Multi‑Objective Deep Reinforcement Learning Framework for E‑commerce Traffic Allocation (MODRL‑TA)
MODRL‑TA is a multi‑objective deep reinforcement learning framework that unites independent Q‑learning agents, a cross‑entropy‑based decision‑fusion module, and progressive data‑augmentation to overcome cold‑start and multi‑objective trade‑offs in e‑commerce traffic allocation, delivering up to 18% more impressions, 4% higher CTR and 5% higher CVR in live tests.
