Alimama Tech
Sep 7, 2022 · Artificial Intelligence
Curriculum-Guided Bayesian Reinforcement Learning for ROI-Constrained Real-Time Bidding
The paper presents a Curriculum‑Guided Bayesian Reinforcement Learning (CBRL) framework that models ROI‑constrained real‑time bidding as a partially observable constrained MDP, using hard‑margin indicator rewards and a curriculum of relaxed proxy problems to achieve fast, constraint‑satisfying, Bayes‑optimal policies that outperform existing methods on large‑scale industrial data.
Bayesian RLMDPROI constraint
0 likes · 15 min read