CTR-Driven Advertising Image Generation with Multimodal Large Language Models
This paper proposes CAIG, a novel method for generating high-CTR advertising images using multimodal large language models, combining reinforcement learning and preference optimization to align generated content with product features.