Tag

multimodal large language models

1 views collected around this technical thread.

JD Tech Talk
JD Tech Talk
Mar 13, 2025 · Artificial Intelligence

CTR-Driven Advertising Image Generation with Multimodal Large Language Models

This paper proposes CAIG, a novel method for generating high-CTR advertising images using multimodal large language models, combining reinforcement learning and preference optimization to align generated content with product features.

CTR predictionadvertising image generatione-commerce
0 likes · 10 min read
CTR-Driven Advertising Image Generation with Multimodal Large Language Models