Relation-Aware Diffusion Models for Automated Poster Layout and Product Background Generation

This article presents JD Advertising's 2023 AI-driven framework that uses a relation‑aware diffusion model with visual‑text and geometric modules, combined with category‑common and personalized generators and a planning‑and‑rendering network, to automate high‑quality, scalable e‑commerce poster creation and background synthesis.

JD Tech
JD Tech
JD Tech
Relation-Aware Diffusion Models for Automated Poster Layout and Product Background Generation

In 2023, JD Advertising introduced a series of AI-driven methods to automate e‑commerce poster creation, addressing the inefficiencies of manual design by leveraging a relation‑aware diffusion model that incorporates visual‑text and geometric relationships.

The model uses a Visual‑Text Relation Awareness Module (VTRAM) to align image and textual features via cross‑attention, and a Geometric Relation Awareness Module (GRAM) to encode relative positions of Regions of Interest, enabling controllable layout generation.

To achieve scalable and personalized backgrounds, a category‑common generator extracts generic background cues from product images, while a personalized generator learns style from reference images; both are integrated into Stable Diffusion.

A planning‑and‑rendering framework (P&R) combines a PlanNet that predicts element layouts from product visuals and text, and a RenderNet that fuses layout, visual, and spatial information through a spatial‑fusion module and ControlNet to produce final posters.

The paper concludes with a technical roadmap summarizing the three‑stage solution and outlines future research directions such as controllability, multimodal integration, and personalized advertising generation.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

Multimodal AIImage GenerationDiffusion Modelse-commerce advertisingposter layout
JD Tech
Written by

JD Tech

Official JD technology sharing platform. All the cutting‑edge JD tech, innovative insights, and open‑source solutions you’re looking for, all in one place.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.