Tag

diffusion

1 views collected around this technical thread.

DaTaobao Tech
DaTaobao Tech
Mar 12, 2025 · Artificial Intelligence

Multimodal Automatic Layout Generation for E-commerce

The project develops a multimodal automatic layout generation system for e‑commerce by fine‑tuning the qwen‑vl‑7b vision‑language model with LoRA on poster and Taobao image‑layout data, employing diffusion‑based image generation and coordinate‑prediction methods to produce structured layouts that power poster, marketing image, and video‑cover creation with over 90% adoption, while exploring multi‑image, style‑aware, and iterative refinement extensions.

LLMdiffusione-commerce
0 likes · 12 min read
Multimodal Automatic Layout Generation for E-commerce
Alimama Tech
Alimama Tech
Dec 4, 2024 · Artificial Intelligence

AIGB: Generative Auto‑Bidding via Diffusion Modeling

AIGB, introduced by Alibaba Mama in 2023, reframes large‑scale ad‑auction auto‑bidding as a generative sequence task using diffusion models, achieving up to 5 % GMV gains, improved stability and interpretability, and is now commercialized, open‑sourced, and featured in a NeurIPS‑endorsed competition.

AIadvertisingauto-bidding
0 likes · 12 min read
AIGB: Generative Auto‑Bidding via Diffusion Modeling
DaTaobao Tech
DaTaobao Tech
Nov 20, 2024 · Mobile Development

MNN-Transformer: Efficient On‑Device Large Language and Diffusion Model Deployment

MNN‑Transformer provides an end‑to‑end framework that enables large language and diffusion models to run efficiently on modern smartphones by exporting, quantizing (including dynamic int4/int8 and KV cache compression) and executing via a plugin‑engine runtime, achieving up to 35 tokens/s decoding and 2‑3× faster image generation compared with existing on‑device solutions.

LLMMNNMobile AI
0 likes · 15 min read
MNN-Transformer: Efficient On‑Device Large Language and Diffusion Model Deployment
DataFunTalk
DataFunTalk
May 20, 2024 · Artificial Intelligence

Deploying OPPO Multi‑Modal Pretrained Models in Edge‑Cloud Scenarios: Techniques and Optimizations

This article presents OPPO's practical research on deploying multi‑modal pre‑training models across mobile devices and cloud, covering edge image‑text retrieval, text‑image generation and understanding optimizations, and lightweight diffusion model techniques, with detailed algorithmic improvements, performance results, and real‑world application cases.

AIGCOPPOdiffusion
0 likes · 18 min read
Deploying OPPO Multi‑Modal Pretrained Models in Edge‑Cloud Scenarios: Techniques and Optimizations
DataFunSummit
DataFunSummit
May 6, 2024 · Artificial Intelligence

Advances, Model Types, and Open Challenges of AI‑Generated Content (AIGC) with XiaoBu’s Image Generation Progress

This article reviews the definition, key metrics, and major model families of AI‑generated content, details XiaoBu’s recent breakthroughs in image generation, and discusses open research problems such as evaluation gaps, transformer limitations, and the need for richer multimodal intelligence representations.

AI researchAIGCGAN
0 likes · 14 min read
Advances, Model Types, and Open Challenges of AI‑Generated Content (AIGC) with XiaoBu’s Image Generation Progress
JD Cloud Developers
JD Cloud Developers
Apr 25, 2024 · Artificial Intelligence

How AI Diffusion Models Revolutionize E‑commerce Ad Image Creation

This article presents JD Advertising's 2023 innovations that combine relation‑aware diffusion models, category‑aware background generation, and planning‑and‑rendering pipelines to automatically produce high‑quality, scalable, and personalized e‑commerce ad posters, addressing efficiency, cost, and creative limitations of manual design.

AIadvertisingdiffusion
0 likes · 18 min read
How AI Diffusion Models Revolutionize E‑commerce Ad Image Creation
Alimama Tech
Alimama Tech
Apr 24, 2024 · Artificial Intelligence

Mask‑Guided Diffusion for Precise Product Image Generation

Mask‑Guided Diffusion combines instance‑mask training, Masked Canny ControlNet, and Mask‑guided Attribute Binding to preserve product details, correctly bind attributes, fix hand distortion, and generate uniform colored backgrounds, enabling merchants to quickly create high‑quality, controllable product images with Stable Diffusion.

AIComputer VisionControlNet
0 likes · 16 min read
Mask‑Guided Diffusion for Precise Product Image Generation
Sohu Tech Products
Sohu Tech Products
Mar 6, 2024 · Artificial Intelligence

Analysis of OpenAI Sora: Data Engineering, Network Architecture, and World Model Implications

OpenAI’s Sora video model unifies image and video data into latent spacetime patches via a VAE, trains on original resolutions with GPT‑4‑expanded captions, employs a Diffusion Transformer backbone for patch‑wise denoising, and demonstrates 3D‑consistent, long‑term world‑model capabilities that hint at a unified computer‑vision paradigm and steps toward AGI.

AI researchOpenAI Soradiffusion
0 likes · 9 min read
Analysis of OpenAI Sora: Data Engineering, Network Architecture, and World Model Implications
Model Perspective
Model Perspective
Nov 19, 2023 · Fundamentals

How Diffusion Models Explain Everyday Phenomena and Environmental Risks

This article introduces the fundamental concepts and mathematical description of diffusion, explores its wide-ranging applications from daily life to environmental engineering, and demonstrates its use through a detailed ink‑in‑water example and a lake‑spill case study.

Fick's lawdiffusionenvironmental science
0 likes · 10 min read
How Diffusion Models Explain Everyday Phenomena and Environmental Risks