Tagged articles

image synthesis

24 articles · Page 1 of 1

May 23, 2026 · Artificial Intelligence

How FlashAR Achieves 22.9× Speedup with Only 0.05% of Training Data

FlashAR transforms pretrained autoregressive image models into highly parallel generators, delivering up to 22.9× end-to-end speedup while using just 0.05% of the original training data and preserving generation quality, thanks to intermediate branching, a learnable fusion gate, and a two-stage adaptation process.

FlashARautoregressive generationimage synthesis

0 likes · 10 min read

How FlashAR Achieves 22.9× Speedup with Only 0.05% of Training Data

Black & White Path

Feb 20, 2026 · Industry Insights

When DeepFake Becomes Democratized: How One Face Photo Can Threaten Your Privacy

The rapid democratization of DeepFake technology lowers creation barriers, turning ordinary face photos into tools for malicious content, privacy breaches, and potential black‑market abuse, prompting urgent personal safeguards and regulatory action.

AI privacyRegulationdeepfake

0 likes · 13 min read

When DeepFake Becomes Democratized: How One Face Photo Can Threaten Your Privacy

JD Tech Talk

Nov 4, 2025 · Artificial Intelligence

How AI-Powered Virtual Try-On Transforms Fashion E‑Commerce

The article explains how JD.com's AI virtual try‑on system Oxygen Tryon uses advanced computer‑vision and generative models to let shoppers instantly preview clothing on their own photos, dramatically improving purchase decisions, reducing return rates, and outlining technical challenges, innovations, and future development plans.

AIFashion E‑commercecomputer vision

0 likes · 7 min read

How AI-Powered Virtual Try-On Transforms Fashion E‑Commerce

JD Cloud Developers

Nov 4, 2025 · Artificial Intelligence

How AI-Powered Virtual Try‑On Is Revolutionizing Fashion E‑Commerce

The article explains how JD.com's AI try‑on system Oxygen Tryon uses advanced computer‑vision models to let shoppers instantly preview garments on their own photos, dramatically improving fit perception, reducing return rates, and outlining future technical and business expansions.

AIFashion E‑commercecomputer vision

0 likes · 6 min read

How AI-Powered Virtual Try‑On Is Revolutionizing Fashion E‑Commerce

JD Retail Technology

Oct 31, 2025 · Artificial Intelligence

How JD’s AI Try‑On “Oxygen Tryon” Revolutionizes Online Fashion Shopping

JD’s Oxygen Tryon leverages advanced AI, keypoint detection, and real‑time rendering to let shoppers virtually try on clothing, dramatically cutting return rates, boosting conversion, and outlining technical challenges, innovations, and future plans for broader fashion applications.

AI try-onFashion E‑commercecomputer vision

0 likes · 6 min read

How JD’s AI Try‑On “Oxygen Tryon” Revolutionizes Online Fashion Shopping

AI Algorithm Path

Oct 15, 2025 · Artificial Intelligence

Building a Flow Matching Model from Scratch: Theory Explained

This article walks through the theory behind flow‑matching generative models, contrasting them with diffusion models, detailing the velocity‑field formulation, training objective, and sampling procedure, and includes visual illustrations of the core concepts.

Diffusion ModelsODEflow matching

0 likes · 8 min read

Building a Flow Matching Model from Scratch: Theory Explained

AIWalker

Apr 2, 2025 · Artificial Intelligence

EasyControl: Plug‑and‑Play DiT Control with Arbitrary Aspect Ratios and Accelerated Inference

EasyControl introduces a lightweight condition‑injection LoRA module, a position‑aware training paradigm, and causal attention with KV‑cache to enable plug‑and‑play multi‑condition control for DiT models, supporting arbitrary image resolutions while cutting inference latency by up to 30% and preserving high‑quality generation.

Conditional GenerationDiTEasyControl

0 likes · 17 min read

EasyControl: Plug‑and‑Play DiT Control with Arbitrary Aspect Ratios and Accelerated Inference

AIWalker

Mar 8, 2025 · Artificial Intelligence

IMAGPose: A Unified Conditional Framework for Photo‑Realistic Pose‑Guided Person Generation (NeurIPS 2024)

IMAGPose introduces a unified conditional diffusion framework that combines feature‑level, image‑level, and cross‑view attention modules to generate high‑fidelity, photo‑realistic person images under diverse pose and multi‑view scenarios, outperforming prior SOTA methods on DeepFashion and Market‑1501.

AIDiffusion Modelscomputer vision

0 likes · 22 min read

IMAGPose: A Unified Conditional Framework for Photo‑Realistic Pose‑Guided Person Generation (NeurIPS 2024)

AIWalker

Jan 14, 2025 · Artificial Intelligence

Pure 3×3 Convolutions for Image‑Generation Diffusion Models: The DiC Approach

The paper introduces DiC, a fully convolutional diffusion model that rethinks 3×3 convolutions, adds sparse skip connections, stage‑specific embeddings and conditional gating, and demonstrates superior FID/IS scores and faster inference compared to diffusion Transformers across multiple scales.

AIDiffusion Modelsconvolutional networks

0 likes · 19 min read

Pure 3×3 Convolutions for Image‑Generation Diffusion Models: The DiC Approach

DaTaobao Tech

Dec 20, 2024 · Artificial Intelligence

AIGC Applications in Fresh E‑Commerce: 2024 Overview

In 2024, AI‑generated content transforms fresh‑food e‑commerce by leveraging large language and multimodal models to automatically craft concise product copy, synthesize realistic images and short‑video GIFs, detect and fix visual defects, and build brand‑specific style libraries, paving the way for future voice‑driven storytelling and personalized shopping experiences.

AIGCContent GenerationFresh E-commerce

0 likes · 7 min read

AIGC Applications in Fresh E‑Commerce: 2024 Overview

AntTech

Dec 19, 2024 · Artificial Intelligence

Framer: Interactive Video Frame Interpolation Using Diffusion Models

Framer is an interactive video frame interpolation method that leverages large‑pretrained video diffusion models, allowing users to define custom motion trajectories or use an automatic mode, and demonstrates strong performance in image deformation, video generation, and cartoon‑to‑video applications.

AIFramerdiffusion model

0 likes · 4 min read

Framer: Interactive Video Frame Interpolation Using Diffusion Models

Alimama Tech

Aug 16, 2024 · Artificial Intelligence

SPLAM: Sub‑Path Linear Approximation for Accelerating Diffusion Model Sampling

SPLAM (Sub‑Path Linear Approximation Model) accelerates diffusion‑model image synthesis by linearly approximating short sub‑paths of the probability‑flow ODE, allowing high‑quality generation in as few as four steps, outperforming prior fast‑sampling methods on COCO benchmarks and being deployed in Alibaba Mama’s recommendation system.

AI image generationDiffusion ModelsSPLAM

0 likes · 11 min read

SPLAM: Sub‑Path Linear Approximation for Accelerating Diffusion Model Sampling

JD Cloud Developers

Apr 25, 2024 · Artificial Intelligence

How AI Diffusion Models Revolutionize E‑commerce Ad Image Creation

This article presents JD Advertising's 2023 innovations that combine relation‑aware diffusion models, category‑aware background generation, and planning‑and‑rendering pipelines to automatically produce high‑quality, scalable, and personalized e‑commerce ad posters, addressing efficiency, cost, and creative limitations of manual design.

AIAdvertisingdiffusion

0 likes · 18 min read

How AI Diffusion Models Revolutionize E‑commerce Ad Image Creation

360 Tech Engineering

Apr 17, 2024 · Artificial Intelligence

HiCo: A Hierarchical Controllable Diffusion Model for Layout‑to‑Image Generation

The 360 AI Research Institute introduces HiCo, a hierarchical controllable diffusion model that enables fine‑grained layout control across up to eight image regions, integrates seamlessly with existing Stable Diffusion ecosystems, and demonstrates superior performance on the GRIT‑VAL benchmark for layout‑aware image synthesis.

AI drawingEvaluationHiCo

0 likes · 8 min read

HiCo: A Hierarchical Controllable Diffusion Model for Layout‑to‑Image Generation

Alibaba Cloud Developer

Dec 5, 2023 · Artificial Intelligence

How AIGC Advances Boost Image Quality: Lessons from Alibaba’s Cyber Project

Since July, the author has been developing AIGC projects and observed dramatic image quality improvements driven by rapid advances in large models, open‑source plugins, and deeper understanding of generation pipelines and parameters, prompting a comprehensive summary of the Cyber project’s solutions, industry landscape, and front‑end responsibilities.

AI-generated imagesAIGCCyber project

0 likes · 8 min read

How AIGC Advances Boost Image Quality: Lessons from Alibaba’s Cyber Project

php Courses

Nov 14, 2023 · Artificial Intelligence

Google and UC Berkeley Introduce Idempotent Generative Network (IGN) as a New Generative AI Method

Google, in collaboration with UC Berkeley, has unveiled a novel generative AI approach called the Idempotent Generative Network (IGN) that can produce images from any input in a single step, offering an alternative to GANs, diffusion models, and consistency models.

Diffusion ModelsGaNGenerative AI

0 likes · 3 min read

Google and UC Berkeley Introduce Idempotent Generative Network (IGN) as a New Generative AI Method

Tencent Tech

Oct 26, 2023 · Artificial Intelligence

Unlocking Tencent Hunyuan Text‑to‑Image: A Complete Guide and Prompt Tips

This guide introduces Tencent Hunyuan's upgraded text‑to‑image model, explains its technical innovations, provides detailed prompt engineering advice, showcases example prompts and generated images across various styles, and highlights real‑world applications and performance metrics for developers and creators.

AI generationPrompt engineeringTencent Hunyuan

0 likes · 12 min read

Unlocking Tencent Hunyuan Text‑to‑Image: A Complete Guide and Prompt Tips

Ximalaya Technology Team

Oct 10, 2023 · Artificial Intelligence

MiniGPT-5: A Novel Multimodal Generation Model for Coherent Text-Image Synthesis

MiniGPT-5 is a novel multimodal generation model using generative vokens to interleave text and image synthesis, integrating Stable Diffusion and LLMs with a two-stage training that requires no domain-specific annotations, achieving state‑of‑the‑art coherence and quality on benchmarks like CC3M, VIST, and MMDialog.

AI researchMultimodal GenerationStable Diffusion

0 likes · 9 min read

MiniGPT-5: A Novel Multimodal Generation Model for Coherent Text-Image Synthesis

Tencent Cloud Developer

Jul 27, 2023 · Artificial Intelligence

Creating Artistic QR Code Images with ControlNet and Stable Diffusion

The article demonstrates how to create visually appealing, scannable QR‑code artworks using ControlNet and Stable Diffusion, explaining QR‑code structure, contrast preservation, and several pipelines—including tile‑based, OpenPose‑combined, and community QR‑code models—while detailing WebUI settings, prompt examples, weight tuning, and a custom ControlNet that reduces grid artifacts.

AI-generated QR codesControlNetStable Diffusion

0 likes · 13 min read

Creating Artistic QR Code Images with ControlNet and Stable Diffusion

Alimama Tech

Jul 6, 2022 · Artificial Intelligence

Recent ACM MM and ECCV Papers on Intelligent Creative Technologies by Alibaba

Alibaba’s Creative & Video Platform showcases six newly accepted ACM MM and ECCV papers that introduce self‑supervised text‑erasing, a confidence‑driven action‑proposal module, a geometry‑aligned variational transformer for image‑conditioned layouts, a high‑resolution virtual‑try‑on system, a motion‑transformer for unsupervised animation, and a cross‑domain motion‑transfer framework, highlighting cutting‑edge AI for automated creative design, video editing, and e‑commerce applications.

image synthesismotion transformervideo action detection

0 likes · 11 min read

Recent ACM MM and ECCV Papers on Intelligent Creative Technologies by Alibaba

NetEase Yanxuan Technology Product Team

Apr 19, 2021 · Artificial Intelligence

Smart Creative: Automated Advertising Content Generation at NetEase Yanxuan

Smart Creative at NetEase Yanxuan uses algorithmic templates to automatically generate personalized ad images and videos from product data, selecting colors, aesthetics, and sizes, reducing manual cost and boosting revenue across DSP, recommendation, and search advertising.

AI marketingNetEase Yanxuanalgorithmic advertising

0 likes · 8 min read

Smart Creative: Automated Advertising Content Generation at NetEase Yanxuan

Alibaba Cloud Developer

Aug 2, 2018 · Artificial Intelligence

Style‑Adversarial Autoencoder Enables Precise Content‑Style Image Generation

This paper introduces a style‑adversarial autoencoder that separates content and style latent variables, uses a multi‑class discriminator, and demonstrates superior image generation and data augmentation across MNIST, face, and text datasets tasks.

Generative Adversarial NetworksStyle Transferautoencoders

0 likes · 15 min read

Style‑Adversarial Autoencoder Enables Precise Content‑Style Image Generation

JD Retail Technology

Jul 18, 2018 · Artificial Intelligence

JD's AI-Driven Image Automation and Knowledge Graph Applications in E-commerce

JD.com describes its AI-powered image automation pipeline—including intelligent cutout, layout learning, and batch synthesis—along with a large-scale product knowledge graph that enables applications such as the JIMI customer service robot and the Li Bai writing assistant for global e-commerce.

AICustomer Service RobotKnowledge Graph

0 likes · 7 min read

JD's AI-Driven Image Automation and Knowledge Graph Applications in E-commerce

Alibaba Cloud Developer

Feb 27, 2018 · Artificial Intelligence

How AR Transforms Coffee Retail: Inside Alibaba’s AI‑Powered Cloud Recognition

Alibaba’s AI Lab built an AR‑enhanced Starbucks coffee workshop in Shanghai, using client‑side object detection, deep‑learning cloud recognition, image synthesis, and color‑simulation techniques to overcome challenges like metal reflections, transparency, and varying lighting, illustrating how AR can revamp new‑retail experiences.

ARaugmented realitycloud recognition

0 likes · 8 min read

How AR Transforms Coffee Retail: Inside Alibaba’s AI‑Powered Cloud Recognition