Tagged articles
23 articles
Page 1 of 1
JD Tech Talk
JD Tech Talk
Nov 4, 2025 · Artificial Intelligence

How AI-Powered Virtual Try-On Transforms Fashion E‑Commerce

The article explains how JD.com's AI virtual try‑on system Oxygen Tryon uses advanced computer‑vision and generative models to let shoppers instantly preview clothing on their own photos, dramatically improving purchase decisions, reducing return rates, and outlining technical challenges, innovations, and future development plans.

AIComputer VisionDeep Learning
0 likes · 7 min read
How AI-Powered Virtual Try-On Transforms Fashion E‑Commerce
JD Cloud Developers
JD Cloud Developers
Nov 4, 2025 · Artificial Intelligence

How AI-Powered Virtual Try‑On Is Revolutionizing Fashion E‑Commerce

The article explains how JD.com's AI try‑on system Oxygen Tryon uses advanced computer‑vision models to let shoppers instantly preview garments on their own photos, dramatically improving fit perception, reducing return rates, and outlining future technical and business expansions.

AIComputer VisionFashion E‑commerce
0 likes · 6 min read
How AI-Powered Virtual Try‑On Is Revolutionizing Fashion E‑Commerce
AI Algorithm Path
AI Algorithm Path
Oct 15, 2025 · Artificial Intelligence

Building a Flow Matching Model from Scratch: Theory Explained

This article walks through the theory behind flow‑matching generative models, contrasting them with diffusion models, detailing the velocity‑field formulation, training objective, and sampling procedure, and includes visual illustrations of the core concepts.

Generative ModelsODEdiffusion models
0 likes · 8 min read
Building a Flow Matching Model from Scratch: Theory Explained
AIWalker
AIWalker
Apr 2, 2025 · Artificial Intelligence

EasyControl: Plug‑and‑Play DiT Control with Arbitrary Aspect Ratios and Accelerated Inference

EasyControl introduces a lightweight condition‑injection LoRA module, a position‑aware training paradigm, and causal attention with KV‑cache to enable plug‑and‑play multi‑condition control for DiT models, supporting arbitrary image resolutions while cutting inference latency by up to 30% and preserving high‑quality generation.

Conditional GenerationDiTEasyControl
0 likes · 17 min read
EasyControl: Plug‑and‑Play DiT Control with Arbitrary Aspect Ratios and Accelerated Inference
AIWalker
AIWalker
Mar 8, 2025 · Artificial Intelligence

IMAGPose: A Unified Conditional Framework for Photo‑Realistic Pose‑Guided Person Generation (NeurIPS 2024)

IMAGPose introduces a unified conditional diffusion framework that combines feature‑level, image‑level, and cross‑view attention modules to generate high‑fidelity, photo‑realistic person images under diverse pose and multi‑view scenarios, outperforming prior SOTA methods on DeepFashion and Market‑1501.

AIComputer Visiondiffusion models
0 likes · 22 min read
IMAGPose: A Unified Conditional Framework for Photo‑Realistic Pose‑Guided Person Generation (NeurIPS 2024)
AIWalker
AIWalker
Jan 14, 2025 · Artificial Intelligence

Pure 3×3 Convolutions for Image‑Generation Diffusion Models: The DiC Approach

The paper introduces DiC, a fully convolutional diffusion model that rethinks 3×3 convolutions, adds sparse skip connections, stage‑specific embeddings and conditional gating, and demonstrates superior FID/IS scores and faster inference compared to diffusion Transformers across multiple scales.

AIconvolutional networksdiffusion models
0 likes · 19 min read
Pure 3×3 Convolutions for Image‑Generation Diffusion Models: The DiC Approach
DaTaobao Tech
DaTaobao Tech
Dec 20, 2024 · Artificial Intelligence

AIGC Applications in Fresh E‑Commerce: 2024 Overview

In 2024, AI‑generated content transforms fresh‑food e‑commerce by leveraging large language and multimodal models to automatically craft concise product copy, synthesize realistic images and short‑video GIFs, detect and fix visual defects, and build brand‑specific style libraries, paving the way for future voice‑driven storytelling and personalized shopping experiences.

AIGCContent GenerationFresh E-commerce
0 likes · 7 min read
AIGC Applications in Fresh E‑Commerce: 2024 Overview
AntTech
AntTech
Dec 19, 2024 · Artificial Intelligence

Framer: Interactive Video Frame Interpolation Using Diffusion Models

Framer is an interactive video frame interpolation method that leverages large‑pretrained video diffusion models, allowing users to define custom motion trajectories or use an automatic mode, and demonstrates strong performance in image deformation, video generation, and cartoon‑to‑video applications.

AIFramerdiffusion model
0 likes · 4 min read
Framer: Interactive Video Frame Interpolation Using Diffusion Models
Alimama Tech
Alimama Tech
Aug 16, 2024 · Artificial Intelligence

SPLAM: Sub‑Path Linear Approximation for Accelerating Diffusion Model Sampling

SPLAM (Sub‑Path Linear Approximation Model) accelerates diffusion‑model image synthesis by linearly approximating short sub‑paths of the probability‑flow ODE, allowing high‑quality generation in as few as four steps, outperforming prior fast‑sampling methods on COCO benchmarks and being deployed in Alibaba Mama’s recommendation system.

AI image generationSPLAMdiffusion models
0 likes · 11 min read
SPLAM: Sub‑Path Linear Approximation for Accelerating Diffusion Model Sampling
JD Cloud Developers
JD Cloud Developers
Apr 25, 2024 · Artificial Intelligence

How AI Diffusion Models Revolutionize E‑commerce Ad Image Creation

This article presents JD Advertising's 2023 innovations that combine relation‑aware diffusion models, category‑aware background generation, and planning‑and‑rendering pipelines to automatically produce high‑quality, scalable, and personalized e‑commerce ad posters, addressing efficiency, cost, and creative limitations of manual design.

AIAdvertisingdiffusion
0 likes · 18 min read
How AI Diffusion Models Revolutionize E‑commerce Ad Image Creation
360 Tech Engineering
360 Tech Engineering
Apr 17, 2024 · Artificial Intelligence

HiCo: A Hierarchical Controllable Diffusion Model for Layout‑to‑Image Generation

The 360 AI Research Institute introduces HiCo, a hierarchical controllable diffusion model that enables fine‑grained layout control across up to eight image regions, integrates seamlessly with existing Stable Diffusion ecosystems, and demonstrates superior performance on the GRIT‑VAL benchmark for layout‑aware image synthesis.

AI drawingControllable GenerationHiCo
0 likes · 8 min read
HiCo: A Hierarchical Controllable Diffusion Model for Layout‑to‑Image Generation
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 5, 2023 · Artificial Intelligence

How AIGC Advances Boost Image Quality: Lessons from Alibaba’s Cyber Project

Since July, the author has been developing AIGC projects and observed dramatic image quality improvements driven by rapid advances in large models, open‑source plugins, and deeper understanding of generation pipelines and parameters, prompting a comprehensive summary of the Cyber project’s solutions, industry landscape, and front‑end responsibilities.

AI-generated imagesAIGCCyber project
0 likes · 8 min read
How AIGC Advances Boost Image Quality: Lessons from Alibaba’s Cyber Project
Tencent Tech
Tencent Tech
Oct 26, 2023 · Artificial Intelligence

Unlocking Tencent Hunyuan Text‑to‑Image: A Complete Guide and Prompt Tips

This guide introduces Tencent Hunyuan's upgraded text‑to‑image model, explains its technical innovations, provides detailed prompt engineering advice, showcases example prompts and generated images across various styles, and highlights real‑world applications and performance metrics for developers and creators.

AI GenerationLarge ModelPrompt engineering
0 likes · 12 min read
Unlocking Tencent Hunyuan Text‑to‑Image: A Complete Guide and Prompt Tips
Ximalaya Technology Team
Ximalaya Technology Team
Oct 10, 2023 · Artificial Intelligence

MiniGPT-5: A Novel Multimodal Generation Model for Coherent Text-Image Synthesis

MiniGPT-5 is a novel multimodal generation model using generative vokens to interleave text and image synthesis, integrating Stable Diffusion and LLMs with a two-stage training that requires no domain-specific annotations, achieving state‑of‑the‑art coherence and quality on benchmarks like CC3M, VIST, and MMDialog.

AI researchStable DiffusionVision Transformer
0 likes · 9 min read
MiniGPT-5: A Novel Multimodal Generation Model for Coherent Text-Image Synthesis
Tencent Cloud Developer
Tencent Cloud Developer
Jul 27, 2023 · Artificial Intelligence

Creating Artistic QR Code Images with ControlNet and Stable Diffusion

The article demonstrates how to create visually appealing, scannable QR‑code artworks using ControlNet and Stable Diffusion, explaining QR‑code structure, contrast preservation, and several pipelines—including tile‑based, OpenPose‑combined, and community QR‑code models—while detailing WebUI settings, prompt examples, weight tuning, and a custom ControlNet that reduces grid artifacts.

AI-generated QR codesControlNetStable Diffusion
0 likes · 13 min read
Creating Artistic QR Code Images with ControlNet and Stable Diffusion
Alimama Tech
Alimama Tech
Jul 6, 2022 · Artificial Intelligence

Recent ACM MM and ECCV Papers on Intelligent Creative Technologies by Alibaba

Alibaba’s Creative & Video Platform showcases six newly accepted ACM MM and ECCV papers that introduce self‑supervised text‑erasing, a confidence‑driven action‑proposal module, a geometry‑aligned variational transformer for image‑conditioned layouts, a high‑resolution virtual‑try‑on system, a motion‑transformer for unsupervised animation, and a cross‑domain motion‑transfer framework, highlighting cutting‑edge AI for automated creative design, video editing, and e‑commerce applications.

image synthesismotion transformervideo action detection
0 likes · 11 min read
Recent ACM MM and ECCV Papers on Intelligent Creative Technologies by Alibaba
JD Retail Technology
JD Retail Technology
Jul 18, 2018 · Artificial Intelligence

JD's AI-Driven Image Automation and Knowledge Graph Applications in E-commerce

JD.com describes its AI-powered image automation pipeline—including intelligent cutout, layout learning, and batch synthesis—along with a large-scale product knowledge graph that enables applications such as the JIMI customer service robot and the Li Bai writing assistant for global e-commerce.

AICustomer Service RobotKnowledge Graph
0 likes · 7 min read
JD's AI-Driven Image Automation and Knowledge Graph Applications in E-commerce
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 27, 2018 · Artificial Intelligence

How AR Transforms Coffee Retail: Inside Alibaba’s AI‑Powered Cloud Recognition

Alibaba’s AI Lab built an AR‑enhanced Starbucks coffee workshop in Shanghai, using client‑side object detection, deep‑learning cloud recognition, image synthesis, and color‑simulation techniques to overcome challenges like metal reflections, transparency, and varying lighting, illustrating how AR can revamp new‑retail experiences.

ARDeep Learningaugmented reality
0 likes · 8 min read
How AR Transforms Coffee Retail: Inside Alibaba’s AI‑Powered Cloud Recognition