Tag

image synthesis

0 views collected around this technical thread.

DaTaobao Tech
DaTaobao Tech
Dec 20, 2024 · Artificial Intelligence

AIGC Applications in Fresh E‑Commerce: 2024 Overview

In 2024, AI‑generated content transforms fresh‑food e‑commerce by leveraging large language and multimodal models to automatically craft concise product copy, synthesize realistic images and short‑video GIFs, detect and fix visual defects, and build brand‑specific style libraries, paving the way for future voice‑driven storytelling and personalized shopping experiences.

AIGCContent generationFresh E-commerce
0 likes · 7 min read
AIGC Applications in Fresh E‑Commerce: 2024 Overview
AntTech
AntTech
Dec 19, 2024 · Artificial Intelligence

Framer: Interactive Video Frame Interpolation Using Diffusion Models

Framer is an interactive video frame interpolation method that leverages large‑pretrained video diffusion models, allowing users to define custom motion trajectories or use an automatic mode, and demonstrates strong performance in image deformation, video generation, and cartoon‑to‑video applications.

AIFramercomputer vision
0 likes · 4 min read
Framer: Interactive Video Frame Interpolation Using Diffusion Models
DaTaobao Tech
DaTaobao Tech
Nov 27, 2024 · Artificial Intelligence

FuseAnyPart: Diffusion‑Driven Facial Parts Swapping via Multiple Reference Images

FuseAnyPart is a diffusion‑model‑based facial part swapping technique that fuses features from multiple reference images via mask‑based fusion and additive injection modules, delivering high‑fidelity, consistent face edits with lower computational cost, outperforming prior methods on CelebA‑HQ and FaceForensics++ and already boosting commercial AIGC applications.

computer visiondiffusion modelfacial part swapping
0 likes · 9 min read
FuseAnyPart: Diffusion‑Driven Facial Parts Swapping via Multiple Reference Images
Alimama Tech
Alimama Tech
Aug 16, 2024 · Artificial Intelligence

SPLAM: Sub‑Path Linear Approximation for Accelerating Diffusion Model Sampling

SPLAM (Sub‑Path Linear Approximation Model) accelerates diffusion‑model image synthesis by linearly approximating short sub‑paths of the probability‑flow ODE, allowing high‑quality generation in as few as four steps, outperforming prior fast‑sampling methods on COCO benchmarks and being deployed in Alibaba Mama’s recommendation system.

AI image generationSPLAMdiffusion models
0 likes · 11 min read
SPLAM: Sub‑Path Linear Approximation for Accelerating Diffusion Model Sampling
360 Tech Engineering
360 Tech Engineering
Apr 17, 2024 · Artificial Intelligence

HiCo: A Hierarchical Controllable Diffusion Model for Layout‑to‑Image Generation

The 360 AI Research Institute introduces HiCo, a hierarchical controllable diffusion model that enables fine‑grained layout control across up to eight image regions, integrates seamlessly with existing Stable Diffusion ecosystems, and demonstrates superior performance on the GRIT‑VAL benchmark for layout‑aware image synthesis.

AI drawingHiCocontrollable generation
0 likes · 8 min read
HiCo: A Hierarchical Controllable Diffusion Model for Layout‑to‑Image Generation
Tencent Tech
Tencent Tech
Oct 26, 2023 · Artificial Intelligence

Unlocking Tencent Hunyuan Text‑to‑Image: A Complete Guide and Prompt Tips

This guide introduces Tencent Hunyuan's upgraded text‑to‑image model, explains its technical innovations, provides detailed prompt engineering advice, showcases example prompts and generated images across various styles, and highlights real‑world applications and performance metrics for developers and creators.

AI generationTencent Hunyuanimage synthesis
0 likes · 12 min read
Unlocking Tencent Hunyuan Text‑to‑Image: A Complete Guide and Prompt Tips
Ximalaya Technology Team
Ximalaya Technology Team
Oct 10, 2023 · Artificial Intelligence

MiniGPT-5: A Novel Multimodal Generation Model for Coherent Text-Image Synthesis

MiniGPT-5 is a novel multimodal generation model using generative vokens to interleave text and image synthesis, integrating Stable Diffusion and LLMs with a two-stage training that requires no domain-specific annotations, achieving state‑of‑the‑art coherence and quality on benchmarks like CC3M, VIST, and MMDialog.

AI researchMultimodal GenerationStable Diffusion
0 likes · 9 min read
MiniGPT-5: A Novel Multimodal Generation Model for Coherent Text-Image Synthesis
Tencent Cloud Developer
Tencent Cloud Developer
Jul 27, 2023 · Artificial Intelligence

Creating Artistic QR Code Images with ControlNet and Stable Diffusion

The article demonstrates how to create visually appealing, scannable QR‑code artworks using ControlNet and Stable Diffusion, explaining QR‑code structure, contrast preservation, and several pipelines—including tile‑based, OpenPose‑combined, and community QR‑code models—while detailing WebUI settings, prompt examples, weight tuning, and a custom ControlNet that reduces grid artifacts.

AI-generated QR codesControlNetPrompt engineering
0 likes · 13 min read
Creating Artistic QR Code Images with ControlNet and Stable Diffusion
Alimama Tech
Alimama Tech
Jul 6, 2022 · Artificial Intelligence

Recent ACM MM and ECCV Papers on Intelligent Creative Technologies by Alibaba

Alibaba’s Creative & Video Platform showcases six newly accepted ACM MM and ECCV papers that introduce self‑supervised text‑erasing, a confidence‑driven action‑proposal module, a geometry‑aligned variational transformer for image‑conditioned layouts, a high‑resolution virtual‑try‑on system, a motion‑transformer for unsupervised animation, and a cross‑domain motion‑transfer framework, highlighting cutting‑edge AI for automated creative design, video editing, and e‑commerce applications.

computer visionimage synthesislayout generation
0 likes · 11 min read
Recent ACM MM and ECCV Papers on Intelligent Creative Technologies by Alibaba
NetEase Yanxuan Technology Product Team
NetEase Yanxuan Technology Product Team
Apr 19, 2021 · Artificial Intelligence

Smart Creative: Automated Advertising Content Generation at NetEase Yanxuan

Smart Creative at NetEase Yanxuan uses algorithmic templates to automatically generate personalized ad images and videos from product data, selecting colors, aesthetics, and sizes, reducing manual cost and boosting revenue across DSP, recommendation, and search advertising.

AI marketingNetEase Yanxuanalgorithmic advertising
0 likes · 8 min read
Smart Creative: Automated Advertising Content Generation at NetEase Yanxuan
JD Retail Technology
JD Retail Technology
Jul 18, 2018 · Artificial Intelligence

JD's AI-Driven Image Automation and Knowledge Graph Applications in E-commerce

JD.com describes its AI-powered image automation pipeline—including intelligent cutout, layout learning, and batch synthesis—along with a large-scale product knowledge graph that enables applications such as the JIMI customer service robot and the Li Bai writing assistant for global e-commerce.

AICustomer Service RobotNatural Language Generation
0 likes · 7 min read
JD's AI-Driven Image Automation and Knowledge Graph Applications in E-commerce