Tag

ControlNet

0 views collected around this technical thread.

Ele.me Technology
Ele.me Technology
Apr 10, 2025 · Artificial Intelligence

Ele.me Vertical Business AIGC Image Model: Architecture, Training Pipeline, and Evaluation

Ele.me created a domain-specific AIGC image model built from scratch on its own data using the DiT backbone, a three-stage training pipeline (transformer pre-training, prompt alignment, aesthetic fine-tuning), custom T5‑E‑CLIP text and visual encoders, ControlNet for layout control, and evaluated via FID, CLIP scores and a human rubric, enabling automated dish-image generation and UI asset creation for its vertical business.

AIGCControlNetDiT
0 likes · 8 min read
Ele.me Vertical Business AIGC Image Model: Architecture, Training Pipeline, and Evaluation
Alimama Tech
Alimama Tech
Oct 17, 2024 · Artificial Intelligence

FLUX ControlNet Inpainting and 8-Step Turbo Acceleration Models

Alibaba’s Mama Intelligent Creation team has open‑sourced a FLUX‑based ControlNet inpainting model that leverages a DiT‑backed Interleave design for superior repair quality, and an 8‑step LoRA‑Turbo model that cuts inference time three‑fold while preserving near‑original image fidelity, both now available on Hugging Face and ModelScope.

AIControlNetFlux
0 likes · 9 min read
FLUX ControlNet Inpainting and 8-Step Turbo Acceleration Models
JD Tech Talk
JD Tech Talk
May 31, 2024 · Artificial Intelligence

Getting Started with ControlNet in ComfyUI: Installation, Plugins, and Workflow Guide

This article introduces ControlNet for ComfyUI, explains its benefits for AI image generation, walks through local and cloud installation, plugin setup, node configuration, and provides practical examples and resources for creating high‑quality AI artwork.

AI artComfyUIControlNet
0 likes · 14 min read
Getting Started with ControlNet in ComfyUI: Installation, Plugins, and Workflow Guide
Alimama Tech
Alimama Tech
May 15, 2024 · Artificial Intelligence

EcomXL: Optimizing SDXL for Large‑Scale E‑commerce Image Generation

EcomXL enhances SDXL for large‑scale e‑commerce image generation by leveraging tens of millions of curated images, a two‑stage fine‑tuning with denoising‑weighted distillation and layer‑wise fusion, specialized ControlNets for inpainting and soft‑edge consistency, and the SLAM inference accelerator to achieve sub‑second generation while boosting visual quality and adoption metrics.

AIGCControlNetEcomXL
0 likes · 15 min read
EcomXL: Optimizing SDXL for Large‑Scale E‑commerce Image Generation
Alimama Tech
Alimama Tech
Apr 24, 2024 · Artificial Intelligence

Mask‑Guided Diffusion for Precise Product Image Generation

Mask‑Guided Diffusion combines instance‑mask training, Masked Canny ControlNet, and Mask‑guided Attribute Binding to preserve product details, correctly bind attributes, fix hand distortion, and generate uniform colored backgrounds, enabling merchants to quickly create high‑quality, controllable product images with Stable Diffusion.

AIControlNetMask Guidance
0 likes · 16 min read
Mask‑Guided Diffusion for Precise Product Image Generation
DaTaobao Tech
DaTaobao Tech
Mar 6, 2024 · Artificial Intelligence

AI Clothing Graffiti Project: Implementation and Optimization of AIGC Technology in Taobao Life 2

The AI Clothing Graffiti Project in Taobao Life 2 leverages Stable Diffusion, ControlNet, and LoRA to let users generate and stylize clothing designs via text‑image prompts, employing parallel processing, face repair, and content filtering, and has launched successfully, inviting algorithm engineers to join the team.

AIAIGCComputer Vision
0 likes · 14 min read
AI Clothing Graffiti Project: Implementation and Optimization of AIGC Technology in Taobao Life 2
Ximalaya Technology Team
Ximalaya Technology Team
Feb 1, 2024 · Artificial Intelligence

Understanding AI Image Generation: Diffusion Models, CLIP, and Control Techniques

This guide explains how AI image generators such as Stable Diffusion and DALL·E 3 turn text prompts into pictures by using diffusion models, CLIP‑aligned embeddings, and optional controls like negative prompts, fine‑tuned LoRA checkpoints and ControlNet conditioning, highlighting their differences, workflow, and practical customization.

AI image generationClipControlNet
0 likes · 18 min read
Understanding AI Image Generation: Diffusion Models, CLIP, and Control Techniques
DaTaobao Tech
DaTaobao Tech
Dec 4, 2023 · Artificial Intelligence

AIGC Poster Generation Project: Methods and Optimizations

The AIGC Poster Generation Project employs Stable Diffusion enhanced with VAE, ControlNet, LoRA and other extensions to create product posters in four visual styles, exploring outpainting, inpainting, reference‑based diffusion and DreamBooth prototypes, and optimizes detail preservation, super‑resolution text, and masking to achieve over 90% detail fidelity, 95% success rate, and 3–5 second inference per image.

AIGCControlNetPoster Design
0 likes · 7 min read
AIGC Poster Generation Project: Methods and Optimizations
DaTaobao Tech
DaTaobao Tech
Sep 1, 2023 · Artificial Intelligence

Research on Virtual Clothing Try-On Using Stable Diffusion and LoRA

This study evaluates virtual clothing try‑on by fine‑tuning LoRA models on a handful of garment images and integrating Stable Diffusion with ControlNet, Inpainting, and Segment Anything, showing that the AI‑generated pipeline matches or exceeds traditional geometric‑matching VTON in visual fidelity, especially for upper‑body apparel, while running locally on an RTX 3060.

AIControlNetFashion Generation
0 likes · 9 min read
Research on Virtual Clothing Try-On Using Stable Diffusion and LoRA
DaTaobao Tech
DaTaobao Tech
Aug 11, 2023 · Artificial Intelligence

Practical Guide to Stable Diffusion WebUI: Prompt Engineering, LoRA, VAE, and ControlNet

This practical guide walks users through installing Stable Diffusion WebUI, explains the differences between base, LoRA, VAE, and ControlNet models, shows how to derive prompts with CLIP or DeepBooru, and provides detailed text‑to‑image and image‑to‑image examples for effective prompt engineering.

AI image generationControlNetLoRA
0 likes · 12 min read
Practical Guide to Stable Diffusion WebUI: Prompt Engineering, LoRA, VAE, and ControlNet
Tencent Cloud Developer
Tencent Cloud Developer
Jul 27, 2023 · Artificial Intelligence

Creating Artistic QR Code Images with ControlNet and Stable Diffusion

The article demonstrates how to create visually appealing, scannable QR‑code artworks using ControlNet and Stable Diffusion, explaining QR‑code structure, contrast preservation, and several pipelines—including tile‑based, OpenPose‑combined, and community QR‑code models—while detailing WebUI settings, prompt examples, weight tuning, and a custom ControlNet that reduces grid artifacts.

AI-generated QR codesControlNetPrompt engineering
0 likes · 13 min read
Creating Artistic QR Code Images with ControlNet and Stable Diffusion
DaTaobao Tech
DaTaobao Tech
Jun 16, 2023 · Artificial Intelligence

Introduction to Stable Diffusion: Concepts, Prompts, and Advanced Techniques

The article introduces Stable Diffusion, explains key terms and parameters, guides model checkpoint merging and fine‑tuning with embeddings, LoRA, and hypernetworks, details ControlNet pose control, sampling choices, prompt engineering techniques—including weighting and negative prompts—and explores advanced uses such as inpainting, Pix2Pix, custom training, highlighting personal and commercial applications and the technology’s growing impact across industries.

AI image generationControlNetStable Diffusion
0 likes · 18 min read
Introduction to Stable Diffusion: Concepts, Prompts, and Advanced Techniques
Tencent Cloud Developer
Tencent Cloud Developer
Apr 10, 2023 · Artificial Intelligence

How Computers Generate Realistic Images: An In‑Depth Guide to AI Image Generation, Diffusion Models, ControlNet, LoRA and More

This guide explains how AI creates photorealistic images, tracing the shift from VAEs and GANs to diffusion models, detailing latent diffusion, ControlNet conditioning, CLIP text‑image alignment, and lightweight fine‑tuning methods like DreamBooth and LoRA, plus practical tips for higher‑resolution results.

AI image generationControlNetLoRA
0 likes · 22 min read
How Computers Generate Realistic Images: An In‑Depth Guide to AI Image Generation, Diffusion Models, ControlNet, LoRA and More
Laiye Technology Team
Laiye Technology Team
Mar 3, 2023 · Artificial Intelligence

Survey of Text‑Controlled Image Generation Models: DALL·E‑2, Imagen, Stable Diffusion, and ControlNet

This article reviews the key components and design choices of recent text‑controlled image generation systems—including DALL·E‑2, Google Imagen, Stability AI's Latent Stable Diffusion, and the ControlNet extension—highlighting how diffusion models, text encoders, prior modules, super‑resolution, and conditioning mechanisms enable high‑quality, controllable visual synthesis.

AIControlNetDALL-E-2
0 likes · 16 min read
Survey of Text‑Controlled Image Generation Models: DALL·E‑2, Imagen, Stable Diffusion, and ControlNet