Tagged articles
26 articles
Page 1 of 1
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Mar 30, 2026 · Artificial Intelligence

Editable AI-Generated Research Figures: Introducing AutoFigure-Edit from Westlake University

The article presents AutoFigure-Edit, an open‑source AI system that turns long‑form scientific text into fully editable SVG figures, solves the uneditable‑image problem of existing AIGC tools, and demonstrates superior performance on the FigureBench benchmark and real‑user studies.

AIAutoFigureSVG
0 likes · 11 min read
Editable AI-Generated Research Figures: Introducing AutoFigure-Edit from Westlake University
DaTaobao Tech
DaTaobao Tech
Jan 30, 2026 · Artificial Intelligence

Human‑like LLM Replies for Live Digital Hosts: ASR‑Based Style Transfer and Reward Modeling

This article proposes an ASR‑driven pipeline that creates high‑quality AI‑reply vs. human‑like reply pairs, trains a rewrite model and a reward model, and uses GRPO reinforcement learning to generate natural, helpful, and less AI‑sounding responses in digital‑human live streaming, achieving 92% accuracy and 97% helpfulness while improving user experience.

ASR dataLLMQwen
0 likes · 20 min read
Human‑like LLM Replies for Live Digital Hosts: ASR‑Based Style Transfer and Reward Modeling
Old Meng AI Explorer
Old Meng AI Explorer
Nov 30, 2025 · Artificial Intelligence

Unlock 1‑Minute AI Video Generation with TTT‑Video‑Dit: Break the 3‑Second Limit

TTT‑Video‑Dit is an open‑source framework that uses test‑time‑training and hierarchical attention to generate coherent 63‑second videos with style‑transfer, dramatically reducing GPU memory requirements so a single RTX 4090 can replace costly H100 clusters, enabling creators and developers to produce long AI videos efficiently.

GPU OptimizationStyle TransferTTT-Video-Dit
0 likes · 11 min read
Unlock 1‑Minute AI Video Generation with TTT‑Video‑Dit: Break the 3‑Second Limit
AI Algorithm Path
AI Algorithm Path
Sep 3, 2025 · Artificial Intelligence

15 Real-World Applications of Google’s Nano Banana AI Image Tool

Google’s Nano Banana, an advanced multimodal AI model integrated into Gemini, delivers unprecedented role‑consistency and multi‑step editing, and this article walks through fifteen concrete use cases—from virtual try‑on and background swapping to style transfer, product visualisation, educational graphics, and 3D conversion—showcasing how the tool can streamline creative workflows across industries.

AI image generationGeminiGoogle
0 likes · 9 min read
15 Real-World Applications of Google’s Nano Banana AI Image Tool
ShiZhen AI
ShiZhen AI
Sep 1, 2025 · Artificial Intelligence

Nano Banana: A Next‑Gen AI Image Creation and Editing Guide

Nano Banana, Google’s internal code name for Gemini 2.5 Flash Image, reshapes AI image creation with ten‑fold speed gains over Photoshop, consistent multi‑step editing, dialogue‑driven image manipulation, style‑transfer capabilities, and a community‑validated reputation earned through blind tests on LMArena, while also exposing typical generative‑AI limits such as text rendering glitches and occasional anatomical errors.

AI image generationGemini 2.5 Flash ImageLMArena
0 likes · 20 min read
Nano Banana: A Next‑Gen AI Image Creation and Editing Guide
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Aug 19, 2025 · Artificial Intelligence

How Single Trajectory Distillation Boosts Diffusion Model Speed and Style Quality

The paper introduces Single Trajectory Distillation (STD), a novel training framework that aligns full PF‑ODE trajectories from a fixed noisy state, uses a Trajectory Bank to cut training cost, and adds an Asymmetric Adversarial Loss to markedly improve style consistency and aesthetic quality while accelerating image and video style‑transfer diffusion models.

AI accelerationStyle Transferconsistency models
0 likes · 14 min read
How Single Trajectory Distillation Boosts Diffusion Model Speed and Style Quality
AIWalker
AIWalker
Mar 5, 2025 · Artificial Intelligence

Attention Distillation in Diffusion Models: CVPR 2025 Technique Outperforms Traditional Image Generation

The paper introduces a novel attention‑distillation loss and a guided‑sampling scheme that together enable diffusion models to faithfully transfer visual features from reference images, dramatically speeding synthesis and surpassing prior plug‑and‑play attention methods across style transfer, text‑to‑image generation, and texture synthesis tasks.

AI researchStyle Transferattention distillation
0 likes · 15 min read
Attention Distillation in Diffusion Models: CVPR 2025 Technique Outperforms Traditional Image Generation
AIWalker
AIWalker
Jan 13, 2025 · Artificial Intelligence

ArtCrafter: A Controllable, Diverse Style Transfer Framework from Tsinghua

ArtCrafter introduces a novel text‑image style transfer framework that leverages attention‑based style extraction, text‑image alignment enhancement, and explicit modulation to achieve controllable, diverse, and high‑fidelity visual results, outperforming existing methods in both qualitative and quantitative evaluations.

Attention MechanismStyle Transferdiffusion models
0 likes · 10 min read
ArtCrafter: A Controllable, Diverse Style Transfer Framework from Tsinghua
Baidu Geek Talk
Baidu Geek Talk
Jul 24, 2024 · Artificial Intelligence

AI-Driven Fusion of Peking Opera Characters with Ink-Wash Painting Style Using PaddleGAN

Li Yilin’s AI project blends Peking Opera characters with traditional ink‑wash painting by using PaddleHub for style transfer and PaddleGAN’s First‑Order Motion model for facial motion, then adds music and Wav2Lip lip‑sync, producing videos that modernize Chinese heritage and gauge public cultural awareness.

AIComputer VisionDeep Learning
0 likes · 9 min read
AI-Driven Fusion of Peking Opera Characters with Ink-Wash Painting Style Using PaddleGAN
Baidu MEUX
Baidu MEUX
Jun 19, 2024 · Artificial Intelligence

How Baidu’s AI Publisher Transforms Holiday Images with Offline and Online Style Transfer

This article details Baidu APP’s AI Publisher, explaining the research behind its offline and online stylization modes, the complete generation pipelines, core AI technologies such as template creation, face‑merging, large‑model style transfer, custom model training, and showcases the resulting festive visual effects.

AI image generationBaidu AIStyle Transfer
0 likes · 10 min read
How Baidu’s AI Publisher Transforms Holiday Images with Offline and Online Style Transfer
DaTaobao Tech
DaTaobao Tech
Jun 3, 2024 · Artificial Intelligence

Transforming Interior Design: AIGC’s Text‑to‑Image, Lora, and IP‑Adapter Techniques

This article explains how AI‑generated content (AIGC) technologies such as text‑to‑image diffusion models, Lora fine‑tuning, and IP‑Adapter style transfer are applied to interior design, dramatically reducing design time, cutting costs, and enabling personalized, high‑quality visualizations for both consumers and furniture merchants.

AIGCIP-AdapterLoRA
0 likes · 9 min read
Transforming Interior Design: AIGC’s Text‑to‑Image, Lora, and IP‑Adapter Techniques
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Aug 24, 2023 · Artificial Intelligence

Neural Style Transfer with PyTorch: Theory and Implementation

This article introduces neural style transfer, explains its underlying principles using VGG19 feature extraction, content and style loss definitions, and provides a complete PyTorch implementation with code for loading images, extracting features, computing Gram matrices, and optimizing the output image.

Computer VisionDeep LearningPyTorch
0 likes · 14 min read
Neural Style Transfer with PyTorch: Theory and Implementation
DataFunSummit
DataFunSummit
Apr 1, 2023 · Artificial Intelligence

Overview of Understanding and Editing Vision Models in the ModelScope Community

This article introduces ModelScope's community‑released visual models, detailing the categorization of understanding versus editing models, the two‑stage coarse‑to‑fine segmentation pipeline for matting, and four editing applications—style transfer, portrait beautification, skin enhancement, and anime‑style conversion—while also previewing upcoming sky‑replacement and video‑matting models.

AIModelScopeStyle Transfer
0 likes · 9 min read
Overview of Understanding and Editing Vision Models in the ModelScope Community
Alimama Tech
Alimama Tech
Aug 17, 2022 · Artificial Intelligence

How Multimodal AI Transforms Advertising Copy: From Image Text to Video Scripts

Alibaba’s advertising AI team presents a comprehensive study of four new multimodal copywriting tasks—image overlay text generation, video narration, text style transfer, and detail-page extraction—detailing model architectures, training on billions of images, experimental results, and practical deployment in the “Xiyu” product.

Large-Scale TrainingMultimodal AIStyle Transfer
0 likes · 17 min read
How Multimodal AI Transforms Advertising Copy: From Image Text to Video Scripts
Kuaishou Tech
Kuaishou Tech
Apr 18, 2022 · Artificial Intelligence

SSAN: A Novel Dual‑Stream Network for Domain‑Generalized Face Anti‑Spoofing

This paper proposes SSAN, a novel dual‑stream network that separates content and style features to achieve domain‑generalized face anti‑spoofing, employing adversarial learning for content, contrastive learning for style, and a large‑scale evaluation protocol across twelve public datasets, achieving state‑of‑the‑art performance.

SSANStyle Transfercontrastive learning
0 likes · 16 min read
SSAN: A Novel Dual‑Stream Network for Domain‑Generalized Face Anti‑Spoofing
IT Services Circle
IT Services Circle
Apr 13, 2022 · Artificial Intelligence

Introducing DualStyleGAN, RQ‑VAE Transformer, and VFD: Recent CVPR 2022 Open‑Source Algorithms

Jack Cui presents three recently open‑sourced CVPR 2022 algorithms—DualStyleGAN for high‑resolution portrait style transfer, RQ‑VAE Transformer for improved text‑to‑image generation, and VFD for deep‑fake detection—detailing their functionality, usage options, and providing links to code repositories and demo platforms.

AIGenerative ModelsStyle Transfer
0 likes · 5 min read
Introducing DualStyleGAN, RQ‑VAE Transformer, and VFD: Recent CVPR 2022 Open‑Source Algorithms
DataFunSummit
DataFunSummit
Jan 25, 2022 · Artificial Intelligence

Intelligent Lyric Generation for Music: Techniques, Models, and Future Directions

This article explores how AI and natural language processing technologies are applied to music lyric creation, covering background challenges, rhyme retrieval methods, advanced language models such as SongNet, decoding strategies, style transfer, and a multi‑level generation platform that aims to streamline professional songwriting.

AI lyric generationSongNetStyle Transfer
0 likes · 14 min read
Intelligent Lyric Generation for Music: Techniques, Models, and Future Directions
DataFunSummit
DataFunSummit
Dec 3, 2020 · Artificial Intelligence

GAN Fundamentals, Variants, and Practical Applications in Image Style Transfer and Handwriting Font Generation

This article provides a comprehensive overview of Generative Adversarial Networks, covering their original formulation, training dynamics, loss functions, major variants such as DCGAN and WGAN, and practical implementations for image‑to‑image translation, style transfer, and handwriting font synthesis at Laiye Technology.

Computer VisionDeep LearningGAN
0 likes · 28 min read
GAN Fundamentals, Variants, and Practical Applications in Image Style Transfer and Handwriting Font Generation
Laiye Technology Team
Laiye Technology Team
Nov 25, 2020 · Artificial Intelligence

Comprehensive Overview of GANs: History, Improvements, Applications, and Handwriting Style Transfer

This article provides an in‑depth overview of Generative Adversarial Networks (GANs), covering their original formulation, major variants such as DCGAN and WGAN, challenges like mode collapse, image‑to‑image translation techniques (cGAN, pix2pix, CycleGAN), and practical handwriting style‑transfer implementations using BicycleGAN and Zi2Zi.

GANGenerative Adversarial NetworksImage-to-Image Translation
0 likes · 27 min read
Comprehensive Overview of GANs: History, Improvements, Applications, and Handwriting Style Transfer
DataFunTalk
DataFunTalk
Jun 8, 2020 · Artificial Intelligence

Recap of Baidu ACL 2020 Paper Sharing Session – Papers 4 to 6

The Baidu ACL 2020 paper sharing live session recap presents three NLP research papers—on unsupervised style transfer, sentiment‑knowledge‑enhanced pre‑training, and conversational recommendation over multi‑type dialogs—detailing their novel models, methodologies, and key contributions.

ACL 2020NLPSentiment Analysis
0 likes · 4 min read
Recap of Baidu ACL 2020 Paper Sharing Session – Papers 4 to 6
Programmer DD
Programmer DD
Apr 24, 2020 · Artificial Intelligence

Turn Photos into Studio Ghibli‑Style Anime with AnimeGAN – A Hands‑On Guide

This article introduces AnimeGAN, a lightweight GAN that converts real photos into Japanese anime‑style illustrations, explains its architecture, loss functions, model size advantages, and provides step‑by‑step instructions with code for setting up, training, and testing the TensorFlow implementation.

AnimeGANDeep LearningGAN
0 likes · 8 min read
Turn Photos into Studio Ghibli‑Style Anime with AnimeGAN – A Hands‑On Guide
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 27, 2019 · Artificial Intelligence

Inside Alibaba’s AliPlayStudio: Real-Time AI Video Interaction Techniques

This article details how Alibaba’s AliPlayStudio combines advanced computer‑vision algorithms—such as human semantic segmentation, gesture and pose detection, controllable style transfer, and face‑fusion—optimised for low‑power mobile and embedded devices, to deliver engaging real‑time video interactions across online and offline marketing scenarios.

Mobile AIStyle Transferface fusion
0 likes · 17 min read
Inside Alibaba’s AliPlayStudio: Real-Time AI Video Interaction Techniques