Tag

GAN

0 views collected around this technical thread.

DataFunSummit
DataFunSummit
May 6, 2024 · Artificial Intelligence

Advances, Model Types, and Open Challenges of AI‑Generated Content (AIGC) with XiaoBu’s Image Generation Progress

This article reviews the definition, key metrics, and major model families of AI‑generated content, details XiaoBu’s recent breakthroughs in image generation, and discusses open research problems such as evaluation gaps, transformer limitations, and the need for richer multimodal intelligence representations.

AI researchAIGCGAN
0 likes · 14 min read
Advances, Model Types, and Open Challenges of AI‑Generated Content (AIGC) with XiaoBu’s Image Generation Progress
DaTaobao Tech
DaTaobao Tech
Feb 28, 2024 · Artificial Intelligence

A Survey of Image Quality Evaluation Metrics for Text-to-Image Generation

The survey traces the evolution of image‑quality evaluation for text‑to‑image generation—from early handcrafted edge and color cues, through GAN‑era similarity scores such as IS, FID and KID, to modern perceptual and CLIP‑based metrics like LPIPS, CLIPScore, TRIQ, IQT and human‑preference models—highlighting a shift toward semantic, aesthetic, and text‑image alignment measures and forecasting domain‑specific metrics for future diffusion models.

GANGenerative ModelsTransformer
0 likes · 18 min read
A Survey of Image Quality Evaluation Metrics for Text-to-Image Generation
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Feb 28, 2024 · Artificial Intelligence

A Survey of Multimodal Image Synthesis and Editing with Generative AI

This comprehensive review examines the rapid advances in generative AI for multimodal image synthesis and editing, covering visual, textual, and audio guidance, model families such as GANs, diffusion, autoregressive, and NeRF, as well as datasets, challenges, and future research directions.

GANdiffusion modelsgenerative AI
0 likes · 6 min read
A Survey of Multimodal Image Synthesis and Editing with Generative AI
Kuaishou Tech
Kuaishou Tech
Dec 28, 2023 · Artificial Intelligence

Kuaishou Audio Team Wins ICASSP 2024 SSI and PLC Challenges with Advanced Speech Enhancement and Packet Loss Concealment

The Kuaishou audio team secured first place in both the ICASSP 2024 Speech Signal Improvement and Audio Deep Packet Loss Concealment challenges by deploying a two‑stage GAN‑based speech enhancement system and a hybrid time‑frequency packet‑loss concealment model that dramatically improve real‑time communication quality.

Audio ProcessingGANICASSP 2024
0 likes · 8 min read
Kuaishou Audio Team Wins ICASSP 2024 SSI and PLC Challenges with Advanced Speech Enhancement and Packet Loss Concealment
php中文网 Courses
php中文网 Courses
Aug 26, 2023 · Artificial Intelligence

Understanding Generative AI: Concepts, Common Models, and Development Guide

Generative AI, a branch of artificial intelligence that creates novel content such as text, images, and music, works by learning patterns from training data, with common models including GANs, VAEs, autoregressive and Transformer-based architectures, and its development involves task definition, data preparation, model design, training, evaluation, and ethical considerations.

Artificial IntelligenceGANModel Development
0 likes · 8 min read
Understanding Generative AI: Concepts, Common Models, and Development Guide
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jul 31, 2023 · Artificial Intelligence

Overview of Deep Neural Network Architectures

This article provides a comprehensive overview of deep neural network families, introducing twelve major architectures—including Feedforward, CNN, RNN, LSTM, DBN, GAN, Autoencoder, Residual, Capsule, Transformer, Attention, and Deep Reinforcement Learning—explaining their principles, structures, training methods, and offering Python/TensorFlow/PyTorch code examples.

CNNGANPython
0 likes · 29 min read
Overview of Deep Neural Network Architectures
Alimama Tech
Alimama Tech
May 31, 2023 · Artificial Intelligence

Unsupervised Domain Adaptation with Pixel-level Discriminator for Image-aware Layout Generation

Alibaba Mama’s team introduces PDA‑GAN, an unsupervised domain‑adaptation framework employing a lightweight pixel‑level discriminator to align repaired and original image features, enabling image‑aware layout generation that outperforms prior methods on visual‑quality and layout metrics for advertising creatives.

GANLayout Generationcomputer vision
0 likes · 10 min read
Unsupervised Domain Adaptation with Pixel-level Discriminator for Image-aware Layout Generation
DataFunTalk
DataFunTalk
May 10, 2023 · Artificial Intelligence

AI‑Driven Predictive Maintenance for NIO Power: GAN and Conceptor Techniques for PHM

This article presents NIO Power's intelligent equipment health management solution, detailing business background, operational challenges, PHM difficulties, and frontier AI technologies such as GAN‑based unsupervised anomaly detection and Conceptor‑based small‑sample fault diagnosis, illustrated with real‑world case studies and a comprehensive Q&A.

Anomaly DetectionConceptorGAN
0 likes · 28 min read
AI‑Driven Predictive Maintenance for NIO Power: GAN and Conceptor Techniques for PHM
DeWu Technology
DeWu Technology
Feb 13, 2023 · Artificial Intelligence

Overview of AI-Generated Art: GAN, Diffusion Models, and Stable Diffusion Applications

The article surveys AI‑generated art, explaining how GANs’ limitations gave way to diffusion models and the open‑source Stable Diffusion platform, which offers text‑to‑image, img2img, inpainting, DreamBooth fine‑tuning, and widespread commercial and DIY deployments via cloud or local WebUI setups.

AI artGANStable Diffusion
0 likes · 13 min read
Overview of AI-Generated Art: GAN, Diffusion Models, and Stable Diffusion Applications
DataFunSummit
DataFunSummit
Feb 4, 2023 · Artificial Intelligence

Overview of Deep Learning Algorithms: Supervised, Unsupervised, and Semi‑Supervised Methods

This article introduces deep learning as a powerful AI technique, explains its core algorithms—including supervised, unsupervised, and semi‑supervised approaches—and provides concrete examples such as CNN, RNN, autoencoders, GAN, self‑supervised and transfer learning, illustrated with visual demos.

AIGANdeep learning
0 likes · 6 min read
Overview of Deep Learning Algorithms: Supervised, Unsupervised, and Semi‑Supervised Methods
AntTech
AntTech
Dec 20, 2022 · Artificial Intelligence

Towards Smooth Video Composition: A New Benchmark for GAN‑Based Video Generation

Researchers from multiple institutions propose a GAN‑based video generation framework that explicitly models short‑, medium‑, and long‑range temporal relations, introduces B‑spline motion embeddings and temporal shift modules, and demonstrates substantial quality improvements across several video datasets.

B-splineGANStyleGAN-V
0 likes · 7 min read
Towards Smooth Video Composition: A New Benchmark for GAN‑Based Video Generation
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Nov 24, 2022 · Artificial Intelligence

Spectral Normalization in GANs: Theory, Implementation, and Code Walkthrough

This article explains the motivation behind Spectral Normalization for stabilizing GAN training, derives its mathematical foundation, shows how it enforces the Lipschitz constraint, and provides a complete PyTorch implementation with detailed code explanations.

GANLipschitz ContinuityPyTorch
0 likes · 11 min read
Spectral Normalization in GANs: Theory, Implementation, and Code Walkthrough
HelloTech
HelloTech
Oct 19, 2022 · Artificial Intelligence

Intelligent Creative System: Types, Quality Evaluation, Generation Models, and Optimization

The Intelligent Creative System defines advertising creatives across formats, evaluates image and text quality using reference‑based metrics and models like DeepBIQ, generates multimodal ads via GANs and Transformers, and selects optimal variants through bandit‑based CTR prediction and multimodal fusion, enabling scalable, data‑driven creative production.

AIBandit ModelGAN
0 likes · 10 min read
Intelligent Creative System: Types, Quality Evaluation, Generation Models, and Optimization
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Oct 11, 2022 · Artificial Intelligence

GANomaly: Theory and Source Code Analysis

This article explains the GANomaly model for semi‑supervised anomaly detection, detailing its generator‑encoder‑discriminator architecture, loss functions, testing phase scoring, and provides annotated PyTorch source code to help readers implement and understand the approach.

Anomaly DetectionEncoder-DecoderGAN
0 likes · 15 min read
GANomaly: Theory and Source Code Analysis
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Oct 8, 2022 · Artificial Intelligence

Wasserstein GAN (WGAN): Theory and Hands‑On Implementation

This article explains why traditional GANs suffer from training instability, introduces the Wasserstein (Earth‑Mover) distance as a smoother alternative, derives the WGAN objective, discusses Lipschitz constraints, and provides practical PyTorch code modifications to convert a vanilla GAN into a stable WGAN.

GANPyTorchWGAN
0 likes · 21 min read
Wasserstein GAN (WGAN): Theory and Hands‑On Implementation
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Sep 28, 2022 · Artificial Intelligence

EGBAD: Efficient GAN‑Based Anomaly Detection – Theory and Practical Implementation

This article introduces the EGBAD model, an efficient GAN‑based anomaly detection method that replaces AnoGAN's costly latent variable search with an encoder, provides detailed PyTorch code for data loading, model construction, training, and inference, and compares its testing speed with AnoGAN.

Anomaly DetectionDiscriminatorEGBAD
0 likes · 18 min read
EGBAD: Efficient GAN‑Based Anomaly Detection – Theory and Practical Implementation
DataFunSummit
DataFunSummit
Sep 4, 2022 · Artificial Intelligence

Sparse Features in Machine Learning: Challenges, NVIDIA Ampere Structured Sparsity, Knowledge Distillation, and GAN Model Compression

This talk explores the challenges and opportunities of leveraging sparsity in machine learning models, covering fine‑grained and coarse‑grained sparsity, NVIDIA Ampere’s 2:4 structured sparsity, knowledge‑distillation techniques for converting unstructured to structured sparsity, and model compression strategies for generative adversarial networks.

GANGPU Accelerationdeep learning
0 likes · 14 min read
Sparse Features in Machine Learning: Challenges, NVIDIA Ampere Structured Sparsity, Knowledge Distillation, and GAN Model Compression
Beike Product & Technology
Beike Product & Technology
Aug 12, 2022 · Artificial Intelligence

Green Area Generation Method Based on Pix2pix Model

This paper proposes a pix2pix‑based method to automatically generate green areas for large‑scale outdoor 3D scene modeling, detailing dataset creation via OpenCV segmentation, model training, region partitioning, and experimental results showing a 93.8% acceptance rate, significantly improving efficiency over manual drawing.

3D modelingGANcomputer vision
0 likes · 14 min read
Green Area Generation Method Based on Pix2pix Model
Alimama Tech
Alimama Tech
Jul 27, 2022 · Artificial Intelligence

Content-aware Automatic Graphic Layout Generation

The paper introduces a perception‑driven automatic graphic layout system that aligns advertising creatives with product images via a domain‑alignment module and generates content‑aware layouts using a multi‑scale CNN‑Transformer generator, achieving superior aesthetic quality and lower element overlap compared with existing template‑based and deep‑learning methods.

GANadvertisingcontent-aware
0 likes · 10 min read
Content-aware Automatic Graphic Layout Generation