Tagged articles

generative modeling

13 articles · Page 1 of 1

Jun 23, 2026 · Artificial Intelligence

How Diffusion Models Achieve Generalization: Insights from a CVPR 2026 Tutorial

Diffusion models have set the state‑of‑the‑art in image, video, and audio generation, yet their training objective admits a unique closed‑form solution that merely memorizes training data; this tutorial examines why they still generalize by exploring score smoothing, architectural inductive bias, training dynamics, and data geometry, all illustrated with hands‑on Jupyter notebooks.

CVPR 2026Diffusion Modelsdata geometry

0 likes · 2 min read

How Diffusion Models Achieve Generalization: Insights from a CVPR 2026 Tutorial

DeepHub IMBA

Jun 18, 2026 · Artificial Intelligence

From Bayesian Models to Generative Pre‑trained Transformers (GPT): A Brief History of Generative Learning

The article traces generative learning from its probabilistic roots in Bayesian classification, through Gaussian mixture models, hidden Markov models, N‑gram and neural language models, to attention mechanisms, Transformers and GPT, highlighting how each innovation expanded the ability to model data‑generating processes.

BayesianGPTGaussian Mixture

0 likes · 26 min read

From Bayesian Models to Generative Pre‑trained Transformers (GPT): A Brief History of Generative Learning

Xiaomi Tech

May 26, 2026 · Artificial Intelligence

Xiaomi Auto Unveils Integrated Reconstruction‑Generation World Model Framework Achieving SOTA on Major Benchmarks

Xiaomi Auto introduces a novel world‑model framework that tightly couples 3D reconstruction and generative prediction, delivering state‑of‑the‑art performance on Waymo and nuScenes benchmarks while enabling high‑fidelity, long‑duration video synthesis for autonomous‑driving scenarios.

3D reconstructionBenchmark SOTAXiaomi Auto

0 likes · 10 min read

Xiaomi Auto Unveils Integrated Reconstruction‑Generation World Model Framework Achieving SOTA on Major Benchmarks

Machine Heart

May 10, 2026 · Artificial Intelligence

The First Industry Survey of Vision World Models: Toward a Higher‑Intelligence Visual Paradigm

This survey introduces vision world models as a central driver for AI to learn physical and causal dynamics directly from visual data, presents a unified "representation‑learning‑simulation" framework, categorises four major technical routes, outlines evaluation metrics and datasets, and proposes a 3R roadmap for the next generation of world models.

Evaluation MetricsFuture DirectionsMultimodal AI

0 likes · 15 min read

The First Industry Survey of Vision World Models: Toward a Higher‑Intelligence Visual Paradigm

Bighead's Algorithm Notes

Apr 2, 2026 · Artificial Intelligence

Diffolio: A Diffusion‑Model Framework for Risk‑Aware Portfolio Optimization

Diffolio introduces a diffusion‑model‑based approach that directly learns a pseudo‑optimal portfolio distribution conditioned on user risk preferences, generating diverse high‑quality portfolios and outperforming classic and recent baselines on six real‑world market datasets, with annualized returns improving up to 12.1 percentage points.

diffusion modelfinancial AIgenerative modeling

0 likes · 22 min read

Diffolio: A Diffusion‑Model Framework for Risk‑Aware Portfolio Optimization

Alimama Tech

Mar 26, 2026 · Industry Insights

How Alibaba’s Large User Model (LUM) Boosted CTR by 4.5% and Scaled to Billions of Parameters

The article analyzes the evolution from traditional modular recommendation models to a generative Large User Model (LUM), detailing its three‑stage paradigm, tokenization, training objectives, scaling‑law findings, offline and online experiments, and the AI‑infra innovations that enabled a 4.5% CTR lift in production.

CTR PredictionLarge Language ModelsRecommendation Systems

0 likes · 18 min read

How Alibaba’s Large User Model (LUM) Boosted CTR by 4.5% and Scaled to Billions of Parameters

Machine Learning Algorithms & Natural Language Processing

Mar 15, 2026 · Artificial Intelligence

A 17‑Year‑Old High‑Schooler Becomes First‑Author on a CVPR Paper

A 17‑year‑old high‑school student from Anhui Ansheng School led the first‑author CVPR 2026 paper "CraftMesh," a novel 3D mesh editing framework that combines image editing, mesh generation, and Poisson seamless fusion, achieving superior quantitative metrics and showcasing the growing impact of young researchers in top AI conferences.

3D mesh generationCVPRCraftMesh

0 likes · 7 min read

A 17‑Year‑Old High‑Schooler Becomes First‑Author on a CVPR Paper

AIWalker

Mar 4, 2026 · Artificial Intelligence

Drifting Models Enable One‑Step Generation, Shattering Speed Records

The paper introduces Drifting Models, a new generative paradigm that moves the distribution evolution to the training phase, achieving true one‑step (1‑NFE) generation with state‑of‑the‑art ImageNet FID scores of 1.54 in latent space and 1.61 in pixel space, while eliminating the need for distillation or classifier‑free guidance.

Drifting ModelsImageNetOne-step Generation

0 likes · 24 min read

Drifting Models Enable One‑Step Generation, Shattering Speed Records

Data Party THU

Oct 31, 2025 · Artificial Intelligence

Can AI Generate High‑Fidelity Spectra? Inside MIT’s SpectroGen Breakthrough

MIT’s SpectroGen model uses physics‑informed generative AI to convert a single spectral modality into high‑fidelity cross‑modal spectra, achieving up to 99% correlation with experimental data, dramatically reducing the cost and time of material spectroscopy while preserving detailed spectral features.

AIMaterials ScienceVariational Autoencoder

0 likes · 9 min read

Can AI Generate High‑Fidelity Spectra? Inside MIT’s SpectroGen Breakthrough

AI Frontier Lectures

Jul 8, 2025 · Artificial Intelligence

How LaVin-DiT Unifies Vision Tasks with a Large Diffusion Transformer

The LaVin-DiT paper presents a large vision diffusion transformer that integrates a spatio‑temporal variational auto‑encoder, a joint diffusion transformer with full‑sequence joint attention, and 3D rotary position encoding to enable unified, efficient multi‑task generation for images and videos, and details its training via flow‑matching and experimental results.

3D RoPEJoint Diffusion TransformerST-VAE

0 likes · 12 min read

How LaVin-DiT Unifies Vision Tasks with a Large Diffusion Transformer

Kuaishou Tech

Jun 20, 2025 · Artificial Intelligence

How OneRec Redefines Recommendation with End‑to‑End Generative Modeling and RL Alignment

The OneRec system from Kuaishou replaces traditional cascade recommendation pipelines with an encoder‑decoder architecture, leverages reward‑based preference alignment via reinforcement learning, achieves ten‑fold FLOPs gains, cuts operational costs by 90%, and delivers significant user‑engagement improvements across short‑video and local‑service scenarios.

KuaishouOneRecgenerative modeling

0 likes · 17 min read

How OneRec Redefines Recommendation with End‑to‑End Generative Modeling and RL Alignment

AntTech

Jun 4, 2025 · Artificial Intelligence

LLaDA and LLaDA‑V: Large Language Diffusion Models and Their Multimodal Extensions

This article presents the LLaDA series of diffusion‑based large language models, explains how their generative‑modeling principle yields language intelligence comparable to autoregressive models, and details the multimodal LLaDA‑V architecture, training methods, experimental results, and broader implications for AI research.

Diffusion ModelsLarge Language ModelsMultimodal AI

0 likes · 10 min read

LLaDA and LLaDA‑V: Large Language Diffusion Models and Their Multimodal Extensions

DaTaobao Tech

Apr 7, 2025 · Artificial Intelligence

Flow Matching for Generative Modeling

Flow Matching reformulates generative modeling by learning a time‑dependent vector field that deterministically transports Gaussian noise to data, using a neural network trained with an analytically derived L2 loss, yielding simpler training, faster convergence, and deterministic sampling that matches or exceeds diffusion model quality.

AIDiffusion Modelscontinuous normalizing flow

0 likes · 13 min read