Tagged articles
10 articles
Page 1 of 1
Machine Heart
Machine Heart
May 10, 2026 · Artificial Intelligence

The First Industry Survey of Vision World Models: Toward a Higher‑Intelligence Visual Paradigm

This survey introduces vision world models as a central driver for AI to learn physical and causal dynamics directly from visual data, presents a unified "representation‑learning‑simulation" framework, categorises four major technical routes, outlines evaluation metrics and datasets, and proposes a 3R roadmap for the next generation of world models.

Evaluation MetricsFuture DirectionsGenerative Modeling
0 likes · 15 min read
The First Industry Survey of Vision World Models: Toward a Higher‑Intelligence Visual Paradigm
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Apr 2, 2026 · Artificial Intelligence

Diffolio: A Diffusion‑Model Framework for Risk‑Aware Portfolio Optimization

Diffolio introduces a diffusion‑model‑based approach that directly learns a pseudo‑optimal portfolio distribution conditioned on user risk preferences, generating diverse high‑quality portfolios and outperforming classic and recent baselines on six real‑world market datasets, with annualized returns improving up to 12.1 percentage points.

Financial AIGenerative ModelingQuantitative Finance
0 likes · 22 min read
Diffolio: A Diffusion‑Model Framework for Risk‑Aware Portfolio Optimization
Alimama Tech
Alimama Tech
Mar 26, 2026 · Industry Insights

How Alibaba’s Large User Model (LUM) Boosted CTR by 4.5% and Scaled to Billions of Parameters

The article analyzes the evolution from traditional modular recommendation models to a generative Large User Model (LUM), detailing its three‑stage paradigm, tokenization, training objectives, scaling‑law findings, offline and online experiments, and the AI‑infra innovations that enabled a 4.5% CTR lift in production.

CTR predictionGenerative ModelingLarge Language Models
0 likes · 18 min read
How Alibaba’s Large User Model (LUM) Boosted CTR by 4.5% and Scaled to Billions of Parameters
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Mar 15, 2026 · Artificial Intelligence

A 17‑Year‑Old High‑Schooler Becomes First‑Author on a CVPR Paper

A 17‑year‑old high‑school student from Anhui Ansheng School led the first‑author CVPR 2026 paper "CraftMesh," a novel 3D mesh editing framework that combines image editing, mesh generation, and Poisson seamless fusion, achieving superior quantitative metrics and showcasing the growing impact of young researchers in top AI conferences.

3D mesh generationCVPRComputer Vision
0 likes · 7 min read
A 17‑Year‑Old High‑Schooler Becomes First‑Author on a CVPR Paper
AIWalker
AIWalker
Mar 4, 2026 · Artificial Intelligence

Drifting Models Enable One‑Step Generation, Shattering Speed Records

The paper introduces Drifting Models, a new generative paradigm that moves the distribution evolution to the training phase, achieving true one‑step (1‑NFE) generation with state‑of‑the‑art ImageNet FID scores of 1.54 in latent space and 1.61 in pixel space, while eliminating the need for distillation or classifier‑free guidance.

Drifting ModelsGenerative ModelingImageNet
0 likes · 24 min read
Drifting Models Enable One‑Step Generation, Shattering Speed Records
Data Party THU
Data Party THU
Oct 31, 2025 · Artificial Intelligence

Can AI Generate High‑Fidelity Spectra? Inside MIT’s SpectroGen Breakthrough

MIT’s SpectroGen model uses physics‑informed generative AI to convert a single spectral modality into high‑fidelity cross‑modal spectra, achieving up to 99% correlation with experimental data, dramatically reducing the cost and time of material spectroscopy while preserving detailed spectral features.

Generative ModelingMaterials ScienceVariational Autoencoder
0 likes · 9 min read
Can AI Generate High‑Fidelity Spectra? Inside MIT’s SpectroGen Breakthrough
AI Frontier Lectures
AI Frontier Lectures
Jul 8, 2025 · Artificial Intelligence

How LaVin-DiT Unifies Vision Tasks with a Large Diffusion Transformer

The LaVin-DiT paper presents a large vision diffusion transformer that integrates a spatio‑temporal variational auto‑encoder, a joint diffusion transformer with full‑sequence joint attention, and 3D rotary position encoding to enable unified, efficient multi‑task generation for images and videos, and details its training via flow‑matching and experimental results.

3D RoPEComputer VisionGenerative Modeling
0 likes · 12 min read
How LaVin-DiT Unifies Vision Tasks with a Large Diffusion Transformer
Kuaishou Tech
Kuaishou Tech
Jun 20, 2025 · Artificial Intelligence

How OneRec Redefines Recommendation with End‑to‑End Generative Modeling and RL Alignment

The OneRec system from Kuaishou replaces traditional cascade recommendation pipelines with an encoder‑decoder architecture, leverages reward‑based preference alignment via reinforcement learning, achieves ten‑fold FLOPs gains, cuts operational costs by 90%, and delivers significant user‑engagement improvements across short‑video and local‑service scenarios.

Generative ModelingKuaishouOneRec
0 likes · 17 min read
How OneRec Redefines Recommendation with End‑to‑End Generative Modeling and RL Alignment
AntTech
AntTech
Jun 4, 2025 · Artificial Intelligence

LLaDA and LLaDA‑V: Large Language Diffusion Models and Their Multimodal Extensions

This article presents the LLaDA series of diffusion‑based large language models, explains how their generative‑modeling principle yields language intelligence comparable to autoregressive models, and details the multimodal LLaDA‑V architecture, training methods, experimental results, and broader implications for AI research.

Diffusion ModelsGenerative ModelingLarge Language Models
0 likes · 10 min read
LLaDA and LLaDA‑V: Large Language Diffusion Models and Their Multimodal Extensions
DaTaobao Tech
DaTaobao Tech
Apr 7, 2025 · Artificial Intelligence

Flow Matching for Generative Modeling

Flow Matching reformulates generative modeling by learning a time‑dependent vector field that deterministically transports Gaussian noise to data, using a neural network trained with an analytically derived L2 loss, yielding simpler training, faster convergence, and deterministic sampling that matches or exceeds diffusion model quality.

Diffusion ModelsGenerative Modelingai
0 likes · 13 min read
Flow Matching for Generative Modeling