Tagged articles
17 articles
Page 1 of 1
AIWalker
AIWalker
Mar 4, 2026 · Artificial Intelligence

Drifting Models Enable One‑Step Generation, Shattering Speed Records

The paper introduces Drifting Models, a new generative paradigm that moves the distribution evolution to the training phase, achieving true one‑step (1‑NFE) generation with state‑of‑the‑art ImageNet FID scores of 1.54 in latent space and 1.61 in pixel space, while eliminating the need for distillation or classifier‑free guidance.

Drifting ModelsGenerative ModelingImageNet
0 likes · 24 min read
Drifting Models Enable One‑Step Generation, Shattering Speed Records
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Feb 14, 2026 · Artificial Intelligence

Latent Forcing: Reordering Diffusion Steps Boosts Pixel‑Level Image Quality

The new Latent Forcing technique from Fei‑Fei Li’s team reorders the diffusion trajectory, first generating a latent structural sketch and then refining pixel details, which restores efficiency of latent‑space models while preserving 100 % pixel fidelity, achieving state‑of‑the‑art FID scores on ImageNet‑256.

AI researchImageNetdiffusion models
0 likes · 6 min read
Latent Forcing: Reordering Diffusion Steps Boosts Pixel‑Level Image Quality
Data Party THU
Data Party THU
Nov 11, 2025 · Artificial Intelligence

Why Early Adversarial Attacks Still Beat Modern Ones: A Fair Transferability Study

This paper systematically evaluates 23 transferable adversarial attacks and 11 defenses on ImageNet, revealing that early methods like DI outperform many newer attacks when hyper‑parameters are fairly matched, that diffusion‑based defenses give a false sense of security, and that higher transferability often comes at the cost of reduced stealthiness.

ImageNetadversarial attacksdeep learning security
0 likes · 8 min read
Why Early Adversarial Attacks Still Beat Modern Ones: A Fair Transferability Study
AI Frontier Lectures
AI Frontier Lectures
Oct 29, 2025 · Artificial Intelligence

Why Early DI Attacks Outperform Modern Methods: A Systematic Study of Transferable Adversarial Images

This paper systematically evaluates 23 transferable adversarial attacks and 11 defenses on ImageNet, revealing that early DI attacks surpass newer methods when hyper‑parameters are fairly set, diffusion defenses offer false security, and higher transferability often reduces stealthiness, urging fair benchmarking and comprehensive metrics.

ImageNetadversarial attacksdeep learning robustness
0 likes · 7 min read
Why Early DI Attacks Outperform Modern Methods: A Systematic Study of Transferable Adversarial Images
Data Party THU
Data Party THU
Sep 20, 2025 · Artificial Intelligence

How Mamba-Adaptor Revives State‑Space Models for Vision Tasks

The Mamba-Adaptor introduces a dual‑module adapter that overcomes causal computation limits, long‑range memory decay, and spatial structure loss in state‑space models, delivering state‑of‑the‑art results on ImageNet, COCO, and various downstream visual tasks with minimal overhead.

AdapterCOCODeep Learning
0 likes · 8 min read
How Mamba-Adaptor Revives State‑Space Models for Vision Tasks
AI Frontier Lectures
AI Frontier Lectures
May 27, 2025 · Artificial Intelligence

Can One-Step Generative Modeling Beat Multi-Step Diffusion? Inside MeanFlow

The article presents MeanFlow, a novel one‑step generative modeling framework that replaces instantaneous velocity with an average‑velocity field, achieving a record‑low FID of 3.43 on ImageNet 256×256 with a single function evaluation and outperforming both prior single‑step and multi‑step diffusion models.

AI researchFIDImageNet
0 likes · 7 min read
Can One-Step Generative Modeling Beat Multi-Step Diffusion? Inside MeanFlow
DataFunTalk
DataFunTalk
Jun 14, 2024 · Artificial Intelligence

Midjourney’s Diverse Data Sources: Public Datasets, Academic Research, Partner and Proprietary Data

Midjourney enhances its AI models by integrating a wide range of data sources—including public datasets like ImageNet and COCO, academic research from top conferences, partner collaborations, and its own proprietary data—while continuously updating and managing these datasets for quality, privacy, and security.

AI trainingBright DataCOCO
0 likes · 9 min read
Midjourney’s Diverse Data Sources: Public Datasets, Academic Research, Partner and Proprietary Data
Meituan Technology Team
Meituan Technology Team
Mar 24, 2022 · Artificial Intelligence

Twins: Efficient Visual Attention Models for Vision Transformers

The Twins series, a collaboration between Meituan and the University of Adelaide, introduces conditional positional encoding and spatially separable self‑attention to improve efficiency and performance of vision transformers, achieving state‑of‑the‑art results on ImageNet, ADE20K, COCO and high‑precision map segmentation.

ADE20KCOCOConditional Positional Encoding
0 likes · 20 min read
Twins: Efficient Visual Attention Models for Vision Transformers
JD Cloud Developers
JD Cloud Developers
Mar 21, 2022 · Artificial Intelligence

ViTAEv2 Breaks ImageNet Real Record with 91.2% Accuracy – How a 600M‑Parameter Model Redefines Few‑Shot Learning

JD Research Institute and the University of Sydney introduced ViTAEv2, a 600‑million‑parameter deep learning model that achieved a world‑leading 91.2% top‑1 accuracy on ImageNet Real without external data, demonstrating strong few‑shot learning, reducing labeling costs, and promising advances across many computer‑vision tasks.

AI modelComputer VisionDeep Learning
0 likes · 4 min read
ViTAEv2 Breaks ImageNet Real Record with 91.2% Accuracy – How a 600M‑Parameter Model Redefines Few‑Shot Learning
Tencent Tech
Tencent Tech
Aug 26, 2020 · Artificial Intelligence

How Tencent Engineers Shattered the 128‑GPU ImageNet Training Record in 2m31s

Tencent engineers broke the world record for training ImageNet with 128 V100 GPUs in just 2 minutes 31 seconds, detailing a suite of optimizations—including a new Light distributed training framework, single‑machine speed boosts, multi‑machine communication enhancements, and advanced batch convergence techniques—that together dramatically cut training time while maintaining high accuracy.

GPUImageNetTencent Cloud
0 likes · 9 min read
How Tencent Engineers Shattered the 128‑GPU ImageNet Training Record in 2m31s
21CTO
21CTO
Sep 14, 2018 · Artificial Intelligence

From Stanford to Google: How Fei‑Fei Li Built ImageNet and Shaped AI

Fei‑Fei Li, the pioneering AI researcher and former Google Cloud AI lead, rose from humble beginnings in China to create the ImageNet dataset, drive breakthroughs in computer vision, and now returns to Stanford, illustrating how curiosity and perseverance can transform both academia and industry.

Computer VisionFei-Fei LiGoogle AI
0 likes · 12 min read
From Stanford to Google: How Fei‑Fei Li Built ImageNet and Shaped AI
Tencent Architect
Tencent Architect
Jul 30, 2018 · Artificial Intelligence

Four‑Minute ImageNet Training: Tencent’s AI Platform Sets a New World Record

Tencent’s intelligent machine‑learning platform achieved a world‑record by training AlexNet in 4 minutes and ResNet‑50 in 6.6 minutes on ImageNet, using large batch sizes, mixed‑precision, LARS optimization, hierarchical synchronization, gradient fusion, and pipeline I/O techniques to overcome accuracy and scalability challenges.

AI accelerationDeep LearningImageNet
0 likes · 24 min read
Four‑Minute ImageNet Training: Tencent’s AI Platform Sets a New World Record