Tagged articles

generalization

18 articles · Page 1 of 1

Jun 23, 2026 · Artificial Intelligence

How Diffusion Models Achieve Generalization: Insights from a CVPR 2026 Tutorial

Diffusion models have set the state‑of‑the‑art in image, video, and audio generation, yet their training objective admits a unique closed‑form solution that merely memorizes training data; this tutorial examines why they still generalize by exploring score smoothing, architectural inductive bias, training dynamics, and data geometry, all illustrated with hands‑on Jupyter notebooks.

CVPR 2026Diffusion Modelsdata geometry

0 likes · 2 min read

How Diffusion Models Achieve Generalization: Insights from a CVPR 2026 Tutorial

Machine Heart

Jun 10, 2026 · Artificial Intelligence

How BEV Propels Embodied Intelligence: Scaling Robot Data with Dexterity‑BEV

The article analyzes how the Dexterity‑BEV approach unifies heterogeneous robot sensor streams into a single bird's‑eye‑view coordinate system, aligning vision, state, action, and time to enable scalable, generalizable embodied AI, drawing parallels with the transformative impact of BEV in autonomous driving.

BEVDexterity-BEVEmbodied AI

0 likes · 11 min read

How BEV Propels Embodied Intelligence: Scaling Robot Data with Dexterity‑BEV

Machine Heart

Jun 7, 2026 · Artificial Intelligence

How RoboScience’s Bi-Adapt Framework Tackles Embodied Intelligence Generalization Bottlenecks

RoboScience’s team secured consecutive ICRA best‑paper finalist spots with Bi‑Adapt and D(R,O) Grasp, presenting a few‑shot bimanual adaptation framework and a unified grasp model that together bridge top‑tier research to scalable embodied AI by overcoming cross‑category generalization challenges.

Embodied AIICRAVLOA

0 likes · 11 min read

How RoboScience’s Bi-Adapt Framework Tackles Embodied Intelligence Generalization Bottlenecks

Data Party THU

Mar 25, 2026 · Artificial Intelligence

How Knowledge‑Guided Context Optimization Boosts Zero‑Shot Vision‑Language Models

The article analyzes the Base‑to‑New generalization problem of CLIP‑based visual‑language models, explains why standard prompt tuning (CoOp) forgets base knowledge, and presents the KgCoOp framework that adds a knowledge‑guided loss to keep learned prompts close to hand‑crafted ones, dramatically improving unseen‑class performance while preserving efficiency.

CLIPKnowledge-guided OptimizationPrompt Tuning

0 likes · 12 min read

How Knowledge‑Guided Context Optimization Boosts Zero‑Shot Vision‑Language Models

Tencent Advertising Technology

Nov 28, 2025 · Artificial Intelligence

How Retrv-R1 Redefines Universal Multimodal Retrieval with Reasoning‑Driven MLLM

Retrv‑R1, a reasoning‑driven multimodal large language model framework, tackles the precision‑efficiency dilemma of universal multimodal retrieval by introducing a two‑stage coarse‑to‑fine pipeline, an information‑compression module, a detail‑inspection mechanism, and a three‑stage training strategy, achieving SOTA performance across accuracy, efficiency, and generalization benchmarks.

EfficiencyMLLMdetail inspection

0 likes · 21 min read

How Retrv-R1 Redefines Universal Multimodal Retrieval with Reasoning‑Driven MLLM

Baobao Algorithm Notes

Aug 14, 2025 · Artificial Intelligence

Why Standard SFT Fails to Generalize and How One‑Line Dynamic Fine‑Tuning Fixes It

The article analyzes the poor generalization of supervised fine‑tuning (SFT) for large language models, reveals its gradient as a high‑variance inverse‑probability policy gradient, proposes a one‑line Dynamic Fine‑Tuning correction, and shows substantial gains on challenging math and offline RL benchmarks.

Dynamic Fine-TuningLLM alignmentSupervised Fine‑Tuning

0 likes · 7 min read

Why Standard SFT Fails to Generalize and How One‑Line Dynamic Fine‑Tuning Fixes It

DataFunTalk

Feb 26, 2024 · Artificial Intelligence

Large Language Model Empowered Recommendation Systems: Overview, Techniques, and Future Directions

With the rapid rise of ChatGPT and large language models, recommendation systems are undergoing a transformative shift, moving beyond traditional behavior‑based methods to leverage LLMs for improved generalization, representation, and prompt‑based learning, while addressing challenges such as scalability, interpretability, bias, and deployment costs.

AILLMRepresentation

0 likes · 19 min read

Large Language Model Empowered Recommendation Systems: Overview, Techniques, and Future Directions

Baobao Algorithm Notes

Jan 13, 2024 · Artificial Intelligence

How to Boost Reward Model Performance in RLHF: Data and Algorithm Strategies from the MOSS Report

This article analyzes the MOSS technical report on RLHF, identifying low data quality and poor model generalization as key challenges, and presents data‑centric and algorithmic solutions—including multi‑model preference strength measurement, soft labels, adaptive margins, contrastive learning, and MetaRM—backed by detailed experiments and visualizations.

Meta LearningPreference StrengthRLHF

0 likes · 12 min read

How to Boost Reward Model Performance in RLHF: Data and Algorithm Strategies from the MOSS Report

DataFunSummit

Nov 7, 2023 · Artificial Intelligence

Instrumental Variable Based Causal Inference and Generalizable Causal Learning

This article presents a comprehensive overview of using instrumental variables for causal inference and causal generalization in machine learning, discussing deep learning limitations, Pearl's causal hierarchy, two‑stage regression, challenges with unobserved confounders, automatic IV generation, and applications in economics and social networks.

causal inferencecausal learninggeneralization

0 likes · 16 min read

Instrumental Variable Based Causal Inference and Generalizable Causal Learning

21CTO

Aug 15, 2023 · Artificial Intelligence

Why Do Neural Networks Suddenly ‘Grok’ After Long Training? Insights from Google

Google’s recent research reveals that when small neural networks are trained for extended periods on tasks like modular addition, they can abruptly shift from memorizing training data to genuinely generalizing—a sudden “grokking” phenomenon driven by weight decay and the emergence of periodic weight structures.

AI researchMLPWeight Decay

0 likes · 9 min read

Why Do Neural Networks Suddenly ‘Grok’ After Long Training? Insights from Google

DataFunTalk

Oct 10, 2021 · Artificial Intelligence

Adaptive Universal Generalized PageRank Graph Neural Network (GPR‑GNN): Overview, Challenges, and Experimental Insights

This article presents an in‑depth overview of the Adaptive Universal Generalized PageRank Graph Neural Network (GPR‑GNN), explains the two main limitations of conventional GNNs—lack of generality across homophilic and heterophilic graphs and over‑smoothing—describes the GPR‑GNN architecture with learnable propagation weights, and summarizes synthetic and real‑world experiments that demonstrate its superior generality, resistance to over‑smoothing, interpretability, and potential future extensions.

GNNGeneralized PageRankOver‑smoothing

0 likes · 18 min read

Adaptive Universal Generalized PageRank Graph Neural Network (GPR‑GNN): Overview, Challenges, and Experimental Insights

Architects Research Society

Sep 19, 2021 · Fundamentals

Understanding UML Associations, Aggregations, Compositions, Generalization and Specialization

This article explains UML relationship types—including association, aggregation, composition, generalization and specialization—by describing their definitions, visual notations, multiplicity, role names, and real‑world examples, helping readers distinguish each concept and apply them in software modeling.

AggregationAssociationSpecialization

0 likes · 7 min read

Understanding UML Associations, Aggregations, Compositions, Generalization and Specialization

DataFunSummit

Mar 16, 2021 · Artificial Intelligence

Myths and Misconceptions in Reinforcement Learning – Summary of Csaba Szepesvári’s KDD 2020 Deep Learning Day Talk

This article summarizes Csaba Szepesvári’s 2020 KDD Deep Learning Day presentation on common myths and misconceptions in reinforcement learning, covering the scope of RL, safety concerns, generalization challenges, causal reasoning, and broader meta‑considerations for the field.

Meta‑LearningMythsSafety

0 likes · 16 min read

Myths and Misconceptions in Reinforcement Learning – Summary of Csaba Szepesvári’s KDD 2020 Deep Learning Day Talk

Python Programming Learning Circle

Dec 14, 2020 · Artificial Intelligence

Notes on Feasibility, Hoeffding Inequality, and VC Theory from Lin Xuantian's Machine Learning Foundations Course

These concise notes summarize key concepts from Professor Lin Xuantian's Machine Learning Foundations course, covering feasibility of learning, Hoeffding and multi‑bin Hoeffding inequalities, VC bounds, dichotomies, growth and bounding functions, VC dimension, and their implications for model and sample complexity.

Hoeffding InequalityVC Theorygeneralization

0 likes · 8 min read

Notes on Feasibility, Hoeffding Inequality, and VC Theory from Lin Xuantian's Machine Learning Foundations Course

Full-Stack Internet Architecture

Jun 27, 2020 · Fundamentals

Understanding UML Class Diagram Relationships: Generalization, Realization, Aggregation, Composition, Association, and Dependency

This article explains how to correctly draw UML class diagrams by describing the six main relationship types—generalization, realization, aggregation, composition, association, and dependency—using clear examples and visual illustrations to help readers understand and apply each relationship in software modeling.

Class DiagramRelationshipsUML

0 likes · 5 min read

Understanding UML Class Diagram Relationships: Generalization, Realization, Aggregation, Composition, Association, and Dependency

Alibaba Cloud Developer

Jul 26, 2018 · Artificial Intelligence

How We Won OpenAI’s Retro Contest: Joint PPO and Generalization in Sonic

This article details the technical journey behind Alibaba’s champion solution in OpenAI’s Retro Contest, explaining the reinforcement‑learning challenges of playing Sonic, the joint PPO approach, distributed training optimizations, reward shaping, fine‑tuning with DeepMimic, and the final performance that secured first place.

OpenAI Retro Contestgeneralizationjoint PPO

0 likes · 20 min read

How We Won OpenAI’s Retro Contest: Joint PPO and Generalization in Sonic

Alibaba Cloud Developer

Jun 13, 2018 · Artificial Intelligence

What Machine Learning Can Teach Us About Growing Up

Using a stroll conversation among Ant Financial AI team members, the article likens machine learning concepts such as overfitting, generalization, supervised and unsupervised learning, transfer learning, and model interpretability to human development stages, illustrating how both require diverse data, training, and evolving algorithms.

AI Educationgeneralizationhuman development

0 likes · 10 min read

What Machine Learning Can Teach Us About Growing Up

21CTO

Dec 16, 2017 · Artificial Intelligence

Unveiling the Mathematics Behind Deep Learning Success

This article reviews recent research that mathematically explains why deep learning, especially convolutional neural networks, achieve remarkable performance by examining core factors such as architecture, regularization, and optimization, and discusses properties like global optimality, geometric stability, and invariant representations.

Deep Learninggeneralizationinformation theory

0 likes · 16 min read

Unveiling the Mathematics Behind Deep Learning Success