Tagged articles
15 articles
Page 1 of 1
Data Party THU
Data Party THU
Mar 25, 2026 · Artificial Intelligence

How Knowledge‑Guided Context Optimization Boosts Zero‑Shot Vision‑Language Models

The article analyzes the Base‑to‑New generalization problem of CLIP‑based visual‑language models, explains why standard prompt tuning (CoOp) forgets base knowledge, and presents the KgCoOp framework that adds a knowledge‑guided loss to keep learned prompts close to hand‑crafted ones, dramatically improving unseen‑class performance while preserving efficiency.

CLIPGeneralizationKnowledge-guided Optimization
0 likes · 12 min read
How Knowledge‑Guided Context Optimization Boosts Zero‑Shot Vision‑Language Models
Tencent Advertising Technology
Tencent Advertising Technology
Nov 28, 2025 · Artificial Intelligence

How Retrv-R1 Redefines Universal Multimodal Retrieval with Reasoning‑Driven MLLM

Retrv‑R1, a reasoning‑driven multimodal large language model framework, tackles the precision‑efficiency dilemma of universal multimodal retrieval by introducing a two‑stage coarse‑to‑fine pipeline, an information‑compression module, a detail‑inspection mechanism, and a three‑stage training strategy, achieving SOTA performance across accuracy, efficiency, and generalization benchmarks.

GeneralizationMLLMMultimodal Retrieval
0 likes · 21 min read
How Retrv-R1 Redefines Universal Multimodal Retrieval with Reasoning‑Driven MLLM
Baobao Algorithm Notes
Baobao Algorithm Notes
Aug 14, 2025 · Artificial Intelligence

Why Standard SFT Fails to Generalize and How One‑Line Dynamic Fine‑Tuning Fixes It

The article analyzes the poor generalization of supervised fine‑tuning (SFT) for large language models, reveals its gradient as a high‑variance inverse‑probability policy gradient, proposes a one‑line Dynamic Fine‑Tuning correction, and shows substantial gains on challenging math and offline RL benchmarks.

Dynamic Fine-TuningGeneralizationLLM alignment
0 likes · 7 min read
Why Standard SFT Fails to Generalize and How One‑Line Dynamic Fine‑Tuning Fixes It
DataFunTalk
DataFunTalk
Feb 26, 2024 · Artificial Intelligence

Large Language Model Empowered Recommendation Systems: Overview, Techniques, and Future Directions

With the rapid rise of ChatGPT and large language models, recommendation systems are undergoing a transformative shift, moving beyond traditional behavior‑based methods to leverage LLMs for improved generalization, representation, and prompt‑based learning, while addressing challenges such as scalability, interpretability, bias, and deployment costs.

AIGeneralizationLLM
0 likes · 19 min read
Large Language Model Empowered Recommendation Systems: Overview, Techniques, and Future Directions
Baobao Algorithm Notes
Baobao Algorithm Notes
Jan 13, 2024 · Artificial Intelligence

How to Boost Reward Model Performance in RLHF: Data and Algorithm Strategies from the MOSS Report

This article analyzes the MOSS technical report on RLHF, identifying low data quality and poor model generalization as key challenges, and presents data‑centric and algorithmic solutions—including multi‑model preference strength measurement, soft labels, adaptive margins, contrastive learning, and MetaRM—backed by detailed experiments and visualizations.

GeneralizationMeta LearningPreference Strength
0 likes · 12 min read
How to Boost Reward Model Performance in RLHF: Data and Algorithm Strategies from the MOSS Report
DataFunSummit
DataFunSummit
Nov 7, 2023 · Artificial Intelligence

Instrumental Variable Based Causal Inference and Generalizable Causal Learning

This article presents a comprehensive overview of using instrumental variables for causal inference and causal generalization in machine learning, discussing deep learning limitations, Pearl's causal hierarchy, two‑stage regression, challenges with unobserved confounders, automatic IV generation, and applications in economics and social networks.

Generalizationcausal inferencecausal learning
0 likes · 16 min read
Instrumental Variable Based Causal Inference and Generalizable Causal Learning
21CTO
21CTO
Aug 15, 2023 · Artificial Intelligence

Why Do Neural Networks Suddenly ‘Grok’ After Long Training? Insights from Google

Google’s recent research reveals that when small neural networks are trained for extended periods on tasks like modular addition, they can abruptly shift from memorizing training data to genuinely generalizing—a sudden “grokking” phenomenon driven by weight decay and the emergence of periodic weight structures.

AI researchGeneralizationMLP
0 likes · 9 min read
Why Do Neural Networks Suddenly ‘Grok’ After Long Training? Insights from Google
DataFunTalk
DataFunTalk
Oct 10, 2021 · Artificial Intelligence

Adaptive Universal Generalized PageRank Graph Neural Network (GPR‑GNN): Overview, Challenges, and Experimental Insights

This article presents an in‑depth overview of the Adaptive Universal Generalized PageRank Graph Neural Network (GPR‑GNN), explains the two main limitations of conventional GNNs—lack of generality across homophilic and heterophilic graphs and over‑smoothing—describes the GPR‑GNN architecture with learnable propagation weights, and summarizes synthetic and real‑world experiments that demonstrate its superior generality, resistance to over‑smoothing, interpretability, and potential future extensions.

GNNGeneralizationGeneralized PageRank
0 likes · 18 min read
Adaptive Universal Generalized PageRank Graph Neural Network (GPR‑GNN): Overview, Challenges, and Experimental Insights
Architects Research Society
Architects Research Society
Sep 19, 2021 · Fundamentals

Understanding UML Associations, Aggregations, Compositions, Generalization and Specialization

This article explains UML relationship types—including association, aggregation, composition, generalization and specialization—by describing their definitions, visual notations, multiplicity, role names, and real‑world examples, helping readers distinguish each concept and apply them in software modeling.

AssociationGeneralizationSpecialization
0 likes · 7 min read
Understanding UML Associations, Aggregations, Compositions, Generalization and Specialization
DataFunSummit
DataFunSummit
Mar 16, 2021 · Artificial Intelligence

Myths and Misconceptions in Reinforcement Learning – Summary of Csaba Szepesvári’s KDD 2020 Deep Learning Day Talk

This article summarizes Csaba Szepesvári’s 2020 KDD Deep Learning Day presentation on common myths and misconceptions in reinforcement learning, covering the scope of RL, safety concerns, generalization challenges, causal reasoning, and broader meta‑considerations for the field.

GeneralizationMeta‑LearningMyths
0 likes · 16 min read
Myths and Misconceptions in Reinforcement Learning – Summary of Csaba Szepesvári’s KDD 2020 Deep Learning Day Talk
Python Programming Learning Circle
Python Programming Learning Circle
Dec 14, 2020 · Artificial Intelligence

Notes on Feasibility, Hoeffding Inequality, and VC Theory from Lin Xuantian's Machine Learning Foundations Course

These concise notes summarize key concepts from Professor Lin Xuantian's Machine Learning Foundations course, covering feasibility of learning, Hoeffding and multi‑bin Hoeffding inequalities, VC bounds, dichotomies, growth and bounding functions, VC dimension, and their implications for model and sample complexity.

GeneralizationHoeffding InequalityVC Theory
0 likes · 8 min read
Notes on Feasibility, Hoeffding Inequality, and VC Theory from Lin Xuantian's Machine Learning Foundations Course
Full-Stack Internet Architecture
Full-Stack Internet Architecture
Jun 27, 2020 · Fundamentals

Understanding UML Class Diagram Relationships: Generalization, Realization, Aggregation, Composition, Association, and Dependency

This article explains how to correctly draw UML class diagrams by describing the six main relationship types—generalization, realization, aggregation, composition, association, and dependency—using clear examples and visual illustrations to help readers understand and apply each relationship in software modeling.

Class DiagramGeneralizationModeling
0 likes · 5 min read
Understanding UML Class Diagram Relationships: Generalization, Realization, Aggregation, Composition, Association, and Dependency
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 26, 2018 · Artificial Intelligence

How We Won OpenAI’s Retro Contest: Joint PPO and Generalization in Sonic

This article details the technical journey behind Alibaba’s champion solution in OpenAI’s Retro Contest, explaining the reinforcement‑learning challenges of playing Sonic, the joint PPO approach, distributed training optimizations, reward shaping, fine‑tuning with DeepMimic, and the final performance that secured first place.

GeneralizationOpenAI Retro Contestjoint PPO
0 likes · 20 min read
How We Won OpenAI’s Retro Contest: Joint PPO and Generalization in Sonic
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 13, 2018 · Artificial Intelligence

What Machine Learning Can Teach Us About Growing Up

Using a stroll conversation among Ant Financial AI team members, the article likens machine learning concepts such as overfitting, generalization, supervised and unsupervised learning, transfer learning, and model interpretability to human development stages, illustrating how both require diverse data, training, and evolving algorithms.

AI educationGeneralizationhuman development
0 likes · 10 min read
What Machine Learning Can Teach Us About Growing Up
21CTO
21CTO
Dec 16, 2017 · Artificial Intelligence

Unveiling the Mathematics Behind Deep Learning Success

This article reviews recent research that mathematically explains why deep learning, especially convolutional neural networks, achieve remarkable performance by examining core factors such as architecture, regularization, and optimization, and discusses properties like global optimality, geometric stability, and invariant representations.

Deep LearningGeneralizationNeural Networks
0 likes · 16 min read
Unveiling the Mathematics Behind Deep Learning Success