Tagged articles
12 articles
Page 1 of 1
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jun 10, 2026 · Artificial Intelligence

LU‑KV Sets New SOTA at ICML 2026 by Redefining KV Cache Eviction

A joint effort by Baidu Baige and Fudan University introduces the LU‑KV framework, which treats KV‑cache budget allocation as a global combinatorial optimization problem, achieving only 0.52% relative performance loss at 80% compression and establishing a new efficiency‑accuracy SOTA on LongBench.

Cache EvictionICML 2026KV cache
0 likes · 5 min read
LU‑KV Sets New SOTA at ICML 2026 by Redefining KV Cache Eviction
Machine Heart
Machine Heart
Jun 8, 2026 · Artificial Intelligence

Can Text-to-Image Models Forget Prompts? Prompt Reinjection Boosts Instruction Following Without Retraining

The paper reveals that multimodal diffusion transformers often lose fine‑grained textual semantics in deeper layers—a phenomenon called Prompt Forgetting—and introduces Prompt Reinjection, a training‑free inference technique that re‑injects shallow text features to markedly improve text‑image alignment and instruction compliance while preserving visual quality and incurring negligible computational overhead.

ICML 2026Multimodal Diffusion TransformersPrompt Forgetting
0 likes · 9 min read
Can Text-to-Image Models Forget Prompts? Prompt Reinjection Boosts Instruction Following Without Retraining
Alimama Tech
Alimama Tech
Jun 4, 2026 · Artificial Intelligence

ICML 2026 Highlights: Five Taotian Group Papers Pushing Multimodal AI Boundaries

The article showcases five ICML 2026 papers from the Taotian Group that tackle core multimodal AI challenges—interactive video try‑on, high‑resolution vision, e‑commerce video reasoning, sparse‑reward reinforcement learning, and curriculum learning for large language models—detailing their problem statements, novel solutions, and strong experimental results.

Curriculum LearningICML 2026Reinforcement Learning
0 likes · 15 min read
ICML 2026 Highlights: Five Taotian Group Papers Pushing Multimodal AI Boundaries
Data Party THU
Data Party THU
May 31, 2026 · Artificial Intelligence

Why AI Agents Get Dumber Over Time? ICML 2026 Theory of Agent Explains

The article introduces the ICML 2026 Theory of Agent (ToA), analyzes four common failure modes of modern agents, explains the internal‑vs‑external tool trade‑off through a knowledge‑boundary framework, and outlines how effort‑conservation and the β parameter guide self‑evolving agent design and future research.

AI agentsICML 2026Self‑Evolution
0 likes · 24 min read
Why AI Agents Get Dumber Over Time? ICML 2026 Theory of Agent Explains
Machine Heart
Machine Heart
May 31, 2026 · Artificial Intelligence

LMNet: Enabling Language Models to Self‑Organize into Networks

The paper introduces Language Model Networks (LMNet), a framework that lets pretrained large language models act as reusable compute nodes communicating via dense, trainable vectors, showing measurable performance gains on general and supervised adaptation tasks with minimal extra training cost.

ICML 2026LLM collaborationLMNet
0 likes · 10 min read
LMNet: Enabling Language Models to Self‑Organize into Networks
Machine Heart
Machine Heart
May 22, 2026 · Artificial Intelligence

Breaking the Echo Chamber: MP‑MoE Introduces Ensemble‑Pruning for Diverse Experts

The paper presents MP‑MoE, a new Mixture‑of‑Experts architecture that replaces top‑k routing with Mahalanobis‑based ensemble pruning, explicitly encouraging expert diversity via a co‑occurrence matrix, and uses an efficient greedy algorithm with incremental Cholesky updates, achieving higher performance with minimal training overhead and no inference cost.

Dynamic RoutingEnsemble PruningExpert Diversity
0 likes · 8 min read
Breaking the Echo Chamber: MP‑MoE Introduces Ensemble‑Pruning for Diverse Experts
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
May 21, 2026 · Artificial Intelligence

Breaking the UED Bottleneck: PACE Locates the Reinforcement‑Learning Zone of Proximal Development

The paper introduces PACE, a Parameter‑Change based Unsupervised Environment Design method that evaluates training levels by the magnitude of induced policy‑parameter updates, offering a low‑variance, computationally cheap signal that consistently outperforms prior UED approaches on MiniGrid and Craftax benchmarks.

CraftaxCurriculum LearningICML 2026
0 likes · 11 min read
Breaking the UED Bottleneck: PACE Locates the Reinforcement‑Learning Zone of Proximal Development
Machine Heart
Machine Heart
May 21, 2026 · Artificial Intelligence

Breaking the Traditional UED Bottleneck: Using RL to Precisely Locate the Zone of Proximal Development

The paper introduces PACE, a Parameter Change Environment Design method that evaluates training levels by measuring induced policy parameter updates, offering a low‑variance learning‑progress signal that outperforms prior UED approaches on MiniGrid and Craftax benchmarks, achieving higher success rates and more stable generalization.

CraftaxCurriculum LearningICML 2026
0 likes · 10 min read
Breaking the Traditional UED Bottleneck: Using RL to Precisely Locate the Zone of Proximal Development
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
May 14, 2026 · Artificial Intelligence

Turning Multi‑Teacher Conflict into Dynamic Constraints: Robust Reasoning Alignment for Multimodal LLMs (ICML 2026)

APO (Autonomous Preference Optimization) converts the drift and conflict among multiple teacher multimodal LLMs into dynamic negative constraints while treating consensus as a positive preference, enabling robust concept alignment and superior diagnostic accuracy on the CXR‑MAX benchmark, as demonstrated by extensive ICML‑2026 experiments.

APOICML 2026Preference Optimization
0 likes · 11 min read
Turning Multi‑Teacher Conflict into Dynamic Constraints: Robust Reasoning Alignment for Multimodal LLMs (ICML 2026)
Machine Heart
Machine Heart
May 13, 2026 · Artificial Intelligence

Turning Multi-Teacher Conflict into Dynamic Constraints for Precise Multimodal Model Alignment (ICML 2026)

The paper introduces APO, a novel autonomous preference optimization framework that converts concept drift among multiple teacher multimodal LLMs into dynamic negative constraints and treats consensus as a positive preference, achieving robust concept alignment and surpassing strong teachers on a high‑risk medical X‑ray benchmark.

APOCXR-MAXICML 2026
0 likes · 11 min read
Turning Multi-Teacher Conflict into Dynamic Constraints for Precise Multimodal Model Alignment (ICML 2026)
DataFunTalk
DataFunTalk
Nov 6, 2025 · Artificial Intelligence

What New AI Policies Are Shaping ICML 2026 Submissions?

ICML 2026 opens paper submissions with strict AI usage rules—LLMs cannot be listed as authors, prompt injection is banned, and AI reviewing is expanded—while outlining submission formats, important dates, reciprocal review limits, and ethical guidelines for authors.

AI policyICML 2026conference
0 likes · 11 min read
What New AI Policies Are Shaping ICML 2026 Submissions?