Tagged articles

12 articles

Page 1 of 1

Jun 10, 2026 · Artificial Intelligence

LU‑KV Sets New SOTA at ICML 2026 by Redefining KV Cache Eviction

A joint effort by Baidu Baige and Fudan University introduces the LU‑KV framework, which treats KV‑cache budget allocation as a global combinatorial optimization problem, achieving only 0.52% relative performance loss at 80% compression and establishing a new efficiency‑accuracy SOTA on LongBench.

Cache EvictionICML 2026KV cache

0 likes · 5 min read

LU‑KV Sets New SOTA at ICML 2026 by Redefining KV Cache Eviction

Machine Heart

Jun 8, 2026 · Artificial Intelligence

Can Text-to-Image Models Forget Prompts? Prompt Reinjection Boosts Instruction Following Without Retraining

The paper reveals that multimodal diffusion transformers often lose fine‑grained textual semantics in deeper layers—a phenomenon called Prompt Forgetting—and introduces Prompt Reinjection, a training‑free inference technique that re‑injects shallow text features to markedly improve text‑image alignment and instruction compliance while preserving visual quality and incurring negligible computational overhead.

ICML 2026Multimodal Diffusion TransformersPrompt Forgetting

0 likes · 9 min read

Can Text-to-Image Models Forget Prompts? Prompt Reinjection Boosts Instruction Following Without Retraining

Alimama Tech

Jun 4, 2026 · Artificial Intelligence

ICML 2026 Highlights: Five Taotian Group Papers Pushing Multimodal AI Boundaries

The article showcases five ICML 2026 papers from the Taotian Group that tackle core multimodal AI challenges—interactive video try‑on, high‑resolution vision, e‑commerce video reasoning, sparse‑reward reinforcement learning, and curriculum learning for large language models—detailing their problem statements, novel solutions, and strong experimental results.

Curriculum LearningICML 2026Reinforcement Learning

0 likes · 15 min read

ICML 2026 Highlights: Five Taotian Group Papers Pushing Multimodal AI Boundaries

Data Party THU

May 31, 2026 · Artificial Intelligence

Why AI Agents Get Dumber Over Time? ICML 2026 Theory of Agent Explains

The article introduces the ICML 2026 Theory of Agent (ToA), analyzes four common failure modes of modern agents, explains the internal‑vs‑external tool trade‑off through a knowledge‑boundary framework, and outlines how effort‑conservation and the β parameter guide self‑evolving agent design and future research.

AI agentsICML 2026Self‑Evolution

0 likes · 24 min read

Why AI Agents Get Dumber Over Time? ICML 2026 Theory of Agent Explains

Machine Heart

May 31, 2026 · Artificial Intelligence

LMNet: Enabling Language Models to Self‑Organize into Networks

The paper introduces Language Model Networks (LMNet), a framework that lets pretrained large language models act as reusable compute nodes communicating via dense, trainable vectors, showing measurable performance gains on general and supervised adaptation tasks with minimal extra training cost.

ICML 2026LLM collaborationLMNet

0 likes · 10 min read

LMNet: Enabling Language Models to Self‑Organize into Networks

Machine Heart

May 22, 2026 · Artificial Intelligence

Breaking the Echo Chamber: MP‑MoE Introduces Ensemble‑Pruning for Diverse Experts

The paper presents MP‑MoE, a new Mixture‑of‑Experts architecture that replaces top‑k routing with Mahalanobis‑based ensemble pruning, explicitly encouraging expert diversity via a co‑occurrence matrix, and uses an efficient greedy algorithm with incremental Cholesky updates, achieving higher performance with minimal training overhead and no inference cost.

Dynamic RoutingEnsemble PruningExpert Diversity

0 likes · 8 min read

Breaking the Echo Chamber: MP‑MoE Introduces Ensemble‑Pruning for Diverse Experts

Machine Learning Algorithms & Natural Language Processing

May 21, 2026 · Artificial Intelligence

Breaking the UED Bottleneck: PACE Locates the Reinforcement‑Learning Zone of Proximal Development

The paper introduces PACE, a Parameter‑Change based Unsupervised Environment Design method that evaluates training levels by the magnitude of induced policy‑parameter updates, offering a low‑variance, computationally cheap signal that consistently outperforms prior UED approaches on MiniGrid and Craftax benchmarks.

CraftaxCurriculum LearningICML 2026

0 likes · 11 min read

Breaking the UED Bottleneck: PACE Locates the Reinforcement‑Learning Zone of Proximal Development

Machine Heart

May 21, 2026 · Artificial Intelligence

Breaking the Traditional UED Bottleneck: Using RL to Precisely Locate the Zone of Proximal Development

The paper introduces PACE, a Parameter Change Environment Design method that evaluates training levels by measuring induced policy parameter updates, offering a low‑variance learning‑progress signal that outperforms prior UED approaches on MiniGrid and Craftax benchmarks, achieving higher success rates and more stable generalization.

CraftaxCurriculum LearningICML 2026

0 likes · 10 min read

Breaking the Traditional UED Bottleneck: Using RL to Precisely Locate the Zone of Proximal Development

Machine Learning Algorithms & Natural Language Processing

May 14, 2026 · Artificial Intelligence

Turning Multi‑Teacher Conflict into Dynamic Constraints: Robust Reasoning Alignment for Multimodal LLMs (ICML 2026)

APO (Autonomous Preference Optimization) converts the drift and conflict among multiple teacher multimodal LLMs into dynamic negative constraints while treating consensus as a positive preference, enabling robust concept alignment and superior diagnostic accuracy on the CXR‑MAX benchmark, as demonstrated by extensive ICML‑2026 experiments.

APOICML 2026Preference Optimization

0 likes · 11 min read

Turning Multi‑Teacher Conflict into Dynamic Constraints: Robust Reasoning Alignment for Multimodal LLMs (ICML 2026)

Machine Heart

May 13, 2026 · Artificial Intelligence

Turning Multi-Teacher Conflict into Dynamic Constraints for Precise Multimodal Model Alignment (ICML 2026)

The paper introduces APO, a novel autonomous preference optimization framework that converts concept drift among multiple teacher multimodal LLMs into dynamic negative constraints and treats consensus as a positive preference, achieving robust concept alignment and surpassing strong teachers on a high‑risk medical X‑ray benchmark.

APOCXR-MAXICML 2026

0 likes · 11 min read

Turning Multi-Teacher Conflict into Dynamic Constraints for Precise Multimodal Model Alignment (ICML 2026)

Machine Learning Algorithms & Natural Language Processing

Feb 16, 2026 · Artificial Intelligence

How ICML 2026 Used Prompt Injection to Trap Automated Reviewers

Reviewers discovered hidden text in ICML 2026 PDFs that injects specific phrases into large‑language‑model generated reviews, turning an attack technique into a defense mechanism and prompting new safeguards such as watermarking and OCR‑based checks.

AI securityAcademic Peer ReviewICML 2026

0 likes · 6 min read

How ICML 2026 Used Prompt Injection to Trap Automated Reviewers

DataFunTalk

Nov 6, 2025 · Artificial Intelligence

What New AI Policies Are Shaping ICML 2026 Submissions?

ICML 2026 opens paper submissions with strict AI usage rules—LLMs cannot be listed as authors, prompt injection is banned, and AI reviewing is expanded—while outlining submission formats, important dates, reciprocal review limits, and ethical guidelines for authors.

AI policyICML 2026conference

0 likes · 11 min read

What New AI Policies Are Shaping ICML 2026 Submissions?