Tagged articles
9 articles
Page 1 of 1
Machine Heart
Machine Heart
May 8, 2026 · Artificial Intelligence

How Laser Cuts Token Use by 97% with Probabilistic Superposition for Implicit Multimodal Reasoning

Laser introduces a latent‑superposition paradigm that replaces step‑by‑step token prediction with dynamic windowed alignment, achieving over 97% token‑consumption reduction, new SOTA performance on six visual benchmarks, and improved interpretability for multimodal large models.

ACL 2026Dynamic Window AlignmentLatent Superposition
0 likes · 13 min read
How Laser Cuts Token Use by 97% with Probabilistic Superposition for Implicit Multimodal Reasoning
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
May 4, 2026 · Artificial Intelligence

SignThought: A New Gloss‑Free Sign Language Translation Framework for the Deaf Community

The paper introduces SignThought, a gloss‑free sign language translation model that inserts an ordered latent‑thought chain between video encoding and text generation, uses a plan‑then‑ground decoding strategy, and is evaluated on five benchmarks and a newly built 1,311‑hour LC‑HKSLT dataset, achieving state‑of‑the‑art BLEU‑4 and ROUGE scores.

ACL2026DatasetGloss-Free
0 likes · 11 min read
SignThought: A New Gloss‑Free Sign Language Translation Framework for the Deaf Community
Machine Heart
Machine Heart
May 4, 2026 · Artificial Intelligence

Thought-Based Gloss-Free Sign Language Translation Model for the Deaf (ACL 2026)

The paper introduces SignThought, a gloss‑free sign language translation framework that uses a latent chain‑of‑thought reasoning layer and a plan‑then‑ground decoder, evaluates it on five benchmarks with state‑of‑the‑art BLEU‑4 and ROUGE scores, and releases a large new Hong Kong sign language dataset.

ACL 2026Gloss-FreeLatent Thoughts
0 likes · 11 min read
Thought-Based Gloss-Free Sign Language Translation Model for the Deaf (ACL 2026)
AIWalker
AIWalker
Mar 19, 2026 · Artificial Intelligence

Vision‑R1 Multimodal Reasoning Model Delivers Human‑Level Logic and Near‑OpenAI O1 Accuracy

Vision‑R1 introduces a 7B multimodal large language model that leverages 200K unsupervised CoT data, Modality Bridging, and Progressive Thinking Suppression Training to overcome data scarcity and over‑thinking, achieving 73.5% accuracy on MathVista—within 0.4% of OpenAI’s O1.

Chain-of-ThoughtLarge Language ModelsMultimodal Reasoning
0 likes · 12 min read
Vision‑R1 Multimodal Reasoning Model Delivers Human‑Level Logic and Near‑OpenAI O1 Accuracy
AIWalker
AIWalker
Mar 12, 2026 · Artificial Intelligence

Mind-Brush: ‘Think‑Research‑Create’ Intent Reasoning for Image Generation

Mind-Brush introduces a ‘think‑research‑create’ agentic framework that unifies intent analysis, multimodal evidence retrieval, and knowledge‑driven reasoning to transform text‑to‑image generation from static decoding into an active cognitive workflow, achieving large accuracy gains on the new Mind‑Bench benchmark and surpassing existing SOTA models.

Agentic AIImage GenerationMind-Brush
0 likes · 15 min read
Mind-Brush: ‘Think‑Research‑Create’ Intent Reasoning for Image Generation
AI Engineering
AI Engineering
Feb 20, 2026 · Artificial Intelligence

Gemini 3.1 Pro Doubles Reasoning Power and Outperforms Claude Opus 4.6

Google's Gemini 3.1 Pro achieves a 77.1% ARC‑AGI‑2 score—more than double its predecessor—leads in multiple benchmark categories, cuts inference cost by half compared to top rivals, and demonstrates advanced multimodal and programming capabilities through real‑world demos.

AI benchmarksARC-AGI-2Claude Opus 4.6
0 likes · 9 min read
Gemini 3.1 Pro Doubles Reasoning Power and Outperforms Claude Opus 4.6
Data Party THU
Data Party THU
Oct 8, 2025 · Artificial Intelligence

Why Reinforcement Learning Unlocks Hierarchical Reasoning in LLMs: The HICRA Breakthrough

The article explains how reinforcement learning induces a hierarchical learning dynamic in large language models, introduces the HICRA training paradigm that concentrates gradient updates on planning tokens, and shows through extensive text and multimodal benchmarks that this approach consistently yields earlier Aha moments and superior reasoning performance.

HICRAHierarchical ReasoningMultimodal Reasoning
0 likes · 10 min read
Why Reinforcement Learning Unlocks Hierarchical Reasoning in LLMs: The HICRA Breakthrough
DataFunSummit
DataFunSummit
Dec 6, 2022 · Artificial Intelligence

Multimodal Reasoning, Logic Inference, and Machine Learning: An Integrated Survey

This article surveys the development of artificial intelligence from symbolic and connectionist perspectives, covering deductive and inductive reasoning, multimodal and cross‑modal inference, knowledge‑graph reasoning, text and visual understanding, and their applications in causal inference, dialogue consistency, and security vulnerability analysis.

Knowledge GraphsMultimodal Reasoningcausal inference
0 likes · 18 min read
Multimodal Reasoning, Logic Inference, and Machine Learning: An Integrated Survey