Tagged articles

Chain-of-Thought

106 articles · Page 1 of 2

Jul 3, 2026 · Artificial Intelligence

What Fable 5’s Leaked Chain‑of‑Thought Reveals About AI’s “Neuralese”

A Reddit post exposing Claude Fable 5’s raw chain‑of‑thought shows garbled internal monologue, sparking debate about illegible reasoning, its prevalence across large language models, and the safety implications of AI potentially developing a private, unreadable language.

AI safetyChain-of-ThoughtClaude

0 likes · 10 min read

What Fable 5’s Leaked Chain‑of‑Thought Reveals About AI’s “Neuralese”

Data Party THU

Jun 29, 2026 · Artificial Intelligence

Mapping LLM Reasoning: Paradigms, Methods, and Failure Modes in a Periodic Table

This 103‑page survey of over 300 recent papers organizes large language model reasoning into a periodic‑table framework, explains where reasoning emerges, categorizes 36 method families across six dimensions, critiques accuracy‑only evaluation, and outlines key open challenges such as fidelity, robustness, calibration, generalization, efficiency, and safety.

AI safetyChain-of-ThoughtEvaluation

0 likes · 13 min read

Mapping LLM Reasoning: Paradigms, Methods, and Failure Modes in a Periodic Table

Machine Heart

Jun 24, 2026 · Artificial Intelligence

How APEIRIA Breaks the Black‑Box Barrier of 3D MLLMs (ICML 2026)

The paper introduces APEIRIA, a three‑stage curriculum that distills neuro‑symbolic program traces into 3D multi‑modal LLMs, enabling transparent spatial reasoning while preserving open‑vocabulary understanding, and demonstrates strong benchmark gains, modular upgrades, and zero‑shot generalization.

3D MLLMChain-of-ThoughtModular AI

0 likes · 11 min read

How APEIRIA Breaks the Black‑Box Barrier of 3D MLLMs (ICML 2026)

Machine Learning Algorithms & Natural Language Processing

Jun 22, 2026 · Artificial Intelligence

Why Large Language Models Need Not Run CoT on Every Question: Tencent Hunyuan’s On‑Demand CoT Trigger

The paper analyzes the efficiency and reward‑signal shortcomings of conventional generative reward models (GRM) and presents the E‑GRM framework, which uses model‑internal uncertainty to dynamically trigger chain‑of‑thought reasoning, employs a consensus‑based routing decision and a mixed‑loss discriminative scorer, achieving significant speed‑up and accuracy gains on benchmarks such as MATH, RM‑Bench and RewardBench.

Chain-of-ThoughtDynamic RoutingEfficiency

0 likes · 15 min read

Why Large Language Models Need Not Run CoT on Every Question: Tencent Hunyuan’s On‑Demand CoT Trigger

Machine Heart

Jun 17, 2026 · Artificial Intelligence

Why Transformers Struggle with State Tracking and How Recurrence Could Fix It

The DeepMind paper “The Topological Trouble With Transformers” reveals that the Transformer architecture inherently fails at state tracking, making chain‑of‑thought prompting only a costly patch, and proposes returning to recurrent mechanisms—such as looped or sequence‑wise recurrence—to achieve true, continuous memory.

AI researchChain-of-ThoughtDeepMind

0 likes · 9 min read

Why Transformers Struggle with State Tracking and How Recurrence Could Fix It

Machine Learning Algorithms & Natural Language Processing

Jun 17, 2026 · Artificial Intelligence

Is More Chain‑of‑Thought Always Better? Introducing E‑GRM for On‑Demand LLM Reasoning

The article critically examines the assumption that longer chain‑of‑thought reasoning always improves large language model performance, presents the E‑GRM framework that dynamically decides when to invoke full CoT based on model‑internal uncertainty, and validates its efficiency and accuracy gains through extensive experiments and ablations.

Ablation StudyChain-of-ThoughtDynamic Routing

0 likes · 16 min read

Is More Chain‑of‑Thought Always Better? Introducing E‑GRM for On‑Demand LLM Reasoning

AI Engineer Programming

Jun 13, 2026 · Artificial Intelligence

8 Prompt Templates to Structure AI Reasoning, Review, and Creative Output

These eight prompt templates guide AI through chain-of-thought reasoning, self-review iteration, role-and-constraint framing, parallel solution generation, code-performance analysis, multi-style title creation, meta-prompt nesting, and Socratic questioning, helping users craft structured, reliable, and creative interactions.

AI interactionChain-of-ThoughtMeta-Prompt

0 likes · 5 min read

8 Prompt Templates to Structure AI Reasoning, Review, and Creative Output

Machine Learning Algorithms & Natural Language Processing

Jun 10, 2026 · Artificial Intelligence

OneReason: Enabling Recommendation Systems to Reason

OneReason introduces a systematic reasoning capability into industrial recommendation models through multi‑stage pre‑training, chain‑of‑thought fine‑tuning, and reinforcement learning, achieving significant gains in click‑through, revenue, and cross‑domain recommendation performance while preserving the underlying language abilities of the base model.

Chain-of-ThoughtRecommendation Systemsindustrial AI

0 likes · 29 min read

OneReason: Enabling Recommendation Systems to Reason

Machine Heart

Jun 7, 2026 · Artificial Intelligence

Can AI Learn Mental Math? Implicit Chain‑of‑Thought Proven Theoretically (Stuart Russell)

The article reviews a new UC Berkeley and Princeton study that mathematically proves the feasibility of Implicit Chain‑of‑Thought (ICoT), showing how a tree‑structured training curriculum lets Transformers internalize reasoning steps, dramatically reducing token cost and training stages while achieving 100 % accuracy on the k‑parity task.

Chain-of-ThoughtImplicit ReasoningTheoretical Proof

0 likes · 11 min read

Can AI Learn Mental Math? Implicit Chain‑of‑Thought Proven Theoretically (Stuart Russell)

DeepHub IMBA

Jun 1, 2026 · Artificial Intelligence

The Essence of Prompt Engineering: Roles, Tasks, Context, Format, and Constraints

Prompt engineering designs inputs for large language models by combining clear intent, relevant context, explicit format, and constraints, turning ambiguous queries into reliable, high‑quality outputs through a structured, iterative process illustrated with concrete examples and advanced techniques.

AI communicationChain-of-ThoughtLLM reliability

0 likes · 23 min read

The Essence of Prompt Engineering: Roles, Tasks, Context, Format, and Constraints

Machine Heart

May 30, 2026 · Artificial Intelligence

How Abstract Symbols Cut AI Inference Cost by 11×

The article examines IBM Research's Abstract‑CoT approach, which replaces verbose natural‑language chain‑of‑thought reasoning with a compact abstract token vocabulary, achieving up to an 11‑fold reduction in inference tokens while maintaining comparable accuracy across math, instruction‑following, and multi‑hop QA benchmarks.

AI inferenceAbstract-CoTChain-of-Thought

0 likes · 11 min read

How Abstract Symbols Cut AI Inference Cost by 11×

Weekly Large Model Application

May 29, 2026 · Artificial Intelligence

From Direct Transcription to Reasoning ASR and Parallel Decoding: CoT‑ASR vs Whisfusion

ASR is shifting from direct verbatim transcription to two new paradigms—Chain‑of‑Thought reasoning (CoT‑ASR) that cuts WER and entity error rates, and diffusion‑based parallel decoding (Whisfusion) that slashes latency by over eight times—offering complementary routes for smarter, faster speech recognition.

ASRChain-of-ThoughtCoT-ASR

0 likes · 12 min read

From Direct Transcription to Reasoning ASR and Parallel Decoding: CoT‑ASR vs Whisfusion

James' Growth Diary

May 28, 2026 · Artificial Intelligence

Mastering Prompt Engineering: Few‑Shot, Chain‑of‑Thought, and Self‑Consistency Techniques

This article breaks down three core prompt‑engineering techniques—Few‑Shot prompting for output format stability, Chain‑of‑Thought for multi‑step reasoning, and Self‑Consistency for answer robustness—showing when to use each, how to combine them in LangChain, and providing concrete code examples, performance data, and common pitfalls.

Chain-of-ThoughtDynamic RoutingFew-shot

0 likes · 30 min read

Mastering Prompt Engineering: Few‑Shot, Chain‑of‑Thought, and Self‑Consistency Techniques

AI Engineering

May 24, 2026 · Artificial Intelligence

Build a Local AI Agent from Scratch: A Deep‑Dive, Non‑Fast‑Food Tutorial

This tutorial walks you through the open‑source “AI Agents From Scratch” project, teaching how to build a fully local AI agent without any pre‑made framework by covering core modules, 14 step‑by‑step examples, advanced reasoning architectures, and minimal system requirements.

AI AgentChain-of-ThoughtPrompt engineering

0 likes · 6 min read

Build a Local AI Agent from Scratch: A Deep‑Dive, Non‑Fast‑Food Tutorial

Machine Learning Algorithms & Natural Language Processing

May 20, 2026 · Artificial Intelligence

How 800 Data Points Halve LLM Chain‑of‑Thought Length and Boost Accuracy

The ICLR‑2026 paper introduces LCPO, a lightweight preference‑optimization technique that uses only 800 curated examples and 50 training steps to cut large‑model chain‑of‑thought generation length by about 50% while maintaining or even improving answer accuracy, dramatically reducing training and inference costs.

Chain-of-ThoughtEfficient InferenceLCPO

0 likes · 8 min read

How 800 Data Points Halve LLM Chain‑of‑Thought Length and Boost Accuracy

Machine Heart

May 18, 2026 · Artificial Intelligence

Can Large Models Reason Deeply with Only a Few Thinking Tokens?

The paper introduces Heima, a framework that compresses chain‑of‑thought reasoning into a small set of abstract “thinking tokens” for multimodal large models, dramatically reducing generated tokens while preserving inference capability, and provides an adaptive interpreter to reconstruct human‑readable reasoning for analysis.

Chain-of-ThoughtEfficient InferenceMultimodal AI

0 likes · 12 min read

Can Large Models Reason Deeply with Only a Few Thinking Tokens?

Lao Guo's Learning Space

May 12, 2026 · Artificial Intelligence

Demystifying the Core Technologies Behind ChatGPT, GPT‑4, and DeepSeek

This article breaks down the key algorithms that power large‑language models—Transformer, Mixture‑of‑Experts, Flash Attention, KV‑Cache, Multi‑Token Prediction, quantization, Chain‑of‑Thought and Retrieval‑Augmented Generation—explaining how each contributes to the performance of ChatGPT, GPT‑4 and DeepSeek.

Chain-of-ThoughtFlash AttentionKV cache

0 likes · 10 min read

Demystifying the Core Technologies Behind ChatGPT, GPT‑4, and DeepSeek

DataFunTalk

May 10, 2026 · Artificial Intelligence

DeepSeek vs MCTS: Decoding the ‘Chicken & Liquor’ Dilemma in LLM Training

The article analyzes why DeepSeek’s large‑model training struggles with Monte‑Carlo Tree Search, explains its use of Chain‑of‑Thought prompting, GRPO entropy‑boosting and rejection‑sampling fine‑tuning, compares these methods with Google’s OmegaPRM and PRM approaches, and proposes a concrete MCTS‑driven data‑generation pipeline to overcome the “chicken and liquor” trade‑off.

Chain-of-ThoughtDeepSeekGRPO

0 likes · 14 min read

DeepSeek vs MCTS: Decoding the ‘Chicken & Liquor’ Dilemma in LLM Training

DataFunSummit

May 4, 2026 · Artificial Intelligence

DeepSeek’s MCTS Failure: The ‘Roast Chicken and Baijiu’ Dilemma in LLM Training

The article examines why DeepSeek’s large‑model training cannot yet leverage Monte‑Carlo Tree Search, detailing its reliance on SFT, GRPO‑driven CoT activation and rejection‑sampling, contrasting this with Google’s PRM‑based approaches, and proposing a MCTS‑powered data‑generation pipeline to overcome the “roast chicken and baijiu” training dilemma.

Chain-of-ThoughtGRPOMonte Carlo Tree Search

0 likes · 14 min read

DeepSeek’s MCTS Failure: The ‘Roast Chicken and Baijiu’ Dilemma in LLM Training

Old Zhang's AI Learning

May 4, 2026 · Artificial Intelligence

How DeepSeek’s New Paper Redefines Multimodal Reasoning with Visual Primitives

DeepSeek’s new paper "Thinking with Visual Primitives" tackles the reference gap in multimodal models by introducing points and boxes as reasoning units, achieving up to 8× token efficiency and leading benchmark scores in counting, spatial reasoning, and maze navigation compared with GPT‑5.4, Claude‑Sonnet‑4.6 and Gemini‑3‑Flash.

Chain-of-ThoughtDeepSeekMultimodal

0 likes · 10 min read

How DeepSeek’s New Paper Redefines Multimodal Reasoning with Visual Primitives

Machine Learning Algorithms & Natural Language Processing

Apr 25, 2026 · Artificial Intelligence

How Anthropic and OpenAI Monitor Frontier AI Agent Behavior – A Comprehensive Review

This article systematically reviews Anthropic and OpenAI’s public research on monitoring intelligent agent trajectories, covering infrastructure such as Clio, Petri, Bloom, chain‑of‑thought monitoring, the Confessions mechanism, internal coding‑agent audits, and the Docent tool, while highlighting mitigation strategies for reward hacking and hidden objectives.

AI alignmentAnthropicChain-of-Thought

0 likes · 40 min read

How Anthropic and OpenAI Monitor Frontier AI Agent Behavior – A Comprehensive Review

JavaGuide

Apr 14, 2026 · Artificial Intelligence

Interview Question: How to Build Prompt Engineering for an Agent and Defend Against Malicious Prompt Injection

The article explains how industrial‑grade AI agents require structured prompt engineering, chain‑of‑thought reasoning, task decomposition, and a three‑layer defense (sandbox, prompt isolation, and human approval) to prevent prompt‑injection attacks, while also covering context engineering, retrieval‑augmented generation, and tool design best practices.

Agent DesignChain-of-ThoughtLLM security

0 likes · 23 min read

Interview Question: How to Build Prompt Engineering for an Agent and Defend Against Malicious Prompt Injection

Big Data and Microservices

Apr 12, 2026 · Artificial Intelligence

Master Structured Prompt Engineering: From Simple Commands to Powerful AI Agents

This article explains how vague AI queries lead to generic answers and shows how structured prompt engineering—using clear roles, goals, constraints, and frameworks like RTF and BROKE—can turn ambiguous business needs into precise, high‑quality AI outputs, including advanced chain‑of‑thought and few‑shot techniques for agents.

AIAgentChain-of-Thought

0 likes · 10 min read

Master Structured Prompt Engineering: From Simple Commands to Powerful AI Agents

Smart Workplace Lab

Apr 2, 2026 · Artificial Intelligence

Master Reverse Prompt Debugging: Turn AI into Your Red‑Team Tester

Learn how to apply reverse debugging to AI prompts by letting the model act as an attacker, uncover hidden logical flaws, and use chain‑of‑thought logs to refine your instructions before they reach production, reducing costly errors and improving reliability.

AI promptingChain-of-ThoughtPrompt engineering

0 likes · 3 min read

Master Reverse Prompt Debugging: Turn AI into Your Red‑Team Tester

AI Large-Model Wave and Transformation Guide

Mar 28, 2026 · Artificial Intelligence

How a 17‑Year‑Old Prompt Turned Claude 3.5 into a Free O1‑Level AI

A teenage prodigy engineered a "Thinking Claude" prompt that adds a human‑like chain‑of‑thought protocol to Claude 3.5, enabling free O1‑level reasoning and producing impressive outputs such as a functional calculator, sci‑fi story, and playable games, while the article details the prompt’s design process and usage.

AI reasoningChain-of-ThoughtClaude 3.5

0 likes · 8 min read

How a 17‑Year‑Old Prompt Turned Claude 3.5 into a Free O1‑Level AI

DeepHub IMBA

Mar 27, 2026 · Artificial Intelligence

AI Agent Architecture: Chain‑of‑Thought, ReAct, and Tool Calls

From a simple black‑box view where an agent receives a user request and returns an answer, the article breaks down modern AI agent designs—detailing the pure Chain‑of‑Thought reasoning loop, the ReAct reasoning‑acting cycle, tool integration, iteration tuning, and how to choose the optimal architecture for production.

AI agentsChain-of-ThoughtLLM architecture

0 likes · 9 min read

AI Agent Architecture: Chain‑of‑Thought, ReAct, and Tool Calls

AIWalker

Mar 19, 2026 · Artificial Intelligence

Vision‑R1 Multimodal Reasoning Model Delivers Human‑Level Logic and Near‑OpenAI O1 Accuracy

Vision‑R1 introduces a 7B multimodal large language model that leverages 200K unsupervised CoT data, Modality Bridging, and Progressive Thinking Suppression Training to overcome data scarcity and over‑thinking, achieving 73.5% accuracy on MathVista—within 0.4% of OpenAI’s O1.

Chain-of-ThoughtMultimodal Reasoningbenchmark performance

0 likes · 12 min read

Vision‑R1 Multimodal Reasoning Model Delivers Human‑Level Logic and Near‑OpenAI O1 Accuracy

Machine Learning Algorithms & Natural Language Processing

Mar 18, 2026 · Artificial Intelligence

Breaking the ‘See‑then‑Think’ Barrier: Real‑Time ‘See‑and‑Think’ for VLMs (CVPR 2026)

The paper introduces TaYS (Think‑as‑You‑See), a streaming chain‑of‑thought framework that replaces the traditional “watch‑then‑think” video inference pipeline with a parallel, real‑time “watch‑and‑think” approach, dramatically reducing latency and improving accuracy on complex video reasoning tasks.

Chain-of-ThoughtDual KV-CacheStreaming Inference

0 likes · 8 min read

Breaking the ‘See‑then‑Think’ Barrier: Real‑Time ‘See‑and‑Think’ for VLMs (CVPR 2026)

AI Cyberspace

Mar 10, 2026 · Artificial Intelligence

Mastering Prompt Engineering: Techniques to Guide LLMs Effectively

This article explains the fundamentals of prompt engineering for large language models, covering LLM output configuration, length and sampling controls, various prompt types, chain‑of‑thought and tree‑of‑thought reasoning methods, and practical best‑practice guidelines for creating high‑quality prompts.

AI Prompt DesignChain-of-ThoughtLLM

0 likes · 18 min read

Mastering Prompt Engineering: Techniques to Guide LLMs Effectively

AIWalker

Mar 5, 2026 · Artificial Intelligence

How ViDA-UGC Leverages Large Multimodal Models for Fine-Grained Visual Quality Assessment

The article introduces ViDA-UGC, a large‑scale UGC visual‑quality dataset and its companion benchmark ViDA‑Bench, explains the MILP‑driven sampling, expert annotation pipeline, and CoT‑based evaluation framework, and shows how fine‑tuning popular multimodal LLMs on this data markedly improves low‑level quality perception, grounding, and description capabilities.

Chain-of-Thoughtbenchmarkdataset

0 likes · 12 min read

How ViDA-UGC Leverages Large Multimodal Models for Fine-Grained Visual Quality Assessment

SuanNi

Feb 27, 2026 · Artificial Intelligence

Can Deep Thought Ratio Reveal the True Reasoning Power of LLMs?

This article introduces the Deep Thought Ratio (DTR) metric, explains how tracking token modifications across neural network layers quantifies genuine inference effort, and shows through extensive experiments that DTR predicts accuracy far better than token length while enabling a sampling strategy that halves computational cost.

AI metricsChain-of-ThoughtLLM evaluation

0 likes · 9 min read

Can Deep Thought Ratio Reveal the True Reasoning Power of LLMs?

Tencent Technical Engineering

Jan 30, 2026 · Artificial Intelligence

Can Rendering Thought Chains as Images Speed Up LLM Reasoning?

This article introduces Render‑of‑Thought (RoT), a novel paradigm that compresses chain‑of‑thought reasoning into visual embeddings using frozen vision encoders, achieving 3‑4× token reduction, faster inference, and improved interpretability while requiring minimal pre‑training.

Chain-of-ThoughtInference OptimizationMultimodal

0 likes · 12 min read

Can Rendering Thought Chains as Images Speed Up LLM Reasoning?

Kuaishou Tech

Jan 28, 2026 · Artificial Intelligence

BLM‑Guard: Explainable Multimodal Ad Moderation Using Chain‑of‑Thought and Policy‑Aligned RL

The paper introduces BLM‑Guard, an explainable multimodal ad‑moderation framework that combines interleaved‑modal chain‑of‑thought reasoning with a policy‑aligned reinforcement‑learning reward to detect hidden cross‑modal violations in short‑video ads, and presents a new benchmark that demonstrates state‑of‑the‑art performance across multiple risk scenarios.

Chain-of-Thoughtad risk detectionbenchmark

0 likes · 12 min read

BLM‑Guard: Explainable Multimodal Ad Moderation Using Chain‑of‑Thought and Policy‑Aligned RL

PaperAgent

Dec 19, 2025 · Artificial Intelligence

Can We Trust AI? Inside GPT‑5.2‑Codex’s Monitorability Breakthrough

OpenAI’s new GPT‑5.2‑Codex model achieves state‑of‑the‑art performance on SWE‑Bench Pro and Terminal‑Bench 2.0, and a 90‑page technical report introduces the concept of monitorability, defining metrics, benchmark suites, and key findings about chain‑of‑thought length, RL training, and model size.

AI safetyChain-of-ThoughtGPT-5.2

0 likes · 10 min read

Can We Trust AI? Inside GPT‑5.2‑Codex’s Monitorability Breakthrough

Bilibili Tech

Dec 19, 2025 · Artificial Intelligence

SABER: Switchable and Balanced Training for Efficient LLM Reasoning

SABER introduces a reinforcement‑learning framework that lets large language models dynamically switch among four token‑budgeted reasoning modes, dramatically cutting inference length while preserving or improving accuracy across math, code, and logic tasks.

Budgeted ComputationChain-of-ThoughtEfficient Reasoning

0 likes · 13 min read

SABER: Switchable and Balanced Training for Efficient LLM Reasoning

KooFE Frontend Team

Dec 13, 2025 · Artificial Intelligence

Unlocking LLM Reasoning: Advanced Chain‑of‑Thought Prompting Techniques Explained

This article explains how Chain‑of‑Thought prompting and its variants—zero‑shot CoT, Thread of Thought, Tabular CoT, Analogical Prompting, and Step‑back Prompting—enable large language models to perform multi‑step reasoning by breaking problems into intermediate steps, with practical prompts, examples, and implementation details.

Chain-of-Thoughtreasoningzero-shot learning

0 likes · 12 min read

Unlocking LLM Reasoning: Advanced Chain‑of‑Thought Prompting Techniques Explained

Frontend AI Walk

Dec 5, 2025 · Artificial Intelligence

Master Prompt Engineering: From Random Chat to Precise Control with Zero-shot, Few-shot, and Chain‑of‑Thought

This article explains how to converse effectively with large language models by mastering three core prompting techniques—Zero‑shot, Few‑shot, and Chain‑of‑Thought—illustrated with front‑end analogies, code snippets, and a step‑by‑step DeepSeek JSON‑generation exercise that shows common pitfalls and best practices.

Chain-of-ThoughtDeepSeekFew-shot

0 likes · 12 min read

Master Prompt Engineering: From Random Chat to Precise Control with Zero-shot, Few-shot, and Chain‑of‑Thought

Alimama Tech

Dec 3, 2025 · Artificial Intelligence

How LORE Transforms E‑Commerce Search Relevance with Generative AI

The article details the development and deployment of LORE, a large generative model that reshapes e‑commerce search relevance by combining knowledge injection, chain‑of‑thought reasoning, and multimodal alignment, achieving simultaneous improvements in user experience and revenue metrics.

Chain-of-ThoughtMultimodale-commerce

0 likes · 15 min read

How LORE Transforms E‑Commerce Search Relevance with Generative AI

Tencent Technical Engineering

Dec 1, 2025 · Artificial Intelligence

Do Machines Really Think? Inside Deep Reasoning, Scaling Laws & RLHF for LLMs

This article examines whether large language models truly think, explores the origins of deep reasoning through transformer architectures and scaling laws, reviews chain‑of‑thought and its variants, and analyzes how reinforcement learning from human feedback—including PPO, DPO, and GRPO—helps internalise step‑by‑step reasoning while pointing to future directions such as atomic thought, hierarchical models, and training‑free in‑context knowledge bases.

AI alignmentChain-of-ThoughtLLM

0 likes · 35 min read

Do Machines Really Think? Inside Deep Reasoning, Scaling Laws & RLHF for LLMs

AI Tech Publishing

Nov 13, 2025 · Artificial Intelligence

Claude’s Prompt Engineering Best Practices: A Step‑by‑Step Guide

This guide outlines Claude team’s best practices for prompt engineering, covering core techniques such as clear instructions, background context, specificity, examples, and advanced methods like pre‑filled responses, chain‑of‑thought, output formatting, and prompt chaining, with concrete examples and code snippets.

AI promptingChain-of-ThoughtClaude

0 likes · 18 min read

Claude’s Prompt Engineering Best Practices: A Step‑by‑Step Guide

Amap Tech

Oct 7, 2025 · Artificial Intelligence

Farsighted-LAM & SSM-VLA: Boosting Spatial‑Temporal Reasoning for Embodied AI

Introducing Farsighted-LAM, a novel latent action model that integrates geometric perception and multi‑scale temporal modeling, and its end‑to‑end SSM‑VLA framework with a Chain‑of‑Thought reasoning module, the authors demonstrate markedly improved spatial‑temporal fidelity, interpretability, and state‑of‑the‑art performance on challenging VLA benchmarks.

Chain-of-ThoughtEmbodied AIlatent action models

0 likes · 11 min read

Farsighted-LAM & SSM-VLA: Boosting Spatial‑Temporal Reasoning for Embodied AI

Tencent Technical Engineering

Sep 12, 2025 · Artificial Intelligence

A Structured Prompt Engineering Guide to Make LLMs Obey

Learn how to craft effective prompts for large language models by using a systematic structure—role and task, core principles, context handling, chain‑of‑thought, output specifications, and few‑shot examples—and discover techniques for generating and iteratively refining prompts with the model itself.

AI promptingChain-of-Thoughtcontext handling

0 likes · 10 min read

A Structured Prompt Engineering Guide to Make LLMs Obey

Data Party THU

Sep 9, 2025 · Artificial Intelligence

From Chain‑of‑Thought to Graph‑of‑Thought: The Evolution of LLM Reasoning

This article examines how large language model reasoning has progressed from linear Chain‑of‑Thought prompting to parallel Tree‑of‑Thought and flexible Graph‑of‑Thought approaches, highlighting each method’s mechanism, strengths, limitations, computational costs, and the broader shift toward cognitive‑centric AI research.

AI researchChain-of-ThoughtGraph-of-Thought

0 likes · 7 min read

From Chain‑of‑Thought to Graph‑of‑Thought: The Evolution of LLM Reasoning

Data Party THU

Sep 1, 2025 · Artificial Intelligence

Why Intermediate Tokens Make LLMs Reason Better: Insights from Denny Zhou

The article analyzes Denny Zhou's Stanford CS25 lecture on large language model reasoning, explaining how intermediate token generation, chain‑of‑thought prompting, self‑consistency, reinforcement‑learning fine‑tuning, and answer aggregation together unlock powerful reasoning capabilities beyond traditional greedy decoding.

AI researchChain-of-ThoughtLLM

0 likes · 17 min read

Why Intermediate Tokens Make LLMs Reason Better: Insights from Denny Zhou

AI2ML AI to Machine Learning

Aug 25, 2025 · Artificial Intelligence

Decoding OpenAI’s Multi‑Level AGI Roadmap

The article analyzes OpenAI’s five‑layer AGI roadmap, compares it with DeepMind’s ECEVS framework, and examines the technical progress from L1 to L5—including RL‑enhanced chain‑of‑thought, ReAct agents, deep research, and upcoming innovations—while highlighting the commercial implications of each stage.

AGIChain-of-ThoughtDeepMind

0 likes · 7 min read

Decoding OpenAI’s Multi‑Level AGI Roadmap

Data Thinking Notes

Aug 21, 2025 · Artificial Intelligence

Why Intermediate Tokens Matter: Denny Zhou’s Deep Insights into LLM Reasoning

This article distills Denny Zhou’s Stanford CS25 lecture, explaining how large language models achieve reasoning through intermediate token generation, chain‑of‑thought prompting, self‑consistency, reinforcement‑learning fine‑tuning, and answer aggregation, while highlighting theoretical foundations and practical breakthroughs.

Chain-of-ThoughtLLMreasoning

0 likes · 18 min read

Why Intermediate Tokens Matter: Denny Zhou’s Deep Insights into LLM Reasoning

Alibaba Cloud Developer

Aug 18, 2025 · Artificial Intelligence

Mastering Claude Prompt Engineering: 9 Proven Strategies to Boost LLM Performance

This guide systematically breaks down Anthropic's official prompt‑engineering recommendations—clear instructions, multishot examples, chain‑of‑thought prompting, XML structuring, response pre‑filling, prompt chaining, long‑context handling, extended thinking, and practical code snippets—showing how to unlock Claude's full potential across complex tasks.

AIChain-of-ThoughtClaude

0 likes · 15 min read

Mastering Claude Prompt Engineering: 9 Proven Strategies to Boost LLM Performance

DaTaobao Tech

Aug 13, 2025 · Artificial Intelligence

Unlocking AI Power: A Complete Guide to Prompt Engineering and Advanced Techniques

This article explores the emerging field of prompt engineering, detailing its fundamentals, advanced strategies such as chain‑of‑thought, ReAct, and structured frameworks, and demonstrates practical applications in AI agents for data retrieval, SQL generation, and market insight, offering actionable guidance for developers and business users alike.

AI agentsChain-of-ThoughtData Retrieval

0 likes · 42 min read

Unlocking AI Power: A Complete Guide to Prompt Engineering and Advanced Techniques

Data Party THU

Aug 12, 2025 · Artificial Intelligence

Unlocking Chain-of-Thought: How AI Reasoning Boosts Accuracy Across Domains

Chain‑of‑Thought (CoT) enables large language models to solve complex tasks by breaking problems into sequential reasoning steps, improving accuracy in mathematics, commonsense, code generation, business strategy, and medical diagnosis, while highlighting its principles, advantages, challenges, and future prospects.

Chain-of-ThoughtLLMPrompt Design

0 likes · 13 min read

Unlocking Chain-of-Thought: How AI Reasoning Boosts Accuracy Across Domains

JD Retail Technology

Jul 21, 2025 · Artificial Intelligence

How Causal Inference Meets Large Language Models to Revolutionize E‑commerce Pricing

This article presents a comprehensive approach that combines causal inference, large language models, and retrieval‑augmented generation to automate e‑commerce price recommendation, detailing the three‑step workflow, challenges across product categories, the RAG architecture, process‑reward‑guided tree search, reinforcement learning refinements, and experimental results showing significant accuracy and speed improvements.

Chain-of-Thoughtcausal inferencee‑commerce pricing

0 likes · 16 min read

How Causal Inference Meets Large Language Models to Revolutionize E‑commerce Pricing

Kuaishou Tech

Jul 16, 2025 · Artificial Intelligence

How KuaiMM Conversation Revolutionizes Multimodal Dialogue on Short‑Video Platforms

The KuaiMM Conversation project introduces a multimodal large‑model‑driven dialogue system for Kuaishou, featuring the world‑first short‑video mixed‑dialogue dataset, a Chain‑of‑Thought interaction framework, and large‑scale industrial deployments that dramatically improve live‑stream comments and intelligent customer service.

Chain-of-ThoughtKuaishouconversation AI

0 likes · 11 min read

How KuaiMM Conversation Revolutionizes Multimodal Dialogue on Short‑Video Platforms

Alimama Tech

Jul 9, 2025 · Artificial Intelligence

How to Make LLMs Recognize and Resolve Their Own Uncertainty

This article introduces ConfuseBench, a benchmark that classifies LLM uncertainty into document‑missing, ability‑limited, and ambiguous types, and presents methods—including retrieval, chain‑of‑thought, and clarification—to detect and actively resolve uncertainty, improving answer quality across diverse tasks.

Chain-of-ThoughtClarificationInquiry

0 likes · 17 min read

How to Make LLMs Recognize and Resolve Their Own Uncertainty

Alibaba Cloud Big Data AI Platform

Jun 30, 2025 · Artificial Intelligence

Unlocking Small LLM Power: Variable‑Length Chain Distillation with DistillQwen‑ThoughtY

This article introduces a variable‑length chain‑of‑thought distillation technique built on Alibaba Cloud PAI’s EasyDistill toolkit, presents the high‑quality OmniThought‑0528 dataset, details the training of the DistillQwen‑ThoughtY 4B/8B/32B models, and provides code and usage examples for researchers and practitioners.

Chain-of-ThoughtDistillationLLM

0 likes · 15 min read

Unlocking Small LLM Power: Variable‑Length Chain Distillation with DistillQwen‑ThoughtY

Xiaohongshu Tech REDtech

Jun 19, 2025 · Artificial Intelligence

Can Adaptive Chain‑of‑Thought Learning Halve LLM Thinking Time?

The article introduces the Think When You Need (TWYN) method, a reinforcement‑learning approach that dynamically adapts chain‑of‑thought length, dramatically cuts redundant token generation in large language models, and maintains or improves accuracy across diverse reasoning benchmarks.

Chain-of-ThoughtEfficiencyadaptive inference

0 likes · 9 min read

Can Adaptive Chain‑of‑Thought Learning Halve LLM Thinking Time?

Fighter's World

Jun 14, 2025 · Artificial Intelligence

How Can LLMs Learn to “Think” in Complex Industry Scenarios?

The article analyzes how large language models can acquire true reasoning abilities for hard‑to‑score industry tasks by combining Chain‑of‑Thought prompting with reinforcement learning, addressing vague reward signals, reward hacking, and loyalty, and proposing a toolbox of reward engineering, synthetic data, hierarchical RL and multi‑agent collaboration.

Chain-of-ThoughtLLMMulti-Agent Systems

0 likes · 22 min read

How Can LLMs Learn to “Think” in Complex Industry Scenarios?

Code Mala Tang

Jun 5, 2025 · Artificial Intelligence

Mastering LLM Prompts: Proven Techniques to Get Precise Answers

By rethinking how we interact with large language models—using role‑play, task decomposition, chain‑of‑thought, ReAct, and other advanced prompting strategies—readers can transform generic ChatGPT answers into precise, context‑aware responses, leveraging pattern recognition and context windows for superior AI assistance.

AI reasoningChain-of-ThoughtLLM techniques

0 likes · 21 min read

Mastering LLM Prompts: Proven Techniques to Get Precise Answers

AI Frontier Lectures

May 30, 2025 · Artificial Intelligence

Can Diffusion Chains Unlock More Creative Reasoning in Large Language Models?

Recent work from West Lake University's MAPLE Lab introduces a diffusion‑based “Divergent Thought Chain” that treats each intermediate denoising step of a diffusion language model as a reasoning step, using result‑based reinforcement learning to optimize non‑linear token generation and achieving state‑of‑the‑art performance on math and code tasks.

Chain-of-Thoughtcode generationdiffusion language models

0 likes · 14 min read

Can Diffusion Chains Unlock More Creative Reasoning in Large Language Models?

Efficient Ops

May 29, 2025 · Artificial Intelligence

DeepSeek R1 0528 Update: New Features, Performance Gains Over OpenAI o3

DeepSeek quietly launched the R1 0528 model, which early testers report matches OpenAI’s o3 in benchmarks and style, while adding deeper chain‑of‑thought reasoning, better writing output, and extended thinking windows, and the announcement is followed by a promotion for the GOPS Global Ops Conference.

AI performanceChain-of-ThoughtDeepSeek

0 likes · 3 min read

DeepSeek R1 0528 Update: New Features, Performance Gains Over OpenAI o3

Baobao Algorithm Notes

May 26, 2025 · Artificial Intelligence

Why Do Reasoning LLMs Lose Instruction-Following Ability? A Deep Dive into Recent Findings

This article compares two recent papers that investigate why large reasoning models such as Llama and Qwen show degraded instruction‑following performance when using chain‑of‑thought prompting, analyzing attention patterns, training effects, and proposed mitigation strategies.

Chain-of-ThoughtEvaluationLLM

0 likes · 11 min read

Why Do Reasoning LLMs Lose Instruction-Following Ability? A Deep Dive into Recent Findings

AI Frontier Lectures

May 25, 2025 · Artificial Intelligence

Can Alternating Generation‑Reduction Make LLMs Think Faster? Introducing PENCIL

The paper presents PENCIL, a novel alternating generation‑and‑erasure reasoning paradigm that achieves optimal space‑time complexity for chain‑of‑thought tasks, dramatically improves accuracy and efficiency on hard SAT, QBF, and Einstein puzzle benchmarks, and is provably Turing‑complete.

Chain-of-ThoughtPencilbenchmark results

0 likes · 12 min read

Can Alternating Generation‑Reduction Make LLMs Think Faster? Introducing PENCIL

Alimama Tech

Apr 23, 2025 · Artificial Intelligence

Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning

The paper introduces an explainable LLM framework (ELLM‑rele) that uses chain‑of‑thought reasoning and a multi‑dimensional knowledge distillation pipeline to compress large‑model relevance judgments into lightweight student models, achieving superior offline relevance scores and online click‑through and conversion improvements in Taobao’s search advertising.

Chain-of-ThoughtLLMexplainability

0 likes · 17 min read

Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning

Alibaba Cloud Big Data AI Platform

Apr 22, 2025 · Artificial Intelligence

How DistilQwen2.5-DS3-0324 Achieves Fast, Accurate Reasoning via Quick‑Think Distillation

This article introduces DistilQwen2.5-DS3-0324, a distilled language model series that balances rapid inference with strong reasoning by applying a fast‑thinking chain‑of‑thought strategy, details its two‑stage distillation framework, evaluation on diverse benchmarks, and provides code for downloading and using the models.

Chain-of-Thoughtdeep learningfast inference

0 likes · 17 min read

How DistilQwen2.5-DS3-0324 Achieves Fast, Accurate Reasoning via Quick‑Think Distillation

Amap Tech

Apr 21, 2025 · Artificial Intelligence

Lenna: Language‑Enhanced Reasoning Detection Assistant and a Chain‑of‑Thought Image Editing Framework Using Multimodal Large Language Models

At ICASSP 2025, Gaode’s two accepted papers present Lenna, a language‑enhanced reasoning detection assistant that adds a DET token to multimodal LLMs and achieves state‑of‑the‑art accuracy on RefCOCO benchmarks, and a chain‑of‑thought image‑editing framework that converts complex prompts into segmented masks and repair prompts for diffusion‑based inpainting, surpassing existing methods.

AIChain-of-ThoughtICASSP

0 likes · 10 min read

Lenna: Language‑Enhanced Reasoning Detection Assistant and a Chain‑of‑Thought Image Editing Framework Using Multimodal Large Language Models

AI Algorithm Path

Apr 20, 2025 · Artificial Intelligence

Boosting Visual Reasoning in VLMs with Reinforcement Learning

The article analyzes how reinforcement learning, which transformed LLM reasoning in DeepSeek, can be applied to visual‑language models to overcome the limitations of traditional chain‑of‑thought prompting and supervised fine‑tuning, presenting concrete reward designs, training pipelines, and a critical assessment of their strengths and weaknesses.

Chain-of-ThoughtLLMRL Training

0 likes · 10 min read

Boosting Visual Reasoning in VLMs with Reinforcement Learning

Alibaba Cloud Developer

Apr 9, 2025 · Artificial Intelligence

Unlocking LLM Reasoning: A Deep Dive into Prompt Engineering Techniques

This article surveys classic prompt‑engineering methods such as Chain‑of‑Thought, Self‑Consistency, Least‑to‑Most, Boosting of Thoughts, Tree of Thoughts, and AutoGPT, summarizing their core ideas, advantages, limitations, and experimental results to help readers understand how to enhance large language model reasoning without model fine‑tuning.

AI reasoningChain-of-ThoughtSelf-Consistency

0 likes · 22 min read

Unlocking LLM Reasoning: A Deep Dive into Prompt Engineering Techniques

Cognitive Technology Team

Apr 4, 2025 · Artificial Intelligence

Reasoning Models Do Not Always Reveal Their Thoughts: Evaluating Chain‑of‑Thought Fidelity

The article examines how modern reasoning models like Claude 3.7 Sonnet display chain‑of‑thought explanations, but often hide or distort their true reasoning, presenting challenges for AI safety and alignment, and evaluates methods to test and improve fidelity.

AI alignmentAI safetyChain-of-Thought

0 likes · 13 min read

Reasoning Models Do Not Always Reveal Their Thoughts: Evaluating Chain‑of‑Thought Fidelity

JavaEdge

Mar 24, 2025 · Artificial Intelligence

Why Large Language Models Still Struggle with Complex Reasoning – Challenges and Solutions

This article examines the fundamental reasoning limitations of large language models, illustrates real‑world failure cases, and outlines current research directions such as better datasets, chain‑of‑thought prompting, external verification, and specialized solvers to improve their logical capabilities.

AIChain-of-ThoughtLLM

0 likes · 8 min read

Why Large Language Models Still Struggle with Complex Reasoning – Challenges and Solutions

DataFunTalk

Mar 24, 2025 · Artificial Intelligence

DeepSeek R1: Open‑Source Reasoning Model and Multi‑Stage Training Insights

The interview explores DeepSeek R1's open‑source weights, its multi‑stage training pipeline—including pre‑training, supervised fine‑tuning, and RLHF—alongside innovations such as self‑consistency, chain‑of‑thought prompting, distillation, MoE architectures, and cost considerations, highlighting its impact on the future of large language models.

AI trainingChain-of-ThoughtDeepSeek

0 likes · 20 min read

DeepSeek R1: Open‑Source Reasoning Model and Multi‑Stage Training Insights

AI Frontier Lectures

Mar 21, 2025 · Artificial Intelligence

Can Chain‑of‑Thought Templates Unlock Higher Reasoning Limits in LLMs?

The article examines how chain‑of‑thought (CoT) templates are evolving from short‑term heuristics to long‑range planning in large language models, highlighting recent advances such as OpenAI o1, DeepSeek R1, and Kimi 1.5, and explores template designs that boost reasoning performance, efficiency, and multimodal capabilities.

AI reasoningChain-of-ThoughtEfficiency

0 likes · 7 min read

Can Chain‑of‑Thought Templates Unlock Higher Reasoning Limits in LLMs?

Architect

Mar 16, 2025 · Artificial Intelligence

Training a 0.5B LLM with Chain‑of‑Thought Reasoning: From Pre‑training to GRPO Fine‑tuning

This article walks through the complete lifecycle of building a small large‑language model, covering token‑level inference, pre‑training, post‑training steps such as supervised fine‑tuning, reward‑model creation, and reinforcement‑learning methods like DPO, PPO and GRPO, culminating in a practical 0.5B model fine‑tuned for chain‑of‑thought reasoning.

Chain-of-ThoughtGRPOLLM training

0 likes · 22 min read

Training a 0.5B LLM with Chain‑of‑Thought Reasoning: From Pre‑training to GRPO Fine‑tuning

Selected Java Interview Questions

Mar 15, 2025 · Artificial Intelligence

DeepSeek4j 1.4: Java Spring Boot Integration for DeepSeek with Full Chain‑of‑Thought and Streaming Support

DeepSeek4j 1.4 introduces a Java‑centric, Spring Boot‑compatible framework that fully preserves DeepSeek's chain‑of‑thought capabilities, adds reactive streaming, and provides simple one‑line API integration, addressing previous limitations in mainstream frameworks and offering ready‑to‑use configuration and code examples.

AI integrationChain-of-ThoughtDeepSeek

0 likes · 5 min read

DeepSeek4j 1.4: Java Spring Boot Integration for DeepSeek with Full Chain‑of‑Thought and Streaming Support

Fun with Large Models

Mar 8, 2025 · Artificial Intelligence

Make AI Obey: A Detailed Prompt Engineering Guide to Boost Large‑Model Logic

This tutorial explains how to enhance large language models' logical reasoning by using DeepSeek‑R1's deep‑thinking mode, few‑shot prompting, chain‑of‑thought, and zero‑shot chain‑of‑thought techniques, providing concrete examples, comparisons, and a step‑by‑step template for effective prompt design.

AI reasoningChain-of-ThoughtDeepSeek

0 likes · 10 min read

Make AI Obey: A Detailed Prompt Engineering Guide to Boost Large‑Model Logic

Java Architect Essentials

Mar 7, 2025 · Artificial Intelligence

Introducing DeepSeek4j 1.4: A Java Spring Boot Integration for DeepSeek AI with Chain‑of‑Thought and Streaming Support

The article introduces DeepSeek4j 1.4, a Java Spring Boot library that overcomes existing framework limitations by preserving DeepSeek's chain‑of‑thought capabilities, adding full reactive streaming, and providing a simple one‑line API along with quick‑start instructions and code examples.

AI integrationChain-of-ThoughtDeepSeek

0 likes · 5 min read

Introducing DeepSeek4j 1.4: A Java Spring Boot Integration for DeepSeek AI with Chain‑of‑Thought and Streaming Support

AI Code to Success

Mar 6, 2025 · Artificial Intelligence

Inside the Brain Module: How AI Agents Process, Remember, and Decide

This article provides a comprehensive analysis of the Brain module in AI agents, covering its multi‑step workflow, knowledge integration, memory mechanisms, intent recognition, planning strategies, reasoning techniques, and the role of reflection and emotion in enhancing adaptability and robustness.

Brain moduleChain-of-ThoughtEmotion simulation

0 likes · 17 min read

Inside the Brain Module: How AI Agents Process, Remember, and Decide

Code Mala Tang

Feb 27, 2025 · Artificial Intelligence

Do New AI Reasoning Models Really Think? Unpacking the Debate

The article examines whether the latest AI models that claim to perform true reasoning—by breaking problems into steps and using chain‑of‑thought—actually reason like humans, presenting skeptical and supportive expert viewpoints, and offering practical guidance on how to use such models responsibly.

AI reasoningAI safetyChain-of-Thought

0 likes · 14 min read

Do New AI Reasoning Models Really Think? Unpacking the Debate

Java Web Project

Feb 25, 2025 · Artificial Intelligence

How DeepSeek4j 1.4 Solves Spring AI’s Chain‑of‑Thought and Streaming Gaps

The article explains why existing Java AI frameworks struggle with DeepSeek R1’s chain‑of‑thought and streaming features, introduces DeepSeek4j 1.4 as a targeted solution, details its core capabilities, and provides a step‑by‑step guide to integrate it with Spring Boot and Project Reactor.

AI integrationChain-of-ThoughtDeepSeek

0 likes · 5 min read

How DeepSeek4j 1.4 Solves Spring AI’s Chain‑of‑Thought and Streaming Gaps

Top Architect

Feb 21, 2025 · Artificial Intelligence

DeepSeek4j 1.4: Java Integration Framework for DeepSeek with Full Chain‑of‑Thought and Streaming Support

The article introduces DeepSeek4j 1.4, a Java‑based framework that overcomes Spring AI’s limitations by fully preserving DeepSeek’s chain‑of‑thought and billing features, adding reactive streaming, providing Spring Boot starter integration, and offering quick‑start code samples and configuration guidance.

AIChain-of-ThoughtDeepSeek

0 likes · 8 min read

DeepSeek4j 1.4: Java Integration Framework for DeepSeek with Full Chain‑of‑Thought and Streaming Support

Software Engineering 3.0 Era

Feb 18, 2025 · Artificial Intelligence

Deep Dive into Grok 3: How the New Reasoning Model Beats OpenAI o3-mini and DeepSeek R1

The article examines xAI's newly released Grok 3, detailing its chain‑of‑thought reasoning, synthetic‑data training, benchmark dominance over rivals like DeepSeek V3 and GPT‑4o, internal controversy, massive GPU investment, pricing, and its broader impact on the competitive AI landscape.

AI benchmarkingChain-of-ThoughtGrok 3

0 likes · 9 min read

Deep Dive into Grok 3: How the New Reasoning Model Beats OpenAI o3-mini and DeepSeek R1

Tencent Technical Engineering

Feb 17, 2025 · Artificial Intelligence

Prompt Engineering: Definitions, Frameworks, Principles, and Advanced Techniques

The guide defines prompts as structured queries that unlock large‑language‑model abilities, outlines five core frameworks (RTF, Chain‑of‑Thought, RISEN, RODES, Density‑Chain), presents two key principles—clear, delimited instructions and explicit reasoning steps—to reduce hallucinations, and surveys advanced techniques such as zero‑shot, few‑shot, RAG, Tree‑of‑Thought and automatic prompt engineering.

AIChain-of-ThoughtRetrieval-Augmented Generation

0 likes · 29 min read

Prompt Engineering: Definitions, Frameworks, Principles, and Advanced Techniques

AI Algorithm Path

Feb 12, 2025 · Artificial Intelligence

Essential DeepSeek‑R1 Reading List: Papers Behind the 2025 Hottest LLM

This article compiles a curated reading list of foundational and recent research papers—from the original Transformer to chain‑of‑thought, mixture‑of‑experts, and reinforcement‑learning studies—that together explain the breakthroughs behind DeepSeek‑R1 and guide readers through the technical evolution of modern large language models.

Chain-of-ThoughtDeepSeekLarge Language Model

0 likes · 15 min read

Essential DeepSeek‑R1 Reading List: Papers Behind the 2025 Hottest LLM

DevOps

Feb 7, 2025 · Artificial Intelligence

OpenAI Releases o3-mini Chain‑of‑Thought: First Tests, Community Reactions, and Critical Analysis

OpenAI has publicly disclosed the chain‑of‑thought reasoning of its o3‑mini model, prompting a wave of community experiments, critiques about authenticity, and discussions on the model’s limitations, prompting insights into AI interpretability and the trade‑offs of revealing internal reasoning.

Chain-of-ThoughtO3-miniOpenAI

0 likes · 6 min read

OpenAI Releases o3-mini Chain‑of‑Thought: First Tests, Community Reactions, and Critical Analysis

Java Architecture Diary

Feb 5, 2025 · Artificial Intelligence

Unlocking DeepSeek R1’s Chain‑of‑Thought: A Spring WebFlux Integration Guide

This article examines why mainstream AI frameworks like Spring AI and LangChain4j cannot fully support DeepSeek’s R1 model, explains its unique chain‑of‑thought response format and parameter constraints, and provides a complete Spring WebFlux‑based solution—including API calls, streaming handling, and response parsing—to preserve reasoning content.

Chain-of-ThoughtDeepSeekR1

0 likes · 8 min read

Unlocking DeepSeek R1’s Chain‑of‑Thought: A Spring WebFlux Integration Guide

AIWalker

Feb 4, 2025 · Artificial Intelligence

How Chain‑of‑Thought Boosts Text‑to‑Image Generation: The New o1 Inference Scheme

This article reviews a comprehensive study that applies Chain‑of‑Thought reasoning to autoregressive text‑to‑image generation, introducing extended test‑time computation, direct preference optimization, and two custom reward models (PARM and PARM++) that together improve generation quality by up to 15% over Stable Diffusion 3.

Chain-of-ThoughtDirect Preference OptimizationMultimodal AI

0 likes · 13 min read

How Chain‑of‑Thought Boosts Text‑to‑Image Generation: The New o1 Inference Scheme

DaTaobao Tech

Jan 24, 2025 · Artificial Intelligence

MktAI Assistant: AI‑Driven Marketing Data Query and Insight Platform

The MktAI Assistant combines LLM‑powered memory, skill planning, and tool‑calling with real‑time API data to replace slow, manual SQL dashboards, delivering sub‑minute, fresh, explainable marketing queries and attribution insights that boost decision speed, accuracy, and collaboration between data scientists and business users.

AI AgentChain-of-ThoughtFunction Calling

0 likes · 16 min read

MktAI Assistant: AI‑Driven Marketing Data Query and Insight Platform

Baobao Algorithm Notes

Nov 24, 2024 · Artificial Intelligence

How Marco‑o1 Merges Chain‑of‑Thought Fine‑Tuning with Monte‑Carlo Tree Search for Superior Reasoning

The article introduces Marco‑o1, an open‑source LLM that enhances complex reasoning by fine‑tuning on Chain‑of‑Thought data, integrating Monte‑Carlo Tree Search, introducing mini‑step actions and a reflection mechanism, and evaluates its performance on multilingual math and translation benchmarks.

Chain-of-ThoughtLLMMachine Translation

0 likes · 15 min read

How Marco‑o1 Merges Chain‑of‑Thought Fine‑Tuning with Monte‑Carlo Tree Search for Superior Reasoning

JD Tech

Nov 12, 2024 · Artificial Intelligence

Prompt Engineering: Concepts, Evolution, Techniques, and JD Logistics Application

This article explains what Prompt Engineering is, traces its development from early NLP commands to modern adaptive and multimodal prompting techniques, describes various prompting strategies such as Zero‑shot, Few‑shot, Chain‑of‑Thought, Auto‑CoT, and showcases a JD Logistics case study using these methods to classify product types with code examples.

AI Prompt DesignChain-of-ThoughtFew-shot

0 likes · 27 min read

Prompt Engineering: Concepts, Evolution, Techniques, and JD Logistics Application

JD Tech Talk

Nov 11, 2024 · Artificial Intelligence

Prompt Engineering: Concepts, Evolution, Techniques, and a Logistics Application Case

This article explains what Prompt Engineering is, traces its development from early command‑based interactions to modern adaptive and multimodal prompting, details various prompting techniques such as zero‑shot, few‑shot, Chain‑of‑Thought, hallucination‑reduction methods, and demonstrates their practical use in a JD Logistics SKU piece‑type classification case with code examples.

AI promptingChain-of-ThoughtLLM applications

0 likes · 26 min read

Prompt Engineering: Concepts, Evolution, Techniques, and a Logistics Application Case

JD Cloud Developers

Nov 11, 2024 · Artificial Intelligence

Mastering Prompt Engineering: History, Techniques, and Real-World Applications

This article explains what Prompt Engineering is, traces its evolution from early NLP commands to modern adaptive and multimodal prompting, details core techniques such as Zero‑shot, Chain‑of‑Thought, Auto‑CoT, and reduction of hallucinations, and showcases a logistics case study using various prompting strategies.

AIChain-of-ThoughtLLM

0 likes · 26 min read

Mastering Prompt Engineering: History, Techniques, and Real-World Applications

DaTaobao Tech

Oct 30, 2024 · Artificial Intelligence

Understanding OpenAI o1: Chain‑of‑Thought, Scaling Laws, and Training Strategies

The article explains how OpenAI’s o1 model leverages chain‑of‑thought prompting, dual‑system cognitive theory, and new scaling laws—pre‑training on code/math and post‑training reinforcement with step‑wise reward models—to achieve superior reasoning, safety, and performance over GPT‑4, heralding a shift toward models that learn to think.

Chain-of-ThoughtLLMSafety

0 likes · 42 min read

Understanding OpenAI o1: Chain‑of‑Thought, Scaling Laws, and Training Strategies

Architect

Sep 26, 2024 · Artificial Intelligence

Decoding OpenAI o1: How RL‑LLM Fusion Powers Next‑Gen Reasoning

This article provides a detailed technical analysis of OpenAI’s o1 model, exploring its enhanced logical reasoning, the likely use of reinforcement learning with hidden chain‑of‑thought generation, multi‑model architecture, training data pipelines, reward modeling, and how these innovations could reshape AI safety and scaling strategies.

AI safetyChain-of-ThoughtLLM

0 likes · 43 min read

Decoding OpenAI o1: How RL‑LLM Fusion Powers Next‑Gen Reasoning

iQIYI Technical Product Team

Sep 26, 2024 · Artificial Intelligence

AI-Powered Search in iQIYI: Techniques, Architecture, and Implementation

iQIYI’s AI‑powered search expands beyond title‑only queries by handling fuzzy role, plot, star, award, and semantic searches, using Chain‑of‑Thought‑generated TIPS, Retrieval‑Augmented Generation with sophisticated indexing, chunking, embedding, reranking, and prompt‑engineering to deliver personalized, accurate video recommendations that boost user engagement.

AI SearchChain-of-ThoughtEmbedding

0 likes · 15 min read

AI-Powered Search in iQIYI: Techniques, Architecture, and Implementation

CSS Magic

Sep 14, 2024 · Artificial Intelligence

Why OpenAI’s New o1 Model Outperforms Its Rivals

The article examines OpenAI’s newly released o1 model, highlighting its superior performance in complex reasoning tasks such as math, programming, and science, and explains how model‑level chain‑of‑thought optimization and product‑level UI design give it an edge over competitors like Claude.

AI evaluationChain-of-ThoughtChatGPT

0 likes · 8 min read

Why OpenAI’s New o1 Model Outperforms Its Rivals

MaGe Linux Operations

Sep 13, 2024 · Artificial Intelligence

Can OpenAI’s New o1 Model Reach Human‑Level Reasoning?

OpenAI’s newly released o1 series introduces a reinforcement‑learning‑trained LLM that generates long chain‑of‑thought reasoning, achieving top‑50% scores on IOI contests, high rankings on Codeforces and AIME, and dramatically outperforming GPT‑4o across scientific and mathematical tasks.

AI reasoningChain-of-ThoughtLarge Language Model

0 likes · 8 min read

Can OpenAI’s New o1 Model Reach Human‑Level Reasoning?

Tencent Cloud Developer

Jul 30, 2024 · Artificial Intelligence

A Systematic Guide to Prompt Engineering: From Zero to One

This guide walks readers from beginner to proficient Prompt Engineer by outlining the evolution of prompting, introducing a universal four‑component template, and detailing a five‑step workflow—including refinement, retrieval‑augmented generation, chain‑of‑thought reasoning, and advanced tuning techniques—plus evaluation metrics for LLM performance.

AI promptingChain-of-ThoughtLLM Optimization

0 likes · 51 min read

A Systematic Guide to Prompt Engineering: From Zero to One

NewBeeNLP

Jun 19, 2024 · Artificial Intelligence

Can Symbolic Chain‑of‑Thought Boost LLM Logical Reasoning?

The paper introduces SymbCoT, a Symbolic Chain‑of‑Thought framework that translates natural‑language problems into symbolic form, plans, solves, and verifies reasoning steps, achieving significantly higher logical reasoning performance than traditional CoT methods across multiple benchmark datasets.

ACL 2024Chain-of-ThoughtLLM

0 likes · 13 min read

Can Symbolic Chain‑of‑Thought Boost LLM Logical Reasoning?

Xiaohongshu Tech REDtech

Apr 10, 2024 · Artificial Intelligence

Early‑Stopping Self‑Consistency (ESC): Reducing Sampling Cost for Large Language Model Reasoning

Early‑Stopping Self‑Consistency (ESC) dynamically halts sampling once a sliding‑window answer distribution reaches zero entropy, cutting the number of required LLM reasoning samples by 34‑84 % across arithmetic, commonsense, and symbolic benchmarks while preserving accuracy and offering a theoretically‑bounded, robust, budget‑adaptive alternative to traditional Self‑Consistency.

AIChain-of-ThoughtEarly Stopping

0 likes · 14 min read

Early‑Stopping Self‑Consistency (ESC): Reducing Sampling Cost for Large Language Model Reasoning

DataFunTalk

Feb 2, 2024 · Artificial Intelligence

Utilizing Negative Samples for Knowledge Distillation of Large Language Models

This paper presents a novel framework that leverages negative samples during large language model distillation through three stages—Negative Assistive Training, Negative Calibration Enhancement, and Adaptive Self‑Consistency—demonstrating significant accuracy gains on challenging mathematical reasoning benchmarks and improved generalization to out‑of‑distribution tasks.

Chain-of-ThoughtKnowledge TransferLLM distillation

0 likes · 13 min read

Utilizing Negative Samples for Knowledge Distillation of Large Language Models

Sohu Tech Products

Aug 23, 2023 · Artificial Intelligence

Engineering GPT Applications: Capabilities, Limitations, and Solutions

The guide explains GPT’s core capabilities—natural language mastery, domain reasoning, and code generation—while detailing its limits such as prompt sensitivity, token caps, and lack of memory, then offers engineering workarounds like systematic prompting, chain‑of‑thought, external memory, tool integration, safety checks, and a six‑layer architecture for building robust commercial AI applications.

AI Application ArchitectureChain-of-ThoughtGPT

0 likes · 20 min read

Engineering GPT Applications: Capabilities, Limitations, and Solutions

Alibaba Cloud Developer

Jul 19, 2023 · Artificial Intelligence

Mastering Prompt Engineering: Techniques, Tips, and Real-World Examples

This comprehensive guide explores prompt engineering for large language models, covering its background, fundamental concepts, prompt formats, construction principles, advanced techniques like few‑shot, zero‑shot, and chain‑of‑thought prompting, as well as practical examples, evaluation metrics, and future directions.

Chain-of-ThoughtFew-shotLLM

0 likes · 33 min read

Mastering Prompt Engineering: Techniques, Tips, and Real-World Examples

Tencent Cloud Developer

Jun 28, 2023 · Artificial Intelligence

Prompt Engineering: Fundamentals, Techniques, and Advanced Strategies

Prompt engineering teaches how to craft effective instructions, context, input data, and output formats for large language models, using clear commands, iterative refinement, and advanced methods such as zero‑shot, few‑shot, chain‑of‑thought, Tree of Thoughts, retrieval‑augmented and progressive‑hint prompting to achieve precise, reliable results across diverse tasks.

AIChain-of-ThoughtMultimodal

0 likes · 17 min read

Prompt Engineering: Fundamentals, Techniques, and Advanced Strategies