JD Cloud Developers
JD Cloud Developers
Feb 4, 2026 · Artificial Intelligence

How Deep Research Transforms LLMs into Autonomous AI Researchers

This article examines Deep Research, an AI system that adds autonomous planning and deep reasoning to large language models, enabling them to browse the web, perform long‑chain reasoning, and generate professional, citation‑rich reports for complex tasks such as industry trend analysis and technical competitive research.

AI researchInformation RetrievalLLM
0 likes · 22 min read
How Deep Research Transforms LLMs into Autonomous AI Researchers
Tencent Technical Engineering
Tencent Technical Engineering
Dec 1, 2025 · Artificial Intelligence

Do Machines Really Think? Inside Deep Reasoning, Scaling Laws & RLHF for LLMs

This article examines whether large language models truly think, explores the origins of deep reasoning through transformer architectures and scaling laws, reviews chain‑of‑thought and its variants, and analyzes how reinforcement learning from human feedback—including PPO, DPO, and GRPO—helps internalise step‑by‑step reasoning while pointing to future directions such as atomic thought, hierarchical models, and training‑free in‑context knowledge bases.

AI alignmentLLMRLHF
0 likes · 35 min read
Do Machines Really Think? Inside Deep Reasoning, Scaling Laws & RLHF for LLMs
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Nov 20, 2025 · Artificial Intelligence

How DeepAgent Achieves End‑to‑End Reasoning with 16,000+ Scalable Tools

DeepAgent is a new end‑to‑end reasoning agent that unifies autonomous thinking, dynamic tool search, and execution, handling over 16,000 real APIs, supporting embodied environments and research assistance, and achieving state‑of‑the‑art results across multiple benchmarks through its unified reasoning core, memory‑folding mechanisms, structured memory, and the ToolPO training framework.

AI agentsTool Integrationdeep reasoning
0 likes · 14 min read
How DeepAgent Achieves End‑to‑End Reasoning with 16,000+ Scalable Tools
AntTech
AntTech
Oct 14, 2025 · Artificial Intelligence

How Ring-1T Achieves Trillion-Scale Deep Thinking and Competitive Benchmarks

The Ring-1T model, a trillion-parameter AI system released as open source, leverages advanced reinforcement learning techniques, extensive benchmark evaluations, and custom training frameworks to deliver balanced performance across math, code, reasoning, and creative tasks while highlighting current limitations and future development plans.

AI modelLarge Language Modelbenchmark evaluation
0 likes · 8 min read
How Ring-1T Achieves Trillion-Scale Deep Thinking and Competitive Benchmarks
DataFunSummit
DataFunSummit
Oct 7, 2025 · Artificial Intelligence

Deep Thinking in Large Language Models: Overcoming Domain Challenges

This presentation explores how large language models can transcend their general knowledge limits by developing domain‑specific deep thinking abilities, addressing challenges such as complex instruction execution, expert reasoning gaps, and tool integration, and proposes reinforcement‑learning‑driven frameworks, structured thinking pipelines, and tool‑calling mechanisms to achieve rational intelligence.

Domain AdaptationTool Integrationdeep reasoning
0 likes · 27 min read
Deep Thinking in Large Language Models: Overcoming Domain Challenges
AntTech
AntTech
Aug 6, 2025 · Artificial Intelligence

Ring-lite-2507: Boosted Deep Reasoning and Balanced General Capabilities

The AntBailing team releases Ring-lite-2507, enhancing deep reasoning through a Two‑staged RL pipeline while simultaneously balancing overall model abilities, showcasing notable gains on benchmarks like ARC‑AGI‑v1 and offering the model as an open‑source resource across major platforms.

Large Language ModelRL trainingRing-lite
0 likes · 5 min read
Ring-lite-2507: Boosted Deep Reasoning and Balanced General Capabilities
AI Algorithm Path
AI Algorithm Path
Jun 11, 2025 · Artificial Intelligence

OpenAI's O3‑Pro Model: Deep Reasoning, Pricing, Benchmarks, and Access Guide

OpenAI introduced the O3‑Pro multimodal deep‑reasoning model with an 80% price cut for O3, detailed its training via large‑scale reinforcement learning, compared its capabilities and costs against GPT‑4o, GPT‑4.1 and O3‑Pro, listed its core specs, limitations, access methods, and presented benchmark tests that highlight both strengths and weaknesses.

AIO3-ProOpenAI
0 likes · 10 min read
OpenAI's O3‑Pro Model: Deep Reasoning, Pricing, Benchmarks, and Access Guide