Tagged articles
1023 articles
Page 4 of 11
Instant Consumer Technology Team
Instant Consumer Technology Team
Dec 16, 2025 · Artificial Intelligence

How Mind Lab Trained a Trillion‑Parameter Agentic Memory with Only 10% GPU Power

This article explains how the Mind Lab team tackled the challenges of training a 1‑trillion‑parameter mixture‑of‑experts model for agentic memory using reinforcement learning, LoRA, and a custom Megatron‑Bridge architecture, achieving a ten‑fold speedup while consuming just a fraction of the usual GPU resources.

AIAgentic AppsLoRA
0 likes · 9 min read
How Mind Lab Trained a Trillion‑Parameter Agentic Memory with Only 10% GPU Power
DataFunSummit
DataFunSummit
Dec 14, 2025 · Artificial Intelligence

How Sina Weibo Scaled Enterprise AI with a Unified Multi‑Agent Platform

Sina Weibo’s engineering team tackled the high technical barriers, low reuse, and long cycles of large‑model AI deployment by building a unified AI application platform that combines a layered architecture, low‑code workflow, multi‑agent orchestration, and knowledge‑base integration, enabling rapid, reliable AI solutions across the company.

AI PlatformEnterprise AIKnowledge Base
0 likes · 26 min read
How Sina Weibo Scaled Enterprise AI with a Unified Multi‑Agent Platform
PaperAgent
PaperAgent
Dec 12, 2025 · Artificial Intelligence

What Makes GPT‑5.2 and Gemini‑3‑Pro So Fast? Inside Their Key Features and Real‑World Tests

Gemini‑3‑pro’s surprise debut and OpenAI’s emergency release of GPT‑5.2 highlight a shift toward faster inference, deeper reasoning, and lower hallucination rates, with detailed performance metrics, three‑tier model options, extended context windows, and mixed community test results that reveal both strengths and shortcomings.

AI model performanceGPT-5.2Gemini 3 Pro
0 likes · 4 min read
What Makes GPT‑5.2 and Gemini‑3‑Pro So Fast? Inside Their Key Features and Real‑World Tests
Amap Tech
Amap Tech
Dec 11, 2025 · Artificial Intelligence

How ACoder Achieved Up to 24× Faster Multi‑Platform Development with AI

The ACoder platform combines multi‑model AI, a panoramic code‑understanding engine, and a layered knowledge‑management system to automate the entire software‑development lifecycle, delivering 5‑20× overall efficiency gains, up to 24× speed‑up for cross‑platform code migration, and dramatically higher code‑recall accuracy.

AI CodingCode GenerationSoftware Engineering
0 likes · 19 min read
How ACoder Achieved Up to 24× Faster Multi‑Platform Development with AI
AI Frontier Lectures
AI Frontier Lectures
Dec 9, 2025 · Artificial Intelligence

Can Token‑Level Surrogates Stabilize RL for Large Language Models? A Deep Dive

This article analyzes why optimizing sequence‑level rewards for LLMs with token‑level surrogate objectives can improve reinforcement‑learning stability, explains the theoretical conditions required, introduces Routing Replay for MoE models, and presents extensive experiments validating the approach.

Importance SamplingMixture of Expertslarge language models
0 likes · 12 min read
Can Token‑Level Surrogates Stabilize RL for Large Language Models? A Deep Dive
Tencent Cloud Developer
Tencent Cloud Developer
Dec 9, 2025 · Artificial Intelligence

How Do Large Language Models Turn Text into Math? A Deep Dive into Transformers

This article walks through the complete workflow of AI large language models, from turning user queries into token matrices via tokenization and embedding, through the Transformer’s self‑attention and multi‑head mechanisms, to decoding logits into human‑readable text, while also covering position encoding, long‑context strategies, generation parameters, and practical engineering tips.

Inference OptimizationSelf-AttentionTransformer
0 likes · 29 min read
How Do Large Language Models Turn Text into Math? A Deep Dive into Transformers
PaperAgent
PaperAgent
Dec 6, 2025 · Artificial Intelligence

How Titans and MIRAS Enable AI Models to Remember 1 Million Tokens

Google's Titans architecture and the MIRAS theoretical framework introduce a deep neural memory that lets large language models learn in real time, retain surprising information, and handle context windows of up to two million tokens, outperforming existing Transformers and linear RNNs on a range of benchmarks.

AI memoryMIRAS frameworkSequence Modeling
0 likes · 10 min read
How Titans and MIRAS Enable AI Models to Remember 1 Million Tokens
HyperAI Super Neural
HyperAI Super Neural
Dec 6, 2025 · Artificial Intelligence

Quick Look at This Week’s Frontier AI Papers: DeepSeekMath‑V2, MedSAM‑3, SAM 3D, Qwen3‑VL, and M²

This roundup surveys five cutting‑edge AI papers—DeepSeekMath‑V2’s self‑verifiable mathematical reasoning, MedSAM‑3’s promptable medical image and video segmentation, SAM 3D’s single‑image 3D reconstruction, Qwen3‑VL’s high‑capacity vision‑language model, and the M² memory‑mesh transformer for image captioning—highlighting their key methods, benchmarks, and code links.

3D reconstructionImage CaptioningMathematical Reasoning
0 likes · 6 min read
Quick Look at This Week’s Frontier AI Papers: DeepSeekMath‑V2, MedSAM‑3, SAM 3D, Qwen3‑VL, and M²
JD Retail Technology
JD Retail Technology
Dec 4, 2025 · Artificial Intelligence

Twin Networks Reveal How to Optimize Data Mixtures for Large Language Models

This article presents TANDEM, a bi‑level data‑mixture optimization framework that uses twin networks to automatically adjust domain‑specific training data ratios, offering theoretical guarantees, broader applicability, and significant performance gains across pre‑training, fine‑tuning, and e‑commerce product‑understanding tasks.

NeurIPSbi-level optimizationdata mixture optimization
0 likes · 6 min read
Twin Networks Reveal How to Optimize Data Mixtures for Large Language Models
Tencent Cloud Developer
Tencent Cloud Developer
Dec 4, 2025 · Artificial Intelligence

From Tapestry to LLMs: 30+ Years of Recommender System Evolution

This article traces the three‑decade evolution of recommender systems—from early collaborative‑filtering prototypes like Tapestry, through the Netflix Prize era and deep‑learning breakthroughs such as Wide&Deep and DIN, to the current generative‑AI wave driven by large language models—highlighting key milestones, technical shifts, industrial deployments, and future challenges.

Deep LearningIndustrial Deploymentcollaborative filtering
0 likes · 38 min read
From Tapestry to LLMs: 30+ Years of Recommender System Evolution
PaperAgent
PaperAgent
Dec 4, 2025 · Artificial Intelligence

From Code Foundations to AI Agents: A Deep Dive into Code LLMs and Their Applications

This article reviews a comprehensive 303‑page survey on code foundation models, tracing the evolution of code‑focused large language models from 2021 to 2025, comparing general‑purpose and specialized LLMs, and presenting extensive experiments on prompting, fine‑tuning, reinforcement learning, and autonomous coding agents.

AI CodingCode LLMModel Evaluation
0 likes · 5 min read
From Code Foundations to AI Agents: A Deep Dive into Code LLMs and Their Applications
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Dec 3, 2025 · Artificial Intelligence

2026 Forecast: How Large‑Model AI Will Evolve After 2025 Breakthroughs

The article reviews the major 2025 breakthroughs in multimodal, open‑source, and deployment technologies for large models and outlines four 2026 trends—including ToC vs. ToB service split, dual‑hand data generation, MoE routing advances, and AI4Science breakthroughs—that will shape the next wave of AI development.

AI deploymentAI4ScienceMixture of Experts
0 likes · 6 min read
2026 Forecast: How Large‑Model AI Will Evolve After 2025 Breakthroughs
Baidu MEUX
Baidu MEUX
Dec 3, 2025 · User Experience Design

Boost User Research with AI: Automating Short Feedback Classification & Long‑Form Insight Extraction

This article explains how AI large‑language models can automate short user‑feedback classification and extract insights from long interview texts, offering practical prompting tips, fine‑tuning strategies, and Retrieval‑Augmented Generation workflows to make user research faster, more accurate, and less labor‑intensive.

AIFeedback ClassificationPrompt engineering
0 likes · 11 min read
Boost User Research with AI: Automating Short Feedback Classification & Long‑Form Insight Extraction
ShiZhen AI
ShiZhen AI
Dec 2, 2025 · Artificial Intelligence

What Is a Prompt? Mastering Question Techniques for Better AI Results

Episode 4 of the Comic‑AI series explains that a prompt is the art of formulating precise questions to guide large language models, covering content and format constraints, positive and negative prompting, and showing how specific instructions lead to more predictable AI behavior.

AIAI interactionPrompt Design
0 likes · 3 min read
What Is a Prompt? Mastering Question Techniques for Better AI Results
ShiZhen AI
ShiZhen AI
Dec 1, 2025 · Artificial Intelligence

AI Comic Episode 3: What Exactly Is a Token?

This episode explains that a token is the smallest text chunk an LLM processes—ranging from characters to subwords—covers why subword tokenization avoids vocabulary explosion, compares token counts across languages, describes the computational cost of sequential generation, and introduces visual tokens for multimodal models.

AI fundamentalsMultimodallarge language models
0 likes · 7 min read
AI Comic Episode 3: What Exactly Is a Token?
JD Tech
JD Tech
Nov 28, 2025 · Artificial Intelligence

How JD Ads Uses Large Language Models to Transform Advertising

This article details JD Advertising's shift from generic to domain‑specific large models, the design of AI‑driven ad agents, the end‑to‑end GRAM retrieval‑alignment system, CTR‑guided AIGC for creatives, ultra‑low‑latency inference techniques, and ARM‑based optimizations that together reshape modern ad marketing.

CTR optimizationIntelligent agentsadvertising AI
0 likes · 19 min read
How JD Ads Uses Large Language Models to Transform Advertising
Meituan Technology Team
Meituan Technology Team
Nov 27, 2025 · Artificial Intelligence

AMO‑Bench: A New High‑Difficulty, Original Math Reasoning Benchmark for LLMs

AMO‑Bench, released by Meituan's LongCat team, is a 50‑question, IMO‑level math reasoning benchmark that combines original, high‑difficulty problems with automated scoring, exposing the current limits of top large language models whose best accuracy hovers around 52 % and offering a more discriminative evaluation tool for future model improvements.

AI EvaluationAMO-Benchbenchmark
0 likes · 12 min read
AMO‑Bench: A New High‑Difficulty, Original Math Reasoning Benchmark for LLMs
DataFunTalk
DataFunTalk
Nov 25, 2025 · Artificial Intelligence

Unlocking Agentic RAG and Generative Ranking: AI Search & Recommendation Breakthroughs

This article summarizes cutting‑edge techniques from Alibaba Cloud AI Search’s Agentic RAG architecture, Huawei Noah’s LLM‑enhanced recommendation evolution, and Baidu’s GRAB generative ranking model, detailing multi‑agent retrieval, multimodal data handling, scaling laws, causal attention, and performance gains demonstrated through benchmarks and real‑world deployments.

AI searchAgentic RAGGenerative Ranking
0 likes · 8 min read
Unlocking Agentic RAG and Generative Ranking: AI Search & Recommendation Breakthroughs
ITPUB
ITPUB
Nov 24, 2025 · Artificial Intelligence

Why Memory, Not Size, Is the Next Bottleneck for Large Language Models

In a detailed interview, the CTO of Memory Tensor (Shanghai) explains how limited memory capacity hampers large models, outlines the MemOS memory operating system, discusses information‑theoretic metrics, multimodal extensions, and reinforcement‑learning strategies for scalable, secure, and explainable AI memory management.

AI ArchitectureMultimodal AIinformation theory
0 likes · 23 min read
Why Memory, Not Size, Is the Next Bottleneck for Large Language Models
DataFunSummit
DataFunSummit
Nov 23, 2025 · Artificial Intelligence

How Large Language Models Are Revolutionizing Banking Data Integration

This article examines the challenges of traditional banking data, explains how large language models can fuse structured and unstructured information, outlines a new data‑centric infrastructure and governance approach, and describes the DiFY platform’s AI‑agent and DataOps capabilities for agile, non‑intrusive integration with core banking systems.

AI agentsBig DataData Governance
0 likes · 16 min read
How Large Language Models Are Revolutionizing Banking Data Integration
Kuaishou Tech
Kuaishou Tech
Nov 20, 2025 · Artificial Intelligence

How UniDex and UniSearch Redefine Video Search with Semantic Indexing and Generative Models

This article explains how Kuaishou’s UniDex replaces traditional term‑based inverted indexes with model‑driven semantic posting lists and how the end‑to‑end UniSearch framework generates video IDs directly from queries, delivering higher relevance, lower latency, and significant online performance gains.

AIGenerative ModelsSearch
0 likes · 17 min read
How UniDex and UniSearch Redefine Video Search with Semantic Indexing and Generative Models
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Nov 20, 2025 · Artificial Intelligence

How DeepAgent Redefines AI Agents with Memory Folding and ToolPO

This article breaks down the DeepAgent paper, explaining its novel "main model + auxiliary model" architecture, the memory‑folding mechanism that compresses long‑context reasoning, and the ToolPO reinforcement strategy that enables efficient tool discovery and usage.

AI agentsToolPOlarge language models
0 likes · 8 min read
How DeepAgent Redefines AI Agents with Memory Folding and ToolPO
Tencent Advertising Technology
Tencent Advertising Technology
Nov 20, 2025 · Artificial Intelligence

CoderRec: Latent Reasoning Boosts Sequential Recommendation

CoderRec, a new sequential recommendation framework jointly developed by Tencent Advertising Technology and Tsinghua University, combines domain‑specific latent reasoning with cross‑scale model collaboration to capture implicit user intent and fuse large‑language‑model semantics with traditional recommender signals, achieving state‑of‑the‑art performance on multiple Amazon datasets.

artificial intelligencecross-scale collaborationlarge language models
0 likes · 17 min read
CoderRec: Latent Reasoning Boosts Sequential Recommendation
Alibaba Cloud Developer
Alibaba Cloud Developer
Nov 18, 2025 · Artificial Intelligence

How ReAct and Reflexion Boost Large Language Models for Complex, Real‑World Tasks

The article explains the limitations of large language models on multi‑step reasoning, real‑time information retrieval, and planning, then introduces the ReAct (Reasoning + Acting) framework and its Reflexion extension, detailing their mechanisms, examples, performance gains, practical applications, and future research directions.

Agentic AILLM ReasoningPrompt engineering
0 likes · 16 min read
How ReAct and Reflexion Boost Large Language Models for Complex, Real‑World Tasks
AI Tech Publishing
AI Tech Publishing
Nov 17, 2025 · Artificial Intelligence

Frontier AI Models in RL Environments Reveal an Agent Capability Hierarchy

The article evaluates nine cutting‑edge AI models on 150 simulated workplace tasks, showing that even the strongest models complete fewer than 40% of tasks, and uses these results to propose a hierarchical framework of agentic capabilities ranging from tool use to common‑sense reasoning.

AI model evaluationTool Useagentic capabilities
0 likes · 19 min read
Frontier AI Models in RL Environments Reveal an Agent Capability Hierarchy
Data Thinking Notes
Data Thinking Notes
Nov 16, 2025 · Artificial Intelligence

How AI Agents Transform Automation: Architecture, Challenges & Future Trends

This comprehensive overview examines AI agents powered by large language models, detailing their definition, core components, architectural patterns, key technologies such as prompt engineering and retrieval‑augmented generation, diverse application domains, current challenges, security solutions, and emerging research directions.

Prompt engineeringRetrieval Augmented Generationarchitecture
0 likes · 81 min read
How AI Agents Transform Automation: Architecture, Challenges & Future Trends
Liangxu Linux
Liangxu Linux
Nov 12, 2025 · Artificial Intelligence

Top Open‑Source AI‑Powered Tools to Boost Your Workflow (2024)

It introduces several open-source projects—MarkItDown for document-to‑Markdown conversion, Codebuff AI coding assistant, Twitter’s recommendation algorithm, mlx‑lm for running LLMs on Apple silicon, Perplexica AI search, and ChinaTextbook dataset—highlighting their features, usage, and GitHub links.

AIDocument Conversioneducation
0 likes · 6 min read
Top Open‑Source AI‑Powered Tools to Boost Your Workflow (2024)
AntTech
AntTech
Nov 11, 2025 · Artificial Intelligence

Breaking the Efficiency Wall: Ant Group’s Bailing Model Paves the Way to AGI

At CNCC 2025, Ant Group’s Vice President Zhou Jun outlined the Bailing large‑model’s five‑layer architecture, hybrid linear attention, Ling Scaling Law, and novel training algorithms that dramatically cut costs and latency, achieving state‑of‑the‑art performance on math and code benchmarks while promoting open‑source collaboration toward AGI.

AGIMixture of ExpertsMultimodal AI
0 likes · 8 min read
Breaking the Efficiency Wall: Ant Group’s Bailing Model Paves the Way to AGI
Alimama Tech
Alimama Tech
Nov 11, 2025 · Artificial Intelligence

Accelerating LLM RL with Async Training, Mini‑Critics, and Attention Rewards

This article introduces the 3A collaborative framework—Async architecture, Asymmetric PPO mini‑critics, and an attention‑based reasoning rhythm—demonstrating how decoupled, fine‑grained parallel training and structure‑aware reward allocation dramatically improve efficiency, scalability, and interpretability of reinforcement learning for large language models.

asynchronous trainingattention mechanismslarge language models
0 likes · 23 min read
Accelerating LLM RL with Async Training, Mini‑Critics, and Attention Rewards
Network Intelligence Research Center (NIRC)
Network Intelligence Research Center (NIRC)
Nov 11, 2025 · Artificial Intelligence

What Is Mechanistic Interpretability and Why It Matters for Large Language Models

The article defines mechanistic interpretability as reverse‑engineering LLMs to reveal how they represent knowledge and make decisions, explains its importance for transparency, risk mitigation, and model improvement, and surveys key techniques such as causal tracing, zero‑making, noise‑making, and logit‑lens methods with illustrative examples.

causal tracinglarge language modelslogit lens
0 likes · 8 min read
What Is Mechanistic Interpretability and Why It Matters for Large Language Models
DaTaobao Tech
DaTaobao Tech
Nov 10, 2025 · Artificial Intelligence

How Tmall’s AI Transforms Test Case Generation for Faster, Smarter QA

This article details Tmall's technology team's deep AI‑driven testing practice, outlining industry challenges, the need for intelligent test case generation, and a comprehensive strategy that combines prompt engineering, RAG‑based knowledge bases, and platform integration to boost coverage, reduce manual effort, and accelerate release cycles.

AI testingKnowledge BasePrompt engineering
0 likes · 10 min read
How Tmall’s AI Transforms Test Case Generation for Faster, Smarter QA
Tencent Technical Engineering
Tencent Technical Engineering
Nov 10, 2025 · Artificial Intelligence

How Large Language Models Evolved in 2025: From DeepSeek to Kimi‑K2 and Beyond

This article maps the rapid evolution of open‑source large language models in 2025, explains the underlying architectural breakthroughs such as MLA, MoE, and NSA, compares dozens of models—including DeepSeek‑V3, OLMo2, Gemma3, Llama4, Qwen3, and Kimi‑K2—and highlights the emergence of powerful AI assistants like Dola, providing developers with a concise technical roadmap.

AI AssistantLLM efficiencyMixture of Experts
0 likes · 44 min read
How Large Language Models Evolved in 2025: From DeepSeek to Kimi‑K2 and Beyond
DataFunSummit
DataFunSummit
Nov 9, 2025 · Artificial Intelligence

How Kuaishou Boosted Ad Performance with Multimodal LLMs: COPE & LEARN Frameworks

This article reviews Kuaishou's two‑year exploration of large‑model techniques in advertising, detailing the challenges of content‑domain ad estimation, the use of multimodal and LLM technologies to harness full‑scope user behavior and external knowledge, and the COPE and LEARN frameworks that delivered measurable business gains.

AdvertisingKnowledge TransferMultimodal AI
0 likes · 6 min read
How Kuaishou Boosted Ad Performance with Multimodal LLMs: COPE & LEARN Frameworks
AntTech
AntTech
Nov 8, 2025 · Artificial Intelligence

Ant Group’s AntBaiLing Model: Pushing AI Scaling Limits with Trillion‑Parameter Efficiency

Ant Group’s President Luo Ji outlined how the AntBaiLing suite, featuring trillion‑parameter open‑source models, three efficiency breakthroughs, and a domestic compute cluster, is advancing AGI research and inclusive applications, especially in healthcare, while emphasizing ethical, trustworthy AI.

AGIlarge language modelsmodel efficiency
0 likes · 5 min read
Ant Group’s AntBaiLing Model: Pushing AI Scaling Limits with Trillion‑Parameter Efficiency
Sohu Tech Products
Sohu Tech Products
Nov 5, 2025 · Artificial Intelligence

Do AI Models Really Have Introspective Awareness? Anthropic’s New Findings

Anthropic’s recent study reveals that large language models like Claude Opus 4 exhibit functional introspective awareness, defining rigorous criteria for true introspection and demonstrating through four experiments how models can recognize, report, and even control their internal states, though the capability remains unstable and context‑dependent.

AIClaude OpusConcept Injection
0 likes · 15 min read
Do AI Models Really Have Introspective Awareness? Anthropic’s New Findings
Zhihu Tech Column
Zhihu Tech Column
Nov 4, 2025 · Artificial Intelligence

How Multimodal Large Models Transform Recommendation Systems: From Tags to Embeddings

This article explores how multimodal large models like Qwen2.5‑VL enable high‑dimensional tag generation and universal embeddings for recommendation systems, detailing data synthesis, model training, quantization, fine‑tuning, and the resulting improvements in click‑through rate and exposure interaction.

EmbeddingMultimodal AIRecommendation Systems
0 likes · 17 min read
How Multimodal Large Models Transform Recommendation Systems: From Tags to Embeddings
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Nov 4, 2025 · Artificial Intelligence

How Alibaba Cloud’s PAI Powers Cutting‑Edge LLM Research at EMNLP 2025

EMNLP 2025 in Suzhou will feature Alibaba Cloud’s AI platform PAI presenting four accepted papers on knowledge distillation, small‑model reasoning, distilled reasoning models, and an automated RAG benchmark framework, alongside exhibition demos, networking events, and recruitment opportunities for AI talent.

AI PlatformEMNLP 2025Retrieval Augmented Generation
0 likes · 10 min read
How Alibaba Cloud’s PAI Powers Cutting‑Edge LLM Research at EMNLP 2025
JD Retail Technology
JD Retail Technology
Nov 4, 2025 · Artificial Intelligence

How AIGC Is Transforming E‑commerce with Personalized Visual Content

This article explains how large‑model AIGC technology reshapes e‑commerce by enabling mass‑produced, user‑profile‑driven visual assets, detailing the evolution from early online trade to the 2.0 era, the technical pipeline of multimodal models, and the practical impact on merchants.

AIGCMultimodal AIe‑commerce
0 likes · 17 min read
How AIGC Is Transforming E‑commerce with Personalized Visual Content
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Nov 4, 2025 · Artificial Intelligence

How Baidu’s Baige Accelerates Multimodal Video Training with Context Parallelism

Baidu Baige’s enhanced veRL framework dramatically boosts video frame rates and resolution limits, cuts training time, reduces memory usage, and improves model accuracy by leveraging context parallelism and optimized attention on Ampere GPUs for multimodal mixed‑training scenarios.

AI accelerationContext ParallelismMultimodal Training
0 likes · 6 min read
How Baidu’s Baige Accelerates Multimodal Video Training with Context Parallelism
DeWu Technology
DeWu Technology
Nov 3, 2025 · Artificial Intelligence

How Large Language Models Boost Search Relevance: A Real‑World Case Study

This article explains how a leading e‑commerce platform leveraged large language models to overcome traditional search relevance challenges, detailing the iterative workflow, model distillation, performance gains, deployment results, and future directions for smarter, more accurate product search.

AIe‑commercelarge language models
0 likes · 10 min read
How Large Language Models Boost Search Relevance: A Real‑World Case Study
AI Info Trend
AI Info Trend
Nov 3, 2025 · Industry Insights

2025 Q3 AI Landscape: Key Players, Model Trends, and Hardware Shifts

Artificial Analysis’s Q3 2025 AI report reveals a rapidly accelerating industry across the entire stack, with US and Chinese labs neck‑and‑neck, fierce competition among OpenAI, Google, Anthropic, xAI, DeepSeek and Alibaba, cost‑efficient models, booming multimodal agents, and a hardware race led by NVIDIA’s Blackwell accelerators.

2025AIHardware
0 likes · 12 min read
2025 Q3 AI Landscape: Key Players, Model Trends, and Hardware Shifts
DataFunSummit
DataFunSummit
Nov 1, 2025 · Artificial Intelligence

Large Language Models Revolutionize Legal Document Automation – Alibaba Expert Insights

This article explores how breakthrough large‑model technologies are reshaping legal document automation, covering current challenges, the evolution of intelligent document processing, large‑model applications in core legal scenarios, benchmark results, performance optimizations, and future directions, based on a talk by Alibaba senior algorithm engineer Huang Zhangfeng.

Document AutomationEnterprise ComplianceInformation Extraction
0 likes · 18 min read
Large Language Models Revolutionize Legal Document Automation – Alibaba Expert Insights
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Oct 30, 2025 · Artificial Intelligence

FinSearchComp: ByteDance’s Expert‑Level Financial Search and Reasoning Benchmark for Real‑World Scenarios

FinSearchComp is the first fully open‑source benchmark that evaluates large‑language‑model agents' search and reasoning abilities in realistic financial workflows, featuring 635 expert‑annotated questions across three task types, built with 70 finance experts, and revealing that web‑enabled models with financial plugins markedly outperform API‑only models.

AI EvaluationFinSearchCompLLM agents
0 likes · 12 min read
FinSearchComp: ByteDance’s Expert‑Level Financial Search and Reasoning Benchmark for Real‑World Scenarios
Alimama Tech
Alimama Tech
Oct 29, 2025 · Artificial Intelligence

LLM Breakthroughs at EMNLP 2025: Embedding Compression, Complex Instructions, Knowledge Scaling

EMNLP 2025 in Suzhou showcases Taobao's booth featuring four cutting‑edge AI papers that introduce a novel embedding compression framework, an automatic iterative refinement method for complex instruction generation, a knowledge infusion scaling law for large language models, and a video caption optimization approach for text‑to‑video generation.

embedding compressioninstruction generationknowledge infusion
0 likes · 7 min read
LLM Breakthroughs at EMNLP 2025: Embedding Compression, Complex Instructions, Knowledge Scaling
DataFunTalk
DataFunTalk
Oct 29, 2025 · Artificial Intelligence

Voice Agents Transform Gaming & Insurance: Real‑World Lessons from Silicon Valley

In a Silicon Valley tech conference, Mu Shen shared how voice agents—real‑time, task‑oriented AI—were applied to an open‑world game as an AI NPC and to a Fortune‑500 insurer as an AI tele‑salesperson, revealing technical challenges, model architectures, training strategies, evaluation methods, and key lessons for future deployments.

Model architecturegame AIinsurance automation
0 likes · 19 min read
Voice Agents Transform Gaming & Insurance: Real‑World Lessons from Silicon Valley
Code Mala Tang
Code Mala Tang
Oct 28, 2025 · Artificial Intelligence

Unlocking AI Creativity with Just Eight Words: The Verbalized Sampling Breakthrough

A recent Stanford and West Virginia University study reveals that a simple eight‑word prompt technique, called Verbalized Sampling, can double the creative output of large language models without costly retraining, by exposing hidden diversity suppressed by conventional alignment methods.

AI creativityLLM sampling techniquesPrompt engineering
0 likes · 9 min read
Unlocking AI Creativity with Just Eight Words: The Verbalized Sampling Breakthrough
Ele.me Technology
Ele.me Technology
Oct 27, 2025 · Artificial Intelligence

How IAK Transforms Multi‑Domain Recommendation with Pre‑Training and Fine‑Tuning

This paper introduces IAK, a unified multi‑domain recommendation paradigm that treats the system as a large model, leveraging pre‑training and fine‑tuning with an information‑aware adaptive kernel to capture rapid user interest shifts while reducing training costs and improving online performance.

Recommendation Systemsfine‑tuninginformation bottleneck
0 likes · 18 min read
How IAK Transforms Multi‑Domain Recommendation with Pre‑Training and Fine‑Tuning
KooFE Frontend Team
KooFE Frontend Team
Oct 26, 2025 · Artificial Intelligence

Master Zero-Shot Prompting: Advanced Techniques to Boost LLM Performance

Zero-shot prompting lets large language models perform tasks without examples, and by following principles of clarity and structured instructions, advanced strategies such as emotion prompting, zero-shot chain-of-thought, RE2 re-reading, Rephrase-and-Respond, role-play, and System-2 Attention can significantly improve accuracy and response quality across translation, reasoning, and QA tasks.

AI reasoningLLMPrompt engineering
0 likes · 13 min read
Master Zero-Shot Prompting: Advanced Techniques to Boost LLM Performance
Data Party THU
Data Party THU
Oct 25, 2025 · Artificial Intelligence

How InfLLM‑V2 Delivers Fast, Low‑Cost Sparse Attention for Long‑Context LLMs

InfLLM‑V2 introduces a zero‑parameter, train‑efficient sparse‑attention framework that dramatically speeds up long‑sequence processing while requiring only 5 B tokens for training, and the open‑source MiniCPM4.1 model demonstrates comparable performance to dense attention on both long‑text understanding and deep‑thinking benchmarks.

InfLLM-V2MiniCPM4.1efficiency
0 likes · 10 min read
How InfLLM‑V2 Delivers Fast, Low‑Cost Sparse Attention for Long‑Context LLMs
Baidu Tech Salon
Baidu Tech Salon
Oct 24, 2025 · Artificial Intelligence

How Wenxin X1.1 Tops China’s LLMs on the New SuperCLUE-CPIF Benchmark

Recent release of the SuperCLUE-CPIF benchmark shows Baidu’s Wenxin X1.1 achieving the highest score among Chinese large language models, surpassing competitors like DeepSeek‑V3.2‑Exp‑Thinking and Hunyuan‑T1, with notable advantages in precise instruction following and complex task handling.

AI EvaluationWenxin X1.1benchmark
0 likes · 4 min read
How Wenxin X1.1 Tops China’s LLMs on the New SuperCLUE-CPIF Benchmark
DataFunTalk
DataFunTalk
Oct 24, 2025 · Artificial Intelligence

Why OpenAI’s Adult Content Plans Could Reshape AI Performance and Markets

The article examines how opening AI models to adult content—tracing its historical role as a technology testbed, analyzing market incentives, data‑bias risks, alignment tax, and regulatory hurdles—suggests that such a move could boost model capabilities while raising ethical and legal challenges.

AIMarket analysisadult content
0 likes · 12 min read
Why OpenAI’s Adult Content Plans Could Reshape AI Performance and Markets
DataFunTalk
DataFunTalk
Oct 22, 2025 · Artificial Intelligence

How Large Language Models Power Xiaomi’s Xiao AI Assistant

This article explains how Xiaomi’s Xiao AI assistant leverages large language models for intent routing, domain‑specific intent understanding, and response generation, detailing the system architecture, challenges such as knowledge requirements and latency constraints, and the shift from prompt engineering to model fine‑tuning.

AI AssistantIntent RoutingPrompt engineering
0 likes · 5 min read
How Large Language Models Power Xiaomi’s Xiao AI Assistant
Wuming AI
Wuming AI
Oct 20, 2025 · Artificial Intelligence

How to Let AI Instantly Draw Professional UML Diagrams with Mermaid

This article walks through using large language models such as Claude, Gemini, DeepSeek, and Kimi to generate accurate, colorful UML diagrams via Mermaid syntax, covering model selection, prompt engineering, step‑by‑step demonstrations, and practical tips for reliable AI‑driven diagram creation.

AI‑generated diagramsMermaidPrompt engineering
0 likes · 5 min read
How to Let AI Instantly Draw Professional UML Diagrams with Mermaid
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Oct 20, 2025 · Artificial Intelligence

How ACK Inference Gateway Tripled Large‑Model Performance for an Insurance Giant

This article details how Guotai Insurance tackled the high latency and cost of large‑model inference by deploying Alibaba Cloud's ACK Inference Gateway, which uses load‑aware, prefix‑aware routing, intelligent queuing, and comprehensive observability to boost efficiency threefold while reducing expenses.

ACK GatewayAI inferenceCloud Native
0 likes · 18 min read
How ACK Inference Gateway Tripled Large‑Model Performance for an Insurance Giant
AntTech
AntTech
Oct 20, 2025 · Artificial Intelligence

How a Constraint-Aware Multi-Agent System Won the IJCAI Travel Planning Challenge

Leveraging a proprietary “large model + optimization” approach, Alibaba’s Ant Group and East China Normal University built a constraint-aware multi-agent framework that secured first place in the Original OS track and second in the DSL track of the IJCAI-2025 Autonomous Travel Planning Competition.

AI OptimizationIJCAITravel Planning
0 likes · 7 min read
How a Constraint-Aware Multi-Agent System Won the IJCAI Travel Planning Challenge
Data Thinking Notes
Data Thinking Notes
Oct 19, 2025 · Artificial Intelligence

How GSPO Improves Stability in Large Language Model Training

GSPO (Group Sequence Policy Optimization) is a reinforcement‑learning algorithm for LLMs that replaces token‑level GRPO with sequence‑level optimization, addressing instability in ultra‑large model training, especially for long‑sequence and MoE architectures, by aligning reward granularity and reducing variance.

GRPOGSPOlarge language models
0 likes · 11 min read
How GSPO Improves Stability in Large Language Model Training
IT Services Circle
IT Services Circle
Oct 18, 2025 · Artificial Intelligence

Unlock Multi‑Model AI Collaboration with Zen MCP – A Deep Dive

The Zen MCP open‑source server, now with over 8.6K stars, acts as a bridge that lets Claude Code, Codex CLI, Gemini CLI and other AI tools invoke dozens of large models simultaneously, offering seamless multi‑model cooperation, automatic model selection, conversation continuity, and local execution for privacy‑preserving AI workflows.

AI orchestrationAI toolingMulti-Model Collaboration
0 likes · 5 min read
Unlock Multi‑Model AI Collaboration with Zen MCP – A Deep Dive
Amap Tech
Amap Tech
Oct 17, 2025 · Artificial Intelligence

How Ranking Improves In-Context Example Retrieval: Insights from NeurIPS ’25

This article explains the limitations of current pointwise in‑context learning methods, introduces a novel ranking‑based approach called SeDPO that learns preference orders among examples, and demonstrates its superior performance across multiple NLP tasks through extensive experiments and ablation studies.

In-Context LearningNeurIPSSeDPO
0 likes · 10 min read
How Ranking Improves In-Context Example Retrieval: Insights from NeurIPS ’25
Wuming AI
Wuming AI
Oct 16, 2025 · Industry Insights

Top AI Model Releases This Week: NanoChat, Ring‑1T, Qwen3‑VL, Veo 3.1, Claude Haiku 4.5

This week’s AI landscape saw Karpathy’s NanoChat open‑sourcing a 8‑K‑line ChatGPT replica, Ant Group unveiling a trillion‑parameter Ring‑1T model, Alibaba releasing the 4B/8B Qwen3‑VL visual language models that outperform Gemini 2.5 Flash Lite and GPT‑5 Nano, Google launching Veo 3.1 for high‑fidelity video generation, and Anthropic announcing Claude Haiku 4.5, a faster and cheaper LLM that excels on SWE‑bench benchmarks.

AI modelsMultimodalVideo Generation
0 likes · 7 min read
Top AI Model Releases This Week: NanoChat, Ring‑1T, Qwen3‑VL, Veo 3.1, Claude Haiku 4.5
Meituan Technology Team
Meituan Technology Team
Oct 15, 2025 · Artificial Intelligence

What’s New in Large Model Research? Top Meituan AI Papers Up to Oct 2025

This curated list showcases Meituan’s latest large‑model breakthroughs and academic papers up to October 2025, spanning LLM system optimizations, multimodal generation, evaluation benchmarks, quantization techniques, and reinforcement‑learning‑driven improvements, offering researchers valuable insights and resources across the AI landscape.

AI researchBenchmarkingMultimodal AI
0 likes · 10 min read
What’s New in Large Model Research? Top Meituan AI Papers Up to Oct 2025
Shopee Tech Team
Shopee Tech Team
Oct 14, 2025 · Artificial Intelligence

How SPEC‑RL Boosts On‑Policy Reinforcement Learning Speed by Up to 3×

SPEC‑RL introduces speculative rollouts that reuse verified historical rollouts as prefixes, cutting rollout time by 2–3× while maintaining or improving performance across various math and reasoning benchmarks, and works seamlessly with PPO, GRPO, DAPO and other on‑policy algorithms.

AI efficiencyTraining Accelerationlarge language models
0 likes · 8 min read
How SPEC‑RL Boosts On‑Policy Reinforcement Learning Speed by Up to 3×
Practical DevOps Architecture
Practical DevOps Architecture
Oct 14, 2025 · Artificial Intelligence

Master AI Agents: From Basics to Advanced Multi-Model Development

This comprehensive AI agent development course covers 18 chapters, ranging from fundamental concepts and architecture to large‑model integration, tool and browser control, memory, RAG self‑learning, sandboxing, database manipulation, multi‑agent architectures, code assistance, and a real‑world frontend automation project, complete with source code and documentation.

AI agentsLangChainRAG
0 likes · 3 min read
Master AI Agents: From Basics to Advanced Multi-Model Development
DataFunSummit
DataFunSummit
Oct 13, 2025 · Artificial Intelligence

How Large Language Models Supercharge Douyin’s User Experience

This article explains how Douyin leverages large language models to build an end‑to‑end user‑experience pipeline that detects signals, understands feedback, attributes issues, and automates governance, turning reactive fixes into proactive, data‑driven product improvements.

AISignal ProcessingUser experience
0 likes · 20 min read
How Large Language Models Supercharge Douyin’s User Experience
Alibaba Cloud Developer
Alibaba Cloud Developer
Oct 13, 2025 · Artificial Intelligence

Can AI Cut Taobao Recommendation Development from a Week to Two Days?

This article explains how Alibaba's WaterFlow, an AI‑driven end‑to‑end development platform, tackles the high demand volume, diverse tech stacks, and slow collaboration of Taobao's recommendation feed, enabling many features to be delivered in just two days instead of a week.

AIcontinuous integrationlarge language models
0 likes · 16 min read
Can AI Cut Taobao Recommendation Development from a Week to Two Days?
Data Party THU
Data Party THU
Oct 11, 2025 · Artificial Intelligence

From Transformers to LLaMA 4: A Journey Through the Biggest LLMs

This article surveys the most influential large language models released since 2017, detailing the core innovations of Transformer, BERT, GPT series, T5, Retrieval‑Augmented Generation, and the latest LLaMA and Meta models, while highlighting their architectures, training paradigms, and impact on NLP research.

LLMModel Scalinglarge language models
0 likes · 21 min read
From Transformers to LLaMA 4: A Journey Through the Biggest LLMs
Kuaishou Large Model
Kuaishou Large Model
Oct 11, 2025 · Artificial Intelligence

How Large-Scale Reinforcement Learning Boosted KAT-Dev-72B-Exp to 74.6% on SWE‑Bench

The KwaiPilot team introduced KAT-Dev-72B-Exp, an open‑source LLM trained with large‑scale reinforcement learning that achieved a record‑breaking 74.6% score on SWE‑Bench Verified, thanks to innovations like Trie Packing, entropy‑aware advantage scaling, and a decoupled data‑environment architecture.

KAT-Dev-72B-ExpTrie Packingentropy scaling
0 likes · 6 min read
How Large-Scale Reinforcement Learning Boosted KAT-Dev-72B-Exp to 74.6% on SWE‑Bench
Bilibili Tech
Bilibili Tech
Oct 11, 2025 · Artificial Intelligence

Can Dual-Agent AI Transform Web Video Editing? Inside VibeCut’s Architecture

VibeCut introduces a novel Orchestrator‑Executor dual‑agent framework for WebCut, leveraging large language models, shared structured context, and modular tool integration to automate complex video editing tasks, demonstrating improved efficiency, transparency, and adaptability across diverse scenarios while addressing challenges of multi‑agent coordination.

AI video editingWebCutlarge language models
0 likes · 35 min read
Can Dual-Agent AI Transform Web Video Editing? Inside VibeCut’s Architecture
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Oct 10, 2025 · Artificial Intelligence

Quantitative Finance Paper Digest (Sep 27 – Oct 10 2025)

This digest summarizes recent arXiv papers that introduce new AI‑driven methods for portfolio similarity, Bayesian portfolio optimization, end‑to‑end deep‑learning portfolio construction, large‑language‑model‑based financial prediction, and multi‑agent crypto‑trading systems, highlighting their datasets, architectures, and empirical gains.

Bayesian OptimizationDeep Learningasset allocation
0 likes · 18 min read
Quantitative Finance Paper Digest (Sep 27 – Oct 10 2025)
Baidu Tech Salon
Baidu Tech Salon
Oct 10, 2025 · Artificial Intelligence

Navigating the 2025 AI Model Boom: Practical Evaluation Strategies

This article examines the rapid surge of large AI models in 2024‑2025, critiques the reliability of public leaderboards, and presents a business‑focused evaluation framework—including dataset construction, metric selection, automation, and LLM‑as‑judge techniques—to help developers choose the right model for real‑world applications.

AI PerformanceAI benchmarksLLM-as-judge
0 likes · 17 min read
Navigating the 2025 AI Model Boom: Practical Evaluation Strategies
Data Party THU
Data Party THU
Oct 10, 2025 · Artificial Intelligence

Can Language Models Self‑Train Without Data? Inside the Language Self‑Play Framework

This article examines the Language Self‑Play (LSP) approach for data‑free training of large language models, detailing its challenger‑solver game formulation, advantage calculations, loss functions, self‑reward extension, experimental setup on AlpacaEval, and results that show LSP can match or surpass data‑driven baselines.

LLMdata-free traininglarge language models
0 likes · 14 min read
Can Language Models Self‑Train Without Data? Inside the Language Self‑Play Framework
DataFunTalk
DataFunTalk
Oct 10, 2025 · Artificial Intelligence

How Large Language Models Power Xiaomi’s Xiao AI Assistant

This article explains how large language models are integrated into Xiaomi’s Xiao AI assistant, covering intent distribution, domain‑specific intent understanding, response generation, architectural design, challenges such as knowledge requirements and latency, and the shift from prompt engineering to model fine‑tuning.

AI AssistantIntent RoutingPrompt engineering
0 likes · 5 min read
How Large Language Models Power Xiaomi’s Xiao AI Assistant
Data Party THU
Data Party THU
Oct 9, 2025 · Artificial Intelligence

Can One Model Master All Audio‑Visual Tasks? Introducing Crab’s Unified Approach

This article presents Crab, a unified audio‑visual scene understanding model that leverages a novel display‑cooperation learning paradigm, introduces the AV‑UIE dataset with explicit reasoning steps, and demonstrates superior performance across temporal, spatial, pixel‑level, and spatio‑temporal tasks through extensive experiments and ablations.

DatasetLoRAMultimodal
0 likes · 12 min read
Can One Model Master All Audio‑Visual Tasks? Introducing Crab’s Unified Approach
DataFunTalk
DataFunTalk
Oct 9, 2025 · Artificial Intelligence

From Physics to DeepMind: How a Tsinghua Star Is Shaping AI Research

Google DeepMind hired Shunyu Yao, a Tsinghua physics prodigy and former Anthropic researcher, whose rapid transition from theoretical physics to AI highlights the intense workload, values clash, and the accelerating pace of large‑model research.

AI researchDeepMindPhysics
0 likes · 9 min read
From Physics to DeepMind: How a Tsinghua Star Is Shaping AI Research
HyperAI Super Neural
HyperAI Super Neural
Oct 5, 2025 · Artificial Intelligence

Which Sci‑Fi AI Are Already Real? Voice Assistants, Companion Bots, Digital Immortality

The article reviews iconic AI portrayals from movies such as Iron Man, Her, The Wandering Earth 2, Terminator and The Matrix, then compares each vision with today’s voice assistants, large‑language‑model chatbots, companion robots, brain‑computer interfaces and autonomous weapon systems, highlighting what has materialized and what remains speculative.

AIautonomous weaponsbrain-computer interface
0 likes · 15 min read
Which Sci‑Fi AI Are Already Real? Voice Assistants, Companion Bots, Digital Immortality
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Oct 1, 2025 · Artificial Intelligence

2025 Large Model Engineering Breakthroughs: Cutting Costs, Boosting Performance, and Extending Context

The 2025 open‑source reports reveal major advances in large‑model engineering, including drastic cost cuts such as DeepSeek‑V3 training for $5.57 M, performance gains where Gemma 3 4B matches Gemma 2 27B, memory efficiencies like 85 % KV‑cache reduction, and a suite of new techniques—from loss‑free MoE balancing to multi‑token prediction—that together push context lengths to one million tokens and enable multimodal, aligned, and industry‑specific models.

Cost reductionMultimodal AIattention mechanisms
0 likes · 13 min read
2025 Large Model Engineering Breakthroughs: Cutting Costs, Boosting Performance, and Extending Context
DataFunSummit
DataFunSummit
Sep 29, 2025 · Artificial Intelligence

How Large Language Models Power XiaoAI: From Intent Routing to Response Generation

This article explores how large language models are integrated into Xiaomi’s XiaoAI assistant, detailing the system’s architecture, intent distribution, domain-specific understanding, and response generation, while sharing practical challenges, prompt engineering solutions, and fine‑tuning strategies that boosted user retention and query satisfaction.

AI assistantsIntent RoutingXiaoAI
0 likes · 4 min read
How Large Language Models Power XiaoAI: From Intent Routing to Response Generation
21CTO
21CTO
Sep 29, 2025 · Artificial Intelligence

Why Open‑Source Is the Key to China’s AI Future, According to Li Kaifu

Li Kaifu argues that open‑source large‑model ecosystems are essential for China to close the AI gap with the United States, highlighting DeepSeek’s impact, shifting scaling laws, and the emerging role of AI‑to‑AI teaching as the next development frontier.

China AIartificial intelligencelarge language models
0 likes · 4 min read
Why Open‑Source Is the Key to China’s AI Future, According to Li Kaifu
Volcano Engine Developer Services
Volcano Engine Developer Services
Sep 28, 2025 · Artificial Intelligence

Demystifying AI Jargon: A Beginner’s Guide to Large Language Models

This guide breaks down the complex terminology of large language models—explaining tokens, transformers, self‑attention, RAG, scaling laws, dense vs. sparse architectures, and training stages—using clear analogies and step‑by‑step explanations so readers can confidently understand and work with modern AI systems.

AI fundamentalsModel TrainingRAG
0 likes · 35 min read
Demystifying AI Jargon: A Beginner’s Guide to Large Language Models
HyperAI Super Neural
HyperAI Super Neural
Sep 26, 2025 · Artificial Intelligence

Nvidia’s ReaSyn Uses Chain‑of‑Reaction Reasoning to Boost Molecule Reconstruction and Path Diversity

ReaSyn, a new framework from Nvidia’s research team, treats synthesis pathways as chain‑of‑thought reasoning using a novel Chain‑of‑Reaction representation, achieving the highest reconstruction rates and path diversity in molecule synthesis tasks, and outperforming prior methods across multiple benchmark optimizations.

AI drug discoveryReaSynchain-of-reaction
0 likes · 14 min read
Nvidia’s ReaSyn Uses Chain‑of‑Reaction Reasoning to Boost Molecule Reconstruction and Path Diversity
Instant Consumer Technology Team
Instant Consumer Technology Team
Sep 25, 2025 · Artificial Intelligence

Inside Qwen’s Midnight Release: New Guard, Travel Agent, LiveTranslate, Code & Vision Models Unveiled

Late at night on the 23rd, Lin Junyang of Tongyi Lab announced six AI model releases—including a safety‑audit guard, a personal travel planner, a real‑time multilingual translator, upgraded coding models, a powerful vision‑language model, and the flagship Qwen3‑Max—each detailed with capabilities, highlights, and direct download links.

Multimodal AISafetyartificial intelligence
0 likes · 11 min read
Inside Qwen’s Midnight Release: New Guard, Travel Agent, LiveTranslate, Code & Vision Models Unveiled
Data Thinking Notes
Data Thinking Notes
Sep 24, 2025 · Artificial Intelligence

How AI Agents Are Transforming Smart Logistics at SF Express

This article explains how SF Express leverages AI agents and large language models to create a full‑process intelligent management framework that optimizes order forecasting, dynamic scheduling, resource allocation, and operational decision‑making across the entire logistics chain.

AIIntelligent agentsLogistics
0 likes · 21 min read
How AI Agents Are Transforming Smart Logistics at SF Express
Fun with Large Models
Fun with Large Models
Sep 24, 2025 · Artificial Intelligence

Interview Guide: Core Differences Between PPO and GRPO Algorithms for Large Model Fine‑Tuning

The article explains the fundamental principles of PPO and GRPO reinforcement‑learning algorithms, compares their architectures and training workflows, highlights why GRPO is gaining traction in large‑model fine‑tuning, discusses associated risks, and offers practical guidance on group size selection for engineers preparing for interviews.

GRPOPPORLHF
0 likes · 9 min read
Interview Guide: Core Differences Between PPO and GRPO Algorithms for Large Model Fine‑Tuning
Data Thinking Notes
Data Thinking Notes
Sep 21, 2025 · Artificial Intelligence

From RAG to DeepSearch & DeepResearch: How AI Is Mastering Knowledge Retrieval

Amid the rapid rise of generative AI, this article examines the limitations of large language models and explains how Retrieval‑Augmented Generation (RAG), followed by the advanced paradigms DeepSearch and DeepResearch, progressively enhance knowledge handling through dynamic retrieval, multi‑agent reasoning, and autonomous research capabilities.

AI Knowledge ManagementDeepResearchDeepSearch
0 likes · 16 min read
From RAG to DeepSearch & DeepResearch: How AI Is Mastering Knowledge Retrieval
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Sep 20, 2025 · Artificial Intelligence

Weekly Quantitative Finance Paper Digest (Sep 13‑19, 2025)

This digest summarizes seven recent arXiv papers that apply reinforcement learning, multi‑agent frameworks, dynamic factor models, high‑frequency trading LLMs, quantum GANs, multi‑LLM sentiment analysis, and context‑aware language models to advance quantitative finance and AI‑driven market prediction.

Quantitative FinanceQuantum Machine Learninglarge language models
0 likes · 12 min read
Weekly Quantitative Finance Paper Digest (Sep 13‑19, 2025)
DataFunTalk
DataFunTalk
Sep 19, 2025 · Artificial Intelligence

How Tencent’s Large Language Models Transform Business with RAG, GraphRAG, and Agents

This article examines Tencent's large language model deployments across diverse business scenarios, detailing how Retrieval‑Augmented Generation, GraphRAG, and autonomous agents boost model intelligence, improve user experience, and enable advanced content generation, understanding, and multi‑step reasoning.

Autonomous AgentsGraphRAGRetrieval Augmented Generation
0 likes · 4 min read
How Tencent’s Large Language Models Transform Business with RAG, GraphRAG, and Agents
Data Party THU
Data Party THU
Sep 19, 2025 · Artificial Intelligence

How RepoMaster Enables AI Agents to Master GitHub Repositories for Complex Tasks

RepoMaster is an AI‑driven framework that automatically discovers, analyzes, and executes code from massive GitHub repositories, turning them into reusable tools and achieving state‑of‑the‑art performance on challenging benchmarks while drastically reducing token consumption and engineering effort.

AI agentsRepoMasterSoftware Engineering
0 likes · 9 min read
How RepoMaster Enables AI Agents to Master GitHub Repositories for Complex Tasks
Data Party THU
Data Party THU
Sep 19, 2025 · Artificial Intelligence

How DeepSeek R1 Redefines AI Reasoning with Pure Reinforcement Learning

DeepSeek R1 replaces traditional supervised fine‑tuning with a pure reinforcement‑learning pipeline, introducing the GRPO algorithm and a four‑stage training regime that dramatically lowers cost, boosts reasoning and code‑generation performance, and raises important ethical, privacy, and societal considerations for large language models.

AI reasoningDeepSeekGRPO
0 likes · 14 min read
How DeepSeek R1 Redefines AI Reasoning with Pure Reinforcement Learning
HyperAI Super Neural
HyperAI Super Neural
Sep 19, 2025 · Artificial Intelligence

Weekly AI Paper Roundup: RL Advances, Tree‑Structured QA, and GraphRAG Breakthroughs

This article surveys five recent AI papers, covering reinforcement learning for large reasoning models, a tree‑structured table QA framework (ST‑Raptor), visual representation alignment for multimodal LLMs, GraphRAG‑based generation, and an LLM‑driven cryptographic vulnerability detector, each with key insights and links.

cryptographic vulnerability detectiongraph retrievallarge language models
0 likes · 5 min read
Weekly AI Paper Roundup: RL Advances, Tree‑Structured QA, and GraphRAG Breakthroughs
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Sep 16, 2025 · Industry Insights

Can Ontology Bridge the Gap Between Large Language Models and Executable Code?

This article analyzes how combining ontology with large language models can create a new intelligent application development paradigm that unites semantic understanding and executable behavior, proposing a three‑layer architecture, a Model Control Protocol, and real‑world case studies to illustrate its potential and challenges.

AI integrationSoftware Architectureknowledge representation
0 likes · 22 min read
Can Ontology Bridge the Gap Between Large Language Models and Executable Code?
DataFunTalk
DataFunTalk
Sep 15, 2025 · Artificial Intelligence

How AI+Data Agents Are Transforming the Automotive Industry’s Digital Leap

In an interview, Di Xingxing of Autohome details their AI+Data framework—unified lake‑warehouse, intelligent engine, and agent services—that breaks data silos, blends traditional models with LLMs, leverages causal inference and RAG knowledge bases, and uses continuous feedback to build explainable, evolving data agents for accurate sales forecasting, competitive analysis, and end‑to‑end business automation in the automotive industry.

AIRAGautomotive
0 likes · 10 min read
How AI+Data Agents Are Transforming the Automotive Industry’s Digital Leap
DataFunSummit
DataFunSummit
Sep 14, 2025 · Artificial Intelligence

How AI is Revolutionizing Chemistry and Drug Discovery: From Data to Breakthroughs

This article explores how AI-driven models and data pipelines are transforming the chemistry and pharmaceutical sectors by accelerating drug design, improving protein‑antibody predictions, automating patent data extraction, and outlining future goals for end‑to‑end AI‑enabled scientific discovery.

AI for ScienceChemistry AIdata mining
0 likes · 13 min read
How AI is Revolutionizing Chemistry and Drug Discovery: From Data to Breakthroughs