Tagged articles

Large Language Models

1206 articles · Page 6 of 13

Nov 10, 2025 · Artificial Intelligence

How Tmall’s AI Transforms Test Case Generation for Faster, Smarter QA

This article details Tmall's technology team's deep AI‑driven testing practice, outlining industry challenges, the need for intelligent test case generation, and a comprehensive strategy that combines prompt engineering, RAG‑based knowledge bases, and platform integration to boost coverage, reduce manual effort, and accelerate release cycles.

AI testingKnowledge BaseLarge Language Models

0 likes · 10 min read

How Tmall’s AI Transforms Test Case Generation for Faster, Smarter QA

Tencent Technical Engineering

Nov 10, 2025 · Artificial Intelligence

How Large Language Models Evolved in 2025: From DeepSeek to Kimi‑K2 and Beyond

This article maps the rapid evolution of open‑source large language models in 2025, explains the underlying architectural breakthroughs such as MLA, MoE, and NSA, compares dozens of models—including DeepSeek‑V3, OLMo2, Gemma3, Llama4, Qwen3, and Kimi‑K2—and highlights the emergence of powerful AI assistants like Dola, providing developers with a concise technical roadmap.

AI assistantLLM efficiencyLarge Language Models

0 likes · 44 min read

How Large Language Models Evolved in 2025: From DeepSeek to Kimi‑K2 and Beyond

DataFunSummit

Nov 9, 2025 · Artificial Intelligence

How Kuaishou Boosted Ad Performance with Multimodal LLMs: COPE & LEARN Frameworks

This article reviews Kuaishou's two‑year exploration of large‑model techniques in advertising, detailing the challenges of content‑domain ad estimation, the use of multimodal and LLM technologies to harness full‑scope user behavior and external knowledge, and the COPE and LEARN frameworks that delivered measurable business gains.

AdvertisingKnowledge TransferLarge Language Models

0 likes · 6 min read

How Kuaishou Boosted Ad Performance with Multimodal LLMs: COPE & LEARN Frameworks

AntTech

Nov 8, 2025 · Artificial Intelligence

Ant Group’s AntBaiLing Model: Pushing AI Scaling Limits with Trillion‑Parameter Efficiency

Ant Group’s President Luo Ji outlined how the AntBaiLing suite, featuring trillion‑parameter open‑source models, three efficiency breakthroughs, and a domestic compute cluster, is advancing AGI research and inclusive applications, especially in healthcare, while emphasizing ethical, trustworthy AI.

AGILarge Language ModelsModel Efficiency

0 likes · 5 min read

Ant Group’s AntBaiLing Model: Pushing AI Scaling Limits with Trillion‑Parameter Efficiency

Software Engineering 3.0 Era

Nov 7, 2025 · Artificial Intelligence

Why Context Engineering Is the Key to Effective AI Assistants

The article explains how AI assistants often fall short because of missing or poor context, traces the philosophical roots of context, maps its pervasive role in software engineering, and proposes a three‑level context‑engineering framework to turn context into a production asset for large‑model AI.

AIContext EngineeringLarge Language Models

0 likes · 9 min read

Why Context Engineering Is the Key to Effective AI Assistants

Sohu Tech Products

Nov 5, 2025 · Artificial Intelligence

Do AI Models Really Have Introspective Awareness? Anthropic’s New Findings

Anthropic’s recent study reveals that large language models like Claude Opus 4 exhibit functional introspective awareness, defining rigorous criteria for true introspection and demonstrating through four experiments how models can recognize, report, and even control their internal states, though the capability remains unstable and context‑dependent.

AIClaude OpusConcept Injection

0 likes · 15 min read

Do AI Models Really Have Introspective Awareness? Anthropic’s New Findings

Zhihu Tech Column

Nov 4, 2025 · Artificial Intelligence

How Multimodal Large Models Transform Recommendation Systems: From Tags to Embeddings

This article explores how multimodal large models like Qwen2.5‑VL enable high‑dimensional tag generation and universal embeddings for recommendation systems, detailing data synthesis, model training, quantization, fine‑tuning, and the resulting improvements in click‑through rate and exposure interaction.

EmbeddingLarge Language ModelsMultimodal AI

0 likes · 17 min read

How Multimodal Large Models Transform Recommendation Systems: From Tags to Embeddings

Alibaba Cloud Big Data AI Platform

Nov 4, 2025 · Artificial Intelligence

How Alibaba Cloud’s PAI Powers Cutting‑Edge LLM Research at EMNLP 2025

EMNLP 2025 in Suzhou will feature Alibaba Cloud’s AI platform PAI presenting four accepted papers on knowledge distillation, small‑model reasoning, distilled reasoning models, and an automated RAG benchmark framework, alongside exhibition demos, networking events, and recruitment opportunities for AI talent.

AI platformEMNLP 2025Knowledge Distillation

0 likes · 10 min read

How Alibaba Cloud’s PAI Powers Cutting‑Edge LLM Research at EMNLP 2025

JD Retail Technology

Nov 4, 2025 · Artificial Intelligence

How AIGC Is Transforming E‑commerce with Personalized Visual Content

This article explains how large‑model AIGC technology reshapes e‑commerce by enabling mass‑produced, user‑profile‑driven visual assets, detailing the evolution from early online trade to the 2.0 era, the technical pipeline of multimodal models, and the practical impact on merchants.

AIGCE‑CommerceLarge Language Models

0 likes · 17 min read

How AIGC Is Transforming E‑commerce with Personalized Visual Content

Baidu Intelligent Cloud Tech Hub

Nov 4, 2025 · Artificial Intelligence

How Baidu’s Baige Accelerates Multimodal Video Training with Context Parallelism

Baidu Baige’s enhanced veRL framework dramatically boosts video frame rates and resolution limits, cuts training time, reduces memory usage, and improves model accuracy by leveraging context parallelism and optimized attention on Ampere GPUs for multimodal mixed‑training scenarios.

AI accelerationContext ParallelismLarge Language Models

0 likes · 6 min read

How Baidu’s Baige Accelerates Multimodal Video Training with Context Parallelism

DeWu Technology

Nov 3, 2025 · Artificial Intelligence

How Large Language Models Boost Search Relevance: A Real‑World Case Study

This article explains how a leading e‑commerce platform leveraged large language models to overcome traditional search relevance challenges, detailing the iterative workflow, model distillation, performance gains, deployment results, and future directions for smarter, more accurate product search.

AIE‑CommerceLarge Language Models

0 likes · 10 min read

How Large Language Models Boost Search Relevance: A Real‑World Case Study

AI Info Trend

Nov 3, 2025 · Industry Insights

2025 Q3 AI Landscape: Key Players, Model Trends, and Hardware Shifts

Artificial Analysis’s Q3 2025 AI report reveals a rapidly accelerating industry across the entire stack, with US and Chinese labs neck‑and‑neck, fierce competition among OpenAI, Google, Anthropic, xAI, DeepSeek and Alibaba, cost‑efficient models, booming multimodal agents, and a hardware race led by NVIDIA’s Blackwell accelerators.

2025AIAgents

0 likes · 12 min read

2025 Q3 AI Landscape: Key Players, Model Trends, and Hardware Shifts

DataFunSummit

Nov 1, 2025 · Artificial Intelligence

Large Language Models Revolutionize Legal Document Automation – Alibaba Expert Insights

This article explores how breakthrough large‑model technologies are reshaping legal document automation, covering current challenges, the evolution of intelligent document processing, large‑model applications in core legal scenarios, benchmark results, performance optimizations, and future directions, based on a talk by Alibaba senior algorithm engineer Huang Zhangfeng.

Document AutomationEnterprise ComplianceLarge Language Models

0 likes · 18 min read

Large Language Models Revolutionize Legal Document Automation – Alibaba Expert Insights

Tencent Technical Engineering

Oct 31, 2025 · Artificial Intelligence

How SpecExit Cuts LLM Reasoning Chains by 66% and Boosts Inference Speed 2.5×

SpecExit combines speculative sampling with a lightweight draft model to predict early‑exit signals, shortening large‑reasoning model chains by up to two‑thirds and achieving up to 2.5× end‑to‑end inference acceleration on vLLM without sacrificing accuracy.

AI efficiencyEarly StoppingInference Optimization

0 likes · 12 min read

How SpecExit Cuts LLM Reasoning Chains by 66% and Boosts Inference Speed 2.5×

Bighead's Algorithm Notes

Oct 30, 2025 · Artificial Intelligence

FinSearchComp: ByteDance’s Expert‑Level Financial Search and Reasoning Benchmark for Real‑World Scenarios

FinSearchComp is the first fully open‑source benchmark that evaluates large‑language‑model agents' search and reasoning abilities in realistic financial workflows, featuring 635 expert‑annotated questions across three task types, built with 70 finance experts, and revealing that web‑enabled models with financial plugins markedly outperform API‑only models.

AI evaluationFinSearchCompLLM Agents

0 likes · 12 min read

FinSearchComp: ByteDance’s Expert‑Level Financial Search and Reasoning Benchmark for Real‑World Scenarios

Alimama Tech

Oct 29, 2025 · Artificial Intelligence

LLM Breakthroughs at EMNLP 2025: Embedding Compression, Complex Instructions, Knowledge Scaling

EMNLP 2025 in Suzhou showcases Taobao's booth featuring four cutting‑edge AI papers that introduce a novel embedding compression framework, an automatic iterative refinement method for complex instruction generation, a knowledge infusion scaling law for large language models, and a video caption optimization approach for text‑to‑video generation.

Large Language Modelsembedding compressioninstruction generation

0 likes · 7 min read

LLM Breakthroughs at EMNLP 2025: Embedding Compression, Complex Instructions, Knowledge Scaling

DataFunTalk

Oct 29, 2025 · Artificial Intelligence

Voice Agents Transform Gaming & Insurance: Real‑World Lessons from Silicon Valley

In a Silicon Valley tech conference, Mu Shen shared how voice agents—real‑time, task‑oriented AI—were applied to an open‑world game as an AI NPC and to a Fortune‑500 insurer as an AI tele‑salesperson, revealing technical challenges, model architectures, training strategies, evaluation methods, and key lessons for future deployments.

Large Language ModelsReal-time AIgame AI

0 likes · 19 min read

Voice Agents Transform Gaming & Insurance: Real‑World Lessons from Silicon Valley

Code Mala Tang

Oct 28, 2025 · Artificial Intelligence

Unlocking AI Creativity with Just Eight Words: The Verbalized Sampling Breakthrough

A recent Stanford and West Virginia University study reveals that a simple eight‑word prompt technique, called Verbalized Sampling, can double the creative output of large language models without costly retraining, by exposing hidden diversity suppressed by conventional alignment methods.

AI creativityLLM sampling techniquesLarge Language Models

0 likes · 9 min read

Unlocking AI Creativity with Just Eight Words: The Verbalized Sampling Breakthrough

Ele.me Technology

Oct 27, 2025 · Artificial Intelligence

How IAK Transforms Multi‑Domain Recommendation with Pre‑Training and Fine‑Tuning

This paper introduces IAK, a unified multi‑domain recommendation paradigm that treats the system as a large model, leveraging pre‑training and fine‑tuning with an information‑aware adaptive kernel to capture rapid user interest shifts while reducing training costs and improving online performance.

Large Language ModelsRecommendation Systemsfine‑tuning

0 likes · 18 min read

How IAK Transforms Multi‑Domain Recommendation with Pre‑Training and Fine‑Tuning

KooFE Frontend Team

Oct 26, 2025 · Artificial Intelligence

Master Zero-Shot Prompting: Advanced Techniques to Boost LLM Performance

Zero-shot prompting lets large language models perform tasks without examples, and by following principles of clarity and structured instructions, advanced strategies such as emotion prompting, zero-shot chain-of-thought, RE2 re-reading, Rephrase-and-Respond, role-play, and System-2 Attention can significantly improve accuracy and response quality across translation, reasoning, and QA tasks.

AI reasoningLLMLarge Language Models

0 likes · 13 min read

Master Zero-Shot Prompting: Advanced Techniques to Boost LLM Performance

Data Party THU

Oct 25, 2025 · Artificial Intelligence

How InfLLM‑V2 Delivers Fast, Low‑Cost Sparse Attention for Long‑Context LLMs

InfLLM‑V2 introduces a zero‑parameter, train‑efficient sparse‑attention framework that dramatically speeds up long‑sequence processing while requiring only 5 B tokens for training, and the open‑source MiniCPM4.1 model demonstrates comparable performance to dense attention on both long‑text understanding and deep‑thinking benchmarks.

InfLLM-V2Large Language ModelsMiniCPM4.1

0 likes · 10 min read

How InfLLM‑V2 Delivers Fast, Low‑Cost Sparse Attention for Long‑Context LLMs

Baidu Tech Salon

Oct 24, 2025 · Artificial Intelligence

How Wenxin X1.1 Tops China’s LLMs on the New SuperCLUE-CPIF Benchmark

Recent release of the SuperCLUE-CPIF benchmark shows Baidu’s Wenxin X1.1 achieving the highest score among Chinese large language models, surpassing competitors like DeepSeek‑V3.2‑Exp‑Thinking and Hunyuan‑T1, with notable advantages in precise instruction following and complex task handling.

AI evaluationLarge Language ModelsWenxin X1.1

0 likes · 4 min read

How Wenxin X1.1 Tops China’s LLMs on the New SuperCLUE-CPIF Benchmark

DataFunTalk

Oct 24, 2025 · Artificial Intelligence

Why OpenAI’s Adult Content Plans Could Reshape AI Performance and Markets

The article examines how opening AI models to adult content—tracing its historical role as a technology testbed, analyzing market incentives, data‑bias risks, alignment tax, and regulatory hurdles—suggests that such a move could boost model capabilities while raising ethical and legal challenges.

AILarge Language Modelsadult content

0 likes · 12 min read

Why OpenAI’s Adult Content Plans Could Reshape AI Performance and Markets

DataFunSummit

Oct 22, 2025 · Artificial Intelligence

When Claude Leaves China: How Domestic AI Models Are Rising to Fill the Gap

Anthropic's ban on Claude for Chinese‑owned firms forces developers to seek home‑grown alternatives, prompting a deep dive into Claude's strengths, the rapid growth of Chinese AI models, and the gaps that still separate them from the international benchmark.

AI modelsChinese AIClaude

0 likes · 10 min read

When Claude Leaves China: How Domestic AI Models Are Rising to Fill the Gap

DataFunTalk

Oct 22, 2025 · Artificial Intelligence

How Large Language Models Power Xiaomi’s Xiao AI Assistant

This article explains how Xiaomi’s Xiao AI assistant leverages large language models for intent routing, domain‑specific intent understanding, and response generation, detailing the system architecture, challenges such as knowledge requirements and latency constraints, and the shift from prompt engineering to model fine‑tuning.

AI assistantIntent RoutingLarge Language Models

0 likes · 5 min read

How Large Language Models Power Xiaomi’s Xiao AI Assistant

Wuming AI

Oct 20, 2025 · Artificial Intelligence

How to Let AI Instantly Draw Professional UML Diagrams with Mermaid

This article walks through using large language models such as Claude, Gemini, DeepSeek, and Kimi to generate accurate, colorful UML diagrams via Mermaid syntax, covering model selection, prompt engineering, step‑by‑step demonstrations, and practical tips for reliable AI‑driven diagram creation.

AI‑generated diagramsLarge Language ModelsMermaid

0 likes · 5 min read

How to Let AI Instantly Draw Professional UML Diagrams with Mermaid

Alibaba Cloud Infrastructure

Oct 20, 2025 · Artificial Intelligence

How ACK Inference Gateway Tripled Large‑Model Performance for an Insurance Giant

This article details how Guotai Insurance tackled the high latency and cost of large‑model inference by deploying Alibaba Cloud's ACK Inference Gateway, which uses load‑aware, prefix‑aware routing, intelligent queuing, and comprehensive observability to boost efficiency threefold while reducing expenses.

ACK GatewayAI inferenceLarge Language Models

0 likes · 18 min read

How ACK Inference Gateway Tripled Large‑Model Performance for an Insurance Giant

AntTech

Oct 20, 2025 · Artificial Intelligence

How a Constraint-Aware Multi-Agent System Won the IJCAI Travel Planning Challenge

Leveraging a proprietary “large model + optimization” approach, Alibaba’s Ant Group and East China Normal University built a constraint-aware multi-agent framework that secured first place in the Original OS track and second in the DSL track of the IJCAI-2025 Autonomous Travel Planning Competition.

IJCAILarge Language ModelsMulti-Agent Systems

0 likes · 7 min read

How a Constraint-Aware Multi-Agent System Won the IJCAI Travel Planning Challenge

Data Thinking Notes

Oct 19, 2025 · Artificial Intelligence

How GSPO Improves Stability in Large Language Model Training

GSPO (Group Sequence Policy Optimization) is a reinforcement‑learning algorithm for LLMs that replaces token‑level GRPO with sequence‑level optimization, addressing instability in ultra‑large model training, especially for long‑sequence and MoE architectures, by aligning reward granularity and reducing variance.

GRPOGSPOLarge Language Models

0 likes · 11 min read

How GSPO Improves Stability in Large Language Model Training

IT Services Circle

Oct 18, 2025 · Artificial Intelligence

Unlock Multi‑Model AI Collaboration with Zen MCP – A Deep Dive

The Zen MCP open‑source server, now with over 8.6K stars, acts as a bridge that lets Claude Code, Codex CLI, Gemini CLI and other AI tools invoke dozens of large models simultaneously, offering seamless multi‑model cooperation, automatic model selection, conversation continuity, and local execution for privacy‑preserving AI workflows.

AI ToolingAI orchestrationLarge Language Models

0 likes · 5 min read

Unlock Multi‑Model AI Collaboration with Zen MCP – A Deep Dive

Amap Tech

Oct 17, 2025 · Artificial Intelligence

How Ranking Improves In-Context Example Retrieval: Insights from NeurIPS ’25

This article explains the limitations of current pointwise in‑context learning methods, introduces a novel ranking‑based approach called SeDPO that learns preference orders among examples, and demonstrates its superior performance across multiple NLP tasks through extensive experiments and ablation studies.

In-Context LearningLarge Language ModelsNeurIPS

0 likes · 10 min read

How Ranking Improves In-Context Example Retrieval: Insights from NeurIPS ’25

Wuming AI

Oct 16, 2025 · Industry Insights

Top AI Model Releases This Week: NanoChat, Ring‑1T, Qwen3‑VL, Veo 3.1, Claude Haiku 4.5

This week’s AI landscape saw Karpathy’s NanoChat open‑sourcing a 8‑K‑line ChatGPT replica, Ant Group unveiling a trillion‑parameter Ring‑1T model, Alibaba releasing the 4B/8B Qwen3‑VL visual language models that outperform Gemini 2.5 Flash Lite and GPT‑5 Nano, Google launching Veo 3.1 for high‑fidelity video generation, and Anthropic announcing Claude Haiku 4.5, a faster and cheaper LLM that excels on SWE‑bench benchmarks.

AI modelsLarge Language Modelsmultimodal

0 likes · 7 min read

Top AI Model Releases This Week: NanoChat, Ring‑1T, Qwen3‑VL, Veo 3.1, Claude Haiku 4.5

Meituan Technology Team

Oct 15, 2025 · Artificial Intelligence

What’s New in Large Model Research? Top Meituan AI Papers Up to Oct 2025

This curated list showcases Meituan’s latest large‑model breakthroughs and academic papers up to October 2025, spanning LLM system optimizations, multimodal generation, evaluation benchmarks, quantization techniques, and reinforcement‑learning‑driven improvements, offering researchers valuable insights and resources across the AI landscape.

AI researchBenchmarkingLarge Language Models

0 likes · 10 min read

What’s New in Large Model Research? Top Meituan AI Papers Up to Oct 2025

Shopee Tech Team

Oct 14, 2025 · Artificial Intelligence

How SPEC‑RL Boosts On‑Policy Reinforcement Learning Speed by Up to 3×

SPEC‑RL introduces speculative rollouts that reuse verified historical rollouts as prefixes, cutting rollout time by 2–3× while maintaining or improving performance across various math and reasoning benchmarks, and works seamlessly with PPO, GRPO, DAPO and other on‑policy algorithms.

AI efficiencyLarge Language ModelsTraining Acceleration

0 likes · 8 min read

How SPEC‑RL Boosts On‑Policy Reinforcement Learning Speed by Up to 3×

HyperAI Super Neural

Oct 14, 2025 · Artificial Intelligence

NeurIPS 2025: OCRBench v2 Shows Gemini Leads Chinese OCR Ranking Yet Scores Only Pass

OCRBench v2, introduced at NeurIPS 2025, evaluates 58 multimodal models on 23 OCR‑related tasks in Chinese and English, revealing that even top models like Gemini‑2.5‑Pro barely exceed the passing threshold and that most models struggle with fine‑grained text localization and multilingual performance.

EvaluationGeminiLarge Language Models

0 likes · 8 min read

NeurIPS 2025: OCRBench v2 Shows Gemini Leads Chinese OCR Ranking Yet Scores Only Pass

Practical DevOps Architecture

Oct 14, 2025 · Artificial Intelligence

Master AI Agents: From Basics to Advanced Multi-Model Development

This comprehensive AI agent development course covers 18 chapters, ranging from fundamental concepts and architecture to large‑model integration, tool and browser control, memory, RAG self‑learning, sandboxing, database manipulation, multi‑agent architectures, code assistance, and a real‑world frontend automation project, complete with source code and documentation.

AI agentsLangChainLarge Language Models

0 likes · 3 min read

Master AI Agents: From Basics to Advanced Multi-Model Development

DataFunSummit

Oct 13, 2025 · Artificial Intelligence

How Large Language Models Supercharge Douyin’s User Experience

This article explains how Douyin leverages large language models to build an end‑to‑end user‑experience pipeline that detects signals, understands feedback, attributes issues, and automates governance, turning reactive fixes into proactive, data‑driven product improvements.

AILarge Language ModelsSignal Processing

0 likes · 20 min read

How Large Language Models Supercharge Douyin’s User Experience

Alibaba Cloud Developer

Oct 13, 2025 · Artificial Intelligence

Can AI Cut Taobao Recommendation Development from a Week to Two Days?

This article explains how Alibaba's WaterFlow, an AI‑driven end‑to‑end development platform, tackles the high demand volume, diverse tech stacks, and slow collaboration of Taobao's recommendation feed, enabling many features to be delivered in just two days instead of a week.

AIContinuous IntegrationLarge Language Models

0 likes · 16 min read

Can AI Cut Taobao Recommendation Development from a Week to Two Days?

Data Party THU

Oct 11, 2025 · Artificial Intelligence

From Transformers to LLaMA 4: A Journey Through the Biggest LLMs

This article surveys the most influential large language models released since 2017, detailing the core innovations of Transformer, BERT, GPT series, T5, Retrieval‑Augmented Generation, and the latest LLaMA and Meta models, while highlighting their architectures, training paradigms, and impact on NLP research.

LLMLarge Language ModelsModel Scaling

0 likes · 21 min read

From Transformers to LLaMA 4: A Journey Through the Biggest LLMs

Kuaishou Large Model

Oct 11, 2025 · Artificial Intelligence

How Large-Scale Reinforcement Learning Boosted KAT-Dev-72B-Exp to 74.6% on SWE‑Bench

The KwaiPilot team introduced KAT-Dev-72B-Exp, an open‑source LLM trained with large‑scale reinforcement learning that achieved a record‑breaking 74.6% score on SWE‑Bench Verified, thanks to innovations like Trie Packing, entropy‑aware advantage scaling, and a decoupled data‑environment architecture.

KAT-Dev-72B-ExpLarge Language ModelsTrie Packing

0 likes · 6 min read

How Large-Scale Reinforcement Learning Boosted KAT-Dev-72B-Exp to 74.6% on SWE‑Bench

Bilibili Tech

Oct 11, 2025 · Artificial Intelligence

Can Dual-Agent AI Transform Web Video Editing? Inside VibeCut’s Architecture

VibeCut introduces a novel Orchestrator‑Executor dual‑agent framework for WebCut, leveraging large language models, shared structured context, and modular tool integration to automate complex video editing tasks, demonstrating improved efficiency, transparency, and adaptability across diverse scenarios while addressing challenges of multi‑agent coordination.

AI video editingLarge Language ModelsWebCut

0 likes · 35 min read

Can Dual-Agent AI Transform Web Video Editing? Inside VibeCut’s Architecture

Bighead's Algorithm Notes

Oct 10, 2025 · Artificial Intelligence

Quantitative Finance Paper Digest (Sep 27 – Oct 10 2025)

This digest summarizes recent arXiv papers that introduce new AI‑driven methods for portfolio similarity, Bayesian portfolio optimization, end‑to‑end deep‑learning portfolio construction, large‑language‑model‑based financial prediction, and multi‑agent crypto‑trading systems, highlighting their datasets, architectures, and empirical gains.

Bayesian OptimizationLarge Language ModelsMulti-Agent Systems

0 likes · 18 min read

Quantitative Finance Paper Digest (Sep 27 – Oct 10 2025)

Baidu Tech Salon

Oct 10, 2025 · Artificial Intelligence

Navigating the 2025 AI Model Boom: Practical Evaluation Strategies

This article examines the rapid surge of large AI models in 2024‑2025, critiques the reliability of public leaderboards, and presents a business‑focused evaluation framework—including dataset construction, metric selection, automation, and LLM‑as‑judge techniques—to help developers choose the right model for real‑world applications.

AI benchmarksAI performanceDataset Construction

0 likes · 17 min read

Navigating the 2025 AI Model Boom: Practical Evaluation Strategies

Data Party THU

Oct 10, 2025 · Artificial Intelligence

Can Language Models Self‑Train Without Data? Inside the Language Self‑Play Framework

This article examines the Language Self‑Play (LSP) approach for data‑free training of large language models, detailing its challenger‑solver game formulation, advantage calculations, loss functions, self‑reward extension, experimental setup on AlpacaEval, and results that show LSP can match or surpass data‑driven baselines.

LLMLarge Language Modelsdata-free training

0 likes · 14 min read

Can Language Models Self‑Train Without Data? Inside the Language Self‑Play Framework

DataFunTalk

Oct 10, 2025 · Artificial Intelligence

How Large Language Models Power Xiaomi’s Xiao AI Assistant

This article explains how large language models are integrated into Xiaomi’s Xiao AI assistant, covering intent distribution, domain‑specific intent understanding, response generation, architectural design, challenges such as knowledge requirements and latency, and the shift from prompt engineering to model fine‑tuning.

AI assistantIntent RoutingLarge Language Models

0 likes · 5 min read

Data Party THU

Oct 9, 2025 · Artificial Intelligence

How Reinforcement Learning Is Transforming the Full Lifecycle of Large Language Models

This survey systematically reviews recent advances in applying reinforcement learning across the entire lifecycle of large language models, detailing methods, datasets, benchmarks, open‑source tools, and future challenges such as scalability, reward design, and evaluation standards.

AI SurveyLLM lifecycleLarge Language Models

0 likes · 9 min read

How Reinforcement Learning Is Transforming the Full Lifecycle of Large Language Models

Data Party THU

Oct 9, 2025 · Artificial Intelligence

Can One Model Master All Audio‑Visual Tasks? Introducing Crab’s Unified Approach

This article presents Crab, a unified audio‑visual scene understanding model that leverages a novel display‑cooperation learning paradigm, introduces the AV‑UIE dataset with explicit reasoning steps, and demonstrates superior performance across temporal, spatial, pixel‑level, and spatio‑temporal tasks through extensive experiments and ablations.

Audio-VisualLarge Language ModelsLoRA

0 likes · 12 min read

Can One Model Master All Audio‑Visual Tasks? Introducing Crab’s Unified Approach

DataFunTalk

Oct 9, 2025 · Artificial Intelligence

From Physics to DeepMind: How a Tsinghua Star Is Shaping AI Research

Google DeepMind hired Shunyu Yao, a Tsinghua physics prodigy and former Anthropic researcher, whose rapid transition from theoretical physics to AI highlights the intense workload, values clash, and the accelerating pace of large‑model research.

AI researchDeepMindLarge Language Models

0 likes · 9 min read

From Physics to DeepMind: How a Tsinghua Star Is Shaping AI Research

HyperAI Super Neural

Oct 5, 2025 · Artificial Intelligence

Which Sci‑Fi AI Are Already Real? Voice Assistants, Companion Bots, Digital Immortality

The article reviews iconic AI portrayals from movies such as Iron Man, Her, The Wandering Earth 2, Terminator and The Matrix, then compares each vision with today’s voice assistants, large‑language‑model chatbots, companion robots, brain‑computer interfaces and autonomous weapon systems, highlighting what has materialized and what remains speculative.

AILarge Language Modelsautonomous weapons

0 likes · 15 min read

Which Sci‑Fi AI Are Already Real? Voice Assistants, Companion Bots, Digital Immortality

AI2ML AI to Machine Learning

Oct 1, 2025 · Artificial Intelligence

2025 Large Model Engineering Breakthroughs: Cutting Costs, Boosting Performance, and Extending Context

The 2025 open‑source reports reveal major advances in large‑model engineering, including drastic cost cuts such as DeepSeek‑V3 training for $5.57 M, performance gains where Gemma 3 4B matches Gemma 2 27B, memory efficiencies like 85 % KV‑cache reduction, and a suite of new techniques—from loss‑free MoE balancing to multi‑token prediction—that together push context lengths to one million tokens and enable multimodal, aligned, and industry‑specific models.

Large Language ModelsMemory EfficiencyMultimodal AI

0 likes · 13 min read

2025 Large Model Engineering Breakthroughs: Cutting Costs, Boosting Performance, and Extending Context

DataFunSummit

Sep 29, 2025 · Artificial Intelligence

How Large Language Models Power XiaoAI: From Intent Routing to Response Generation

This article explores how large language models are integrated into Xiaomi’s XiaoAI assistant, detailing the system’s architecture, intent distribution, domain-specific understanding, and response generation, while sharing practical challenges, prompt engineering solutions, and fine‑tuning strategies that boosted user retention and query satisfaction.

AI assistantsIntent RoutingLarge Language Models

0 likes · 4 min read

How Large Language Models Power XiaoAI: From Intent Routing to Response Generation

21CTO

Sep 29, 2025 · Artificial Intelligence

Why Open‑Source Is the Key to China’s AI Future, According to Li Kaifu

Li Kaifu argues that open‑source large‑model ecosystems are essential for China to close the AI gap with the United States, highlighting DeepSeek’s impact, shifting scaling laws, and the emerging role of AI‑to‑AI teaching as the next development frontier.

Artificial IntelligenceChina AILarge Language Models

0 likes · 4 min read

Why Open‑Source Is the Key to China’s AI Future, According to Li Kaifu

Software Engineering 3.0 Era

Sep 28, 2025 · Artificial Intelligence

Why Large Language Models Appear So Smart: The Science of Emergence

The article explains how massive language models achieve seemingly intelligent behavior through emergence at a critical scale, hierarchical planning, attention-driven global coherence, multimodal understanding, and progressive training techniques that turn simple token prediction into sophisticated reasoning and creativity.

Attention MechanismLarge Language ModelsMultimodal AI

0 likes · 15 min read

Why Large Language Models Appear So Smart: The Science of Emergence

Volcano Engine Developer Services

Sep 28, 2025 · Artificial Intelligence

Demystifying AI Jargon: A Beginner’s Guide to Large Language Models

This guide breaks down the complex terminology of large language models—explaining tokens, transformers, self‑attention, RAG, scaling laws, dense vs. sparse architectures, and training stages—using clear analogies and step‑by‑step explanations so readers can confidently understand and work with modern AI systems.

AI FundamentalsLarge Language ModelsModel Training

0 likes · 35 min read

Demystifying AI Jargon: A Beginner’s Guide to Large Language Models

DataFunSummit

Sep 26, 2025 · Artificial Intelligence

How Large Language Models are Transforming Recommendation Systems: Insights from Huawei

This article reviews Huawei Noah's Ark Lab's exploration of large language models in recommendation systems, covering background challenges, the KAR and Uni-CTR projects, experimental results, and future research directions for open, knowledge‑driven recommendation pipelines.

AI researchHuaweiLarge Language Models

0 likes · 13 min read

How Large Language Models are Transforming Recommendation Systems: Insights from Huawei

HyperAI Super Neural

Sep 26, 2025 · Artificial Intelligence

Nvidia’s ReaSyn Uses Chain‑of‑Reaction Reasoning to Boost Molecule Reconstruction and Path Diversity

ReaSyn, a new framework from Nvidia’s research team, treats synthesis pathways as chain‑of‑thought reasoning using a novel Chain‑of‑Reaction representation, achieving the highest reconstruction rates and path diversity in molecule synthesis tasks, and outperforming prior methods across multiple benchmark optimizations.

AI drug discoveryLarge Language ModelsReaSyn

0 likes · 14 min read

Nvidia’s ReaSyn Uses Chain‑of‑Reaction Reasoning to Boost Molecule Reconstruction and Path Diversity

Instant Consumer Technology Team

Sep 25, 2025 · Artificial Intelligence

Inside Qwen’s Midnight Release: New Guard, Travel Agent, LiveTranslate, Code & Vision Models Unveiled

Late at night on the 23rd, Lin Junyang of Tongyi Lab announced six AI model releases—including a safety‑audit guard, a personal travel planner, a real‑time multilingual translator, upgraded coding models, a powerful vision‑language model, and the flagship Qwen3‑Max—each detailed with capabilities, highlights, and direct download links.

Artificial IntelligenceLarge Language ModelsMultimodal AI

0 likes · 11 min read

Inside Qwen’s Midnight Release: New Guard, Travel Agent, LiveTranslate, Code & Vision Models Unveiled

Data Thinking Notes

Sep 24, 2025 · Artificial Intelligence

How AI Agents Are Transforming Smart Logistics at SF Express

This article explains how SF Express leverages AI agents and large language models to create a full‑process intelligent management framework that optimizes order forecasting, dynamic scheduling, resource allocation, and operational decision‑making across the entire logistics chain.

AIIntelligent agentsLarge Language Models

0 likes · 21 min read

How AI Agents Are Transforming Smart Logistics at SF Express

Fun with Large Models

Sep 24, 2025 · Artificial Intelligence

Interview Guide: Core Differences Between PPO and GRPO Algorithms for Large Model Fine‑Tuning

The article explains the fundamental principles of PPO and GRPO reinforcement‑learning algorithms, compares their architectures and training workflows, highlights why GRPO is gaining traction in large‑model fine‑tuning, discusses associated risks, and offers practical guidance on group size selection for engineers preparing for interviews.

GRPOLarge Language ModelsPPO

0 likes · 9 min read

Interview Guide: Core Differences Between PPO and GRPO Algorithms for Large Model Fine‑Tuning

Data Thinking Notes

Sep 21, 2025 · Artificial Intelligence

From RAG to DeepSearch & DeepResearch: How AI Is Mastering Knowledge Retrieval

Amid the rapid rise of generative AI, this article examines the limitations of large language models and explains how Retrieval‑Augmented Generation (RAG), followed by the advanced paradigms DeepSearch and DeepResearch, progressively enhance knowledge handling through dynamic retrieval, multi‑agent reasoning, and autonomous research capabilities.

AI Knowledge ManagementDeepResearchDeepSearch

0 likes · 16 min read

From RAG to DeepSearch & DeepResearch: How AI Is Mastering Knowledge Retrieval

Bighead's Algorithm Notes

Sep 20, 2025 · Artificial Intelligence

Weekly Quantitative Finance Paper Digest (Sep 13‑19, 2025)

This digest summarizes seven recent arXiv papers that apply reinforcement learning, multi‑agent frameworks, dynamic factor models, high‑frequency trading LLMs, quantum GANs, multi‑LLM sentiment analysis, and context‑aware language models to advance quantitative finance and AI‑driven market prediction.

Large Language ModelsMulti-Agent SystemsQuantum Machine Learning

0 likes · 12 min read

Weekly Quantitative Finance Paper Digest (Sep 13‑19, 2025)

DataFunTalk

Sep 19, 2025 · Artificial Intelligence

How Tencent’s Large Language Models Transform Business with RAG, GraphRAG, and Agents

This article examines Tencent's large language model deployments across diverse business scenarios, detailing how Retrieval‑Augmented Generation, GraphRAG, and autonomous agents boost model intelligence, improve user experience, and enable advanced content generation, understanding, and multi‑step reasoning.

Artificial IntelligenceAutonomous AgentsGraphRAG

0 likes · 4 min read

How Tencent’s Large Language Models Transform Business with RAG, GraphRAG, and Agents

Data Party THU

Sep 19, 2025 · Artificial Intelligence

How RepoMaster Enables AI Agents to Master GitHub Repositories for Complex Tasks

RepoMaster is an AI‑driven framework that automatically discovers, analyzes, and executes code from massive GitHub repositories, turning them into reusable tools and achieving state‑of‑the‑art performance on challenging benchmarks while drastically reducing token consumption and engineering effort.

AI agentsLarge Language ModelsRepoMaster

0 likes · 9 min read

How RepoMaster Enables AI Agents to Master GitHub Repositories for Complex Tasks

Data Party THU

Sep 19, 2025 · Artificial Intelligence

How DeepSeek R1 Redefines AI Reasoning with Pure Reinforcement Learning

DeepSeek R1 replaces traditional supervised fine‑tuning with a pure reinforcement‑learning pipeline, introducing the GRPO algorithm and a four‑stage training regime that dramatically lowers cost, boosts reasoning and code‑generation performance, and raises important ethical, privacy, and societal considerations for large language models.

AI reasoningDeepSeekGRPO

0 likes · 14 min read

How DeepSeek R1 Redefines AI Reasoning with Pure Reinforcement Learning

HyperAI Super Neural

Sep 19, 2025 · Artificial Intelligence

Weekly AI Paper Roundup: RL Advances, Tree‑Structured QA, and GraphRAG Breakthroughs

This article surveys five recent AI papers, covering reinforcement learning for large reasoning models, a tree‑structured table QA framework (ST‑Raptor), visual representation alignment for multimodal LLMs, GraphRAG‑based generation, and an LLM‑driven cryptographic vulnerability detector, each with key insights and links.

Large Language Modelscryptographic vulnerability detectiongraph retrieval

0 likes · 5 min read

Weekly AI Paper Roundup: RL Advances, Tree‑Structured QA, and GraphRAG Breakthroughs

AsiaInfo Technology: New Tech Exploration

Sep 16, 2025 · Industry Insights

Can Ontology Bridge the Gap Between Large Language Models and Executable Code?

This article analyzes how combining ontology with large language models can create a new intelligent application development paradigm that unites semantic understanding and executable behavior, proposing a three‑layer architecture, a Model Control Protocol, and real‑world case studies to illustrate its potential and challenges.

AI integrationLarge Language ModelsSoftware Architecture

0 likes · 22 min read

Can Ontology Bridge the Gap Between Large Language Models and Executable Code?

DataFunTalk

Sep 15, 2025 · Artificial Intelligence

How AI+Data Agents Are Transforming the Automotive Industry’s Digital Leap

In an interview, Di Xingxing of Autohome details their AI+Data framework—unified lake‑warehouse, intelligent engine, and agent services—that breaks data silos, blends traditional models with LLMs, leverages causal inference and RAG knowledge bases, and uses continuous feedback to build explainable, evolving data agents for accurate sales forecasting, competitive analysis, and end‑to‑end business automation in the automotive industry.

AIAutomotiveData Engineering

0 likes · 10 min read

How AI+Data Agents Are Transforming the Automotive Industry’s Digital Leap

DataFunSummit

Sep 14, 2025 · Artificial Intelligence

How AI is Revolutionizing Chemistry and Drug Discovery: From Data to Breakthroughs

This article explores how AI-driven models and data pipelines are transforming the chemistry and pharmaceutical sectors by accelerating drug design, improving protein‑antibody predictions, automating patent data extraction, and outlining future goals for end‑to‑end AI‑enabled scientific discovery.

AI for ScienceChemistry AILarge Language Models

0 likes · 13 min read

How AI is Revolutionizing Chemistry and Drug Discovery: From Data to Breakthroughs

Alibaba Cloud Developer

Sep 12, 2025 · Operations

How to Build End‑to‑End Observability for Large‑Model Applications on Alibaba Cloud

This guide explains how to design and implement a complete observability solution for large‑model AI services on Alibaba Cloud, covering architecture, core metrics, logging standards, demo code, log collection, dashboard design, alerting, monitoring tools, troubleshooting SOPs, and recovery procedures.

AI OperationsAlibaba CloudCloud Monitoring

0 likes · 21 min read

How to Build End‑to‑End Observability for Large‑Model Applications on Alibaba Cloud

Fun with Large Models

Sep 12, 2025 · Artificial Intelligence

When to Choose Model Fine‑Tuning vs RAG for Large‑Model Engineering Interviews

The article explains the technical background and suitable scenarios for Retrieval‑Augmented Generation (RAG) and model fine‑tuning, compares their strengths, discusses how they can be combined, and provides interview‑style Q&A on their capabilities, risks, and differences from model distillation.

AI interviewFine‑TuningLarge Language Models

0 likes · 7 min read

When to Choose Model Fine‑Tuning vs RAG for Large‑Model Engineering Interviews

AI2ML AI to Machine Learning

Sep 11, 2025 · Industry Insights

Key Takeaways from Asset Management Leaders on Large‑Model AI at the Bund Conference

The article compiles senior asset‑management executives' perspectives on applying large‑model AI—covering vertical versus generic models, integration strategies, talent and cost considerations, innovative C2C development, AI‑native platforms, and the practical challenges of using LLMs in investment research.

AI ApplicationsAsset ManagementC2C development

0 likes · 5 min read

Key Takeaways from Asset Management Leaders on Large‑Model AI at the Bund Conference

Baidu Geek Talk

Sep 10, 2025 · Artificial Intelligence

How to Cut Through the LLM SOTA Hype: Practical Evaluation Strategies for 2025

Amid the 2025 surge of large language models, this article demystifies misleading SOTA claims, critiques benchmark reliability, and presents a comprehensive, business‑focused evaluation framework—including dataset construction, metric selection, automated scoring, and practical guidelines—to help developers and product teams choose the right model for real‑world applications.

AI benchmarkingLLM-as-JudgeLarge Language Models

0 likes · 18 min read

How to Cut Through the LLM SOTA Hype: Practical Evaluation Strategies for 2025

Baobao Algorithm Notes

Sep 10, 2025 · Artificial Intelligence

Qwen3-Next Unveiled: Sparse MoE, Hybrid Attention & Multi‑Token Prediction

A recent Hugging Face pull request reveals Alibaba’s upcoming Qwen3‑Next series, highlighting its extreme‑context, parameter‑efficient design that combines a 1:50 high‑sparsity MoE, a hybrid attention architecture mixing gated attention with Gated DeltaNet, and a Multi‑Token Prediction technique, promising ten‑fold throughput gains for 32K‑plus token contexts.

AI ArchitectureHybrid AttentionLarge Language Models

0 likes · 8 min read

Qwen3-Next Unveiled: Sparse MoE, Hybrid Attention & Multi‑Token Prediction

DataFunSummit

Sep 9, 2025 · Artificial Intelligence

How Baidu’s GRAB Model Uses Scaling Laws to Transform Ad Ranking

This article explains Baidu's generative ranking model GRAB, detailing how scaling laws from large language models inspire a new recommendation paradigm, the model's architecture, custom attention mechanisms, training strategies, deployment optimizations, and the resulting business gains in CTR and revenue.

BaiduCTR PredictionGenerative AI

0 likes · 22 min read

How Baidu’s GRAB Model Uses Scaling Laws to Transform Ad Ranking

JD Cloud Developers

Sep 9, 2025 · Artificial Intelligence

How JD’s PODM‑MI Framework Revolutionized E‑commerce Search Ranking

This article recounts a JD engineer’s journey from theory to practice, detailing the development of the PODM‑MI re‑ranking framework, its three‑layer distribution‑based design, the discovery of a novel SID bottleneck, and the resulting multi‑million‑order impact validated at SIGIR 2024.

Large Language ModelsRe‑rankingSIGIR

0 likes · 8 min read

How JD’s PODM‑MI Framework Revolutionized E‑commerce Search Ranking

DataFunSummit

Sep 8, 2025 · Artificial Intelligence

How High‑Quality Inference Data Is Powering the Next AI Revolution

This article explores how high‑quality inference data has become a new paradigm driving AI breakthroughs, detailing Ant Group's research on inference data paradigms, financial‑sector applications, intelligent labeling and quality inspection, and the AIGD AI data synthesis platform, followed by a technical Q&A.

AI dataAIGDData Synthesis

0 likes · 11 min read

How High‑Quality Inference Data Is Powering the Next AI Revolution

DaTaobao Tech

Sep 8, 2025 · Artificial Intelligence

How to Make Large Language Models Understand Third‑Party Java Packages: From Failure to Success

This article explains why AI coding assistants like Cursor and Claude fail to read external Java libraries, explores naive "feed‑the‑code" tricks, evaluates built‑in IDE tools, and ultimately presents a robust solution using a local decompilation pipeline (MCP) that lets LLMs query class definitions and generate correct backend code.

AI code generationJava decompilationLarge Language Models

0 likes · 19 min read

How to Make Large Language Models Understand Third‑Party Java Packages: From Failure to Success

DataFunTalk

Sep 8, 2025 · Artificial Intelligence

When Claude Leaves China: How Domestic AI Models Are Rising to Fill the Gap

Anthropic's new ban on Claude for Chinese‑controlled firms forces developers to seek home‑grown alternatives, prompting a deep dive into Claude's strengths, the rapid rise of Chinese large‑language models, and the gaps that still separate them from the world‑leading offering.

AI modelsAI safetyChinese AI

0 likes · 11 min read

Bighead's Algorithm Notes

Sep 5, 2025 · Artificial Intelligence

Weekly Quantitative Finance Paper Digest (Aug 30 – Sep 5, 2025)

This digest reviews four recent AI‑driven finance papers: a robust MCVaR portfolio optimizer with ellipsoidal support and RKHS uncertainty, a PPO‑based adaptive weighting system for LLM‑generated alphas, an empirical comparison of price‑based, GICS‑based, and LLM‑embedding stock clustering, and a diffusion‑model approach that generates future financial chart images from current charts and text prompts.

Large Language Modelsdiffusion modelsportfolio optimization

0 likes · 9 min read

Weekly Quantitative Finance Paper Digest (Aug 30 – Sep 5, 2025)

ShiZhen AI

Sep 5, 2025 · Artificial Intelligence

Andrew Ng Highlights Core AI Engineer Skills Amidst Major AI Industry Updates

The article reports that ChatGPT now supports branch conversations, Anthropic restricts service use in certain regions, Andrew Ng outlines essential AI engineer capabilities such as AI‑assisted software building, prompting and agentic workflows, and highlights the market demand, while also covering the Kimi K2 model upgrade, Hugging Face’s FineVision dataset release, and Google’s AI‑driven Deep Loop Shaping method published in *Science*.

AI engineeringAI for astronomyAI safety

0 likes · 8 min read

Andrew Ng Highlights Core AI Engineer Skills Amidst Major AI Industry Updates

Instant Consumer Technology Team

Sep 5, 2025 · Artificial Intelligence

Why Context Engineering Is the Next Frontier for Large Language Models

This article surveys over 1,400 papers to define context engineering as a systematic discipline that structures retrieval, memory, tools, and multi‑agent coordination for LLMs, highlighting the critical asymmetry between understanding long contexts and generating equally complex outputs.

Context EngineeringLLM evaluationLarge Language Models

0 likes · 8 min read

Why Context Engineering Is the Next Frontier for Large Language Models

DataFunSummit

Sep 4, 2025 · Artificial Intelligence

Unlocking Multi‑Agent AI: How Ant Group’s agentUniverse Transforms Financial Services

The article explores Ant Group’s agentUniverse team’s experience applying multi‑agent technology in finance, covering background on large language models, the agentUniverse framework, real‑world implementations, and the advantages of coordinated multi‑agent collaboration for complex analytical and decision‑making tasks.

AI collaborationLarge Language ModelsMulti-agent

0 likes · 4 min read

Unlocking Multi‑Agent AI: How Ant Group’s agentUniverse Transforms Financial Services

Amap Tech

Sep 4, 2025 · Artificial Intelligence

How Hierarchical Sampling Boosts Self‑Taught Reasoning in LLMs

HS‑STAR introduces a three‑stage hierarchical sampling framework that identifies high‑utility boundary problems, reallocates computation budget to them, and fine‑tunes large language models, achieving significant accuracy gains on math reasoning benchmarks without extra sampling cost.

HS-STARHierarchical SamplingLarge Language Models

0 likes · 10 min read

How Hierarchical Sampling Boosts Self‑Taught Reasoning in LLMs

Data Party THU

Sep 3, 2025 · Artificial Intelligence

Exploring Multimodal Generative AI: A Tsinghua Tutorial at IJCAI 2025

This article introduces a 1.5‑hour tutorial presented by Tsinghua researchers at IJCAI 2025, covering the latest advances in multimodal generative AI, including multimodal large language models, diffusion models, post‑training generalization techniques, and unified understanding‑generation frameworks.

IJCAI 2025Large Language ModelsMultimodal AI

0 likes · 5 min read

Exploring Multimodal Generative AI: A Tsinghua Tutorial at IJCAI 2025

AI2ML AI to Machine Learning

Sep 2, 2025 · Artificial Intelligence

Why Enterprise Large‑Model Digitalization Is So Hard: Key Challenges and Capabilities

The article analyzes why enterprise‑wide large‑model AI projects face steep hurdles, outlining required human capabilities, historical labor shifts, current hot technologies such as RAG, Agent, CoT and multimodal, their limits, a three‑stage implementation roadmap, typical case pitfalls, and the key success factors for sustainable digital transformation.

AgentCoTEnterprise AI

0 likes · 15 min read

Why Enterprise Large‑Model Digitalization Is So Hard: Key Challenges and Capabilities

Amap Tech

Sep 2, 2025 · Artificial Intelligence

How Pos2Distill Eliminates Positional Bias in Large Language Models

This article introduces Pos2Distill, a novel knowledge‑distillation framework that transfers capabilities from advantageous to disadvantaged positions in large language models, effectively mitigating positional bias and improving performance on long‑text retrieval and in‑context reasoning tasks.

Knowledge DistillationLarge Language Modelsin-context reasoning

0 likes · 10 min read

How Pos2Distill Eliminates Positional Bias in Large Language Models

Alibaba Cloud Developer

Sep 2, 2025 · Artificial Intelligence

Turning Large Language Models into Business Results: Alibaba Cloud’s Playbook

In this talk, Alibaba Cloud CIO Jiang Linquan shares how his team systematically tackled organizational, technical, and operational challenges to deploy large‑language‑model applications across dozens of enterprise scenarios, presenting real‑world case studies, a RIDE methodology, and practical metrics for success.

AICase StudiesEnterprise AI

0 likes · 36 min read

Turning Large Language Models into Business Results: Alibaba Cloud’s Playbook

Software Engineering 3.0 Era

Sep 1, 2025 · Industry Insights

How Large AI Models Will Redefine Software Development in the Next Few Years

The article analyzes how emerging large AI models are moving from simple code copying to intent‑driven programming, examines current tactical uses versus strategic design limits, presents real‑world examples like Vibe Coding, and forecasts both the opportunities and risks for software engineering over the next 2‑3 years.

AI agentsLarge Language ModelsSoftware Architecture

0 likes · 10 min read

How Large AI Models Will Redefine Software Development in the Next Few Years

DataFunSummit

Sep 1, 2025 · Artificial Intelligence

Turning Large AI Models into Real Business Value: A Logistics Ops Expert’s Playbook

In this interview, senior AI product operations manager Lu Xinting shares how to identify high‑value AI scenarios, apply three practical metrics, build a closed‑loop AIGC operation framework, and design user incentives to achieve product‑market fit for large language models in logistics.

AI OperationsAIGCLarge Language Models

0 likes · 8 min read

Turning Large AI Models into Real Business Value: A Logistics Ops Expert’s Playbook

DataFunSummit

Aug 28, 2025 · Artificial Intelligence

Why Finance Needs Its Own Large Language Model: Insights from Du Xiaoman

This article explains how the unique data‑driven, knowledge‑intensive, and complex nature of the financial industry makes large language models especially valuable, outlines the limitations of generic models, and shows how domain‑specific, cost‑effective models can deliver superior performance for finance.

AILarge Language ModelsModel Training

0 likes · 5 min read

Why Finance Needs Its Own Large Language Model: Insights from Du Xiaoman

Architects' Tech Alliance

Aug 26, 2025 · Artificial Intelligence

How DeepSeek‑V3.1’s New FP8 Precision Supercharges Domestic Chip Performance

DeepSeek‑V3.1 introduces the UE8M0 FP8 Scale precision, cutting memory usage by up to 75% and enabling next‑generation Chinese chips such as Ascend 910B to run 128K context models efficiently, while the ecosystem rapidly adopts FP8, yet challenges in IP autonomy and software maturity remain before global competitiveness is achieved.

AI hardwareDeepSeekFP8

0 likes · 10 min read

How DeepSeek‑V3.1’s New FP8 Precision Supercharges Domestic Chip Performance

JD Tech

Aug 25, 2025 · Artificial Intelligence

How JD’s Large‑Model Tools are Shaping AI in Enterprise: Insights & Roadmap

JD’s recent technical salon reveals the rapid evolution of large‑model tools, detailing industry trends, JD’s JoyAI ecosystem—including JoyAgent, OxyGent and JoyCode—real‑world applications across office, code review, logistics and local services, and future policy and multi‑agent visions.

AI ApplicationsAI toolsAgent platforms

0 likes · 13 min read

How JD’s Large‑Model Tools are Shaping AI in Enterprise: Insights & Roadmap

Architecture and Beyond

Aug 24, 2025 · Artificial Intelligence

Why Master‑Slave Architecture Powers Modern Multi‑Agent AI Systems

The article explains how the master‑slave (or manager‑worker) architecture, inspired by both software micro‑services and biological systems, solves context fragmentation and coordination challenges in large‑model multi‑agent applications, detailing design principles, technical implementations, advantages, limitations, and suitable use cases.

AI CoordinationContext ManagementLarge Language Models

0 likes · 15 min read

Why Master‑Slave Architecture Powers Modern Multi‑Agent AI Systems

Wu Shixiong's Large Model Academy

Aug 23, 2025 · Artificial Intelligence

Why LoRA, QLoRA, Prompt & Prefix Tuning Are Changing Large‑Model Fine‑Tuning

This article explains the mathematical basis of LoRA, compares it with QLoRA, Prompt Tuning, Prefix Tuning and P‑tuning, shows practical PyTorch implementations, and provides mixed‑precision training tips so readers can choose the most memory‑efficient fine‑tuning method for their large language models.

Large Language ModelsLoRAPrompt Tuning

0 likes · 17 min read

Why LoRA, QLoRA, Prompt & Prefix Tuning Are Changing Large‑Model Fine‑Tuning

DataFunSummit

Aug 23, 2025 · Artificial Intelligence

Mastering Role‑Playing AI Agents: Challenges, Techniques, and Future Directions

This article surveys the latest research on role‑playing AI agents, covering their definition, core components, application scenarios, three main challenges—role fidelity, long‑term memory, and evaluation—and presents four technical approaches for each challenge along with future research directions and references.

AI agentsEvaluationLarge Language Models

0 likes · 22 min read

Mastering Role‑Playing AI Agents: Challenges, Techniques, and Future Directions

JD Retail Technology

Aug 22, 2025 · Artificial Intelligence

How JD’s Open‑Source Large‑Model Tools Are Shaping the Future of Enterprise AI

This article explores the rapid evolution of large‑model AI tools, outlines JD’s open‑source solutions such as JoyAI, JoyAgent, OxyGent and JoyCode, and examines real‑world applications, design principles, policy considerations, and future directions for AI agents and embodied intelligence.

AI ApplicationsAI policyEnterprise AI

0 likes · 12 min read

How JD’s Open‑Source Large‑Model Tools Are Shaping the Future of Enterprise AI

JD Tech Talk

Aug 20, 2025 · Artificial Intelligence

How Large AI Models Are Transforming Software Testing

This article explains what large AI models are, how they enhance capabilities across domains, and details their practical use in software testing—covering code review, automated test case generation, security and performance checks—while envisioning future impacts on manual testing efficiency.

AI in QALarge Language Modelsmodel‑driven testing

0 likes · 4 min read

How Large AI Models Are Transforming Software Testing

Data Party THU

Aug 20, 2025 · Artificial Intelligence

How Dual‑Granularity Prompting Boosts Graph‑Enhanced LLMs for Fraud Detection

The article analyzes the Dual Granularity Prompting (DGP) framework, which mitigates information overload in graph‑enhanced large language models for fraud detection by applying fine‑grained processing to target nodes and coarse‑grained summarization to neighbors, achieving superior accuracy and token efficiency across multiple public and industrial datasets.

Graph Neural NetworksLarge Language Modelsdual granularity prompting

0 likes · 6 min read

How Dual‑Granularity Prompting Boosts Graph‑Enhanced LLMs for Fraud Detection

Kuaishou Large Model

Aug 19, 2025 · Artificial Intelligence

How Klear-Reasoner Achieves SOTA Math & Code Reasoning with GPPO

Klear-Reasoner, built on Qwen3‑8B‑Base, introduces the Gradient‑Preserving Clipping Policy Optimization (GPPO) algorithm to overcome traditional clip limitations, achieving state‑of‑the‑art performance on AIME2024/2025 and LiveCodeBench while providing detailed experimental analysis and data‑quality insights.

GPPOGradient ClippingLarge Language Models

0 likes · 11 min read

How Klear-Reasoner Achieves SOTA Math & Code Reasoning with GPPO

Alibaba Cloud Developer

Aug 18, 2025 · Artificial Intelligence

Mastering Claude Prompt Engineering: 9 Proven Strategies to Boost LLM Performance

This guide systematically breaks down Anthropic's official prompt‑engineering recommendations—clear instructions, multishot examples, chain‑of‑thought prompting, XML structuring, response pre‑filling, prompt chaining, long‑context handling, extended thinking, and practical code snippets—showing how to unlock Claude's full potential across complex tasks.

AIChain-of-ThoughtClaude

0 likes · 15 min read

Mastering Claude Prompt Engineering: 9 Proven Strategies to Boost LLM Performance