Tagged articles
2015 articles
Page 10 of 21
Data Party THU
Data Party THU
Oct 11, 2025 · Artificial Intelligence

From Transformers to LLaMA 4: A Journey Through the Biggest LLMs

This article surveys the most influential large language models released since 2017, detailing the core innovations of Transformer, BERT, GPT series, T5, Retrieval‑Augmented Generation, and the latest LLaMA and Meta models, while highlighting their architectures, training paradigms, and impact on NLP research.

LLMModel Scalinglarge language models
0 likes · 21 min read
From Transformers to LLaMA 4: A Journey Through the Biggest LLMs
BirdNest Tech Talk
BirdNest Tech Talk
Oct 10, 2025 · Artificial Intelligence

How to Build a Custom Output Parser in LangChain for Non‑Standard LLM Formats

This guide explains why custom output parsers are needed for LangChain when dealing with non‑JSON or XML responses, walks through inheriting BaseOutputParser, implementing parse() and optional format instructions, and provides a complete Python example that converts a simple "Key: Value" string into a dictionary.

CustomParserLLMLangChain
0 likes · 6 min read
How to Build a Custom Output Parser in LangChain for Non‑Standard LLM Formats
Programmer DD
Programmer DD
Oct 10, 2025 · Artificial Intelligence

How to Build a Resilient Multi‑LLM Chatbot with Spring AI

This tutorial demonstrates how to integrate multiple large language models from different providers into a Spring Boot application using Spring AI, configure primary, secondary, and tertiary models, and implement a fallback mechanism with Spring Retry to ensure high availability of the chatbot.

LLMResilienceSpring Boot
0 likes · 12 min read
How to Build a Resilient Multi‑LLM Chatbot with Spring AI
Data Party THU
Data Party THU
Oct 10, 2025 · Artificial Intelligence

Can Language Models Self‑Train Without Data? Inside the Language Self‑Play Framework

This article examines the Language Self‑Play (LSP) approach for data‑free training of large language models, detailing its challenger‑solver game formulation, advantage calculations, loss functions, self‑reward extension, experimental setup on AlpacaEval, and results that show LSP can match or surpass data‑driven baselines.

LLMReinforcement Learningdata-free training
0 likes · 14 min read
Can Language Models Self‑Train Without Data? Inside the Language Self‑Play Framework
JD Tech
JD Tech
Oct 9, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines external knowledge retrieval with large language models, covering its motivations, data preparation, chunking strategies, vectorization, storage, query processing, retrieval, reranking, prompt engineering, and LLM generation, plus practical optimization tips.

LLMRAGVector Search
0 likes · 14 min read
What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?
AntTech
AntTech
Oct 9, 2025 · Artificial Intelligence

Ling-1T: The Trillion‑Parameter AI Model Redefining Efficient Reasoning

Ling-1T, a trillion‑parameter flagship non‑thinking model, combines 50 billion active parameters per token, 128 K context, Evo‑CoT reasoning, and FP8 mixed‑precision training to achieve state‑of‑the‑art performance on complex reasoning, code generation, and multimodal tasks while outlining its architecture, benchmarks, limitations, and future roadmap.

AIFP8Inference
0 likes · 11 min read
Ling-1T: The Trillion‑Parameter AI Model Redefining Efficient Reasoning
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Oct 9, 2025 · Artificial Intelligence

How Short‑Term and Long‑Term Memory Power LLM‑Based Agents

This article explains the definitions, technical implementations, functions, limitations, and collaborative workflow of short‑term and long‑term memory in large‑language‑model agents, detailing context windows, attention mechanisms, vector storage, retrieval strategies, and future research directions for building personalized, continuously learning AI agents.

Agent MemoryArtificial IntelligenceLLM
0 likes · 11 min read
How Short‑Term and Long‑Term Memory Power LLM‑Based Agents
Data Party THU
Data Party THU
Oct 9, 2025 · Information Security

How to Secure MCP Tools: Risks, Real‑World Cases, and the Open‑Source MCPScan Framework

The article analyzes the security challenges introduced by the open Model Context Protocol (MCP) ecosystem, outlines typical attack vectors such as command‑execution hijacking and indirect prompt injection, and presents MCPScan—an open‑source scanner that combines static taint analysis with LLM‑driven reasoning to detect exploitable tool chains before deployment.

LLMMCPOpen-source
0 likes · 7 min read
How to Secure MCP Tools: Risks, Real‑World Cases, and the Open‑Source MCPScan Framework
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Oct 9, 2025 · Artificial Intelligence

Paper Review: TradingGroup – A Multi‑Agent Quantitative Trading System with Self‑Reflection and Data Synthesis

The paper introduces TradingGroup, a five‑agent LLM‑based quantitative trading framework that incorporates a self‑reflection mechanism, dynamic risk management, and an automated data‑synthesis pipeline, and demonstrates superior cumulative returns, Sharpe ratios, and lower drawdowns than rule‑based, ML, RL, and existing LLM strategies on five real‑world stock datasets.

Financial AILLMMulti-Agent System
0 likes · 14 min read
Paper Review: TradingGroup – A Multi‑Agent Quantitative Trading System with Self‑Reflection and Data Synthesis
Alibaba Cloud Developer
Alibaba Cloud Developer
Oct 9, 2025 · Operations

How AI‑Powered Multi‑Agent Systems Turn Fault Postmortems into Proactive Risk Prevention

This article explains how an AI‑driven multi‑agent platform automates fault postmortem generation, enriches analysis with memory management, prompt engineering, and RAG techniques, and delivers actionable insights for SREs, developers, and non‑technical stakeholders, ultimately shifting incident handling from reactive to proactive.

AILLMSRE
0 likes · 44 min read
How AI‑Powered Multi‑Agent Systems Turn Fault Postmortems into Proactive Risk Prevention
FunTester
FunTester
Oct 9, 2025 · Artificial Intelligence

How AI Turns Natural Language Into Automated End‑to‑End Tests

This article explains how the browser‑use/qa‑use platform leverages large language models to let testers describe test cases in natural language, automatically generates browser actions, executes them, and provides detailed reports, dramatically reducing script maintenance and boosting testing efficiency.

AI testingBrowser AutomationLLM
0 likes · 10 min read
How AI Turns Natural Language Into Automated End‑to‑End Tests
DataFunSummit
DataFunSummit
Oct 8, 2025 · Artificial Intelligence

How Kuaishou Boosted Ad Performance with Multimodal LLMs and the COPE Framework

This article reviews Kuaishou’s two‑year exploration of large‑model techniques in advertising, detailing the content‑domain estimation challenges, how multimodal and LLM approaches improve full‑domain behavior utilization and external knowledge integration, and introducing the COPE product‑content representation framework and the LEARN LLM knowledge‑transfer system.

AdvertisingKuaishouLLM
0 likes · 7 min read
How Kuaishou Boosted Ad Performance with Multimodal LLMs and the COPE Framework
BirdNest Tech Talk
BirdNest Tech Talk
Oct 8, 2025 · Artificial Intelligence

How to Turn LLM Text into Structured Data with LangChain Output Parsers

This article explains why LLMs output plain text, introduces LangChain output parsers as the bridge to structured data, details their workflow, reviews built‑in parsers, and walks through a complete Python example that builds a prompt‑model‑parser chain to generate a JSON‑based joke.

LLMLangChainOutputParser
0 likes · 10 min read
How to Turn LLM Text into Structured Data with LangChain Output Parsers
AI Cyberspace
AI Cyberspace
Oct 5, 2025 · Artificial Intelligence

AI Agent vs AI Workflow: Which Approach Suits Your Projects?

The article explains the differences between AI Agents and AI Workflows, compares their characteristics, introduces the hybrid Agentic Workflow concept, and offers practical recommendations for building enhanced LLM applications using simple prompts or advanced frameworks.

AI workflowArtificial IntelligenceLLM
0 likes · 10 min read
AI Agent vs AI Workflow: Which Approach Suits Your Projects?
DataFunSummit
DataFunSummit
Oct 5, 2025 · Artificial Intelligence

How Bilibili Uses LLM‑Powered Assistants to Tackle Big‑Data Task Failures

Bilibili’s massive video platform relies on a five‑layer, storage‑compute separated big‑data architecture, handling hundreds of thousands of daily tasks, and now leverages large‑language‑model assistants to automatically diagnose and resolve frequent task failures and performance slowdowns.

AI assistanceBilibiliDistributed Systems
0 likes · 4 min read
How Bilibili Uses LLM‑Powered Assistants to Tackle Big‑Data Task Failures
BirdNest Tech Talk
BirdNest Tech Talk
Oct 2, 2025 · Artificial Intelligence

How Function Calling Empowers LLMs: A Step‑by‑Step LangChain Guide

This article explains how function (tool) calling lets large language models like GPT or Gemini invoke external APIs, walks through defining tools with LangChain, and demonstrates a complete Python example that fetches real‑time weather data and returns a natural‑language answer.

AI agentsFunction CallingLLM
0 likes · 9 min read
How Function Calling Empowers LLMs: A Step‑by‑Step LangChain Guide
Data Party THU
Data Party THU
Oct 1, 2025 · Artificial Intelligence

Why SFT and RL Are Two Sides of the Same Coin: A Unified Gradient Theory for LLM Post‑Training

This article analyzes a recent paper that unifies supervised fine‑tuning (SFT) and reinforcement learning (RL) for large language models under a single gradient estimator, introduces the Unified Policy Gradient Estimator (UPGE) and the Hybrid Post‑Training (HPT) algorithm, and demonstrates their superior performance on math reasoning benchmarks.

AI researchHybrid TrainingLLM
0 likes · 11 min read
Why SFT and RL Are Two Sides of the Same Coin: A Unified Gradient Theory for LLM Post‑Training
BirdNest Tech Talk
BirdNest Tech Talk
Sep 30, 2025 · Artificial Intelligence

LLM vs. ChatModel in LangChain: Choosing the Right Interface

This article explains LangChain's two core abstractions—LLM for simple text completion and ChatModel for multi‑turn conversational AI—detailing their input/output formats, practical code examples, and why ChatModel is generally preferred for modern dialogue applications.

AIChatModelLLM
0 likes · 6 min read
LLM vs. ChatModel in LangChain: Choosing the Right Interface
DataFunSummit
DataFunSummit
Sep 30, 2025 · Artificial Intelligence

How Kuaishou Uses Large Models to Boost Ad Performance with COPE and LEARN

This article outlines Kuaishou's two‑year exploration of large‑model techniques in advertising, detailing challenges of sparse cross‑domain data, the COPE unified product representation framework, and the LEARN LLM knowledge‑transfer approach that together improve ad system effectiveness.

COPELLMMultimodal
0 likes · 6 min read
How Kuaishou Uses Large Models to Boost Ad Performance with COPE and LEARN
Data Party THU
Data Party THU
Sep 30, 2025 · Artificial Intelligence

Do Large Language Models Really Have Personalities? New Study Reveals a ‘Personality Illusion’

A recent interdisciplinary study from Caltech, Cambridge and others shows that while large language models can present idealized personalities on questionnaires, their actual behavior in tasks diverges sharply, exposing a ‘personality illusion’ that challenges current AI alignment approaches.

AI AlignmentBehavioral TestingLLM
0 likes · 12 min read
Do Large Language Models Really Have Personalities? New Study Reveals a ‘Personality Illusion’
DataFunSummit
DataFunSummit
Sep 30, 2025 · Artificial Intelligence

How Kuaishou Boosted Ad Performance with Multimodal LLMs: COPE & LEARN Frameworks

Over the past two years, Kuaishou has leveraged multimodal large‑model techniques to overcome sparse advertising data, integrating full‑domain user behavior and external knowledge via the COPE unified product representation framework and the LEARN LLM knowledge‑transfer system, achieving measurable business gains.

KuaishouLLMMultimodal
0 likes · 6 min read
How Kuaishou Boosted Ad Performance with Multimodal LLMs: COPE & LEARN Frameworks
BirdNest Tech Talk
BirdNest Tech Talk
Sep 29, 2025 · Artificial Intelligence

Mastering LangChain Serialization: Save, Load, and Share Your AI Workflows

Learn how to serialize LangChain components—including prompts, chains, and agents—using JSON and YAML, enabling reproducibility, collaboration, persistence, and decoupling, with step‑by‑step code examples for dumping objects to files and loading them back into executable LLM pipelines.

AI workflowLLMLangChain
0 likes · 8 min read
Mastering LangChain Serialization: Save, Load, and Share Your AI Workflows
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Sep 29, 2025 · Artificial Intelligence

AlphaAgents: BlackRock’s LLM‑Driven Multi‑Agent System for Stock Portfolio Management

AlphaAgents introduces a role‑based multi‑agent framework—Fundamental, Sentiment, and Valuation agents—leveraging LLMs to analyze 10‑K reports, news, and price data, with a debate mechanism via Microsoft AutoGen; experiments on 15 tech stocks show superior cumulative returns and Sharpe ratios under risk‑neutral and risk‑averse settings compared to single‑agent baselines.

AlphaAgentsFinancial AILLM
0 likes · 10 min read
AlphaAgents: BlackRock’s LLM‑Driven Multi‑Agent System for Stock Portfolio Management
DataFunSummit
DataFunSummit
Sep 29, 2025 · Artificial Intelligence

How to Detect and Prevent Hallucinations in LLM‑Powered NL2SQL Systems

This article explains the nature, types, and causes of hallucinations in large language models used for NL2SQL, reviews both unsupervised and supervised detection methods, and introduces an efficient token‑confidence based Active Sampling Detection (ASD) approach with practical deployment examples and future research directions.

AI SafetyASDLLM
0 likes · 19 min read
How to Detect and Prevent Hallucinations in LLM‑Powered NL2SQL Systems
Alibaba Cloud Observability
Alibaba Cloud Observability
Sep 29, 2025 · Artificial Intelligence

Building a Cloud‑Native Observability Stack for LLM Apps with Alibaba SLS

This article details the engineering practice of constructing a complete data infrastructure for large‑language‑model (LLM) applications using Alibaba Cloud SLS, covering the observability challenges of the Dify platform, the redesign of the architecture, and the resulting improvements in monitoring, diagnosis, and quality optimization.

Cloud NativeData InfrastructureDify
0 likes · 23 min read
Building a Cloud‑Native Observability Stack for LLM Apps with Alibaba SLS
BirdNest Tech Talk
BirdNest Tech Talk
Sep 28, 2025 · Artificial Intelligence

Mastering LangChain Callbacks: Track LLM Execution Step‑by‑Step

LangChain’s callback system lets developers hook into every stage of an LLM chain— from chain start/end to token generation—using built‑in handlers like StdOutCallbackHandler or custom handlers derived from BaseCallbackHandler, with examples showing constructor‑level and request‑level attachment, plus a custom handler implementation.

AICallbacksLLM
0 likes · 6 min read
Mastering LangChain Callbacks: Track LLM Execution Step‑by‑Step
DataFunSummit
DataFunSummit
Sep 28, 2025 · Artificial Intelligence

Unlocking Enterprise Knowledge: Building Multimodal AI Systems with LLMs

This article examines the challenges of processing massive multimodal data in enterprises and presents a knowledge‑augmentation framework that leverages Retrieval‑Augmented Generation, memory‑inspired architecture, and feedback loops to enable reliable, scalable AI‑driven decision making across diverse business scenarios.

Enterprise KnowledgeLLMMultimodal AI
0 likes · 29 min read
Unlocking Enterprise Knowledge: Building Multimodal AI Systems with LLMs
DataFunSummit
DataFunSummit
Sep 28, 2025 · Artificial Intelligence

How Bilibili Built an LLM‑Powered Assistant to Tackle Massive Data Tasks

This article explains Bilibili's implementation of a large‑language‑model based intelligent assistant, detailing the platform's five‑layer architecture, the huge volume of offline and real‑time jobs, common user issues like task failures and slowdowns, and how AI can help automate troubleshooting.

BilibiliLLM
0 likes · 4 min read
How Bilibili Built an LLM‑Powered Assistant to Tackle Massive Data Tasks
Data STUDIO
Data STUDIO
Sep 28, 2025 · Artificial Intelligence

Top Reranker Models for RAG in 2025: A Comparative Review

This article explains why initial retrieval in Retrieval‑Augmented Generation often yields noisy results, describes how rerankers act as quality filters to improve relevance, compares the leading 2025 reranker models—including Cohere, bge‑reranker, Voyage, Jina, FlashRank, and MixedBread—and provides code snippets, evaluation metrics, and guidance for selecting the right model for specific use cases.

AICross-EncoderLLM
0 likes · 31 min read
Top Reranker Models for RAG in 2025: A Comparative Review
JavaGuide
JavaGuide
Sep 28, 2025 · Artificial Intelligence

JD Open‑Sources JoyAgent‑JDGenie: A Product‑Grade Java Multi‑Agent AI Platform

JD Cloud has released JoyAgent‑JDGenie, the first fully product‑grade open‑source Java multi‑agent system that bundles front‑end, back‑end, framework, engine and core agents, supports major LLMs, offers layered architecture, Docker or manual deployment, and showcases demos such as PPT generation and sales analysis.

AIDockerLLM
0 likes · 6 min read
JD Open‑Sources JoyAgent‑JDGenie: A Product‑Grade Java Multi‑Agent AI Platform
JD Tech Talk
JD Tech Talk
Sep 28, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Power Modern AI?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines traditional information retrieval with large language models, detailing its core workflow—from knowledge preparation, chunking, and embedding to vector database storage and the question‑answering stage—while highlighting key challenges, tools, and optimization strategies.

AIEmbeddingLLM
0 likes · 15 min read
What Is Retrieval‑Augmented Generation (RAG) and How Does It Power Modern AI?
JD Cloud Developers
JD Cloud Developers
Sep 28, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Work?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines traditional information retrieval with large language models, covering its core workflow—from knowledge preparation, data cleaning, and metadata extraction to query preprocessing, vector retrieval, reranking, information integration, and final LLM generation, while also reviewing common embedding models and vector databases.

Artificial IntelligenceLLMRAG
0 likes · 13 min read
What Is Retrieval‑Augmented Generation (RAG) and How Does It Work?
Baobao Algorithm Notes
Baobao Algorithm Notes
Sep 28, 2025 · Artificial Intelligence

How Much GPU Memory Do LLMs Really Need? A Deep Dive into Training & Inference

This article breaks down the GPU memory requirements of large language models during training and inference, detailing the contributions of model weights, optimizer states, activations, KV cache, and activation recomputation, and provides concrete formulas, examples, and scaling insights for models like Qwen3 and DeepSeek V3.

GPU MemoryKV cacheLLM
0 likes · 18 min read
How Much GPU Memory Do LLMs Really Need? A Deep Dive into Training & Inference
DataFunSummit
DataFunSummit
Sep 27, 2025 · Artificial Intelligence

Bridging the Gap: Enforcing Discipline in AI Agents for Reliable Performance

This article examines the challenges of building production‑grade AI agents—such as context drift, knowledge leakage, and fragile state handling—and presents a disciplined architecture that combines code locks, attention anchors, and Redis‑backed state management to turn a prototype travel planner into a robust, industrial‑strength system.

AI AgentLLMState Management
0 likes · 14 min read
Bridging the Gap: Enforcing Discipline in AI Agents for Reliable Performance
DataFunTalk
DataFunTalk
Sep 27, 2025 · Artificial Intelligence

How Bilibili Uses LLMs to Diagnose Big Data Platform Issues

This article explains how Bilibili leverages a large‑language‑model‑driven assistant to diagnose and resolve failures and slowdowns in its massive big‑data platform, detailing the platform’s five‑layer architecture, common task issues, and the need for intelligent troubleshooting tools.

AI AssistantBig DataBilibili
0 likes · 5 min read
How Bilibili Uses LLMs to Diagnose Big Data Platform Issues
Architecture and Beyond
Architecture and Beyond
Sep 27, 2025 · Artificial Intelligence

Mastering AI Agent Tool Management: OpenManus, Gemini CLI & Shopify Sidekick

This article explains how AI agents work, examines OpenManus’s comprehensive tool framework, reviews Gemini CLI’s minimalist tool scheduling and error handling, and discusses Shopify Sidekick’s scaling challenges and Just‑in‑Time instruction strategy, offering practical guidance for building robust, production‑ready agentic systems.

AI agentsError HandlingJust-in-Time
0 likes · 15 min read
Mastering AI Agent Tool Management: OpenManus, Gemini CLI & Shopify Sidekick
Tech Freedom Circle
Tech Freedom Circle
Sep 27, 2025 · Artificial Intelligence

What Is an AI‑Native Application and How to Design One?

The article explains the concept of AI‑native applications, distinguishes them from AI‑plugin extensions, outlines their core principles such as model‑first design, data flywheel, event‑driven agents, multimodal semantics, continuous learning, and provides a seven‑step practical guide with code examples for building an AI‑native app.

AI AssistantAI-nativeData Flywheel
0 likes · 23 min read
What Is an AI‑Native Application and How to Design One?
Raymond Ops
Raymond Ops
Sep 26, 2025 · Artificial Intelligence

How to Build and Deploy a Dify LLM Application Platform on CentOS

This comprehensive guide walks you through the fundamentals of Dify, an open‑source LLM application platform, its key features and use cases, and provides step‑by‑step instructions for preparing the environment, installing Docker and Docker‑Compose, and deploying Dify on a CentOS 7.9 server.

AI PlatformDifyDocker
0 likes · 13 min read
How to Build and Deploy a Dify LLM Application Platform on CentOS
Bilibili Tech
Bilibili Tech
Sep 26, 2025 · Artificial Intelligence

How RAG Transforms Natural Language Queries into Accurate SQL for Business Users

This article explains how Retrieval‑Augmented Generation (RAG) combines large language models with vector databases to let non‑technical staff query massive membership data using plain language, detailing the workflow, technical architecture, optimization challenges, and real‑world impact on data‑driven decision making.

AIData PlatformLLM
0 likes · 17 min read
How RAG Transforms Natural Language Queries into Accurate SQL for Business Users
Tech Freedom Circle
Tech Freedom Circle
Sep 25, 2025 · Operations

RAGFlow Link Tracing: GPS‑Style Observability for LLM‑Powered Applications

The article explains why RAGFlow needs end‑to‑end link tracing, introduces OpenTelemetry’s core concepts, shows how custom tracing utilities are implemented in Python, describes the layered architecture, provides concrete Docker and YAML configurations, and offers best‑practice guidelines for performance monitoring and fault diagnosis.

Distributed SystemsLLMOpenTelemetry
0 likes · 24 min read
RAGFlow Link Tracing: GPS‑Style Observability for LLM‑Powered Applications
Tech Freedom Circle
Tech Freedom Circle
Sep 25, 2025 · Artificial Intelligence

RAGFlow Primer Part 1: Introduction and Concept Deep Dive

This article provides a comprehensive technical overview of RAGFlow, an industrial‑grade Retrieval‑Augmented Generation platform, detailing its architecture, core components such as DeepDoc, intelligent chunking, embedding integration, multi‑stage retrieval, and agent workflow, while comparing it with traditional RAG shortcomings.

DeepDocIntelligent ChunkingKnowledge Base
0 likes · 32 min read
RAGFlow Primer Part 1: Introduction and Concept Deep Dive
Tech Freedom Circle
Tech Freedom Circle
Sep 25, 2025 · Artificial Intelligence

RAGFlow Deep Dive: Data Parsing and Knowledge Graph Construction

This article examines RAGFlow's end‑to‑end pipeline for turning diverse documents into structured knowledge, detailing the TaskExecutor factory, the DeepDoc layout‑aware parser, chunking strategies, embedding and storage mechanisms, and the GraphRAG‑based knowledge‑graph extraction that together enable high‑precision retrieval and reasoning.

Data ParsingDeepDocElasticsearch
0 likes · 15 min read
RAGFlow Deep Dive: Data Parsing and Knowledge Graph Construction
BirdNest Tech Talk
BirdNest Tech Talk
Sep 25, 2025 · Artificial Intelligence

How to Install and Configure LangChain for LLM Development

This guide walks you through installing the LangChain library, adding model‑specific packages, verifying the setup with a Python script, configuring API keys via environment variables or a .env file, and preparing to use OpenAI‑compatible models such as DeepSeek or Qwen.

API keysEnvironmentInstallation
0 likes · 8 min read
How to Install and Configure LangChain for LLM Development
BirdNest Tech Talk
BirdNest Tech Talk
Sep 25, 2025 · Artificial Intelligence

Mastering LangChain: A Hands‑On Guide to Building LLM Applications

This repository offers a comprehensive, step‑by‑step LangChain tutorial series that walks developers through installation, the LangChain Expression Language, streaming, parallel execution, callbacks, serialization, model customization, prompt templates, memory, multimodal support, and advanced tools like LangGraph and LangSmith, enabling the creation of sophisticated AI applications.

AI DevelopmentLLMLangChain
0 likes · 9 min read
Mastering LangChain: A Hands‑On Guide to Building LLM Applications
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Sep 24, 2025 · Artificial Intelligence

Key Points for Evaluating AI Agents

The article explains how Coze's Compass introduces a flexible evaluation system for AI agents, outlines a four‑dimensional submodule assessment (planning, tool use, self‑reflection, memory), and details specific testing criteria and challenges for web, scientific, dialogue, and programming agents.

AI agentsBenchmarkingCoze
0 likes · 6 min read
Key Points for Evaluating AI Agents
DataFunSummit
DataFunSummit
Sep 24, 2025 · Artificial Intelligence

Taming LLM Hallucinations: Strategies and Solutions from 360

This article explores the problem of large‑model hallucinations, explains its definitions and classifications, analyzes root causes in data, algorithms and inference, and presents detection methods and practical mitigation techniques such as RAG, decoding strategies, and model‑enhancement approaches, illustrated with real‑world 360 use cases and future research directions.

AI SafetyLLMModel Alignment
0 likes · 22 min read
Taming LLM Hallucinations: Strategies and Solutions from 360
Huolala Tech
Huolala Tech
Sep 24, 2025 · Artificial Intelligence

How CID-GraphRAG Boosts Multi‑Turn AI Customer Service with Dual‑Layer Retrieval

The article introduces CID-GraphRAG, a novel framework that combines intent‑driven graphs with semantic similarity search to improve multi‑turn intelligent customer service, detailing its architecture, dual‑layer retrieval mechanism, evaluation against baseline models, and future research directions.

AIDialogue SystemsLLM
0 likes · 14 min read
How CID-GraphRAG Boosts Multi‑Turn AI Customer Service with Dual‑Layer Retrieval
AI Large Model Application Practice
AI Large Model Application Practice
Sep 23, 2025 · Artificial Intelligence

How MindsDB Turns Any Data Source into an AI‑Powered Query Engine

This article walks through installing MindsDB, configuring its unified data access layer, and demonstrates how to query across relational databases, files, and vector stores while injecting AI models—including traditional ML, LLMs, and embedding models—directly into SQL for intelligent data retrieval and analysis.

AI data integrationLLMMindsDB
0 likes · 16 min read
How MindsDB Turns Any Data Source into an AI‑Powered Query Engine
Baobao Algorithm Notes
Baobao Algorithm Notes
Sep 22, 2025 · Artificial Intelligence

How to Add Special Tokens to LLMs Without Losing Performance

This guide explains why naïvely adding special tokens during supervised fine‑tuning can destabilize a large language model, and provides step‑by‑step strategies—including tokenizer updates, embedding resizing, smart initialization, and LoRA‑based PEFT—to integrate new tokens while preserving the model's original capabilities.

LLMLoRAspecial tokens
0 likes · 9 min read
How to Add Special Tokens to LLMs Without Losing Performance
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Sep 21, 2025 · Artificial Intelligence

FinKario: Event‑Enhanced Financial Knowledge Graphs Boost A‑Share Sharpe Ratio to 4.9

This article reviews the FinKario paper, which introduces an event‑augmented financial knowledge graph and a two‑stage RAG retrieval strategy that together enable real‑time knowledge updates and efficient integration of long‑form research reports, yielding a Sharpe ratio of 4.9 and outperforming baseline LLMs and institutional strategies in back‑testing.

FinKarioLLMRAG
0 likes · 10 min read
FinKario: Event‑Enhanced Financial Knowledge Graphs Boost A‑Share Sharpe Ratio to 4.9
AntTech
AntTech
Sep 19, 2025 · Artificial Intelligence

How Reinforcement Learning Cuts Hallucinations in Large Language Models: Ant Insurance’s Proven Approach

Ant Insurance’s tech team leveraged reinforcement learning, focused data selection, and a multi‑dimensional reward system to dramatically reduce hallucinations in LLMs, achieving top‑rank performance on the HHEM leaderboard and robust improvements across instruction‑following and reasoning‑enhanced models.

Hallucination ControlLLMLLM-as-judge
0 likes · 6 min read
How Reinforcement Learning Cuts Hallucinations in Large Language Models: Ant Insurance’s Proven Approach
DataFunSummit
DataFunSummit
Sep 19, 2025 · Artificial Intelligence

How Tencent Leverages LLMs: RAG, GraphRAG, and Agents in Real‑World Apps

This article examines Tencent's large language model deployments across diverse business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑play, and explains the underlying technologies—Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and intelligent agents—that enable these applications.

AIAgentLLM
0 likes · 4 min read
How Tencent Leverages LLMs: RAG, GraphRAG, and Agents in Real‑World Apps
JD Tech
JD Tech
Sep 18, 2025 · Artificial Intelligence

How I Turned a General LLM into a Precise E‑commerce Risk Detector

The article recounts how a risk‑control algorithm engineer progressively refined a generic large language model through four stages of prompt engineering—role‑playing, business knowledge injection, deeper analysis, and a double‑hypothesis decision framework—to transform it into a precise e‑commerce fraud detection expert.

AILLMPrompt engineering
0 likes · 12 min read
How I Turned a General LLM into a Precise E‑commerce Risk Detector
DataFunSummit
DataFunSummit
Sep 18, 2025 · Artificial Intelligence

Boosting LLM Function Call: Data, Training, and Agent Optimization Strategies

This presentation by Yao Yitong of China Telecom AI Research Institute explains why Function Call is essential for LLM deployment, outlines data‑centric and training‑centric optimization methods, discusses common pitfalls and reward‑function design for reinforcement learning, and showcases practical Agent application patterns for real‑world tasks.

AgentLLMReinforcement Learning
0 likes · 36 min read
Boosting LLM Function Call: Data, Training, and Agent Optimization Strategies
AI Cyberspace
AI Cyberspace
Sep 18, 2025 · Artificial Intelligence

LangChain vs LangGraph vs LangSmith: Which AI Framework Fits Your Needs?

This article compares LangChain, LangGraph, and LangSmith—three complementary frameworks for building LLM-powered applications—explaining their distinct architectures, use cases, and features, and also introduces related concepts such as RAG, MCP, A2A protocols, hierarchical memory systems, context engineering, and knowledge graphs to guide developers in selecting and integrating the appropriate tools.

AgentContext EngineeringLLM
0 likes · 21 min read
LangChain vs LangGraph vs LangSmith: Which AI Framework Fits Your Needs?
Zhuanzhuan Tech
Zhuanzhuan Tech
Sep 17, 2025 · Artificial Intelligence

LLM‑Powered Intent Understanding, RAG QA, and Knowledge Base Maintenance for Recycling

This article details how Zhuanzhuan leverages large language models to enhance on‑site device inspection through a three‑stage pipeline—intent understanding, retrieval‑augmented generation QA, and automated knowledge‑base upkeep—highlighting technical innovations, workflow integration, and the resulting operational benefits.

AIIntent UnderstandingKnowledge Base
0 likes · 14 min read
LLM‑Powered Intent Understanding, RAG QA, and Knowledge Base Maintenance for Recycling
AntTech
AntTech
Sep 16, 2025 · Information Security

Cutting-Edge Privacy Tech Unveiled: Gibbon, Panther & PromeFuzz at ACM CCS 2025

At the ACM CCS 2025 live paper showcase, three groundbreaking studies—Gibbon’s fast secure two‑party GBDT training, Panther’s efficient private approximate nearest‑neighbor search on a single server, and PromeFuzz’s knowledge‑driven LLM approach to fuzzing harness generation—are presented, highlighting significant performance and security advances.

LLMMPCapproximate nearest neighbor
0 likes · 8 min read
Cutting-Edge Privacy Tech Unveiled: Gibbon, Panther & PromeFuzz at ACM CCS 2025
Baidu Geek Talk
Baidu Geek Talk
Sep 15, 2025 · Artificial Intelligence

How Baidu’s AI Navigation Turns Voice Commands into Precise Actions

This article explains how Baidu Map’s AI navigation system converts spoken queries into accurate map instructions by combining speech recognition, intent parsing, large‑language‑model reasoning, tool calling, and memory‑reflection techniques, showcasing the underlying technologies that enable instant, context‑aware responses.

AILLMMap Services
0 likes · 13 min read
How Baidu’s AI Navigation Turns Voice Commands into Precise Actions
Data Party THU
Data Party THU
Sep 15, 2025 · Artificial Intelligence

Agentic RL: Transforming LLMs into Autonomous Decision‑Making Agents

This survey formalizes the shift from preference‑based reinforcement fine‑tuning to Agentic Reinforcement Learning, defines Agentic RL via MDP/POMDP abstractions, proposes a dual taxonomy of capabilities and task domains, compiles over 500 recent works, and outlines open challenges for scalable, robust AI agents.

AI agentsLLMPOMDP
0 likes · 12 min read
Agentic RL: Transforming LLMs into Autonomous Decision‑Making Agents
Data Party THU
Data Party THU
Sep 15, 2025 · Artificial Intelligence

Why Merge SFT and RL? Exploring Unified Fine‑Tuning Strategies for LLMs

This article examines the necessity of integrating Supervised Fine‑Tuning (SFT) with Reinforcement Learning (RL) for large language models, surveys alternating, sample‑reuse, simultaneous, and hint‑guided fusion methods, presents the underlying loss functions, and discusses practical trade‑offs such as entropy collapse and importance‑sampling corrections.

AILLMRL
0 likes · 14 min read
Why Merge SFT and RL? Exploring Unified Fine‑Tuning Strategies for LLMs
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 15, 2025 · Artificial Intelligence

Why MCP Isn't a Magic AI Upgrade: Deep Dive into Its Architecture, Host Role, and Real Costs

This article debunks common misconceptions about the Model Context Protocol (MCP), explains its client‑host‑server (CHS) architecture, shows how the Host drives AI decisions while Server and Client remain model‑agnostic, compares MCP with Function Calling, analyzes SDK source code, evaluates practical trade‑offs, and outlines the true engineering value and costs of using MCP in AI applications.

AI EngineeringFunction CallingLLM
0 likes · 35 min read
Why MCP Isn't a Magic AI Upgrade: Deep Dive into Its Architecture, Host Role, and Real Costs
Data Thinking Notes
Data Thinking Notes
Sep 14, 2025 · Artificial Intelligence

How to Build a Robust Tool Integration Module for AI Agents

This article explains the architecture, core components, and step‑by‑step implementation of a tool usage module that enables AI agents to standardize, select, execute, and transform external tools, illustrated with a sales data analysis case and detailed code snippets.

AI AgentLLMmetadata management
0 likes · 9 min read
How to Build a Robust Tool Integration Module for AI Agents
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Sep 14, 2025 · Artificial Intelligence

How MM‑DREX Uses Multimodal LLMs for Dynamic Expert Routing in Financial Trading

The article reviews the MM‑DREX framework, which tackles the non‑stationarity of financial markets by modeling trading as a POMDP, employing a vision‑language model‑driven dynamic router to allocate four heterogeneous experts, and demonstrates superior returns, Sharpe ratios, and drawdown control across stocks, futures, and crypto compared with 15 strong baselines.

LLMPOMDPReinforcement Learning
0 likes · 13 min read
How MM‑DREX Uses Multimodal LLMs for Dynamic Expert Routing in Financial Trading
JavaEdge
JavaEdge
Sep 14, 2025 · Artificial Intelligence

Exploring Hugging Face AI Sheets: No‑Code LLM‑Powered Data Manipulation

Hugging Face AI Sheets lets users employ large language models through a spreadsheet‑like interface to clean, transform, enrich, and generate datasets without writing code, offering both zero‑shot dataset creation and import‑based bulk processing, with optional self‑hosting via Docker for privacy‑sensitive workflows.

AI SheetsDocker deploymentHugging Face
0 likes · 5 min read
Exploring Hugging Face AI Sheets: No‑Code LLM‑Powered Data Manipulation
AI Algorithm Path
AI Algorithm Path
Sep 14, 2025 · Artificial Intelligence

Qwen3-Next: Achieving Unmatched Training and Inference Cost‑Effectiveness

Alibaba's Qwen team unveils Qwen3-Next, a hybrid expert LLM with 800 B parameters but only 30 B active, delivering training costs under one‑tenth of comparable dense models and more than ten‑fold inference throughput for long contexts, while matching or surpassing larger models on benchmark tasks.

AILLMMulti-token Prediction
0 likes · 9 min read
Qwen3-Next: Achieving Unmatched Training and Inference Cost‑Effectiveness
Data Party THU
Data Party THU
Sep 14, 2025 · Artificial Intelligence

Why Do Large Language Models Hallucinate? Uncovering the Root Causes and Practical Fixes

The article analyzes why large language models frequently generate confidently wrong answers, attributing hallucinations to statistical inevitability, data scarcity, and limited model expressiveness, and shows how RLHF exacerbates the problem by rewarding guesses, then proposes confidence‑threshold and "I don't know" strategies to mitigate it.

AISafetyConfidenceThresholdLLM
0 likes · 6 min read
Why Do Large Language Models Hallucinate? Uncovering the Root Causes and Practical Fixes
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Sep 13, 2025 · Artificial Intelligence

Paper Summary: Recent AI Advances in Financial Time-Series (Sep 6‑12, 2025)

This article summarizes four recent AI research papers that explore zero‑shot PDE extrapolation with text‑trained LLMs, causal hidden‑state interventions for rare financial events, tabular reformulation of graph node classification, and a multimodal model for financial time‑series forecasting, detailing their methods, experiments, and key findings.

LLMTime Seriescausal intervention
0 likes · 10 min read
Paper Summary: Recent AI Advances in Financial Time-Series (Sep 6‑12, 2025)
DataFunSummit
DataFunSummit
Sep 13, 2025 · Artificial Intelligence

How Pinterest Scaled LLM Data Pipelines with Ray: Boosting Throughput and Cutting Costs

This article details how Pinterest’s senior staff engineer Dr. Luo leveraged the open‑source Ray framework to overcome LLM data‑preprocessing bottlenecks, describing the system’s architecture, key features such as map_batches, Carry‑Over Columns and Accumulators, and the dramatic performance and cost improvements achieved.

LLMPinterestRay
0 likes · 12 min read
How Pinterest Scaled LLM Data Pipelines with Ray: Boosting Throughput and Cutting Costs
Architecture and Beyond
Architecture and Beyond
Sep 12, 2025 · Artificial Intelligence

How Gemini CLI and Claude Code Achieve Context Isolation for AI Agents

This article examines the context isolation strategies employed by Gemini CLI and Claude Code in AI agents, detailing why isolation is essential, the multi‑layer memory architecture, tool execution pipelines, concurrency controls, session management, and practical recommendations for building robust, cost‑effective agent systems.

AI agentsClaude CodeGemini CLI
0 likes · 15 min read
How Gemini CLI and Claude Code Achieve Context Isolation for AI Agents
Volcano Engine Developer Services
Volcano Engine Developer Services
Sep 11, 2025 · Artificial Intelligence

Why Do Large Language Models Hallucinate? Causes, Types, and Mitigation Strategies

This article examines the growing problem of hallucinations in large language models, outlining their causes across the model lifecycle, classifying four main hallucination types, and presenting both retrieval‑augmented generation and detection techniques—white‑box and black‑box—to reduce factual errors in critical applications.

AI SafetyLLMModel Evaluation
0 likes · 15 min read
Why Do Large Language Models Hallucinate? Causes, Types, and Mitigation Strategies
Data Party THU
Data Party THU
Sep 11, 2025 · Artificial Intelligence

How ComRAG Revolutionizes Real‑Time Community QA with Dynamic Vector Stores

ComRAG tackles the static‑knowledge gaps, uneven QA quality, and storage explosion of community question‑answer platforms by integrating a static documentation vector store with dual dynamic CQA stores managed via a centroid‑based memory, delivering higher accuracy, lower latency, and scalable storage for industrial retrieval‑augmented generation.

Artificial IntelligenceCommunity QADynamic Retrieval
0 likes · 7 min read
How ComRAG Revolutionizes Real‑Time Community QA with Dynamic Vector Stores
Instant Consumer Technology Team
Instant Consumer Technology Team
Sep 11, 2025 · Artificial Intelligence

How REFRAG Cuts LLM Decoding Time by 30×: A New Efficient RAG Framework

REFRAG (REpresentation For RAG) introduces a novel decoding framework that compresses, senses, and expands context using precomputed chunk embeddings, achieving up to 30.85× faster first-token generation and 16× larger context windows without sacrificing perplexity, as validated across diverse long‑context tasks.

LLMRAGReinforcement Learning
0 likes · 18 min read
How REFRAG Cuts LLM Decoding Time by 30×: A New Efficient RAG Framework
DataFunTalk
DataFunTalk
Sep 11, 2025 · Artificial Intelligence

How Google's AI Is Transforming Scientific Code Development

Google researchers have built a breakthrough AI system that uses large language models and tree‑search to automatically write, rewrite, and optimize scientific computing code, delivering solutions that surpass human experts across biology, epidemiology, remote sensing, neuroscience, time‑series analysis, and computational mathematics.

AICross‑Domain InnovationLLM
0 likes · 6 min read
How Google's AI Is Transforming Scientific Code Development
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Sep 11, 2025 · Artificial Intelligence

How AST Boosts LLM‑Powered Code Question Answering: Theory, Practice, and Future Directions

This article explores how abstract syntax trees (AST) can enrich large language model (LLM) based code question‑answering by providing precise structural context, detailing LLM strengths and limits, describing AST‑LLM collaboration, RAG integration, cutting‑edge models, practical tooling, challenges, standardisation efforts, and future research avenues.

ASTLLMRAG
0 likes · 30 min read
How AST Boosts LLM‑Powered Code Question Answering: Theory, Practice, and Future Directions
Instant Consumer Technology Team
Instant Consumer Technology Team
Sep 10, 2025 · Artificial Intelligence

Why AI Agents Fail and How Parlant Ensures Reliable, Controllable Bots

This article explains why most AI agents underperform due to low problem‑resolution rates, critiques traditional prompting methods, and introduces the Parlant framework with conditional rule activation, dual protection, and state‑machine architecture, followed by a complete implementation example and best‑practice guidance.

AI agentsCustomer Service AutomationLLM
0 likes · 12 min read
Why AI Agents Fail and How Parlant Ensures Reliable, Controllable Bots
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Sep 9, 2025 · Artificial Intelligence

How EFS Leverages Large Language Models for Sparse Portfolio Optimization

The paper introduces the Evolutionary Factor Search (EFS) framework, which uses large language models to automatically generate and evolve alpha factors, turning sparse portfolio selection into an LLM‑guided top‑m ranking task, and demonstrates superior performance on multiple Fama‑French benchmarks and real‑world market datasets.

Alpha FactorsEvolutionary AlgorithmsFactor Search
0 likes · 11 min read
How EFS Leverages Large Language Models for Sparse Portfolio Optimization
DataFunSummit
DataFunSummit
Sep 8, 2025 · Artificial Intelligence

How Ant Group’s Ragent Redefines LLM‑Based AI Agents on Ray

This article introduces Ant Group’s new Ray‑based distributed agent framework Ragent, outlines its background and motivation, and details the four core modules—Profile, Memory, Planning, and Action—that together enable sophisticated LLM‑driven AI agents for large‑scale applications.

AI agentsAnt GroupDistributed Systems
0 likes · 4 min read
How Ant Group’s Ragent Redefines LLM‑Based AI Agents on Ray
Data Party THU
Data Party THU
Sep 8, 2025 · Artificial Intelligence

Why Small Language Models Will Dominate Agentic AI by 2025

By 2025, Agentic AI is shifting from massive LLMs to cost‑effective Small Language Models (SLMs), driven by their comparable performance, lower latency, and dramatically reduced inference and fine‑tuning costs, as detailed through market data, model benchmarks, migration steps, and real‑world case studies.

AIAgentic AILLM
0 likes · 6 min read
Why Small Language Models Will Dominate Agentic AI by 2025
21CTO
21CTO
Sep 8, 2025 · Artificial Intelligence

Alibaba Unveils Qwen3‑Max‑Preview: First Trillion‑Parameter LLM and What It Means

Alibaba introduced the Qwen3‑Max‑Preview model, a trillion‑parameter LLM that boosts multilingual understanding, complex instruction handling, and tool use while cutting hallucinations, offers competitive benchmark scores, supports 262K context, and comes with tiered token‑based pricing that may limit broader adoption.

AIAlibabaLLM
0 likes · 5 min read
Alibaba Unveils Qwen3‑Max‑Preview: First Trillion‑Parameter LLM and What It Means
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Sep 8, 2025 · Artificial Intelligence

Unlocking Precise Code Q&A: How ASTs Power AI-Driven Development

With software systems growing ever more complex, traditional text‑based code search falls short; this article explains how abstract syntax trees (AST) provide deeper structural understanding, improve query precision, enable advanced features like control‑flow analysis and knowledge‑graph construction, and outlines a full architecture for building AI‑enhanced code question‑answering systems.

ASTLLMcode question answering
0 likes · 33 min read
Unlocking Precise Code Q&A: How ASTs Power AI-Driven Development
JD Tech Talk
JD Tech Talk
Sep 8, 2025 · Artificial Intelligence

How I Turned a Generic LLM into a Precise E‑Commerce Risk Detector

The article recounts how a risk‑control algorithm engineer progressively refined a generic large language model through four stages of prompt engineering—defining roles, dimensions, structured I/O, business rules, behavior fingerprints, and a dual‑hypothesis decision framework—to transform it into a precise e‑commerce fraud detection expert.

AILLMPrompt engineering
0 likes · 10 min read
How I Turned a Generic LLM into a Precise E‑Commerce Risk Detector
JD Cloud Developers
JD Cloud Developers
Sep 8, 2025 · Artificial Intelligence

Turn a Generic LLM into an E‑Commerce Risk Detector with Prompt Engineering

In this detailed case study, a risk‑control algorithm engineer explains how he progressively refined prompts for a large language model—starting from a basic role‑playing instruction, adding business‑specific exemption rules, structuring input/output, and finally implementing a dual‑hypothesis decision framework—to transform the model into a reliable e‑commerce fraud detection expert.

AIE-commerce FraudLLM
0 likes · 10 min read
Turn a Generic LLM into an E‑Commerce Risk Detector with Prompt Engineering
Data Thinking Notes
Data Thinking Notes
Sep 7, 2025 · Artificial Intelligence

Unlocking AI Agent Memory: How LLMs Use Retrieval and Planning to Stay Smart

This article explains the core architecture of AI agents powered by large language models, detailing how planning, short‑term and long‑term memory, and tool integration work together through vector databases, retrieval‑augmented generation, and summarization to enable stateful, intelligent interactions across multiple sessions.

AI AgentLLMMemory
0 likes · 10 min read
Unlocking AI Agent Memory: How LLMs Use Retrieval and Planning to Stay Smart
Architecture and Beyond
Architecture and Beyond
Sep 6, 2025 · Artificial Intelligence

How AI Agents Manage Context: Compression Strategies from Manus, Claude Code, and Gemini CLI

This article examines the context explosion problem in AI agents and compares three distinct compression approaches—Manus's never‑lose philosophy, Claude Code's aggressive 92% threshold with eight‑section summaries, and Gemini CLI's balanced 70% trigger with curated history—highlighting their trade‑offs in performance, cost, and reliability.

AIAgent DesignLLM
0 likes · 19 min read
How AI Agents Manage Context: Compression Strategies from Manus, Claude Code, and Gemini CLI
IT Services Circle
IT Services Circle
Sep 5, 2025 · Artificial Intelligence

10 Must‑Know Tencent AI Interview Topics: Overfitting, Dropout, Transformers & Beyond

This article compiles the ten core questions from a Tencent algorithm interview, covering overfitting, regularization, generalization error, dropout, residual connections, attention, embeddings, BART vs BERT, instruction‑tuning data, LLM hallucination, and why GANs collapse more than diffusion models, with concise explanations and interview‑ready tips.

GANLLMRegularization
0 likes · 22 min read
10 Must‑Know Tencent AI Interview Topics: Overfitting, Dropout, Transformers & Beyond