Tagged articles
2015 articles
Page 18 of 21
Continuous Delivery 2.0
Continuous Delivery 2.0
Jul 3, 2024 · Artificial Intelligence

Applying Large Language Models to Software Engineering: Challenges, Cross‑File Editing Issues, Bug‑Fixing Evaluation, and SWE‑Bench Results

This article examines the practical challenges of using large language models in software development, including handling long contexts, cross‑file editing, bug‑fixing evaluation methods, and presents benchmark results from SWE‑Bench and its Lite subset to assess model capabilities.

Cross-File EditingLLMSWE-bench
0 likes · 7 min read
Applying Large Language Models to Software Engineering: Challenges, Cross‑File Editing Issues, Bug‑Fixing Evaluation, and SWE‑Bench Results
DataFunTalk
DataFunTalk
Jul 2, 2024 · Artificial Intelligence

Application of Large Language Models in Recommendation Systems: Overview and Future Directions

This article provides a comprehensive overview of how large language models (LLMs) are applied in recommendation systems, covering two main paradigms—LLM+RS as a component and LLM as a standalone recommender—detailing their impact on pre‑training, fine‑tuning, prompting, and future research challenges.

Fine-tuningFuture DirectionsLLM
0 likes · 6 min read
Application of Large Language Models in Recommendation Systems: Overview and Future Directions
AntTech
AntTech
Jul 2, 2024 · Artificial Intelligence

Design and Implementation of a Generalized Retrieval‑Augmented Generation (RAG) Framework with Graph RAG Support

This article surveys Retrieval‑Augmented Generation (RAG), analyzes the limitations of traditional vector‑based RAG, introduces Graph RAG that leverages knowledge graphs for more reliable context, proposes a universal RAG architecture compatible with vector, graph and full‑text indexes, and details its open‑source implementation, code components, testing, and future research directions.

AIEngineeringGraphRAGKnowledgeGraph
0 likes · 26 min read
Design and Implementation of a Generalized Retrieval‑Augmented Generation (RAG) Framework with Graph RAG Support
JD Retail Technology
JD Retail Technology
Jul 1, 2024 · Artificial Intelligence

Generative Recommendation Systems for JD Alliance Advertising: Design, Implementation, and Evaluation

This article surveys how large language models reshape recommendation systems, details a generative recommender framework for JD Alliance ads—including item representation, model input, training, and inference—presents extensive offline and online experiments, and discusses future optimization directions.

Generative RecommendationJD AllianceLLM
0 likes · 25 min read
Generative Recommendation Systems for JD Alliance Advertising: Design, Implementation, and Evaluation
Continuous Delivery 2.0
Continuous Delivery 2.0
Jun 29, 2024 · Artificial Intelligence

AI in Software Engineering at Google: Progress and the Path Ahead

The article describes how Google has integrated AI, particularly large language models, into its internal software development tools to improve developer productivity, outlines the challenges faced, shares lessons learned, and outlines future directions for AI‑driven engineering assistance.

AIGoogleLLM
0 likes · 10 min read
AI in Software Engineering at Google: Progress and the Path Ahead
NewBeeNLP
NewBeeNLP
Jun 28, 2024 · Artificial Intelligence

Why Large Language Models Aren’t Magic: Understanding Compression and Prompt Engineering

This article demystifies large language models by comparing them to classic compression algorithms, explains how they compress massive data into compact parameters, explores their ability to learn abstract patterns, and provides practical insights into prompt engineering, sampling strategies, and multi‑step agent architectures for real‑world applications.

Agent ArchitectureLLMSampling
0 likes · 19 min read
Why Large Language Models Aren’t Magic: Understanding Compression and Prompt Engineering
Baobao Algorithm Notes
Baobao Algorithm Notes
Jun 27, 2024 · Artificial Intelligence

Engineering Data for R&D Large Language Models: From Pre‑training to Prompt Design

This article presents a comprehensive guide to data engineering for research‑focused large language models, covering domain‑adaptive pre‑training, supervised fine‑tuning, retrieval‑augmented generation, dataset construction, data cleaning pipelines, token‑izer adaptation, and prompt engineering best practices to boost model performance in specialized tasks.

Fine‑TuningLLMRAG
0 likes · 20 min read
Engineering Data for R&D Large Language Models: From Pre‑training to Prompt Design
Baidu Geek Talk
Baidu Geek Talk
Jun 26, 2024 · Artificial Intelligence

Build a Conversational 24‑Point Game with Baidu AppBuilder’s AI Agent

This guide walks through the complete workflow of creating an AI‑native 24‑point game using Baidu Cloud's AppBuilder, covering the three‑step methodology, Agent architecture, component design, custom workflow implementation, and practical tips for optimal model selection.

24-point gameAI native appAgent Architecture
0 likes · 14 min read
Build a Conversational 24‑Point Game with Baidu AppBuilder’s AI Agent
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jun 26, 2024 · Cloud Native

Securing LLM Calls with Alibaba Cloud ASM Service Mesh Using a Wasm Plugin

This article demonstrates how to protect large language model (LLM) requests in a cloud‑native environment by using Alibaba Cloud ASM service mesh and a custom Wasm plugin to dynamically inject API keys, enforce custom denial patterns, and optionally route requests through a private LLM for intelligent data‑leak detection.

Cloud NativeKubernetesLLM
0 likes · 13 min read
Securing LLM Calls with Alibaba Cloud ASM Service Mesh Using a Wasm Plugin
JavaEdge
JavaEdge
Jun 23, 2024 · Artificial Intelligence

Build a Cultural Name‑Generator with LangChain, Custom Prompts, and Output Parsers

This tutorial walks through installing LangChain, creating an LLM (via own GPU resources or third‑party APIs), designing parameterized prompt templates, implementing a custom output parser for structured results, and running a complete Python example that generates culturally specific names.

AILLMLangChain
0 likes · 7 min read
Build a Cultural Name‑Generator with LangChain, Custom Prompts, and Output Parsers
JavaEdge
JavaEdge
Jun 23, 2024 · Artificial Intelligence

What Is LangChain? Features, Pros, Cons, and Setup Guide

This article introduces LangChain, an open‑source framework for building LLM‑powered applications, outlines its key components such as prompts, chains, agents, and retrieval‑augmented generation, compares its advantages and drawbacks, and provides step‑by‑step instructions for setting up a Python development environment.

AIFrameworkLLM
0 likes · 7 min read
What Is LangChain? Features, Pros, Cons, and Setup Guide
DataFunTalk
DataFunTalk
Jun 21, 2024 · Artificial Intelligence

Fine‑tuning Large Language Models with Alibaba Cloud PAI: Practices, Techniques, and Deployment

This article introduces the Alibaba Cloud PAI platform for large language model (LLM) fine‑tuning, covering model‑training pipelines, performance‑cost trade‑offs, retrieval‑augmented generation, fine‑tuning methods such as full‑parameter, LoRA and QLoRA, model selection, data preparation, evaluation, and real‑world deployment examples.

AI PlatformFine-tuningLLM
0 likes · 20 min read
Fine‑tuning Large Language Models with Alibaba Cloud PAI: Practices, Techniques, and Deployment
Data Thinking Notes
Data Thinking Notes
Jun 20, 2024 · Artificial Intelligence

Leveraging LLMs for Data: Embedding Search, Knowledge Bases, Text2SQL, and EDA

This article explores how large language models can transform data workflows by using embeddings for semantic search, building private domain knowledge bases, generating SQL code from natural language with visualized results, and enhancing exploratory data analysis, outlining practical steps and benefits for enterprises.

EDAEmbeddingKnowledge Base
0 likes · 7 min read
Leveraging LLMs for Data: Embedding Search, Knowledge Bases, Text2SQL, and EDA
NewBeeNLP
NewBeeNLP
Jun 20, 2024 · Artificial Intelligence

How LLMs Transform Recommendation Systems: Insights from Kuaishou’s LERAN Paper

This article analyzes Kuaishou’s May 2024 paper on LLM‑driven recommendation, detailing its dual‑tower architecture, contrastive learning of user and item embeddings, and a CVR‑auxiliary task that together improve cold‑start handling and boost both offline and online AUC metrics.

Industrial ApplicationItem EmbeddingLLM
0 likes · 10 min read
How LLMs Transform Recommendation Systems: Insights from Kuaishou’s LERAN Paper
NewBeeNLP
NewBeeNLP
Jun 19, 2024 · Artificial Intelligence

Can Symbolic Chain‑of‑Thought Boost LLM Logical Reasoning?

The paper introduces SymbCoT, a Symbolic Chain‑of‑Thought framework that translates natural‑language problems into symbolic form, plans, solves, and verifies reasoning steps, achieving significantly higher logical reasoning performance than traditional CoT methods across multiple benchmark datasets.

ACL 2024Chain-of-ThoughtLLM
0 likes · 13 min read
Can Symbolic Chain‑of‑Thought Boost LLM Logical Reasoning?
dbaplus Community
dbaplus Community
Jun 18, 2024 · Artificial Intelligence

How to Effectively Evaluate RAG Systems: Metrics, Tools, and Best Practices

Evaluating Retrieval‑Augmented Generation (RAG) systems requires both component‑level and end‑to‑end metrics—such as context relevance, recall, answer relevance, and groundedness—and can be automated with tools like TruLens, RAGAS, LangSmith, and Langfuse, enabling systematic selection and optimization of LLM applications.

AI metricsLLMLangSmith
0 likes · 8 min read
How to Effectively Evaluate RAG Systems: Metrics, Tools, and Best Practices
JavaEdge
JavaEdge
Jun 17, 2024 · Artificial Intelligence

Build Simple LLM Agents with LangChain: A Hands‑On Tutorial

This guide explains what AI agents are, how they combine large language models with planning, memory, and tool use, and provides a step‑by‑step LangChain implementation—including environment setup, tool integration, and a runnable example that solves math and performs web searches.

LLMLangChainPython
0 likes · 6 min read
Build Simple LLM Agents with LangChain: A Hands‑On Tutorial
Bilibili Tech
Bilibili Tech
Jun 14, 2024 · Artificial Intelligence

Technical Report on the Index-1.9B Series: Model Variants, Pre‑training Optimizations, and Alignment Experiments

The report presents the open‑source Index‑1.9B family—base, pure, chat, and character variants—detailing benchmark results, pre‑training optimizations such as a normalized LM‑Head and deeper‑slim architectures, the importance of modest instruction data, alignment via SFT/DPO, role‑play enhancements with RAG, and acknowledges remaining safety and factual limitations.

AlignmentInstruction TuningLLM
0 likes · 15 min read
Technical Report on the Index-1.9B Series: Model Variants, Pre‑training Optimizations, and Alignment Experiments
Baobao Algorithm Notes
Baobao Algorithm Notes
Jun 14, 2024 · Artificial Intelligence

Boost LLM Speed: How KV Cache Quantization Cuts Memory While Preserving Quality

This article explains Hugging Face's KV cache quantization technique, detailing how it reduces memory usage for long‑context LLM generation, the underlying quantization methods, implementation steps in 🤗 Transformers, benchmark results versus fp16, and the trade‑offs between speed, memory, and accuracy.

LLMMemory OptimizationTransformers
0 likes · 15 min read
Boost LLM Speed: How KV Cache Quantization Cuts Memory While Preserving Quality
Continuous Delivery 2.0
Continuous Delivery 2.0
Jun 14, 2024 · Artificial Intelligence

AI Code Generation Tools: Benefits, Risks, and Top Choices

This article explains how AI-powered code generators create high‑quality code, outlines their capabilities such as language translation and documentation assistance, discusses safety and copyright concerns highlighted by research, and emphasizes that while popular, these tools should augment rather than replace developers.

AILLMcode-generation
0 likes · 2 min read
AI Code Generation Tools: Benefits, Risks, and Top Choices
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jun 14, 2024 · Artificial Intelligence

How Alibaba Cloud OpenSearch Powers RAG: Insights from AICon 2024

In this talk, Alibaba Cloud's OpenSearch RAG team shares their year‑long journey of building retrieval‑augmented generation systems, covering data parsing, slicing, vectorization, hybrid retrieval, model fine‑tuning, performance optimizations, cost reduction, and future directions such as multimodal queries and agents.

AI searchHybrid RetrievalLLM
0 likes · 25 min read
How Alibaba Cloud OpenSearch Powers RAG: Insights from AICon 2024
JD Cloud Developers
JD Cloud Developers
Jun 13, 2024 · Artificial Intelligence

How LLMs Are Redefining Recommender Systems for JD Union Ads

This article surveys the impact of large language models on recommendation systems, outlines generative recommender architectures, discusses challenges of JD Union advertising, presents a semantic‑ID based solution with training and inference details, and reports offline and online experimental results.

AILLMcold start
0 likes · 22 min read
How LLMs Are Redefining Recommender Systems for JD Union Ads
JD Tech Talk
JD Tech Talk
Jun 13, 2024 · Artificial Intelligence

Generative Recommender Systems for JD Affiliate Advertising: Architecture, Methods, and Experimental Evaluation

This article surveys how large language models can reshape recommendation systems, describes the four-stage generative pipeline, details item representation techniques such as semantic IDs, presents a JD affiliate advertising use case with offline and online experiments, and outlines future optimization directions.

LLMcold startgenerative recommender
0 likes · 25 min read
Generative Recommender Systems for JD Affiliate Advertising: Architecture, Methods, and Experimental Evaluation
DataFunSummit
DataFunSummit
Jun 12, 2024 · Artificial Intelligence

Large Language Model (LLM) Powered Recommendation Systems: Overview, Techniques, Challenges, and Future Directions

This article reviews how large language models are transforming recommendation systems, covering their fundamentals, recent LLM‑enabled methods for representation, learning and generalization, challenges such as scalability, bias and privacy, and future research directions including personalized prompts and robust model integration.

LLMmodel generalizationprompt learning
0 likes · 19 min read
Large Language Model (LLM) Powered Recommendation Systems: Overview, Techniques, Challenges, and Future Directions
DataFunSummit
DataFunSummit
Jun 10, 2024 · Artificial Intelligence

Xiaomi Agent Technology: Architecture, Prompt Management, and Evaluation

This article presents Xiaomi's work on LLM‑based Agent technology, covering its perception‑thinking‑action pipeline, technical framework, prompt management, executor and API platform, workflow, optimization strategies, evaluation metrics, and future directions for AI assistants.

AI AssistantAgentLLM
0 likes · 17 min read
Xiaomi Agent Technology: Architecture, Prompt Management, and Evaluation
DaTaobao Tech
DaTaobao Tech
Jun 7, 2024 · Artificial Intelligence

Exploring AI Agent Integration in HandCat App: Architecture, Tool Management, and Implementation

The HandCat team designed an end‑to‑LLM pipeline that separates agent templates, tool protocols, and view layers, enabling LLM‑driven agents with memory, planning, and three tool types—general, selector, and interruptor—to safely manage sessions, handle errors, and balance granularity for performance within a commercial mobile app.

AI AgentAgent LabLLM
0 likes · 18 min read
Exploring AI Agent Integration in HandCat App: Architecture, Tool Management, and Implementation
JD Tech Talk
JD Tech Talk
Jun 7, 2024 · Artificial Intelligence

AI‑Powered JUnit Rule for Automatic Error Reporting to GPT

This tutorial shows Java engineers how to build a JUnit Rule that captures test failures, extracts the exception stack and source file, and automatically sends the information to OpenAI's GPT for analysis and code‑fix suggestions, complete with reusable data‑model classes and utility methods.

AIGPTJUnit
0 likes · 13 min read
AI‑Powered JUnit Rule for Automatic Error Reporting to GPT
AI Large Model Application Practice
AI Large Model Application Practice
Jun 7, 2024 · Artificial Intelligence

Mastering Advanced Retrieval: Fusion and Recursive Strategies for RAG

This article explores two advanced retrieval paradigms—Fusion Retrieval, which merges results from multiple retrievers using re‑ranking, and Recursive Retrieval, which builds hierarchical chunk‑to‑chunk or chunk‑to‑retriever links—to boost the quality and flexibility of Retrieval‑Augmented Generation pipelines.

Fusion RetrievalLLMLangChain
0 likes · 12 min read
Mastering Advanced Retrieval: Fusion and Recursive Strategies for RAG
JD Tech
JD Tech
Jun 7, 2024 · Artificial Intelligence

Automated Test Case Generation Using LangChain, Vector Databases, and Large Language Models

This article presents a practical approach to automatically generate software test cases by leveraging LangChain, PDF parsing, vector‑database retrieval, and large language models, comparing it with existing tools, detailing implementation steps, code examples, experimental results, and future improvement directions.

LLMLangChainPDF parsing
0 likes · 14 min read
Automated Test Case Generation Using LangChain, Vector Databases, and Large Language Models
Sohu Tech Products
Sohu Tech Products
Jun 5, 2024 · Artificial Intelligence

Retrieval Augmented Generation (RAG): Concepts, Workflow, and LangChain Implementation

The article outlines LLM issues such as hallucination, outdated knowledge, and data privacy, then explains Retrieval‑Augmented Generation—detailing its data‑preparation and query‑time retrieval workflow, demonstrates a full LangChain implementation, and contrasts RAG with fine‑tuning as complementary strategies for up‑to‑date, grounded responses.

LLMLangChainRAG
0 likes · 15 min read
Retrieval Augmented Generation (RAG): Concepts, Workflow, and LangChain Implementation
JavaEdge
JavaEdge
Jun 5, 2024 · Artificial Intelligence

Step‑by‑Step Guide to Building a Name‑Generator with LangChain and OpenAI

This tutorial walks through installing LangChain, creating an LLM with either self‑hosted or third‑party models, designing custom prompt templates, configuring output parsers for structured results, and running a complete Python example that generates culturally specific names using OpenAI's API.

LLMLangChainOpenAI
0 likes · 8 min read
Step‑by‑Step Guide to Building a Name‑Generator with LangChain and OpenAI
JD Tech
JD Tech
May 31, 2024 · Artificial Intelligence

Understanding Large Language Models, Retrieval‑Augmented Generation, and AI Agents: Concepts, Engineering Practices, and Applications

This article explains the fundamentals and engineering practices of large language models (LLM), retrieval‑augmented generation (RAG) and AI agents, compares small and large embedding models, provides Python code for vector‑database RAG with Chroma, and discusses integration, use cases, and future challenges in AI development.

AI EngineeringAI agentsLLM
0 likes · 41 min read
Understanding Large Language Models, Retrieval‑Augmented Generation, and AI Agents: Concepts, Engineering Practices, and Applications
NewBeeNLP
NewBeeNLP
May 31, 2024 · Artificial Intelligence

Can Cleaned Web Data Rival Proprietary Corpora for LLM Training?

This article analyzes whether large‑scale web crawls, when meticulously filtered and deduplicated, can match or surpass the performance of high‑quality curated datasets in training large language models, covering dataset composition, processing pipelines, experimental results, scaling‑law implications, and future data‑efficiency strategies.

Artificial IntelligenceDataset CleaningLLM
0 likes · 23 min read
Can Cleaned Web Data Rival Proprietary Corpora for LLM Training?
21CTO
21CTO
May 30, 2024 · Artificial Intelligence

Why AI Leaders Urge Students to Move Beyond Large Language Models

At VivaTech, Meta AI chief Yann LeCun warned students that building next‑generation AI systems means steering clear of large language model research, while other experts highlight emerging architectures and multimodal models like GPT‑4o as the future of artificial intelligence.

AIGPT-4oLLM
0 likes · 3 min read
Why AI Leaders Urge Students to Move Beyond Large Language Models
Practical DevOps Architecture
Practical DevOps Architecture
May 30, 2024 · Artificial Intelligence

Eight‑Week LLM and Large Model Training Course Outline

This article outlines an eight‑week curriculum covering LLM evolution, PyTorch fundamentals, CUDA training, large‑model fine‑tuning, LangChain application development, cloud‑based quantization, industry case studies, and a recruitment session, providing video resources for each topic.

AIFine-tuningLLM
0 likes · 5 min read
Eight‑Week LLM and Large Model Training Course Outline
Continuous Delivery 2.0
Continuous Delivery 2.0
May 30, 2024 · Artificial Intelligence

Meta’s TestGen‑LLM: AI‑Driven Automatic Unit Test Generation for Kotlin Code

In 2024 Meta introduced TestGen‑LLM, an AI‑powered tool that automatically generates Kotlin unit tests using large language models, improving test coverage through a multi‑stage pipeline of candidate generation, compilation filtering, execution filtering, coverage validation, refactoring, and engineer review, with reported coverage gains across Facebook and Instagram codebases.

AIKotlinLLM
0 likes · 6 min read
Meta’s TestGen‑LLM: AI‑Driven Automatic Unit Test Generation for Kotlin Code
Baobao Algorithm Notes
Baobao Algorithm Notes
May 30, 2024 · Artificial Intelligence

What’s the Latest RLHF Landscape? From PPO to ORPO Explained

This article surveys the current RLHF ecosystem, comparing on‑policy methods like PPO with off‑policy approaches such as DPO, and examines recent variants—including ReMax, GRPO, DPOP, TDPO, and ORPO—highlighting their algorithmic differences, resource trade‑offs, and practical performance insights.

AlignmentDPOLLM
0 likes · 23 min read
What’s the Latest RLHF Landscape? From PPO to ORPO Explained
21CTO
21CTO
May 29, 2024 · Artificial Intelligence

How AI PCs Are Redefining the Desktop: Inside Microsoft’s Copilot+ Vision

Microsoft’s vision of AI PCs, highlighted by the Copilot+ concept, details how integrated NPU hardware, local large‑language models, and the Windows Copilot Runtime enable on‑device AI inference, reducing data‑center load and offering developers a unified platform for building next‑generation AI applications.

AI PCCopilot+Edge Computing
0 likes · 11 min read
How AI PCs Are Redefining the Desktop: Inside Microsoft’s Copilot+ Vision
JD Cloud Developers
JD Cloud Developers
May 29, 2024 · Artificial Intelligence

How Multi‑Agent AI Is Revolutionizing E‑Commerce Decision Making

This article explores JD Retail's AI‑driven multi‑agent system that mimics real‑world merchant decision processes, detailing the ReAct paradigm, agent roles, workflow, training methods, monitoring, and future directions for building intelligent e‑commerce assistants.

AIAgent ArchitectureFine-tuning
0 likes · 21 min read
How Multi‑Agent AI Is Revolutionizing E‑Commerce Decision Making
21CTO
21CTO
May 28, 2024 · Artificial Intelligence

13 Open‑Source AI Projects That Made the 2024 GitHub Accelerator – A Deep Dive

This article showcases the 13 award‑winning open‑source AI projects featured in the 2024 GitHub Accelerator, highlighting each project's purpose, founders, key technologies, and how they advance machine‑learning, model training, deployment, and innovative AI applications across various domains.

AI toolsGitHub AcceleratorLLM
0 likes · 9 min read
13 Open‑Source AI Projects That Made the 2024 GitHub Accelerator – A Deep Dive
NewBeeNLP
NewBeeNLP
May 28, 2024 · Artificial Intelligence

How Generative Models Are Redefining Recommendation Systems

This article reviews recent advances in generative recommendation, highlighting challenges such as item representation and multimodal fusion, and summarizing four key research papers that propose novel tokenization, collaborative integration, and transformer-based multimodal approaches to improve recommendation performance.

AI researchGenerative RecommendationLLM
0 likes · 8 min read
How Generative Models Are Redefining Recommendation Systems
37 Interactive Technology Team
37 Interactive Technology Team
May 27, 2024 · Artificial Intelligence

Enhancing AI Code Review Quality with Contextual Embedding and Function Calling

The article explains how AI code reviews suffer from missing context, and improves them by embedding the codebase, using Retrieval‑Augmented Generation to fetch relevant snippets, and adding a function‑calling tool that lets the model autonomously request additional code, resulting in precise, bug‑detecting feedback.

AI code reviewEmbeddingFunction Calling
0 likes · 8 min read
Enhancing AI Code Review Quality with Contextual Embedding and Function Calling
NewBeeNLP
NewBeeNLP
May 24, 2024 · Artificial Intelligence

How NoteLLM Boosts Cold‑Start Recommendation with Generative Contrastive Learning

This article reviews the NoteLLM paper, which leverages Llama 2 to create richer text embeddings and automatically generate tags and categories for note recommendation, addressing cold‑start issues through a multitask prompt design, generative‑contrastive learning, and collaborative supervised fine‑tuning, and demonstrates strong offline and online gains.

EmbeddingGenerative Contrastive LearningLLM
0 likes · 14 min read
How NoteLLM Boosts Cold‑Start Recommendation with Generative Contrastive Learning
DevOps
DevOps
May 23, 2024 · Information Security

Guidelines for Evaluating Large Language Models in Cybersecurity Tasks

The article examines the opportunities and risks of applying large language models (LLMs) to cybersecurity, outlines fourteen practical recommendations for assessing their real‑world capabilities, and concludes with an invitation to the upcoming R&D Efficiency Conference covering AI, product management, and related topics.

AI SafetyInformation SecurityLLM
0 likes · 11 min read
Guidelines for Evaluating Large Language Models in Cybersecurity Tasks
Cognitive Technology Team
Cognitive Technology Team
May 23, 2024 · Operations

eBPF + LLM: Building the Infrastructure for Observability Agents

The article explains how zero‑intrusion eBPF provides full‑stack, high‑quality observability data that, when combined with large language models, enables AI‑driven agents to automate ticket handling, change impact analysis, and vulnerability triage, dramatically improving operational efficiency.

AI AgentDistributed TracingLLM
0 likes · 17 min read
eBPF + LLM: Building the Infrastructure for Observability Agents
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
May 23, 2024 · Artificial Intelligence

Building a Multi‑Agent Bookmark Assistant Bot with Coze: From File Upload to AI‑Powered Search

This tutorial walks through creating a Coze bot that uses multi‑agent orchestration, memory variables, triggers, and large‑language‑model integration to upload bookmark files, extract and clean data, classify sites, generate importable HTML bookmarks, and provide AI‑driven search functionality, complete with Python code examples and deployment tips.

AI agentsBookmark ManagementBot Development
0 likes · 24 min read
Building a Multi‑Agent Bookmark Assistant Bot with Coze: From File Upload to AI‑Powered Search
Eric Tech Circle
Eric Tech Circle
May 22, 2024 · Artificial Intelligence

Deploy and Build AI Apps with Dify: A Complete Open‑Source Guide

This article introduces Dify, an open‑source LLM application platform, outlines its core features such as workflows, model support, RAG pipelines, agents, and observability, compares it with alternatives, and provides step‑by‑step deployment instructions using Docker Compose and Helm for local and Kubernetes environments.

AI PlatformDockerKubernetes
0 likes · 7 min read
Deploy and Build AI Apps with Dify: A Complete Open‑Source Guide
JD Tech
JD Tech
May 22, 2024 · Artificial Intelligence

AI Multi‑Agent System for E‑commerce Merchant Assistance: Design, ReAct Architecture, and Implementation

The article describes JD Retail's AI‑driven multi‑agent platform that models real‑world merchant decision‑making with ReAct‑based LLM agents, detailing the system architecture, agent roles, reasoning loops, workflow examples, training pipelines, monitoring, and future directions for e‑commerce support.

AILLMMulti-Agent
0 likes · 21 min read
AI Multi‑Agent System for E‑commerce Merchant Assistance: Design, ReAct Architecture, and Implementation
Baidu Geek Talk
Baidu Geek Talk
May 22, 2024 · Artificial Intelligence

How AI Can Auto‑Generate Perfect Git Commit Messages

This article explains how a large‑language‑model‑driven tool can automatically create standardized Git commit messages by extracting change summaries, applying customizable plugins, measuring performance with MSE and adoption rate, and optimizing prompts, data pipelines, and fine‑tuning strategies.

AICommitMessageDataProcessing
0 likes · 17 min read
How AI Can Auto‑Generate Perfect Git Commit Messages
JD Retail Technology
JD Retail Technology
May 22, 2024 · Artificial Intelligence

AI Multi‑Agent System for E‑commerce: Design, Implementation, and Operational Insights

This article presents a comprehensive overview of JD Retail's AI‑driven multi‑agent architecture for e‑commerce assistance, detailing how real‑world merchant decision processes are modeled with ReAct‑based LLM agents, the hierarchical workflow, training pipelines, monitoring mechanisms, and future directions for scalable intelligent commerce support.

AIAgent ArchitectureKnowledge Retrieval
0 likes · 20 min read
AI Multi‑Agent System for E‑commerce: Design, Implementation, and Operational Insights
NewBeeNLP
NewBeeNLP
May 18, 2024 · Artificial Intelligence

How to Detect Test Set Contamination in Black‑Box Language Models

Researchers propose a black‑box method to expose test‑set leakage in large language models by comparing log‑probability shifts when test items are shuffled, using Monte‑Carlo estimation and a sharded likelihood test, and demonstrate its effectiveness on several models including Mistral‑7B.

LLMblack-box detectionevaluation
0 likes · 8 min read
How to Detect Test Set Contamination in Black‑Box Language Models
Open Source Tech Hub
Open Source Tech Hub
May 16, 2024 · Artificial Intelligence

Deploy and Run Llama 3 Locally with Ollama in Minutes

This guide explains how to download a GGUF‑format Llama 3 model, create a Modelfile, use Ollama commands to build and run the model locally, test it, and interact via the built‑in REST API, including useful Docker and model‑management tips.

DockerGGUFLLM
0 likes · 7 min read
Deploy and Run Llama 3 Locally with Ollama in Minutes
Alibaba Cloud Native
Alibaba Cloud Native
May 15, 2024 · Cloud Native

Build a Cloud‑Native Playground to Compare GPT‑4o and Qwen‑2.5 with NextChat and Higress

This article walks through setting up a cloud‑native test environment using the open‑source NextChat UI and Higress API gateway to let Qwen‑2.5 masquerade as GPT‑4o, enabling a side‑by‑side comparison of their responses while showcasing Higress’s streaming, hot‑update, and security features for AI workloads.

AI gatewayDockerGPT-4o
0 likes · 8 min read
Build a Cloud‑Native Playground to Compare GPT‑4o and Qwen‑2.5 with NextChat and Higress
Efficient Ops
Efficient Ops
May 14, 2024 · Artificial Intelligence

How Large‑Model Agents Are Revolutionizing AIOps and Modern Operations

This article explores why large‑model Agent technology is essential for AIOps, explains single‑ and multi‑Agent architectures, memory and tool integration, and demonstrates practical applications such as anomaly detection, fault diagnosis, automated remediation, ChatOps, and future directions for intelligent, autonomous operations.

AI agentsLLMLarge Model
0 likes · 14 min read
How Large‑Model Agents Are Revolutionizing AIOps and Modern Operations
StarRocks
StarRocks
May 14, 2024 · Artificial Intelligence

How Tencent Games Boosted AI‑Generated SQL Accuracy to 89% with a Lakehouse Architecture

Tencent Games tackled the low accuracy of AI‑generated SQL in production by combining large language models with a StarRocks lake‑warehouse, introducing a semantic layer, async materialized views, and an agent‑based multi‑intelligence framework, ultimately raising one‑shot SQL correctness to 89% and cutting delivery time from 2 hours to 0.33 hours.

AILLMLakehouse
0 likes · 13 min read
How Tencent Games Boosted AI‑Generated SQL Accuracy to 89% with a Lakehouse Architecture
NewBeeNLP
NewBeeNLP
May 13, 2024 · Artificial Intelligence

Why DPO Treats LLMs as Q‑Functions: A Deep Theoretical Dive

This article offers a detailed theoretical interpretation of the DPO algorithm, showing how large language models can be viewed as Q‑functions, unifying sequence‑wise and step‑wise decision perspectives, and discussing the resulting implications for reinforcement‑learning‑based alignment research.

DPOLLMPreference Optimization
0 likes · 14 min read
Why DPO Treats LLMs as Q‑Functions: A Deep Theoretical Dive
DataFunTalk
DataFunTalk
May 9, 2024 · Databases

ByteHouse Vector Search Technical Guide: Architecture, Design, and Performance Optimizations

This guide explains ByteHouse’s high‑performance vector search capabilities, covering the background of vector retrieval for LLMs, the limitations of its existing skip‑index architecture, the new vector‑index design with HNSW and IVF, query‑time optimizations, performance benchmarks against Milvus, and future development plans.

ByteHouseLLMindexing
0 likes · 8 min read
ByteHouse Vector Search Technical Guide: Architecture, Design, and Performance Optimizations
Baidu App Technology
Baidu App Technology
May 8, 2024 · Artificial Intelligence

How AI Can Auto‑Generate Standardized Git Commit Messages

This article details the design, implementation, and evaluation of an AI‑powered tool that automatically creates compliant Git commit messages by leveraging large language models, custom plugins, and performance‑focused optimizations to improve developer productivity and commit quality.

AIGitLLM
0 likes · 16 min read
How AI Can Auto‑Generate Standardized Git Commit Messages
DataFunTalk
DataFunTalk
May 8, 2024 · Artificial Intelligence

Intelligent NPCs: Infusing Soul into Game Characters with AI and the Art and Science of Deep Model Inference Acceleration

This talk explores how large‑model AI can give game NPCs personality, outlines the opportunities and challenges of intelligent NPCs, presents a case study of the "Jue Zhi An Nuan" NPC, and discusses future directions, safety compliance, and real‑time multimodal interaction solutions.

AIGame DevelopmentGame NPC
0 likes · 3 min read
Intelligent NPCs: Infusing Soul into Game Characters with AI and the Art and Science of Deep Model Inference Acceleration
Java Backend Technology
Java Backend Technology
May 8, 2024 · Artificial Intelligence

Explore the Latest Open‑Source AI Projects: Llama 3, MaxKB, Phidata & RAGFlow

This article highlights four cutting‑edge open‑source AI initiatives—Meta’s Llama 3 large language model, the MaxKB knowledge‑base Q&A system, the Phidata framework for building AI assistants, and the RAGFlow retrieval‑augmented generation engine—detailing their capabilities, licensing, and where to access the code.

AIKnowledge BaseLLM
0 likes · 7 min read
Explore the Latest Open‑Source AI Projects: Llama 3, MaxKB, Phidata & RAGFlow
Architect
Architect
May 5, 2024 · Artificial Intelligence

The Rise of Small Language Models (SLM) and Their Impact on AI Development

Amidst a growing trend that narrows performance gaps between large and small language models, researchers highlight the efficiency, adaptability, and specialized advantages of small language models (SLM), while also discussing the high costs, hallucinations, and security concerns that still challenge large‑scale LLMs.

AI efficiencyEdge ComputingLLM
0 likes · 9 min read
The Rise of Small Language Models (SLM) and Their Impact on AI Development
Xiaohe Frontend Team
Xiaohe Frontend Team
May 5, 2024 · Artificial Intelligence

SenseNova 5.0 Takes on GPT‑4 Turbo and Other AI Breakthroughs This Week

This roundup covers SenseTime's new SenseNova 5.0 model rivaling GPT‑4 Turbo, Apple's ReALM model that outperforms GPT‑4, the free‑to‑try Meshy 3 3D generator, Lamini's $25 M funding for enterprise generative AI, and OpenAI's upcoming ChatGPT‑powered search engine challenging Google.

AIFundingGenerativeAI
0 likes · 13 min read
SenseNova 5.0 Takes on GPT‑4 Turbo and Other AI Breakthroughs This Week
Baobao Algorithm Notes
Baobao Algorithm Notes
May 5, 2024 · Artificial Intelligence

Deep Dive into Transformer Mechanics: Scaling, Q/K Projections, FFNs, and More

This article provides concise technical explanations for 25 common questions about Transformer models, covering scaled dot‑product attention scaling, separate Q/K projections, feed‑forward network design, attention variants, normalization, LoRA versus full‑parameter training, KV‑cache, pre‑ and post‑norm, computational cost analysis, and advanced position‑encoding techniques.

LLMLoRATransformer
0 likes · 25 min read
Deep Dive into Transformer Mechanics: Scaling, Q/K Projections, FFNs, and More
DataFunSummit
DataFunSummit
May 4, 2024 · Artificial Intelligence

Applications of Large Language Models in Recommendation Systems: Overview and Future Directions

This article provides a comprehensive overview of how large language models (LLMs) are integrated into recommendation systems, detailing two main paradigms—LLM as a component and LLM as a standalone system—while discussing their impact on retrieval, ranking, prompting, and outlining future research challenges such as multimodal recommendation, hallucination mitigation, bias reduction, and agent‑based approaches.

AIFuture DirectionsLLM
0 likes · 6 min read
Applications of Large Language Models in Recommendation Systems: Overview and Future Directions
AI Large Model Application Practice
AI Large Model Application Practice
May 3, 2024 · Artificial Intelligence

Can Giant Context LLMs Replace RAG? Exploring the Limits of Long‑Context Retrieval

This article examines whether the rapid growth of large‑language‑model context windows can eliminate the need for retrieval‑augmented generation, presenting experimental needle‑in‑a‑haystack tests, analysis of model performance across token lengths and needle positions, and practical guidance using an open‑source evaluation tool.

AILLMNeedle-in-a-Haystack
0 likes · 13 min read
Can Giant Context LLMs Replace RAG? Exploring the Limits of Long‑Context Retrieval
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
May 2, 2024 · Artificial Intelligence

Understanding Large Language Models: Principles, Training, Risks, and Application Security

This article provides a comprehensive overview of large language models (LLMs), explaining their core concepts, transformer architecture, training stages, known shortcomings such as hallucination and reversal curse, and highlights emerging security threats like prompt injection and jailbreaking, offering guidance for safe deployment.

AI SafetyLLMjailbreaking
0 likes · 21 min read
Understanding Large Language Models: Principles, Training, Risks, and Application Security
21CTO
21CTO
Apr 29, 2024 · Artificial Intelligence

Fine‑Tuning vs. Context Learning: Building Apps with the Emerging LLM Tech Stack

This article explores how developers can integrate large language models into applications by comparing fine‑tuning and context learning, detailing each method’s advantages and drawbacks, and presenting a four‑layer LLM tech stack—data, model, orchestration, and operations—with practical tooling examples.

AI StackFine-tuningLLM
0 likes · 16 min read
Fine‑Tuning vs. Context Learning: Building Apps with the Emerging LLM Tech Stack
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Apr 29, 2024 · Artificial Intelligence

Building Enterprise‑Grade Retrieval‑Augmented Generation (RAG) Systems: Challenges, Fault Points, and Best Practices

This comprehensive guide explores the complexities of building enterprise‑level Retrieval‑Augmented Generation (RAG) systems, detailing common failure points, architectural components such as authentication, input guards, query rewriting, document ingestion, indexing, storage, retrieval, generation, observability, caching, and multi‑tenant considerations, and provides actionable best‑practice recommendations for developers and technical leaders.

Enterprise AILLMRAG
0 likes · 32 min read
Building Enterprise‑Grade Retrieval‑Augmented Generation (RAG) Systems: Challenges, Fault Points, and Best Practices
NewBeeNLP
NewBeeNLP
Apr 25, 2024 · Artificial Intelligence

How Apple’s OpenELM Redefines Efficient LLM Scaling with Layer‑Wise Design

Apple’s OpenELM introduces a layer‑wise scaling Transformer family ranging from 270 M to 3 B parameters, provides a full open‑source training framework, and demonstrates superior zero‑shot and few‑shot performance over existing open LLMs despite using less public data, while also analyzing inference bottlenecks and PEFT results.

LLMOpen-sourceOpenELM
0 likes · 8 min read
How Apple’s OpenELM Redefines Efficient LLM Scaling with Layer‑Wise Design
Huolala Tech
Huolala Tech
Apr 25, 2024 · Artificial Intelligence

How LLM‑Powered Multi‑Agent AI Boosts Vehicle Selection in HuoLala’s Customer Service

This article details the design and implementation of an LLM‑driven multi‑agent AI customer‑service assistant for vehicle selection at HuoLala, covering system architecture, algorithmic solutions, retrieval‑augmented generation, NLU/NLG agents, performance improvements, and future outlooks.

AI Customer ServiceLLMMulti-Agent System
0 likes · 12 min read
How LLM‑Powered Multi‑Agent AI Boosts Vehicle Selection in HuoLala’s Customer Service
21CTO
21CTO
Apr 24, 2024 · Artificial Intelligence

Microsoft’s Phi‑3 Mini: The Smallest LLM That Beats GPT‑3.5 on iPhone

Microsoft unveiled the open‑source Phi‑3 series, a lightweight family of large language models that outperform larger rivals, run offline on smartphones, and cost a fraction of comparable AI models, opening new possibilities for edge and mobile AI applications.

LLMPhi-3offline-inference
0 likes · 8 min read
Microsoft’s Phi‑3 Mini: The Smallest LLM That Beats GPT‑3.5 on iPhone
JavaEdge
JavaEdge
Apr 22, 2024 · Artificial Intelligence

Why Large Language Models Still Struggle and How to Fix Them

Large language models still suffer from limited memory, constrained context windows, outdated knowledge, inability to control external systems, and poor domain expertise, but the article outlines two main remedies—fine‑tuning (Model‑as‑a‑Service) and prompt‑engineering—detailing their mechanisms, suitable scenarios, and trade‑offs.

Artificial IntelligenceFine-tuningLLM
0 likes · 9 min read
Why Large Language Models Still Struggle and How to Fix Them
21CTO
21CTO
Apr 20, 2024 · Artificial Intelligence

How OpenAI Revitalized Microsoft: The AI Strategy Behind a Tech Giant’s Comeback

The article chronicles Microsoft’s transformation from a stagnant software behemoth to the world’s most valuable company by embracing artificial intelligence, detailing Sophia Velastegui’s pivotal role, the strategic partnership with OpenAI, Azure OpenAI services, product integrations, associated risks, and future outlook.

LLMOpenAI
0 likes · 9 min read
How OpenAI Revitalized Microsoft: The AI Strategy Behind a Tech Giant’s Comeback
dbaplus Community
dbaplus Community
Apr 19, 2024 · Backend Development

How Justine Tunney Built a Six‑OS C Web Server and Other Groundbreaking Projects

The article showcases Justine Tunney’s remarkable engineering feats—from the RedBean web server that runs the same binary on six operating systems, to the cosmopolitan libc, a 512‑byte sectorLisp, the Blinkenlights visual debugger, the RoseHub security effort, and the llamafile tool that packages large language models into a single portable executable.

LLMLispWeb server
0 likes · 9 min read
How Justine Tunney Built a Six‑OS C Web Server and Other Groundbreaking Projects
JD Tech
JD Tech
Apr 18, 2024 · Artificial Intelligence

Getting Started with LangChain: Overview, Core Components, and Python Code Samples

This article introduces the LangChain framework for large language model integration, explains its key components and advantages, and provides step‑by‑step Python examples for setting up environment variables, creating prompts, chaining models, and using embeddings, completions, and chat models.

ChatModelEmbeddingLLM
0 likes · 7 min read
Getting Started with LangChain: Overview, Core Components, and Python Code Samples
Alimama Tech
Alimama Tech
Apr 17, 2024 · Artificial Intelligence

Applying Large Language Models to Advertising Copy Generation

The article examines how large language models can streamline advertising copy creation by addressing format diversity, creativity, and new media demands, detailing model evaluation, fine‑tuning of Chinese‑adapted LLMs—ultimately selecting QWen 1.5‑7B—and showing that deployment boosts copy quality, click‑through and conversion rates while outlining future personalization and data‑efficient scaling.

AICopy GenerationFine-tuning
0 likes · 18 min read
Applying Large Language Models to Advertising Copy Generation
DaTaobao Tech
DaTaobao Tech
Apr 17, 2024 · Artificial Intelligence

Challenges and Practices of LLM‑Based Knowledge Bases and Personal Assistants

The article examines how LLM‑driven knowledge‑base QA and personal‑assistant agents struggle with context management, token limits, multimodal data, and tool‑parameter parsing, reviews open‑source frameworks such as LangChain, AutoGen and MetaGPT, and argues that fine‑tuning (e.g., LoRA) is essential for domain‑specific, scalable solutions.

AgentFine-tuningKnowledge Base
0 likes · 11 min read
Challenges and Practices of LLM‑Based Knowledge Bases and Personal Assistants
DevOps
DevOps
Apr 14, 2024 · Artificial Intelligence

Exploring the Application of Large Language Models in DevOps: Practices, Principles, and Future Prospects

This article examines how large language models (LLMs) are being integrated into DevOps workflows, detailing practical implementations, organizational adoption, efficiency‑boosting techniques, underlying principles, limitations, and future directions for software engineers seeking to leverage AI as a reliable development partner.

Artificial IntelligenceLLMautomation
0 likes · 22 min read
Exploring the Application of Large Language Models in DevOps: Practices, Principles, and Future Prospects
21CTO
21CTO
Apr 12, 2024 · Artificial Intelligence

How I Built an AI‑Powered Resume Chatbot with LLMs and RAG

Senior developer Jon Olson shares how he created an AI resume assistant using GPT‑4/3.5, LangChain, LlamaIndex, and retrieval‑augmented generation, detailing prompt engineering, backend integration, and future routing features to help job seekers showcase their skills.

AI chatbotLLMLangChain
0 likes · 8 min read
How I Built an AI‑Powered Resume Chatbot with LLMs and RAG
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Apr 12, 2024 · Artificial Intelligence

Typical Business and Technical Architectures for Large Language Model Applications

This article reviews the common business and technical architectures used in large language model (LLM) applications, explains AI Embedded, AI Copilot, and AI Agent modes—including single‑ and multi‑agent systems—and offers guidance on selecting appropriate technology stacks such as prompt‑only, function‑calling agents, RAG, and fine‑tuning.

AI AgentFine-tuningLLM
0 likes · 9 min read
Typical Business and Technical Architectures for Large Language Model Applications
AI Large Model Application Practice
AI Large Model Application Practice
Apr 10, 2024 · Artificial Intelligence

What Is Self‑RAG? A Simple Guide to Self‑Reflective Retrieval‑Augmented Generation

This article explains the motivation behind Self‑RAG, describes its core workflow—including conditional retrieval, enhanced generation, and self‑evaluation tokens—details the four evaluation metrics (Retrieve, IsRel, IsSup, IsUse), and provides a Python scoring example using log‑probabilities.

Evaluation MetricsLLMLogprobs
0 likes · 13 min read
What Is Self‑RAG? A Simple Guide to Self‑Reflective Retrieval‑Augmented Generation
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Apr 10, 2024 · Artificial Intelligence

Early‑Stopping Self‑Consistency (ESC): Reducing Sampling Cost for Large Language Model Reasoning

Early‑Stopping Self‑Consistency (ESC) dynamically halts sampling once a sliding‑window answer distribution reaches zero entropy, cutting the number of required LLM reasoning samples by 34‑84 % across arithmetic, commonsense, and symbolic benchmarks while preserving accuracy and offering a theoretically‑bounded, robust, budget‑adaptive alternative to traditional Self‑Consistency.

AIChain-of-ThoughtEarly Stopping
0 likes · 14 min read
Early‑Stopping Self‑Consistency (ESC): Reducing Sampling Cost for Large Language Model Reasoning
HelloTech
HelloTech
Apr 10, 2024 · Artificial Intelligence

An Overview of LangChain: Architecture, Core Components, and Code Examples

LangChain is an open‑source framework that provides Python and JavaScript SDKs, templates, and services such as LangServe and LangSmith to compose models, embeddings, prompts, indexes, memory, chains, and agents via a concise expression language, enabling rapid prototyping, debugging, and deployment of LLM‑driven applications.

AI EngineeringJavaScriptLLM
0 likes · 19 min read
An Overview of LangChain: Architecture, Core Components, and Code Examples
DaTaobao Tech
DaTaobao Tech
Apr 10, 2024 · Artificial Intelligence

Survey of Popular AI Agent Frameworks and Their Architectures

The article surveys modern open‑source AI agent frameworks, defining agents as autonomous perception‑planning‑action systems, outlining core modules (inference, memory, tools, action), comparing single‑agent designs like BabyAGI and AutoGPT with multi‑agent platforms such as MetaGPT and AutoGen, and discussing their benefits, trade‑offs, and future research directions.

AI agentsAgent FrameworksLLM
0 likes · 28 min read
Survey of Popular AI Agent Frameworks and Their Architectures
NewBeeNLP
NewBeeNLP
Apr 10, 2024 · Artificial Intelligence

What Scaling Laws Reveal About LLM Fine‑Tuning and RLHF Performance

This article reviews recent scaling‑law research on large‑language‑model fine‑tuning and RLHF, explaining how data quantity, model size, PET parameters, reward‑model size and KL‑penalty affect downstream performance and offering practical insights for efficient training.

Artificial IntelligenceLLMRLHF
0 likes · 11 min read
What Scaling Laws Reveal About LLM Fine‑Tuning and RLHF Performance
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 10, 2024 · Artificial Intelligence

Master LangChain in 10 Minutes: From Basics to Advanced AI Engineering

This guide walks AI engineers through a rapid 10‑minute boot‑strap of LangChain, explaining its purpose, core concepts, design questions, environment setup, and step‑by‑step code examples that cover APIs, chains, memory, retrieval‑augmented generation, tools, agents, and the overall architecture.

AI EngineeringLLMLangChain
0 likes · 28 min read
Master LangChain in 10 Minutes: From Basics to Advanced AI Engineering
DataFunTalk
DataFunTalk
Apr 8, 2024 · Artificial Intelligence

LLM‑Based Agents: Architecture, Key Challenges, and Future Directions

This article surveys the emerging field of large‑language‑model (LLM) based agents, detailing their modular architecture—including profiling, memory, planning, and action components—while discussing critical challenges such as role‑playing, memory design, reasoning, multi‑agent collaboration, and outlining promising research directions and practical case studies.

AI AgentAgent ArchitectureLLM
0 likes · 11 min read
LLM‑Based Agents: Architecture, Key Challenges, and Future Directions