Tagged articles
2016 articles
Page 17 of 21
Baobao Algorithm Notes
Baobao Algorithm Notes
Oct 7, 2024 · Artificial Intelligence

Mastering LLM Supervised Fine‑Tuning: Practical Tips, Data Strategies, and Debugging

This article provides a comprehensive, experience‑driven guide to supervised fine‑tuning (SFT) of large language models, covering special tokens, latency considerations, data diversity and production, training frameworks and hyper‑parameters, over‑/under‑fitting diagnostics, and evaluation metrics such as helpfulness, honesty, and harmlessness.

AILLMSFT
0 likes · 40 min read
Mastering LLM Supervised Fine‑Tuning: Practical Tips, Data Strategies, and Debugging
Fighter's World
Fighter's World
Sep 30, 2024 · Artificial Intelligence

Exploring Google NotebookLM: Use Cases, Interaction Experience, and Key Insights

The author reviews Google NotebookLM, describing how it aids deep paper reading, boosts chat willingness with guided prompts, maintains conversation coherence through self‑play insights, highlights the audio‑overview feature, and reflects on AI concepts such as the "bitter lesson" and the limits of self‑play in open scenarios.

AI researchGoogleLLM
0 likes · 22 min read
Exploring Google NotebookLM: Use Cases, Interaction Experience, and Key Insights
21CTO
21CTO
Sep 30, 2024 · Artificial Intelligence

How LLM‑Powered IDEs Can Cut Your Coding Time in Half

Using an LLM-powered IDE, the author built a full‑stack weekend project without writing a single line of code, discovering faster development cycles, new debugging habits, and the strengths and limits of AI assistants compared to traditional Google searches.

AI CodingLLMSoftware Development
0 likes · 10 min read
How LLM‑Powered IDEs Can Cut Your Coding Time in Half
JD Cloud Developers
JD Cloud Developers
Sep 29, 2024 · Artificial Intelligence

Build a Local AI Q&A System with Java, Ollama, and LangChain4J

This article walks through building a local AI question‑answer system using Java, Ollama, LangChain4J, embeddings, and a Chroma vector database, covering LLM fundamentals, embedding techniques, RAG architecture, setup steps, Maven dependencies, and sample code to retrieve and answer queries.

AIEmbeddingLLM
0 likes · 19 min read
Build a Local AI Q&A System with Java, Ollama, and LangChain4J
Architect
Architect
Sep 26, 2024 · Artificial Intelligence

Decoding OpenAI o1: How RL‑LLM Fusion Powers Next‑Gen Reasoning

This article provides a detailed technical analysis of OpenAI’s o1 model, exploring its enhanced logical reasoning, the likely use of reinforcement learning with hidden chain‑of‑thought generation, multi‑model architecture, training data pipelines, reward modeling, and how these innovations could reshape AI safety and scaling strategies.

AI SafetyChain-of-ThoughtLLM
0 likes · 43 min read
Decoding OpenAI o1: How RL‑LLM Fusion Powers Next‑Gen Reasoning
Huolala Tech
Huolala Tech
Sep 26, 2024 · Artificial Intelligence

How LLM-Powered AI Assistants Transform Logistics Operations

This article details Huolala's exploration of large‑language‑model (LLM) based AI assistants across multiple business scenarios, describing their architecture, implementation challenges, prompt engineering techniques, and the progressive stages from professional assistants to multi‑agent systems that drive efficiency and innovation in logistics.

AI AssistantLLMMulti-Agent
0 likes · 12 min read
How LLM-Powered AI Assistants Transform Logistics Operations
Baobao Algorithm Notes
Baobao Algorithm Notes
Sep 25, 2024 · Industry Insights

Decoding OpenAI o1: How RL and LLM Fuse to Power Hidden Chain‑of‑Thought

This article analytically reconstructs OpenAI o1’s architecture, training pipeline, and inference workflow, exploring its reinforcement‑learning‑enhanced hidden chain‑of‑thought, multi‑model composition, scaling laws, reward modeling, and potential implications for future AI safety and small‑model strategies.

AI SafetyHidden COTLLM
0 likes · 43 min read
Decoding OpenAI o1: How RL and LLM Fuse to Power Hidden Chain‑of‑Thought
ByteDance Data Platform
ByteDance Data Platform
Sep 25, 2024 · Artificial Intelligence

How LLMs Power the “Find Data Assistant” for Smarter Data Retrieval

This article explains how the Volcano Engine DataLeap team leveraged large‑language models to build the “Find Data Assistant”, detailing its design, challenges, embedding‑and‑reranker enhancements, LLM‑driven semantic search, mixing architecture, and practical lessons for improving data asset management and retrieval.

Data Asset ManagementData RetrievalEmbedding
0 likes · 17 min read
How LLMs Power the “Find Data Assistant” for Smarter Data Retrieval
JavaEdge
JavaEdge
Sep 24, 2024 · Artificial Intelligence

Mastering RAG with LangChain4j: From Simple Setup to Advanced Retrieval‑Augmented Generation

This article explains how to extend large language models with domain‑specific knowledge using Retrieval‑Augmented Generation (RAG) in LangChain4j, covering the concepts of RAG, its indexing and retrieval stages, simple RAG setup, detailed API usage, and advanced customization options such as query transformers and content injectors.

EmbeddingLLMLangChain4j
0 likes · 24 min read
Mastering RAG with LangChain4j: From Simple Setup to Advanced Retrieval‑Augmented Generation
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Sep 23, 2024 · Artificial Intelligence

How Large Language Models Power Multi‑Turn Dialogue for Smart Marketing

This article presents a comprehensive technical analysis of using large language models to build a task‑oriented multi‑turn dialogue system for intelligent marketing, detailing architecture, intent detection, slot extraction, prompt design, dialogue management, practical experience, and future research directions.

LLMintelligent marketingintent recognition
0 likes · 21 min read
How Large Language Models Power Multi‑Turn Dialogue for Smart Marketing
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 23, 2024 · Artificial Intelligence

Boosting Aviator Script Development with AI—No Model Training Required

This article details an engineering‑focused practice that uses large language models, RAG, prompt engineering, and reranking to automatically generate, review, and refine Aviator scripts for decision‑center policies without any model pre‑training, offering practical insights and code examples for developers.

AI code generationAviator scriptLLM
0 likes · 29 min read
Boosting Aviator Script Development with AI—No Model Training Required
JavaEdge
JavaEdge
Sep 21, 2024 · Artificial Intelligence

Understanding LLM API Types and Usage in LangChain4j

This article explains the different low‑level LLM API types in LangChain4j, including LanguageModel, ChatLanguageModel, and other model interfaces, and shows how to create and combine ChatMessage objects for multi‑turn conversations.

AI APIChatLanguageModelChatMessage
0 likes · 8 min read
Understanding LLM API Types and Usage in LangChain4j
DataFunSummit
DataFunSummit
Sep 21, 2024 · Artificial Intelligence

DataLeap "Find Data Assistant": Leveraging Large Language Models for Data Asset Retrieval and Management

This article details how the DataLeap team applied large language model technology to build the "Find Data Assistant" platform, addressing the challenges of locating and using massive data assets through a hybrid retrieval architecture, enhanced embedding, reranking, mixed ranking, and answer summarization, while sharing practical lessons and future directions.

Data Asset ManagementData RetrievalEmbedding
0 likes · 17 min read
DataLeap "Find Data Assistant": Leveraging Large Language Models for Data Asset Retrieval and Management
Senior Brother's Insights
Senior Brother's Insights
Sep 19, 2024 · Artificial Intelligence

Rule Engines vs AI Models: Choosing the Right Approach for Product Logic

The article compares traditional rule‑engine architectures with AI‑driven models, explains their differing characteristics, outlines when deterministic rule matching is preferable over flexible AI inference, and recommends practical technologies such as Drools for rule‑based solutions and LLM‑based RAG/Agent frameworks for AI‑centric scenarios.

AIDroolsLLM
0 likes · 9 min read
Rule Engines vs AI Models: Choosing the Right Approach for Product Logic
JavaEdge
JavaEdge
Sep 19, 2024 · Artificial Intelligence

Unlock Java LLM Power: A Deep Dive into LangChain4j Features and Architecture

LangChain4j streamlines the integration of large language models into Java applications by offering a standardized API, extensive support for over a dozen LLM providers and vector stores, a rich toolbox for RAG, chat memory, and tool calling, plus two abstraction layers that cater to both low‑level control and high‑level convenience.

AIIntegrationLLM
0 likes · 10 min read
Unlock Java LLM Power: A Deep Dive into LangChain4j Features and Architecture
DevOps
DevOps
Sep 13, 2024 · Artificial Intelligence

15 Advanced Retrieval‑Augmented Generation (RAG) Techniques for Production‑Ready AI Solutions

The article outlines fifteen advanced Retrieval‑Augmented Generation (RAG) techniques—from hierarchical indexing and context caching to multimodal alignment and microservice orchestration—explaining how they help transform AI prototypes into scalable, reliable production systems while highlighting common pitfalls and a concluding call to action.

AI productionLLMRAG
0 likes · 8 min read
15 Advanced Retrieval‑Augmented Generation (RAG) Techniques for Production‑Ready AI Solutions
Code Mala Tang
Code Mala Tang
Sep 12, 2024 · Artificial Intelligence

Unlocking LangChain.js: The Swiss Army Knife for LLM Applications

This article introduces LangChain.js, explains its origins, core concepts such as chats, templates, tools, and chains, demonstrates practical JavaScript code examples, and explores the LangChain Execution Language (LCEL) for building flexible, conditional AI workflows.

AI workflowLCELLLM
0 likes · 17 min read
Unlocking LangChain.js: The Swiss Army Knife for LLM Applications
Code Mala Tang
Code Mala Tang
Sep 12, 2024 · Artificial Intelligence

Boost LLM Accuracy with Retrieval‑Augmented Generation Using LangChain.js

This article explains the core concepts of Retrieval‑Augmented Generation (RAG), walks through its implementation steps with LangChain.js—including text chunking, embedding, storage, retrieval, and generation—and showcases practical use cases, challenges, and best practices for building reliable AI‑powered applications.

AI applicationsEmbeddingLLM
0 likes · 16 min read
Boost LLM Accuracy with Retrieval‑Augmented Generation Using LangChain.js
DataFunTalk
DataFunTalk
Sep 12, 2024 · Artificial Intelligence

MetaGPT: Advances in Multi‑Agent Collaboration and Agent Capability Enhancement

This article reviews MetaGPT, an open‑source multi‑agent framework that integrates human‑engineered SOPs into LLM‑based agents to improve software generation, data interpretation, and simulation tasks, highlighting its rapid community growth, experimental successes, tool integration strategies, and future research directions.

Agent CollaborationLLMMetaGPT
0 likes · 20 min read
MetaGPT: Advances in Multi‑Agent Collaboration and Agent Capability Enhancement
Baobao Algorithm Notes
Baobao Algorithm Notes
Sep 10, 2024 · Artificial Intelligence

Do LLMs Silence Human Voices? Unveiling the ‘Spiral of Silence’ in Retrieval‑Augmented Generation

This article reviews the ACL 2024 paper that investigates how large language model‑generated text influences retrieval‑augmented generation pipelines, revealing short‑term retrieval gains but a long‑term “spiral of silence” that marginalizes human‑generated content and homogenizes open‑domain QA results.

AI ImpactLLMOpen Domain QA
0 likes · 9 min read
Do LLMs Silence Human Voices? Unveiling the ‘Spiral of Silence’ in Retrieval‑Augmented Generation
Code Mala Tang
Code Mala Tang
Sep 7, 2024 · Artificial Intelligence

Unlocking LangChain.js: The Swiss Army Knife for LLM Applications

This article introduces LangChain.js, its core concepts such as chats, templates, tools, and chains, demonstrates how to use LCEL for flexible workflow composition, and shows practical JavaScript code examples for building AI-powered applications with large language models.

AI workflowLCELLLM
0 likes · 17 min read
Unlocking LangChain.js: The Swiss Army Knife for LLM Applications
iKang Technology Team
iKang Technology Team
Sep 5, 2024 · Artificial Intelligence

What Is LangChain? Overview, Core Advantages, Components, and Use Cases

LangChain is a modular framework that streamlines integration of large language models by providing unified model interfaces, prompt optimization, memory handling, indexing, chains, and agents, enabling developers to quickly build and deploy sophisticated NLP applications such as text generation, information extraction, and dynamic tool‑driven workflows across various industries.

AI FrameworkChainsLLM
0 likes · 6 min read
What Is LangChain? Overview, Core Advantages, Components, and Use Cases
Full-Stack Cultivation Path
Full-Stack Cultivation Path
Sep 4, 2024 · Artificial Intelligence

Hot Open-Source RAG Tool for Document Chat: GraphRAG, Multimodal QA & Complex Reasoning

This article introduces Kotaemon, an open‑source Retrieval‑Augmented Generation platform that lets users chat with their documents, offering a self‑hosted web UI, support for local and API LLMs, hybrid retrieval, multimodal question answering, GraphRAG indexing, and advanced reasoning capabilities, along with step‑by‑step installation via App or Docker.

GraphRAGLLMMultimodal QA
0 likes · 6 min read
Hot Open-Source RAG Tool for Document Chat: GraphRAG, Multimodal QA & Complex Reasoning
AI Large Model Application Practice
AI Large Model Application Practice
Sep 4, 2024 · Artificial Intelligence

When to Use GraphRAG vs. Traditional RAG and How to Combine Them

This article compares GraphRAG with traditional RAG across seven dimensions—suitable scenarios, knowledge representation, retrieval, comprehensive queries, hidden‑relationship understanding, scalability, and performance‑cost trade‑offs—explains how they can be fused, and offers guidance on selecting the right approach for complex data‑driven applications.

Artificial IntelligenceGraphRAGLLM
0 likes · 13 min read
When to Use GraphRAG vs. Traditional RAG and How to Combine Them
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Sep 2, 2024 · Artificial Intelligence

Turning PDFs and Word Docs into Searchable Knowledge for RAG Systems

This article explains why generic large language models struggle with domain‑specific data, introduces Retrieval‑Augmented Generation (RAG) as a solution, compares Word and PDF formats, outlines document‑parsing pipelines, reviews open‑source PDF tools, and presents Alibaba Cloud's rule‑based parsing architecture with performance results.

AIDocument ParsingLLM
0 likes · 13 min read
Turning PDFs and Word Docs into Searchable Knowledge for RAG Systems
Data Thinking Notes
Data Thinking Notes
Sep 1, 2024 · Artificial Intelligence

Master LLMs: Basics, Prompt Engineering, RAG, Agents & Multimodal AI

This article provides a comprehensive overview of large language models, covering their fundamental concepts, historical milestones, parameter scaling, prompt engineering techniques, retrieval‑augmented generation, autonomous agents, and multimodal model applications, illustrating how these technologies reshape AI capabilities across domains.

AI agentsLLMRAG
0 likes · 22 min read
Master LLMs: Basics, Prompt Engineering, RAG, Agents & Multimodal AI
DataFunSummit
DataFunSummit
Aug 29, 2024 · Artificial Intelligence

Intelligent NPC Practices in Tencent Games: Multi‑Modal LLM Solutions and System Optimizations

This article details Tencent Game's end‑to‑end approach to building intelligent NPCs, covering the opportunities brought by AI, the practical implementation of multimodal LLM‑driven dialogue, knowledge‑augmented retrieval, long‑context handling, safety measures, multimodal expression (voice and facial animation), and system‑level performance optimizations for real‑time deployment.

AILLMMultimodal
0 likes · 18 min read
Intelligent NPC Practices in Tencent Games: Multi‑Modal LLM Solutions and System Optimizations
Baobao Algorithm Notes
Baobao Algorithm Notes
Aug 27, 2024 · Industry Insights

What Real‑World LLM Researchers Face: Scaling Limits, Data Bottlenecks, and Deployment Challenges

The author shares a candid account of recent large‑model experiments, highlighting why most labs struggle to exceed 100 B parameters, how data and hardware constraints shape model iteration, and the practical engineering, safety, and multimodal challenges that dictate real‑world LLM deployment.

AI industryAI scalingHardware acceleration
0 likes · 6 min read
What Real‑World LLM Researchers Face: Scaling Limits, Data Bottlenecks, and Deployment Challenges
Baobao Algorithm Notes
Baobao Algorithm Notes
Aug 22, 2024 · Industry Insights

How LLMs Turned Into a Powerful Productivity Boost for Developers

The author reflects on two years of LLM work, describing how large language models shattered old NLP expertise, reshaped daily engineering tasks, leveled the playing field for newcomers, sparked a golden era of AI productivity, and why concerns about a bubble are largely irrelevant for programmers.

Artificial IntelligenceDeveloper ExperienceLLM
0 likes · 10 min read
How LLMs Turned Into a Powerful Productivity Boost for Developers
21CTO
21CTO
Aug 19, 2024 · Artificial Intelligence

How to Become an AI Agent Developer: A Step‑by‑Step Roadmap

This article explains what AI agents are, why they matter, the essential soft and hard skills needed, and provides a detailed step‑by‑step roadmap for anyone aspiring to become an AI agent developer in the emerging AI‑driven software landscape.

AI AgentBackendLLM
0 likes · 6 min read
How to Become an AI Agent Developer: A Step‑by‑Step Roadmap
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 19, 2024 · Artificial Intelligence

Ensuring Stable AI Agents: Engineering Practices, RAG, and Monitoring

This article shares engineering insights from Hema’s AI smart customer service deployment, detailing key stability factors for AI agents—including hallucination mitigation, memory integration, RAG enhancement, exception handling, and comprehensive monitoring—to improve reliability and performance in real‑world e‑commerce chatbot scenarios.

AI AgentLLMRAG
0 likes · 13 min read
Ensuring Stable AI Agents: Engineering Practices, RAG, and Monitoring
JavaEdge
JavaEdge
Aug 17, 2024 · Artificial Intelligence

Exploring LangGraph Studio: A Visual IDE for Building LLM Agents

LangGraph Studio is a new visual IDE that simplifies the development, debugging, and interactive iteration of complex LLM‑based agent applications, offering features such as graph visualization, real‑time state inspection, code‑aware debugging, and seamless integration with LangSmith, with step‑by‑step guidance for desktop users.

AI DevelopmentAgent IDELLM
0 likes · 8 min read
Exploring LangGraph Studio: A Visual IDE for Building LLM Agents
Meituan Technology Team
Meituan Technology Team
Aug 15, 2024 · Artificial Intelligence

Meituan's Exploration and Practice in Advertising Algorithm: Information Flow Ad Estimation

This article details Meituan Waimai's feed advertising system, covering business characteristics, the evolution of estimation models, and practical implementations such as decision‑path modeling, ultra‑long/wide user modeling, full‑reconstruction techniques, and the integration of large language models for CTR prediction.

CTR estimationLLMMeituan
0 likes · 22 min read
Meituan's Exploration and Practice in Advertising Algorithm: Information Flow Ad Estimation
DataFunSummit
DataFunSummit
Aug 15, 2024 · Artificial Intelligence

Building an LLM‑Driven Metric Platform for Data Democratization

This article explains how large language models (LLMs) can launch data democratization by constructing a metric platform that combines LLM agents, semantic layers, NL2SQL/NL2API pipelines, warehouse‑internal and external semantics, and showcases SwiftAgent/SwiftMetrics innovations, real‑world case studies, and future directions.

Big DataData DemocratizationLLM
0 likes · 13 min read
Building an LLM‑Driven Metric Platform for Data Democratization
21CTO
21CTO
Aug 11, 2024 · Artificial Intelligence

Demystifying LLMs: How Tokens, Training, and Transformers Power Generative AI

This article explains the fundamentals of large language models, covering tokenization, probability prediction, Markov chain basics, training data limitations, context windows, and the transition to neural network architectures like Transformers, while providing Python examples and insights into model scaling and the illusion of intelligence.

AILLMNeural Networks
0 likes · 18 min read
Demystifying LLMs: How Tokens, Training, and Transformers Power Generative AI
Architect
Architect
Aug 11, 2024 · Artificial Intelligence

Understanding Large Language Models: Tokens, Tokenization, and the Evolution from Markov Chains to Transformers

This article explains how generative AI models work by demystifying tokens, tokenization with tools like tiktoken, simple Markov‑chain training, the limitations of small context windows, and how modern LLMs use neural networks, transformers and attention mechanisms to predict the next token.

LLMMarkov chainTransformer
0 likes · 20 min read
Understanding Large Language Models: Tokens, Tokenization, and the Evolution from Markov Chains to Transformers
DataFunSummit
DataFunSummit
Aug 10, 2024 · Artificial Intelligence

Leveraging Large Language Models for Graph Recommendation System Optimization

This article reviews cutting‑edge research on integrating large language models with graph‑based recommendation systems, detailing four key strategies—LLM node embeddings, deep graph‑LLM fusion, model‑driven graph data training, and text‑modal enhancements—while analyzing representation learning, InfoNCE optimization, explainable recommendations, and extensive experimental validation.

InfoNCELLMexplainability
0 likes · 18 min read
Leveraging Large Language Models for Graph Recommendation System Optimization
JavaEdge
JavaEdge
Aug 9, 2024 · Artificial Intelligence

Build a Graph‑Based LLM Agent with LangGraph: Step‑by‑Step Tutorial

This article introduces LangGraph, a Python library for creating stateful, multi‑agent LLM workflows, explains its loop, persistence, and human‑in‑the‑loop features, shows how to install it, and provides a complete code example that builds, runs, and reuses a searchable AI agent with thread‑level state saving.

AILLMLangChain
0 likes · 10 min read
Build a Graph‑Based LLM Agent with LangGraph: Step‑by‑Step Tutorial
Baobao Algorithm Notes
Baobao Algorithm Notes
Aug 9, 2024 · Artificial Intelligence

Testing 1M‑Token LLMs with a Novel Medal‑Insertion Benchmark

The article presents a practical method for evaluating 1‑million‑token LLMs by inserting structured medal data into a classic Chinese novel, provides a full Python script for the test, shares results on GLM‑4‑long, and discusses training techniques and open‑source resources for long‑context models.

AILLMPython
0 likes · 10 min read
Testing 1M‑Token LLMs with a Novel Medal‑Insertion Benchmark
Full-Stack Cultivation Path
Full-Stack Cultivation Path
Aug 8, 2024 · Artificial Intelligence

MegaParse: A Precision Document Parser Built for LLMs

MegaParse is an open‑source document parser that transforms PDFs, Word, PPT, Excel and CSV files into LLM‑friendly formats, preserving full information, boosting processing efficiency, and enabling deeper semantic analysis, with quick‑start installation steps and a roadmap for future features.

AI toolsDocument ParsingLLM
0 likes · 4 min read
MegaParse: A Precision Document Parser Built for LLMs
DaTaobao Tech
DaTaobao Tech
Aug 7, 2024 · Artificial Intelligence

Overview of Large Model Development, AIGC Practices, and Prompt Engineering

The article surveys the rapid emergence of large AI models and AIGC, explains core concepts like AI, AGI, and LLMs, details prompt‑engineering techniques such as chain‑of‑thought, outlines a seven‑layer AIGC stack, discusses technical and ethical challenges, and highlights future multimodal and industry‑specific applications.

AIAIGCLLM
0 likes · 25 min read
Overview of Large Model Development, AIGC Practices, and Prompt Engineering
NewBeeNLP
NewBeeNLP
Aug 7, 2024 · Artificial Intelligence

Can Intuitive Fine‑Tuning Replace Expensive RLHF and DPO for LLM Alignment?

This article analyses the shortcomings of current large language model training methods such as SFT, RLHF and DPO, explains why they incur high data and compute costs, and introduces Intuitive Fine‑Tuning (IFT) with temporal residual connections as a cheaper yet effective alternative that better aligns training objectives with real generation tasks.

DPOIntuitive Fine-TuningLLM
0 likes · 15 min read
Can Intuitive Fine‑Tuning Replace Expensive RLHF and DPO for LLM Alignment?
Volcano Engine Developer Services
Volcano Engine Developer Services
Aug 6, 2024 · Artificial Intelligence

How an AI-Powered Bot Turns Excel Files into Interactive Reports

This article introduces an AI‑driven Smart Report Assistant Bot that automatically converts uploaded Excel files into recommended charts, allows users to customize reports, and details the underlying workflow—including Excel parsing, LLM‑generated SQL, dynamic table creation, chart rendering with ECharts, and image‑merging plugins.

AIBotECharts
0 likes · 8 min read
How an AI-Powered Bot Turns Excel Files into Interactive Reports
JD Tech Talk
JD Tech Talk
Aug 5, 2024 · Artificial Intelligence

An Introduction to STORM: An LLM‑Powered Knowledge Management System for Automated Research and Writing

This article introduces STORM, a Stanford‑developed large‑language‑model‑based knowledge‑management platform that automates topic research, outline generation, citation‑rich article writing, and iterative refinement through perspective‑guided questioning and simulated conversations, dramatically improving technical investigation efficiency.

AI toolsLLMStorm
0 likes · 7 min read
An Introduction to STORM: An LLM‑Powered Knowledge Management System for Automated Research and Writing
JD Cloud Developers
JD Cloud Developers
Aug 5, 2024 · Artificial Intelligence

How STORM Uses LLMs to Automate Technical Research and Writing

This article introduces STORM, a Stanford‑developed LLM‑based knowledge‑management system that automates topic research, outline creation, content generation with citations, and optimization through perspective‑guided questioning and simulated dialogue, dramatically improving technical investigation efficiency.

AI writingLLMautomated research
0 likes · 5 min read
How STORM Uses LLMs to Automate Technical Research and Writing
Alibaba Cloud Native
Alibaba Cloud Native
Aug 2, 2024 · Cloud Native

How to Build an AI‑Native API Gateway with Higress: ChatGPT‑Next‑Web, RAG, Token Limits & More

This guide walks through creating a full‑featured AI‑native API gateway using Higress, covering architecture setup, AI agent integration, observability, content security, token rate limiting, caching, retrieval‑augmented generation, prompt templates, and intelligent request/response transformation with concrete configuration examples.

AILLMToken Limiting
0 likes · 11 min read
How to Build an AI‑Native API Gateway with Higress: ChatGPT‑Next‑Web, RAG, Token Limits & More
Open Source Tech Hub
Open Source Tech Hub
Jul 31, 2024 · Artificial Intelligence

Understanding LLMs, AI Agents, and Retrieval-Augmented Generation: Key Concepts and Challenges

This article explains the fundamentals of large language models, artificial general intelligence, AI-generated content, AI agents, retrieval‑augmented generation, knowledge bases, multimodal processing, fine‑tuning, alignment, tokens, vectors, and related tools, highlighting their capabilities, limitations, and practical considerations.

AI AgentArtificial IntelligenceFine-tuning
0 likes · 14 min read
Understanding LLMs, AI Agents, and Retrieval-Augmented Generation: Key Concepts and Challenges
NewBeeNLP
NewBeeNLP
Jul 31, 2024 · Artificial Intelligence

How Continual Pre‑Training Boosts Llama‑3’s Chinese and Scientific Reasoning

This report presents a continual pre‑training approach that significantly enhances Llama‑3 (8B)’s Chinese language proficiency and scientific reasoning by using a carefully mixed corpus of existing and synthetic data, detailing the bilingual adaptation and synthetic‑enhancement stages, data‑mixing and curriculum strategies, and demonstrating strong results across multilingual and scientific benchmarks without sacrificing original capabilities.

BenchmarkingLLMLlama-3
0 likes · 9 min read
How Continual Pre‑Training Boosts Llama‑3’s Chinese and Scientific Reasoning
DataFunSummit
DataFunSummit
Jul 29, 2024 · Artificial Intelligence

Large Language Models for Recommendation Systems: Current Progress, Challenges, and Future Directions

This article reviews the state‑of‑the‑art applications of large language models in recommendation systems, summarizing background knowledge, recent advances such as LLM4Rec, various tuning strategies, agent‑based approaches, open research problems, and future directions for generative recommendation.

AIIn-Context LearningLLM
0 likes · 24 min read
Large Language Models for Recommendation Systems: Current Progress, Challenges, and Future Directions
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Jul 25, 2024 · Artificial Intelligence

Designing Autonomous LLM Agents: Architecture, Memory, Planning, and Learning Strategies

This article surveys the design of autonomous large‑language‑model agents, detailing their modular architecture—including profiling, memory, planning, and execution—while also reviewing common profiling methods, memory structures, planning techniques, action strategies, and various learning approaches such as exemplar, human‑in‑the‑loop, and environment‑feedback training.

AIAgent ArchitectureAutonomous Agents
0 likes · 36 min read
Designing Autonomous LLM Agents: Architecture, Memory, Planning, and Learning Strategies
phodal
phodal
Jul 24, 2024 · Artificial Intelligence

How to Build Trustworthy Coding Agents with Shire’s Custom RAG Workflow

This article explains how to use the Shire language to create reliable coding agents by defining custom RAG workflows, leveraging IDE APIs, code verification functions, and vector‑based search, with detailed examples, configuration snippets, and a roadmap for future enhancements.

AICoding AgentIDE
0 likes · 10 min read
How to Build Trustworthy Coding Agents with Shire’s Custom RAG Workflow
Alibaba Cloud Native
Alibaba Cloud Native
Jul 24, 2024 · Cloud Native

How to Observe and Optimize LLM Applications with Alibaba Cloud ARMS

This article explains the challenges of deploying large language model (LLM) applications, outlines the need for end‑to‑end observability, and details Alibaba Cloud ARMS' LLM‑specific tracing, metrics, and Python agent solutions for monitoring, debugging, and performance optimization.

AILLMMetrics
0 likes · 20 min read
How to Observe and Optimize LLM Applications with Alibaba Cloud ARMS
DataFunSummit
DataFunSummit
Jul 24, 2024 · Artificial Intelligence

Overview of Large Language Model‑Based AI Agents: Architecture, Challenges, and Future Directions

This article reviews the emerging field of large language model‑based AI agents, outlining their overall architecture, key challenges such as role‑playing, memory, planning, and multi‑agent collaboration, and discusses future research directions and practical examples in user behavior simulation and software development.

AI agentsLLMMemory Mechanisms
0 likes · 11 min read
Overview of Large Language Model‑Based AI Agents: Architecture, Challenges, and Future Directions
21CTO
21CTO
Jul 23, 2024 · Artificial Intelligence

How AI Coding Assistants Are Redefining Software Development

This article explores how large language model‑powered coding assistants boost developer productivity, shift the role of engineers toward higher‑level design and problem‑solving, and raise new responsibilities for code safety, performance, and ethical use in the evolving software development paradigm.

AI CodingLLMSoftware Development
0 likes · 11 min read
How AI Coding Assistants Are Redefining Software Development
Model Perspective
Model Perspective
Jul 23, 2024 · Artificial Intelligence

Building Your Own AI Agent with LangChain: A Hands‑On Guide

This article walks through the author’s experience creating a custom AI agent using LangChain and OpenAI APIs, explains the concepts of AI agents and the ReAct reasoning framework, provides step‑by‑step code, discusses required libraries and APIs, and shares practical tips and challenges encountered.

AI AgentLLMLangChain
0 likes · 16 min read
Building Your Own AI Agent with LangChain: A Hands‑On Guide
DataFunSummit
DataFunSummit
Jul 22, 2024 · Artificial Intelligence

From BERT to LLM: Language Model Applications in 360 Advertising Recommendation

This talk explores how 360's advertising recommendation system leverages language models—from BERT to large‑scale LLMs—to improve user interest modeling, feature extraction, and conversion‑rate prediction, detailing practical challenges, engineering solutions, experimental results, and future research directions.

AdvertisingBERTLLM
0 likes · 18 min read
From BERT to LLM: Language Model Applications in 360 Advertising Recommendation
JD Tech
JD Tech
Jul 22, 2024 · Artificial Intelligence

Task‑Aware Decoding (TaD): A Plug‑and‑Play Method to Mitigate Hallucinations in Large Language Models

This article presents Task‑aware Decoding (TaD), a plug‑and‑play technique introduced by JD Tech and Tsinghua University and accepted at IJCAI 2024, which reduces intrinsic hallucinations in large language models by comparing pre‑ and post‑fine‑tuning outputs, and demonstrates its effectiveness combined with Retrieval‑Augmented Generation across various tasks.

Fine-tuningLLMRetrieval Augmented Generation
0 likes · 18 min read
Task‑Aware Decoding (TaD): A Plug‑and‑Play Method to Mitigate Hallucinations in Large Language Models
DevOps
DevOps
Jul 21, 2024 · Artificial Intelligence

LLM Fundamentals, Applications, Prompt Engineering, RAG, and Agentic Workflows

This article provides a comprehensive overview of large language models (LLMs), covering their basic concepts, relationship with NLP, development history, parameter scaling, offline deployment, practical applications, prompt‑engineering frameworks, retrieval‑augmented generation, LangChain integration, agents, workflow orchestration, and future directions toward multimodal AI and AGI.

AI applicationsAgentArtificial Intelligence
0 likes · 36 min read
LLM Fundamentals, Applications, Prompt Engineering, RAG, and Agentic Workflows
21CTO
21CTO
Jul 21, 2024 · Artificial Intelligence

How JetBrains AI Boosts Code Completion and Refactoring in Rider

This article reviews JetBrains AI, an LLM‑powered assistant for JetBrains IDEs, exploring its code‑completion, code‑explanation, unit‑test generation, and refactoring capabilities through real C# examples and discussing its impact on developer workflows.

IDEJetBrains AILLM
0 likes · 8 min read
How JetBrains AI Boosts Code Completion and Refactoring in Rider
DataFunTalk
DataFunTalk
Jul 20, 2024 · Artificial Intelligence

Exploring and Applying Large Language Models in Recommendation Systems

The talk by Huawei Noah's Ark Lab researcher Wang Yichao presents a comprehensive exploration of large language models (LLMs) for recommendation systems, covering background challenges, the KAR and Uni-CTR projects, experimental results, and future directions for open‑world, generative recommendation pipelines.

KARLLMUni-CTR
0 likes · 13 min read
Exploring and Applying Large Language Models in Recommendation Systems
Full-Stack Cultivation Path
Full-Stack Cultivation Path
Jul 20, 2024 · Artificial Intelligence

Beyond RAG: How Mem0 Gives Large Language Models Super Memory for Personalized AI Apps

Mem0 is an open‑source memory‑management middleware for large language models that provides dynamic, context‑aware, and adaptive memory, outperforming traditional Retrieval‑Augmented Generation (RAG) and enabling personalized AI assistants, travel planners, and support agents with concrete Python APIs and examples.

AI agentsLLMMem0
0 likes · 9 min read
Beyond RAG: How Mem0 Gives Large Language Models Super Memory for Personalized AI Apps
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Jul 18, 2024 · Artificial Intelligence

Why AI Agents Are the Next Frontier of Intelligent Systems

This article surveys the rapid rise of AI agents powered by large language models, explaining their core perception‑planning‑action loop, memory architectures, tool‑use mechanisms, self‑reflection techniques, and real‑world case studies while highlighting current challenges and future prospects for autonomous intelligent systems.

AI AgentLLMTool Use
0 likes · 29 min read
Why AI Agents Are the Next Frontier of Intelligent Systems
Tencent Cloud Developer
Tencent Cloud Developer
Jul 18, 2024 · Artificial Intelligence

Exploring Large Language Models (LLM): Fundamentals, Applications, and Future Directions

Exploring Large Language Models, this article surveys their core concepts, evolution through Transformers, GPT and BERT, generation challenges, diverse applications such as QA, multimodal creation, summarization and retrieval‑augmented generation, prompt‑engineering frameworks and tools, LangChain‑based pipelines, AI‑driven agents, and future prospects toward domain‑specific use, multimodality, and AGI.

AIAgentLLM
0 likes · 35 min read
Exploring Large Language Models (LLM): Fundamentals, Applications, and Future Directions
DataFunSummit
DataFunSummit
Jul 17, 2024 · Artificial Intelligence

Overview of LLM‑Based Agents: Architecture, Key Challenges, and Future Directions

This article reviews the emerging field of large‑language‑model (LLM) based AI agents, outlining their overall architecture, core modules such as profiling, memory, planning and action, discussing current challenges, presenting concrete use‑cases, and highlighting promising research directions.

AI AgentAgent ArchitectureLLM
0 likes · 11 min read
Overview of LLM‑Based Agents: Architecture, Key Challenges, and Future Directions
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 17, 2024 · Artificial Intelligence

How Alibaba Cloud Built Service‑Domain AI Agents: Design, Practice, and Results

This article explains how Alibaba Cloud designed and deployed large‑language‑model agents for its service domain, covering background, ideal LLM deployment, the shift from explanation to problem solving, the agent framework, practical implementation, automation trade‑offs, training, evaluation, and real‑world impact.

AI AgentAlibaba CloudCustomer Service Automation
0 likes · 20 min read
How Alibaba Cloud Built Service‑Domain AI Agents: Design, Practice, and Results
Java Tech Enthusiast
Java Tech Enthusiast
Jul 16, 2024 · Artificial Intelligence

LLMs Misjudge Simple Number Comparison: 9.11 vs 9.9

Recent tests reveal that popular large language models—including GPT‑4o, Gemini Advanced, and Claude 3.5—often claim 9.11 is larger than 9.9 because their tokenizers split the numbers, but rephrasing, zero‑shot chain‑of‑thought prompts, or treating the values as floating‑point numbers can correct the mistake, a pattern also seen variably in Chinese models.

AI EvaluationLLMnumeric comparison
0 likes · 7 min read
LLMs Misjudge Simple Number Comparison: 9.11 vs 9.9
JD Tech Talk
JD Tech Talk
Jul 16, 2024 · Artificial Intelligence

Task‑Aware Decoding (TaD): A Plug‑and‑Play Method to Mitigate Hallucinations in Large Language Models

TaD, a task‑aware decoding technique jointly developed by JD.com and Tsinghua University and presented at IJCAI 2024, leverages differences between pre‑ and post‑fine‑tuned LLM outputs to construct knowledge vectors, significantly reducing hallucinations across various models, tasks, and data‑scarce scenarios, especially when combined with RAG.

AILLMRAG
0 likes · 18 min read
Task‑Aware Decoding (TaD): A Plug‑and‑Play Method to Mitigate Hallucinations in Large Language Models
Alibaba Cloud Native
Alibaba Cloud Native
Jul 15, 2024 · Cloud Native

How AI-Driven Gateways Are Evolving to Meet LLM Demands

The article examines how AI-era large language model (LLM) applications impose new traffic, security, and scalability requirements on gateways, and explains how the Envoy‑based open‑source Higress gateway addresses these challenges with hot configuration updates, token‑based rate limiting, streaming support, and multi‑tenant capabilities.

AIInfraLLM
0 likes · 19 min read
How AI-Driven Gateways Are Evolving to Meet LLM Demands
Architect
Architect
Jul 13, 2024 · Artificial Intelligence

Practical Guide to Building LLM Products: Prompt Engineering, RAG, Evaluation, and Operations

This article provides a comprehensive, step‑by‑step guide for developing large‑language‑model (LLM) applications, covering prompt design techniques, n‑shot and chain‑of‑thought strategies, retrieval‑augmented generation, structured I/O, workflow optimization, evaluation pipelines, operational best practices, and team organization to create reliable, scalable AI products.

AI OperationsLLMProduct Development
0 likes · 54 min read
Practical Guide to Building LLM Products: Prompt Engineering, RAG, Evaluation, and Operations
DataFunSummit
DataFunSummit
Jul 10, 2024 · Artificial Intelligence

Applying Large Language Models to Recommendation Systems at Ant Group

The article presents Ant Group's research on integrating large language models into recommendation pipelines, covering background challenges, knowledge extraction, teacher‑model distillation, efficient deployment, experimental results, and future directions to improve accuracy and reduce bias.

AILLMmodel distillation
0 likes · 13 min read
Applying Large Language Models to Recommendation Systems at Ant Group
JD Tech
JD Tech
Jul 10, 2024 · Artificial Intelligence

Implementing Retrieval‑Augmented Generation (RAG) with LangChain4j in Java

This article provides a step‑by‑step guide for Java engineers on building a Retrieval‑Augmented Generation (RAG) application using the LangChain4j framework, covering RAG fundamentals, environment setup, Maven integration, document loading, splitting, embedding with OpenAI, vector store management with Chroma, and prompt‑based LLM interaction.

EmbeddingLLMRAG
0 likes · 35 min read
Implementing Retrieval‑Augmented Generation (RAG) with LangChain4j in Java
NewBeeNLP
NewBeeNLP
Jul 8, 2024 · Artificial Intelligence

How LLMs Transform Recommendation Systems: The LEARN Framework Explained

This article reviews the Kuaishou paper on adapting large language models for recommendation, detailing the LEARN framework's dual‑tower architecture, embedding generation, loss functions, and experimental results that address cold‑start and long‑tail challenges in modern recommender systems.

InfoNCELLMLong Tail
0 likes · 8 min read
How LLMs Transform Recommendation Systems: The LEARN Framework Explained
21CTO
21CTO
Jul 7, 2024 · Artificial Intelligence

How to Build a Secure Local LLM Chatbot with Ollama, Python, and ChromaDB

This tutorial walks you through creating a privacy‑preserving, locally hosted large language model chatbot using Ollama, Python 3, and ChromaDB, covering RAG fundamentals, GPU selection, environment setup, and full source code for a Flask‑based application.

ChromaDBLLMOllama
0 likes · 19 min read
How to Build a Secure Local LLM Chatbot with Ollama, Python, and ChromaDB
Architecture Development Notes
Architecture Development Notes
Jul 5, 2024 · Artificial Intelligence

Simplify Multi‑LLM Integration in Rust with the genai Library

genai is a Rust library that unifies the APIs of major large language models, offering a lightweight, native solution with simple chat‑focused examples, demonstrated through a dual‑model Rust program, and outlines future expansions such as additional model support, multimodal capabilities, and performance optimizations.

APIArtificial IntelligenceGenAI
0 likes · 6 min read
Simplify Multi‑LLM Integration in Rust with the genai Library
JD Tech
JD Tech
Jul 5, 2024 · Artificial Intelligence

Generative Recommendation Systems for JD Alliance Advertising: Architecture, Implementation, and Experimental Evaluation

This article surveys how large language models reshape recommendation systems, presents a generative RS framework tailored for JD Alliance advertising, details material representation, model input, training and inference pipelines, and reports extensive offline and online experiments demonstrating its effectiveness on sparse user data.

Generative RecommendationLLMe-commerce advertising
0 likes · 27 min read
Generative Recommendation Systems for JD Alliance Advertising: Architecture, Implementation, and Experimental Evaluation