Tagged articles
2011 articles
Page 20 of 21
Ximalaya Technology Team
Ximalaya Technology Team
Oct 18, 2023 · Artificial Intelligence

The Evolution of AI Agents: From Philosophy to Modern Implementations

Tracing AI agents from Aristotle’s and Zhuangzi’s philosophical notions through the coining of “agent” in computer science to today’s learning‑based systems powered by large language models, the article outlines key milestones, core components—LLM brain, memory, planning, tool use—and showcases applications such as AlphaGo, Siri, and autonomous platforms, while forecasting their expanding, industry‑wide ubiquity.

AI agentsGenerative AgentsHistory of AI
0 likes · 21 min read
The Evolution of AI Agents: From Philosophy to Modern Implementations
Model Perspective
Model Perspective
Oct 15, 2023 · Artificial Intelligence

How to Use Large Language Models Ethically in Math Modeling Contests

COMAP’s new policy outlines why and how teams in mathematical modeling competitions should responsibly employ large language models and generative AI, detailing guiding principles, risks, citation requirements, and ethical considerations to ensure fairness, transparency, and academic integrity.

AI policyLLMacademic integrity
0 likes · 9 min read
How to Use Large Language Models Ethically in Math Modeling Contests
dbaplus Community
dbaplus Community
Oct 14, 2023 · Artificial Intelligence

Demystifying Retrieval‑Augmented Generation: From Theory to Working Chatbot

This guide explains the Retrieval‑Augmented Generation (RAG) technique, detailing how user queries are matched to private knowledge bases, how relevant passages are retrieved, and how large language models use those passages to generate context‑aware answers, complete with code examples and practical tips.

ChatbotEmbeddingLLM
0 likes · 19 min read
Demystifying Retrieval‑Augmented Generation: From Theory to Working Chatbot
21CTO
21CTO
Oct 12, 2023 · Frontend Development

How Vercel’s AI‑Powered v0 Tool Is Transforming Frontend Development

Vercel has launched v0, an AI‑driven tool that lets developers describe desired UI components in plain text and receive generated frontend code, streamlining creation, offering multiple design options, and shifting developer focus toward creativity and design.

AILLMVercel
0 likes · 4 min read
How Vercel’s AI‑Powered v0 Tool Is Transforming Frontend Development
Zhuanzhuan Tech
Zhuanzhuan Tech
Oct 11, 2023 · Artificial Intelligence

Building a ChatGPT‑Based Intelligent Customer Service System with BERT Classification and Knowledge Filtering

This article describes how to construct an intelligent customer‑service assistant using ChatGPT for natural‑language understanding, BERT for user‑question classification, and Sentence‑BERT for knowledge‑selection, detailing system architecture, prompt design, model training, performance results, and practical cost reductions.

BERTChatGPTIntelligent Customer Service
0 likes · 16 min read
Building a ChatGPT‑Based Intelligent Customer Service System with BERT Classification and Knowledge Filtering
ByteFE
ByteFE
Oct 11, 2023 · Artificial Intelligence

CR Copilot: An Open‑Source LLM‑Based Code Review Assistant with Private Knowledge Base

This article describes the design and implementation of a code‑review assistant powered by open‑source large language models and a privately hosted knowledge base, covering background, pain points, system architecture, model selection, vector‑store integration, prompt engineering, diff parsing, and practical reflections.

AICode reviewKnowledge Base
0 likes · 24 min read
CR Copilot: An Open‑Source LLM‑Based Code Review Assistant with Private Knowledge Base
DataFunTalk
DataFunTalk
Oct 10, 2023 · Artificial Intelligence

Integrating Large Language Models into Recommender Systems: Opportunities, Methods, and Challenges

This article surveys how large language models can be incorporated into recommender systems, discussing their strengths and limitations, outlining where and how they can be applied across the recommendation pipeline, presenting recent research examples, and highlighting challenges and future directions for industrial deployment.

LLMfeature engineeringrecommender systems
0 likes · 20 min read
Integrating Large Language Models into Recommender Systems: Opportunities, Methods, and Challenges
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Oct 10, 2023 · Artificial Intelligence

Create a Custom Enterprise Conversational Search with Alibaba Cloud OpenSearch Vector & LLM

This guide walks you through setting up Alibaba Cloud OpenSearch Vector Search and LLM Intelligent Q&A editions, covering environment preparation, instance creation, data source configuration, field and index setup, document ingestion, query processing, and a complete Java SDK demo for building a flexible enterprise conversational search system.

Alibaba CloudConversational AIJava SDK
0 likes · 20 min read
Create a Custom Enterprise Conversational Search with Alibaba Cloud OpenSearch Vector & LLM
Baidu Geek Talk
Baidu Geek Talk
Oct 9, 2023 · Artificial Intelligence

Code Understanding Technology: Building White-Box Software Knowledge Graph at Baidu

Baidu’s white‑box code understanding platform combines static, dynamic, non‑code and LLM‑based analyses in a three‑layer architecture that accelerates C/C++ processing ninefold, supports multiple languages, and powers applications such as intelligent unit testing, orphan‑function cleanup and AI‑driven risk detection, while future integration with models like GPT‑4 aims to enable multi‑turn code Q&A, automated refactoring and predictive testing.

ASTBaiduCode Understanding
0 likes · 15 min read
Code Understanding Technology: Building White-Box Software Knowledge Graph at Baidu
Baobao Algorithm Notes
Baobao Algorithm Notes
Oct 8, 2023 · Interview Experience

Must‑Know Large‑Model Interview Questions for RLHF Candidates

The article shares a practitioner’s transition story from reinforcement‑learning‑focused game AI to large‑model work, outlines the challenges faced during job hunting at major Chinese tech firms, and provides a curated list of 23 technical interview questions covering PPO, RLHF, dataset evaluation, model fine‑tuning, and broader LLM concepts.

AI researchInterview PreparationLLM
0 likes · 10 min read
Must‑Know Large‑Model Interview Questions for RLHF Candidates
21CTO
21CTO
Oct 4, 2023 · Artificial Intelligence

How LangStream Merges Data Streams with Generative AI for Real‑Time LLM Apps

LangStream, the new open‑source framework from DataStax, combines event‑driven data streaming with generative AI, offering seamless integration with vector databases like Astra DB, Milvus, and Pinecone, and providing a Kubernetes‑based runtime that enables real‑time LLM applications without extensive coding.

Data StreamingKubernetesLLM
0 likes · 7 min read
How LangStream Merges Data Streams with Generative AI for Real‑Time LLM Apps
AI Large Model Application Practice
AI Large Model Application Practice
Oct 4, 2023 · Artificial Intelligence

Controlling LLM‑Based AI Agents with the Open‑Source ‘Agents’ Framework

This article introduces the experimental open‑source project ‘Agents’, explains common challenges of LLM‑based AI agents, compares it with tools like AutoGPT, LangChain and MetaGPT, and demonstrates how its configuration‑driven SOP approach enables more controllable, multi‑agent interactions and easier deployment.

ConfigurationLLMMulti-Agent
0 likes · 14 min read
Controlling LLM‑Based AI Agents with the Open‑Source ‘Agents’ Framework
HomeTech
HomeTech
Sep 26, 2023 · Artificial Intelligence

Integrating Large Language Models with Search for Automotive Knowledge Retrieval

This article explores how combining traditional keyword search with large language models (LLMs) enhances understanding of user intent, builds a robust automotive knowledge base, and delivers more accurate, context‑aware answers through a multi‑stage retrieval and generation pipeline.

AIKnowledge BaseLLM
0 likes · 17 min read
Integrating Large Language Models with Search for Automotive Knowledge Retrieval
AI Large Model Application Practice
AI Large Model Application Practice
Sep 25, 2023 · Artificial Intelligence

How LangSmith Turns LLM Debugging, Testing, and Production Monitoring into a Seamless Workflow

This article explores LangSmith, the experimental platform from the creators of LangChain, detailing how it tracks complex LLM reasoning, supports batch testing and evaluation of AI applications, and offers a community Hub for sharing prompts and chains, ultimately helping move LLM projects from prototype to production.

AI testingDebuggingLLM
0 likes · 10 min read
How LangSmith Turns LLM Debugging, Testing, and Production Monitoring into a Seamless Workflow
phodal
phodal
Sep 24, 2023 · Artificial Intelligence

Designing a JVM‑Based LLM Framework: Insights from Chocolate Factory

This article explores the design principles, architectural decisions, and practical code examples behind the Chocolate Factory framework, a JVM‑centric LLM development platform inspired by LangChain, LlamaIndex, Spring AI, and PromptFlow, highlighting SDK construction, RAG workflows, and prompt engineering challenges.

AI DevelopmentFrameworkJVM
0 likes · 11 min read
Designing a JVM‑Based LLM Framework: Insights from Chocolate Factory
NetEase LeiHuo Testing Center
NetEase LeiHuo Testing Center
Sep 22, 2023 · Artificial Intelligence

Understanding Large Language Models and Prompt Engineering: A Practical Guide

This article provides an introductory overview of large language models (LLMs), compares popular models, explains their underlying principles, and offers practical guidance on prompt engineering, model evaluation, usage tips, and safety considerations, helping readers effectively select and apply LLMs in various scenarios.

AILLMModel Evaluation
0 likes · 44 min read
Understanding Large Language Models and Prompt Engineering: A Practical Guide
DataFunSummit
DataFunSummit
Sep 22, 2023 · Artificial Intelligence

Exploring Game AI Agents: Review, LLM‑Driven Exploration, and Future Directions

This article reviews the evolution of game AI agents, examines how large language models (LLMs) can drive new AI behaviors in games, and discusses practical case studies across genres such as Werewolf‑style, war‑SLG, and MOBA games, concluding with challenges and future research directions.

AI agentsGame DevelopmentLLM
0 likes · 31 min read
Exploring Game AI Agents: Review, LLM‑Driven Exploration, and Future Directions
21CTO
21CTO
Sep 21, 2023 · Artificial Intelligence

Falcon 180B vs Llama 2: Which Open‑Source LLM Leads the AI Race?

This article compares the open‑source large language models Falcon 180B and Meta’s Llama 2, detailing their parameter sizes, training data, licensing, variants, infrastructure, language support, and safety policies, while providing links to official resources and a side‑by‑side feature table.

AI comparisonFalcon 180BLLM
0 likes · 8 min read
Falcon 180B vs Llama 2: Which Open‑Source LLM Leads the AI Race?
Ximalaya Technology Team
Ximalaya Technology Team
Sep 18, 2023 · Artificial Intelligence

Understanding Autonomous and Autopilot AI Agents: Insights from Industry Experts

The article surveys the rise of LLM‑powered AI agents, defining them as LLM + memory + planning + tool use, contrasting fully autonomous agents with human‑guided autopilot/copilot variants, outlining their benefits, risks such as hallucinations and unsafe actions, and urging modular frameworks and oversight for reliable enterprise deployment.

AI agentsAgent FrameworkLLM
0 likes · 27 min read
Understanding Autonomous and Autopilot AI Agents: Insights from Industry Experts
Yunxuetang Frontend Team
Yunxuetang Frontend Team
Sep 15, 2023 · Frontend Development

Front-End Insights: Architecture, Code Review, Storage, and New Tools

This article explores front‑end architecture decisions between Vue and React, shares a comprehensive code‑review methodology from 13 years at Tencent, evaluates the most robust local storage solutions for large offline data, details two screen‑adaptation approaches for Vue 3/Vite, and introduces emerging technologies such as LangChain’s RAG and Agents and the high‑performance Bun 1.0 JavaScript runtime.

BunCode reviewLLM
0 likes · 4 min read
Front-End Insights: Architecture, Code Review, Storage, and New Tools
DaTaobao Tech
DaTaobao Tech
Sep 13, 2023 · Artificial Intelligence

Integrating Large Language Models with Recommendation Systems: Paradigms, Methods, and Experiments

The article surveys how large language models can be integrated into recommendation systems, either as feature extractors or as end‑to‑end recommenders, showing that LLM‑derived semantics improve recall, ranking, diversity, and user experience, and outlining future multimodal, efficiency, and re‑ranking directions.

EmbeddingLLMPrompt engineering
0 likes · 19 min read
Integrating Large Language Models with Recommendation Systems: Paradigms, Methods, and Experiments
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Sep 13, 2023 · Artificial Intelligence

Pai‑Megatron‑Patch: Design Principles, Key Features, and End‑to‑End Usage for Large Language Model Training

This article introduces the open‑source Pai‑Megatron‑Patch tool from Alibaba Cloud, explains its non‑intrusive patch architecture, enumerates supported models and features such as weight conversion, Flash‑Attention 2.0, FP8 training with Transformer Engine, and provides detailed command‑line examples for model conversion, pre‑training, supervised fine‑tuning, inference, and RLHF reinforcement learning pipelines.

Deep LearningFP8LLM
0 likes · 19 min read
Pai‑Megatron‑Patch: Design Principles, Key Features, and End‑to‑End Usage for Large Language Model Training
Baobao Algorithm Notes
Baobao Algorithm Notes
Sep 12, 2023 · Artificial Intelligence

Why RTX 4090 Beats H100 for LLM Inference but Fails at Training

The article analyses the performance, memory, bandwidth and cost of NVIDIA H100, A100 and RTX 4090 GPUs, explains why the 4090 cannot handle large‑model training due to communication and memory limits, and shows how its high compute‑to‑price ratio makes it attractive for inference, backed by detailed parallelism calculations and cost‑per‑token estimates.

CostGPULLM
0 likes · 46 min read
Why RTX 4090 Beats H100 for LLM Inference but Fails at Training
Continuous Delivery 2.0
Continuous Delivery 2.0
Sep 12, 2023 · Artificial Intelligence

Compression as a Measure of Intelligence in Large Language Models

The article argues that a large language model's ability to compress data through next‑token prediction reflects its intelligence, reviews theoretical and empirical evidence linking compression efficiency to model scale, and proposes a circuit‑competition framework to explain emergent capabilities, in‑context learning, and fine‑tuning effects.

GPT-4IntelligenceLLM
0 likes · 58 min read
Compression as a Measure of Intelligence in Large Language Models
AI Large Model Application Practice
AI Large Model Application Practice
Sep 6, 2023 · Artificial Intelligence

Prompt Engineering vs Fine‑Tuning: How to Choose the Best Strategy for Reliable LLM Outputs

This article compares Prompt Engineering and Supervised Fine‑Tuning for large language models, explains their principles, showcases common prompt patterns such as Chain‑of‑Thought, ReAct and Self‑Ask, outlines fine‑tuning stages and trade‑offs, and provides practical guidance on selecting the most suitable approach for specific enterprise AI Agent scenarios.

AI AgentFine-tuningLLM
0 likes · 17 min read
Prompt Engineering vs Fine‑Tuning: How to Choose the Best Strategy for Reliable LLM Outputs
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Sep 1, 2023 · Artificial Intelligence

Understanding Function Calling and ReAct for LLM Agents with LangChain

This article explains how large language models can act as agents by using OpenAI's Function Calling and the ReAct prompting paradigm, compares their trade‑offs, and demonstrates practical implementations with LangChain, including code examples for defining tools, invoking functions, and orchestrating multi‑step reasoning.

AIAgentFunction Calling
0 likes · 21 min read
Understanding Function Calling and ReAct for LLM Agents with LangChain
Architect
Architect
Aug 31, 2023 · Artificial Intelligence

Building a Custom LLM Chatbot with LangChain, ChromaDB, and LLaMA‑2

This tutorial explains how to leverage generative AI tools—including LLMs, embedding models, vector databases, and the LangChain framework—to create a custom chatbot that answers user queries using a knowledge base, with step‑by‑step code examples for Google Colab.

ChatbotEmbeddingLLM
0 likes · 15 min read
Building a Custom LLM Chatbot with LangChain, ChromaDB, and LLaMA‑2
DataFunSummit
DataFunSummit
Aug 30, 2023 · Databases

Milvus: An AI‑Native Vector Database for Large Language Model Applications

This article introduces Milvus, an open‑source, cloud‑native vector database designed for AI workloads, explains how it helps mitigate large‑model hallucinations, outlines its CVP architecture, showcases performance benchmarks, and explores diverse application scenarios and future directions for LLM‑vector database integration.

AILLMMilvus
0 likes · 13 min read
Milvus: An AI‑Native Vector Database for Large Language Model Applications
21CTO
21CTO
Aug 28, 2023 · Artificial Intelligence

What Is Code Llama? Meta’s Open-Source LLM for Generating Code

Code Llama, Meta’s specialized extension of Llama 2, is a large language model fine‑tuned on code data that can generate, complete, and debug software across multiple languages, supports up to 100 k tokens, and is freely available for research and commercial use.

Code GenerationCode LlamaLLM
0 likes · 5 min read
What Is Code Llama? Meta’s Open-Source LLM for Generating Code
21CTO
21CTO
Aug 26, 2023 · Artificial Intelligence

How MetaGPT Leverages SOP to Boost Multi‑Agent LLM Collaboration

MetaGPT is a meta‑programming framework that encodes standard operating procedures as prompts, enabling LLM‑driven multi‑agent systems to automatically generate software artifacts, coordinate roles, and build complex applications like a Blackjack CLI with near‑perfect task completion.

AI CollaborationLLMMetaGPT
0 likes · 4 min read
How MetaGPT Leverages SOP to Boost Multi‑Agent LLM Collaboration
phodal
phodal
Aug 26, 2023 · Artificial Intelligence

How CoUnit Turns LLMs Into a Smart Team API for Faster Collaboration

CoUnit is an open‑source Rust‑based tool that uses local semantic search and LLMs to create a virtual team interface, enabling low‑cost, offline knowledge retrieval, API discovery, and cross‑team assistance for software development teams.

AI integrationLLMRust
0 likes · 7 min read
How CoUnit Turns LLMs Into a Smart Team API for Faster Collaboration
DataFunTalk
DataFunTalk
Aug 23, 2023 · Artificial Intelligence

Evaluating Large Language Model Item Encoders for Textual Collaborative Filtering in Recommendation Systems

This article investigates whether replacing traditional ID-based item encoders with massive LLMs such as GPT‑3 improves recommendation performance, by conducting extensive experiments on three real‑world datasets, analyzing performance limits, generality of item representations, and comparing against ID‑based and prompt‑based methods.

AIGPT-3LLM
0 likes · 15 min read
Evaluating Large Language Model Item Encoders for Textual Collaborative Filtering in Recommendation Systems
Ximalaya Technology Team
Ximalaya Technology Team
Aug 22, 2023 · Artificial Intelligence

Guidelines and Best Practices for Prompt Engineering with Large Language Models

The guide outlines prompt‑engineering best practices for large language models, distinguishing base and instruction‑tuned LLMs, emphasizing clear, structured, step‑by‑step prompts, handling hallucinations, iterating through idea‑code‑data cycles, applying techniques to summarization, reasoning and expansion, managing token costs, and providing concrete OpenAI API examples.

AIAPI UsageLLM
0 likes · 14 min read
Guidelines and Best Practices for Prompt Engineering with Large Language Models
php Courses
php Courses
Aug 14, 2023 · Artificial Intelligence

Guide to the Five Most Powerful Large Language Models and How to Choose Them

This article explains the fundamentals of modern large language models, outlines the top five most powerful LLMs—including GPT‑4, Claude 2, Llama 2, Orca, and Cohere—and provides practical guidance on selecting and applying them across business and development use cases.

AI applicationsClaude 2GPT-4
0 likes · 9 min read
Guide to the Five Most Powerful Large Language Models and How to Choose Them
AI Large Model Application Practice
AI Large Model Application Practice
Aug 11, 2023 · Artificial Intelligence

How LLMs Are Revolutionizing Enterprise Apps: Scenarios, Architecture & Challenges

This article examines how large language models (LLMs) are reshaping enterprise applications by enabling natural‑language interfaces, automating workflows, and enhancing data analysis, while also detailing typical use cases, integration complexities, and practical solutions such as prompt engineering and vector‑database retrieval.

LLMPrompt engineeringartificial intelligence
0 likes · 11 min read
How LLMs Are Revolutionizing Enterprise Apps: Scenarios, Architecture & Challenges
Architect's Guide
Architect's Guide
Aug 10, 2023 · Artificial Intelligence

Getting Started with LangChain: Building LLM Applications in Python

This tutorial introduces LangChain, an open‑source Python framework that provides unified model access, prompt management, memory, retrieval, and tool integration, enabling developers to quickly prototype AI‑driven applications using large language models and various external data sources.

LLMLangChainPrompt engineering
0 likes · 13 min read
Getting Started with LangChain: Building LLM Applications in Python
JD Tech
JD Tech
Aug 4, 2023 · Artificial Intelligence

Deploying and Evaluating the Vicuna Open‑Source Large Language Model on a Single Machine

This article details a step‑by‑step guide to deploying the Vicuna open‑source LLM on a single server, covering model preparation, environment setup, dependency installation, GPU and CUDA configuration, inference commands, performance evaluation, and attempted fine‑tuning, while sharing practical observations and results.

Fine‑tuningGPUInference
0 likes · 16 min read
Deploying and Evaluating the Vicuna Open‑Source Large Language Model on a Single Machine
Architect
Architect
Jul 31, 2023 · Artificial Intelligence

Getting Started with LangChain: Building LLM‑Powered Applications

This article introduces LangChain, explains why it’s useful for building applications with large language models, walks through installation, API‑key setup, model and embedding selection, prompt engineering, chaining, memory, agents, and vector‑store indexing, and provides runnable Python code examples throughout.

LLMLangChainPromptEngineering
0 likes · 16 min read
Getting Started with LangChain: Building LLM‑Powered Applications
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jul 30, 2023 · Artificial Intelligence

ChatGPT Technical Analysis Series – Part 2: GPT1, GPT2, and GPT3 (Encoder vs Decoder, Zero‑Shot, and Scaling)

This article reviews the evolution of the GPT family from GPT‑1 to GPT‑3, comparing encoder‑decoder architectures, explaining the shift from supervised fine‑tuning to zero‑shot and few‑shot learning, and highlighting the architectural and training innovations that enabled large‑scale language models.

Fine-tuningGPTLLM
0 likes · 13 min read
ChatGPT Technical Analysis Series – Part 2: GPT1, GPT2, and GPT3 (Encoder vs Decoder, Zero‑Shot, and Scaling)
AI Large Model Application Practice
AI Large Model Application Practice
Jul 29, 2023 · Artificial Intelligence

How TypeChat Turns Natural Language into Structured JSON with LLMs

TypeChat, an open‑source project from Microsoft’s TypeScript creator, demonstrates how large language models can translate free‑form user requests into strongly‑typed JSON structures, enabling natural‑language interfaces for applications such as coffee ordering while reducing the complexity of traditional intent‑recognition pipelines.

AILLMTypeChat
0 likes · 7 min read
How TypeChat Turns Natural Language into Structured JSON with LLMs
MoonWebTeam
MoonWebTeam
Jul 28, 2023 · Artificial Intelligence

Unlocking LangChain: A Complete Guide to Building LLM Applications

This article introduces LangChain, explains its architecture and core components, and provides step‑by‑step Python examples for chat models, embeddings, prompts, indexes, chains, memory, agents, and practical use‑cases such as QA bots, web search, summarization, and persistent vector stores.

LLMLangChainPython
0 likes · 34 min read
Unlocking LangChain: A Complete Guide to Building LLM Applications
Alibaba Terminal Technology
Alibaba Terminal Technology
Jul 27, 2023 · Artificial Intelligence

How LLMs Transform Industrial Configuration Software: Architecture & Use Cases

This article explains how integrating large language models into industrial configuration tools creates AI‑driven features such as knowledge Q&A, automatic application generation, smart drawing, script and material generation, and outlines the three‑layer architecture, prompt engineering, and implementation lessons for developers.

AI integrationLLMPrompt engineering
0 likes · 30 min read
How LLMs Transform Industrial Configuration Software: Architecture & Use Cases
Tencent Cloud Developer
Tencent Cloud Developer
Jul 24, 2023 · Artificial Intelligence

Building an Internal Code Knowledge Base with Embedding and AST Interpreter

The author builds an internal code knowledge base for the TDesign Vue‑Next library by scraping documentation, chunking and embedding texts with OpenAI’s ada model into a vector store, then retrieving relevant chunks for LLM answers, and enhances context continuity using a JavaScript AST interpreter, achieving up to 90 % query accuracy and a 20 % productivity boost.

ASTEmbeddingKnowledge Base
0 likes · 11 min read
Building an Internal Code Knowledge Base with Embedding and AST Interpreter
Baobao Algorithm Notes
Baobao Algorithm Notes
Jul 23, 2023 · Artificial Intelligence

Why Cold Starts, Reward Hacking, and Evaluation Matter in LLM Training

The article analyzes key challenges in large‑language‑model pipelines—including the necessity of cold‑start pretraining, the pitfalls of reward‑model hacking, efficiency‑effectiveness trade‑offs, evaluation difficulties, and downstream fine‑tuning limits—offering practical insights for more reliable LLM development.

Fine-tuningLLMRLHF
0 likes · 9 min read
Why Cold Starts, Reward Hacking, and Evaluation Matter in LLM Training
Tencent Cloud Developer
Tencent Cloud Developer
Jul 19, 2023 · Artificial Intelligence

Build a Full‑Scale LLM from Scratch in 61 Lines of Python

This step‑by‑step tutorial shows how to set up a GPU environment, prepare custom text data, train a tokenizer, configure and train a GPT‑2‑based large language model, test its generation, and run the entire pipeline using only 61 lines of Python code.

AIDockerGPT-2
0 likes · 10 min read
Build a Full‑Scale LLM from Scratch in 61 Lines of Python
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 19, 2023 · Artificial Intelligence

Mastering Prompt Engineering: Techniques, Tips, and Real-World Examples

This comprehensive guide explores prompt engineering for large language models, covering its background, fundamental concepts, prompt formats, construction principles, advanced techniques like few‑shot, zero‑shot, and chain‑of‑thought prompting, as well as practical examples, evaluation metrics, and future directions.

Few-ShotLLMPrompt engineering
0 likes · 33 min read
Mastering Prompt Engineering: Techniques, Tips, and Real-World Examples
NetEase Yanxuan Technology Product Team
NetEase Yanxuan Technology Product Team
Jul 17, 2023 · Artificial Intelligence

Beyond Prompts: Designing Robust LLM Applications and the Rise of AI Engineers

This article analyzes the evolving landscape of large‑model applications, detailing prompt engineering, engineering challenges, AI‑engineer roles, domain‑driven design, and agent frameworks, while offering practical guidance and references for building production‑grade LLM‑driven systems.

AI EngineerAgent FrameworkDomain-Driven Design
0 likes · 14 min read
Beyond Prompts: Designing Robust LLM Applications and the Rise of AI Engineers
DaTaobao Tech
DaTaobao Tech
Jul 12, 2023 · Artificial Intelligence

Optimizing ChatGLM-6B Deployment with MNN: Model Conversion, Quantization, and Edge Inference

The article details a workflow that converts the PyTorch ChatGLM‑6B model to MNN, splits and compresses embeddings, applies int4/int8 quantization, supports dynamic shapes, and uses hybrid GPU/CPU or CPU‑only loading to enable low‑memory edge inference on PCs and mobile devices with competitive token‑per‑second performance.

ChatGLMLLMMNN
0 likes · 16 min read
Optimizing ChatGLM-6B Deployment with MNN: Model Conversion, Quantization, and Edge Inference
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 12, 2023 · Artificial Intelligence

Can Large Language Models Transform Recommendation Systems?

This article reviews how recent large language models (LLMs) are reshaping recommendation systems, covering their emergence, in‑context learning, prompt‑based strategies, three main LLM‑driven recommendation paradigms, key research papers, experimental results, and future research directions.

In-Context LearningLLMPrompt engineering
0 likes · 20 min read
Can Large Language Models Transform Recommendation Systems?
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jul 10, 2023 · Artificial Intelligence

Enhancing Large Language Models with LangChain: Prompt Engineering, Chains, Agents, and Node.js Implementation

This article explains the limitations of large language models, introduces prompt engineering as a remedy, and provides a comprehensive guide to using the LangChain framework—including models, prompts, chains, agents, vector search, and practical Node.js code examples—to enable LLMs to interact with external tools and data sources.

AI DevelopmentLLMLangChain
0 likes · 35 min read
Enhancing Large Language Models with LangChain: Prompt Engineering, Chains, Agents, and Node.js Implementation
phodal
phodal
Jul 9, 2023 · Artificial Intelligence

How LLMs Can Transform Software Architecture Governance and Code Generation

This article explores how large language models can be integrated into software architecture governance, turning architectural rules into code, enhancing design, development, and runtime phases, and improving code generation quality through explicit knowledge, DSLs, and full‑process AI assistance.

AI-assisted developmentDSLLLM
0 likes · 14 min read
How LLMs Can Transform Software Architecture Governance and Code Generation
Architect
Architect
Jul 7, 2023 · Artificial Intelligence

Introduction to LangChain: Building LLM Applications with Chains, Agents, and Memory

LangChain is an innovative framework that simplifies building language‑model‑driven applications by providing modular components such as models, prompts, memory, chains, and agents, with examples of asynchronous LLM calls, custom prompt templates, vector stores, token handling, and a simple Gradio chatbot implementation.

AIChatbotLLM
0 likes · 21 min read
Introduction to LangChain: Building LLM Applications with Chains, Agents, and Memory
DaTaobao Tech
DaTaobao Tech
Jul 7, 2023 · Artificial Intelligence

Introduction to LangChain: Concepts, Tools, and Applications

The article introduces LangChain, a framework that unifies language models, prompts, memory, retrieval, and tool‑driven agents into composable chains, illustrating its core components, code examples, end‑to‑end applications such as retrieval‑augmented QA and image generation, and outlining future uses in customer service, recommendation, and automated code review.

AILLMLangChain
0 likes · 25 min read
Introduction to LangChain: Concepts, Tools, and Applications
ITPUB
ITPUB
Jul 5, 2023 · Databases

Why Vector Databases Are Essential for Building Industry‑Specific LLM Applications

Vector databases enable efficient similarity search and storage of high‑dimensional embeddings, allowing enterprises to combine large language models with proprietary knowledge assets to create domain‑specific, accurate, and up‑to‑date AI services, as illustrated with open‑source solutions Chroma and Milvus.

AIChromaLLM
0 likes · 11 min read
Why Vector Databases Are Essential for Building Industry‑Specific LLM Applications
Tencent Tech
Tencent Tech
Jul 4, 2023 · Databases

What Is a Vector Database and Why Is It the AI Engine’s Secret Weapon?

This article explains what vectors and vector databases are, how they differ from traditional databases, their core technologies, their relationship with large language models, market trends, and details of Tencent Cloud VectorDB’s capabilities, architecture, real‑world applications, and future competitive challenges.

AIEmbeddingLLM
0 likes · 10 min read
What Is a Vector Database and Why Is It the AI Engine’s Secret Weapon?
php Courses
php Courses
Jul 3, 2023 · Databases

Top 5 Revolutionary Vector Databases Transforming Machine Learning and Similarity Search (2023)

Vector databases store and search large-scale vector data, and in 2023 the five leading solutions—Chroma, Pinecone, Weaviate, Milvus, and Faiss—offer scalable, high-performance options for applications such as LLM-driven services, audio search, recommendation systems, image/video analysis, and semantic retrieval across various industries.

AILLMdata storage
0 likes · 4 min read
Top 5 Revolutionary Vector Databases Transforming Machine Learning and Similarity Search (2023)
phodal
phodal
Jul 2, 2023 · Industry Insights

Can LLMs Revive Classic Software Engineering? A Deep Dive into Standardized AI‑Driven Development

This article explores how large language models can standardize software engineering practices by converting requirements and designs into DSL formats, enabling more automated and efficient code generation, while discussing the challenges of dynamic context building, DSL specification, and the evolving role of LLMs in development pipelines.

AI automationDSLLLM
0 likes · 14 min read
Can LLMs Revive Classic Software Engineering? A Deep Dive into Standardized AI‑Driven Development
ByteFE
ByteFE
Jun 28, 2023 · Frontend Development

How GPT is Transforming Frontend Development and UI Interaction

The article examines the rapid rise of GPT models, their technical capabilities and limitations, and how their integration is reshaping software interaction from command‑line to GUI‑plus‑Language UI, offering frontend engineers new opportunities, practical examples, and guidance on leveraging large‑model AI in product design.

AI integrationGPTLLM
0 likes · 21 min read
How GPT is Transforming Frontend Development and UI Interaction
phodal
phodal
Jun 27, 2023 · Artificial Intelligence

Designing an LLM‑Powered Architecture: The ArchGuard Co‑mate Reference Model

This article presents a detailed reference architecture for building LLM‑driven applications, using the ArchGuard Co‑mate project to illustrate layered design, local model integration, DSL‑based orchestration, and streaming LLM interfaces, complete with code examples and practical implementation notes.

AI OpsKotlinLLM
0 likes · 10 min read
Designing an LLM‑Powered Architecture: The ArchGuard Co‑mate Reference Model
ITPUB
ITPUB
Jun 27, 2023 · Artificial Intelligence

Why Databricks’ $1.3B MosaicML Deal Marks a Bold Bet on Generative AI

Databricks' $1.3 billion acquisition of MosaicML brings the startup's open‑source MPT models and high‑efficiency training stack into the Lakehouse platform, reflecting a strategic push to embed generative AI across enterprises while emphasizing data control, cost reduction, and open‑source policies.

AIGCDatabricksLLM
0 likes · 8 min read
Why Databricks’ $1.3B MosaicML Deal Marks a Bold Bet on Generative AI
DataFunTalk
DataFunTalk
Jun 23, 2023 · Artificial Intelligence

DeepKE-LLM: An Open‑Source Large Language Model Toolkit for Knowledge Extraction

DeepKE-LLM is an open‑source, extensible knowledge‑graph extraction framework that leverages large language models for entity, relation, and attribute extraction, supports multiple LLMs, provides installation scripts, various usage modes, fine‑tuning pipelines, and integrates with the KnowLM project for advanced instruction‑following capabilities.

DeepKEFine-tuningLLM
0 likes · 8 min read
DeepKE-LLM: An Open‑Source Large Language Model Toolkit for Knowledge Extraction
Alibaba Cloud Native
Alibaba Cloud Native
Jun 23, 2023 · Cloud Native

Accelerating LLM Inference on Alibaba Cloud with KServe and Fluid

This guide explains how to deploy large language models on Alibaba Cloud's ACK using KServe for serverless inference, integrates Fluid for distributed data caching to cut cold‑start latency, provides step‑by‑step commands, performance benchmarks, and practical tips for production‑grade AI model serving.

Cloud NativeFluidKServe
0 likes · 22 min read
Accelerating LLM Inference on Alibaba Cloud with KServe and Fluid
DataFunTalk
DataFunTalk
Jun 19, 2023 · Artificial Intelligence

Rensselaer Polytechnic Institute (RPI) Computer Science Faculty, Resources, and PhD/Intern Recruitment Overview

The announcement introduces RPI's prestigious computer science department, its extensive GPU resources, collaborations with IBM Research, and detailed profiles of three incoming faculty members—highlighting their research areas in graph neural networks, trustworthy AI, data‑centric AI, drug‑design generative models, and neural‑symbolic reasoning—while inviting PhD and intern applicants to apply with full scholarships and funding support.

Data‑Centric AILLMPhD Recruitment
0 likes · 8 min read
Rensselaer Polytechnic Institute (RPI) Computer Science Faculty, Resources, and PhD/Intern Recruitment Overview
phodal
phodal
Jun 18, 2023 · Artificial Intelligence

How to Build Language‑First APIs: 5 LLM‑Powered Architectural Patterns

The article outlines five practical patterns—natural‑language DSL, streaming DSL, DSL‑guided generation, explicit retry, and dynamic proxying—that enable developers to treat large‑language‑model interactions as first‑class APIs, improving efficiency, accuracy, and user experience across diverse scenarios.

DSLDynamic ProxyLLM
0 likes · 10 min read
How to Build Language‑First APIs: 5 LLM‑Powered Architectural Patterns
21CTO
21CTO
Jun 16, 2023 · Artificial Intelligence

Why Are LLM Stacks Becoming Essential for Modern Companies?

A comprehensive look at how companies are rapidly adopting large language model APIs, retrieval techniques, and custom model strategies, revealing key statistics, emerging toolchains, and the shifting balance between closed‑source LLM services and open‑source custom stacks.

AI adoptionCustom ModelsLLM
0 likes · 8 min read
Why Are LLM Stacks Becoming Essential for Modern Companies?
ByteFE
ByteFE
Jun 15, 2023 · Artificial Intelligence

Effective Prompt Engineering: Techniques, Prompt Injection Prevention, Hallucination Mitigation, and Advanced Prompting Strategies

This article explains how to craft efficient prompts by combining clear instructions and questions, discusses prompt injection risks and mitigation with delimiters, addresses hallucinations, and introduces zero‑shot, few‑shot, and chain‑of‑thought prompting techniques for large language models.

Few-ShotLLMPrompt engineering
0 likes · 16 min read
Effective Prompt Engineering: Techniques, Prompt Injection Prevention, Hallucination Mitigation, and Advanced Prompting Strategies
phodal
phodal
Jun 14, 2023 · Industry Insights

What Are the Four Core Principles for LLM‑Powered Software Architecture?

This article outlines four foundational design principles—user‑intent‑driven design, context awareness, atomic capability mapping, and language‑API integration—for building LLM‑centric software architectures, illustrating each with DSL examples, Kotlin implementations, and practical insights on prompt engineering, dynamic context layering, and API evolution.

DSLLLMPrompt engineering
0 likes · 10 min read
What Are the Four Core Principles for LLM‑Powered Software Architecture?
phodal
phodal
Jun 11, 2023 · Artificial Intelligence

How ArchGuard 2.0 Uses LLMs to Dynamically Generate Architecture Governance Functions

This article explains how ArchGuard 2.0 leverages large language models to transform tacit architectural knowledge into dynamic, DSL‑based governance functions, detailing the challenges of traditional tools, the design of the Co‑mate system, and the practical implementation using Kotlin REPL.

APIDSLKotlin
0 likes · 8 min read
How ArchGuard 2.0 Uses LLMs to Dynamically Generate Architecture Governance Functions
Tencent Cloud Developer
Tencent Cloud Developer
Jun 7, 2023 · Artificial Intelligence

Prompt Engineering Techniques and Their Application in Low‑Code Development with GPT and LangChain

The article explains prompt‑engineering fundamentals—definitions, instruction, context, and output formatting—and showcases tricks like few‑shot, chain‑of‑thought, and ReAct, then demonstrates testing with OpenAI APIs, token management, LangChain integration, and low‑code applications such as AI‑generated SQL, API gateways, DSL‑driven UI, chart creation, and vector‑based semantic search.

AIGPTKnowledge Generation
0 likes · 30 min read
Prompt Engineering Techniques and Their Application in Low‑Code Development with GPT and LangChain
phodal
phodal
Jun 2, 2023 · Artificial Intelligence

Mastering LLMs: A Programmer’s Guide to Prompt Engineering, Architecture, and Contextual AI

This comprehensive guide walks programmers through the fundamentals of large language model capabilities, prompt writing and management, new interaction and workflow designs, advanced scenario‑specific applications, and context engineering, offering practical strategies and architectural insights for AI‑native development.

AI ArchitectureContext EngineeringLLM
0 likes · 14 min read
Mastering LLMs: A Programmer’s Guide to Prompt Engineering, Architecture, and Contextual AI
21CTO
21CTO
Jun 2, 2023 · Artificial Intelligence

Why LangChain Is the Fast‑Growing Framework for LLM‑Powered Apps

LangChain, launched in 2022, quickly evolved from a Python library to a multi‑environment framework that helps developers build chat‑based applications, agents, and memory‑aware LLM solutions, while integrating with major cloud and AI tooling ecosystems.

AI agentsAuto-GPTChat applications
0 likes · 8 min read
Why LangChain Is the Fast‑Growing Framework for LLM‑Powered Apps
Architect
Architect
May 29, 2023 · Artificial Intelligence

Understanding Embeddings and Vector Databases for LLM Applications

This article explains what embeddings and vector databases are, how they are generated with models like OpenAI's Ada, why they enable semantic search and help overcome large language model token limits, and demonstrates a practical workflow for retrieving relevant document chunks using cosine similarity.

LLMembeddingsinformation retrieval
0 likes · 7 min read
Understanding Embeddings and Vector Databases for LLM Applications
DataFunSummit
DataFunSummit
May 24, 2023 · Artificial Intelligence

Digital Humans and XR Interaction Forum at DataFunCon2023

The DataFunCon2023 Digital Humans and XR Interaction Forum showcases the latest advances in digital human technology, multimodal interaction, emotion computing, and LLM‑enhanced AI beings through expert talks, detailed outlines, and audience benefits, highlighting both research progress and practical applications.

AILLMXR
0 likes · 8 min read
Digital Humans and XR Interaction Forum at DataFunCon2023
Network Intelligence Research Center (NIRC)
Network Intelligence Research Center (NIRC)
May 22, 2023 · Artificial Intelligence

How Microsoft Leverages LLMs to Auto‑Generate Cloud Incident Root Causes and Fixes

Microsoft researchers fine‑tuned GPT‑3.x models with LoRA on over 40,000 cloud incident records, evaluated them with six NLP metrics and human interviews, and found that LLMs can generate root‑cause analyses and mitigation steps comparable to BERT models, especially for machine‑detected failures.

AI for operationsGPT-3LLM
0 likes · 8 min read
How Microsoft Leverages LLMs to Auto‑Generate Cloud Incident Root Causes and Fixes
JD Retail Technology
JD Retail Technology
May 16, 2023 · Artificial Intelligence

Deploying and Fine‑Tuning the Alpaca‑LoRA Large Language Model on a Multi‑GPU Server

This guide details the end‑to‑end process of installing GPU drivers, setting up a Python environment, deploying the open‑source Alpaca‑LoRA model, fine‑tuning it with Chinese data on a multi‑GPU server, and performing inference, while highlighting practical challenges and performance observations.

Alpaca-LoRADeep LearningFine-tuning
0 likes · 11 min read
Deploying and Fine‑Tuning the Alpaca‑LoRA Large Language Model on a Multi‑GPU Server
Alibaba Cloud Developer
Alibaba Cloud Developer
May 16, 2023 · Artificial Intelligence

How to Build a Company‑Specific Chatbot with LLMs and Vector Databases

This article explains why combining large language models with vector databases enables enterprises to create specialized, up‑to‑date chatbots, outlines the underlying principles, describes the ADB‑PG vector‑search capabilities, and provides step‑by‑step implementation details including data processing, indexing, and query examples.

AnalyticDBChatbotLLM
0 likes · 17 min read
How to Build a Company‑Specific Chatbot with LLMs and Vector Databases
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
May 15, 2023 · Artificial Intelligence

How LangChain, Auto‑GPT, and HuggingGPT Shape LLM‑Powered Application Development

This article analyzes the current landscape of large‑language‑model application frameworks—LangChain, Auto‑GPT, and HuggingGPT—detailing their core capabilities, workflow patterns, limitations, and how they enable developers to integrate LLMs into real‑world products while outlining future platform requirements.

AI frameworksApplication DevelopmentAuto-GPT
0 likes · 17 min read
How LangChain, Auto‑GPT, and HuggingGPT Shape LLM‑Powered Application Development
DataFunSummit
DataFunSummit
May 4, 2023 · Artificial Intelligence

LLM Ranking Arena: Elo‑Based Competitive Evaluation of Open‑Source Chatbots

A recent study by the LMSYS organization introduces an Elo‑rated, 1v1 battle arena for large language models, ranking open‑source chatbots like Vicuna, Koala, and ChatGLM, while discussing the limitations of traditional benchmarks and the advantages of crowd‑sourced, scalable evaluation.

AI benchmarkingChatbot ArenaElo Rating
0 likes · 7 min read
LLM Ranking Arena: Elo‑Based Competitive Evaluation of Open‑Source Chatbots
Top Architect
Top Architect
Apr 21, 2023 · Artificial Intelligence

Fine‑Tuning LLaMA‑7B with Alpaca‑LoRA to Build a Chinese ChatGPT

This article explains why and how to fine‑tune the LLaMA‑7B model using the cheap Alpaca‑LoRA approach, covering hardware requirements, dataset preparation, LoRA training, optional model merging and quantization, and provides ready‑to‑run code snippets for single‑ and multi‑GPU setups.

Alpaca-LoRAFine-tuningGPU
0 likes · 10 min read
Fine‑Tuning LLaMA‑7B with Alpaca‑LoRA to Build a Chinese ChatGPT
Continuous Delivery 2.0
Continuous Delivery 2.0
Apr 20, 2023 · Artificial Intelligence

AutoGPT: Autonomous AI Tool Overview, Demonstrations, and Setup Guide

AutoGPT, the latest autonomous AI system, can independently browse the web, use third‑party tools, and execute tasks without human intervention, exemplified by building a React website in minutes, and this article explains its principles, showcases demos, and provides step‑by‑step installation instructions.

AI toolsAgentGPTAutoGPT
0 likes · 10 min read
AutoGPT: Autonomous AI Tool Overview, Demonstrations, and Setup Guide
Architect
Architect
Apr 14, 2023 · Artificial Intelligence

Overview of Prominent Large Language Models and Instruction Fine‑Tuning Techniques

The article surveys major large language models—including GPT‑3, T5, LaMDA, Jurassic‑1, MT‑NLG, Gopher, Chinchilla, PaLM, U‑PaLM, OPT, LLaMA, BLOOM, GLM‑130B, and ERNIE 3.0 Titan—explains their architectures, scaling trade‑offs, and then details instruction‑fine‑tuned variants such as T0, FLAN, GPT‑3.5, ChatGPT, GPT‑4, Alpaca and ChatGLM, providing references for further study.

AIChatGPTGPT-3
0 likes · 27 min read
Overview of Prominent Large Language Models and Instruction Fine‑Tuning Techniques
DataFunSummit
DataFunSummit
Mar 25, 2023 · Artificial Intelligence

How GPT‑4 Has Changed NLP Research: Community Perspectives

A collection of Zhihu answers reflects on how the release of GPT‑4 has reshaped NLP research, dividing the community into LLM‑enthusiasts and skeptics, discussing the relevance of parsing, resource‑driven research directions, and the existential challenges faced by researchers.

AIAcademic CommunityGPT-4
0 likes · 10 min read
How GPT‑4 Has Changed NLP Research: Community Perspectives