Tagged articles

2011 articles

Page 20 of 21

Oct 18, 2023 · Artificial Intelligence

The Evolution of AI Agents: From Philosophy to Modern Implementations

Tracing AI agents from Aristotle’s and Zhuangzi’s philosophical notions through the coining of “agent” in computer science to today’s learning‑based systems powered by large language models, the article outlines key milestones, core components—LLM brain, memory, planning, tool use—and showcases applications such as AlphaGo, Siri, and autonomous platforms, while forecasting their expanding, industry‑wide ubiquity.

AI agentsGenerative AgentsHistory of AI

0 likes · 21 min read

The Evolution of AI Agents: From Philosophy to Modern Implementations

Model Perspective

Oct 15, 2023 · Artificial Intelligence

How to Use Large Language Models Ethically in Math Modeling Contests

COMAP’s new policy outlines why and how teams in mathematical modeling competitions should responsibly employ large language models and generative AI, detailing guiding principles, risks, citation requirements, and ethical considerations to ensure fairness, transparency, and academic integrity.

AI policyLLMacademic integrity

0 likes · 9 min read

How to Use Large Language Models Ethically in Math Modeling Contests

phodal

Oct 15, 2023 · Artificial Intelligence

How Prompt Design Shapes AIGC Tool Architecture: Lessons from Copilot, Bloop, and JetBrains AI

This article analyzes how carefully crafted prompts influence the architecture of complex AIGC applications, comparing tools like GitHub Copilot, JetBrains AI Assistant, and Bloop, and presents practical strategies and design patterns for building robust, context‑aware AI‑driven development environments.

AIGCIDE pluginsLLM

0 likes · 13 min read

How Prompt Design Shapes AIGC Tool Architecture: Lessons from Copilot, Bloop, and JetBrains AI

dbaplus Community

Oct 14, 2023 · Artificial Intelligence

Demystifying Retrieval‑Augmented Generation: From Theory to Working Chatbot

This guide explains the Retrieval‑Augmented Generation (RAG) technique, detailing how user queries are matched to private knowledge bases, how relevant passages are retrieved, and how large language models use those passages to generate context‑aware answers, complete with code examples and practical tips.

ChatbotEmbeddingLLM

0 likes · 19 min read

Demystifying Retrieval‑Augmented Generation: From Theory to Working Chatbot

21CTO

Oct 12, 2023 · Frontend Development

How Vercel’s AI‑Powered v0 Tool Is Transforming Frontend Development

Vercel has launched v0, an AI‑driven tool that lets developers describe desired UI components in plain text and receive generated frontend code, streamlining creation, offering multiple design options, and shifting developer focus toward creativity and design.

AILLMVercel

0 likes · 4 min read

How Vercel’s AI‑Powered v0 Tool Is Transforming Frontend Development

Zhuanzhuan Tech

Oct 11, 2023 · Artificial Intelligence

Building a ChatGPT‑Based Intelligent Customer Service System with BERT Classification and Knowledge Filtering

This article describes how to construct an intelligent customer‑service assistant using ChatGPT for natural‑language understanding, BERT for user‑question classification, and Sentence‑BERT for knowledge‑selection, detailing system architecture, prompt design, model training, performance results, and practical cost reductions.

BERTChatGPTIntelligent Customer Service

0 likes · 16 min read

Building a ChatGPT‑Based Intelligent Customer Service System with BERT Classification and Knowledge Filtering

ByteFE

Oct 11, 2023 · Artificial Intelligence

CR Copilot: An Open‑Source LLM‑Based Code Review Assistant with Private Knowledge Base

This article describes the design and implementation of a code‑review assistant powered by open‑source large language models and a privately hosted knowledge base, covering background, pain points, system architecture, model selection, vector‑store integration, prompt engineering, diff parsing, and practical reflections.

AICode reviewKnowledge Base

0 likes · 24 min read

CR Copilot: An Open‑Source LLM‑Based Code Review Assistant with Private Knowledge Base

DataFunTalk

Oct 10, 2023 · Artificial Intelligence

Integrating Large Language Models into Recommender Systems: Opportunities, Methods, and Challenges

This article surveys how large language models can be incorporated into recommender systems, discussing their strengths and limitations, outlining where and how they can be applied across the recommendation pipeline, presenting recent research examples, and highlighting challenges and future directions for industrial deployment.

LLMfeature engineeringrecommender systems

0 likes · 20 min read

Integrating Large Language Models into Recommender Systems: Opportunities, Methods, and Challenges

Alibaba Cloud Big Data AI Platform

Oct 10, 2023 · Artificial Intelligence

Create a Custom Enterprise Conversational Search with Alibaba Cloud OpenSearch Vector & LLM

This guide walks you through setting up Alibaba Cloud OpenSearch Vector Search and LLM Intelligent Q&A editions, covering environment preparation, instance creation, data source configuration, field and index setup, document ingestion, query processing, and a complete Java SDK demo for building a flexible enterprise conversational search system.

Alibaba CloudConversational AIJava SDK

0 likes · 20 min read

Create a Custom Enterprise Conversational Search with Alibaba Cloud OpenSearch Vector & LLM

Baidu Geek Talk

Oct 9, 2023 · Artificial Intelligence

Code Understanding Technology: Building White-Box Software Knowledge Graph at Baidu

Baidu’s white‑box code understanding platform combines static, dynamic, non‑code and LLM‑based analyses in a three‑layer architecture that accelerates C/C++ processing ninefold, supports multiple languages, and powers applications such as intelligent unit testing, orphan‑function cleanup and AI‑driven risk detection, while future integration with models like GPT‑4 aims to enable multi‑turn code Q&A, automated refactoring and predictive testing.

ASTBaiduCode Understanding

0 likes · 15 min read

Code Understanding Technology: Building White-Box Software Knowledge Graph at Baidu

Baobao Algorithm Notes

Oct 8, 2023 · Interview Experience

Must‑Know Large‑Model Interview Questions for RLHF Candidates

The article shares a practitioner’s transition story from reinforcement‑learning‑focused game AI to large‑model work, outlines the challenges faced during job hunting at major Chinese tech firms, and provides a curated list of 23 technical interview questions covering PPO, RLHF, dataset evaluation, model fine‑tuning, and broader LLM concepts.

AI researchInterview PreparationLLM

0 likes · 10 min read

Must‑Know Large‑Model Interview Questions for RLHF Candidates

21CTO

Oct 4, 2023 · Artificial Intelligence

How LangStream Merges Data Streams with Generative AI for Real‑Time LLM Apps

LangStream, the new open‑source framework from DataStax, combines event‑driven data streaming with generative AI, offering seamless integration with vector databases like Astra DB, Milvus, and Pinecone, and providing a Kubernetes‑based runtime that enables real‑time LLM applications without extensive coding.

Data StreamingKubernetesLLM

0 likes · 7 min read

How LangStream Merges Data Streams with Generative AI for Real‑Time LLM Apps

AI Large Model Application Practice

Oct 4, 2023 · Artificial Intelligence

Controlling LLM‑Based AI Agents with the Open‑Source ‘Agents’ Framework

This article introduces the experimental open‑source project ‘Agents’, explains common challenges of LLM‑based AI agents, compares it with tools like AutoGPT, LangChain and MetaGPT, and demonstrates how its configuration‑driven SOP approach enables more controllable, multi‑agent interactions and easier deployment.

ConfigurationLLMMulti-Agent

0 likes · 14 min read

Controlling LLM‑Based AI Agents with the Open‑Source ‘Agents’ Framework

HomeTech

Sep 26, 2023 · Artificial Intelligence

Integrating Large Language Models with Search for Automotive Knowledge Retrieval

This article explores how combining traditional keyword search with large language models (LLMs) enhances understanding of user intent, builds a robust automotive knowledge base, and delivers more accurate, context‑aware answers through a multi‑stage retrieval and generation pipeline.

AIKnowledge BaseLLM

0 likes · 17 min read

Integrating Large Language Models with Search for Automotive Knowledge Retrieval

AI Large Model Application Practice

Sep 25, 2023 · Artificial Intelligence

How LangSmith Turns LLM Debugging, Testing, and Production Monitoring into a Seamless Workflow

This article explores LangSmith, the experimental platform from the creators of LangChain, detailing how it tracks complex LLM reasoning, supports batch testing and evaluation of AI applications, and offers a community Hub for sharing prompts and chains, ultimately helping move LLM projects from prototype to production.

AI testingDebuggingLLM

0 likes · 10 min read

How LangSmith Turns LLM Debugging, Testing, and Production Monitoring into a Seamless Workflow

phodal

Sep 24, 2023 · Artificial Intelligence

Designing a JVM‑Based LLM Framework: Insights from Chocolate Factory

This article explores the design principles, architectural decisions, and practical code examples behind the Chocolate Factory framework, a JVM‑centric LLM development platform inspired by LangChain, LlamaIndex, Spring AI, and PromptFlow, highlighting SDK construction, RAG workflows, and prompt engineering challenges.

AI DevelopmentFrameworkJVM

0 likes · 11 min read

Designing a JVM‑Based LLM Framework: Insights from Chocolate Factory

NetEase LeiHuo Testing Center

Sep 22, 2023 · Artificial Intelligence

Understanding Large Language Models and Prompt Engineering: A Practical Guide

This article provides an introductory overview of large language models (LLMs), compares popular models, explains their underlying principles, and offers practical guidance on prompt engineering, model evaluation, usage tips, and safety considerations, helping readers effectively select and apply LLMs in various scenarios.

AILLMModel Evaluation

0 likes · 44 min read

Understanding Large Language Models and Prompt Engineering: A Practical Guide

DataFunSummit

Sep 22, 2023 · Artificial Intelligence

Exploring Game AI Agents: Review, LLM‑Driven Exploration, and Future Directions

This article reviews the evolution of game AI agents, examines how large language models (LLMs) can drive new AI behaviors in games, and discusses practical case studies across genres such as Werewolf‑style, war‑SLG, and MOBA games, concluding with challenges and future research directions.

AI agentsGame DevelopmentLLM

0 likes · 31 min read

Exploring Game AI Agents: Review, LLM‑Driven Exploration, and Future Directions

21CTO

Sep 21, 2023 · Artificial Intelligence

Falcon 180B vs Llama 2: Which Open‑Source LLM Leads the AI Race?

This article compares the open‑source large language models Falcon 180B and Meta’s Llama 2, detailing their parameter sizes, training data, licensing, variants, infrastructure, language support, and safety policies, while providing links to official resources and a side‑by‑side feature table.

AI comparisonFalcon 180BLLM

0 likes · 8 min read

Falcon 180B vs Llama 2: Which Open‑Source LLM Leads the AI Race?

Ximalaya Technology Team

Sep 18, 2023 · Artificial Intelligence

Understanding Autonomous and Autopilot AI Agents: Insights from Industry Experts

The article surveys the rise of LLM‑powered AI agents, defining them as LLM + memory + planning + tool use, contrasting fully autonomous agents with human‑guided autopilot/copilot variants, outlining their benefits, risks such as hallucinations and unsafe actions, and urging modular frameworks and oversight for reliable enterprise deployment.

AI agentsAgent FrameworkLLM

0 likes · 27 min read

Understanding Autonomous and Autopilot AI Agents: Insights from Industry Experts

Yunxuetang Frontend Team

Sep 15, 2023 · Frontend Development

Front-End Insights: Architecture, Code Review, Storage, and New Tools

This article explores front‑end architecture decisions between Vue and React, shares a comprehensive code‑review methodology from 13 years at Tencent, evaluates the most robust local storage solutions for large offline data, details two screen‑adaptation approaches for Vue 3/Vite, and introduces emerging technologies such as LangChain’s RAG and Agents and the high‑performance Bun 1.0 JavaScript runtime.

BunCode reviewLLM

0 likes · 4 min read

Front-End Insights: Architecture, Code Review, Storage, and New Tools

DaTaobao Tech

Sep 13, 2023 · Artificial Intelligence

Integrating Large Language Models with Recommendation Systems: Paradigms, Methods, and Experiments

The article surveys how large language models can be integrated into recommendation systems, either as feature extractors or as end‑to‑end recommenders, showing that LLM‑derived semantics improve recall, ranking, diversity, and user experience, and outlining future multimodal, efficiency, and re‑ranking directions.

EmbeddingLLMPrompt engineering

0 likes · 19 min read

Integrating Large Language Models with Recommendation Systems: Paradigms, Methods, and Experiments

Alibaba Cloud Infrastructure

Sep 13, 2023 · Artificial Intelligence

Pai‑Megatron‑Patch: Design Principles, Key Features, and End‑to‑End Usage for Large Language Model Training

This article introduces the open‑source Pai‑Megatron‑Patch tool from Alibaba Cloud, explains its non‑intrusive patch architecture, enumerates supported models and features such as weight conversion, Flash‑Attention 2.0, FP8 training with Transformer Engine, and provides detailed command‑line examples for model conversion, pre‑training, supervised fine‑tuning, inference, and RLHF reinforcement learning pipelines.

Deep LearningFP8LLM

0 likes · 19 min read

Pai‑Megatron‑Patch: Design Principles, Key Features, and End‑to‑End Usage for Large Language Model Training

Baobao Algorithm Notes

Sep 12, 2023 · Artificial Intelligence

Why RTX 4090 Beats H100 for LLM Inference but Fails at Training

The article analyses the performance, memory, bandwidth and cost of NVIDIA H100, A100 and RTX 4090 GPUs, explains why the 4090 cannot handle large‑model training due to communication and memory limits, and shows how its high compute‑to‑price ratio makes it attractive for inference, backed by detailed parallelism calculations and cost‑per‑token estimates.

CostGPULLM

0 likes · 46 min read

Why RTX 4090 Beats H100 for LLM Inference but Fails at Training

Continuous Delivery 2.0

Sep 12, 2023 · Artificial Intelligence

Compression as a Measure of Intelligence in Large Language Models

The article argues that a large language model's ability to compress data through next‑token prediction reflects its intelligence, reviews theoretical and empirical evidence linking compression efficiency to model scale, and proposes a circuit‑competition framework to explain emergent capabilities, in‑context learning, and fine‑tuning effects.

GPT-4IntelligenceLLM

0 likes · 58 min read

Compression as a Measure of Intelligence in Large Language Models

AI Large Model Application Practice

Sep 6, 2023 · Artificial Intelligence

Prompt Engineering vs Fine‑Tuning: How to Choose the Best Strategy for Reliable LLM Outputs

This article compares Prompt Engineering and Supervised Fine‑Tuning for large language models, explains their principles, showcases common prompt patterns such as Chain‑of‑Thought, ReAct and Self‑Ask, outlines fine‑tuning stages and trade‑offs, and provides practical guidance on selecting the most suitable approach for specific enterprise AI Agent scenarios.

AI AgentFine-tuningLLM

0 likes · 17 min read

Prompt Engineering vs Fine‑Tuning: How to Choose the Best Strategy for Reliable LLM Outputs

phodal

Sep 3, 2023 · Artificial Intelligence

Engineering LLM Applications: Architecture, Prompt Modeling, and Multi‑Language Strategies

This article shares practical insights from months of building LLM proof‑of‑concepts, covering language‑agnostic architectures, FFI integration, prompt engineering, RAG patterns, DSL design, and four core architectural principles for scalable AI applications.

AI ArchitectureDSLFFI

0 likes · 13 min read

Engineering LLM Applications: Architecture, Prompt Modeling, and Multi‑Language Strategies

Rare Earth Juejin Tech Community

Sep 1, 2023 · Artificial Intelligence

Understanding Function Calling and ReAct for LLM Agents with LangChain

This article explains how large language models can act as agents by using OpenAI's Function Calling and the ReAct prompting paradigm, compares their trade‑offs, and demonstrates practical implementations with LangChain, including code examples for defining tools, invoking functions, and orchestrating multi‑step reasoning.

AIAgentFunction Calling

0 likes · 21 min read

Understanding Function Calling and ReAct for LLM Agents with LangChain

Architect

Aug 31, 2023 · Artificial Intelligence

Building a Custom LLM Chatbot with LangChain, ChromaDB, and LLaMA‑2

This tutorial explains how to leverage generative AI tools—including LLMs, embedding models, vector databases, and the LangChain framework—to create a custom chatbot that answers user queries using a knowledge base, with step‑by‑step code examples for Google Colab.

ChatbotEmbeddingLLM

0 likes · 15 min read

Building a Custom LLM Chatbot with LangChain, ChromaDB, and LLaMA‑2

21CTO

Aug 31, 2023 · Artificial Intelligence

Google Duet AI, IBM Mainframe AI, and NCSC LLM Warning: What’s Next for Enterprise Tech

The article reports on Google’s launch of Duet AI in Workspace, a Chinese firm’s apology over a VS Code‑based IDE, NCSC’s caution about LLMs, IBM’s generative‑AI tool for mainframe modernization, and OpenSSF’s new open‑source security manifesto.

AIEnterprise CollaborationLLM

0 likes · 9 min read

Google Duet AI, IBM Mainframe AI, and NCSC LLM Warning: What’s Next for Enterprise Tech

DataFunSummit

Aug 30, 2023 · Databases

Milvus: An AI‑Native Vector Database for Large Language Model Applications

This article introduces Milvus, an open‑source, cloud‑native vector database designed for AI workloads, explains how it helps mitigate large‑model hallucinations, outlines its CVP architecture, showcases performance benchmarks, and explores diverse application scenarios and future directions for LLM‑vector database integration.

AILLMMilvus

0 likes · 13 min read

Milvus: An AI‑Native Vector Database for Large Language Model Applications

21CTO

Aug 28, 2023 · Artificial Intelligence

What Is Code Llama? Meta’s Open-Source LLM for Generating Code

Code Llama, Meta’s specialized extension of Llama 2, is a large language model fine‑tuned on code data that can generate, complete, and debug software across multiple languages, supports up to 100 k tokens, and is freely available for research and commercial use.

Code GenerationCode LlamaLLM

0 likes · 5 min read

What Is Code Llama? Meta’s Open-Source LLM for Generating Code

21CTO

Aug 26, 2023 · Artificial Intelligence

How MetaGPT Leverages SOP to Boost Multi‑Agent LLM Collaboration

MetaGPT is a meta‑programming framework that encodes standard operating procedures as prompts, enabling LLM‑driven multi‑agent systems to automatically generate software artifacts, coordinate roles, and build complex applications like a Blackjack CLI with near‑perfect task completion.

AI CollaborationLLMMetaGPT

0 likes · 4 min read

How MetaGPT Leverages SOP to Boost Multi‑Agent LLM Collaboration

phodal

Aug 26, 2023 · Artificial Intelligence

How CoUnit Turns LLMs Into a Smart Team API for Faster Collaboration

CoUnit is an open‑source Rust‑based tool that uses local semantic search and LLMs to create a virtual team interface, enabling low‑cost, offline knowledge retrieval, API discovery, and cross‑team assistance for software development teams.

AI integrationLLMRust

0 likes · 7 min read

How CoUnit Turns LLMs Into a Smart Team API for Faster Collaboration

DataFunTalk

Aug 23, 2023 · Artificial Intelligence

Evaluating Large Language Model Item Encoders for Textual Collaborative Filtering in Recommendation Systems

This article investigates whether replacing traditional ID-based item encoders with massive LLMs such as GPT‑3 improves recommendation performance, by conducting extensive experiments on three real‑world datasets, analyzing performance limits, generality of item representations, and comparing against ID‑based and prompt‑based methods.

AIGPT-3LLM

0 likes · 15 min read

Evaluating Large Language Model Item Encoders for Textual Collaborative Filtering in Recommendation Systems

Ximalaya Technology Team

Aug 22, 2023 · Artificial Intelligence

Guidelines and Best Practices for Prompt Engineering with Large Language Models

The guide outlines prompt‑engineering best practices for large language models, distinguishing base and instruction‑tuned LLMs, emphasizing clear, structured, step‑by‑step prompts, handling hallucinations, iterating through idea‑code‑data cycles, applying techniques to summarization, reasoning and expansion, managing token costs, and providing concrete OpenAI API examples.

AIAPI UsageLLM

0 likes · 14 min read

Guidelines and Best Practices for Prompt Engineering with Large Language Models

AI Large Model Application Practice

Aug 21, 2023 · Artificial Intelligence

Building an AI Sales Assistant: Enterprise LLM Architecture and Agent Workflow

This article outlines a practical enterprise architecture for integrating large language models into a sales assistant, detailing knowledge ingestion, vector embedding, task planning, tool usage, and iterative dialogue, while introducing AI Agent concepts and open‑source frameworks such as LangChain.

AI AgentLLMLangChain

0 likes · 10 min read

Building an AI Sales Assistant: Enterprise LLM Architecture and Agent Workflow

JD Cloud Developers

Aug 15, 2023 · Artificial Intelligence

Unlocking LangChain: A Deep Dive into LLM‑Powered Application Development

This article explains what LangChain is, outlines its core components such as Models, Indexes, Chains, Memory and Agents, provides practical code examples for building summarization and QA pipelines, and discusses future directions for LLM‑centric development.

LLMLangChainMemory

0 likes · 22 min read

Unlocking LangChain: A Deep Dive into LLM‑Powered Application Development

php Courses

Aug 14, 2023 · Artificial Intelligence

Guide to the Five Most Powerful Large Language Models and How to Choose Them

This article explains the fundamentals of modern large language models, outlines the top five most powerful LLMs—including GPT‑4, Claude 2, Llama 2, Orca, and Cohere—and provides practical guidance on selecting and applying them across business and development use cases.

AI applicationsClaude 2GPT-4

0 likes · 9 min read

Guide to the Five Most Powerful Large Language Models and How to Choose Them

AI Large Model Application Practice

Aug 11, 2023 · Artificial Intelligence

How LLMs Are Revolutionizing Enterprise Apps: Scenarios, Architecture & Challenges

This article examines how large language models (LLMs) are reshaping enterprise applications by enabling natural‑language interfaces, automating workflows, and enhancing data analysis, while also detailing typical use cases, integration complexities, and practical solutions such as prompt engineering and vector‑database retrieval.

LLMPrompt engineeringartificial intelligence

0 likes · 11 min read

How LLMs Are Revolutionizing Enterprise Apps: Scenarios, Architecture & Challenges

Architect's Guide

Aug 10, 2023 · Artificial Intelligence

Getting Started with LangChain: Building LLM Applications in Python

This tutorial introduces LangChain, an open‑source Python framework that provides unified model access, prompt management, memory, retrieval, and tool integration, enabling developers to quickly prototype AI‑driven applications using large language models and various external data sources.

LLMLangChainPrompt engineering

0 likes · 13 min read

Getting Started with LangChain: Building LLM Applications in Python

phodal

Aug 6, 2023 · Industry Insights

Can AI Really Supercharge Your IDE? Inside the AutoDev Open‑Source Copilot

This article examines the open‑source AutoDev project, outlining its assumptions about private LLMs, the three assistance modes it provides for IDEs, technical implementation details, integration challenges, and broader thoughts on LLMs as developer copilots.

AIIDELLM

0 likes · 8 min read

Can AI Really Supercharge Your IDE? Inside the AutoDev Open‑Source Copilot

JD Tech

Aug 4, 2023 · Artificial Intelligence

Deploying and Evaluating the Vicuna Open‑Source Large Language Model on a Single Machine

This article details a step‑by‑step guide to deploying the Vicuna open‑source LLM on a single server, covering model preparation, environment setup, dependency installation, GPU and CUDA configuration, inference commands, performance evaluation, and attempted fine‑tuning, while sharing practical observations and results.

Fine‑tuningGPUInference

0 likes · 16 min read

Deploying and Evaluating the Vicuna Open‑Source Large Language Model on a Single Machine

Architect

Jul 31, 2023 · Artificial Intelligence

Getting Started with LangChain: Building LLM‑Powered Applications

This article introduces LangChain, explains why it’s useful for building applications with large language models, walks through installation, API‑key setup, model and embedding selection, prompt engineering, chaining, memory, agents, and vector‑store indexing, and provides runnable Python code examples throughout.

LLMLangChainPromptEngineering

0 likes · 16 min read

Getting Started with LangChain: Building LLM‑Powered Applications

Rare Earth Juejin Tech Community

Jul 30, 2023 · Artificial Intelligence

ChatGPT Technical Analysis Series – Part 2: GPT1, GPT2, and GPT3 (Encoder vs Decoder, Zero‑Shot, and Scaling)

This article reviews the evolution of the GPT family from GPT‑1 to GPT‑3, comparing encoder‑decoder architectures, explaining the shift from supervised fine‑tuning to zero‑shot and few‑shot learning, and highlighting the architectural and training innovations that enabled large‑scale language models.

Fine-tuningGPTLLM

0 likes · 13 min read

ChatGPT Technical Analysis Series – Part 2: GPT1, GPT2, and GPT3 (Encoder vs Decoder, Zero‑Shot, and Scaling)

AI Large Model Application Practice

Jul 29, 2023 · Artificial Intelligence

How TypeChat Turns Natural Language into Structured JSON with LLMs

TypeChat, an open‑source project from Microsoft’s TypeScript creator, demonstrates how large language models can translate free‑form user requests into strongly‑typed JSON structures, enabling natural‑language interfaces for applications such as coffee ordering while reducing the complexity of traditional intent‑recognition pipelines.

AILLMTypeChat

0 likes · 7 min read

How TypeChat Turns Natural Language into Structured JSON with LLMs

MoonWebTeam

Jul 28, 2023 · Artificial Intelligence

Unlocking LangChain: A Complete Guide to Building LLM Applications

This article introduces LangChain, explains its architecture and core components, and provides step‑by‑step Python examples for chat models, embeddings, prompts, indexes, chains, memory, agents, and practical use‑cases such as QA bots, web search, summarization, and persistent vector stores.

LLMLangChainPython

0 likes · 34 min read

Unlocking LangChain: A Complete Guide to Building LLM Applications

Alibaba Terminal Technology

Jul 27, 2023 · Artificial Intelligence

How LLMs Transform Industrial Configuration Software: Architecture & Use Cases

This article explains how integrating large language models into industrial configuration tools creates AI‑driven features such as knowledge Q&A, automatic application generation, smart drawing, script and material generation, and outlines the three‑layer architecture, prompt engineering, and implementation lessons for developers.

AI integrationLLMPrompt engineering

0 likes · 30 min read

How LLMs Transform Industrial Configuration Software: Architecture & Use Cases

Alibaba Cloud Big Data AI Platform

Jul 26, 2023 · Artificial Intelligence

Build a Custom LLM Chatbot with Hologres Vector Store and LangChain

This guide shows how to create a domain‑specific chatbot by combining large language models, prompt‑tuning, and Alibaba Cloud's Hologres real‑time vector store, covering prerequisites, environment setup, code configuration, vector data preparation, verification, and optional DingTalk integration.

ChatbotHologresLLM

0 likes · 12 min read

Build a Custom LLM Chatbot with Hologres Vector Store and LangChain

Tencent Cloud Developer

Jul 24, 2023 · Artificial Intelligence

Building an Internal Code Knowledge Base with Embedding and AST Interpreter

The author builds an internal code knowledge base for the TDesign Vue‑Next library by scraping documentation, chunking and embedding texts with OpenAI’s ada model into a vector store, then retrieving relevant chunks for LLM answers, and enhances context continuity using a JavaScript AST interpreter, achieving up to 90 % query accuracy and a 20 % productivity boost.

ASTEmbeddingKnowledge Base

0 likes · 11 min read

Building an Internal Code Knowledge Base with Embedding and AST Interpreter

Baobao Algorithm Notes

Jul 23, 2023 · Artificial Intelligence

Why Cold Starts, Reward Hacking, and Evaluation Matter in LLM Training

The article analyzes key challenges in large‑language‑model pipelines—including the necessity of cold‑start pretraining, the pitfalls of reward‑model hacking, efficiency‑effectiveness trade‑offs, evaluation difficulties, and downstream fine‑tuning limits—offering practical insights for more reliable LLM development.

Fine-tuningLLMRLHF

0 likes · 9 min read

Why Cold Starts, Reward Hacking, and Evaluation Matter in LLM Training

Volcano Engine Developer Services

Jul 21, 2023 · Artificial Intelligence

Build a Custom AI Q&A System Using Volcano Engine Cloud Search & LangChain

This guide explains how to create a domain‑specific intelligent Q&A system by leveraging prompt‑tuning, Volcano Engine Cloud Search vector store, and LangChain, covering model selection, data embedding, vector indexing, retrieval, and LLM integration with full code examples.

AI Q&AEmbeddingLLM

0 likes · 8 min read

Build a Custom AI Q&A System Using Volcano Engine Cloud Search & LangChain

Tencent Cloud Developer

Jul 19, 2023 · Artificial Intelligence

Build a Full‑Scale LLM from Scratch in 61 Lines of Python

This step‑by‑step tutorial shows how to set up a GPU environment, prepare custom text data, train a tokenizer, configure and train a GPT‑2‑based large language model, test its generation, and run the entire pipeline using only 61 lines of Python code.

AIDockerGPT-2

0 likes · 10 min read

Build a Full‑Scale LLM from Scratch in 61 Lines of Python

Java Backend Technology

Jul 19, 2023 · Artificial Intelligence

Unlocking LangChain: Build Powerful LLM Applications Step‑by‑Step

This article introduces LangChain, explains its modular architecture—including models, prompts, memory, chains, and agents—shows how to handle tokens and model selection, and provides complete Python code examples for building a conversational retrieval chatbot with OpenAI models.

EmbeddingLLMLangChain

0 likes · 20 min read

Unlocking LangChain: Build Powerful LLM Applications Step‑by‑Step

Alibaba Cloud Developer

Jul 19, 2023 · Artificial Intelligence

Mastering Prompt Engineering: Techniques, Tips, and Real-World Examples

This comprehensive guide explores prompt engineering for large language models, covering its background, fundamental concepts, prompt formats, construction principles, advanced techniques like few‑shot, zero‑shot, and chain‑of‑thought prompting, as well as practical examples, evaluation metrics, and future directions.

Few-ShotLLMPrompt engineering

0 likes · 33 min read

Mastering Prompt Engineering: Techniques, Tips, and Real-World Examples

NetEase Yanxuan Technology Product Team

Jul 17, 2023 · Artificial Intelligence

Beyond Prompts: Designing Robust LLM Applications and the Rise of AI Engineers

This article analyzes the evolving landscape of large‑model applications, detailing prompt engineering, engineering challenges, AI‑engineer roles, domain‑driven design, and agent frameworks, while offering practical guidance and references for building production‑grade LLM‑driven systems.

AI EngineerAgent FrameworkDomain-Driven Design

0 likes · 14 min read

Beyond Prompts: Designing Robust LLM Applications and the Rise of AI Engineers

DaTaobao Tech

Jul 12, 2023 · Artificial Intelligence

Optimizing ChatGLM-6B Deployment with MNN: Model Conversion, Quantization, and Edge Inference

The article details a workflow that converts the PyTorch ChatGLM‑6B model to MNN, splits and compresses embeddings, applies int4/int8 quantization, supports dynamic shapes, and uses hybrid GPU/CPU or CPU‑only loading to enable low‑memory edge inference on PCs and mobile devices with competitive token‑per‑second performance.

ChatGLMLLMMNN

0 likes · 16 min read

Optimizing ChatGLM-6B Deployment with MNN: Model Conversion, Quantization, and Edge Inference

Alibaba Cloud Developer

Jul 12, 2023 · Artificial Intelligence

Can Large Language Models Transform Recommendation Systems?

This article reviews how recent large language models (LLMs) are reshaping recommendation systems, covering their emergence, in‑context learning, prompt‑based strategies, three main LLM‑driven recommendation paradigms, key research papers, experimental results, and future research directions.

In-Context LearningLLMPrompt engineering

0 likes · 20 min read

Can Large Language Models Transform Recommendation Systems?

Rare Earth Juejin Tech Community

Jul 10, 2023 · Artificial Intelligence

Enhancing Large Language Models with LangChain: Prompt Engineering, Chains, Agents, and Node.js Implementation

This article explains the limitations of large language models, introduces prompt engineering as a remedy, and provides a comprehensive guide to using the LangChain framework—including models, prompts, chains, agents, vector search, and practical Node.js code examples—to enable LLMs to interact with external tools and data sources.

AI DevelopmentLLMLangChain

0 likes · 35 min read

Enhancing Large Language Models with LangChain: Prompt Engineering, Chains, Agents, and Node.js Implementation

phodal

Jul 9, 2023 · Artificial Intelligence

How LLMs Can Transform Software Architecture Governance and Code Generation

This article explores how large language models can be integrated into software architecture governance, turning architectural rules into code, enhancing design, development, and runtime phases, and improving code generation quality through explicit knowledge, DSLs, and full‑process AI assistance.

AI-assisted developmentDSLLLM

0 likes · 14 min read

Architect

Jul 7, 2023 · Artificial Intelligence

Introduction to LangChain: Building LLM Applications with Chains, Agents, and Memory

LangChain is an innovative framework that simplifies building language‑model‑driven applications by providing modular components such as models, prompts, memory, chains, and agents, with examples of asynchronous LLM calls, custom prompt templates, vector stores, token handling, and a simple Gradio chatbot implementation.

AIChatbotLLM

0 likes · 21 min read

Introduction to LangChain: Building LLM Applications with Chains, Agents, and Memory

DaTaobao Tech

Jul 7, 2023 · Artificial Intelligence

Introduction to LangChain: Concepts, Tools, and Applications

The article introduces LangChain, a framework that unifies language models, prompts, memory, retrieval, and tool‑driven agents into composable chains, illustrating its core components, code examples, end‑to‑end applications such as retrieval‑augmented QA and image generation, and outlining future uses in customer service, recommendation, and automated code review.

AILLMLangChain

0 likes · 25 min read

Introduction to LangChain: Concepts, Tools, and Applications

php Courses

Jul 6, 2023 · Artificial Intelligence

Microsoft Introduces Guidance: A Domain-Specific Language for Managing Large Language Models

Microsoft's newly released Guidance language unifies generation, prompting, and logical control for large language models, enabling seamless integration with providers like Hugging Face, enhancing prompt reliability, and simplifying complex LLM workflows for developers.

AILLMLangChain

0 likes · 3 min read

Microsoft Introduces Guidance: A Domain-Specific Language for Managing Large Language Models

ITPUB

Jul 5, 2023 · Databases

Why Vector Databases Are Essential for Building Industry‑Specific LLM Applications

Vector databases enable efficient similarity search and storage of high‑dimensional embeddings, allowing enterprises to combine large language models with proprietary knowledge assets to create domain‑specific, accurate, and up‑to‑date AI services, as illustrated with open‑source solutions Chroma and Milvus.

AIChromaLLM

0 likes · 11 min read

Why Vector Databases Are Essential for Building Industry‑Specific LLM Applications

Tencent Tech

Jul 4, 2023 · Databases

What Is a Vector Database and Why Is It the AI Engine’s Secret Weapon?

This article explains what vectors and vector databases are, how they differ from traditional databases, their core technologies, their relationship with large language models, market trends, and details of Tencent Cloud VectorDB’s capabilities, architecture, real‑world applications, and future competitive challenges.

AIEmbeddingLLM

0 likes · 10 min read

What Is a Vector Database and Why Is It the AI Engine’s Secret Weapon?

php Courses

Jul 3, 2023 · Databases

Top 5 Revolutionary Vector Databases Transforming Machine Learning and Similarity Search (2023)

Vector databases store and search large-scale vector data, and in 2023 the five leading solutions—Chroma, Pinecone, Weaviate, Milvus, and Faiss—offer scalable, high-performance options for applications such as LLM-driven services, audio search, recommendation systems, image/video analysis, and semantic retrieval across various industries.

AILLMdata storage

0 likes · 4 min read

Top 5 Revolutionary Vector Databases Transforming Machine Learning and Similarity Search (2023)

phodal

Jul 2, 2023 · Industry Insights

Can LLMs Revive Classic Software Engineering? A Deep Dive into Standardized AI‑Driven Development

This article explores how large language models can standardize software engineering practices by converting requirements and designs into DSL formats, enabling more automated and efficient code generation, while discussing the challenges of dynamic context building, DSL specification, and the evolving role of LLMs in development pipelines.

AI automationDSLLLM

0 likes · 14 min read

Can LLMs Revive Classic Software Engineering? A Deep Dive into Standardized AI‑Driven Development

ByteFE

Jun 28, 2023 · Frontend Development

How GPT is Transforming Frontend Development and UI Interaction

The article examines the rapid rise of GPT models, their technical capabilities and limitations, and how their integration is reshaping software interaction from command‑line to GUI‑plus‑Language UI, offering frontend engineers new opportunities, practical examples, and guidance on leveraging large‑model AI in product design.

AI integrationGPTLLM

0 likes · 21 min read

How GPT is Transforming Frontend Development and UI Interaction

phodal

Jun 27, 2023 · Artificial Intelligence

Designing an LLM‑Powered Architecture: The ArchGuard Co‑mate Reference Model

This article presents a detailed reference architecture for building LLM‑driven applications, using the ArchGuard Co‑mate project to illustrate layered design, local model integration, DSL‑based orchestration, and streaming LLM interfaces, complete with code examples and practical implementation notes.

AI OpsKotlinLLM

0 likes · 10 min read

Designing an LLM‑Powered Architecture: The ArchGuard Co‑mate Reference Model

ITPUB

Jun 27, 2023 · Artificial Intelligence

Why Databricks’ $1.3B MosaicML Deal Marks a Bold Bet on Generative AI

Databricks' $1.3 billion acquisition of MosaicML brings the startup's open‑source MPT models and high‑efficiency training stack into the Lakehouse platform, reflecting a strategic push to embed generative AI across enterprises while emphasizing data control, cost reduction, and open‑source policies.

AIGCDatabricksLLM

0 likes · 8 min read

Why Databricks’ $1.3B MosaicML Deal Marks a Bold Bet on Generative AI

Kuaishou E-commerce Frontend Team

Jun 27, 2023 · Artificial Intelligence

Exploring AIGC in Enterprise: From OT Algorithms to 3D AI Innovations

The A2M Internet Architecture and AI Summit in Shanghai showcased 24 topics and 84 talks, highlighting AIGC applications across enterprise collaboration, large‑scale model integration, 3D visual content, music entertainment, and the broader impact of LLMs on software development and digital transformation.

3D AIAIGCCollaboration

0 likes · 10 min read

Exploring AIGC in Enterprise: From OT Algorithms to 3D AI Innovations

Alibaba Cloud Native

Jun 24, 2023 · Cloud Native

How to Deploy Distributed LLM Inference with DeepSpeed on Alibaba Cloud ACK

This guide walks through deploying a Bloom 7B1 large language model for distributed inference on Alibaba Cloud Container Service (ACK) using DeepSpeed, Arena, and Kubernetes, covering environment setup, model configuration, service launch, verification, and Ingress exposure.

ACKArenaCloud Native

0 likes · 14 min read

How to Deploy Distributed LLM Inference with DeepSpeed on Alibaba Cloud ACK

DataFunTalk

Jun 23, 2023 · Artificial Intelligence

DeepKE-LLM: An Open‑Source Large Language Model Toolkit for Knowledge Extraction

DeepKE-LLM is an open‑source, extensible knowledge‑graph extraction framework that leverages large language models for entity, relation, and attribute extraction, supports multiple LLMs, provides installation scripts, various usage modes, fine‑tuning pipelines, and integrates with the KnowLM project for advanced instruction‑following capabilities.

DeepKEFine-tuningLLM

0 likes · 8 min read

DeepKE-LLM: An Open‑Source Large Language Model Toolkit for Knowledge Extraction

Alibaba Cloud Native

Jun 23, 2023 · Cloud Native

Accelerating LLM Inference on Alibaba Cloud with KServe and Fluid

This guide explains how to deploy large language models on Alibaba Cloud's ACK using KServe for serverless inference, integrates Fluid for distributed data caching to cut cold‑start latency, provides step‑by‑step commands, performance benchmarks, and practical tips for production‑grade AI model serving.

Cloud NativeFluidKServe

0 likes · 22 min read

Accelerating LLM Inference on Alibaba Cloud with KServe and Fluid

Architects' Tech Alliance

Jun 23, 2023 · Artificial Intelligence

Can GPT‑4 Really Design a Chip? Inside NYU’s AI‑Generated Processor Breakthrough

A NYU Tandon study shows that GPT‑4, using plain‑language dialogue, can generate Verilog code for a novel 8‑bit accumulator microprocessor, which was tape‑out on Skywater 130 nm, demonstrating AI’s emerging ability to perform hardware design tasks at near‑human level.

AGIAI chip designGPT-4

0 likes · 5 min read

Can GPT‑4 Really Design a Chip? Inside NYU’s AI‑Generated Processor Breakthrough

DataFunTalk

Jun 19, 2023 · Artificial Intelligence

Rensselaer Polytechnic Institute (RPI) Computer Science Faculty, Resources, and PhD/Intern Recruitment Overview

The announcement introduces RPI's prestigious computer science department, its extensive GPU resources, collaborations with IBM Research, and detailed profiles of three incoming faculty members—highlighting their research areas in graph neural networks, trustworthy AI, data‑centric AI, drug‑design generative models, and neural‑symbolic reasoning—while inviting PhD and intern applicants to apply with full scholarships and funding support.

Data‑Centric AILLMPhD Recruitment

0 likes · 8 min read

Rensselaer Polytechnic Institute (RPI) Computer Science Faculty, Resources, and PhD/Intern Recruitment Overview

phodal

Jun 18, 2023 · Artificial Intelligence

How to Build Language‑First APIs: 5 LLM‑Powered Architectural Patterns

The article outlines five practical patterns—natural‑language DSL, streaming DSL, DSL‑guided generation, explicit retry, and dynamic proxying—that enable developers to treat large‑language‑model interactions as first‑class APIs, improving efficiency, accuracy, and user experience across diverse scenarios.

DSLDynamic ProxyLLM

0 likes · 10 min read

How to Build Language‑First APIs: 5 LLM‑Powered Architectural Patterns

21CTO

Jun 16, 2023 · Artificial Intelligence

Why Are LLM Stacks Becoming Essential for Modern Companies?

A comprehensive look at how companies are rapidly adopting large language model APIs, retrieval techniques, and custom model strategies, revealing key statistics, emerging toolchains, and the shifting balance between closed‑source LLM services and open‑source custom stacks.

AI adoptionCustom ModelsLLM

0 likes · 8 min read

Why Are LLM Stacks Becoming Essential for Modern Companies?

ByteFE

Jun 15, 2023 · Artificial Intelligence

Effective Prompt Engineering: Techniques, Prompt Injection Prevention, Hallucination Mitigation, and Advanced Prompting Strategies

This article explains how to craft efficient prompts by combining clear instructions and questions, discusses prompt injection risks and mitigation with delimiters, addresses hallucinations, and introduces zero‑shot, few‑shot, and chain‑of‑thought prompting techniques for large language models.

Few-ShotLLMPrompt engineering

0 likes · 16 min read

Effective Prompt Engineering: Techniques, Prompt Injection Prevention, Hallucination Mitigation, and Advanced Prompting Strategies

phodal

Jun 14, 2023 · Industry Insights

What Are the Four Core Principles for LLM‑Powered Software Architecture?

This article outlines four foundational design principles—user‑intent‑driven design, context awareness, atomic capability mapping, and language‑API integration—for building LLM‑centric software architectures, illustrating each with DSL examples, Kotlin implementations, and practical insights on prompt engineering, dynamic context layering, and API evolution.

DSLLLMPrompt engineering

0 likes · 10 min read

What Are the Four Core Principles for LLM‑Powered Software Architecture?

phodal

Jun 11, 2023 · Artificial Intelligence

How ArchGuard 2.0 Uses LLMs to Dynamically Generate Architecture Governance Functions

This article explains how ArchGuard 2.0 leverages large language models to transform tacit architectural knowledge into dynamic, DSL‑based governance functions, detailing the challenges of traditional tools, the design of the Co‑mate system, and the practical implementation using Kotlin REPL.

APIDSLKotlin

0 likes · 8 min read

How ArchGuard 2.0 Uses LLMs to Dynamically Generate Architecture Governance Functions

dbaplus Community

Jun 10, 2023 · Artificial Intelligence

How Embeddings and Vector Databases Empower LLMs for Semantic Search

This article explains what embeddings and vector databases are, why they matter for large language models, and how they can be used to overcome token limits by storing and retrieving relevant text chunks for accurate, context‑aware responses.

LLMembeddingssemantic search

0 likes · 6 min read

How Embeddings and Vector Databases Empower LLMs for Semantic Search

Tencent Cloud Developer

Jun 7, 2023 · Artificial Intelligence

Prompt Engineering Techniques and Their Application in Low‑Code Development with GPT and LangChain

The article explains prompt‑engineering fundamentals—definitions, instruction, context, and output formatting—and showcases tricks like few‑shot, chain‑of‑thought, and ReAct, then demonstrates testing with OpenAI APIs, token management, LangChain integration, and low‑code applications such as AI‑generated SQL, API gateways, DSL‑driven UI, chart creation, and vector‑based semantic search.

AIGPTKnowledge Generation

0 likes · 30 min read

Prompt Engineering Techniques and Their Application in Low‑Code Development with GPT and LangChain

phodal

Jun 2, 2023 · Artificial Intelligence

Mastering LLMs: A Programmer’s Guide to Prompt Engineering, Architecture, and Contextual AI

This comprehensive guide walks programmers through the fundamentals of large language model capabilities, prompt writing and management, new interaction and workflow designs, advanced scenario‑specific applications, and context engineering, offering practical strategies and architectural insights for AI‑native development.

AI ArchitectureContext EngineeringLLM

0 likes · 14 min read

Mastering LLMs: A Programmer’s Guide to Prompt Engineering, Architecture, and Contextual AI

21CTO

Jun 2, 2023 · Artificial Intelligence

Why LangChain Is the Fast‑Growing Framework for LLM‑Powered Apps

LangChain, launched in 2022, quickly evolved from a Python library to a multi‑environment framework that helps developers build chat‑based applications, agents, and memory‑aware LLM solutions, while integrating with major cloud and AI tooling ecosystems.

AI agentsAuto-GPTChat applications

0 likes · 8 min read

Why LangChain Is the Fast‑Growing Framework for LLM‑Powered Apps

Architect

May 29, 2023 · Artificial Intelligence

Understanding Embeddings and Vector Databases for LLM Applications

This article explains what embeddings and vector databases are, how they are generated with models like OpenAI's Ada, why they enable semantic search and help overcome large language model token limits, and demonstrates a practical workflow for retrieving relevant document chunks using cosine similarity.

LLMembeddingsinformation retrieval

0 likes · 7 min read

Understanding Embeddings and Vector Databases for LLM Applications

DataFunSummit

May 24, 2023 · Artificial Intelligence

Digital Humans and XR Interaction Forum at DataFunCon2023

The DataFunCon2023 Digital Humans and XR Interaction Forum showcases the latest advances in digital human technology, multimodal interaction, emotion computing, and LLM‑enhanced AI beings through expert talks, detailed outlines, and audience benefits, highlighting both research progress and practical applications.

AILLMXR

0 likes · 8 min read

Digital Humans and XR Interaction Forum at DataFunCon2023

Network Intelligence Research Center (NIRC)

May 22, 2023 · Artificial Intelligence

How Microsoft Leverages LLMs to Auto‑Generate Cloud Incident Root Causes and Fixes

Microsoft researchers fine‑tuned GPT‑3.x models with LoRA on over 40,000 cloud incident records, evaluated them with six NLP metrics and human interviews, and found that LLMs can generate root‑cause analyses and mitigation steps comparable to BERT models, especially for machine‑detected failures.

AI for operationsGPT-3LLM

0 likes · 8 min read

How Microsoft Leverages LLMs to Auto‑Generate Cloud Incident Root Causes and Fixes

JD Retail Technology

May 16, 2023 · Artificial Intelligence

Deploying and Fine‑Tuning the Alpaca‑LoRA Large Language Model on a Multi‑GPU Server

This guide details the end‑to‑end process of installing GPU drivers, setting up a Python environment, deploying the open‑source Alpaca‑LoRA model, fine‑tuning it with Chinese data on a multi‑GPU server, and performing inference, while highlighting practical challenges and performance observations.

Alpaca-LoRADeep LearningFine-tuning

0 likes · 11 min read

Deploying and Fine‑Tuning the Alpaca‑LoRA Large Language Model on a Multi‑GPU Server

Alibaba Cloud Developer

May 16, 2023 · Artificial Intelligence

How to Build a Company‑Specific Chatbot with LLMs and Vector Databases

This article explains why combining large language models with vector databases enables enterprises to create specialized, up‑to‑date chatbots, outlines the underlying principles, describes the ADB‑PG vector‑search capabilities, and provides step‑by‑step implementation details including data processing, indexing, and query examples.

AnalyticDBChatbotLLM

0 likes · 17 min read

How to Build a Company‑Specific Chatbot with LLMs and Vector Databases

phodal

May 15, 2023 · Artificial Intelligence

How GitHub Copilot Builds Context: Inside Its Tree‑Sitter Architecture and Token Management

This article reverse‑engineers GitHub Copilot's context construction, detailing visible and invisible processes, prompt composition, token allocation strategies, fast response mechanisms, and future directions for LLM context engineering.

AI code assistantGitHub CopilotLLM

0 likes · 11 min read

How GitHub Copilot Builds Context: Inside Its Tree‑Sitter Architecture and Token Management

AsiaInfo Technology: New Tech Exploration

May 15, 2023 · Artificial Intelligence

How LangChain, Auto‑GPT, and HuggingGPT Shape LLM‑Powered Application Development

This article analyzes the current landscape of large‑language‑model application frameworks—LangChain, Auto‑GPT, and HuggingGPT—detailing their core capabilities, workflow patterns, limitations, and how they enable developers to integrate LLMs into real‑world products while outlining future platform requirements.

AI frameworksApplication DevelopmentAuto-GPT

0 likes · 17 min read

How LangChain, Auto‑GPT, and HuggingGPT Shape LLM‑Powered Application Development

DataFunSummit

May 4, 2023 · Artificial Intelligence

LLM Ranking Arena: Elo‑Based Competitive Evaluation of Open‑Source Chatbots

A recent study by the LMSYS organization introduces an Elo‑rated, 1v1 battle arena for large language models, ranking open‑source chatbots like Vicuna, Koala, and ChatGLM, while discussing the limitations of traditional benchmarks and the advantages of crowd‑sourced, scalable evaluation.

AI benchmarkingChatbot ArenaElo Rating

0 likes · 7 min read

LLM Ranking Arena: Elo‑Based Competitive Evaluation of Open‑Source Chatbots

Top Architect

Apr 21, 2023 · Artificial Intelligence

Fine‑Tuning LLaMA‑7B with Alpaca‑LoRA to Build a Chinese ChatGPT

This article explains why and how to fine‑tune the LLaMA‑7B model using the cheap Alpaca‑LoRA approach, covering hardware requirements, dataset preparation, LoRA training, optional model merging and quantization, and provides ready‑to‑run code snippets for single‑ and multi‑GPU setups.

Alpaca-LoRAFine-tuningGPU

0 likes · 10 min read

Fine‑Tuning LLaMA‑7B with Alpaca‑LoRA to Build a Chinese ChatGPT

Continuous Delivery 2.0

Apr 20, 2023 · Artificial Intelligence

AutoGPT: Autonomous AI Tool Overview, Demonstrations, and Setup Guide

AutoGPT, the latest autonomous AI system, can independently browse the web, use third‑party tools, and execute tasks without human intervention, exemplified by building a React website in minutes, and this article explains its principles, showcases demos, and provides step‑by‑step installation instructions.

AI toolsAgentGPTAutoGPT

0 likes · 10 min read

AutoGPT: Autonomous AI Tool Overview, Demonstrations, and Setup Guide

Programmer DD

Apr 19, 2023 · Artificial Intelligence

Is Auto-GPT a Breakthrough or Overhyped? Uncovering Its Real Limitations

This article examines Auto‑GPT’s hype versus reality, detailing its architecture, cost per task, looping pitfalls, limited memory management, and why its current capabilities fall short of practical production use.

AI agentsAuto-GPTGPT-4

0 likes · 17 min read

Is Auto-GPT a Breakthrough or Overhyped? Uncovering Its Real Limitations

Architect

Apr 14, 2023 · Artificial Intelligence

Overview of Prominent Large Language Models and Instruction Fine‑Tuning Techniques

The article surveys major large language models—including GPT‑3, T5, LaMDA, Jurassic‑1, MT‑NLG, Gopher, Chinchilla, PaLM, U‑PaLM, OPT, LLaMA, BLOOM, GLM‑130B, and ERNIE 3.0 Titan—explains their architectures, scaling trade‑offs, and then details instruction‑fine‑tuned variants such as T0, FLAN, GPT‑3.5, ChatGPT, GPT‑4, Alpaca and ChatGLM, providing references for further study.

AIChatGPTGPT-3

0 likes · 27 min read

Overview of Prominent Large Language Models and Instruction Fine‑Tuning Techniques

Programmer DD

Apr 7, 2023 · Artificial Intelligence

How Vicuna-13B Achieves ChatGPT‑Level Performance with Low‑Cost Open‑Source Training

The Vicuna-13B open‑source chatbot, fine‑tuned from LLaMA on 70k ShareGPT conversations, matches over 90% of ChatGPT and Google Bard quality while costing only about $300 to train, thanks to memory optimizations, multi‑turn dialogue handling, and cheap spot‑instance training.

AIChatbotLLM

0 likes · 8 min read

How Vicuna-13B Achieves ChatGPT‑Level Performance with Low‑Cost Open‑Source Training

DataFunTalk

Apr 6, 2023 · Artificial Intelligence

A Comprehensive Survey of Large Language Models: Background, Capabilities, Key Technologies, and Future Directions

This article reviews the rapid progress of large language models (LLMs), covering their historical development, scaling laws, emergent abilities, core technologies such as training and alignment, resource ecosystems, evaluation methods, safety concerns, and prospective research challenges.

AI researchAlignmentLLM

0 likes · 21 min read

A Comprehensive Survey of Large Language Models: Background, Capabilities, Key Technologies, and Future Directions

DataFunSummit

Mar 25, 2023 · Artificial Intelligence

How GPT‑4 Has Changed NLP Research: Community Perspectives

A collection of Zhihu answers reflects on how the release of GPT‑4 has reshaped NLP research, dividing the community into LLM‑enthusiasts and skeptics, discussing the relevance of parsing, resource‑driven research directions, and the existential challenges faced by researchers.

AIAcademic CommunityGPT-4

0 likes · 10 min read

How GPT‑4 Has Changed NLP Research: Community Perspectives