Tagged articles
192 articles
Page 2 of 2
iKang Technology Team
iKang Technology Team
Feb 7, 2025 · Artificial Intelligence

Retrieval‑Augmented Generation (RAG) with LangChain: Concepts and Python Implementation

Retrieval‑Augmented Generation (RAG) using LangChain lets developers enhance large language models by embedding user queries, fetching relevant documents from a vector store, inserting the context into a prompt template, and generating concise, source‑grounded answers, offering low‑cost, up‑to‑date knowledge while reducing hallucinations and fine‑tuning expenses.

LLMLangChainRAG
0 likes · 10 min read
Retrieval‑Augmented Generation (RAG) with LangChain: Concepts and Python Implementation
Architect
Architect
Jan 27, 2025 · Artificial Intelligence

How to Build a Retrieval‑Augmented Generation QA Assistant for an Open Platform

This article details a step‑by‑step design of a RAG‑based intelligent Q&A assistant for the DeWu Open Platform, covering background, RAG fundamentals, system architecture, technology selection, prompt engineering with CO‑STAR, data preprocessing, vector store setup, LangChain.js implementation, similarity search, runnable chaining, debugging, and future prospects.

AILLMLangChain
0 likes · 28 min read
How to Build a Retrieval‑Augmented Generation QA Assistant for an Open Platform
JD Tech Talk
JD Tech Talk
Jan 9, 2025 · Artificial Intelligence

Practical Guide to Building Retrieval‑Augmented Generation (RAG) Applications with LangChain4j in Java

This article provides a step‑by‑step tutorial for Java engineers on using the LangChain4j framework to implement Retrieval‑Augmented Generation (RAG) with large language models, covering concepts, environment setup, code integration, document splitting, embedding, vector‑store operations, and prompt engineering.

EmbeddingJavaLangChain4j
0 likes · 35 min read
Practical Guide to Building Retrieval‑Augmented Generation (RAG) Applications with LangChain4j in Java
JD Cloud Developers
JD Cloud Developers
Jan 9, 2025 · Artificial Intelligence

Boost Your Java Apps with LangChain4j: A Hands‑On RAG Guide

This article walks Java developers through the fundamentals of Retrieval‑Augmented Generation (RAG), explains the LangChain4j framework, compares large‑model development with traditional Java coding, and provides step‑by‑step code examples for environment setup, document splitting, embedding, vector‑store operations, and LLM interaction.

EmbeddingJavaLangChain4j
0 likes · 34 min read
Boost Your Java Apps with LangChain4j: A Hands‑On RAG Guide
DeWu Technology
DeWu Technology
Jan 6, 2025 · Artificial Intelligence

Design and Implementation of a Retrieval‑Augmented Generation (RAG) Answering Assistant for the Dewu Open Platform

The paper describes building a Retrieval‑Augmented Generation assistant for the Dewu Open Platform that leverages GPT‑4o‑mini, OpenAI embeddings, Milvus vector store, and LangChain.js to semantically retrieve API documentation, structure user queries, and generate accurate, JSON‑formatted answers, thereby reducing manual support and hallucinations.

AILLMLangChain
0 likes · 28 min read
Design and Implementation of a Retrieval‑Augmented Generation (RAG) Answering Assistant for the Dewu Open Platform
Baobao Algorithm Notes
Baobao Algorithm Notes
Dec 15, 2024 · Artificial Intelligence

What Are the Best Practices for Retrieval‑Augmented Generation (RAG)?

This comprehensive study evaluates various components of Retrieval‑Augmented Generation pipelines—including query classification, chunking, embedding models, vector databases, retrieval, re‑ranking, summarization, and generator fine‑tuning—identifies optimal configurations, and proposes best‑practice guidelines for both performance‑maximizing and efficiency‑balanced RAG systems.

Fine-tuningLLMRAG
0 likes · 17 min read
What Are the Best Practices for Retrieval‑Augmented Generation (RAG)?
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 5, 2024 · Artificial Intelligence

How to Build a Financial RAG Solution with Alibaba PAI: Step-by-Step Guide

Learn how to create a Retrieval‑Augmented Generation (RAG) system for financial scenarios using Alibaba’s PAI platform—covering knowledge‑base construction with PAI‑Designer, template creation in PAI‑LangStudio, deployment of LLM and embedding models, and linking vector stores for accurate, context‑aware answers.

EmbeddingFinancial AIPAI
0 likes · 17 min read
How to Build a Financial RAG Solution with Alibaba PAI: Step-by-Step Guide
DataFunSummit
DataFunSummit
Nov 27, 2024 · Artificial Intelligence

Applying Large Language Models in Data Management and Risk Control at Ping An One Wallet

This presentation details how Ping An One Wallet leverages large language models across five key areas—current application status, data management, risk control, technical architecture, and a Q&A session—highlighting strategies such as vectorized rule storage, prompt engineering, RAG enhancements, and workflow agents to improve efficiency and accuracy in data governance and fraud detection.

AI ArchitectureData Governancerisk control
0 likes · 16 min read
Applying Large Language Models in Data Management and Risk Control at Ping An One Wallet
21CTO
21CTO
Nov 19, 2024 · Databases

Why Vector Databases Like Milvus Outperform Elasticsearch in Hybrid Search

This article explains how combining dense vector‑based semantic search with traditional keyword matching using a unified vector database such as Milvus delivers superior performance, scalability, and simplicity compared to maintaining separate Elasticsearch and vector‑search stacks.

ElasticsearchHybrid SearchMilvus
0 likes · 9 min read
Why Vector Databases Like Milvus Outperform Elasticsearch in Hybrid Search
ITPUB
ITPUB
Nov 15, 2024 · Databases

Why Vector Databases Matter: Deploying PgVector on PostgreSQL for Scalable AI Retrieval

This article explains the need for vector databases in the AI era, reviews PostgreSQL's extensible ecosystem, compares vector‑database options, provides step‑by‑step PgVector installation and usage, shares operational best practices, performance tuning tips, and real‑world Qunar & Tujia case studies.

AIPostgreSQLRAG
0 likes · 27 min read
Why Vector Databases Matter: Deploying PgVector on PostgreSQL for Scalable AI Retrieval
Architects' Tech Alliance
Architects' Tech Alliance
Nov 12, 2024 · Artificial Intelligence

How Retrieval‑Augmented Generation Boosts Enterprise AI with Intel Optimizations

This article explains the fundamentals of Retrieval‑Augmented Generation (RAG), its four‑step workflow, architecture, and how Intel’s hardware and software optimizations—including vector search, quantized embeddings, and advanced inference extensions—enhance performance, security, and scalability for enterprise LLM applications.

AI inferenceEmbedding QuantizationIntel Optimization
0 likes · 14 min read
How Retrieval‑Augmented Generation Boosts Enterprise AI with Intel Optimizations
Baidu Tech Salon
Baidu Tech Salon
Nov 11, 2024 · Cloud Native

Baidu Cloud Native Data Platform: Empowering Enterprise AI in the LLM Era

To empower enterprise AI in the LLM era, Baidu Cloud unveils a cloud‑native data platform featuring upgraded databases—PegaDB, GaiaDB 5.0, Vector DB 2.0, Palo 2.0—and integrated services like DBSC 2.0, EDAP 2.0, and DBStack, delivering high‑performance, cost‑effective handling of structured, unstructured, and vector data for fine‑tuning and Enterprise RAG.

DBStackData LakehouseEDAP
0 likes · 10 min read
Baidu Cloud Native Data Platform: Empowering Enterprise AI in the LLM Era
JD Tech
JD Tech
Oct 31, 2024 · Artificial Intelligence

Design and Implementation of the Logistics Intelligent Robot “Yunli XiaoZhi” Powered by Large Language Models

The article details the development of Yunli XiaoZhi, an AI‑driven logistics chatbot that combines knowledge‑base Q&A, data‑analysis, proactive alerts and report‑pushing to streamline SOP access, reduce manual query effort, and improve operational efficiency for operators, carriers and drivers.

AI chatbotKnowledge BaseRAG
0 likes · 22 min read
Design and Implementation of the Logistics Intelligent Robot “Yunli XiaoZhi” Powered by Large Language Models
JavaEdge
JavaEdge
Oct 15, 2024 · Artificial Intelligence

Build a Real‑Time Search & Bazi AI Agent with LangChain & FastAPI

This tutorial walks through creating a LangChain tool‑calling agent that combines a real‑time web search tool, a Qdrant vector store for local knowledge retrieval, and a custom Bazi fortune‑telling service, all wrapped in a FastAPI application for interactive use.

AI AgentFastAPILangChain
0 likes · 15 min read
Build a Real‑Time Search & Bazi AI Agent with LangChain & FastAPI
JD Cloud Developers
JD Cloud Developers
Sep 29, 2024 · Artificial Intelligence

Build a Local AI Q&A System with Java, Ollama, and LangChain4J

This article walks through building a local AI question‑answer system using Java, Ollama, LangChain4J, embeddings, and a Chroma vector database, covering LLM fundamentals, embedding techniques, RAG architecture, setup steps, Maven dependencies, and sample code to retrieve and answer queries.

AIEmbeddingJava
0 likes · 19 min read
Build a Local AI Q&A System with Java, Ollama, and LangChain4J
DaTaobao Tech
DaTaobao Tech
Sep 20, 2024 · Databases

Database Technology Evolution: From Hierarchical to Vector Databases

The article chronicles the evolution of database technology from early hierarchical and network models through relational, column‑store, document, key‑value, graph, time‑series, HTAP, and finally vector databases, detailing each system’s architecture, strengths, limitations, typical uses, and future trends toward specialization, distributed cloud‑native designs, and AI‑driven applications.

HBaseHTAPInfluxDB
0 likes · 52 min read
Database Technology Evolution: From Hierarchical to Vector Databases
DataFunTalk
DataFunTalk
Sep 20, 2024 · Databases

Technical Paper Summaries on Graph Databases, Vector Databases, and Real-Time Data Warehousing

This article compiles concise English summaries of several technical papers covering Xiaohongshu's REDgraph graph database, DingoDB vector database, Tianqiong autonomous data platform, Douyin's real‑time data warehouse, financial‑grade data warehousing, Alibaba Cloud ClickHouse Serverless offering, best practices in financial data governance, and 58.com user‑profile data warehouse construction.

Big DataData Warehousegraph database
0 likes · 5 min read
Technical Paper Summaries on Graph Databases, Vector Databases, and Real-Time Data Warehousing
DataFunTalk
DataFunTalk
Sep 19, 2024 · Databases

Technical Topics Overview from DataFun Summit: Graph Database, Vector Database, Real-time Data Warehouse, and Cloud‑Native Solutions

The article presents a collection of technical overviews—including a graph database for distributed queries, a next‑generation vector database, real‑time data warehouse architectures at Douyin and Ant Group, a cloud‑native ClickHouse service, and best practices for financial data warehousing—while also explaining how to obtain the related e‑book.

Big DataCloud Nativegraph database
0 likes · 4 min read
Technical Topics Overview from DataFun Summit: Graph Database, Vector Database, Real-time Data Warehouse, and Cloud‑Native Solutions
ITPUB
ITPUB
Sep 18, 2024 · Databases

Why Vector Databases Are the Next Big Thing in GenAI Applications

The article examines how vector databases have become the most popular database type in the past three years, why they are essential for handling unstructured data in GenAI, compares proprietary and multi‑model solutions, and outlines future trends and practical deployment considerations.

AI applicationsDatabase TrendsGenAI
0 likes · 10 min read
Why Vector Databases Are the Next Big Thing in GenAI Applications
DataFunTalk
DataFunTalk
Sep 17, 2024 · Databases

Overview of Recent Advances in Graph, Vector, and Real-Time Data Warehouse Technologies

This article presents a collection of technical abstracts covering graph database parallel query optimization, next‑generation vector databases, real‑time data warehouse architectures, and cloud‑native analytics solutions, while also providing instructions for obtaining the full e‑book via a WeChat public account.

Big DataCloud NativeData Warehouse
0 likes · 5 min read
Overview of Recent Advances in Graph, Vector, and Real-Time Data Warehouse Technologies
Code Mala Tang
Code Mala Tang
Sep 12, 2024 · Artificial Intelligence

Boost LLM Accuracy with Retrieval‑Augmented Generation Using LangChain.js

This article explains the core concepts of Retrieval‑Augmented Generation (RAG), walks through its implementation steps with LangChain.js—including text chunking, embedding, storage, retrieval, and generation—and showcases practical use cases, challenges, and best practices for building reliable AI‑powered applications.

AI applicationsEmbeddingLLM
0 likes · 16 min read
Boost LLM Accuracy with Retrieval‑Augmented Generation Using LangChain.js
Baidu Geek Talk
Baidu Geek Talk
Sep 11, 2024 · Databases

Why Vector Databases Are the Next Big Thing in AI: A Deep Dive into RAG and Baidu’s VectorDB

This article examines the 70‑year evolution of databases, explains how large‑model AI drives the rise of vector databases and Retrieval‑Augmented Generation (RAG), outlines the four‑stage RAG workflow, compares Baidu’s self‑built VectorDB with open‑source alternatives, and showcases real‑world deployments that highlight performance, scalability, and enterprise benefits.

AIDatabase ArchitectureRAG
0 likes · 16 min read
Why Vector Databases Are the Next Big Thing in AI: A Deep Dive into RAG and Baidu’s VectorDB
21CTO
21CTO
Sep 7, 2024 · Artificial Intelligence

Why AI Databases Are the Next Big Leap for Vector Search and Multimodal Data

The article explains how AI databases combine structured, unstructured, and vector data, integrate machine‑learning, NLP, and generative models, and why platforms like Vespa are emerging as open‑source solutions to meet the performance and scalability demands of modern generative AI applications.

AI DatabaseMultimodal DataVespa
0 likes · 8 min read
Why AI Databases Are the Next Big Leap for Vector Search and Multimodal Data
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Sep 5, 2024 · Databases

How Vector Databases Power AI and RAG: Insights from Baidu’s DTCC 2024

This article reviews the 70‑year evolution of databases, explains how vector databases and Retrieval‑Augmented Generation (RAG) are reshaping AI applications, and details Baidu Intelligent Cloud's VectorDB architecture, performance advantages, real‑world use cases, and future trends in data engineering.

AIDatabase ArchitectureDistributed Systems
0 likes · 16 min read
How Vector Databases Power AI and RAG: Insights from Baidu’s DTCC 2024
DaTaobao Tech
DaTaobao Tech
Aug 30, 2024 · Artificial Intelligence

Overview of Large Model Application Development Platforms: LangChain, Dify, Flowise, and Coze

The article reviews open‑source and commercial large‑model development platforms—LangChain, Dify, Flowise, and Coze—detailing their architectures, low‑code visual tools, model integrations, extensibility, and a step‑by‑step Dify example, and concludes they are essential infrastructure for rapid AI application deployment.

AI application developmentDifyFlowise
0 likes · 13 min read
Overview of Large Model Application Development Platforms: LangChain, Dify, Flowise, and Coze
JD Tech
JD Tech
Aug 23, 2024 · Artificial Intelligence

AI-Powered Automated Exam Generation for Aviation Maintenance Training

This article describes an AI-driven solution that uses vector databases and large language models to automatically generate, evaluate, and maintain training exam questions for aviation maintenance personnel, addressing high document volume, frequent updates, and low training effectiveness.

AIexam generationtraining automation
0 likes · 14 min read
AI-Powered Automated Exam Generation for Aviation Maintenance Training
Volcano Engine Developer Services
Volcano Engine Developer Services
Aug 20, 2024 · Databases

How Vector Databases Power RAG: Scaling, Algorithms, and Real‑World Trade‑offs

RAG technology leverages vector databases to provide context‑aware answers without updating model parameters, and this article explores how cloud search teams integrate multiple vector algorithms, balance cost, stability and latency, and adopt open‑source solutions like OpenSearch to build scalable, enterprise‑grade retrieval systems.

AIDiskANNOpenSearch
0 likes · 21 min read
How Vector Databases Power RAG: Scaling, Algorithms, and Real‑World Trade‑offs
21CTO
21CTO
Aug 17, 2024 · Artificial Intelligence

Vector Store vs Vector Database: Which Powers Your AI Apps Better?

This guide explains the differences between vector stores and vector databases, covering vector embeddings, performance, scalability, integration, and ideal use‑cases, helping developers choose the right tool—or a hybrid approach—for AI applications.

AI embeddingsVector Storescalable architecture
0 likes · 12 min read
Vector Store vs Vector Database: Which Powers Your AI Apps Better?
Alibaba Cloud Native
Alibaba Cloud Native
Aug 13, 2024 · Cloud Native

How to Build an AI Cache WASM Plugin for Higress Gateway

This guide explains how to set up a Higress gateway, compile a WebAssembly AI cache plugin, integrate Redis and DashVector for semantic caching of large‑model requests, and provides complete configuration and code examples for end‑to‑end deployment.

AI CacheHigressWebAssembly
0 likes · 16 min read
How to Build an AI Cache WASM Plugin for Higress Gateway
Open Source Tech Hub
Open Source Tech Hub
Jul 31, 2024 · Artificial Intelligence

Understanding LLMs, AI Agents, and Retrieval-Augmented Generation: Key Concepts and Challenges

This article explains the fundamentals of large language models, artificial general intelligence, AI-generated content, AI agents, retrieval‑augmented generation, knowledge bases, multimodal processing, fine‑tuning, alignment, tokens, vectors, and related tools, highlighting their capabilities, limitations, and practical considerations.

AI AgentFine-tuningLLM
0 likes · 14 min read
Understanding LLMs, AI Agents, and Retrieval-Augmented Generation: Key Concepts and Challenges
AntData
AntData
Jul 12, 2024 · Databases

Recent Advances in Vector Databases Presented at SIGMOD 2024

This article reviews the latest vector database research showcased at SIGMOD 2024, covering system designs such as Starling, Vexless, RaBitQ, and ACORN, and discusses current academic hotspots including query processing, index structures, optimization techniques, and hardware acceleration for large‑scale similarity search.

AISIGMOD 2024indexing
0 likes · 20 min read
Recent Advances in Vector Databases Presented at SIGMOD 2024
System Architect Go
System Architect Go
Jul 4, 2024 · Artificial Intelligence

Optimizing Image Search System Architecture with Client‑Side Feature Extraction Using MobileNet

This article explains the architecture of an image‑search system that extracts feature vectors, stores them in a vector database, and performs similarity queries, then proposes an optimized design that offloads feature extraction to a lightweight MobileNet model running in the browser, reducing latency, server load, and component complexity.

MobileNetSystem ArchitectureTensorFlow.js
0 likes · 9 min read
Optimizing Image Search System Architecture with Client‑Side Feature Extraction Using MobileNet
Data Thinking Notes
Data Thinking Notes
Jun 20, 2024 · Artificial Intelligence

Leveraging LLMs for Data: Embedding Search, Knowledge Bases, Text2SQL, and EDA

This article explores how large language models can transform data workflows by using embeddings for semantic search, building private domain knowledge bases, generating SQL code from natural language with visualized results, and enhancing exploratory data analysis, outlining practical steps and benefits for enterprises.

EDAEmbeddingKnowledge Base
0 likes · 7 min read
Leveraging LLMs for Data: Embedding Search, Knowledge Bases, Text2SQL, and EDA
21CTO
21CTO
Jun 7, 2024 · Artificial Intelligence

10 Essential Tools for Building a Modern AI Data Lake Architecture

This article outlines ten critical components of a modern data lake reference architecture for AI/ML, detailing each function, the supporting vendor tools and open‑source libraries, and how they enable scalable storage, MLOps, distributed training, model hubs, vector search, and data visualization.

AIData LakeDistributed Training
0 likes · 14 min read
10 Essential Tools for Building a Modern AI Data Lake Architecture
Sohu Tech Products
Sohu Tech Products
Jun 5, 2024 · Artificial Intelligence

Retrieval Augmented Generation (RAG): Concepts, Workflow, and LangChain Implementation

The article outlines LLM issues such as hallucination, outdated knowledge, and data privacy, then explains Retrieval‑Augmented Generation—detailing its data‑preparation and query‑time retrieval workflow, demonstrates a full LangChain implementation, and contrasts RAG with fine‑tuning as complementary strategies for up‑to‑date, grounded responses.

LLMLangChainPrompt engineering
0 likes · 15 min read
Retrieval Augmented Generation (RAG): Concepts, Workflow, and LangChain Implementation
JD Retail Technology
JD Retail Technology
Jun 4, 2024 · Databases

How to Deploy and Query JD’s Open‑Source Vearch Vector Database for LLM Retrieval

This article walks through the practical use of JD’s self‑developed Vearch vector database—covering cluster creation, space setup, data insertion, and both text and vector search—illustrating how it integrates with LangChain and OpenAI embeddings to enable retrieval‑augmented generation for large language models.

EmbeddingLLM RetrievalLangChain
0 likes · 16 min read
How to Deploy and Query JD’s Open‑Source Vearch Vector Database for LLM Retrieval
JD Tech
JD Tech
May 31, 2024 · Artificial Intelligence

Understanding Large Language Models, Retrieval‑Augmented Generation, and AI Agents: Concepts, Engineering Practices, and Applications

This article explains the fundamentals and engineering practices of large language models (LLM), retrieval‑augmented generation (RAG) and AI agents, compares small and large embedding models, provides Python code for vector‑database RAG with Chroma, and discusses integration, use cases, and future challenges in AI development.

AI EngineeringAI agentsLLM
0 likes · 41 min read
Understanding Large Language Models, Retrieval‑Augmented Generation, and AI Agents: Concepts, Engineering Practices, and Applications
JD Retail Technology
JD Retail Technology
May 27, 2024 · Artificial Intelligence

Automating Test Case Generation with Large Language Models and LangChain

This article describes how large language models and the LangChain framework can be combined with PDF parsing, text chunking, memory management, and a vector database to automatically generate software test cases, achieving significant efficiency gains while outlining implementation details, results, and future challenges.

AILangChainPython
0 likes · 10 min read
Automating Test Case Generation with Large Language Models and LangChain
Code Ape Tech Column
Code Ape Tech Column
May 25, 2024 · Artificial Intelligence

Introducing Spring AI: Integrating Artificial Intelligence into Spring Boot Applications

Spring AI brings artificial‑intelligence capabilities to the Spring Boot ecosystem, offering model support, vector‑database integration, SQL‑like filtering, and easy Maven configuration, enabling Java developers to add generative AI, semantic search, and AI‑driven image generation to their backend applications.

JavaSpring Bootartificial intelligence
0 likes · 6 min read
Introducing Spring AI: Integrating Artificial Intelligence into Spring Boot Applications
DataFunTalk
DataFunTalk
May 9, 2024 · Databases

ByteHouse Vector Search Technical Guide: Architecture, Design, and Performance Optimizations

This guide explains ByteHouse’s high‑performance vector search capabilities, covering the background of vector retrieval for LLMs, the limitations of its existing skip‑index architecture, the new vector‑index design with HNSW and IVF, query‑time optimizations, performance benchmarks against Milvus, and future development plans.

ByteHouseLLMindexing
0 likes · 8 min read
ByteHouse Vector Search Technical Guide: Architecture, Design, and Performance Optimizations
AntTech
AntTech
Apr 26, 2024 · Databases

Data Processing Technologies in the AI Era: Trends and Integration of Vector and Relational Databases

The talk explores how the rapid growth of multimodal data and large language models is reshaping data processing, highlighting three key trends—online‑offline integration, vector‑relational database convergence, and the fusion of data processing with AI computation—while presenting practical solutions and future visions for unified data‑AI ecosystems.

AIBig DataHTAP
0 likes · 12 min read
Data Processing Technologies in the AI Era: Trends and Integration of Vector and Relational Databases
Sohu Tech Products
Sohu Tech Products
Mar 27, 2024 · Artificial Intelligence

Building a RAG Application with Baidu Vector Database and Qianfan Embedding

This tutorial walks through building a Retrieval‑Augmented Generation application by setting up Baidu’s Vector Database and Qianfan embedding service, configuring credentials, creating a document database and vector table, loading and chunking PDFs, generating embeddings, storing them, and performing scalar, vector and hybrid similarity searches, ready for integration with Wenxin LLM for answer generation.

AI applicationsBaidu QianfanEmbedding
0 likes · 11 min read
Building a RAG Application with Baidu Vector Database and Qianfan Embedding
Eric Tech Circle
Eric Tech Circle
Mar 24, 2024 · Artificial Intelligence

Running Local LLMs: Ollama vs Hugging Face – A Hands‑On Comparison

This guide compares Ollama and Hugging Face for running large language models locally, detailing API and local execution methods, installation steps, model selection, resource requirements, integration with AnythingLLM, container deployment, embedding and vector store setup, and practical observations on performance and limitations.

AnythingLLMDockerEmbedding
0 likes · 15 min read
Running Local LLMs: Ollama vs Hugging Face – A Hands‑On Comparison
Alimama Tech
Alimama Tech
Mar 20, 2024 · Artificial Intelligence

Dolphin VectorDB: A High-Performance Vector Database for AI Applications

Dolphin VectorDB, created by Alibaba’s Alimama team, is a high‑performance, scalable vector database that delivers fast, cost‑effective AI‑driven vector storage and real‑time updates, supporting multiple query modes and powering applications such as content risk control, marketing Q&A, and audience selection, with ongoing enhancements for multimodal computing.

AI applicationsReal-time Updatescontent risk control
0 likes · 13 min read
Dolphin VectorDB: A High-Performance Vector Database for AI Applications
Ops Development & AI Practice
Ops Development & AI Practice
Mar 16, 2024 · Databases

Why ChromaDB Is Becoming the Go-To Vector Store for AI Applications

ChromaDB is an open‑source, AI‑native vector database that efficiently stores, indexes, and retrieves high‑dimensional embeddings, offering fast similarity search, easy integration via flexible APIs, strong scalability, and active community support, making it suitable for recommendation systems, NLP, and image‑recognition workloads.

AIChromaDBembeddings
0 likes · 5 min read
Why ChromaDB Is Becoming the Go-To Vector Store for AI Applications
Baidu Geek Talk
Baidu Geek Talk
Mar 13, 2024 · Artificial Intelligence

Understanding Retrieval-Augmented Generation (RAG) and Building a Personal Knowledge Base with ERNIE SDK and LangChain

The article explains Retrieval-Augmented Generation (RAG), its workflow, advantages, comparison with fine-tuning, and provides a step-by-step implementation using Baidu's ERNIE SDK, LangChain, and ChromaDB to build a personal knowledge base that answers queries with retrieved context.

AIERNIE SDKKnowledge Base
0 likes · 13 min read
Understanding Retrieval-Augmented Generation (RAG) and Building a Personal Knowledge Base with ERNIE SDK and LangChain
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Feb 25, 2024 · Artificial Intelligence

Pinecone Vector Database and Embedding Model Summary from DeepLearning.AI’s AI Course

This article reviews the author’s hands‑on experience with Pinecone’s serverless vector database, various embedding and generation models such as all‑MiniLM‑L6‑v2, text‑embedding‑ada‑002, clip‑ViT‑B‑32, and GPT‑3.5‑turbo‑instruct, and demonstrates how they are applied to semantic search, RAG, recommendation, hybrid, and facial similarity tasks using Python code examples.

AIPineconePython
0 likes · 9 min read
Pinecone Vector Database and Embedding Model Summary from DeepLearning.AI’s AI Course
Baidu Geek Talk
Baidu Geek Talk
Feb 7, 2024 · Artificial Intelligence

Design and Implementation of a Knowledge-Base Intelligent Q&A System for Database Operations Using Large Models

The paper details Baidu Intelligent Cloud’s design and deployment of a domain‑specific knowledge‑base Q&A system for database operations, combining prompt‑engineered LLMs with hybrid vector‑search using LangChain, BES vector store, and custom ingestion, addressing recall, token limits, and hallucination challenges across dashboard and IM bot interfaces.

AIDatabase operationsKnowledge Base
0 likes · 16 min read
Design and Implementation of a Knowledge-Base Intelligent Q&A System for Database Operations Using Large Models
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jan 31, 2024 · Artificial Intelligence

How Baidu Built an 80% Accurate AI-Powered Database Ops Knowledge Base

This article details Baidu Intelligent Cloud's database operations team’s end‑to‑end design of an AI‑driven knowledge‑base Q&A system, covering background, architecture, technical choices, module implementation, key challenges such as vector‑search recall and token limits, and real‑world deployment scenarios.

AIPrompt engineeringvector database
0 likes · 18 min read
How Baidu Built an 80% Accurate AI-Powered Database Ops Knowledge Base
DataFunTalk
DataFunTalk
Dec 29, 2023 · Artificial Intelligence

Enterprise Knowledge Assistant: Leveraging Vector Databases and Large Language Models

This article explores the emerging enterprise knowledge assistant paradigm in the era of large models, detailing traditional knowledge management challenges, solution architecture using vector databases and LLMs, core technologies such as ETL pipelines, reranking, secure fine‑tuning, and future prospects for intelligent enterprise applications.

Enterprise AILLM fine-tuningknowledge management
0 likes · 11 min read
Enterprise Knowledge Assistant: Leveraging Vector Databases and Large Language Models
Baidu Geek Talk
Baidu Geek Talk
Dec 11, 2023 · Industry Insights

How AI and Cloud Are Redefining the Database Landscape – Baidu’s Journey and Future Trends

This article traces the 70‑year evolution of databases, examines how the rise of AIGC, cloud computing and AI native architectures are reshaping the industry, and details Baidu Smart Cloud's historical milestones, flagship products such as GaiaDB and PegaDB, and the emerging trends that will drive the next generation of database solutions.

AICloud Nativecloud computing
0 likes · 22 min read
How AI and Cloud Are Redefining the Database Landscape – Baidu’s Journey and Future Trends
Architect
Architect
Nov 19, 2023 · Artificial Intelligence

Why AutoGPT Abandoned Vector Databases – A Deep Dive into Simpler Memory Strategies

The article examines AutoGPT's shift away from vector databases, detailing the original vision of using embeddings for long‑term memory, the performance calculations that exposed unnecessary complexity, the adoption of JSON‑based storage, and the emerging trend of specialized multi‑agent architectures.

AI agentsAutoGPTMemory Management
0 likes · 9 min read
Why AutoGPT Abandoned Vector Databases – A Deep Dive into Simpler Memory Strategies
DataFunTalk
DataFunTalk
Nov 17, 2023 · Databases

Cost as the Primary Driver of Vector Database Industry Development

Vector databases gain traction because they dramatically reduce storage, learning, scaling, and large‑model limitations costs by enabling semantic similarity search, RAG‑based prompt optimization, efficient high‑dimensional indexing, and cloud‑native architectures, making them essential for modern AI applications despite the promotional context.

AIBig DataRAG
0 likes · 11 min read
Cost as the Primary Driver of Vector Database Industry Development
DataFunTalk
DataFunTalk
Oct 30, 2023 · Databases

Engineering Practices and Evolution of Douyin’s Cloud‑Native Vector Database

This article outlines Douyin’s step‑by‑step engineering evolution of its cloud‑native vector database, covering the background of vector search, core concepts, algorithmic optimizations, storage‑compute separation, streaming updates, multi‑tenant orchestration, and future applications such as large language model integration.

ANNCloud NativeDouyin
0 likes · 17 min read
Engineering Practices and Evolution of Douyin’s Cloud‑Native Vector Database
Volcano Engine Developer Services
Volcano Engine Developer Services
Sep 19, 2023 · Databases

Unlocking AI with Vector Databases: Architecture, Optimization, and Real-World Cases

This article explores how vector databases serve as the memory layer for large AI models, detailing their distributed, compute‑separated architecture, performance optimizations, hybrid vector‑scalar retrieval, and practical deployments across TikTok’s ecosystem such as image search, intelligent Q&A, and multimodal AI services.

AIKnowledge Basedistributed architecture
0 likes · 11 min read
Unlocking AI with Vector Databases: Architecture, Optimization, and Real-World Cases
phodal
phodal
Sep 17, 2023 · Artificial Intelligence

How Chocolate Factory’s Codebase AI Assistant Boosts Code Search with RAG

This article explains the design and implementation of the Codebase AI Assistant in the Chocolate Factory framework, covering its problem‑solving DSL, retrieval‑augmented generation pipeline, indexing and querying stages, prompt strategies, and code‑splitting rules that together enable efficient semantic code search.

AI AssistantKotlinRetrieval Augmented Generation
0 likes · 11 min read
How Chocolate Factory’s Codebase AI Assistant Boosts Code Search with RAG
Architect
Architect
Aug 31, 2023 · Artificial Intelligence

Building a Custom LLM Chatbot with LangChain, ChromaDB, and LLaMA‑2

This tutorial explains how to leverage generative AI tools—including LLMs, embedding models, vector databases, and the LangChain framework—to create a custom chatbot that answers user queries using a knowledge base, with step‑by‑step code examples for Google Colab.

ChatbotEmbeddingLLM
0 likes · 15 min read
Building a Custom LLM Chatbot with LangChain, ChromaDB, and LLaMA‑2
DataFunSummit
DataFunSummit
Aug 30, 2023 · Databases

Milvus: An AI‑Native Vector Database for Large Language Model Applications

This article introduces Milvus, an open‑source, cloud‑native vector database designed for AI workloads, explains how it helps mitigate large‑model hallucinations, outlines its CVP architecture, showcases performance benchmarks, and explores diverse application scenarios and future directions for LLM‑vector database integration.

AILLMMilvus
0 likes · 13 min read
Milvus: An AI‑Native Vector Database for Large Language Model Applications
dbaplus Community
dbaplus Community
Aug 26, 2023 · Databases

What Is a Vector Database? A Simple Guide from Kids to Engineers

This article demystifies vector databases by first explaining the concept with a five‑year‑old analogy, then expanding to technical details for developers, covering how embeddings work, the differences from relational databases, ANN search, indexing, similarity metrics, and why vector stores outperform raw NumPy arrays for large‑scale similarity retrieval.

ANNdatabasesmachine learning
0 likes · 9 min read
What Is a Vector Database? A Simple Guide from Kids to Engineers
Sohu Tech Products
Sohu Tech Products
Aug 23, 2023 · Artificial Intelligence

Engineering GPT Applications: Capabilities, Limitations, and Solutions

The guide explains GPT’s core capabilities—natural language mastery, domain reasoning, and code generation—while detailing its limits such as prompt sensitivity, token caps, and lack of memory, then offers engineering workarounds like systematic prompting, chain‑of‑thought, external memory, tool integration, safety checks, and a six‑layer architecture for building robust commercial AI applications.

AI Application ArchitectureGPTPrompt engineering
0 likes · 20 min read
Engineering GPT Applications: Capabilities, Limitations, and Solutions
ITPUB
ITPUB
Aug 18, 2023 · Databases

Key Takeaways from DTCC2023: Vector Databases, Data Privacy, and Intelligent Ops

The 14th China Database Technology Conference (DTCC2023) showcased cutting‑edge advances in vector databases, data privacy, MySQL security, and AI‑driven intelligent operations, featuring insights from industry leaders at Huawei, Tencent, eBay, Bilibili and more.

AIBig DataDatabase Security
0 likes · 10 min read
Key Takeaways from DTCC2023: Vector Databases, Data Privacy, and Intelligent Ops
DataFunSummit
DataFunSummit
Aug 3, 2023 · Artificial Intelligence

Integrating Vector Databases with Large Language Models for Enterprise AI Applications

The article explains how combining vector databases with large language models can help governments and enterprises leverage massive private data for AI, covering semantic search, approximate nearest neighbor techniques, alignment challenges across modalities, and future directions for fine‑grained data integration.

AIapproximate nearest neighborlarge language model
0 likes · 7 min read
Integrating Vector Databases with Large Language Models for Enterprise AI Applications
Tencent Cloud Developer
Tencent Cloud Developer
Jul 24, 2023 · Artificial Intelligence

Building an Internal Code Knowledge Base with Embedding and AST Interpreter

The author builds an internal code knowledge base for the TDesign Vue‑Next library by scraping documentation, chunking and embedding texts with OpenAI’s ada model into a vector store, then retrieving relevant chunks for LLM answers, and enhances context continuity using a JavaScript AST interpreter, achieving up to 90 % query accuracy and a 20 % productivity boost.

ASTEmbeddingKnowledge Base
0 likes · 11 min read
Building an Internal Code Knowledge Base with Embedding and AST Interpreter
Tencent Tech
Tencent Tech
Jul 4, 2023 · Databases

What Is a Vector Database and Why Is It the AI Engine’s Secret Weapon?

This article explains what vectors and vector databases are, how they differ from traditional databases, their core technologies, their relationship with large language models, market trends, and details of Tencent Cloud VectorDB’s capabilities, architecture, real‑world applications, and future competitive challenges.

AIEmbeddingLLM
0 likes · 10 min read
What Is a Vector Database and Why Is It the AI Engine’s Secret Weapon?
php Courses
php Courses
Jul 3, 2023 · Databases

Top 5 Revolutionary Vector Databases Transforming Machine Learning and Similarity Search (2023)

Vector databases store and search large-scale vector data, and in 2023 the five leading solutions—Chroma, Pinecone, Weaviate, Milvus, and Faiss—offer scalable, high-performance options for applications such as LLM-driven services, audio search, recommendation systems, image/video analysis, and semantic retrieval across various industries.

AILLMdata storage
0 likes · 4 min read
Top 5 Revolutionary Vector Databases Transforming Machine Learning and Similarity Search (2023)
IT Services Circle
IT Services Circle
Jun 26, 2023 · Databases

Understanding Vector Databases and Embedding Techniques

The article explains what vector databases are, how vectors and embeddings work, the main embedding methods such as matrix factorization, NLP and graph techniques, the characteristics and high‑availability requirements of vector databases, and common AI‑driven application scenarios like semantic search, recommendation and anomaly detection.

AIEmbeddingmachine learning
0 likes · 8 min read
Understanding Vector Databases and Embedding Techniques
Alibaba Cloud Developer
Alibaba Cloud Developer
May 16, 2023 · Artificial Intelligence

How to Build a Company‑Specific Chatbot with LLMs and Vector Databases

This article explains why combining large language models with vector databases enables enterprises to create specialized, up‑to‑date chatbots, outlines the underlying principles, describes the ADB‑PG vector‑search capabilities, and provides step‑by‑step implementation details including data processing, indexing, and query examples.

AnalyticDBChatbotLLM
0 likes · 17 min read
How to Build a Company‑Specific Chatbot with LLMs and Vector Databases
DeWu Technology
DeWu Technology
Mar 17, 2023 · Artificial Intelligence

Prompt‑Ops and LangChain: Engineering LLM Applications

Prompt‑Ops frameworks like LangChain let developers turn pre‑trained LLMs into versatile applications by abstracting model calls, chaining prompts, integrating tools, managing memory, and handling private data, while addressing challenges such as nondeterminism, version control, and prompt injection in production environments.

AI applicationsAgentLLM
0 likes · 15 min read
Prompt‑Ops and LangChain: Engineering LLM Applications
21CTO
21CTO
Aug 5, 2022 · Databases

Why Milvus 2.1 Is Revolutionizing Vector Databases for Unstructured Data

Milvus 2.1, the newly released open‑source vector database, brings trillion‑byte storage, sub‑5 ms search latency, and seamless scalability for both structured and unstructured data, positioning it as a game‑changing infrastructure for similarity search across industries.

Milvusunstructured datavector database
0 likes · 4 min read
Why Milvus 2.1 Is Revolutionizing Vector Databases for Unstructured Data
360 Quality & Efficiency
360 Quality & Efficiency
Jul 1, 2022 · Artificial Intelligence

Building an End-to-End Image Search System with Milvus and VGG

This article presents a complete image‑search solution that extracts visual features with the VGG16 model, stores them in the Milvus vector database, and provides a set of web APIs for training, querying, counting, searching, and deleting image vectors, all deployed via Docker containers.

AIDeep LearningMilvus
0 likes · 7 min read
Building an End-to-End Image Search System with Milvus and VGG
DataFunSummit
DataFunSummit
Mar 29, 2022 · Databases

AI-Driven Unstructured Data Analysis and Retrieval with Milvus and Towhee

This article explains how the Milvus vector database and the Towhee embedding framework together enable large‑scale, high‑throughput semantic analysis and retrieval of unstructured data such as images, video, and audio by leveraging AI‑powered vectorization and search pipelines.

AIMilvusTowhee
0 likes · 13 min read
AI-Driven Unstructured Data Analysis and Retrieval with Milvus and Towhee
Code DAO
Code DAO
Dec 14, 2021 · Artificial Intelligence

Semantic Search on Wikipedia with Weaviate, GraphQL, Sentence‑BERT, and BERT Q&A

This article walks through building a large‑scale semantic search system on the English Wikipedia using the Weaviate vector database, GraphQL queries, and pre‑trained Sentence‑BERT and BERT Q&A models, covering dataset preparation, schema design, import pipelines, query examples, and production deployment strategies.

GraphQLSentence-BERTWeaviate
0 likes · 8 min read
Semantic Search on Wikipedia with Weaviate, GraphQL, Sentence‑BERT, and BERT Q&A
DataFunTalk
DataFunTalk
Aug 2, 2021 · Databases

From Text Search to Vector Search: Generalizing Unstructured Data Retrieval

The article explains why traditional text‑based search engines like ElasticSearch struggle with modern multimodal data, introduces vector databases that store implicit semantic embeddings, and proposes a generalized search architecture that decouples data‑to‑vector mapping from the engine while leveraging clustering or graph indexes for similarity search.

AIEmbeddinginformation retrieval
0 likes · 12 min read
From Text Search to Vector Search: Generalizing Unstructured Data Retrieval
System Architect Go
System Architect Go
Apr 11, 2020 · Artificial Intelligence

How to Build an Image Search Engine with CNN and Milvus: A Step‑by‑Step Guide

This article walks through the complete engineering workflow for building an image‑search system, covering CNN‑based feature extraction with VGG16, vector normalization, image preprocessing, black‑edge removal, and practical deployment of the Milvus vector database including hardware requirements, capacity planning, collection/partition design, and search result handling.

CNNMilvusPython
0 likes · 11 min read
How to Build an Image Search Engine with CNN and Milvus: A Step‑by‑Step Guide