Tagged articles

FAISS

37 articles · Page 1 of 1

Jun 25, 2026 · Artificial Intelligence

Transform a Single RAG Pipeline with LangGraph – Agent Picks Vector, Graph or Web Search

This article demonstrates how to use LangGraph to build a state‑machine‑based hybrid RAG agent that routes each query to the most suitable retriever—vector similarity, graph traversal, or web search—through a Router, and then validates answers with grading, rewriting, generation, and hallucination‑checking components.

Agentic RetrievalFAISSLLM

0 likes · 12 min read

Transform a Single RAG Pipeline with LangGraph – Agent Picks Vector, Graph or Web Search

AI Architect Hub

May 3, 2026 · Artificial Intelligence

Choosing the Right Vector Database: Milvus, Chroma, Weaviate, Qdrant, FAISS Compared

This article compares five popular vector databases—Chroma, Milvus, Weaviate, Qdrant, and FAISS—detailing their positions, strengths, weaknesses, suitable scenarios, a selection‑dimension matrix, common pitfalls, code implementations for a unified RAG pipeline, best‑practice recommendations, and thought questions to guide engineers in choosing and migrating vector stores.

ChromaFAISSMilvus

0 likes · 23 min read

Choosing the Right Vector Database: Milvus, Chroma, Weaviate, Qdrant, FAISS Compared

AI Architect Hub

Apr 27, 2026 · Artificial Intelligence

Why HNSW Can Speed Up Search 50× Compared to Brute‑Force? A Hands‑On Guide to Building Vector Indexes

The article explains why brute‑force vector search is painfully slow, introduces Flat, IVF, and HNSW index structures, compares their speed, memory and accuracy, shows common pitfalls, provides production‑grade Python code, and presents benchmark results that demonstrate HNSW’s superior speed‑accuracy trade‑off.

AIFAISSHNSW

0 likes · 12 min read

Why HNSW Can Speed Up Search 50× Compared to Brute‑Force? A Hands‑On Guide to Building Vector Indexes

Data STUDIO

Apr 2, 2026 · Artificial Intelligence

Building a Dual‑Stack Memory Agent: Situational + Semantic Memory for Long‑Term AI Understanding

This tutorial walks through designing and implementing a dual‑stack memory architecture for AI agents—combining episodic vector‑based situational memory with graph‑based semantic memory—using LangChain, FAISS, and Neo4j, and demonstrates a complete end‑to‑end workflow with code examples.

Agent MemoryFAISSKnowledge Graph

0 likes · 14 min read

Building a Dual‑Stack Memory Agent: Situational + Semantic Memory for Long‑Term AI Understanding

Huawei Cloud Developer Alliance

Mar 26, 2026 · Artificial Intelligence

How to Build a Full‑Stack RAG Chatbot Using LangChain, FAISS & Langfuse

This guide walks through an end‑to‑end RAG implementation with LangChain, covering multi‑format document loading, recursive text splitting, embedding selection, FAISS vector storage, ConversationalRetrievalChain setup, prompt engineering, source citation, Langfuse observability, and best‑practice configuration management.

FAISSLLMOpsLangChain

0 likes · 13 min read

How to Build a Full‑Stack RAG Chatbot Using LangChain, FAISS & Langfuse

Wu Shixiong's Large Model Academy

Mar 26, 2026 · Artificial Intelligence

Why Hybrid Retrieval Beats Pure Vector Search: BM25, RRF, and Real‑World Gains

This article explains why combining BM25 with dense vector search using Reciprocal Rank Fusion (RRF) improves recall for both exact‑term and semantic queries in a financial‑insurance document corpus, details the underlying algorithms, parameter choices such as k=60, provides Python implementations, and shows measurable performance gains in production.

BM25FAISSHybrid Retrieval

0 likes · 28 min read

Why Hybrid Retrieval Beats Pure Vector Search: BM25, RRF, and Real‑World Gains

DeepHub IMBA

Mar 18, 2026 · Artificial Intelligence

CRAG Architecture Explained: Fixing Erroneous Retrieval Results Before the Generator

The article analyzes how most RAG pipelines blindly feed retrieved documents to LLMs, introduces CRAG's lightweight evaluator with confidence thresholds, describes its sentence‑level decomposition, filtering, and dual‑knowledge routing, and provides a full implementation walkthrough with a real insurance query example.

CRAGFAISSLLM

0 likes · 13 min read

CRAG Architecture Explained: Fixing Erroneous Retrieval Results Before the Generator

AI Tech Publishing

Feb 19, 2026 · Artificial Intelligence

Add Long-Term Memory to Your Agent with Lightweight RAG (Lesson 5)

This tutorial shows how to equip an AI agent with long‑term memory using Retrieval‑Augmented Generation (RAG), covering the concepts of vector embeddings, FAISS indexing, building and querying a knowledge base, and providing complete Python code examples.

AgentEmbeddingFAISS

0 likes · 13 min read

Add Long-Term Memory to Your Agent with Lightweight RAG (Lesson 5)

BirdNest Tech Talk

Oct 27, 2025 · Artificial Intelligence

How LangChain’s Indexing API Enables Efficient Incremental Updates for RAG Systems

This article explains how LangChain's Indexing API adds state management and synchronization to the classic load‑split‑embed‑store RAG pipeline, detailing the RecordManager component, the index function workflow, key parameters, implementation considerations, and best‑practice code examples for production‑grade vector stores.

FAISSIndexing APILangChain

0 likes · 12 min read

How LangChain’s Indexing API Enables Efficient Incremental Updates for RAG Systems

BirdNest Tech Talk

Oct 21, 2025 · Artificial Intelligence

How Vector Stores Enable Lightning‑Fast Semantic Search in LangChain

This article explains what vector stores are, outlines their core workflow of adding, querying, and searching embeddings, compares popular back‑ends like FAISS, Chroma, and Pinecone, and walks through a complete Chinese‑language example using LangChain’s FAISS integration with detailed code and result analysis.

AIFAISSLangChain

0 likes · 10 min read

How Vector Stores Enable Lightning‑Fast Semantic Search in LangChain

Amazon Cloud Developers

Sep 15, 2025 · Artificial Intelligence

Amazon Bedrock Enables One‑Stop SEO: From Keyword Mining to AI‑Generated Content

This article walks through a complete, low‑cost SEO pipeline built on Amazon Bedrock’s generative AI, covering automated keyword extraction, search‑intent analysis, product‑vector matching with FAISS, and AI‑driven content creation within a serverless, event‑driven architecture that scales efficiently while reducing manual effort.

Amazon BedrockFAISSGenerative AI

0 likes · 16 min read

Amazon Bedrock Enables One‑Stop SEO: From Keyword Mining to AI‑Generated Content

Fun with Large Models

Aug 22, 2025 · Artificial Intelligence

Step‑by‑Step Guide: Building a PDF‑Based RAG Knowledge Base with LangChain, Streamlit, DashScope & DeepSeek

This tutorial shows how to create a lightweight Retrieval‑Augmented Generation (RAG) system that indexes multiple PDF files, stores their embeddings in a FAISS vector database, and answers user queries through a LangChain agent powered by DashScope embeddings and the DeepSeek‑Chat model, all wrapped in a Streamlit UI.

DashScopeDeepSeekFAISS

0 likes · 13 min read

Step‑by‑Step Guide: Building a PDF‑Based RAG Knowledge Base with LangChain, Streamlit, DashScope & DeepSeek

Architect

May 23, 2025 · Artificial Intelligence

How We Won the RAG Challenge: Multi‑Router & Dynamic Knowledge Base Techniques Revealed

This article details the end‑to‑end design, parsing tricks, vector database setup, retrieval strategies, prompt engineering, and LLM reranking that powered the winning solution in a company‑annual‑report question‑answering competition.

FAISSLLMPDF parsing

0 likes · 37 min read

How We Won the RAG Challenge: Multi‑Router & Dynamic Knowledge Base Techniques Revealed

DevOps

Apr 20, 2025 · Artificial Intelligence

Building a Medical Knowledge Base with RAG: A Step‑by‑Step Example

This article demonstrates how to construct an AI‑powered medical knowledge base for diabetes treatment by preprocessing literature, performing semantic chunking, generating BioBERT embeddings, storing them in a FAISS vector database, and using a RAG framework together with a knowledge graph to retrieve and generate accurate answers.

BioBERTFAISSKnowledge Graph

0 likes · 12 min read

Building a Medical Knowledge Base with RAG: A Step‑by‑Step Example

AI Algorithm Path

Feb 13, 2025 · Artificial Intelligence

How to Build a Local RAG Knowledge Base with DeepSeek‑R1 and Ollama

This article walks through setting up a local Retrieval‑Augmented Generation (RAG) system using the open‑source DeepSeek‑R1 model run via Ollama, covering installation, model selection, PDF ingestion with LangChain, semantic chunking, FAISS vector store creation, RetrievalQA chain construction, and a Streamlit UI for querying.

DeepSeek-R1FAISSLangChain

0 likes · 8 min read

How to Build a Local RAG Knowledge Base with DeepSeek‑R1 and Ollama

DataFunSummit

Apr 15, 2024 · Artificial Intelligence

Deep Learning Practices for Internet Real‑Estate Recommendation at 58.com

This article details the end‑to‑end deep‑learning pipeline used by 58.com for real‑estate recommendation, covering business background, a six‑layer architecture, vector‑based recall, various embedding and ranking models, multi‑task and multi‑scenario optimization techniques, and future directions for large‑model integration.

Deep LearningFAISSMulti-Task Learning

0 likes · 19 min read

Deep Learning Practices for Internet Real‑Estate Recommendation at 58.com

Meituan Technology Team

Apr 11, 2024 · Artificial Intelligence

GPU-Accelerated Mixed Vector-Scalar Retrieval System for Meituan Takeaway Search

Meituan Waimai’s search team built a GPU‑accelerated, mixed vector‑and‑scalar retrieval engine that supports billions of items, achieving over 99% recall and up to 89% latency reduction by combining pre‑filtering, optimized data layouts, multi‑GPU parallelism, and FP16 precision.

ANNFAISSGPU

0 likes · 20 min read

GPU-Accelerated Mixed Vector-Scalar Retrieval System for Meituan Takeaway Search

Baidu Intelligent Cloud Tech Hub

Aug 31, 2023 · Databases

How Baidu Cloud Transforms Elasticsearch into a High‑Performance Vector Database

In this interview, Baidu Smart Cloud’s big‑data lead Li Li explains how the team combined Elasticsearch with the open‑source Faiss engine, built custom columnar storage, and iteratively optimized their vector‑search service to support billions of high‑dimensional vectors for large‑model applications.

AIBig DataCloud Computing

0 likes · 19 min read

How Baidu Cloud Transforms Elasticsearch into a High‑Performance Vector Database

Architect

Aug 17, 2023 · Backend Development

Design and Implementation of Bilibili's New Customer Service System

This article details Bilibili's transition from a purchased customer‑service platform to a self‑developed system, describing the background, architectural design, core modules such as intelligent QA, seat scheduling, workbench, permission management, the use of Faiss for vector search, and future explorations with large language models, highlighting the technical challenges and solutions across backend development and AI integration.

AIFAISSbackend

0 likes · 22 min read

Design and Implementation of Bilibili's New Customer Service System

Zhuanzhuan Tech

Sep 29, 2022 · Artificial Intelligence

Design and Implementation of ZhiZhuan's Low-Result Search Module with Hybrid Hard and Soft Retrieval

The article details the architecture and techniques of ZhiZhuan's low-result search module, explaining how it combines ElasticSearch hard matching and sBert semantic vector soft matching, along with sophisticated negative sample strategies, to improve recommendation coverage and user experience.

FAISSSearchrecommendation

0 likes · 17 min read

Design and Implementation of ZhiZhuan's Low-Result Search Module with Hybrid Hard and Soft Retrieval

Zhuanzhuan Tech

Sep 21, 2022 · Artificial Intelligence

Vector Retrieval and Product Quantization with Faiss

This article explains the challenges of large‑scale vector retrieval, compares Faiss index types such as brute‑force, graph‑based and product quantization, and details how product quantization works, its memory‑speed trade‑offs, hierarchical quantization, and practical hyper‑parameter tuning.

ANNEmbeddingFAISS

0 likes · 9 min read

Vector Retrieval and Product Quantization with Faiss

Alimama Tech

Aug 24, 2022 · Artificial Intelligence

Distributed High‑Performance Vector Retrieval with gpdb‑faiss‑vector Plugin on Dolphin Engine

The gpdb‑faiss‑vector plugin embeds Facebook’s Faiss library into the Dolphin (Greenplum‑compatible) engine, exposing SQL functions for distributed, high‑performance approximate nearest‑neighbor vector retrieval with caching, parallel search, configurable indexes, and sub‑millisecond latency, enabling scalable recommendation and advertising workloads.

AIFAISSSQL

0 likes · 15 min read

Distributed High‑Performance Vector Retrieval with gpdb‑faiss‑vector Plugin on Dolphin Engine

DeWu Technology

Jul 27, 2022 · Artificial Intelligence

Overview of Nearest Neighbor Search Algorithms

The article reviews how high‑dimensional vector representations in deep‑learning applications require efficient approximate nearest‑neighbor search, comparing K‑d trees, hierarchical k‑means trees, locality‑sensitive hashing, product quantization, and HNSW graphs, and discusses practical FAISS implementations and how algorithm choice depends on data size, recall, latency, and resources.

FAISSHNSWKD-Tree

0 likes · 8 min read

Overview of Nearest Neighbor Search Algorithms

ITPUB

Jul 16, 2022 · Artificial Intelligence

How Huya Live Uses Vector Search and Fine‑Ranking to Power Real‑Time Recommendations

This article explains Huya Live's recommendation architecture, covering business background, system design, vector retrieval challenges and solutions with ScaNN, and the fine‑ranking pipeline, while highlighting performance optimizations, scalability, and future directions for their live‑streaming platform.

FAISSHuya LiveScaNN

0 likes · 11 min read

How Huya Live Uses Vector Search and Fine‑Ranking to Power Real‑Time Recommendations

Laiye Technology Team

Apr 29, 2022 · Artificial Intelligence

Using Faiss for Efficient Vector Similarity Search: Installation, Index Construction, and Performance Optimization

This tutorial explains what Faiss is, how to install it, construct various indexes such as IndexFlatL2, IndexIVFFlat, and IndexIVFPQ, and demonstrates code examples for building and querying vector similarity search pipelines while discussing speed‑accuracy trade‑offs.

AIFAISSIndexing

0 likes · 11 min read

Using Faiss for Efficient Vector Similarity Search: Installation, Index Construction, and Performance Optimization

Code DAO

Apr 18, 2022 · Artificial Intelligence

Transformer‑Based Denoising AutoEncoder (TSDAE) for Job Description Embeddings (Job2Vec)

This article explains how TSDAE, a transformer‑based denoising auto‑encoder, converts noisy job description sentences into robust vector embeddings, details its training process, loss function, dataset preparation, and demonstrates using FAISS for similarity search on the resulting Job2Vec representations.

AutoencoderFAISSNLP

0 likes · 4 min read

Transformer‑Based Denoising AutoEncoder (TSDAE) for Job Description Embeddings (Job2Vec)

Kuaishou Tech

Dec 10, 2021 · Artificial Intelligence

Kuaishou and Tsinghua University Win NeurIPS'21 Billion-Scale ANN Challenge with FAISS‑Optimized KST_ANN Solution

On December 6, Kuaishou and Tsinghua University’s joint team secured first place in the NeurIPS'21 Billion‑Scale Approximate Nearest Neighbor Search Challenge by leveraging a FAISS‑optimized, memory‑efficient KST_ANN algorithm that achieved over 6% higher recall on multiple billion‑scale datasets, showcasing the practical impact of large‑scale vector retrieval in AI‑driven services.

AIANNFAISS

0 likes · 5 min read

Kuaishou and Tsinghua University Win NeurIPS'21 Billion-Scale ANN Challenge with FAISS‑Optimized KST_ANN Solution

DataFunTalk

Sep 19, 2021 · Artificial Intelligence

Second‑hand Housing Recommendation System: Business Background, Vector Recall, Multi‑objective Optimization and Future Plans

This article presents the end‑to‑end practice of a second‑hand housing recommendation system at 58.com and Anjuke, covering business background, embedding‑based vector recall, multi‑objective ranking methods such as ESMM and MMOE, experimental results, and future development directions.

ESMMEmbeddingFAISS

0 likes · 14 min read

Second‑hand Housing Recommendation System: Business Background, Vector Recall, Multi‑objective Optimization and Future Plans

58 Tech

Apr 9, 2021 · Artificial Intelligence

Vectorized Recall and Dual‑Tower Model for Home Page Recommendation at 58.com

This article details how 58.com improved its home‑page recommendation system by introducing vectorized recall with Word2Vec, optimizing negative sampling, deploying FAISS for fast nearest‑neighbor search, and later adopting a dual‑tower deep learning model with user interest features, achieving higher click‑through and conversion rates.

FAISSWord2Vecdual-tower

0 likes · 19 min read

Vectorized Recall and Dual‑Tower Model for Home Page Recommendation at 58.com

Big Data Technology & Architecture

Jun 6, 2020 · Artificial Intelligence

Embedding Techniques and Practices in Tencent Mobile News Recommendation System

This article reviews the concept, history, and practical implementations of embedding—including item, image, and user embeddings—and describes various vector‑based recall strategies such as i2i, u2i, clustering, and deep‑learning models used in Tencent's mobile news recommendation platform.

ClusteringDSSMEmbedding

0 likes · 18 min read

Embedding Techniques and Practices in Tencent Mobile News Recommendation System

iQIYI Technical Product Team

Apr 10, 2020 · Big Data

Video Copyright Detection Solution Using SE-ResNeXt and Faiss in the 2019 CCF Big Data & Computational Intelligence Competition

The iQiyi team “都挺好” tackled the 2019 CCF Video Copyright Detection contest by extracting frame‑level features with SE‑ResNeXt, indexing them with Faiss, aligning temporally via a critical‑path method, and refining copy boundaries using SIFT re‑matching and a sliding‑window approach, ultimately achieving an F1 score of 0.9678 after three iterative stages of model selection, cascade detection, and feature fusion.

CCF competitionFAISSSE-ResNeXt

0 likes · 6 min read

Video Copyright Detection Solution Using SE-ResNeXt and Faiss in the 2019 CCF Big Data & Computational Intelligence Competition

58 Tech

Mar 30, 2020 · Artificial Intelligence

Embedding Techniques for Advertising Recall and Ranking in a Second-Hand Car Platform

This article details the commercial strategy team's exploration of embedding technologies for a second‑hand car platform, covering mainstream embedding methods, their application in advertising recall and ranking pipelines, system architecture, model optimizations, evaluation results, and future directions.

AdvertisingDSSMDeep Learning

0 likes · 22 min read

Embedding Techniques for Advertising Recall and Ranking in a Second-Hand Car Platform

MXPlayer Technical Team

Sep 4, 2019 · Artificial Intelligence

How gRPC Transformed Our Faiss Vector Search Service for Faster Recommendations

This article details how the MX recommendation team rebuilt their Faiss‑based similarity search service using gRPC, covering service selection, multi‑type vector handling, dynamic index updates, deployment strategies, and performance gains that doubled QPS and cut latency by two‑thirds.

FAISSPythongrpc

0 likes · 10 min read

How gRPC Transformed Our Faiss Vector Search Service for Faster Recommendations

360 Quality & Efficiency

Aug 23, 2019 · Artificial Intelligence

High‑Performance High‑Dimensional Vector KNN Search Using FAISS

This article introduces the background of vector representations in machine learning, explains the K‑Nearest Neighbors algorithm and its key parameters, reviews traditional tree‑based and modern high‑performance search solutions, and demonstrates how FAISS can achieve microsecond‑level KNN queries on large‑scale high‑dimensional data.

FAISShigh-dimensionalkNN

0 likes · 5 min read

High‑Performance High‑Dimensional Vector KNN Search Using FAISS

MXPlayer Technical Team

Aug 9, 2019 · Artificial Intelligence

How MX Player’s Recommendation Engine Is Built: Architecture & Core Processes

This article explains MX Player’s recommendation engine architecture, detailing the division between functional and algorithmic components, API communication via Thrift, and the configurable core processing pipeline that handles recall, filtering, scoring, ranking, rule‑based adjustments, packaging, and caching.

AIFAISSThrift

0 likes · 7 min read

How MX Player’s Recommendation Engine Is Built: Architecture & Core Processes

vivo Internet Technology

Nov 16, 2018 · Artificial Intelligence

Efficient Vector Search with Deep Learning Embeddings in Elasticsearch

The article explains how to replace keyword matching with deep‑learning document embeddings in Elasticsearch by applying PCA dimensionality reduction, indexing vectors using Lucene’s KD‑tree structures via a custom plugin, and leveraging FAISS‑style nearest‑neighbour techniques to achieve fast, semantically aware similarity search.

Deep LearningElasticsearchFAISS

0 likes · 7 min read

Efficient Vector Search with Deep Learning Embeddings in Elasticsearch

Xianyu Technology

Sep 7, 2018 · Artificial Intelligence

Video Deduplication on Xianyu Using High‑Dimensional Vector Retrieval

The Xianyu platform combats video plagiarism by extracting key frames, converting them into 1024‑dimensional vectors, and using product quantization‑based high‑dimensional vector retrieval to achieve over 95% recall with ~100 ms latency and more than 1000 QPS, enabling scalable video, image, and product deduplication.

FAISSPQVector Retrieval

0 likes · 12 min read

Video Deduplication on Xianyu Using High‑Dimensional Vector Retrieval