Tag

faiss

0 views collected around this technical thread.

DevOps
DevOps
Apr 20, 2025 · Artificial Intelligence

Building a Medical Knowledge Base with RAG: A Step‑by‑Step Example

This article demonstrates how to construct an AI‑powered medical knowledge base for diabetes treatment by preprocessing literature, performing semantic chunking, generating BioBERT embeddings, storing them in a FAISS vector database, and using a RAG framework together with a knowledge graph to retrieve and generate accurate answers.

BioBERTMedical AIRAG
0 likes · 12 min read
Building a Medical Knowledge Base with RAG: A Step‑by‑Step Example
DataFunSummit
DataFunSummit
Apr 15, 2024 · Artificial Intelligence

Deep Learning Practices for Internet Real‑Estate Recommendation at 58.com

This article details the end‑to‑end deep‑learning pipeline used by 58.com for real‑estate recommendation, covering business background, a six‑layer architecture, vector‑based recall, various embedding and ranking models, multi‑task and multi‑scenario optimization techniques, and future directions for large‑model integration.

Vector Searchdeep learningfaiss
0 likes · 19 min read
Deep Learning Practices for Internet Real‑Estate Recommendation at 58.com
Architect
Architect
Aug 17, 2023 · Backend Development

Design and Implementation of Bilibili's New Customer Service System

This article details Bilibili's transition from a purchased customer‑service platform to a self‑developed system, describing the background, architectural design, core modules such as intelligent QA, seat scheduling, workbench, permission management, the use of Faiss for vector search, and future explorations with large language models, highlighting the technical challenges and solutions across backend development and AI integration.

AICustomer ServiceSystem Architecture
0 likes · 22 min read
Design and Implementation of Bilibili's New Customer Service System
Zhuanzhuan Tech
Zhuanzhuan Tech
Sep 29, 2022 · Artificial Intelligence

Design and Implementation of ZhiZhuan's Low-Result Search Module with Hybrid Hard and Soft Retrieval

The article details the architecture and techniques of ZhiZhuan's low-result search module, explaining how it combines ElasticSearch hard matching and sBert semantic vector soft matching, along with sophisticated negative sample strategies, to improve recommendation coverage and user experience.

RetrievalVector Searchelasticsearch
0 likes · 17 min read
Design and Implementation of ZhiZhuan's Low-Result Search Module with Hybrid Hard and Soft Retrieval
Zhuanzhuan Tech
Zhuanzhuan Tech
Sep 21, 2022 · Artificial Intelligence

Vector Retrieval and Product Quantization with Faiss

This article explains the challenges of large‑scale vector retrieval, compares Faiss index types such as brute‑force, graph‑based and product quantization, and details how product quantization works, its memory‑speed trade‑offs, hierarchical quantization, and practical hyper‑parameter tuning.

ANNVector Searchembedding
0 likes · 9 min read
Vector Retrieval and Product Quantization with Faiss
Alimama Tech
Alimama Tech
Aug 24, 2022 · Artificial Intelligence

Distributed High‑Performance Vector Retrieval with gpdb‑faiss‑vector Plugin on Dolphin Engine

The gpdb‑faiss‑vector plugin embeds Facebook’s Faiss library into the Dolphin (Greenplum‑compatible) engine, exposing SQL functions for distributed, high‑performance approximate nearest‑neighbor vector retrieval with caching, parallel search, configurable indexes, and sub‑millisecond latency, enabling scalable recommendation and advertising workloads.

AIDistributed DatabaseSQL
0 likes · 15 min read
Distributed High‑Performance Vector Retrieval with gpdb‑faiss‑vector Plugin on Dolphin Engine
DeWu Technology
DeWu Technology
Jul 27, 2022 · Artificial Intelligence

Overview of Nearest Neighbor Search Algorithms

The article reviews how high‑dimensional vector representations in deep‑learning applications require efficient approximate nearest‑neighbor search, comparing K‑d trees, hierarchical k‑means trees, locality‑sensitive hashing, product quantization, and HNSW graphs, and discusses practical FAISS implementations and how algorithm choice depends on data size, recall, latency, and resources.

HNSWKD-TreeLSH
0 likes · 8 min read
Overview of Nearest Neighbor Search Algorithms
Kuaishou Tech
Kuaishou Tech
Dec 10, 2021 · Artificial Intelligence

Kuaishou and Tsinghua University Win NeurIPS'21 Billion-Scale ANN Challenge with FAISS‑Optimized KST_ANN Solution

On December 6, Kuaishou and Tsinghua University’s joint team secured first place in the NeurIPS'21 Billion‑Scale Approximate Nearest Neighbor Search Challenge by leveraging a FAISS‑optimized, memory‑efficient KST_ANN algorithm that achieved over 6% higher recall on multiple billion‑scale datasets, showcasing the practical impact of large‑scale vector retrieval in AI‑driven services.

AIANNKST_ANN
0 likes · 5 min read
Kuaishou and Tsinghua University Win NeurIPS'21 Billion-Scale ANN Challenge with FAISS‑Optimized KST_ANN Solution
DataFunTalk
DataFunTalk
Sep 19, 2021 · Artificial Intelligence

Second‑hand Housing Recommendation System: Business Background, Vector Recall, Multi‑objective Optimization and Future Plans

This article presents the end‑to‑end practice of a second‑hand housing recommendation system at 58.com and Anjuke, covering business background, embedding‑based vector recall, multi‑objective ranking methods such as ESMM and MMOE, experimental results, and future development directions.

ESMMReal Estateembedding
0 likes · 14 min read
Second‑hand Housing Recommendation System: Business Background, Vector Recall, Multi‑objective Optimization and Future Plans
DataFunTalk
DataFunTalk
Aug 4, 2021 · Artificial Intelligence

Deep Learning Practices for Personalized Recommendation in a Cultural Artifact Auction Platform

This article presents a comprehensive case study of applying deep learning techniques—including item and user embedding, cross‑domain keyword intent modeling, and multi‑interest representation—to improve the recall stage of personalized recommendation for a cultural‑artifact auction platform, addressing unique data sparsity and diversity challenges.

cross-domain learningdeep learningembedding
0 likes · 16 min read
Deep Learning Practices for Personalized Recommendation in a Cultural Artifact Auction Platform
58 Tech
58 Tech
Apr 9, 2021 · Artificial Intelligence

Vectorized Recall and Dual‑Tower Model for Home Page Recommendation at 58.com

This article details how 58.com improved its home‑page recommendation system by introducing vectorized recall with Word2Vec, optimizing negative sampling, deploying FAISS for fast nearest‑neighbor search, and later adopting a dual‑tower deep learning model with user interest features, achieving higher click‑through and conversion rates.

Word2Vecdual‑towerfaiss
0 likes · 19 min read
Vectorized Recall and Dual‑Tower Model for Home Page Recommendation at 58.com
58 Tech
58 Tech
Mar 3, 2021 · Artificial Intelligence

Design and Implementation of a Faiss‑Based Vector Search Platform

The article describes the design, architecture, and key components of a vector search platform built on Faiss that supports full‑index construction, incremental and distributed indexing, online retrieval, city‑level search, and vector update/delete operations to meet large‑scale AI application needs.

AIKubernetesVector Search
0 likes · 10 min read
Design and Implementation of a Faiss‑Based Vector Search Platform
iQIYI Technical Product Team
iQIYI Technical Product Team
Apr 10, 2020 · Big Data

Video Copyright Detection Solution Using SE-ResNeXt and Faiss in the 2019 CCF Big Data & Computational Intelligence Competition

The iQiyi team “都挺好” tackled the 2019 CCF Video Copyright Detection contest by extracting frame‑level features with SE‑ResNeXt, indexing them with Faiss, aligning temporally via a critical‑path method, and refining copy boundaries using SIFT re‑matching and a sliding‑window approach, ultimately achieving an F1 score of 0.9678 after three iterative stages of model selection, cascade detection, and feature fusion.

CCF competitionSE-ResNeXtVideo Copyright Detection
0 likes · 6 min read
Video Copyright Detection Solution Using SE-ResNeXt and Faiss in the 2019 CCF Big Data & Computational Intelligence Competition
58 Tech
58 Tech
Mar 30, 2020 · Artificial Intelligence

Embedding Techniques for Advertising Recall and Ranking in a Second-Hand Car Platform

This article details the commercial strategy team's exploration of embedding technologies for a second‑hand car platform, covering mainstream embedding methods, their application in advertising recall and ranking pipelines, system architecture, model optimizations, evaluation results, and future directions.

DSSMRankingadvertising
0 likes · 22 min read
Embedding Techniques for Advertising Recall and Ranking in a Second-Hand Car Platform
360 Quality & Efficiency
360 Quality & Efficiency
Aug 23, 2019 · Artificial Intelligence

High‑Performance High‑Dimensional Vector KNN Search Using FAISS

This article introduces the background of vector representations in machine learning, explains the K‑Nearest Neighbors algorithm and its key parameters, reviews traditional tree‑based and modern high‑performance search solutions, and demonstrates how FAISS can achieve microsecond‑level KNN queries on large‑scale high‑dimensional data.

Vector Searchfaisshigh-dimensional
0 likes · 5 min read
High‑Performance High‑Dimensional Vector KNN Search Using FAISS
vivo Internet Technology
vivo Internet Technology
Nov 16, 2018 · Artificial Intelligence

Efficient Vector Search with Deep Learning Embeddings in Elasticsearch

The article explains how to replace keyword matching with deep‑learning document embeddings in Elasticsearch by applying PCA dimensionality reduction, indexing vectors using Lucene’s KD‑tree structures via a custom plugin, and leveraging FAISS‑style nearest‑neighbour techniques to achieve fast, semantically aware similarity search.

KD-TreePCAVector Search
0 likes · 7 min read
Efficient Vector Search with Deep Learning Embeddings in Elasticsearch
Xianyu Technology
Xianyu Technology
Sep 7, 2018 · Artificial Intelligence

Video Deduplication on Xianyu Using High‑Dimensional Vector Retrieval

The Xianyu platform combats video plagiarism by extracting key frames, converting them into 1024‑dimensional vectors, and using product quantization‑based high‑dimensional vector retrieval to achieve over 95% recall with ~100 ms latency and more than 1000 QPS, enabling scalable video, image, and product deduplication.

PQVector Retrievalfaiss
0 likes · 12 min read
Video Deduplication on Xianyu Using High‑Dimensional Vector Retrieval