Tagged articles
187 articles
Page 2 of 2
StarRocks
StarRocks
Feb 11, 2025 · Databases

How StarRocks Supercharges Vector Search: 7× Faster Queries and 1/3 Cost

This article explains the principles and practical implementation of vector retrieval in StarRocks, covering approximate nearest‑neighbor algorithms, index design, query planning, performance optimizations, real‑world case studies, and future challenges, showing how query latency dropped from 15 seconds to 2 seconds while cutting costs to a third.

ANNHNSWIVFPQ
0 likes · 25 min read
How StarRocks Supercharges Vector Search: 7× Faster Queries and 1/3 Cost
Architecture Digest
Architecture Digest
Jan 16, 2025 · Artificial Intelligence

Redis Introduces Multi‑Threaded Query Engine to Boost Vector Search Performance for Generative AI

Redis has unveiled a multi‑threaded query engine that dramatically increases query throughput and lowers latency for vector similarity searches, offering up to 16× performance gains and enabling real‑time Retrieval‑Augmented Generation (RAG) workloads in generative AI applications.

Database PerformanceRAGgenerative AI
0 likes · 7 min read
Redis Introduces Multi‑Threaded Query Engine to Boost Vector Search Performance for Generative AI
Bilibili Tech
Bilibili Tech
Jan 7, 2025 · Cloud Native

Design and Implementation of Bilibili's Large-Scale Recall System

Bilibili’s large‑scale recall system separates online processing into a two‑tier merge service and an index service, supports multi‑channel text, item‑to‑item and vector indexes with real‑time updates, uses horizontal sharding, robust CI/CD, monitoring and degradation mechanisms, and is being extended toward model‑based recall and greater automation.

BilibiliSearch Architecturecloud-native
0 likes · 16 min read
Design and Implementation of Bilibili's Large-Scale Recall System
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 18, 2024 · Artificial Intelligence

Can GPU Graph Algorithms Boost Vector Search Performance by 10×?

This article explains how OpenSearch's GPU‑accelerated vector search leverages parallel graph algorithms to achieve up to tenfold speed improvements over CPU solutions, detailing ANNS techniques, performance benchmarks, and practical GPU specifications for high‑QPS AI applications.

GPU AccelerationOpenSearchapproximate nearest neighbor
0 likes · 11 min read
Can GPU Graph Algorithms Boost Vector Search Performance by 10×?
21CTO
21CTO
Dec 15, 2024 · Databases

Why Antirez Is Returning to Redis: Insights on Licensing, AI, and Vector Search

Redis Labs announces the return of its founder Antirez, who shares his reasons for rejoining, discusses the recent licensing shift, reflects on his past projects, and outlines future plans for AI integration, vector indexing, and community engagement within the Redis ecosystem.

AILicensingdatabase
0 likes · 16 min read
Why Antirez Is Returning to Redis: Insights on Licensing, AI, and Vector Search
Sohu Tech Products
Sohu Tech Products
Nov 27, 2024 · Artificial Intelligence

RAG Technology and Practical Application in Multi-Modal Query: Using Chinese-CLIP and Redis Search

The article explains how Retrieval‑Augmented Generation (RAG) outperforms direct LLM inference by enabling real‑time knowledge updates and lower costs, and demonstrates a practical multi‑modal RAG pipeline that uses Chinese‑CLIP for vector encoding, various chunking strategies, and Redis Search for fast vector storage and retrieval.

Chinese-CLIPLLMRAG
0 likes · 17 min read
RAG Technology and Practical Application in Multi-Modal Query: Using Chinese-CLIP and Redis Search
Baidu Tech Salon
Baidu Tech Salon
Nov 22, 2024 · Artificial Intelligence

How GPU‑Accelerated ANN Search Cuts Costs and Boosts Throughput in High‑Volume Retrieval

This article analyzes a GPU‑based approximate nearest neighbor (ANN) retrieval solution built on NVIDIA's RAFT library, detailing algorithm selection, offline indexing tricks, batch online search design, performance results on a 25‑million‑vector workload, and cost‑saving implications for large‑scale search services.

ANNGPUIVF_INT8
0 likes · 21 min read
How GPU‑Accelerated ANN Search Cuts Costs and Boosts Throughput in High‑Volume Retrieval
Baidu Geek Talk
Baidu Geek Talk
Nov 20, 2024 · Artificial Intelligence

Boosting ANN Search with GPU: Inside RAFT’s IVF_INT8 Implementation

This article examines how Baidu and NVIDIA leveraged the open‑source RAFT library to build a GPU‑accelerated approximate nearest neighbor (ANN) retrieval system, detailing algorithm choices, offline indexing, online batch processing, performance results, and practical guidelines for deploying ANN on GPUs.

ANNGPUIVF_INT8
0 likes · 20 min read
Boosting ANN Search with GPU: Inside RAFT’s IVF_INT8 Implementation
DataFunSummit
DataFunSummit
Nov 19, 2024 · Databases

From DIKW to Distributed Data Warebase: Evolution of Data Systems and AI‑Driven Architecture

The article traces the progression from the human DIKW information hierarchy to its computer‑world counterpart, illustrates how a homestay platform’s data architecture evolves through relational, NoSQL, search, and data‑warehouse layers, and introduces the next‑generation distributed Data Warebase that unifies structured, semi‑structured, and vectorized knowledge to meet modern AI‑driven business demands.

AIDIKW ModelDistributed Data Warehouse
0 likes · 26 min read
From DIKW to Distributed Data Warebase: Evolution of Data Systems and AI‑Driven Architecture
AntData
AntData
Nov 18, 2024 · Databases

Modern Data Paradigms: From Relational Databases to Vector Retrieval and AI

This article surveys the evolution of modern data technologies—from the 4V characteristics of big data and the limitations of traditional relational databases, through the rise of NoSQL and polyglot persistence, to embedding‑driven vector search, hybrid retrieval and RAG, illustrating how each paradigm frees applications from data constraints.

Big DataData ArchitectureEmbedding
0 likes · 30 min read
Modern Data Paradigms: From Relational Databases to Vector Retrieval and AI
DaTaobao Tech
DaTaobao Tech
Oct 9, 2024 · Artificial Intelligence

Building a Vertical Domain QA Bot with Vector Search, RAG, and SFT

This guide walks entry‑level developers through building a logistics‑focused QA bot by first embedding documents for vector similarity search, then adding retrieval‑augmented generation, fine‑tuning a small model, integrating hybrid checks, and optimizing deployment with feedback loops to achieve fast, accurate, out‑of‑scope‑aware answers.

AIChatbotFine-tuning
0 likes · 15 min read
Building a Vertical Domain QA Bot with Vector Search, RAG, and SFT
21CTO
21CTO
Oct 3, 2024 · Databases

MongoDB 8.0 Unveiled: Massive Performance Gains and New Vector Support

MongoDB 8.0 launches with up to 36% higher read throughput, 56% faster batch writes, 200% faster time‑series processing, 50‑fold faster sharding distribution, quantized vector search, and queryable encryption, delivering significant performance, cost, and security improvements for modern workloads.

MongoDBdatabaseencryption
0 likes · 4 min read
MongoDB 8.0 Unveiled: Massive Performance Gains and New Vector Support
DataFunSummit
DataFunSummit
Sep 4, 2024 · Artificial Intelligence

How Elasticsearch Powers Retrieval‑Augmented Generation (RAG) Applications

This article explains how Elasticsearch’s advanced search capabilities—including vector and semantic search, hardware acceleration, hybrid retrieval, model re‑ranking, multi‑vector support, and integrated security—enable robust RAG implementations and outlines future directions such as a new compute engine, stronger vector engines, and cloud‑native serverless deployment.

AIElasticsearchHybrid Search
0 likes · 9 min read
How Elasticsearch Powers Retrieval‑Augmented Generation (RAG) Applications
DataFunTalk
DataFunTalk
Sep 4, 2024 · Artificial Intelligence

Data+AI Data Lake Technologies: Challenges, Apache Iceberg Overview, and Vector Table Implementations with PyIceberg

This article explores the evolution of data lakes for AI, discusses the challenges of AI-era data management, introduces Apache Iceberg and its architecture, demonstrates PyIceberg-based AI training and inference pipelines, and presents vector table designs with LSH indexing and performance optimizations.

AIApache IcebergBig Data
0 likes · 22 min read
Data+AI Data Lake Technologies: Challenges, Apache Iceberg Overview, and Vector Table Implementations with PyIceberg
Selected Java Interview Questions
Selected Java Interview Questions
Aug 18, 2024 · Backend Development

Redis Introduces a Multi‑Threaded Query Engine to Boost Vector Search Performance for Generative AI

Redis has launched a multi‑threaded query engine that vertically scales its in‑memory database, dramatically increasing query throughput and lowering latency for vector similarity searches, thereby addressing the performance demands of real‑time retrieval‑augmented generation in generative AI applications.

BackendRAGgenerative AI
0 likes · 9 min read
Redis Introduces a Multi‑Threaded Query Engine to Boost Vector Search Performance for Generative AI
AI Large Model Application Practice
AI Large Model Application Practice
Aug 16, 2024 · Artificial Intelligence

How to Query a Microsoft GraphRAG Knowledge Graph with Neo4j: Local and Global Modes

This guide explains how to query a Microsoft GraphRAG knowledge graph using the official CLI, API, and a custom Neo4j implementation, covering both local and global retrieval modes, vector index creation, Cypher query customization, and integration with LangChain for end‑to‑end RAG pipelines.

LangChainMicrosoft GraphRAGNeo4j
0 likes · 13 min read
How to Query a Microsoft GraphRAG Knowledge Graph with Neo4j: Local and Global Modes
21CTO
21CTO
Jul 30, 2024 · Databases

What Goes Around: 20‑Year Evolution of Database Systems and Future Trends

This article reviews two decades of database research, analyzing the rise and decline of various data models—from hierarchical and relational to NoSQL, vector, and graph databases—while highlighting how AI, cloud, and hardware advances are reshaping DBMS architecture and predicting which approaches will dominate tomorrow’s data landscape.

DBMS EvolutionNoSQLSQL
0 likes · 30 min read
What Goes Around: 20‑Year Evolution of Database Systems and Future Trends
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jul 24, 2024 · Artificial Intelligence

How Alibaba Cloud’s Milvus Service Boosted E‑commerce Search Stability and Scalability

This case study details how ShiHuo, an e‑commerce recommendation platform, overcame rapid product growth, cluster instability, and high operational overhead by adopting Alibaba Cloud’s fully managed Milvus vector search service, achieving higher performance, better availability, and reduced management costs.

AIMilvusScalability
0 likes · 8 min read
How Alibaba Cloud’s Milvus Service Boosted E‑commerce Search Stability and Scalability
DataFunTalk
DataFunTalk
Jun 26, 2024 · Big Data

Evolution of the Big Data + AI Development Paradigm and Alibaba Cloud’s Integrated Architecture

This article examines how the big‑data AI development paradigm has shifted from model‑centric to data‑centric workflows, outlines the challenges of integrating data and AI teams, and details Alibaba Cloud’s end‑to‑end, serverless big‑data platform—including MaxCompute, Hologres, MaxFrame, Object Table, and vector search—designed to accelerate large‑scale AI applications.

AI integrationBig DataData Platform
0 likes · 20 min read
Evolution of the Big Data + AI Development Paradigm and Alibaba Cloud’s Integrated Architecture
DataFunSummit
DataFunSummit
Jun 20, 2024 · Big Data

Data+AI Data Lake Technologies: Apache Iceberg, PyIceberg, and Vector Table Solutions

This article presents a comprehensive overview of modern Data+AI data lake challenges and solutions, covering the evolution of data lakes, an introduction to Apache Iceberg, practical use of PyIceberg for AI training and inference pipelines, and advanced vector table and indexing techniques for efficient similarity search.

AI trainingApache IcebergBig Data
0 likes · 22 min read
Data+AI Data Lake Technologies: Apache Iceberg, PyIceberg, and Vector Table Solutions
AI Large Model Application Practice
AI Large Model Application Practice
Jun 7, 2024 · Artificial Intelligence

Mastering Advanced Retrieval: Fusion and Recursive Strategies for RAG

This article explores two advanced retrieval paradigms—Fusion Retrieval, which merges results from multiple retrievers using re‑ranking, and Recursive Retrieval, which builds hierarchical chunk‑to‑chunk or chunk‑to‑retriever links—to boost the quality and flexibility of Retrieval‑Augmented Generation pipelines.

Fusion RetrievalLLMLangChain
0 likes · 12 min read
Mastering Advanced Retrieval: Fusion and Recursive Strategies for RAG
21CTO
21CTO
May 6, 2024 · Databases

How Oracle’s New 23ai Database Brings AI-Powered Vector Search to Enterprises

Oracle’s latest release, Database 23ai, upgrades its 23c platform with AI-driven vector search, RAG capabilities, and enhanced JSON and graph querying, positioning the database as a unified, secure, and scalable solution for handling structured, semi‑structured, and unstructured data across cloud and on‑premises environments.

AIOracleRAG
0 likes · 7 min read
How Oracle’s New 23ai Database Brings AI-Powered Vector Search to Enterprises
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Mar 22, 2024 · Artificial Intelligence

Improving Document Search with Vector Search: From Elasticsearch Limitations to Milvus Integration

This article explains how traditional keyword search with Elasticsearch often yields inaccurate or incomplete results for document retrieval, introduces vectorization and semantic search using NLP embeddings, and demonstrates a practical workflow that combines these techniques with the Milvus vector database to achieve more accurate and efficient document search.

AIElasticsearchMilvus
0 likes · 13 min read
Improving Document Search with Vector Search: From Elasticsearch Limitations to Milvus Integration
Sohu Tech Products
Sohu Tech Products
Mar 13, 2024 · Databases

DingoDB Multi-Modal Vector Database: Design Philosophy, Architecture and Applications

DingoDB is a multi‑modal vector database that unifies storage and analysis of structured, semi‑structured and unstructured data through a Raft‑based distributed architecture, offering MySQL‑compatible SQL, high‑performance APIs, automatic sharding, real‑time index optimization, and hybrid scalar‑vector queries for enterprise knowledge bases, LLM memory, and real‑time decision‑making.

Data ArchitectureDingoDBLLM applications
0 likes · 11 min read
DingoDB Multi-Modal Vector Database: Design Philosophy, Architecture and Applications
Baidu Geek Talk
Baidu Geek Talk
Jan 24, 2024 · Artificial Intelligence

Building AI‑Native Applications with Baidu Cloud AppBuilder

Sun Ke’s keynote at the 2023 Baidu Cloud Intelligence Conference explains how AI‑native development has shifted from model selection to building practical applications, and introduces Baidu Cloud AppBuilder—a three‑layer, low‑code‑and‑code platform that provides multimodal, LLM, and infrastructure services, enabling rapid prototyping of solutions such as automated resume screening and interview preparation.

AIAppBuilderNL2SQL
0 likes · 12 min read
Building AI‑Native Applications with Baidu Cloud AppBuilder
JD Cloud Developers
JD Cloud Developers
Jan 10, 2024 · Artificial Intelligence

Boosting Elasticsearch with Generative AI: Relevance Engine & Vector Search

This article explores the rise of generative AI, outlines popular models like ChatGPT, DALL‑E, and Google Bard, examines their limitations, and then delves into Elasticsearch’s Relevance Engine and vector capabilities, demonstrating how to store, index, and query dense embeddings with practical code examples.

ElasticsearchLLM integrationgenerative AI
0 likes · 17 min read
Boosting Elasticsearch with Generative AI: Relevance Engine & Vector Search
Tongcheng Travel Technology Center
Tongcheng Travel Technology Center
Dec 27, 2023 · Big Data

Recap of Tongcheng Travel’s 7th Big Data Technology Salon – Talks on StarRocks, Paimon, Iceberg, Data+AI, Vector Retrieval, Real‑Time Computing, and Hotel Ranking

The 7th Tongcheng Travel Big Data Technology Salon in Beijing featured a series of expert talks covering StarRocks architecture evolution, lake‑house solutions with Paimon, Iceberg real‑time upsert, Data+AI for travel recommendation, vector retrieval in AI, JD Logistics real‑time computing governance, and multi‑task hotel ranking modeling, providing deep technical insights and future roadmaps.

AIBig DataLakehouse
0 likes · 10 min read
Recap of Tongcheng Travel’s 7th Big Data Technology Salon – Talks on StarRocks, Paimon, Iceberg, Data+AI, Vector Retrieval, Real‑Time Computing, and Hotel Ranking
dbaplus Community
dbaplus Community
Nov 27, 2023 · Artificial Intelligence

Build an Image‑Search Engine with Elasticsearch 8.x and CLIP

This guide explains how to implement reverse image search by extracting visual features with a multilingual CLIP model, storing the vectors in Elasticsearch 8.x, and using its k‑NN plugin to retrieve similar images, covering architecture, tools, code snippets, and results.

CLIPDeep Learningimage search
0 likes · 9 min read
Build an Image‑Search Engine with Elasticsearch 8.x and CLIP
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Nov 27, 2023 · Artificial Intelligence

How OpenSearch Supercharges Vector Search for Large‑Model Applications

This article explains how Alibaba Cloud OpenSearch leverages vector retrieval, engineering and algorithmic optimizations, heterogeneous CPU‑GPU computing, and dense‑sparse hybrid memory to deliver billion‑scale, high‑throughput search performance and enable conversational AI use cases such as intelligent Q&A and SmartArXiv.

AIOpenSearchretrieval
0 likes · 16 min read
How OpenSearch Supercharges Vector Search for Large‑Model Applications
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Nov 1, 2023 · Databases

How BES Powers Large-Scale Vector Search for AI Applications

This article explains the principles of vector databases, outlines the engineering practices of Baidu Intelligent Cloud BES for large‑scale vector retrieval, discusses optimization techniques such as HNSW, IVF and filter integration, and presents real‑world AI use cases and future development directions.

AIBESElasticsearch
0 likes · 16 min read
How BES Powers Large-Scale Vector Search for AI Applications
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Oct 19, 2023 · Artificial Intelligence

How to Build a Retrieval‑Augmented LLM Knowledge Base on Alibaba Cloud

This guide details a complete end‑to‑end solution for constructing a large‑language‑model knowledge‑base chatbot on Alibaba Cloud, covering background, modular architecture, vector database selection, text preprocessing, embedding models, LLM fine‑tuning, prompt engineering, deployment with PAI‑EAS and BladeLLM, and real‑world results.

AILLMLangChain
0 likes · 37 min read
How to Build a Retrieval‑Augmented LLM Knowledge Base on Alibaba Cloud
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 21, 2023 · Artificial Intelligence

How Vector Search Powers AI: From Embeddings to Real‑World Applications

This article explains how vector search converts unstructured data such as speech, images, video, and text into high‑dimensional embeddings, explores common algorithms like Brute‑Force, ANN, and HNSW, and presents optimization techniques that dramatically improve recall and query‑per‑second performance for large‑scale AI retrieval systems.

AIANNEmbedding
0 likes · 27 min read
How Vector Search Powers AI: From Embeddings to Real‑World Applications
ZhongAn Tech Team
ZhongAn Tech Team
Sep 4, 2023 · Artificial Intelligence

Embedding Technology for FAQ Retrieval: Cases, Evaluation Metrics, and Model Comparison

This article introduces the evolution of embedding techniques, presents real‑world case studies of embedding‑based FAQ retrieval, explains evaluation metrics such as Recall and MRR, and compares the performance of a proprietary ZhongAn embedding model with OpenAI and Sentence‑BERT models on Chinese FAQ datasets.

EmbeddingEvaluation MetricsFAQ Retrieval
0 likes · 18 min read
Embedding Technology for FAQ Retrieval: Cases, Evaluation Metrics, and Model Comparison
Java High-Performance Architecture
Java High-Performance Architecture
Aug 18, 2023 · Databases

Redis 7.2 Unified Release: Boost AI, Vector Search, and Real‑Time Functions

Redis 7.2, the first Unified Redis Release, introduces AI‑ready vector indexing, hybrid semantic search, scalable RAG support, server‑side Triggers and Functions, enhanced geospatial queries, and a preview of high‑performance searchable indexes, while expanding client library support and integrating Redis Data Integration for seamless enterprise data pipelines.

AIRAGServerless Functions
0 likes · 8 min read
Redis 7.2 Unified Release: Boost AI, Vector Search, and Real‑Time Functions
Architect
Architect
Aug 17, 2023 · Backend Development

Design and Implementation of Bilibili's New Customer Service System

This article details Bilibili's transition from a purchased customer‑service platform to a self‑developed system, describing the background, architectural design, core modules such as intelligent QA, seat scheduling, workbench, permission management, the use of Faiss for vector search, and future explorations with large language models, highlighting the technical challenges and solutions across backend development and AI integration.

AIBackendFAISS
0 likes · 22 min read
Design and Implementation of Bilibili's New Customer Service System
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jul 10, 2023 · Artificial Intelligence

Enhancing Large Language Models with LangChain: Prompt Engineering, Chains, Agents, and Node.js Implementation

This article explains the limitations of large language models, introduces prompt engineering as a remedy, and provides a comprehensive guide to using the LangChain framework—including models, prompts, chains, agents, vector search, and practical Node.js code examples—to enable LLMs to interact with external tools and data sources.

AI DevelopmentLLMLangChain
0 likes · 35 min read
Enhancing Large Language Models with LangChain: Prompt Engineering, Chains, Agents, and Node.js Implementation
21CTO
21CTO
May 16, 2023 · Databases

How Cassandra’s New Vector Search Transforms AI Applications

This article explains how Cassandra’s newly added vector data type and ANN search capabilities empower AI developers to store, index, and query high‑dimensional embeddings at scale, enabling use cases such as image retrieval, recommendation, and large‑language‑model integration.

AIANNcassandra
0 likes · 10 min read
How Cassandra’s New Vector Search Transforms AI Applications
High Availability Architecture
High Availability Architecture
Apr 27, 2023 · Artificial Intelligence

Design and Optimization of Bilibili's Large‑Scale Video Duplicate Detection System

This article describes the design, algorithmic improvements, and engineering performance optimizations of Bilibili's massive video duplicate detection (collision) system, covering challenges of low‑edit‑degree reposts, two‑stage retrieval, self‑supervised feature extraction, GPU‑accelerated preprocessing, and the resulting gains in accuracy and throughput.

BilibiliDeep Learningfeature extraction
0 likes · 17 min read
Design and Optimization of Bilibili's Large‑Scale Video Duplicate Detection System
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Mar 21, 2023 · Artificial Intelligence

From Daily to Minute-Level Updates: Real-Time Recommendation System Enhancements at Xiaohongshu

Xiaohongshu transformed its recommendation pipeline from daily to minute‑level updates by redesigning recall, ranking and feature‑joining components, deploying a base‑plus‑incremental training scheme, migrating Spark to Flink, rewriting services in C++, and optimizing RocksDB, which yielded over 10% longer dwell time, 15% more interactions and roughly 50% higher new‑note efficiency.

Model ServingReal-time Traininglarge-scale systems
0 likes · 20 min read
From Daily to Minute-Level Updates: Real-Time Recommendation System Enhancements at Xiaohongshu
Alimama Tech
Alimama Tech
Feb 8, 2023 · Artificial Intelligence

Evolution of Recall Indexes in Alibaba Advertising: From Quantization to Graph-based HNSW

Alibaba’s advertising pipeline progressed from low‑dimensional quantization partitions to hierarchical tree indexes, then to graph‑based HNSW structures—including multi‑category, multi‑level graphs and a BlazeOp‑driven scoring service—dramatically boosting recall efficiency, scalability and maintainability while meeting strict latency constraints.

HNSWlarge scalerecall
0 likes · 13 min read
Evolution of Recall Indexes in Alibaba Advertising: From Quantization to Graph-based HNSW
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jan 10, 2023 · Big Data

How Alibaba’s Dolphin Engine Uses Flink + Hologres for Real‑Time Big Data

The Dolphin engine, built by Alibaba’s Data Engine team, combines Flink and Hologres to deliver ultra‑large‑scale OLAP, streaming, batch, and AI capabilities for real‑time advertising analytics, offering smart materialization, intelligent indexing, and vector recall while supporting millions of advertisers and petabyte‑level data.

AIBig DataFlink
0 likes · 13 min read
How Alibaba’s Dolphin Engine Uses Flink + Hologres for Real‑Time Big Data
HomeTech
HomeTech
Dec 16, 2022 · Artificial Intelligence

Building and Optimizing a Milvus‑Based Vector Search Platform

This article describes the background, technical selection, architecture, deployment, performance tuning, and operational practices of a Milvus‑driven vector retrieval platform, including cloud‑native deployment, index choices, capacity planning, and real‑world application cases that improve recall latency and resource efficiency.

AIMilvusPerformance Optimization
0 likes · 12 min read
Building and Optimizing a Milvus‑Based Vector Search Platform
DeWu Technology
DeWu Technology
Nov 25, 2022 · Databases

Milvus Vector Database Performance Testing and Architecture Analysis

The author stress‑tested Milvus 2.1.4’s cloud‑native, micro‑service architecture—detailing its write and search paths, evaluating FLAT index performance across 100 K to 10 M 512‑dim vectors, uncovering scaling, scheduler, segment‑rebalance, and upgrade issues, and concluding the system is robust but benefits from graph‑based indexes and Helm‑driven scaling.

Database ArchitectureMilvusPerformance Testing
0 likes · 10 min read
Milvus Vector Database Performance Testing and Architecture Analysis
Alimama Tech
Alimama Tech
Aug 24, 2022 · Artificial Intelligence

Distributed High‑Performance Vector Retrieval with gpdb‑faiss‑vector Plugin on Dolphin Engine

The gpdb‑faiss‑vector plugin embeds Facebook’s Faiss library into the Dolphin (Greenplum‑compatible) engine, exposing SQL functions for distributed, high‑performance approximate nearest‑neighbor vector retrieval with caching, parallel search, configurable indexes, and sub‑millisecond latency, enabling scalable recommendation and advertising workloads.

AIFAISSSQL
0 likes · 15 min read
Distributed High‑Performance Vector Retrieval with gpdb‑faiss‑vector Plugin on Dolphin Engine
ITPUB
ITPUB
Jul 16, 2022 · Artificial Intelligence

How Huya Live Uses Vector Search and Fine‑Ranking to Power Real‑Time Recommendations

This article explains Huya Live's recommendation architecture, covering business background, system design, vector retrieval challenges and solutions with ScaNN, and the fine‑ranking pipeline, while highlighting performance optimizations, scalability, and future directions for their live‑streaming platform.

FAISSHuya LiveScaNN
0 likes · 11 min read
How Huya Live Uses Vector Search and Fine‑Ranking to Power Real‑Time Recommendations
ITPUB
ITPUB
Jun 25, 2022 · Artificial Intelligence

How We Revamped a Content Community’s Recommendation Engine for Real‑Time, Personalized Results

This article details the evolution of the ‘逛逛’ content community’s recommendation system, comparing the legacy rule‑based Hive workflow with a new algorithm‑driven architecture that leverages Elasticsearch, Redis, multi‑stage recall, coarse‑ and fine‑ranking, re‑ranking, exposure filtering, cold‑start handling, performance tuning, and future plans for vector‑based recall and platformization.

Real-Timealgorithmic rankingcold start
0 likes · 18 min read
How We Revamped a Content Community’s Recommendation Engine for Real‑Time, Personalized Results
Laiye Technology Team
Laiye Technology Team
Apr 29, 2022 · Artificial Intelligence

Using Faiss for Efficient Vector Similarity Search: Installation, Index Construction, and Performance Optimization

This tutorial explains what Faiss is, how to install it, construct various indexes such as IndexFlatL2, IndexIVFFlat, and IndexIVFPQ, and demonstrates code examples for building and querying vector similarity search pipelines while discussing speed‑accuracy trade‑offs.

AIFAISSapproximate nearest neighbor
0 likes · 11 min read
Using Faiss for Efficient Vector Similarity Search: Installation, Index Construction, and Performance Optimization
System Architect Go
System Architect Go
Apr 15, 2022 · Artificial Intelligence

Elasticsearch Vector Search: script_score and _knn_search Methods

This article explains Elasticsearch's vector search capabilities, detailing two approaches—script_score using dense_vector fields for exact similarity scoring and the experimental _knn_search for approximate nearest neighbor queries—along with data modeling examples, code snippets, performance considerations, and usage guidelines.

Elasticsearch_knn_searchdense_vector
0 likes · 6 min read
Elasticsearch Vector Search: script_score and _knn_search Methods
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Mar 31, 2022 · Industry Insights

How Implicit Relationship Chains Solve Cold‑Start Problems at NetEase Cloud Music

This article details NetEase Cloud Music's technical approach to building implicit user relationship chains—using SimHash, Item2Vec, and MetaPath2Vec embeddings, large‑scale vector search, and a unified service architecture—to address cold‑start challenges across multiple business scenarios.

Item2VecMetaPath2VecRecommendation Systems
0 likes · 20 min read
How Implicit Relationship Chains Solve Cold‑Start Problems at NetEase Cloud Music
IEG Growth Platform Technology Team
IEG Growth Platform Technology Team
Mar 21, 2022 · Backend Development

Optimization of Local Vector Retrieval: Filtering, Storage, and Sorting Strategies

This article presents a comprehensive study of local vector retrieval optimization, covering memory‑based filtering techniques, Redis‑backed vector storage designs, and various sorting algorithms—including radix and heap‑based approaches—to achieve lower latency and higher throughput for large‑scale ad recommendation systems.

GolangSortingbackend optimization
0 likes · 13 min read
Optimization of Local Vector Retrieval: Filtering, Storage, and Sorting Strategies
DataFunTalk
DataFunTalk
Mar 2, 2022 · Artificial Intelligence

Huya Live Streaming Recommendation Architecture: Business Background, System Design, Vector Retrieval, and Ranking

This article presents a comprehensive overview of Huya Live's recommendation system, covering business background, system architecture, vector retrieval techniques, ranking pipeline, technical challenges, implementation details, and future outlook, highlighting scalability and performance optimizations.

AIHuyalive streaming
0 likes · 14 min read
Huya Live Streaming Recommendation Architecture: Business Background, System Design, Vector Retrieval, and Ranking
Baidu Geek Talk
Baidu Geek Talk
Feb 14, 2022 · Artificial Intelligence

How Baidu’s PUCK Dominated the First BigANN Vector Search Competition

The inaugural BigANN competition, organized by NeurIPS, showcased large‑scale ANN research, and Baidu's self‑developed PUCK algorithm secured top scores across all four tracks by leveraging multi‑layer quantization, two‑level inverted indexing, and extensive system‑level optimizations.

ANNBigANNPUCK
0 likes · 8 min read
How Baidu’s PUCK Dominated the First BigANN Vector Search Competition
Code DAO
Code DAO
Dec 26, 2021 · Artificial Intelligence

Building a Vector‑Based Movie Recommendation System with Transformers

This tutorial walks through constructing a movie recommendation engine by downloading a dataset, cleaning and de‑duplicating entries, encoding plot summaries into vectors with transformer models, and performing nearest‑neighbor searches using scikit‑learn, while handling misspellings with Levenshtein distance.

Levenshtein distanceTransformersmovie recommendation
0 likes · 8 min read
Building a Vector‑Based Movie Recommendation System with Transformers
Laravel Tech Community
Laravel Tech Community
Dec 9, 2021 · Backend Development

Apache Lucene 9.0 Released – New Features and Improvements

Apache Lucene 9.0, a high‑performance Java full‑text search library, introduces high‑dimensional vector indexing, new language analyzers, faster faceting and sorting, updated file formats, and several performance optimizations, providing developers with a richer, more efficient search toolkit.

Apache LuceneFull‑Text SearchJava
0 likes · 3 min read
Apache Lucene 9.0 Released – New Features and Improvements
Kuaishou Tech
Kuaishou Tech
Nov 29, 2021 · Artificial Intelligence

Starry Vector Retrieval Platform: Architecture, Features, and Performance

The article describes the design, challenges, architecture, key features, algorithm optimizations, and future roadmap of Kuaishou's Starry vector retrieval platform, which delivers high‑performance, high‑reliability, and easy‑to‑use large‑scale ANN search for diverse business scenarios.

AI PlatformANNPerformance Optimization
0 likes · 14 min read
Starry Vector Retrieval Platform: Architecture, Features, and Performance
iQIYI Technical Product Team
iQIYI Technical Product Team
Aug 20, 2021 · Artificial Intelligence

Engineering Practice of Online Vector Recall Service at iQIYI

iQIYI’s engineering team built an online vector‑recall service on Milvus, wrapping it with a Dubbo‑gRPC interface to serve 6 M 64‑dimensional embeddings at roughly 3 k QPS and 20 ms p99 latency, integrating query‑embedding generation, simplifying recommendation pipelines, and demonstrating the performance and operational advantages of a platformized ANN‑based recall layer.

AIEngineeringMilvus
0 likes · 14 min read
Engineering Practice of Online Vector Recall Service at iQIYI
DataFunTalk
DataFunTalk
Aug 2, 2021 · Databases

From Text Search to Vector Search: Generalizing Unstructured Data Retrieval

The article explains why traditional text‑based search engines like ElasticSearch struggle with modern multimodal data, introduces vector databases that store implicit semantic embeddings, and proposes a generalized search architecture that decouples data‑to‑vector mapping from the engine while leveraging clustering or graph indexes for similarity search.

AIEmbeddinginformation retrieval
0 likes · 12 min read
From Text Search to Vector Search: Generalizing Unstructured Data Retrieval
DataFunTalk
DataFunTalk
Jul 2, 2021 · Artificial Intelligence

Vector Retrieval for Community Forum Search Using Milvus at Dingxiangyuan

This article describes how Dingxiangyuan's algorithm team adopted Milvus for distributed vector indexing to improve semantic search in their community forum, detailing the background, retrieval workflow, various embedding models—including Bi‑Encoder, Spherical Embedding, and Knowledge Embedding—and summarizing the benefits and future applications.

EmbeddingMilvusNLP
0 likes · 10 min read
Vector Retrieval for Community Forum Search Using Milvus at Dingxiangyuan
Baidu Geek Talk
Baidu Geek Talk
May 10, 2021 · Industry Insights

How Baidu’s GNOIMI Powers Billion‑Scale Rich Media Retrieval

Baidu’s rich‑media retrieval system combines CNN‑based feature extraction with an Approximate Nearest Neighbor engine called GNOIMI, employing hierarchical clustering, product quantization, and optimized indexing to achieve sub‑millisecond search over billions of images, videos and audio, supporting anti‑spam, recommendation and risk‑control across dozens of services.

ANNGNOIMIHNSW
0 likes · 16 min read
How Baidu’s GNOIMI Powers Billion‑Scale Rich Media Retrieval
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 4, 2021 · Artificial Intelligence

How Alibaba’s Proxima Engine Revolutionizes Vector Search for AI Applications

Alibaba’s Damo Academy unveils Proxima, a high‑performance vector search engine that powers e‑commerce, video, and payment services, detailing its core capabilities, large‑scale indexing, distributed construction, real‑time updates, and challenges such as algorithm diversity, scalability, and multi‑modal retrieval.

AIAlibaba Proximalarge-scale indexing
0 likes · 17 min read
How Alibaba’s Proxima Engine Revolutionizes Vector Search for AI Applications
58 Tech
58 Tech
Mar 3, 2021 · Artificial Intelligence

Design and Implementation of a Faiss‑Based Vector Search Platform

The article describes the design, architecture, and key components of a vector search platform built on Faiss that supports full‑index construction, incremental and distributed indexing, online retrieval, city‑level search, and vector update/delete operations to meet large‑scale AI application needs.

AIKubernetesdistributed indexing
0 likes · 10 min read
Design and Implementation of a Faiss‑Based Vector Search Platform
System Architect Go
System Architect Go
Jun 4, 2020 · Artificial Intelligence

Evolution and Underlying Principles of the Billion‑Scale Image Search System at Youpai Image Manager

This article describes the two‑generation evolution of Youpai Image Manager's billion‑scale image search system, explaining the mathematical representation of images, the limitations of MD5, the first‑generation pHash‑ElasticSearch solution, and the second‑generation CNN‑Milvus approach for robust, large‑scale visual similarity search.

CNNMilvusimage search
0 likes · 9 min read
Evolution and Underlying Principles of the Billion‑Scale Image Search System at Youpai Image Manager
System Architect Go
System Architect Go
Mar 30, 2020 · Artificial Intelligence

Overview of Image Search System

This article explains the fundamentals of building an image‑by‑image search system, covering image feature extraction methods such as hashing, traditional descriptors, CNN‑based vectors, and the use of vector search engines like Milvus for similarity retrieval.

CNNMilvusfeature extraction
0 likes · 6 min read
Overview of Image Search System
DataFunTalk
DataFunTalk
Oct 24, 2019 · Artificial Intelligence

Evolution and Engineering Practices of the 360 Display Advertising Recall System

This article details the 360 display advertising system's architecture and the progressive evolution of its recall module, covering business overview, overall pipeline, various recall strategies—including Boolean, vectorized, and deep‑tree approaches—and the performance optimizations applied to meet real‑time constraints.

AdvertisingDeep Learningrecall system
0 likes · 14 min read
Evolution and Engineering Practices of the 360 Display Advertising Recall System
360 Quality & Efficiency
360 Quality & Efficiency
Aug 23, 2019 · Artificial Intelligence

High‑Performance High‑Dimensional Vector KNN Search Using FAISS

This article introduces the background of vector representations in machine learning, explains the K‑Nearest Neighbors algorithm and its key parameters, reviews traditional tree‑based and modern high‑performance search solutions, and demonstrates how FAISS can achieve microsecond‑level KNN queries on large‑scale high‑dimensional data.

FAISShigh-dimensionalkNN
0 likes · 5 min read
High‑Performance High‑Dimensional Vector KNN Search Using FAISS
vivo Internet Technology
vivo Internet Technology
Nov 16, 2018 · Artificial Intelligence

Efficient Vector Search with Deep Learning Embeddings in Elasticsearch

The article explains how to replace keyword matching with deep‑learning document embeddings in Elasticsearch by applying PCA dimensionality reduction, indexing vectors using Lucene’s KD‑tree structures via a custom plugin, and leveraging FAISS‑style nearest‑neighbour techniques to achieve fast, semantically aware similarity search.

Deep LearningElasticsearchFAISS
0 likes · 7 min read
Efficient Vector Search with Deep Learning Embeddings in Elasticsearch
Xianyu Technology
Xianyu Technology
Aug 31, 2018 · Artificial Intelligence

Personalized Recommendation for Xianyu Small Item Pools: Challenges and Solutions

Xianyu’s personalized recommendation system struggles with tiny, fast‑turnover item pools because traditional X2I matrices provide insufficient recall, so the team introduced pool‑specific pre‑filtering, high‑dimensional vector search, and a real‑time search‑engine recall, the latter boosting clicks by 14 % and transactions by 0.14 %.

EngineeringXianyupersonalization
0 likes · 15 min read
Personalized Recommendation for Xianyu Small Item Pools: Challenges and Solutions