Tagged articles
77 articles
Page 1 of 1
AI Architect Hub
AI Architect Hub
May 3, 2026 · Artificial Intelligence

Choosing the Right Vector Database: Milvus, Chroma, Weaviate, Qdrant, FAISS Compared

This article compares five popular vector databases—Chroma, Milvus, Weaviate, Qdrant, and FAISS—detailing their positions, strengths, weaknesses, suitable scenarios, a selection‑dimension matrix, common pitfalls, code implementations for a unified RAG pipeline, best‑practice recommendations, and thought questions to guide engineers in choosing and migrating vector stores.

ChromaFAISSMilvus
0 likes · 23 min read
Choosing the Right Vector Database: Milvus, Chroma, Weaviate, Qdrant, FAISS Compared
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
May 1, 2026 · Artificial Intelligence

Zero Deployment, Zero Ops: Alibaba Cloud Milvus Embedding Service Makes Vectorization Plug‑and‑Play

The article explains how Alibaba Cloud's Milvus Embedding Service eliminates the need for self‑hosted embedding models by integrating model inference, vector generation and Milvus indexing into a managed pipeline, dramatically reducing deployment complexity, operational overhead, and time‑to‑value for semantic search, RAG and multimodal retrieval use cases.

Alibaba CloudEmbeddingMilvus
0 likes · 19 min read
Zero Deployment, Zero Ops: Alibaba Cloud Milvus Embedding Service Makes Vectorization Plug‑and‑Play
Shuge Unlimited
Shuge Unlimited
Apr 30, 2026 · Databases

Milvus VTS Deep Dive: Two Write Modes, Sharding, and Migration Best Practices

The article provides a source‑code level analysis of Milvus Vector Transport Service (VTS), detailing its three‑stage architecture, partition‑aware sharding logic, two distinct sink write mechanisms (BufferBatchWriter and BulkWriter), schema conversion rules, error‑handling strategies, performance tuning parameters, and practical configuration examples for efficient vector data migration across various data sources.

BufferBatchWriterBulkWriterMilvus
0 likes · 20 min read
Milvus VTS Deep Dive: Two Write Modes, Sharding, and Migration Best Practices
Shuge Unlimited
Shuge Unlimited
Apr 29, 2026 · Databases

Milvus Storage Tuning in Practice: 25× Query Speedup and Three Tricks to Cut Memory Usage by Half

This article walks through Milvus 2.3‑2.6.x storage optimizations—Mmap, tiered storage, and clustering compaction—explaining their principles, configuration hierarchy, benchmark results, and concrete deployment templates that together can boost query performance up to 25‑fold while halving memory consumption.

MilvusStorage Optimizationclustering compaction
0 likes · 24 min read
Milvus Storage Tuning in Practice: 25× Query Speedup and Three Tricks to Cut Memory Usage by Half
James' Growth Diary
James' Growth Diary
Apr 21, 2026 · Artificial Intelligence

Boosting RAG Performance with Milvus: Chunking, Hybrid Search, and Rerank Best Practices

This article analyzes why Retrieval‑Augmented Generation often underperforms, then walks through concrete engineering steps—optimal chunking, overlap settings, hybrid vector + BM25 retrieval, RRF fusion, and reranking—while providing code snippets, parameter tables, and a full pipeline diagram to turn a usable RAG system into a high‑quality one.

Hybrid SearchLangChainMilvus
0 likes · 18 min read
Boosting RAG Performance with Milvus: Chunking, Hybrid Search, and Rerank Best Practices
Senior Tony
Senior Tony
Apr 11, 2026 · Databases

Why Vectors Need a Dedicated Database and How Milvus Solves It

This article explains what vectors are, why traditional relational databases struggle with high‑dimensional similarity queries, and how the open‑source Milvus vector database efficiently stores, indexes, and retrieves massive vectors for AI applications such as semantic search, image matching, and recommendation.

AI applicationsANNMilvus
0 likes · 5 min read
Why Vectors Need a Dedicated Database and How Milvus Solves It
Shuge Unlimited
Shuge Unlimited
Apr 10, 2026 · Artificial Intelligence

How Zilliz’s Two Skills Enable AI to Code with pymilvus and Manage Cloud Clusters

This article dissects Zilliz’s Milvus Skill and Zilliz Cloud Skill, showing how a modular set of reference files teaches AI agents to generate pymilvus Python code for vector databases and to operate Zilliz Cloud via CLI, while comparing their architecture, security design, and ecosystem role.

AI AgentCloud ManagementHybrid Search
0 likes · 20 min read
How Zilliz’s Two Skills Enable AI to Code with pymilvus and Manage Cloud Clusters
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Apr 1, 2026 · Artificial Intelligence

How to Design an Effective Agent Memory System for Enterprise AI Assistants

This article explains why AI agents need a structured memory module, outlines three memory types from cognitive science, details short‑term and long‑term storage architectures using vector databases, and provides concrete code and management strategies—including conflict resolution, TTL expiration, and privacy compliance—to build a robust Agent Memory system.

Agent MemoryLLMMem0
0 likes · 23 min read
How to Design an Effective Agent Memory System for Enterprise AI Assistants
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Mar 27, 2026 · Artificial Intelligence

Securing RAG Systems: A Three‑Layer Permission Framework for Banking AI

This article explains why vector databases lack row‑level security, presents a three‑layer permission architecture—including JWT authentication, Milvus metadata or partition filtering, and post‑retrieval validation—covers document security levels, PostgreSQL RLS, audit logging, caching strategies, and offers interview‑ready talking points.

JWTMilvusPostgreSQL RLS
0 likes · 18 min read
Securing RAG Systems: A Three‑Layer Permission Framework for Banking AI
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Mar 21, 2026 · Artificial Intelligence

Step‑by‑Step Guide to Implementing a Hybrid Retrieval Function with RRF Fusion

This article breaks down the end‑to‑end retrieval function used in a RAG system, detailing each of the five stages—from request construction, hybrid vector + BM25 search, RRF fusion, cross‑encoder reranking, to threshold filtering—and provides concrete Python code, parameter choices, and performance insights.

Cross-EncoderElasticsearchHybrid Retrieval
0 likes · 13 min read
Step‑by‑Step Guide to Implementing a Hybrid Retrieval Function with RRF Fusion
Shuge Unlimited
Shuge Unlimited
Feb 27, 2026 · Databases

Why Is Milvus, the 43K‑Star Vector Database, So Powerful?

This article analyzes Milvus—its open‑source origins, three deployment modes, four‑layer architecture, eight‑plus indexing algorithms, real‑world case studies, and a detailed comparison with competitors—highlighting its strengths, weaknesses, common pitfalls, and when it’s the right choice for large‑scale AI workloads.

AI workloadsCloud NativeDeployment
0 likes · 15 min read
Why Is Milvus, the 43K‑Star Vector Database, So Powerful?
Shuge Unlimited
Shuge Unlimited
Feb 11, 2026 · Operations

How to Easily Manage Operations of 10 Milvus Clusters with an Agent Skill

This article walks through the real‑world pain points of monitoring dozens of Milvus collections across multiple clusters, then details a Python‑based Skill that automates connection handling, aggregates collection metadata, evaluates index health with a three‑state model, and provides unified health checks, performance testing, and capacity analysis for reliable large‑scale vector database operations.

Index ManagementMilvusOperations Automation
0 likes · 18 min read
How to Easily Manage Operations of 10 Milvus Clusters with an Agent Skill
Instant Consumer Technology Team
Instant Consumer Technology Team
Feb 6, 2026 · Artificial Intelligence

How AI‑Powered Agentic Labeling Transforms Customer Conversation Tagging

This article details an end‑to‑end AI system that replaces manual, error‑prone tagging of customer dialogues with a large‑language‑model‑driven, vector‑based pipeline that automatically discovers, clusters, and iteratively refines business‑level tags, dramatically cutting cycle time and improving coverage.

Agentic AIHDBSCANLLM
0 likes · 33 min read
How AI‑Powered Agentic Labeling Transforms Customer Conversation Tagging
Yiche Technology
Yiche Technology
Dec 3, 2025 · Artificial Intelligence

How Milvus Powered a Scalable AI Assistant for Car Queries with Vector Search

This article details how an automotive AI assistant migrated from keyword matching to a Milvus‑based vector retrieval system, overcoming semantic gaps, scaling to millions of daily queries, optimizing indexing, introducing multi‑vector and sparse‑vector search, and building a real‑time RAG pipeline with Flink.

AI AssistantMilvusRAG
0 likes · 12 min read
How Milvus Powered a Scalable AI Assistant for Car Queries with Vector Search
Raymond Ops
Raymond Ops
Nov 23, 2025 · Databases

How to Install and Run Milvus Vector Database with Docker Compose

This guide introduces Milvus, an open‑source vector database for AI workloads, outlines its key features and common use cases, and provides step‑by‑step Docker‑Compose commands to set up Milvus, its storage backend MinIO, and the Attu management UI.

AttuDocker ComposeMilvus
0 likes · 8 min read
How to Install and Run Milvus Vector Database with Docker Compose
TAL Education Technology
TAL Education Technology
Jul 31, 2025 · Databases

How Milvus Powers Billion-Scale Vector Search for AI at TAL Education

This article explains how TAL Education leverages the open‑source Milvus vector database—covering its architecture, features, cloud‑native deployment, monitoring, and real‑world AI applications such as intelligent grading and multimodal search—to handle billions of vectors with millisecond‑level similarity retrieval.

AICloud NativeEducation Technology
0 likes · 14 min read
How Milvus Powers Billion-Scale Vector Search for AI at TAL Education
DeWu Technology
DeWu Technology
Jul 30, 2025 · Databases

Why Milvus Outperforms Traditional Databases: Deep Dive into Vector DB Architecture

This article explores the evolution, architecture, and operational challenges of vector databases like Milvus and Zilliz, comparing them with traditional databases, detailing indexing strategies such as HNSW and DiskANN, migration plans, performance benchmarks, and future directions for large‑scale AI‑driven search systems.

AIMilvusindexing
0 likes · 26 min read
Why Milvus Outperforms Traditional Databases: Deep Dive into Vector DB Architecture
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 19, 2025 · Artificial Intelligence

Build Efficient Multimodal Text‑Image Search with Alibaba Cloud Milvus

This guide explains how to use Alibaba Cloud Milvus to create a scalable, high‑performance multimodal search system that supports text‑to‑image, image‑to‑image, and cross‑modal queries across various business scenarios, detailing architecture, deployment steps, validation, and resource cleanup.

AIMilvusMultimodal Retrieval
0 likes · 8 min read
Build Efficient Multimodal Text‑Image Search with Alibaba Cloud Milvus
ITPUB
ITPUB
Jun 15, 2025 · Artificial Intelligence

How to Build a High‑Performance Enterprise RAG System with Model Context Protocol (MCP)

This article presents a step‑by‑step guide for constructing a scalable enterprise Retrieval‑Augmented Generation (RAG) solution using the Model Context Protocol (MCP), covering architecture comparison, system design, Milvus‑backed knowledge store, Python client implementation, deployment scripts, code examples, and best‑practice recommendations.

KnowledgeBaseLLMMCP
0 likes · 22 min read
How to Build a High‑Performance Enterprise RAG System with Model Context Protocol (MCP)
Instant Consumer Technology Team
Instant Consumer Technology Team
Jun 12, 2025 · Artificial Intelligence

How to Build a Production-Ready RAG System with Qwen3 Embedding and Reranker Models

This guide walks through using Alibaba's new Qwen3-Embedding and Qwen3-Reranker models to build a two‑stage Retrieval‑Augmented Generation pipeline with Milvus, covering environment setup, data ingestion, vector indexing, reranking, and LLM‑driven answer generation, demonstrating production‑grade performance across multilingual queries.

EmbeddingLLMMilvus
0 likes · 19 min read
How to Build a Production-Ready RAG System with Qwen3 Embedding and Reranker Models
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Apr 24, 2025 · Big Data

Boosting Product Recommendations with Serverless Spark and Milvus: A Real‑World Case Study

蝉妈妈 migrated its recommendation platform to Alibaba Cloud Serverless Spark and Milvus, replacing traditional vector search and Spark clusters, achieving 40% faster offline tasks, 80% lower failure rates, significant cost savings, and scalable, low‑latency similar‑product retrieval for personalized marketing.

Big DataMilvusrecommendation system
0 likes · 8 min read
Boosting Product Recommendations with Serverless Spark and Milvus: A Real‑World Case Study
Spring Full-Stack Practical Cases
Spring Full-Stack Practical Cases
Apr 10, 2025 · Artificial Intelligence

Build a RAG-Powered Knowledge Base with Spring Boot, Milvus, and Ollama

This guide walks through creating a Retrieval‑Augmented Generation (RAG) system using Spring Boot 3.4.2, Milvus vector database, and the bge‑m3 embedding model via Ollama, covering environment setup, dependency configuration, vector store operations, and integration with a large language model to deliver refined, similarity‑based answers.

EmbeddingLLMMilvus
0 likes · 11 min read
Build a RAG-Powered Knowledge Base with Spring Boot, Milvus, and Ollama
Big Data Technology & Architecture
Big Data Technology & Architecture
Apr 3, 2025 · Artificial Intelligence

Understanding Model Context Protocol (MCP), Retrieval-Augmented Generation (RAG), and Vector Databases for LLM Integration

This article explains the Model Context Protocol (MCP) as a standard for LLM‑data integration, describes Retrieval‑Augmented Generation (RAG) techniques to reduce hallucinations, and introduces vector databases like Milvus that store high‑dimensional embeddings for efficient AI retrieval tasks.

LLMMCPMilvus
0 likes · 7 min read
Understanding Model Context Protocol (MCP), Retrieval-Augmented Generation (RAG), and Vector Databases for LLM Integration
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Feb 25, 2025 · Artificial Intelligence

Build a RAG‑Powered Smart Q&A Assistant with Milvus, DeepSeek, and PAI LangStudio

This step‑by‑step guide shows how to assemble a Retrieval‑Augmented Generation (RAG) system using Alibaba Cloud Milvus vector search, the DeepSeek large language model, and PAI LangStudio, covering instance creation, data upload, model deployment, connection setup, flow design, and service invocation.

AI TutorialDeepSeekLLM
0 likes · 9 min read
Build a RAG‑Powered Smart Q&A Assistant with Milvus, DeepSeek, and PAI LangStudio
macrozheng
macrozheng
Feb 17, 2025 · Artificial Intelligence

Unlock DeepSeek4j 1.4: Build a Private AI Knowledge Base with Spring Boot

This guide explains why DeepSeek4j is needed, its core features, and provides step‑by‑step instructions—including dependency setup, configuration, code examples, and a complete RAG pipeline using Milvus—to help developers quickly create a private AI knowledge base with Spring Boot.

AIDeepSeek4jMilvus
0 likes · 12 min read
Unlock DeepSeek4j 1.4: Build a Private AI Knowledge Base with Spring Boot
Java Architecture Diary
Java Architecture Diary
Feb 13, 2025 · Artificial Intelligence

Create a Java RAG System Using DeepSeek R1, Milvus, and Spring

This guide walks through building a Java RAG system with DeepSeek R1, Milvus, and Spring, covering environment setup, vector model integration via OpenAI protocol, Maven dependencies, data embedding, and a chat endpoint that combines semantic retrieval with LLM generation.

AI integrationDeepSeekMilvus
0 likes · 11 min read
Create a Java RAG System Using DeepSeek R1, Milvus, and Spring
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jan 3, 2025 · Artificial Intelligence

Build an Education‑Focused Retrieval‑Augmented Generation (RAG) Solution with Alibaba PAI

This guide walks you through creating a RAG‑enhanced AI solution for education using Alibaba PAI, covering prerequisite setup, knowledge‑base construction with PAI‑Designer, model deployment, connection configuration, workflow assembly, and a side‑by‑side comparison of RAG versus non‑RAG answers.

AI PlatformLLMMilvus
0 likes · 16 min read
Build an Education‑Focused Retrieval‑Augmented Generation (RAG) Solution with Alibaba PAI
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 24, 2024 · Artificial Intelligence

Build a Medical RAG Solution with Alibaba PAI: Step-by-Step Guide

Learn how to create a Retrieval‑Augmented Generation (RAG) system for medical applications using Alibaba's PAI platform, covering knowledge‑base construction with PAI‑Designer, template setup in PAI‑LangStudio, deployment of LLM and embedding models, vector database integration, and end‑to‑end workflow configuration.

EmbeddingLLMMilvus
0 likes · 18 min read
Build a Medical RAG Solution with Alibaba PAI: Step-by-Step Guide
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 20, 2024 · Artificial Intelligence

How to Build a Retrieval‑Augmented Generation (RAG) System with Alibaba Cloud Milvus and PAI

This guide walks you through setting up Alibaba Cloud Milvus, configuring public access, deploying a RAG system via PAI, uploading a knowledge base, interacting with the model through the Web UI, and inspecting vector collections with Attu, all with step‑by‑step instructions and configuration details.

AIAlibaba CloudMilvus
0 likes · 10 min read
How to Build a Retrieval‑Augmented Generation (RAG) System with Alibaba Cloud Milvus and PAI
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Dec 10, 2024 · Databases

Exploring Domestic Alternatives to Elasticsearch When ES Is Restricted

This article analyzes why organizations are seeking domestic replacements for Elasticsearch due to policy, licensing, and cost constraints, and evaluates several options—including OpenSearch, XunSearch, Lucene/Solr, native database extensions, Milvus, and EasySearch—highlighting their strengths, weaknesses, and suitable scenarios.

EasysearchElasticsearchMilvus
0 likes · 13 min read
Exploring Domestic Alternatives to Elasticsearch When ES Is Restricted
21CTO
21CTO
Nov 19, 2024 · Databases

Why Vector Databases Like Milvus Outperform Elasticsearch in Hybrid Search

This article explains how combining dense vector‑based semantic search with traditional keyword matching using a unified vector database such as Milvus delivers superior performance, scalability, and simplicity compared to maintaining separate Elasticsearch and vector‑search stacks.

ElasticsearchHybrid SearchMilvus
0 likes · 9 min read
Why Vector Databases Like Milvus Outperform Elasticsearch in Hybrid Search
DaTaobao Tech
DaTaobao Tech
Sep 20, 2024 · Databases

Database Technology Evolution: From Hierarchical to Vector Databases

The article chronicles the evolution of database technology from early hierarchical and network models through relational, column‑store, document, key‑value, graph, time‑series, HTAP, and finally vector databases, detailing each system’s architecture, strengths, limitations, typical uses, and future trends toward specialization, distributed cloud‑native designs, and AI‑driven applications.

HBaseHTAPInfluxDB
0 likes · 52 min read
Database Technology Evolution: From Hierarchical to Vector Databases
DataFunSummit
DataFunSummit
Aug 24, 2024 · Databases

Cloud‑Native Storage Solutions for Large‑Scale Vector Data with Milvus and Zilliz

This article presents a comprehensive overview of Zilliz’s cloud‑native vector database ecosystem, detailing Milvus’s distributed architecture, indexing and query capabilities, related tools such as Towhee and GPTCache, storage challenges, tiered storage designs, performance metrics, and real‑world AI use cases like code‑assist and RAG‑based Q&A systems.

ANN searchMilvuslarge-scale storage
0 likes · 21 min read
Cloud‑Native Storage Solutions for Large‑Scale Vector Data with Milvus and Zilliz
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jul 24, 2024 · Artificial Intelligence

How Alibaba Cloud’s Milvus Service Boosted E‑commerce Search Stability and Scalability

This case study details how ShiHuo, an e‑commerce recommendation platform, overcame rapid product growth, cluster instability, and high operational overhead by adopting Alibaba Cloud’s fully managed Milvus vector search service, achieving higher performance, better availability, and reduced management costs.

AIMilvusScalability
0 likes · 8 min read
How Alibaba Cloud’s Milvus Service Boosted E‑commerce Search Stability and Scalability
JD Tech
JD Tech
Jul 15, 2024 · Databases

A Comprehensive Overview of Nine Database Types and Polyglot Persistence Practices

This article provides an in‑depth survey of nine database categories—including relational, key‑value, columnar, document, graph, time‑series, and vector databases—detailing their architectures, advantages, disadvantages, best‑practice recommendations, typical use cases, and how they can be combined in polyglot persistence solutions.

ClickHouseDatabase TypesHBase
0 likes · 41 min read
A Comprehensive Overview of Nine Database Types and Polyglot Persistence Practices
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Mar 22, 2024 · Artificial Intelligence

Improving Document Search with Vector Search: From Elasticsearch Limitations to Milvus Integration

This article explains how traditional keyword search with Elasticsearch often yields inaccurate or incomplete results for document retrieval, introduces vectorization and semantic search using NLP embeddings, and demonstrates a practical workflow that combines these techniques with the Milvus vector database to achieve more accurate and efficient document search.

AIElasticsearchMilvus
0 likes · 13 min read
Improving Document Search with Vector Search: From Elasticsearch Limitations to Milvus Integration
DataFunSummit
DataFunSummit
Aug 30, 2023 · Databases

Milvus: An AI‑Native Vector Database for Large Language Model Applications

This article introduces Milvus, an open‑source, cloud‑native vector database designed for AI workloads, explains how it helps mitigate large‑model hallucinations, outlines its CVP architecture, showcases performance benchmarks, and explores diverse application scenarios and future directions for LLM‑vector database integration.

AILLMMilvus
0 likes · 13 min read
Milvus: An AI‑Native Vector Database for Large Language Model Applications
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jul 25, 2023 · Artificial Intelligence

Building a Reverse Image Search Engine with Geometric Distance, ResNet Feature Embeddings, Clustering, and Milvus Vector Database

This article walks through implementing a reverse image search system, starting with simple pixel‑based geometric distance, then improving accuracy using ResNet‑derived feature embeddings, accelerating queries with K‑means clustering, and finally deploying a Milvus vector database for fast, scalable similarity retrieval.

MilvusResNet50clustering
0 likes · 17 min read
Building a Reverse Image Search Engine with Geometric Distance, ResNet Feature Embeddings, Clustering, and Milvus Vector Database
ITPUB
ITPUB
Jul 5, 2023 · Databases

Why Vector Databases Are Essential for Building Industry‑Specific LLM Applications

Vector databases enable efficient similarity search and storage of high‑dimensional embeddings, allowing enterprises to combine large language models with proprietary knowledge assets to create domain‑specific, accurate, and up‑to‑date AI services, as illustrated with open‑source solutions Chroma and Milvus.

AIChromaLLM
0 likes · 11 min read
Why Vector Databases Are Essential for Building Industry‑Specific LLM Applications
HomeTech
HomeTech
Dec 16, 2022 · Artificial Intelligence

Building and Optimizing a Milvus‑Based Vector Search Platform

This article describes the background, technical selection, architecture, deployment, performance tuning, and operational practices of a Milvus‑driven vector retrieval platform, including cloud‑native deployment, index choices, capacity planning, and real‑world application cases that improve recall latency and resource efficiency.

AIMilvusPerformance Optimization
0 likes · 12 min read
Building and Optimizing a Milvus‑Based Vector Search Platform
DeWu Technology
DeWu Technology
Nov 25, 2022 · Databases

Milvus Vector Database Performance Testing and Architecture Analysis

The author stress‑tested Milvus 2.1.4’s cloud‑native, micro‑service architecture—detailing its write and search paths, evaluating FLAT index performance across 100 K to 10 M 512‑dim vectors, uncovering scaling, scheduler, segment‑rebalance, and upgrade issues, and concluding the system is robust but benefits from graph‑based indexes and Helm‑driven scaling.

Database ArchitectureMilvusPerformance Testing
0 likes · 10 min read
Milvus Vector Database Performance Testing and Architecture Analysis
Youzan Coder
Youzan Coder
Oct 24, 2022 · Artificial Intelligence

Knowledge Base Retrieval Matching: Algorithm and Engineering Service Practice

The article outlines a comprehensive knowledge‑base retrieval matching solution—combining PageRank‑enhanced DSL rewriting, keyword and dual‑tower vector recall, contrastive fine‑ranking, and optimized vector‑based ranking—implemented via offline DP training and Sunfish online inference on Milvus, with applications in enterprise search and recommendations and future plans for graph‑neural embeddings.

InfoNCEMilvusNLP
0 likes · 12 min read
Knowledge Base Retrieval Matching: Algorithm and Engineering Service Practice
21CTO
21CTO
Aug 5, 2022 · Databases

Why Milvus 2.1 Is Revolutionizing Vector Databases for Unstructured Data

Milvus 2.1, the newly released open‑source vector database, brings trillion‑byte storage, sub‑5 ms search latency, and seamless scalability for both structured and unstructured data, positioning it as a game‑changing infrastructure for similarity search across industries.

Milvusunstructured datavector database
0 likes · 4 min read
Why Milvus 2.1 Is Revolutionizing Vector Databases for Unstructured Data
360 Quality & Efficiency
360 Quality & Efficiency
Jul 1, 2022 · Artificial Intelligence

Building an End-to-End Image Search System with Milvus and VGG

This article presents a complete image‑search solution that extracts visual features with the VGG16 model, stores them in the Milvus vector database, and provides a set of web APIs for training, querying, counting, searching, and deleting image vectors, all deployed via Docker containers.

AIDeep LearningMilvus
0 likes · 7 min read
Building an End-to-End Image Search System with Milvus and VGG
DataFunSummit
DataFunSummit
Mar 29, 2022 · Databases

AI-Driven Unstructured Data Analysis and Retrieval with Milvus and Towhee

This article explains how the Milvus vector database and the Towhee embedding framework together enable large‑scale, high‑throughput semantic analysis and retrieval of unstructured data such as images, video, and audio by leveraging AI‑powered vectorization and search pipelines.

AIMilvusTowhee
0 likes · 13 min read
AI-Driven Unstructured Data Analysis and Retrieval with Milvus and Towhee
iQIYI Technical Product Team
iQIYI Technical Product Team
Aug 20, 2021 · Artificial Intelligence

Engineering Practice of Online Vector Recall Service at iQIYI

iQIYI’s engineering team built an online vector‑recall service on Milvus, wrapping it with a Dubbo‑gRPC interface to serve 6 M 64‑dimensional embeddings at roughly 3 k QPS and 20 ms p99 latency, integrating query‑embedding generation, simplifying recommendation pipelines, and demonstrating the performance and operational advantages of a platformized ANN‑based recall layer.

AIEngineeringMilvus
0 likes · 14 min read
Engineering Practice of Online Vector Recall Service at iQIYI
DataFunTalk
DataFunTalk
Jul 2, 2021 · Artificial Intelligence

Vector Retrieval for Community Forum Search Using Milvus at Dingxiangyuan

This article describes how Dingxiangyuan's algorithm team adopted Milvus for distributed vector indexing to improve semantic search in their community forum, detailing the background, retrieval workflow, various embedding models—including Bi‑Encoder, Spherical Embedding, and Knowledge Embedding—and summarizing the benefits and future applications.

EmbeddingMilvusNLP
0 likes · 10 min read
Vector Retrieval for Community Forum Search Using Milvus at Dingxiangyuan
System Architect Go
System Architect Go
Jun 4, 2020 · Artificial Intelligence

Evolution and Underlying Principles of the Billion‑Scale Image Search System at Youpai Image Manager

This article describes the two‑generation evolution of Youpai Image Manager's billion‑scale image search system, explaining the mathematical representation of images, the limitations of MD5, the first‑generation pHash‑ElasticSearch solution, and the second‑generation CNN‑Milvus approach for robust, large‑scale visual similarity search.

CNNMilvusimage search
0 likes · 9 min read
Evolution and Underlying Principles of the Billion‑Scale Image Search System at Youpai Image Manager
System Architect Go
System Architect Go
Apr 11, 2020 · Artificial Intelligence

How to Build an Image Search Engine with CNN and Milvus: A Step‑by‑Step Guide

This article walks through the complete engineering workflow for building an image‑search system, covering CNN‑based feature extraction with VGG16, vector normalization, image preprocessing, black‑edge removal, and practical deployment of the Milvus vector database including hardware requirements, capacity planning, collection/partition design, and search result handling.

CNNMilvusPython
0 likes · 11 min read
How to Build an Image Search Engine with CNN and Milvus: A Step‑by‑Step Guide
System Architect Go
System Architect Go
Mar 30, 2020 · Artificial Intelligence

Overview of Image Search System

This article explains the fundamentals of building an image‑by‑image search system, covering image feature extraction methods such as hashing, traditional descriptors, CNN‑based vectors, and the use of vector search engines like Milvus for similarity retrieval.

CNNMilvusfeature extraction
0 likes · 6 min read
Overview of Image Search System