Tagged articles

Milvus

87 articles · Page 1 of 1

Jun 29, 2026 · Databases

Inside Milvus’ Index Engine: 3‑Layer Parameter Filling, Compile‑time Hardware Split, and a 16× Memory Trade‑off

The article dissects Milvus’ index engine, revealing that AUTOINDEX relies on a three‑stage default‑parameter pipeline, that CPU/GPU index selection is fixed at compile time via Go build tags, that the C++ Knowhere engine executes the algorithms, and that version aggregation, scalar V3 format, and the new AISAQ index embody deliberate memory‑vs‑IO trade‑offs.

AISAQAUTOINDEXCPU/GPU build tags

0 likes · 26 min read

Inside Milvus’ Index Engine: 3‑Layer Parameter Filling, Compile‑time Hardware Split, and a 16× Memory Trade‑off

Shuge Unlimited

Jun 27, 2026 · Artificial Intelligence

How MFS Unifies 20+ Data Sources with a Single Verb Set and How Open Tag Replicates Claude Tag

The article dissects Zilliztech's MFS, showing how a thin‑client, stateful‑server architecture uses a unified verb set to access over twenty heterogeneous data sources, and explains how the Open Tag demo re‑creates Claude Tag's brain‑memory‑tools workflow on top of MFS while highlighting its design trade‑offs and production‑readiness limits.

AI AgentsClaude TagContext Management

0 likes · 16 min read

How MFS Unifies 20+ Data Sources with a Single Verb Set and How Open Tag Replicates Claude Tag

Shuge Unlimited

Jun 21, 2026 · Databases

Why Deleting 1 Million Vectors in Milvus Doesn't Shrink Disk Space: A Deep Dive into 11 CompactionTypes

When Milvus appears to keep disk usage unchanged after deleting a million vectors, the cause is not a bug but a sophisticated compaction system that splits the single compact() API into eleven enum values, six independent policies, and seven special handling paths that together manage different kinds of data waste and ensure safe, incremental reclamation.

ClusteringCompactionDataCoord

0 likes · 23 min read

Why Deleting 1 Million Vectors in Milvus Doesn't Shrink Disk Space: A Deep Dive into 11 CompactionTypes

Shuge Unlimited

Jun 20, 2026 · Databases

From 64 to a Million Tenants: Choosing the Right Milvus Multi‑Tenant Layer and Avoiding the 65,536 Ceiling

The article dissects Milvus's four‑layer multi‑tenant architecture—Database, Collection, Partition, and Partition Key—detailing each layer's default tenant limits, isolation strength versus scalability trade‑offs, hidden constraints like the 65,536 capacity ceiling, the Partition Key isolation switch, and practical guidance for selecting the appropriate layer in SaaS and regulated scenarios.

Database isolationMilvusMulti-Tenancy

0 likes · 17 min read

From 64 to a Million Tenants: Choosing the Right Milvus Multi‑Tenant Layer and Avoiding the 65,536 Ceiling

Java Architect Handbook

Jun 17, 2026 · Artificial Intelligence

What Is Hybrid Search in RAG and Why Choose It Over Pure Vector Retrieval?

Hybrid search combines dense vector retrieval with sparse keyword search, using RRF fusion and optional reranking, to overcome the limitations of each method—semantic understanding versus exact matching—making it the production‑grade standard for RAG systems by 2025‑2026.

BM25ElasticsearchHybrid Search

0 likes · 19 min read

What Is Hybrid Search in RAG and Why Choose It Over Pure Vector Retrieval?

Shuge Unlimited

Jun 14, 2026 · Artificial Intelligence

Beyond Vector Storage: Inside Milvus 2.6’s Three‑Layer AI Agent Architecture

Milvus 2.6 transforms from a pure vector‑storage backend into a full‑stack AI‑Agent infrastructure by introducing a three‑layer capability system—coding‑rule, protocol, and runtime—covering memory, retrieval, and tool backends, hybrid search, strict operation ordering, and multiple integration paths, while contrasting traditional RAG with agent‑driven modes.

AI AgentsHybrid SearchMCP

0 likes · 20 min read

Beyond Vector Storage: Inside Milvus 2.6’s Three‑Layer AI Agent Architecture

Coder Trainee

Jun 10, 2026 · Artificial Intelligence

Building Production‑Ready RAG with Vector Databases: Deep Dive into Chroma, Pinecone, Milvus and Optimizations

This article explains why Retrieval‑Augmented Generation is needed, compares popular vector databases, provides step‑by‑step Docker and Python examples for Chroma, Pinecone, and Milvus, and shows how to optimize a full RAG agent with hybrid search, reranking, and caching.

CacheChromaHybrid Search

0 likes · 20 min read

Building Production‑Ready RAG with Vector Databases: Deep Dive into Chroma, Pinecone, Milvus and Optimizations

Alibaba Cloud Big Data AI Platform

May 29, 2026 · Artificial Intelligence

How Alibaba Cloud Milvus Achieves 20× Faster Billion‑Scale Vector Search with DiskANN and RaBitQ

Alibaba Cloud Milvus combines DiskANN graph indexing with the RaBitQ quantization algorithm, delivering over 20× higher QPS, sub‑10% P99 latency, 29% lower memory usage and more than 98% recall on a 100 million‑vector, 768‑dimensional benchmark, while also cutting index build time from 20 h to about 6 h.

DiskANNMilvusQuantization

0 likes · 7 min read

How Alibaba Cloud Milvus Achieves 20× Faster Billion‑Scale Vector Search with DiskANN and RaBitQ

Senior Brother's Insights

May 10, 2026 · Backend Development

Hands‑On AI Coding: My Real‑World Claude Code Project Experience

In an urgent backend project, the author used Claude Code (GML‑5.1) with Go and go‑zero to generate documentation, integrate Milvus, design a vector table, and add embedding support, spending 200k‑300k tokens for about 2‑3 CNY, while discussing speed bottlenecks, workflow choices, and practical tips for parallelism and mode configuration.

AI codingBackend DevelopmentClaude Code

0 likes · 8 min read

Hands‑On AI Coding: My Real‑World Claude Code Project Experience

AI Architect Hub

May 3, 2026 · Artificial Intelligence

Choosing the Right Vector Database: Milvus, Chroma, Weaviate, Qdrant, FAISS Compared

This article compares five popular vector databases—Chroma, Milvus, Weaviate, Qdrant, and FAISS—detailing their positions, strengths, weaknesses, suitable scenarios, a selection‑dimension matrix, common pitfalls, code implementations for a unified RAG pipeline, best‑practice recommendations, and thought questions to guide engineers in choosing and migrating vector stores.

ChromaFAISSMilvus

0 likes · 23 min read

Choosing the Right Vector Database: Milvus, Chroma, Weaviate, Qdrant, FAISS Compared

Alibaba Cloud Big Data AI Platform

May 1, 2026 · Artificial Intelligence

Zero Deployment, Zero Ops: Alibaba Cloud Milvus Embedding Service Makes Vectorization Plug‑and‑Play

The article explains how Alibaba Cloud's Milvus Embedding Service eliminates the need for self‑hosted embedding models by integrating model inference, vector generation and Milvus indexing into a managed pipeline, dramatically reducing deployment complexity, operational overhead, and time‑to‑value for semantic search, RAG and multimodal retrieval use cases.

Alibaba CloudEmbeddingMilvus

0 likes · 19 min read

Zero Deployment, Zero Ops: Alibaba Cloud Milvus Embedding Service Makes Vectorization Plug‑and‑Play

Shuge Unlimited

Apr 30, 2026 · Databases

Milvus VTS Deep Dive: Two Write Modes, Sharding, and Migration Best Practices

The article provides a source‑code level analysis of Milvus Vector Transport Service (VTS), detailing its three‑stage architecture, partition‑aware sharding logic, two distinct sink write mechanisms (BufferBatchWriter and BulkWriter), schema conversion rules, error‑handling strategies, performance tuning parameters, and practical configuration examples for efficient vector data migration across various data sources.

BufferBatchWriterBulkWriterMilvus

0 likes · 20 min read

Milvus VTS Deep Dive: Two Write Modes, Sharding, and Migration Best Practices

Shuge Unlimited

Apr 29, 2026 · Databases

Milvus Storage Tuning in Practice: 25× Query Speedup and Three Tricks to Cut Memory Usage by Half

This article walks through Milvus 2.3‑2.6.x storage optimizations—Mmap, tiered storage, and clustering compaction—explaining their principles, configuration hierarchy, benchmark results, and concrete deployment templates that together can boost query performance up to 25‑fold while halving memory consumption.

MilvusPerformance TuningStorage Optimization

0 likes · 24 min read

Milvus Storage Tuning in Practice: 25× Query Speedup and Three Tricks to Cut Memory Usage by Half

James' Growth Diary

Apr 22, 2026 · Artificial Intelligence

Boost RAG Performance: Chunking Strategies, Rerank, and Hybrid Retrieval Explained

This article breaks down why RAG pipelines often underperform and shows how proper chunking, overlap settings, hybrid vector‑plus‑BM25 retrieval, and a Rerank step can dramatically improve recall and precision, with concrete code examples and tuning tips.

BM25ChunkingHybrid Retrieval

0 likes · 14 min read

Boost RAG Performance: Chunking Strategies, Rerank, and Hybrid Retrieval Explained

James' Growth Diary

Apr 21, 2026 · Databases

Build a Production-Ready Milvus Vector Database for Semantic Search

This article walks through deploying Milvus with Docker Compose, creating a persistent collection, tuning HNSW indexes, integrating LangChain.js for semantic retrieval, and covering performance tips and common pitfalls to run a production‑grade vector database.

Docker ComposeHNSWLangChain

0 likes · 21 min read

Build a Production-Ready Milvus Vector Database for Semantic Search

James' Growth Diary

Apr 21, 2026 · Artificial Intelligence

Boosting RAG Performance with Milvus: Chunking, Hybrid Search, and Rerank Best Practices

This article analyzes why Retrieval‑Augmented Generation often underperforms, then walks through concrete engineering steps—optimal chunking, overlap settings, hybrid vector + BM25 retrieval, RRF fusion, and reranking—while providing code snippets, parameter tables, and a full pipeline diagram to turn a usable RAG system into a high‑quality one.

ChunkingHybrid SearchLangChain

0 likes · 18 min read

Boosting RAG Performance with Milvus: Chunking, Hybrid Search, and Rerank Best Practices

Linyb Geek Road

Apr 20, 2026 · Artificial Intelligence

Designing a Scalable RAG Storage Architecture: Lessons from a Real‑World Project

The article explains why RAG storage must be layered, describes the four data types involved, presents a typical three‑layer architecture with vector, content, and metadata stores, and shows how the design evolves with scale, multi‑level indexing, update handling, and tenant isolation.

Chunk IndexingElasticsearchMilvus

0 likes · 16 min read

Designing a Scalable RAG Storage Architecture: Lessons from a Real‑World Project

Senior Tony

Apr 11, 2026 · Databases

Why Vectors Need a Dedicated Database and How Milvus Solves It

This article explains what vectors are, why traditional relational databases struggle with high‑dimensional similarity queries, and how the open‑source Milvus vector database efficiently stores, indexes, and retrieves massive vectors for AI applications such as semantic search, image matching, and recommendation.

AI ApplicationsANNDatabases

0 likes · 5 min read

Why Vectors Need a Dedicated Database and How Milvus Solves It

Shuge Unlimited

Apr 10, 2026 · Artificial Intelligence

How Zilliz’s Two Skills Enable AI to Code with pymilvus and Manage Cloud Clusters

This article dissects Zilliz’s Milvus Skill and Zilliz Cloud Skill, showing how a modular set of reference files teaches AI agents to generate pymilvus Python code for vector databases and to operate Zilliz Cloud via CLI, while comparing their architecture, security design, and ecosystem role.

AI AgentCloud ManagementHybrid Search

0 likes · 20 min read

How Zilliz’s Two Skills Enable AI to Code with pymilvus and Manage Cloud Clusters

Wu Shixiong's Large Model Academy

Apr 1, 2026 · Artificial Intelligence

How to Design an Effective Agent Memory System for Enterprise AI Assistants

This article explains why AI agents need a structured memory module, outlines three memory types from cognitive science, details short‑term and long‑term storage architectures using vector databases, and provides concrete code and management strategies—including conflict resolution, TTL expiration, and privacy compliance—to build a robust Agent Memory system.

Agent MemoryLLMMemory Management

0 likes · 23 min read

How to Design an Effective Agent Memory System for Enterprise AI Assistants

Wu Shixiong's Large Model Academy

Mar 27, 2026 · Artificial Intelligence

Securing RAG Systems: A Three‑Layer Permission Framework for Banking AI

This article explains why vector databases lack row‑level security, presents a three‑layer permission architecture—including JWT authentication, Milvus metadata or partition filtering, and post‑retrieval validation—covers document security levels, PostgreSQL RLS, audit logging, caching strategies, and offers interview‑ready talking points.

JWTMilvusPermission Management

0 likes · 18 min read

Securing RAG Systems: A Three‑Layer Permission Framework for Banking AI

Wu Shixiong's Large Model Academy

Mar 21, 2026 · Artificial Intelligence

Step‑by‑Step Guide to Implementing a Hybrid Retrieval Function with RRF Fusion

This article breaks down the end‑to‑end retrieval function used in a RAG system, detailing each of the five stages—from request construction, hybrid vector + BM25 search, RRF fusion, cross‑encoder reranking, to threshold filtering—and provides concrete Python code, parameter choices, and performance insights.

Cross-EncoderElasticsearchHybrid Retrieval

0 likes · 13 min read

Step‑by‑Step Guide to Implementing a Hybrid Retrieval Function with RRF Fusion

Shuge Unlimited

Feb 27, 2026 · Databases

Why Is Milvus, the 43K‑Star Vector Database, So Powerful?

This article analyzes Milvus—its open‑source origins, three deployment modes, four‑layer architecture, eight‑plus indexing algorithms, real‑world case studies, and a detailed comparison with competitors—highlighting its strengths, weaknesses, common pitfalls, and when it’s the right choice for large‑scale AI workloads.

AI workloadsCloud NativeIndexing

0 likes · 15 min read

Why Is Milvus, the 43K‑Star Vector Database, So Powerful?

Shuge Unlimited

Feb 11, 2026 · Operations

How to Easily Manage Operations of 10 Milvus Clusters with an Agent Skill

This article walks through the real‑world pain points of monitoring dozens of Milvus collections across multiple clusters, then details a Python‑based Skill that automates connection handling, aggregates collection metadata, evaluates index health with a three‑state model, and provides unified health checks, performance testing, and capacity analysis for reliable large‑scale vector database operations.

Index managementMilvusMonitoring

0 likes · 18 min read

How to Easily Manage Operations of 10 Milvus Clusters with an Agent Skill

Instant Consumer Technology Team

Feb 6, 2026 · Artificial Intelligence

How AI‑Powered Agentic Labeling Transforms Customer Conversation Tagging

This article details an end‑to‑end AI system that replaces manual, error‑prone tagging of customer dialogues with a large‑language‑model‑driven, vector‑based pipeline that automatically discovers, clusters, and iteratively refines business‑level tags, dramatically cutting cycle time and improving coverage.

Agentic AIHDBSCANLLM

0 likes · 33 min read

How AI‑Powered Agentic Labeling Transforms Customer Conversation Tagging

Big Data Technology & Architecture

Jan 9, 2026 · Databases

Why Vector Databases Matter and How Milvus Stacks Up Against OceanBase

This article explains why vector databases are essential for AI-driven unstructured data, surveys the current landscape of professional and extended vector databases, and provides a detailed technical comparison between Milvus and OceanBase, covering architecture, features, performance, and product positioning.

AIDatabase ArchitectureMilvus

0 likes · 11 min read

Why Vector Databases Matter and How Milvus Stacks Up Against OceanBase

dbaplus Community

Dec 15, 2025 · Databases

Understanding Milvus Vector Indexes: Structures, Quantization, and Future Trends

This article explains the core concepts of vector database indexing, details the composition of Milvus indexes—including data structures, quantization methods, and specific algorithms like IVF, HNSW, DISKANN, PQ, RABITQ, PRQ, SCANN, AISAQ, and MINHASH_LSH—and offers speculation on future developments.

ANNHNSWMilvus

0 likes · 32 min read

Understanding Milvus Vector Indexes: Structures, Quantization, and Future Trends

360 Zhihui Cloud Developer

Dec 9, 2025 · Databases

How Milvus Implements Multi‑Tenant Architecture: Design, RBAC, and Resource Group Isolation

This article explains Milvus's multi‑tenant architecture, covering its layered data structures, role‑based access control, resource‑group based physical isolation, practical code examples, and future directions for cost‑effective, cloud‑native vector database deployments.

MilvusMulti‑tenantRBAC

0 likes · 11 min read

How Milvus Implements Multi‑Tenant Architecture: Design, RBAC, and Resource Group Isolation

Yiche Technology

Dec 3, 2025 · Artificial Intelligence

How Milvus Powered a Scalable AI Assistant for Car Queries with Vector Search

This article details how an automotive AI assistant migrated from keyword matching to a Milvus‑based vector retrieval system, overcoming semantic gaps, scaling to millions of daily queries, optimizing indexing, introducing multi‑vector and sparse‑vector search, and building a real‑time RAG pipeline with Flink.

AI assistantMilvusRAG

0 likes · 12 min read

How Milvus Powered a Scalable AI Assistant for Car Queries with Vector Search

Raymond Ops

Nov 23, 2025 · Databases

How to Install and Run Milvus Vector Database with Docker Compose

This guide introduces Milvus, an open‑source vector database for AI workloads, outlines its key features and common use cases, and provides step‑by‑step Docker‑Compose commands to set up Milvus, its storage backend MinIO, and the Attu management UI.

AttuDocker ComposeMilvus

0 likes · 8 min read

How to Install and Run Milvus Vector Database with Docker Compose

Volcano Engine Developer Services

Oct 20, 2025 · Artificial Intelligence

How DiskANN + RaBitQ Supercharges Milvus: 5× Faster, 90% Cheaper Vector Search

This article explains how integrating the disk‑based DiskANN index with the ultra‑compact RaBitQ quantization dramatically boosts Milvus's vector search performance and cuts costs, delivering over five times higher QPS and more than 90% cost reduction for billion‑scale AI workloads.

AIDiskANNMilvus

0 likes · 11 min read

How DiskANN + RaBitQ Supercharges Milvus: 5× Faster, 90% Cheaper Vector Search

Alibaba Cloud Big Data AI Platform

Sep 10, 2025 · Artificial Intelligence

How Mem0 Gives AI Agents Persistent Memory Using Milvus Vector Store

This article explains how Mem0 provides a durable memory layer for AI agents, enabling them to recall past interactions, store user preferences, and continuously improve by integrating with Milvus vector databases and graph stores, complete with step‑by‑step code examples and performance comparisons.

LLMLangGraphMilvus

0 likes · 19 min read

How Mem0 Gives AI Agents Persistent Memory Using Milvus Vector Store

Alibaba Cloud Big Data AI Platform

Aug 7, 2025 · Databases

Unlock Powerful Hybrid Search with Milvus 2.5: Full-Text, BM25, and RAG Guide

This tutorial explains how to use Milvus 2.5's new full‑text, BM25 keyword matching, and hybrid search capabilities—including step‑by‑step setup, schema design, code examples, and RAG integration—to achieve high recall and precision in large‑scale AI vector retrieval scenarios.

Full-Text SearchHybrid SearchMilvus

0 likes · 13 min read

Unlock Powerful Hybrid Search with Milvus 2.5: Full-Text, BM25, and RAG Guide

Alibaba Cloud Big Data AI Platform

Aug 6, 2025 · Artificial Intelligence

How to Build an Enterprise‑Grade Vector Search Q&A Bot with Milvus and n8n

This article explains how to combine Alibaba Cloud Milvus, a high‑performance vector database, with the low‑code workflow platform n8n to create an enterprise‑level, domain‑specific intelligent Q&A system, covering challenges, architecture, setup, workflow configuration, and verification steps.

LLMMilvusWorkflow Automation

0 likes · 18 min read

How to Build an Enterprise‑Grade Vector Search Q&A Bot with Milvus and n8n

TAL Education Technology

Jul 31, 2025 · Databases

How Milvus Powers Billion-Scale Vector Search for AI at TAL Education

This article explains how TAL Education leverages the open‑source Milvus vector database—covering its architecture, features, cloud‑native deployment, monitoring, and real‑world AI applications such as intelligent grading and multimodal search—to handle billions of vectors with millisecond‑level similarity retrieval.

AICloud NativeEducation Technology

0 likes · 14 min read

How Milvus Powers Billion-Scale Vector Search for AI at TAL Education

DeWu Technology

Jul 30, 2025 · Databases

Why Milvus Outperforms Traditional Databases: Deep Dive into Vector DB Architecture

This article explores the evolution, architecture, and operational challenges of vector databases like Milvus and Zilliz, comparing them with traditional databases, detailing indexing strategies such as HNSW and DiskANN, migration plans, performance benchmarks, and future directions for large‑scale AI‑driven search systems.

AIIndexingMilvus

0 likes · 26 min read

Why Milvus Outperforms Traditional Databases: Deep Dive into Vector DB Architecture

Alibaba Cloud Big Data AI Platform

Jul 30, 2025 · Artificial Intelligence

Build Enterprise‑Grade RAG Apps with Milvus and Dify: A Step‑by‑Step Guide

This guide explains how to combine Alibaba Cloud Milvus, a high‑performance vector database, with the low‑code AI platform Dify to create an enterprise‑level Retrieval‑Augmented Generation (RAG) application, covering architecture, installation, data ingestion, and verification steps.

AI ApplicationDifyLow‑code

0 likes · 9 min read

Build Enterprise‑Grade RAG Apps with Milvus and Dify: A Step‑by‑Step Guide

Spring Full-Stack Practical Cases

Jul 23, 2025 · Backend Development

Build a Spring Boot 3 Web QA App with Vector Store and AI in Minutes

This guide walks through creating a Spring Boot 3 application that loads web pages into a vector database using Spring AI, configures AI models and embeddings, and exposes REST endpoints for storing documents and answering queries with retrieval‑augmented generation.

MilvusSpring AISpring Boot

0 likes · 10 min read

Build a Spring Boot 3 Web QA App with Vector Store and AI in Minutes

Huolala Tech

Jul 9, 2025 · Artificial Intelligence

How HuoLala Overcomes Large‑Model Limits with Retrieval‑Augmented Vector Databases

This article details HuoLala's journey from confronting large‑model knowledge gaps and latency issues to adopting Retrieval‑Augmented Generation (RAG) with vector databases, outlining their architecture challenges, selection criteria, deployment scenarios, and future plans for AI‑driven logistics.

AIElasticsearchMilvus

0 likes · 14 min read

How HuoLala Overcomes Large‑Model Limits with Retrieval‑Augmented Vector Databases

Zhuanzhuan Tech

Jul 2, 2025 · Artificial Intelligence

How to Build an Image Similarity Search System with ResNet, Milvus, and YOLO

This article walks through the end‑to‑end process of building an image similarity solution—from vectorizing images with ResNet, storing high‑dimensional vectors in Milvus, using HNSW for fast ANN search, to applying YOLO for object detection and practical training tips.

HNSWMilvusResNet

0 likes · 15 min read

How to Build an Image Similarity Search System with ResNet, Milvus, and YOLO

MaGe Linux Operations

Jun 29, 2025 · Databases

Step‑by‑Step Guide to Install Milvus Vector Database with Docker Compose

This article introduces Milvus, an open‑source vector database for AI and machine‑learning workloads, outlines its key features and application scenarios, and provides detailed Docker‑Compose commands to set up Milvus, MinIO, and the Attu graphical management tool.

AIAttuDocker

0 likes · 7 min read

Step‑by‑Step Guide to Install Milvus Vector Database with Docker Compose

Alibaba Cloud Developer

Jun 19, 2025 · Artificial Intelligence

Build Efficient Multimodal Text‑Image Search with Alibaba Cloud Milvus

This guide explains how to use Alibaba Cloud Milvus to create a scalable, high‑performance multimodal search system that supports text‑to‑image, image‑to‑image, and cross‑modal queries across various business scenarios, detailing architecture, deployment steps, validation, and resource cleanup.

AIMilvusServerless

0 likes · 8 min read

Build Efficient Multimodal Text‑Image Search with Alibaba Cloud Milvus

ITPUB

Jun 15, 2025 · Artificial Intelligence

How to Build a High‑Performance Enterprise RAG System with Model Context Protocol (MCP)

This article presents a step‑by‑step guide for constructing a scalable enterprise Retrieval‑Augmented Generation (RAG) solution using the Model Context Protocol (MCP), covering architecture comparison, system design, Milvus‑backed knowledge store, Python client implementation, deployment scripts, code examples, and best‑practice recommendations.

KnowledgeBaseLLMMCP

0 likes · 22 min read

How to Build a High‑Performance Enterprise RAG System with Model Context Protocol (MCP)

Instant Consumer Technology Team

Jun 12, 2025 · Artificial Intelligence

How to Build a Production-Ready RAG System with Qwen3 Embedding and Reranker Models

This guide walks through using Alibaba's new Qwen3-Embedding and Qwen3-Reranker models to build a two‑stage Retrieval‑Augmented Generation pipeline with Milvus, covering environment setup, data ingestion, vector indexing, reranking, and LLM‑driven answer generation, demonstrating production‑grade performance across multilingual queries.

EmbeddingLLMMilvus

0 likes · 19 min read

How to Build a Production-Ready RAG System with Qwen3 Embedding and Reranker Models

Alibaba Cloud Big Data AI Platform

Jun 11, 2025 · Big Data

Sync MaxCompute Tables to Milvus with DataWorks: Step‑by‑Step Guide

This guide explains how to use Alibaba Cloud DataWorks to create the necessary resources, configure Milvus and MaxCompute data sources, set up an offline single‑table synchronization task, and verify the imported vectors, enabling efficient AI‑driven vector search on large structured datasets.

Big DataData IntegrationDataWorks

0 likes · 8 min read

Sync MaxCompute Tables to Milvus with DataWorks: Step‑by‑Step Guide

Alibaba Cloud Big Data AI Platform

May 6, 2025 · Artificial Intelligence

Build a Powerful RAG‑Enabled AI Q&A App with PAI‑LangStudio and Qwen3

This guide walks you through using Alibaba Cloud's PAI‑LangStudio together with the Qwen3 large language model to create an AI‑powered question‑answering system that combines Retrieval‑Augmented Generation, web search, secure deployment, and flexible customization for production use.

LangStudioMilvusQwen3

0 likes · 10 min read

Build a Powerful RAG‑Enabled AI Q&A App with PAI‑LangStudio and Qwen3

Alibaba Cloud Big Data AI Platform

Apr 24, 2025 · Artificial Intelligence

Build Multimodal Image Search with Alibaba Cloud Milvus and Qwen VL

This tutorial shows how to combine Alibaba Cloud Milvus vector search with the Qwen VL large model to extract image descriptions, generate multimodal embeddings, store them in Milvus, and perform fast multimodal searches such as text‑to‑image and image‑to‑text queries using Python.

MilvusPythonQwen-VL

0 likes · 18 min read

Build Multimodal Image Search with Alibaba Cloud Milvus and Qwen VL

Alibaba Cloud Big Data AI Platform

Apr 24, 2025 · Big Data

Boosting Product Recommendations with Serverless Spark and Milvus: A Real‑World Case Study

蝉妈妈 migrated its recommendation platform to Alibaba Cloud Serverless Spark and Milvus, replacing traditional vector search and Spark clusters, achieving 40% faster offline tasks, 80% lower failure rates, significant cost savings, and scalable, low‑latency similar‑product retrieval for personalized marketing.

Big DataMilvusrecommendation system

0 likes · 8 min read

Boosting Product Recommendations with Serverless Spark and Milvus: A Real‑World Case Study

Spring Full-Stack Practical Cases

Apr 10, 2025 · Artificial Intelligence

Build a RAG-Powered Knowledge Base with Spring Boot, Milvus, and Ollama

This guide walks through creating a Retrieval‑Augmented Generation (RAG) system using Spring Boot 3.4.2, Milvus vector database, and the bge‑m3 embedding model via Ollama, covering environment setup, dependency configuration, vector store operations, and integration with a large language model to deliver refined, similarity‑based answers.

EmbeddingLLMMilvus

0 likes · 11 min read

Build a RAG-Powered Knowledge Base with Spring Boot, Milvus, and Ollama

Big Data Technology & Architecture

Apr 3, 2025 · Artificial Intelligence

Understanding Model Context Protocol (MCP), Retrieval-Augmented Generation (RAG), and Vector Databases for LLM Integration

This article explains the Model Context Protocol (MCP) as a standard for LLM‑data integration, describes Retrieval‑Augmented Generation (RAG) techniques to reduce hallucinations, and introduces vector databases like Milvus that store high‑dimensional embeddings for efficient AI retrieval tasks.

LLMMCPMilvus

0 likes · 7 min read

Understanding Model Context Protocol (MCP), Retrieval-Augmented Generation (RAG), and Vector Databases for LLM Integration

Alibaba Cloud Big Data AI Platform

Mar 5, 2025 · Artificial Intelligence

How to Unlock Milvus 2.5’s Full‑Text and Hybrid Search for RAG Applications

This guide walks you through Milvus 2.5’s new full‑text, keyword‑matching, and hybrid search capabilities, showing how to set up the schema, install dependencies, prepare data with LangChain, and combine dense and sparse vectors for RAG‑enabled retrieval using Python.

Full-Text SearchHybrid SearchLangChain

0 likes · 14 min read

How to Unlock Milvus 2.5’s Full‑Text and Hybrid Search for RAG Applications

Alibaba Cloud Big Data AI Platform

Feb 25, 2025 · Artificial Intelligence

Build a RAG‑Powered Smart Q&A Assistant with Milvus, DeepSeek, and PAI LangStudio

This step‑by‑step guide shows how to assemble a Retrieval‑Augmented Generation (RAG) system using Alibaba Cloud Milvus vector search, the DeepSeek large language model, and PAI LangStudio, covering instance creation, data upload, model deployment, connection setup, flow design, and service invocation.

AI TutorialDeepSeekLLM

0 likes · 9 min read

Build a RAG‑Powered Smart Q&A Assistant with Milvus, DeepSeek, and PAI LangStudio

macrozheng

Feb 17, 2025 · Artificial Intelligence

Unlock DeepSeek4j 1.4: Build a Private AI Knowledge Base with Spring Boot

This guide explains why DeepSeek4j is needed, its core features, and provides step‑by‑step instructions—including dependency setup, configuration, code examples, and a complete RAG pipeline using Milvus—to help developers quickly create a private AI knowledge base with Spring Boot.

AIDeepSeek4jMilvus

0 likes · 12 min read

Unlock DeepSeek4j 1.4: Build a Private AI Knowledge Base with Spring Boot

Java Architecture Diary

Feb 13, 2025 · Artificial Intelligence

Create a Java RAG System Using DeepSeek R1, Milvus, and Spring

This guide walks through building a Java RAG system with DeepSeek R1, Milvus, and Spring, covering environment setup, vector model integration via OpenAI protocol, Maven dependencies, data embedding, and a chat endpoint that combines semantic retrieval with LLM generation.

AI integrationDeepSeekMilvus

0 likes · 11 min read

Create a Java RAG System Using DeepSeek R1, Milvus, and Spring

Alibaba Cloud Big Data AI Platform

Jan 3, 2025 · Artificial Intelligence

Build an Education‑Focused Retrieval‑Augmented Generation (RAG) Solution with Alibaba PAI

This guide walks you through creating a RAG‑enhanced AI solution for education using Alibaba PAI, covering prerequisite setup, knowledge‑base construction with PAI‑Designer, model deployment, connection configuration, workflow assembly, and a side‑by‑side comparison of RAG versus non‑RAG answers.

AI platformLLMMilvus

0 likes · 16 min read

Build an Education‑Focused Retrieval‑Augmented Generation (RAG) Solution with Alibaba PAI

Alibaba Cloud Big Data AI Platform

Dec 24, 2024 · Artificial Intelligence

Build a Medical RAG Solution with Alibaba PAI: Step-by-Step Guide

Learn how to create a Retrieval‑Augmented Generation (RAG) system for medical applications using Alibaba's PAI platform, covering knowledge‑base construction with PAI‑Designer, template setup in PAI‑LangStudio, deployment of LLM and embedding models, vector database integration, and end‑to‑end workflow configuration.

EmbeddingLLMMilvus

0 likes · 18 min read

Build a Medical RAG Solution with Alibaba PAI: Step-by-Step Guide

Alibaba Cloud Big Data AI Platform

Dec 20, 2024 · Artificial Intelligence

How to Build a Retrieval‑Augmented Generation (RAG) System with Alibaba Cloud Milvus and PAI

This guide walks you through setting up Alibaba Cloud Milvus, configuring public access, deploying a RAG system via PAI, uploading a knowledge base, interacting with the model through the Web UI, and inspecting vector collections with Attu, all with step‑by‑step instructions and configuration details.

AIAlibaba CloudMilvus

0 likes · 10 min read

How to Build a Retrieval‑Augmented Generation (RAG) System with Alibaba Cloud Milvus and PAI

Alibaba Cloud Big Data AI Platform

Dec 16, 2024 · Artificial Intelligence

Build a RAG-Powered Q&A App with Alibaba Cloud Milvus, DashScope & PAI

This guide walks you through creating a Retrieval‑Augmented Generation (RAG) question‑answering application by integrating Alibaba Cloud Milvus vector search, DashScope embedding models, and PAI EAS LLM services, covering prerequisites, service deployment, configuration, Python code setup, and execution steps.

LLMLangChainMilvus

0 likes · 12 min read

Build a RAG-Powered Q&A App with Alibaba Cloud Milvus, DashScope & PAI

Mingyi World Elasticsearch

Dec 10, 2024 · Databases

Exploring Domestic Alternatives to Elasticsearch When ES Is Restricted

This article analyzes why organizations are seeking domestic replacements for Elasticsearch due to policy, licensing, and cost constraints, and evaluates several options—including OpenSearch, XunSearch, Lucene/Solr, native database extensions, Milvus, and EasySearch—highlighting their strengths, weaknesses, and suitable scenarios.

EasysearchElasticsearchMilvus

0 likes · 13 min read

Exploring Domestic Alternatives to Elasticsearch When ES Is Restricted

21CTO

Nov 19, 2024 · Databases

Why Vector Databases Like Milvus Outperform Elasticsearch in Hybrid Search

This article explains how combining dense vector‑based semantic search with traditional keyword matching using a unified vector database such as Milvus delivers superior performance, scalability, and simplicity compared to maintaining separate Elasticsearch and vector‑search stacks.

ElasticsearchHybrid SearchMilvus

0 likes · 9 min read

Why Vector Databases Like Milvus Outperform Elasticsearch in Hybrid Search

DataFunSummit

Oct 30, 2024 · Databases

Design and Implementation of Vector Databases: Architecture, Indexing, and AI Optimizations

This article introduces vector databases as the foundation for efficient high‑dimensional data retrieval in generative AI, covering their background, Milvus’s cloud‑native architecture, key indexing techniques, performance‑trade‑offs, AI‑driven optimizations, and a Q&A session.

AIIndexingMilvus

0 likes · 15 min read

Design and Implementation of Vector Databases: Architecture, Indexing, and AI Optimizations

DaTaobao Tech

Sep 20, 2024 · Databases

Database Technology Evolution: From Hierarchical to Vector Databases

The article chronicles the evolution of database technology from early hierarchical and network models through relational, column‑store, document, key‑value, graph, time‑series, HTAP, and finally vector databases, detailing each system’s architecture, strengths, limitations, typical uses, and future trends toward specialization, distributed cloud‑native designs, and AI‑driven applications.

HBaseHTAPInfluxDB

0 likes · 52 min read

Database Technology Evolution: From Hierarchical to Vector Databases

21CTO

Sep 17, 2024 · Artificial Intelligence

How to Build AI-Powered Semantic Search with OpenAI Embeddings and Milvus

This guide explains vector embeddings, OpenAI's text‑embedding models, and how to use PyMilvus with Zilliz Cloud to create a Retrieval‑Augmented Generation (RAG) system for fast, accurate semantic search.

MilvusOpenAIPython

0 likes · 12 min read

How to Build AI-Powered Semantic Search with OpenAI Embeddings and Milvus

DataFunSummit

Aug 24, 2024 · Databases

Cloud‑Native Storage Solutions for Large‑Scale Vector Data with Milvus and Zilliz

This article presents a comprehensive overview of Zilliz’s cloud‑native vector database ecosystem, detailing Milvus’s distributed architecture, indexing and query capabilities, related tools such as Towhee and GPTCache, storage challenges, tiered storage designs, performance metrics, and real‑world AI use cases like code‑assist and RAG‑based Q&A systems.

ANN searchMilvuslarge-scale storage

0 likes · 21 min read

Cloud‑Native Storage Solutions for Large‑Scale Vector Data with Milvus and Zilliz

Alibaba Cloud Big Data AI Platform

Jul 24, 2024 · Artificial Intelligence

How Alibaba Cloud’s Milvus Service Boosted E‑commerce Search Stability and Scalability

This case study details how ShiHuo, an e‑commerce recommendation platform, overcame rapid product growth, cluster instability, and high operational overhead by adopting Alibaba Cloud’s fully managed Milvus vector search service, achieving higher performance, better availability, and reduced management costs.

AIMilvuse-commerce

0 likes · 8 min read

How Alibaba Cloud’s Milvus Service Boosted E‑commerce Search Stability and Scalability

Rare Earth Juejin Tech Community

Jul 18, 2024 · Backend Development

Implementing Full‑Text Document Search with Elasticsearch and Milvus

This article describes how to combine Elasticsearch’s keyword matching with Milvus’s vector‑based semantic search to build a scalable document search service, covering data preprocessing, architecture, query handling, custom scoring, DSL configuration, and result merging.

ElasticsearchFull-Text SearchMilvus

0 likes · 12 min read

Implementing Full‑Text Document Search with Elasticsearch and Milvus

JD Tech

Jul 15, 2024 · Databases

A Comprehensive Overview of Nine Database Types and Polyglot Persistence Practices

This article provides an in‑depth survey of nine database categories—including relational, key‑value, columnar, document, graph, time‑series, and vector databases—detailing their architectures, advantages, disadvantages, best‑practice recommendations, typical use cases, and how they can be combined in polyglot persistence solutions.

ClickHouseDatabase TypesHBase

0 likes · 41 min read

A Comprehensive Overview of Nine Database Types and Polyglot Persistence Practices

Alibaba Cloud Big Data AI Platform

May 11, 2024 · Artificial Intelligence

How to Build a High‑Performance RAG System with Milvus on Alibaba Cloud PAI

This guide explains how to integrate Milvus vector search with Alibaba Cloud PAI to create a Retrieval‑Augmented Generation (RAG) solution, covering background, prerequisites, deployment steps, configuration parameters, and practical usage through the Web UI.

AIAlibaba CloudLangChain

0 likes · 7 min read

How to Build a High‑Performance RAG System with Milvus on Alibaba Cloud PAI

Rare Earth Juejin Tech Community

May 9, 2024 · Artificial Intelligence

Exploring Sentence and Paragraph Search with Milvus Vector Search vs Elasticsearch

This article examines how vector search using Milvus handles sentence and paragraph queries compared to traditional Elasticsearch, demonstrating the advantages of embedding‑based semantic matching for document search scenarios through practical experiments and visual results.

ElasticsearchEmbeddingMilvus

0 likes · 11 min read

Exploring Sentence and Paragraph Search with Milvus Vector Search vs Elasticsearch

Alibaba Cloud Big Data AI Platform

Apr 18, 2024 · Databases

Build a Serverless AI Vector Search with Alibaba Cloud Milvus and DashScope

This tutorial walks you through creating a serverless Milvus vector search service on Alibaba Cloud, preparing a news corpus, generating embeddings with DashScope, storing them in Milvus, and performing vector‑based retrieval and question answering using a large language model.

AI embeddingsDashScopeKnowledge QA

0 likes · 11 min read

Build a Serverless AI Vector Search with Alibaba Cloud Milvus and DashScope

Rare Earth Juejin Tech Community

Mar 22, 2024 · Artificial Intelligence

Improving Document Search with Vector Search: From Elasticsearch Limitations to Milvus Integration

This article explains how traditional keyword search with Elasticsearch often yields inaccurate or incomplete results for document retrieval, introduces vectorization and semantic search using NLP embeddings, and demonstrates a practical workflow that combines these techniques with the Milvus vector database to achieve more accurate and efficient document search.

AIElasticsearchMilvus

0 likes · 13 min read

Improving Document Search with Vector Search: From Elasticsearch Limitations to Milvus Integration

Zhuanzhuan Tech

Nov 29, 2023 · Artificial Intelligence

Applying CLIP and Milvus for Image Similarity Search in E‑commerce Risk Control

The article explains how an e‑commerce risk‑control team leverages OpenAI's CLIP model to generate image and text embeddings and stores them in the Milvus cloud‑native vector database to enable fast, scalable similarity searches for compliance verification and risk detection.

AICLIPMilvus

0 likes · 11 min read

Applying CLIP and Milvus for Image Similarity Search in E‑commerce Risk Control

DataFunSummit

Aug 30, 2023 · Databases

Milvus: An AI‑Native Vector Database for Large Language Model Applications

This article introduces Milvus, an open‑source, cloud‑native vector database designed for AI workloads, explains how it helps mitigate large‑model hallucinations, outlines its CVP architecture, showcases performance benchmarks, and explores diverse application scenarios and future directions for LLM‑vector database integration.

AILLMMilvus

0 likes · 13 min read

Milvus: An AI‑Native Vector Database for Large Language Model Applications

Rare Earth Juejin Tech Community

Jul 27, 2023 · Artificial Intelligence

Implementing Text‑Based Image Search Using OCR, Transformers, and Vector Databases

This article explains how to build a text‑to‑image search system by first extracting text with OCR, then storing image paths and textual embeddings in a SQLite or Milvus vector database, and finally improving retrieval with Transformer‑based sentence embeddings and image‑captioning models.

MilvusOCRPython

0 likes · 16 min read

Implementing Text‑Based Image Search Using OCR, Transformers, and Vector Databases

Rare Earth Juejin Tech Community

Jul 25, 2023 · Artificial Intelligence

Building a Reverse Image Search Engine with Geometric Distance, ResNet Feature Embeddings, Clustering, and Milvus Vector Database

This article walks through implementing a reverse image search system, starting with simple pixel‑based geometric distance, then improving accuracy using ResNet‑derived feature embeddings, accelerating queries with K‑means clustering, and finally deploying a Milvus vector database for fast, scalable similarity retrieval.

ClusteringMilvusResNet50

0 likes · 17 min read

Building a Reverse Image Search Engine with Geometric Distance, ResNet Feature Embeddings, Clustering, and Milvus Vector Database

ITPUB

Jul 5, 2023 · Databases

Why Vector Databases Are Essential for Building Industry‑Specific LLM Applications

Vector databases enable efficient similarity search and storage of high‑dimensional embeddings, allowing enterprises to combine large language models with proprietary knowledge assets to create domain‑specific, accurate, and up‑to‑date AI services, as illustrated with open‑source solutions Chroma and Milvus.

AIChromaLLM

0 likes · 11 min read

Why Vector Databases Are Essential for Building Industry‑Specific LLM Applications

HomeTech

Dec 16, 2022 · Artificial Intelligence

Building and Optimizing a Milvus‑Based Vector Search Platform

This article describes the background, technical selection, architecture, deployment, performance tuning, and operational practices of a Milvus‑driven vector retrieval platform, including cloud‑native deployment, index choices, capacity planning, and real‑world application cases that improve recall latency and resource efficiency.

AIMilvusPerformance Optimization

0 likes · 12 min read

Building and Optimizing a Milvus‑Based Vector Search Platform

DeWu Technology

Nov 25, 2022 · Databases

Milvus Vector Database Performance Testing and Architecture Analysis

The author stress‑tested Milvus 2.1.4’s cloud‑native, micro‑service architecture—detailing its write and search paths, evaluating FLAT index performance across 100 K to 10 M 512‑dim vectors, uncovering scaling, scheduler, segment‑rebalance, and upgrade issues, and concluding the system is robust but benefits from graph‑based indexes and Helm‑driven scaling.

Database ArchitectureMilvusperformance testing

0 likes · 10 min read

Milvus Vector Database Performance Testing and Architecture Analysis

Youzan Coder

Oct 24, 2022 · Artificial Intelligence

Knowledge Base Retrieval Matching: Algorithm and Engineering Service Practice

The article outlines a comprehensive knowledge‑base retrieval matching solution—combining PageRank‑enhanced DSL rewriting, keyword and dual‑tower vector recall, contrastive fine‑ranking, and optimized vector‑based ranking—implemented via offline DP training and Sunfish online inference on Milvus, with applications in enterprise search and recommendations and future plans for graph‑neural embeddings.

InfoNCEMilvusNLP

0 likes · 12 min read

Knowledge Base Retrieval Matching: Algorithm and Engineering Service Practice

21CTO

Aug 5, 2022 · Databases

Why Milvus 2.1 Is Revolutionizing Vector Databases for Unstructured Data

Milvus 2.1, the newly released open‑source vector database, brings trillion‑byte storage, sub‑5 ms search latency, and seamless scalability for both structured and unstructured data, positioning it as a game‑changing infrastructure for similarity search across industries.

Milvusunstructured datavector database

0 likes · 4 min read

Why Milvus 2.1 Is Revolutionizing Vector Databases for Unstructured Data

360 Quality & Efficiency

Jul 1, 2022 · Artificial Intelligence

Building an End-to-End Image Search System with Milvus and VGG

This article presents a complete image‑search solution that extracts visual features with the VGG16 model, stores them in the Milvus vector database, and provides a set of web APIs for training, querying, counting, searching, and deleting image vectors, all deployed via Docker containers.

AIMilvusVGG

0 likes · 7 min read

Building an End-to-End Image Search System with Milvus and VGG

DataFunSummit

Mar 29, 2022 · Databases

AI-Driven Unstructured Data Analysis and Retrieval with Milvus and Towhee

This article explains how the Milvus vector database and the Towhee embedding framework together enable large‑scale, high‑throughput semantic analysis and retrieval of unstructured data such as images, video, and audio by leveraging AI‑powered vectorization and search pipelines.

AIMilvusTowhee

0 likes · 13 min read

AI-Driven Unstructured Data Analysis and Retrieval with Milvus and Towhee

iQIYI Technical Product Team

Aug 20, 2021 · Artificial Intelligence

Engineering Practice of Online Vector Recall Service at iQIYI

iQIYI’s engineering team built an online vector‑recall service on Milvus, wrapping it with a Dubbo‑gRPC interface to serve 6 M 64‑dimensional embeddings at roughly 3 k QPS and 20 ms p99 latency, integrating query‑embedding generation, simplifying recommendation pipelines, and demonstrating the performance and operational advantages of a platformized ANN‑based recall layer.

AIMilvusengineering

0 likes · 14 min read

Engineering Practice of Online Vector Recall Service at iQIYI

DataFunTalk

Jul 2, 2021 · Artificial Intelligence

Vector Retrieval for Community Forum Search Using Milvus at Dingxiangyuan

This article describes how Dingxiangyuan's algorithm team adopted Milvus for distributed vector indexing to improve semantic search in their community forum, detailing the background, retrieval workflow, various embedding models—including Bi‑Encoder, Spherical Embedding, and Knowledge Embedding—and summarizing the benefits and future applications.

EmbeddingMilvusNLP

0 likes · 10 min read

Vector Retrieval for Community Forum Search Using Milvus at Dingxiangyuan

System Architect Go

Jun 4, 2020 · Artificial Intelligence

Evolution and Underlying Principles of the Billion‑Scale Image Search System at Youpai Image Manager

This article describes the two‑generation evolution of Youpai Image Manager's billion‑scale image search system, explaining the mathematical representation of images, the limitations of MD5, the first‑generation pHash‑ElasticSearch solution, and the second‑generation CNN‑Milvus approach for robust, large‑scale visual similarity search.

CNNMilvusimage search

0 likes · 9 min read

Evolution and Underlying Principles of the Billion‑Scale Image Search System at Youpai Image Manager

System Architect Go

Apr 11, 2020 · Artificial Intelligence

How to Build an Image Search Engine with CNN and Milvus: A Step‑by‑Step Guide

This article walks through the complete engineering workflow for building an image‑search system, covering CNN‑based feature extraction with VGG16, vector normalization, image preprocessing, black‑edge removal, and practical deployment of the Milvus vector database including hardware requirements, capacity planning, collection/partition design, and search result handling.

CNNMilvusPython

0 likes · 11 min read

How to Build an Image Search Engine with CNN and Milvus: A Step‑by‑Step Guide

System Architect Go

Mar 30, 2020 · Artificial Intelligence

Overview of Image Search System

This article explains the fundamentals of building an image‑by‑image search system, covering image feature extraction methods such as hashing, traditional descriptors, CNN‑based vectors, and the use of vector search engines like Milvus for similarity retrieval.

CNNMilvusfeature extraction

0 likes · 6 min read