Tagged articles

vector search

209 articles · Page 1 of 3

Jul 2, 2026 · Artificial Intelligence

How Cognee’s Single‑Postgres AI Memory Outperforms Traditional RAG (23K+ Stars)

Cognee is an open‑source AI memory platform that combines vector embeddings and knowledge‑graph reasoning on a single Postgres database, delivering dual retrieval, automatic ontology generation, and BEAM benchmark scores up to 0.8—more than double traditional RAG—while offering multi‑language SDKs and flexible deployment options.

AI memoryKnowledge GraphPostgres

0 likes · 15 min read

How Cognee’s Single‑Postgres AI Memory Outperforms Traditional RAG (23K+ Stars)

Shuge Unlimited

Jun 27, 2026 · Artificial Intelligence

How MFS Unifies 20+ Data Sources with a Single Verb Set and How Open Tag Replicates Claude Tag

The article dissects Zilliztech's MFS, showing how a thin‑client, stateful‑server architecture uses a unified verb set to access over twenty heterogeneous data sources, and explains how the Open Tag demo re‑creates Claude Tag's brain‑memory‑tools workflow on top of MFS while highlighting its design trade‑offs and production‑readiness limits.

AI AgentsClaude TagContext Management

0 likes · 16 min read

How MFS Unifies 20+ Data Sources with a Single Verb Set and How Open Tag Replicates Claude Tag

Code Mala Tang

Jun 25, 2026 · Artificial Intelligence

Why Rerank Is Essential: From 100 Retrieved Docs to the 5 Correct Answers in RAG

Even with a perfectly populated vector database, a RAG pipeline often returns irrelevant answers because the initial Bi‑encoder retrieval only narrows the pool to about 100 candidates, and without a Cross‑encoder rerank step the truly correct document—often buried around rank 37—never reaches the LLM for answering.

Bi-EncoderCross-EncoderEmbedding

0 likes · 9 min read

Why Rerank Is Essential: From 100 Retrieved Docs to the 5 Correct Answers in RAG

IT Services Circle

Jun 20, 2026 · Artificial Intelligence

How I Doubled RAG Accuracy with These Optimizations

This article walks through a complete RAG pipeline, identifying common pitfalls from document preprocessing to prompt construction, and provides concrete Python and Java examples, chunking strategies, embedding tweaks, hybrid retrieval, reranking, advanced techniques, and evaluation methods to reliably double retrieval accuracy.

EmbeddingJavaPrompt Engineering

0 likes · 35 min read

How I Doubled RAG Accuracy with These Optimizations

Shuge Unlimited

Jun 16, 2026 · Artificial Intelligence

Beyond mem0: How YC CEO’s Open‑Source AI Memory Engine Uses Regex Instead of LLMs to Power a Knowledge Graph

The article dissects GBrain, an open‑source AI memory engine from Y Combinator’s Garry Tan, showing how a dual‑engine contract, zero‑LLM regex‑based knowledge‑graph extraction, and a layered hybrid retrieval pipeline boost P@5 from ~18 to 49.1 while detailing engineering trade‑offs, batch‑write work‑arounds, weighting constants, and reliability mechanisms.

AI AgentHybrid RetrievalKnowledge Graph

0 likes · 21 min read

Beyond mem0: How YC CEO’s Open‑Source AI Memory Engine Uses Regex Instead of LLMs to Power a Knowledge Graph

Mingyi World Elasticsearch

Jun 7, 2026 · Artificial Intelligence

Build an Enterprise RAG Vector Search System from Scratch with LangChain, Easysearch, and MiMo

This article walks through the complete end‑to‑end pipeline for building a production‑grade RAG system—including document chunking, embedding generation via MiMo, vector storage and kNN retrieval in Easysearch, hybrid search configuration, prompt engineering, answer generation, interactive chat, and a detailed list of common pitfalls and fixes.

EasysearchLangChainMiMo

0 likes · 17 min read

Build an Enterprise RAG Vector Search System from Scratch with LangChain, Easysearch, and MiMo

Alibaba Cloud Big Data AI Platform

Jun 4, 2026 · Big Data

Scalar‑Vector Hybrid Search in a Data Lake with One SQL on EMR Serverless Spark

EMR Serverless Spark now supports scalar‑vector hybrid search via DLF Global Index, allowing a single Spark SQL statement to perform vector similarity and scalar filtering together, eliminating data movement, reducing latency, and boosting performance for scenarios such as autonomous driving, e‑commerce, and knowledge‑base retrieval.

Big DataDLF Global IndexEMR Serverless Spark

0 likes · 17 min read

Scalar‑Vector Hybrid Search in a Data Lake with One SQL on EMR Serverless Spark

PMTalk Product Manager Community

May 30, 2026 · Product Management

5 Skills to Double an AI Product Manager’s Efficiency

The article explains why AI product managers must focus on turning AI into problem‑solving products rather than reciting jargon, outlines three development stages—from basic language understanding to retrieval‑augmented generation and autonomous agents—and shares a real‑world customer‑support case that achieved over 80% automation and a 45% boost in efficiency.

AI AgentsAI product managementPrompt Engineering

0 likes · 8 min read

5 Skills to Double an AI Product Manager’s Efficiency

AI Engineer Programming

May 30, 2026 · Artificial Intelligence

Should You Pre‑filter or Post‑filter in RAG Vector Search?

The article examines RAG vector retrieval filtering strategies, comparing pre‑filtering (filter before vector search) and post‑filtering (filter after ANN search), and introduces single‑stage filtering, discussing their principles, trade‑offs, suitable scenarios, and architectural implications for accuracy and performance.

ANNRAGmetadata filtering

0 likes · 15 min read

Should You Pre‑filter or Post‑filter in RAG Vector Search?

Alibaba Cloud Big Data AI Platform

May 29, 2026 · Artificial Intelligence

How Alibaba Cloud Milvus Achieves 20× Faster Billion‑Scale Vector Search with DiskANN and RaBitQ

Alibaba Cloud Milvus combines DiskANN graph indexing with the RaBitQ quantization algorithm, delivering over 20× higher QPS, sub‑10% P99 latency, 29% lower memory usage and more than 98% recall on a 100 million‑vector, 768‑dimensional benchmark, while also cutting index build time from 20 h to about 6 h.

DiskANNMilvusQuantization

0 likes · 7 min read

How Alibaba Cloud Milvus Achieves 20× Faster Billion‑Scale Vector Search with DiskANN and RaBitQ

Alibaba Cloud Infrastructure

May 26, 2026 · Cloud Computing

How OSS Vector Bucket Eliminates Needle‑in‑a‑Haystack Searches for Media Asset Platforms

The article examines how Alibaba Cloud OSS Vector Bucket solves the data‑scattered, costly, and inefficient retrieval problems of massive multimodal media asset platforms by unifying storage, providing semantic vector search, and cutting operational expenses up to 95%.

Multimodal DataOSS Vector BucketSemantic Retrieval

0 likes · 9 min read

How OSS Vector Bucket Eliminates Needle‑in‑a‑Haystack Searches for Media Asset Platforms

James' Growth Diary

May 23, 2026 · Artificial Intelligence

Choosing the Right Retrieval Strategy: Full‑Text vs Vector vs Graph Search

This article breaks down the underlying logic, ideal scenarios, benchmark data, decision trees, and real‑world case studies for full‑text (BM25), vector, and graph retrieval, showing why hybrid approaches dominate production while each technique has distinct strengths and trade‑offs.

Full-Text SearchHybrid SearchRAG

0 likes · 25 min read

Choosing the Right Retrieval Strategy: Full‑Text vs Vector vs Graph Search

DataFunSummit

May 21, 2026 · Big Data

Alibaba Cloud’s Agent-Ready Big Data AI Infrastructure: Boosting Data Development from Hours to Minutes

Facing a projected 85% of enterprises deploying internal agents within two years, Alibaba Cloud proposes an Agent-Ready big‑data AI infrastructure—comprising a unified data lake, real‑time processing, high‑dimensional vector retrieval, elastic model serving, and comprehensive security governance—that has already cut data‑development cycles from hours to 5‑10 minutes in internal model‑training and Taobao flash‑sale scenarios.

AIAgent-ReadyBig Data

0 likes · 15 min read

Alibaba Cloud’s Agent-Ready Big Data AI Infrastructure: Boosting Data Development from Hours to Minutes

StarRocks

May 20, 2026 · Big Data

How StarRocks, Paimon, and Fluss Enable Multimodal Fusion Search in a Lakehouse

The Streaming Lakehouse Meetup (May 27) explores breaking data silos by unifying structured tables, images, video, audio, and high‑dimensional vectors through StarRocks‑Paimon‑Fluss integration, covering multimodal fusion retrieval, vector search internals, native reader/writer performance gains, and real‑world ANN indexing practices.

FlussLakehouseMultimodal

0 likes · 5 min read

How StarRocks, Paimon, and Fluss Enable Multimodal Fusion Search in a Lakehouse

DataFunSummit

May 20, 2026 · Databases

Apache Doris 4.1: A Unified Data Store and Retrieval Engine for AI & Search

Apache Doris 4.1 introduces a systematic evolution for AI and search workloads, adding low‑cost massive vector storage, unified structured, full‑text and vector search, 100 MB JSON document support, Segment V3 metadata decoupling, sparse column optimizations, lakehouse lifecycle management, and a suite of performance‑boosting features such as aggregate push‑down, condition cache, and spill‑to‑disk, all backed by detailed benchmark results.

AIApache DorisLakehouse

0 likes · 30 min read

Apache Doris 4.1: A Unified Data Store and Retrieval Engine for AI & Search

Big Data Technology & Architecture

May 20, 2026 · Databases

Deep Dive into Apache Doris’ Multimodal Capabilities: Architecture and Enterprise Deployments

Apache Doris 4.0 introduces native vector indexes, built‑in AI functions, and hybrid search, turning the OLAP engine into an AI‑centric analytics hub; the article details the technical design, performance optimizations, and real‑world deployments at ByteDance, Squirrel AI, NetEase and a security vendor, highlighting storage savings, query speedups and reduced operational complexity.

AI FunctionsApache DorisEnterprise Case Study

0 likes · 19 min read

Deep Dive into Apache Doris’ Multimodal Capabilities: Architecture and Enterprise Deployments

AI Engineer Programming

May 20, 2026 · Artificial Intelligence

Why Chunk‑Based RAG Fails and How IdeaBlocks Improve Retrieval

The article argues that the common assumption that text chunks are the proper knowledge unit in RAG pipelines is flawed, leading to versioning, metadata, and redundancy problems, and demonstrates that replacing chunks with structured IdeaBlocks dramatically reduces corpus size, token usage, and improves vector relevance.

IdeaBlockLLMMetadata

0 likes · 10 min read

Why Chunk‑Based RAG Fails and How IdeaBlocks Improve Retrieval

Xiaohongshu Tech REDtech

May 18, 2026 · Artificial Intelligence

CCD‑Aware Thread Orchestration Shatters Multi‑Core CPU Vector Search Performance Ceiling

The paper presents a CCD‑level load‑aware thread orchestration framework that boosts vector ANNS throughput up to 3.7×, cuts P999 tail latency by 30%‑90%, reduces L3 cache miss rates by 6%‑30% and CPU stall time by 20%‑80% on AMD EPYC multi‑chiplet CPUs.

ANNSCCDCPU cache

0 likes · 19 min read

CCD‑Aware Thread Orchestration Shatters Multi‑Core CPU Vector Search Performance Ceiling

Tech Minimalism

May 16, 2026 · Artificial Intelligence

One‑page guide to the three RAG architectures: Classic, Graph, and Agentic

The article explains why plain large language models cannot answer internal company questions, introduces Retrieval‑Augmented Generation (RAG) as a solution, and compares three RAG variants—Classic, Graph, and Agentic—detailing their workflows, strengths, limitations, and how to choose the right one for a given problem.

Agentic RAGClassic RAGGraph RAG

0 likes · 17 min read

One‑page guide to the three RAG architectures: Classic, Graph, and Agentic

AI Engineer Programming

May 15, 2026 · Artificial Intelligence

Hybrid Retrieval in RAG: Combining BM25 Precision with Dense Vector Semantics

The article examines why pure vector retrieval in RAG lacks lexical precision and traceable relevance scores, explains BM25's strengths, and presents hybrid retrieval architectures—including RRF and linear combination fusion—as well as the trade‑offs of externalizing the fusion process.

BM25Hybrid SearchInformation Retrieval

0 likes · 9 min read

Hybrid Retrieval in RAG: Combining BM25 Precision with Dense Vector Semantics

DeepHub IMBA

May 14, 2026 · Artificial Intelligence

How HyDE Transforms RAG Retrieval from Keyword Matching to Intent Understanding

The article explains how Hypothetical Document Embeddings (HyDE) improve Retrieval‑Augmented Generation by generating a synthetic answer before vector search, allowing the system to embed richer semantic intent rather than relying on shallow keyword similarity, and provides a step‑by‑step implementation using LangChain.

HyDELLMLangChain

0 likes · 6 min read

How HyDE Transforms RAG Retrieval from Keyword Matching to Intent Understanding

AI Engineer Programming

May 8, 2026 · Artificial Intelligence

Is Non-Vector RAG the Next Generation of Retrieval‑Augmented Generation?

The article analyses the relevance and accuracy shortcomings of traditional vector‑based RAG, explains how non‑vector approaches like PageIndex let LLMs navigate document trees for relevance classification and auditability, and evaluates their complexity, latency, metadata risks, and suitable use cases compared with hybrid retrieval.

Hybrid RetrievalLLMRAG

0 likes · 8 min read

Is Non-Vector RAG the Next Generation of Retrieval‑Augmented Generation?

Lao Guo's Learning Space

May 6, 2026 · Artificial Intelligence

Why Your RAG Keeps Missing the Mark: Enterprise‑Level Pitfall Guide

This article examines why Retrieval‑Augmented Generation systems that work in demos often fail in production, detailing common pitfalls—from chunking and vector‑database selection to hybrid retrieval and re‑ranking—and offers concrete strategies, configuration tips, and a decision tree to build reliable enterprise‑grade RAG solutions.

ChunkingEnterprise AIHybrid Retrieval

0 likes · 12 min read

Why Your RAG Keeps Missing the Mark: Enterprise‑Level Pitfall Guide

Amazon Cloud Developers

May 6, 2026 · Artificial Intelligence

How JoyCastle Accelerated a 100k+ Ad Asset Library with Amazon Nova Multimodal Embeddings

JoyCastle faced a growing ad‑asset library that slowed creative production, so it built an AI‑powered management system using Amazon Nova Multimodal Embeddings, achieving unified semantic search, automatic video segmentation, 96.7% recall and a 73.3% top‑2 precision while reducing manual labeling effort.

AWSAmazon NovaAsset Management

0 likes · 13 min read

How JoyCastle Accelerated a 100k+ Ad Asset Library with Amazon Nova Multimodal Embeddings

Linyb Geek Road

May 5, 2026 · Artificial Intelligence

Optimizing Retrieval and Generation Latency in High‑Concurrency RAG Agents

The article dissects latency in high‑concurrency RAG Agent pipelines, showing how retrieval, re‑ranking, and LLM generation each contribute milliseconds of delay, and presents system‑level tactics—from ANN index tuning and partitioned search to vLLM PagedAttention, continuous batching, speculative decoding, model quantization, routing, semantic caching, and pipeline parallelism—to dramatically cut end‑to‑end response time.

ANNLLMRAG

0 likes · 15 min read

Optimizing Retrieval and Generation Latency in High‑Concurrency RAG Agents

Spring Full-Stack Practical Cases

May 3, 2026 · Artificial Intelligence

9 Advanced Retrieval‑Augmented Generation (RAG) Architectures Explained

This article introduces Retrieval‑Augmented Generation (RAG) and systematically details nine distinct RAG architectures—standard, conversational with memory, corrective (CRAG), adaptive, self‑RAG, fusion, HyDE, agentic, and Graph RAG—highlighting their workflows, real‑world examples, advantages, and trade‑offs.

AI ArchitectureGraphRAGLLM

0 likes · 17 min read

9 Advanced Retrieval‑Augmented Generation (RAG) Architectures Explained

Alibaba Cloud Big Data AI Platform

May 1, 2026 · Artificial Intelligence

Zero Deployment, Zero Ops: Alibaba Cloud Milvus Embedding Service Makes Vectorization Plug‑and‑Play

The article explains how Alibaba Cloud's Milvus Embedding Service eliminates the need for self‑hosted embedding models by integrating model inference, vector generation and Milvus indexing into a managed pipeline, dramatically reducing deployment complexity, operational overhead, and time‑to‑value for semantic search, RAG and multimodal retrieval use cases.

Alibaba CloudEmbeddingMilvus

0 likes · 19 min read

Zero Deployment, Zero Ops: Alibaba Cloud Milvus Embedding Service Makes Vectorization Plug‑and‑Play

DeepHub IMBA

Apr 30, 2026 · Artificial Intelligence

Why Real RAG Systems Need Both BM25 and Vector Search

The article analyzes how BM25 excels at exact token matching while vector embeddings capture semantic intent, explains their distinct failure modes, and shows that a hybrid retriever—combined with metadata filtering, proper chunking, and reciprocal rank fusion—delivers the most reliable results for RAG pipelines.

BM25EmbeddingHybrid Retrieval

0 likes · 17 min read

Why Real RAG Systems Need Both BM25 and Vector Search

AI Architect Hub

Apr 30, 2026 · Artificial Intelligence

How AI Understands Your Queries: Core Techniques of Semantic Vector Search

The article explains why traditional keyword search often fails when user questions differ from knowledge‑base wording, introduces semantic search that matches queries and documents via vector similarity, details query understanding and rewriting techniques, lists common pitfalls, provides a full Python implementation, and shares best‑practice recommendations.

AIPythonRAG

0 likes · 16 min read

How AI Understands Your Queries: Core Techniques of Semantic Vector Search

Architect's Tech Stack

Apr 29, 2026 · Databases

Redis 8.0 Beyond Simple Caching: 16 Powerful Use Cases You Must Try

Redis 8.0 consolidates many previously external modules—JSON, time‑series, vector search, probabilistic data structures, and more—into a single package, and this article walks through 16 concrete scenarios ranging from field‑level cache expiration to AI‑ready vector similarity search, showing exact commands and when to prefer each feature.

CachingDistributed LockFull-Text Search

0 likes · 19 min read

Redis 8.0 Beyond Simple Caching: 16 Powerful Use Cases You Must Try

AI Architect Hub

Apr 27, 2026 · Artificial Intelligence

Why HNSW Can Speed Up Search 50× Compared to Brute‑Force? A Hands‑On Guide to Building Vector Indexes

The article explains why brute‑force vector search is painfully slow, introduces Flat, IVF, and HNSW index structures, compares their speed, memory and accuracy, shows common pitfalls, provides production‑grade Python code, and presents benchmark results that demonstrate HNSW’s superior speed‑accuracy trade‑off.

AIFAISSHNSW

0 likes · 12 min read

Why HNSW Can Speed Up Search 50× Compared to Brute‑Force? A Hands‑On Guide to Building Vector Indexes

The Dominant Programmer

Apr 27, 2026 · Artificial Intelligence

Building a Private Document Vector Search with SpringBoot, LangChain4j, and Ollama RAG

This guide walks through why Retrieval‑Augmented Generation (RAG) is needed for large language models, explains the three‑step indexing and query workflow, details LangChain4j’s core components, and provides a complete SpringBoot example—including Maven setup, configuration, service code, and troubleshooting—to create a private document‑vector search system powered by Ollama.

EmbeddingLangChain4jOllama

0 likes · 13 min read

Building a Private Document Vector Search with SpringBoot, LangChain4j, and Ollama RAG

dbaplus Community

Apr 26, 2026 · Databases

Why PostgreSQL Is the Better Choice in 99% of Scenarios

The article argues that relying on many specialized databases creates operational complexity, higher costs, and maintenance overhead, while PostgreSQL’s extensible ecosystem—offering full‑text search, vector, time‑series, JSONB, and more—delivers comparable or superior algorithms, proven performance, and a simpler, more reliable stack for the vast majority of use cases, especially in AI applications.

AIExtensionsFull-Text Search

0 likes · 19 min read

Why PostgreSQL Is the Better Choice in 99% of Scenarios

AI Engineer Programming

Apr 25, 2026 · Artificial Intelligence

Quantization Across Signal Processing, AI Inference, and RAG Vector Search

This article explains how quantization—originating from signal processing—reduces precision to save resources, details its application to neural network weights and activations via PTQ, QAT, GPTQ, AWQ, and SmoothQuant, and shows how vector quantization enables fast, memory‑efficient retrieval in large‑scale RAG systems.

AWQGPTQLLM

0 likes · 19 min read

Quantization Across Signal Processing, AI Inference, and RAG Vector Search

DataFunTalk

Apr 24, 2026 · Databases

DM GDMBASE V4.0: HyperRAG, Long‑Term Memory & NL Agents for Graph‑Vector AI

At the 2026 China Database Technology & Industry Conference, DM unveiled GDMBASE V4.0, a graph database that natively fuses vectors and graphs, introduces HyperRAG, long‑term memory, and a natural‑language agent, and delivers sub‑500 ms retrieval, 30% higher recall and 60% lower hallucination rates for AI workloads.

AI integrationHybrid RetrievalHyperRAG

0 likes · 12 min read

DM GDMBASE V4.0: HyperRAG, Long‑Term Memory & NL Agents for Graph‑Vector AI

James' Growth Diary

Apr 21, 2026 · Artificial Intelligence

Boosting RAG Performance with Milvus: Chunking, Hybrid Search, and Rerank Best Practices

This article analyzes why Retrieval‑Augmented Generation often underperforms, then walks through concrete engineering steps—optimal chunking, overlap settings, hybrid vector + BM25 retrieval, RRF fusion, and reranking—while providing code snippets, parameter tables, and a full pipeline diagram to turn a usable RAG system into a high‑quality one.

ChunkingHybrid SearchLangChain

0 likes · 18 min read

Boosting RAG Performance with Milvus: Chunking, Hybrid Search, and Rerank Best Practices

AI Engineer Programming

Apr 21, 2026 · Artificial Intelligence

From Bag‑of‑Words to Semantic Vectors: Understanding Embeddings and Similarity Search (Part 1)

The article explains how diverse data can be represented as high‑dimensional vectors, describes exact and approximate nearest‑neighbor search, explores vector quantization, product quantization, locality‑sensitive hashing, and HNSW graphs, and analyzes their speed, accuracy, and memory trade‑offs for large‑scale similarity retrieval.

HNSWLSHembeddings

0 likes · 16 min read

From Bag‑of‑Words to Semantic Vectors: Understanding Embeddings and Similarity Search (Part 1)

Linyb Geek Road

Apr 20, 2026 · Artificial Intelligence

How to Choose the Right Embedding Model for RAG Architectures

This article explains why embedding models are the foundation of Retrieval‑Augmented Generation, outlines five evaluation dimensions, compares leading open‑source and commercial models, provides a decision tree, practical validation steps, common pitfalls, and future trends to help developers select the most suitable embedding model for their RAG system.

EmbeddingHybrid SearchMTEB

0 likes · 10 min read

How to Choose the Right Embedding Model for RAG Architectures

Mingyi World Elasticsearch

Apr 19, 2026 · Industry Insights

ElasticStack 2026: Beyond New Versions, It’s Becoming an Agent Platform

In early 2026 ElasticStack transformed from a traditional search‑log‑visualization stack into an Agent platform, accelerating releases across three lines, elevating Elasticsearch to a context‑engineered infrastructure, unifying ES|QL as a platform‑wide interaction layer, and integrating Workflows, MCP, and vector enhancements to drive autonomous observability and security operations.

Agent PlatformElasticStackElasticsearch

0 likes · 20 min read

ElasticStack 2026: Beyond New Versions, It’s Becoming an Agent Platform

DataFunTalk

Apr 18, 2026 · Databases

How Will Apache Doris Evolve in 2026 to Power AI‑Driven Data Workloads?

The article outlines Apache Doris's 2026 roadmap, detailing how the database will shift from pure analytics to a unified AI‑enabled platform with enhanced semi‑structured data support, vector and hybrid search, agent‑focused capabilities, and expanded storage and lakehouse integrations to meet emerging AI workloads.

AI integrationApache DorisData Lake

0 likes · 14 min read

How Will Apache Doris Evolve in 2026 to Power AI‑Driven Data Workloads?

DataFunSummit

Apr 17, 2026 · Artificial Intelligence

Why RAG Projects Fail: Real‑World Pitfalls and Proven Solutions

This article dissects the hype‑versus‑reality gap of Retrieval‑Augmented Generation in enterprises, exposing low recall, hallucinations, and cost overruns, then offers a systematic diagnosis, hybrid search, reranking, security controls, and advanced GraphRAG and Agentic RAG strategies to achieve reliable production deployments.

Enterprise AILLMRAG

0 likes · 17 min read

Why RAG Projects Fail: Real‑World Pitfalls and Proven Solutions

AI Explorer

Apr 16, 2026 · Artificial Intelligence

Build an AI Agent Memory Engine with Just Six Lines of Code

The open‑source Cognee project lets developers give AI agents a dynamic, long‑term memory by combining vector search, graph databases and cognitive techniques, and it can be set up with only six lines of Python code, as demonstrated with a quick‑start example.

AI memoryPythoncognee

0 likes · 6 min read

Build an AI Agent Memory Engine with Just Six Lines of Code

Alibaba Cloud Infrastructure

Apr 13, 2026 · Artificial Intelligence

How to Speed Up Bulk Vector Searches with CLI and SDK Concurrency

This guide explains how to dramatically reduce latency for batch semantic search, RAG multi‑path retrieval, and multimodal vector queries by running multiple OSS Vectors embed requests in parallel using CLI‑based, xargs, shell background jobs, Python asyncio, and SDK‑level concurrency techniques.

CLIGoOSS

0 likes · 21 min read

How to Speed Up Bulk Vector Searches with CLI and SDK Concurrency

DeepHub IMBA

Apr 11, 2026 · Artificial Intelligence

Understanding Vector Similarity Search: Flat Index, IVF, and HNSW

This article explains why vector databases are needed for semantic search of unstructured data and provides a detailed, step‑by‑step comparison of three core vector similarity algorithms—cosine similarity, Flat Index, IVF, and HNSW—highlighting their trade‑offs in accuracy and speed.

HNSWIVFembeddings

0 likes · 10 min read

Understanding Vector Similarity Search: Flat Index, IVF, and HNSW

Mingyi World Elasticsearch

Apr 10, 2026 · Artificial Intelligence

Easysearch vs Elasticsearch Vector Search: Compatibility Explained in One Guide

The article compares Easysearch and Elasticsearch vector‑search capabilities, showing that both support vector queries but use different field types and DSL structures, and it outlines migration pitfalls and practical advice for choosing the right system.

API compatibilityEasysearchElasticsearch

0 likes · 7 min read

Easysearch vs Elasticsearch Vector Search: Compatibility Explained in One Guide

Wu Shixiong's Large Model Academy

Apr 7, 2026 · Artificial Intelligence

Why Hybrid Retrieval Beats Pure Vector Search: BM25, RRF, and Real‑World Experiments

This article dissects the shortcomings of pure vector retrieval, explains how BM25 complements it, compares weighted‑sum and Reciprocal Rank Fusion (RRF) strategies, shows experimental results that identify optimal weight and k values, and provides practical engineering tips for deploying hybrid search in RAG systems.

BM25Hybrid RetrievalRAG Systems

0 likes · 24 min read

Why Hybrid Retrieval Beats Pure Vector Search: BM25, RRF, and Real‑World Experiments

DataFunTalk

Apr 1, 2026 · Industry Insights

How Oracle’s AI‑Powered Database Is Turning Data Sovereignty into a Competitive Edge

Oracle’s 2026 AI database rollout fuses vector search, private AI agents, unified memory, and deep data security directly into the database engine, challenging the cloud‑centric data‑movement paradigm and prompting a market shift that could revive Oracle’s dominance while reshaping strategies for DBAs, AI engineers, and decision makers.

AI DatabaseDatabase ArchitectureIndustry Trends

0 likes · 13 min read

How Oracle’s AI‑Powered Database Is Turning Data Sovereignty into a Competitive Edge

Ray's Galactic Tech

Mar 30, 2026 · Artificial Intelligence

From Demo to Production: Building an Enterprise‑Grade RAG System with Spring AI & PGVector

This comprehensive guide explains how to design, implement, and operate a production‑ready Retrieval‑Augmented Generation (RAG) platform using Spring AI and PostgreSQL PGVector, covering architecture, indexing, hybrid retrieval, prompt engineering, scaling, security, observability, deployment, and common pitfalls for enterprise knowledge‑base applications.

Enterprise AIHybrid RetrievalObservability

0 likes · 42 min read

From Demo to Production: Building an Enterprise‑Grade RAG System with Spring AI & PGVector

Alibaba Cloud Infrastructure

Mar 27, 2026 · Artificial Intelligence

Build a Scalable Multimodal Image Search with Alibaba Cloud OSS Vector Buckets

This guide walks through setting up Alibaba Cloud OSS Vector Buckets, installing the necessary SDKs, uploading image datasets, creating vector indexes, generating embeddings with the Bailei multimodal model, writing vectors, performing semantic searches, and visualizing results via a Gradio web UI.

AI RetrievalGradio UIOSS

0 likes · 27 min read

Build a Scalable Multimodal Image Search with Alibaba Cloud OSS Vector Buckets

Open Source Tech Hub

Mar 25, 2026 · Artificial Intelligence

How to Build Hybrid Vector and Full‑Text Search with PHPVector in PHP 8.2

This guide introduces PHPVector, a pure‑PHP vector database that combines HNSW‑based approximate nearest‑neighbor search with BM25 full‑text ranking, showing installation, document insertion, vector and text queries, hybrid ranking modes, configuration options, distance metrics, tuning tips, and persistence mechanisms.

AIBM25HNSW

0 likes · 10 min read

How to Build Hybrid Vector and Full‑Text Search with PHPVector in PHP 8.2

Alibaba Cloud Big Data AI Platform

Mar 24, 2026 · Artificial Intelligence

How Hologres + Mem0 Deliver Low‑Cost, High‑Performance Long‑Memory for LLMs

This article explains how the combination of Hologres, a unified real‑time data warehouse, and Mem0, an open‑source LLM memory framework, overcomes the limited context window of large language models by providing scalable, low‑latency, and cost‑effective long‑term memory for AI applications.

AI InfrastructureHologresLLM

0 likes · 11 min read

How Hologres + Mem0 Deliver Low‑Cost, High‑Performance Long‑Memory for LLMs

Data Party THU

Mar 23, 2026 · Artificial Intelligence

Boosting RAG Performance: Query Translation & Decomposition Techniques

The article explains two emerging RAG query‑optimization approaches—query translation and query decomposition—detailing fan‑out retrieval, reciprocal rank fusion, HyDE, step‑back prompting, and chain‑of‑thought retrieval, and shows how combining them can improve relevance and latency in LLM‑augmented systems.

LLMQuery OptimizationRAG

0 likes · 9 min read

Boosting RAG Performance: Query Translation & Decomposition Techniques

Alibaba Cloud Developer

Mar 19, 2026 · Artificial Intelligence

How Engineering Knowledge Engines Turn AI Coders into Reliable Collaborators

The article analyzes the limitations of current AI coding agents—narrow perception, fragmented knowledge, and missing high‑dimensional context—and presents an Engineering Knowledge Engine that integrates vector retrieval, code and commit graphs, RepoWiki, memory, and Agentic Search to provide structured, evolving context, dramatically improving task success, token efficiency, and code quality.

AIAgentic SearchCode Graph

0 likes · 11 min read

How Engineering Knowledge Engines Turn AI Coders into Reliable Collaborators

Tech Freedom Circle

Mar 19, 2026 · Artificial Intelligence

Failed Alibaba Interview: The 4 RAG Modules and 6 Design Principles You Need

The article dissects a failed Alibaba second‑round interview where the candidate answered only “vector‑search‑enhanced” for a RAG design, and then presents a systematic, four‑module RAG architecture together with six design principles, detailed indexing, query understanding, multi‑path recall, and context generation techniques to help candidates demonstrate comprehensive technical depth.

AI ArchitectureKnowledge GraphMulti‑Path Recall

0 likes · 22 min read

Failed Alibaba Interview: The 4 RAG Modules and 6 Design Principles You Need

DeepHub IMBA

Mar 17, 2026 · Artificial Intelligence

Advanced RAG Techniques: Boosting Retrieval with Query Translation and Decomposition

The article examines how retrieval‑augmented generation suffers from poor query formulation and presents two advanced strategies—query translation, which generates multiple semantically similar variants, and query decomposition, which breaks complex questions into finer sub‑queries—detailing methods such as fan‑out retrieval, reciprocal rank fusion, HyDE, step‑back prompting, and chain‑of‑thought retrieval, and explains when to combine them.

Hybrid RetrievalLLMQuery Decomposition

0 likes · 9 min read

Advanced RAG Techniques: Boosting Retrieval with Query Translation and Decomposition

Mingyi World Elasticsearch

Mar 11, 2026 · Backend Development

How to Achieve One‑Line Semantic Search for Nearby Clean Coffee Shops with Elasticsearch

This article walks through building a practical Elasticsearch demo that lets users type a single query like “nearby clean coffee shop” and get results by combining dense‑vector semantic search, geo filtering, BM25, and a hybrid RRF‑style ranking, with both LLM‑based structuring and a fallback hash‑based embedding.

BM25FlaskHybrid Search

0 likes · 10 min read

How to Achieve One‑Line Semantic Search for Nearby Clean Coffee Shops with Elasticsearch

AI Explorer

Mar 11, 2026 · Artificial Intelligence

Gemini Embedding 2: Google’s First Native Multimodal Embedding Model

Google’s Gemini Embedding 2 introduces a native multimodal embedding model that maps text, images, video, audio, and documents into a single vector space, offers three configurable dimensions, achieves state‑of‑the‑art benchmarks across modalities, and enables cross‑modal search, RAG, and seamless integration with major vector databases.

AI modelsGemini EmbeddingMatryoshka representation

0 likes · 8 min read

Gemini Embedding 2: Google’s First Native Multimodal Embedding Model

Big Data Technology Tribe

Mar 10, 2026 · Databases

How Lance Builds Scalar and Vector Indexes: A Deep Dive into create_index

This article explains how Lance's Python API creates scalar and vector indexes, walks through the internal Rust implementation of the create_index workflow, and details the transaction, commit, and error‑handling mechanisms that ensure atomic and consistent index creation.

IndexingLancecreate_index

0 likes · 12 min read

How Lance Builds Scalar and Vector Indexes: A Deep Dive into create_index

Data STUDIO

Mar 9, 2026 · Artificial Intelligence

Boost RAG Accuracy from 60% to 94% with 11 Proven Strategies

This article dissects why naive Retrieval‑Augmented Generation (RAG) often yields only 60% accuracy, then presents eleven concrete ingestion, query, and hybrid techniques—complete with code samples, performance trade‑offs, and real‑world case studies—that together can raise RAG accuracy to 94% while outlining practical implementation roadmaps and common pitfalls.

EmbeddingKnowledge GraphLLM

0 likes · 31 min read

Boost RAG Accuracy from 60% to 94% with 11 Proven Strategies

AI2ML AI to Machine Learning

Feb 27, 2026 · Artificial Intelligence

Why No Single Algorithm Dominates Vector Search: A Deep Dive into Modern Vector DBs

The article surveys emerging vector databases, explains how various vector‑search algorithms such as FLAT, IVF, HNSW, DiskANN and ScaNN differ in accuracy, speed, memory use and build time, and provides practical guidance for choosing the right index based on data size, latency and resource constraints.

DiskANNHNSWScaNN

0 likes · 9 min read

Why No Single Algorithm Dominates Vector Search: A Deep Dive into Modern Vector DBs

Alibaba Cloud Big Data AI Platform

Feb 25, 2026 · Artificial Intelligence

How Hologres Powers Fast Vector & Full‑Text Search for AI‑Driven Customer Service

The Taobao‑Tmall customer operations team built an integrated vector‑plus‑full‑text retrieval solution on Hologres, achieving millisecond‑level recall for massive unstructured knowledge bases, boosting intelligent客服, rule comparison, and sentiment analysis across multiple business scenarios.

AI RetrievalFull-Text SearchHologres

0 likes · 12 min read

How Hologres Powers Fast Vector & Full‑Text Search for AI‑Driven Customer Service

ByteDance Data Platform

Feb 11, 2026 · Databases

How ByteHouse Redefines Real‑Time Multimodal Analytics with a Cloud‑Native Data Warehouse

ByteHouse, ByteDance's cloud‑native data warehouse, evolves from a traditional warehouse to a next‑generation AI‑ready platform that handles 800+ PB of data, supports 25,000 nodes, and delivers real‑time, multimodal analytics through a decoupled storage‑compute architecture, AI‑driven query optimization, and native vector search integration.

Cloud NativeDatabasesai-optimization

0 likes · 9 min read

How ByteHouse Redefines Real‑Time Multimodal Analytics with a Cloud‑Native Data Warehouse

SpringMeng

Feb 7, 2026 · Databases

Redis’s Multithreaded Query Engine Boosts RAG Performance

Redis introduces a multithreaded query engine that keeps average latency under 10 ms while delivering up to 16× higher throughput for vector‑search workloads, enabling faster retrieval‑augmented generation (RAG) applications and outperforming pure vector databases and managed Redis services in benchmark tests.

Multithreaded QueryRAGRedis

0 likes · 6 min read

Redis’s Multithreaded Query Engine Boosts RAG Performance

Amazon Cloud Developers

Feb 5, 2026 · Cloud Computing

How to Build a Fast, Accurate AI‑Powered Knowledge Base with Amazon OpenSearch and DeepSeek

This article walks through using Amazon OpenSearch Service’s vector search and ML connector together with the DeepSeek large language model to create a low‑cost, high‑efficiency enterprise knowledge base, covering architecture, step‑by‑step deployment, RAG pipeline configuration, and conversational search extensions.

Amazon OpenSearchDeepSeekKnowledge Base

0 likes · 17 min read

How to Build a Fast, Accurate AI‑Powered Knowledge Base with Amazon OpenSearch and DeepSeek

Architecture and Beyond

Feb 1, 2026 · Artificial Intelligence

5 High‑ROI Strategies to Supercharge RAG Retrieval Performance

This article outlines five practical engineering strategies—multi‑vector retrieval, manual splitting and labeling, scalar enhancement, context augmentation, and dense‑sparse vector integration—that together address common RAG retrieval bottlenecks and dramatically improve recall stability and answer quality.

BM25LLMRAG

0 likes · 17 min read

5 High‑ROI Strategies to Supercharge RAG Retrieval Performance

Tech Musings

Jan 29, 2026 · Databases

Mastering Redis 8 Vector Search: Indexing, Hybrid Retrieval, and Re‑ranking Techniques

This article explains how to use Redis 8.4.0 for vector recall and keyword filtering, covering index selection (FLAT vs HNSW), schema creation with redisvl, full‑text BM25 search, pure KNN vector queries, hybrid text‑plus‑vector retrieval, query cleaning, score fusion, and optional in‑Redis Lua re‑ranking or TAG‑based filtering extensions.

IndexingPythonvector search

0 likes · 15 min read

Mastering Redis 8 Vector Search: Indexing, Hybrid Retrieval, and Re‑ranking Techniques

PaperAgent

Jan 28, 2026 · Artificial Intelligence

How Clawdbot Achieves Persistent, Local Memory for LLM Agents

Clawdbot implements a fully local, persistent memory system for LLM agents by storing context and long‑term knowledge in editable Markdown files, indexing them with SQLite‑vec and FTS5, supporting multi‑agent isolation, compression, pruning, and configurable session lifecycles to maintain efficient, cost‑effective interactions.

LLM Agentscontext compressionlocal storage

0 likes · 13 min read

How Clawdbot Achieves Persistent, Local Memory for LLM Agents

Tech Musings

Jan 28, 2026 · Databases

Building a CPU‑Only Poetry Retrieval Engine with Qwen Embeddings and Redis Vector Search

This article details a lightweight, CPU‑only knowledge‑base retrieval experiment that uses Qwen3‑Embedding‑0.6B to vectorize Chinese poetry, stores vectors in Redis with HNSW indexing, and implements a hybrid keyword‑plus‑vector search pipeline with configurable weighting and performance optimizations.

CPUEmbeddingKnowledge Base

0 likes · 11 min read

Building a CPU‑Only Poetry Retrieval Engine with Qwen Embeddings and Redis Vector Search

StarRocks

Jan 15, 2026 · Artificial Intelligence

How AI‑First Lakehouse Redefines Data Platforms for Multimodal Analytics

The article outlines the evolution from traditional OLAP to an AI‑first Lakehouse, detailing unified multimodal storage, CPU/GPU heterogeneous scheduling, native vector search, in‑database AI inference, agent‑centric execution, and self‑evolving platform capabilities that together reshape modern data analytics.

AIBig DataIn‑Database Inference

0 likes · 11 min read

How AI‑First Lakehouse Redefines Data Platforms for Multimodal Analytics

Alibaba Cloud Big Data AI Platform

Dec 29, 2025 · Cloud Native

How a Visual Platform Cut Search Costs by 60% with All‑in‑Elasticsearch

This case study details how a major internet visual platform consolidated its log, keyword, and vector search workloads onto Alibaba Cloud Elasticsearch, eliminating three separate pipelines, reducing write‑costs by 60%, cutting storage expenses over 60%, and achieving multi‑fold performance gains through serverless scaling, FalconSeek engine optimizations, and unified monitoring.

ElasticsearchRAGSearch Architecture

0 likes · 10 min read

How a Visual Platform Cut Search Costs by 60% with All‑in‑Elasticsearch

360 Tech Engineering

Dec 25, 2025 · Artificial Intelligence

Choosing Between Vector Knowledge Bases and Knowledge Graphs for RAG

This article explains the definitions, differences, and integration trends of Knowledge Bases and Knowledge Graphs within Retrieval‑Augmented Generation, helping developers decide which technology best fits their AI system requirements.

AI RetrievalGraphRAGHybrid Search

0 likes · 9 min read

Choosing Between Vector Knowledge Bases and Knowledge Graphs for RAG

Data STUDIO

Dec 23, 2025 · Databases

Is the Vector Database Dead? PostgreSQL’s New pgvector Feature Puts Closed‑Source Solutions on the Spot

The article examines how PostgreSQL’s latest pgvector 0.8.0 release adds iterative index scans and smart query planning, enabling fully free vector search within an existing relational database, compares performance, cost, and architecture against dedicated vector databases like Pinecone, and outlines migration steps and best‑practice guidelines.

AIPostgreSQLbenchmark

0 likes · 14 min read

Is the Vector Database Dead? PostgreSQL’s New pgvector Feature Puts Closed‑Source Solutions on the Spot

Data STUDIO

Dec 17, 2025 · Databases

One‑Stop Python Database Guide: From MySQL & PostgreSQL to MongoDB, Redis, Neo4j and Vector Stores

This article walks through the strengths, typical use cases, and Python connection code for relational databases (MySQL, PostgreSQL, SQLite), NoSQL stores (MongoDB, Redis, Neo4j), cloud‑native options (DynamoDB), and emerging vector databases such as Milvus, helping you decide which fits your project.

MongoDBPostgreSQLPython

0 likes · 23 min read

One‑Stop Python Database Guide: From MySQL & PostgreSQL to MongoDB, Redis, Neo4j and Vector Stores

Volcano Engine Developer Services

Dec 5, 2025 · Artificial Intelligence

Why Vectors Power Scalable AI Search and How S3 Vectors Redefines Storage

This article explains how high‑dimensional vectors enable semantic AI search, compares exact and approximate nearest‑neighbor algorithms, examines the challenges of large‑scale vector storage, and evaluates AWS S3 Vectors' architecture, pricing, and hybrid solutions for cost‑effective, high‑performance retrieval.

AI semanticsANNS3 Vectors

0 likes · 17 min read

Why Vectors Power Scalable AI Search and How S3 Vectors Redefines Storage

Amazon Cloud Developers

Dec 5, 2025 · Cloud Computing

Hard‑Core Cloud Foundations Power Agentic AI: Highlights from re:Invent 2025 Peter & Dave Keynote

At re:Invent 2025, AWS executives Peter DeSantis and Dave Brown detailed a series of hardware and service innovations—including Graviton5, Trainium3/4, Lambda Managed Instances, Project Mantle, and S3 Vectors—showcasing how security, availability, elasticity, cost, and agility are becoming even more critical for the AI era, with concrete performance benchmarks from customers such as Airbnb, Apple, and Twelve Labs.

AIAWSCloud

0 likes · 14 min read

Hard‑Core Cloud Foundations Power Agentic AI: Highlights from re:Invent 2025 Peter & Dave Keynote

Yiche Technology

Dec 3, 2025 · Artificial Intelligence

How Milvus Powered a Scalable AI Assistant for Car Queries with Vector Search

This article details how an automotive AI assistant migrated from keyword matching to a Milvus‑based vector retrieval system, overcoming semantic gaps, scaling to millions of daily queries, optimizing indexing, introducing multi‑vector and sparse‑vector search, and building a real‑time RAG pipeline with Flink.

AI assistantMilvusRAG

0 likes · 12 min read

How Milvus Powered a Scalable AI Assistant for Car Queries with Vector Search

Data STUDIO

Dec 3, 2025 · Artificial Intelligence

Pixeltable: One Table to Power Multimodal AI with Declarative Python

Pixeltable introduces a unified table abstraction that treats images, text, embeddings and model outputs as columns, enabling declarative multimodal AI pipelines, eliminating glue code, supporting built‑in vector indexing, versioned experiments, extensible custom functions, and a concise 30‑line RAG implementation.

Declarative programmingMultimodal AIPixeltable

0 likes · 15 min read

Pixeltable: One Table to Power Multimodal AI with Declarative Python

Xiaolei Talks DB

Nov 25, 2025 · Databases

What’s New in MongoDB 8.2? Performance Boosts, AI Features, and Multi‑Cloud Power

The article reviews MongoDB 8.2’s major upgrades, highlighting up to 36% read throughput gains, 59% write speed improvements, 200% faster time‑series aggregation, 50‑fold faster shard rebalancing, enhanced queryable encryption, native vector search, multi‑cloud Atlas support, and AI‑driven capabilities such as hybrid search and the MongoDB AMP platform.

AIEncryptionMongoDB

0 likes · 7 min read

What’s New in MongoDB 8.2? Performance Boosts, AI Features, and Multi‑Cloud Power

Wu Shixiong's Large Model Academy

Nov 21, 2025 · Artificial Intelligence

How to Build a Multi‑Layer Cache for Dynamic RAG Systems

This article explains why dynamic Retrieval‑Augmented Generation (RAG) requires a layered caching strategy rather than simple result caching, details a four‑level cache architecture—including embedding, search, answer, and pipeline caches—provides practical key‑generation and TTL guidelines, and outlines dirty‑data defenses to keep caches consistent and performant.

AI EngineeringCachingLLM

0 likes · 10 min read

How to Build a Multi‑Layer Cache for Dynamic RAG Systems

Aikesheng Open Source Community

Nov 19, 2025 · Databases

Getting Started with SeekDB: An AI‑Native Embedded Search Database

This guide introduces SeekDB, an AI‑native embedded database that unifies vector, text, and structured search, walks through installation with uv, shows a complete Python demo, and shares practical pros, cons, and usage tips for developers.

AI-native databasePython SDKSeekDB

0 likes · 5 min read

Getting Started with SeekDB: An AI‑Native Embedded Search Database

DevOps Coach

Nov 13, 2025 · Databases

Explore ClickHouse 25.10: 20 JOIN Boosts, Vector Search & New SQL

ClickHouse 25.10 introduces a suite of enhancements—including 20 JOIN performance upgrades, lazy column replication, Bloom filter runtime filters, disjunction push‑down, automatic column statistics, the QBit vector type, expanded SQL operators, negative LIMIT/OFFSET, Arrow Flight support, and delayed secondary index materialization—backed by detailed benchmarks and contributor acknowledgments.

ClickHouseSQL Extensionsdatabase

0 likes · 23 min read

Explore ClickHouse 25.10: 20 JOIN Boosts, Vector Search & New SQL

Alibaba Cloud Big Data AI Platform

Nov 6, 2025 · Artificial Intelligence

How GPU‑Accelerated NN‑Descent Boosts Vector Search Speed by Up to 13×

This article explains how unstructured multimedia data is transformed into vectors for similarity search, introduces GPU parallelism and the NN‑Descent algorithm to replace traditional HNSW indexing in OpenSearch, and presents benchmark results showing up to a thirteen‑fold speed improvement while maintaining comparable recall.

GPU AccelerationNN-DescentOpenSearch

0 likes · 12 min read

How GPU‑Accelerated NN‑Descent Boosts Vector Search Speed by Up to 13×

Data STUDIO

Nov 4, 2025 · Artificial Intelligence

How to Build a Memory-Enabled AI Agent with SQLite and Vector Search

This article explains how to give AI agents persistent memory, reflection, and goal‑tracking by storing interaction summaries in SQLite, embedding them for semantic retrieval with a vector database, and using LLM‑generated prompts to recall, reflect, and manage objectives across sessions.

AI AgentGoal TrackingLLM

0 likes · 10 min read

How to Build a Memory-Enabled AI Agent with SQLite and Vector Search

Amazon Cloud Developers

Oct 29, 2025 · Artificial Intelligence

How Amazon Nova’s Multimodal Embedding Model Handles All Modalities in One Go

Amazon Nova, a new multimodal embedding model now available on Amazon Bedrock, unifies text, document, image, video, and audio into a single semantic space, offering up to 8000‑token context, multiple output dimensions, and detailed Python examples for embedding generation, storage, and cross‑modal search.

AWS BedrockAmazon NovaPython SDK

0 likes · 19 min read

How Amazon Nova’s Multimodal Embedding Model Handles All Modalities in One Go

Volcano Engine Developer Services

Oct 20, 2025 · Artificial Intelligence

How DiskANN + RaBitQ Supercharges Milvus: 5× Faster, 90% Cheaper Vector Search

This article explains how integrating the disk‑based DiskANN index with the ultra‑compact RaBitQ quantization dramatically boosts Milvus's vector search performance and cuts costs, delivering over five times higher QPS and more than 90% cost reduction for billion‑scale AI workloads.

AIDiskANNMilvus

0 likes · 11 min read

How DiskANN + RaBitQ Supercharges Milvus: 5× Faster, 90% Cheaper Vector Search

JD Tech

Oct 9, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines external knowledge retrieval with large language models, covering its motivations, data preparation, chunking strategies, vectorization, storage, query processing, retrieval, reranking, prompt engineering, and LLM generation, plus practical optimization tips.

ChunkingLLMMetadata

0 likes · 14 min read

What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?

JavaGuide

Oct 8, 2025 · Databases

Is MySQL Now the Runner‑Up to PostgreSQL in the AI Era?

While MySQL has long dominated relational databases with its open‑source stability and massive user base, the rise of AI and PostgreSQL’s extensible ecosystem—highlighted by extensions like pgvector, pg_bm25, TimescaleDB and PostGIS—are shifting developer preference, as shown by the 2025 Stack Overflow survey.

AIDatabase ExtensibilityPostgreSQL

0 likes · 6 min read

Is MySQL Now the Runner‑Up to PostgreSQL in the AI Era?

AI Large Model Application Practice

Sep 28, 2025 · Artificial Intelligence

Unlock MindsDB Knowledge Base: Build RAG Pipelines and Data Agents with SQL

This article walks through MindsDB’s Knowledge Base feature, showing how to map data sources, create vector indexes, perform semantic search, combine multiple sources with SQL joins, automate updates via jobs, and construct powerful RAG pipelines and Data Agents for AI‑driven query answering.

AIData AgentMindsDB

0 likes · 14 min read

Unlock MindsDB Knowledge Base: Build RAG Pipelines and Data Agents with SQL

Tech Freedom Circle

Sep 25, 2025 · Artificial Intelligence

RAGFlow Search Engine Deep Dive: Multi‑Path Retrieval, Fusion, and Reranking

The article provides a detailed technical analysis of RAGFlow's search engine, covering the Searcher class coordination, adaptive multi‑path retrieval (vector, keyword, and knowledge‑graph), intelligent fusion with weighted scoring, caching, performance monitoring, and both built‑in and model‑driven reranking to achieve high‑precision results.

Performance OptimizationRerankingSearch Engine

0 likes · 32 min read

RAGFlow Search Engine Deep Dive: Multi‑Path Retrieval, Fusion, and Reranking

Aikesheng Open Source Community

Sep 17, 2025 · Artificial Intelligence

How MySQL AI Brings Built‑In Machine Learning and GenAI to Your Database

MySQL AI, introduced for the database's 30th anniversary, integrates auto‑ML, generative AI, LLM‑driven text‑to‑SQL, and a vector engine directly into MySQL Enterprise, enabling developers to build intelligent applications on familiar SQL tools without moving data.

AIAutoMLGenAI

0 likes · 8 min read

How MySQL AI Brings Built‑In Machine Learning and GenAI to Your Database

Architecture & Thinking

Sep 10, 2025 · Databases

Redis 8.0 Unveiled: AGPLv3, Vector Search, JSON, and 16× Query Boost

Redis 8.0, released on May 1, 2025, introduces a major license shift to AGPLv3, eight new native data structures—including vector sets, JSON, and time‑series—alongside a rebuilt query engine that delivers up to 16‑fold performance gains, enhanced security, and cloud‑native capabilities.

AGPLv3Redisjson

0 likes · 15 min read

Redis 8.0 Unveiled: AGPLv3, Vector Search, JSON, and 16× Query Boost

DataFunSummit

Sep 4, 2025 · Artificial Intelligence

Unlocking Elasticsearch Vector Search: From Basics to RAG Implementation

This article explores the evolving search demands of the intelligent era, explains dense and sparse vector concepts, details Elasticsearch's vector search capabilities and recent performance breakthroughs, introduces hybrid and relevance‑tuning techniques, and demonstrates RAG principles and real‑world enterprise use cases.

AIElasticsearchHybrid Search

0 likes · 14 min read

Unlocking Elasticsearch Vector Search: From Basics to RAG Implementation

Xiaolei Talks DB

Aug 28, 2025 · Databases

How AI Is Transforming Databases: Highlights from China’s DTCC2025

At DTCC2025 in Beijing, industry leaders showcased AI-driven innovations, vector database advances, RAG techniques, and distributed database performance breakthroughs, illustrating how databases are evolving from passive data stores into intelligent, autonomous systems that boost efficiency, scalability, and business value across sectors.

AICloudDatabases

0 likes · 10 min read

How AI Is Transforming Databases: Highlights from China’s DTCC2025

Alibaba Cloud Big Data AI Platform

Aug 7, 2025 · Databases

Unlock Powerful Hybrid Search with Milvus 2.5: Full-Text, BM25, and RAG Guide

This tutorial explains how to use Milvus 2.5's new full‑text, BM25 keyword matching, and hybrid search capabilities—including step‑by‑step setup, schema design, code examples, and RAG integration—to achieve high recall and precision in large‑scale AI vector retrieval scenarios.

Full-Text SearchHybrid SearchMilvus

0 likes · 13 min read

Unlock Powerful Hybrid Search with Milvus 2.5: Full-Text, BM25, and RAG Guide

Mingyi World Elasticsearch

Aug 5, 2025 · Artificial Intelligence

Enterprise Semantic Search: Key Q&A on Scoring, Recall, LSH, Chunking, and Embedding Dimensions

This article answers practical questions about enterprise semantic search, explaining how Reciprocal Rank Fusion normalizes mixed scoring, how to control vector result size, the trade‑offs of LSH parameters, word‑ and sentence‑based chunking strategies with version‑specific defaults, and flexible embedding dimensionality.

ChunkingElasticsearchLSH

0 likes · 8 min read

Enterprise Semantic Search: Key Q&A on Scoring, Recall, LSH, Chunking, and Embedding Dimensions

Alibaba Cloud Developer

Aug 5, 2025 · Databases

How PolarDB IMCI Unifies Vector Search and Embedding in One SQL Engine

This article explains how PolarDB IMCI integrates vector indexing and embedding directly into the database kernel, offering a unified, transactional, and real‑time vector lifecycle management service that lets developers build RAG knowledge bases and AI applications using only standard SQL, dramatically reducing development and operational complexity.

AIPolardbRAG

0 likes · 11 min read

How PolarDB IMCI Unifies Vector Search and Embedding in One SQL Engine

Mingyi World Elasticsearch

Jul 30, 2025 · Backend Development

From Keyword Matching to Semantic Understanding: Building an Intelligent E‑Commerce Search Engine

The article analyzes the semantic gap in e‑commerce search, compares traditional keyword matching with vector‑based retrieval, and provides a step‑by‑step implementation using Elasticsearch/Easysearch pipelines, embedding models, and a hybrid search strategy to improve user intent understanding.

EasysearchElasticsearchHybrid Search

0 likes · 11 min read

From Keyword Matching to Semantic Understanding: Building an Intelligent E‑Commerce Search Engine

Sohu Tech Products

Jul 23, 2025 · Artificial Intelligence

Boosting Video Moderation with Multimodal CLIP and Efficient Vector Search

This article describes how a video review system combines multimodal CLIP models, image‑text feature alignment, and optimized vector‑search databases such as RedisSearch and Elasticsearch to detect prohibited content in real time and perform large‑scale historical recall, while addressing challenges of generalization, storage cost, and inference speed.

AICLIPmodel fine-tuning

0 likes · 18 min read

Boosting Video Moderation with Multimodal CLIP and Efficient Vector Search

Mingyi World Elasticsearch

Jul 18, 2025 · Artificial Intelligence

Video: Building an Intelligent Knowledge‑Base Q&A System with Large Models and Elasticsearch (RAG)

The video walks through the differences between traditional keyword search and vector search, explains the core concept of Retrieval‑Augmented Generation, and demonstrates how to construct a knowledge‑base Q&A system using a large language model integrated with Elasticsearch.

ElasticsearchKnowledge BaseLarge Language Model

0 likes · 1 min read

Video: Building an Intelligent Knowledge‑Base Q&A System with Large Models and Elasticsearch (RAG)

DataFunSummit

Jul 15, 2025 · Artificial Intelligence

Unlocking Semantic Search: Elasticsearch Vector Search & RAG Applications

This article explains why traditional keyword search falls short, introduces Elasticsearch's vector search and hybrid retrieval capabilities, and shows how combining it with large language models enables Retrieval‑Augmented Generation (RAG) for more accurate, context‑aware AI-driven search across text and multimedia data.

AIElasticsearchRAG

0 likes · 5 min read