Tagged articles
187 articles
Page 1 of 2
DataFunSummit
DataFunSummit
May 20, 2026 · Databases

Apache Doris 4.1: A Unified Data Store and Retrieval Engine for AI & Search

Apache Doris 4.1 introduces a systematic evolution for AI and search workloads, adding low‑cost massive vector storage, unified structured, full‑text and vector search, 100 MB JSON document support, Segment V3 metadata decoupling, sparse column optimizations, lakehouse lifecycle management, and a suite of performance‑boosting features such as aggregate push‑down, condition cache, and spill‑to‑disk, all backed by detailed benchmark results.

AIApache DorisLakehouse
0 likes · 30 min read
Apache Doris 4.1: A Unified Data Store and Retrieval Engine for AI & Search
AI Engineer Programming
AI Engineer Programming
May 20, 2026 · Artificial Intelligence

Why Chunk‑Based RAG Fails and How IdeaBlocks Improve Retrieval

The article argues that the common assumption that text chunks are the proper knowledge unit in RAG pipelines is flawed, leading to versioning, metadata, and redundancy problems, and demonstrates that replacing chunks with structured IdeaBlocks dramatically reduces corpus size, token usage, and improves vector relevance.

IdeaBlockLLMRAG
0 likes · 10 min read
Why Chunk‑Based RAG Fails and How IdeaBlocks Improve Retrieval
DeepHub IMBA
DeepHub IMBA
May 14, 2026 · Artificial Intelligence

How HyDE Transforms RAG Retrieval from Keyword Matching to Intent Understanding

The article explains how Hypothetical Document Embeddings (HyDE) improve Retrieval‑Augmented Generation by generating a synthetic answer before vector search, allowing the system to embed richer semantic intent rather than relying on shallow keyword similarity, and provides a step‑by‑step implementation using LangChain.

HyDELLMLangChain
0 likes · 6 min read
How HyDE Transforms RAG Retrieval from Keyword Matching to Intent Understanding
AI Engineer Programming
AI Engineer Programming
May 8, 2026 · Artificial Intelligence

Is Non-Vector RAG the Next Generation of Retrieval‑Augmented Generation?

The article analyses the relevance and accuracy shortcomings of traditional vector‑based RAG, explains how non‑vector approaches like PageIndex let LLMs navigate document trees for relevance classification and auditability, and evaluates their complexity, latency, metadata risks, and suitable use cases compared with hybrid retrieval.

Hybrid RetrievalLLMRAG
0 likes · 8 min read
Is Non-Vector RAG the Next Generation of Retrieval‑Augmented Generation?
Lao Guo's Learning Space
Lao Guo's Learning Space
May 6, 2026 · Artificial Intelligence

Why Your RAG Keeps Missing the Mark: Enterprise‑Level Pitfall Guide

This article examines why Retrieval‑Augmented Generation systems that work in demos often fail in production, detailing common pitfalls—from chunking and vector‑database selection to hybrid retrieval and re‑ranking—and offers concrete strategies, configuration tips, and a decision tree to build reliable enterprise‑grade RAG solutions.

Enterprise AIHybrid RetrievalRAG
0 likes · 12 min read
Why Your RAG Keeps Missing the Mark: Enterprise‑Level Pitfall Guide
Spring Full-Stack Practical Cases
Spring Full-Stack Practical Cases
May 3, 2026 · Artificial Intelligence

9 Advanced Retrieval‑Augmented Generation (RAG) Architectures Explained

This article introduces Retrieval‑Augmented Generation (RAG) and systematically details nine distinct RAG architectures—standard, conversational with memory, corrective (CRAG), adaptive, self‑RAG, fusion, HyDE, agentic, and Graph RAG—highlighting their workflows, real‑world examples, advantages, and trade‑offs.

AI ArchitectureGraphRAGLLM
0 likes · 17 min read
9 Advanced Retrieval‑Augmented Generation (RAG) Architectures Explained
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
May 1, 2026 · Artificial Intelligence

Zero Deployment, Zero Ops: Alibaba Cloud Milvus Embedding Service Makes Vectorization Plug‑and‑Play

The article explains how Alibaba Cloud's Milvus Embedding Service eliminates the need for self‑hosted embedding models by integrating model inference, vector generation and Milvus indexing into a managed pipeline, dramatically reducing deployment complexity, operational overhead, and time‑to‑value for semantic search, RAG and multimodal retrieval use cases.

Alibaba CloudEmbeddingMilvus
0 likes · 19 min read
Zero Deployment, Zero Ops: Alibaba Cloud Milvus Embedding Service Makes Vectorization Plug‑and‑Play
DeepHub IMBA
DeepHub IMBA
Apr 30, 2026 · Artificial Intelligence

Why Real RAG Systems Need Both BM25 and Vector Search

The article analyzes how BM25 excels at exact token matching while vector embeddings capture semantic intent, explains their distinct failure modes, and shows that a hybrid retriever—combined with metadata filtering, proper chunking, and reciprocal rank fusion—delivers the most reliable results for RAG pipelines.

BM25EmbeddingHybrid Retrieval
0 likes · 17 min read
Why Real RAG Systems Need Both BM25 and Vector Search
AI Architect Hub
AI Architect Hub
Apr 30, 2026 · Artificial Intelligence

How AI Understands Your Queries: Core Techniques of Semantic Vector Search

The article explains why traditional keyword search often fails when user questions differ from knowledge‑base wording, introduces semantic search that matches queries and documents via vector similarity, details query understanding and rewriting techniques, lists common pitfalls, provides a full Python implementation, and shares best‑practice recommendations.

AIPythonRAG
0 likes · 16 min read
How AI Understands Your Queries: Core Techniques of Semantic Vector Search
Architect's Tech Stack
Architect's Tech Stack
Apr 29, 2026 · Databases

Redis 8.0 Beyond Simple Caching: 16 Powerful Use Cases You Must Try

Redis 8.0 consolidates many previously external modules—JSON, time‑series, vector search, probabilistic data structures, and more—into a single package, and this article walks through 16 concrete scenarios ranging from field‑level cache expiration to AI‑ready vector similarity search, showing exact commands and when to prefer each feature.

Full‑Text SearchRedis 8.0caching
0 likes · 19 min read
Redis 8.0 Beyond Simple Caching: 16 Powerful Use Cases You Must Try
AI Architect Hub
AI Architect Hub
Apr 27, 2026 · Artificial Intelligence

Why HNSW Can Speed Up Search 50× Compared to Brute‑Force? A Hands‑On Guide to Building Vector Indexes

The article explains why brute‑force vector search is painfully slow, introduces Flat, IVF, and HNSW index structures, compares their speed, memory and accuracy, shows common pitfalls, provides production‑grade Python code, and presents benchmark results that demonstrate HNSW’s superior speed‑accuracy trade‑off.

AIFAISSHNSW
0 likes · 12 min read
Why HNSW Can Speed Up Search 50× Compared to Brute‑Force? A Hands‑On Guide to Building Vector Indexes
The Dominant Programmer
The Dominant Programmer
Apr 27, 2026 · Artificial Intelligence

Building a Private Document Vector Search with SpringBoot, LangChain4j, and Ollama RAG

This guide walks through why Retrieval‑Augmented Generation (RAG) is needed for large language models, explains the three‑step indexing and query workflow, details LangChain4j’s core components, and provides a complete SpringBoot example—including Maven setup, configuration, service code, and troubleshooting—to create a private document‑vector search system powered by Ollama.

EmbeddingLangChain4jOllama
0 likes · 13 min read
Building a Private Document Vector Search with SpringBoot, LangChain4j, and Ollama RAG
dbaplus Community
dbaplus Community
Apr 26, 2026 · Databases

Why PostgreSQL Is the Better Choice in 99% of Scenarios

The article argues that relying on many specialized databases creates operational complexity, higher costs, and maintenance overhead, while PostgreSQL’s extensible ecosystem—offering full‑text search, vector, time‑series, JSONB, and more—delivers comparable or superior algorithms, proven performance, and a simpler, more reliable stack for the vast majority of use cases, especially in AI applications.

AIExtensionsFull‑Text Search
0 likes · 19 min read
Why PostgreSQL Is the Better Choice in 99% of Scenarios
AI Engineer Programming
AI Engineer Programming
Apr 25, 2026 · Artificial Intelligence

Quantization Across Signal Processing, AI Inference, and RAG Vector Search

This article explains how quantization—originating from signal processing—reduces precision to save resources, details its application to neural network weights and activations via PTQ, QAT, GPTQ, AWQ, and SmoothQuant, and shows how vector quantization enables fast, memory‑efficient retrieval in large‑scale RAG systems.

AWQGPTQLLM
0 likes · 19 min read
Quantization Across Signal Processing, AI Inference, and RAG Vector Search
DataFunTalk
DataFunTalk
Apr 24, 2026 · Databases

DM GDMBASE V4.0: HyperRAG, Long‑Term Memory & NL Agents for Graph‑Vector AI

At the 2026 China Database Technology & Industry Conference, DM unveiled GDMBASE V4.0, a graph database that natively fuses vectors and graphs, introduces HyperRAG, long‑term memory, and a natural‑language agent, and delivers sub‑500 ms retrieval, 30% higher recall and 60% lower hallucination rates for AI workloads.

AI integrationHybrid RetrievalHyperRAG
0 likes · 12 min read
DM GDMBASE V4.0: HyperRAG, Long‑Term Memory & NL Agents for Graph‑Vector AI
James' Growth Diary
James' Growth Diary
Apr 21, 2026 · Artificial Intelligence

Boosting RAG Performance with Milvus: Chunking, Hybrid Search, and Rerank Best Practices

This article analyzes why Retrieval‑Augmented Generation often underperforms, then walks through concrete engineering steps—optimal chunking, overlap settings, hybrid vector + BM25 retrieval, RRF fusion, and reranking—while providing code snippets, parameter tables, and a full pipeline diagram to turn a usable RAG system into a high‑quality one.

Hybrid SearchLangChainMilvus
0 likes · 18 min read
Boosting RAG Performance with Milvus: Chunking, Hybrid Search, and Rerank Best Practices
AI Engineer Programming
AI Engineer Programming
Apr 21, 2026 · Artificial Intelligence

From Bag‑of‑Words to Semantic Vectors: Understanding Embeddings and Similarity Search (Part 1)

The article explains how diverse data can be represented as high‑dimensional vectors, describes exact and approximate nearest‑neighbor search, explores vector quantization, product quantization, locality‑sensitive hashing, and HNSW graphs, and analyzes their speed, accuracy, and memory trade‑offs for large‑scale similarity retrieval.

HNSWLSHembeddings
0 likes · 16 min read
From Bag‑of‑Words to Semantic Vectors: Understanding Embeddings and Similarity Search (Part 1)
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Apr 19, 2026 · Industry Insights

ElasticStack 2026: Beyond New Versions, It’s Becoming an Agent Platform

In early 2026 ElasticStack transformed from a traditional search‑log‑visualization stack into an Agent platform, accelerating releases across three lines, elevating Elasticsearch to a context‑engineered infrastructure, unifying ES|QL as a platform‑wide interaction layer, and integrating Workflows, MCP, and vector enhancements to drive autonomous observability and security operations.

Agent PlatformElasticStackElasticsearch
0 likes · 20 min read
ElasticStack 2026: Beyond New Versions, It’s Becoming an Agent Platform
DataFunTalk
DataFunTalk
Apr 18, 2026 · Databases

How Will Apache Doris Evolve in 2026 to Power AI‑Driven Data Workloads?

The article outlines Apache Doris's 2026 roadmap, detailing how the database will shift from pure analytics to a unified AI‑enabled platform with enhanced semi‑structured data support, vector and hybrid search, agent‑focused capabilities, and expanded storage and lakehouse integrations to meet emerging AI workloads.

AI integrationApache DorisData Lake
0 likes · 14 min read
How Will Apache Doris Evolve in 2026 to Power AI‑Driven Data Workloads?
DataFunSummit
DataFunSummit
Apr 17, 2026 · Artificial Intelligence

Why RAG Projects Fail: Real‑World Pitfalls and Proven Solutions

This article dissects the hype‑versus‑reality gap of Retrieval‑Augmented Generation in enterprises, exposing low recall, hallucinations, and cost overruns, then offers a systematic diagnosis, hybrid search, reranking, security controls, and advanced GraphRAG and Agentic RAG strategies to achieve reliable production deployments.

Enterprise AILLMRAG
0 likes · 17 min read
Why RAG Projects Fail: Real‑World Pitfalls and Proven Solutions
AI Explorer
AI Explorer
Apr 16, 2026 · Artificial Intelligence

Build an AI Agent Memory Engine with Just Six Lines of Code

The open‑source Cognee project lets developers give AI agents a dynamic, long‑term memory by combining vector search, graph databases and cognitive techniques, and it can be set up with only six lines of Python code, as demonstrated with a quick‑start example.

AI memoryPythoncognee
0 likes · 6 min read
Build an AI Agent Memory Engine with Just Six Lines of Code
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Apr 13, 2026 · Artificial Intelligence

How to Speed Up Bulk Vector Searches with CLI and SDK Concurrency

This guide explains how to dramatically reduce latency for batch semantic search, RAG multi‑path retrieval, and multimodal vector queries by running multiple OSS Vectors embed requests in parallel using CLI‑based, xargs, shell background jobs, Python asyncio, and SDK‑level concurrency techniques.

CLIGoOSS
0 likes · 21 min read
How to Speed Up Bulk Vector Searches with CLI and SDK Concurrency
DeepHub IMBA
DeepHub IMBA
Apr 11, 2026 · Artificial Intelligence

Understanding Vector Similarity Search: Flat Index, IVF, and HNSW

This article explains why vector databases are needed for semantic search of unstructured data and provides a detailed, step‑by‑step comparison of three core vector similarity algorithms—cosine similarity, Flat Index, IVF, and HNSW—highlighting their trade‑offs in accuracy and speed.

HNSWIVFembeddings
0 likes · 10 min read
Understanding Vector Similarity Search: Flat Index, IVF, and HNSW
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Apr 7, 2026 · Artificial Intelligence

Why Hybrid Retrieval Beats Pure Vector Search: BM25, RRF, and Real‑World Experiments

This article dissects the shortcomings of pure vector retrieval, explains how BM25 complements it, compares weighted‑sum and Reciprocal Rank Fusion (RRF) strategies, shows experimental results that identify optimal weight and k values, and provides practical engineering tips for deploying hybrid search in RAG systems.

BM25Hybrid RetrievalParameter Tuning
0 likes · 24 min read
Why Hybrid Retrieval Beats Pure Vector Search: BM25, RRF, and Real‑World Experiments
DataFunTalk
DataFunTalk
Apr 1, 2026 · Industry Insights

How Oracle’s AI‑Powered Database Is Turning Data Sovereignty into a Competitive Edge

Oracle’s 2026 AI database rollout fuses vector search, private AI agents, unified memory, and deep data security directly into the database engine, challenging the cloud‑centric data‑movement paradigm and prompting a market shift that could revive Oracle’s dominance while reshaping strategies for DBAs, AI engineers, and decision makers.

AI DatabaseDatabase ArchitectureOracle
0 likes · 13 min read
How Oracle’s AI‑Powered Database Is Turning Data Sovereignty into a Competitive Edge
Ray's Galactic Tech
Ray's Galactic Tech
Mar 30, 2026 · Artificial Intelligence

From Demo to Production: Building an Enterprise‑Grade RAG System with Spring AI & PGVector

This comprehensive guide explains how to design, implement, and operate a production‑ready Retrieval‑Augmented Generation (RAG) platform using Spring AI and PostgreSQL PGVector, covering architecture, indexing, hybrid retrieval, prompt engineering, scaling, security, observability, deployment, and common pitfalls for enterprise knowledge‑base applications.

Enterprise AIHybrid RetrievalObservability
0 likes · 42 min read
From Demo to Production: Building an Enterprise‑Grade RAG System with Spring AI & PGVector
Open Source Tech Hub
Open Source Tech Hub
Mar 25, 2026 · Artificial Intelligence

How to Build Hybrid Vector and Full‑Text Search with PHPVector in PHP 8.2

This guide introduces PHPVector, a pure‑PHP vector database that combines HNSW‑based approximate nearest‑neighbor search with BM25 full‑text ranking, showing installation, document insertion, vector and text queries, hybrid ranking modes, configuration options, distance metrics, tuning tips, and persistence mechanisms.

AIBM25HNSW
0 likes · 10 min read
How to Build Hybrid Vector and Full‑Text Search with PHPVector in PHP 8.2
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Mar 24, 2026 · Artificial Intelligence

How Hologres + Mem0 Deliver Low‑Cost, High‑Performance Long‑Memory for LLMs

This article explains how the combination of Hologres, a unified real‑time data warehouse, and Mem0, an open‑source LLM memory framework, overcomes the limited context window of large language models by providing scalable, low‑latency, and cost‑effective long‑term memory for AI applications.

AI InfrastructureHologresLLM
0 likes · 11 min read
How Hologres + Mem0 Deliver Low‑Cost, High‑Performance Long‑Memory for LLMs
Data Party THU
Data Party THU
Mar 23, 2026 · Artificial Intelligence

Boosting RAG Performance: Query Translation & Decomposition Techniques

The article explains two emerging RAG query‑optimization approaches—query translation and query decomposition—detailing fan‑out retrieval, reciprocal rank fusion, HyDE, step‑back prompting, and chain‑of‑thought retrieval, and shows how combining them can improve relevance and latency in LLM‑augmented systems.

LLMRAGRetrieval Augmented Generation
0 likes · 9 min read
Boosting RAG Performance: Query Translation & Decomposition Techniques
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 19, 2026 · Artificial Intelligence

How Engineering Knowledge Engines Turn AI Coders into Reliable Collaborators

The article analyzes the limitations of current AI coding agents—narrow perception, fragmented knowledge, and missing high‑dimensional context—and presents an Engineering Knowledge Engine that integrates vector retrieval, code and commit graphs, RepoWiki, memory, and Agentic Search to provide structured, evolving context, dramatically improving task success, token efficiency, and code quality.

AICode GraphSoftware Engineering
0 likes · 11 min read
How Engineering Knowledge Engines Turn AI Coders into Reliable Collaborators
Tech Freedom Circle
Tech Freedom Circle
Mar 19, 2026 · Artificial Intelligence

Failed Alibaba Interview: The 4 RAG Modules and 6 Design Principles You Need

The article dissects a failed Alibaba second‑round interview where the candidate answered only “vector‑search‑enhanced” for a RAG design, and then presents a systematic, four‑module RAG architecture together with six design principles, detailed indexing, query understanding, multi‑path recall, and context generation techniques to help candidates demonstrate comprehensive technical depth.

AI ArchitectureKnowledge GraphMulti‑Path Recall
0 likes · 22 min read
Failed Alibaba Interview: The 4 RAG Modules and 6 Design Principles You Need
DeepHub IMBA
DeepHub IMBA
Mar 17, 2026 · Artificial Intelligence

Advanced RAG Techniques: Boosting Retrieval with Query Translation and Decomposition

The article examines how retrieval‑augmented generation suffers from poor query formulation and presents two advanced strategies—query translation, which generates multiple semantically similar variants, and query decomposition, which breaks complex questions into finer sub‑queries—detailing methods such as fan‑out retrieval, reciprocal rank fusion, HyDE, step‑back prompting, and chain‑of‑thought retrieval, and explains when to combine them.

Hybrid RetrievalLLMQuery Decomposition
0 likes · 9 min read
Advanced RAG Techniques: Boosting Retrieval with Query Translation and Decomposition
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Mar 11, 2026 · Backend Development

How to Achieve One‑Line Semantic Search for Nearby Clean Coffee Shops with Elasticsearch

This article walks through building a practical Elasticsearch demo that lets users type a single query like “nearby clean coffee shop” and get results by combining dense‑vector semantic search, geo filtering, BM25, and a hybrid RRF‑style ranking, with both LLM‑based structuring and a fallback hash‑based embedding.

BM25FlaskHybrid Search
0 likes · 10 min read
How to Achieve One‑Line Semantic Search for Nearby Clean Coffee Shops with Elasticsearch
AI Explorer
AI Explorer
Mar 11, 2026 · Artificial Intelligence

Gemini Embedding 2: Google’s First Native Multimodal Embedding Model

Google’s Gemini Embedding 2 introduces a native multimodal embedding model that maps text, images, video, audio, and documents into a single vector space, offers three configurable dimensions, achieves state‑of‑the‑art benchmarks across modalities, and enables cross‑modal search, RAG, and seamless integration with major vector databases.

AI modelsGemini EmbeddingMatryoshka representation
0 likes · 8 min read
Gemini Embedding 2: Google’s First Native Multimodal Embedding Model
Data STUDIO
Data STUDIO
Mar 9, 2026 · Artificial Intelligence

Boost RAG Accuracy from 60% to 94% with 11 Proven Strategies

This article dissects why naive Retrieval‑Augmented Generation (RAG) often yields only 60% accuracy, then presents eleven concrete ingestion, query, and hybrid techniques—complete with code samples, performance trade‑offs, and real‑world case studies—that together can raise RAG accuracy to 94% while outlining practical implementation roadmaps and common pitfalls.

EmbeddingKnowledge GraphLLM
0 likes · 31 min read
Boost RAG Accuracy from 60% to 94% with 11 Proven Strategies
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Feb 25, 2026 · Artificial Intelligence

How Hologres Powers Fast Vector & Full‑Text Search for AI‑Driven Customer Service

The Taobao‑Tmall customer operations team built an integrated vector‑plus‑full‑text retrieval solution on Hologres, achieving millisecond‑level recall for massive unstructured knowledge bases, boosting intelligent客服, rule comparison, and sentiment analysis across multiple business scenarios.

AI RetrievalFull‑Text SearchHologres
0 likes · 12 min read
How Hologres Powers Fast Vector & Full‑Text Search for AI‑Driven Customer Service
ByteDance Data Platform
ByteDance Data Platform
Feb 11, 2026 · Databases

How ByteHouse Redefines Real‑Time Multimodal Analytics with a Cloud‑Native Data Warehouse

ByteHouse, ByteDance's cloud‑native data warehouse, evolves from a traditional warehouse to a next‑generation AI‑ready platform that handles 800+ PB of data, supports 25,000 nodes, and delivers real‑time, multimodal analytics through a decoupled storage‑compute architecture, AI‑driven query optimization, and native vector search integration.

AI OptimizationCloud NativeReal-time analytics
0 likes · 9 min read
How ByteHouse Redefines Real‑Time Multimodal Analytics with a Cloud‑Native Data Warehouse
SpringMeng
SpringMeng
Feb 7, 2026 · Databases

Redis’s Multithreaded Query Engine Boosts RAG Performance

Redis introduces a multithreaded query engine that keeps average latency under 10 ms while delivering up to 16× higher throughput for vector‑search workloads, enabling faster retrieval‑augmented generation (RAG) applications and outperforming pure vector databases and managed Redis services in benchmark tests.

BenchmarkMultithreaded QueryRAG
0 likes · 6 min read
Redis’s Multithreaded Query Engine Boosts RAG Performance
Architecture and Beyond
Architecture and Beyond
Feb 1, 2026 · Artificial Intelligence

5 High‑ROI Strategies to Supercharge RAG Retrieval Performance

This article outlines five practical engineering strategies—multi‑vector retrieval, manual splitting and labeling, scalar enhancement, context augmentation, and dense‑sparse vector integration—that together address common RAG retrieval bottlenecks and dramatically improve recall stability and answer quality.

BM25EngineeringLLM
0 likes · 17 min read
5 High‑ROI Strategies to Supercharge RAG Retrieval Performance
Tech Musings
Tech Musings
Jan 29, 2026 · Databases

Mastering Redis 8 Vector Search: Indexing, Hybrid Retrieval, and Re‑ranking Techniques

This article explains how to use Redis 8.4.0 for vector recall and keyword filtering, covering index selection (FLAT vs HNSW), schema creation with redisvl, full‑text BM25 search, pure KNN vector queries, hybrid text‑plus‑vector retrieval, query cleaning, score fusion, and optional in‑Redis Lua re‑ranking or TAG‑based filtering extensions.

Pythonindexingvector search
0 likes · 15 min read
Mastering Redis 8 Vector Search: Indexing, Hybrid Retrieval, and Re‑ranking Techniques
PaperAgent
PaperAgent
Jan 28, 2026 · Artificial Intelligence

How Clawdbot Achieves Persistent, Local Memory for LLM Agents

Clawdbot implements a fully local, persistent memory system for LLM agents by storing context and long‑term knowledge in editable Markdown files, indexing them with SQLite‑vec and FTS5, supporting multi‑agent isolation, compression, pruning, and configurable session lifecycles to maintain efficient, cost‑effective interactions.

LLM agentscontext compressionlocal storage
0 likes · 13 min read
How Clawdbot Achieves Persistent, Local Memory for LLM Agents
StarRocks
StarRocks
Jan 15, 2026 · Artificial Intelligence

How AI‑First Lakehouse Redefines Data Platforms for Multimodal Analytics

The article outlines the evolution from traditional OLAP to an AI‑first Lakehouse, detailing unified multimodal storage, CPU/GPU heterogeneous scheduling, native vector search, in‑database AI inference, agent‑centric execution, and self‑evolving platform capabilities that together reshape modern data analytics.

AIAgent ArchitectureBig Data
0 likes · 11 min read
How AI‑First Lakehouse Redefines Data Platforms for Multimodal Analytics
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 29, 2025 · Cloud Native

How a Visual Platform Cut Search Costs by 60% with All‑in‑Elasticsearch

This case study details how a major internet visual platform consolidated its log, keyword, and vector search workloads onto Alibaba Cloud Elasticsearch, eliminating three separate pipelines, reducing write‑costs by 60%, cutting storage expenses over 60%, and achieving multi‑fold performance gains through serverless scaling, FalconSeek engine optimizations, and unified monitoring.

Cost OptimizationElasticsearchRAG
0 likes · 10 min read
How a Visual Platform Cut Search Costs by 60% with All‑in‑Elasticsearch
Data STUDIO
Data STUDIO
Dec 23, 2025 · Databases

Is the Vector Database Dead? PostgreSQL’s New pgvector Feature Puts Closed‑Source Solutions on the Spot

The article examines how PostgreSQL’s latest pgvector 0.8.0 release adds iterative index scans and smart query planning, enabling fully free vector search within an existing relational database, compares performance, cost, and architecture against dedicated vector databases like Pinecone, and outlines migration steps and best‑practice guidelines.

AIBenchmarkPostgreSQL
0 likes · 14 min read
Is the Vector Database Dead? PostgreSQL’s New pgvector Feature Puts Closed‑Source Solutions on the Spot
Volcano Engine Developer Services
Volcano Engine Developer Services
Dec 5, 2025 · Artificial Intelligence

Why Vectors Power Scalable AI Search and How S3 Vectors Redefines Storage

This article explains how high‑dimensional vectors enable semantic AI search, compares exact and approximate nearest‑neighbor algorithms, examines the challenges of large‑scale vector storage, and evaluates AWS S3 Vectors' architecture, pricing, and hybrid solutions for cost‑effective, high‑performance retrieval.

AI semanticsANNS3 Vectors
0 likes · 17 min read
Why Vectors Power Scalable AI Search and How S3 Vectors Redefines Storage
Yiche Technology
Yiche Technology
Dec 3, 2025 · Artificial Intelligence

How Milvus Powered a Scalable AI Assistant for Car Queries with Vector Search

This article details how an automotive AI assistant migrated from keyword matching to a Milvus‑based vector retrieval system, overcoming semantic gaps, scaling to millions of daily queries, optimizing indexing, introducing multi‑vector and sparse‑vector search, and building a real‑time RAG pipeline with Flink.

AI AssistantMilvusRAG
0 likes · 12 min read
How Milvus Powered a Scalable AI Assistant for Car Queries with Vector Search
Data STUDIO
Data STUDIO
Dec 3, 2025 · Artificial Intelligence

Pixeltable: One Table to Power Multimodal AI with Declarative Python

Pixeltable introduces a unified table abstraction that treats images, text, embeddings and model outputs as columns, enabling declarative multimodal AI pipelines, eliminating glue code, supporting built‑in vector indexing, versioned experiments, extensible custom functions, and a concise 30‑line RAG implementation.

Multimodal AIPixeltablePython
0 likes · 15 min read
Pixeltable: One Table to Power Multimodal AI with Declarative Python
Xiaolei Talks DB
Xiaolei Talks DB
Nov 25, 2025 · Databases

What’s New in MongoDB 8.2? Performance Boosts, AI Features, and Multi‑Cloud Power

The article reviews MongoDB 8.2’s major upgrades, highlighting up to 36% read throughput gains, 59% write speed improvements, 200% faster time‑series aggregation, 50‑fold faster shard rebalancing, enhanced queryable encryption, native vector search, multi‑cloud Atlas support, and AI‑driven capabilities such as hybrid search and the MongoDB AMP platform.

AIMongoDBdatabase
0 likes · 7 min read
What’s New in MongoDB 8.2? Performance Boosts, AI Features, and Multi‑Cloud Power
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 21, 2025 · Artificial Intelligence

How to Build a Multi‑Layer Cache for Dynamic RAG Systems

This article explains why dynamic Retrieval‑Augmented Generation (RAG) requires a layered caching strategy rather than simple result caching, details a four‑level cache architecture—including embedding, search, answer, and pipeline caches—provides practical key‑generation and TTL guidelines, and outlines dirty‑data defenses to keep caches consistent and performant.

AI EngineeringLLMRAG
0 likes · 10 min read
How to Build a Multi‑Layer Cache for Dynamic RAG Systems
DevOps Coach
DevOps Coach
Nov 13, 2025 · Databases

Explore ClickHouse 25.10: 20 JOIN Boosts, Vector Search & New SQL

ClickHouse 25.10 introduces a suite of enhancements—including 20 JOIN performance upgrades, lazy column replication, Bloom filter runtime filters, disjunction push‑down, automatic column statistics, the QBit vector type, expanded SQL operators, negative LIMIT/OFFSET, Arrow Flight support, and delayed secondary index materialization—backed by detailed benchmarks and contributor acknowledgments.

ClickHouseJOIN optimizationSQL Extensions
0 likes · 23 min read
Explore ClickHouse 25.10: 20 JOIN Boosts, Vector Search & New SQL
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Nov 6, 2025 · Artificial Intelligence

How GPU‑Accelerated NN‑Descent Boosts Vector Search Speed by Up to 13×

This article explains how unstructured multimedia data is transformed into vectors for similarity search, introduces GPU parallelism and the NN‑Descent algorithm to replace traditional HNSW indexing in OpenSearch, and presents benchmark results showing up to a thirteen‑fold speed improvement while maintaining comparable recall.

GPU AccelerationNN-DescentOpenSearch
0 likes · 12 min read
How GPU‑Accelerated NN‑Descent Boosts Vector Search Speed by Up to 13×
Data STUDIO
Data STUDIO
Nov 4, 2025 · Artificial Intelligence

How to Build a Memory-Enabled AI Agent with SQLite and Vector Search

This article explains how to give AI agents persistent memory, reflection, and goal‑tracking by storing interaction summaries in SQLite, embedding them for semantic retrieval with a vector database, and using LLM‑generated prompts to recall, reflect, and manage objectives across sessions.

AI AgentGoal TrackingLLM
0 likes · 10 min read
How to Build a Memory-Enabled AI Agent with SQLite and Vector Search
JD Tech
JD Tech
Oct 9, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines external knowledge retrieval with large language models, covering its motivations, data preparation, chunking strategies, vectorization, storage, query processing, retrieval, reranking, prompt engineering, and LLM generation, plus practical optimization tips.

LLMRAGchunking
0 likes · 14 min read
What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?
JavaGuide
JavaGuide
Oct 8, 2025 · Databases

Is MySQL Now the Runner‑Up to PostgreSQL in the AI Era?

While MySQL has long dominated relational databases with its open‑source stability and massive user base, the rise of AI and PostgreSQL’s extensible ecosystem—highlighted by extensions like pgvector, pg_bm25, TimescaleDB and PostGIS—are shifting developer preference, as shown by the 2025 Stack Overflow survey.

AIDatabase ExtensibilityPostgreSQL
0 likes · 6 min read
Is MySQL Now the Runner‑Up to PostgreSQL in the AI Era?
Tech Freedom Circle
Tech Freedom Circle
Sep 25, 2025 · Artificial Intelligence

RAGFlow Search Engine Deep Dive: Multi‑Path Retrieval, Fusion, and Reranking

The article provides a detailed technical analysis of RAGFlow's search engine, covering the Searcher class coordination, adaptive multi‑path retrieval (vector, keyword, and knowledge‑graph), intelligent fusion with weighted scoring, caching, performance monitoring, and both built‑in and model‑driven reranking to achieve high‑precision results.

Performance OptimizationRAGFlowReranking
0 likes · 32 min read
RAGFlow Search Engine Deep Dive: Multi‑Path Retrieval, Fusion, and Reranking
DataFunSummit
DataFunSummit
Sep 4, 2025 · Artificial Intelligence

Unlocking Elasticsearch Vector Search: From Basics to RAG Implementation

This article explores the evolving search demands of the intelligent era, explains dense and sparse vector concepts, details Elasticsearch's vector search capabilities and recent performance breakthroughs, introduces hybrid and relevance‑tuning techniques, and demonstrates RAG principles and real‑world enterprise use cases.

AIElasticsearchHybrid Search
0 likes · 14 min read
Unlocking Elasticsearch Vector Search: From Basics to RAG Implementation
Xiaolei Talks DB
Xiaolei Talks DB
Aug 28, 2025 · Databases

How AI Is Transforming Databases: Highlights from China’s DTCC2025

At DTCC2025 in Beijing, industry leaders showcased AI-driven innovations, vector database advances, RAG techniques, and distributed database performance breakthroughs, illustrating how databases are evolving from passive data stores into intelligent, autonomous systems that boost efficiency, scalability, and business value across sectors.

AIDistributed SystemsRAG
0 likes · 10 min read
How AI Is Transforming Databases: Highlights from China’s DTCC2025
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Aug 5, 2025 · Artificial Intelligence

Enterprise Semantic Search: Key Q&A on Scoring, Recall, LSH, Chunking, and Embedding Dimensions

This article answers practical questions about enterprise semantic search, explaining how Reciprocal Rank Fusion normalizes mixed scoring, how to control vector result size, the trade‑offs of LSH parameters, word‑ and sentence‑based chunking strategies with version‑specific defaults, and flexible embedding dimensionality.

ElasticsearchLSHRRF
0 likes · 8 min read
Enterprise Semantic Search: Key Q&A on Scoring, Recall, LSH, Chunking, and Embedding Dimensions
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 5, 2025 · Databases

How PolarDB IMCI Unifies Vector Search and Embedding in One SQL Engine

This article explains how PolarDB IMCI integrates vector indexing and embedding directly into the database kernel, offering a unified, transactional, and real‑time vector lifecycle management service that lets developers build RAG knowledge bases and AI applications using only standard SQL, dramatically reducing development and operational complexity.

AIPolardbRAG
0 likes · 11 min read
How PolarDB IMCI Unifies Vector Search and Embedding in One SQL Engine
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Jul 30, 2025 · Backend Development

From Keyword Matching to Semantic Understanding: Building an Intelligent E‑Commerce Search Engine

The article analyzes the semantic gap in e‑commerce search, compares traditional keyword matching with vector‑based retrieval, and provides a step‑by‑step implementation using Elasticsearch/Easysearch pipelines, embedding models, and a hybrid search strategy to improve user intent understanding.

EasysearchElasticsearchHybrid Search
0 likes · 11 min read
From Keyword Matching to Semantic Understanding: Building an Intelligent E‑Commerce Search Engine
Sohu Tech Products
Sohu Tech Products
Jul 23, 2025 · Artificial Intelligence

Boosting Video Moderation with Multimodal CLIP and Efficient Vector Search

This article describes how a video review system combines multimodal CLIP models, image‑text feature alignment, and optimized vector‑search databases such as RedisSearch and Elasticsearch to detect prohibited content in real time and perform large‑scale historical recall, while addressing challenges of generalization, storage cost, and inference speed.

AICLIPmodel fine-tuning
0 likes · 18 min read
Boosting Video Moderation with Multimodal CLIP and Efficient Vector Search
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Jul 18, 2025 · Artificial Intelligence

Video: Building an Intelligent Knowledge‑Base Q&A System with Large Models and Elasticsearch (RAG)

The video walks through the differences between traditional keyword search and vector search, explains the core concept of Retrieval‑Augmented Generation, and demonstrates how to construct a knowledge‑base Q&A system using a large language model integrated with Elasticsearch.

ElasticsearchKnowledge BaseQ&A system
0 likes · 1 min read
Video: Building an Intelligent Knowledge‑Base Q&A System with Large Models and Elasticsearch (RAG)
DataFunSummit
DataFunSummit
Jul 15, 2025 · Artificial Intelligence

Unlocking Semantic Search: Elasticsearch Vector Search & RAG Applications

This article explains why traditional keyword search falls short, introduces Elasticsearch's vector search and hybrid retrieval capabilities, and shows how combining it with large language models enables Retrieval‑Augmented Generation (RAG) for more accurate, context‑aware AI-driven search across text and multimedia data.

AIElasticsearchRAG
0 likes · 5 min read
Unlocking Semantic Search: Elasticsearch Vector Search & RAG Applications
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 19, 2025 · Artificial Intelligence

Build Efficient Multimodal Text‑Image Search with Alibaba Cloud Milvus

This guide explains how to use Alibaba Cloud Milvus to create a scalable, high‑performance multimodal search system that supports text‑to‑image, image‑to‑image, and cross‑modal queries across various business scenarios, detailing architecture, deployment steps, validation, and resource cleanup.

AIMilvusMultimodal Retrieval
0 likes · 8 min read
Build Efficient Multimodal Text‑Image Search with Alibaba Cloud Milvus
Sohu Tech Products
Sohu Tech Products
Jun 11, 2025 · Artificial Intelligence

How DeepSeek and TiDB AI Are Redefining Data Engines for the Large‑Model Era

This article explores DeepSeek's open‑source large‑model breakthroughs, PingCAP's AI‑enhanced database roadmap, TiDB.AI's retrieval‑augmented generation framework, the unified TiDB data engine, and practical Q&A insights on knowledge‑graph construction, vector search, and AI‑driven SQL generation.

AIDeepSeekKnowledge Graph
0 likes · 15 min read
How DeepSeek and TiDB AI Are Redefining Data Engines for the Large‑Model Era
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
May 22, 2025 · Artificial Intelligence

Scalable Overload-Aware Graph-Based Index Construction for 10‑Billion‑Scale Vector Similarity Search (SOGAIC)

The paper introduces SOGAIC, a scalable overload‑aware graph‑based index construction system for billion‑scale vector similarity search that uses adaptive overlapping partitioning and load‑balanced distributed scheduling to cut construction time by 47.3% while maintaining high recall.

ANNDistributed Schedulinggraph index
0 likes · 13 min read
Scalable Overload-Aware Graph-Based Index Construction for 10‑Billion‑Scale Vector Similarity Search (SOGAIC)
Mingyi World Elasticsearch
Mingyi World Elasticsearch
May 16, 2025 · Artificial Intelligence

Easysearch Vector Search: From Theory to Hands‑On Implementation

This article explains the principles of vector search, compares Easysearch's approximate (LSH) and exact kNN APIs, and walks through a complete hands‑on example using Stanford's 50‑dimensional GloVe embeddings to index, import, and query semantically similar words.

Approximate SearchCosine SimilarityEasysearch
0 likes · 9 min read
Easysearch Vector Search: From Theory to Hands‑On Implementation
Architecture & Thinking
Architecture & Thinking
May 15, 2025 · Databases

Redis 8.0 Unveiled: New AGPLv3 License, Vector Search, JSON & More

Redis 8.0, released on May 1 2025, introduces a major license shift to AGPL‑v3, adds eight native data structures—including vector sets, JSON, and time‑series—enhances the query engine with up to 16× performance gains, improves scalability, security, and cloud‑native support, and provides extensive code examples for AI and real‑time analytics.

JSONTime Seriesdatabase
0 likes · 15 min read
Redis 8.0 Unveiled: New AGPLv3 License, Vector Search, JSON & More
Alibaba Cloud Native
Alibaba Cloud Native
May 9, 2025 · Artificial Intelligence

Build a Retrieval‑Augmented Generation (RAG) App with LangChain, Higress, and Elasticsearch

This tutorial walks through building a Retrieval‑Augmented Generation (RAG) system by combining LangChain for document processing, Elasticsearch’s vector store with the ELSER v2 model for semantic search, and Higress as a cloud‑native AI gateway, complete with deployment scripts, code examples, and query testing.

AIHigressLangChain
0 likes · 15 min read
Build a Retrieval‑Augmented Generation (RAG) App with LangChain, Higress, and Elasticsearch
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Apr 24, 2025 · Big Data

Boosting Product Recommendations with Serverless Spark and Milvus: A Real‑World Case Study

蝉妈妈 migrated its recommendation platform to Alibaba Cloud Serverless Spark and Milvus, replacing traditional vector search and Spark clusters, achieving 40% faster offline tasks, 80% lower failure rates, significant cost savings, and scalable, low‑latency similar‑product retrieval for personalized marketing.

Big DataMilvusrecommendation system
0 likes · 8 min read
Boosting Product Recommendations with Serverless Spark and Milvus: A Real‑World Case Study
dbaplus Community
dbaplus Community
Apr 7, 2025 · Databases

How Do LLMs Tackle Oracle Bad Block Errors? A Hands‑On Evaluation

This article presents a hands‑on evaluation of several large language models—including Mistral‑Small, Deepseek‑r1, Llama 3.3 and ChatGPT‑4‑go—on Oracle database bad‑block errors, RAG‑based document retrieval, and log‑driven reasoning, revealing performance gaps, scoring results, and practical DBA implications.

AILLM evaluationOracle
0 likes · 11 min read
How Do LLMs Tackle Oracle Bad Block Errors? A Hands‑On Evaluation
Sohu Tech Products
Sohu Tech Products
Mar 19, 2025 · Databases

Redis Vector Search Technology for AI Applications: Implementation and Best Practices

The article explains how Redis vector search, powered by RedisSearch’s FLAT and HNSW algorithms and supporting various data types and precisions, enables fast AI-driven similarity queries for text, image, and audio, and provides implementation guidance, optimization tips, and a real‑world customer‑service use case.

AI applicationsDatabase OptimizationHNSW
0 likes · 17 min read
Redis Vector Search Technology for AI Applications: Implementation and Best Practices
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Mar 11, 2025 · Backend Development

Master Elasticsearch dense_vector: definition, usage, and kNN search guide

This article explains Elasticsearch's dense_vector field for storing dense vectors, covering its definition, how to define and index vectors, kNN search methods (brute‑force and approximate with HNSW), similarity options, quantization strategies, bit‑vector support, key parameters, and how to update mappings.

Elasticsearchbit vectorsdense_vector
0 likes · 13 min read
Master Elasticsearch dense_vector: definition, usage, and kNN search guide
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Mar 6, 2025 · Artificial Intelligence

Smart Q&A Knowledge Base Powered by Qwen2.5‑14B and Elasticsearch RAG

This article details a smart Q&A knowledge‑base system that integrates the Qwen2.5‑14B large language model with Elasticsearch vector search via RAG, covering data ingestion with FSCrawler, Chinese sentence embedding, Gradio UI, performance tests on a 483‑page book, architecture diagrams, code walkthroughs, and suggested enhancements.

Chinese EmbeddingElasticsearchFSCrawler
0 likes · 11 min read
Smart Q&A Knowledge Base Powered by Qwen2.5‑14B and Elasticsearch RAG
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Mar 4, 2025 · Artificial Intelligence

Deploy a High‑Performance RAG Service with Hologres, DeepSeek, and PAI‑EAS

This guide walks you through building a Retrieval‑Augmented Generation (RAG) system by integrating Alibaba Cloud's Hologres vector store, the Proxima high‑performance vector engine, and DeepSeek large language models via PAI‑EAS, covering prerequisites, deployment steps, configuration, and inference verification.

AI deploymentDeepSeekHologres
0 likes · 12 min read
Deploy a High‑Performance RAG Service with Hologres, DeepSeek, and PAI‑EAS
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Feb 25, 2025 · Artificial Intelligence

Build a RAG‑Powered Smart Q&A Assistant with Milvus, DeepSeek, and PAI LangStudio

This step‑by‑step guide shows how to assemble a Retrieval‑Augmented Generation (RAG) system using Alibaba Cloud Milvus vector search, the DeepSeek large language model, and PAI LangStudio, covering instance creation, data upload, model deployment, connection setup, flow design, and service invocation.

AI TutorialDeepSeekLLM
0 likes · 9 min read
Build a RAG‑Powered Smart Q&A Assistant with Milvus, DeepSeek, and PAI LangStudio
Tencent Technical Engineering
Tencent Technical Engineering
Feb 21, 2025 · Databases

Understanding Vector Storage and Optimization in Elasticsearch 8.16.1

The article explains how Elasticsearch 8.16.1 stores dense and sparse vectors using various file extensions, compares flat and HNSW index formats, shows how disabling doc‑values removes redundant column‑store copies, and demonstrates scalar and binary quantization—including a quantization‑only mode—that can cut storage to roughly 9 percent while preserving search accuracy.

ElasticsearchHNSWIndex Optimization
0 likes · 32 min read
Understanding Vector Storage and Optimization in Elasticsearch 8.16.1
Java Architecture Diary
Java Architecture Diary
Feb 13, 2025 · Artificial Intelligence

Create a Java RAG System Using DeepSeek R1, Milvus, and Spring

This guide walks through building a Java RAG system with DeepSeek R1, Milvus, and Spring, covering environment setup, vector model integration via OpenAI protocol, Maven dependencies, data embedding, and a chat endpoint that combines semantic retrieval with LLM generation.

AI integrationDeepSeekMilvus
0 likes · 11 min read
Create a Java RAG System Using DeepSeek R1, Milvus, and Spring