Tagged articles

AI Retrieval

13 articles · Page 1 of 1

May 17, 2026 · Artificial Intelligence

Is Multimodal RAG the Cure for Enterprise Knowledge‑Base Bottlenecks? The ‘Where to Retrieve’ Challenge

The article analyzes how multimodal Retrieval‑Augmented Generation expands retrieval objects beyond text chunks, why the "where to retrieve" problem is as critical as "what to retrieve" in enterprise knowledge bases, and how Google Gemini's File Search and recent industry research illustrate the shift toward verifiable, multimodal evidence.

AI RetrievalDocument AIEnterprise Knowledge Base

0 likes · 7 min read

Is Multimodal RAG the Cure for Enterprise Knowledge‑Base Bottlenecks? The ‘Where to Retrieve’ Challenge

Old Zhang's AI Learning

May 9, 2026 · Artificial Intelligence

Why Gemini’s Multimodal RAG with File Search Is So Compelling

The article analyzes Google Gemini’s File Search tool as a fully managed multimodal RAG solution, detailing its architecture, key features, pricing model, step‑by‑step usage, strengths, limitations, and how it compares with OpenAI Assistants File Search and Vertex AI Search.

AI RetrievalEmbeddingFile Search

0 likes · 14 min read

Why Gemini’s Multimodal RAG with File Search Is So Compelling

AI Architect Hub

Apr 25, 2026 · Artificial Intelligence

How to Feed Massive Documents to an RAG System: Mastering the Art of Text Chunking

This article explains why proper text chunking is critical for Retrieval‑Augmented Generation, illustrates common pitfalls with real‑world examples, compares four chunking strategies (fixed length, recursive, structure‑aware, and code‑aware), and provides practical guidelines for chunk size, overlap, metadata handling, and a production‑ready pipeline.

AI RetrievalLangChainMetadata

0 likes · 21 min read

How to Feed Massive Documents to an RAG System: Mastering the Art of Text Chunking

Wu Shixiong's Large Model Academy

Apr 6, 2026 · Artificial Intelligence

Why Rerank Beats Simple Retrieval in RAG: Practical Tips & Code

This article explains the limitations of Bi‑Encoder retrieval, introduces Cross‑Encoder rerankers, shows how a cascade of recall‑rerank‑generation improves answer quality, and provides concrete code, threshold‑filtering strategies, and domain‑specific fine‑tuning techniques for industrial RAG systems.

AI RetrievalBi-EncoderCross‑Encoder

0 likes · 20 min read

Why Rerank Beats Simple Retrieval in RAG: Practical Tips & Code

Alibaba Cloud Infrastructure

Mar 27, 2026 · Artificial Intelligence

Build a Scalable Multimodal Image Search with Alibaba Cloud OSS Vector Buckets

This guide walks through setting up Alibaba Cloud OSS Vector Buckets, installing the necessary SDKs, uploading image datasets, creating vector indexes, generating embeddings with the Bailei multimodal model, writing vectors, performing semantic searches, and visualizing results via a Gradio web UI.

AI RetrievalCloud StorageGradio UI

0 likes · 27 min read

Build a Scalable Multimodal Image Search with Alibaba Cloud OSS Vector Buckets

Alibaba Cloud Big Data AI Platform

Feb 25, 2026 · Artificial Intelligence

How Hologres Powers Fast Vector & Full‑Text Search for AI‑Driven Customer Service

The Taobao‑Tmall customer operations team built an integrated vector‑plus‑full‑text retrieval solution on Hologres, achieving millisecond‑level recall for massive unstructured knowledge bases, boosting intelligent客服, rule comparison, and sentiment analysis across multiple business scenarios.

AI RetrievalFull-Text SearchHologres

0 likes · 12 min read

How Hologres Powers Fast Vector & Full‑Text Search for AI‑Driven Customer Service

360 Tech Engineering

Dec 25, 2025 · Artificial Intelligence

Choosing Between Vector Knowledge Bases and Knowledge Graphs for RAG

This article explains the definitions, differences, and integration trends of Knowledge Bases and Knowledge Graphs within Retrieval‑Augmented Generation, helping developers decide which technology best fits their AI system requirements.

AI RetrievalGraphRAGHybrid Search

0 likes · 9 min read

Choosing Between Vector Knowledge Bases and Knowledge Graphs for RAG

DataFunSummit

Jul 23, 2025 · Artificial Intelligence

Multimodal RAG: Techniques, Challenges, and Scaling the Future of AI

This article presents a comprehensive overview of multimodal Retrieval‑Augmented Generation (RAG), detailing three implementation paths—semantic extraction, Transformer‑based, and Visual Language Model approaches—along with scaling strategies using tensor indexing, performance comparisons, and guidance on selecting the most suitable technical route.

AI RetrievalDocument ProcessingMultimodal RAG

0 likes · 12 min read

Multimodal RAG: Techniques, Challenges, and Scaling the Future of AI

High Availability Architecture

Jun 18, 2025 · Backend Development

How WeChat Reading Scaled Its Backend Architecture Over a Decade

Marking ten years of WeChat Reading, this article details the backend's evolution from a monolithic service to a multi‑layered, micro‑service architecture with robust storage, RPC frameworks, book data platforms, account system redesign, and AI‑driven content retrieval, highlighting the technical challenges and solutions behind its scalability.

AI Retrievalbackend architecturedata platform

0 likes · 18 min read

How WeChat Reading Scaled Its Backend Architecture Over a Decade

Ma Wei Says

Feb 23, 2025 · Artificial Intelligence

How Microsoft’s PIKE‑RAG Builds Knowledge‑Driven AI Across Four Stages

The article explains Microsoft’s open‑source PIKE‑RAG system, detailing its four progressive stages—from knowledge‑base construction to creative multi‑agent reasoning—while describing the underlying modules, chunking strategies, multi‑granularity retrieval, and code snippets that enable specialized domain understanding and inference.

AI RetrievalKnowledge GraphLLM

0 likes · 11 min read

How Microsoft’s PIKE‑RAG Builds Knowledge‑Driven AI Across Four Stages

NewBeeNLP

Jan 2, 2025 · Artificial Intelligence

Unlocking Multimodal RAG: From Semantic Extraction to Scalable VLM Solutions

This article examines the implementation paths and future prospects of multimodal Retrieval‑Augmented Generation, covering semantic extraction, transformer‑based OCR, visual language models, scaling challenges, tensor indexing, and practical evaluations with tools like Infinity and ColPali.

AI RetrievalInfinity DatabaseMultimodal RAG

0 likes · 12 min read

Unlocking Multimodal RAG: From Semantic Extraction to Scalable VLM Solutions

Alibaba Cloud Native

Oct 22, 2024 · Operations

How to Build and Manage a High‑Quality Enterprise Knowledge Base for AI‑Powered Q&A

This guide explains why a well‑structured, permission‑controlled enterprise knowledge base is essential for AI‑driven Q&A, outlines document format and naming standards, describes multi‑document organization principles, and provides step‑by‑step instructions for creating and managing knowledge bases in the Cloud Native environment.

AI RetrievalDocument StandardsEnterprise Knowledge Base

0 likes · 20 min read

How to Build and Manage a High‑Quality Enterprise Knowledge Base for AI‑Powered Q&A

Volcano Engine Developer Services

Apr 18, 2024 · Databases

How VikingDB Powers AI Retrieval and Scalable Vector Search

VikingDB is a cloud‑native vector database that originated at ByteDance, offering high‑performance ANN search, hybrid dense‑sparse retrieval for Retrieval‑Augmented Generation, extensive scaling and filtering capabilities, and ready‑to‑use SDKs for real‑world AI applications.

AI RetrievalANN indexingRAG

0 likes · 17 min read