Tagged articles
2 articles
Page 1 of 1
AI Engineer Programming
AI Engineer Programming
May 14, 2026 · Artificial Intelligence

RAG Retrieval: Comparing Bi-encoder and Cross-encoder Architectures

The article reviews the three‑step RAG pipeline, explains why retrieval quality hinges on fast, accurate semantic matching, contrasts Bi-encoder’s offline vector indexing and speed with Cross-encoder’s token‑level interaction and higher precision, and discusses hybrid solutions such as ColBERT and LLM rerankers with practical engineering guidelines.

Bi-encoderColBERTCross-Encoder
0 likes · 10 min read
RAG Retrieval: Comparing Bi-encoder and Cross-encoder Architectures
Sohu Tech Products
Sohu Tech Products
Nov 6, 2024 · Artificial Intelligence

RAG2.0 Engine Design Challenges and Implementation

The talk outlines RAG2.0’s design challenges—low vector recall, complex documents, semantic gaps—and presents a two‑stage architecture using deep multimodal understanding and knowledge‑graph‑enhanced retrieval, detailing advanced chunking, multi‑index and multi‑path retrieval, efficient sorting models like ColBERT, and future multi‑modal and memory‑augmented agent directions.

ColBERTDelayed InteractionEnterprise AI
0 likes · 23 min read
RAG2.0 Engine Design Challenges and Implementation