Tagged articles
11 articles
Page 1 of 1
Su San Talks Tech
Su San Talks Tech
May 15, 2026 · Artificial Intelligence

Understanding Rerank in Retrieval‑Augmented Generation (RAG)

The article explains why a reranking step is essential in RAG pipelines, describes how it refines the initial vector‑search results, compares mainstream rerank techniques, discusses practical engineering choices such as candidate set size and model selection, and outlines how to evaluate and tune rerank performance.

Cross-EncoderEvaluation MetricsLLM
0 likes · 15 min read
Understanding Rerank in Retrieval‑Augmented Generation (RAG)
James' Growth Diary
James' Growth Diary
Apr 21, 2026 · Artificial Intelligence

Boosting RAG Performance with Milvus: Chunking, Hybrid Search, and Rerank Best Practices

This article analyzes why Retrieval‑Augmented Generation often underperforms, then walks through concrete engineering steps—optimal chunking, overlap settings, hybrid vector + BM25 retrieval, RRF fusion, and reranking—while providing code snippets, parameter tables, and a full pipeline diagram to turn a usable RAG system into a high‑quality one.

Hybrid SearchLangChainMilvus
0 likes · 18 min read
Boosting RAG Performance with Milvus: Chunking, Hybrid Search, and Rerank Best Practices
AgentGuide
AgentGuide
Apr 6, 2026 · Artificial Intelligence

How to Optimize RAG System Performance: From Evaluation Metrics to Tuning Strategies

The article explains how to improve Retrieval‑Augmented Generation (RAG) systems by interpreting three key metrics—context recall, context precision, and answer correctness—and provides concrete step‑by‑step actions such as checking the knowledge base, upgrading embedding models, rewriting queries, adding a rerank model, and refining prompts and generation parameters.

Evaluation MetricsRAGRerank
0 likes · 7 min read
How to Optimize RAG System Performance: From Evaluation Metrics to Tuning Strategies
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Apr 6, 2026 · Artificial Intelligence

Why Rerank Beats Simple Retrieval in RAG: Practical Tips & Code

This article explains the limitations of Bi‑Encoder retrieval, introduces Cross‑Encoder rerankers, shows how a cascade of recall‑rerank‑generation improves answer quality, and provides concrete code, threshold‑filtering strategies, and domain‑specific fine‑tuning techniques for industrial RAG systems.

AI RetrievalBi-encoderCross-Encoder
0 likes · 20 min read
Why Rerank Beats Simple Retrieval in RAG: Practical Tips & Code
Xuanwu Backend Tech Stack
Xuanwu Backend Tech Stack
Oct 22, 2025 · Artificial Intelligence

How Rerank Transforms Retrieval‑Augmented Generation for Accurate AI Answers

This article explains the limitations of basic Retrieval‑Augmented Generation (RAG), introduces Rerank technology as a two‑step refinement process, compares dual‑encoder and cross‑encoder methods, and reviews popular Rerank models to help developers build more precise AI‑driven retrieval systems.

RAGRerankRetrieval Augmented Generation
0 likes · 10 min read
How Rerank Transforms Retrieval‑Augmented Generation for Accurate AI Answers
Zhihu Tech Column
Zhihu Tech Column
Jan 17, 2025 · Artificial Intelligence

Zhihu Direct Answer: Product Overview and Technical Practices

This article summarizes the key technical insights from Zhihu Direct Answer, an AI-powered search product, covering its product overview, RAG framework, query understanding, retrieval strategies, chunking, reranking, generation techniques, evaluation methods, and engineering optimizations for cost and performance.

AI searchEngineering OptimizationGeneration
0 likes · 13 min read
Zhihu Direct Answer: Product Overview and Technical Practices
AI Large Model Application Practice
AI Large Model Application Practice
Jun 17, 2024 · Artificial Intelligence

Boost Your RAG Pipeline with Cohere and BGE Rerank Models

This guide explains why post‑retrieval reranking is essential for Retrieval‑Augmented Generation, compares the commercial Cohere Rerank service with the open‑source bge‑reranker‑large model, and provides step‑by‑step code for integrating both into LlamaIndex pipelines, including a custom TEI‑based processor.

BGECohereLlamaIndex
0 likes · 11 min read
Boost Your RAG Pipeline with Cohere and BGE Rerank Models
DataFunSummit
DataFunSummit
Apr 28, 2022 · Artificial Intelligence

ReRank: The Backstage of Recommendation Systems and Its Evolution Toward Ecosystem Reshaping

This article explores the role of ReRank in recommendation and advertising pipelines, detailing its algorithmic position, the challenges of diversity versus relevance, evaluation metrics such as DCG/NDCG, the evolution from heuristic methods to deep learning models, and practical insights from industry cases like Airbnb and Alibaba.

AdvertisingDiversityRerank
0 likes · 57 min read
ReRank: The Backstage of Recommendation Systems and Its Evolution Toward Ecosystem Reshaping