Tagged articles

Rerank

13 articles · Page 1 of 1

Jun 25, 2026 · Artificial Intelligence

Why Rerank Is Essential: From 100 Retrieved Docs to the 5 Correct Answers in RAG

Even with a perfectly populated vector database, a RAG pipeline often returns irrelevant answers because the initial Bi‑encoder retrieval only narrows the pool to about 100 candidates, and without a Cross‑encoder rerank step the truly correct document—often buried around rank 37—never reaches the LLM for answering.

Bi-EncoderCross-EncoderEmbedding

0 likes · 9 min read

Why Rerank Is Essential: From 100 Retrieved Docs to the 5 Correct Answers in RAG

Su San Talks Tech

Jun 15, 2026 · Artificial Intelligence

How I Doubled RAG Accuracy with Targeted Optimizations

This article walks through a comprehensive, step‑by‑step analysis of why RAG pipelines often underperform and presents concrete optimizations—including OCR preprocessing, table extraction, metadata enrichment, recursive chunking, embedding fine‑tuning, hybrid vector‑keyword retrieval, reranking, prompt templates, and a production‑grade Java implementation—backed by code snippets, benchmark figures, and evaluation metrics.

ChunkingEmbeddingHybrid Retrieval

0 likes · 36 min read

How I Doubled RAG Accuracy with Targeted Optimizations

Su San Talks Tech

May 15, 2026 · Artificial Intelligence

Understanding Rerank in Retrieval‑Augmented Generation (RAG)

The article explains why a reranking step is essential in RAG pipelines, describes how it refines the initial vector‑search results, compares mainstream rerank techniques, discusses practical engineering choices such as candidate set size and model selection, and outlines how to evaluate and tune rerank performance.

Cross-EncoderEvaluation MetricsLLM

0 likes · 15 min read

Understanding Rerank in Retrieval‑Augmented Generation (RAG)

James' Growth Diary

Apr 22, 2026 · Artificial Intelligence

Boost RAG Performance: Chunking Strategies, Rerank, and Hybrid Retrieval Explained

This article breaks down why RAG pipelines often underperform and shows how proper chunking, overlap settings, hybrid vector‑plus‑BM25 retrieval, and a Rerank step can dramatically improve recall and precision, with concrete code examples and tuning tips.

BM25ChunkingHybrid Retrieval

0 likes · 14 min read

Boost RAG Performance: Chunking Strategies, Rerank, and Hybrid Retrieval Explained

James' Growth Diary

Apr 21, 2026 · Artificial Intelligence

Boosting RAG Performance with Milvus: Chunking, Hybrid Search, and Rerank Best Practices

This article analyzes why Retrieval‑Augmented Generation often underperforms, then walks through concrete engineering steps—optimal chunking, overlap settings, hybrid vector + BM25 retrieval, RRF fusion, and reranking—while providing code snippets, parameter tables, and a full pipeline diagram to turn a usable RAG system into a high‑quality one.

ChunkingHybrid SearchLangChain

0 likes · 18 min read

Boosting RAG Performance with Milvus: Chunking, Hybrid Search, and Rerank Best Practices

AgentGuide

Apr 6, 2026 · Artificial Intelligence

How to Optimize RAG System Performance: From Evaluation Metrics to Tuning Strategies

The article explains how to improve Retrieval‑Augmented Generation (RAG) systems by interpreting three key metrics—context recall, context precision, and answer correctness—and provides concrete step‑by‑step actions such as checking the knowledge base, upgrading embedding models, rewriting queries, adding a rerank model, and refining prompts and generation parameters.

Evaluation MetricsRAGRerank

0 likes · 7 min read

How to Optimize RAG System Performance: From Evaluation Metrics to Tuning Strategies

Wu Shixiong's Large Model Academy

Apr 6, 2026 · Artificial Intelligence

Why Rerank Beats Simple Retrieval in RAG: Practical Tips & Code

This article explains the limitations of Bi‑Encoder retrieval, introduces Cross‑Encoder rerankers, shows how a cascade of recall‑rerank‑generation improves answer quality, and provides concrete code, threshold‑filtering strategies, and domain‑specific fine‑tuning techniques for industrial RAG systems.

AI RetrievalBi-EncoderCross-Encoder

0 likes · 20 min read

Why Rerank Beats Simple Retrieval in RAG: Practical Tips & Code

Xuanwu Backend Tech Stack

Oct 22, 2025 · Artificial Intelligence

How Rerank Transforms Retrieval‑Augmented Generation for Accurate AI Answers

This article explains the limitations of basic Retrieval‑Augmented Generation (RAG), introduces Rerank technology as a two‑step refinement process, compares dual‑encoder and cross‑encoder methods, and reviews popular Rerank models to help developers build more precise AI‑driven retrieval systems.

Information RetrievalRAGRerank

0 likes · 10 min read

How Rerank Transforms Retrieval‑Augmented Generation for Accurate AI Answers

Zhihu Tech Column

Jan 17, 2025 · Artificial Intelligence

Zhihu Direct Answer: Product Overview and Technical Practices

This article summarizes the key technical insights from Zhihu Direct Answer, an AI-powered search product, covering its product overview, RAG framework, query understanding, retrieval strategies, chunking, reranking, generation techniques, evaluation methods, and engineering optimizations for cost and performance.

AI SearchChunkingEngineering Optimization

0 likes · 13 min read

Zhihu Direct Answer: Product Overview and Technical Practices

Alibaba Cloud Big Data AI Platform

Dec 17, 2024 · Artificial Intelligence

Build Chinese Vector Search with Alibaba Cloud AI and Elasticsearch Inference APIs

This guide walks you through creating sparse and dense vector inference endpoints on Elasticsearch using Alibaba Cloud AI services, demonstrates how to generate embeddings, perform completion, rerank results, and integrate RAG workflows for accurate Chinese‑language search.

AI SearchCompletionDense Embedding

0 likes · 14 min read

Build Chinese Vector Search with Alibaba Cloud AI and Elasticsearch Inference APIs

JD Tech Talk

Nov 26, 2024 · Artificial Intelligence

Design and Implementation of an Automated Logistics QA Bot Using Retrieval, Rerank, and Data Augmentation Techniques

This article describes a low‑cost, privacy‑preserving chatbot for logistics that combines data cleaning, large‑model‑based data augmentation, BM25 and vector retrieval, a DNN rerank model, and LLM‑driven answer rewriting to deliver accurate, compliant automated responses.

AIBM25Data Augmentation

0 likes · 11 min read

Design and Implementation of an Automated Logistics QA Bot Using Retrieval, Rerank, and Data Augmentation Techniques

AI Large Model Application Practice

Jun 17, 2024 · Artificial Intelligence

Boost Your RAG Pipeline with Cohere and BGE Rerank Models

This guide explains why post‑retrieval reranking is essential for Retrieval‑Augmented Generation, compares the commercial Cohere Rerank service with the open‑source bge‑reranker‑large model, and provides step‑by‑step code for integrating both into LlamaIndex pipelines, including a custom TEI‑based processor.

BGECohereLlamaIndex

0 likes · 11 min read

Boost Your RAG Pipeline with Cohere and BGE Rerank Models

DataFunSummit

Apr 28, 2022 · Artificial Intelligence

ReRank: The Backstage of Recommendation Systems and Its Evolution Toward Ecosystem Reshaping

This article explores the role of ReRank in recommendation and advertising pipelines, detailing its algorithmic position, the challenges of diversity versus relevance, evaluation metrics such as DCG/NDCG, the evolution from heuristic methods to deep learning models, and practical insights from industry cases like Airbnb and Alibaba.

AdvertisingDiversityRerank

0 likes · 57 min read

ReRank: The Backstage of Recommendation Systems and Its Evolution Toward Ecosystem Reshaping