Tagged articles

Chunking

43 articles · Page 1 of 1

Jun 18, 2026 · Artificial Intelligence

RAG Data Governance: Pre‑Ingestion Data Quality Challenges (Part 1)

The article analyzes how RAG systems inherit classic data‑quality problems, explains why clean input is essential for retrieval and generation, outlines historical GIGO lessons, highlights new risks introduced by vectorization and LLMs, and reviews practical chunking and governance strategies to mitigate hidden failures.

ChunkingData GovernanceData Quality

0 likes · 18 min read

RAG Data Governance: Pre‑Ingestion Data Quality Challenges (Part 1)

ZhiKe AI

Jun 18, 2026 · User Experience Design

The 7±2 Memory Limit: How Chunking Lets You Overcome Brain Capacity

Everyday forgetfulness—missing phone digits, skipping items on a shopping list—stems from the brain's limited working‑memory capacity of about 7±2 chunks, a constraint that can be mitigated by grouping information into meaningful chunks, a principle widely used in UX design.

ChunkingMiller 7±2UX design

0 likes · 5 min read

The 7±2 Memory Limit: How Chunking Lets You Overcome Brain Capacity

Su San Talks Tech

Jun 15, 2026 · Artificial Intelligence

How I Doubled RAG Accuracy with Targeted Optimizations

This article walks through a comprehensive, step‑by‑step analysis of why RAG pipelines often underperform and presents concrete optimizations—including OCR preprocessing, table extraction, metadata enrichment, recursive chunking, embedding fine‑tuning, hybrid vector‑keyword retrieval, reranking, prompt templates, and a production‑grade Java implementation—backed by code snippets, benchmark figures, and evaluation metrics.

ChunkingEmbeddingHybrid Retrieval

0 likes · 36 min read

How I Doubled RAG Accuracy with Targeted Optimizations

Java Architect Handbook

Jun 13, 2026 · Artificial Intelligence

Why Fixed-Size Chunking Fails in RAG: Interview Insights

The article explains that fixed-size chunking in Retrieval‑Augmented Generation ignores semantic boundaries, causing broken sentences, scattered topics, redundant or missing information, and noisy retrieval, and it evaluates overlap as a partial fix while presenting better alternatives such as recursive, semantic, structural, and agentic chunking along with practical production tips and future trends.

AI interviewChunkingLangChain

0 likes · 12 min read

Why Fixed-Size Chunking Fails in RAG: Interview Insights

DeepHub IMBA

May 31, 2026 · Artificial Intelligence

Chunking Strategies for Video RAG: Pause‑Based, Sliding‑Window, and LLM‑Driven Methods

The article examines how to chunk transcribed video text for Retrieval‑Augmented Generation, comparing pause‑based, overlapping‑window, length‑based fallback, and LLM‑driven topic chunking methods, and shows how combining fine‑grained and thematic chunks yields a multi‑layered pipeline that improves context coverage for both precise and broad queries.

ChunkingLLMRAG

0 likes · 8 min read

Chunking Strategies for Video RAG: Pause‑Based, Sliding‑Window, and LLM‑Driven Methods

AI Engineer Programming

May 27, 2026 · Artificial Intelligence

MMR for RAG: Low-Cost Chunk Limits Balance Relevance and Diversity

When a long document is split into many highly similar chunks, vector‑based top‑k retrieval tends to return multiple pieces from the same source, causing document dominance; applying a per‑document chunk limit together with Maximal Marginal Relevance (MMR) re‑ranking introduces diversity while preserving relevance, offering a low‑cost way to improve RAG answer quality.

ChunkingDPPDiversity

0 likes · 17 min read

MMR for RAG: Low-Cost Chunk Limits Balance Relevance and Diversity

Su San Talks Tech

May 25, 2026 · Artificial Intelligence

Mastering RAG: Chunking, Embeddings, BM25 & Multi‑Index Retrieval in Python

This tutorial explains Retrieval‑Augmented Generation (RAG) from fundamentals to a full pipeline, covering text chunking strategies, VoyageAI embeddings, vector‑store implementation, BM25 lexical search, and a multi‑index retriever that fuses semantic and lexical results with Reciprocal Rank Fusion.

BM25ChunkingPython

0 likes · 48 min read

Mastering RAG: Chunking, Embeddings, BM25 & Multi‑Index Retrieval in Python

Architect's Ambition

May 18, 2026 · Artificial Intelligence

Building Enterprise Private Knowledge Bases: End-to-End Crawl, Clean, and RAG Pipeline

The article outlines a complete six‑stage workflow for constructing enterprise‑grade private knowledge bases—starting with targeted web‑crawling and API ingestion, through data cleaning, chunking, embedding generation, vector storage, and finally multi‑stage RAG retrieval optimization—highlighting why early stages set the performance ceiling and offering practical tips from real‑world projects.

AI AgentChunkingEmbedding

0 likes · 10 min read

Building Enterprise Private Knowledge Bases: End-to-End Crawl, Clean, and RAG Pipeline

Lao Guo's Learning Space

May 6, 2026 · Artificial Intelligence

Why Your RAG Keeps Missing the Mark: Enterprise‑Level Pitfall Guide

This article examines why Retrieval‑Augmented Generation systems that work in demos often fail in production, detailing common pitfalls—from chunking and vector‑database selection to hybrid retrieval and re‑ranking—and offers concrete strategies, configuration tips, and a decision tree to build reliable enterprise‑grade RAG solutions.

ChunkingEnterprise AIHybrid Retrieval

0 likes · 12 min read

Why Your RAG Keeps Missing the Mark: Enterprise‑Level Pitfall Guide

MaGe Linux Operations

Apr 28, 2026 · Artificial Intelligence

Why Your RAG Performance Is Poor: Common Issues and Optimization Strategies

This article systematically analyzes why Retrieval‑Augmented Generation pipelines often underperform—covering embedding model selection, chunking strategies, hybrid retrieval, reranking, context window waste, evaluation metrics, and a detailed troubleshooting checklist—while providing concrete code examples and best‑practice recommendations for engineers.

ChunkingEmbeddingEvaluation

0 likes · 19 min read

Why Your RAG Performance Is Poor: Common Issues and Optimization Strategies

James' Growth Diary

Apr 22, 2026 · Artificial Intelligence

Boost RAG Performance: Chunking Strategies, Rerank, and Hybrid Retrieval Explained

This article breaks down why RAG pipelines often underperform and shows how proper chunking, overlap settings, hybrid vector‑plus‑BM25 retrieval, and a Rerank step can dramatically improve recall and precision, with concrete code examples and tuning tips.

BM25ChunkingHybrid Retrieval

0 likes · 14 min read

Boost RAG Performance: Chunking Strategies, Rerank, and Hybrid Retrieval Explained

James' Growth Diary

Apr 21, 2026 · Artificial Intelligence

Boosting RAG Performance with Milvus: Chunking, Hybrid Search, and Rerank Best Practices

This article analyzes why Retrieval‑Augmented Generation often underperforms, then walks through concrete engineering steps—optimal chunking, overlap settings, hybrid vector + BM25 retrieval, RRF fusion, and reranking—while providing code snippets, parameter tables, and a full pipeline diagram to turn a usable RAG system into a high‑quality one.

ChunkingHybrid SearchLangChain

0 likes · 18 min read

Boosting RAG Performance with Milvus: Chunking, Hybrid Search, and Rerank Best Practices

Data Party THU

Apr 17, 2026 · Artificial Intelligence

Mastering Text Chunking: 21 Strategies to Supercharge Your RAG Pipelines

This comprehensive guide presents 21 practical text‑chunking techniques—from simple line‑based splits to advanced embedding‑ and LLM‑driven methods—explaining their implementations, code examples, and ideal use‑cases to help you build efficient Retrieval‑Augmented Generation systems while avoiding common pitfalls.

AIChunkingLLM

0 likes · 57 min read

Mastering Text Chunking: 21 Strategies to Supercharge Your RAG Pipelines

James' Growth Diary

Apr 17, 2026 · Artificial Intelligence

How to Load and Split Documents for RAG: First Step to Building a Knowledge Base

This tutorial explains why document loading and splitting are critical for RAG pipelines, introduces LangChain's Document format, demonstrates loaders for various file types, details the RecursiveCharacterTextSplitter and alternative splitters, and provides practical tips on parameter tuning, metadata preservation, Chinese text handling, and common pitfalls.

AIChunkingDocument Loader

0 likes · 27 min read

How to Load and Split Documents for RAG: First Step to Building a Knowledge Base

Wu Shixiong's Large Model Academy

Apr 2, 2026 · Artificial Intelligence

How Smart Chunk Splitting Boosts RAG Recall from 67% to 91%

This article examines the critical role of chunk splitting in Retrieval‑Augmented Generation systems, comparing three generations of methods—from fixed‑size token cuts to sentence‑aware and semantic‑aware strategies—showing how refined chunking, overlap tuning, and metadata design raise Recall@5 from 0.67 to 0.91 while addressing table, list, and long‑section challenges.

ChunkingInformation RetrievalLLM

0 likes · 24 min read

How Smart Chunk Splitting Boosts RAG Recall from 67% to 91%

Wu Shixiong's Large Model Academy

Mar 17, 2026 · Artificial Intelligence

Mastering Chunk Splitting for RAG: From Fixed Length to Semantic Segmentation

Chunk splitting, a critical yet often overlooked step in RAG pipelines, dramatically impacts retrieval recall and LLM output quality; this guide walks through three evolution stages—from naive fixed‑length splits to sentence‑aware overlaps and finally semantic, structure‑driven segmentation—complete with code, experiments, and practical pitfalls.

ChunkingLLMRAG

0 likes · 15 min read

Mastering Chunk Splitting for RAG: From Fixed Length to Semantic Segmentation

Programmer's Advance

Jan 15, 2026 · Artificial Intelligence

How Spec‑First, Chunking, and Multi‑Model Strategies Make AI Coding 5× More Effective

The article dissects Addy Osmani’s 2026 AI Coding Workflow, showing how a spec‑first mindset, task chunking, precise context packing, multi‑model collaboration, and human‑in‑the‑loop practices together boost developer efficiency by 30‑50% while reducing bugs and costs.

AI programmingChunkingContext Packing

0 likes · 24 min read

How Spec‑First, Chunking, and Multi‑Model Strategies Make AI Coding 5× More Effective

360 Tech Engineering

Dec 26, 2025 · Artificial Intelligence

15 Chunking Strategies to Supercharge Retrieval‑Augmented Generation

This article presents fifteen practical chunking techniques—ranging from line‑by‑line and fixed‑size chunking to semantic and hierarchical methods—explaining their principles, ideal use‑cases, concrete input examples, chunk outputs, and key advantages or cautions for improving Retrieval‑Augmented Generation with large language models.

AIChunkingData Retrieval

0 likes · 28 min read

15 Chunking Strategies to Supercharge Retrieval‑Augmented Generation

JD Tech Talk

Nov 21, 2025 · Artificial Intelligence

Mastering Chunking Strategies for Retrieval‑Augmented Generation

This article explains why effective chunking is crucial for RAG performance, compares seven major chunking strategies—including fixed‑size, semantic, recursive, document‑structure, agent‑driven, sentence, and paragraph methods—and offers practical guidance on selecting and optimizing chunks for real‑world AI applications.

AIChunkingRAG

0 likes · 10 min read

Mastering Chunking Strategies for Retrieval‑Augmented Generation

JD Cloud Developers

Nov 21, 2025 · Artificial Intelligence

Why Chunking Strategy Makes or Breaks RAG Performance

This article explains how different chunking methods—fixed size, semantic, recursive, document‑based, agent‑driven, sentence‑level, and paragraph‑level—affect Retrieval‑Augmented Generation, offering practical guidelines, metrics, and optimization tips for real‑world deployments.

AIChunkingInformation Retrieval

0 likes · 9 min read

Why Chunking Strategy Makes or Breaks RAG Performance

Data Party THU

Nov 9, 2025 · Artificial Intelligence

Mastering Chunking Strategies for Effective RAG: Fixed, Recursive, Semantic, Structured, and Delayed

This article walks through the core RAG pipeline, explains why chunking is the linchpin of retrieval quality, and provides detailed definitions, trade‑offs, and implementation examples for five chunking techniques—fixed, recursive, semantic, structure‑aware, and delayed—so you can choose the right approach for any document‑heavy AI application.

AIChunkingLLM

0 likes · 10 min read

Mastering Chunking Strategies for Effective RAG: Fixed, Recursive, Semantic, Structured, and Delayed

Wu Shixiong's Large Model Academy

Nov 6, 2025 · Artificial Intelligence

How to Optimize RAG Knowledge Base Construction: Parsing, Chunking, and Retrieval

This article explains why building a high‑quality RAG knowledge base is critical, outlines offline parsing techniques for multi‑format documents, presents semantic chunking strategies that preserve structure and context, and shows how to answer interview questions with a robust, production‑ready pipeline.

AI interviewChunkingKnowledge Base

0 likes · 8 min read

How to Optimize RAG Knowledge Base Construction: Parsing, Chunking, and Retrieval

Wu Shixiong's Large Model Academy

Nov 4, 2025 · Artificial Intelligence

Why Financial RAG Fails and How to Solve Its Core Challenges

This article explains why Retrieval‑Augmented Generation (RAG) projects in the financial sector often underperform, highlighting data‑structure complexities, document‑parsing hurdles, chunking strategies, compliance constraints, evaluation metrics, and engineering requirements, and offers practical solutions and code examples.

ChunkingEvaluationRAG

0 likes · 10 min read

Why Financial RAG Fails and How to Solve Its Core Challenges

DeWu Technology

Oct 29, 2025 · Artificial Intelligence

Why Chunking Can Make or Break Your RAG System – Practical Strategies & Code

This article explains how proper document chunking—choosing the right chunk size, overlap, and structure‑aware boundaries—directly impacts the relevance, factuality, and efficiency of Retrieval‑Augmented Generation pipelines, and provides multiple Python implementations ranging from simple fixed‑length splits to semantic and hybrid approaches.

ChunkingEmbeddingLLM

0 likes · 29 min read

Why Chunking Can Make or Break Your RAG System – Practical Strategies & Code

JD Tech

Oct 9, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines external knowledge retrieval with large language models, covering its motivations, data preparation, chunking strategies, vectorization, storage, query processing, retrieval, reranking, prompt engineering, and LLM generation, plus practical optimization tips.

ChunkingLLMMetadata

0 likes · 14 min read

What Is Retrieval‑Augmented Generation (RAG) and How Does It Boost AI Accuracy?

JD Tech Talk

Sep 28, 2025 · Artificial Intelligence

What Is Retrieval‑Augmented Generation (RAG) and How Does It Power Modern AI?

This article explains Retrieval‑Augmented Generation (RAG), an AI framework that combines traditional information retrieval with large language models, detailing its core workflow—from knowledge preparation, chunking, and embedding to vector database storage and the question‑answering stage—while highlighting key challenges, tools, and optimization strategies.

AIChunkingEmbedding

0 likes · 15 min read

What Is Retrieval‑Augmented Generation (RAG) and How Does It Power Modern AI?

Tech Freedom Circle

Sep 25, 2025 · Artificial Intelligence

RAGFlow Deep Dive: Data Parsing and Knowledge Graph Construction

This article examines RAGFlow's end‑to‑end pipeline for turning diverse documents into structured knowledge, detailing the TaskExecutor factory, the DeepDoc layout‑aware parser, chunking strategies, embedding and storage mechanisms, and the GraphRAG‑based knowledge‑graph extraction that together enable high‑precision retrieval and reasoning.

ChunkingData ParsingDeepDoc

0 likes · 15 min read

RAGFlow Deep Dive: Data Parsing and Knowledge Graph Construction

Rare Earth Juejin Tech Community

Sep 3, 2025 · Frontend Development

Fast, Resumable Large File Uploads with Vue & Express

This article walks through a complete Vue‑and‑Express solution for uploading massive files, detailing chunked splitting, hash‑based instant upload detection, resumable transfers, concurrency control, manual abort handling, and server‑side merging using streams, providing ready‑to‑use code snippets and performance optimizations.

ChunkingConcurrency ControlExpress

0 likes · 18 min read

Fast, Resumable Large File Uploads with Vue & Express

Alibaba Cloud Developer

Sep 1, 2025 · Artificial Intelligence

Mastering RAG: From Chunking to Hybrid Search for Better AI Retrieval

This article delves into the implementation details and optimization strategies of Retrieval‑Augmented Generation (RAG), covering document chunking, index enhancement, embedding, hybrid search, and re‑ranking, and provides practical code examples to help developers move from quick deployment to deep performance tuning.

AIChunkingEmbedding

0 likes · 19 min read

Mastering RAG: From Chunking to Hybrid Search for Better AI Retrieval

DaTaobao Tech

Aug 25, 2025 · Artificial Intelligence

Mastering RAG: From Quick Start to Deep Optimization Strategies

This article dives into the practical implementation of Retrieval‑Augmented Generation (RAG), covering document chunking, semantic and reverse HyDE indexing, embedding, hybrid search, and re‑ranking techniques, and provides concrete code examples and optimization tips for building high‑performance AI applications.

ChunkingEmbeddingHybrid Search

0 likes · 18 min read

Mastering RAG: From Quick Start to Deep Optimization Strategies

Mingyi World Elasticsearch

Aug 5, 2025 · Artificial Intelligence

Enterprise Semantic Search: Key Q&A on Scoring, Recall, LSH, Chunking, and Embedding Dimensions

This article answers practical questions about enterprise semantic search, explaining how Reciprocal Rank Fusion normalizes mixed scoring, how to control vector result size, the trade‑offs of LSH parameters, word‑ and sentence‑based chunking strategies with version‑specific defaults, and flexible embedding dimensionality.

ChunkingElasticsearchLSH

0 likes · 8 min read

Enterprise Semantic Search: Key Q&A on Scoring, Recall, LSH, Chunking, and Embedding Dimensions

360 Zhihui Cloud Developer

Jul 16, 2025 · Backend Development

How to Implement Efficient Large File Uploads with Chunking in Vue and Node.js

This guide explains how to overcome large‑file upload limits by splitting files into chunks with Blob.slice, uploading them concurrently from a Vue front‑end, and merging the pieces on a Node.js back‑end using streams, while providing progress tracking and handling Nginx size restrictions.

ChunkingLarge File UploadNode.js

0 likes · 15 min read

How to Implement Efficient Large File Uploads with Chunking in Vue and Node.js

Satori Komeiji's Programming Classroom

Jun 3, 2025 · Artificial Intelligence

Everything You Need to Know About Retrieval‑Augmented Generation (RAG)

The article explains Retrieval‑Augmented Generation (RAG) by describing how a programmer, frustrated with oversized prompts for a large language model, discovers that retrieving relevant document fragments, embedding them, and feeding the augmented context to the model yields accurate, fact‑based answers.

AIChunkingEmbedding

0 likes · 6 min read

Everything You Need to Know About Retrieval‑Augmented Generation (RAG)

Fun with Large Models

Apr 25, 2025 · Artificial Intelligence

Why Your RAG System Underperforms and How to Boost Its Effectiveness by 20%

This article analyzes common shortcomings of RAG pipelines—data preparation, retrieval, and LLM generation—and provides concrete optimization techniques such as advanced chunking, embedding model selection, retrieval parameter tuning, rerank models, and prompt engineering, promising up to a 20% performance gain.

ChunkingEmbeddingPrompt engineering

0 likes · 17 min read

Why Your RAG System Underperforms and How to Boost Its Effectiveness by 20%

Rare Earth Juejin Tech Community

Feb 17, 2025 · Backend Development

Large File Upload with Chunking, Instant Upload, and Resume Using React, Vue and NestJS

This article explains how to implement a large‑file upload system that splits files into chunks, computes MD5 hashes for instant‑upload detection, supports breakpoint resume, and merges the chunks on the server using React or Vue on the frontend and NestJS with TypeScript on the backend.

ChunkingMD5NestJS

0 likes · 6 min read

Large File Upload with Chunking, Instant Upload, and Resume Using React, Vue and NestJS

DataFunSummit

Jan 22, 2025 · Artificial Intelligence

RAG2.0 Engine Design Challenges and Implementation

This article presents a comprehensive overview of the RAG2.0 engine design, covering RAG1.0 limitations, effective chunking methods, accurate retrieval techniques, advanced multimodal processing, hybrid search strategies, database indexing choices, and future directions such as agentic RAG and memory‑enhanced models.

ChunkingHybrid SearchMultimodal

0 likes · 23 min read

RAG2.0 Engine Design Challenges and Implementation

Zhihu Tech Column

Jan 17, 2025 · Artificial Intelligence

Zhihu Direct Answer: Product Overview and Technical Practices

This article summarizes the key technical insights from Zhihu Direct Answer, an AI-powered search product, covering its product overview, RAG framework, query understanding, retrieval strategies, chunking, reranking, generation techniques, evaluation methods, and engineering optimizations for cost and performance.

AI SearchChunkingEngineering Optimization

0 likes · 13 min read

Zhihu Direct Answer: Product Overview and Technical Practices

Sohu Tech Products

Nov 27, 2024 · Artificial Intelligence

RAG Technology and Practical Application in Multi-Modal Query: Using Chinese-CLIP and Redis Search

The article explains how Retrieval‑Augmented Generation (RAG) outperforms direct LLM inference by enabling real‑time knowledge updates and lower costs, and demonstrates a practical multi‑modal RAG pipeline that uses Chinese‑CLIP for vector encoding, various chunking strategies, and Redis Search for fast vector storage and retrieval.

Chinese-CLIPChunkingLLM

0 likes · 17 min read

RAG Technology and Practical Application in Multi-Modal Query: Using Chinese-CLIP and Redis Search

AI Large Model Application Practice

Aug 29, 2024 · Artificial Intelligence

8 Essential Indexing Strategies to Boost Enterprise RAG Performance

This article presents eight practical optimization recommendations for the indexing stage of enterprise‑level Retrieval‑Augmented Generation (RAG) applications, covering chunk creation, abbreviation handling, multimodal document processing, semantic enrichment, metadata usage, alternative index types, and embedding model selection.

ChunkingIndexingMetadata

0 likes · 15 min read

8 Essential Indexing Strategies to Boost Enterprise RAG Performance

vivo Internet Technology

Jul 3, 2024 · Databases

End-to-End Data Consistency Verification for MySQL in DTS

The Vivo Internet Storage R&D team's article describes an end‑to‑end MySQL data‑consistency verification tool for DTS that uses fixed‑size chunking and CRC32/MD5 fingerprint aggregation to quickly compare source and target tables, pinpoint mismatched rows, and enable automated or manual correction while minimizing impact on replication.

CRC32ChunkingDTS

0 likes · 13 min read

End-to-End Data Consistency Verification for MySQL in DTS

Code Ape Tech Column

Mar 20, 2023 · Backend Development

Implementing Large File Upload with Chunking, Resume, and Instant Transfer Using Java RandomAccessFile

This article explains how to handle 2 GB video uploads by splitting files into chunks, using breakpoint resume and instant transfer techniques, and leveraging Java's RandomAccessFile together with Spring Boot and Redis to manage upload state, merge chunks, and store the final file.

ChunkingRandomAccessFileRedis

0 likes · 12 min read

Implementing Large File Upload with Chunking, Resume, and Instant Transfer Using Java RandomAccessFile

Code Ape Tech Column

Sep 14, 2021 · Backend Development

Large File Upload with Chunking, Resume, and RandomAccessFile in Java

This article explains how to handle multi‑gigabyte video uploads by splitting files into chunks, using MD5 for identification, implementing resumable and instant uploads with Spring Boot and Redis, and leveraging Java's RandomAccessFile and memory‑mapped I/O for efficient merging.

ChunkingJavaRandomAccessFile

0 likes · 15 min read

Large File Upload with Chunking, Resume, and RandomAccessFile in Java

Xueersi Online School Tech Team

Aug 6, 2021 · Backend Development

Understanding RTMP Handshake, Chunking, and Nginx‑RTMP Live Streaming Implementation

This article explains the RTMP protocol’s handshake process, chunking and message formats, details command flows such as connect, createStream, publish and play, and describes how the Nginx‑RTMP module implements live streaming, acceleration, authentication, recording and HLS slicing.

ChunkingLive videoMedia Server

0 likes · 21 min read

Understanding RTMP Handshake, Chunking, and Nginx‑RTMP Live Streaming Implementation