Tagged articles

custom embeddings

1 articles · Page 1 of 1
Programmer DD
Programmer DD
Jun 18, 2026 · Artificial Intelligence

How Cursor Instantly Understands Massive Codebases

The article dissects Cursor's code‑base indexing pipeline, explaining how semantic vector search, trigram‑based regex filtering, AST‑driven chunking, custom embeddings trained on agent trajectories, Merkle‑tree change detection, and Turbopuffer's namespace‑per‑repo vector store combine to deliver sub‑second, accurate code retrieval even in monorepos with tens of thousands of files.

CursorMerkle treecode indexing
0 likes · 21 min read
How Cursor Instantly Understands Massive Codebases