Tagged articles
7 articles
Page 1 of 1
AI Engineer Programming
AI Engineer Programming
May 6, 2026 · Artificial Intelligence

How to Evaluate and Choose Embedding Models for RAG Systems

This article explains why embedding models are the foundation of RAG pipelines, outlines concrete evaluation metrics such as MTEB v2 scores, latency, throughput and cost, compares a range of commercial and open‑source models, and discusses emerging trends like multimodal and long‑context embeddings.

MTEBModel SelectionRAG
0 likes · 13 min read
How to Evaluate and Choose Embedding Models for RAG Systems
AI Architect Hub
AI Architect Hub
Apr 21, 2026 · Artificial Intelligence

How to Choose the Right Embedding Model for RAG: A Practical Comparison

This article examines the key factors for selecting embedding models in Retrieval‑Augmented Generation, comparing dimensions, context windows, MTEB scores, pricing, and language support across major providers, and offers practical recommendations, cost estimates, and pitfalls to avoid.

AIOpen-sourceRAG
0 likes · 11 min read
How to Choose the Right Embedding Model for RAG: A Practical Comparison
Qborfy AI
Qborfy AI
Feb 18, 2026 · Artificial Intelligence

How Retrieval‑Augmented Generation (RAG) Supercharges LLM Answers – Complete Guide & Code

This article explains Retrieval‑Augmented Generation (RAG), detailing its offline knowledge‑base construction and online retrieval‑enhanced generation workflow, comparing it with traditional and fine‑tuned models, and providing step‑by‑step LangChain implementations, advanced techniques, and practical use‑case demos.

Hybrid SearchLangChainPrompt engineering
0 likes · 16 min read
How Retrieval‑Augmented Generation (RAG) Supercharges LLM Answers – Complete Guide & Code
AI Algorithm Path
AI Algorithm Path
Jan 11, 2026 · Artificial Intelligence

How Vector Embeddings Enable AI to Understand Anything

This article explains the principle of vector embeddings, shows how they turn words, images, audio and other data into dense numeric vectors, compares them with one‑hot encoding, describes static and contextual models, training methods, similarity metrics, and a wide range of real‑world AI applications.

AI fundamentalsRAGembedding models
0 likes · 15 min read
How Vector Embeddings Enable AI to Understand Anything
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Feb 25, 2024 · Artificial Intelligence

Pinecone Vector Database and Embedding Model Summary from DeepLearning.AI’s AI Course

This article reviews the author’s hands‑on experience with Pinecone’s serverless vector database, various embedding and generation models such as all‑MiniLM‑L6‑v2, text‑embedding‑ada‑002, clip‑ViT‑B‑32, and GPT‑3.5‑turbo‑instruct, and demonstrates how they are applied to semantic search, RAG, recommendation, hybrid, and facial similarity tasks using Python code examples.

AIPineconePython
0 likes · 9 min read
Pinecone Vector Database and Embedding Model Summary from DeepLearning.AI’s AI Course
Kuaishou Tech
Kuaishou Tech
Aug 26, 2023 · Artificial Intelligence

PetPS: A Persistent‑Memory Parameter Server for Large‑Scale Embedding Models

PetPS introduces a persistent‑memory‑based parameter server that redesigns indexing with the PetHash hash table and offloads parameter aggregation to NIC Gathering, achieving up to 1.7× higher throughput and significantly lower latency for industrial‑scale embedding models in recommendation, search, and advertising workloads.

Parameter ServerPerformance OptimizationPersistent Memory
0 likes · 14 min read
PetPS: A Persistent‑Memory Parameter Server for Large‑Scale Embedding Models