Tagged articles

Embedding Models

8 articles · Page 1 of 1

Jun 14, 2026 · Artificial Intelligence

10 RAG Architectures Every AI Engineer Should Master

The article debunks the claim that Retrieval‑Augmented Generation is obsolete, explains why huge context windows are impractical, and systematically presents ten RAG patterns—from basic Naïve RAG to advanced Graph and Multimodal RAG—detailing their trade‑offs, costs, and suitable use cases.

AI ArchitectureEmbedding ModelsRAG

0 likes · 16 min read

10 RAG Architectures Every AI Engineer Should Master

AI Engineer Programming

May 6, 2026 · Artificial Intelligence

How to Evaluate and Choose Embedding Models for RAG Systems

This article explains why embedding models are the foundation of RAG pipelines, outlines concrete evaluation metrics such as MTEB v2 scores, latency, throughput and cost, compares a range of commercial and open‑source models, and discusses emerging trends like multimodal and long‑context embeddings.

Embedding ModelsMTEBRAG

0 likes · 13 min read

How to Evaluate and Choose Embedding Models for RAG Systems

AI Architect Hub

Apr 21, 2026 · Artificial Intelligence

How to Choose the Right Embedding Model for RAG: A Practical Comparison

This article examines the key factors for selecting embedding models in Retrieval‑Augmented Generation, comparing dimensions, context windows, MTEB scores, pricing, and language support across major providers, and offers practical recommendations, cost estimates, and pitfalls to avoid.

AIEmbedding ModelsOpen Source

0 likes · 11 min read

How to Choose the Right Embedding Model for RAG: A Practical Comparison

Qborfy AI

Feb 18, 2026 · Artificial Intelligence

How Retrieval‑Augmented Generation (RAG) Supercharges LLM Answers – Complete Guide & Code

This article explains Retrieval‑Augmented Generation (RAG), detailing its offline knowledge‑base construction and online retrieval‑enhanced generation workflow, comparing it with traditional and fine‑tuned models, and providing step‑by‑step LangChain implementations, advanced techniques, and practical use‑case demos.

Embedding ModelsLangChainRAG

0 likes · 16 min read

How Retrieval‑Augmented Generation (RAG) Supercharges LLM Answers – Complete Guide & Code

AI Algorithm Path

Jan 11, 2026 · Artificial Intelligence

How Vector Embeddings Enable AI to Understand Anything

This article explains the principle of vector embeddings, shows how they turn words, images, audio and other data into dense numeric vectors, compares them with one‑hot encoding, describes static and contextual models, training methods, similarity metrics, and a wide range of real‑world AI applications.

AI FundamentalsEmbedding ModelsRAG

0 likes · 15 min read

How Vector Embeddings Enable AI to Understand Anything

Architect's Alchemy Furnace

Jul 7, 2025 · Artificial Intelligence

How to Build an AI‑Powered Knowledge Retrieval Workflow with Dify and Embedding Models

This guide explains how to organize a knowledge base, select embedding and rerank models, clean user queries, and construct a Dify DSL workflow that iteratively retrieves and merges information before handing it to a large language model for answer generation.

AI workflowDifyEmbedding Models

0 likes · 23 min read

How to Build an AI‑Powered Knowledge Retrieval Workflow with Dify and Embedding Models

Rare Earth Juejin Tech Community

Feb 25, 2024 · Artificial Intelligence

Pinecone Vector Database and Embedding Model Summary from DeepLearning.AI’s AI Course

This article reviews the author’s hands‑on experience with Pinecone’s serverless vector database, various embedding and generation models such as all‑MiniLM‑L6‑v2, text‑embedding‑ada‑002, clip‑ViT‑B‑32, and GPT‑3.5‑turbo‑instruct, and demonstrates how they are applied to semantic search, RAG, recommendation, hybrid, and facial similarity tasks using Python code examples.

AIEmbedding ModelsPinecone

0 likes · 9 min read

Pinecone Vector Database and Embedding Model Summary from DeepLearning.AI’s AI Course

Kuaishou Tech

Aug 26, 2023 · Artificial Intelligence

PetPS: A Persistent‑Memory Parameter Server for Large‑Scale Embedding Models

PetPS introduces a persistent‑memory‑based parameter server that redesigns indexing with the PetHash hash table and offloads parameter aggregation to NIC Gathering, achieving up to 1.7× higher throughput and significantly lower latency for industrial‑scale embedding models in recommendation, search, and advertising workloads.

Embedding ModelsParameter ServerPerformance Optimization

0 likes · 14 min read

PetPS: A Persistent‑Memory Parameter Server for Large‑Scale Embedding Models