RAG | BestHub

Collection size

167 articles

Page 8 of 9

Java Architecture Diary

Feb 13, 2025 · Artificial Intelligence

Create a Java RAG System Using DeepSeek R1, Milvus, and Spring

This guide walks through building a Java RAG system with DeepSeek R1, Milvus, and Spring, covering environment setup, vector model integration via OpenAI protocol, Maven dependencies, data embedding, and a chat endpoint that combines semantic retrieval with LLM generation.

AI integrationDeepSeekJava

0 likes · 11 min read

Create a Java RAG System Using DeepSeek R1, Milvus, and Spring

Instant Consumer Technology Team

Jun 12, 2025 · Artificial Intelligence

How to Build a Production-Ready RAG System with Qwen3 Embedding and Reranker Models

This guide walks through using Alibaba's new Qwen3-Embedding and Qwen3-Reranker models to build a two‑stage Retrieval‑Augmented Generation pipeline with Milvus, covering environment setup, data ingestion, vector indexing, reranking, and LLM‑driven answer generation, demonstrating production‑grade performance across multilingual queries.

LLMMilvusPython

0 likes · 19 min read

How to Build a Production-Ready RAG System with Qwen3 Embedding and Reranker Models

Instant Consumer Technology Team

Jun 4, 2025 · Artificial Intelligence

Unlocking Retrieval-Augmented Generation: Theory, Practice, and Future Trends

This comprehensive article examines Retrieval‑Augmented Generation (RAG), covering its historical evolution, core theory, implementation variants, practical code examples, diverse applications, current controversies, and future research directions within the AI and NLP landscape.

Generative ModelsRAGRetrieval-Augmented Generation

0 likes · 21 min read

Unlocking Retrieval-Augmented Generation: Theory, Practice, and Future Trends

Instant Consumer Technology Team

May 29, 2025 · Artificial Intelligence

API vs GUI Agents: How to Choose the Right LLM Automation Approach

This article examines the evolution of large language model agents, contrasting API‑based agents that use predefined function calls with GUI‑based agents that interact with visual interfaces, and explores hybrid strategies, orchestration tools, RAG techniques, and practical guidelines for selecting the optimal paradigm.

API vs GUIHybrid automationLLM agents

0 likes · 34 min read

API vs GUI Agents: How to Choose the Right LLM Automation Approach

Instant Consumer Technology Team

May 29, 2025 · Artificial Intelligence

How to Build an Agent‑Powered Financial Q&A System with RAG and SQL

This article explains how to construct a financial question‑answering agent that automatically decides between SQL queries and RAG retrieval, covering intent recognition, tool creation, prompt design, agent initialization, and end‑to‑end testing with Python code.

LangChainPythonRAG

0 likes · 13 min read

How to Build an Agent‑Powered Financial Q&A System with RAG and SQL

Qunhe Technology Quality Tech

Aug 14, 2024 · Artificial Intelligence

Should Your Testing Team Build a Private LLM or Use RAG with a General Model?

This article compares the high costs and technical challenges of building a private large language model with the benefits, flexibility, and lower risk of using Retrieval‑Augmented Generation (RAG) on a general LLM, offering practical guidance for testing teams seeking AI assistance.

AIRAGTesting Automation

0 likes · 11 min read

Should Your Testing Team Build a Private LLM or Use RAG with a General Model?

Qunhe Technology Quality Tech

Aug 27, 2024 · Artificial Intelligence

Boosting Test Code Quality: How Large Language Models Transform Code Review

This article explores how mature testing teams can leverage large language models for automated code review, outlining the advantages, challenges, and a practical implementation using FastGPT and GitLab CI to build a low‑cost, AI‑enhanced review system that improves efficiency and feedback quality.

AIFastGPTGitLab CI

0 likes · 10 min read

Boosting Test Code Quality: How Large Language Models Transform Code Review

Data Thinking Notes

Jun 11, 2025 · Artificial Intelligence

How RAG‑Powered AI Boosted Government Data Labeling Efficiency by 5×

This case study details how a government‑focused AI system using retrieval‑augmented generation (RAG) and advanced preprocessing algorithms increased data labeling speed by up to five times, raised accuracy above 95%, and produced high‑quality enterprise, spatial, and economic datasets.

AIRAGautomation

0 likes · 5 min read

How RAG‑Powered AI Boosted Government Data Labeling Efficiency by 5×

DaTaobao Tech

Jul 19, 2024 · Artificial Intelligence

Practices and Techniques for Vertical Domain Large Language Models

Vertical domain large language models, fine‑tuned on specialized data, deliver higher expertise and task performance, but require continual knowledge updates and careful alignment; techniques such as BPO‑guided instruction tuning (+1.8% accuracy), Reflexion‑based Text2API (+4% API correctness), advanced RAG preprocessing, and SFT combined with ORPO (+5.2% gain) demonstrate notable improvements while underscoring remaining challenges and collaborative opportunities.

AIRAGSFT

0 likes · 9 min read

Practices and Techniques for Vertical Domain Large Language Models

Tencent Cloud Developer

Jul 18, 2024 · Artificial Intelligence

Exploring Large Language Models (LLM): Fundamentals, Applications, and Future Directions

Exploring Large Language Models, this article surveys their core concepts, evolution through Transformers, GPT and BERT, generation challenges, diverse applications such as QA, multimodal creation, summarization and retrieval‑augmented generation, prompt‑engineering frameworks and tools, LangChain‑based pipelines, AI‑driven agents, and future prospects toward domain‑specific use, multimodality, and AGI.

AILLMPrompt Engineering

0 likes · 35 min read

Exploring Large Language Models (LLM): Fundamentals, Applications, and Future Directions

Baidu Geek Talk

Mar 13, 2024 · Artificial Intelligence

Understanding Retrieval-Augmented Generation (RAG) and Building a Personal Knowledge Base with ERNIE SDK and LangChain

The article explains Retrieval-Augmented Generation (RAG), its workflow, advantages, comparison with fine-tuning, and provides a step-by-step implementation using Baidu's ERNIE SDK, LangChain, and ChromaDB to build a personal knowledge base that answers queries with retrieved context.

AIERNIE SDKLangChain

0 likes · 13 min read

Understanding Retrieval-Augmented Generation (RAG) and Building a Personal Knowledge Base with ERNIE SDK and LangChain

Sohu Tech Products

Jun 5, 2024 · Artificial Intelligence

Retrieval Augmented Generation (RAG): Concepts, Workflow, and LangChain Implementation

The article outlines LLM issues such as hallucination, outdated knowledge, and data privacy, then explains Retrieval‑Augmented Generation—detailing its data‑preparation and query‑time retrieval workflow, demonstrates a full LangChain implementation, and contrasts RAG with fine‑tuning as complementary strategies for up‑to‑date, grounded responses.

LLMLangChainPrompt Engineering

0 likes · 15 min read

Retrieval Augmented Generation (RAG): Concepts, Workflow, and LangChain Implementation

JD Tech

Jun 19, 2024 · Artificial Intelligence

Advances in Large AI Models: Prompt Engineering, RAG, Agents, Fine‑Tuning, Vector Databases and Knowledge Graphs

This article surveys the rapid expansion of large AI models, covering prompt engineering, structured prompts, retrieval‑augmented generation, AI agents, fine‑tuning strategies, vector database technology, knowledge graphs, function calling, and their collective role in moving toward artificial general intelligence.

AIFine-tuningKnowledge Graph

0 likes · 23 min read

Advances in Large AI Models: Prompt Engineering, RAG, Agents, Fine‑Tuning, Vector Databases and Knowledge Graphs

JD Tech Talk

Jun 14, 2024 · Artificial Intelligence

Building a Retrieval‑Augmented Generation (RAG) System with JD Cloud Docs, ClickHouse, LangChain, and FastAPI

This guide explains how to build a Retrieval‑Augmented Generation (RAG) system using JD Cloud documentation as a knowledge base, storing document embeddings in ClickHouse, leveraging LangChain for vector retrieval, and exposing query and answer services via FastAPI and a Gradio UI.

AIClickHouseLangChain

0 likes · 13 min read

Building a Retrieval‑Augmented Generation (RAG) System with JD Cloud Docs, ClickHouse, LangChain, and FastAPI

Qunar Tech Salon

Aug 28, 2024 · Databases

Why Vector Databases Are Needed, PgVector Installation, Usage, and Operational Practices in PostgreSQL

This article explains the necessity of vector databases for AI workloads, reviews the PostgreSQL ecosystem, compares vector database options, provides detailed PgVector installation and usage steps, shares operational best‑practices, performance tuning tips, and real‑world deployment cases at Qunar and Tujia.

AIPerformance TuningPostgreSQL

0 likes · 24 min read

Why Vector Databases Are Needed, PgVector Installation, Usage, and Operational Practices in PostgreSQL

Cognitive Technology Team

Feb 6, 2025 · Artificial Intelligence

DeepSeek Model Guide: 10 Practical Tips and Usage Techniques

This article presents ten detailed techniques for effectively using DeepSeek's large language models—including mode selection, model comparisons, knowledge updates, prompt engineering, RAG, file uploads, API access, and open‑source resources—while offering concrete examples and code snippets for each feature.

AI APIDeepSeekLarge Language Model

0 likes · 12 min read

DeepSeek Model Guide: 10 Practical Tips and Usage Techniques

AntTech

Apr 2, 2025 · Artificial Intelligence

PEAR: Position-Embedding-Agnostic Attention Re-weighting Enhances Retrieval-Augmented Generation with Zero Inference Overhead

The PEAR framework introduces a position‑embedding‑agnostic attention re‑weighting method that detects and suppresses detrimental attention heads in large language models, dramatically improving retrieval‑augmented generation performance without adding any inference overhead, as demonstrated on multiple RAG benchmarks and LLM families.

Attention Re-weightingContext AwarenessLLM

0 likes · 6 min read

PEAR: Position-Embedding-Agnostic Attention Re-weighting Enhances Retrieval-Augmented Generation with Zero Inference Overhead

Rare Earth Juejin Tech Community

Apr 12, 2024 · Artificial Intelligence

Typical Business and Technical Architectures for Large Language Model Applications

This article reviews the common business and technical architectures used in large language model (LLM) applications, explains AI Embedded, AI Copilot, and AI Agent modes—including single‑ and multi‑agent systems—and offers guidance on selecting appropriate technology stacks such as prompt‑only, function‑calling agents, RAG, and fine‑tuning.

AI AgentArchitectureFine-tuning

0 likes · 9 min read

Typical Business and Technical Architectures for Large Language Model Applications

Rare Earth Juejin Tech Community

Jan 31, 2024 · Artificial Intelligence

Advanced RAG with Semi‑Structured Data Using LangChain, Unstructured, and ChromaDB

This tutorial demonstrates how to build an advanced Retrieval‑Augmented Generation (RAG) system for semi‑structured PDF data by leveraging LangChain, the unstructured library, ChromaDB vector store, and OpenAI models, covering installation, PDF partitioning, element classification, summarization, and query execution.

AIChromaDBLangChain

0 likes · 11 min read

Advanced RAG with Semi‑Structured Data Using LangChain, Unstructured, and ChromaDB

Qunhe Technology Quality Tech

Sep 19, 2024 · Artificial Intelligence

Deploy FastGPT Locally: Step‑by‑Step Docker & Source Code Guide for RAG AI

This article explains how to set up FastGPT, a Retrieval‑Augmented Generation (RAG) knowledge‑base system powered by large language models, covering both Docker‑compose image deployment and source‑code installation, including environment configuration, database setup, and API usage examples.

AIDockerFastGPT

0 likes · 13 min read

Deploy FastGPT Locally: Step‑by‑Step Docker & Source Code Guide for RAG AI