Tagged articles
13 articles
Page 1 of 1
Architect's Guide
Architect's Guide
Mar 21, 2026 · Artificial Intelligence

Turn PDFs, Word Docs, and Images into Instant Answers with WeKnora’s LLM‑Powered Search

WeKnora is a Tencent‑open‑source LLM‑based document understanding and semantic search framework that extracts structured content from PDFs, Word files and images, offers agent‑driven reasoning, multi‑modal retrieval, and a modular architecture, with step‑by‑step Docker deployment and a web UI for instant querying.

AILLMRAG
0 likes · 7 min read
Turn PDFs, Word Docs, and Images into Instant Answers with WeKnora’s LLM‑Powered Search
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Mar 20, 2026 · Artificial Intelligence

Why Vector‑Based RAG Falls Short and How PageIndex’s Reasoning‑Based Retrieval Solves It

This article analyzes the fundamental limitations of traditional vector‑based Retrieval‑Augmented Generation, introduces Vectify AI’s reasoning‑driven PageIndex framework, and explains how hierarchical, non‑vector indexing enables more accurate, context‑aware document retrieval for complex, domain‑specific texts.

AILLMPageIndex
0 likes · 15 min read
Why Vector‑Based RAG Falls Short and How PageIndex’s Reasoning‑Based Retrieval Solves It
Architecture Digest
Architecture Digest
Jan 22, 2026 · Artificial Intelligence

Unlock AI-Powered Document Search with WeKnora: A Hands‑On Guide

WeKnora is an open‑source LLM‑driven framework that transforms complex, multi‑format documents into searchable semantic knowledge, offering features such as Agent mode, hybrid retrieval, secure private deployment, and an easy‑to‑use web UI, with step‑by‑step installation instructions and demo screenshots.

AILLMWeKnora
0 likes · 7 min read
Unlock AI-Powered Document Search with WeKnora: A Hands‑On Guide
Sohu Tech Products
Sohu Tech Products
Jan 14, 2026 · Artificial Intelligence

Build a Zero‑Cost Open‑Source RAG Smart Document Q&A System from Scratch

This guide walks through building an open‑source Retrieval‑Augmented Generation (RAG) system that indexes local files with Everything, uses hybrid BM25‑vector search via Elasticsearch, and answers questions with a local LLM, covering architecture, core techniques, deployment steps, performance tweaks, and common pitfalls.

ElasticsearchLLMPython
0 likes · 11 min read
Build a Zero‑Cost Open‑Source RAG Smart Document Q&A System from Scratch
Full-Stack Cultivation Path
Full-Stack Cultivation Path
Sep 4, 2024 · Artificial Intelligence

Hot Open-Source RAG Tool for Document Chat: GraphRAG, Multimodal QA & Complex Reasoning

This article introduces Kotaemon, an open‑source Retrieval‑Augmented Generation platform that lets users chat with their documents, offering a self‑hosted web UI, support for local and API LLMs, hybrid retrieval, multimodal question answering, GraphRAG indexing, and advanced reasoning capabilities, along with step‑by‑step installation via App or Docker.

GraphRAGLLMMultimodal QA
0 likes · 6 min read
Hot Open-Source RAG Tool for Document Chat: GraphRAG, Multimodal QA & Complex Reasoning
Sohu Tech Products
Sohu Tech Products
Mar 27, 2024 · Artificial Intelligence

Building a RAG Application with Baidu Vector Database and Qianfan Embedding

This tutorial walks through building a Retrieval‑Augmented Generation application by setting up Baidu’s Vector Database and Qianfan embedding service, configuring credentials, creating a document database and vector table, loading and chunking PDFs, generating embeddings, storing them, and performing scalar, vector and hybrid similarity searches, ready for integration with Wenxin LLM for answer generation.

AI applicationsBaidu QianfanEmbedding
0 likes · 11 min read
Building a RAG Application with Baidu Vector Database and Qianfan Embedding
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Mar 22, 2024 · Artificial Intelligence

Improving Document Search with Vector Search: From Elasticsearch Limitations to Milvus Integration

This article explains how traditional keyword search with Elasticsearch often yields inaccurate or incomplete results for document retrieval, introduces vectorization and semantic search using NLP embeddings, and demonstrates a practical workflow that combines these techniques with the Milvus vector database to achieve more accurate and efficient document search.

AIElasticsearchMilvus
0 likes · 13 min read
Improving Document Search with Vector Search: From Elasticsearch Limitations to Milvus Integration
ByteDance Web Infra
ByteDance Web Infra
Jun 16, 2023 · Artificial Intelligence

How AIGC Transforms Document Search: Architecture, Techniques, and Future Directions

This article explains how AI‑generated content (AIGC) reshapes document search by combining traditional indexing with modern embedding and prompt‑tuning techniques, reviews key components such as LangChain and Supabase, compares existing AI‑search products, and discusses the future blend of classic and AI‑driven search.

AI searchAIGCEmbedding
0 likes · 15 min read
How AIGC Transforms Document Search: Architecture, Techniques, and Future Directions
Meituan Technology Team
Meituan Technology Team
Aug 5, 2021 · Artificial Intelligence

Overview of Meituan's ACL 2021 Accepted Papers

Meituan’s 2021 ACL contributions comprise seven accepted papers—six long and one short—introducing novel approaches to event argument decoding, cross‑domain slot transfer, contrastive out‑of‑domain detection, novel slot discovery, self‑supervised sentence representation, unsupervised semantic parsing, and pseudo‑query‑enhanced dense retrieval, inviting further research and collaboration.

ACLEvent ExtractionMeituan
0 likes · 22 min read
Overview of Meituan's ACL 2021 Accepted Papers