Tagged articles
19 articles
Page 1 of 1
360 Tech Engineering
360 Tech Engineering
Dec 26, 2025 · Artificial Intelligence

15 Chunking Strategies to Supercharge Retrieval‑Augmented Generation

This article presents fifteen practical chunking techniques—ranging from line‑by‑line and fixed‑size chunking to semantic and hierarchical methods—explaining their principles, ideal use‑cases, concrete input examples, chunk outputs, and key advantages or cautions for improving Retrieval‑Augmented Generation with large language models.

AIData RetrievalLLM
0 likes · 28 min read
15 Chunking Strategies to Supercharge Retrieval‑Augmented Generation
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 16, 2025 · Artificial Intelligence

How We Built an AI‑Powered Data Agent to Automate Data Retrieval at Scale

This article details the design and implementation of Matra, an AI‑driven data assistant for a large e‑commerce platform, covering the challenges of legacy data assets, knowledge‑base construction, GraphRAG integration, multi‑stage agent frameworks, practical results, and future plans for continuous improvement.

AIData RetrievalKnowledge Graph
0 likes · 22 min read
How We Built an AI‑Powered Data Agent to Automate Data Retrieval at Scale
DaTaobao Tech
DaTaobao Tech
Aug 13, 2025 · Artificial Intelligence

Unlocking AI Power: A Complete Guide to Prompt Engineering and Advanced Techniques

This article explores the emerging field of prompt engineering, detailing its fundamentals, advanced strategies such as chain‑of‑thought, ReAct, and structured frameworks, and demonstrates practical applications in AI agents for data retrieval, SQL generation, and market insight, offering actionable guidance for developers and business users alike.

AI agentsData RetrievalRAG
0 likes · 42 min read
Unlocking AI Power: A Complete Guide to Prompt Engineering and Advanced Techniques
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Feb 13, 2025 · Big Data

Configuring and Using DeepSeek Search Engine in Cursor for Efficient Data Retrieval

This article introduces DeepSeek, a high‑efficiency search engine optimized for large‑scale data, explains how to configure it within the Cursor database tool using code snippets, and demonstrates its applications such as semantic search, content recommendation, intelligent data analysis, and document similarity matching.

Big DataConfigurationCursor
0 likes · 6 min read
Configuring and Using DeepSeek Search Engine in Cursor for Efficient Data Retrieval
ITPUB
ITPUB
Feb 1, 2025 · Databases

How to Query Elasticsearch with SQL: A Step‑by‑Step Guide

This tutorial explains how to use Elasticsearch's built‑in SQL interface to replace complex Query DSL statements with familiar SQL syntax, covering environment setup, index creation, data insertion, and a variety of query examples such as filtering, ordering, grouping, aggregation, and date manipulation.

Data RetrievalElasticsearchQuery DSL
0 likes · 8 min read
How to Query Elasticsearch with SQL: A Step‑by‑Step Guide
dbaplus Community
dbaplus Community
Nov 12, 2024 · Databases

How to Query Elasticsearch with SQL: A Step‑by‑Step Guide

This tutorial shows how to use Elasticsearch's built‑in SQL support to run familiar MySQL‑style queries—creating an index, inserting documents, and performing SELECT, WHERE, ORDER BY, GROUP BY, LIMIT, aggregation, date and string functions—making ES data retrieval as easy as relational databases.

Data RetrievalElasticsearchQuery DSL
0 likes · 7 min read
How to Query Elasticsearch with SQL: A Step‑by‑Step Guide
ByteDance Data Platform
ByteDance Data Platform
Sep 25, 2024 · Artificial Intelligence

How LLMs Power the “Find Data Assistant” for Smarter Data Retrieval

This article explains how the Volcano Engine DataLeap team leveraged large‑language models to build the “Find Data Assistant”, detailing its design, challenges, embedding‑and‑reranker enhancements, LLM‑driven semantic search, mixing architecture, and practical lessons for improving data asset management and retrieval.

Data Asset ManagementData RetrievalEmbedding
0 likes · 17 min read
How LLMs Power the “Find Data Assistant” for Smarter Data Retrieval
DataFunSummit
DataFunSummit
Sep 21, 2024 · Artificial Intelligence

DataLeap "Find Data Assistant": Leveraging Large Language Models for Data Asset Retrieval and Management

This article details how the DataLeap team applied large language model technology to build the "Find Data Assistant" platform, addressing the challenges of locating and using massive data assets through a hybrid retrieval architecture, enhanced embedding, reranking, mixed ranking, and answer summarization, while sharing practical lessons and future directions.

Data Asset ManagementData RetrievalEmbedding
0 likes · 17 min read
DataLeap "Find Data Assistant": Leveraging Large Language Models for Data Asset Retrieval and Management
Selected Java Interview Questions
Selected Java Interview Questions
Dec 29, 2022 · Backend Development

Optimizing Large‑Scale Data Retrieval with ClickHouse, Elasticsearch Scroll Scan, ES+HBase, and RediSearch+RedisJSON

This article examines a business requirement to filter up to 100 000 records from a pool of tens of millions, presenting and evaluating four backend solutions—multithreaded ClickHouse pagination, Elasticsearch scroll‑scan, an ES‑HBase hybrid, and RediSearch + RedisJSON—along with performance data and implementation details.

BackendClickHouseData Retrieval
0 likes · 11 min read
Optimizing Large‑Scale Data Retrieval with ClickHouse, Elasticsearch Scroll Scan, ES+HBase, and RediSearch+RedisJSON
Zuoyebang Tech Team
Zuoyebang Tech Team
Apr 7, 2022 · Cloud Native

How Fluid Transforms Large‑Scale Data Retrieval on Kubernetes

This article explains how Zuoyebang redesigned its massive data retrieval platform by separating compute and storage with the Fluid project on Kubernetes, achieving minute‑level hundred‑TB distribution, elastic caching, and improved stability for real‑time educational services.

Compute-Storage SeparationData RetrievalFluid
0 likes · 8 min read
How Fluid Transforms Large‑Scale Data Retrieval on Kubernetes
Bilibili Tech
Bilibili Tech
Feb 18, 2022 · Big Data

Evolution of Bilibili's Data Retrieval Services and Lakehouse Architecture

Bilibili’s data retrieval journey progressed from a fragmented, chimney‑style pipeline to a unified Flink‑based service layer with the Ark construction system and Akuya SQL engine, and finally to an Iceberg‑driven lakehouse that eliminates data duplication, streamlines cross‑engine optimization, and offers platformized, low‑latency analytics.

Big DataBilibiliData Retrieval
0 likes · 14 min read
Evolution of Bilibili's Data Retrieval Services and Lakehouse Architecture
21CTO
21CTO
Nov 29, 2021 · Databases

Why Moving Business Logic to the Application Boosts Scalability

The article explains how shifting complex filtering, validation, and join operations from the database to the application layer reduces scarce database resource usage, improves scalability, and enables more flexible resource allocation for modern web applications.

Data RetrievalScalabilityapplication architecture
0 likes · 7 min read
Why Moving Business Logic to the Application Boosts Scalability
ITPUB
ITPUB
Jan 26, 2016 · Backend Development

How Python Powers Star Wars Visual Effects and the Star Wars API

This article explores Python's role in Industrial Light & Magic's visual effects for Star Wars, introduces the Star Wars API, and demonstrates Python scripts that query film data and build a simple Star Wars-themed game.

Backend DevelopmentData RetrievalGame Development
0 likes · 5 min read
How Python Powers Star Wars Visual Effects and the Star Wars API