Tagged articles
29 articles
Page 1 of 1
Architect's Ambition
Architect's Ambition
Apr 22, 2026 · Artificial Intelligence

From Natural Language to Executable SQL: Building an AI‑Powered SQL Generation Engine

The article explains why directly letting large language models generate SQL leads to poor accuracy, and presents a production‑grade engine that combines a semantic knowledge layer, RAG‑enhanced NL‑to‑DSL conversion, and a deterministic DSL‑to‑SQL translator to achieve 85‑90% correctness in real‑world deployments.

DSL2SQLNL2DSLRAG
0 likes · 13 min read
From Natural Language to Executable SQL: Building an AI‑Powered SQL Generation Engine
Big Data Tech Team
Big Data Tech Team
Jan 7, 2026 · Big Data

How AI Is Transforming Data Warehouses: Automation, SQL Generation, and NLQ

This article explores how artificial intelligence enhances data warehouses by automating model design, generating SQL from natural language, optimizing resource scheduling, and enabling business users to converse directly with data, while also reviewing leading tools and future cloud‑native trends.

AIAutomated ModelingData Warehouse
0 likes · 7 min read
How AI Is Transforming Data Warehouses: Automation, SQL Generation, and NLQ
dbaplus Community
dbaplus Community
Nov 3, 2025 · Artificial Intelligence

How RAG Turns Natural Language Queries into Accurate SQL for Data Platforms

This article explains how Retrieval‑Augmented Generation (RAG) combines vector databases with large language models to let non‑technical users ask natural‑language questions and receive precise SQL statements, detailing the workflow, architecture, chunking methods, performance gains, and remaining challenges.

Data PlatformLLMRAG
0 likes · 17 min read
How RAG Turns Natural Language Queries into Accurate SQL for Data Platforms
DaTaobao Tech
DaTaobao Tech
Aug 13, 2025 · Artificial Intelligence

Unlocking AI Power: A Complete Guide to Prompt Engineering and Advanced Techniques

This article explores the emerging field of prompt engineering, detailing its fundamentals, advanced strategies such as chain‑of‑thought, ReAct, and structured frameworks, and demonstrates practical applications in AI agents for data retrieval, SQL generation, and market insight, offering actionable guidance for developers and business users alike.

AI agentsData RetrievalRAG
0 likes · 42 min read
Unlocking AI Power: A Complete Guide to Prompt Engineering and Advanced Techniques
DataFunSummit
DataFunSummit
Jun 6, 2025 · Artificial Intelligence

Automating High‑Quality NL2SQL Data Synthesis with Intermediate Representations

This work tackles the difficulty of incorporating extensive domain knowledge into in‑domain NL2SQL tasks by proposing an intermediate‑representation‑based data synthesis method that decouples knowledge compliance from SQL generation, enabling automated creation of high‑quality training data with 60× human efficiency and over 97% accuracy.

NL2SQLSQL generationdata synthesis
0 likes · 2 min read
Automating High‑Quality NL2SQL Data Synthesis with Intermediate Representations
Youzan Coder
Youzan Coder
Jun 6, 2025 · Artificial Intelligence

How AI Agents Turn Manual Data Retrieval into Fully Automated Insights

This article examines the challenges of manual data extraction in data‑driven enterprises, explains why large language models alone fall short, and details how the Cursor‑Agent framework automates end‑to‑end querying, knowledge‑base integration, and result validation to become a self‑sufficient "data master" for both technical and non‑technical users.

AI AgentCursor AgentData Automation
0 likes · 26 min read
How AI Agents Turn Manual Data Retrieval into Fully Automated Insights
Qborfy AI
Qborfy AI
Mar 28, 2025 · Artificial Intelligence

Master Prompt Engineering: From Basics to Advanced SQL Generation

This article walks readers through the fundamentals of prompt engineering—covering role, context, instruction, examples, and output formatting—then demonstrates a step‑by‑step construction of a sophisticated SQL‑generation prompt, complete with concrete code snippets, best‑practice tips, and reference resources.

AI Prompt DesignInstruction TuningPractical Examples
0 likes · 21 min read
Master Prompt Engineering: From Basics to Advanced SQL Generation
Su San Talks Tech
Su San Talks Tech
Mar 8, 2025 · Databases

Quickly Design Databases with the Open‑Source DrawDB Tool

This guide introduces the open‑source DrawDB tool, walks through Docker‑based installation, demonstrates designing tables and relationships using a mall e‑commerce permission module example, and covers SQL import/export, theme switching, and template customization, helping developers efficiently create database schemas.

Database designDockerDrawDB
0 likes · 6 min read
Quickly Design Databases with the Open‑Source DrawDB Tool
DataFunSummit
DataFunSummit
Feb 3, 2025 · Databases

NL2SQL from a Database Perspective: Overview, History, and Laboratory Projects (GAR, MetaSQL, PURPLE)

This article presents a comprehensive overview of NL2SQL, covering its definition, motivations, application scenarios, key technical components, evaluation metrics, historical development stages, and detailed descriptions of three laboratory projects—GAR, MetaSQL, and PURPLE—along with future research directions and a Q&A session.

AIMetaSQLNL2SQL
0 likes · 17 min read
NL2SQL from a Database Perspective: Overview, History, and Laboratory Projects (GAR, MetaSQL, PURPLE)
DataFunSummit
DataFunSummit
Nov 8, 2024 · Artificial Intelligence

ChatDBA: An AI‑Powered Database Fault Diagnosis Assistant Using Retrieval‑Augmented Generation

ChatDBA, developed by Shanghai Aikesheng, is an AI-driven database operation assistant that leverages large language models and Retrieval‑Augmented Generation to provide fault diagnosis, knowledge learning, SQL generation and optimization, addressing challenges such as vague outputs, complex troubleshooting logic, and memory management through a structured architecture and multi‑modal retrieval strategies.

AIFault DiagnosisRAG
0 likes · 10 min read
ChatDBA: An AI‑Powered Database Fault Diagnosis Assistant Using Retrieval‑Augmented Generation
Baidu Geek Talk
Baidu Geek Talk
Oct 22, 2024 · Big Data

How Baidu’s DATAPILOT Uses NVIDIA RAPIDS to Supercharge SQL Analytics

Baidu’s DATAPILOT platform combines natural‑language interaction with GPU‑accelerated Spark‑RAPIDS to turn complex, multi‑table SQL queries into seconds‑fast results, boosting ad‑revenue analysis efficiency by up to five‑fold while reducing infrastructure costs.

Apache SparkBaiduBig Data
0 likes · 10 min read
How Baidu’s DATAPILOT Uses NVIDIA RAPIDS to Supercharge SQL Analytics
DataFunTalk
DataFunTalk
Oct 4, 2024 · Artificial Intelligence

Building a Commercial Intelligence Assistant with Baidu's Wenxin Large Model: Methods, Optimizations, and Future Outlook

This article shares the exploration and practice of using Baidu's Wenxin large model to build a commercial intelligence assistant, highlighting its impact on business revenue and user experience, code generation, knowledge graph integration, database query optimization, and visual analytics for enhanced data analysis.

AI for enterpriseKnowledge GraphSQL generation
0 likes · 17 min read
Building a Commercial Intelligence Assistant with Baidu's Wenxin Large Model: Methods, Optimizations, and Future Outlook
StarRocks
StarRocks
May 14, 2024 · Artificial Intelligence

How Tencent Games Boosted AI‑Generated SQL Accuracy to 89% with a Lakehouse Architecture

Tencent Games tackled the low accuracy of AI‑generated SQL in production by combining large language models with a StarRocks lake‑warehouse, introducing a semantic layer, async materialized views, and an agent‑based multi‑intelligence framework, ultimately raising one‑shot SQL correctness to 89% and cutting delivery time from 2 hours to 0.33 hours.

AILLMLakehouse
0 likes · 13 min read
How Tencent Games Boosted AI‑Generated SQL Accuracy to 89% with a Lakehouse Architecture
StarRocks
StarRocks
Apr 25, 2024 · Artificial Intelligence

How AI Boosts SQL Accuracy and Performance: Real‑World Demo & AutoMV Insights

The April 16 online meetup by Tencent Game Data and StarRocks explored AI‑generated SQL, tackled NL2SQL challenges, showcased a demo that lifted one‑shot accuracy to 89%, and introduced StarRocks AutoMV technology that automates materialized‑view recommendation and merging to accelerate data‑warehouse queries.

AIAutoMVNL2SQL
0 likes · 9 min read
How AI Boosts SQL Accuracy and Performance: Real‑World Demo & AutoMV Insights
JD Retail Technology
JD Retail Technology
Oct 26, 2023 · Artificial Intelligence

Leveraging Large Language Models for Text-to-SQL: Prompt Design and End-to-End Pipeline

This article explains how large language models can be used to convert natural language queries into SQL statements, describes two main approaches—direct generation and fine‑tuned open‑source models—details prompt engineering techniques, and outlines an end‑to‑end pipeline that executes the generated SQL and summarizes results.

ChatGLMLLMPrompt engineering
0 likes · 7 min read
Leveraging Large Language Models for Text-to-SQL: Prompt Design and End-to-End Pipeline
ITPUB
ITPUB
Jun 5, 2023 · Databases

How ChatGPT Could Transform Database Development and Management

This article explores why ChatGPT has become a breakthrough AI technology, reviews past AI‑database integrations, and envisions multiple future use cases—including natural‑language SQL generation, performance tuning, schema and architecture design, data analysis, and governance—showing how developers, DBAs, and analysts might benefit.

AIChatGPTSQL generation
0 likes · 9 min read
How ChatGPT Could Transform Database Development and Management
Shopee Tech Team
Shopee Tech Team
Aug 25, 2022 · Frontend Development

Datav: Building an Efficient Data Visualization Platform for Dashboard Development

Datav, Shopee’s internal low‑code data‑visualization platform, automates dashboard creation by providing a DSL‑driven designer, reusable chart components, and auto‑generated SQL, cutting project cycles from roughly 40 to 20 days and front‑end effort from ten to three person‑days while streamlining data‑source integration and permission management.

DashboardData visualizationLow‑code platform
0 likes · 17 min read
Datav: Building an Efficient Data Visualization Platform for Dashboard Development
ByteDance Data Platform
ByteDance Data Platform
Jul 18, 2022 · Big Data

Unlocking Real‑Time Data Quality: ByteDance’s Dynamic Exploration Solution

This article explains how ByteDance’s dynamic data exploration tool improves data quality assurance by replacing time‑consuming SQL validation with real‑time, sample‑based profiling, detailing its problem background, core features, technical architecture, front‑end rendering techniques, operation‑stack management, and future enhancements.

Big DataSQL generationdata exploration
0 likes · 13 min read
Unlocking Real‑Time Data Quality: ByteDance’s Dynamic Exploration Solution
ITPUB
ITPUB
Sep 14, 2020 · Big Data

How Alibaba’s DChain Data Converger Auto‑Generates Real‑Time Wide Tables with SQL Pipelines

This article explains how the ADC (Alibaba DChain Data Converger) project automatically creates large real‑time tables by letting users configure metrics on the front‑end, then generating and publishing SQL through a pipeline that leverages design patterns, priority queues, and tree‑based data structures for efficient cross‑database processing.

Design PatternsFlinkReal-time analytics
0 likes · 15 min read
How Alibaba’s DChain Data Converger Auto‑Generates Real‑Time Wide Tables with SQL Pipelines
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 7, 2020 · Big Data

How Alibaba’s ADC Project Automates Real‑Time SQL Generation with Design Patterns and Priority Queues

This article explains how the Alibaba DChain Data Converger (ADC) automatically creates wide‑table SQL for real‑time cross‑database analytics by using a pipeline architecture, priority‑queue‑driven task scheduling, and specific design patterns to handle metadata, joins, and resource management.

Big DataSQL generationpriority-queue
0 likes · 13 min read
How Alibaba’s ADC Project Automates Real‑Time SQL Generation with Design Patterns and Priority Queues
Meituan Technology Team
Meituan Technology Team
Nov 2, 2017 · Big Data

Dashiang Cube: A Multi‑Source BI Reporting Tool with Custom Join Algorithms

Meituan‑Dianping’s Dashiang Cube is a multi‑source BI reporting platform that unifies MySQL, Kylin, Elasticsearch and plain‑text data via a common SQL layer, generates dialect‑specific queries, performs custom back‑tracking inner and left outer joins across heterogeneous sources, supports scripted metric calculations, permission controls, and a reusable UI component library for self‑service reporting.

BIData PermissionsData Warehouse
0 likes · 14 min read
Dashiang Cube: A Multi‑Source BI Reporting Tool with Custom Join Algorithms
ITPUB
ITPUB
Sep 14, 2017 · Artificial Intelligence

How Salesforce’s Seq2SQL Turns Natural Language into SQL with Reinforcement Learning

Salesforce’s recent research introduces Seq2SQL, a reinforcement‑learning‑driven sequence‑to‑sequence model that translates natural‑language questions into SQL queries, eliminating the need to learn SQL, and includes the large WikiSQL dataset built from crowdsourced NL‑SQL pairs for training and evaluation.

AISQL generationSeq2SQL
0 likes · 6 min read
How Salesforce’s Seq2SQL Turns Natural Language into SQL with Reinforcement Learning