Tagged articles
17 articles
Page 1 of 1
Ctrip Technology
Ctrip Technology
Apr 16, 2026 · Big Data

How Ray + DuckDB Cut 9B-Row Attribution Queries from 40s to 15s

When attribution analysis on over 900 million rows slowed to more than 40 seconds and threatened cluster stability, Ctrip's smart attribution team rebuilt the architecture with Ray and DuckDB, achieving sub‑15‑second query times, 160 % performance gain, and complete resource isolation.

Attribution AnalysisBig DataDuckDB
0 likes · 22 min read
How Ray + DuckDB Cut 9B-Row Attribution Queries from 40s to 15s
IT Services Circle
IT Services Circle
Apr 3, 2026 · Operations

Turn Millions of Log Lines into Actionable Data with 6 Python Tools in 10 Minutes

This article shows how to replace manual grep searches on massive log files with six Python libraries—pygrok, drain3, datasketch, rapidfuzz, duckdb, and adtk—providing structured parsing, automatic clustering, near‑duplicate detection, fuzzy matching, SQL querying, and time‑series anomaly detection, all illustrated with real code examples and practical tips.

DuckDBPythonadtk
0 likes · 12 min read
Turn Millions of Log Lines into Actionable Data with 6 Python Tools in 10 Minutes
ITPUB
ITPUB
Mar 27, 2026 · Databases

AI’s Impact on Open‑Source Databases: MySQL, PostgreSQL, and AliSQL DuckDB

In 2026 the database ecosystem faces fierce competition between MySQL and PostgreSQL, while AI emerges as a new driver prompting open‑source projects like AliSQL to release DuckDB, vector engines and intelligent CLI, reshaping how relational databases serve both transactional and analytical workloads.

AIAliSQLDuckDB
0 likes · 15 min read
AI’s Impact on Open‑Source Databases: MySQL, PostgreSQL, and AliSQL DuckDB
Data STUDIO
Data STUDIO
Mar 27, 2026 · Operations

Struggling with Log Files? 6 Python Libraries That Turn Logs into Actionable Data

This article introduces six Python libraries—pygrok, drain3, datasketch, rapidfuzz, duckdb, and adtk—that transform massive, unstructured log streams into structured, searchable, and analyzable data, showing concrete code examples, performance gains, and practical tips for real‑world troubleshooting.

DuckDBPythonadtk
0 likes · 12 min read
Struggling with Log Files? 6 Python Libraries That Turn Logs into Actionable Data
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 10, 2026 · Artificial Intelligence

Turning AI into an Analyst: OpenClaw Skill System + DuckDB for E‑Commerce Forecasting

This article explains how OpenClaw’s Skill system gives AI executable instructions, combines it with a DuckDB analytical instance, and builds a fully automated e‑commerce behavior forecasting pipeline that iteratively trains, validates, and optimizes models to achieve sub‑10% prediction error.

AI automationDuckDBE‑Commerce Forecasting
0 likes · 17 min read
Turning AI into an Analyst: OpenClaw Skill System + DuckDB for E‑Commerce Forecasting
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 27, 2026 · Databases

How DuckDB Compression Supercharges AliSQL Storage and Cuts MySQL Costs

AliSQL integrates DuckDB as its storage engine to achieve high‑density columnar compression and fast analytical scans, detailing DuckDB’s multi‑layer storage format, adaptive compression algorithm selection, performance benchmarks versus InnoDB, HBase, ClickHouse, OceanBase, and the engineering optimizations AliSQL adds for throughput and cost reduction.

AliSQLColumnar StorageDatabase Optimization
0 likes · 12 min read
How DuckDB Compression Supercharges AliSQL Storage and Cuts MySQL Costs
ITPUB
ITPUB
Feb 7, 2026 · Databases

Why MySQL Excels at Transactions but Falters in Analytics – DuckDB to the Rescue

The article examines why MySQL’s transaction layer remains robust while its analytical capabilities lag, outlines enterprise demands for simultaneous OLTP and OLAP workloads, introduces DuckDB’s lightweight yet powerful engine, describes AliSQL’s integration of DuckDB for full MySQL compatibility, and presents TPCH benchmark results showing a dramatic performance boost for analytical queries.

AliSQLDuckDBOLAP
0 likes · 23 min read
Why MySQL Excels at Transactions but Falters in Analytics – DuckDB to the Rescue
Data STUDIO
Data STUDIO
Nov 25, 2025 · Big Data

Why Parquet Is the Faster, Lighter, Safer Alternative to CSV in Python

The article explains why CSV becomes a bottleneck for large‑scale data, demonstrates how Parquet’s columnar, typed, and compressed format dramatically reduces storage, speeds up reads, and improves data safety, and provides step‑by‑step Python code for migrating and benchmarking the switch.

CSVDuckDBParquet
0 likes · 18 min read
Why Parquet Is the Faster, Lighter, Safer Alternative to CSV in Python
Data STUDIO
Data STUDIO
Nov 12, 2025 · Databases

7 Reusable DuckDB SQL Patterns for Fast Local Data Analysis

This article presents seven practical DuckDB SQL patterns—querying files directly, treating partition folders as tables, deduplicating with QUALIFY, computing rolling metrics with window functions, pivot/unpivot, handling JSON arrays, and exporting results to Parquet—plus tips and a mini case study that show how to turn a notebook into a lightweight OLAP engine without leaving the Python environment.

DuckDBJSONParquet
0 likes · 12 min read
7 Reusable DuckDB SQL Patterns for Fast Local Data Analysis
Aikesheng Open Source Community
Aikesheng Open Source Community
May 26, 2025 · Information Security

Building a Minimal Offline MySQL CVE Scanning Tool with DuckDB

This article demonstrates how to create a lightweight, offline MySQL CVE scanning and reporting tool by cloning the CVEProject JSON repository, leveraging DuckDB for fast JSON parsing and SQL queries, and providing simple shell commands for data updates, version scanning, and vulnerability matching.

CVEDuckDBShellScript
0 likes · 6 min read
Building a Minimal Offline MySQL CVE Scanning Tool with DuckDB
Volcano Engine Developer Services
Volcano Engine Developer Services
Mar 5, 2025 · Artificial Intelligence

How DeepSeek Smallpond Powers AI Data Processing with Ray and DuckDB

This article introduces DeepSeek Smallpond, a lightweight yet high‑performance AI data‑processing engine built on Ray and DuckDB, explains its dual Dataframe and LogicalPlan APIs, showcases integration with Volcano Engine's AI Data Lake LAS, and provides practical code examples for distributed processing, multimodal storage, and RAG pipelines.

AI data processingData LakeDuckDB
0 likes · 18 min read
How DeepSeek Smallpond Powers AI Data Processing with Ray and DuckDB
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 27, 2025 · Databases

Boosting PostgreSQL Analytics with DuckDB: Architecture, Optimizations, and Performance Gains

This article explains how integrating DuckDB as an extension for RDS PostgreSQL creates a unified HTAP solution that dramatically accelerates complex analytical queries through columnar storage, vectorized execution, and advanced optimizer techniques, delivering up to hundreds‑fold performance improvements and superior compression.

AnalyticsColumnarDatabase Optimization
0 likes · 11 min read
Boosting PostgreSQL Analytics with DuckDB: Architecture, Optimizations, and Performance Gains
dbaplus Community
dbaplus Community
Sep 5, 2024 · Databases

How to Migrate Data from MongoDB to MySQL Using DuckDB

This guide explains how to export MongoDB collections to JSON, load them into DuckDB, generate compatible table schemas, and then transfer the data efficiently into MySQL using DuckDB as an intermediate processing engine.

Data MigrationDuckDBETL
0 likes · 6 min read
How to Migrate Data from MongoDB to MySQL Using DuckDB
Laravel Tech Community
Laravel Tech Community
Nov 16, 2022 · Databases

DuckDB New Release Highlights and Feature Changes

The article introduces DuckDB, a high‑performance embedded analytical database, outlines its new release’s storage, performance, and memory improvements, describes its C/C++ integration and build process, and lists key feature changes such as parallel execution, novel compression methods, and enhanced SQL capabilities.

Analytical DatabaseDuckDBEmbedded Database
0 likes · 3 min read
DuckDB New Release Highlights and Feature Changes