Tagged articles

FlinkSQL

7 articles · Page 1 of 1

Aug 20, 2024 · Big Data

How FlinkSQL Optimizations Cut CPU Usage by Up to 60% in Streaming Jobs

This article details the FlinkSQL performance enhancements implemented by the streaming team, covering view reuse, redundant shuffle removal, multiple‑input operator redesign, long sliding‑window optimizations, and native JSON format improvements, which together deliver up to 60% CPU savings and massive core‑hour reductions.

Big DataCPU ReductionFlinkSQL

0 likes · 13 min read

How FlinkSQL Optimizations Cut CPU Usage by Up to 60% in Streaming Jobs

DataFunSummit

May 10, 2023 · Big Data

Field-Level Data Lineage Extraction for FlinkSQL Using Apache Calcite

This article explains how to derive field‑level data lineage for FlinkSQL by leveraging Apache Calcite, covering the Calcite framework, FlinkSQL execution stages, the three‑step parsing approach, core source code details, practical Insert/Join examples, and extensions for lookup joins and UDTFs.

Apache CalciteFlinkSQLRelNode

0 likes · 12 min read

Field-Level Data Lineage Extraction for FlinkSQL Using Apache Calcite

Bilibili Tech

Nov 4, 2022 · Big Data

Advancements and Optimizations of FlinkSQL at Bilibili

Bilibili’s FlinkSQL team has enhanced the Flink engine—now based on 1.11 with back‑ported 1.15 features—by adding Delay‑Join, table‑valued functions, projection‑push‑down, UDF and object reuse, automatic mini‑batch/two‑phase aggregation, key‑group skew fixes, connector slot‑groups, real‑time projection with Hudi, and RocksDB state‑performance tweaks, while planning remote state backends and deeper stream‑batch integration.

FlinkSQLPerformance OptimizationReal-time Projection

0 likes · 29 min read

Advancements and Optimizations of FlinkSQL at Bilibili

Big Data Technology Architecture

Nov 2, 2021 · Big Data

Comprehensive Guide to FlinkSQL and Table API: Background, Dependencies, Planners, and Usage

This article provides a detailed introduction to FlinkSQL, covering its background, the Table API, required dependencies, differences between old and Blink planners, various API usage patterns, connector configurations for CSV, Kafka, Elasticsearch, MySQL, and how to convert between DataStream and Table in Flink's unified batch‑stream processing model.

ConnectorDataStreamFlinkSQL

0 likes · 23 min read

Comprehensive Guide to FlinkSQL and Table API: Background, Dependencies, Planners, and Usage

360 Tech Engineering

Jun 25, 2021 · Big Data

Introducing ULTRON: A Real‑Time Data Warehouse Platform Powered by FlinkSQL

ULTRON is a one‑stop real‑time data‑warehouse development platform built on FlinkSQL that unifies data integration, asset management, cluster deployment, modeling, ETL, OLAP analysis and governance, addressing the limitations of traditional batch‑oriented warehouses and simplifying streaming data workflows for developers.

Data GovernanceFlinkSQLReal-Time Data Warehouse

0 likes · 13 min read

Introducing ULTRON: A Real‑Time Data Warehouse Platform Powered by FlinkSQL

Youzan Coder

Jun 23, 2021 · Industry Insights

How Youzan Built a Real‑Time Data Warehouse: Architecture, ETL, and Use Cases

This article details Youzan's end‑to‑end experience building a real‑time data warehouse, covering background, application scenarios, layered architecture, streaming ETL techniques, data validation, recovery processes, and three production projects that illustrate the solution in action.

ClickHouseData ValidationDruid

0 likes · 18 min read

How Youzan Built a Real‑Time Data Warehouse: Architecture, ETL, and Use Cases

Big Data Technology & Architecture

Apr 29, 2019 · Big Data

Understanding Retract Updates in FlinkSQL: Append vs Retract Modes

FlinkSQL's retract updates allow handling of data modifications in streaming queries by using toRetractStream, contrasting with the append-only toAppendStream mode, and this article explains the differences, when each mode applies, and provides illustrative examples and visual diagrams.

Append ModeBig DataFlinkSQL

0 likes · 3 min read

Understanding Retract Updates in FlinkSQL: Append vs Retract Modes