Big Data 22 min read

Upgrading Real-Time Computing Engine from Flink 1.10 to 1.13: Practices and Challenges

Youzan upgraded its real‑time computing engine from Flink 1.10 to 1.13 to meet rising SQL and containerization demands, gaining enhanced SQL syntax, time‑function handling, Window TVF standardization, Hive integration, K8s stability, elastic scaling, richer Kafka and format support, improved metrics and debugging tools, and successfully migrated all custom connectors, UDFs, and SQL jobs to the new Kubernetes‑based platform.

Youzan Coder
Youzan Coder
Youzan Coder
Upgrading Real-Time Computing Engine from Flink 1.10 to 1.13: Practices and Challenges

This article details the process of upgrading the real-time computing engine from Flink 1.10 to Flink 1.13 at Youzan. With the increasing demand for SQL capabilities and the need for containerization, the upgrade was necessary to leverage community support for on K8S capabilities and enhance the engine's functionality.

The article begins by outlining the background and motivation for the upgrade, highlighting the growing demand for SQL capabilities and the need for containerization. It then discusses the benefits of upgrading to Flink 1.13, including improvements in SQL syntax, time zone and time function optimizations, support for Window TVF syntax standardization, and enhanced Flink on Hive capabilities.

The article also covers the benefits of Flink on K8S, such as improved stability and maturity, real-time offline elastic scaling, and enhanced state retention and recovery features. It further discusses the support for upsert Kafka and richer format formats, as well as other benefits like improved memory metrics, backpressure visualization, CPU flame graph viewing, and historical exception support in the Web UI.

The article then delves into the practical aspects of the upgrade process, including the upgrade of custom connectors, UDF compatibility, SQL syntax conversion, and smooth task migration. It provides detailed examples and code snippets to illustrate the challenges and solutions encountered during the upgrade.

Finally, the article concludes by summarizing the successful upgrade of the real-time computing engine to Flink 1.13 and the migration of all Flink SQL tasks to the K8S cluster. It emphasizes the importance of SQL simplification and feature enhancement for future business scenarios and the readiness for elastic scaling.

Performance Optimizationbig dataFlinkSQLContainerizationUpgradereal-time computingK8s
Youzan Coder
Written by

Youzan Coder

Official Youzan tech channel, delivering technical insights and occasional daily updates from the Youzan tech team.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.