Author

Alibaba Cloud Big Data AI Platform

The Alibaba Cloud Big Data AI Platform builds on Alibaba’s leading cloud infrastructure, big‑data and AI engineering capabilities, scenario algorithms, and extensive industry experience to offer enterprises and developers a one‑stop, cloud‑native big‑data and AI capability suite. It boosts AI development efficiency, enables large‑scale AI deployment across industries, and drives business value.

465

Articles

Likes

1.1k

Views

Comments

Latest from Alibaba Cloud Big Data AI Platform

100 recent articles max

Alibaba Cloud Big Data AI Platform

Feb 2, 2026 · Big Data

Real‑Time Analytics with Alibaba Cloud Serverless Spark & Paimon for Taobao Flash Sale

This article details how Alibaba Cloud EMR Serverless Spark combined with the Paimon lakehouse framework enables Taobao Flash Sale’s retail data team to achieve low‑latency, high‑throughput real‑time analytics, batch processing, and feature generation, outlining architecture evolution, performance gains, and practical Spark tuning techniques.

LakehousePaimonPerformance Tuning

0 likes · 18 min read

Real‑Time Analytics with Alibaba Cloud Serverless Spark & Paimon for Taobao Flash Sale

Alibaba Cloud Big Data AI Platform

Feb 2, 2026 · Big Data

How We Built a Scalable Lakehouse Architecture with StarRocks, Paimon, and Flink

This article details the evolution of a data warehouse at RenliJia from a MaxCompute‑centric setup to a modern lakehouse using StarRocks, Paimon, Flink, and Fluss, describing design goals, technical evaluations, implementation steps for offline, OLAP, and real‑time workloads, and the challenges and future plans that emerged.

FlinkLakehousePaimon

0 likes · 25 min read

How We Built a Scalable Lakehouse Architecture with StarRocks, Paimon, and Flink

Alibaba Cloud Big Data AI Platform

Jan 29, 2026 · Cloud Native

How Alibaba Cloud’s MaxCompute Powers Multi‑Modal AI Data Processing for MOSI Intelligence

In the era of rapid AI advancement, MOSI Intelligence faced IDC storage, compute, and network bottlenecks for large‑scale audio‑video pipelines, prompting a partnership with Alibaba Cloud to build a cloud‑native, one‑stop multi‑modal data processing platform using MaxCompute and the custom MaxFrame engine, dramatically improving performance and operational efficiency.

AI Data PlatformCloud NativeMaxCompute

0 likes · 8 min read

How Alibaba Cloud’s MaxCompute Powers Multi‑Modal AI Data Processing for MOSI Intelligence

Alibaba Cloud Big Data AI Platform

Jan 23, 2026 · Backend Development

Why Elasticsearch’s 10,000 Hit Limit Slows Your Cluster and How to Fix It

Elasticsearch defaults to a total hit count of 10,000 after version 7.x, which many developers override with "track_total_hits": true to get exact numbers, but this seemingly harmless change can double CPU usage and increase query latency from 20 ms to 500 ms due to the underlying Block‑Max WAND algorithm and its interaction with aggregations, sorting, and scoring.

Block-Max WANDElasticSearchPerformance

0 likes · 11 min read

Why Elasticsearch’s 10,000 Hit Limit Slows Your Cluster and How to Fix It

Alibaba Cloud Big Data AI Platform

Jan 19, 2026 · Databases

How Hologres Dynamic Table Accelerates Billion‑Row Data Refreshes

The article explains how Hologres Dynamic Table, a cloud‑native materialized‑view‑like feature, supports full and incremental refresh modes, enables minute‑level data freshness for billion‑row price tables, and provides join, aggregation, and partition capabilities while outlining its architecture, limitations, and real‑world performance gains.

Dynamic TableHologresIncremental Refresh

0 likes · 8 min read

How Hologres Dynamic Table Accelerates Billion‑Row Data Refreshes

Alibaba Cloud Big Data AI Platform

Jan 8, 2026 · Big Data

How Gaode Maps Built a Real‑Time Lakehouse for Billion‑Scale Trajectory Data

This article details Gaode Maps' end‑to‑end lakehouse solution for massive, high‑frequency trajectory data, covering the challenges of real‑time visibility, query performance, and storage cost, and explaining how a hot‑warm‑cold tiering architecture built on Apache Flink, Paimon, StarRocks, Redis and Lindorm delivers millisecond‑level queries while cutting storage expenses.

Apache FlinkApache PaimonData Tiering

0 likes · 19 min read

How Gaode Maps Built a Real‑Time Lakehouse for Billion‑Scale Trajectory Data

Alibaba Cloud Big Data AI Platform

Jan 5, 2026 · Big Data

How Xunlei Boosted Data Processing with Alibaba Cloud EMR Serverless Spark

This article details Xunlei's migration from a fixed Hadoop cluster to Alibaba Cloud EMR Serverless Spark, outlining the platform's background, pain points, technical upgrade goals, serverless capabilities, archive data access methods, Kyuubi integration, and the resulting business and technical benefits.

Cloud ComputingEMRKyuubi

0 likes · 11 min read

How Xunlei Boosted Data Processing with Alibaba Cloud EMR Serverless Spark

Alibaba Cloud Big Data AI Platform

Dec 31, 2025 · Big Data

Build a Scalable AI Data Pipeline Using DataWorks, MaxCompute & MaxFrame

This guide walks you through setting up a secure, elastic, and high‑performance AI data processing platform on Alibaba Cloud by combining DataWorks, MaxCompute, and MaxFrame, covering the four essential steps, code examples, best‑practice tips, and common troubleshooting advice.

AICloud ComputingDataWorks

0 likes · 10 min read

Build a Scalable AI Data Pipeline Using DataWorks, MaxCompute & MaxFrame

Alibaba Cloud Big Data AI Platform

Dec 30, 2025 · Big Data

How StarRocks and Apache Paimon Unite to Build a True Lakehouse Native Engine

StarRocks and Apache Paimon have been progressively integrated across multiple releases, enabling a unified lakehouse architecture that supports multi-source federated analysis, time-travel queries, native readers/writers, distributed planning, and advanced profiling, while delivering performance gains that bring Paimon query speed on par with native StarRocks tables.

Apache PaimonData IntegrationLakehouse

0 likes · 9 min read

How StarRocks and Apache Paimon Unite to Build a True Lakehouse Native Engine

Alibaba Cloud Big Data AI Platform

Dec 29, 2025 · Cloud Native

How a Visual Platform Cut Search Costs by 60% with All‑in‑Elasticsearch

This case study details how a major internet visual platform consolidated its log, keyword, and vector search workloads onto Alibaba Cloud Elasticsearch, eliminating three separate pipelines, reducing write‑costs by 60%, cutting storage expenses over 60%, and achieving multi‑fold performance gains through serverless scaling, FalconSeek engine optimizations, and unified monitoring.

Cost OptimizationElasticSearchRAG

0 likes · 10 min read