Tagged articles
1 articles
Page 1 of 1
Alibaba Cloud Developer
Alibaba Cloud Developer
Oct 29, 2024 · Big Data

How We Scaled Billion‑Image Asset Ingestion with Dataworks: Lessons & Tricks

Facing the challenge of importing billions of image assets, we redesigned the pipeline using Dataworks open‑API, clustered tables, data sharding, cube tables, and custom key generation, achieving faster parallel processing, fault tolerance, and flexible attribute storage, and share practical insights on scheduling, view parametrization, and output services.

Image Processingcube tabledata ingestion
0 likes · 18 min read
How We Scaled Billion‑Image Asset Ingestion with Dataworks: Lessons & Tricks