Tagged articles
32 articles
Page 1 of 1
DataFunSummit
DataFunSummit
Mar 24, 2026 · Industry Insights

How DataWorks Is Transforming Big Data Development with AI Agents

The article outlines DataWorks' evolution from a decade‑long big‑data governance platform to an AI‑driven Copilot and autonomous Agent system, detailing its technical foundations, tool‑adaptation layer, context engineering, security safeguards, and future vision of a professional, open, and intelligent big‑data development ecosystem.

AI CopilotBig DataDataWorks
0 likes · 13 min read
How DataWorks Is Transforming Big Data Development with AI Agents
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 6, 2026 · Big Data

How DataWorks Turns Data Quality Rules into Code with Data Contracts

This article explains how DataWorks integrates data quality specifications directly into the SQL development workflow using Data Contracts, addressing governance lag, versioning gaps, and trust issues while providing a unified, version‑controlled, and automated quality assurance process for offline data pipelines.

Data QualityDataWorksYAML
0 likes · 12 min read
How DataWorks Turns Data Quality Rules into Code with Data Contracts
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 13, 2025 · Big Data

How ODPS Evolved Over 15 Years into a Next‑Gen AI‑Ready Big Data Platform

This article chronicles ODPS's 15‑year journey from its exploratory beginnings to a modern, AI‑enabled big data platform, detailing its four development phases, architectural layers, SQL engine upgrades, real‑time processing, lakehouse integration, and the new Data+AI capabilities offered by MaxCompute and DataWorks.

AI integrationBig DataDataWorks
0 likes · 12 min read
How ODPS Evolved Over 15 Years into a Next‑Gen AI‑Ready Big Data Platform
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Feb 24, 2025 · Artificial Intelligence

Unlock Data+AI Fusion: Fine‑Tune Multimodal Models on DataWorks with GPU‑Ready Notebooks

This tutorial shows how to use Alibaba Cloud DataWorks' serverless GPU resource groups together with the open‑source LLaMA‑Factory framework to fine‑tune the Qwen2‑VL‑2B multimodal model for tourism‑domain Q&A, covering environment setup, dataset preparation, parameter configuration, training, and interactive inference.

DataWorksGPULLaMA-Factory
0 likes · 10 min read
Unlock Data+AI Fusion: Fine‑Tune Multimodal Models on DataWorks with GPU‑Ready Notebooks
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 24, 2025 · Big Data

Master DataWorks Notebook: Interactive SQL & Python for Big Data Development

This guide walks you through setting up a personal DataWorks Notebook, performing interactive SQL development with engines like MaxCompute, creating Python visualizations, building ipywidgets for dynamic queries, and leveraging the AI‑powered Copilot to rewrite, explain, and comment code, all within a unified big‑data platform.

Big DataCopilotDataWorks
0 likes · 9 min read
Master DataWorks Notebook: Interactive SQL & Python for Big Data Development
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jan 13, 2025 · Artificial Intelligence

Build a Kaggle House‑Price Prediction Pipeline with DataWorks

This guide walks you through setting up Alibaba Cloud DataWorks, creating a workspace and personal development environment, and importing a Kaggle house‑price prediction notebook to perform data loading, cleaning, feature engineering, model training, and evaluation—all without writing code from scratch.

DataWorksKaggleTutorial
0 likes · 6 min read
Build a Kaggle House‑Price Prediction Pipeline with DataWorks
DataFunSummit
DataFunSummit
Nov 15, 2023 · Big Data

Alibaba Cloud DataWorks Intelligent Data Modeling: Practices and Insights

This article introduces Alibaba Cloud DataWorks' intelligent data modeling tool, outlines the data demand flow, shares best practices and practical demonstrations of data warehouse modeling, discusses model application and data asset management, and answers common questions while highlighting its commercial availability.

AlibabaCloudDataGovernanceDataWarehouse
0 likes · 12 min read
Alibaba Cloud DataWorks Intelligent Data Modeling: Practices and Insights
DataFunTalk
DataFunTalk
Oct 7, 2023 · Big Data

Alibaba DataWorks Data Stability Governance: Challenges, Solutions, and Practices

This article presents Alibaba's experience in addressing large‑scale data stability challenges by outlining common problems, governance principles, baseline monitoring, team collaboration methods, practical implementations, and proactive measures to ensure reliable and accurate data production on the DataWorks platform.

AlibabaBig DataData Governance
0 likes · 12 min read
Alibaba DataWorks Data Stability Governance: Challenges, Solutions, and Practices
DataFunTalk
DataFunTalk
Jul 26, 2023 · Big Data

Data Model Governance Practices at Taobao (Alibaba)

This article presents a comprehensive case study of Taobao's data model governance, detailing the background challenges, the four‑pillar solution framework, specific governance practices such as invalid table decommissioning, data handover, public layer operations, incremental control, productization, future plans, and a Q&A session.

AlibabaDataWorksmetadata
0 likes · 26 min read
Data Model Governance Practices at Taobao (Alibaba)
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Feb 20, 2023 · Big Data

How Alibaba’s DataWorks Transforms Data Governance for Efficiency, Security, and Cost Savings

This article explores Alibaba's DataWorks platform and its comprehensive data governance practices, covering application efficiency, security controls, cost optimization, organizational structure, and cultural initiatives that together enable scalable, secure, and cost‑effective data management across the enterprise.

Big DataCost OptimizationData Governance
0 likes · 31 min read
How Alibaba’s DataWorks Transforms Data Governance for Efficiency, Security, and Cost Savings
Data Thinking Notes
Data Thinking Notes
Jan 12, 2023 · Big Data

Mastering Alibaba DataWorks: Data Warehouse Architecture & Modeling Guide

This comprehensive tutorial walks you through Alibaba DataWorks' data warehouse architecture, covering technical stack selection, three‑layer warehouse design (ODS, CDM, ADS), detailed data modeling with DDL examples, storage strategies, dimension and fact table conventions, and best‑practice hierarchical call standards.

DataModelingDataWarehouseDataWorks
0 likes · 27 min read
Mastering Alibaba DataWorks: Data Warehouse Architecture & Modeling Guide
DataFunSummit
DataFunSummit
Aug 19, 2022 · Big Data

Taobao Data Model Governance: Challenges, Analysis, and Solutions

This article presents a comprehensive overview of Taobao's data model governance, detailing the background and problems of the current data architecture, analyzing root causes, proposing a structured governance framework with DataWorks automation, and outlining future plans to improve efficiency, standardization, and product tooling.

AlibabaBig DataData Governance
0 likes · 13 min read
Taobao Data Model Governance: Challenges, Analysis, and Solutions
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jul 26, 2022 · Big Data

How Alibaba’s Big Data Model Governance Boosted Efficiency and Cut Costs

This report details Alibaba’s large‑scale data model governance initiative for the DaTao ecosystem, analyzing current data issues such as naming inconsistencies, low reuse, and application‑layer inefficiencies, and presents a comprehensive solution—including a model evaluation system, DataWorks co‑development, intelligent modeling, data map enhancements, and future roadmap—to improve data health, reduce costs, and increase operational efficiency.

Big DataData GovernanceDataWorks
0 likes · 15 min read
How Alibaba’s Big Data Model Governance Boosted Efficiency and Cut Costs
DataFunTalk
DataFunTalk
Jul 25, 2022 · Big Data

Taobao Data Model Governance and Intelligent Modeling with DataWorks

This article summarizes Guo Jinshi's presentation on Taobao's data model governance, covering the current data landscape, identified problems, analysis of root causes, proposed governance solutions—including DataWorks intelligent modeling—and future plans, while also providing a Q&A session on practical implementation.

AlibabaBig DataData Governance
0 likes · 13 min read
Taobao Data Model Governance and Intelligent Modeling with DataWorks
DeWu Technology
DeWu Technology
Jun 13, 2022 · Operations

How to Build a Minute‑Level Order Fulfillment Simulation Platform with DataWorks

This article outlines the design and implementation of a minute‑level order‑fulfillment timeliness simulation platform, detailing its background, objectives, challenges, architecture built on Alibaba Cloud DataWorks, core workflow nodes, domain model, ER diagram, JSON task templates, and future extensions for supply‑chain routing.

Big DataDataWorksarchitecture
0 likes · 11 min read
How to Build a Minute‑Level Order Fulfillment Simulation Platform with DataWorks
DaTaobao Tech
DaTaobao Tech
May 13, 2022 · Big Data

Taobao Big Data Model Governance and DataWorks Co‑development

Taobao’s rapidly expanding technical data system faced naming inconsistencies, low table reuse, and costly, inefficient data usage, prompting a joint effort with DataWorks to digitize model evaluation, enforce standardized governance, deliver intelligent end‑to‑end modeling tools, and launch a development assistant, resulting in a health‑monitoring dashboard, upgraded data maps, and a roadmap for further automation and architecture refinement.

Big DataData GovernanceData Platform
0 likes · 12 min read
Taobao Big Data Model Governance and DataWorks Co‑development
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 7, 2022 · Big Data

How Alibaba’s Big Data Model Governance Boosted Efficiency and Cut Costs

This article details Alibaba's large‑scale data model governance initiative, analyzing current data issues, presenting a comprehensive solution—including model digitization, public model sinking, productization, daily governance, and search‑enhancement—and outlining achieved results and future plans to further improve data quality, reuse, and operational efficiency.

Data GovernanceDataWorksModel Scoring
0 likes · 12 min read
How Alibaba’s Big Data Model Governance Boosted Efficiency and Cut Costs
BaiPing Technology
BaiPing Technology
Mar 14, 2022 · Big Data

Mastering DataWorks & MaxCompute: A Complete Guide to Big Data Architecture and Governance

DataWorks, Alibaba Cloud’s comprehensive PaaS platform, combined with the serverless MaxCompute data warehouse, offers an integrated solution for data integration, development, quality, and services, while detailed naming and layer conventions ensure scalable, maintainable big‑data architectures and effective governance across ODS, CDM, DWD, DWS, and ADS layers.

Big DataData GovernanceDataWorks
0 likes · 8 min read
Mastering DataWorks & MaxCompute: A Complete Guide to Big Data Architecture and Governance
DataFunTalk
DataFunTalk
Jan 22, 2022 · Big Data

Alibaba Cloud Data Integration (DataX) Architecture, Design Principles, and Solution Overview

This presentation details Alibaba Cloud DataWorks Data Integration (DataX), covering its architecture, core design principles, offline and real‑time synchronization mechanisms, deployment modes, product positioning, use‑case scenarios, and its role within the broader DataWorks ecosystem, highlighting its capabilities for large‑scale data movement and processing.

Alibaba CloudBig DataData Integration
0 likes · 19 min read
Alibaba Cloud Data Integration (DataX) Architecture, Design Principles, and Solution Overview
Sohu Tech Products
Sohu Tech Products
Apr 7, 2021 · Big Data

Data Warehouse Architecture and Modeling with Alibaba MaxCompute and DataWorks

This tutorial explains how to select a technical architecture, design a three‑layer data warehouse (ODS, CDM, ADS), model tables and dimensions, choose storage strategies, handle slowly changing dimensions, synchronize data with DataWorks, and implement dimensional modeling and fact tables using Alibaba MaxCompute for big‑data analytics.

Big DataDataWorksMaxCompute
0 likes · 32 min read
Data Warehouse Architecture and Modeling with Alibaba MaxCompute and DataWorks
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 7, 2020 · Big Data

How to Build a New‑Retail Data Middle Platform with DataWorks

This article explains how new‑retail companies can design and implement a data middle platform using Alibaba Cloud's DataWorks, covering business model analysis, technical architecture, layer‑by‑layer data modeling, governance, security, and the concrete benefits of turning raw data into actionable business insights.

Big Data ArchitectureData GovernanceData Middle Platform
0 likes · 28 min read
How to Build a New‑Retail Data Middle Platform with DataWorks
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 26, 2020 · Cloud Native

Reinventing DataWorks: How Microservices and Cloud‑Native Architecture Solve Legacy Pain Points

This article examines the long‑standing technical and operational challenges of Alibaba's DataWorks platform—such as heavy legacy baggage, complex environments, tight coupling, and frequent releases—and explains how adopting a cloud‑native microservice architecture, service mesh, and DevOps practices can transform the platform into a flexible, scalable, and future‑proof data development ecosystem.

DataWorksDevOpsMicroservices
0 likes · 31 min read
Reinventing DataWorks: How Microservices and Cloud‑Native Architecture Solve Legacy Pain Points