Tagged articles

big data integration

6 articles · Page 1 of 1
Programmer XiaoFu
Programmer XiaoFu
Jun 18, 2025 · Big Data

How DataX Boosts Data‑Sync Speed by 200% Across Heterogeneous Sources

This article walks through the challenges of synchronizing 50 million rows between disparate MySQL databases, explains why traditional mysqldump or file‑based methods fail, and then details how the open‑source DataX tool—its 3.0 framework, installation steps, job architecture, and JSON‑based configurations—enables fast full and incremental data transfers with concrete performance metrics.

Data synchronizationDataXMySQL
0 likes · 14 min read
How DataX Boosts Data‑Sync Speed by 200% Across Heterogeneous Sources
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Sep 26, 2024 · Artificial Intelligence

How Alibaba Cloud’s PAI Tackles Large‑Model Training and Inference Challenges in 2024

At the 2024 Yunqi Conference, Alibaba Cloud’s AI Infra experts detailed the latest challenges of large‑model deployment—such as hardware costs, resource management, and software‑hardware coordination—and introduced PAI’s new capabilities, including stability tools, automated distributed training, reinforcement‑learning frameworks, inference optimizations, and integrated big‑data AI solutions.

AI InfraInference Optimizationbig data integration
0 likes · 14 min read
How Alibaba Cloud’s PAI Tackles Large‑Model Training and Inference Challenges in 2024
iQIYI Technical Product Team
iQIYI Technical Product Team
May 31, 2024 · Artificial Intelligence

How Opal Turns iQIYI’s ML Workflow into a Unified AI Platform

Opal is iQIYI's end‑to‑end machine‑learning platform that integrates feature production, sample construction, model training, and deployment with big‑data services, addressing duplicated effort, weak data processing, and fragmented pipelines to boost efficiency across recommendation, advertising, and risk‑control scenarios.

AI OperationsData ValidationMachine Learning Platform
0 likes · 19 min read
How Opal Turns iQIYI’s ML Workflow into a Unified AI Platform

Database Independence Migration for Yanxuan Trading System: Architecture Evolution and Implementation

Yanxuan migrated its monolithic trading system from a shared DDB cluster to an independent database by using Netease Data Canal for real‑time sync, a write‑stop switch with Pandora middleware, account and permission isolation, and extensive testing across three phases to ensure data consistency and minimal business impact.

Data ConsistencyEnterprise DatabaseTransaction Management
0 likes · 15 min read
Database Independence Migration for Yanxuan Trading System: Architecture Evolution and Implementation
Tencent Tech
Tencent Tech
Jun 23, 2022 · Big Data

Why Apache InLong’s Graduation Marks a New Era for Big Data Integration

Apache InLong, originally contributed by Tencent, has graduated to an Apache top‑level project, offering a one‑stop framework for petabyte‑scale data ingestion, processing, and reliable streaming, and is now widely adopted across advertising, payment, social, gaming, and AI industries.

Data StreamingInLongTencent
0 likes · 5 min read
Why Apache InLong’s Graduation Marks a New Era for Big Data Integration
DataFunSummit
DataFunSummit
Feb 4, 2021 · Artificial Intelligence

Full-Stack Machine Learning Platform: Architecture, Key Factors, and Implementation Details

This article examines the evolution of user data, computing power, and models, and presents the design principles, key architectural factors, and practical implementation techniques for building a full‑stack machine learning platform that supports large‑scale data processing, distributed training, and low‑latency online serving.

Machine Learning PlatformResource Schedulingbig data integration
0 likes · 15 min read
Full-Stack Machine Learning Platform: Architecture, Key Factors, and Implementation Details