Master Database Migration to Cloud: Challenges & Solutions with Baidu DTS
This article examines the rapid growth of China's database market, the technical hurdles of moving databases to public cloud—including engine selection, lengthy migration processes, efficiency, disaster recovery, and data consistency—and explains how Baidu Intelligent Cloud's DTS platform offers a smooth, reliable, high‑availability, and high‑performance one‑stop solution with real‑world use cases.
1 Database Migration Challenges
According to the 2023 Database Development Research Report, China's database market grew from 40 billion RMB last year to an expected 54 billion RMB this year and may reach 128 billion RMB by 2027, with a compound annual growth rate of 26%.
The market is shifting toward public‑cloud databases, which now account for about 60% of the total and grow at roughly 50% annually, far outpacing on‑premises deployments.
Key technical challenges of cloud migration include:
Choosing the right cloud engine, architecture, and pricing plan while ensuring compatibility and business‑specific tuning.
A lengthy migration workflow that must handle network bridging, full data transfer, incremental sync, and final cut‑over.
Maintaining efficiency and disaster‑recovery capabilities throughout the process.
Guaranteeing data consistency, as any loss or duplication can severely impact online services.
2 Baidu Intelligent Cloud DTS Solution
Baidu emphasizes that smoothness (usability, compatibility, minimal business impact) and reliability (consistency, rollback, high availability) are core customer requirements.
2.1 Smooth Migration
Usability: out‑of‑the‑box, fully managed migration workflow.
Compatibility: supports heterogeneous source and target engines, architectures, versions, and network environments.
Business impact: enables nonstop migration of accounts, structures, full and incremental data.
2.2 Reliable Migration
Consistency: ensures source and target data are identical and provides verification tools.
Rollback: one‑click reverse sync restores cloud‑down environments for disaster recovery.
High availability: checkpointing, real‑time disaster recovery, and automatic failover during primary‑standby switches.
2.3 Migration Process
DTS divides migration into three phases: pre‑migration (assessment and selection), migration (full‑load, incremental sync, cut‑over), and post‑migration (intelligent dashboard for monitoring and tuning).
Pre‑migration evaluates engine choices, performance, and network connectivity (public network, dedicated line, VPN, or cloud‑native options). Migration moves accounts/roles, schema, full data, and incremental changes without downtime, then validates consistency before traffic cut‑over. After cut‑over, DTS can reverse‑sync traffic back to the on‑premise environment if needed.
2.4 Data Transfer Architecture
The data plane follows an ETL plug‑in design:
Extract (E) : supports full‑load, incremental CDC, and parallel table sharding.
Transform (T) : normalizes source data to a unified abstract format, applies filtering, mapping, and protocol conversion.
Load (L) : parallel batch loading with table‑ or primary‑key‑level concurrency, preserving order for incremental streams.
Consistency verification uses a two‑round recheck to reduce false positives caused by millisecond‑level CDC lag.
2.5 Exactly‑Once Guarantees
At‑Least‑Once guarantees no data loss via monotonic version tracking. Exactly‑Once is achieved by leveraging target‑side unique constraints, UUID tagging for message queues, or command rewriting for key‑value stores.
2.6 High Availability & Performance
Checkpoint‑based resume ensures no data loss across restarts.
Real‑time disaster recovery with automatic split‑brain detection and self‑healing.
Switch‑over self‑healing automatically discovers new nodes and resumes transfer.
High throughput: parallel full‑load, table/primary‑key parallel conversion, batch INSERT merging, and network latency amortization.
Low latency: CDC‑based incremental capture, streaming pipeline, message‑queue buffering, and logical transaction merging.
MySQL benchmarks show up to 200 k rows/s full‑load and 10 k rows/s incremental with millisecond‑level delay.
3 Use Cases
Three real‑world deployments illustrate DTS capabilities:
A large video‑streaming platform migrated over 1.5 W clusters (MySQL, Redis, MongoDB) with nonstop migration, self‑healing, and sub‑4‑day per‑cluster cycles.
A major commercial bank achieved zero‑RPO, sub‑30‑second RTO disaster recovery for 64‑shard workloads with 500 k+ TPS and minute‑level sync latency.
A large language‑model service realized second‑level sync for model traces and real‑time analytics, supporting 470+ continuous tasks.
DTS supports eight typical scenarios: nonstop cloud migration, active‑active, multi‑cloud DR, event‑driven, migration for domestic‑innovation platforms, cache updates, real‑time analytics, and real‑time lake/warehouse ingestion.
Baidu Intelligent Cloud Tech Hub
We share the cloud tech topics you care about. Feel free to leave a message and tell us what you'd like to learn.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.
