Cloud Native 13 min read

Baidu Zhidao Cloud Migration Practice: From Legacy OXP to Cloud-Native Architecture

Baidu Zhidao migrated its 18‑year‑old Q&A platform from a legacy OXP architecture to a cloud‑native solution using Pandora containers and the Zhiyun platform, overcoming complex code, high traffic, and zero‑downtime requirements, and achieved full traffic migration, 99.99% SLA, reduced latency, and enhanced elasticity and multi‑region disaster recovery.

Baidu Geek Talk
Baidu Geek Talk
Baidu Geek Talk
Baidu Zhidao Cloud Migration Practice: From Legacy OXP to Cloud-Native Architecture

This article details Baidu Zhidao's comprehensive cloud migration journey, addressing the challenges of migrating an 18-year-old Q&A platform to cloud-native infrastructure. The migration was driven by the deprecation of Baidu's internal PaaS platform (ORP) and the need for improved resource elasticity, cost optimization, and enhanced stability.

Key Challenges:

1. Legacy system complexity: Multiple business scenarios, outdated architecture, inconsistent code styles, and high refactoring costs

2. Rapid business iteration: 780+ business requirements annually while maintaining zero-downtime migration

3. High traffic and stability requirements: Over 100 million daily PV with 99.99% SLA targets

4. Architecture evolution: Achieving multi-region disaster recovery capabilities

Migration Solution:

The team selected Pandora (Baidu's container platform) as the underlying infrastructure and Zhiyun platform for deployment and operations. This choice was based on Pandora's ability to support large-scale module deployments (up to 2K modules) without requiring significant code restructuring, making it suitable for the existing ODP monolithic architecture.

Traffic Switching Implementation:

The migration utilized Lua scripts in the access layer for traffic splitting, implementing various strategies from 1% to 100% traffic migration. The solution included business layer modifications to capture traffic markers and enable A/B testing between cloud and legacy environments.

Architecture Evolution Results:

By March 2023, 100% of Zhidao's traffic was successfully migrated to cloud. The platform achieved three consecutive quarters of 99.99% SLA with zero migration-related issues. Core pages now operate across three regions with four data centers (North China:Central China:South China = 4:3:3), providing N+1 cross-region disaster recovery capabilities. End-to-end latency for mini-program core interfaces decreased by 12%.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

cloud migrationdisaster recoveryPaaSmulti-region deploymentcontainer platformcloud-native architectureInfrastructure Evolutionlegacy system modernization
Baidu Geek Talk
Written by

Baidu Geek Talk

Follow us to discover more Baidu tech insights.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.