Cloud Native 14 min read

How Tencent Meeting Scaled Rapidly with Cloud‑Native TKE: Architecture & Practices

This article explains how Tencent Meeting leveraged Tencent Cloud’s Kubernetes Engine (TKE) and a suite of cloud‑native extensions—dynamic routing, fixed networking, parallel scaling, and controlled batch upgrades—to achieve rapid, reliable version iteration and massive capacity growth during the COVID‑19 surge.

Tencent Tech
Tencent Tech
Tencent Tech
How Tencent Meeting Scaled Rapidly with Cloud‑Native TKE: Architecture & Practices

Background

Due to the COVID‑19 pandemic, online office applications surged. Tencent Meeting, an enterprise‑grade video‑conferencing product, experienced explosive user growth and needed fast, reliable version upgrades. In 40 days it delivered 14 major releases, illustrating a rapid‑iteration model.

Why Cloud‑Native TKE?

Tencent Cloud Container Service (TKE) offers a stable, secure, high‑performance Kubernetes platform with extensions such as cloud disks, load balancers, and custom operators. It solves environment‑consistency challenges, simplifies large‑scale cluster management, and reduces cost.

Solution Overview

Using TKE, the team built a management platform that adds dynamic routing, fixed networking, elastic scaling, and controllable upgrades. The platform runs on Tencent Cloud CVM, CBS, CLB, VPC and extends TKE via operators for routing (L5‑controller), network (IPAMD), resource management, and auto‑scaling.

Management platform architecture
Management platform architecture

Key Technical Areas

1. Quality Assurance

Ensuring service stability through dynamic routing, anomaly detection, parallel scaling, and business migration.

1.1 Dynamic Routing

Implemented via a custom L5‑controller that watches service changes, triggers callbacks, and updates routes. A primary fast path guarantees second‑level propagation, while a secondary consistency check reconciles routes every minute.

Dynamic routing diagram
Dynamic routing diagram

1.2 Health Checks

livenessProbe

readinessProbe

postStart

preStop

terminationGracePeriodSeconds

These probes isolate unhealthy pods and protect traffic.

1.3 Parallel Scaling

Extended HPA to run detection at high concurrency and allow custom calculation periods down to seconds, enabling immediate scaling under burst load.

1.4 Business Migration

Helm deployment

Namespace packaging and copy

Namespace reuse with custom adjustments

2. Efficiency

Automation of cluster creation, environment initialization, and console integration; CI/CD pipelines across multiple networks; centralized CMDB‑based authorization; and enhanced elastic scaling.

2.1 Process Automation

Cluster provisioning via API

Environment setup scripts

Console integration for batch operations

2.2 CI/CD

All internal services, including Tencent Meeting, use a unified CI/CD workflow that supports cross‑environment deployments.

2.3 Authorization Management

CMDB controller synchronizes pod‑product relationships, enabling automatic credential provisioning.

CMDB synchronization
CMDB synchronization

2.4 Elastic Scaling

Multi‑metric support (Metrics Server, Prometheus, custom)

Parallel workers for real‑time scaling

Customizable thresholds and periods

Elastic scaling architecture
Elastic scaling architecture

3. Controlled Upgrade

Four extensions ensure reliable upgrades: fixed networking, lightweight upgrade, batch rollout, and capacity guarantees.

3.1 Fixed Networking

Pods retain IP after recreation, supporting IP‑based authentication and whitelist scenarios. Implemented via IPAMD Controller, TKE‑ENI‑AGENT, and CNI.

Fixed networking components
Fixed networking components

3.2 StatefulsetPlus Operator

Custom workload extending StatefulSet with fixed IP, gray‑scale batch updates, and in‑place upgrades.

StatefulsetPlus architecture
StatefulsetPlus architecture

3.3 Batch Upgrade

Operators allow pre‑defined batches, manual or automatic progression, rollback, and per‑batch health checks, enabling controlled, region‑aware rollouts.

Batch upgrade flow
Batch upgrade flow

Conclusion

By leveraging cloud‑native technologies on TKE, Tencent Meeting achieved rapid, reliable version iteration and massive scaling, demonstrating the critical role of container platforms in supporting billions of users.

CI/CDcloud-nativeKubernetesnetworkingScalingtke
Tencent Tech
Written by

Tencent Tech

Tencent's official tech account. Delivering quality technical content to serve developers.

0 followers
Reader feedback

How this landed with the community

login Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.