Tagged articles
35 articles
Page 1 of 1
Huolala Tech
Huolala Tech
Apr 22, 2026 · Backend Development

How a Platform‑Based Architecture Turns Logistics Fulfillment from Silos to Scalable Services

The article details a step‑by‑step transformation of a logistics fulfillment system from a duplicated, siloed design to a unified, plug‑in‑driven platform, explaining the underlying domain model, split‑and‑route strategies, state orchestration, code contracts, quantitative benefits, and future intelligent extensions.

MicroservicesScalable Systemslogistics fulfillment
0 likes · 15 min read
How a Platform‑Based Architecture Turns Logistics Fulfillment from Silos to Scalable Services
Ray's Galactic Tech
Ray's Galactic Tech
Apr 16, 2026 · Artificial Intelligence

How to Turn FunASR into a Production‑Ready Real‑Time Speech Platform: From Single‑Node Demo to Million‑Scale Architecture

This article explains how to evolve FunASR from a simple demo into a production‑grade, low‑latency, high‑concurrency streaming speech‑recognition system by addressing model inference, session state, scaling layers, Kubernetes deployment, monitoring, and common pitfalls for real‑world use cases such as call‑center quality inspection.

FunASRProduction ArchitectureReal-time Speech Recognition
0 likes · 38 min read
How to Turn FunASR into a Production‑Ready Real‑Time Speech Platform: From Single‑Node Demo to Million‑Scale Architecture
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Feb 26, 2026 · Artificial Intelligence

How MiniMax’s Forge Architecture Achieves 40× Faster Agent RL Training

The article details MiniMax’s Forge system, an asynchronous native Agent‑RL architecture that standardizes Agent‑LLM interaction, introduces engineering optimizations, novel scheduling, prefix‑tree merging and reward designs, enabling million‑sample daily throughput, stable reward growth and up to 40‑fold training acceleration for the MiniMax M2.5 model.

Agent ArchitectureAsynchronous RLMixed Scheduling
0 likes · 17 min read
How MiniMax’s Forge Architecture Achieves 40× Faster Agent RL Training
Huolala Tech
Huolala Tech
Nov 22, 2024 · Backend Development

Building a Scalable Message Fusion Platform for Billions of User Notifications

This article details how a rapidly growing on‑demand logistics service designed and implemented a high‑performance, extensible message fusion platform that handles massive, diverse user messages through distributed processing, visual configuration, and self‑healing mechanisms, enabling real‑time, personalized communication at scale.

Backend ArchitectureDistributed ProcessingMessage Queue
0 likes · 13 min read
Building a Scalable Message Fusion Platform for Billions of User Notifications
DataFunSummit
DataFunSummit
Sep 12, 2023 · Backend Development

Xiaohongshu Recommendation Engineering Architecture: Graph Architecture, Hot Deployment, and Practices

This article presents Xiaohongshu's evolving recommendation engineering architecture, detailing its modular backend design, graph-based Ark framework, hot deployment mechanisms, and the challenges and solutions for scaling personalized content delivery in a fast‑growing mobile platform.

Backend ArchitectureHot DeploymentScalable Systems
0 likes · 13 min read
Xiaohongshu Recommendation Engineering Architecture: Graph Architecture, Hot Deployment, and Practices
Shopee Tech Team
Shopee Tech Team
Jul 20, 2023 · Cloud Native

Design and Evolution of Shopee Image Service: Architecture, Challenges, and Optimizations

Shopee’s unified Image Service consolidates upload, processing, and delivery into a single platform that overcomes legacy fragmentation and heterogeneous networks through resumable SDKs, adaptive ID schemes, link‑level and HTTP/3 transport optimizations, dynamic CDN scheduling, and device‑aware resizing, boosting upload success to 99.67 %, halving latency, cutting storage by 25 % and bandwidth by 56 % while markedly improving app performance.

Backend ArchitectureCDN optimizationCloud Native
0 likes · 20 min read
Design and Evolution of Shopee Image Service: Architecture, Challenges, and Optimizations
Programmer DD
Programmer DD
Feb 24, 2023 · Artificial Intelligence

How Jeff Dean’s Journey Shaped Google’s AI and Big Data Revolution

Jeff Dean, a Google engineering legend, has mastered over 18 programming languages and pioneered transformative technologies such as MapReduce, Bigtable, Spanner, and TensorFlow, illustrating how his relentless pursuit of scalability and performance has driven the evolution of AI, big data, and modern cloud infrastructure.

Jeff DeanMapReduceScalable Systems
0 likes · 14 min read
How Jeff Dean’s Journey Shaped Google’s AI and Big Data Revolution
vivo Internet Technology
vivo Internet Technology
Feb 22, 2023 · Backend Development

Game Recommendation System: Architecture, Models, Scaling, and Operational Practices

The article details the design, evolution, and operational practices of Vivo’s large‑scale game recommendation platform, covering its initial rule‑based model, layered strategy framework, multi‑level caching, GC tuning, rate‑limiting, fine‑grained A/B testing, multi‑path recall, dynamic exposure control, and future intelligent extensions.

A/B testingBackend ArchitectureGarbage Collection
0 likes · 17 min read
Game Recommendation System: Architecture, Models, Scaling, and Operational Practices
Tencent Cloud Developer
Tencent Cloud Developer
May 31, 2022 · Artificial Intelligence

Scalable Graph Neural Architecture Search System (PaSca) – WWW 2022 Best Student Paper

PaSca, a scalable graph neural architecture search system that separates message aggregation from updates, explores over 150,000 GNN designs with multi‑objective optimization, delivers models that outperform traditional GNNs in accuracy, memory and speed, has been open‑sourced and deployed at Tencent for risk control, recommendation and fraud detection, and earned the WWW 2022 Best Student Paper award.

Big DataNeural Architecture SearchScalable Systems
0 likes · 11 min read
Scalable Graph Neural Architecture Search System (PaSca) – WWW 2022 Best Student Paper
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Feb 26, 2022 · Backend Development

How to Build a Scalable Million‑DAU Backend Architecture with Hybrid Cloud

This article outlines a comprehensive architecture for handling millions of daily active users, covering DNS routing, four‑layer and seven‑layer load balancing, monolithic versus microservice deployment, caching, database sharding, hybrid‑cloud strategies, full‑link elastic scaling, and a three‑tier degradation mechanism to ensure resilience under sudden traffic spikes.

Scalable Systemsdatabase shardinghybrid cloud
0 likes · 11 min read
How to Build a Scalable Million‑DAU Backend Architecture with Hybrid Cloud
Qingyun Technology Community
Qingyun Technology Community
Aug 31, 2021 · Cloud Computing

How to Build Scalable, High‑Availability Real‑Time Audio‑Video Systems

This talk explains the evolution and practical implementation of large‑scale real‑time audio‑video communication, covering common architectures such as direct P2P, MCU, and SFU, network topologies, scalability, high‑availability techniques, edge computing, and emerging technologies like WebRTC, SDN, and AI‑driven enhancements.

Edge ComputingSFUScalable Systems
0 likes · 16 min read
How to Build Scalable, High‑Availability Real‑Time Audio‑Video Systems
Programmer DD
Programmer DD
May 16, 2021 · Backend Development

What Really Drives Scalable Backend Architecture? Lessons from Weibo’s Massive System

This article explores the essence of system architecture, illustrating how large‑scale services like Uber and Weibo handle massive traffic, data storage, service decomposition, multi‑level caching, and monitoring, and offers practical design principles for building robust, high‑performance backend systems.

Backend ArchitectureScalable Systemsdistributed caching
0 likes · 22 min read
What Really Drives Scalable Backend Architecture? Lessons from Weibo’s Massive System
Continuous Delivery 2.0
Continuous Delivery 2.0
Jan 28, 2021 · Operations

Facebook's Scalable Continuous Delivery System

This article explains how Facebook built a highly scalable continuous delivery pipeline for both web and mobile products, evolving from simple trunk‑based releases with cherry‑picks to a quasi‑continuous deployment model that supports thousands of engineers, rapid rollouts, and robust quality monitoring.

Continuous DeliveryDeploymentFacebook
0 likes · 14 min read
Facebook's Scalable Continuous Delivery System
21CTO
21CTO
Jan 2, 2021 · Operations

Designing & Operating Highly Available Scalable Systems: Google’s SRE Secrets

This article presents a comprehensive overview of Site Reliability Engineering (SRE) as shared by Google SRE expert Ramón Medrano Llamas, covering SRE fundamentals, a typical day’s workflow, design principles for massive scale, fault‑tolerant architecture, monitoring, SLI/SLO metrics, redundancy strategies, disaster recovery, and operational best practices.

OperationsSREScalable Systems
0 likes · 13 min read
Designing & Operating Highly Available Scalable Systems: Google’s SRE Secrets
Youzan Coder
Youzan Coder
Apr 7, 2020 · Backend Development

Building Youzan's Enterprise Search Platform: Architecture, Indexing & Scaling

This article explores Youzan's enterprise search middle platform, detailing the challenges of siloed architectures, the concept of cognitive folding, comprehensive index design, write/read mechanisms, configuration-driven routing, monitoring, and practical implementations that enable scalable, reusable search capabilities across diverse business domains.

Backend ArchitectureEnterprise searchScalable Systems
0 likes · 16 min read
Building Youzan's Enterprise Search Platform: Architecture, Indexing & Scaling
JD Retail Technology
JD Retail Technology
Jan 16, 2020 · Backend Development

Architecture and Key Technologies of a Scalable Message Push Platform

The document outlines the design, key components, data flow, and operational strategies of a large‑scale message push platform, detailing its architecture, request handling, long‑connection management, retry mechanisms, data statistics, monitoring, and future expansion plans.

Backend ArchitectureData AnalyticsLong Connections
0 likes · 15 min read
Architecture and Key Technologies of a Scalable Message Push Platform
21CTO
21CTO
Nov 15, 2019 · Operations

How SRE Designs Highly Available Software Systems at Scale

This article presents Google SRE expert Ramón Medrano Llamas’s comprehensive guide on designing, operating, and maintaining large‑scale, highly available software systems, covering SRE fundamentals, daily workflows, scalability strategies, fault‑tolerant architecture, monitoring, and operational best practices.

SREScalable Systemsfault tolerance
0 likes · 13 min read
How SRE Designs Highly Available Software Systems at Scale
WeChat Backend Team
WeChat Backend Team
Jan 17, 2019 · Operations

How We Built DAGOR: A Scalable Overload Control System for Massive Microservices

This article presents DAGOR, a decentralized overload control framework designed for large‑scale microservice architectures like WeChat’s backend, detailing its service‑agnostic design, priority‑based admission policies, adaptive algorithms, and experimental evaluation that demonstrates improved success rates, fairness, and robustness under heavy load.

MicroservicesScalable Systemsload shedding
0 likes · 49 min read
How We Built DAGOR: A Scalable Overload Control System for Massive Microservices
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 28, 2018 · Databases

How Alibaba’s Tair Cache Engine Scaled to 500M QPS for Double 11

Alibaba’s Tair, a high‑performance distributed key/value cache, evolved through multiple versions to support massive traffic during Double 11, employing multi‑region deployment, hotspot hashing, memory merging, user‑space networking, and client optimizations that dramatically cut latency, improve scalability, and reduce operational costs.

AlibabaScalable SystemsTair
0 likes · 14 min read
How Alibaba’s Tair Cache Engine Scaled to 500M QPS for Double 11
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jan 16, 2018 · Backend Development

What We Learned from China’s Top Tech Architects on System Refactoring

Leading architects from companies like Sogou, Ele.me, Xiaomi, Tuniu, Kuaidi, 58.com, and Tencent shared practical insights on progressively refactoring legacy systems, scaling platforms, adopting service‑oriented and streaming architectures, and balancing optimization with new business demands, offering a comprehensive roadmap for modern backend evolution.

MicroservicesScalable Systemsservice-oriented architecture
0 likes · 17 min read
What We Learned from China’s Top Tech Architects on System Refactoring
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 15, 2017 · Artificial Intelligence

Kunpeng: A Scalable Distributed Machine Learning Platform for Billion‑Scale Data

Kunpeng is a unique distributed platform that seamlessly integrates large‑scale system architecture with parallel optimization algorithms, delivering fault‑tolerant, high‑performance machine‑learning capabilities for billions of samples and features, and outperforming Spark, MPI, and XGBoost in real‑world Alibaba applications.

Artificial IntelligenceScalable Systemsdistributed machine learning
0 likes · 11 min read
Kunpeng: A Scalable Distributed Machine Learning Platform for Billion‑Scale Data
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Mar 27, 2017 · Cloud Native

How Microservice Architecture Powers Scalable Smart Campus Platforms

This article explains how a decentralized microservice and SOA architecture, combined with cloud deployment, service registration, gateways, and unified APIs, enables high‑performance, high‑availability, and low‑coupling smart campus systems that support both mobile and PC applications while simplifying development, testing, and operations.

MicroservicesScalable Systemscloud-native
0 likes · 22 min read
How Microservice Architecture Powers Scalable Smart Campus Platforms
High Availability Architecture
High Availability Architecture
Mar 16, 2017 · Operations

Stormcrow: Dropbox’s Scalable Feature‑Flag Platform for Rapid Deployment and A/B Testing

The article describes Dropbox’s Stormcrow system, a configurable feature‑gate platform that enables fast, safe rollout of new functionality across web, desktop, and mobile clients, supports granular A/B testing, leverages custom data fields, and integrates deployment, monitoring, and audit tooling for large‑scale operations.

A/B testingDeploymentScalable Systems
0 likes · 15 min read
Stormcrow: Dropbox’s Scalable Feature‑Flag Platform for Rapid Deployment and A/B Testing
AntTech
AntTech
Feb 28, 2017 · Artificial Intelligence

Key Computing Capabilities Driving the Evolution of Digital Financial Services

The talk outlines nine essential computing capabilities—transaction processing, system robustness, connectivity, decision-making, data insight, intelligent services, biometric authentication, blockchain trust, and immersive integration—that have transformed Ant Financial over the past decade and outlines the challenges and strategies for the next ten years.

Artificial IntelligenceBig DataBlockchain
0 likes · 16 min read
Key Computing Capabilities Driving the Evolution of Digital Financial Services
21CTO
21CTO
May 5, 2016 · Backend Development

How Uber Built the Tincup Microservice: Lessons in Scalable Backend Architecture

This article details Uber's transition to a modular microservice architecture, describing the RFC process for new services, the design and implementation of the Tincup currency service, and the technologies—such as MVCS, UDR, Tornado, TChannel, and Thrift—used to ensure scalability, reliability, and efficient production deployment.

MicroservicesRFC processScalable Systems
0 likes · 10 min read
How Uber Built the Tincup Microservice: Lessons in Scalable Backend Architecture
21CTO
21CTO
Apr 24, 2016 · Backend Development

Inside Sina Weibo’s Third-Gen Backend Architecture for 100M+ Users

To support over 140 million monthly active users, Sina Weibo evolved from a simple LAMP stack to a sophisticated third‑generation architecture featuring layered interfaces, service and resource tiers, custom RPC and messaging frameworks, SSD caching, and a large‑scale distributed tracing system called WatchMan.

Backend ArchitectureDistributed TracingScalable Systems
0 likes · 11 min read
Inside Sina Weibo’s Third-Gen Backend Architecture for 100M+ Users
Big Data and Microservices
Big Data and Microservices
Apr 12, 2016 · Backend Development

WeChat’s Backend Journey: From Zero to Billions with Scalable Architecture

This article chronicles how WeChat’s backend evolved from a simple messaging prototype to a globally distributed, multi‑data‑center system, detailing its message model, unified sync protocol, three‑layer architecture, platformization, disaster‑recovery design, performance tuning, and emerging resource‑scheduling challenges.

Backend ArchitectureScalable SystemsWeChat
0 likes · 30 min read
WeChat’s Backend Journey: From Zero to Billions with Scalable Architecture
Architecture Digest
Architecture Digest
Apr 7, 2016 · Backend Development

Evolution of 58 Bangbang IM System Architecture Across Four Phases

This article chronicles the four-stage evolution of the 58 Bangbang instant messaging platform’s architecture—from a traditional IM system to a merchant management platform, then to a mobile marketing tool, and finally to a high‑throughput mobile push solution—detailing design choices, scaling strategies, and technology stacks used.

Distributed SystemsIM architectureScalable Systems
0 likes · 11 min read
Evolution of 58 Bangbang IM System Architecture Across Four Phases
21CTO
21CTO
Feb 23, 2016 · Backend Development

From Personal Homepage to Billion‑Page Site: Lessons in Scalable Web Architecture

This article shares a 14‑year journey of building and evolving website architectures, from a simple personal homepage to billion‑page platforms, highlighting essential accumulation, knowledge structuring, design principles, infrastructure, software engineering practices, and the nuanced differences across business systems.

Scalable Systemsbackend-developmentsite scaling
0 likes · 31 min read
From Personal Homepage to Billion‑Page Site: Lessons in Scalable Web Architecture
21CTO
21CTO
Jan 25, 2016 · Backend Development

From Personal Homepage to Billion‑Visit Site: Key Lessons in Scalable Architecture

The article shares a veteran engineer’s fourteen‑year journey from a simple personal homepage to a billion‑page‑view e‑commerce platform, outlining essential principles, knowledge structures, design philosophies, infrastructure choices, and operational practices needed to build and maintain large‑scale web systems.

Scalable Systemsbackend-developmentperformance optimization
0 likes · 31 min read
From Personal Homepage to Billion‑Visit Site: Key Lessons in Scalable Architecture
21CTO
21CTO
Nov 6, 2015 · Backend Development

How WeChat & Momo Scale IM: Lessons on Battery, Network, and Custom Protocols

This article analyzes the architectural choices behind WeChat and Momo instant‑messaging services, covering battery and traffic constraints, network reliability, the shift from XMPP to proprietary long/short connections, protocol design with protobuf, and operational strategies for scaling massive user bases.

Backend ArchitectureInstant MessagingProtobuf
0 likes · 12 min read
How WeChat & Momo Scale IM: Lessons on Battery, Network, and Custom Protocols
21CTO
21CTO
Oct 29, 2015 · Backend Development

Scaling Sina's News Comment System: From 3.0 to 5.0 – Key Backend Lessons

This article chronicles the evolution of Sina's news comment platform from a single‑server C++/MySQL prototype to a multi‑node, asynchronous, Python‑based architecture, highlighting the caching, sharding, and queueing techniques that enabled the system to survive massive traffic spikes without downtime.

Backend ArchitectureMessage QueueScalable Systems
0 likes · 21 min read
Scaling Sina's News Comment System: From 3.0 to 5.0 – Key Backend Lessons
21CTO
21CTO
Sep 10, 2015 · Backend Development

Inside 58.com’s Mobile O2O Platform: Architecture, Evolution, and Push Tech

Sun Xuan, a system architect at 58.com, details the evolution of the company's merchant mobile management platform—from its early IM‑centric design through third‑party service integration to a lightweight client architecture—while also dissecting core O2O technologies such as long‑connection push, LBS, and cross‑platform notification strategies.

Backend ArchitectureIMScalable Systems
0 likes · 9 min read
Inside 58.com’s Mobile O2O Platform: Architecture, Evolution, and Push Tech