Tagged articles
19 articles
Page 1 of 1
Architect
Architect
Aug 14, 2023 · Backend Development

Load Balancer vs API Gateway: Functions, Differences, and Application Scenarios

This article explains the roles of load balancers and API gateways, compares their layer focus, features, and performance characteristics, and outlines when to use each technology—especially in micro‑service architectures and high‑throughput network entry points.

Load BalancerMicroservicesbackend infrastructure
0 likes · 10 min read
Load Balancer vs API Gateway: Functions, Differences, and Application Scenarios
Baidu Geek Talk
Baidu Geek Talk
Jun 19, 2023 · Operations

How Baidu’s Tianyan Log Service Overcomes ELK’s Scaling and Performance Limits

This article examines the challenges of logging in distributed services, compares the traditional ELK stack with Baidu's Tianyan solution, details Tianyan's architecture—including Ingest, Store, Consumer, Elastic Agent, Fleet, APM, Beats, and Disruptor‑based high‑throughput pipelines—covers resource isolation, dynamic cleanup, and best‑practice recommendations for building a scalable, low‑latency log platform.

Distributed SystemsElastic StackLog Management
0 likes · 26 min read
How Baidu’s Tianyan Log Service Overcomes ELK’s Scaling and Performance Limits
Open Source Linux
Open Source Linux
Jun 16, 2023 · Backend Development

How Netflix’s Cloud Gateway Cuts Errors with Adaptive Load Balancing

Netflix’s cloud‑gateway team redesigned its load‑balancing stack—combining client latency, server utilization, and probabilistic choice‑of‑2 algorithms—to dramatically lower error rates, improve request distribution, and enhance fault‑tolerance for millions of requests per second.

Distributed SystemsNetflixadaptive algorithms
0 likes · 19 min read
How Netflix’s Cloud Gateway Cuts Errors with Adaptive Load Balancing
JD Cloud Developers
JD Cloud Developers
Apr 20, 2023 · Operations

How to Build a Cost‑Effective, High‑Throughput Log Collection System with ClickHouse

This article examines the challenges of scaling log storage and retrieval for high‑traffic services, analyzes the cost and performance limits of traditional ELK‑based pipelines, and presents a streamlined, UDP‑driven architecture using ClickHouse that dramatically reduces hardware expenses while handling hundreds of gigabytes per second.

ClickHouseCost OptimizationHigh Throughput
0 likes · 16 min read
How to Build a Cost‑Effective, High‑Throughput Log Collection System with ClickHouse
MaGe Linux Operations
MaGe Linux Operations
Dec 8, 2021 · Operations

Mastering Load Balancing: LVS, Nginx, and HAProxy Explained

This article provides a comprehensive overview of load balancing technologies—including LVS, Nginx, and HAProxy—detailing their architectures, operating modes, advantages, and drawbacks, and explains how to choose and configure them for high‑performance, scalable backend systems.

HAProxyLVSNginx
0 likes · 11 min read
Mastering Load Balancing: LVS, Nginx, and HAProxy Explained
Open Source Linux
Open Source Linux
Apr 9, 2021 · Operations

How Replit Cut REPL Startup Time from 2 Minutes to 15 Seconds by Fixing Docker Shutdown

Replit engineers discovered that slow Docker container shutdown on preemptible VMs caused REPL sessions to hang for up to a minute, and by bypassing Docker’s kill process and directly terminating container PIDs they reduced error rates from 3% to under 0.5% and cut 99th‑percentile startup time from two minutes to fifteen seconds.

Container ShutdownDockerPerformance Optimization
0 likes · 12 min read
How Replit Cut REPL Startup Time from 2 Minutes to 15 Seconds by Fixing Docker Shutdown
Top Architect
Top Architect
Mar 25, 2021 · Operations

Improving REPL Container Shutdown Performance at Replit

Replit engineers analyzed why container shutdown on preemptible VMs caused REPL sessions to stall for up to a minute, identified Docker's network‑release bottleneck, and implemented a direct SIGKILL workaround that reduced error rates and startup latency dramatically.

Container ManagementDockerOps
0 likes · 12 min read
Improving REPL Container Shutdown Performance at Replit
Open Source Linux
Open Source Linux
Dec 31, 2020 · Backend Development

How ZooKeeper Powers Distributed Coordination: Core Concepts Explained

This article provides a comprehensive overview of ZooKeeper, covering its purpose as a distributed coordination service, design goals, hierarchical data model, session handling, watch mechanism, consistency guarantees, server roles, leader election, workflow of leaders and followers, and the Zab protocol that ensures reliable state replication.

ConsensusZAB ProtocolZooKeeper
0 likes · 15 min read
How ZooKeeper Powers Distributed Coordination: Core Concepts Explained
MaGe Linux Operations
MaGe Linux Operations
Oct 20, 2020 · Backend Development

Mastering Load Balancing: Choose Between LVS, Nginx, and HAProxy

This article explains how modern web services use server clusters and load‑balancing devices, compares the three most popular software balancers—LVS, Nginx, and HAProxy—by describing their architectures, operating modes, advantages, disadvantages, and suitable deployment scenarios for different site scales.

LVSNginxbackend infrastructure
0 likes · 13 min read
Mastering Load Balancing: Choose Between LVS, Nginx, and HAProxy
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Sep 19, 2020 · Operations

Understanding LVS, Nginx, and HAProxy: Choosing the Right Load Balancer for Your Web Architecture

This article explains how modern web services rely on server clusters and a front‑end load balancer, compares the three most popular software balancers—LVS, Nginx, and HAProxy—by describing their architecture, operating modes, advantages, and drawbacks, and offers guidance on selecting the appropriate solution for different traffic scales.

HAProxyLVSNetworking
0 likes · 16 min read
Understanding LVS, Nginx, and HAProxy: Choosing the Right Load Balancer for Your Web Architecture
ITPUB
ITPUB
Dec 2, 2019 · Backend Development

How Xiaomi Built Talos: A Scalable, Stateless Message Queue for Billions of Events

This article details Xiaomi's journey from Kafka 0.8 to the home‑grown Talos system, covering business motivations, storage‑compute separation architecture, key challenges such as tail‑read and consistency, and extensive performance, resource, and platform optimizations that enable a high‑throughput, multi‑tenant messaging service.

Distributed MessagingHDFSMessage Queue
0 likes · 16 min read
How Xiaomi Built Talos: A Scalable, Stateless Message Queue for Billions of Events
Huajiao Technology
Huajiao Technology
Oct 29, 2019 · Backend Development

Building a Scalable Distributed Cron: Google‑Level Design Simplified for Startups

This article examines Google's high‑availability distributed cron design, distills its core requirements and algorithms, and then presents a streamlined implementation for a startup using etcd and Raft, followed by a thoughtful discussion on whether early‑stage companies should adopt a middle‑platform strategy.

Raftbackend infrastructuredistributed cron
0 likes · 10 min read
Building a Scalable Distributed Cron: Google‑Level Design Simplified for Startups
Didi Tech
Didi Tech
Aug 24, 2019 · Backend Development

Levin: Didi's Open‑Source Solution for Fast Loading of Large Static Data

Levin, Didi’s open‑source GitHub project, uses shared‑memory containers and offline‑compiled binary layouts to mmap large, low‑frequency static datasets, cutting service cold‑start times from minutes to seconds, lowering memory overhead, simplifying version switches, and improving stability for high‑scale applications.

LevinPerformance Optimizationbackend infrastructure
0 likes · 6 min read
Levin: Didi's Open‑Source Solution for Fast Loading of Large Static Data
Meituan Technology Team
Meituan Technology Team
Jan 12, 2018 · Backend Development

Design and Implementation of Meituan Hotel Full-Chain Log and Trace System

To cope with Meituan Hotel’s exploding micro‑service complexity, the infrastructure team built the Satellite System—combining MTrace and a selective, zero‑intrusion Log4j2‑based logging pipeline that streams enriched logs through Kafka, Storm, Redis and Elasticsearch, delivering second‑level trace‑log queries and six‑month retention, dramatically speeding up debugging.

Distributed TracingElasticsearchKafka
0 likes · 11 min read
Design and Implementation of Meituan Hotel Full-Chain Log and Trace System
21CTO
21CTO
Mar 8, 2016 · Backend Development

How Uber Engineers Tailor Core Infrastructure for China’s Ride‑Sharing Market

Uber’s CTO Thuan Pham explains how the company builds a specialized engineering team and core infrastructure in China, uses data‑center support, dynamic pricing, anti‑fraud machine learning, and safety features to adapt its global platform to local market needs.

Ride-sharingUberbackend infrastructure
0 likes · 9 min read
How Uber Engineers Tailor Core Infrastructure for China’s Ride‑Sharing Market
21CTO
21CTO
Dec 20, 2015 · Backend Development

How Twitter Scales Redis to 105 TB RAM and 39 M QPS

This article summarizes Yao Yu's "Scaling Redis at Twitter" talk, detailing why Twitter chose Redis, the massive memory and QPS requirements, custom data models, Hybrid List and BTree extensions, cluster management, and operational lessons for building a high‑performance caching service.

Cluster ManagementTwitterbackend infrastructure
0 likes · 21 min read
How Twitter Scales Redis to 105 TB RAM and 39 M QPS

Load Balancing Solutions: Nginx, LVS, and HAProxy – Advantages, Disadvantages, and Deployment Stages

This article examines how load‑balancing technologies such as Nginx, LVS, and HAProxy are chosen according to website scale, outlines their respective strengths and weaknesses, and describes typical deployment stages from simple single‑point setups to complex multi‑layer architectures.

HAProxyLVSOperations
0 likes · 13 min read
Load Balancing Solutions: Nginx, LVS, and HAProxy – Advantages, Disadvantages, and Deployment Stages