Tagged articles
132 articles
Page 1 of 2
Machine Heart
Machine Heart
May 10, 2026 · Artificial Intelligence

Stop Fragmenting Long Texts: HiLight Lets AI Highlight Key Points Directly

The HiLight approach inserts lightweight highlight tags into full-length inputs, training a small Emphasis Actor to score token importance and guide a frozen large language model, improving performance on tasks like recommendation and QA without modifying the solver, while keeping low latency and training cost.

LLMLow latencyevaluation
0 likes · 9 min read
Stop Fragmenting Long Texts: HiLight Lets AI Highlight Key Points Directly
Weekly Large Model Application
Weekly Large Model Application
May 6, 2026 · Cloud Native

How OpenAI Scales Low-Latency Voice AI with WebRTC: Architecture Deep Dive

The article dissects OpenAI's engineering approach to delivering low‑latency voice AI at scale, explaining why WebRTC was chosen, how a Relay + Transceiver split solves Kubernetes integration challenges, the use of ICE ufrag for deterministic routing, and how global relay and implementation choices reduce perceived latency.

KubernetesLow latencyOpenAI
0 likes · 9 min read
How OpenAI Scales Low-Latency Voice AI with WebRTC: Architecture Deep Dive
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 27, 2026 · Artificial Intelligence

How Tair Powers Sub‑Second AI Agent Memory for Real‑Time Ordering

This article examines how Taobao Flash Sale’s AI Agent uses Alibaba Cloud’s Tair as a high‑performance short‑term memory layer, detailing data model design, latency impact, concurrency control, elastic scaling, bandwidth handling, and TTL‑based cleanup to achieve sub‑second response times during massive traffic spikes.

AI AgentLow latencyMemory Management
0 likes · 15 min read
How Tair Powers Sub‑Second AI Agent Memory for Real‑Time Ordering
Code Wrench
Code Wrench
Mar 6, 2026 · Backend Development

Why WebRTC Latency Isn’t About the API: Go, ICE, DTLS, and Scaling

This article breaks down the true bottlenecks of low‑latency WebRTC systems—network models, congestion control, memory layout, and concurrency scheduling—by examining the protocol stack, Go runtime, ICE state machine, DTLS/SRTP security, RTP/RTCP feedback, and practical high‑concurrency tuning strategies.

GoLow latencyReal-time Media
0 likes · 10 min read
Why WebRTC Latency Isn’t About the API: Go, ICE, DTLS, and Scaling
Deepin Linux
Deepin Linux
Mar 6, 2026 · Backend Development

Unlocking Ultra‑Low Latency: How RDMA Transforms High‑Performance Networking

This article explains the fundamentals of Remote Direct Memory Access (RDMA), its low‑latency, zero‑copy and kernel‑bypass mechanisms, programming interfaces, and real‑world applications in data‑center networks, high‑performance computing, and distributed storage, providing developers with practical guidance and code examples.

High‑performance computingLow latencyNetwork programming
0 likes · 31 min read
Unlocking Ultra‑Low Latency: How RDMA Transforms High‑Performance Networking
Code Wrench
Code Wrench
Feb 23, 2026 · Backend Development

How Switching from Go to Rust Slashed Latency from 15 ms to 80 µs

In high‑frequency trading and real‑time systems, Go’s garbage‑collector pauses and channel lock contention can inflate P99 latency to dozens of milliseconds, while a disciplined Rust rewrite eliminates GC, reduces lock overhead, and achieves sub‑100‑microsecond latency with far lower CPU usage.

Backend DevelopmentGoLow latency
0 likes · 7 min read
How Switching from Go to Rust Slashed Latency from 15 ms to 80 µs
Old Zhang's AI Learning
Old Zhang's AI Learning
Jan 24, 2026 · Artificial Intelligence

Open-Source Qwen3‑TTS: Sub‑100 ms Latency, Runs on 8 GB GPU, and ComfyUI Integration

Qwen3‑TTS, an open‑source text‑to‑speech model from Alibaba, offers sub‑100 ms first‑packet latency, supports voice cloning, natural‑language voice design, and ten languages, can be deployed locally on a GPU with as little as 8 GB VRAM, and integrates with ComfyUI for visual workflow building.

ComfyUILow latencyQwen3-TTS
0 likes · 15 min read
Open-Source Qwen3‑TTS: Sub‑100 ms Latency, Runs on 8 GB GPU, and ComfyUI Integration
Ray's Galactic Tech
Ray's Galactic Tech
Dec 23, 2025 · Backend Development

How Apache Ignite Powers Low‑Latency Real‑Time Bidding at Scale

This article explains how Apache Ignite's memory‑first architecture, distributed compute grid, and event‑driven streaming enable sub‑100 ms decision making, high throughput, and strong consistency for real‑time bidding platforms, with practical code examples, Spring Boot integration, monitoring tips, and security considerations.

Apache IgniteIn-Memory Data GridLow latency
0 likes · 8 min read
How Apache Ignite Powers Low‑Latency Real‑Time Bidding at Scale
Architect Chen
Architect Chen
Dec 23, 2025 · Backend Development

How to Reduce Java GC Pauses from 200 ms to 20 ms: A Practical Tuning Guide

This guide explains how to systematically analyze and optimize Java garbage‑collection pauses—cutting typical 200 ms stalls down to around 20 ms—by enabling detailed logs, selecting the right collector, tuning heap and generation settings, minimizing allocation, handling large objects, and balancing GC threads with CPU resources.

JVMJavaLow latency
0 likes · 5 min read
How to Reduce Java GC Pauses from 200 ms to 20 ms: A Practical Tuning Guide
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 16, 2025 · Artificial Intelligence

How CosyVoice 2.0 Cuts First‑Chunk Latency for High‑Fidelity Voice Cloning

CosyVoice 2.0, Alibaba DAMO Academy's next‑gen high‑fidelity speech synthesis model, introduces architecture decoupling, streaming generation, reference‑audio caching and dynamic load balancing to dramatically reduce first‑packet latency and improve real‑time factor while supporting multi‑language voice cloning.

AI model optimizationLow latencyStreaming Inference
0 likes · 9 min read
How CosyVoice 2.0 Cuts First‑Chunk Latency for High‑Fidelity Voice Cloning
DevOps Coach
DevOps Coach
Dec 14, 2025 · Backend Development

10 Proven Strategies to Slash System Latency for Faster User Experience

This article outlines ten practical techniques—ranging from reducing network hops and caching hot data to optimizing database queries, batching requests, trimming payloads, focusing on critical paths, and proactive scaling—to dramatically lower response times and make applications feel instantly responsive for users.

BackendDatabase OptimizationLow latency
0 likes · 8 min read
10 Proven Strategies to Slash System Latency for Faster User Experience
Architects' Tech Alliance
Architects' Tech Alliance
Nov 9, 2025 · Artificial Intelligence

How SUE Ethernet Redefines AI Cluster Interconnects for Scale‑Up Performance

This article examines Broadcom's Scale Up Ethernet (SUE) framework, detailing how it addresses AI/HPC rack‑scale interconnect challenges by delivering ultra‑high bandwidth, microsecond‑level latency, memory‑semantic operations, and seamless compatibility with existing Ethernet infrastructure for large XPU clusters.

AI interconnectHPCHigh Bandwidth
0 likes · 12 min read
How SUE Ethernet Redefines AI Cluster Interconnects for Scale‑Up Performance
Architects' Tech Alliance
Architects' Tech Alliance
Sep 28, 2025 · Artificial Intelligence

How AI Workloads Are Redefining Network Architecture: Key Requirements and Topologies

The article examines how the rapid growth of AI models and workloads is reshaping network design, highlighting the need for ultra‑high bandwidth, sub‑millisecond latency, reliability, scalable topologies like Fat‑Tree and Dragonfly, and robust security and QoS mechanisms across data‑center, cloud, and edge environments.

AI networkingData centerDistributed Training
0 likes · 11 min read
How AI Workloads Are Redefining Network Architecture: Key Requirements and Topologies
Meituan Technology Team
Meituan Technology Team
Sep 11, 2025 · Artificial Intelligence

How LongCat-Flash Achieves Ultra-Fast, Low-Cost AI Agent Inference with SGLang

LongCat-Flash, an open‑source Mixture‑of‑Experts model released by Meituan, leverages model‑system co‑design, PD‑disaggregation, SBO scheduling and large‑scale expert parallelism within the SGLang framework to deliver dramatically lower latency, higher throughput and cost‑effective inference for AI agents, with detailed deployment instructions provided.

LongCat-FlashLow latencyMixture of Experts
0 likes · 15 min read
How LongCat-Flash Achieves Ultra-Fast, Low-Cost AI Agent Inference with SGLang
JD Tech Talk
JD Tech Talk
Aug 29, 2025 · Fundamentals

Why KCP Beats TCP and UDP for Real‑Time Apps: Low Latency, High Throughput

This article explains the limitations of TCP and UDP for real‑time applications, introduces the KCP protocol as a UDP‑based solution that combines reliability with low latency, outlines its features, working principles, code usage, and real‑world deployment examples in gaming, video conferencing, live streaming, and IoT.

KCPLow latencyUDP
0 likes · 8 min read
Why KCP Beats TCP and UDP for Real‑Time Apps: Low Latency, High Throughput
Su San Talks Tech
Su San Talks Tech
Jul 6, 2025 · Backend Development

Mastering Disruptor: High‑Performance In‑Memory Queue for Low‑Latency Java Apps

This article introduces the Disruptor in‑memory message queue—its architecture, core concepts such as Ring Buffer, Sequence, Sequencer and Wait Strategies, key features like multicast and gating, tuning options for producers and wait strategies, and provides a complete Java example to help developers build ultra‑low‑latency systems.

DisruptorIn-Memory QueueJava
0 likes · 10 min read
Mastering Disruptor: High‑Performance In‑Memory Queue for Low‑Latency Java Apps
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
May 16, 2025 · Artificial Intelligence

How Baidu Cloud Achieved 4µs End-to-End Latency for Large-Scale PD Inference

Baidu Intelligent Cloud built a 4µs end-to-end low‑latency HPN cluster, optimized traffic management and communication operators, and introduced dynamic expert balancing to dramatically improve the performance of large‑scale PD‑separated inference services, showcasing the deep integration of network infrastructure with AI workloads.

AI inferenceAll-to-AllHPN
0 likes · 14 min read
How Baidu Cloud Achieved 4µs End-to-End Latency for Large-Scale PD Inference
macrozheng
macrozheng
Mar 19, 2025 · Backend Development

Unlock Ultra‑Low Latency with Disruptor: Architecture, Features & Tuning Guide

This article introduces the high‑performance in‑memory queue Disruptor, covering its architecture—including ring buffer, sequence, sequencer, and wait strategies—its key features such as multicast events and lock‑free concurrency, and provides practical tuning advice and a complete Java example.

DisruptorIn-Memory QueueLow latency
0 likes · 12 min read
Unlock Ultra‑Low Latency with Disruptor: Architecture, Features & Tuning Guide
FunTester
FunTester
Jan 22, 2025 · Industry Insights

Aeron vs Chronicle Queue: Which Low‑Latency Messaging Solution Wins?

This article provides an in‑depth comparison of Aeron Queue and Chronicle Queue, examining their architectures, ideal use cases, performance characteristics, scalability, and persistence features to help engineers decide which low‑latency messaging system best fits distributed or single‑process applications.

AeronChronicle QueueLow latency
0 likes · 9 min read
Aeron vs Chronicle Queue: Which Low‑Latency Messaging Solution Wins?
Code Ape Tech Column
Code Ape Tech Column
Dec 24, 2024 · Backend Development

Understanding the Disruptor In-Memory Message Queue: Architecture, Features, and Tuning

This article introduces the Disruptor in‑memory message queue, explains its core components such as Ring Buffer, Sequence, Sequencer and Wait Strategies, describes its distinctive features like multicast events and lock‑free concurrency, and provides tuning guidelines and a complete Java example.

DisruptorIn-Memory QueueJava concurrency
0 likes · 11 min read
Understanding the Disruptor In-Memory Message Queue: Architecture, Features, and Tuning
FunTester
FunTester
Dec 5, 2024 · Backend Development

Understanding Aeron: A High‑Performance Messaging Framework and Its Advantages

Aeron is an open‑source, low‑latency, high‑throughput messaging framework that leverages zero‑copy memory, shared‑memory IPC and UDP transport to deliver microsecond‑level latency for finance, gaming, and distributed systems, offering a simple API and powerful performance features.

AeronDistributed SystemsHigh-Performance Messaging
0 likes · 9 min read
Understanding Aeron: A High‑Performance Messaging Framework and Its Advantages
BirdNest Tech Talk
BirdNest Tech Talk
Dec 1, 2024 · Fundamentals

Step-by-Step Guide to RDMA Programming with the ibverbs API

This tutorial walks through the complete RDMA programming workflow using the ibverbs API, covering device initialization, memory registration, completion queue and queue pair creation, state transitions, send/receive operations, completion handling, and resource cleanup with concrete C code examples.

CLow latencyNetwork programming
0 likes · 5 min read
Step-by-Step Guide to RDMA Programming with the ibverbs API
JavaEdge
JavaEdge
Nov 16, 2024 · Backend Development

How Netflix Built a Low‑Latency Distributed Counter Service at Scale

This article explains Netflix's distributed counter abstraction built on their time‑series service, detailing use cases, API design, counter types, implementation methods, control‑plane configuration, performance results, and future work to achieve near‑real‑time, low‑latency counting at massive scale.

Backend ArchitectureDistributed SystemsLow latency
0 likes · 25 min read
How Netflix Built a Low‑Latency Distributed Counter Service at Scale
Java Architecture Stack
Java Architecture Stack
Oct 30, 2024 · Operations

Why ZGC Is Revolutionizing Java Memory Management for Low‑Latency Apps

This article explains ZGC’s ultra‑low pause times, massive heap support, colored pointers, concurrent compaction, and generational optimizations, then shows how these features benefit real‑time analytics, high‑performance servers, and online transaction systems, and provides step‑by‑step instructions for enabling and tuning ZGC in Java.

Garbage CollectorJavaLow latency
0 likes · 9 min read
Why ZGC Is Revolutionizing Java Memory Management for Low‑Latency Apps
MaGe Linux Operations
MaGe Linux Operations
Oct 7, 2024 · Operations

Why Choose RocketMQ? Features, Comparisons, and Reliability Explained

This article provides a comprehensive overview of RocketMQ, covering its architecture, key features such as high reliability, low latency and high throughput, comparisons with Kafka, RabbitMQ and ActiveMQ, and detailed mechanisms that ensure message durability, performance, and ordered consumption.

Distributed SystemsLow latencyMessage Queue
0 likes · 12 min read
Why Choose RocketMQ? Features, Comparisons, and Reliability Explained
Architects' Tech Alliance
Architects' Tech Alliance
Aug 14, 2024 · Artificial Intelligence

Network Architecture and Performance Requirements for Training Large-Scale Generative AI Models

The article examines the ultra‑large‑scale, high‑bandwidth, low‑latency, and automated network infrastructure needed for training generative AI models, covering custom network designs, congestion control, deterministic RDMA, topology choices such as Fat‑Tree, and emerging deterministic networking technologies.

High BandwidthLow latencyRDMA
0 likes · 8 min read
Network Architecture and Performance Requirements for Training Large-Scale Generative AI Models
DataFunSummit
DataFunSummit
Aug 8, 2024 · Artificial Intelligence

GPU Throughput and Low‑Latency Optimization Practices in JD Advertising

This article presents JD Advertising's technical practices for improving GPU throughput and reducing latency in large‑scale recommendation scenarios, covering system challenges, storage and compute optimizations for training, low‑latency inference techniques, and compiler extensions to handle massive sparse models.

AIAdvertisingLow latency
0 likes · 13 min read
GPU Throughput and Low‑Latency Optimization Practices in JD Advertising
High Availability Architecture
High Availability Architecture
Jun 27, 2024 · Backend Development

Understanding and Tuning Z Garbage Collector (ZGC) for Low‑Latency Java Applications

This article explains the principles, features, and workflow of Java's Z Garbage Collector, provides detailed configuration and logging guidance, and shares AutoMQ's practical tuning experiences that achieve sub‑millisecond pause times and improved performance for latency‑sensitive backend services.

AutoMQGarbage CollectionJava
0 likes · 26 min read
Understanding and Tuning Z Garbage Collector (ZGC) for Low‑Latency Java Applications
IT Architects Alliance
IT Architects Alliance
Jun 12, 2024 · Cloud Computing

Network Architecture Selection and Comparison for AI Compute Centers

The article analyzes traditional cloud data‑center networking challenges for AI workloads and compares two‑layer and three‑layer fat‑tree architectures, presenting high‑bandwidth, non‑blocking, and low‑latency designs such as AI‑Pool networks and offering practical deployment scales from hundreds to tens of thousands of GPUs.

AI computeFat-TreeHigh Bandwidth
0 likes · 11 min read
Network Architecture Selection and Comparison for AI Compute Centers
21CTO
21CTO
Apr 24, 2024 · Backend Development

Can Rust Outperform C++ in Finance? Insights on Safety, Speed, and Hiring Challenges

While Rust’s safety and performance promise significant productivity gains over C++ for low‑latency trading systems, steep learning curves, scarce talent, and lower salaries hinder its adoption in finance, with industry leaders noting both its advantages and the challenges of transitioning legacy codebases.

C++Low latencyRust
0 likes · 7 min read
Can Rust Outperform C++ in Finance? Insights on Safety, Speed, and Hiring Challenges
Architects' Tech Alliance
Architects' Tech Alliance
Apr 18, 2024 · Industry Insights

Why InfiniBand Dominates Modern HPC: Speed, Latency, and Scalability Explained

This article provides a comprehensive technical overview of InfiniBand, covering its rapid adoption in top supercomputers, detailed performance advantages such as ultra‑high bandwidth, CPU offload, sub‑microsecond latency, flexible scalability, QoS, SHARP acceleration, and a comparison with Ethernet, Fibre Channel, and Omni‑Path, while also outlining HDR switch and NIC product families.

Data centerHDRHPC
0 likes · 20 min read
Why InfiniBand Dominates Modern HPC: Speed, Latency, and Scalability Explained
Java Architecture Diary
Java Architecture Diary
Apr 17, 2024 · Backend Development

Unlock Low-Latency IoT Messaging with mica-mqtt: A Java AIO MQTT Solution

mica-mqtt is a lightweight, low‑latency, high‑performance open‑source MQTT component built on Java AIO, offering full MQTT v3.1/v3.1.1/v5 support, WebSocket, REST API, client/server capabilities, clustering via Redis, GraalVM native compilation, Spring Boot starter integration, and extensive customization for IoT and messaging scenarios.

Low latencySpring Bootgraalvm
0 likes · 6 min read
Unlock Low-Latency IoT Messaging with mica-mqtt: A Java AIO MQTT Solution
DataFunSummit
DataFunSummit
Feb 4, 2024 · Mobile Development

Advanced Mobile Audio Recording Techniques in Quanjian K‑Song: Low Latency, High Fidelity, and Intelligent Audio Processing

The article details how Quanjian K‑Song has built a comprehensive mobile‑focused audio recording system since 2014, covering low‑latency capture, high‑quality sampling, lyric and vocal‑accompaniment alignment, ear‑return, pitch shifting, vocal enhancement, 3A processing, and AI‑driven scoring to deliver a professional karaoke experience on smartphones.

AI scoringAudio ProcessingLow latency
0 likes · 14 min read
Advanced Mobile Audio Recording Techniques in Quanjian K‑Song: Low Latency, High Fidelity, and Intelligent Audio Processing
Tencent Music Tech Team
Tencent Music Tech Team
Feb 4, 2024 · Mobile Development

Technical Guidelines for High-Quality Mobile Recording and Audio Processing in Quanmin K Song

Quanmin K Song’s decade‑long mobile‑recording platform combines 48 kHz/16‑bit dry‑signal capture, sub‑70 ms latency via OpenSL ES/AAudio, real‑time clipping and noise detection, lyric‑ and vocal‑accompaniment alignment, pitch‑shifting, adaptive vocal enhancement, 3A DSP/AI processing, and AI‑driven pitch correction to deliver industry‑leading high‑quality mobile singing experiences.

AIAudio ProcessingLow latency
0 likes · 15 min read
Technical Guidelines for High-Quality Mobile Recording and Audio Processing in Quanmin K Song
DataFunTalk
DataFunTalk
Jan 29, 2024 · Big Data

Case Study: Deploying RisingWave for Real-Time Stream Processing in a Large-Scale Quantitative Hedge Fund

An ultra‑large hedge fund with over $10 billion AUM replaced ksqlDB and Flink with RisingWave, leveraging its PostgreSQL‑compatible streaming SQL to achieve sub‑10 ms latency, lower learning and operational costs, rich connectors, advanced operators, and comprehensive observability for real‑time trade data processing.

Data IntegrationLow latencyQuantitative Trading
0 likes · 10 min read
Case Study: Deploying RisingWave for Real-Time Stream Processing in a Large-Scale Quantitative Hedge Fund
JD Cloud Developers
JD Cloud Developers
Sep 12, 2023 · Backend Development

Why Netty’s Direct Memory Stalls on JDK 17: A Deep Dive into Low‑Latency Bottlenecks

An in‑depth analysis of the Tianwang risk‑control Lingji system reveals how JDK 17’s ZGC, Netty’s direct‑memory allocation, and cross‑data‑center channel limits caused severe latency spikes, memory growth, and CPU usage, and outlines the debugging steps and configuration changes that finally resolved the issue.

Low latencyNettydirect memory
0 likes · 16 min read
Why Netty’s Direct Memory Stalls on JDK 17: A Deep Dive into Low‑Latency Bottlenecks
Tencent Architect
Tencent Architect
Aug 21, 2023 · Fundamentals

How Tencent Cloud’s V265/TXAV1 Revolutionizes 8K Live Streaming

This article details Tencent Cloud's V265/TXAV1 live streaming solution, covering its high‑efficiency 8K and low‑latency capabilities, performance gains over X265, MV‑HEVC 3D compression, extensive engineering and algorithmic optimizations, and the resulting speed‑up and quality improvements for ultra‑high‑definition live broadcasts.

8KAlgorithm OptimizationLow latency
0 likes · 18 min read
How Tencent Cloud’s V265/TXAV1 Revolutionizes 8K Live Streaming
Architects' Tech Alliance
Architects' Tech Alliance
Aug 8, 2023 · Cloud Computing

Design Principles and Practices for High‑Performance AI Compute Center Networks

The article analyzes the limitations of traditional data‑center networking for AI compute workloads and presents high‑bandwidth, non‑blocking, low‑latency design solutions—including two‑layer and three‑layer fat‑tree architectures, AI‑Pool concepts, and recommended configurations—for building scalable, efficient intelligent computing clusters.

AI computeFat-TreeHigh Bandwidth
0 likes · 10 min read
Design Principles and Practices for High‑Performance AI Compute Center Networks
Kuaishou Audio & Video Technology
Kuaishou Audio & Video Technology
Jun 20, 2023 · Artificial Intelligence

How a Low‑Latency Hierarchical Fusion Network Beats Echoes in Real‑Time Calls

At ICASSP 2023, Kuaishou’s audio team presented a low‑latency hierarchical fusion network for full‑band acoustic echo cancellation, detailing its multi‑stage design, asymmetric windowing, loss functions, training strategy, and achieving second place in the non‑personalized AEC Challenge, with real‑world deployment results.

Acoustic Echo CancellationDeep LearningHierarchical Fusion Network
0 likes · 13 min read
How a Low‑Latency Hierarchical Fusion Network Beats Echoes in Real‑Time Calls
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jun 5, 2023 · Artificial Intelligence

How Alibaba’s DGS Enables Real‑Time GNN Inference on Massive Dynamic Graphs

The Dynamic Graph Sampling (DGS) service, built on GraphLearn, delivers sub‑20 ms latency for real‑time GNN inference on large, constantly evolving graphs by separating storage from computation, using event‑driven pre‑sampling, lazy multi‑hop concatenation, and a publish‑subscribe architecture that scales linearly across distributed workers.

Alibaba CloudDistributed SystemsGraphLearn
0 likes · 12 min read
How Alibaba’s DGS Enables Real‑Time GNN Inference on Massive Dynamic Graphs
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jan 16, 2023 · Operations

Low‑Latency Video Streaming Optimizations for Douyin During the World Cup

This article details the end‑to‑end low‑latency video streaming architecture, measurement methods, and optimization techniques—including FLV‑2s, RTM, MiniSDP, buffer‑driven speed‑up, and CDN strategies—that Douyin's Volcano Engine video cloud employed to achieve sub‑2‑second latency and high QoE for World Cup live broadcasts.

FLVLow latencyQoE
0 likes · 21 min read
Low‑Latency Video Streaming Optimizations for Douyin During the World Cup
Bilibili Tech
Bilibili Tech
Jan 13, 2023 · Cloud Computing

Design and Implementation of Bilibili's Low‑Latency Cloud Gaming Platform Using WebRTC

Bilibili built a cross‑platform cloud‑gaming service that leverages WebRTC with tuned jitter buffers, unordered data channels, adaptive input‑report rates, and a custom kernel driver to deliver sub‑100 ms latency, dynamic bitrate control, and haptic feedback, overcoming typical latency, stutter, and flexibility limitations of existing solutions.

Low latencyWebRTCadaptive bitrate
0 likes · 14 min read
Design and Implementation of Bilibili's Low‑Latency Cloud Gaming Platform Using WebRTC
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
Jan 6, 2023 · Cloud Computing

How NetEase’s Ultra‑Low‑Latency Cloud Architecture Powers Remote Excavators

NetEase and China Construction 8th Bureau teamed up to create high‑altitude excavation robots that rely on NetEase Cloud Sign's ultra‑low‑latency global transmission, edge deployment, intelligent scheduling, WE‑CAN routing, multiplexed signaling, and advanced QoS techniques to achieve sub‑150 ms control and reliable video streaming in challenging network environments.

Edge ComputingLow latencyQoS
0 likes · 12 min read
How NetEase’s Ultra‑Low‑Latency Cloud Architecture Powers Remote Excavators
Zuoyebang Tech Team
Zuoyebang Tech Team
Nov 17, 2022 · Artificial Intelligence

Scaling Deep Learning Model Serving: High‑Concurrency, Low‑Latency Solutions

This article examines the challenges of deploying dozens of deep‑learning models at Zuoyebang and compares three serving architectures—Gunicorn + Flask + Transformers, Tornado + PyTorch, and Tornado + Triton—highlighting performance trade‑offs and presenting a final high‑concurrency, low‑latency solution in production.

Deep LearningLow latencyModel Deployment
0 likes · 11 min read
Scaling Deep Learning Model Serving: High‑Concurrency, Low‑Latency Solutions
Bilibili Tech
Bilibili Tech
Nov 11, 2022 · Backend Development

Real-Time Audio/Video System Architecture and Key Technologies Based on WebRTC

The article surveys the evolution of live streaming toward low‑latency, interactive scenarios and details WebRTC‑based real‑time audio/video system design, covering RTP/UDP transport, FEC and ARQ loss recovery, congestion control, jitter buffering, echo cancellation, edge‑node path optimization, and a multi‑layer architecture with signaling, routing, mixing services for scalable, high‑availability PK deployments.

Low latencyMedia ServerReal-time Streaming
0 likes · 16 min read
Real-Time Audio/Video System Architecture and Key Technologies Based on WebRTC
phodal
phodal
Oct 24, 2022 · Industry Insights

Unlocking Ultra-Fast Systems: Key Patterns Behind Low‑Latency Architecture

This article provides a comprehensive overview of low‑latency architecture, covering network hardware, system‑level programming strategies, language choices, memory management techniques, event‑driven designs, high‑performance data structures, and visualization approaches for building ultra‑fast computing systems.

Event-Driven ArchitectureHigh‑performance computingJava performance
0 likes · 10 min read
Unlocking Ultra-Fast Systems: Key Patterns Behind Low‑Latency Architecture
Refining Core Development Skills
Refining Core Development Skills
Oct 24, 2022 · Fundamentals

Low‑Latency Network Architecture for High‑Frequency Trading

This article explains how high‑frequency trading firms achieve ultra‑low network latency by combining proximity deployment, dedicated links, microwave transmission, InfiniBand, low‑latency switches, kernel bypass, RDMA, TCP offload engines and FPGA acceleration, and summarizes the impact of each technique on overall request latency.

FPGAInfiniBandKernel Bypass
0 likes · 16 min read
Low‑Latency Network Architecture for High‑Frequency Trading
WeChat Backend Team
WeChat Backend Team
Aug 5, 2022 · Artificial Intelligence

How WeChat’s Ekko Achieves Ultra‑Low‑Latency Model Updates for Billion‑User Recommendations

At the 16th OSDI conference, Tencent’s WeChat team presented the award‑winning Ekko system—a groundbreaking, ultra‑low‑latency model‑update solution for massive recommendation workloads that dramatically speeds up updates, supports over a trillion‑scale models, and has already boosted user engagement across billions of daily users.

Low latencyModel UpdateRecommendation Systems
0 likes · 5 min read
How WeChat’s Ekko Achieves Ultra‑Low‑Latency Model Updates for Billion‑User Recommendations
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
May 26, 2022 · Cloud Computing

How Huawei SparkRTC Delivers Global Low‑Latency Audio/Video: Network Architecture Secrets

In a May 16 live session, Huawei Cloud’s chief architect MuNa explained the goals of media networking, detailed the SparkRTC global network architecture—including overlay/underlay layers and low‑latency techniques—and revealed nine key strategies such as end‑to‑end transmission, AI‑driven routing, multi‑path access, and edge acceleration, while also reminding developers to register for the ongoing competition.

Low latencycloud computingmedia streaming
0 likes · 5 min read
How Huawei SparkRTC Delivers Global Low‑Latency Audio/Video: Network Architecture Secrets
IT Architects Alliance
IT Architects Alliance
May 23, 2022 · Industry Insights

Why RDMA Is Replacing TCP/IP for AI and High‑Performance Storage

The article analyzes how the AI boom and high‑performance SSD storage demand sub‑microsecond latency, exposing TCP/IP’s inherent context‑switch and CPU overhead, and explains why RDMA’s kernel‑bypass, zero‑copy design and 1 µs latency make it the preferred network stack for modern data‑center workloads despite challenges in Ethernet deployment.

AI computingData Center NetworkLow latency
0 likes · 11 min read
Why RDMA Is Replacing TCP/IP for AI and High‑Performance Storage
Architects' Tech Alliance
Architects' Tech Alliance
May 19, 2022 · Fundamentals

An Introduction to RDMA: Concepts, Advantages, Protocols, and Programming Basics

This article explains the fundamentals of Remote Direct Memory Access (RDMA), comparing it with traditional networking, outlining its core advantages, suitable use cases, the three main RDMA protocols (Infiniband, RoCE, iWARP), deployment requirements, communication flow, and essential programming concepts.

High‑Performance NetworkingLow latencyRDMA
0 likes · 9 min read
An Introduction to RDMA: Concepts, Advantages, Protocols, and Programming Basics
ITPUB
ITPUB
May 12, 2022 · Databases

How Aerospike Delivers Millisecond Latency on TB‑Scale Data

This article explains how Aerospike, a high‑performance NoSQL database, achieves millisecond‑level query latency on terabyte‑scale datasets by using a hybrid storage architecture, multi‑level storage tiers, flash optimizations, and a flexible ecosystem that supports diverse real‑time use cases.

AerospikeHybrid storageLow latency
0 likes · 14 min read
How Aerospike Delivers Millisecond Latency on TB‑Scale Data
High Availability Architecture
High Availability Architecture
Apr 22, 2022 · Artificial Intelligence

BIGO RTC: High‑Quality, Low‑Cost Real‑Time Communication through Core Operators and Scene Adaptation

The article explains how BIGO RTC achieves high‑quality, low‑cost real‑time audio‑video communication by optimizing core video operators such as HEVC encoding, AI‑driven super‑resolution and HDR, and by employing scene‑adaptive techniques like device performance tuning, content‑adaptive encoding and AI‑based pre‑processing to meet diverse latency constraints.

AI AdaptationHDRHigh Quality
0 likes · 9 min read
BIGO RTC: High‑Quality, Low‑Cost Real‑Time Communication through Core Operators and Scene Adaptation
Zuoyebang Tech Team
Zuoyebang Tech Team
Apr 21, 2022 · Backend Development

How to Build a High‑Concurrency, Low‑Latency Live Streaming System for Online Education

This article details the design and implementation of a self‑developed interactive live‑streaming platform that supports massive concurrent users and ultra‑low latency for online education, covering business scenarios, technical abstractions, key low‑latency and high‑concurrency techniques, and real‑world performance results.

Backend ArchitectureKCPLow latency
0 likes · 16 min read
How to Build a High‑Concurrency, Low‑Latency Live Streaming System for Online Education
Ops Development Stories
Ops Development Stories
Apr 20, 2022 · Fundamentals

Understanding ZGC: Low‑Latency Garbage Collection in Java

ZGC (Z Garbage Collector) is a scalable, low‑latency collector introduced in JDK 11, featuring sub‑millisecond pause times, region‑based memory layout, colored pointers, NUMA awareness, and concurrent phases, with detailed explanations of its architecture, parameters, triggers, and log analysis.

Garbage CollectionJVMJava
0 likes · 17 min read
Understanding ZGC: Low‑Latency Garbage Collection in Java
Architects' Tech Alliance
Architects' Tech Alliance
Mar 13, 2022 · Industry Insights

Why RDMA Is Replacing TCP/IP in AI-Driven Data Centers

The article analyzes how the AI era’s demand for ultra‑low latency and high throughput exposes fundamental limits of the traditional TCP/IP stack, and explains why RDMA’s kernel‑bypass, zero‑copy design, and emerging congestion‑control algorithms are becoming the preferred network fabric for modern data‑center workloads.

AI FabricData centerLow latency
0 likes · 12 min read
Why RDMA Is Replacing TCP/IP in AI-Driven Data Centers
Tencent Architect
Tencent Architect
Mar 11, 2022 · Cloud Computing

How Tencent Cloud’s Ultra‑Low‑Latency Live Streaming Redefined Real‑Time Video

This article explores how Tencent Cloud leveraged WebRTC and innovative engineering to create an ultra‑low‑latency live streaming solution that outperforms traditional CDN‑based streams, detailing the technical challenges, product decisions, and market impact that have driven rapid adoption across e‑commerce, education, and entertainment sectors.

Low latencyTencent CloudWebRTC
0 likes · 10 min read
How Tencent Cloud’s Ultra‑Low‑Latency Live Streaming Redefined Real‑Time Video
IT Architects Alliance
IT Architects Alliance
Dec 19, 2021 · Information Security

Understanding the Metaverse from a Network Perspective: Concepts, Technical Implications, and Emerging Challenges

The article analyses the metaverse as a network-driven evolution, explaining its definition, classifying terminal types, and detailing the new security, ultra‑low‑latency, and deterministic quality‑of‑service requirements that challenge existing internet architecture and design principles.

InternetLow latencyQoS
0 likes · 17 min read
Understanding the Metaverse from a Network Perspective: Concepts, Technical Implications, and Emerging Challenges
Alibaba Terminal Technology
Alibaba Terminal Technology
Dec 6, 2021 · Backend Development

How We Achieved Low‑Latency, High‑Definition Multi‑Angle Live Streaming with WebRTC

This article details the design and implementation of a low‑latency, high‑definition multi‑angle live streaming solution using WebRTC, covering protocol selection, system architecture, edge commands, client integration, performance optimizations, and lessons learned from deploying the feature in a large‑scale live event.

Edge ComputingLow latencyVideo Encoding
0 likes · 15 min read
How We Achieved Low‑Latency, High‑Definition Multi‑Angle Live Streaming with WebRTC
Java Architecture Diary
Java Architecture Diary
Nov 29, 2021 · Backend Development

Unlock High‑Performance IoT Messaging with mica‑mqtt: Low‑Latency Java MQTT Framework

The mica‑mqtt library, built on t‑io, offers a simple, low‑latency, high‑performance open‑source MQTT solution for IoT, supporting MQTT v3.1/v3.1.1/v5, WebSocket sub‑protocol, HTTP API, clustering via Redis, Spring Boot starter integration, Prometheus/Grafana monitoring, and provides extensive client and server interfaces with code examples.

JavaLow latencySpring Boot
0 likes · 10 min read
Unlock High‑Performance IoT Messaging with mica‑mqtt: Low‑Latency Java MQTT Framework
Tencent Cloud Developer
Tencent Cloud Developer
Nov 26, 2021 · Cloud Computing

5G Remote Real-Time Control: Key Challenges, Enabling Technologies, and System Architectures

5G’s high bandwidth, low latency, and edge offloading empower real‑time remote control of complex equipment, demanding synchronized video, audio, and control signals, which are addressed by real‑time AV communication, control‑signal synchronization, and network optimization, while evolving system architectures—from direct‑connection models to integrated fusion designs—ensure scalability, reliability, and minimal latency for applications in mines, ports, logistics, and autonomous vehicles.

5GIoTLow latency
0 likes · 12 min read
5G Remote Real-Time Control: Key Challenges, Enabling Technologies, and System Architectures
Java Architecture Diary
Java Architecture Diary
Oct 18, 2021 · Backend Development

Unlock Low-Latency MQTT with mica-mqtt: A High-Performance Java Component

This guide introduces mica-mqtt, a simple, low‑latency, high‑performance MQTT open‑source component built on t‑io, outlines its extensive feature set, provides Spring Boot and plain Java integration steps, showcases configuration examples, and explains monitoring and clustering capabilities for IoT back‑end development.

JavaLow latencySpring Boot
0 likes · 8 min read
Unlock Low-Latency MQTT with mica-mqtt: A High-Performance Java Component
Architects' Tech Alliance
Architects' Tech Alliance
Sep 10, 2021 · Artificial Intelligence

FPGA Technology for Compute‑Intensive and Communication‑Intensive Tasks in Data Centers

The article examines how FPGA’s pipeline parallel architecture provides latency‑critical advantages over CPU and GPU for both compute‑intensive workloads such as matrix operations and AI inference, and communication‑intensive tasks like encryption and high‑throughput networking, while also discussing deployment models, power efficiency, eFPGA trends, and the evolving Chinese FPGA market.

AICompute AccelerationData center
0 likes · 14 min read
FPGA Technology for Compute‑Intensive and Communication‑Intensive Tasks in Data Centers
Youku Technology
Youku Technology
Sep 8, 2021 · Industry Insights

How Youku Engineers Accelerate Start‑up, Cut Stalls, and Stabilize Video Playback

This article examines Youku's technical team's systematic approach to improving video playback by tackling slow start‑up, error handling, network stalls, audio‑video sync, and overall stability through metric‑driven optimization, adaptive streaming protocols, device capability databases, and forward‑looking features like low latency and high‑frame‑rate streaming.

Low latencyPerformance MonitoringVideo Streaming
0 likes · 12 min read
How Youku Engineers Accelerate Start‑up, Cut Stalls, and Stabilize Video Playback
MaGe Linux Operations
MaGe Linux Operations
Jul 14, 2021 · Cloud Computing

What Is Intelligent Edge Computing and Why It Matters Today

Intelligent edge computing extends traditional edge architectures by embedding AI-driven data analysis and machine learning directly at the network edge, enabling faster, low‑latency decisions for use cases like smart cameras and autonomous vehicles, and is rapidly becoming a core strategy for modern enterprises.

Edge ComputingIoTLow latency
0 likes · 6 min read
What Is Intelligent Edge Computing and Why It Matters Today
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Jun 21, 2021 · Cloud Computing

How Huawei Cloud Achieves Ultra‑Low Latency for E‑Commerce Live Streaming

This article examines the technical challenges of massive e‑commerce live streams, explains why traditional streaming suffers from high latency, and details Huawei Cloud's low‑latency architecture, protocols, intelligent scheduling and ultra‑fast transcoding that enable sub‑second product link delivery to millions of viewers.

CDNLow latencyVideo Transcoding
0 likes · 13 min read
How Huawei Cloud Achieves Ultra‑Low Latency for E‑Commerce Live Streaming
Tencent Cloud Middleware
Tencent Cloud Middleware
Jun 17, 2021 · Fundamentals

How Tencent Kona JDK 11 ZGC Delivers Millisecond‑Level GC Pauses for Real‑Time Services

Tencent's Kona JDK 11 introduces a production‑ready ZGC implementation that reduces Java garbage‑collection stop‑the‑world pauses to under 10 ms, enabling ultra‑low‑latency online services across massive heaps while maintaining acceptable throughput, and the article details its design, tuning, and real‑world deployments.

Garbage CollectionJava performanceLow latency
0 likes · 28 min read
How Tencent Kona JDK 11 ZGC Delivers Millisecond‑Level GC Pauses for Real‑Time Services
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
Jun 4, 2021 · Backend Development

How WE‑CAN Revolutionizes Global Data Transmission with Ultra‑Low Latency and Cost‑Effective Architecture

WE‑CAN, NetEase Cloud Communication's self‑developed large‑scale distributed transmission network, dramatically improves end‑to‑end data quality, reduces costs, and supports diverse real‑time communication scenarios through intelligent routing, layered decoupling, and global edge deployment, as demonstrated by extensive production metrics.

Distributed SystemsLow latencycloud networking
0 likes · 15 min read
How WE‑CAN Revolutionizes Global Data Transmission with Ultra‑Low Latency and Cost‑Effective Architecture
Kuaishou Large Model
Kuaishou Large Model
May 28, 2021 · Mobile Development

How to Eliminate High‑Latency Decoding in Moonlight Android for Smooth Game Streaming

This article explains the technical challenges of high‑latency video decoding in Android game streaming with Moonlight, analyzes why it occurs, and provides practical solutions—including API choices, immediate buffer release, frame ordering, and low‑latency vendor configurations—to achieve smoother, low‑delay gameplay.

AndroidC languageLow latency
0 likes · 10 min read
How to Eliminate High‑Latency Decoding in Moonlight Android for Smooth Game Streaming
Selected Java Interview Questions
Selected Java Interview Questions
Mar 11, 2021 · Fundamentals

Understanding ZGC: A Low‑Latency Garbage Collector for Java

The article explains ZGC, Oracle's low‑latency Java garbage collector introduced in JDK 11, covering its dynamic region layout, colored pointer technique, tri‑color marking, memory multi‑mapping, read barriers, operation phases, performance characteristics, and its main drawback of floating garbage.

Concurrent MarkingGarbage CollectionJava
0 likes · 11 min read
Understanding ZGC: A Low‑Latency Garbage Collector for Java
Tencent Cloud Developer
Tencent Cloud Developer
Jan 12, 2021 · Cloud Computing

Key Technologies and Applications of Tencent Real-Time Audio‑Video (TRTC)

Tencent's Real‑Time Audio‑Video (TRTC) platform delivers sub‑second, cross‑platform RTC for up to 300 participants and 100 000 viewers, using UDP‑based NACK/FEC, dynamic QoS, AI‑enhanced video/audio effects, and cloud‑driven QoE optimization to support diverse scenarios such as online meetings, education, live streaming, and industry‑specific applications.

Audio-VideoCloud ServicesEcho Cancellation
0 likes · 16 min read
Key Technologies and Applications of Tencent Real-Time Audio‑Video (TRTC)
Architects' Tech Alliance
Architects' Tech Alliance
Jan 10, 2021 · Industry Insights

Why RoCE Is Revolutionizing Data Center Networking: A Deep Dive into RDMA over Ethernet

This article explains the fundamentals of RDMA and RoCE, compares RoCE v1 and v2, outlines deployment steps, highlights performance benefits such as low CPU usage and zero‑copy, and answers common questions about its differences from iWARP and InfiniBand, helping data‑center engineers evaluate the technology.

Data Center NetworkingHigh BandwidthLow latency
0 likes · 8 min read
Why RoCE Is Revolutionizing Data Center Networking: A Deep Dive into RDMA over Ethernet
Didi Tech
Didi Tech
Dec 21, 2020 · Big Data

HBase Availability and Latency Optimizations: Replication‑Based Multi‑Read and ZGC Adoption

To overcome HBase’s weak availability and GC‑induced latency spikes, the DiDi team introduced a replication‑based client multi‑read (hedged‑read) mechanism and migrated to the Z Garbage Collector, which together dramatically cut maximum and 99.9th‑percentile latencies while keeping services online during region disruptions.

Big DataHBaseLow latency
0 likes · 12 min read
HBase Availability and Latency Optimizations: Replication‑Based Multi‑Read and ZGC Adoption
High Availability Architecture
High Availability Architecture
Nov 5, 2020 · Backend Development

Why We Chose Java for Our High‑Frequency Trading Application

The article explains how a high‑frequency trading firm evaluated Java versus C++ for ultra‑low‑latency trading, discusses the challenges of JVM JIT compilation and garbage‑collection pauses, and shows how Azul Zing’s C4 collector delivers near‑C++ latency while preserving Java’s development productivity.

Azul ZingGarbage CollectionJVM
0 likes · 11 min read
Why We Chose Java for Our High‑Frequency Trading Application
Amap Tech
Amap Tech
Oct 30, 2020 · Mobile Development

Video Streaming Solution for the ARC Car Cloud Control Platform

The ARC Car Cloud Control platform now streams the vehicle’s screen using Android’s Virtual Display and a C++‑based H.264 hardware encoder, sending raw video over a TCP socket to a server that adaptively adjusts bitrate and frame rate, while the web client decodes the fragmented MP4 via MSE, dramatically lowering CPU usage and latency on low‑end head‑units.

AndroidH.264Low latency
0 likes · 8 min read
Video Streaming Solution for the ARC Car Cloud Control Platform
Youku Technology
Youku Technology
Aug 18, 2020 · Backend Development

How Youku Engineered a High‑Performance, Low‑Latency Marketing Platform

This article details Youku's membership marketing system architecture, covering complex marketing scenarios, high‑availability and low‑latency requirements, rule‑based QL engine, unified marketing framework, multi‑cache storage, multithreaded matching, asynchronous reward distribution, and distributed transaction mechanisms.

BackendLow latencySystem Design
0 likes · 12 min read
How Youku Engineered a High‑Performance, Low‑Latency Marketing Platform
High Availability Architecture
High Availability Architecture
Aug 11, 2020 · Operations

Understanding and Optimizing ZGC (Z Garbage Collector) for Low‑Latency Java Services

This article examines the Z Garbage Collector (ZGC) introduced in JDK 11, detailing its low‑pause design goals, underlying concurrent marking‑copy algorithm, colored pointer and read‑barrier techniques, practical tuning parameters, real‑world case studies, and the performance impact of upgrading from CMS/G1 to ZGC in high‑throughput, low‑latency services.

Garbage CollectionJVMJava
0 likes · 28 min read
Understanding and Optimizing ZGC (Z Garbage Collector) for Low‑Latency Java Services
Meituan Technology Team
Meituan Technology Team
Aug 6, 2020 · Backend Development

ZGC: Principles, Tuning Practices, and Production Upgrade Experience

The article explains how Meituan’s risk‑control platform eliminated frequent 40 ms CMS pauses by adopting JDK 11’s ZGC—detailing its concurrent mark‑copy design, practical tuning parameters, real‑world case fixes, and measured latency reductions of up to 74 % while noting trade‑offs.

Garbage CollectionJDK11Java
0 likes · 27 min read
ZGC: Principles, Tuning Practices, and Production Upgrade Experience
Tencent Tech
Tencent Tech
Jun 18, 2020 · Backend Development

Scaling Live‑Ecommerce Platforms: Architecture Behind Billions of Users

This article examines the rapid rise of live‑ecommerce during the 618 shopping festival, explains why the “live + ecommerce” model demands robust backend, streaming and CDN infrastructure, and details Tencent Cloud’s architectural solutions—including media processing, low‑latency protocols, bandwidth optimization and anti‑attack measures—to support massive concurrent traffic.

Low latencyarchitecturee‑commerce
0 likes · 10 min read
Scaling Live‑Ecommerce Platforms: Architecture Behind Billions of Users
Tencent Tech
Tencent Tech
Jun 2, 2020 · Cloud Computing

How SRT Enables Low‑Latency, Reliable Live Streaming for Global eSports Events

The article explains how the Secure Reliable Transport (SRT) protocol, combined with Tencent Video Cloud’s optimized infrastructure, overcame latency and packet‑loss challenges to deliver stable, high‑quality live streams for the 2020 LPL Mid‑Season Cup and other large‑scale events, and describes its broader applications through MediaConnect.

Low latencyMediaConnectNetwork Transmission
0 likes · 10 min read
How SRT Enables Low‑Latency, Reliable Live Streaming for Global eSports Events
Programmer DD
Programmer DD
May 22, 2020 · Backend Development

Can ZGC Deliver Sub‑10ms Pauses for Massive Java Heaps?

This article explains the design goals, architecture, key features, tuning options, and version history of Java's Z Garbage Collector (ZGC), highlighting its sub‑10 ms pause times for terabyte‑scale heaps, its use of colored pointers and load barriers, and the trade‑offs in throughput and configuration.

Garbage CollectionJVMJava
0 likes · 16 min read
Can ZGC Deliver Sub‑10ms Pauses for Massive Java Heaps?
Tencent Cloud Developer
Tencent Cloud Developer
May 21, 2020 · Game Development

How Tencent’s Game Server Engine Tackles Low Latency and Cost in Multiplayer Games

This article analyzes the challenges of low‑latency, stable, and cost‑effective online multiplayer games and explains how Tencent's Game Server Engine (GSE) provides elastic scaling, near‑by scheduling, stateful shrinkage, multi‑region disaster recovery, and zero‑downtime updates to meet those demands.

Low latencyTencent GSEcloud gaming
0 likes · 11 min read
How Tencent’s Game Server Engine Tackles Low Latency and Cost in Multiplayer Games
iQIYI Technical Product Team
iQIYI Technical Product Team
Apr 24, 2020 · Cloud Computing

Technical Insights into Cloud Gaming Advertising Trials: Low‑Latency RTCDN Solutions by iQIYI Live Cloud

In an interview, iQIYI Live Cloud’s Chen Kunzhong explains how their WebRTC‑based RTCDN reduces encoding and transmission delay to achieve roughly 100 ms end‑to‑end latency for cloud‑gaming ads, supporting cross‑device play, instant click‑to‑play sessions, and future 5G‑enhanced high‑resolution experiences.

5GAdvertisingLow latency
0 likes · 9 min read
Technical Insights into Cloud Gaming Advertising Trials: Low‑Latency RTCDN Solutions by iQIYI Live Cloud
Top Architect
Top Architect
Apr 9, 2020 · Backend Development

Low‑Latency and High‑Availability Design of RocketMQ: Evolution, Optimizations, and Capacity Planning

This article reviews the evolution of Alibaba's Aliware message engine, analyzes the low‑latency and high‑availability challenges faced during Double 11, and details the architectural, JVM, memory, rate‑limiting, and multi‑replica solutions that enabled RocketMQ to achieve sub‑millisecond write latency and five‑nine availability.

Distributed SystemsLow latencyRocketMQ
0 likes · 29 min read
Low‑Latency and High‑Availability Design of RocketMQ: Evolution, Optimizations, and Capacity Planning
dbaplus Community
dbaplus Community
Apr 7, 2020 · Databases

How Pharos Accelerates HBase Multi‑Condition Queries with Low‑Latency Indexing

This article examines Pharos, Everbright Bank's home‑grown HBase indexing middleware, detailing why existing secondary‑index solutions fall short, the design goals of low latency, simple architecture and non‑intrusiveness, and the concrete storage, pagination, and transaction‑consistency techniques that enable fast complex queries on massive data.

HBaseLow latencyPharos
0 likes · 14 min read
How Pharos Accelerates HBase Multi‑Condition Queries with Low‑Latency Indexing