Tagged articles

low-latency

134 articles · Page 1 of 2

Jun 28, 2026 · Fundamentals

Java Performance Tuning Part 5: Hands‑On GC Optimization from G1 to ZGC

This article walks through Java GC tuning by defining low‑latency and high‑throughput goals, comparing major collectors, presenting G1 and ZGC configuration examples, and demonstrating a real‑world payment system case where pause times were reduced from 150‑200 ms to under 50 ms.

GCJVMJava

0 likes · 8 min read

Java Performance Tuning Part 5: Hands‑On GC Optimization from G1 to ZGC

Machine Heart

May 10, 2026 · Artificial Intelligence

Stop Fragmenting Long Texts: HiLight Lets AI Highlight Key Points Directly

The HiLight approach inserts lightweight highlight tags into full-length inputs, training a small Emphasis Actor to score token importance and guide a frozen large language model, improving performance on tasks like recommendation and QA without modifying the solver, while keeping low latency and training cost.

EvaluationLLMhighlighting

0 likes · 9 min read

Stop Fragmenting Long Texts: HiLight Lets AI Highlight Key Points Directly

Weekly Large Model Application

May 6, 2026 · Cloud Native

How OpenAI Scales Low-Latency Voice AI with WebRTC: Architecture Deep Dive

The article dissects OpenAI's engineering approach to delivering low‑latency voice AI at scale, explaining why WebRTC was chosen, how a Relay + Transceiver split solves Kubernetes integration challenges, the use of ICE ufrag for deterministic routing, and how global relay and implementation choices reduce perceived latency.

OpenAIRelayTransceiver

0 likes · 9 min read

How OpenAI Scales Low-Latency Voice AI with WebRTC: Architecture Deep Dive

Machine Heart

Apr 18, 2026 · Artificial Intelligence

No Prompt Needed: Press Tab Repeatedly to Refactor with the Instruction‑Free NES Framework

The NES framework introduced by Ant Group’s CodeFuse team learns developers’ historical editing patterns to provide instant, instruction‑free edit suggestions, achieving sub‑250 ms latency and improving the flow of frequent refactoring tasks through a simple Tab‑press interaction.

AI code editingIDE assistanceNES

0 likes · 9 min read

No Prompt Needed: Press Tab Repeatedly to Refactor with the Instruction‑Free NES Framework

Amazon Cloud Developers

Apr 15, 2026 · Game Development

Amazon GameLift Player Gateway & Ping Beacons: DDoS Protection and Low‑Latency

Amazon GameLift Servers introduces Player Gateway, an active DDoS mitigation layer that hides server IPs and validates traffic without modifying game servers, and Ping Beacons, a UDP‑based multi‑region latency measurement tool; both are free, SDK‑driven features that simplify secure, low‑latency multiplayer deployment.

Amazon GameLiftC++ SDKDDoS protection

0 likes · 18 min read

Amazon GameLift Player Gateway & Ping Beacons: DDoS Protection and Low‑Latency

Alibaba Cloud Developer

Mar 27, 2026 · Artificial Intelligence

How Tair Powers Sub‑Second AI Agent Memory for Real‑Time Ordering

This article examines how Taobao Flash Sale’s AI Agent uses Alibaba Cloud’s Tair as a high‑performance short‑term memory layer, detailing data model design, latency impact, concurrency control, elastic scaling, bandwidth handling, and TTL‑based cleanup to achieve sub‑second response times during massive traffic spikes.

AI AgentDistributed LockMemory Management

0 likes · 15 min read

How Tair Powers Sub‑Second AI Agent Memory for Real‑Time Ordering

Code Wrench

Mar 6, 2026 · Backend Development

Why WebRTC Latency Isn’t About the API: Go, ICE, DTLS, and Scaling

This article breaks down the true bottlenecks of low‑latency WebRTC systems—network models, congestion control, memory layout, and concurrency scheduling—by examining the protocol stack, Go runtime, ICE state machine, DTLS/SRTP security, RTP/RTCP feedback, and practical high‑concurrency tuning strategies.

GoReal-time MediaWebRTC

0 likes · 10 min read

Why WebRTC Latency Isn’t About the API: Go, ICE, DTLS, and Scaling

Deepin Linux

Mar 6, 2026 · Backend Development

Unlocking Ultra‑Low Latency: How RDMA Transforms High‑Performance Networking

This article explains the fundamentals of Remote Direct Memory Access (RDMA), its low‑latency, zero‑copy and kernel‑bypass mechanisms, programming interfaces, and real‑world applications in data‑center networks, high‑performance computing, and distributed storage, providing developers with practical guidance and code examples.

Distributed storageHigh-performance computingNetwork Programming

0 likes · 31 min read

Unlocking Ultra‑Low Latency: How RDMA Transforms High‑Performance Networking

Code Wrench

Feb 23, 2026 · Backend Development

How Switching from Go to Rust Slashed Latency from 15 ms to 80 µs

In high‑frequency trading and real‑time systems, Go’s garbage‑collector pauses and channel lock contention can inflate P99 latency to dozens of milliseconds, while a disciplined Rust rewrite eliminates GC, reduces lock overhead, and achieves sub‑100‑microsecond latency with far lower CPU usage.

Backend DevelopmentGohigh performance

0 likes · 7 min read

How Switching from Go to Rust Slashed Latency from 15 ms to 80 µs

Old Zhang's AI Learning

Jan 24, 2026 · Artificial Intelligence

Open-Source Qwen3‑TTS: Sub‑100 ms Latency, Runs on 8 GB GPU, and ComfyUI Integration

Qwen3‑TTS, an open‑source text‑to‑speech model from Alibaba, offers sub‑100 ms first‑packet latency, supports voice cloning, natural‑language voice design, and ten languages, can be deployed locally on a GPU with as little as 8 GB VRAM, and integrates with ComfyUI for visual workflow building.

ComfyUIQwen3-TTSText‑to‑Speech

0 likes · 15 min read

Open-Source Qwen3‑TTS: Sub‑100 ms Latency, Runs on 8 GB GPU, and ComfyUI Integration

Ray's Galactic Tech

Dec 23, 2025 · Backend Development

How Apache Ignite Powers Low‑Latency Real‑Time Bidding at Scale

This article explains how Apache Ignite's memory‑first architecture, distributed compute grid, and event‑driven streaming enable sub‑100 ms decision making, high throughput, and strong consistency for real‑time bidding platforms, with practical code examples, Spring Boot integration, monitoring tips, and security considerations.

Apache IgniteDistributed ComputingIn-Memory Data Grid

0 likes · 8 min read

How Apache Ignite Powers Low‑Latency Real‑Time Bidding at Scale

Architect Chen

Dec 23, 2025 · Backend Development

How to Reduce Java GC Pauses from 200 ms to 20 ms: A Practical Tuning Guide

This guide explains how to systematically analyze and optimize Java garbage‑collection pauses—cutting typical 200 ms stalls down to around 20 ms—by enabling detailed logs, selecting the right collector, tuning heap and generation settings, minimizing allocation, handling large objects, and balancing GC threads with CPU resources.

GCJVMJava

0 likes · 5 min read

How to Reduce Java GC Pauses from 200 ms to 20 ms: A Practical Tuning Guide

Alibaba Cloud Big Data AI Platform

Dec 16, 2025 · Artificial Intelligence

How CosyVoice 2.0 Cuts First‑Chunk Latency for High‑Fidelity Voice Cloning

CosyVoice 2.0, Alibaba DAMO Academy's next‑gen high‑fidelity speech synthesis model, introduces architecture decoupling, streaming generation, reference‑audio caching and dynamic load balancing to dramatically reduce first‑packet latency and improve real‑time factor while supporting multi‑language voice cloning.

AI model optimizationStreaming Inferencelow-latency

0 likes · 9 min read

How CosyVoice 2.0 Cuts First‑Chunk Latency for High‑Fidelity Voice Cloning

DevOps Coach

Dec 14, 2025 · Backend Development

10 Proven Strategies to Slash System Latency for Faster User Experience

This article outlines ten practical techniques—ranging from reducing network hops and caching hot data to optimizing database queries, batching requests, trimming payloads, focusing on critical paths, and proactive scaling—to dramatically lower response times and make applications feel instantly responsive for users.

CachingMonitoringbackend

0 likes · 8 min read

10 Proven Strategies to Slash System Latency for Faster User Experience

Architects' Tech Alliance

Nov 9, 2025 · Artificial Intelligence

How SUE Ethernet Redefines AI Cluster Interconnects for Scale‑Up Performance

This article examines Broadcom's Scale Up Ethernet (SUE) framework, detailing how it addresses AI/HPC rack‑scale interconnect challenges by delivering ultra‑high bandwidth, microsecond‑level latency, memory‑semantic operations, and seamless compatibility with existing Ethernet infrastructure for large XPU clusters.

AI interconnectHPCHigh Bandwidth

0 likes · 12 min read

How SUE Ethernet Redefines AI Cluster Interconnects for Scale‑Up Performance

Java Backend Technology

Nov 6, 2025 · Operations

Boost Java Performance with MyPerf4J: High‑Throughput, Low‑Latency Monitoring

MyPerf4J is a high‑throughput, low‑latency Java performance monitoring tool that uses a non‑intrusive JavaAgent to collect real‑time method, memory, GC, and class metrics, offering developers quick bottleneck detection in development and continuous observability in production.

JavaJavaAgentObservability

0 likes · 6 min read

Boost Java Performance with MyPerf4J: High‑Throughput, Low‑Latency Monitoring

Architects' Tech Alliance

Sep 28, 2025 · Artificial Intelligence

How AI Workloads Are Redefining Network Architecture: Key Requirements and Topologies

The article examines how the rapid growth of AI models and workloads is reshaping network design, highlighting the need for ultra‑high bandwidth, sub‑millisecond latency, reliability, scalable topologies like Fat‑Tree and Dragonfly, and robust security and QoS mechanisms across data‑center, cloud, and edge environments.

AI networkingData CenterHigh Bandwidth

0 likes · 11 min read

How AI Workloads Are Redefining Network Architecture: Key Requirements and Topologies

Linux Kernel Journey

Sep 28, 2025 · Fundamentals

Low‑Latency GPU Packet Processing: Techniques, Trade‑offs, and Benchmarks

This article examines how to achieve low‑latency network packet processing on NVIDIA GPUs by comparing CPU and GPU implementations, exploring memory optimizations, batch strategies, stream concurrency, persistent kernels, and CUDA graphs, and presenting detailed performance measurements for each technique.

CUDAGPUPerformance Optimization

0 likes · 12 min read

Low‑Latency GPU Packet Processing: Techniques, Trade‑offs, and Benchmarks

Meituan Technology Team

Sep 11, 2025 · Artificial Intelligence

How LongCat-Flash Achieves Ultra-Fast, Low-Cost AI Agent Inference with SGLang

LongCat-Flash, an open‑source Mixture‑of‑Experts model released by Meituan, leverages model‑system co‑design, PD‑disaggregation, SBO scheduling and large‑scale expert parallelism within the SGLang framework to deliver dramatically lower latency, higher throughput and cost‑effective inference for AI agents, with detailed deployment instructions provided.

LongCat-FlashMixture of ExpertsSGLang

0 likes · 15 min read

How LongCat-Flash Achieves Ultra-Fast, Low-Cost AI Agent Inference with SGLang

JD Tech Talk

Aug 29, 2025 · Fundamentals

Why KCP Beats TCP and UDP for Real‑Time Apps: Low Latency, High Throughput

This article explains the limitations of TCP and UDP for real‑time applications, introduces the KCP protocol as a UDP‑based solution that combines reliability with low latency, outlines its features, working principles, code usage, and real‑world deployment examples in gaming, video conferencing, live streaming, and IoT.

KCPReal‑time communicationUDP

0 likes · 8 min read

Why KCP Beats TCP and UDP for Real‑Time Apps: Low Latency, High Throughput

Big Data Technology Tribe

Jul 18, 2025 · Fundamentals

What Is Spine‑Leaf Architecture and Why It Boosts Data Center Performance

Spine‑Leaf is a two‑layer data‑center network topology that replaces traditional three‑tier designs, reducing latency and hops by connecting every leaf switch to every spine switch, thereby improving redundancy, scalability, and cost efficiency for east‑west traffic.

Data Center NetworkingNetwork ArchitectureSpine‑Leaf

0 likes · 7 min read

What Is Spine‑Leaf Architecture and Why It Boosts Data Center Performance

Su San Talks Tech

Jul 6, 2025 · Backend Development

Mastering Disruptor: High‑Performance In‑Memory Queue for Low‑Latency Java Apps

This article introduces the Disruptor in‑memory message queue—its architecture, core concepts such as Ring Buffer, Sequence, Sequencer and Wait Strategies, key features like multicast and gating, tuning options for producers and wait strategies, and provides a complete Java example to help developers build ultra‑low‑latency systems.

DisruptorIn-Memory QueueJava

0 likes · 10 min read

Mastering Disruptor: High‑Performance In‑Memory Queue for Low‑Latency Java Apps

Baidu Intelligent Cloud Tech Hub

May 16, 2025 · Artificial Intelligence

How Baidu Cloud Achieved 4µs End-to-End Latency for Large-Scale PD Inference

Baidu Intelligent Cloud built a 4µs end-to-end low‑latency HPN cluster, optimized traffic management and communication operators, and introduced dynamic expert balancing to dramatically improve the performance of large‑scale PD‑separated inference services, showcasing the deep integration of network infrastructure with AI workloads.

AI inferenceAll-to-AllHPN

0 likes · 14 min read

How Baidu Cloud Achieved 4µs End-to-End Latency for Large-Scale PD Inference

macrozheng

Mar 19, 2025 · Backend Development

Unlock Ultra‑Low Latency with Disruptor: Architecture, Features & Tuning Guide

This article introduces the high‑performance in‑memory queue Disruptor, covering its architecture—including ring buffer, sequence, sequencer, and wait strategies—its key features such as multicast events and lock‑free concurrency, and provides practical tuning advice and a complete Java example.

DisruptorIn-Memory QueuePerformance Tuning

0 likes · 12 min read

Unlock Ultra‑Low Latency with Disruptor: Architecture, Features & Tuning Guide

FunTester

Jan 22, 2025 · Industry Insights

Aeron vs Chronicle Queue: Which Low‑Latency Messaging Solution Wins?

This article provides an in‑depth comparison of Aeron Queue and Chronicle Queue, examining their architectures, ideal use cases, performance characteristics, scalability, and persistence features to help engineers decide which low‑latency messaging system best fits distributed or single‑process applications.

AeronChronicle QueueMessage Queue

0 likes · 9 min read

Aeron vs Chronicle Queue: Which Low‑Latency Messaging Solution Wins?

Code Ape Tech Column

Dec 24, 2024 · Backend Development

Understanding the Disruptor In-Memory Message Queue: Architecture, Features, and Tuning

This article introduces the Disruptor in‑memory message queue, explains its core components such as Ring Buffer, Sequence, Sequencer and Wait Strategies, describes its distinctive features like multicast events and lock‑free concurrency, and provides tuning guidelines and a complete Java example.

DisruptorIn-Memory QueueJava concurrency

0 likes · 11 min read

Understanding the Disruptor In-Memory Message Queue: Architecture, Features, and Tuning

Su San Talks Tech

Dec 18, 2024 · Backend Development

Mastering Disruptor: High‑Performance In‑Memory Queue for Low‑Latency Java Applications

This article introduces the Disruptor in‑memory message queue, explains its architecture and core concepts such as Ring Buffer, Sequence, Sequencer, and Wait Strategies, and provides tuning guidance and a complete Java example for building low‑latency concurrent systems.

DisruptorIn-Memory QueueJava

0 likes · 11 min read

Mastering Disruptor: High‑Performance In‑Memory Queue for Low‑Latency Java Applications

FunTester

Dec 5, 2024 · Backend Development

Understanding Aeron: A High‑Performance Messaging Framework and Its Advantages

Aeron is an open‑source, low‑latency, high‑throughput messaging framework that leverages zero‑copy memory, shared‑memory IPC and UDP transport to deliver microsecond‑level latency for finance, gaming, and distributed systems, offering a simple API and powerful performance features.

AeronHigh-Performance MessagingJava

0 likes · 9 min read

Understanding Aeron: A High‑Performance Messaging Framework and Its Advantages

Architecture Development Notes

Dec 4, 2024 · Backend Development

How to Build a Zero‑Copy, Low‑Latency Network Protocol in Rust

This article explains how to design and implement a high‑performance custom network protocol in Rust, covering zero‑copy parsing, memory‑mapped packet pools, lock‑free event loops, and an efficient binary packet format to achieve ultra‑low latency and massive concurrency.

high performancelock‑freelow-latency

0 likes · 9 min read

How to Build a Zero‑Copy, Low‑Latency Network Protocol in Rust

BirdNest Tech Talk

Dec 1, 2024 · Fundamentals

Step-by-Step Guide to RDMA Programming with the ibverbs API

This tutorial walks through the complete RDMA programming workflow using the ibverbs API, covering device initialization, memory registration, completion queue and queue pair creation, state transitions, send/receive operations, completion handling, and resource cleanup with concrete C code examples.

C#Network ProgrammingRDMA

0 likes · 5 min read

Step-by-Step Guide to RDMA Programming with the ibverbs API

JavaEdge

Nov 16, 2024 · Backend Development

How Netflix Built a Low‑Latency Distributed Counter Service at Scale

This article explains Netflix's distributed counter abstraction built on their time‑series service, detailing use cases, API design, counter types, implementation methods, control‑plane configuration, performance results, and future work to achieve near‑real‑time, low‑latency counting at massive scale.

Netflixbackend-architecturecounter service

0 likes · 25 min read

How Netflix Built a Low‑Latency Distributed Counter Service at Scale

Java Architecture Stack

Oct 30, 2024 · Operations

Why ZGC Is Revolutionizing Java Memory Management for Low‑Latency Apps

This article explains ZGC’s ultra‑low pause times, massive heap support, colored pointers, concurrent compaction, and generational optimizations, then shows how these features benefit real‑time analytics, high‑performance servers, and online transaction systems, and provides step‑by‑step instructions for enabling and tuning ZGC in Java.

Garbage CollectorJavaMemory Management

0 likes · 9 min read

Why ZGC Is Revolutionizing Java Memory Management for Low‑Latency Apps

MaGe Linux Operations

Oct 7, 2024 · Operations

Why Choose RocketMQ? Features, Comparisons, and Reliability Explained

This article provides a comprehensive overview of RocketMQ, covering its architecture, key features such as high reliability, low latency and high throughput, comparisons with Kafka, RabbitMQ and ActiveMQ, and detailed mechanisms that ensure message durability, performance, and ordered consumption.

Message QueueReliabilityRocketMQ

0 likes · 12 min read

Why Choose RocketMQ? Features, Comparisons, and Reliability Explained

Architects' Tech Alliance

Aug 14, 2024 · Artificial Intelligence

Network Architecture and Performance Requirements for Training Large-Scale Generative AI Models

The article examines the ultra‑large‑scale, high‑bandwidth, low‑latency, and automated network infrastructure needed for training generative AI models, covering custom network designs, congestion control, deterministic RDMA, topology choices such as Fat‑Tree, and emerging deterministic networking technologies.

Generative AIHigh BandwidthRDMA

0 likes · 8 min read

Network Architecture and Performance Requirements for Training Large-Scale Generative AI Models

DataFunSummit

Aug 8, 2024 · Artificial Intelligence

GPU Throughput and Low‑Latency Optimization Practices in JD Advertising

This article presents JD Advertising's technical practices for improving GPU throughput and reducing latency in large‑scale recommendation scenarios, covering system challenges, storage and compute optimizations for training, low‑latency inference techniques, and compiler extensions to handle massive sparse models.

AIAdvertisingTensorFlow

0 likes · 13 min read

GPU Throughput and Low‑Latency Optimization Practices in JD Advertising

High Availability Architecture

Jun 27, 2024 · Backend Development

Understanding and Tuning Z Garbage Collector (ZGC) for Low‑Latency Java Applications

This article explains the principles, features, and workflow of Java's Z Garbage Collector, provides detailed configuration and logging guidance, and shares AutoMQ's practical tuning experiences that achieve sub‑millisecond pause times and improved performance for latency‑sensitive backend services.

AutoMQGarbage CollectionJava

0 likes · 26 min read

Understanding and Tuning Z Garbage Collector (ZGC) for Low‑Latency Java Applications

IT Architects Alliance

Jun 12, 2024 · Cloud Computing

Network Architecture Selection and Comparison for AI Compute Centers

The article analyzes traditional cloud data‑center networking challenges for AI workloads and compares two‑layer and three‑layer fat‑tree architectures, presenting high‑bandwidth, non‑blocking, and low‑latency designs such as AI‑Pool networks and offering practical deployment scales from hundreds to tens of thousands of GPUs.

AI computeFat-TreeHigh Bandwidth

0 likes · 11 min read

Network Architecture Selection and Comparison for AI Compute Centers

21CTO

Apr 24, 2024 · Backend Development

Can Rust Outperform C++ in Finance? Insights on Safety, Speed, and Hiring Challenges

While Rust’s safety and performance promise significant productivity gains over C++ for low‑latency trading systems, steep learning curves, scarce talent, and lower salaries hinder its adoption in finance, with industry leaders noting both its advantages and the challenges of transitioning legacy codebases.

C++financelow-latency

0 likes · 7 min read

Can Rust Outperform C++ in Finance? Insights on Safety, Speed, and Hiring Challenges

Architects' Tech Alliance

Apr 18, 2024 · Industry Insights

Why InfiniBand Dominates Modern HPC: Speed, Latency, and Scalability Explained

This article provides a comprehensive technical overview of InfiniBand, covering its rapid adoption in top supercomputers, detailed performance advantages such as ultra‑high bandwidth, CPU offload, sub‑microsecond latency, flexible scalability, QoS, SHARP acceleration, and a comparison with Ethernet, Fibre Channel, and Omni‑Path, while also outlining HDR switch and NIC product families.

Data CenterHDRHPC

0 likes · 20 min read

Why InfiniBand Dominates Modern HPC: Speed, Latency, and Scalability Explained

Java Architecture Diary

Apr 17, 2024 · Backend Development

Unlock Low-Latency IoT Messaging with mica-mqtt: A Java AIO MQTT Solution

mica-mqtt is a lightweight, low‑latency, high‑performance open‑source MQTT component built on Java AIO, offering full MQTT v3.1/v3.1.1/v5 support, WebSocket, REST API, client/server capabilities, clustering via Redis, GraalVM native compilation, Spring Boot starter integration, and extensive customization for IoT and messaging scenarios.

GraalVMSpring Bootlow-latency

0 likes · 6 min read

Unlock Low-Latency IoT Messaging with mica-mqtt: A Java AIO MQTT Solution

DataFunSummit

Feb 4, 2024 · Mobile Development

Advanced Mobile Audio Recording Techniques in Quanjian K‑Song: Low Latency, High Fidelity, and Intelligent Audio Processing

The article details how Quanjian K‑Song has built a comprehensive mobile‑focused audio recording system since 2014, covering low‑latency capture, high‑quality sampling, lyric and vocal‑accompaniment alignment, ear‑return, pitch shifting, vocal enhancement, 3A processing, and AI‑driven scoring to deliver a professional karaoke experience on smartphones.

AI scoringAudio Processingkaraoke technology

0 likes · 14 min read

Advanced Mobile Audio Recording Techniques in Quanjian K‑Song: Low Latency, High Fidelity, and Intelligent Audio Processing

Tencent Music Tech Team

Feb 4, 2024 · Mobile Development

Technical Guidelines for High-Quality Mobile Recording and Audio Processing in Quanmin K Song

Quanmin K Song’s decade‑long mobile‑recording platform combines 48 kHz/16‑bit dry‑signal capture, sub‑70 ms latency via OpenSL ES/AAudio, real‑time clipping and noise detection, lyric‑ and vocal‑accompaniment alignment, pitch‑shifting, adaptive vocal enhancement, 3A DSP/AI processing, and AI‑driven pitch correction to deliver industry‑leading high‑quality mobile singing experiences.

AIAudio ProcessingMusic App

0 likes · 15 min read

Technical Guidelines for High-Quality Mobile Recording and Audio Processing in Quanmin K Song

DataFunTalk

Jan 29, 2024 · Big Data

Case Study: Deploying RisingWave for Real-Time Stream Processing in a Large-Scale Quantitative Hedge Fund

An ultra‑large hedge fund with over $10 billion AUM replaced ksqlDB and Flink with RisingWave, leveraging its PostgreSQL‑compatible streaming SQL to achieve sub‑10 ms latency, lower learning and operational costs, rich connectors, advanced operators, and comprehensive observability for real‑time trade data processing.

Data IntegrationRisingWaveSQL

0 likes · 10 min read

Case Study: Deploying RisingWave for Real-Time Stream Processing in a Large-Scale Quantitative Hedge Fund

JavaEdge

Dec 1, 2023 · Backend Development

Designing a Low‑Latency Typeahead Suggestion System with Trie and Distributed Architecture

This article explains how to build a real‑time typeahead (autocomplete) service that returns the most frequent query suggestions using an in‑memory Trie, sharding, offline aggregation, and caching to achieve sub‑20 ms latency, fault tolerance, and horizontal scalability.

Trieautocompletedistributed system

0 likes · 13 min read

Designing a Low‑Latency Typeahead Suggestion System with Trie and Distributed Architecture

JD Cloud Developers

Sep 12, 2023 · Backend Development

Why Netty’s Direct Memory Stalls on JDK 17: A Deep Dive into Low‑Latency Bottlenecks

An in‑depth analysis of the Tianwang risk‑control Lingji system reveals how JDK 17’s ZGC, Netty’s direct‑memory allocation, and cross‑data‑center channel limits caused severe latency spikes, memory growth, and CPU usage, and outlines the debugging steps and configuration changes that finally resolved the issue.

Nettydirect-memorylow-latency

0 likes · 16 min read

Why Netty’s Direct Memory Stalls on JDK 17: A Deep Dive into Low‑Latency Bottlenecks

Tencent Architect

Aug 21, 2023 · Fundamentals

How Tencent Cloud’s V265/TXAV1 Revolutionizes 8K Live Streaming

This article details Tencent Cloud's V265/TXAV1 live streaming solution, covering its high‑efficiency 8K and low‑latency capabilities, performance gains over X265, MV‑HEVC 3D compression, extensive engineering and algorithmic optimizations, and the resulting speed‑up and quality improvements for ultra‑high‑definition live broadcasts.

8KLive StreamingTXAV1

0 likes · 18 min read

How Tencent Cloud’s V265/TXAV1 Revolutionizes 8K Live Streaming

Architects' Tech Alliance

Aug 8, 2023 · Cloud Computing

Design Principles and Practices for High‑Performance AI Compute Center Networks

The article analyzes the limitations of traditional data‑center networking for AI compute workloads and presents high‑bandwidth, non‑blocking, low‑latency design solutions—including two‑layer and three‑layer fat‑tree architectures, AI‑Pool concepts, and recommended configurations—for building scalable, efficient intelligent computing clusters.

AI computeFat-TreeHigh Bandwidth

0 likes · 10 min read

Design Principles and Practices for High‑Performance AI Compute Center Networks

Kuaishou Audio & Video Technology

Jun 20, 2023 · Artificial Intelligence

How a Low‑Latency Hierarchical Fusion Network Beats Echoes in Real‑Time Calls

At ICASSP 2023, Kuaishou’s audio team presented a low‑latency hierarchical fusion network for full‑band acoustic echo cancellation, detailing its multi‑stage design, asymmetric windowing, loss functions, training strategy, and achieving second place in the non‑personalized AEC Challenge, with real‑world deployment results.

Acoustic Echo CancellationHierarchical Fusion NetworkSignal Processing

0 likes · 13 min read

How a Low‑Latency Hierarchical Fusion Network Beats Echoes in Real‑Time Calls

Alibaba Cloud Big Data AI Platform

Jun 5, 2023 · Artificial Intelligence

How Alibaba’s DGS Enables Real‑Time GNN Inference on Massive Dynamic Graphs

The Dynamic Graph Sampling (DGS) service, built on GraphLearn, delivers sub‑20 ms latency for real‑time GNN inference on large, constantly evolving graphs by separating storage from computation, using event‑driven pre‑sampling, lazy multi‑hop concatenation, and a publish‑subscribe architecture that scales linearly across distributed workers.

Alibaba CloudGraph Neural NetworksGraphLearn

0 likes · 12 min read

How Alibaba’s DGS Enables Real‑Time GNN Inference on Massive Dynamic Graphs

Mike Chen's Internet Architecture

Feb 1, 2023 · Fundamentals

Understanding ZGC: The Next‑Generation Low‑Latency Java Garbage Collector

ZGC, the Z Garbage Collector introduced in JDK 11, is a low‑latency, region‑based JVM garbage collector that uses colored pointers, load barriers, and multi‑mapping to achieve sub‑10 ms pause times across heap sizes up to several terabytes, with concurrent marking, relocation, and three brief STW phases.

Garbage CollectionJVMJava

0 likes · 8 min read

Understanding ZGC: The Next‑Generation Low‑Latency Java Garbage Collector

Rare Earth Juejin Tech Community

Jan 16, 2023 · Operations

Low‑Latency Video Streaming Optimizations for Douyin During the World Cup

This article details the end‑to‑end low‑latency video streaming architecture, measurement methods, and optimization techniques—including FLV‑2s, RTM, MiniSDP, buffer‑driven speed‑up, and CDN strategies—that Douyin's Volcano Engine video cloud employed to achieve sub‑2‑second latency and high QoE for World Cup live broadcasts.

FLVOptimizationQoE

0 likes · 21 min read

Low‑Latency Video Streaming Optimizations for Douyin During the World Cup

Bilibili Tech

Jan 13, 2023 · Cloud Computing

Design and Implementation of Bilibili's Low‑Latency Cloud Gaming Platform Using WebRTC

Bilibili built a cross‑platform cloud‑gaming service that leverages WebRTC with tuned jitter buffers, unordered data channels, adaptive input‑report rates, and a custom kernel driver to deliver sub‑100 ms latency, dynamic bitrate control, and haptic feedback, overcoming typical latency, stutter, and flexibility limitations of existing solutions.

WebRTCadaptive bitratecloud gaming

0 likes · 14 min read

Design and Implementation of Bilibili's Low‑Latency Cloud Gaming Platform Using WebRTC

NetEase Smart Enterprise Tech+

Jan 6, 2023 · Cloud Computing

How NetEase’s Ultra‑Low‑Latency Cloud Architecture Powers Remote Excavators

NetEase and China Construction 8th Bureau teamed up to create high‑altitude excavation robots that rely on NetEase Cloud Sign's ultra‑low‑latency global transmission, edge deployment, intelligent scheduling, WE‑CAN routing, multiplexed signaling, and advanced QoS techniques to achieve sub‑150 ms control and reliable video streaming in challenging network environments.

Cloud NetworkingQoSedge computing

0 likes · 12 min read

How NetEase’s Ultra‑Low‑Latency Cloud Architecture Powers Remote Excavators

Zuoyebang Tech Team

Nov 17, 2022 · Artificial Intelligence

Scaling Deep Learning Model Serving: High‑Concurrency, Low‑Latency Solutions

This article examines the challenges of deploying dozens of deep‑learning models at Zuoyebang and compares three serving architectures—Gunicorn + Flask + Transformers, Tornado + PyTorch, and Tornado + Triton—highlighting performance trade‑offs and presenting a final high‑concurrency, low‑latency solution in production.

High concurrencyModel DeploymentTriton

0 likes · 11 min read

Scaling Deep Learning Model Serving: High‑Concurrency, Low‑Latency Solutions

Bilibili Tech

Nov 11, 2022 · Backend Development

Real-Time Audio/Video System Architecture and Key Technologies Based on WebRTC

The article surveys the evolution of live streaming toward low‑latency, interactive scenarios and details WebRTC‑based real‑time audio/video system design, covering RTP/UDP transport, FEC and ARQ loss recovery, congestion control, jitter buffering, echo cancellation, edge‑node path optimization, and a multi‑layer architecture with signaling, routing, mixing services for scalable, high‑availability PK deployments.

Congestion ControlMedia ServerReal-time Streaming

0 likes · 16 min read

Real-Time Audio/Video System Architecture and Key Technologies Based on WebRTC

phodal

Oct 24, 2022 · Industry Insights

Unlocking Ultra-Fast Systems: Key Patterns Behind Low‑Latency Architecture

This article provides a comprehensive overview of low‑latency architecture, covering network hardware, system‑level programming strategies, language choices, memory management techniques, event‑driven designs, high‑performance data structures, and visualization approaches for building ultra‑fast computing systems.

Event-Driven ArchitectureHigh-performance computingJava performance

0 likes · 10 min read

Unlocking Ultra-Fast Systems: Key Patterns Behind Low‑Latency Architecture

Refining Core Development Skills

Oct 24, 2022 · Fundamentals

Low‑Latency Network Architecture for High‑Frequency Trading

This article explains how high‑frequency trading firms achieve ultra‑low network latency by combining proximity deployment, dedicated links, microwave transmission, InfiniBand, low‑latency switches, kernel bypass, RDMA, TCP offload engines and FPGA acceleration, and summarizes the impact of each technique on overall request latency.

FPGAInfiniBandKernel Bypass

0 likes · 16 min read

Low‑Latency Network Architecture for High‑Frequency Trading

Qunar Tech Salon

Oct 21, 2022 · Backend Development

Root Cause Analysis and ZGC Optimization for a High‑Concurrency Ticket Pricing Service

This article details the investigation of a 2% timeout rate in a billion‑request‑per‑day ticket pricing service, identifies GC‑induced stop‑the‑world pauses as the main cause, and demonstrates how switching from ParNew+CMS to G1 and finally to ZGC dramatically reduces latency and timeout rates.

DubboGarbage CollectionJava

0 likes · 24 min read

Root Cause Analysis and ZGC Optimization for a High‑Concurrency Ticket Pricing Service

WeChat Backend Team

Aug 5, 2022 · Artificial Intelligence

How WeChat’s Ekko Achieves Ultra‑Low‑Latency Model Updates for Billion‑User Recommendations

At the 16th OSDI conference, Tencent’s WeChat team presented the award‑winning Ekko system—a groundbreaking, ultra‑low‑latency model‑update solution for massive recommendation workloads that dramatically speeds up updates, supports over a trillion‑scale models, and has already boosted user engagement across billions of daily users.

Large ScaleRecommendation SystemsWeChat

0 likes · 5 min read

How WeChat’s Ekko Achieves Ultra‑Low‑Latency Model Updates for Billion‑User Recommendations

Huawei Cloud Developer Alliance

May 26, 2022 · Cloud Computing

How Huawei SparkRTC Delivers Global Low‑Latency Audio/Video: Network Architecture Secrets

In a May 16 live session, Huawei Cloud’s chief architect MuNa explained the goals of media networking, detailed the SparkRTC global network architecture—including overlay/underlay layers and low‑latency techniques—and revealed nine key strategies such as end‑to‑end transmission, AI‑driven routing, multi‑path access, and edge acceleration, while also reminding developers to register for the ongoing competition.

Cloud ComputingNetwork ArchitectureReal‑time communication

0 likes · 5 min read

How Huawei SparkRTC Delivers Global Low‑Latency Audio/Video: Network Architecture Secrets

IT Architects Alliance

May 23, 2022 · Industry Insights

Why RDMA Is Replacing TCP/IP for AI and High‑Performance Storage

The article analyzes how the AI boom and high‑performance SSD storage demand sub‑microsecond latency, exposing TCP/IP’s inherent context‑switch and CPU overhead, and explains why RDMA’s kernel‑bypass, zero‑copy design and 1 µs latency make it the preferred network stack for modern data‑center workloads despite challenges in Ethernet deployment.

AI computingData Center NetworkDistributed storage

0 likes · 11 min read

Why RDMA Is Replacing TCP/IP for AI and High‑Performance Storage

Architects' Tech Alliance

May 19, 2022 · Fundamentals

An Introduction to RDMA: Concepts, Advantages, Protocols, and Programming Basics

This article explains the fundamentals of Remote Direct Memory Access (RDMA), comparing it with traditional networking, outlining its core advantages, suitable use cases, the three main RDMA protocols (Infiniband, RoCE, iWARP), deployment requirements, communication flow, and essential programming concepts.

RDMARoCEhigh‑performance networking

0 likes · 9 min read

An Introduction to RDMA: Concepts, Advantages, Protocols, and Programming Basics

ITPUB

May 12, 2022 · Databases

How Aerospike Delivers Millisecond Latency on TB‑Scale Data

This article explains how Aerospike, a high‑performance NoSQL database, achieves millisecond‑level query latency on terabyte‑scale datasets by using a hybrid storage architecture, multi‑level storage tiers, flash optimizations, and a flexible ecosystem that supports diverse real‑time use cases.

AerospikeHybrid storageNoSQL

0 likes · 14 min read

How Aerospike Delivers Millisecond Latency on TB‑Scale Data

High Availability Architecture

Apr 22, 2022 · Artificial Intelligence

BIGO RTC: High‑Quality, Low‑Cost Real‑Time Communication through Core Operators and Scene Adaptation

The article explains how BIGO RTC achieves high‑quality, low‑cost real‑time audio‑video communication by optimizing core video operators such as HEVC encoding, AI‑driven super‑resolution and HDR, and by employing scene‑adaptive techniques like device performance tuning, content‑adaptive encoding and AI‑based pre‑processing to meet diverse latency constraints.

AI AdaptationHDRHigh Quality

0 likes · 9 min read

BIGO RTC: High‑Quality, Low‑Cost Real‑Time Communication through Core Operators and Scene Adaptation

Zuoyebang Tech Team

Apr 21, 2022 · Backend Development

How to Build a High‑Concurrency, Low‑Latency Live Streaming System for Online Education

This article details the design and implementation of a self‑developed interactive live‑streaming platform that supports massive concurrent users and ultra‑low latency for online education, covering business scenarios, technical abstractions, key low‑latency and high‑concurrency techniques, and real‑world performance results.

High concurrencyKCPLive Streaming

0 likes · 16 min read

How to Build a High‑Concurrency, Low‑Latency Live Streaming System for Online Education

Ops Development Stories

Apr 20, 2022 · Fundamentals

Understanding ZGC: Low‑Latency Garbage Collection in Java

ZGC (Z Garbage Collector) is a scalable, low‑latency collector introduced in JDK 11, featuring sub‑millisecond pause times, region‑based memory layout, colored pointers, NUMA awareness, and concurrent phases, with detailed explanations of its architecture, parameters, triggers, and log analysis.

Garbage CollectionJVMJava

0 likes · 17 min read

Understanding ZGC: Low‑Latency Garbage Collection in Java

Ops Development Stories

Apr 19, 2022 · Backend Development

Unlock Sub-Millisecond Pauses: Deep Dive into Java’s Z Garbage Collector (ZGC)

ZGC, introduced in JDK 11 and stabilized in JDK 15, is a low‑latency, region‑based, concurrent garbage collector that aims for sub‑millisecond pause times regardless of heap size, leveraging colored pointers, NUMA awareness, multi‑mapping, and read barriers to achieve efficient memory reclamation.

Garbage CollectionJVMJava

0 likes · 17 min read

Unlock Sub-Millisecond Pauses: Deep Dive into Java’s Z Garbage Collector (ZGC)

JD Cloud Developers

Mar 31, 2022 · Fundamentals

Why SRT Is Revolutionizing Low-Latency Live Streaming: A Technical Deep Dive

Secure Reliable Transport (SRT) is an open‑source, UDP‑based protocol that overcomes TCP’s latency and jitter issues in long‑distance live streaming by using ARQ retransmission, FEC, and AES/TLS encryption, enabling high‑quality, low‑delay video delivery even in poor network conditions.

FECLive StreamingSRT

0 likes · 8 min read

Why SRT Is Revolutionizing Low-Latency Live Streaming: A Technical Deep Dive

Architects' Tech Alliance

Mar 13, 2022 · Industry Insights

Why RDMA Is Replacing TCP/IP in AI-Driven Data Centers

The article analyzes how the AI era’s demand for ultra‑low latency and high throughput exposes fundamental limits of the traditional TCP/IP stack, and explains why RDMA’s kernel‑bypass, zero‑copy design, and emerging congestion‑control algorithms are becoming the preferred network fabric for modern data‑center workloads.

AI FabricData CenterDistributed Computing

0 likes · 12 min read

Why RDMA Is Replacing TCP/IP in AI-Driven Data Centers

Tencent Architect

Mar 11, 2022 · Cloud Computing

How Tencent Cloud’s Ultra‑Low‑Latency Live Streaming Redefined Real‑Time Video

This article explores how Tencent Cloud leveraged WebRTC and innovative engineering to create an ultra‑low‑latency live streaming solution that outperforms traditional CDN‑based streams, detailing the technical challenges, product decisions, and market impact that have driven rapid adoption across e‑commerce, education, and entertainment sectors.

Live StreamingTencent CloudWebRTC

0 likes · 10 min read

How Tencent Cloud’s Ultra‑Low‑Latency Live Streaming Redefined Real‑Time Video

vivo Internet Technology

Jan 26, 2022 · Industry Insights

Inside Vivo’s Live Streaming Platform: Architecture, Challenges, and Solutions

The article outlines Vivo's live streaming platform evolution, detailing its business scenarios, underlying architecture, key technical challenges such as beautification, messaging, latency, and cost, and the concrete solutions implemented across entertainment, interactive, and corporate event live streams.

Industry insightInteractiveLive Streaming

0 likes · 19 min read

Inside Vivo’s Live Streaming Platform: Architecture, Challenges, and Solutions

IT Architects Alliance

Dec 19, 2021 · Information Security

Understanding the Metaverse from a Network Perspective: Concepts, Technical Implications, and Emerging Challenges

The article analyses the metaverse as a network-driven evolution, explaining its definition, classifying terminal types, and detailing the new security, ultra‑low‑latency, and deterministic quality‑of‑service requirements that challenge existing internet architecture and design principles.

InternetNetworkQoS

0 likes · 17 min read

Understanding the Metaverse from a Network Perspective: Concepts, Technical Implications, and Emerging Challenges

Alibaba Terminal Technology

Dec 6, 2021 · Backend Development

How We Achieved Low‑Latency, High‑Definition Multi‑Angle Live Streaming with WebRTC

This article details the design and implementation of a low‑latency, high‑definition multi‑angle live streaming solution using WebRTC, covering protocol selection, system architecture, edge commands, client integration, performance optimizations, and lessons learned from deploying the feature in a large‑scale live event.

Live StreamingVideo EncodingWebRTC

0 likes · 15 min read

How We Achieved Low‑Latency, High‑Definition Multi‑Angle Live Streaming with WebRTC

Java Architecture Diary

Nov 29, 2021 · Backend Development

Unlock High‑Performance IoT Messaging with mica‑mqtt: Low‑Latency Java MQTT Framework

The mica‑mqtt library, built on t‑io, offers a simple, low‑latency, high‑performance open‑source MQTT solution for IoT, supporting MQTT v3.1/v3.1.1/v5, WebSocket sub‑protocol, HTTP API, clustering via Redis, Spring Boot starter integration, Prometheus/Grafana monitoring, and provides extensive client and server interfaces with code examples.

JavaSpring Bootlow-latency

0 likes · 10 min read

Unlock High‑Performance IoT Messaging with mica‑mqtt: Low‑Latency Java MQTT Framework

Tencent Cloud Developer

Nov 26, 2021 · Cloud Computing

5G Remote Real-Time Control: Key Challenges, Enabling Technologies, and System Architectures

5G’s high bandwidth, low latency, and edge offloading empower real‑time remote control of complex equipment, demanding synchronized video, audio, and control signals, which are addressed by real‑time AV communication, control‑signal synchronization, and network optimization, while evolving system architectures—from direct‑connection models to integrated fusion designs—ensure scalability, reliability, and minimal latency for applications in mines, ports, logistics, and autonomous vehicles.

5GIoTReal‑time communication

0 likes · 12 min read

5G Remote Real-Time Control: Key Challenges, Enabling Technologies, and System Architectures

Java Architecture Diary

Oct 18, 2021 · Backend Development

Unlock Low-Latency MQTT with mica-mqtt: A High-Performance Java Component

This guide introduces mica-mqtt, a simple, low‑latency, high‑performance MQTT open‑source component built on t‑io, outlines its extensive feature set, provides Spring Boot and plain Java integration steps, showcases configuration examples, and explains monitoring and clustering capabilities for IoT back‑end development.

JavaSpring Bootlow-latency

0 likes · 8 min read

Unlock Low-Latency MQTT with mica-mqtt: A High-Performance Java Component

Architects' Tech Alliance

Sep 10, 2021 · Artificial Intelligence

FPGA Technology for Compute‑Intensive and Communication‑Intensive Tasks in Data Centers

The article examines how FPGA’s pipeline parallel architecture provides latency‑critical advantages over CPU and GPU for both compute‑intensive workloads such as matrix operations and AI inference, and communication‑intensive tasks like encryption and high‑throughput networking, while also discussing deployment models, power efficiency, eFPGA trends, and the evolving Chinese FPGA market.

AICloud ComputingData Center

0 likes · 14 min read

FPGA Technology for Compute‑Intensive and Communication‑Intensive Tasks in Data Centers

Youku Technology

Sep 8, 2021 · Industry Insights

How Youku Engineers Accelerate Start‑up, Cut Stalls, and Stabilize Video Playback

This article examines Youku's technical team's systematic approach to improving video playback by tackling slow start‑up, error handling, network stalls, audio‑video sync, and overall stability through metric‑driven optimization, adaptive streaming protocols, device capability databases, and forward‑looking features like low latency and high‑frame‑rate streaming.

low-latencymedia engineeringperformance monitoring

0 likes · 12 min read

How Youku Engineers Accelerate Start‑up, Cut Stalls, and Stabilize Video Playback

MaGe Linux Operations

Jul 14, 2021 · Cloud Computing

What Is Intelligent Edge Computing and Why It Matters Today

Intelligent edge computing extends traditional edge architectures by embedding AI-driven data analysis and machine learning directly at the network edge, enabling faster, low‑latency decisions for use cases like smart cameras and autonomous vehicles, and is rapidly becoming a core strategy for modern enterprises.

IoTedge computingintelligent edge

0 likes · 6 min read

What Is Intelligent Edge Computing and Why It Matters Today

Huawei Cloud Developer Alliance

Jun 21, 2021 · Cloud Computing

How Huawei Cloud Achieves Ultra‑Low Latency for E‑Commerce Live Streaming

This article examines the technical challenges of massive e‑commerce live streams, explains why traditional streaming suffers from high latency, and details Huawei Cloud's low‑latency architecture, protocols, intelligent scheduling and ultra‑fast transcoding that enable sub‑second product link delivery to millions of viewers.

CDNCloud ComputingLive Streaming

0 likes · 13 min read

How Huawei Cloud Achieves Ultra‑Low Latency for E‑Commerce Live Streaming

Tencent Cloud Middleware

Jun 17, 2021 · Fundamentals

How Tencent Kona JDK 11 ZGC Delivers Millisecond‑Level GC Pauses for Real‑Time Services

Tencent's Kona JDK 11 introduces a production‑ready ZGC implementation that reduces Java garbage‑collection stop‑the‑world pauses to under 10 ms, enabling ultra‑low‑latency online services across massive heaps while maintaining acceptable throughput, and the article details its design, tuning, and real‑world deployments.

Garbage CollectionJava performanceTencent Kona JDK

0 likes · 28 min read

How Tencent Kona JDK 11 ZGC Delivers Millisecond‑Level GC Pauses for Real‑Time Services

Alibaba Cloud Developer

Jun 4, 2021 · Backend Development

How to Build a Low‑Latency Timeout Center with Redis: Architecture and Design

This article explains the drawbacks of traditional high‑latency timeout centers and presents a Redis‑based low‑latency design, detailing task storage, scheduling, topic and queue structures, two‑phase consumption, retry control, and the resulting performance and reliability benefits.

Distributed SchedulingRedisbackend

0 likes · 13 min read

How to Build a Low‑Latency Timeout Center with Redis: Architecture and Design

NetEase Smart Enterprise Tech+

Jun 4, 2021 · Backend Development

How WE‑CAN Revolutionizes Global Data Transmission with Ultra‑Low Latency and Cost‑Effective Architecture

WE‑CAN, NetEase Cloud Communication's self‑developed large‑scale distributed transmission network, dramatically improves end‑to‑end data quality, reduces costs, and supports diverse real‑time communication scenarios through intelligent routing, layered decoupling, and global edge deployment, as demonstrated by extensive production metrics.

Cloud NetworkingReal‑time communicationdistributed systems

0 likes · 15 min read

How WE‑CAN Revolutionizes Global Data Transmission with Ultra‑Low Latency and Cost‑Effective Architecture

Kuaishou Large Model

May 28, 2021 · Mobile Development

How to Eliminate High‑Latency Decoding in Moonlight Android for Smooth Game Streaming

This article explains the technical challenges of high‑latency video decoding in Android game streaming with Moonlight, analyzes why it occurs, and provides practical solutions—including API choices, immediate buffer release, frame ordering, and low‑latency vendor configurations—to achieve smoother, low‑delay gameplay.

AndroidC languageMediaCodec

0 likes · 10 min read

How to Eliminate High‑Latency Decoding in Moonlight Android for Smooth Game Streaming

UCloud Tech

Apr 8, 2021 · Fundamentals

What Is RTC? Understanding Real-Time Communications and WebRTC Advantages

This article explains the fundamentals of RTC, distinguishes it from traditional streaming, highlights WebRTC's cross‑platform, plugin‑free benefits, and shows how low‑latency real‑time communication can enhance interactive applications.

RTCReal‑time communicationWebRTC

0 likes · 4 min read

What Is RTC? Understanding Real-Time Communications and WebRTC Advantages

Selected Java Interview Questions

Mar 11, 2021 · Fundamentals

Understanding ZGC: A Low‑Latency Garbage Collector for Java

The article explains ZGC, Oracle's low‑latency Java garbage collector introduced in JDK 11, covering its dynamic region layout, colored pointer technique, tri‑color marking, memory multi‑mapping, read barriers, operation phases, performance characteristics, and its main drawback of floating garbage.

Concurrent MarkingGarbage CollectionJava

0 likes · 11 min read

Understanding ZGC: A Low‑Latency Garbage Collector for Java

Tencent Cloud Developer

Jan 12, 2021 · Cloud Computing

Key Technologies and Applications of Tencent Real-Time Audio‑Video (TRTC)

Tencent's Real‑Time Audio‑Video (TRTC) platform delivers sub‑second, cross‑platform RTC for up to 300 participants and 100 000 viewers, using UDP‑based NACK/FEC, dynamic QoS, AI‑enhanced video/audio effects, and cloud‑driven QoE optimization to support diverse scenarios such as online meetings, education, live streaming, and industry‑specific applications.

Audio-VideoCloud ServicesEcho Cancellation

0 likes · 16 min read

Key Technologies and Applications of Tencent Real-Time Audio‑Video (TRTC)

Architects' Tech Alliance

Jan 10, 2021 · Industry Insights

Why RoCE Is Revolutionizing Data Center Networking: A Deep Dive into RDMA over Ethernet

This article explains the fundamentals of RDMA and RoCE, compares RoCE v1 and v2, outlines deployment steps, highlights performance benefits such as low CPU usage and zero‑copy, and answers common questions about its differences from iWARP and InfiniBand, helping data‑center engineers evaluate the technology.

Data Center NetworkingEthernetHigh Bandwidth

0 likes · 8 min read

Why RoCE Is Revolutionizing Data Center Networking: A Deep Dive into RDMA over Ethernet

Didi Tech

Dec 21, 2020 · Big Data

HBase Availability and Latency Optimizations: Replication‑Based Multi‑Read and ZGC Adoption

To overcome HBase’s weak availability and GC‑induced latency spikes, the DiDi team introduced a replication‑based client multi‑read (hedged‑read) mechanism and migrated to the Z Garbage Collector, which together dramatically cut maximum and 99.9th‑percentile latencies while keeping services online during region disruptions.

Big DataHBaseMulti-Read

0 likes · 12 min read

HBase Availability and Latency Optimizations: Replication‑Based Multi‑Read and ZGC Adoption

High Availability Architecture

Nov 5, 2020 · Backend Development

Why We Chose Java for Our High‑Frequency Trading Application

The article explains how a high‑frequency trading firm evaluated Java versus C++ for ultra‑low‑latency trading, discusses the challenges of JVM JIT compilation and garbage‑collection pauses, and shows how Azul Zing’s C4 collector delivers near‑C++ latency while preserving Java’s development productivity.

Azul ZingGarbage CollectionJVM

0 likes · 11 min read

Why We Chose Java for Our High‑Frequency Trading Application

Amap Tech

Oct 30, 2020 · Mobile Development

Video Streaming Solution for the ARC Car Cloud Control Platform

The ARC Car Cloud Control platform now streams the vehicle’s screen using Android’s Virtual Display and a C++‑based H.264 hardware encoder, sending raw video over a TCP socket to a server that adaptively adjusts bitrate and frame rate, while the web client decodes the fragmented MP4 via MSE, dramatically lowering CPU usage and latency on low‑end head‑units.

AndroidH.264Media Source Extensions

0 likes · 8 min read

Video Streaming Solution for the ARC Car Cloud Control Platform

Youku Technology

Aug 18, 2020 · Backend Development

How Youku Engineered a High‑Performance, Low‑Latency Marketing Platform

This article details Youku's membership marketing system architecture, covering complex marketing scenarios, high‑availability and low‑latency requirements, rule‑based QL engine, unified marketing framework, multi‑cache storage, multithreaded matching, asynchronous reward distribution, and distributed transaction mechanisms.

CachingHigh AvailabilitySystem Design

0 likes · 12 min read

How Youku Engineered a High‑Performance, Low‑Latency Marketing Platform

High Availability Architecture

Aug 11, 2020 · Operations

Understanding and Optimizing ZGC (Z Garbage Collector) for Low‑Latency Java Services

This article examines the Z Garbage Collector (ZGC) introduced in JDK 11, detailing its low‑pause design goals, underlying concurrent marking‑copy algorithm, colored pointer and read‑barrier techniques, practical tuning parameters, real‑world case studies, and the performance impact of upgrading from CMS/G1 to ZGC in high‑throughput, low‑latency services.

Garbage CollectionJVMJava

0 likes · 28 min read

Understanding and Optimizing ZGC (Z Garbage Collector) for Low‑Latency Java Services

Meituan Technology Team

Aug 6, 2020 · Backend Development

ZGC: Principles, Tuning Practices, and Production Upgrade Experience

The article explains how Meituan’s risk‑control platform eliminated frequent 40 ms CMS pauses by adopting JDK 11’s ZGC—detailing its concurrent mark‑copy design, practical tuning parameters, real‑world case fixes, and measured latency reductions of up to 74 % while noting trade‑offs.

Garbage CollectionJDK11Java

0 likes · 27 min read

ZGC: Principles, Tuning Practices, and Production Upgrade Experience

Tencent Tech

Jun 18, 2020 · Backend Development

Scaling Live‑Ecommerce Platforms: Architecture Behind Billions of Users

This article examines the rapid rise of live‑ecommerce during the 618 shopping festival, explains why the “live + ecommerce” model demands robust backend, streaming and CDN infrastructure, and details Tencent Cloud’s architectural solutions—including media processing, low‑latency protocols, bandwidth optimization and anti‑attack measures—to support massive concurrent traffic.

Live Streamingarchitecturee‑commerce

0 likes · 10 min read

Scaling Live‑Ecommerce Platforms: Architecture Behind Billions of Users

Tencent Tech

Jun 2, 2020 · Cloud Computing

How SRT Enables Low‑Latency, Reliable Live Streaming for Global eSports Events

The article explains how the Secure Reliable Transport (SRT) protocol, combined with Tencent Video Cloud’s optimized infrastructure, overcame latency and packet‑loss challenges to deliver stable, high‑quality live streams for the 2020 LPL Mid‑Season Cup and other large‑scale events, and describes its broader applications through MediaConnect.

Live StreamingMediaConnectNetwork Transmission

0 likes · 10 min read

How SRT Enables Low‑Latency, Reliable Live Streaming for Global eSports Events

Programmer DD

May 22, 2020 · Backend Development

Can ZGC Deliver Sub‑10ms Pauses for Massive Java Heaps?

This article explains the design goals, architecture, key features, tuning options, and version history of Java's Z Garbage Collector (ZGC), highlighting its sub‑10 ms pause times for terabyte‑scale heaps, its use of colored pointers and load barriers, and the trade‑offs in throughput and configuration.

Garbage CollectionJVMJava

0 likes · 16 min read

Can ZGC Deliver Sub‑10ms Pauses for Massive Java Heaps?

Tencent Cloud Developer

May 21, 2020 · Game Development

How Tencent’s Game Server Engine Tackles Low Latency and Cost in Multiplayer Games

This article analyzes the challenges of low‑latency, stable, and cost‑effective online multiplayer games and explains how Tencent's Game Server Engine (GSE) provides elastic scaling, near‑by scheduling, stateful shrinkage, multi‑region disaster recovery, and zero‑downtime updates to meet those demands.

Elastic ScalingTencent GSEcloud gaming

0 likes · 11 min read

How Tencent’s Game Server Engine Tackles Low Latency and Cost in Multiplayer Games

Big Data Technology Architecture

May 10, 2020 · Big Data

Understanding Apache Hudi: Incremental Processing and Low‑Latency Data Management on Hadoop

This article explains how Apache Hudi provides an incremental processing framework that enables efficient, low‑latency data ingestion, storage, and query capabilities on Hadoop, detailing its architecture, storage layout, compaction, write and read paths, and support for real‑time and batch analytics.

HadoopHudidata ingestion

0 likes · 15 min read

Understanding Apache Hudi: Incremental Processing and Low‑Latency Data Management on Hadoop

iQIYI Technical Product Team

Apr 24, 2020 · Cloud Computing

Technical Insights into Cloud Gaming Advertising Trials: Low‑Latency RTCDN Solutions by iQIYI Live Cloud

In an interview, iQIYI Live Cloud’s Chen Kunzhong explains how their WebRTC‑based RTCDN reduces encoding and transmission delay to achieve roughly 100 ms end‑to‑end latency for cloud‑gaming ads, supporting cross‑device play, instant click‑to‑play sessions, and future 5G‑enhanced high‑resolution experiences.

5GAdvertisingRTCDN

0 likes · 9 min read

Technical Insights into Cloud Gaming Advertising Trials: Low‑Latency RTCDN Solutions by iQIYI Live Cloud