Tagged articles
142 articles
Page 1 of 2
AI Engineering
AI Engineering
May 8, 2026 · Artificial Intelligence

How GPT‑Realtime‑2 Leverages GPT‑5‑Level Reasoning to Redefine Voice AI Architecture

OpenAI’s GPT‑Realtime‑2 embeds GPT‑5‑class reasoning into a continuous‑audio loop, achieving 96.6% accuracy on Big Bench Audio, offering adjustable inference intensity with latency from 1.12 s to 2.33 s, a 128 K context window, and demonstrable gains in real‑world call success rates, while prompting industry debate over pricing and competitive impact.

GPT-5GPT-Realtime-2Latency
0 likes · 5 min read
How GPT‑Realtime‑2 Leverages GPT‑5‑Level Reasoning to Redefine Voice AI Architecture
Lobster Programming
Lobster Programming
May 6, 2026 · Backend Development

How to Choose the Right MQ: RabbitMQ vs RocketMQ vs Kafka

This article compares RabbitMQ, RocketMQ, and Kafka on throughput, latency, scalability, and reliability, outlining each system's core features and recommending suitable scenarios such as reliable messaging, high‑performance streaming, and large‑scale real‑time data processing.

KafkaLatencyMessage Queue
0 likes · 6 min read
How to Choose the Right MQ: RabbitMQ vs RocketMQ vs Kafka
Raymond Ops
Raymond Ops
Apr 25, 2026 · Databases

How to Reduce MySQL Master‑Slave Replication Lag from 30 seconds to Milliseconds

This article walks through the root causes of MySQL master‑slave replication delay, demonstrates step‑by‑step diagnostics using SHOW SLAVE STATUS, pt‑heartbeat, and binlog comparisons, and provides concrete configuration changes, query rewrites, hardware upgrades, and monitoring scripts that can shrink lag from dozens of seconds to sub‑millisecond levels.

LatencyReplicationmonitoring
0 likes · 23 min read
How to Reduce MySQL Master‑Slave Replication Lag from 30 seconds to Milliseconds
dbaplus Community
dbaplus Community
Apr 22, 2026 · Fundamentals

Is UDP Really Faster Than TCP? A Deep Dive into Their Speed Trade‑offs

This article analytically compares UDP and TCP, detailing UDP’s minimalist design, low‑latency advantages, lack of congestion control and broadcast capabilities, while also explaining TCP’s reliability, congestion‑aware throughput and suitability for large or critical data, helping readers choose the optimal protocol for specific scenarios.

LatencyNetwork ProtocolsTCP
0 likes · 12 min read
Is UDP Really Faster Than TCP? A Deep Dive into Their Speed Trade‑offs
Machine Heart
Machine Heart
Apr 18, 2026 · Artificial Intelligence

Eliminating ‘Think‑Then‑Act’ Stalls: StreamingVLA Boosts VLA Speed by 2.4×

StreamingVLA introduces action‑flow matching and adaptive early observation to parallelize generation, execution, and perception in vision‑language‑action models, cutting per‑action latency from 49.9 ms to 31.6 ms, reducing stall time 6.5‑fold, and achieving up to 2.4× end‑to‑end speedup in LIBERO benchmarks and real‑world robot tests.

LIBEROLatencyParallel Execution
0 likes · 13 min read
Eliminating ‘Think‑Then‑Act’ Stalls: StreamingVLA Boosts VLA Speed by 2.4×
Weekly Large Model Application
Weekly Large Model Application
Mar 17, 2026 · Artificial Intelligence

Essential Features Every Voice Interaction System Must Support

The article provides a comprehensive analysis of core voice interaction system capabilities—including barge‑in, turn‑taking, multi‑turn dialogue, intent recognition, speaker identification, streaming latency, noise robustness, multilingual support, emotion handling, personalization, security, and deployment considerations—highlighting typical scenarios such as smart speakers, in‑car assistants, call centers, and meeting transcription.

ASRLatencyTTS
0 likes · 11 min read
Essential Features Every Voice Interaction System Must Support
Java Tech Enthusiast
Java Tech Enthusiast
Feb 24, 2026 · Fundamentals

Why a Successful Ping Doesn’t Guarantee a Healthy Network

Ping may confirm that a device is reachable, but its simple ICMP test hides many network complexities—such as bandwidth limits, firewall rules, and service failures—so a successful ping alone cannot guarantee overall network health, and understanding its five-step process reveals its inherent limitations.

ICMPLatencyPacket Loss
0 likes · 14 min read
Why a Successful Ping Doesn’t Guarantee a Healthy Network
Programmer DD
Programmer DD
Feb 3, 2026 · Artificial Intelligence

Build Reliable AI Agent Systems: Boost Accuracy 50% While Controlling Cost & Latency

This guide explains how to construct production‑ready AI agent systems by balancing cost, latency, and accuracy, offering a decision framework, concrete techniques such as planner‑executor architecture, chain‑of‑thought prompting, verification agents, parallel agents, and file‑system state management, plus real‑world examples and impact metrics.

AI agentsCost OptimizationLatency
0 likes · 21 min read
Build Reliable AI Agent Systems: Boost Accuracy 50% While Controlling Cost & Latency
NiuNiu MaTe
NiuNiu MaTe
Jan 28, 2026 · Fundamentals

Why a Successful Ping Doesn’t Prove Your Network Is Healthy – A Deep Dive into ICMP Mechanics

This article demystifies the ping command by explaining the ICMP protocol, interpreting TTL, latency and packet‑loss metrics, detailing the five‑step process from DNS lookup to reply, and highlighting ping’s inherent limitations such as its inability to gauge bandwidth, application‑layer issues, or firewall restrictions.

ICMPLatencyNetwork Diagnostics
0 likes · 13 min read
Why a Successful Ping Doesn’t Prove Your Network Is Healthy – A Deep Dive into ICMP Mechanics
FunTester
FunTester
Jan 20, 2026 · Fundamentals

Why Data Movement, Not CPU Speed, Is the Real Performance Bottleneck

Most engineers blame slow CPUs for performance issues, but the true bottleneck is often data latency—from registers and caches to DRAM, NUMA nodes, disks, and networks—so understanding and minimizing data movement is key to reducing tail latency and improving system performance.

LatencyNUMASystems
0 likes · 11 min read
Why Data Movement, Not CPU Speed, Is the Real Performance Bottleneck
Raymond Ops
Raymond Ops
Jan 8, 2026 · Operations

Boost Linux Network Performance: Proven Tips to Increase Bandwidth & Reduce Latency

This article provides a comprehensive guide to Linux network performance tuning, covering key metrics, practical commands for adjusting TCP parameters, congestion control, kernel optimizations, hardware choices, zero‑copy techniques, load balancing, and essential monitoring tools to achieve higher bandwidth and lower latency.

LatencyLinuxTCP Tuning
0 likes · 11 min read
Boost Linux Network Performance: Proven Tips to Increase Bandwidth & Reduce Latency
IT Services Circle
IT Services Circle
Nov 16, 2025 · Fundamentals

Why Optical Communication Beats Electrical: Speed, Latency, Power, Cost & Security

This article explains how optical communication outperforms traditional electrical transmission by offering dramatically higher bandwidth, lower latency, reduced power consumption, stronger interference immunity, lower cost, and enhanced security, all rooted in the physics of light and modern fiber‑optic technologies.

Latencybandwidthcost efficiency
0 likes · 8 min read
Why Optical Communication Beats Electrical: Speed, Latency, Power, Cost & Security
Tech Stroll Journey
Tech Stroll Journey
Oct 30, 2025 · Operations

How to Use fio to Measure Disk IOPS, Throughput, and Latency on Ubuntu

This guide explains how to install fio on Ubuntu 20.04, configure test environments, run IOPS and latency benchmarks with specific parameters, and interpret key metrics such as bandwidth, IOPS, slat, and clat to evaluate storage performance under high‑load and single‑request scenarios.

BenchmarkDisk PerformanceIOPS
0 likes · 7 min read
How to Use fio to Measure Disk IOPS, Throughput, and Latency on Ubuntu
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Sep 22, 2025 · Cloud Native

How to Detect and Fix Kernel‑Level Latency Jitters in Kubernetes

In cloud‑native clusters, resource over‑commit and mixed deployments cause kernel‑level delays such as memory‑reclaim and CPU scheduling latency, which propagate to applications as jitter; this article explains how to visualize, diagnose, and mitigate these issues using the ACK‑SysOM exporter and related monitoring dashboards.

CPU schedulingLatencyMemory reclaim
0 likes · 9 min read
How to Detect and Fix Kernel‑Level Latency Jitters in Kubernetes
Raymond Ops
Raymond Ops
Aug 30, 2025 · Operations

Boosting Internet App Quality: How CDNs Work and Why They Matter

This article explains key internet application quality metrics such as service success rate, latency, and playback smoothness, examines common deployment models, and provides a detailed overview of Content Delivery Networks—including their architecture, operation processes, content injection methods, request scheduling, and distribution strategies—to help improve performance and reliability.

CDNContent DeliveryInternet Quality
0 likes · 11 min read
Boosting Internet App Quality: How CDNs Work and Why They Matter
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Aug 29, 2025 · Backend Development

How Apache Pulsar Achieves Sub‑millisecond Write Latency on NVMe

This article explains Apache Pulsar's architecture, client‑to‑broker and broker‑to‑bookie latency components, data storage model, write path, journal flush strategies, and presents detailed benchmark results showing sub‑millisecond write latency and up to 1.5 million TPS on NVMe storage.

Apache PulsarBackend DevelopmentLatency
0 likes · 10 min read
How Apache Pulsar Achieves Sub‑millisecond Write Latency on NVMe
MaGe Linux Operations
MaGe Linux Operations
Aug 9, 2025 · Operations

Boost Linux Network Bandwidth & Slash Latency with Proven Tuning Techniques

This article explains how operations engineers can dramatically improve Linux network performance by understanding key metrics and applying practical tuning methods—such as adjusting TCP windows, enabling TCP Fast Open, switching to BBR, optimizing kernel parameters, using high‑performance NICs, zero‑copy transfers, load balancing, and monitoring tools—to increase bandwidth and reduce latency for high‑concurrency and real‑time applications.

LatencyLinuxOperations
0 likes · 11 min read
Boost Linux Network Bandwidth & Slash Latency with Proven Tuning Techniques
ITPUB
ITPUB
May 25, 2025 · Databases

MySQL vs PostgreSQL: Which Database Handles High‑Throughput Writes Faster?

A detailed benchmark comparing MySQL 9.0 and PostgreSQL 17.0 evaluates ingestion latency, throughput, saturation, and connection‑pool behavior, revealing that PostgreSQL consistently delivers lower write latency, higher QPS, better CPU and memory efficiency, and superior disk storage efficiency across both write and read workloads.

Connection PoolDatabase IngestionLatency
0 likes · 9 min read
MySQL vs PostgreSQL: Which Database Handles High‑Throughput Writes Faster?
FunTester
FunTester
May 15, 2025 · Operations

Uncovering the Eight Hidden Pitfalls That Can Crash Your Distributed System

This article dissects the classic Eight Fallacies of Distributed Computing, explaining each mistaken assumption about network reliability, latency, bandwidth, security, topology, administration, cost, and homogeneity, and provides real‑world case studies and practical recommendations to help engineers design more resilient distributed systems.

Distributed SystemsFallaciesLatency
0 likes · 16 min read
Uncovering the Eight Hidden Pitfalls That Can Crash Your Distributed System
Architects' Tech Alliance
Architects' Tech Alliance
May 7, 2025 · Industry Insights

What Determines AI Chip Performance? Accuracy, Throughput, Latency & Energy Explained

This article provides a concise technical overview of AI chip key metrics—accuracy, throughput, latency, and energy consumption—explains their impact on hardware design, discusses critical design points such as MAC reduction and processing element optimization, and summarizes practical takeaways for evaluating AI accelerator solutions.

AI ChipEnergy ConsumptionLatency
0 likes · 9 min read
What Determines AI Chip Performance? Accuracy, Throughput, Latency & Energy Explained
Java Tech Enthusiast
Java Tech Enthusiast
Apr 9, 2025 · Databases

MySQL vs PostgreSQL Performance Benchmark: Latency, Throughput, and Saturation

The benchmark comparing MySQL 9.0 and PostgreSQL 17.0 shows PostgreSQL consistently achieving lower insertion latency, roughly double the throughput (≈19 k QPS vs 10 k QPS), using less CPU, memory and disk I/O, while both hit an 80‑connection pool limit and PostgreSQL maintains stable read latency as MySQL’s performance degrades beyond ~5.5 k QPS.

LatencyPostgreSQLThroughput
0 likes · 7 min read
MySQL vs PostgreSQL Performance Benchmark: Latency, Throughput, and Saturation
Bitu Technology
Bitu Technology
Mar 21, 2025 · Backend Development

Optimizing Redis Latency for an Online Feature Store: A Batch Query Case Study

This article describes how Tubi improved the latency of its Redis‑backed online feature store for machine‑learning inference by analyzing query patterns, measuring client‑side bottlenecks, and applying optimizations such as binary Avro encoding, MGET usage, virtual partitioning, and parallel deserialization to meet a sub‑10 ms SLA.

Feature StoreLatencyMLOps
0 likes · 9 min read
Optimizing Redis Latency for an Online Feature Store: A Batch Query Case Study
Architects' Tech Alliance
Architects' Tech Alliance
Mar 13, 2025 · Fundamentals

How Memory Bandwidth and Latency Shape CPU Performance

The article explains how CPU computation latency arises from memory speed, bandwidth, and access delays, detailing the relationships among memory, bandwidth, and latency, and examines key factors such as clock frequency, pipelining, parallelism, cache hit rate, and signal propagation distances that together determine overall system performance.

CPULatencyMemory Bandwidth
0 likes · 9 min read
How Memory Bandwidth and Latency Shape CPU Performance
Architects' Tech Alliance
Architects' Tech Alliance
Mar 12, 2025 · Industry Insights

What Do AI Chip Metrics Really Reveal? A Deep Dive into Latency, Throughput, and Energy

This article provides a technical breakdown of AI chip key performance indicators—accuracy, throughput, latency, and energy consumption—explains how MAC and processing‑element design affect these metrics, and outlines design strategies for maximizing throughput while minimizing latency and power use.

AI chipsIndustry analysisLatency
0 likes · 8 min read
What Do AI Chip Metrics Really Reveal? A Deep Dive into Latency, Throughput, and Energy
Cognitive Technology Team
Cognitive Technology Team
Feb 2, 2025 · Fundamentals

Common Misconceptions in Distributed System Design and Their Solutions

Designing distributed systems often falls prey to misconceptions such as assuming reliable networks, zero latency, unlimited bandwidth, inherent security, static topology, zero transmission cost, and full autonomy, but applying retries, idempotency, message queues, encryption, dynamic discovery, caching, and time protocols can mitigate these issues.

ConsensusDistributed SystemsLatency
0 likes · 5 min read
Common Misconceptions in Distributed System Design and Their Solutions
Architect
Architect
Jan 9, 2025 · Industry Insights

How to Ensure Immediate Reads After Writes in Multi-Active Architectures

This article analyzes the "write‑after‑immediate‑read" challenge in multi‑active disaster‑recovery setups, breaks down solution directions, presents a three‑city five‑center case study, and outlines a four‑step model—distinguish scenarios, mark written data, assess latency, and enable near‑by access—to achieve consistent, low‑latency reads.

BackendData ConsistencyDistributed Systems
0 likes · 15 min read
How to Ensure Immediate Reads After Writes in Multi-Active Architectures
dbaplus Community
dbaplus Community
Jan 1, 2025 · Backend Development

Mastering Multi-Active Data Architecture: Reducing Write Latency and Ensuring High Availability

This article examines the challenges of building multi‑active distributed systems, focusing on the data layer’s role in high availability, write‑latency, sharding, isolation, replication strategies, and routing decisions, and provides concrete architectural patterns and practical guidelines for robust backend design.

Distributed SystemsLatencydata replication
0 likes · 23 min read
Mastering Multi-Active Data Architecture: Reducing Write Latency and Ensuring High Availability
Efficient Ops
Efficient Ops
Dec 24, 2024 · Databases

Why Is My Redis Slowing Down? 10 Common Causes and How to Fix Them

Redis can appear slow for many reasons—including baseline latency differences, high‑complexity commands, big keys, concentrated expirations, memory limits, fork overhead, huge pages, CPU binding, swap usage, memory fragmentation, and AOF configuration—so this guide explains each cause, how to diagnose it, and practical optimization steps.

Database OptimizationLatencyMemory Management
0 likes · 34 min read
Why Is My Redis Slowing Down? 10 Common Causes and How to Fix Them
Fighter's World
Fighter's World
Dec 7, 2024 · Artificial Intelligence

Does Scaling Law Still Hold? Analyzing OpenAI’s 12‑Day Mini Releases and the Future of GPT‑5

The article examines OpenAI’s 12‑day mini‑series, the emergence of o1 and Reinforcement Fine‑Tuning, and uses Epoch AI’s 2024 report to evaluate four critical constraints—power, chip capacity, data scarcity, and latency—that determine whether AI scaling laws can sustain the compute needed for a GPT‑5‑scale model by 2030.

AI scalingLatencychip manufacturing
0 likes · 11 min read
Does Scaling Law Still Hold? Analyzing OpenAI’s 12‑Day Mini Releases and the Future of GPT‑5
Liangxu Linux
Liangxu Linux
Nov 30, 2024 · Operations

Why a New PC Can't Access the Internet: Step‑by‑Step DNS Troubleshooting

A client’s newly purchased computer could connect to the corporate network but could not reach the internet; the guide walks through checking the link light, IP configuration, ping tests, identifying a faulty DNS server, switching to public DNS services, and measuring latency to reveal why foreign DNS isn’t always faster.

CloudflareDNSIP Configuration
0 likes · 5 min read
Why a New PC Can't Access the Internet: Step‑by‑Step DNS Troubleshooting
Tencent Cloud Developer
Tencent Cloud Developer
Nov 27, 2024 · Databases

Analyzing the Write‑After‑Read Consistency Challenge in Multi‑Active Distributed Architectures

The article examines the write‑after‑read consistency problem in multi‑active cross‑region systems, compares single‑write‑single‑read routing, quorum‑based multi‑write‑multi‑read, and read‑while‑copy methods, explains why primary‑secondary replication is preferred, and proposes a four‑step framework—scenario flagging, data marking, latency evaluation, and near‑by asynchronous replication—to meet WAR requirements efficiently.

ConsistencyDatabase ReplicationDistributed Systems
0 likes · 12 min read
Analyzing the Write‑After‑Read Consistency Challenge in Multi‑Active Distributed Architectures
Su San Talks Tech
Su San Talks Tech
Oct 21, 2024 · Databases

How to Diagnose and Fix MySQL Master‑Slave Replication Lag

This article explains why MySQL master‑slave replication can become delayed in high‑traffic order systems, describes the underlying binlog synchronization mechanism, and provides practical steps—including network upgrades, server tuning, transaction reduction, version updates, and slave count limits—to eliminate the lag.

LatencyMaster‑SlaveReplication
0 likes · 7 min read
How to Diagnose and Fix MySQL Master‑Slave Replication Lag
Liangxu Linux
Liangxu Linux
Aug 14, 2024 · Operations

How to Use iPerf for Accurate Network Bandwidth and Latency Testing

This guide walks you through preparing the environment, installing iPerf on Windows or Linux servers, running it in server and client modes, testing TCP and UDP performance, measuring latency with ping, and checking all available options to ensure reliable network bandwidth and latency measurements.

LatencyLinuxTCP
0 likes · 5 min read
How to Use iPerf for Accurate Network Bandwidth and Latency Testing
Code Mala Tang
Code Mala Tang
Jun 20, 2024 · Frontend Development

Essential Latency Numbers Every Web Developer Must Know

This article explores critical latency metrics—from nanosecond-level CPU operations to intercontinental network delays—illustrating how each contributes to web performance, and provides real‑world measurements for various connection types, data processing tasks, and user‑perceived response times to help developers optimize user experience.

LatencyWeb Performancefrontend
0 likes · 6 min read
Essential Latency Numbers Every Web Developer Must Know
Liangxu Linux
Liangxu Linux
Jun 17, 2024 · Fundamentals

Understanding the Four Key Network Performance Metrics: Bandwidth, Latency, Jitter, and Packet Loss

The article explains the four essential network performance indicators—bandwidth, latency, jitter, and packet loss—defining each term, describing how they are measured, illustrating their impact with examples, and clarifying related concepts such as bits versus bytes, processing, queueing, transmission, and propagation delays.

LatencyPacket LossThroughput
0 likes · 7 min read
Understanding the Four Key Network Performance Metrics: Bandwidth, Latency, Jitter, and Packet Loss
Practical DevOps Architecture
Practical DevOps Architecture
Jun 4, 2024 · Databases

Causes and Solutions for MySQL Master‑Slave Replication Lag

This article explains why MySQL master‑slave replication can become delayed—including single‑threaded binlog processing, high load, network latency, and configuration issues—and provides detailed troubleshooting steps and practical solutions such as semi‑synchronous replication, hardware upgrades, and configuration tuning.

Database ArchitectureLatencySemi-synchronous
0 likes · 8 min read
Causes and Solutions for MySQL Master‑Slave Replication Lag
Java Tech Enthusiast
Java Tech Enthusiast
May 17, 2024 · Fundamentals

Understanding Computer Time Units, CPU Cycles and Performance Latency

The article explains that software performance is measured in milliseconds to nanoseconds, describes core hardware components—CPU, caches, and DRAM—shows how cache hierarchy speeds differ, defines a clock cycle as the basic time unit, and provides typical latency figures for operations ranging from a single CPU cycle to a full system reboot.

CPU cyclesLatencyMemory Hierarchy
0 likes · 7 min read
Understanding Computer Time Units, CPU Cycles and Performance Latency
Cognitive Technology Team
Cognitive Technology Team
May 15, 2024 · Fundamentals

The Fallacies of Distributed Systems: Understanding Common Network Assumptions

This article revisits the classic “Fallacies of Distributed Systems” introduced by Peter Deutsch, explaining why assumptions such as reliable networks, zero latency, infinite bandwidth, secure and homogeneous communication are false, and offering practical strategies like retries, caching, batching, and security‑first design to build robust distributed applications.

Distributed SystemsFallaciesLatency
0 likes · 4 min read
The Fallacies of Distributed Systems: Understanding Common Network Assumptions
DataFunSummit
DataFunSummit
Apr 10, 2024 · Artificial Intelligence

Large Language Model Inference Overview and Performance Optimizations

This article presents a comprehensive overview of large language model inference, describing the prefill and decoding stages, key performance metrics such as throughput, latency and QPS, and detailing a series of system-level optimizations—including pipeline parallelism, dynamic batching, KV‑cache quantization, and hardware considerations—to significantly improve inference efficiency on modern GPUs.

GPUInferenceLatency
0 likes · 23 min read
Large Language Model Inference Overview and Performance Optimizations
Linux Cloud Computing Practice
Linux Cloud Computing Practice
Mar 27, 2024 · Cloud Computing

Cloud vs Edge Computing: Key Differences, Benefits, and Use Cases

Cloud computing centralizes resources in large data centers, while edge computing processes data near its source, offering lower latency and better suitability for IoT; this article compares their definitions, characteristics, advantages, and real‑world applications, highlighting how they can complement each other.

Edge ComputingIoTLatency
0 likes · 10 min read
Cloud vs Edge Computing: Key Differences, Benefits, and Use Cases
Liangxu Linux
Liangxu Linux
Mar 21, 2024 · Frontend Development

How to Understand and Reduce Web Latency: Practical Tips for Faster Apps

This article explains what latency is, how it occurs in web interactions, the main factors that contribute to it, common tools for measuring it, and a set of practical optimization techniques such as HTTP/2, CDN usage, caching, and efficient coding practices.

LatencyWeb Performancefrontend
0 likes · 8 min read
How to Understand and Reduce Web Latency: Practical Tips for Faster Apps
DataFunTalk
DataFunTalk
Feb 19, 2024 · Artificial Intelligence

Large Language Model Inference Overview and Performance Optimizations

This article presents a comprehensive overview of large language model inference, detailing the prefill and decoding stages, key performance metrics such as throughput, latency and QPS, and a series of system-level optimizations—including pipeline parallelism, dynamic batching, specialized attention kernels, virtual memory allocation, KV‑cache quantization, and mixed‑precision strategies—to improve GPU utilization and overall inference efficiency.

GPULLMLatency
0 likes · 24 min read
Large Language Model Inference Overview and Performance Optimizations
Sanyou's Java Diary
Sanyou's Java Diary
Dec 14, 2023 · Operations

Why Is My Redis Slowing Down? A Complete Diagnosis and Optimization Guide

This article explains how to determine whether Redis is truly experiencing latency spikes, outlines a step‑by‑step benchmarking process, identifies common causes such as high‑complexity commands, big keys, memory limits, fork overhead, AOF settings, CPU binding, swap usage, and provides concrete configuration and code examples to resolve each issue.

AOFBigKeyLatency
0 likes · 39 min read
Why Is My Redis Slowing Down? A Complete Diagnosis and Optimization Guide
ITPUB
ITPUB
Oct 16, 2023 · Databases

Why Did Our Redis Calls Take 1.2 Seconds? A Deep Dive into JedisPool Tuning

A load test revealed Redis latency soaring to 1.2 seconds, and a systematic investigation traced the root cause to mis‑configured JedisPool parameters, leading to excessive connection wait times and churn, which were fixed by aligning pool settings and enabling proper eviction, dramatically improving latency.

Connection PoolJedisPoolLatency
0 likes · 8 min read
Why Did Our Redis Calls Take 1.2 Seconds? A Deep Dive into JedisPool Tuning
Tencent Cloud Developer
Tencent Cloud Developer
Sep 28, 2023 · Databases

Redis Service Latency Diagnosis and Optimization – A Systematic Approach

The article outlines a systematic three‑step workflow—general service diagnostics, Redis‑specific checks, and reproducible load testing—to pinpoint a hot‑key‑driven CPU bottleneck, then evaluates mitigation options such as read‑write separation, pipelining, and an application‑level cache, ultimately showing the cache’s effectiveness in cutting latency and CPU usage.

CacheLatencyperformance
0 likes · 21 min read
Redis Service Latency Diagnosis and Optimization – A Systematic Approach
Architect's Guide
Architect's Guide
Aug 12, 2023 · Databases

Why Redis Becomes Slow and How to Optimize It

This article explains the common reasons why Redis performance degrades—such as network latency, high‑complexity commands, big keys, concentrated expirations, memory limits, fork overhead, huge pages, AOF settings, CPU binding, swap usage, and memory fragmentation—and provides detailed optimization and troubleshooting steps to restore low latency.

LatencyMemorydatabase
0 likes · 34 min read
Why Redis Becomes Slow and How to Optimize It
Java Architecture Diary
Java Architecture Diary
Jul 11, 2023 · Big Data

Redpanda vs Apache Kafka with KRaft: Why Redpanda Is Up to 10× Faster

This article presents a detailed benchmark comparing Redpanda 23.1 and Apache Kafka 3.4.0 (with and without KRaft) across multiple AWS instance types, showing how Redpanda consistently delivers higher throughput and dramatically lower end‑to‑end latency, often outperforming Kafka by 4‑20× even with extra hardware.

Apache KafkaBig DataKRaft
0 likes · 12 min read
Redpanda vs Apache Kafka with KRaft: Why Redpanda Is Up to 10× Faster
Top Architect
Top Architect
Jun 27, 2023 · Databases

Redis Performance Degradation: Root Causes and Optimization Techniques

This article explains how to benchmark Redis latency, identify common reasons for slowdowns such as high‑complexity commands, big keys, concentrated expirations, memory limits, fork overhead, swap usage, and CPU binding, and provides detailed configuration and operational steps to monitor and resolve each issue.

AOFLatencyMemory
0 likes · 34 min read
Redis Performance Degradation: Root Causes and Optimization Techniques
Architect
Architect
Jun 26, 2023 · Databases

Why Redis Becomes Slow and How to Optimize Its Performance

The article explains common reasons why Redis latency increases, such as intrinsic latency, high‑complexity commands, big keys, expiration spikes, memory limits, fork overhead, huge pages, AOF configuration, CPU binding, swap usage and memory fragmentation, and provides detailed troubleshooting steps and optimization techniques.

LatencyMemoryoptimization
0 likes · 33 min read
Why Redis Becomes Slow and How to Optimize Its Performance
JD Cloud Developers
JD Cloud Developers
May 11, 2023 · Backend Development

How MOSN’s New Latency‑Based Load Balancing Cuts Tail Latency

This article explains MOSN v1.5.0's latency‑based load‑balancing algorithm, analyzes the sources of latency in distributed systems, describes mitigation techniques such as PeakEWMA and P2C, and presents a realistic simulation that shows the algorithm’s superiority over Round Robin and Least Request.

LatencyMOSNP2C
0 likes · 12 min read
How MOSN’s New Latency‑Based Load Balancing Cuts Tail Latency
DeWu Technology
DeWu Technology
Apr 28, 2023 · Operations

Analysis of Live Streaming Latency Causes and Optimization Strategies

The article examines why live‑streaming latency—averaging about five seconds on the studied platform versus three seconds on top e‑commerce services—remains high, pinpointing encoder, transcoding, CDN GOP‑cache, and player buffering as key contributors, and proposes optimizations such as shrinking GOP size, minimizing CDN cache tiers, tightening player buffers, and adopting ultra‑low‑latency protocols like RTS, RTM, or QUIC to achieve sub‑second delays.

CDNGOPLatency
0 likes · 16 min read
Analysis of Live Streaming Latency Causes and Optimization Strategies
Architects' Tech Alliance
Architects' Tech Alliance
Dec 27, 2022 · Fundamentals

Understanding Time and Space Trade‑offs in Software Performance Optimization

The article explains how software performance is limited by the time spent in CPU, memory, storage and network operations and by the space consumed by data structures, JVM object headers, thread stacks and protocol overhead, illustrating the need for careful time‑space trade‑offs to achieve efficient optimization.

LatencyMemorySoftware Engineering
0 likes · 9 min read
Understanding Time and Space Trade‑offs in Software Performance Optimization
Sohu Tech Products
Sohu Tech Products
Dec 7, 2022 · Databases

How to Diagnose and Resolve Redis Performance Issues

This article explains how to identify Redis latency problems, measure baseline performance, monitor slow commands, and address common causes such as network RTT, forked RDB snapshots, transparent huge pages, swap usage, AOF configuration, key expiration bursts, and big keys, providing practical solutions and a checklist for remediation.

Latencydatabaseoptimization
0 likes · 18 min read
How to Diagnose and Resolve Redis Performance Issues
政采云技术
政采云技术
Nov 24, 2022 · Databases

Is Redis Really Slowing Down? A Comprehensive Diagnosis and Optimization Guide

This article explains how to determine whether Redis is truly experiencing latency issues, outlines benchmark testing methods, identifies common causes such as network problems, high‑complexity commands, big keys, slow logs, memory limits, fork overhead, AOF configuration, swap usage, fragmentation, and provides practical troubleshooting and optimization steps.

Latencydatabaseredis
0 likes · 26 min read
Is Redis Really Slowing Down? A Comprehensive Diagnosis and Optimization Guide
Code Ape Tech Column
Code Ape Tech Column
Nov 9, 2022 · Operations

System Performance Optimization: Definitions, Testing, Bottleneck Identification, and Common Strategies

This article explains system performance concepts such as throughput and latency, describes how to design and run performance tests, outlines methods for locating bottlenecks at the OS, code, network, and database levels, and presents practical optimization techniques ranging from algorithmic improvements to I/O and TCP tuning.

Database OptimizationLatencyNetwork Tuning
0 likes · 35 min read
System Performance Optimization: Definitions, Testing, Bottleneck Identification, and Common Strategies
ITPUB
ITPUB
Nov 1, 2022 · Databases

Why RocksDB 7.5.3 Beats 6.2.9: Deep Dive into Performance Optimizations

The new RocksDB 7.5.3 release dramatically reduces write‑stall time, lock contention and CPU usage while improving OPS and latency compared with 6.2.9.x, as shown by detailed memtier benchmark tests on a 4‑CPU, 32 GiB VM with NVMe storage.

BenchmarkLatencyRocksDB
0 likes · 11 min read
Why RocksDB 7.5.3 Beats 6.2.9: Deep Dive into Performance Optimizations
Architect
Architect
Sep 15, 2022 · Big Data

Meituan's Kafka Optimizations: Challenges, Latency Improvements, and Large‑Scale Cluster Management

This article describes how Meituan's massive Kafka deployment—over 15,000 machines and petabytes of daily traffic—faces scalability challenges such as slow nodes, load imbalance, and resource contention, and details the multi‑layer optimizations applied at the application, system, and cluster‑management levels to reduce read/write latency and improve reliability.

KafkaLatencybig-data
0 likes · 22 min read
Meituan's Kafka Optimizations: Challenges, Latency Improvements, and Large‑Scale Cluster Management
MaGe Linux Operations
MaGe Linux Operations
Jul 10, 2022 · Fundamentals

Mastering System Performance: Key Concepts, Metrics, and Optimization Strategies

System performance examines the interplay of hardware and software components, focusing on latency, throughput, and cost reduction, while exploring essential concepts, measurement techniques, trade‑offs, optimization layers, ROI considerations, and practical guidelines for effective performance analysis across diverse computing environments.

LatencySystemsThroughput
0 likes · 14 min read
Mastering System Performance: Key Concepts, Metrics, and Optimization Strategies
JavaEdge
JavaEdge
Jun 3, 2022 · Operations

How to Scale Systems: From Load Metrics to Architecture Strategies

This article explains how to describe current system load, choose appropriate load parameters, analyze Twitter's scaling challenges, compare relational and push‑based timeline designs, clarify latency versus response time, emphasize percentile monitoring, and evaluate vertical versus horizontal scaling and hybrid approaches for handling increasing traffic.

LatencyLoad TestingScalability
0 likes · 15 min read
How to Scale Systems: From Load Metrics to Architecture Strategies
MaGe Linux Operations
MaGe Linux Operations
May 28, 2022 · Fundamentals

The Surprising Origin of Ping: From 1983 Code to Modern Gaming Lag

This article traces the history of the ping utility—from its spontaneous creation by Michael Muuss in 1983, through its adoption across Unix, Windows, macOS and Linux, to its role in measuring network latency for gamers and the technical principles behind ICMP echo requests.

ICMPLatencyNetwork Diagnostics
0 likes · 11 min read
The Surprising Origin of Ping: From 1983 Code to Modern Gaming Lag
IT Services Circle
IT Services Circle
May 10, 2022 · Fundamentals

The Drawbacks of TCP: Upgrade Difficulty, Connection Latency, Head‑of‑Line Blocking, and Migration Overhead

This article examines the inherent shortcomings of the TCP protocol, including the difficulty of upgrading the stack, the latency introduced by its three‑way handshake and TLS, head‑of‑line blocking caused by packet loss, and the high cost of connection migration when network conditions change.

LatencyNetwork ProtocolsQUIC
0 likes · 10 min read
The Drawbacks of TCP: Upgrade Difficulty, Connection Latency, Head‑of‑Line Blocking, and Migration Overhead
Architect's Journey
Architect's Journey
Apr 13, 2022 · Fundamentals

Is Classifying Distributed Systems as CP or AP a False Dichotomy?

The article revisits the CAP theorem, explains linearizable consistency and strict availability, demonstrates with concrete data‑center examples why the CP/AP split is often misleading, and argues that latency concerns and broader failure modes make the binary classification of distributed systems impractical.

AvailabilityCAP theoremCP vs AP
0 likes · 10 min read
Is Classifying Distributed Systems as CP or AP a False Dichotomy?
Sanyou's Java Diary
Sanyou's Java Diary
Mar 14, 2022 · Databases

Why Is My Redis Slowing Down? Comprehensive Diagnosis & Optimization Guide

This article provides a thorough step‑by‑step guide to identify why Redis latency spikes, covering baseline performance testing, slowlog analysis, big‑key handling, expiration patterns, memory limits, fork overhead, huge pages, AOF settings, CPU binding, swap usage, memory fragmentation, network bandwidth, and practical optimization techniques.

Latencydatabaseoptimization
0 likes · 39 min read
Why Is My Redis Slowing Down? Comprehensive Diagnosis & Optimization Guide
IT Services Circle
IT Services Circle
Feb 24, 2022 · Databases

Diagnosing and Solving Redis Performance Issues

This article explains how to detect Redis latency problems, measure baseline performance, monitor slow commands, and address common causes such as network round‑trip delays, fork‑generated RDB snapshots, transparent huge pages, swap usage, AOF settings, key expiration, and big‑key handling, providing practical troubleshooting steps and solutions.

Latencydatabasemonitoring
0 likes · 20 min read
Diagnosing and Solving Redis Performance Issues
Architects' Tech Alliance
Architects' Tech Alliance
Dec 15, 2021 · Fundamentals

Understanding the Metaverse from a Network Access Perspective and Its Implications for Network Technology

The article examines the concept of the Metaverse as an evolution of internet access terminals, analyzes how VR/AR headsets reshape communication‑type devices, and discusses the resulting new security, ultra‑low‑latency, and deterministic quality‑of‑service challenges for network infrastructure.

LatencyMetaverseQoS
0 likes · 16 min read
Understanding the Metaverse from a Network Access Perspective and Its Implications for Network Technology
Programmer DD
Programmer DD
Dec 2, 2021 · Operations

Mastering QoS: How to Ensure Reliable Network Performance for Voice, Video, and Data

Quality of Service (QoS) allocates limited bandwidth among diverse network traffic, guaranteeing end‑to‑end performance for real‑time applications like voice and video, while explaining key metrics, service models, and practical deployment scenarios such as enterprise office, streaming, and hierarchical QoS solutions.

LatencyQoSTraffic Shaping
0 likes · 15 min read
Mastering QoS: How to Ensure Reliable Network Performance for Voice, Video, and Data
Tencent Cloud Developer
Tencent Cloud Developer
Nov 30, 2021 · Industry Insights

Why Edge Computing Is the Next AIoT Explosion Point

This article examines how edge computing, combined with IoT and cloud‑native technologies, addresses latency, bandwidth, connectivity and security challenges of the emerging AIoT era, detailing Tencent Cloud’s IECP platform, its components, and real‑world smart‑water and smart‑factory deployments.

AIoTCloud NativeEdge Computing
0 likes · 14 min read
Why Edge Computing Is the Next AIoT Explosion Point
Cloud Native Technology Community
Cloud Native Technology Community
Nov 25, 2021 · Databases

Why Is My Redis Slowing Down? A Complete Troubleshooting Guide

This article provides a systematic, step‑by‑step methodology for diagnosing Redis latency spikes, covering baseline performance testing, slow‑log analysis, high‑complexity commands, big‑key handling, expiration patterns, memory limits, fork overhead, huge‑page settings, AOF configurations, CPU binding, swap usage, memory fragmentation, network saturation, and practical monitoring tips.

Latencydatabasemonitoring
0 likes · 42 min read
Why Is My Redis Slowing Down? A Complete Troubleshooting Guide
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Nov 24, 2021 · Cloud Computing

How Proactive Link Monitoring Transforms Cloud Network Reliability

This article explains Huawei Cloud Stack's proactive link monitoring system, detailing its point‑line‑plane architecture, golden metrics of packet loss and latency, detection techniques, system components, and key innovations such as strategy optimization, alarm aggregation, and visualized performance dashboards for cloud data‑center networks.

Data centerLatencyNetwork Reliability
0 likes · 13 min read
How Proactive Link Monitoring Transforms Cloud Network Reliability
Open Source Linux
Open Source Linux
Nov 17, 2021 · Fundamentals

Mastering QoS: Ensure Reliable Voice, Video, and Data Performance

This article explains the concept of Quality of Service (QoS), its importance in managing limited bandwidth, key metrics such as bandwidth, latency, jitter, and packet loss, various QoS models like Best‑Effort, IntServ, DiffServ, and practical application scenarios for enterprise networks.

LatencyQoSbandwidth management
0 likes · 15 min read
Mastering QoS: Ensure Reliable Voice, Video, and Data Performance
Architects' Tech Alliance
Architects' Tech Alliance
Nov 7, 2021 · Fundamentals

Understanding QoS Technology: Principles, Metrics, Service Models, and Enterprise Applications

This article explains the origins of QoS, its key performance metrics such as bandwidth, latency, jitter and packet loss, compares the Best‑Effort, IntServ and DiffServ service models, and describes how DiffServ components are deployed in enterprise networks to manage traffic and improve service quality.

LatencyNetworkingPacket Loss
0 likes · 15 min read
Understanding QoS Technology: Principles, Metrics, Service Models, and Enterprise Applications
Liangxu Linux
Liangxu Linux
Oct 10, 2021 · Operations

Mastering MTR: Install, Run, and Decode Network Paths

This guide explains what MTR (My Traceroute) is, how to install it on Windows, Linux, macOS and Android, demonstrates basic commands and key options such as -r, -s, -c, and -n, and provides detailed instructions for interpreting loss, latency and other columns to troubleshoot network routing issues.

LatencyLinuxNetwork Diagnostics
0 likes · 10 min read
Mastering MTR: Install, Run, and Decode Network Paths
Java Architect Essentials
Java Architect Essentials
Aug 30, 2021 · Databases

How to Monitor and Optimize Redis Performance

This article explains how to use Redis INFO commands to track memory usage, command processing, latency, key eviction and fragmentation, and provides practical tips such as adjusting maxmemory, using hash structures, pipelines, and slowlog to diagnose and improve Redis performance.

LatencyMemoryOps
0 likes · 23 min read
How to Monitor and Optimize Redis Performance
iQIYI Technical Product Team
iQIYI Technical Product Team
Aug 13, 2021 · Game Development

Improving VR Video Clarity: PPD, Tile Encoding, and Future Directions

VR video clarity suffers because the required pixels‑per‑degree far exceed what 4K or 8K spherical streams can deliver, but tile‑based encoding that decodes only the viewport, combined with low motion‑to‑photon latency, distortion control, advanced codecs and AI‑driven projection, promises sharper, lower‑bitrate 6DoF experiences.

8KLatencyPPD
0 likes · 13 min read
Improving VR Video Clarity: PPD, Tile Encoding, and Future Directions
Open Source Linux
Open Source Linux
Apr 30, 2021 · Backend Development

How to Cut Nginx HTTPS Latency by 30% with TLS Tweaks

This article explains why optimizing Nginx HTTPS latency matters for instant search, describes how TLS handshakes add round‑trip delays, and provides step‑by‑step Nginx TLS configuration changes—such as enabling HTTP/2, adjusting ciphers, enabling OCSP stapling, tuning buffer sizes and session cache—that together reduced request latency by about 30% in a real‑world search service.

BackendHTTPSLatency
0 likes · 13 min read
How to Cut Nginx HTTPS Latency by 30% with TLS Tweaks
Architect's Tech Stack
Architect's Tech Stack
Apr 29, 2021 · Databases

Analyzing Redis Latency Issues and How to Diagnose Them

This article explains common causes of Redis latency spikes—including slow commands, large keys, concentrated expirations, memory limits, fork overhead, CPU binding, AOF settings, swap usage, and network saturation—and provides step‑by‑step diagnostic commands and practical mitigation techniques.

Database operationsLatencyMemory Management
0 likes · 18 min read
Analyzing Redis Latency Issues and How to Diagnose Them
dbaplus Community
dbaplus Community
Mar 8, 2021 · Databases

Why Is My Redis Slowing Down? A Complete Guide to Diagnose and Fix Latency Issues

This comprehensive article walks you through the entire process of identifying why Redis latency spikes, from confirming the slowdown and measuring baseline performance to analyzing slow logs, big keys, expiration patterns, memory limits, fork overhead, AOF settings, CPU binding, swap usage, memory fragmentation, network bandwidth, and finally applying practical optimization techniques.

Latencyredistroubleshooting
0 likes · 40 min read
Why Is My Redis Slowing Down? A Complete Guide to Diagnose and Fix Latency Issues