Tagged articles
29 articles
Page 1 of 1
Code Wrench
Code Wrench
Feb 6, 2026 · Fundamentals

Unlock 95% CPU Utilization in Go: 7 Scheduler Pitfalls and Real‑World Fixes

This article examines why Go programs often suffer from low CPU usage, explores seven common scheduler pitfalls through real production cases, and provides concrete techniques—such as separating I/O from CPU work, tuning GOMAXPROCS, and using worker pools—to boost utilization from 30% to 95% and dramatically improve latency.

Schedulercpu-utilizationgomp
0 likes · 27 min read
Unlock 95% CPU Utilization in Go: 7 Scheduler Pitfalls and Real‑World Fixes
Youzan Coder
Youzan Coder
Jul 18, 2025 · Cloud Native

How Mixed Workloads Boost Kubernetes CPU Utilization by Over 40%

This article explains how Youzan transformed its Kubernetes clusters from static over‑commit scheduling to load‑balanced mixed workloads using Koordinator and the Longxi kernel, achieving higher CPU utilization, lower costs, and better resource management for both online and offline services.

Big DataCloud NativeKoordinator
0 likes · 10 min read
How Mixed Workloads Boost Kubernetes CPU Utilization by Over 40%
Liangxu Linux
Liangxu Linux
Nov 26, 2024 · Fundamentals

What Does the CPU Do When an RTOS Has No Ready Tasks?

When an RTOS has no runnable tasks and all tasks are delayed or waiting for events, the CPU does not stay idle but executes the system's idle task or thread, which may simply loop forever unless the developer adds custom work.

FreeRTOSIdle TaskRTOS
0 likes · 4 min read
What Does the CPU Do When an RTOS Has No Ready Tasks?
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Nov 27, 2023 · Cloud Native

Mixed-Workload Scheduling and Resource Utilization Optimization in Xiaohongshu's Cloud-Native Platform

Xiaohongshu’s cloud‑native platform adopted a four‑stage mixed‑workload scheduling strategy—reusing idle nodes, whole‑machine time‑sharing, normal mixed pools, and a unified scheduler (Tusker) that coordinates CPU, GPU and memory across Kubernetes and YARN—boosting average cluster CPU utilization from under 20 % to over 45 % and delivering millions of low‑cost core‑hours while preserving QoS for latency‑sensitive, mid, and batch jobs.

Big DataKubernetesQoS
0 likes · 19 min read
Mixed-Workload Scheduling and Resource Utilization Optimization in Xiaohongshu's Cloud-Native Platform
JD Cloud Developers
JD Cloud Developers
Oct 11, 2023 · Backend Development

How Many Threads Should You Run? Real‑World Tests Reveal the Truth

This article debunks common thread‑count myths by running Java CPU‑bound and I/O‑bound experiments on a 6‑core/12‑thread Ryzen, showing how core count, context‑switching overhead, and I/O wait time affect CPU utilization and guiding practical thread‑pool sizing.

Java concurrencyPerformance Testingcpu-utilization
0 likes · 12 min read
How Many Threads Should You Run? Real‑World Tests Reveal the Truth
Top Architecture Tech Stack
Top Architecture Tech Stack
Sep 21, 2023 · Backend Development

Determining the Optimal Thread Pool Size Based on CPU Utilization

This article explains how to size a thread pool by understanding CPU core limits, I/O wait effects, empirical testing on a multi‑core machine, and a practical formula, while emphasizing that real‑world workloads require iterative performance testing to find the best thread count.

JavaPerformance Testingcpu-utilization
0 likes · 12 min read
Determining the Optimal Thread Pool Size Based on CPU Utilization
JD Retail Technology
JD Retail Technology
Jun 21, 2023 · Backend Development

Remote Aware Load Balance (RALB) Algorithm for Search Recommendation System: Design, Implementation, and Performance Evaluation

This article presents the design and evaluation of the Remote Aware Load Balance (RALB) algorithm applied to JD’s search‑recommendation architecture, describing its CPU‑centric load‑balancing principles, implementation details, functional verification, throughput and boundary testing, and the observed improvements in CPU utilization and overall system performance.

Performance Testingcpu-utilizationload balancing
0 likes · 13 min read
Remote Aware Load Balance (RALB) Algorithm for Search Recommendation System: Design, Implementation, and Performance Evaluation
Refining Core Development Skills
Refining Core Development Skills
Apr 4, 2023 · Cloud Native

Understanding Container CPU Utilization: Accurate Measurement Methods and the Missing Nice/IRQ/SoftIRQ Metrics

This article explains how to correctly obtain CPU utilization inside containers, compares host and container metrics, describes the use of lxcfs and cgroup files (including cgroup V1/V2) for accurate measurement, and clarifies why container statistics omit nice, irq, and softirq fields.

Cloud NativeContainercgroup
0 likes · 16 min read
Understanding Container CPU Utilization: Accurate Measurement Methods and the Missing Nice/IRQ/SoftIRQ Metrics
dbaplus Community
dbaplus Community
Jan 10, 2022 · Operations

Why Is Linux Load High? Decoding Load Average, CPU Usage & Process States

This article explains Linux process states, how the kernel calculates load average, the relationship between load and CPU utilization, and provides a systematic approach with tools and commands to identify resource bottlenecks, differentiate high‑load/high‑CPU scenarios from high‑load/low‑CPU cases, and pinpoint problematic processes and threads.

LinuxLoad AveragePerformance Monitoring
0 likes · 11 min read
Why Is Linux Load High? Decoding Load Average, CPU Usage & Process States
Baidu Tech Salon
Baidu Tech Salon
Dec 31, 2021 · Industry Insights

How Baidu Boosted CPU Utilization by Up to 80% with Offline Mixed‑Tenant Scheduling

This article analyzes Baidu's offline mixed‑tenant technology that combines online and offline workloads on the same physical servers, detailing the resource‑usage problems, dynamic resource views, priority schemes, isolation mechanisms, high‑performance scheduling, and future directions for cloud‑native clusters.

Cloud NativeKubernetescpu-utilization
0 likes · 18 min read
How Baidu Boosted CPU Utilization by Up to 80% with Offline Mixed‑Tenant Scheduling
Qu Tech
Qu Tech
Jan 25, 2021 · Operations

How to Diagnose and Fix High Load with Low CPU Utilization in Nginx

This article walks through Linux process states, fork/exec creation, CPU vs load metrics, network I/O models, and a real‑world Nginx high‑load case, showing how strace, accept_mutex, and SO_REUSEPORT can resolve the issue.

Linux performanceNginxSO_REUSEPORT
0 likes · 12 min read
How to Diagnose and Fix High Load with Low CPU Utilization in Nginx
Programmer DD
Programmer DD
Nov 30, 2020 · Backend Development

How Many Threads Are Optimal? Mastering Thread Pool Sizing for Maximum Performance

This article explains why multithreading improves program performance, distinguishes latency reduction from throughput increase, and provides practical formulas to determine the ideal number of threads for both I/O‑bound and CPU‑bound workloads, helping you fully utilize hardware resources.

I/O BoundPerformance Optimizationconcurrency
0 likes · 4 min read
How Many Threads Are Optimal? Mastering Thread Pool Sizing for Maximum Performance
dbaplus Community
dbaplus Community
Jul 2, 2020 · Backend Development

Why High Concurrency Is Really About Squeezing CPU Performance

The article explains that high concurrency in distributed systems is measured by QPS, but its essence lies in effectively utilizing CPU resources, discusses control‑variable analysis of the HTTP request flow, compares PHP‑Swoole and Java‑Netty implementations, and shows how coroutine‑based designs can dramatically boost throughput when I/O blocking is present.

Backend PerformanceNettySwoole
0 likes · 15 min read
Why High Concurrency Is Really About Squeezing CPU Performance
Xiao Lou's Tech Notes
Xiao Lou's Tech Notes
Apr 11, 2020 · Backend Development

Why Sentinel Misses Docker CPU Usage and How to Fix It

This article explains Sentinel's role in microservice rate limiting, details a Docker-specific bug where CPU utilization is incorrectly reported, clarifies the difference between CPU load and utilization, and outlines the code fix and its remaining limitations.

DockerJavacpu-utilization
0 likes · 6 min read
Why Sentinel Misses Docker CPU Usage and How to Fix It
Tencent Cloud Developer
Tencent Cloud Developer
Oct 8, 2019 · Cloud Native

TLinux Team's Mixed Deployment Scheme for Improving Whole-Machine CPU Utilization

Tencent’s TLinux team introduced a kernel‑level mixed‑deployment framework that adds an offline scheduling class and load‑balancing algorithm, enabling online tasks to instantly pre‑empt offline work and boosting whole‑machine CPU utilization to as high as 90% while preserving latency‑sensitive service performance.

Linux schedulingPerformance Optimizationcgroup
0 likes · 13 min read
TLinux Team's Mixed Deployment Scheme for Improving Whole-Machine CPU Utilization
Java Captain
Java Captain
Jun 28, 2019 · Backend Development

Understanding High Concurrency: CPU Utilization, Context Switching, and Performance Comparison of PHP Swoole vs Java Netty

This article explains that high concurrency is fundamentally about efficiently squeezing CPU resources, discusses the evolution of concurrency models, presents a control‑variable analysis of HTTP request flow, and compares PHP‑Swoole with Java‑Netty through detailed benchmark results and code examples.

Java NettyPHP SwoolePerformance Testing
0 likes · 13 min read
Understanding High Concurrency: CPU Utilization, Context Switching, and Performance Comparison of PHP Swoole vs Java Netty
Java Backend Technology
Java Backend Technology
May 25, 2019 · Backend Development

How to Size Thread Pools for Better CPU Utilization and QPS

This article explains why simply increasing thread‑pool size often fails, describes how CPU, network and disk I/O affect request handling, introduces the concept of context switching, and provides practical formulas and guidelines for calculating the optimal number of threads in a Java backend service.

BackendJavaQPS
0 likes · 9 min read
How to Size Thread Pools for Better CPU Utilization and QPS
21CTO
21CTO
Apr 18, 2019 · Operations

Why Traditional API Gateways Crash: CPU, Disk, and Network Bottlenecks Explained

The article examines how traditional synchronous and semi‑synchronous API gateways can fail under high load by analyzing CPU utilization, disk I/O, and network latency, and offers practical monitoring metrics and mitigation strategies to prevent cascading failures in large‑scale systems.

Asynchronous ArchitectureDisk I/ONetwork Latency
0 likes · 9 min read
Why Traditional API Gateways Crash: CPU, Disk, and Network Bottlenecks Explained
Efficient Ops
Efficient Ops
Dec 25, 2018 · Fundamentals

Why %CPU Is Misleading: Understanding the Real Meaning of CPU Utilization

This article explains why the traditional %CPU metric can be deceptive, describes how CPU stalls and memory bottlenecks affect perceived utilization, and offers practical guidance on using IPC and performance counters to obtain a more accurate view of system performance.

CPU stallsIPCPerformance Monitoring
0 likes · 10 min read
Why %CPU Is Misleading: Understanding the Real Meaning of CPU Utilization
Qunar Tech Salon
Qunar Tech Salon
Jun 29, 2015 · Fundamentals

Understanding System Load Average and Its Interpretation

The article explains how Unix-like systems calculate load average using an exponentially damped weighted moving average, how the values reflect CPU and I/O contention on single- and multi‑CPU machines, and why different kernel implementations may count processes and threads differently, affecting performance monitoring.

LinuxLoad AverageScheduling
0 likes · 6 min read
Understanding System Load Average and Its Interpretation