Tagged articles
60 articles
Page 1 of 1
Ops Community
Ops Community
Apr 15, 2026 · Operations

Why Do Linux Processes Vanish? A Complete Troubleshooting Guide

This article systematically explains why Linux processes may disappear, covering OOM Killer, signal termination, cgroup limits, systemd timeouts, manual kills, and provides step‑by‑step diagnostic commands and preventive measures for RHEL, AlmaLinux, and Ubuntu environments.

LinuxMemoryOOM killer
0 likes · 26 min read
Why Do Linux Processes Vanish? A Complete Troubleshooting Guide
Ray's Galactic Tech
Ray's Galactic Tech
Oct 23, 2025 · Cloud Native

How to Seamlessly Upgrade Kubernetes from Docker to Containerd

Learn a step‑by‑step process for migrating Kubernetes clusters (v1.24+) from the deprecated Docker runtime to the native Containerd CRI, covering compatibility checks, node preparation, installation, configuration, node draining, kubelet updates, validation, and common pitfalls such as cgroup driver mismatches.

CRICluster MigrationDocker
0 likes · 8 min read
How to Seamlessly Upgrade Kubernetes from Docker to Containerd
Deepin Linux
Deepin Linux
Oct 10, 2025 · Operations

Mastering Linux Memory cgroups: Theory, Configuration, and Real‑World Scripts

This comprehensive guide explains the fundamentals of Linux memory cgroups, walks through their creation and configuration, details core kernel mechanisms such as OOM handling and hierarchical limits, and provides practical Bash scripts for Docker, Kubernetes, and multi‑tenant environments to help engineers reliably control memory usage.

ContainerLinuxResource Isolation
0 likes · 60 min read
Mastering Linux Memory cgroups: Theory, Configuration, and Real‑World Scripts
Raymond Ops
Raymond Ops
May 31, 2025 · Operations

Master Docker Container Management: Run, Stop, Resource Limits & Best Practices

This guide walks through Docker container fundamentals, covering how to run containers with CMD or ENTRYPOINT, keep them alive, attach or exec into them, stop/start/restart, pause/unpause, remove, apply memory, CPU and block I/O limits, and explains the underlying cgroup and namespace technologies that enforce isolation and resource control.

Container ManagementDockerNamespace
0 likes · 18 min read
Master Docker Container Management: Run, Stop, Resource Limits & Best Practices
OPPO Kernel Craftsman
OPPO Kernel Craftsman
May 23, 2025 · Fundamentals

Why Android Threads Stall: Inside Linux CFS, RT, Fair Scheduling and cgroup Limits

This article explains why many Android threads appear idle despite free CPU cycles, by dissecting Linux's five scheduler classes, the distinction between RT and fair scheduling, the role of virtual runtime and weight, and how cgroup cpu.shares can reshape resource distribution, supported by concrete systrace experiments.

Android performanceCFSLinux scheduler
0 likes · 17 min read
Why Android Threads Stall: Inside Linux CFS, RT, Fair Scheduling and cgroup Limits
System Architect Go
System Architect Go
Dec 11, 2024 · Cloud Native

Kubernetes CPU Configuration and Linux CFS Interaction

This article explains how Kubernetes resource requests and limits map to Linux cgroup settings via the CFS scheduler, illustrates the underlying calculations for cpu.shares and cpu.cfs_quota_us, and discusses the impact on programming languages such as Go and Java within containers.

CFSJavaKubernetes
0 likes · 5 min read
Kubernetes CPU Configuration and Linux CFS Interaction
Bilibili Tech
Bilibili Tech
Jun 4, 2024 · Big Data

Improving Resource Utilization and Isolation in Bilibili Big Data Clusters with the Amiya Over‑commit Component

By deploying the self‑developed Amiya over‑commit component together with kernel‑level cgroup memory isolation, explicit task priorities, OOM‑priority killing, and asynchronous reclamation, Bilibili’s big‑data clusters boosted daily resource utilization by about 15 %, eliminated DataNode OOM kills, cut memory‑reclaim latency to zero, and achieved a further 9 % overall efficiency gain.

OOM PriorityPerformance EvaluationResource Management
0 likes · 18 min read
Improving Resource Utilization and Isolation in Bilibili Big Data Clusters with the Amiya Over‑commit Component
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Apr 19, 2024 · Cloud Native

Understanding Docker: Architecture, Core Components, and Underlying Mechanisms

This article explains Docker’s core architecture—including images, containers, and registries—and details how Linux namespaces, cgroups, and UnionFS work together to provide resource isolation, limitation, and lightweight virtualization, while also offering promotional links to extensive architecture and interview collections.

Cloud NativeContainersDocker
0 likes · 6 min read
Understanding Docker: Architecture, Core Components, and Underlying Mechanisms
OPPO Kernel Craftsman
OPPO Kernel Craftsman
Mar 8, 2024 · Operations

Evolution and Mechanisms of Linux I/O Schedulers (Kyber, MQ‑Deadline, BFQ)

From legacy spinning-disk optimizations to modern SSD-focused QoS, Linux’s I/O scheduler landscape has evolved through noop, deadline, cfq and now multiqueue designs such as Kyber, MQ-Deadline, and BFQ, each employing distinct latency, deadline, and budget-fairness algorithms, supporting cgroup/ionice priorities, and complemented by numerous experimental out-of-tree implementations.

BFQI/O schedulerKyber
0 likes · 10 min read
Evolution and Mechanisms of Linux I/O Schedulers (Kyber, MQ‑Deadline, BFQ)
Liangxu Linux
Liangxu Linux
Jan 13, 2024 · Cloud Native

Essential Docker Commands and Architecture: From Basics to Containers

This guide explains Docker's core architecture, lists the most frequently used Docker commands, compares containers with traditional virtual machines, and dives into the underlying Linux technologies—namespaces, cgroups, and union file systems—that make containerization possible.

ContainerDockerNamespace
0 likes · 12 min read
Essential Docker Commands and Architecture: From Basics to Containers
Liangxu Linux
Liangxu Linux
Nov 29, 2023 · Fundamentals

Inside Linux Cgroup CPU Subsystem: How Containers Get CPU Time Controlled

This article provides a detailed, code‑driven explanation of how Linux cgroup’s CPU subsystem manages container CPU usage, covering cgroup creation, limit configuration, kernel object relationships, scheduler integration, bandwidth enforcement, and the role of period and slack timers.

CPU schedulingLinux kernelcfs_bandwidth
0 likes · 27 min read
Inside Linux Cgroup CPU Subsystem: How Containers Get CPU Time Controlled
MaGe Linux Operations
MaGe Linux Operations
Sep 6, 2023 · Cloud Native

Mastering Process Resource Limits in Docker & Kubernetes: ulimit and cgroup Explained

This article explains how Linux ulimit and cgroup mechanisms can be used to restrict file descriptors, memory, and thread counts in containerized environments, compares Docker and Kubernetes configurations, presents experimental results on fd and thread limits, and offers practical recommendations for configuring default‑ulimits, pod‑max‑pids, and system limits.

ContainerDockercgroup
0 likes · 17 min read
Mastering Process Resource Limits in Docker & Kubernetes: ulimit and cgroup Explained
Tencent Cloud Developer
Tencent Cloud Developer
May 31, 2023 · Cloud Computing

TencentOS "Wujing": Server Memory Multi-Level Offloading Solution for Cloud Data Centers

TencentOS “Wujing” provides a server‑memory multi‑level offloading framework that uses kernel‑side reclamation, heat‑aware page classification, SWAP balancing, and CXL promotion to shift cold pages to cheaper storage, cutting data‑center memory use by up to 50 % while preserving performance.

DAMONLinux kernelMemory Optimization
0 likes · 14 min read
TencentOS "Wujing": Server Memory Multi-Level Offloading Solution for Cloud Data Centers
Refining Core Development Skills
Refining Core Development Skills
Apr 4, 2023 · Cloud Native

Understanding Container CPU Utilization: Accurate Measurement Methods and the Missing Nice/IRQ/SoftIRQ Metrics

This article explains how to correctly obtain CPU utilization inside containers, compares host and container metrics, describes the use of lxcfs and cgroup files (including cgroup V1/V2) for accurate measurement, and clarifies why container statistics omit nice, irq, and softirq fields.

Cloud NativeContainercgroup
0 likes · 16 min read
Understanding Container CPU Utilization: Accurate Measurement Methods and the Missing Nice/IRQ/SoftIRQ Metrics
Alibaba Cloud Native
Alibaba Cloud Native
Mar 10, 2023 · Cloud Native

Uncovering the Root Causes of ACK Cluster Network Latency: kubelet, softirq, and cgroup Insights

A detailed post‑mortem explains how excessive cgroup files, kubelet's sys‑CPU usage, soft‑interrupt scheduling delays, and a buggy page‑free routine caused intermittent hundreds‑of‑milliseconds network latency in an Alibaba Cloud ACK cluster, and how targeted CPU binding and kernel patches resolved the issue.

Cloud NativeKernelKubernetes
0 likes · 14 min read
Uncovering the Root Causes of ACK Cluster Network Latency: kubelet, softirq, and cgroup Insights
MaGe Linux Operations
MaGe Linux Operations
Jan 31, 2023 · Cloud Native

Mastering ulimit and cgroup: Limit Files & Threads in Docker/Kubernetes

This article explains how Linux's ulimit and cgroup mechanisms can be used to restrict file descriptors and thread counts in Docker and Kubernetes environments, compares configuration methods, presents experimental results, and offers practical recommendations for setting limits at the container, pod, and host levels.

ContainerKubernetescgroup
0 likes · 17 min read
Mastering ulimit and cgroup: Limit Files & Threads in Docker/Kubernetes
Open Source Linux
Open Source Linux
Jan 17, 2023 · Backend Development

Why Your Java App Gets OOMKilled in Kubernetes and How to Fix It

This article explains why Java applications running in Kubernetes containers are often terminated with OOMKilled (exit code 137), analyzes the underlying JVM memory‑limit mismatches, and provides practical solutions using cgroup‑aware JVM flags and memory‑tuning techniques.

DockerJVMJava
0 likes · 14 min read
Why Your Java App Gets OOMKilled in Kubernetes and How to Fix It
OPPO Kernel Craftsman
OPPO Kernel Craftsman
Jan 6, 2023 · Fundamentals

CFS Group Scheduling: Purpose, Configuration, and Kernel Implementation Details

The article explains why Linux’s Completely Fair Scheduler introduced group scheduling, how Android configures task groups via cpu.shares and Process.java, and details the kernel structures (task_group, sched_entity, cfs_rq) and algorithms for weight calculation, load measurement, propagation, and hierarchical load balancing.

CFS schedulingLinux kernelcgroup
0 likes · 28 min read
CFS Group Scheduling: Purpose, Configuration, and Kernel Implementation Details
Alibaba Cloud Native
Alibaba Cloud Native
Jan 4, 2023 · Cloud Native

Explore Koordinator v1.1: Load‑Aware Scheduling, cgroup v2, and Descheduler Updates

Koordinator v1.1 introduces load‑aware scheduling with workload‑type awareness, percentile‑based resource aggregation, cgroup v2 support, a new LowNodeLoad descheduler plugin for load‑aware rebalancing, expanded performance collectors, ServiceMonitor integration, and detailed configuration examples, aiming to improve latency‑sensitive workloads and overall cluster resource efficiency.

CloudNativeDeschedulerKubernetes
0 likes · 25 min read
Explore Koordinator v1.1: Load‑Aware Scheduling, cgroup v2, and Descheduler Updates
Liangxu Linux
Liangxu Linux
Oct 7, 2022 · Cloud Native

How Docker Leverages chroot, Namespaces, and Cgroups for Secure Containers

A whimsical tale explains how Docker solves application deployment pain by using chroot/pivot_root for filesystem isolation, Linux namespaces to hide host resources, and cgroups to limit CPU, memory, and I/O, illustrating core container technologies with practical code snippets.

ContainersDockerLinux
0 likes · 8 min read
How Docker Leverages chroot, Namespaces, and Cgroups for Secure Containers
Open Source Linux
Open Source Linux
Sep 28, 2022 · Cloud Native

What Really Makes a Kubernetes Pod? Inside Its Isolation, Namespaces & Cgroups

This article explores how Kubernetes Pods differ from simple containers by examining their underlying implementation, shared network and IPC namespaces, cgroup hierarchies, and the role of the pause sandbox, while also demonstrating how similar pod-like behavior can be achieved using Docker and cgroup tools.

ContainerDockerKubernetes
0 likes · 20 min read
What Really Makes a Kubernetes Pod? Inside Its Isolation, Namespaces & Cgroups
37 Interactive Technology Team
37 Interactive Technology Team
Aug 15, 2022 · Operations

Understanding Linux cgroups and Controlling Process CPU Usage

The article introduces Linux cgroups as a kernel feature for limiting resources, explains their terminology and functions, and demonstrates a hands‑on experiment that creates a CPU cgroup, sets cpu.cfs_quota_us to restrict a process to roughly 25 % CPU usage, confirming effective resource control.

CPU LimitingLinuxcgroup
0 likes · 7 min read
Understanding Linux cgroups and Controlling Process CPU Usage
Bilibili Tech
Bilibili Tech
Jun 17, 2022 · Information Security

Container Escape Techniques, Exploits, and Mitigation Strategies

The article explains how attackers can break out of Docker containers by exploiting misconfigurations, vulnerable Docker components, kernel bugs, or Kubernetes RBAC errors, illustrates real‑world exploits such as host‑proc mounts and CVE‑2019‑5736, and provides mitigation steps like limiting privileges, updating software, and securing configurations.

Container SecurityDockerNamespace
0 likes · 15 min read
Container Escape Techniques, Exploits, and Mitigation Strategies
Liangxu Linux
Liangxu Linux
Feb 12, 2022 · Fundamentals

Understanding Linux CGroup Internals: Key Structures and Resource Control

This article walks through the Linux 2.6.25 CGroup implementation by examining core kernel structures such as cgroup, cgroup_subsys_state, mem_cgroup, css_set, and cgroup_subsys, explaining how they form hierarchical resource control, how mounting and task attachment work, and how memory limits are enforced.

MemoryLimitResourceControlcgroup
0 likes · 17 min read
Understanding Linux CGroup Internals: Key Structures and Resource Control
Cloud Native Technology Community
Cloud Native Technology Community
Dec 2, 2021 · Cloud Native

Understanding cgroup and namespace in Linux for Cloud‑Native Containers

This article explains the role of Linux cgroup and namespace technologies in providing resource isolation and security for containers, traces their historical development from early chroot mechanisms to modern Docker and Kubernetes, and details cgroup architecture, core files, migration, delegation, and practical usage examples.

ContainerDockerLinux
0 likes · 17 min read
Understanding cgroup and namespace in Linux for Cloud‑Native Containers
360 Tech Engineering
360 Tech Engineering
Oct 9, 2021 · Cloud Native

Using LXCFS to Provide Accurate /proc Data Inside Containers

This article explains the problem of inaccurate /proc information in container environments, introduces LXCFS as a FUSE‑based solution that maps cgroup data to /proc, and provides step‑by‑step instructions for installing, mounting, and using LXCFS with Docker and Kubernetes.

FUSEKubernetesLXCFS
0 likes · 7 min read
Using LXCFS to Provide Accurate /proc Data Inside Containers
Aikesheng Open Source Community
Aikesheng Open Source Community
Sep 14, 2021 · Databases

Testing MySQL open_files_limit: Which Configuration Takes Effect – my.cnf, systemd Service, or Docker Limits

This article presents a systematic investigation of MySQL's open_files_limit setting across three scenarios—conflict between my.cnf and systemd service, effectiveness when using mysqld_safe, and the impact of Docker's own limits—revealing which source actually controls the maximum number of open files.

cgroupmysqlopen files limit
0 likes · 10 min read
Testing MySQL open_files_limit: Which Configuration Takes Effect – my.cnf, systemd Service, or Docker Limits
OPPO Kernel Craftsman
OPPO Kernel Craftsman
Sep 3, 2021 · Operations

Understanding Linux cgroups: Mechanism, Data Structures, and Core Logic

Linux cgroups are a kernel mechanism that groups processes into hierarchical directories, each subsystem (e.g., freezer, CPU, memory, IO) exposing control files such as cgroup.procs and freezer.state, with core data structures like cgroup_subsys, cgroup, css_set linking threads to multiple subsystems and enabling resource policies, freezing, throttling, and allocation.

KernelLinuxResource Management
0 likes · 6 min read
Understanding Linux cgroups: Mechanism, Data Structures, and Core Logic
Efficient Ops
Efficient Ops
Feb 2, 2021 · Cloud Native

Unlocking Linux Namespaces: How Docker Leverages Classic Isolation Techniques

This article explains how Docker relies on Linux's eight namespace types and cgroups to achieve fine‑grained isolation, demonstrates practical unshare commands for PID, mount, UTS, IPC, user, and network namespaces, and highlights the role of namespaces in container security and resource management.

Dockercgroupcontainer isolation
0 likes · 7 min read
Unlocking Linux Namespaces: How Docker Leverages Classic Isolation Techniques
360 Tech Engineering
360 Tech Engineering
Nov 25, 2019 · Cloud Native

Understanding lxcfs: Isolating /proc and /sys in Containers

This article explains how lxcfs, a FUSE‑based user‑space filesystem, isolates the /proc and /sys virtual files for containers, details its implementation for reading cpuonline and load average, and provides code examples of the core functions that enable per‑container system metric visibility.

FUSELinuxLoad Average
0 likes · 17 min read
Understanding lxcfs: Isolating /proc and /sys in Containers
Node Underground
Node Underground
Oct 14, 2019 · Backend Development

How to Harness Node.js Worker Threads with Shared Memory and CGroup Limits

This article demonstrates creating Node.js Worker Threads to compute Fibonacci numbers, explains message passing via workerData and parentPort, explores shared memory with SharedArrayBuffer, and shows how to isolate and limit thread CPU usage using Linux CGroup controls, including retrieving thread IDs via a custom native addon.

Node.jsSharedArrayBufferWorker Threads
0 likes · 8 min read
How to Harness Node.js Worker Threads with Shared Memory and CGroup Limits
Alibaba Cloud Native
Alibaba Cloud Native
Oct 11, 2019 · Cloud Native

Can Dynamic Cgroup Tweaks Boost Kubernetes Resource Utilization?

This article shares Alibaba Cloud Container Platform's practical experience in improving container resource utilization by dynamically adjusting cgroup limits, describing real‑world challenges, the design of a policy‑engine solution, experimental results, lessons learned, and future directions for cloud‑native workloads.

Dynamic SchedulingKubernetesPolicy Engine
0 likes · 25 min read
Can Dynamic Cgroup Tweaks Boost Kubernetes Resource Utilization?
Tencent Cloud Developer
Tencent Cloud Developer
Oct 8, 2019 · Cloud Native

TLinux Team's Mixed Deployment Scheme for Improving Whole-Machine CPU Utilization

Tencent’s TLinux team introduced a kernel‑level mixed‑deployment framework that adds an offline scheduling class and load‑balancing algorithm, enabling online tasks to instantly pre‑empt offline work and boosting whole‑machine CPU utilization to as high as 90% while preserving latency‑sensitive service performance.

Linux schedulingPerformance Optimizationcgroup
0 likes · 13 min read
TLinux Team's Mixed Deployment Scheme for Improving Whole-Machine CPU Utilization
58 Tech
58 Tech
Sep 4, 2019 · Cloud Computing

Designing a Solution to Limit Container Thread Count in a Private Cloud Platform Using cgroup pids and inotify

This article analyzes the lack of thread‑count limits in a Kubernetes‑based private cloud platform, reproduces the issue with a Python multiprocessing script, and proposes a solution that combines the cgroup pids subsystem with inotify to enforce per‑container thread limits and provide real‑time alerts.

Kubernetescgroupcloud platform
0 likes · 8 min read
Designing a Solution to Limit Container Thread Count in a Private Cloud Platform Using cgroup pids and inotify
dbaplus Community
dbaplus Community
Jul 25, 2019 · Operations

How to Secure Docker Containers with Namespaces and Cgroups

This guide explains Docker's Namespace and Cgroup mechanisms, shows how to configure them to limit resources and isolate containers, and demonstrates practical commands for protecting container security while highlighting their limitations.

Container SecurityDockerLinux
0 likes · 16 min read
How to Secure Docker Containers with Namespaces and Cgroups
DevOps
DevOps
Jul 12, 2017 · Cloud Native

Container Monitoring: Challenges, Metrics Collection, and Best Practices

This article examines the unique challenges of monitoring containers, outlines three categories of metrics to collect, compares host‑centric and layered monitoring architectures, provides detailed methods for gathering CPU, memory, I/O and network data via cgroup files and Docker commands, and shares practical insights, tooling recommendations, and a Q&A session for effective container observability.

DockerOpsPrometheus
0 likes · 18 min read
Container Monitoring: Challenges, Metrics Collection, and Best Practices
21CTO
21CTO
Dec 4, 2015 · Big Data

Building a Cost‑Effective Real‑Time Stream Processing Platform with Storm

This article details how the e‑commerce company 1号店 selected the Storm framework to create a low‑cost, highly available, and easily scalable distributed stream‑processing system, covering architecture design, resource isolation with CGroup, custom UI improvements, and operational lessons for handling massive traffic spikes.

Resource ManagementStormcgroup
0 likes · 9 min read
Building a Cost‑Effective Real‑Time Stream Processing Platform with Storm
Qunar Tech Salon
Qunar Tech Salon
Jun 8, 2015 · Operations

An Introduction to systemd: Features, Concepts, and Usage

This article provides a comprehensive overview of systemd, the modern Linux init system, covering its design goals, compatibility with SysVinit, parallel boot advantages, on‑demand activation, CGroup‑based process tracking, unit types, dependency handling, snapshot and journal capabilities, as well as practical guidance for developers and system administrators.

Linux initService Managementcgroup
0 likes · 31 min read
An Introduction to systemd: Features, Concepts, and Usage