Linux Kernel Journey
Author

Linux Kernel Journey

Linux Kernel Journey

125
Articles
0
Likes
0
Views
0
Comments
Recent Articles

Latest from Linux Kernel Journey

100 recent articles max
Linux Kernel Journey
Linux Kernel Journey
Aug 5, 2025 · Fundamentals

Why Convert madv_dontneed/madv_free to Per‑VMA Locks in Linux

The article explains how the traditional mmap_lock read lock creates severe contention and priority inversion for frequent madvise MADV_DONTNEED/FREE calls, and how a per‑VMA locking redesign eliminates this bottleneck, improves performance, and is slated for the next Linux kernel release.

Linuxkernel memory managementlock contention
0 likes · 5 min read
Why Convert madv_dontneed/madv_free to Per‑VMA Locks in Linux
Linux Kernel Journey
Linux Kernel Journey
Jul 21, 2025 · Fundamentals

Mastering CUDA GPU Performance Analysis and Tracing

This guide walks you through a complete workflow for profiling CUDA applications, covering GPU performance fundamentals, key metrics, NVIDIA Nsight tools, CUPTI programming, example code, common bottlenecks, and best‑practice recommendations to identify and eliminate performance limits.

CUDACUPTIGPU profiling
0 likes · 13 min read
Mastering CUDA GPU Performance Analysis and Tracing
Linux Kernel Journey
Linux Kernel Journey
Jun 29, 2025 · Fundamentals

How Xavier Xia’s Persistent Optimizations Made contpte_ptep_get Faster in All Scenarios

The article chronicles Xavier Xia’s iterative patches to the Linux kernel’s contpte_ptep_get() function, showing how early‑exit logic and subsequent refinements ultimately yielded consistent performance gains across diverse dirty/young page table scenarios, backed by benchmark data that convinced skeptical reviewers.

Linux kernelPerformance OptimizationXavier Xia
0 likes · 4 min read
How Xavier Xia’s Persistent Optimizations Made contpte_ptep_get Faster in All Scenarios
Linux Kernel Journey
Linux Kernel Journey
Jun 17, 2025 · Fundamentals

Master the Three Classic Cache Mapping Strategies: Theory and Practical Implementation

This article explains why cache is critical for computer performance and provides a thorough analysis of the three classic cache mapping strategies—direct‑mapped, fully associative, and set‑associative—detailing their mechanisms, advantages, disadvantages, concrete examples, and guidance on selecting the appropriate method for different system scenarios.

CacheComputer ArchitectureDirect Mapping
0 likes · 24 min read
Master the Three Classic Cache Mapping Strategies: Theory and Practical Implementation
Linux Kernel Journey
Linux Kernel Journey
Jun 16, 2025 · Cloud Computing

How Tencent’s TGW Achieves Seamless Fast Migration and Self‑Healing Fault Recovery

The paper presents Tencent’s TGW cloud gateway architecture, highlighting a 2.9× forwarding performance boost, lossless state migration within 4 seconds, sub‑minute fault detection, multi‑level fault‑tolerance mechanisms, and operational best practices that enable 100 % availability for massive online services.

Cloud GatewayDPDKFault Tolerance
0 likes · 16 min read
How Tencent’s TGW Achieves Seamless Fast Migration and Self‑Healing Fault Recovery
Linux Kernel Journey
Linux Kernel Journey
Jun 12, 2025 · Information Security

In‑Depth Analysis of JuiceFS Permission Management: Full Compatibility with Linux Security Mechanisms

This article thoroughly examines how JuiceFS, a FUSE‑based distributed file system, aligns with Linux's discretionary and mandatory access control models, implements Unix permissions, POSIX ACLs, special permission bits, capability handling, and SELinux integration, while detailing practical configurations such as SDK access, root‑squash, all‑squash, and optimized ACL storage.

CapabilityJuiceFSLinux permissions
0 likes · 16 min read
In‑Depth Analysis of JuiceFS Permission Management: Full Compatibility with Linux Security Mechanisms
Linux Kernel Journey
Linux Kernel Journey
Jun 9, 2025 · Fundamentals

How to Trace CUDA GPU Operations with eBPF

This tutorial explains how to build an eBPF‑based tracing tool that intercepts CUDA runtime API calls via uprobes, captures detailed event data such as memory sizes, transfer directions, kernel launches and errors, and presents it in a readable format for debugging and performance analysis.

CUDAGPU tracingLinux
0 likes · 17 min read
How to Trace CUDA GPU Operations with eBPF