Tagged articles

73 articles

Page 1 of 1

Nov 14, 2025 · Backend Development

How to Install and Use XHProf for PHP Performance Profiling

This guide explains what XHProf is, walks through installing it via PECL or source, configuring php.ini, verifying the installation, integrating it with a middleware class, and accessing the web UI to view detailed performance metrics and visualizations.

PHPXHProfmiddleware

0 likes · 6 min read

How to Install and Use XHProf for PHP Performance Profiling

AI Cyberspace

Sep 28, 2025 · Artificial Intelligence

How to Set Up WSL2 GPU Acceleration and Profile CUDA on Windows 11

This guide walks through configuring Windows 11 with WSL2 and Ubuntu 22.04 for GPU‑accelerated CUDA development, installing NVIDIA drivers and CUDA libraries, setting up SSH and firewall rules, running a CUDA stress‑test program, and using Nsight Systems, Nsight Compute, and NVIDIA DCGM for performance profiling and monitoring.

CUDAGPULinux

0 likes · 39 min read

Alibaba Cloud Developer

Sep 8, 2025 · Fundamentals

How to Profile GPU Kernels with PTX Probes: From CUDA Basics to Custom Instrumentation

This article walks through GPU performance analysis, starting with CUDA architecture fundamentals, demonstrating matrix multiplication optimization, explaining PTX assembly, and introducing the Neutrino framework for programmable GPU probes that enable fine‑grained, custom instrumentation and detailed timing measurements of kernel execution.

CUDAGPUNeutrino

0 likes · 45 min read

Code Wrench

Aug 31, 2025 · Backend Development

Master Go Performance Debugging with pprof and trace: Heap, Goroutine & CPU Profiling

This guide explains why and how to use Go's pprof and trace tools to diagnose memory leaks, goroutine stalls, and CPU bottlenecks, providing step‑by‑step commands, example code, a cheat‑sheet of core commands, and a practical workflow for microservice tracing.

BackendGoMicroservices

0 likes · 8 min read

Master Go Performance Debugging with pprof and trace: Heap, Goroutine & CPU Profiling

Didi Tech

Aug 7, 2025 · Cloud Native

How HUATUO Revolutionizes Cloud‑Native Observability with Zero‑Impact BPF Tracing

HUATUO, Didi's open‑source cloud‑native observability project, leverages BPF‑based low‑overhead kernel tracing, unified metric and event frameworks, automatic flame‑graph generation, and seamless integration with Prometheus, Grafana and Elasticsearch to provide panoramic, zero‑intrusive monitoring and continuous performance profiling for complex production environments.

BPFCloud NativeDistributed Systems

0 likes · 11 min read

How HUATUO Revolutionizes Cloud‑Native Observability with Zero‑Impact BPF Tracing

360 Zhihui Cloud Developer

Jul 9, 2025 · Backend Development

Master Go Performance Profiling with pprof: From Basics to Real‑World Fixes

This comprehensive guide explains how to use Go's built‑in pprof tool to profile CPU, memory, and goroutine behavior, covering integration methods, command‑line analysis, visualization techniques, and step‑by‑step solutions for common performance bottlenecks in production Go services.

Backend DevelopmentCPU optimizationGo

0 likes · 16 min read

Master Go Performance Profiling with pprof: From Basics to Real‑World Fixes

Linux Code Review Hub

Apr 5, 2025 · Operations

Inside Linux Perf: How the Kernel’s Powerful Tracing Tool Works

The article introduces Linux’s built‑in performance analysis tool perf, explains its event‑driven sampling, tracing and profiling capabilities, shows how to install it on various distributions, demonstrates common commands with real code examples, and discusses practical scenarios for locating and optimizing kernel and application performance issues.

BenchmarkingSystem optimizationflamegraph

0 likes · 36 min read

Inside Linux Perf: How the Kernel’s Powerful Tracing Tool Works

JD Tech Talk

Mar 14, 2025 · Backend Development

Analysis of Java Thread States, Flame Graphs, and Performance Optimizations

The article examines Java thread states—WAITING and TIMED_WAITING—through monitoring screenshots, explains their definitions, trigger conditions, and recovery mechanisms, analyzes real‑world code involving concurrent tasks, and presents flame‑graph based performance optimizations for backend services.

Javaflame graphperformance profiling

0 likes · 10 min read

Analysis of Java Thread States, Flame Graphs, and Performance Optimizations

Deepin Linux

Dec 27, 2024 · Fundamentals

Comprehensive Guide to Using Valgrind for Memory Debugging and Performance Analysis

This article provides an in‑depth overview of Valgrind, explaining its architecture, the capabilities of its various tools such as Memcheck, Cachegrind, Callgrind, Helgrind and Massif, and offers step‑by‑step installation, configuration, and practical code examples for detecting memory errors, leaks, and performance bottlenecks in C/C++ programs.

C++Toolchainleak detection

0 likes · 28 min read

Comprehensive Guide to Using Valgrind for Memory Debugging and Performance Analysis

Open Source Linux

Dec 12, 2024 · Fundamentals

Measure Execution Time in C: time(), clock() and gettimeofday()

This guide demonstrates how to use the C standard library functions time(), clock(), and the POSIX gettimeofday() to measure code execution duration, comparing their precision levels and showing sample code snippets that illustrate typical usage and output.

Cclockgettimeofday

0 likes · 4 min read

Measure Execution Time in C: time(), clock() and gettimeofday()

Tencent Cloud Developer

Nov 5, 2024 · Backend Development

Understanding and Optimizing Go Performance with pprof and trace Tools

The article teaches Go developers how to generate and analyze CPU, heap, allocation, and goroutine profiles with pprof and full‑runtime traces, interpret SVG flame‑graphs, top lists, and source views, and apply concrete optimizations—such as buffering channels and using sync.Pool—to dramatically speed up a Mandelbrot generator.

GoGoroutineHeap Analysis

0 likes · 58 min read

Understanding and Optimizing Go Performance with pprof and trace Tools

Alibaba Cloud Developer

Oct 18, 2024 · Backend Development

How to Use Arthas Flamegraph for Java Performance Profiling and Optimization

Learn how to leverage the Arthas flamegraph tool to profile Java applications, interpret CPU usage visualizations, and apply practical optimization techniques illustrated through real-world case studies that reduced CPU consumption by up to 6% and improved system stability during high‑traffic events.

ArthasBackend DevelopmentCPU optimization

0 likes · 7 min read

How to Use Arthas Flamegraph for Java Performance Profiling and Optimization

Liangxu Linux

Oct 6, 2024 · Fundamentals

Master Linux Resource Profiling with the /usr/bin/time Utility

This guide explains how to use the Linux /usr/bin/time command to measure program execution time, memory usage, and other resource metrics, covering syntax, options, custom output formats, and the differences between the built‑in shell time and the external utility.

LinuxResource Monitoringcommand-line tools

0 likes · 8 min read

Master Linux Resource Profiling with the /usr/bin/time Utility

DeWu Technology

Sep 30, 2024 · Backend Development

Automated Performance Profiling for Go Services with Conan

Conan is an automated profiling solution for Go microservices that embeds an SDK to continuously or adaptively sample CPU, memory and goroutine metrics, detects anomalies via user‑defined rules, uploads data to a Pyroscope server, and reports results through Feishu or Pyroscope, delivering sub‑5 % overhead and faster root‑cause analysis.

AutomationGoPyroscope

0 likes · 16 min read

Automated Performance Profiling for Go Services with Conan

FunTester

Sep 30, 2024 · Backend Development

Unlock Java Performance: Essential Profiling Techniques and Tools

This guide introduces Java performance analysis fundamentals, covering profiling tools, key metrics such as CPU, memory, thread behavior, and practical methods to detect and resolve common issues like memory leaks, GC pauses, high CPU usage, I/O bottlenecks, and inefficient collection usage.

CPU analysisGarbage CollectionJava

0 likes · 13 min read

Unlock Java Performance: Essential Profiling Techniques and Tools

Test Development Learning Exchange

Sep 10, 2024 · Fundamentals

Python Basics: Dictionaries, Sets, File Operations, Regular Expressions, Logging, Unit Testing, Profiling, and Third‑Party Libraries

This article introduces core Python concepts—including dictionaries and sets as key‑value and unique collections, file and directory manipulation with the os module, regular expression handling, logging setup, unit testing with unittest, performance profiling via cProfile, and using popular third‑party libraries such as requests and BeautifulSoup.

Data StructuresFile OperationsPython

0 likes · 7 min read

Python Basics: Dictionaries, Sets, File Operations, Regular Expressions, Logging, Unit Testing, Profiling, and Third‑Party Libraries

Alibaba Cloud Developer

Aug 22, 2024 · Backend Development

How to Profile WebAssembly Performance with Code Instrumentation and Flame Graphs

This article explains how to profile WebAssembly code by inserting lightweight instrumentation hooks, optimizing the overhead with a tree‑based call‑stack, generating flame‑graph data, and demonstrates the whole workflow with a sample Fibonacci program compiled to WASM.

JITWAVMWebAssembly

0 likes · 13 min read

Sohu Tech Products

Aug 21, 2024 · Mobile Development

Analyzing and Optimizing Device Overheating Issues Using Trace Data

This article explains how to evaluate device overheating using metrics such as CPU/GPU usage and temperature, collect and visualize Trace data with SmartPerf or DevEco Studio, analyze high‑load threads and frequency scaling, and apply targeted code changes—disabling sensors, releasing Lottie resources, and simplifying components—to reduce heat and restore frame‑rate stability.

CPU optimizationDevEco StudioHarmonyOS

0 likes · 14 min read

Analyzing and Optimizing Device Overheating Issues Using Trace Data

Tencent Cloud Developer

May 22, 2024 · Backend Development

Comprehensive Guide to Go pprof and trace Tools for Performance Analysis

This comprehensive guide teaches Go developers how to generate CPU, memory, and goroutine profiles with pprof, interpret SVG, top, source, and peek visualizations, understand the runtime’s sampling and allocation internals, use the trace tool to analyze events, and apply these techniques to real‑world optimizations such as speeding up a Mandelbrot image generator.

Goperformance profilingpprof

0 likes · 57 min read

Comprehensive Guide to Go pprof and trace Tools for Performance Analysis

Ops Development & AI Practice

May 18, 2024 · Backend Development

Master Go’s pprof: Unlock Every Command for Deep Performance Insights

This guide walks through Go's pprof tool, detailing each command‑line option, its purpose, and example usage so developers can comprehensively profile, visualise, and optimise Go applications for better performance and stability.

DebuggingGocommand-line

0 likes · 24 min read

Master Go’s pprof: Unlock Every Command for Deep Performance Insights

Ops Development & AI Practice

May 18, 2024 · Backend Development

Master Go Performance Profiling with go tool pprof: A Step‑by‑Step Guide

This guide explains how to use Go's built‑in pprof tool and the go tool pprof command to profile CPU and memory usage, generate analysis files, interpret interactive console output, create flame graphs, and apply practical optimizations for faster, more efficient Go applications.

CPU profilingGoflame graph

0 likes · 6 min read

Master Go Performance Profiling with go tool pprof: A Step‑by‑Step Guide

Ops Development & AI Practice

May 7, 2024 · Backend Development

Mastering Go’s addr2line: Map Addresses to Source Lines for Debugging & Profiling

This guide explains Go's addr2line tool, covering its core function of translating memory addresses to source file locations, typical debugging and performance‑analysis scenarios, command‑line usage with examples, and how it streamlines error tracing and profiling in Go applications.

DebuggingGoToolchain

0 likes · 5 min read

Mastering Go’s addr2line: Map Addresses to Source Lines for Debugging & Profiling

OPPO Kernel Craftsman

Apr 12, 2024 · Fundamentals

Stack Unwinding on AArch64: Frame‑Pointer and DWARF .eh_frame/.debug_frame Implementations

The article explains how AArch64 stack unwinding works using both frame‑pointer based methods and DWARF .eh_frame/.debug_frame CFI data, detailing the calling convention, compiler effects, CIE/FDE structures, and their integration into Android’s simpleperf profiler for accurate back‑traces.

AssemblyDWARFELF

0 likes · 12 min read

Stack Unwinding on AArch64: Frame‑Pointer and DWARF .eh_frame/.debug_frame Implementations

Ops Development Stories

Apr 1, 2024 · Backend Development

Master Java Flight Recorder & JProfiler: Low‑Impact Performance Profiling Guide

This article explains how to use Java Flight Recorder (JFR) and JProfiler for low‑overhead, production‑grade performance analysis, covering JFR advantages, process discovery, file generation commands, JProfiler features, memory and CPU profiling techniques, and includes practical code examples.

Backend DevelopmentJFRJProfiler

0 likes · 11 min read

Master Java Flight Recorder & JProfiler: Low‑Impact Performance Profiling Guide

php Courses

Nov 10, 2023 · Backend Development

Effective PHP Debugging: Xdebug, PHPDBG, and Blackfire

This article introduces three powerful PHP debugging and profiling tools—Xdebug, PHPDBG, and Blackfire—explaining their features, configuration steps, and providing concrete code examples to help developers efficiently locate and fix issues while improving performance.

BlackfireDebuggingPHP

0 likes · 4 min read

Effective PHP Debugging: Xdebug, PHPDBG, and Blackfire

JD Retail Technology

Sep 11, 2023 · Backend Development

Diagnosing and Resolving a Java Application Memory Leak During Load Testing

During a load test of a Java 1.6 application on a CentOS server, memory usage climbed from 20% to 100% after an hour of 300‑concurrent requests, prompting a detailed investigation that identified off‑heap leaks caused by GZIP compression in a JimDB client and led to fixes such as upgrading the JDK and avoiding unnecessary compression.

BackendGoogle PerftoolsJDK

0 likes · 7 min read

Diagnosing and Resolving a Java Application Memory Leak During Load Testing

Selected Java Interview Questions

Aug 1, 2023 · Backend Development

Using Arthas to Locate Slow Code Blocks in Java Applications

This article explains how to use the open‑source Java diagnostic tool Arthas to attach to a running JVM, view real‑time metrics, and employ the trace command to pinpoint and analyze time‑consuming methods without inserting manual timing code.

ArthasDebuggingJava

0 likes · 8 min read

Using Arthas to Locate Slow Code Blocks in Java Applications

Liangxu Linux

Jul 26, 2023 · Operations

Master Linux Perf: From Event Subsystem to Flame Graphs

This article provides a comprehensive guide to Linux Perf, covering its architecture, counting and sampling modes, event classifications, the full suite of Perf tools, and how to generate and interpret CPU and Off‑CPU flame graphs for deep performance analysis.

LinuxPerf EventsSystem Tracing

0 likes · 20 min read

Master Linux Perf: From Event Subsystem to Flame Graphs

MaGe Linux Operations

Jul 2, 2023 · Operations

Master Linux Performance: Using perf for Profiling and Optimization

Linux perf is a powerful, flexible profiling tool that lets developers and system administrators monitor hardware and software events, analyze CPU, memory, and I/O performance, generate flame graphs, and troubleshoot bottlenecks across single processes, containers, and multi‑core systems, with extensive commands and advanced techniques.

Linuxflame graphperf

0 likes · 28 min read

Master Linux Performance: Using perf for Profiling and Optimization

SQB Blog

Jun 16, 2023 · Operations

Boost Java Performance: Optimize JFR Analysis with Flame Graphs and Async‑Profiler

This article explores the evolution of continuous performance profiling, explains why traditional tracing falls short, and details a series of optimizations—including batch processing, object‑reference serialization, aggregation insertion, and multi‑chunk handling—to dramatically reduce memory usage and speed up Java Flight Recorder analysis using async‑profiler and flame graphs.

JFRJavaObservability

0 likes · 13 min read

Boost Java Performance: Optimize JFR Analysis with Flame Graphs and Async‑Profiler

TAL Education Technology

Jun 8, 2023 · Frontend Development

Diagnosing and Fixing Web Page Memory Leaks with Chrome DevTools

This article explains what web page memory leaks are, identifies common sources such as timers, event listeners, closures, and DOM references, and demonstrates step‑by‑step how to locate and resolve them using Chrome DevTools' Performance and Memory panels.

Chrome DevToolsJavaScriptfrontend debugging

0 likes · 8 min read

Diagnosing and Fixing Web Page Memory Leaks with Chrome DevTools

Open Source Tech Hub

May 11, 2023 · Backend Development

How to Install and Use XHProf for PHP Performance Profiling

Learn how to install XHProf via PECL or source, configure php.ini, verify the installation, integrate the XhprofMiddleware into your PHP application, and interpret the detailed performance metrics and visual reports, including function call statistics and memory usage, with example commands and code snippets.

InstallationPHPXHProf

0 likes · 6 min read

Alibaba Cloud Native

Mar 7, 2023 · Operations

How ARMS Continuous Profiler Enables Production‑Level Performance Analysis

This article explains the background of continuous performance profiling, demonstrates two real‑world scenarios using ARMS Continuous Profiler to locate CPU and memory hotspots, describes the tool’s design and core components, and shows how the fixes improve application responsiveness and resource usage.

ARMSJavaperformance profiling

0 likes · 9 min read

How ARMS Continuous Profiler Enables Production‑Level Performance Analysis

Python Programming Learning Circle

Feb 15, 2023 · Fundamentals

Methods for Monitoring Python Code Execution Time and Memory Usage

This article introduces four practical techniques for measuring Python code performance, including the built‑in time module, Jupyter’s %%time magic, line_profiler for per‑line timing, and memory_profiler for detailed memory consumption, complete with example code and interpretation of results.

JupyterPythonline_profiler

0 likes · 7 min read

Methods for Monitoring Python Code Execution Time and Memory Usage

Python Programming Learning Circle

Feb 8, 2023 · Fundamentals

Measuring Execution Time and Memory Usage in Python

This article introduces four practical methods for monitoring Python code performance—using the built‑in time module, the %%time IPython magic, line_profiler for per‑line timing, and memory_profiler for detailed memory usage—complete with code examples and interpretation of results.

MemoryPythonline_profiler

0 likes · 7 min read

Measuring Execution Time and Memory Usage in Python

DeWu Technology

Jan 4, 2023 · Backend Development

Diagnosing and Resolving Go Memory Leak with pprof and Prometheus

The article explains how a sudden Go service memory‑usage alert was traced with go tool pprof to a massive allocation in the quantile.newStream function, uncovered a Prometheus metric‑label explosion caused by the START_POINT label, and resolved the leak by disabling that label, while also reviewing typical Go memory‑leak patterns.

BackendGoPrometheus

0 likes · 15 min read

Diagnosing and Resolving Go Memory Leak with pprof and Prometheus

OPPO Kernel Craftsman

Sep 2, 2022 · Mobile Development

Perfetto Trace Analysis Guide: Android Performance Profiling Tool

This guide explains how to install and use Perfetto, Android’s next‑generation tracing tool, covering trace capture via command line or Traceur, UI navigation, key trace elements, and practical SQL queries for analyzing slices, counters, scheduling, thread states, and lock contention to optimize performance.

Android TracingMobile DevelopmentPerfetto

0 likes · 9 min read

Perfetto Trace Analysis Guide: Android Performance Profiling Tool

Huolala Tech

Aug 23, 2022 · Mobile Development

How to Hook objc_msgSend for Precise iOS Method Timing with Assembly

This article explains how to accurately measure the execution time of Objective‑C methods on iOS by implementing a custom objc_msgSend hook in ARM64 assembly, replacing the original implementation with fishhook, and visualizing the results using Chrome tracing, offering a low‑cost, non‑intrusive performance profiling solution.

AssemblyiOSmethod timing

0 likes · 21 min read

How to Hook objc_msgSend for Precise iOS Method Timing with Assembly

Java Architecture Diary

Aug 16, 2022 · Operations

How to Use Pyroscope for Continuous Performance Profiling in Microservices

This guide explains how to integrate the open‑source Pyroscope platform with Rainbond to continuously profile microservices, detect CPU bottlenecks and memory leaks, and visualize performance data through its UI and various view modes.

Continuous ProfilingJava AgentMicroservices

0 likes · 7 min read

How to Use Pyroscope for Continuous Performance Profiling in Microservices

OPPO Kernel Craftsman

Jul 1, 2022 · Operations

Linux Kernel Performance Profiling: A Comprehensive Guide to On-CPU and Off-CPU Analysis

This comprehensive guide explains Linux kernel performance profiling—both on‑CPU and off‑CPU—by stressing the need to target the critical 3 % of code, covering throughput, latency and power metrics, scalability laws, flame‑graph visualizations, perf and eBPF tools, lock‑contention analysis, and further reading recommendations.

Linux kernelThroughputeBPF

0 likes · 27 min read

Linux Kernel Performance Profiling: A Comprehensive Guide to On-CPU and Off-CPU Analysis

NetEase Cloud Music Tech Team

Jun 7, 2022 · Mobile Development

Enable On‑Device Systrace for Android Without a PC and Capture Full Call Traces

This article explains how Android's built‑in Trace infrastructure can be leveraged on the device itself—by modifying native tag variables and hooking write calls—to run systrace without a PC, automatically instrument methods, and export data for Perfetto analysis.

AndroidMethod InstrumentationNative Hook

0 likes · 11 min read

Enable On‑Device Systrace for Android Without a PC and Capture Full Call Traces

DataFunTalk

Apr 22, 2022 · Artificial Intelligence

Inference Optimization Techniques and GPU Parallel Acceleration for Tencent Intelligent Dialogue Models

This article presents a comprehensive overview of inference optimization methods—including model pruning, quantization, knowledge distillation, caching, instruction‑set acceleration, and operator fusion—and details a GPU‑centric parallel acceleration methodology with CUDA basics, performance‑analysis tools, theoretical limits, and practical case studies, all illustrated with real‑world examples from Tencent's intelligent dialogue products.

GPU AccelerationOperator fusioncaching

0 likes · 18 min read

Inference Optimization Techniques and GPU Parallel Acceleration for Tencent Intelligent Dialogue Models

Java Backend Technology

Mar 23, 2022 · Databases

Master MySQL Performance: 10 Proven SQL Optimization Techniques

This article walks through why SQL queries become bottlenecks as data grows and presents a step‑by‑step guide—including slow‑query analysis, EXPLAIN interpretation, profiling, optimizer tracing, index tuning, and scenario‑specific tricks—to dramatically improve MySQL performance.

Explain PlanIndex TuningSQL Optimization

0 likes · 12 min read

Master MySQL Performance: 10 Proven SQL Optimization Techniques

ITPUB

Jan 28, 2022 · R&D Management

How to Blend Open‑Source and Self‑Developed Solutions: Real‑World Lessons

This article shares practical insights on balancing open‑source and self‑developed components in core platform projects, covering advantages, lifecycle stages, performance profiling, code‑base management, and engineering practices that help teams achieve sustainable growth and innovation.

R&D managementSoftware Engineeringperformance profiling

0 likes · 37 min read

How to Blend Open‑Source and Self‑Developed Solutions: Real‑World Lessons

Alibaba Cloud Native

Jan 4, 2022 · Cloud Native

Debug and Optimize Serverless Functions with Alibaba Cloud’s Instance Exec Feature

This article explains the background, design, and usage of Alibaba Cloud Function Compute's Instance Exec capability, showing how developers can securely access live function instances via the console or SDK, and apply it to troubleshooting, performance profiling, and custom runtime optimization.

Alibaba CloudCloud NativeDebugging

0 likes · 9 min read

Debug and Optimize Serverless Functions with Alibaba Cloud’s Instance Exec Feature

DeWu Technology

Dec 14, 2021 · Operations

Online Service Alarm Handling and Performance Profiling in Go

The article outlines a systematic SOP‑driven approach for diagnosing online service alarms and performance issues in Go, detailing a toolbox that includes pprof, trace, goroutine visualizers, perf and eBPF, and recommends application‑level optimizations, system tuning, and continuous profiling to accelerate root‑cause identification and reduce incident frequency.

GoOnline ServiceSOP

0 likes · 11 min read

Online Service Alarm Handling and Performance Profiling in Go

Code Ape Tech Column

Nov 30, 2021 · Operations

Understanding Flame Graphs for Java Performance Analysis

This article introduces flame graphs, explains their visual characteristics and how they help identify performance bottlenecks in Java applications, and provides practical guidance on generating them using Perl scripts and shell commands to process jstack and perf outputs.

Shellflamegraphjstack

0 likes · 9 min read

Understanding Flame Graphs for Java Performance Analysis

macrozheng

Nov 25, 2021 · Operations

Master SkyWalking: End‑to‑End Guide for Distributed Tracing & Monitoring

This article introduces SkyWalking, a Chinese open‑source APM framework, compares it with Spring Cloud Sleuth+Zipkin, explains server and client setup, storage configuration, log collection, performance profiling, and alerting, providing step‑by‑step instructions, code snippets, and screenshots to help developers implement comprehensive distributed tracing.

APMDistributed TracingJava

0 likes · 16 min read

Master SkyWalking: End‑to‑End Guide for Distributed Tracing & Monitoring

Sohu Tech Products

Sep 22, 2021 · Frontend Development

Introducing the React Scheduling Profiler: New Performance Analysis Features for React 18

The React team has open‑sourced the Scheduling Profiler, a new tool that extends the classic Profiler by visualizing component scheduling, priority handling, transitions, and Suspense behavior, helping developers diagnose rendering delays and optimize performance in React 18 applications.

ReactReact 18Scheduling Profiler

0 likes · 4 min read

Introducing the React Scheduling Profiler: New Performance Analysis Features for React 18

Python Programming Learning Circle

Jul 14, 2021 · Backend Development

Implementing a Python Import‑Hook Probe with sys.meta_path and sitecustomize

This article explains how to build a simple Python probe that measures function execution time by leveraging sys.meta_path import hooks and sitecustomize, providing step‑by‑step code examples, command‑line wrapper, and discussion of automatic module loading for profiling purposes.

Pythonbackend-developmentimport-hook

0 likes · 10 min read

Implementing a Python Import‑Hook Probe with sys.meta_path and sitecustomize

New Oriental Technology

May 24, 2021 · Operations

Overview of SkyWalking UI: Dashboard, Topology, Tracing, Profiling, and Alerts

The article provides a comprehensive English overview of SkyWalking UI, detailing its dashboard metrics, topology visualization, trace analysis, performance profiling workflow, and alarm management, illustrating how the tool monitors microservice and cloud‑native environments with metrics such as throughput, latency, Apdex, and JVM statistics.

APMDistributed TracingObservability

0 likes · 11 min read

Overview of SkyWalking UI: Dashboard, Topology, Tracing, Profiling, and Alerts

Java Interview Crash Guide

May 7, 2021 · Operations

Unlock Performance Bottlenecks with Flame Graphs: A Practical Guide

This article explains how flame graphs visualize call‑stack frequency and duration, shows how to generate them from Java jstack dumps using Perl tools, and demonstrates how to interpret their widths and flat tops to pinpoint performance hot spots in production systems.

JavaShellflame graph

0 likes · 11 min read

Unlock Performance Bottlenecks with Flame Graphs: A Practical Guide

Top Architect

Apr 27, 2021 · Operations

Understanding Flame Graphs for Performance Analysis in Java Applications

This article explains the concept, features, and practical usage of flame graphs—including how to generate them from Java thread dumps with Perl scripts—to help developers visualize call‑stack frequencies and quickly identify performance bottlenecks in backend services.

Operationsflamegraphjstack

0 likes · 11 min read

Understanding Flame Graphs for Performance Analysis in Java Applications

DataFunTalk

Mar 25, 2021 · Artificial Intelligence

Optimizing MNN Mobile Neural Network Inference on GPU with OpenCL: Memory Objects, Work‑Group Tuning, and Auto‑Tuning

This article explains how the MNN deep‑learning framework leverages OpenCL to achieve high‑performance inference on mobile, PC and embedded GPUs by diversifying memory objects, aligning data, using local‑memory reductions, selecting optimal work‑group sizes, applying pre‑inference auto‑tuning, caching compiled programs, and providing practical GPU‑friendly model design guidelines.

GPU OptimizationMNNOpenCL

0 likes · 20 min read

Optimizing MNN Mobile Neural Network Inference on GPU with OpenCL: Memory Objects, Work‑Group Tuning, and Auto‑Tuning

Python Programming Learning Circle

Mar 23, 2021 · Backend Development

Understanding Node.js Asynchronous Execution Model and async_hooks

This article explains Node.js’s asynchronous execution model, defines key concepts such as execution frames and continuation points, demonstrates event flow with code examples, and shows how the async_hooks API and AsyncLocalStorage can be used for tracing, debugging, and performance analysis of backend JavaScript applications.

AsyncLocalStorageBackend DevelopmentNode.js

0 likes · 13 min read

Understanding Node.js Asynchronous Execution Model and async_hooks

Liangxu Linux

Dec 17, 2020 · Operations

Mastering strace: Practical Tips for Tracing System Calls and Debugging Linux Processes

This guide explains what strace is, how to use it for filtering system calls, profiling execution time, diagnosing configuration issues, and tracing network activity, providing concrete command examples and interpretation of output to help troubleshoot Linux processes efficiently.

DebuggingLinuxperformance profiling

0 likes · 16 min read

Mastering strace: Practical Tips for Tracing System Calls and Debugging Linux Processes

MaGe Linux Operations

Nov 25, 2020 · Fundamentals

How to Build a Custom Python Import Hook Probe with sys.meta_path and sitecustomize

This article explains the inner workings of Python's import system, demonstrates how to create a simple import‑hook probe using sys.meta_path and sitecustomize, and walks through building a command‑line agent that automatically measures function execution time in any module.

code instrumentationimport-hookperformance profiling

0 likes · 10 min read

How to Build a Custom Python Import Hook Probe with sys.meta_path and sitecustomize

JD Tech Talk

Jul 9, 2020 · Backend Development

Designing a Java Agent for Performance Profiling and Monitoring in Java Applications

This article explains how to build a Java Agent that leverages bytecode instrumentation, class‑loader mechanisms, and AOP techniques to provide lightweight, configurable performance profiling and monitoring for Java services, covering requirements, technical foundations, implementation steps, and testing procedures.

Java Agentaopbytecode instrumentation

0 likes · 12 min read

Designing a Java Agent for Performance Profiling and Monitoring in Java Applications

Senior Brother's Insights

Jun 3, 2020 · Backend Development

How We Traced and Fixed Excessive Native Memory Usage After Migrating to Spring Boot

After moving a project to Spring Boot, the system reported unusually high swap usage despite a 4 GB heap, prompting a detailed investigation using JVM native‑memory tracking, pmap, gperftools, strace, and GDB, which ultimately identified Spring Boot’s JAR scanning and glibc memory arenas as the root causes and led to a configuration fix and an upgrade to resolve the off‑heap memory leak.

JVMLinuxNative Memory

0 likes · 11 min read

How We Traced and Fixed Excessive Native Memory Usage After Migrating to Spring Boot

ITPUB

Apr 5, 2020 · Fundamentals

How a 20‑Second Sleep Slowed My SAP App and What It Teaches About Code Optimization

A massive SAP cloud application suffered a 20‑second pause due to an accidental wait statement, revealing how careless code can cripple performance and illustrating practical, step‑by‑step guidelines for effective, measured code optimization.

Code OptimizationCode reviewCompiler Flags

0 likes · 11 min read

How a 20‑Second Sleep Slowed My SAP App and What It Teaches About Code Optimization

Meituan Technology Team

Oct 10, 2019 · Backend Development

Implementation Principles of JVM CPU Profiler and Dynamic Attach Mechanism

The article explains how JVM CPU profilers work by using Java or native agents—via JVMTI, JMX, or the AsyncGetCallTrace hack—to sample or instrument stack traces, generate flame graphs, and employ the HotSpot Attach API for live, zero‑restart agent loading and diagnostics.

CPU ProfilerJVMJVMTI

0 likes · 30 min read

Implementation Principles of JVM CPU Profiler and Dynamic Attach Mechanism

Didi Tech

Aug 2, 2019 · Backend Development

Using Go pprof for Online Performance Profiling: Case Studies and Lessons

The article demonstrates how Go’s built‑in pprof tools can be used for live performance profiling, walking through two real‑world cases—one where a malformed JSON request caused massive object allocation and CPU spikes, and another where per‑call self‑referencing structs leaked memory—while offering practical tips on input validation, allocation reduction, and GC monitoring.

Backend DevelopmentCPU optimizationGo

0 likes · 16 min read

Using Go pprof for Online Performance Profiling: Case Studies and Lessons

Qunar Tech Salon

Mar 27, 2019 · Artificial Intelligence

Profiling TensorFlow Performance with TensorBoard and Timeline

This article explains how to use TensorBoard and the Timeline tool to monitor TensorFlow GPU utilization, identify operation bottlenecks, and visualize execution times, including code examples and steps for exporting and merging profiling data for repeated runs.

GPU monitoringTensorBoardTensorFlow

0 likes · 7 min read

Profiling TensorFlow Performance with TensorBoard and Timeline

WeChat Client Technology Team

Jan 11, 2019 · Mobile Development

Why Your Android App Stutters: Understanding and Fixing UI Lag with TraceCanary

This article explores the causes of UI stutter in Android apps, explains how frame rate and dropped frames affect perceived smoothness, and details practical profiling solutions—including TraceView, BlockCanary, ArgusAPM, and Matrix‑TraceCanary—along with implementation techniques for accurate lag detection and mitigation.

AndroidDrop FrameFrame Rate

0 likes · 17 min read

Why Your Android App Stutters: Understanding and Fixing UI Lag with TraceCanary

dbaplus Community

Jan 8, 2019 · Backend Development

How to Diagnose Slow OpenAPI Responses with TProfiler Java Agent

This article walks through diagnosing intermittent OpenAPI latency by using Alibaba's open‑source TProfiler Java agent to collect method‑level timing without modifying code, covering setup, command‑line usage, log analysis, and practical performance‑improvement recommendations.

BackendJava AgentOpenAPI

0 likes · 8 min read

How to Diagnose Slow OpenAPI Responses with TProfiler Java Agent

Java Backend Technology

Dec 18, 2018 · Backend Development

How to Diagnose Slow OpenAPI Calls with TProfiler Without Changing Code

This article walks through a real‑world performance problem where an OpenAPI service sometimes responds in milliseconds and other times takes seconds, explains why adding logging is impractical, and demonstrates how to use Alibaba's TProfiler Java agent to profile the call chain, collect method timings, and analyze the results without modifying any source code.

APMJavaOpenAPI

0 likes · 10 min read

How to Diagnose Slow OpenAPI Calls with TProfiler Without Changing Code

Programmer DD

Nov 20, 2018 · Backend Development

How to Diagnose Slow OpenAPI Responses with TProfiler: A Step‑by‑Step Guide

This article walks through diagnosing intermittent OpenAPI latency by reproducing the issue, visualizing the request flow, using Alibaba's TProfiler agent without code changes, configuring and running profiling commands, analyzing the generated logs, and summarizing actionable performance improvements.

BackendJavaOpenAPI

0 likes · 8 min read

How to Diagnose Slow OpenAPI Responses with TProfiler: A Step‑by‑Step Guide

Tencent Cloud Developer

Mar 16, 2018 · Operations

Introduction to Linux Performance Profiling Tools: Perf, gprof, and Valgrind

This article introduces three popular Linux performance profiling tools—Perf, gprof, and Valgrind—explaining their installation, basic command‑line usage, and how to visualize results with flame graphs, call graphs, or KCachegrind, while comparing their intrusiveness, startup methods, and output formats.

Linuxcallgrindflame graph

0 likes · 10 min read

Introduction to Linux Performance Profiling Tools: Perf, gprof, and Valgrind

dbaplus Community

Jul 24, 2017 · Operations

How to Use Linux Perf for Deep Oracle Performance Analysis and Flame Graphs

This article explains how to leverage the Linux perf tool to profile Oracle database workloads, covering basic commands, probe-based tracing of Oracle internal functions, generating various flame graphs, and practical Q&A for integrating perf into performance monitoring workflows.

LinuxOracleflame graph

0 likes · 11 min read

How to Use Linux Perf for Deep Oracle Performance Analysis and Flame Graphs

dbaplus Community

Jul 27, 2016 · Operations

Master Java Performance Debugging: Hprof, pidstat, and Real‑World Memory Leak Fixes

This guide walks Java developers through practical performance analysis using the Hprof agent and pidstat tool, demonstrates step‑by‑step command usage with real code examples, and presents a memory‑leak case study that explains GC overhead limits, dump analysis, and concrete remediation steps.

HprofJavaOperations

0 likes · 9 min read

Master Java Performance Debugging: Hprof, pidstat, and Real‑World Memory Leak Fixes

Efficient Ops

Jun 30, 2016 · Big Data

How Spark Enables Real‑Time Microservice Performance Profiling

This article explains how IBM Research and Cloudinsight use Apache Spark to capture, analyze, and visualize microservice communication in real time, addressing challenges of observability, bottleneck detection, and latency attribution in large‑scale cloud environments.

Operational MonitoringReal-time analyticsSpark

0 likes · 10 min read

How Spark Enables Real‑Time Microservice Performance Profiling

ITPUB

Jun 21, 2016 · Backend Development

How to Debug Rare Core Dumps and Memory Leaks in High‑Concurrency Nginx/OpenSSL Deployments

This article walks through a real‑world investigation of intermittent core dumps and memory leaks in a heavily modified Nginx + OpenSSL stack under extreme QPS, detailing debugging strategies, custom load‑testing tools, and performance‑profiling techniques that helped pinpoint and fix the root causes.

DebuggingNGINXOpenSSL

0 likes · 23 min read

How to Debug Rare Core Dumps and Memory Leaks in High‑Concurrency Nginx/OpenSSL Deployments

Aotu Lab

Feb 25, 2016 · Backend Development

How to Debug Node.js in Production: Performance, Crashes, and Memory Leaks

This guide explains practical techniques for diagnosing Node.js production issues, covering request‑latency analysis, CPU profiling with perf and FlameGraph, crash investigation via Core Dumps and mdb_v8, and memory‑leak detection using gcore and mdb_v8 diff tools.

CPU FlameGraphNode.jscore dump

0 likes · 11 min read

How to Debug Node.js in Production: Performance, Crashes, and Memory Leaks