Tagged articles
431 articles
Page 3 of 5
Java Backend Technology
Java Backend Technology
Jul 15, 2022 · Backend Development

Master Java Performance Timing with Spring and Apache StopWatch

This article explains how to use Spring's org.springframework.util.StopWatch and Apache Commons Lang3 StopWatch to accurately measure and analyze Java code execution time, covering setup, basic usage, advanced features, thread‑safety considerations, and practical code examples for both libraries.

Apache CommonsCode ProfilingJava
0 likes · 8 min read
Master Java Performance Timing with Spring and Apache StopWatch
Efficient Ops
Efficient Ops
Jul 12, 2022 · Operations

Master Linux Performance Troubleshooting in the First 60 Seconds

This guide walks you through the ten essential Linux command‑line tools that Netflix’s performance team uses to quickly assess system health, focusing on error and saturation metrics before utilization, so you can pinpoint and resolve server issues within the critical first minute.

OpsPerformance MonitoringSystem Administration
0 likes · 18 min read
Master Linux Performance Troubleshooting in the First 60 Seconds
Xiao Lou's Tech Notes
Xiao Lou's Tech Notes
Jul 6, 2022 · Backend Development

How to Let Go Programs Profile Themselves Automatically

This article explains why traditional pprof sampling often fails in production, introduces Go's built‑in profiling tools and runtime.pprof, defines practical rules for triggering automatic sampling based on resource spikes, and demonstrates using the open‑source Holmes library and Docker demo to collect self‑diagnostic profiles.

GoPerformance MonitoringProfiling
0 likes · 12 min read
How to Let Go Programs Profile Themselves Automatically
Alibaba Cloud Native
Alibaba Cloud Native
Jul 5, 2022 · Cloud Native

Unlocking eBPF: How Kernel‑Level Observability Powers Modern Cloud‑Native Apps

This article explains what eBPF is, why it was created, its core characteristics, common use cases such as network optimization, fault diagnosis, security control and performance monitoring, and provides practical step‑by‑step guidance, tooling commands, program types, and ecosystem resources for leveraging eBPF in cloud‑native environments.

Cloud NativeKubernetesObservability
0 likes · 20 min read
Unlocking eBPF: How Kernel‑Level Observability Powers Modern Cloud‑Native Apps
Selected Java Interview Questions
Selected Java Interview Questions
Jun 23, 2022 · Backend Development

Using Spring and Apache Commons StopWatch to Monitor Java Code Execution Time

This article explains how to leverage Spring's org.springframework.util.StopWatch and Apache Commons Lang3 StopWatch for precise Java performance measurement, covering setup, usage examples, source code insights, important considerations, and practical code snippets for both libraries.

Apache CommonsCode ProfilingPerformance Monitoring
0 likes · 9 min read
Using Spring and Apache Commons StopWatch to Monitor Java Code Execution Time
dbaplus Community
dbaplus Community
Jun 15, 2022 · Operations

How Havok Enables Realistic Full‑Link Load Testing for Scalable Services

This article explains the background, design, and core components of Havok—a full‑link load‑testing platform that replays production logs, supports traffic scaling, mock services, real‑time monitoring, isolation, and circuit‑breaker protection—to help enterprises evaluate capacity and improve reliability without polluting live data.

Data IsolationLoad TestingPerformance Monitoring
0 likes · 14 min read
How Havok Enables Realistic Full‑Link Load Testing for Scalable Services
Sohu Tech Products
Sohu Tech Products
Jun 8, 2022 · Mobile Development

Practical Uses of RunLoop in iOS: Thread Keep‑Alive, Stopping, Lazy Image Loading, Lag Detection, Crash Protection, and Integration in the Matrix Framework

This article explains how iOS RunLoop can be leveraged for thread keep‑alive, controlled termination, lazy image loading, various lag‑detection techniques, crash protection strategies, and demonstrates its integration within the open‑source Matrix framework, providing concrete Swift and Objective‑C code examples.

Mobile DevelopmentObjective‑CPerformance Monitoring
0 likes · 18 min read
Practical Uses of RunLoop in iOS: Thread Keep‑Alive, Stopping, Lazy Image Loading, Lag Detection, Crash Protection, and Integration in the Matrix Framework
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 8, 2022 · Fundamentals

eBPF Explained: Core Concepts, Use Cases, and Best Practices

eBPF is a kernel‑level sandbox technology that enables safe, high‑performance, programmable instrumentation for networking, security, and observability, and this article answers seven key questions covering its definition, applications, origins, usage steps, implementation details, best practices, and current ecosystem.

Kernel InstrumentationObservabilityPerformance Monitoring
0 likes · 21 min read
eBPF Explained: Core Concepts, Use Cases, and Best Practices
Programmer DD
Programmer DD
May 21, 2022 · Mobile Development

Mastering Android OOM: Thread, File, and Memory Leak Solutions

This technical guide explores Android out‑of‑memory crashes by classifying OOM into thread‑count, file‑descriptor, and heap‑memory issues, then details non‑intrusive thread and thread‑pool optimizations, file‑descriptor and I/O monitoring, image compression strategies, and both Java and native memory‑leak detection techniques.

AndroidOOMPerformance Monitoring
0 likes · 28 min read
Mastering Android OOM: Thread, File, and Memory Leak Solutions
Open Source Linux
Open Source Linux
May 10, 2022 · Operations

Mastering Linux Disk I/O: Key Metrics and Essential Monitoring Tools

This article explains Linux’s storage I/O stack, defines the five core performance metrics—utilization, saturation, IOPS, throughput, and response time—and demonstrates how to monitor them using tools such as iostat, pidstat, and iotop, with practical command examples.

Disk I/OPerformance Monitoringiostat
0 likes · 9 min read
Mastering Linux Disk I/O: Key Metrics and Essential Monitoring Tools
ITPUB
ITPUB
May 9, 2022 · Databases

How Meituan’s Database Autonomy Service Tackles Scale and Reliability Challenges

This article outlines the evolution of Meituan’s Database Autonomy Service (DAS), describing the growing scale‑vs‑operations imbalance, the strategic roadmap for self‑service and AI‑driven diagnostics, detailed architectural designs across data collection, compute/storage, and analysis layers, and the measurable outcomes and future plans for full database autonomy.

AI DiagnosisDatabase AutonomyOperations
0 likes · 19 min read
How Meituan’s Database Autonomy Service Tackles Scale and Reliability Challenges
Meituan Technology Team
Meituan Technology Team
May 5, 2022 · Databases

Database Autonomy Service (DAS): Architecture, Design, and Implementation

The Database Autonomy Service (DAS) is a platform that uses big‑data, machine‑learning, and expert knowledge to automatically collect, compress, and analyze MySQL metrics, providing self‑service fault detection, root‑cause diagnosis, and security management, thereby reducing manual effort, shortening MTTR, and supporting Meituan’s rapid database growth.

AI-driven opsDatabase AutonomyPerformance Monitoring
0 likes · 20 min read
Database Autonomy Service (DAS): Architecture, Design, and Implementation
ByteDance Terminal Technology
ByteDance Terminal Technology
Apr 22, 2022 · Mobile Development

MetricKit Overview: iOS Performance and Diagnostic Monitoring

MetricKit, introduced in iOS 13 and expanded in iOS 14, provides a low‑overhead, system‑driven way to collect app performance, power, and crash diagnostics over the last 24 hours, enabling developers to upload, analyze, and optimize their iOS applications without third‑party SDKs.

MetricKitMobile DevelopmentPerformance Monitoring
0 likes · 18 min read
MetricKit Overview: iOS Performance and Diagnostic Monitoring
HaoDF Tech Team
HaoDF Tech Team
Mar 29, 2022 · Operations

Building an Observability Platform for Mini‑Program Image Uploads Using SRE and Metrics‑Driven Development

The article describes how a three‑day, cross‑team investigation of a mini‑program image‑upload failure led to the design and implementation of an SRE‑driven, metrics‑driven observability platform that quantifies SLIs, automates tracing, and provides dashboards for real‑time and long‑term analysis, ultimately reducing MTTR.

BackendMetrics-Driven DevelopmentMini-Program
0 likes · 17 min read
Building an Observability Platform for Mini‑Program Image Uploads Using SRE and Metrics‑Driven Development
FunTester
FunTester
Mar 17, 2022 · Operations

Turning Manual Performance Monitoring into Automated Multi‑Level Alerts

The author explains how they distinguished test automation from automated testing, identified monitoring pain points, built a custom scraper‑driven alert system with three escalation levels, tackled common pitfalls, and achieved faster, more reliable performance testing alerts.

OperationsPerformance Monitoringalert system
0 likes · 6 min read
Turning Manual Performance Monitoring into Automated Multi‑Level Alerts
DeWu Technology
DeWu Technology
Feb 28, 2022 · Operations

DeWu Tech Salon – Quality Assurance Sessions Summary

The DeWu Tech Salon, co‑hosted by DeWu App Quality Platform and TesterHome, brought senior engineers from Alibaba Cloud, ByteDance, Lagou and DeWu together to share practical QA insights on end‑side monitoring, traffic replay, full‑link stress testing, and industry‑scale chaos engineering, while announcing a PPT collection, a testing‑expert recruitment drive, and a preview of the next wireless‑technology salon.

Performance Monitoringchaos engineeringsoftware reliability
0 likes · 6 min read
DeWu Tech Salon – Quality Assurance Sessions Summary
Laravel Tech Community
Laravel Tech Community
Feb 7, 2022 · Operations

Linux Basic Operations and Command Reference

This article provides a comprehensive reference of essential Linux commands and procedures, covering system shutdown and reboot, hardware and environment inspection, file and directory management, user and permission handling, networking utilities, performance monitoring tools, and common text processing techniques for administrators and developers.

NetworkingPerformance MonitoringShell scripting
0 likes · 18 min read
Linux Basic Operations and Command Reference
Open Source Linux
Open Source Linux
Jan 26, 2022 · Operations

Essential Linux Command Cheat Sheet for System Administration

This article compiles a comprehensive set of Linux command‑line shortcuts covering system shutdown, hardware inspection, user management, networking, file handling, compression, performance monitoring, and many other routine administration tasks, providing a handy reference for developers and sysadmins alike.

BashPerformance MonitoringShell
0 likes · 23 min read
Essential Linux Command Cheat Sheet for System Administration
21CTO
21CTO
Jan 24, 2022 · Operations

Master Linux Performance Monitoring: Essential Tools and Metrics

This guide explains how to monitor Linux server performance using built‑in tools such as top, vmstat, pidstat, iostat, netstat, sar and tcpdump, detailing the metrics each provides and how to interpret them for CPU, memory, disk I/O and network troubleshooting.

Performance MonitoringSystem Administrationiostat
0 likes · 15 min read
Master Linux Performance Monitoring: Essential Tools and Metrics
Liangxu Linux
Liangxu Linux
Jan 23, 2022 · Operations

Master Linux Tracing with bpftrace: Install, Run, and Explore Built‑in Tools

bpftrace is a Linux eBPF‑based tracing tool that compiles scripts to BPF bytecode, offering kernel and user‑space probes; the guide covers its purpose, installation on major distros and Docker, and demonstrates built‑in example scripts for command logging, I/O latency, disk size, CPU usage, and TCP connection tracing.

Linux tracingPerformance Monitoringbpftrace
0 likes · 7 min read
Master Linux Tracing with bpftrace: Install, Run, and Explore Built‑in Tools
ITPUB
ITPUB
Jan 21, 2022 · Operations

Essential Linux Performance Monitoring: Top, vmstat, iostat, netstat & More

This guide explains how to use core Linux tools such as top, vmstat, pidstat, iostat, sar, netstat, and tcpdump to monitor CPU, memory, disk I/O, and network performance, interpret their metrics, and troubleshoot common bottlenecks on servers.

Performance Monitoringiostatlinux
0 likes · 17 min read
Essential Linux Performance Monitoring: Top, vmstat, iostat, netstat & More
Kuaishou Tech
Kuaishou Tech
Jan 14, 2022 · Operations

Analysis of Linux System Crash Caused by Memory Leak and the Role of min_free_kbytes

This article documents a system crash triggered by a memory‑leak in a network‑chip SDK, explains how low free memory and the kswapd process lead to deadlock, and shows how adjusting the kernel parameter min_free_kbytes can prevent the freeze while highlighting the importance of resource monitoring and tuning.

Performance Monitoringkswapdlinux
0 likes · 11 min read
Analysis of Linux System Crash Caused by Memory Leak and the Role of min_free_kbytes
dbaplus Community
dbaplus Community
Jan 10, 2022 · Operations

Why Is Linux Load High? Decoding Load Average, CPU Usage & Process States

This article explains Linux process states, how the kernel calculates load average, the relationship between load and CPU utilization, and provides a systematic approach with tools and commands to identify resource bottlenecks, differentiate high‑load/high‑CPU scenarios from high‑load/low‑CPU cases, and pinpoint problematic processes and threads.

Load AveragePerformance MonitoringProcess States
0 likes · 11 min read
Why Is Linux Load High? Decoding Load Average, CPU Usage & Process States
Alibaba Terminal Technology
Alibaba Terminal Technology
Jan 10, 2022 · Mobile Development

How to Accurately Measure and Optimize Android Frame Rate with APM

This article explains how APM provides frame‑rate data for Android, discusses the challenges of inaccurate FPS, introduces metrics such as scroll FPS, frozen‑frame ratio, scrollHitchRate and frame‑cause analysis, details the rendering pipeline, code implementations, optimization techniques, and integration with AB testing for performance improvement.

APMAndroidFrame Rate
0 likes · 20 min read
How to Accurately Measure and Optimize Android Frame Rate with APM
ByteDance SE Lab
ByteDance SE Lab
Jan 7, 2022 · Mobile Development

Systematic iOS Stability Management: From Crash Classification to Advanced Attribution

This article presents a comprehensive framework for identifying, classifying, and resolving iOS stability issues—covering crash types, governance methodology, deep-dive attribution techniques, real-world case studies, and practical tools such as Zombie monitoring, Coredump, MemoryGraph, and MetricKit—to dramatically improve app reliability.

APMPerformance Monitoringcrash analysis
0 likes · 30 min read
Systematic iOS Stability Management: From Crash Classification to Advanced Attribution
ByteDance Terminal Technology
ByteDance Terminal Technology
Nov 24, 2021 · Mobile Development

Systematic iOS Stability Issue Management: Classification, Methodology, and Root‑Cause Attribution

This article presents a comprehensive guide on systematically managing iOS stability problems, covering issue classification, a governance methodology, detailed root‑cause analysis for crashes, watchdogs, OOM, CPU and disk I/O anomalies, and practical tools and case studies from ByteDance’s APM platform.

APMMobile DevelopmentPerformance Monitoring
0 likes · 27 min read
Systematic iOS Stability Issue Management: Classification, Methodology, and Root‑Cause Attribution
Efficient Ops
Efficient Ops
Nov 14, 2021 · Operations

Master Linux Performance Monitoring: Top, Free, and Vmstat Commands Explained

This comprehensive guide walks you through Linux performance monitoring tools—top, free, and vmstat—detailing their syntax, key options, real‑world output examples, and how to interpret CPU, memory, swap, and process statistics for effective system administration.

FreePerformance MonitoringSystem Administration
0 likes · 32 min read
Master Linux Performance Monitoring: Top, Free, and Vmstat Commands Explained
Liangxu Linux
Liangxu Linux
Nov 14, 2021 · Operations

How to Spot and Diagnose Linux Hardware Bottlenecks with GUI Tools

This guide explains how to identify CPU, memory, storage, USB, network, and GPU bottlenecks on a Linux PC using only open‑source graphical tools such as GNOME System Monitor, HardInfo, atop and nethogs, providing clear visual cues and step‑by‑step instructions.

GNOMEHardInfoHardware Bottleneck
0 likes · 14 min read
How to Spot and Diagnose Linux Hardware Bottlenecks with GUI Tools
Architecture Digest
Architecture Digest
Nov 12, 2021 · Operations

Performance Monitoring with JMeter, InfluxDB, Prometheus, and Grafana

This article explains how to set up end‑to‑end performance monitoring by sending JMeter metrics to InfluxDB via Backend Listener, visualizing them in Grafana, and similarly collecting system metrics with node_exporter and Prometheus, covering configuration, data storage, query examples, and practical visualization techniques.

GrafanaInfluxDBJMeter
0 likes · 16 min read
Performance Monitoring with JMeter, InfluxDB, Prometheus, and Grafana
58 Tech
58 Tech
Nov 11, 2021 · Frontend Development

Cross‑Platform Mini Program Log Collection SDK: Design, Implementation, and Practice

This article introduces a lightweight, cross‑platform SDK for mini‑programs that unifies log collection—including exception, normal, and performance data—by intercepting App, Page and platform‑specific APIs, handling multi‑environment differences, and providing flexible usage via npm or single‑file integration.

Mini ProgramPerformance MonitoringSDK
0 likes · 15 min read
Cross‑Platform Mini Program Log Collection SDK: Design, Implementation, and Practice
Efficient Ops
Efficient Ops
Nov 3, 2021 · Operations

How to Visualize JMeter Performance Data with Grafana, InfluxDB, and Prometheus

This article explains step‑by‑step how to collect JMeter test metrics via Backend Listener, store them in InfluxDB, and display real‑time performance charts—including TPS, response time, and error rates—in Grafana, while also covering node_exporter integration with Prometheus for system‑level monitoring.

GrafanaInfluxDBJMeter
0 likes · 15 min read
How to Visualize JMeter Performance Data with Grafana, InfluxDB, and Prometheus
ByteDance SE Lab
ByteDance SE Lab
Oct 29, 2021 · Mobile Development

Mastering iOS App Startup: Real‑World Monitoring and Optimization Techniques

This article walks through practical iOS startup optimization, covering monitoring strategies, non‑intrusive instrumentation, development‑stage testing, essential tools like Time Profiler and System Trace, and a comprehensive set of best‑practice tactics to shrink launch time and maintain performance.

InstrumentationPerformance MonitoringiOS
0 likes · 28 min read
Mastering iOS App Startup: Real‑World Monitoring and Optimization Techniques
ByteDance Terminal Technology
ByteDance Terminal Technology
Oct 28, 2021 · Mobile Development

MemCorruption Tool for Detecting Memory Corruption Issues in Android Apps

The MemCorruption tool, developed by ByteDance's AppHealth team, provides an online, low‑overhead solution for detecting Use‑After‑Free, Double‑Free, and Heap‑Buffer‑Overflow problems in Android applications by hooking memory allocation functions, sampling allocations, and performing invisible SIGSEGV‑based detection.

AndroidHookNative Debugging
0 likes · 12 min read
MemCorruption Tool for Detecting Memory Corruption Issues in Android Apps
Sohu Tech Products
Sohu Tech Products
Oct 27, 2021 · Operations

Linux Command Cheat Sheet: Basic Operations, System Management, and Monitoring

This article provides a comprehensive collection of essential Linux command-line instructions covering system shutdown and reboot, hardware and process inspection, file manipulation, networking utilities, performance monitoring, and common administrative tasks, all organized for quick reference by developers and system administrators.

NetworkingPerformance MonitoringShell
0 likes · 20 min read
Linux Command Cheat Sheet: Basic Operations, System Management, and Monitoring
Sohu Tech Products
Sohu Tech Products
Oct 14, 2021 · Backend Development

Design and Implementation of Full‑Chain Tracing for Node.js Applications Using Async Hooks

This article explains the necessity, design principles, and concrete implementation of a full‑chain tracing system for Node.js applications, covering architecture choices, industry solutions, core Async Hooks concepts, the zone‑context framework, invoke‑tree construction, garbage collection, and practical demo code.

Full-Chain TracingJavaScriptNode.js
0 likes · 18 min read
Design and Implementation of Full‑Chain Tracing for Node.js Applications Using Async Hooks
Programmer DD
Programmer DD
Sep 23, 2021 · Backend Development

How to Set Up Nginx‑GUI for Visual Nginx Management on Linux

This guide walks you through installing, configuring, and running the open‑source Nginx‑GUI tool on a Linux server, covering download, property file edits, Linux‑specific renaming, permission setup, startup commands, and the features currently supported versus pending.

ConfigurationGUINginx
0 likes · 4 min read
How to Set Up Nginx‑GUI for Visual Nginx Management on Linux
Sohu Tech Products
Sohu Tech Products
Sep 15, 2021 · Backend Development

Design and Implementation of Full-Chain Tracing for Node.js Applications Using Async Hooks

This article explains how to acquire full‑chain tracing information in Node.js applications by leveraging Async Hooks to build an invoke tree, manage context propagation, perform garbage collection, and attach custom metadata, enabling precise debugging of complex asynchronous call graphs.

AsyncHooksDistributed TracingFull-Chain Tracing
0 likes · 16 min read
Design and Implementation of Full-Chain Tracing for Node.js Applications Using Async Hooks
Tencent Cloud Developer
Tencent Cloud Developer
Sep 10, 2021 · Frontend Development

Front-End Performance Monitoring (RUM) and Optimization Strategies

The article introduces Tencent Cloud’s Real‑User‑Monitoring platform, explains key performance timing points and metric calculations, links RUM data to Google Web Vitals, and demonstrates a data‑driven optimization case that cut first‑screen load from 4.8 s to 3.2 s, halved resource load time, and dramatically improved CLS.

Performance MonitoringRUMfrontend
0 likes · 16 min read
Front-End Performance Monitoring (RUM) and Optimization Strategies
Youku Technology
Youku Technology
Sep 8, 2021 · Industry Insights

How Youku Engineers Accelerate Start‑up, Cut Stalls, and Stabilize Video Playback

This article examines Youku's technical team's systematic approach to improving video playback by tackling slow start‑up, error handling, network stalls, audio‑video sync, and overall stability through metric‑driven optimization, adaptive streaming protocols, device capability databases, and forward‑looking features like low latency and high‑frame‑rate streaming.

Low latencyPerformance MonitoringVideo Streaming
0 likes · 12 min read
How Youku Engineers Accelerate Start‑up, Cut Stalls, and Stabilize Video Playback
vivo Internet Technology
vivo Internet Technology
Sep 2, 2021 · Backend Development

Full-Link Tracing in Node.js Applications: Async Hooks and Zone-Context Design

The article details a full‑link tracing system for Node.js that leverages experimental async_hooks to monitor asynchronous resource lifecycles, builds an invoke‑tree to map parent‑child relationships, implements garbage collection, and provides a ZoneContext API for propagating custom tracing data across async call chains.

Full‑Link TracingGarbage CollectionNode.js
0 likes · 19 min read
Full-Link Tracing in Node.js Applications: Async Hooks and Zone-Context Design
Liangxu Linux
Liangxu Linux
Aug 31, 2021 · Operations

10 Essential Linux Commands to Diagnose Server Performance in the First Minute

When a Linux server shows performance problems, the first 60 seconds are critical; using ten standard command‑line tools such as uptime, dmesg, vmstat, mpstat, pidstat, iostat, free, sar, and top lets you quickly assess load, memory, I/O, and network health to pinpoint the root cause.

Performance MonitoringServer Diagnosticscommand-line
0 likes · 18 min read
10 Essential Linux Commands to Diagnose Server Performance in the First Minute
IT Architects Alliance
IT Architects Alliance
Aug 30, 2021 · Operations

Which Distributed Tracing Tool Wins? Comparing Zipkin, SkyWalking, Pinpoint

As micro‑service architectures grow, tracing every request across thousands of services becomes essential; this article examines the need for full‑link monitoring, outlines core requirements and functional modules, explains Google Dapper’s Span/Trace model, and provides a detailed performance‑focused comparison of Zipkin, SkyWalking, and Pinpoint.

APMComparisonDistributed Tracing
0 likes · 26 min read
Which Distributed Tracing Tool Wins? Comparing Zipkin, SkyWalking, Pinpoint
Open Source Linux
Open Source Linux
Aug 30, 2021 · Fundamentals

Unlocking Linux File System Secrets: Inodes, VFS, and I/O Performance

This article explains how Linux manages disks and file systems through inodes, directory entries, logical blocks, the Virtual File System layer, various I/O models, and practical performance monitoring commands, providing a comprehensive overview for developers and system engineers.

Operating SystemPerformance Monitoringlinux
0 likes · 19 min read
Unlocking Linux File System Secrets: Inodes, VFS, and I/O Performance
Code Ape Tech Column
Code Ape Tech Column
Aug 28, 2021 · Backend Development

A Curated List of Alibaba Open‑Source Developer Tools for Backend Development

This article introduces a collection of Alibaba‑released open‑source tools—including Arthas, Cloud Toolkit, ChaosBlade, PTS, Druid, and more—detailing their usage scenarios, tutorials, and acquisition methods to help backend developers improve efficiency, debugging, monitoring, and reliability of their services.

AlibabaJavaPerformance Monitoring
0 likes · 14 min read
A Curated List of Alibaba Open‑Source Developer Tools for Backend Development
Baidu Intelligent Testing
Baidu Intelligent Testing
Aug 10, 2021 · Backend Development

Evolution and Architecture of Baidu's Fengjing APM System

This article chronicles the four‑year evolution of Baidu's Fengjing performance‑monitoring platform, detailing its data collection, processing pipelines, successive architectural versions (1.0‑4.0), challenges such as probe intrusion and massive data volume, and the engineering solutions that enabled large‑scale, low‑cost, cloud‑native observability for thousands of Java services.

APMBig DataCloud Native
0 likes · 9 min read
Evolution and Architecture of Baidu's Fengjing APM System
WeChat Client Technology Team
WeChat Client Technology Team
Jul 19, 2021 · Mobile Development

How to Build a Complete ANR Monitoring Solution on Android

This article explains the Android ANR workflow, analyzes the system's appNotResponding logic, and presents a robust monitoring strategy that captures SIGQUIT signals, validates true ANR events, and hooks trace writes to reliably detect and diagnose ANRs in mobile apps.

ANRAndroidPerformance Monitoring
0 likes · 22 min read
How to Build a Complete ANR Monitoring Solution on Android
WeChat Client Technology Team
WeChat Client Technology Team
Jul 13, 2021 · Mobile Development

How to Accurately Detect UI Lag and ANR on Android: Advanced Monitoring Techniques

This article explains the relationship between UI stutter and ANR, critiques common monitoring tools, and presents three robust Android lag‑detection methods—WatchDog polling, Looper Printer replacement, and specialized handlers for IdleHandler, TouchEvent, and SyncBarrier leaks—complete with probability analysis and sample code.

ANRAndroidIdleHandler
0 likes · 17 min read
How to Accurately Detect UI Lag and ANR on Android: Advanced Monitoring Techniques
High Availability Architecture
High Availability Architecture
Jul 8, 2021 · Frontend Development

Interview on Kuaishou Frontend Platform: Node.js Infrastructure, Real‑time Performance Monitoring, and Technical Challenges

The interview with Kuaishou senior engineers Zhang and Fang discusses their ten‑year web development experience, recent Node.js infrastructure and front‑end performance projects, the technical highlights, implementation challenges, architectural considerations, and the topics they will present at the upcoming GIAC conference.

Frontend InfrastructureNode.jsPerformance Monitoring
0 likes · 9 min read
Interview on Kuaishou Frontend Platform: Node.js Infrastructure, Real‑time Performance Monitoring, and Technical Challenges
HelloTech
HelloTech
Jun 11, 2021 · Mobile Development

Lumos Hybrid Stack: A Cross‑Platform Solution to Boost Mobile Development Efficiency

Lumos Hybrid Stack leverages Dart and Flutter to unify iOS and Android business logic, cutting duplicated development effort, accelerating debugging, and saving person‑days while providing a consistent, extensible architecture, performance monitoring, and a communication protocol that paves the way for future centralized state‑management across platforms.

DARTFlutterPerformance Monitoring
0 likes · 10 min read
Lumos Hybrid Stack: A Cross‑Platform Solution to Boost Mobile Development Efficiency
Tencent Cloud Developer
Tencent Cloud Developer
Jun 1, 2021 · Frontend Development

Front‑End Performance Monitoring: Metrics, Types, and Data Collection

Front‑end performance monitoring lets developers detect issues early, compare synthetic and real‑user approaches, track key metrics like LCP, FID, and CLS, and gather data via the web‑vitals library, Performance API, and error‑tracking hooks to systematically improve load speed, interactivity, and visual stability.

LighthousePerformance APIPerformance Monitoring
0 likes · 16 min read
Front‑End Performance Monitoring: Metrics, Types, and Data Collection
Liangxu Linux
Liangxu Linux
May 27, 2021 · Operations

Master Linux Performance Monitoring: Top, Free, and vmstat Commands Explained

This article provides a detailed, step‑by‑step guide to Linux's built‑in performance monitoring utilities—top, free and vmstat—covering their purpose, essential options, output interpretation, interactive shortcuts, and practical examples for effective system administration.

FreeOperationsPerformance Monitoring
0 likes · 33 min read
Master Linux Performance Monitoring: Top, Free, and vmstat Commands Explained
Baidu Geek Talk
Baidu Geek Talk
Apr 26, 2021 · Cloud Native

Evolution and Architecture of Baidu's Fengjing APM System

From its 2016 debut to the present, Baidu’s Fengjing APM system has evolved through four major releases—moving from invasive jar‑based probes to non‑invasive bytecode agents, adding modular hot‑swap plugins, scaling to thousands of containers, handling billions of daily metrics via Kafka, Doris, and SIA TSDB, while solving probe upgrade downtime, data‑ingestion volume, and call‑graph query latency.

APMBackend ArchitectureCloud Native
0 likes · 9 min read
Evolution and Architecture of Baidu's Fengjing APM System
Code Ape Tech Column
Code Ape Tech Column
Apr 12, 2021 · Operations

Which APM Tool Wins? Deep Dive into Zipkin, Pinpoint, and SkyWalking

With micro‑service architectures generating complex call chains across thousands of servers, this article analyzes full‑link monitoring concepts, outlines essential requirements, details core components like spans and traces, and compares three major APM solutions—Zipkin, Pinpoint, and SkyWalking—evaluating probe impact, scalability, and data analysis capabilities.

APMDapperDistributed Tracing
0 likes · 26 min read
Which APM Tool Wins? Deep Dive into Zipkin, Pinpoint, and SkyWalking
Youzan Coder
Youzan Coder
Apr 7, 2021 · Mobile Development

Design and Implementation of a Mobile App Performance Monitoring System

The article describes a two‑part mobile app performance monitoring system that automatically instruments code to capture method execution times, ANR and frame stalls, then processes, cleans, aggregates, and visualizes the data on a backend platform to generate alerts, trend reports, and guide optimization across versions.

APMPerformance Monitoringdata-processing
0 likes · 11 min read
Design and Implementation of a Mobile App Performance Monitoring System
ELab Team
ELab Team
Mar 25, 2021 · Frontend Development

Mastering Front-End Event Tracking: From Data Collection to Performance Metrics

This article explains front‑end event tracking fundamentals, compares passive, code‑based, and SDK solutions, outlines common event attributes and types, details performance data collection via the Performance API, describes error capture methods, and presents reliable data reporting techniques using Beacon and image fallbacks.

Data ReportingError HandlingPerformance Monitoring
0 likes · 20 min read
Mastering Front-End Event Tracking: From Data Collection to Performance Metrics
Node Underground
Node Underground
Mar 12, 2021 · Cloud Native

How Alinode Boosts Node.js Observability and Scheduling in the Cloud‑Native Era

Alinode expands its Node.js performance diagnostics into a full‑stack observability and scheduling platform for serverless workloads, offering traffic monitoring, white‑screen logs, remote debugging, crash analysis, standardized metrics, and a cloud‑native runtime that balances cost and performance.

Cloud NativeNode.jsObservability
0 likes · 11 min read
How Alinode Boosts Node.js Observability and Scheduling in the Cloud‑Native Era
Architect
Architect
Mar 9, 2021 · Operations

Full‑Stack Distributed Tracing and Monitoring: Comparing Zipkin, Pinpoint, and SkyWalking

The article explains the need for full‑link monitoring in micro‑service architectures, describes the core concepts of tracing such as spans and traces, outlines functional modules of APM systems, and provides a detailed comparison of three popular solutions—Zipkin, Pinpoint, and SkyWalking—covering performance impact, scalability, data analysis, developer transparency, and topology visualization.

APMDistributed TracingPerformance Monitoring
0 likes · 27 min read
Full‑Stack Distributed Tracing and Monitoring: Comparing Zipkin, Pinpoint, and SkyWalking
Xianyu Technology
Xianyu Technology
Mar 9, 2021 · Mobile Development

Xianyu Android Log Governance and Performance Monitoring Solution

The Xianyu Android Log Governance solution unifies local console logs, TLog and online logs, adds comprehensive logcat capture and AUS‑OSS retrieval, replaces BlockCanary with low‑overhead frame‑callback lag/ANR detection, and provides dashboards and a batch‑query platform, cutting technical‑opinion ratio from 10.5 % to 4.7 % and boosting upload success.

ANR detectionAndroidLog Management
0 likes · 13 min read
Xianyu Android Log Governance and Performance Monitoring Solution
Architecture Digest
Architecture Digest
Mar 7, 2021 · Operations

Full‑Link Monitoring: Concepts, Requirements, Architecture and Comparative Evaluation of APM Solutions

The article explains the need for full‑link monitoring in microservice architectures, outlines its functional modules and design goals, details the core data structures of Google Dapper (Span, Trace, Annotation) with code examples, and compares three popular APM tools—Zipkin, Pinpoint and SkyWalking—across performance, scalability, analysis depth, transparency and topology features.

APMDistributed TracingMicroservices
0 likes · 25 min read
Full‑Link Monitoring: Concepts, Requirements, Architecture and Comparative Evaluation of APM Solutions
Alibaba Terminal Technology
Alibaba Terminal Technology
Mar 5, 2021 · Frontend Development

Node.js & Frontend Innovations: Monitoring, Full-Stack Hooks, Fast CDN, Visual JS

This article introduces five cutting‑edge solutions for modern web development: an Addon‑based Node.js performance monitoring and fault‑location tool, the full‑stack Midway Hooks framework for zero‑API backend development, the ultra‑fast AirpackCDN component delivery network, the visual‑oriented iMove JavaScript framework, and the versatile SSR framework 5.0 supporting both React and Vue.

CDNFull-Stack FrameworkNode.js
0 likes · 5 min read
Node.js & Frontend Innovations: Monitoring, Full-Stack Hooks, Fast CDN, Visual JS
Sohu Tech Products
Sohu Tech Products
Mar 3, 2021 · Operations

Comprehensive Comparison of Full‑Link Tracing Solutions: Zipkin, SkyWalking, and Pinpoint

This article examines the challenges of monitoring distributed micro‑service architectures, outlines the requirements for a low‑overhead, non‑intrusive tracing system, describes the core concepts of Google Dapper (Span, Trace, Annotation), and provides a detailed functional and performance comparison of three popular APM tools—Zipkin, SkyWalking, and Pinpoint—covering probe impact, collector scalability, data analysis depth, developer transparency, topology visualization, and overall suitability.

APMDistributed TracingMicroservices
0 likes · 23 min read
Comprehensive Comparison of Full‑Link Tracing Solutions: Zipkin, SkyWalking, and Pinpoint
dbaplus Community
dbaplus Community
Feb 25, 2021 · Operations

How Distributed Tracing Solves Microservice Performance Bottlenecks with SkyWalking

This article explains the principles of distributed tracing, the OpenTracing standard, SkyWalking's architecture and sampling strategies, and shares a company's practical customizations—including forced sampling, fine‑grained group sampling, log4j traceId injection, and self‑developed plugins—to help pinpoint performance issues in microservice environments.

Distributed TracingJavaObservability
0 likes · 17 min read
How Distributed Tracing Solves Microservice Performance Bottlenecks with SkyWalking
Youzan Coder
Youzan Coder
Feb 24, 2021 · Mobile Development

Building Offline Mobile Performance Monitoring with AWACS and APM

This article explains how Youzan extended its APM framework with offline monitoring, built the AWACS visual tool, integrated Appium‑driven regression, instrumented method timing and network traffic via Gradle plugins, captured page rendering time, processed data in the backend, and created an issue‑management platform, outlining future enhancements.

APMAndroidMobile Development
0 likes · 13 min read
Building Offline Mobile Performance Monitoring with AWACS and APM
Didi Tech
Didi Tech
Feb 4, 2021 · Backend Development

Inside DiDi’s NodeX: Building a Scalable Node.js Ecosystem for Enterprise

DiDi’s NodeX initiative unites multiple front‑end teams to create a group‑level, professional, efficient, and stable Node.js development ecosystem that provides reusable components, a unified framework, common services, performance‑analysis tools, and documentation platforms, dramatically lowering service‑building barriers and boosting business productivity.

Backend DevelopmentComponent ArchitectureDevOps
0 likes · 12 min read
Inside DiDi’s NodeX: Building a Scalable Node.js Ecosystem for Enterprise
OPPO Kernel Craftsman
OPPO Kernel Craftsman
Jan 15, 2021 · Mobile Development

Android Frame Drop Detection and Monitoring Principles

The article explains Android’s frame‑drop detection and monitoring principles, covering refresh rate, Vsync, Choreographer and FrameInfo mechanisms, how lag is perceived, methods to calculate and retrieve dropped frames, and best practices such as avoiding heavy main‑thread work to ensure smooth UI performance.

AndroidPerformance MonitoringUI optimization
0 likes · 6 min read
Android Frame Drop Detection and Monitoring Principles
21CTO
21CTO
Jan 8, 2021 · Databases

How ICBC Scales MySQL to 10,000 Nodes: Governance, Challenges, and Solutions

This article summarizes ICBC's experience of deploying nearly ten thousand MySQL nodes, detailing the current status, challenges, governance strategies—including standards, containerization, automated monitoring, emergency handling, and post‑incident analysis—and outlines future improvement plans for core‑application databases.

AutomationDatabase GovernanceICBC
0 likes · 17 min read
How ICBC Scales MySQL to 10,000 Nodes: Governance, Challenges, and Solutions
ITPUB
ITPUB
Jan 8, 2021 · Databases

How ICBC Scales MySQL to 10,000 Nodes: Governance, Automation, and Performance Strategies

This article outlines Industrial and Commercial Bank of China's journey from early MySQL adoption to managing nearly ten thousand production nodes, detailing the challenges of high‑capacity, high‑concurrency workloads and the comprehensive governance framework—including standards, pre‑emptive checks, automated incident response, and future self‑healing plans—used to ensure reliable core‑application data storage.

AutomationDatabase GovernanceLarge‑Scale Deployment
0 likes · 17 min read
How ICBC Scales MySQL to 10,000 Nodes: Governance, Automation, and Performance Strategies
Top Architect
Top Architect
Dec 29, 2020 · Operations

Comprehensive Guide to Java Runtime Error Checking: CPU, Disk, Memory, GC, and Network Troubleshooting

This article provides a step‑by‑step guide for diagnosing Java production issues by systematically checking CPU usage, disk health, memory consumption, garbage‑collection behavior, and network problems using common Linux tools and JVM utilities such as ps, top, jstack, jstat, vmstat, iostat, free, jmap, and tcpdump.

CPUGarbage CollectionJava
0 likes · 21 min read
Comprehensive Guide to Java Runtime Error Checking: CPU, Disk, Memory, GC, and Network Troubleshooting
Aikesheng Open Source Community
Aikesheng Open Source Community
Dec 28, 2020 · Operations

Building a Custom MySQL Observation Tool with bcc and eBPF

This tutorial explains how to create a Python‑based eBPF tool using the bcc framework to trace MySQL Group Replication's apply_data_packet function, covering environment setup, BPF program writing, attaching probes, and displaying real‑time thread and timestamp information.

BCCObservabilityPerformance Monitoring
0 likes · 8 min read
Building a Custom MySQL Observation Tool with bcc and eBPF
JD Retail Technology
JD Retail Technology
Dec 9, 2020 · Mobile Development

Building and Optimizing a First‑Screen Performance Monitoring System for Android Apps (JD Mall Case Study)

This article describes how JD Mall’s Android team designed a custom first‑screen latency monitoring system, collected lifecycle, network and rendering metrics via AOP and configuration, and applied a series of optimizations—including plugin preloading, network parallelism, layout inflation tricks, and hardware acceleration—to achieve over 30% faster app start‑up and significantly reduced user churn.

AndroidMobile DevelopmentPerformance Monitoring
0 likes · 17 min read
Building and Optimizing a First‑Screen Performance Monitoring System for Android Apps (JD Mall Case Study)
Code Ape Tech Column
Code Ape Tech Column
Dec 7, 2020 · Operations

How to Determine If Redis Is Actually Slowing Down Using Baseline Performance

The article explains how to verify whether Redis performance degradation is real by measuring response latency, establishing a baseline using the --intrinsic-latency option, and comparing current latency against the baseline, with a rule of roughly twice the baseline indicating a slowdown.

Baseline LatencyIntrinsic LatencyPerformance Monitoring
0 likes · 6 min read
How to Determine If Redis Is Actually Slowing Down Using Baseline Performance
JavaEdge
JavaEdge
Nov 25, 2020 · Operations

How Distributed Tracing Locates Failures and Optimizes Microservice Performance

This article explains the importance of service tracing in micro‑service architectures, describes core concepts such as traceId and spanId, outlines a three‑layer tracing system (collection, processing, visualization), and shows how real‑time and offline processing enable rapid fault isolation and system‑wide performance optimization.

Distributed TracingMicroservicesPerformance Monitoring
0 likes · 10 min read
How Distributed Tracing Locates Failures and Optimizes Microservice Performance
Liangxu Linux
Liangxu Linux
Nov 18, 2020 · Operations

Diagnosing Linux Server Performance in the First 60 Seconds

This guide walks you through ten essential Linux command‑line tools—uptime, dmesg, vmstat, mpstat, pidstat, iostat, free, sar, and top—explaining what each metric means, how to interpret the output, and how to quickly spot utilization, saturation, and error issues within the first minute of investigation.

Performance MonitoringSystem Administrationcommand-line
0 likes · 20 min read
Diagnosing Linux Server Performance in the First 60 Seconds
Java Backend Technology
Java Backend Technology
Nov 8, 2020 · Operations

How Distributed Tracing with SkyWalking Solves Microservice Performance Challenges

This article explains the principles, architecture, and practical adoption of distributed tracing—covering OpenTracing standards, SkyWalking's design, sampling strategies, plugin development, and real‑world company practices—to help engineers pinpoint bottlenecks and improve observability in microservice systems.

Distributed TracingMicroservicesObservability
0 likes · 17 min read
How Distributed Tracing with SkyWalking Solves Microservice Performance Challenges
FunTester
FunTester
Oct 30, 2020 · Operations

Mastering Fiddler: Capture and Analyze HTTP/HTTPS Traffic Like a Pro

This guide explains how to use Fiddler to intercept and inspect HTTP and HTTPS requests, walks through its interface components, shows configuration steps for browsers, Android and iPhone, demonstrates filtering by domain, and outlines key metrics for request performance analysis.

FiddlerHTTP debuggingHTTPS capture
0 likes · 7 min read
Mastering Fiddler: Capture and Analyze HTTP/HTTPS Traffic Like a Pro
Programmer DD
Programmer DD
Oct 16, 2020 · Operations

Mastering JVisualVM: Detect Memory Leaks and Monitor Java Apps

This guide introduces JVisualVM, explains its installation and plugin system, demonstrates how to create a memory‑leak example, use VisualGC and sampling tools to identify leaking objects, and shows step‑by‑step remote monitoring of a Tomcat server via JMX, providing practical screenshots and code snippets.

JVisualVMJava profilingPerformance Monitoring
0 likes · 8 min read
Mastering JVisualVM: Detect Memory Leaks and Monitor Java Apps
MaGe Linux Operations
MaGe Linux Operations
Sep 18, 2020 · Operations

Essential Linux Operations Metrics for Effective Monitoring

This guide enumerates the key Linux system metrics—covering CPU, memory, disk, I/O, network, kernel parameters, RAID, SMART, NTP, and process information—that open-falcon agents collect every minute to enable comprehensive operations monitoring and timely issue detection.

MetricsOpen-FalconOperations
0 likes · 12 min read
Essential Linux Operations Metrics for Effective Monitoring
Full-Stack Internet Architecture
Full-Stack Internet Architecture
Sep 17, 2020 · Operations

Understanding Distributed Tracing and SkyWalking: Principles, Architecture, and Practical Implementation

This article explains the fundamentals of distributed tracing, the OpenTracing standard, and how SkyWalking implements automatic span collection, cross‑process context propagation, unique traceId generation, sampling strategies, performance benchmarks, and real‑world adaptations within a micro‑service environment.

Distributed TracingJavaMicroservices
0 likes · 16 min read
Understanding Distributed Tracing and SkyWalking: Principles, Architecture, and Practical Implementation
360 Tech Engineering
360 Tech Engineering
Aug 10, 2020 · Databases

TiDB Cluster Write‑Write Conflict Investigation and Resolution

This article analyzes a TiDB cluster performance incident where QPS dropped and duration spiked due to write‑write conflicts, detailing the monitoring data, root‑cause investigation of server‑busy and scheduler latch issues, and the attempted mitigation steps such as enabling txn‑local‑latches and adjusting insert statements.

Cluster TroubleshootingDatabase operationsPerformance Monitoring
0 likes · 10 min read
TiDB Cluster Write‑Write Conflict Investigation and Resolution
JD Retail Technology
JD Retail Technology
Jul 22, 2020 · Mobile Development

Flutter Frame‑Rate Monitoring Tool and Rendering Pipeline Analysis in the Sunglasses Platform

This article explains the Flutter rendering pipeline—including layout, paint, and compositing stages—details the widget‑element‑render object hierarchy, describes the task runners involved in each frame, and analyzes the Sunglasses‑integrated frame‑rate monitoring tool that classifies frame timings for performance debugging.

Mobile DevelopmentPerformance MonitoringRendering
0 likes · 9 min read
Flutter Frame‑Rate Monitoring Tool and Rendering Pipeline Analysis in the Sunglasses Platform
Top Architect
Top Architect
Jul 11, 2020 · Operations

7 Key Performance Metrics for Application Monitoring and Their Recommended Tools

The article outlines seven essential performance metrics—response time and throughput, average load, error rate, GC pause time, business indicators, uptime, and log size—explaining their significance for application health and recommending popular monitoring tools for each metric.

Garbage CollectionLog ManagementPerformance Monitoring
0 likes · 6 min read
7 Key Performance Metrics for Application Monitoring and Their Recommended Tools