Tagged articles

431 articles

Page 3 of 5

Jul 15, 2022 · Backend Development

Master Java Performance Timing with Spring and Apache StopWatch

This article explains how to use Spring's org.springframework.util.StopWatch and Apache Commons Lang3 StopWatch to accurately measure and analyze Java code execution time, covering setup, basic usage, advanced features, thread‑safety considerations, and practical code examples for both libraries.

Apache CommonsCode ProfilingJava

0 likes · 8 min read

Master Java Performance Timing with Spring and Apache StopWatch

Efficient Ops

Jul 12, 2022 · Operations

Master Linux Performance Troubleshooting in the First 60 Seconds

This guide walks you through the ten essential Linux command‑line tools that Netflix’s performance team uses to quickly assess system health, focusing on error and saturation metrics before utilization, so you can pinpoint and resolve server issues within the critical first minute.

OpsPerformance MonitoringSystem Administration

0 likes · 18 min read

Master Linux Performance Troubleshooting in the First 60 Seconds

Xiao Lou's Tech Notes

Jul 6, 2022 · Backend Development

How to Let Go Programs Profile Themselves Automatically

This article explains why traditional pprof sampling often fails in production, introduces Go's built‑in profiling tools and runtime.pprof, defines practical rules for triggering automatic sampling based on resource spikes, and demonstrates using the open‑source Holmes library and Docker demo to collect self‑diagnostic profiles.

GoPerformance MonitoringProfiling

0 likes · 12 min read

Alibaba Cloud Native

Jul 5, 2022 · Cloud Native

Unlocking eBPF: How Kernel‑Level Observability Powers Modern Cloud‑Native Apps

This article explains what eBPF is, why it was created, its core characteristics, common use cases such as network optimization, fault diagnosis, security control and performance monitoring, and provides practical step‑by‑step guidance, tooling commands, program types, and ecosystem resources for leveraging eBPF in cloud‑native environments.

Cloud NativeKubernetesObservability

0 likes · 20 min read

Unlocking eBPF: How Kernel‑Level Observability Powers Modern Cloud‑Native Apps

Selected Java Interview Questions

Jun 23, 2022 · Backend Development

Using Spring and Apache Commons StopWatch to Monitor Java Code Execution Time

This article explains how to leverage Spring's org.springframework.util.StopWatch and Apache Commons Lang3 StopWatch for precise Java performance measurement, covering setup, usage examples, source code insights, important considerations, and practical code snippets for both libraries.

Apache CommonsCode ProfilingPerformance Monitoring

0 likes · 9 min read

Using Spring and Apache Commons StopWatch to Monitor Java Code Execution Time

dbaplus Community

Jun 15, 2022 · Operations

How Havok Enables Realistic Full‑Link Load Testing for Scalable Services

This article explains the background, design, and core components of Havok—a full‑link load‑testing platform that replays production logs, supports traffic scaling, mock services, real‑time monitoring, isolation, and circuit‑breaker protection—to help enterprises evaluate capacity and improve reliability without polluting live data.

Data IsolationLoad TestingPerformance Monitoring

0 likes · 14 min read

How Havok Enables Realistic Full‑Link Load Testing for Scalable Services

Sohu Tech Products

Jun 8, 2022 · Mobile Development

Practical Uses of RunLoop in iOS: Thread Keep‑Alive, Stopping, Lazy Image Loading, Lag Detection, Crash Protection, and Integration in the Matrix Framework

This article explains how iOS RunLoop can be leveraged for thread keep‑alive, controlled termination, lazy image loading, various lag‑detection techniques, crash protection strategies, and demonstrates its integration within the open‑source Matrix framework, providing concrete Swift and Objective‑C code examples.

Mobile DevelopmentObjective‑CPerformance Monitoring

0 likes · 18 min read

Practical Uses of RunLoop in iOS: Thread Keep‑Alive, Stopping, Lazy Image Loading, Lag Detection, Crash Protection, and Integration in the Matrix Framework

Alibaba Cloud Developer

Jun 8, 2022 · Fundamentals

eBPF Explained: Core Concepts, Use Cases, and Best Practices

eBPF is a kernel‑level sandbox technology that enables safe, high‑performance, programmable instrumentation for networking, security, and observability, and this article answers seven key questions covering its definition, applications, origins, usage steps, implementation details, best practices, and current ecosystem.

Kernel InstrumentationObservabilityPerformance Monitoring

0 likes · 21 min read

eBPF Explained: Core Concepts, Use Cases, and Best Practices

Programmer DD

May 21, 2022 · Mobile Development

Mastering Android OOM: Thread, File, and Memory Leak Solutions

This technical guide explores Android out‑of‑memory crashes by classifying OOM into thread‑count, file‑descriptor, and heap‑memory issues, then details non‑intrusive thread and thread‑pool optimizations, file‑descriptor and I/O monitoring, image compression strategies, and both Java and native memory‑leak detection techniques.

AndroidOOMPerformance Monitoring

0 likes · 28 min read

Mastering Android OOM: Thread, File, and Memory Leak Solutions

Open Source Linux

May 10, 2022 · Operations

Mastering Linux Disk I/O: Key Metrics and Essential Monitoring Tools

This article explains Linux’s storage I/O stack, defines the five core performance metrics—utilization, saturation, IOPS, throughput, and response time—and demonstrates how to monitor them using tools such as iostat, pidstat, and iotop, with practical command examples.

Disk I/OPerformance Monitoringiostat

0 likes · 9 min read

Mastering Linux Disk I/O: Key Metrics and Essential Monitoring Tools

ITPUB

May 9, 2022 · Databases

How Meituan’s Database Autonomy Service Tackles Scale and Reliability Challenges

This article outlines the evolution of Meituan’s Database Autonomy Service (DAS), describing the growing scale‑vs‑operations imbalance, the strategic roadmap for self‑service and AI‑driven diagnostics, detailed architectural designs across data collection, compute/storage, and analysis layers, and the measurable outcomes and future plans for full database autonomy.

AI DiagnosisDatabase AutonomyOperations

0 likes · 19 min read

How Meituan’s Database Autonomy Service Tackles Scale and Reliability Challenges

Meituan Technology Team

May 5, 2022 · Databases

Database Autonomy Service (DAS): Architecture, Design, and Implementation

The Database Autonomy Service (DAS) is a platform that uses big‑data, machine‑learning, and expert knowledge to automatically collect, compress, and analyze MySQL metrics, providing self‑service fault detection, root‑cause diagnosis, and security management, thereby reducing manual effort, shortening MTTR, and supporting Meituan’s rapid database growth.

AI-driven opsDatabase AutonomyPerformance Monitoring

0 likes · 20 min read

Database Autonomy Service (DAS): Architecture, Design, and Implementation

Open Source Linux

May 5, 2022 · Operations

How to Visualize JMeter Performance Data with Grafana, InfluxDB, and Prometheus

This tutorial walks through the end‑to‑end setup of JMeter performance testing data collection using Backend Listener, sending metrics to InfluxDB, and visualizing real‑time TPS, response time, and error rates in Grafana, as well as monitoring OS metrics with node_exporter, Prometheus, and Grafana.

GrafanaInfluxDBJMeter

0 likes · 15 min read

How to Visualize JMeter Performance Data with Grafana, InfluxDB, and Prometheus

Efficient Ops

Apr 24, 2022 · Operations

Turn JMeter Test Results into Real‑Time Grafana Dashboards with InfluxDB & Prometheus

This article walks through the most common performance‑monitoring stack—JMeter, node_exporter, Prometheus, InfluxDB, and Grafana—explaining how to configure backend listeners, send metrics, store them, and build real‑time dashboards while highlighting code snippets and query examples.

DevOpsGrafanaInfluxDB

0 likes · 16 min read

Turn JMeter Test Results into Real‑Time Grafana Dashboards with InfluxDB & Prometheus

ByteDance Terminal Technology

Apr 22, 2022 · Mobile Development

MetricKit Overview: iOS Performance and Diagnostic Monitoring

MetricKit, introduced in iOS 13 and expanded in iOS 14, provides a low‑overhead, system‑driven way to collect app performance, power, and crash diagnostics over the last 24 hours, enabling developers to upload, analyze, and optimize their iOS applications without third‑party SDKs.

MetricKitMobile DevelopmentPerformance Monitoring

0 likes · 18 min read

MetricKit Overview: iOS Performance and Diagnostic Monitoring

Big Data Technology & Architecture

Apr 14, 2022 · Big Data

Practical Guide to Monitoring Flink Performance, Detecting Backpressure, and Configuring Alerts

This article explains how to use Flink's Web UI, Kafka metrics, and YARN monitoring to observe performance, diagnose backpressure, and set alert thresholds, providing code examples and practical tips for reliable stream processing in production environments.

Big DataFlinkKafka

0 likes · 9 min read

Practical Guide to Monitoring Flink Performance, Detecting Backpressure, and Configuring Alerts

HaoDF Tech Team

Mar 29, 2022 · Operations

Building an Observability Platform for Mini‑Program Image Uploads Using SRE and Metrics‑Driven Development

The article describes how a three‑day, cross‑team investigation of a mini‑program image‑upload failure led to the design and implementation of an SRE‑driven, metrics‑driven observability platform that quantifies SLIs, automates tracing, and provides dashboards for real‑time and long‑term analysis, ultimately reducing MTTR.

BackendMetrics-Driven DevelopmentMini-Program

0 likes · 17 min read

Building an Observability Platform for Mini‑Program Image Uploads Using SRE and Metrics‑Driven Development

FunTester

Mar 17, 2022 · Operations

Turning Manual Performance Monitoring into Automated Multi‑Level Alerts

The author explains how they distinguished test automation from automated testing, identified monitoring pain points, built a custom scraper‑driven alert system with three escalation levels, tackled common pitfalls, and achieved faster, more reliable performance testing alerts.

OperationsPerformance Monitoringalert system

0 likes · 6 min read

Turning Manual Performance Monitoring into Automated Multi‑Level Alerts

DeWu Technology

Feb 28, 2022 · Operations

DeWu Tech Salon – Quality Assurance Sessions Summary

The DeWu Tech Salon, co‑hosted by DeWu App Quality Platform and TesterHome, brought senior engineers from Alibaba Cloud, ByteDance, Lagou and DeWu together to share practical QA insights on end‑side monitoring, traffic replay, full‑link stress testing, and industry‑scale chaos engineering, while announcing a PPT collection, a testing‑expert recruitment drive, and a preview of the next wireless‑technology salon.

Performance Monitoringchaos engineeringsoftware reliability

0 likes · 6 min read

DeWu Tech Salon – Quality Assurance Sessions Summary

Laravel Tech Community

Feb 7, 2022 · Operations

Linux Basic Operations and Command Reference

This article provides a comprehensive reference of essential Linux commands and procedures, covering system shutdown and reboot, hardware and environment inspection, file and directory management, user and permission handling, networking utilities, performance monitoring tools, and common text processing techniques for administrators and developers.

NetworkingPerformance MonitoringShell scripting

0 likes · 18 min read

Linux Basic Operations and Command Reference

Open Source Linux

Jan 26, 2022 · Operations

Essential Linux Command Cheat Sheet for System Administration

This article compiles a comprehensive set of Linux command‑line shortcuts covering system shutdown, hardware inspection, user management, networking, file handling, compression, performance monitoring, and many other routine administration tasks, providing a handy reference for developers and sysadmins alike.

BashPerformance MonitoringShell

0 likes · 23 min read

Essential Linux Command Cheat Sheet for System Administration

21CTO

Jan 24, 2022 · Operations

Master Linux Performance Monitoring: Essential Tools and Metrics

This guide explains how to monitor Linux server performance using built‑in tools such as top, vmstat, pidstat, iostat, netstat, sar and tcpdump, detailing the metrics each provides and how to interpret them for CPU, memory, disk I/O and network troubleshooting.

Performance MonitoringSystem Administrationiostat

0 likes · 15 min read

Master Linux Performance Monitoring: Essential Tools and Metrics

Liangxu Linux

Jan 23, 2022 · Operations

Master Linux Tracing with bpftrace: Install, Run, and Explore Built‑in Tools

bpftrace is a Linux eBPF‑based tracing tool that compiles scripts to BPF bytecode, offering kernel and user‑space probes; the guide covers its purpose, installation on major distros and Docker, and demonstrates built‑in example scripts for command logging, I/O latency, disk size, CPU usage, and TCP connection tracing.

Linux tracingPerformance Monitoringbpftrace

0 likes · 7 min read

Master Linux Tracing with bpftrace: Install, Run, and Explore Built‑in Tools

ITPUB

Jan 21, 2022 · Operations

Essential Linux Performance Monitoring: Top, vmstat, iostat, netstat & More

This guide explains how to use core Linux tools such as top, vmstat, pidstat, iostat, sar, netstat, and tcpdump to monitor CPU, memory, disk I/O, and network performance, interpret their metrics, and troubleshoot common bottlenecks on servers.

Performance Monitoringiostatlinux

0 likes · 17 min read

Essential Linux Performance Monitoring: Top, vmstat, iostat, netstat & More

Efficient Ops

Jan 19, 2022 · Operations

Master Linux Server Performance: Essential Monitoring Tools & How to Use Them

This guide explains how to monitor Linux server performance using built‑in tools such as top, vmstat, pidstat, iostat, netstat, sar and tcpdump, interpreting their output to diagnose CPU, memory, disk I/O and network issues quickly and effectively.

Performance MonitoringSystem Administrationiostat

0 likes · 17 min read

Master Linux Server Performance: Essential Monitoring Tools & How to Use Them

Kuaishou Tech

Jan 14, 2022 · Operations

Analysis of Linux System Crash Caused by Memory Leak and the Role of min_free_kbytes

This article documents a system crash triggered by a memory‑leak in a network‑chip SDK, explains how low free memory and the kswapd process lead to deadlock, and shows how adjusting the kernel parameter min_free_kbytes can prevent the freeze while highlighting the importance of resource monitoring and tuning.

Performance Monitoringkswapdlinux

0 likes · 11 min read

Analysis of Linux System Crash Caused by Memory Leak and the Role of min_free_kbytes

dbaplus Community

Jan 10, 2022 · Operations

Why Is Linux Load High? Decoding Load Average, CPU Usage & Process States

This article explains Linux process states, how the kernel calculates load average, the relationship between load and CPU utilization, and provides a systematic approach with tools and commands to identify resource bottlenecks, differentiate high‑load/high‑CPU scenarios from high‑load/low‑CPU cases, and pinpoint problematic processes and threads.

Load AveragePerformance MonitoringProcess States

0 likes · 11 min read

Why Is Linux Load High? Decoding Load Average, CPU Usage & Process States

Alibaba Terminal Technology

Jan 10, 2022 · Mobile Development

How to Accurately Measure and Optimize Android Frame Rate with APM

This article explains how APM provides frame‑rate data for Android, discusses the challenges of inaccurate FPS, introduces metrics such as scroll FPS, frozen‑frame ratio, scrollHitchRate and frame‑cause analysis, details the rendering pipeline, code implementations, optimization techniques, and integration with AB testing for performance improvement.

APMAndroidFrame Rate

0 likes · 20 min read

How to Accurately Measure and Optimize Android Frame Rate with APM

ByteDance SE Lab

Jan 7, 2022 · Mobile Development

Systematic iOS Stability Management: From Crash Classification to Advanced Attribution

This article presents a comprehensive framework for identifying, classifying, and resolving iOS stability issues—covering crash types, governance methodology, deep-dive attribution techniques, real-world case studies, and practical tools such as Zombie monitoring, Coredump, MemoryGraph, and MetricKit—to dramatically improve app reliability.

APMPerformance Monitoringcrash analysis

0 likes · 30 min read

Systematic iOS Stability Management: From Crash Classification to Advanced Attribution

ByteDance Terminal Technology

Nov 24, 2021 · Mobile Development

Systematic iOS Stability Issue Management: Classification, Methodology, and Root‑Cause Attribution

This article presents a comprehensive guide on systematically managing iOS stability problems, covering issue classification, a governance methodology, detailed root‑cause analysis for crashes, watchdogs, OOM, CPU and disk I/O anomalies, and practical tools and case studies from ByteDance’s APM platform.

APMMobile DevelopmentPerformance Monitoring

0 likes · 27 min read

Systematic iOS Stability Issue Management: Classification, Methodology, and Root‑Cause Attribution

Efficient Ops

Nov 14, 2021 · Operations

Master Linux Performance Monitoring: Top, Free, and Vmstat Commands Explained

This comprehensive guide walks you through Linux performance monitoring tools—top, free, and vmstat—detailing their syntax, key options, real‑world output examples, and how to interpret CPU, memory, swap, and process statistics for effective system administration.

FreePerformance MonitoringSystem Administration

0 likes · 32 min read

Master Linux Performance Monitoring: Top, Free, and Vmstat Commands Explained

Liangxu Linux

Nov 14, 2021 · Operations

How to Spot and Diagnose Linux Hardware Bottlenecks with GUI Tools

This guide explains how to identify CPU, memory, storage, USB, network, and GPU bottlenecks on a Linux PC using only open‑source graphical tools such as GNOME System Monitor, HardInfo, atop and nethogs, providing clear visual cues and step‑by‑step instructions.

GNOMEHardInfoHardware Bottleneck

0 likes · 14 min read

How to Spot and Diagnose Linux Hardware Bottlenecks with GUI Tools

Architecture Digest

Nov 12, 2021 · Operations

Performance Monitoring with JMeter, InfluxDB, Prometheus, and Grafana

This article explains how to set up end‑to‑end performance monitoring by sending JMeter metrics to InfluxDB via Backend Listener, visualizing them in Grafana, and similarly collecting system metrics with node_exporter and Prometheus, covering configuration, data storage, query examples, and practical visualization techniques.

GrafanaInfluxDBJMeter

0 likes · 16 min read

Performance Monitoring with JMeter, InfluxDB, Prometheus, and Grafana

58 Tech

Nov 11, 2021 · Frontend Development

Cross‑Platform Mini Program Log Collection SDK: Design, Implementation, and Practice

This article introduces a lightweight, cross‑platform SDK for mini‑programs that unifies log collection—including exception, normal, and performance data—by intercepting App, Page and platform‑specific APIs, handling multi‑environment differences, and providing flexible usage via npm or single‑file integration.

Mini ProgramPerformance MonitoringSDK

0 likes · 15 min read

Cross‑Platform Mini Program Log Collection SDK: Design, Implementation, and Practice

Efficient Ops

Nov 3, 2021 · Operations

How to Visualize JMeter Performance Data with Grafana, InfluxDB, and Prometheus

This article explains step‑by‑step how to collect JMeter test metrics via Backend Listener, store them in InfluxDB, and display real‑time performance charts—including TPS, response time, and error rates—in Grafana, while also covering node_exporter integration with Prometheus for system‑level monitoring.

GrafanaInfluxDBJMeter

0 likes · 15 min read

ByteDance SE Lab

Oct 29, 2021 · Mobile Development

Mastering iOS App Startup: Real‑World Monitoring and Optimization Techniques

This article walks through practical iOS startup optimization, covering monitoring strategies, non‑intrusive instrumentation, development‑stage testing, essential tools like Time Profiler and System Trace, and a comprehensive set of best‑practice tactics to shrink launch time and maintain performance.

InstrumentationPerformance MonitoringiOS

0 likes · 28 min read

Mastering iOS App Startup: Real‑World Monitoring and Optimization Techniques

ByteDance Terminal Technology

Oct 28, 2021 · Mobile Development

MemCorruption Tool for Detecting Memory Corruption Issues in Android Apps

The MemCorruption tool, developed by ByteDance's AppHealth team, provides an online, low‑overhead solution for detecting Use‑After‑Free, Double‑Free, and Heap‑Buffer‑Overflow problems in Android applications by hooking memory allocation functions, sampling allocations, and performing invisible SIGSEGV‑based detection.

AndroidHookNative Debugging

0 likes · 12 min read

MemCorruption Tool for Detecting Memory Corruption Issues in Android Apps

Sohu Tech Products

Oct 27, 2021 · Operations

Linux Command Cheat Sheet: Basic Operations, System Management, and Monitoring

This article provides a comprehensive collection of essential Linux command-line instructions covering system shutdown and reboot, hardware and process inspection, file manipulation, networking utilities, performance monitoring, and common administrative tasks, all organized for quick reference by developers and system administrators.

NetworkingPerformance MonitoringShell

0 likes · 20 min read

Linux Command Cheat Sheet: Basic Operations, System Management, and Monitoring

Sohu Tech Products

Oct 14, 2021 · Backend Development

Design and Implementation of Full‑Chain Tracing for Node.js Applications Using Async Hooks

This article explains the necessity, design principles, and concrete implementation of a full‑chain tracing system for Node.js applications, covering architecture choices, industry solutions, core Async Hooks concepts, the zone‑context framework, invoke‑tree construction, garbage collection, and practical demo code.

Full-Chain TracingJavaScriptNode.js

0 likes · 18 min read

Design and Implementation of Full‑Chain Tracing for Node.js Applications Using Async Hooks

ByteDance Terminal Technology

Oct 12, 2021 · Mobile Development

Understanding Resource Overflow: Memory OOM and FD Exhaustion in Android Apps

This article explains how resource overflow such as memory out‑of‑memory (OOM) and file‑descriptor (FD) exhaustion occurs in Android applications, demonstrates how to identify the root causes using MARS‑APM (APMPlus), and provides practical steps to resolve these crashes.

APMPlusAndroidMemory

0 likes · 10 min read

Understanding Resource Overflow: Memory OOM and FD Exhaustion in Android Apps

dbaplus Community

Oct 11, 2021 · Operations

How to Visualize JMeter Performance Data with Grafana, InfluxDB, and Prometheus

This guide explains step‑by‑step how to configure JMeter’s Backend Listener to send metrics to InfluxDB, set up Prometheus and node_exporter, and create Grafana dashboards for real‑time TPS, response time, and system resource monitoring.

Backend ListenerGrafanaInfluxDB

0 likes · 15 min read

Programmer DD

Sep 23, 2021 · Backend Development

How to Set Up Nginx‑GUI for Visual Nginx Management on Linux

This guide walks you through installing, configuring, and running the open‑source Nginx‑GUI tool on a Linux server, covering download, property file edits, Linux‑specific renaming, permission setup, startup commands, and the features currently supported versus pending.

ConfigurationGUINginx

0 likes · 4 min read

How to Set Up Nginx‑GUI for Visual Nginx Management on Linux

Sohu Tech Products

Sep 15, 2021 · Backend Development

Design and Implementation of Full-Chain Tracing for Node.js Applications Using Async Hooks

This article explains how to acquire full‑chain tracing information in Node.js applications by leveraging Async Hooks to build an invoke tree, manage context propagation, perform garbage collection, and attach custom metadata, enabling precise debugging of complex asynchronous call graphs.

AsyncHooksDistributed TracingFull-Chain Tracing

0 likes · 16 min read

Tencent Cloud Developer

Sep 10, 2021 · Frontend Development

Front-End Performance Monitoring (RUM) and Optimization Strategies

The article introduces Tencent Cloud’s Real‑User‑Monitoring platform, explains key performance timing points and metric calculations, links RUM data to Google Web Vitals, and demonstrates a data‑driven optimization case that cut first‑screen load from 4.8 s to 3.2 s, halved resource load time, and dramatically improved CLS.

Performance MonitoringRUMfrontend

0 likes · 16 min read

Front-End Performance Monitoring (RUM) and Optimization Strategies

Youku Technology

Sep 8, 2021 · Industry Insights

How Youku Engineers Accelerate Start‑up, Cut Stalls, and Stabilize Video Playback

This article examines Youku's technical team's systematic approach to improving video playback by tackling slow start‑up, error handling, network stalls, audio‑video sync, and overall stability through metric‑driven optimization, adaptive streaming protocols, device capability databases, and forward‑looking features like low latency and high‑frame‑rate streaming.

Low latencyPerformance MonitoringVideo Streaming

0 likes · 12 min read

How Youku Engineers Accelerate Start‑up, Cut Stalls, and Stabilize Video Playback

vivo Internet Technology

Sep 2, 2021 · Backend Development

Full-Link Tracing in Node.js Applications: Async Hooks and Zone-Context Design

The article details a full‑link tracing system for Node.js that leverages experimental async_hooks to monitor asynchronous resource lifecycles, builds an invoke‑tree to map parent‑child relationships, implements garbage collection, and provides a ZoneContext API for propagating custom tracing data across async call chains.

Full‑Link TracingGarbage CollectionNode.js

0 likes · 19 min read

Full-Link Tracing in Node.js Applications: Async Hooks and Zone-Context Design

Liangxu Linux

Aug 31, 2021 · Operations

10 Essential Linux Commands to Diagnose Server Performance in the First Minute

When a Linux server shows performance problems, the first 60 seconds are critical; using ten standard command‑line tools such as uptime, dmesg, vmstat, mpstat, pidstat, iostat, free, sar, and top lets you quickly assess load, memory, I/O, and network health to pinpoint the root cause.

Performance MonitoringServer Diagnosticscommand-line

0 likes · 18 min read

10 Essential Linux Commands to Diagnose Server Performance in the First Minute

IT Architects Alliance

Aug 30, 2021 · Operations

Which Distributed Tracing Tool Wins? Comparing Zipkin, SkyWalking, Pinpoint

As micro‑service architectures grow, tracing every request across thousands of services becomes essential; this article examines the need for full‑link monitoring, outlines core requirements and functional modules, explains Google Dapper’s Span/Trace model, and provides a detailed performance‑focused comparison of Zipkin, SkyWalking, and Pinpoint.

APMComparisonDistributed Tracing

0 likes · 26 min read

Which Distributed Tracing Tool Wins? Comparing Zipkin, SkyWalking, Pinpoint

Open Source Linux

Aug 30, 2021 · Fundamentals

Unlocking Linux File System Secrets: Inodes, VFS, and I/O Performance

This article explains how Linux manages disks and file systems through inodes, directory entries, logical blocks, the Virtual File System layer, various I/O models, and practical performance monitoring commands, providing a comprehensive overview for developers and system engineers.

Operating SystemPerformance Monitoringlinux

0 likes · 19 min read

Unlocking Linux File System Secrets: Inodes, VFS, and I/O Performance

Code Ape Tech Column

Aug 28, 2021 · Backend Development

A Curated List of Alibaba Open‑Source Developer Tools for Backend Development

This article introduces a collection of Alibaba‑released open‑source tools—including Arthas, Cloud Toolkit, ChaosBlade, PTS, Druid, and more—detailing their usage scenarios, tutorials, and acquisition methods to help backend developers improve efficiency, debugging, monitoring, and reliability of their services.

AlibabaJavaPerformance Monitoring

0 likes · 14 min read

A Curated List of Alibaba Open‑Source Developer Tools for Backend Development

MaGe Linux Operations

Aug 27, 2021 · Operations

Visualizing JMeter Test Results in Real‑Time with Grafana, InfluxDB & Prometheus

This tutorial walks through the end‑to‑end monitoring pipeline—from JMeter’s Backend Listener sending metrics to InfluxDB, through storing and querying those metrics, to configuring Grafana dashboards for real‑time performance visualization, and also covers OS monitoring with node_exporter, Prometheus, and Grafana.

GrafanaInfluxDBJMeter

0 likes · 15 min read

Visualizing JMeter Test Results in Real‑Time with Grafana, InfluxDB & Prometheus

Code Ape Tech Column

Aug 26, 2021 · Operations

Common Linux Commands for Diagnosing Java Backend Performance Issues

This article provides a comprehensive collection of Linux commands and techniques for identifying memory, CPU, network, disk, and application bottlenecks in Java backend services, including practical examples of free, vmstat, top, sar, iostat, iotop, and jstack usage.

BackendJavaPerformance Monitoring

0 likes · 44 min read

Common Linux Commands for Diagnosing Java Backend Performance Issues

Baidu Intelligent Testing

Aug 10, 2021 · Backend Development

Evolution and Architecture of Baidu's Fengjing APM System

This article chronicles the four‑year evolution of Baidu's Fengjing performance‑monitoring platform, detailing its data collection, processing pipelines, successive architectural versions (1.0‑4.0), challenges such as probe intrusion and massive data volume, and the engineering solutions that enabled large‑scale, low‑cost, cloud‑native observability for thousands of Java services.

APMBig DataCloud Native

0 likes · 9 min read

Evolution and Architecture of Baidu's Fengjing APM System

WeChat Client Technology Team

Jul 19, 2021 · Mobile Development

How to Build a Complete ANR Monitoring Solution on Android

This article explains the Android ANR workflow, analyzes the system's appNotResponding logic, and presents a robust monitoring strategy that captures SIGQUIT signals, validates true ANR events, and hooks trace writes to reliably detect and diagnose ANRs in mobile apps.

ANRAndroidPerformance Monitoring

0 likes · 22 min read

How to Build a Complete ANR Monitoring Solution on Android

WeChat Client Technology Team

Jul 13, 2021 · Mobile Development

How to Accurately Detect UI Lag and ANR on Android: Advanced Monitoring Techniques

This article explains the relationship between UI stutter and ANR, critiques common monitoring tools, and presents three robust Android lag‑detection methods—WatchDog polling, Looper Printer replacement, and specialized handlers for IdleHandler, TouchEvent, and SyncBarrier leaks—complete with probability analysis and sample code.

ANRAndroidIdleHandler

0 likes · 17 min read

How to Accurately Detect UI Lag and ANR on Android: Advanced Monitoring Techniques

High Availability Architecture

Jul 8, 2021 · Frontend Development

Interview on Kuaishou Frontend Platform: Node.js Infrastructure, Real‑time Performance Monitoring, and Technical Challenges

The interview with Kuaishou senior engineers Zhang and Fang discusses their ten‑year web development experience, recent Node.js infrastructure and front‑end performance projects, the technical highlights, implementation challenges, architectural considerations, and the topics they will present at the upcoming GIAC conference.

Frontend InfrastructureNode.jsPerformance Monitoring

0 likes · 9 min read

Interview on Kuaishou Frontend Platform: Node.js Infrastructure, Real‑time Performance Monitoring, and Technical Challenges

Alibaba Cloud Developer

Jul 1, 2021 · Operations

Why High Load Doesn’t Always Mean High CPU: Decoding Linux Load Average & CPU Metrics

Understanding Linux load average and CPU utilization, this guide explains process states, how load is calculated, the difference between load and CPU usage, common bottlenecks, and step‑by‑step troubleshooting techniques using tools like top, vmstat, pidstat, iostat, and perf.

Performance Monitoringcpu-utilizationlinux

0 likes · 11 min read

Why High Load Doesn’t Always Mean High CPU: Decoding Linux Load Average & CPU Metrics

Selected Java Interview Questions

Jun 16, 2021 · Operations

Understanding CPU Utilization vs. CPU Load and Troubleshooting High Load Scenarios

This article explains the difference between CPU utilization and CPU load, describes how to interpret these metrics, and provides practical Linux commands and step‑by‑step methods for diagnosing high load with low utilization or vice versa, including common I/O and indexing issues.

CPULoadPerformance Monitoring

0 likes · 12 min read

Understanding CPU Utilization vs. CPU Load and Troubleshooting High Load Scenarios

HelloTech

Jun 11, 2021 · Mobile Development

Lumos Hybrid Stack: A Cross‑Platform Solution to Boost Mobile Development Efficiency

Lumos Hybrid Stack leverages Dart and Flutter to unify iOS and Android business logic, cutting duplicated development effort, accelerating debugging, and saving person‑days while providing a consistent, extensible architecture, performance monitoring, and a communication protocol that paves the way for future centralized state‑management across platforms.

DARTFlutterPerformance Monitoring

0 likes · 10 min read

Lumos Hybrid Stack: A Cross‑Platform Solution to Boost Mobile Development Efficiency

Tencent Cloud Developer

Jun 1, 2021 · Frontend Development

Front‑End Performance Monitoring: Metrics, Types, and Data Collection

Front‑end performance monitoring lets developers detect issues early, compare synthetic and real‑user approaches, track key metrics like LCP, FID, and CLS, and gather data via the web‑vitals library, Performance API, and error‑tracking hooks to systematically improve load speed, interactivity, and visual stability.

LighthousePerformance APIPerformance Monitoring

0 likes · 16 min read

Front‑End Performance Monitoring: Metrics, Types, and Data Collection

Liangxu Linux

May 27, 2021 · Operations

Master Linux Performance Monitoring: Top, Free, and vmstat Commands Explained

This article provides a detailed, step‑by‑step guide to Linux's built‑in performance monitoring utilities—top, free and vmstat—covering their purpose, essential options, output interpretation, interactive shortcuts, and practical examples for effective system administration.

FreeOperationsPerformance Monitoring

0 likes · 33 min read

Refining Core Development Skills

May 12, 2021 · Fundamentals

Linux Network Packet Sending Process: Deep Dive into Kernel Implementation

This comprehensive article provides an in-depth analysis of how Linux kernel sends network packets, covering the complete process from user-space send() call through protocol stack processing to hardware transmission, with detailed source code examination and performance considerations.

DMANetwork StackPacket Transmission

0 likes · 28 min read

Linux Network Packet Sending Process: Deep Dive into Kernel Implementation

MaGe Linux Operations

May 5, 2021 · Operations

Master Linux Performance Monitoring: Top, Free, and Vmstat Commands Explained

This guide provides a comprehensive, step‑by‑step walkthrough of the Linux performance monitoring tools top, free, and vmstat, covering command syntax, useful options, real‑world examples, output interpretation, and practical tips for effective system tuning and troubleshooting.

FreeMemory ManagementPerformance Monitoring

0 likes · 34 min read

Baidu Geek Talk

Apr 26, 2021 · Cloud Native

Evolution and Architecture of Baidu's Fengjing APM System

From its 2016 debut to the present, Baidu’s Fengjing APM system has evolved through four major releases—moving from invasive jar‑based probes to non‑invasive bytecode agents, adding modular hot‑swap plugins, scaling to thousands of containers, handling billions of daily metrics via Kafka, Doris, and SIA TSDB, while solving probe upgrade downtime, data‑ingestion volume, and call‑graph query latency.

APMBackend ArchitectureCloud Native

0 likes · 9 min read

DeWu Technology

Apr 15, 2021 · Frontend Development

How DeWu’s Frontend Platform Built an Effective Performance Monitoring System

This article explains why front‑end performance is critical, outlines the most cost‑effective metrics such as FCP, TTS, and TSP, describes the SDK‑SLS‑Backend architecture, shares key technical decisions, and proposes future enhancements for a robust monitoring solution.

Performance MonitoringSDKSLS

0 likes · 10 min read

How DeWu’s Frontend Platform Built an Effective Performance Monitoring System

Code Ape Tech Column

Apr 12, 2021 · Operations

Which APM Tool Wins? Deep Dive into Zipkin, Pinpoint, and SkyWalking

With micro‑service architectures generating complex call chains across thousands of servers, this article analyzes full‑link monitoring concepts, outlines essential requirements, details core components like spans and traces, and compares three major APM solutions—Zipkin, Pinpoint, and SkyWalking—evaluating probe impact, scalability, and data analysis capabilities.

APMDapperDistributed Tracing

0 likes · 26 min read

Which APM Tool Wins? Deep Dive into Zipkin, Pinpoint, and SkyWalking

Programmer DD

Apr 8, 2021 · Operations

How to Diagnose Linux Memory, CPU, Network, and Disk Bottlenecks with Command‑Line Tools

This guide explains how to use Linux command‑line utilities such as free, vmstat, top, sar, iostat, iotop, netstat, and jstack to identify memory, CPU, network, and disk performance bottlenecks, and provides practical commands for monitoring and troubleshooting each resource.

CPUMemoryPerformance Monitoring

0 likes · 41 min read

How to Diagnose Linux Memory, CPU, Network, and Disk Bottlenecks with Command‑Line Tools

Youzan Coder

Apr 7, 2021 · Mobile Development

Design and Implementation of a Mobile App Performance Monitoring System

The article describes a two‑part mobile app performance monitoring system that automatically instruments code to capture method execution times, ANR and frame stalls, then processes, cleans, aggregates, and visualizes the data on a backend platform to generate alerts, trend reports, and guide optimization across versions.

APMPerformance Monitoringdata-processing

0 likes · 11 min read

Design and Implementation of a Mobile App Performance Monitoring System

ELab Team

Mar 25, 2021 · Frontend Development

Mastering Front-End Event Tracking: From Data Collection to Performance Metrics

This article explains front‑end event tracking fundamentals, compares passive, code‑based, and SDK solutions, outlines common event attributes and types, details performance data collection via the Performance API, describes error capture methods, and presents reliable data reporting techniques using Beacon and image fallbacks.

Data ReportingError HandlingPerformance Monitoring

0 likes · 20 min read

Mastering Front-End Event Tracking: From Data Collection to Performance Metrics

Efficient Ops

Mar 15, 2021 · Operations

Master Linux Performance Monitoring: Top, Free, and Vmstat Commands Explained

This guide provides a comprehensive walkthrough of the Linux performance monitoring tools top, free, and vmstat, detailing their syntax, options, output interpretation, and practical tips for real‑time system analysis and optimization.

Performance MonitoringSystem Administrationfree command

0 likes · 34 min read

Node Underground

Mar 12, 2021 · Cloud Native

How Alinode Boosts Node.js Observability and Scheduling in the Cloud‑Native Era

Alinode expands its Node.js performance diagnostics into a full‑stack observability and scheduling platform for serverless workloads, offering traffic monitoring, white‑screen logs, remote debugging, crash analysis, standardized metrics, and a cloud‑native runtime that balances cost and performance.

Cloud NativeNode.jsObservability

0 likes · 11 min read

How Alinode Boosts Node.js Observability and Scheduling in the Cloud‑Native Era

Architect

Mar 9, 2021 · Operations

Full‑Stack Distributed Tracing and Monitoring: Comparing Zipkin, Pinpoint, and SkyWalking

The article explains the need for full‑link monitoring in micro‑service architectures, describes the core concepts of tracing such as spans and traces, outlines functional modules of APM systems, and provides a detailed comparison of three popular solutions—Zipkin, Pinpoint, and SkyWalking—covering performance impact, scalability, data analysis, developer transparency, and topology visualization.

APMDistributed TracingPerformance Monitoring

0 likes · 27 min read

Full‑Stack Distributed Tracing and Monitoring: Comparing Zipkin, Pinpoint, and SkyWalking

Xianyu Technology

Mar 9, 2021 · Mobile Development

Xianyu Android Log Governance and Performance Monitoring Solution

The Xianyu Android Log Governance solution unifies local console logs, TLog and online logs, adds comprehensive logcat capture and AUS‑OSS retrieval, replaces BlockCanary with low‑overhead frame‑callback lag/ANR detection, and provides dashboards and a batch‑query platform, cutting technical‑opinion ratio from 10.5 % to 4.7 % and boosting upload success.

ANR detectionAndroidLog Management

0 likes · 13 min read

Xianyu Android Log Governance and Performance Monitoring Solution

Architecture Digest

Mar 7, 2021 · Operations

Full‑Link Monitoring: Concepts, Requirements, Architecture and Comparative Evaluation of APM Solutions

The article explains the need for full‑link monitoring in microservice architectures, outlines its functional modules and design goals, details the core data structures of Google Dapper (Span, Trace, Annotation) with code examples, and compares three popular APM tools—Zipkin, Pinpoint and SkyWalking—across performance, scalability, analysis depth, transparency and topology features.

APMDistributed TracingMicroservices

0 likes · 25 min read

Full‑Link Monitoring: Concepts, Requirements, Architecture and Comparative Evaluation of APM Solutions

Alibaba Terminal Technology

Mar 5, 2021 · Frontend Development

Node.js & Frontend Innovations: Monitoring, Full-Stack Hooks, Fast CDN, Visual JS

This article introduces five cutting‑edge solutions for modern web development: an Addon‑based Node.js performance monitoring and fault‑location tool, the full‑stack Midway Hooks framework for zero‑API backend development, the ultra‑fast AirpackCDN component delivery network, the visual‑oriented iMove JavaScript framework, and the versatile SSR framework 5.0 supporting both React and Vue.

CDNFull-Stack FrameworkNode.js

0 likes · 5 min read

Node.js & Frontend Innovations: Monitoring, Full-Stack Hooks, Fast CDN, Visual JS

Sohu Tech Products

Mar 3, 2021 · Operations

Comprehensive Comparison of Full‑Link Tracing Solutions: Zipkin, SkyWalking, and Pinpoint

This article examines the challenges of monitoring distributed micro‑service architectures, outlines the requirements for a low‑overhead, non‑intrusive tracing system, describes the core concepts of Google Dapper (Span, Trace, Annotation), and provides a detailed functional and performance comparison of three popular APM tools—Zipkin, SkyWalking, and Pinpoint—covering probe impact, collector scalability, data analysis depth, developer transparency, topology visualization, and overall suitability.

APMDistributed TracingMicroservices

0 likes · 23 min read

Comprehensive Comparison of Full‑Link Tracing Solutions: Zipkin, SkyWalking, and Pinpoint

dbaplus Community

Feb 25, 2021 · Operations

How Distributed Tracing Solves Microservice Performance Bottlenecks with SkyWalking

This article explains the principles of distributed tracing, the OpenTracing standard, SkyWalking's architecture and sampling strategies, and shares a company's practical customizations—including forced sampling, fine‑grained group sampling, log4j traceId injection, and self‑developed plugins—to help pinpoint performance issues in microservice environments.

Distributed TracingJavaObservability

0 likes · 17 min read

How Distributed Tracing Solves Microservice Performance Bottlenecks with SkyWalking

Youzan Coder

Feb 24, 2021 · Mobile Development

Building Offline Mobile Performance Monitoring with AWACS and APM

This article explains how Youzan extended its APM framework with offline monitoring, built the AWACS visual tool, integrated Appium‑driven regression, instrumented method timing and network traffic via Gradle plugins, captured page rendering time, processed data in the backend, and created an issue‑management platform, outlining future enhancements.

APMAndroidMobile Development

0 likes · 13 min read

Building Offline Mobile Performance Monitoring with AWACS and APM

Aikesheng Open Source Community

Feb 21, 2021 · Databases

Using MySQL Events and Stored Procedures to Monitor InnoDB Lock Waits

This article explains how to replace shell‑script based lock‑wait monitoring with MySQL Event and Stored Procedure mechanisms, showing step‑by‑step creation of a monitoring database, procedure, scheduled event, and how to interpret the generated lock‑wait log table for troubleshooting.

Event SchedulerInnoDBLock Wait

0 likes · 6 min read

Using MySQL Events and Stored Procedures to Monitor InnoDB Lock Waits

Didi Tech

Feb 4, 2021 · Backend Development

Inside DiDi’s NodeX: Building a Scalable Node.js Ecosystem for Enterprise

DiDi’s NodeX initiative unites multiple front‑end teams to create a group‑level, professional, efficient, and stable Node.js development ecosystem that provides reusable components, a unified framework, common services, performance‑analysis tools, and documentation platforms, dramatically lowering service‑building barriers and boosting business productivity.

Backend DevelopmentComponent ArchitectureDevOps

0 likes · 12 min read

Inside DiDi’s NodeX: Building a Scalable Node.js Ecosystem for Enterprise

OPPO Kernel Craftsman

Jan 15, 2021 · Mobile Development

Android Frame Drop Detection and Monitoring Principles

The article explains Android’s frame‑drop detection and monitoring principles, covering refresh rate, Vsync, Choreographer and FrameInfo mechanisms, how lag is perceived, methods to calculate and retrieve dropped frames, and best practices such as avoiding heavy main‑thread work to ensure smooth UI performance.

AndroidPerformance MonitoringUI optimization

0 likes · 6 min read

Android Frame Drop Detection and Monitoring Principles

21CTO

Jan 8, 2021 · Databases

How ICBC Scales MySQL to 10,000 Nodes: Governance, Challenges, and Solutions

This article summarizes ICBC's experience of deploying nearly ten thousand MySQL nodes, detailing the current status, challenges, governance strategies—including standards, containerization, automated monitoring, emergency handling, and post‑incident analysis—and outlines future improvement plans for core‑application databases.

AutomationDatabase GovernanceICBC

0 likes · 17 min read

How ICBC Scales MySQL to 10,000 Nodes: Governance, Challenges, and Solutions

ITPUB

Jan 8, 2021 · Databases

How ICBC Scales MySQL to 10,000 Nodes: Governance, Automation, and Performance Strategies

This article outlines Industrial and Commercial Bank of China's journey from early MySQL adoption to managing nearly ten thousand production nodes, detailing the challenges of high‑capacity, high‑concurrency workloads and the comprehensive governance framework—including standards, pre‑emptive checks, automated incident response, and future self‑healing plans—used to ensure reliable core‑application data storage.

AutomationDatabase GovernanceLarge‑Scale Deployment

0 likes · 17 min read

How ICBC Scales MySQL to 10,000 Nodes: Governance, Automation, and Performance Strategies

Top Architect

Dec 29, 2020 · Operations

Comprehensive Guide to Java Runtime Error Checking: CPU, Disk, Memory, GC, and Network Troubleshooting

This article provides a step‑by‑step guide for diagnosing Java production issues by systematically checking CPU usage, disk health, memory consumption, garbage‑collection behavior, and network problems using common Linux tools and JVM utilities such as ps, top, jstack, jstat, vmstat, iostat, free, jmap, and tcpdump.

CPUGarbage CollectionJava

0 likes · 21 min read

Comprehensive Guide to Java Runtime Error Checking: CPU, Disk, Memory, GC, and Network Troubleshooting

Aikesheng Open Source Community

Dec 28, 2020 · Operations

Building a Custom MySQL Observation Tool with bcc and eBPF

This tutorial explains how to create a Python‑based eBPF tool using the bcc framework to trace MySQL Group Replication's apply_data_packet function, covering environment setup, BPF program writing, attaching probes, and displaying real‑time thread and timestamp information.

BCCObservabilityPerformance Monitoring

0 likes · 8 min read

Building a Custom MySQL Observation Tool with bcc and eBPF

Practical DevOps Architecture

Dec 25, 2020 · Databases

MySQL Command Shortcuts, Help Commands, and Administration Tips

This guide lists useful MySQL client shortcuts, explains the help command usage, provides common mysqladmin administration examples, and shows how to diagnose high CPU usage by tracing the offending SQL statements through process and performance schema queries.

Command-LineDatabase AdministrationPerformance Monitoring

0 likes · 5 min read

MySQL Command Shortcuts, Help Commands, and Administration Tips

JD Retail Technology

Dec 9, 2020 · Mobile Development

Building and Optimizing a First‑Screen Performance Monitoring System for Android Apps (JD Mall Case Study)

This article describes how JD Mall’s Android team designed a custom first‑screen latency monitoring system, collected lifecycle, network and rendering metrics via AOP and configuration, and applied a series of optimizations—including plugin preloading, network parallelism, layout inflation tricks, and hardware acceleration—to achieve over 30% faster app start‑up and significantly reduced user churn.

AndroidMobile DevelopmentPerformance Monitoring

0 likes · 17 min read

Building and Optimizing a First‑Screen Performance Monitoring System for Android Apps (JD Mall Case Study)

Code Ape Tech Column

Dec 7, 2020 · Operations

How to Determine If Redis Is Actually Slowing Down Using Baseline Performance

The article explains how to verify whether Redis performance degradation is real by measuring response latency, establishing a baseline using the --intrinsic-latency option, and comparing current latency against the baseline, with a rule of roughly twice the baseline indicating a slowdown.

Baseline LatencyIntrinsic LatencyPerformance Monitoring

0 likes · 6 min read

How to Determine If Redis Is Actually Slowing Down Using Baseline Performance

JavaEdge

Nov 25, 2020 · Operations

How Distributed Tracing Locates Failures and Optimizes Microservice Performance

This article explains the importance of service tracing in micro‑service architectures, describes core concepts such as traceId and spanId, outlines a three‑layer tracing system (collection, processing, visualization), and shows how real‑time and offline processing enable rapid fault isolation and system‑wide performance optimization.

Distributed TracingMicroservicesPerformance Monitoring

0 likes · 10 min read

How Distributed Tracing Locates Failures and Optimizes Microservice Performance

Liangxu Linux

Nov 18, 2020 · Operations

Diagnosing Linux Server Performance in the First 60 Seconds

This guide walks you through ten essential Linux command‑line tools—uptime, dmesg, vmstat, mpstat, pidstat, iostat, free, sar, and top—explaining what each metric means, how to interpret the output, and how to quickly spot utilization, saturation, and error issues within the first minute of investigation.

Performance MonitoringSystem Administrationcommand-line

0 likes · 20 min read

Diagnosing Linux Server Performance in the First 60 Seconds

Java Backend Technology

Nov 8, 2020 · Operations

How Distributed Tracing with SkyWalking Solves Microservice Performance Challenges

This article explains the principles, architecture, and practical adoption of distributed tracing—covering OpenTracing standards, SkyWalking's design, sampling strategies, plugin development, and real‑world company practices—to help engineers pinpoint bottlenecks and improve observability in microservice systems.

Distributed TracingMicroservicesObservability

0 likes · 17 min read

How Distributed Tracing with SkyWalking Solves Microservice Performance Challenges

FunTester

Oct 30, 2020 · Operations

Mastering Fiddler: Capture and Analyze HTTP/HTTPS Traffic Like a Pro

This guide explains how to use Fiddler to intercept and inspect HTTP and HTTPS requests, walks through its interface components, shows configuration steps for browsers, Android and iPhone, demonstrates filtering by domain, and outlines key metrics for request performance analysis.

FiddlerHTTP debuggingHTTPS capture

0 likes · 7 min read

Mastering Fiddler: Capture and Analyze HTTP/HTTPS Traffic Like a Pro

Programmer DD

Oct 16, 2020 · Operations

Mastering JVisualVM: Detect Memory Leaks and Monitor Java Apps

This guide introduces JVisualVM, explains its installation and plugin system, demonstrates how to create a memory‑leak example, use VisualGC and sampling tools to identify leaking objects, and shows step‑by‑step remote monitoring of a Tomcat server via JMX, providing practical screenshots and code snippets.

JVisualVMJava profilingPerformance Monitoring

0 likes · 8 min read

Mastering JVisualVM: Detect Memory Leaks and Monitor Java Apps

MaGe Linux Operations

Sep 18, 2020 · Operations

Essential Linux Operations Metrics for Effective Monitoring

This guide enumerates the key Linux system metrics—covering CPU, memory, disk, I/O, network, kernel parameters, RAID, SMART, NTP, and process information—that open-falcon agents collect every minute to enable comprehensive operations monitoring and timely issue detection.

MetricsOpen-FalconOperations

0 likes · 12 min read

Essential Linux Operations Metrics for Effective Monitoring

Full-Stack Internet Architecture

Sep 17, 2020 · Operations

Understanding Distributed Tracing and SkyWalking: Principles, Architecture, and Practical Implementation

This article explains the fundamentals of distributed tracing, the OpenTracing standard, and how SkyWalking implements automatic span collection, cross‑process context propagation, unique traceId generation, sampling strategies, performance benchmarks, and real‑world adaptations within a micro‑service environment.

Distributed TracingJavaMicroservices

0 likes · 16 min read

Understanding Distributed Tracing and SkyWalking: Principles, Architecture, and Practical Implementation

IT Architects Alliance

Sep 8, 2020 · Operations

How to Diagnose Linux Server Performance Issues in the First 60 Seconds

This guide walks you through ten essential Linux command‑line tools—such as uptime, vmstat, iostat, and top—showing how Netflix’s performance engineers use them to quickly assess system load, resource saturation, and errors within the first minute of investigation.

OperationsPerformance Monitoringcommand-line

0 likes · 19 min read

How to Diagnose Linux Server Performance Issues in the First 60 Seconds

360 Tech Engineering

Aug 10, 2020 · Databases

TiDB Cluster Write‑Write Conflict Investigation and Resolution

This article analyzes a TiDB cluster performance incident where QPS dropped and duration spiked due to write‑write conflicts, detailing the monitoring data, root‑cause investigation of server‑busy and scheduler latch issues, and the attempted mitigation steps such as enabling txn‑local‑latches and adjusting insert statements.

Cluster TroubleshootingDatabase operationsPerformance Monitoring

0 likes · 10 min read

TiDB Cluster Write‑Write Conflict Investigation and Resolution

Efficient Ops

Jul 27, 2020 · Operations

How to Turn Raw Network Traffic into Actionable Insights: A Step‑by‑Step Guide

This article explains why network traffic is valuable, how to capture it with tools like Wireshark or professional probes, decode packets for performance or business monitoring, and apply the extracted data to network, application, database monitoring and innovative business use cases.

Network TrafficPerformance MonitoringTCP/IP

0 likes · 6 min read

How to Turn Raw Network Traffic into Actionable Insights: A Step‑by‑Step Guide

JD Retail Technology

Jul 22, 2020 · Mobile Development

Flutter Frame‑Rate Monitoring Tool and Rendering Pipeline Analysis in the Sunglasses Platform

This article explains the Flutter rendering pipeline—including layout, paint, and compositing stages—details the widget‑element‑render object hierarchy, describes the task runners involved in each frame, and analyzes the Sunglasses‑integrated frame‑rate monitoring tool that classifies frame timings for performance debugging.

Mobile DevelopmentPerformance MonitoringRendering

0 likes · 9 min read

Flutter Frame‑Rate Monitoring Tool and Rendering Pipeline Analysis in the Sunglasses Platform

Top Architect

Jul 11, 2020 · Operations

7 Key Performance Metrics for Application Monitoring and Their Recommended Tools

The article outlines seven essential performance metrics—response time and throughput, average load, error rate, GC pause time, business indicators, uptime, and log size—explaining their significance for application health and recommending popular monitoring tools for each metric.

Garbage CollectionLog ManagementPerformance Monitoring

0 likes · 6 min read

7 Key Performance Metrics for Application Monitoring and Their Recommended Tools