Tagged articles
180 articles
Page 2 of 2
Code Ape Tech Column
Code Ape Tech Column
Oct 21, 2021 · Backend Development

Integrating Spring Cloud Sleuth and Zipkin for Distributed Tracing in Microservices

This tutorial explains the principles of distributed tracing, why it is needed for microservice architectures, and provides step‑by‑step instructions for adding Spring Cloud Sleuth and Zipkin—including Maven dependencies, configuration, Docker deployment, and Elasticsearch persistence—to a Spring Cloud project.

Distributed TracingDockerElasticsearch
0 likes · 14 min read
Integrating Spring Cloud Sleuth and Zipkin for Distributed Tracing in Microservices
Sohu Tech Products
Sohu Tech Products
Sep 15, 2021 · Backend Development

Design and Implementation of Full-Chain Tracing for Node.js Applications Using Async Hooks

This article explains how to acquire full‑chain tracing information in Node.js applications by leveraging Async Hooks to build an invoke tree, manage context propagation, perform garbage collection, and attach custom metadata, enabling precise debugging of complex asynchronous call graphs.

AsyncHooksDistributed TracingFull-Chain Tracing
0 likes · 16 min read
Design and Implementation of Full-Chain Tracing for Node.js Applications Using Async Hooks
IT Architects Alliance
IT Architects Alliance
Aug 30, 2021 · Operations

Which Distributed Tracing Tool Wins? Comparing Zipkin, SkyWalking, Pinpoint

As micro‑service architectures grow, tracing every request across thousands of services becomes essential; this article examines the need for full‑link monitoring, outlines core requirements and functional modules, explains Google Dapper’s Span/Trace model, and provides a detailed performance‑focused comparison of Zipkin, SkyWalking, and Pinpoint.

APMComparisonDistributed Tracing
0 likes · 26 min read
Which Distributed Tracing Tool Wins? Comparing Zipkin, SkyWalking, Pinpoint
Java Architect Essentials
Java Architect Essentials
Aug 12, 2021 · Cloud Native

Integrating Zipkin with Spring Cloud Sleuth for Distributed Tracing in Microservices

The article explains how to implement distributed tracing in a microservice architecture using Spring Cloud Sleuth and Zipkin, covering the concepts of spans and traces, configuration steps, code examples, deployment methods, sampling considerations, and persistence options such as MySQL and Elasticsearch.

Distributed TracingMicroservicesSpring Boot
0 likes · 25 min read
Integrating Zipkin with Spring Cloud Sleuth for Distributed Tracing in Microservices
Baidu Intelligent Testing
Baidu Intelligent Testing
Aug 10, 2021 · Backend Development

Evolution and Architecture of Baidu's Fengjing APM System

This article chronicles the four‑year evolution of Baidu's Fengjing performance‑monitoring platform, detailing its data collection, processing pipelines, successive architectural versions (1.0‑4.0), challenges such as probe intrusion and massive data volume, and the engineering solutions that enabled large‑scale, low‑cost, cloud‑native observability for thousands of Java services.

APMBig DataCloud Native
0 likes · 9 min read
Evolution and Architecture of Baidu's Fengjing APM System
Tencent Cloud Developer
Tencent Cloud Developer
Jul 22, 2021 · Operations

Observability in Serverless Environments: Monitoring, Logging, Distributed Tracing, and Best Practices

In this talk, Gal Bashan explains how serverless architectures complicate observability and why metrics, logs, and especially distributed tracing with tools like OpenTelemetry, Jaeger, or commercial platforms are essential for gaining end-to-end visibility, automating instrumentation, and maintaining reliable, business-focused services across cloud providers.

Cloud NativeDistributed TracingServerless
0 likes · 12 min read
Observability in Serverless Environments: Monitoring, Logging, Distributed Tracing, and Best Practices
Baidu Geek Talk
Baidu Geek Talk
Jun 30, 2021 · Operations

How Baidu Achieves 5‑9+ Availability: Inside Its Stability Engineering and Observability

This article dissects Baidu Search's ultra‑large micro‑service architecture, detailing the challenges of maintaining five‑nine‑plus availability, the diverse failure modes, and the step‑by‑step evolution of its observability stack—from early log‑only analysis to the kepler1.0/kepler2.0 tracing, full‑log indexing, custom span‑id generation, and compression techniques that together enable rapid root‑cause diagnosis at massive scale.

Baidu SearchDistributed TracingMetrics
0 likes · 21 min read
How Baidu Achieves 5‑9+ Availability: Inside Its Stability Engineering and Observability
Code Ape Tech Column
Code Ape Tech Column
Jun 29, 2021 · Industry Insights

Which Distributed Tracing Tool Wins? A Deep Dive into Dapper, Zipkin, Pinpoint, and SkyWalking

This article examines the challenges of monitoring complex micro‑service architectures, outlines the objectives of full‑link tracing, explains the Span/Trace data model, describes core functional modules, and provides a detailed performance and feature comparison of Google Dapper, Zipkin, Pinpoint, and SkyWalking.

APMDistributed TracingFull‑Link Monitoring
0 likes · 22 min read
Which Distributed Tracing Tool Wins? A Deep Dive into Dapper, Zipkin, Pinpoint, and SkyWalking
Liulishuo Tech Team
Liulishuo Tech Team
Jun 2, 2021 · Backend Development

Understanding Distributed Tracing and Its Use at Liulishuo

This article explains what distributed tracing is, why it is needed alongside logging and metrics for observability, how it works with trace and span IDs, and describes Liulishuo's implementation using OpenTelemetry, W3C Trace Context, and tail‑based sampling to improve backend debugging.

Distributed TracingMicroservicesOpenTelemetry
0 likes · 9 min read
Understanding Distributed Tracing and Its Use at Liulishuo
New Oriental Technology
New Oriental Technology
May 24, 2021 · Operations

Overview of SkyWalking UI: Dashboard, Topology, Tracing, Profiling, and Alerts

The article provides a comprehensive English overview of SkyWalking UI, detailing its dashboard metrics, topology visualization, trace analysis, performance profiling workflow, and alarm management, illustrating how the tool monitors microservice and cloud‑native environments with metrics such as throughput, latency, Apdex, and JVM statistics.

APMDistributed TracingSkyWalking
0 likes · 11 min read
Overview of SkyWalking UI: Dashboard, Topology, Tracing, Profiling, and Alerts
Baidu Geek Talk
Baidu Geek Talk
Apr 26, 2021 · Cloud Native

Evolution and Architecture of Baidu's Fengjing APM System

From its 2016 debut to the present, Baidu’s Fengjing APM system has evolved through four major releases—moving from invasive jar‑based probes to non‑invasive bytecode agents, adding modular hot‑swap plugins, scaling to thousands of containers, handling billions of daily metrics via Kafka, Doris, and SIA TSDB, while solving probe upgrade downtime, data‑ingestion volume, and call‑graph query latency.

APMBackend ArchitectureCloud Native
0 likes · 9 min read
Evolution and Architecture of Baidu's Fengjing APM System
Code Ape Tech Column
Code Ape Tech Column
Apr 12, 2021 · Operations

Which APM Tool Wins? Deep Dive into Zipkin, Pinpoint, and SkyWalking

With micro‑service architectures generating complex call chains across thousands of servers, this article analyzes full‑link monitoring concepts, outlines essential requirements, details core components like spans and traces, and compares three major APM solutions—Zipkin, Pinpoint, and SkyWalking—evaluating probe impact, scalability, and data analysis capabilities.

APMDapperDistributed Tracing
0 likes · 26 min read
Which APM Tool Wins? Deep Dive into Zipkin, Pinpoint, and SkyWalking
Top Architect
Top Architect
Mar 16, 2021 · Operations

Full-Link Monitoring: Concepts, Architecture, and Comparison of Zipkin, SkyWalking, and Pinpoint

This article explains the fundamentals of full‑link (distributed) monitoring, describes its core components such as spans, traces and annotations, outlines typical system architecture, and provides a detailed performance and feature comparison of three popular APM solutions—Zipkin, SkyWalking, and Pinpoint.

APMDistributed TracingFull‑Link Monitoring
0 likes · 22 min read
Full-Link Monitoring: Concepts, Architecture, and Comparison of Zipkin, SkyWalking, and Pinpoint
Architect
Architect
Mar 9, 2021 · Operations

Full‑Stack Distributed Tracing and Monitoring: Comparing Zipkin, Pinpoint, and SkyWalking

The article explains the need for full‑link monitoring in micro‑service architectures, describes the core concepts of tracing such as spans and traces, outlines functional modules of APM systems, and provides a detailed comparison of three popular solutions—Zipkin, Pinpoint, and SkyWalking—covering performance impact, scalability, data analysis, developer transparency, and topology visualization.

APMDistributed TracingPerformance Monitoring
0 likes · 27 min read
Full‑Stack Distributed Tracing and Monitoring: Comparing Zipkin, Pinpoint, and SkyWalking
Architecture Digest
Architecture Digest
Mar 7, 2021 · Operations

Full‑Link Monitoring: Concepts, Requirements, Architecture and Comparative Evaluation of APM Solutions

The article explains the need for full‑link monitoring in microservice architectures, outlines its functional modules and design goals, details the core data structures of Google Dapper (Span, Trace, Annotation) with code examples, and compares three popular APM tools—Zipkin, Pinpoint and SkyWalking—across performance, scalability, analysis depth, transparency and topology features.

APMDistributed TracingMicroservices
0 likes · 25 min read
Full‑Link Monitoring: Concepts, Requirements, Architecture and Comparative Evaluation of APM Solutions
Sohu Tech Products
Sohu Tech Products
Mar 3, 2021 · Operations

Comprehensive Comparison of Full‑Link Tracing Solutions: Zipkin, SkyWalking, and Pinpoint

This article examines the challenges of monitoring distributed micro‑service architectures, outlines the requirements for a low‑overhead, non‑intrusive tracing system, describes the core concepts of Google Dapper (Span, Trace, Annotation), and provides a detailed functional and performance comparison of three popular APM tools—Zipkin, SkyWalking, and Pinpoint—covering probe impact, collector scalability, data analysis depth, developer transparency, topology visualization, and overall suitability.

APMDistributed TracingMicroservices
0 likes · 23 min read
Comprehensive Comparison of Full‑Link Tracing Solutions: Zipkin, SkyWalking, and Pinpoint
dbaplus Community
dbaplus Community
Feb 25, 2021 · Operations

How Distributed Tracing Solves Microservice Performance Bottlenecks with SkyWalking

This article explains the principles of distributed tracing, the OpenTracing standard, SkyWalking's architecture and sampling strategies, and shares a company's practical customizations—including forced sampling, fine‑grained group sampling, log4j traceId injection, and self‑developed plugins—to help pinpoint performance issues in microservice environments.

Distributed TracingOpenTracingPerformance Monitoring
0 likes · 17 min read
How Distributed Tracing Solves Microservice Performance Bottlenecks with SkyWalking
Tencent Cloud Middleware
Tencent Cloud Middleware
Feb 22, 2021 · Operations

Why Distributed Tracing Matters: OpenTracing, OpenTelemetry, and Tencent’s TSW

Tracing has evolved from early log and stack‑trace techniques to modern distributed observability standards like OpenTracing and OpenTelemetry, and Tencent’s Cloud Service Watcher (TSW) demonstrates how cloud providers integrate these protocols to simplify microservice monitoring, performance metrics, and root‑cause analysis.

Cloud NativeDistributed TracingMicroservices
0 likes · 11 min read
Why Distributed Tracing Matters: OpenTracing, OpenTelemetry, and Tencent’s TSW
MaGe Linux Operations
MaGe Linux Operations
Feb 13, 2021 · Operations

Comparing Full‑Link Tracing Tools: Zipkin vs Pinpoint vs SkyWalking

This article examines the challenges of monitoring distributed micro‑service architectures, outlines the requirements for a full‑link tracing system, and provides a detailed comparison of three popular APM solutions—Zipkin, Pinpoint, and SkyWalking—covering performance impact, scalability, data analysis, developer transparency, and topology visualization.

APMDistributed TracingFull‑Link Monitoring
0 likes · 28 min read
Comparing Full‑Link Tracing Tools: Zipkin vs Pinpoint vs SkyWalking
dbaplus Community
dbaplus Community
Feb 1, 2021 · Operations

How to Build a Low‑Cost Distributed Tracing System for Microservices

This article explains the evolution from a monolithic architecture to microservices, outlines the new pain points such as fault isolation, performance bottlenecks and scaling inefficiencies, and presents a practical, low‑cost distributed tracing solution with unified frameworks, components, configuration management, data collection, and visualization.

Configuration ManagementDistributed TracingPerformance debugging
0 likes · 31 min read
How to Build a Low‑Cost Distributed Tracing System for Microservices
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Jan 30, 2021 · Operations

How to Choose the Right Full‑Link Tracing Tool: Zipkin vs Pinpoint vs SkyWalking

This article explains the background of full‑link monitoring in micro‑service architectures, outlines the key requirements for tracing tools, describes core concepts such as spans, traces and annotations, compares Zipkin, Pinpoint and SkyWalking across performance, scalability, data analysis, transparency and topology features, and provides practical deployment guidance to help you select the most suitable solution.

APMDistributed TracingFull‑Link Monitoring
0 likes · 28 min read
How to Choose the Right Full‑Link Tracing Tool: Zipkin vs Pinpoint vs SkyWalking
Zhuanzhuan Tech
Zhuanzhuan Tech
Jan 27, 2021 · Backend Development

IP‑Tag Based Traffic Routing and Distributed Tracing System for Test Environments

This article describes how a microservice architecture uses IP‑tag traffic routing to separate dynamic test environments from stable ones, implements RPC and MQ routing, and introduces a custom Zipkin‑based distributed tracing system called Tianwang to improve deployment efficiency, resource utilization, and debugging in large‑scale backend services.

Distributed TracingMQMicroservices
0 likes · 12 min read
IP‑Tag Based Traffic Routing and Distributed Tracing System for Test Environments
Efficient Ops
Efficient Ops
Jan 26, 2021 · Operations

How Full‑Link Tracing Tools Compare: Zipkin vs SkyWalking vs Pinpoint

This article examines the challenges of monitoring complex micro‑service architectures, outlines the goals and functional modules of full‑link tracing systems, explains Google Dapper’s core concepts such as Span, Trace and Annotation, and provides a detailed performance, scalability and feature comparison of three popular APM solutions—Zipkin, SkyWalking and Pinpoint.

APMDistributed TracingFull‑Link Monitoring
0 likes · 25 min read
How Full‑Link Tracing Tools Compare: Zipkin vs SkyWalking vs Pinpoint
NetEase Media Technology Team
NetEase Media Technology Team
Dec 8, 2020 · Operations

Comprehensive Online Load‑Testing and Stability Assurance Framework

The stability‑assurance squad built an online load‑testing framework that injects global TraceIds via a Java‑agent, records real‑traffic, routes test writes to shadow databases and caches, enforces automatic stop‑rules, and provides a UI platform, reducing cost, improving capacity insight, and enabling safe fault‑injection drills.

Distributed TracingJava AgentLoad Testing
0 likes · 12 min read
Comprehensive Online Load‑Testing and Stability Assurance Framework
JavaEdge
JavaEdge
Nov 25, 2020 · Operations

How Distributed Tracing Locates Failures and Optimizes Microservice Performance

This article explains the importance of service tracing in micro‑service architectures, describes core concepts such as traceId and spanId, outlines a three‑layer tracing system (collection, processing, visualization), and shows how real‑time and offline processing enable rapid fault isolation and system‑wide performance optimization.

Distributed TracingMicroservicesPerformance Monitoring
0 likes · 10 min read
How Distributed Tracing Locates Failures and Optimizes Microservice Performance
vivo Internet Technology
vivo Internet Technology
Nov 18, 2020 · Cloud Native

vivo Distributed Tracing System Agent Technology Principles and Practical Experience

The 2017‑initiated vivo distributed tracing system leverages a JavaAgent‑based micro‑kernel architecture, using ByteBuddy for non‑intrusive bytecode instrumentation, a Disruptor lock‑free queue, and Kafka to capture Trace/Span data—including cross‑thread propagation—while employing sampling, degradation, and JVM metrics to ensure 94% adoption stability.

DisruptorDistributed TracingJavaAgent
0 likes · 23 min read
vivo Distributed Tracing System Agent Technology Principles and Practical Experience
Wukong Talks Architecture
Wukong Talks Architecture
Nov 13, 2020 · Cloud Native

Understanding Distributed Tracing with Spring Cloud Sleuth and Zipkin

This article explains the principles and practical steps for implementing distributed tracing in microservice architectures using Spring Cloud Sleuth and Zipkin, covering why tracing is needed, core concepts like Span and Trace, Maven integration, Docker deployment, Zipkin UI usage, and performance analysis.

Distributed TracingDockerElasticsearch
0 likes · 13 min read
Understanding Distributed Tracing with Spring Cloud Sleuth and Zipkin
Tencent Cloud Developer
Tencent Cloud Developer
Nov 12, 2020 · Big Data

Tencent Tianji Ge Distributed Tracing System: Elasticsearch Optimization Practice

Tencent’s Tianji Ge distributed tracing platform, which combines tracing, metrics and logging for billions of daily records, overcame severe cluster jitter and storage latency by applying tiered Elasticsearch index templates, replica reduction, Transport‑API buffering, pre‑created indices and ILM, cutting write latency from 20 s to 0.32 s, shrinking shards by 70 % and saving 30 % of storage.

Big Data InfrastructureDistributed TracingElasticsearch Optimization
0 likes · 22 min read
Tencent Tianji Ge Distributed Tracing System: Elasticsearch Optimization Practice
Java Backend Technology
Java Backend Technology
Nov 8, 2020 · Operations

How Distributed Tracing with SkyWalking Solves Microservice Performance Challenges

This article explains the principles, architecture, and practical adoption of distributed tracing—covering OpenTracing standards, SkyWalking's design, sampling strategies, plugin development, and real‑world company practices—to help engineers pinpoint bottlenecks and improve observability in microservice systems.

Distributed TracingMicroservicesOpenTracing
0 likes · 17 min read
How Distributed Tracing with SkyWalking Solves Microservice Performance Challenges
Full-Stack Internet Architecture
Full-Stack Internet Architecture
Sep 22, 2020 · Operations

Design and Implementation of a Distributed Call‑Chain Tracing System for Microservices

This article explains how to design a non‑intrusive distributed tracing system for microservices by assigning global TraceIDs, generating hierarchical SpanIDs, using lightweight agents to propagate identifiers via transport headers, and aggregating data in a collector to visualize complete call graphs and diagnose performance issues.

Distributed TracingMicroservicesTrace ID
0 likes · 6 min read
Design and Implementation of a Distributed Call‑Chain Tracing System for Microservices
Full-Stack Internet Architecture
Full-Stack Internet Architecture
Sep 17, 2020 · Operations

Understanding Distributed Tracing and SkyWalking: Principles, Architecture, and Practical Implementation

This article explains the fundamentals of distributed tracing, the OpenTracing standard, and how SkyWalking implements automatic span collection, cross‑process context propagation, unique traceId generation, sampling strategies, performance benchmarks, and real‑world adaptations within a micro‑service environment.

Distributed TracingMicroservicesPerformance Monitoring
0 likes · 16 min read
Understanding Distributed Tracing and SkyWalking: Principles, Architecture, and Practical Implementation
IT Architects Alliance
IT Architects Alliance
Sep 14, 2020 · Operations

Implementation of Service Chain Monitoring and End-to-End Process Monitoring

This article explains how to design and implement service‑chain (APM) monitoring and end‑to‑end process monitoring in distributed systems, covering concepts such as spans and traces, TRACE_ID generation, logging practices, visualisation techniques, and a practical expense‑report use case with code examples.

APMDistributed TracingMicroservices
0 likes · 15 min read
Implementation of Service Chain Monitoring and End-to-End Process Monitoring
Programmer DD
Programmer DD
Aug 4, 2020 · Operations

Why SkyWalking’s Architecture Powers Modern Cloud‑Native Observability

This article, excerpted from Wu Sheng’s "Apache SkyWalking实战", explains SkyWalking’s four‑core component architecture, its protocol‑oriented, modular, and lightweight design principles, and highlights its advantages such as seamless traditional and cloud‑native support, easy maintenance, high performance, and extensibility for secondary development.

APMApache SkyWalkingDistributed Tracing
0 likes · 12 min read
Why SkyWalking’s Architecture Powers Modern Cloud‑Native Observability
Java Captain
Java Captain
May 16, 2020 · Fundamentals

Deep Dive into Java ThreadLocal: Data Structure, Hash Algorithm, Cleanup, and Usage

This article provides an in‑depth analysis of Java’s ThreadLocal mechanism, covering its weak‑reference key, internal ThreadLocalMap structure, hash algorithm, collision resolution, cleanup strategies, expansion logic, source code walkthroughs, and practical usage scenarios such as trace‑ID propagation in distributed systems.

Distributed TracingGarbage CollectionThreadLocal
0 likes · 27 min read
Deep Dive into Java ThreadLocal: Data Structure, Hash Algorithm, Cleanup, and Usage
360 Quality & Efficiency
360 Quality & Efficiency
Mar 13, 2020 · Cloud Native

An Overview of Spring Cloud and Its Core Components

This article provides a comprehensive introduction to Spring Cloud, explaining how it builds on Spring Boot to simplify distributed system development with components for service discovery, configuration management, messaging, monitoring, security, and compares its approach to Dubbo's RPC framework.

Cloud NativeConfiguration ManagementDistributed Tracing
0 likes · 6 min read
An Overview of Spring Cloud and Its Core Components
Youzan Coder
Youzan Coder
Mar 6, 2020 · Backend Development

Full-Link Tracing System: Architecture, Java Agent Integration, Multi-language Support, and Data Processing

Youzan’s full‑link tracing system combines a multi‑language SDK, Java Agent dynamic attachment, transparent upgrades, asynchronous context propagation, and a Spark‑based data pipeline that indexes traces in Elasticsearch and stores them in HBase, enabling real‑time diagnostics, log correlation, and future container‑level tracing expansion.

Distributed TracingJava AgentMicroservices
0 likes · 15 min read
Full-Link Tracing System: Architecture, Java Agent Integration, Multi-language Support, and Data Processing
dbaplus Community
dbaplus Community
Dec 10, 2019 · Databases

Unveiling Sharding-Proxy: Transparent Database Sharding in Apache ShardingSphere

This talk walks through Sharding-Proxy’s role within Apache ShardingSphere, detailing its architecture, SQL lifecycle from protocol decoding to routing, rewriting, execution, and result merging, while highlighting core principles, performance bottlenecks, and practical optimization techniques for high‑throughput, low‑latency database proxying.

Distributed TracingNettySQL Routing
0 likes · 17 min read
Unveiling Sharding-Proxy: Transparent Database Sharding in Apache ShardingSphere
Java Captain
Java Captain
Nov 30, 2019 · Cloud Native

Microservices Architecture Overview: Core Concepts, Benefits, and Implementation Details

This article provides a comprehensive overview of microservices, covering its origin, definition, core principles such as small independent services, process isolation, lightweight communication, deployment and management, while discussing advantages, challenges, suitable organizational structures, and key technical components like service discovery, gateways, configuration centers, monitoring, circuit breaking, containers, and orchestration frameworks.

DevOpsDistributed TracingMicroservices
0 likes · 15 min read
Microservices Architecture Overview: Core Concepts, Benefits, and Implementation Details
Qunhe Technology Quality Tech
Qunhe Technology Quality Tech
Oct 21, 2019 · Frontend Development

Accelerate Web Testing with a Chrome Extension for Hunter Tracing

This article explains how to build a Chrome extension that integrates with the Hunter distributed tracing system to automatically capture failing API calls during web testing, visualize request chains, and improve debugging efficiency, including manifest configuration, background and popup scripts, and implementation details.

Chrome ExtensionDistributed Tracingfrontend development
0 likes · 8 min read
Accelerate Web Testing with a Chrome Extension for Hunter Tracing
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Aug 15, 2019 · Operations

Navigating the Open‑Source Distributed Tracing Landscape: Tools, Features, and How to Choose

This guide surveys the most popular open‑source distributed tracing projects, classifying them by instrumentation, tracer, and analysis capabilities, and explains how they fit into modern microservice observability, helping newcomers understand each tool’s strengths, integrations, and the broader tracing ecosystem.

APMCloud NativeDistributed Tracing
0 likes · 10 min read
Navigating the Open‑Source Distributed Tracing Landscape: Tools, Features, and How to Choose
Sohu Tech Products
Sohu Tech Products
Jul 3, 2019 · Cloud Native

Building a Cloud‑Native Distributed Tracing System with Jaeger

This article explains why Jaeger is a popular cloud‑native tracing solution, describes its architecture, sampling options, and deployment strategies on Kubernetes—including DaemonSet and Sidecar modes—followed by a step‑by‑step Django integration example and guidance on monitoring, alerting, and resource cleanup.

Cloud NativeDistributed TracingDjango
0 likes · 13 min read
Building a Cloud‑Native Distributed Tracing System with Jaeger
Ctrip Technology
Ctrip Technology
Apr 18, 2019 · Operations

Application Monitoring Systems: Necessity, Components, Distributed Tracing, and Design for Developers, Testers, and Operations

The article explains why enterprise application monitoring systems are essential, outlines their core components such as Trace, Log, Metric, and Report, discusses distributed tracing techniques, and describes how these insights are designed to aid developers, testers, and operations engineers in performance tuning and fault diagnosis.

Distributed Tracingapplication monitoringobservability
0 likes · 12 min read
Application Monitoring Systems: Necessity, Components, Distributed Tracing, and Design for Developers, Testers, and Operations
Architecture Talk
Architecture Talk
Mar 10, 2019 · Operations

Mastering Distributed Tracing: From Dapper to Zipkin and OpenTracing

This article explains the fundamentals of distributed tracing, covering the original Dapper concepts, the architecture and data model of Zipkin, sampling strategies, storage mechanisms, and how OpenTracing provides a vendor‑neutral API for integrating tracing into microservice systems.

BackendDapperDistributed Tracing
0 likes · 13 min read
Mastering Distributed Tracing: From Dapper to Zipkin and OpenTracing
Programmer DD
Programmer DD
Feb 26, 2019 · Operations

Mastering Distributed Tracing: From Dapper to Zipkin and OpenTracing

This article explores the fundamentals of distributed tracing, detailing concepts from Google's Dapper paper, the architecture and data model of Zipkin, sampling mechanisms, data propagation, and OpenTracing standards, while providing code examples and practical insights for implementing tracing in microservice environments.

DapperDistributed TracingOpenTracing
0 likes · 14 min read
Mastering Distributed Tracing: From Dapper to Zipkin and OpenTracing
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Dec 11, 2018 · Operations

Why SkyWalking Beats Zipkin and Pinpoint: A Deep Dive into APM Tools

With micro‑service architectures causing requests to span dozens of services across multiple teams and data centers, this article explains APM fundamentals, details Google’s Dapper tracing model, and compares three popular APM solutions—Zipkin, Pinpoint, and SkyWalking—highlighting performance impact, scalability, data analysis depth, developer transparency, topology visualization, and community support.

APMDistributed TracingPerformance Monitoring
0 likes · 15 min read
Why SkyWalking Beats Zipkin and Pinpoint: A Deep Dive into APM Tools
Big Data and Microservices
Big Data and Microservices
Aug 15, 2018 · Operations

What Is APM? A Deep Dive into Application Performance Management and Top Open‑Source Tools

This article explains Application Performance Management (APM), its role in monitoring distributed and micro‑service systems, outlines the five‑dimensional APM model, details core monitoring functions, and reviews leading open‑source APM solutions such as PinPoint, Zipkin, SkyWalking, Prometheus, CAT and Hawkular.

APMDistributed TracingOperations
0 likes · 8 min read
What Is APM? A Deep Dive into Application Performance Management and Top Open‑Source Tools
Architect's Tech Stack
Architect's Tech Stack
Jul 26, 2018 · Operations

Deploying Pinpoint for Distributed Tracing of Dubbo Services

This guide explains how to install, configure, and use the open‑source Pinpoint APM tool to monitor Java‑based Dubbo applications, covering environment preparation, downloading binaries, modifying configuration files, deploying collector and web components, installing agents, and adding startup parameters for both Tomcat and SpringBoot deployments.

APMDistributed TracingDubbo
0 likes · 9 min read
Deploying Pinpoint for Distributed Tracing of Dubbo Services
AntTech
AntTech
May 31, 2018 · Cloud Native

Introducing the Newly Open‑Source SOFA Middleware Components for Cloud‑Native Microservices

This article introduces the recently open‑sourced SOFA middleware components—including SOFATracer, SOFALookout, SOFA‑Boot, SOFA‑RPC (Node and Java), Eggjs integration, and JarsLink—explaining their features, usage scenarios, and providing GitHub links for developers to adopt these financial‑grade cloud‑native solutions.

Distributed TracingMetricsSOFA
0 likes · 9 min read
Introducing the Newly Open‑Source SOFA Middleware Components for Cloud‑Native Microservices
JD Tech
JD Tech
May 9, 2018 · Cloud Native

Deep Dive into JD.com Microservice Component Platform and Its Cloud‑Native Architecture

An extensive overview of JD.com's microservice component platform explains its underlying architecture, communication technologies, service mesh (ContainerMesh), design tools based on DDD, service evaluation, migration challenges, and the platform's evolution toward a cloud‑native, self‑evolving ecosystem supporting distributed tracing, API gateways, and plugin‑based development.

DDDDistributed TracingMicroservices
0 likes · 17 min read
Deep Dive into JD.com Microservice Component Platform and Its Cloud‑Native Architecture
Architecture Digest
Architecture Digest
May 8, 2018 · Backend Development

Design and Comparison of Distributed Tracing Systems

The article explains the concept, functions, design goals, data models, log collection, and deployment considerations of distributed tracing systems, and compares several open‑source and proprietary solutions such as Dapper, Zipkin, Pinpoint, Alibaba Eagle Eye, and JD Hydra to guide the selection of an appropriate tracing platform.

BackendDistributed TracingMicroservices
0 likes · 16 min read
Design and Comparison of Distributed Tracing Systems
Efficient Ops
Efficient Ops
Apr 2, 2018 · Operations

How Bilibili Revamped Its Monitoring Architecture: From Zabbix to Dapper

An in‑depth look at Bilibili’s multi‑layer monitoring overhaul, detailing the shift from a monolithic Zabbix setup to micro‑service‑based ELK, Dapper, Misaka, Traceon and Lancer systems, and how layered observability improves fault detection across business, application, and infrastructure levels.

Distributed TracingMicroservicesOperations
0 likes · 10 min read
How Bilibili Revamped Its Monitoring Architecture: From Zabbix to Dapper
JD Tech
JD Tech
Feb 28, 2018 · Operations

CallGraph: JD.com's Distributed Tracing and Service Governance Platform

CallGraph is JD.com's internally developed distributed tracing and service governance platform that addresses the challenges of monitoring complex microservice architectures by providing low‑intrusion, low‑latency tracing, real‑time analytics, configurable sampling, and integration with JMQ, Storm, Spark, HBase, and JimDB for both operational insight and performance optimization.

Big DataDistributed TracingMicroservices
0 likes · 12 min read
CallGraph: JD.com's Distributed Tracing and Service Governance Platform
Programmer DD
Programmer DD
Feb 23, 2018 · Operations

How Zipkin Collects and Processes Sleuth Tracing Data – Deep Dive into Spans

This article explains Zipkin’s data model, how Spring Cloud Sleuth generates and sends Span and Annotation information, the message‑channel listener that converts Sleuth spans to Zipkin spans, debugging techniques to observe the collected data, and why the number of spans shown in Zipkin’s UI can differ from the raw count.

Distributed TracingMicroservicesSpring Cloud Sleuth
0 likes · 17 min read
How Zipkin Collects and Processes Sleuth Tracing Data – Deep Dive into Spans
Programmer DD
Programmer DD
Feb 20, 2018 · Backend Development

Master Distributed Tracing in Spring Cloud with Sleuth: A Hands‑On Guide

This tutorial explains how Spring Cloud Sleuth creates and propagates Trace‑ID and Span‑ID across microservice calls, shows the required HTTP headers, demonstrates adding the spring‑cloud‑starter‑sleuth dependency, customizing header output, configuring logging levels, and provides runnable code samples with log output for verification.

Distributed TracingMicroservicesSleuth
0 likes · 8 min read
Master Distributed Tracing in Spring Cloud with Sleuth: A Hands‑On Guide
Hujiang Technology
Hujiang Technology
Jan 29, 2018 · Operations

Design and Implementation of a Low‑Impact Distributed Tracing System for Service Calls

This article describes the background, design goals, architecture, implementation details, and lessons learned from building a low‑overhead, low‑intrusion distributed tracing system using Kafka, Elasticsearch, and OpenTracing to monitor microservice interactions and support performance analysis and DevOps decision‑making.

Distributed TracingElasticsearchKafka
0 likes · 9 min read
Design and Implementation of a Low‑Impact Distributed Tracing System for Service Calls
Meituan Technology Team
Meituan Technology Team
Jan 12, 2018 · Backend Development

Design and Implementation of Meituan Hotel Full-Chain Log and Trace System

To cope with Meituan Hotel’s exploding micro‑service complexity, the infrastructure team built the Satellite System—combining MTrace and a selective, zero‑intrusion Log4j2‑based logging pipeline that streams enriched logs through Kafka, Storm, Redis and Elasticsearch, delivering second‑level trace‑log queries and six‑month retention, dramatically speeding up debugging.

Distributed TracingElasticsearchKafka
0 likes · 11 min read
Design and Implementation of Meituan Hotel Full-Chain Log and Trace System
Qunar Tech Salon
Qunar Tech Salon
Oct 26, 2017 · Operations

Evolution of Pinterest's Monitoring System: From Time-Series Metrics to Distributed Tracing

Over seven years, Pinterest’s monitoring team built and refined a three‑pronged observability platform—time‑series metrics, log search, and distributed tracing—scaling from a single‑machine system to handling millions of data points per second across tens of thousands of AWS VMs, while addressing reliability, cost, and usability challenges.

Distributed TracingSREcloud infrastructure
0 likes · 19 min read
Evolution of Pinterest's Monitoring System: From Time-Series Metrics to Distributed Tracing
Efficient Ops
Efficient Ops
Oct 24, 2017 · Operations

How Pinterest Scaled Its Monitoring, Logging, and Tracing Over Seven Years

This article chronicles Pinterest's seven‑year evolution from a single‑machine time‑series monitor to a multi‑component system that integrates metrics, log search, and distributed tracing, sharing architectural choices, scaling challenges, and lessons learned for building reliable, high‑performance operations platforms.

Distributed TracingOperationsSRE
0 likes · 24 min read
How Pinterest Scaled Its Monitoring, Logging, and Tracing Over Seven Years
dbaplus Community
dbaplus Community
Oct 10, 2017 · Operations

How to Build Effective Service Monitoring: Principles, Practices, and Technical Implementation

This article explains why service monitoring is essential for large‑scale microservice environments, outlines design principles, core monitoring components, dependency mapping, call‑chain analysis, capacity planning, root‑cause analysis, and presents a practical technical architecture for implementing robust monitoring solutions.

Distributed TracingOperationscapacity planning
0 likes · 12 min read
How to Build Effective Service Monitoring: Principles, Practices, and Technical Implementation
Qunar Tech Salon
Qunar Tech Salon
Aug 14, 2017 · Backend Development

Introduction to QTracer: An Internal Distributed Tracing System at Qunar

QTracer is Qunar’s internal distributed tracing system that generates a global TraceID for each request, records operations across services, and provides features such as execution chain visualization, log correlation, conditional search, service dependency analysis, database statistics, transparent data propagation, and low‑overhead instrumentation for debugging and performance monitoring.

BackendDistributed TracingQTracer
0 likes · 20 min read
Introduction to QTracer: An Internal Distributed Tracing System at Qunar
Ctrip Technology
Ctrip Technology
Aug 10, 2017 · Operations

QTracer: An In‑Depth Overview of Qunar’s Distributed Tracing System

This article provides a comprehensive technical overview of QTracer, Qunar’s internal distributed tracing platform, covering its architecture, core concepts, key features such as execution‑chain queries, log association, conditional searches, data storage, non‑intrusive instrumentation, bytecode injection, and the QTracer Debug tool for online breakpoint debugging.

BackendDistributed TracingQTracer
0 likes · 19 min read
QTracer: An In‑Depth Overview of Qunar’s Distributed Tracing System
Beike Product & Technology
Beike Product & Technology
Jul 16, 2017 · Industry Insights

How Lianjia Built LTrace: A Low‑Overhead, Scalable Distributed Tracing Platform

This article explains how Lianjia designed and implemented LTrace, a zero‑intrusion, high‑performance distributed tracing system that captures full request chains across heterogeneous services, supports multi‑language environments, offers flexible sampling, and enables rapid fault isolation and performance optimization.

Distributed TracingSamplingarchitecture
0 likes · 12 min read
How Lianjia Built LTrace: A Low‑Overhead, Scalable Distributed Tracing Platform
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 20, 2017 · Cloud Native

How Uber Built Jaeger: From In‑House Tracing to a Cloud‑Native Open‑Source Platform

Uber’s engineering team chronicles the evolution of its distributed tracing system—from the early Merckx pull‑based solution and TChannel integration to the open‑source Jaeger platform—detailing architectural shifts, sampling strategies, multi‑language client libraries, and the move toward a fully cloud‑native, end‑to‑end observability stack.

Cloud NativeDistributed TracingMicroservices
0 likes · 17 min read
How Uber Built Jaeger: From In‑House Tracing to a Cloud‑Native Open‑Source Platform
dbaplus Community
dbaplus Community
Feb 6, 2017 · Operations

How JD’s CallGraph Transforms Distributed Tracing for Real‑Time Operations

CallGraph, JD.com’s in‑house distributed tracing platform, provides low‑intrusion, high‑performance monitoring for micro‑service ecosystems, enabling real‑time call‑graph analysis, TP metrics, flexible configuration, and future extensions such as deep‑learning‑driven insights.

Distributed TracingLog Processingmonitoring
0 likes · 15 min read
How JD’s CallGraph Transforms Distributed Tracing for Real‑Time Operations
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Dec 8, 2016 · Operations

Designing Effective End-to-End Tracing Systems for Distributed Services

This article surveys the design of end‑to‑end tracing systems for large distributed services, explaining core use cases, tracing approaches, metadata propagation, sampling strategies, visualization techniques, and recommended design choices to improve debugging, performance analysis, and resource attribution.

Distributed TracingSamplingSystem Design
0 likes · 44 min read
Designing Effective End-to-End Tracing Systems for Distributed Services
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Dec 1, 2016 · Operations

Why Distributed Tracing Systems Are Essential for Modern Microservices

As microservice architectures grow, service calls become increasingly complex, involving dozens of services and teams, making rapid fault localization and comprehensive data analysis critical; distributed tracing systems address these challenges by providing end‑to‑end visibility, low‑overhead instrumentation, and scalable monitoring across large‑scale applications.

Distributed TracingFault LocalizationMicroservices
0 likes · 8 min read
Why Distributed Tracing Systems Are Essential for Modern Microservices
Architecture Digest
Architecture Digest
Oct 25, 2016 · Backend Development

MTrace: Meituan‑Dianping Distributed Session Tracing System Design and Practice

The article introduces MTrace, Meituan‑Dianping’s large‑scale distributed session tracing system, explaining its call‑chain concept, architecture, data‑embedding SDK, trace and span identifiers, APIs for transparent data propagation, and how it enables bottleneck detection, performance optimization, and comprehensive monitoring across heterogeneous backend services.

Distributed TracingMicroservicesPerformance Monitoring
0 likes · 11 min read
MTrace: Meituan‑Dianping Distributed Session Tracing System Design and Practice
Meituan Technology Team
Meituan Technology Team
Oct 14, 2016 · Backend Development

Large-Scale Service Governance Design and Practice Using MTrace

MTrace, Meituan‑Dianping’s internal distributed tracing platform, assigns a global 64‑bit traceId to each request, instruments RPC, HTTP, database and messaging calls, aggregates data via Kafka into HBase and Hive, and visualizes the full call chain to pinpoint network cross‑datacenter traffic, latency bottlenecks, redundant calls, and correlated exceptions, thereby enabling systematic service‑level optimization.

BackendDistributed TracingMTrace
0 likes · 12 min read
Large-Scale Service Governance Design and Practice Using MTrace
dbaplus Community
dbaplus Community
May 11, 2016 · Operations

Inside Twitter’s Massive Monitoring Stack: Architecture, Metrics, and Lessons Learned

Twitter’s internal monitoring team built a full‑stack observability platform that handles billions of metric writes per minute, supports distributed tracing, log aggregation, visual dashboards, and alerting across data centers and public clouds, and shares the architecture, components, and key lessons learned.

AlertingDistributed TracingMetrics
0 likes · 18 min read
Inside Twitter’s Massive Monitoring Stack: Architecture, Metrics, and Lessons Learned
21CTO
21CTO
Apr 24, 2016 · Backend Development

Inside Sina Weibo’s Third-Gen Backend Architecture for 100M+ Users

To support over 140 million monthly active users, Sina Weibo evolved from a simple LAMP stack to a sophisticated third‑generation architecture featuring layered interfaces, service and resource tiers, custom RPC and messaging frameworks, SSD caching, and a large‑scale distributed tracing system called WatchMan.

Backend ArchitectureDistributed TracingScalable Systems
0 likes · 11 min read
Inside Sina Weibo’s Third-Gen Backend Architecture for 100M+ Users
Architect
Architect
Nov 8, 2015 · Backend Development

Weibo Platform Third‑Generation Architecture Overview

The article details how Weibo evolved from a LAMP‑based first generation to a third‑generation, orthogonal three‑layer (interface, service, resource) and four‑vertical (business, technology, monitoring, governance) architecture that supports massive high‑availability, high‑concurrency workloads with middleware such as Web V4, MCQ, Motan, object store, SSDCache, and the WatchMan distributed tracing system.

Distributed TracingWeibobackend-development
0 likes · 10 min read
Weibo Platform Third‑Generation Architecture Overview
21CTO
21CTO
Aug 15, 2015 · Backend Development

Inside Weibo’s Third‑Generation Backend Architecture: Scalability and High‑Availability

An in‑depth look at Weibo’s evolution to its third‑generation backend system, detailing the orthogonal decomposition model, three‑tier horizontal layering, key middleware such as MCQ, Motan RPC, SSDCache, and the WatchMan tracing platform that together enable high‑availability, massive concurrency, and low‑latency services for billions of users.

Backend ArchitectureDistributed Tracinghigh availability
0 likes · 12 min read
Inside Weibo’s Third‑Generation Backend Architecture: Scalability and High‑Availability
Qunar Tech Salon
Qunar Tech Salon
Nov 17, 2014 · Cloud Native

Service Scalability Challenges and Architectural Solutions in the Cloud Era

In the cloud and mobile era, while many scalability issues can be addressed by cloud platforms, services still face challenges such as remote RPC calls, distributed tracing, configuration management including service discovery and load balancing, and scheduling with lifecycle management, prompting architects to design transparent, pluggable solutions.

Distributed TracingRPCservice discovery
0 likes · 7 min read
Service Scalability Challenges and Architectural Solutions in the Cloud Era