Search

Discover articles.

Search across authors, categories, and technical themes. The layout mirrors the editorial references while staying responsive and fast.

Results

Matches for “observability”

625 results
Operations Mar 1, 2024 Yum! Tech Team

Building an Observability System Traffic Distribution Diagram

This article explains how to design and implement a traffic distribution diagram for an observability system, covering current cloud‑native tooling, data standardization, transformation, traffic‑flow modeling, aggregation, storage with ClickHouse, and visualisation techniques such as Sankey diagrams.

cloud nativeobservabilitydata modelingvisualizationtraffic diagram
Cloud Native Jan 29, 2024 DaTaobao Tech

Observability: Logging, Metrics, and Tracing in Distributed Systems

Observability in distributed systems combines event logging, aggregated metrics, and request tracing—each offering distinct trade‑offs in detail, storage, and overhead—and while the ELK stack dominates log and metric handling, tracing solutions such as EagleEye and SkyWalking differ by protocol and language, prompting many teams to adopt unified, cloud‑native platforms like Alibaba Cloud’s Log Service for lower cost, real‑time analysis and simplified management.

cloud-nativeobservabilitymetricsloggingtracingELKSLS
Operations Jan 22, 2024 Efficient Ops

How New Oriental Standardized Its Observability System to Cut Costs and Boost Efficiency

At the 21st GOPS Global Operations Conference, New Oriental's senior operations manager Qi Chen detailed the demand, technical, and focus pressures that drove a phased, full‑process observability standardization, leveraging OpenTelemetry, Telegraf, Loki and CMDB tagging to achieve cost reduction and higher stability.

monitoringcloud nativeobservabilityDevOpsOpenTelemetrySREcost reduction
Cloud Native Jan 21, 2024 DataFunTalk

Building a System Observability Framework with YHP: Practices, Challenges, and Integrated Solutions

This article explains how YHP enables cloud‑native systems to achieve comprehensive observability by defining the three core signals—metrics, traces, and logs—addressing common enterprise pain points, and presenting an integrated platform that unifies data collection, storage, analysis, and visualization for efficient fault diagnosis and performance monitoring.

Cloud NativeObservabilityMetricsData PlatformLogsTraces
Operations Jan 7, 2024 FunTester

Integrating Monitoring and Observability for Effective Application Performance Management

The article explains how combining traditional monitoring with modern observability, supported by data quality practices and unified workflows, enables more reliable, scalable, and insightful application performance management in agile and cloud‑native environments.

monitoringperformanceAPMoperationsobservabilitydata quality
Operations Dec 14, 2023 DevOps Cloud Academy

CI/CD Observability via OpenTelemetry at Grafana Labs

The article explains the importance of CI/CD observability, outlines common pipeline problems, introduces Grafana's GraCIe plugin built on OpenTelemetry, and discusses how enhanced visibility can improve reliability, decision‑making, and future standardization across CI/CD platforms.

MonitoringCI/CDObservabilityDevOpsOpenTelemetryGrafana
Cloud Native Dec 14, 2023 Xiaohongshu Tech REDtech

Evolution of Xiaohongshu Metrics System: Cloud‑Native Observability, High Availability, and Performance Optimizations

Xiaohongshu’s observability team rebuilt its Prometheus‑based metrics platform using vmagent, dual‑active HA clusters, query push‑down, high‑cardinality governance and multi‑cloud active‑active design, delivering ten‑fold collection speed, up to 70× query capacity, massive CPU‑memory‑storage savings and fully automated scaling.

performancecloud-nativeobservabilityhigh-availabilitymetricsVictoriametricstime-series
Databases Dec 13, 2023 DataFunTalk

SelectDB Boosts GuanceDB Observability: Architecture Upgrade, Cost Reduction, and Performance Gains

This article details how SelectDB’s inverted‑index, Variant data type, and sampling capabilities were integrated into GuanceDB to replace Elasticsearch, achieving up to 70% storage cost reduction, 2‑4× query speed improvement, and a ten‑fold overall cost‑performance boost for log analytics and observability workloads.

PerformanceCloud NativeDatabaseObservabilityLog AnalyticsSelectDB
Backend Development Dec 12, 2023 Qunar Tech Salon

System Slimming at Qunar Travel: Reducing Code and Service Footprint by 50% Using Observability and Automation

This article presents Qunar Travel's "system slimming" project, describing how observability techniques, a two‑stage strategy, and automated tooling were used to identify and remove unused services and code, achieving a 50% reduction in code size, a 26% cut in services, and measurable improvements in reliability and release efficiency.

Javamicroservicesautomationobservabilitybackend optimizationcode reductionservice slimming
Operations Nov 20, 2023 Ops Development Stories

How eBPF Powers Next‑Gen Observability and Fault Diagnosis in Kubernetes

At KubeCon China 2023, experts Liu Kai and Dong Shandong presented a three‑part deep dive into Kubernetes observability challenges, demonstrating how eBPF enables comprehensive data collection across all stack layers, seamless integration, and intelligent root‑cause analysis through dimension attribution, anomaly bounding, and fault‑tree methods.

Cloud NativeObservabilityKuberneteseBPFFault Diagnosis
Previous Page 4 Next