Search

Discover articles.

Search across authors, categories, and technical themes. The layout mirrors the editorial references while staying responsive and fast.

Results

Matches for “observability”

668 results
Cloud Native Mar 27, 2024 ByteDance Cloud Native

How ByteDance Optimized Its Metrics Agent for 70% CPU Savings

This article details how ByteDance's cloud‑native observability team tackled performance bottlenecks in their metricserver2 Agent—reducing memory copies, merging tiny packets, applying SIMD for tag parsing, and switching compression libraries—to cut CPU usage by over 10% and memory usage by nearly 20% while handling petabyte‑scale metric data.

performance optimizationobservabilityC++SIMDcompressionmsgpack
Operations Mar 25, 2024 Efficient Ops

Why SRE Exists and How It Solves Modern Reliability Challenges

This article explains why Site Reliability Engineering (SRE) emerged, outlines its core responsibilities, required skill set, and how SRE teams use SLOs, monitoring, and scenario drills to improve system reliability, performance, and observability in complex production environments.

MonitoringOperationsDevOpsSREReliabilitySLO
Backend Development Mar 21, 2024 Tencent Cloud Developer

Backend Refactoring and Architecture Design of Tencent Docs Collection Form Service

Tencent Docs transformed its high‑traffic Collection Form by refactoring a monolithic C++‑style service into 19 loosely‑coupled vertical services with light‑heavy separation, database isolation, async Kafka pipelines, and full observability via Tianji, achieving dramatically improved stability, millisecond‑level sync, reliable export, and faster incident resolution.

backendperformancecloud-nativearchitectureobservabilitymicroservices
Cloud Computing Mar 20, 2024 DevOps

Platform Engineering: Beyond Infrastructure – Core Pillars and Human Collaboration

The article explains that platform engineering extends far beyond basic infrastructure, highlighting its core pillars such as automation, composability, agility, observability, and the essential role of collaboration and culture in creating value‑driven, cloud‑native software delivery.

cloud computingplatform engineeringautomationobservabilitycollaboration
Backend Development Mar 13, 2024 Sohu Tech Products

OpenResty‑Based Interface Authentication, Traffic Control, and Request Tracing in Production

The article shows how OpenResty’s Lua‑based extensions can implement lightweight, version‑controlled API authentication, dynamic traffic‑shaping via shared‑memory peer status, and selective request tracing with batch‑sent logs to Elasticsearch, enabling secure, highly available services and rich observability without sacrificing Nginx performance in production.

Traffic controlLoggingNginxLuaOpenRestyAPI authentication
Operations Mar 10, 2024 DevOps Cloud Academy

Top 10 Open‑Source Monitoring Tools for DevOps in 2024 – Features, Pros and Cons

This article reviews the ten most important open‑source monitoring and observability tools for modern DevOps teams in 2024, outlining each tool's key features, advantages, disadvantages, and how they compare for performance, scalability, cost and ease of use.

MonitoringOperationsObservabilityDevOpsOpen SourceTool Comparison
Frontend Development Mar 4, 2024 DevOps

Building QQ Front-end Unified Access Layer: Architecture, Technical Choices, and Performance Insights

This article shares a decade‑long journey of designing and scaling the QQ front‑end unified access layer, covering business background, overall architecture, solution comparisons, core challenges, observability, and performance optimizations while reflecting on practical lessons for large‑scale front‑end systems.

case studyfrontendperformancearchitectureobservabilitygatewaytRPC
Mobile Development Feb 23, 2024 OPPO Kernel Craftsman

Understanding Perfetto Data Flow Architecture and Reducing Trace Data Loss

Perfetto’s tracing system links multiple producers to a single consumer via shared‑memory buffers, where careful sizing of pages, chunks, and central buffers, along with tuned protobuf encoding and scheduling priorities, mitigates CPU overhead and prevents data loss, enabling reliable observability on Android devices.

PerformanceAndroidObservabilityShared MemoryData FlowtracingPerfetto
Big Data Feb 22, 2024 DataFunTalk

Flink on Kubernetes: Kuaishou’s Practice, Migration, and Future Refactoring

This article details Kuaishou’s five‑year evolution of Flink, covering its background, production refactoring to Kubernetes, migration practices, and future improvements, highlighting architecture layers, resource management, observability, and testing strategies for large‑scale stream processing.

MigrationCloud NativeBig DataFlinkObservabilityKubernetes
Frontend Development Feb 20, 2024 Tencent Cloud Developer

From Frontend to Full‑Stack: Architecture, Challenges, and Practices of the QQ Frontend Unified Access Layer

The veteran front‑end engineer chronicles a decade of building QQ’s large‑scale products, detailing how the new Frontend Unified Access Layer replaced fragmented SDKs with a high‑performance, scalable, secure gateway built on an internal http2rpc framework, while tackling legacy protocol coexistence, observability, alert fatigue, and targeted performance optimizations.

frontendperformancearchitectureobservabilitytRPCfull-stack
Previous Page 19 Next