Alibaba Cloud Observability
Author

Alibaba Cloud Observability

Driving continuous progress in observability technology!

144
Articles
0
Likes
222
Views
0
Comments
Recent Articles

Latest from Alibaba Cloud Observability

100 recent articles max
Alibaba Cloud Observability
Alibaba Cloud Observability
Jan 13, 2025 · Information Security

Why Log Auditing Is Essential for Cloud Security and Compliance

This article explains the importance of centralized log auditing for breaking information silos, meeting legal requirements, and enhancing security insights, and details how Alibaba Cloud's Simple Log Service (SLS) supports VPC flow log collection, multi‑region aggregation, rule configuration, custom analysis, and alerting.

Alibaba CloudLog AuditingVPC flow logs
0 likes · 18 min read
Why Log Auditing Is Essential for Cloud Security and Compliance
Alibaba Cloud Observability
Alibaba Cloud Observability
Jan 13, 2025 · Cloud Native

Alibaba Cloud’s Guide to Stable Large‑Scale Kubernetes After OpenAI Crash

After the OpenAI outage caused massive Kubernetes API overload, Alibaba Cloud’s Container Service and Observability teams detail how they reinforce large‑scale K8s clusters with high‑availability control‑plane design, optimized Prometheus probing, out‑of‑band monitoring, and best‑practice guidelines for capacity planning, safe releases, and rapid incident response.

Alibaba CloudCluster StabilityKubernetes
0 likes · 21 min read
Alibaba Cloud’s Guide to Stable Large‑Scale Kubernetes After OpenAI Crash
Alibaba Cloud Observability
Alibaba Cloud Observability
Jan 6, 2025 · Operations

How Synthetic Monitoring Boosts Network Reliability and User Experience

This article explains the importance of network stability, outlines major real‑world outages, and introduces synthetic monitoring—its functions, advantages, disadvantages, and various types such as protocol, browser, and internal monitoring—while comparing probe point categories and guiding enterprises on selecting the right strategy to improve service reliability and performance.

Network ReliabilitySynthetic Monitoringobservability
0 likes · 12 min read
How Synthetic Monitoring Boosts Network Reliability and User Experience
Alibaba Cloud Observability
Alibaba Cloud Observability
Dec 30, 2024 · Operations

Alibaba Cloud’s Mint Tracing Framework and FAMOS Diagnosis Earn Top‑Conference Spot

Alibaba Cloud’s recent research breakthroughs—Mint, a cost‑efficient tracing framework that captures all request flows while drastically cutting storage and network overhead, and FAMOS, a multi‑modal fault‑diagnosis method for microservice systems—have been accepted to the prestigious ASPLOS and ICSE conferences, marking the first top‑conference publications in observability for the company.

Fault DiagnosisTracingcloud computing
0 likes · 6 min read
Alibaba Cloud’s Mint Tracing Framework and FAMOS Diagnosis Earn Top‑Conference Spot
Alibaba Cloud Observability
Alibaba Cloud Observability
Dec 30, 2024 · Cloud Native

What Caused OpenAI’s Global Outage? Lessons for Cloud‑Native Observability

The article analyzes the December 11 OpenAI outage, revealing that a newly deployed telemetry service overloaded Kubernetes API servers, breaking DNS resolution and slowing recovery, and compares OpenAI’s approach with LoongCollector/iLogtail’s design to offer stability insights for cloud‑native environments.

API ServerKubernetesOpenAI outage
0 likes · 15 min read
What Caused OpenAI’s Global Outage? Lessons for Cloud‑Native Observability
Alibaba Cloud Observability
Alibaba Cloud Observability
Dec 24, 2024 · Operations

How to Achieve Full Observability for Go Apps Without Intrusive Agents

This article compares three Go observability solutions—SDK instrumentation, eBPF‑based monitoring, and compile‑time code injection—explaining their mechanisms, open‑source implementations, trade‑offs, and why Alibaba Cloud's Instgo compile‑time approach offers a low‑overhead, non‑intrusive APM alternative.

GoInstrumentationOpenTelemetry
0 likes · 11 min read
How to Achieve Full Observability for Go Apps Without Intrusive Agents
Alibaba Cloud Observability
Alibaba Cloud Observability
Dec 24, 2024 · Cloud Native

How the New SLS SQL Engine Boosts Big Data Queries by Up to 10×

Alibaba Cloud’s SLS SQL engine has been completely rebuilt, leveraging C++ SIMD, compute‑storage fusion, fine‑grained parallel pipelines, and advanced caching, delivering up to three‑fold raw performance gains, halving latency, and dramatically accelerating high‑cardinality, incremental, and join queries across trillion‑row log datasets.

Log Analytics
0 likes · 12 min read
How the New SLS SQL Engine Boosts Big Data Queries by Up to 10×