Alibaba Cloud Observability
Author

Alibaba Cloud Observability

Driving continuous progress in observability technology!

144
Articles
0
Likes
223
Views
0
Comments
Recent Articles

Latest from Alibaba Cloud Observability

100 recent articles max
Alibaba Cloud Observability
Alibaba Cloud Observability
Apr 1, 2025 · Cloud Native

How We Boosted Multi-line Log Collection Speed from 90 MB/s to 350 MB/s

This article details a real‑world case where massive multi‑line log volumes overwhelmed iLogtail, explains the performance bottlenecks caused by full‑line regex matching, describes the switch to prefix‑only matching and IngestProcessor, and shows how these changes lifted throughput from under 100 MB/s to over 300 MB/s while halving CPU usage.

Performance optimizationiLogtaillog collection
0 likes · 15 min read
How We Boosted Multi-line Log Collection Speed from 90 MB/s to 350 MB/s
Alibaba Cloud Observability
Alibaba Cloud Observability
Mar 24, 2025 · Artificial Intelligence

Achieving Full Observability for AI Inference Apps with Prometheus

This article explores the observability challenges of AI inference services, outlines a comprehensive Prometheus‑based metric collection strategy, and demonstrates practical monitoring implementations for Ray Serve, vLLM, GPU resources, and custom metrics to build stable, high‑performance inference pipelines.

AI inferencePrometheusRay Serve
0 likes · 19 min read
Achieving Full Observability for AI Inference Apps with Prometheus
Alibaba Cloud Observability
Alibaba Cloud Observability
Mar 24, 2025 · Information Security

DeepSeek ClickHouse Leak: AI Data Risks & Cloud Native Log Service Safeguards

An exposed ClickHouse database at DeepSeek revealed over a million sensitive logs—including chats, API keys, and backend details—highlighting AI data security gaps, while Alibaba Cloud’s Log Service (SLS) offers comprehensive protection through access control, data masking, fine-grained query limits, and real‑time monitoring.

AILog Serviceobservability
0 likes · 11 min read
DeepSeek ClickHouse Leak: AI Data Risks & Cloud Native Log Service Safeguards
Alibaba Cloud Observability
Alibaba Cloud Observability
Mar 17, 2025 · Cloud Native

How to Master LLM Observability in Cloud‑Native Environments

This article explains the unique observability challenges of large language model (LLM) applications, outlines essential performance, cost, and safety metrics, and presents a comprehensive cloud‑native solution—including trace, metric, and log collection, domain‑specific dashboards, and step‑by‑step integration with Alibaba Cloud's Python Agent—to ensure reliable, efficient LLM deployments.

AI gatewayLLM ObservabilityOpenTelemetry
0 likes · 18 min read
How to Master LLM Observability in Cloud‑Native Environments
Alibaba Cloud Observability
Alibaba Cloud Observability
Mar 13, 2025 · Databases

How MetricStore 2.0 Redefines Cloud‑Native Time‑Series Storage Performance

MetricStore 2.0 introduces a comprehensive overhaul of memory, file, compute, and transport layers for cloud‑native time‑series data, delivering higher compression, lower latency, multi‑tenant resource control, and support for dynamic schemas, while addressing the scalability limits of its 1.0 predecessor.

cloud-nativeobservabilitytime series
0 likes · 21 min read
How MetricStore 2.0 Redefines Cloud‑Native Time‑Series Storage Performance