Tagged articles

log-aggregation

49 articles · Page 1 of 1

Jun 19, 2026 · Operations

Loki + Promtail: A Lightweight, Cost‑Effective Alternative to ELK for Log Management

Loki + Promtail provides a lightweight log aggregation solution that indexes only labels, cutting storage and memory usage to about one‑fifth of ELK, and the article walks through deployment, configuration, best‑practice label design, multi‑tenant setup, performance tuning, and real‑world case studies.

Cloud Nativelog-aggregationloki

0 likes · 40 min read

Loki + Promtail: A Lightweight, Cost‑Effective Alternative to ELK for Log Management

Linux Cloud-Native Ops Stack

Apr 16, 2026 · Operations

Step-by-Step Loki Binary Deployment on Linux (CentOS/Ubuntu)

This guide walks through preparing a Linux host, downloading Loki v3.6.10, configuring its YAML file, setting up a systemd service, deploying Promtail for log collection, and integrating Grafana for visualization, all using a unified /app/soft directory structure.

CentOSGrafanaLinux

0 likes · 10 min read

Step-by-Step Loki Binary Deployment on Linux (CentOS/Ubuntu)

Raymond Ops

Mar 2, 2026 · Cloud Native

ELK vs EFK vs Loki: 2025’s Best Log Solution for Cost, Performance & Simplicity

This comprehensive 2025 guide compares ELK, EFK, and Loki across architecture, deployment complexity, storage cost, query performance, feature completeness, high‑availability, and real‑world case studies, helping teams of any size choose the most cost‑effective and operationally suitable log collection stack.

EFKELKlog-aggregation

0 likes · 37 min read

ELK vs EFK vs Loki: 2025’s Best Log Solution for Cost, Performance & Simplicity

Alibaba Cloud Observability

Dec 29, 2025 · Cloud Native

How to Seamlessly Import Massive S3 Logs into Alibaba Cloud SLS with Real‑Time Analysis

This article explains how to centralize and analyze massive multi‑cloud log data stored in object storage by moving AWS S3 logs into Alibaba Cloud Log Service (SLS) using dual‑mode file discovery, SQS event‑driven import, elastic scaling, and pre‑ingestion processing to achieve low latency, high reliability, and cost efficiency.

AWS S3Elastic ScalingReal-time Processing

0 likes · 12 min read

How to Seamlessly Import Massive S3 Logs into Alibaba Cloud SLS with Real‑Time Analysis

Ops Community

Sep 8, 2025 · Operations

Mastering Distributed Log Architecture: From Flume to ELK and Beyond

This comprehensive guide walks you through the challenges of large‑scale log collection, real‑time processing, storage optimization, and visualization, detailing practical configurations for Flume, Logstash, Elasticsearch, Kibana, Filebeat, Kafka, Kubernetes, and future AIOps integrations to build a reliable, cost‑effective distributed logging system.

ELKFlumeKafka

0 likes · 24 min read

Mastering Distributed Log Architecture: From Flume to ELK and Beyond

Architecture Digest

Jul 10, 2025 · Operations

How to Deploy Graylog with Docker and Integrate It into Spring Boot

This guide explains why log aggregation is needed in micro‑service environments, shows how to set up Graylog using Docker‑Compose, configure its inputs, and integrate it with a Spring Boot application via Logback‑GELF, then demonstrates basic log search queries.

DockerGraylogLogging

0 likes · 8 min read

How to Deploy Graylog with Docker and Integrate It into Spring Boot

Code Ape Tech Column

Jul 7, 2025 · Operations

How to Deploy Graylog with Docker and Integrate It into Spring Boot

Learn step‑by‑step how to set up Graylog using Docker‑Compose, configure its inputs, and connect a Spring Boot application via Logback‑GELF, enabling centralized log aggregation and searchable queries across multiple service instances in a micro‑services environment.

DockerGraylogLogback

0 likes · 8 min read

Liangxu Linux

Jun 10, 2025 · Cloud Native

Why Loki Is the Ideal Cloud‑Native Log Aggregator for Prometheus & Grafana

Loki, an open‑source log aggregation system from Grafana Labs, integrates tightly with Prometheus and Grafana, stores logs efficiently using object storage, offers a simple label‑based model, and provides cost‑effective, high‑performance logging for cloud‑native environments while outlining its architecture, usage, configuration, advantages, limitations, and retention policies.

Cloud NativeGrafanaObservability

0 likes · 10 min read

Why Loki Is the Ideal Cloud‑Native Log Aggregator for Prometheus & Grafana

Top Architect

May 8, 2025 · Operations

Centralized Log Collection with Filebeat and Graylog: Configuration, Deployment, and Integration Guide

This article explains how to use Filebeat for log shipping, configure its YAML files, deploy Graylog with Docker and Elasticsearch, and integrate logging into Spring Boot applications, providing step‑by‑step commands, code examples, and best‑practice recommendations for centralized log management.

DockerElasticsearchGraylog

0 likes · 20 min read

Centralized Log Collection with Filebeat and Graylog: Configuration, Deployment, and Integration Guide

Liangxu Linux

May 7, 2025 · Operations

How to Install and Configure Loki, Promtail, and Grafana for Log Aggregation on Rocky Linux

This step‑by‑step guide shows how to install Loki, configure its YAML file, set up Promtail to ship logs, install Grafana, add Loki as a data source, and use LogQL to query logs—including collecting Nginx JSON logs—on a Rocky Linux system.

GrafanaLogQLObservability

0 likes · 10 min read

How to Install and Configure Loki, Promtail, and Grafana for Log Aggregation on Rocky Linux

Top Architect

Sep 14, 2024 · Operations

Centralized Log Collection with Filebeat and Graylog: Installation, Configuration, and Usage

This article explains why centralized log collection is essential for multi‑environment services, introduces Graylog as a lightweight alternative to ELK, details Filebeat's role and workflow, provides configuration examples, shows how to deploy both Filebeat and Graylog via Docker or packages, and demonstrates integration with Spring Boot and log search techniques.

DockerELKGraylog

0 likes · 20 min read

Centralized Log Collection with Filebeat and Graylog: Installation, Configuration, and Usage

Code Ape Tech Column

Apr 21, 2024 · Cloud Native

Deploying Graylog with Docker‑Compose and Integrating It into a Spring Boot Application

This tutorial explains how to deploy Graylog using Docker‑Compose for centralized log aggregation in a microservices environment and shows step‑by‑step integration of Graylog with a Spring Boot application via Logback‑GELF, including configuration, code examples, and basic log search queries.

Docker ComposeGELFGraylog

0 likes · 8 min read

Deploying Graylog with Docker‑Compose and Integrating It into a Spring Boot Application

ITPUB

Dec 24, 2023 · Backend Development

Why Kafka Is the Backbone of Modern Messaging, Streaming, and Data Pipelines

This article explains how Kafka serves as a high‑throughput, durable messaging system, a reliable storage layer, a log‑aggregation hub, a stream‑processing engine, and a core component for CDC, system migration, monitoring, and event‑sourcing architectures.

CDCEvent SourcingKafka

0 likes · 9 min read

Why Kafka Is the Backbone of Modern Messaging, Streaming, and Data Pipelines

Top Architect

Sep 20, 2023 · Operations

Design and Implementation of a Distributed Log Service: Tianyan vs ELK

This article examines the challenges of building a high‑performance log service for distributed systems, compares the traditional ELK stack with the Tianyan platform, details Tianyan's architecture—including ingest, storage, and consumer components, SDK and Minos collection methods, high‑throughput transmission with Disruptor and Bigpipe, log retrieval, resource isolation, dynamic cleaning, and best‑practice recommendations.

BigpipeDisruptorELK

0 likes · 27 min read

Design and Implementation of a Distributed Log Service: Tianyan vs ELK

Architect

Sep 19, 2023 · Big Data

How Tianyan Beats ELK: Inside a High‑Performance Distributed Log Service

This article analyzes the challenges of logging in distributed services, compares the traditional ELK stack with Baidu's Tianyan platform, and details Tianyan's architecture, data collection, high‑throughput transmission, storage, retrieval, resource isolation, dynamic cleanup, and best‑practice recommendations, complete with code examples and performance insights.

Big DataELKElasticsearch

0 likes · 30 min read

How Tianyan Beats ELK: Inside a High‑Performance Distributed Log Service

Liangxu Linux

Aug 29, 2023 · Cloud Native

Master Real-Time Multi-Pod Log Viewing in Kubernetes with Kubetail & Stern

This guide introduces two lightweight Kubernetes log‑tailing tools, Kubetail and Stern, explains their installation on various platforms, demonstrates common usage patterns and command‑line options, and provides practical examples for aggregating and filtering logs across multiple pods and containers.

Cloud NativeKubernetesdevops

0 likes · 10 min read

Master Real-Time Multi-Pod Log Viewing in Kubernetes with Kubetail & Stern

MaGe Linux Operations

Jul 25, 2023 · Cloud Native

Why Choose Loki for Cloud‑Native Log Management? A Complete Deployment Guide

This article explains why Loki is a lightweight, cloud‑native log aggregation solution, outlines its advantages and supported storage backends, compares log collectors, details Loki's indexing and query mechanisms, and provides step‑by‑step instructions for deploying Loki in Kubernetes with all‑in‑one, read/write, and microservice modes.

Cloud Native MonitoringGrafanaKubernetes

0 likes · 15 min read

Why Choose Loki for Cloud‑Native Log Management? A Complete Deployment Guide

Qunar Tech Salon

Apr 19, 2023 · Operations

Heimdall Exception Statistics System: Architecture, Implementation, and Practice

This article describes the design, implementation, and evolution of Heimdall, an exception‑statistics platform built on Kafka, Flink, and HBase that provides minute‑level anomaly aggregation, stack trace querying, and integration with release and alerting workflows to improve service reliability across thousands of micro‑services.

Exception MonitoringKafkaObservability

0 likes · 14 min read

Heimdall Exception Statistics System: Architecture, Implementation, and Practice

Architecture Digest

Apr 10, 2023 · Operations

Comparison of Common Log Management Tools: Filebeat, Graylog, LogDNA, ELK, Loki, Datadog, Logstash, Fluentd, and Splunk

This article provides a detailed comparison of nine popular log management solutions—Filebeat, Graylog, LogDNA, ELK Stack, Grafana Loki, Datadog, Logstash, Fluentd, and Splunk—covering their core features, pricing models, advantages, and drawbacks to help readers choose the right tool for centralized logging.

CloudELKlog management

0 likes · 13 min read

Comparison of Common Log Management Tools: Filebeat, Graylog, LogDNA, ELK, Loki, Datadog, Logstash, Fluentd, and Splunk

Efficient Ops

Mar 19, 2023 · Cloud Native

Master Real-Time Multi-Pod Logging in Kubernetes with Kubetail & Stern

This guide introduces two lightweight Kubernetes log‑tailing utilities, Kubetail and Stern, explaining their installation on various platforms, core command‑line options, and practical usage examples for aggregating and color‑coding logs from multiple pods and containers, offering a simpler alternative to heavyweight logging stacks.

CLIKuberneteskubetail

0 likes · 10 min read

Master Real-Time Multi-Pod Logging in Kubernetes with Kubetail & Stern

High Availability Architecture

Nov 7, 2022 · Backend Development

Design and Implementation of Meituan's Logan Real-Time Log System

This article describes how Meituan built Logan, a high‑performance, end‑to‑end real‑time logging platform for mobile, web, mini‑programs and IoT, covering its background, architecture, data collection, processing, consumption, monitoring, deployment strategies, achieved results and future roadmap.

ElasticsearchFlinkKafka

0 likes · 15 min read

Design and Implementation of Meituan's Logan Real-Time Log System

Code Ape Tech Column

Oct 12, 2022 · Operations

Deploying Graylog with Docker Compose and Integrating it into a Spring Boot Application

This tutorial explains how to set up Graylog for centralized log aggregation using Docker Compose, configure its inputs, and integrate a Spring Boot application with Graylog via Logback GELF, including search syntax examples for effective log analysis.

DockerGELFGraylog

0 likes · 7 min read

Architecture Digest

May 20, 2022 · Cloud Native

Introduction, Architecture, Deployment and Usage of Grafana Loki Log Aggregation System

This article introduces Grafana Loki, an open‑source, horizontally scalable, highly available log aggregation system optimized for Kubernetes and Prometheus, covering its core concepts, architecture, component roles, deployment steps, configuration examples, and practical usage within Grafana.

GrafanaKubernetesObservability

0 likes · 18 min read

Introduction, Architecture, Deployment and Usage of Grafana Loki Log Aggregation System

Programmer DD

May 16, 2022 · Cloud Native

Master Loki: Scalable Log Aggregation for Kubernetes and Prometheus

This guide introduces Loki, the open‑source, horizontally scalable log aggregation system optimized for Prometheus and Kubernetes, covering its core concepts, architecture, components, deployment steps, Grafana integration, label‑based indexing, and best practices for handling dynamic and high‑cardinality tags.

GrafanaKubernetesObservability

0 likes · 19 min read

Master Loki: Scalable Log Aggregation for Kubernetes and Prometheus

Efficient Ops

Apr 27, 2022 · Operations

Why Choose Loki Over ELK? A Practical Guide to Scalable Log Aggregation

This article explains the motivations for selecting Grafana Loki instead of traditional ELK/EFK stacks, introduces Loki's core concepts and architecture, details component roles, provides step‑by‑step deployment of Promtail and Loki, and demonstrates how to configure and query logs in Grafana while addressing label indexing, dynamic tags, high‑cardinality challenges, and query performance.

GrafanaKubernetesObservability

0 likes · 18 min read

Why Choose Loki Over ELK? A Practical Guide to Scalable Log Aggregation

IT Architects Alliance

Nov 11, 2021 · Operations

Design and Implementation of a TB‑Scale Log Monitoring System Using the ELK Stack

This article explains how to build a terabyte‑level log monitoring platform for micro‑service environments by unifying log collection with FileBeat, enriching observability through Elastic APM, processing streams via Kafka Streams, and visualizing metrics with Grafana and Kibana, while addressing cost‑effective filtering and retention strategies.

ELK StackGrafanaLog Monitoring

0 likes · 8 min read

Design and Implementation of a TB‑Scale Log Monitoring System Using the ELK Stack

Ops Development Stories

Sep 23, 2021 · Operations

Deploy a Production-Ready Loki Cluster with S3 Storage and Redis Cache

This guide walks you through setting up a Loki logging cluster for production, covering the native architecture, key configuration differences, storage with boltdb‑shipper on S3, Redis caching, ruler setup, and adapting the Docker‑Compose deployment to Kubernetes.

ConfigurationKubernetesS3

0 likes · 9 min read

Deploy a Production-Ready Loki Cluster with S3 Storage and Redis Cache

Efficient Ops

Sep 15, 2021 · Cloud Native

Why Loki Beats ELK for Cloud‑Native Log Management

This article explains the motivations behind choosing Grafana Loki over traditional ELK/EFK stacks for container‑cloud logging, detailing its lightweight design, cost advantages, simple architecture, and how its components—Distributor, Ingester, and Querier—work together to provide scalable, efficient log aggregation and querying.

Prometheuslog-aggregationloki

0 likes · 8 min read

Why Loki Beats ELK for Cloud‑Native Log Management

Code Ape Tech Column

Jul 27, 2021 · Cloud Native

Understanding Loki: Advantages, Architecture, Installation, and Query Practices

This article explains Loki's low‑index overhead, concurrent query handling, tag‑based indexing, component roles, read/write paths, step‑by‑step installation of Promtail and Loki, label matching techniques, dynamic‑tag handling, high‑cardinality concerns, and query optimization strategies for cloud‑native log aggregation.

Cloud NativeObservabilitylog-aggregation

0 likes · 13 min read

Understanding Loki: Advantages, Architecture, Installation, and Query Practices

Yuewen Technology

Jul 16, 2021 · Operations

Mastering Log Aggregation: From LogID Generation to Powerful Analysis Tools

This article explores the challenges of log aggregation in micro‑service architectures, introduces a globally unique log identifier (logid) with its required properties, compares various logid generation schemes, and presents end‑to‑end solutions for log distribution, aggregation, and analysis using custom tools such as ylog and watcher.

distributed systemslog analysislog-aggregation

0 likes · 26 min read

Mastering Log Aggregation: From LogID Generation to Powerful Analysis Tools

Programmer DD

Jul 1, 2021 · Operations

Why Loki Beats Elasticsearch: Low Index Overhead, Fast Queries, and Easy Setup

This article explains Loki's advantages over Elasticsearch, including low indexing overhead, concurrent query processing with caching, seamless integration with Prometheus and Grafana, detailed architecture components, installation steps, label handling, high‑cardinality challenges, and best practices for efficient log management.

ElasticsearchGrafanaObservability

0 likes · 15 min read

Why Loki Beats Elasticsearch: Low Index Overhead, Fast Queries, and Easy Setup

Ops Development Stories

May 31, 2021 · Operations

Deploy a Production‑Ready Loki Cluster on Kubernetes with S3 Storage

This guide walks you through setting up a Loki logging cluster in production, covering native configurations, extended storage and cache settings, Kubernetes deployment, and practical code examples to simplify the process for newcomers.

KubernetesS3 storagedocker-compose

0 likes · 8 min read

Deploy a Production‑Ready Loki Cluster on Kubernetes with S3 Storage

Efficient Ops

May 9, 2021 · Big Data

How to Build a Billion-Scale ELK Log Platform with Filebeat, Kafka, and Elasticsearch

Learn step‑by‑step how to design and deploy a billion‑scale log collection and analysis platform using the ELK stack—Filebeat, Kafka, Logstash, Elasticsearch, and Kibana—covering architecture, configuration, installation, and best practices for high‑availability and performance.

ELKElasticsearchKibana

0 likes · 14 min read

How to Build a Billion-Scale ELK Log Platform with Filebeat, Kafka, and Elasticsearch

Java Architecture Diary

Apr 19, 2021 · Operations

Why Loki Is the Lightweight, Scalable Log Solution You Need Over EFK

This article introduces Loki, Grafana’s lightweight, horizontally scalable log aggregation system, compares it with the EFK stack, explains Promtail, LogQL query language, alerting, and how Loki integrates with Grafana and Prometheus for unified metrics and logs, highlighting its low‑resource, cloud‑native advantages.

Cloud NativeObservabilitylog-aggregation

0 likes · 8 min read

Why Loki Is the Lightweight, Scalable Log Solution You Need Over EFK

MaGe Linux Operations

Apr 6, 2021 · Operations

How to Deploy Loki with Docker Compose for Scalable Log Aggregation

This guide walks you through Loki's architecture, advantages, installation via Docker Compose, configuration of Promtail, and step‑by‑step commands to set up a high‑availability, multi‑tenant log aggregation system integrated with Grafana.

Docker ComposeGrafanalog-aggregation

0 likes · 7 min read

How to Deploy Loki with Docker Compose for Scalable Log Aggregation

Programmer DD

Mar 28, 2021 · Big Data

Mastering Apache Flume: Architecture, Components, and Key Features

This article provides a comprehensive overview of Apache Flume, detailing its purpose as a distributed log aggregation system, explaining its core components such as sources, channels, and sinks, and illustrating its architecture, multi‑agent setups, and key features like reliability, scalability, compression, and monitoring.

Flumedata ingestionlog-aggregation

0 likes · 9 min read

Mastering Apache Flume: Architecture, Components, and Key Features

JD Cloud Developers

Dec 17, 2020 · Backend Development

How Loki Cuts Log Storage Costs While Integrating Deeply with Prometheus

This article explains Loki's origins, data model, LogQL query language, low‑cost storage design, and the full read‑write architecture—including Distributor, Ingester, Querier, and QueryFrontend—showing how it solves the shortcomings of traditional Elasticsearch‑based logging solutions and integrates tightly with Prometheus monitoring.

LogQLObservabilityPrometheus

0 likes · 21 min read

How Loki Cuts Log Storage Costs While Integrating Deeply with Prometheus

Programmer DD

Nov 7, 2020 · Operations

Loki 2.0.0 Unveiled: Transforming Log Observability for Kubernetes

Loki 2.0.0 introduces major enhancements such as a revamped LogQL pipeline, native Prometheus‑style alerts, and simplified storage with boltdb‑shipper, delivering a more resource‑efficient, scalable log aggregation solution for Kubernetes environments.

KubernetesLogQLObservability

0 likes · 3 min read

Loki 2.0.0 Unveiled: Transforming Log Observability for Kubernetes

Java Backend Technology

Jul 5, 2020 · Cloud Native

Why Loki Beats ELK for Cloud‑Native Log Management: Architecture and Benefits

This article explains the motivations behind choosing Loki over traditional ELK/EFK stacks for container‑cloud logging, outlines its cost‑effective design, describes its simple architecture and components such as Distributor, Ingester, and Querier, and highlights its scalability and seamless integration with Prometheus.

ELK alternativeObservabilitycloud-native

0 likes · 8 min read

Why Loki Beats ELK for Cloud‑Native Log Management: Architecture and Benefits

Efficient Ops

Apr 2, 2018 · Operations

How Bilibili Revamped Its Monitoring Architecture: From Zabbix to Dapper

An in‑depth look at Bilibili’s multi‑layer monitoring overhaul, detailing the shift from a monolithic Zabbix setup to micro‑service‑based ELK, Dapper, Misaka, Traceon and Lancer systems, and how layered observability improves fault detection across business, application, and infrastructure levels.

Distributed TracingMicroservicesObservability

0 likes · 10 min read

How Bilibili Revamped Its Monitoring Architecture: From Zabbix to Dapper

Meituan Technology Team

Jan 12, 2018 · Backend Development

Design and Implementation of Meituan Hotel Full-Chain Log and Trace System

To cope with Meituan Hotel’s exploding micro‑service complexity, the infrastructure team built the Satellite System—combining MTrace and a selective, zero‑intrusion Log4j2‑based logging pipeline that streams enriched logs through Kafka, Storm, Redis and Elasticsearch, delivering second‑level trace‑log queries and six‑month retention, dramatically speeding up debugging.

Distributed TracingElasticsearchKafka

0 likes · 11 min read

Design and Implementation of Meituan Hotel Full-Chain Log and Trace System

21CTO

Dec 3, 2017 · Operations

Mastering ELK: Choose the Right Log Architecture and Solve Common Issues

This article explains the core components of the ELK stack, compares three typical deployment architectures, and provides practical solutions for multiline log merging, timestamp correction, and module‑based log filtering using Filebeat, Logstash, Kafka, and Kibana.

ELKElasticsearchKibana

0 likes · 10 min read

Mastering ELK: Choose the Right Log Architecture and Solve Common Issues

ITFLY8 Architecture Home

Dec 8, 2016 · Operations

How CAT Enables Scalable Real‑Time Monitoring for Distributed Systems

This article introduces CAT, an open‑source Java‑based distributed real‑time monitoring platform, detailing its design goals, architecture, message processing pipeline, logging instrumentation, API, real‑time analysis, report modeling, storage challenges, and key takeaways for building highly available, scalable monitoring solutions.

Distributed MonitoringOperationslog-aggregation

0 likes · 13 min read

How CAT Enables Scalable Real‑Time Monitoring for Distributed Systems

dbaplus Community

Sep 12, 2016 · Big Data

Apache Flume Quickstart: Log Collection and Kafka Integration

This article introduces Apache Flume, explains its design goals of reliability, scalability, manageability and extensibility, outlines core concepts and architecture, provides step‑by‑step configuration using the first mode, demonstrates integration with Zookeeper, Kafka and a shell script, and shows how to launch and verify the agent.

Apache FlumeBig DataKafka Integration

0 likes · 7 min read

Apache Flume Quickstart: Log Collection and Kafka Integration

MaGe Linux Operations

Aug 4, 2016 · Big Data

How Hadoop 2.0 Collects and Manages Job Logs with YARN

This article explains Hadoop 2.0's built‑in MRv2 log collection mechanism, detailing job‑run and task‑run logs, their generation steps, log aggregation, and the role of the JobHistory Server for centralized analysis.

Big DataHadoopJobHistory

0 likes · 8 min read

How Hadoop 2.0 Collects and Manages Job Logs with YARN

21CTO

May 16, 2016 · Operations

How to Centralize Logs from Dockerized Services Using Flume and Kafka

This article explains a practical architecture for aggregating logs from distributed Docker containers by employing Flume NG as a lightweight log collector, Kafka as a high‑throughput message bus, and custom sinks to store logs per service, module and day with low latency and minimal resource impact.

DockerFlumeKafka

0 likes · 17 min read

How to Centralize Logs from Dockerized Services Using Flume and Kafka

Architect

Mar 21, 2016 · Big Data

Introduction to Apache Flume: Architecture, Core Concepts, Configuration and Usage

This article provides a comprehensive overview of Apache Flume, covering its design goals, core components, deployment architecture, configuration patterns, and step‑by‑step instructions for integrating Flume with Zookeeper and Kafka to collect and forward massive log data.

Apache FlumeKafkaZookeeper

0 likes · 6 min read

Introduction to Apache Flume: Architecture, Core Concepts, Configuration and Usage

21CTO

Mar 17, 2016 · Operations

How Vipshop’s Three‑Tier Monitoring System Keeps Services Running Smoothly

This article explains Vipshop’s multi‑layer monitoring architecture, detailing system‑level metrics, application‑level tracing with the Mercury platform, and business‑level KPI dashboards, while describing the data pipelines that collect, process, and alert on distributed logs to ensure reliable operations.

OperationsVipshopdistributed systems

0 likes · 4 min read

How Vipshop’s Three‑Tier Monitoring System Keeps Services Running Smoothly

Java High-Performance Architecture

Mar 16, 2016 · Operations

How Vipshop’s Three‑Tier Monitoring System Keeps Services Running Smoothly

Vipshop’s three‑tier monitoring system—covering system, application (Mercury), and business layers—collects and analyzes logs from distributed components, providing real‑time metrics, slow‑call detection, error tracing, and configurable alerts to help engineers quickly pinpoint and resolve performance issues.

APMAlertingPerformance

0 likes · 4 min read