Tagged articles
47 articles
Page 1 of 1
Raymond Ops
Raymond Ops
Mar 2, 2026 · Cloud Native

ELK vs EFK vs Loki: 2025’s Best Log Solution for Cost, Performance & Simplicity

This comprehensive 2025 guide compares ELK, EFK, and Loki across architecture, deployment complexity, storage cost, query performance, feature completeness, high‑availability, and real‑world case studies, helping teams of any size choose the most cost‑effective and operationally suitable log collection stack.

EFKELKLoki
0 likes · 37 min read
ELK vs EFK vs Loki: 2025’s Best Log Solution for Cost, Performance & Simplicity
Alibaba Cloud Observability
Alibaba Cloud Observability
Dec 29, 2025 · Cloud Native

How to Seamlessly Import Massive S3 Logs into Alibaba Cloud SLS with Real‑Time Analysis

This article explains how to centralize and analyze massive multi‑cloud log data stored in object storage by moving AWS S3 logs into Alibaba Cloud Log Service (SLS) using dual‑mode file discovery, SQS event‑driven import, elastic scaling, and pre‑ingestion processing to achieve low latency, high reliability, and cost efficiency.

AWS S3Real-time Processingalibaba-sls
0 likes · 12 min read
How to Seamlessly Import Massive S3 Logs into Alibaba Cloud SLS with Real‑Time Analysis
Ops Community
Ops Community
Sep 8, 2025 · Operations

Mastering Distributed Log Architecture: From Flume to ELK and Beyond

This comprehensive guide walks you through the challenges of large‑scale log collection, real‑time processing, storage optimization, and visualization, detailing practical configurations for Flume, Logstash, Elasticsearch, Kibana, Filebeat, Kafka, Kubernetes, and future AIOps integrations to build a reliable, cost‑effective distributed logging system.

ELKFlumeKafka
0 likes · 24 min read
Mastering Distributed Log Architecture: From Flume to ELK and Beyond
Liangxu Linux
Liangxu Linux
Jun 10, 2025 · Cloud Native

Why Loki Is the Ideal Cloud‑Native Log Aggregator for Prometheus & Grafana

Loki, an open‑source log aggregation system from Grafana Labs, integrates tightly with Prometheus and Grafana, stores logs efficiently using object storage, offers a simple label‑based model, and provides cost‑effective, high‑performance logging for cloud‑native environments while outlining its architecture, usage, configuration, advantages, limitations, and retention policies.

Cloud NativeGrafanaLoki
0 likes · 10 min read
Why Loki Is the Ideal Cloud‑Native Log Aggregator for Prometheus & Grafana
Top Architect
Top Architect
Sep 14, 2024 · Operations

Centralized Log Collection with Filebeat and Graylog: Installation, Configuration, and Usage

This article explains why centralized log collection is essential for multi‑environment services, introduces Graylog as a lightweight alternative to ELK, details Filebeat's role and workflow, provides configuration examples, shows how to deploy both Filebeat and Graylog via Docker or packages, and demonstrates integration with Spring Boot and log search techniques.

DockerELKFilebeat
0 likes · 20 min read
Centralized Log Collection with Filebeat and Graylog: Installation, Configuration, and Usage
ITPUB
ITPUB
Dec 24, 2023 · Backend Development

Why Kafka Is the Backbone of Modern Messaging, Streaming, and Data Pipelines

This article explains how Kafka serves as a high‑throughput, durable messaging system, a reliable storage layer, a log‑aggregation hub, a stream‑processing engine, and a core component for CDC, system migration, monitoring, and event‑sourcing architectures.

CDCEvent SourcingKafka
0 likes · 9 min read
Why Kafka Is the Backbone of Modern Messaging, Streaming, and Data Pipelines
Top Architect
Top Architect
Sep 20, 2023 · Operations

Design and Implementation of a Distributed Log Service: Tianyan vs ELK

This article examines the challenges of building a high‑performance log service for distributed systems, compares the traditional ELK stack with the Tianyan platform, details Tianyan's architecture—including ingest, storage, and consumer components, SDK and Minos collection methods, high‑throughput transmission with Disruptor and Bigpipe, log retrieval, resource isolation, dynamic cleaning, and best‑practice recommendations.

BigpipeDisruptorDistributed Systems
0 likes · 27 min read
Design and Implementation of a Distributed Log Service: Tianyan vs ELK
Architect
Architect
Sep 19, 2023 · Big Data

How Tianyan Beats ELK: Inside a High‑Performance Distributed Log Service

This article analyzes the challenges of logging in distributed services, compares the traditional ELK stack with Baidu's Tianyan platform, and details Tianyan's architecture, data collection, high‑throughput transmission, storage, retrieval, resource isolation, dynamic cleanup, and best‑practice recommendations, complete with code examples and performance insights.

Big DataDistributed SystemsELK
0 likes · 30 min read
How Tianyan Beats ELK: Inside a High‑Performance Distributed Log Service
Liangxu Linux
Liangxu Linux
Aug 29, 2023 · Cloud Native

Master Real-Time Multi-Pod Log Viewing in Kubernetes with Kubetail & Stern

This guide introduces two lightweight Kubernetes log‑tailing tools, Kubetail and Stern, explains their installation on various platforms, demonstrates common usage patterns and command‑line options, and provides practical examples for aggregating and filtering logs across multiple pods and containers.

Cloud NativeDevOpsKubernetes
0 likes · 10 min read
Master Real-Time Multi-Pod Log Viewing in Kubernetes with Kubetail & Stern
MaGe Linux Operations
MaGe Linux Operations
Jul 25, 2023 · Cloud Native

Why Choose Loki for Cloud‑Native Log Management? A Complete Deployment Guide

This article explains why Loki is a lightweight, cloud‑native log aggregation solution, outlines its advantages and supported storage backends, compares log collectors, details Loki's indexing and query mechanisms, and provides step‑by‑step instructions for deploying Loki in Kubernetes with all‑in‑one, read/write, and microservice modes.

Cloud Native MonitoringGrafanaKubernetes
0 likes · 15 min read
Why Choose Loki for Cloud‑Native Log Management? A Complete Deployment Guide
Qunar Tech Salon
Qunar Tech Salon
Apr 19, 2023 · Operations

Heimdall Exception Statistics System: Architecture, Implementation, and Practice

This article describes the design, implementation, and evolution of Heimdall, an exception‑statistics platform built on Kafka, Flink, and HBase that provides minute‑level anomaly aggregation, stack trace querying, and integration with release and alerting workflows to improve service reliability across thousands of micro‑services.

Exception MonitoringKafkalog aggregation
0 likes · 14 min read
Heimdall Exception Statistics System: Architecture, Implementation, and Practice
Architecture Digest
Architecture Digest
Apr 10, 2023 · Operations

Comparison of Common Log Management Tools: Filebeat, Graylog, LogDNA, ELK, Loki, Datadog, Logstash, Fluentd, and Splunk

This article provides a detailed comparison of nine popular log management solutions—Filebeat, Graylog, LogDNA, ELK Stack, Grafana Loki, Datadog, Logstash, Fluentd, and Splunk—covering their core features, pricing models, advantages, and drawbacks to help readers choose the right tool for centralized logging.

ELKLog Managementcloud
0 likes · 13 min read
Comparison of Common Log Management Tools: Filebeat, Graylog, LogDNA, ELK, Loki, Datadog, Logstash, Fluentd, and Splunk
Efficient Ops
Efficient Ops
Mar 19, 2023 · Cloud Native

Master Real-Time Multi-Pod Logging in Kubernetes with Kubetail & Stern

This guide introduces two lightweight Kubernetes log‑tailing utilities, Kubetail and Stern, explaining their installation on various platforms, core command‑line options, and practical usage examples for aggregating and color‑coding logs from multiple pods and containers, offering a simpler alternative to heavyweight logging stacks.

CLIKuberneteskubetail
0 likes · 10 min read
Master Real-Time Multi-Pod Logging in Kubernetes with Kubetail & Stern
High Availability Architecture
High Availability Architecture
Nov 7, 2022 · Backend Development

Design and Implementation of Meituan's Logan Real-Time Log System

This article describes how Meituan built Logan, a high‑performance, end‑to‑end real‑time logging platform for mobile, web, mini‑programs and IoT, covering its background, architecture, data collection, processing, consumption, monitoring, deployment strategies, achieved results and future roadmap.

Backend ArchitectureElasticsearchFlink
0 likes · 15 min read
Design and Implementation of Meituan's Logan Real-Time Log System
Programmer DD
Programmer DD
May 16, 2022 · Cloud Native

Master Loki: Scalable Log Aggregation for Kubernetes and Prometheus

This guide introduces Loki, the open‑source, horizontally scalable log aggregation system optimized for Prometheus and Kubernetes, covering its core concepts, architecture, components, deployment steps, Grafana integration, label‑based indexing, and best practices for handling dynamic and high‑cardinality tags.

GrafanaKubernetesLoki
0 likes · 19 min read
Master Loki: Scalable Log Aggregation for Kubernetes and Prometheus
Efficient Ops
Efficient Ops
Apr 27, 2022 · Operations

Why Choose Loki Over ELK? A Practical Guide to Scalable Log Aggregation

This article explains the motivations for selecting Grafana Loki instead of traditional ELK/EFK stacks, introduces Loki's core concepts and architecture, details component roles, provides step‑by‑step deployment of Promtail and Loki, and demonstrates how to configure and query logs in Grafana while addressing label indexing, dynamic tags, high‑cardinality challenges, and query performance.

GrafanaKubernetesLoki
0 likes · 18 min read
Why Choose Loki Over ELK? A Practical Guide to Scalable Log Aggregation
IT Architects Alliance
IT Architects Alliance
Nov 11, 2021 · Operations

Design and Implementation of a TB‑Scale Log Monitoring System Using the ELK Stack

This article explains how to build a terabyte‑level log monitoring platform for micro‑service environments by unifying log collection with FileBeat, enriching observability through Elastic APM, processing streams via Kafka Streams, and visualizing metrics with Grafana and Kibana, while addressing cost‑effective filtering and retention strategies.

ELK StackGrafanaLog Monitoring
0 likes · 8 min read
Design and Implementation of a TB‑Scale Log Monitoring System Using the ELK Stack
Efficient Ops
Efficient Ops
Sep 15, 2021 · Cloud Native

Why Loki Beats ELK for Cloud‑Native Log Management

This article explains the motivations behind choosing Grafana Loki over traditional ELK/EFK stacks for container‑cloud logging, detailing its lightweight design, cost advantages, simple architecture, and how its components—Distributor, Ingester, and Querier—work together to provide scalable, efficient log aggregation and querying.

LokiPrometheuslog aggregation
0 likes · 8 min read
Why Loki Beats ELK for Cloud‑Native Log Management
Code Ape Tech Column
Code Ape Tech Column
Jul 27, 2021 · Cloud Native

Understanding Loki: Advantages, Architecture, Installation, and Query Practices

This article explains Loki's low‑index overhead, concurrent query handling, tag‑based indexing, component roles, read/write paths, step‑by‑step installation of Promtail and Loki, label matching techniques, dynamic‑tag handling, high‑cardinality concerns, and query optimization strategies for cloud‑native log aggregation.

Cloud NativeLokiPromtail
0 likes · 13 min read
Understanding Loki: Advantages, Architecture, Installation, and Query Practices
Yuewen Technology
Yuewen Technology
Jul 16, 2021 · Operations

Mastering Log Aggregation: From LogID Generation to Powerful Analysis Tools

This article explores the challenges of log aggregation in micro‑service architectures, introduces a globally unique log identifier (logid) with its required properties, compares various logid generation schemes, and presents end‑to‑end solutions for log distribution, aggregation, and analysis using custom tools such as ylog and watcher.

Distributed Systemslog aggregationlog analysis
0 likes · 26 min read
Mastering Log Aggregation: From LogID Generation to Powerful Analysis Tools
Programmer DD
Programmer DD
Jul 1, 2021 · Operations

Why Loki Beats Elasticsearch: Low Index Overhead, Fast Queries, and Easy Setup

This article explains Loki's advantages over Elasticsearch, including low indexing overhead, concurrent query processing with caching, seamless integration with Prometheus and Grafana, detailed architecture components, installation steps, label handling, high‑cardinality challenges, and best practices for efficient log management.

ElasticsearchGrafanaLoki
0 likes · 15 min read
Why Loki Beats Elasticsearch: Low Index Overhead, Fast Queries, and Easy Setup
Java Architecture Diary
Java Architecture Diary
Apr 19, 2021 · Operations

Why Loki Is the Lightweight, Scalable Log Solution You Need Over EFK

This article introduces Loki, Grafana’s lightweight, horizontally scalable log aggregation system, compares it with the EFK stack, explains Promtail, LogQL query language, alerting, and how Loki integrates with Grafana and Prometheus for unified metrics and logs, highlighting its low‑resource, cloud‑native advantages.

Cloud NativeLokilog aggregation
0 likes · 8 min read
Why Loki Is the Lightweight, Scalable Log Solution You Need Over EFK
Programmer DD
Programmer DD
Mar 28, 2021 · Big Data

Mastering Apache Flume: Architecture, Components, and Key Features

This article provides a comprehensive overview of Apache Flume, detailing its purpose as a distributed log aggregation system, explaining its core components such as sources, channels, and sinks, and illustrating its architecture, multi‑agent setups, and key features like reliability, scalability, compression, and monitoring.

Flumedata ingestionlog aggregation
0 likes · 9 min read
Mastering Apache Flume: Architecture, Components, and Key Features
JD Cloud Developers
JD Cloud Developers
Dec 17, 2020 · Backend Development

How Loki Cuts Log Storage Costs While Integrating Deeply with Prometheus

This article explains Loki's origins, data model, LogQL query language, low‑cost storage design, and the full read‑write architecture—including Distributor, Ingester, Querier, and QueryFrontend—showing how it solves the shortcomings of traditional Elasticsearch‑based logging solutions and integrates tightly with Prometheus monitoring.

LogQLLokiPrometheus
0 likes · 21 min read
How Loki Cuts Log Storage Costs While Integrating Deeply with Prometheus
Programmer DD
Programmer DD
Nov 7, 2020 · Operations

Loki 2.0.0 Unveiled: Transforming Log Observability for Kubernetes

Loki 2.0.0 introduces major enhancements such as a revamped LogQL pipeline, native Prometheus‑style alerts, and simplified storage with boltdb‑shipper, delivering a more resource‑efficient, scalable log aggregation solution for Kubernetes environments.

KubernetesLogQLLoki
0 likes · 3 min read
Loki 2.0.0 Unveiled: Transforming Log Observability for Kubernetes
Java Backend Technology
Java Backend Technology
Jul 5, 2020 · Cloud Native

Why Loki Beats ELK for Cloud‑Native Log Management: Architecture and Benefits

This article explains the motivations behind choosing Loki over traditional ELK/EFK stacks for container‑cloud logging, outlines its cost‑effective design, describes its simple architecture and components such as Distributor, Ingester, and Querier, and highlights its scalability and seamless integration with Prometheus.

ELK alternativeLokicloud-native
0 likes · 8 min read
Why Loki Beats ELK for Cloud‑Native Log Management: Architecture and Benefits
Efficient Ops
Efficient Ops
Apr 2, 2018 · Operations

How Bilibili Revamped Its Monitoring Architecture: From Zabbix to Dapper

An in‑depth look at Bilibili’s multi‑layer monitoring overhaul, detailing the shift from a monolithic Zabbix setup to micro‑service‑based ELK, Dapper, Misaka, Traceon and Lancer systems, and how layered observability improves fault detection across business, application, and infrastructure levels.

Distributed TracingMicroservicesOperations
0 likes · 10 min read
How Bilibili Revamped Its Monitoring Architecture: From Zabbix to Dapper
Meituan Technology Team
Meituan Technology Team
Jan 12, 2018 · Backend Development

Design and Implementation of Meituan Hotel Full-Chain Log and Trace System

To cope with Meituan Hotel’s exploding micro‑service complexity, the infrastructure team built the Satellite System—combining MTrace and a selective, zero‑intrusion Log4j2‑based logging pipeline that streams enriched logs through Kafka, Storm, Redis and Elasticsearch, delivering second‑level trace‑log queries and six‑month retention, dramatically speeding up debugging.

Distributed TracingElasticsearchKafka
0 likes · 11 min read
Design and Implementation of Meituan Hotel Full-Chain Log and Trace System
21CTO
21CTO
Dec 3, 2017 · Operations

Mastering ELK: Choose the Right Log Architecture and Solve Common Issues

This article explains the core components of the ELK stack, compares three typical deployment architectures, and provides practical solutions for multiline log merging, timestamp correction, and module‑based log filtering using Filebeat, Logstash, Kafka, and Kibana.

ELKElasticsearchFilebeat
0 likes · 10 min read
Mastering ELK: Choose the Right Log Architecture and Solve Common Issues
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Dec 8, 2016 · Operations

How CAT Enables Scalable Real‑Time Monitoring for Distributed Systems

This article introduces CAT, an open‑source Java‑based distributed real‑time monitoring platform, detailing its design goals, architecture, message processing pipeline, logging instrumentation, API, real‑time analysis, report modeling, storage challenges, and key takeaways for building highly available, scalable monitoring solutions.

Distributed MonitoringOperationsSystem Architecture
0 likes · 13 min read
How CAT Enables Scalable Real‑Time Monitoring for Distributed Systems
dbaplus Community
dbaplus Community
Sep 12, 2016 · Big Data

Apache Flume Quickstart: Log Collection and Kafka Integration

This article introduces Apache Flume, explains its design goals of reliability, scalability, manageability and extensibility, outlines core concepts and architecture, provides step‑by‑step configuration using the first mode, demonstrates integration with Zookeeper, Kafka and a shell script, and shows how to launch and verify the agent.

Apache FlumeBig DataKafka Integration
0 likes · 7 min read
Apache Flume Quickstart: Log Collection and Kafka Integration
21CTO
21CTO
May 16, 2016 · Operations

How to Centralize Logs from Dockerized Services Using Flume and Kafka

This article explains a practical architecture for aggregating logs from distributed Docker containers by employing Flume NG as a lightweight log collector, Kafka as a high‑throughput message bus, and custom sinks to store logs per service, module and day with low latency and minimal resource impact.

DockerFlumeKafka
0 likes · 17 min read
How to Centralize Logs from Dockerized Services Using Flume and Kafka
21CTO
21CTO
Mar 17, 2016 · Operations

How Vipshop’s Three‑Tier Monitoring System Keeps Services Running Smoothly

This article explains Vipshop’s multi‑layer monitoring architecture, detailing system‑level metrics, application‑level tracing with the Mercury platform, and business‑level KPI dashboards, while describing the data pipelines that collect, process, and alert on distributed logs to ensure reliable operations.

Distributed SystemsOperationsVipshop
0 likes · 4 min read
How Vipshop’s Three‑Tier Monitoring System Keeps Services Running Smoothly
Java High-Performance Architecture
Java High-Performance Architecture
Mar 16, 2016 · Operations

How Vipshop’s Three‑Tier Monitoring System Keeps Services Running Smoothly

Vipshop’s three‑tier monitoring system—covering system, application (Mercury), and business layers—collects and analyzes logs from distributed components, providing real‑time metrics, slow‑call detection, error tracing, and configurable alerts to help engineers quickly pinpoint and resolve performance issues.

APMAlertingDistributed Systems
0 likes · 4 min read
How Vipshop’s Three‑Tier Monitoring System Keeps Services Running Smoothly