Tagged articles
35 articles
Page 1 of 1
MaGe Linux Operations
MaGe Linux Operations
Feb 18, 2026 · Databases

How to Replace Prometheus Local Storage with VictoriaMetrics for High‑Performance Long‑Term Monitoring

This guide explains why Prometheus’s local TSDB struggles at scale, compares alternative remote‑storage solutions, and provides a step‑by‑step walkthrough for deploying VictoriaMetrics (single‑node or clustered), configuring remote_write, tuning performance, handling multi‑tenant use cases, and troubleshooting common issues.

PrometheusTSDBVictoriaMetrics
0 likes · 42 min read
How to Replace Prometheus Local Storage with VictoriaMetrics for High‑Performance Long‑Term Monitoring
Linux Ops Smart Journey
Linux Ops Smart Journey
Nov 5, 2025 · Cloud Native

Why Switch from Prometheus? Deploy a High‑Performance vmagent Cluster with VictoriaMetrics

This article explains the scalability limits of Prometheus, introduces vmagent as a lightweight, high‑performance collector compatible with Prometheus, and provides a step‑by‑step guide—including configuration, systemd service setup, and verification—to deploy a resilient vmagent cluster in production.

DeploymentPrometheusVictoriaMetrics
0 likes · 5 min read
Why Switch from Prometheus? Deploy a High‑Performance vmagent Cluster with VictoriaMetrics
dbaplus Community
dbaplus Community
Sep 1, 2025 · Operations

How to Keep VictoriaMetrics Stable During Sudden Metric Surges

This article outlines practical strategies for protecting VictoriaMetrics storage under bursty metric traffic, covering communication with business teams, splitting deployments, choosing single‑node versus cluster setups, key monitoring metrics, separate storage for self‑monitoring, the VMUI Explore UI, and techniques for discarding high‑cardinality metrics.

MetricsVictoriaMetricsmonitoring
0 likes · 10 min read
How to Keep VictoriaMetrics Stable During Sudden Metric Surges
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Apr 2, 2025 · Databases

How VictoriaMetrics' Distributed Architecture Scales Massive Time‑Series Data

VictoriaMetrics employs a modular, horizontally scalable architecture composed of vmagent, vminsert, vmstorage, vmselect, and vmalert, each handling data collection, ingestion, storage, querying, and alerting, while leveraging consistent hashing, LSM‑tree storage, TSID indexing, and multi‑tenant isolation to efficiently manage large‑scale time‑series workloads.

Time Series DatabaseVictoriaMetricsquery optimization
0 likes · 11 min read
How VictoriaMetrics' Distributed Architecture Scales Massive Time‑Series Data
Ops Development Stories
Ops Development Stories
Mar 19, 2025 · Cloud Native

Unified Multi‑Cluster Monitoring with KubeDoor 1.0: Alerts, Metrics & Best Practices

KubeDoor 1.0 introduces a new architecture for unified multi‑Kubernetes monitoring, offering components for master and agent, flexible deployment options, Helm‑based installation, configurable storage and alerting settings, and detailed guidance on integrating with existing Prometheus/VictoriaMetrics setups while providing automatic peak‑usage data collection.

AlertingCloud NativeKubernetes
0 likes · 14 min read
Unified Multi‑Cluster Monitoring with KubeDoor 1.0: Alerts, Metrics & Best Practices
Soul Technical Team
Soul Technical Team
Jan 24, 2025 · Operations

Migration from Thanos to VictoriaMetrics: Architecture, Plan, Issues, and Benefits

This article details the end‑to‑end migration from Thanos to VictoriaMetrics, covering background analysis, architectural comparison, a phased migration plan, encountered configuration and performance issues, resolution strategies, and the resulting performance, cost, and scalability improvements for the monitoring system.

ThanosTime SeriesVictoriaMetrics
0 likes · 16 min read
Migration from Thanos to VictoriaMetrics: Architecture, Plan, Issues, and Benefits
Efficient Ops
Efficient Ops
Dec 11, 2024 · Operations

Thanos vs VictoriaMetrics: Which Prometheus Storage Solution Wins for Scale and Cost?

This article compares Thanos and VictoriaMetrics as long‑term storage solutions for Prometheus, evaluating their architecture, write and read paths, reliability, consistency, performance, scalability, high‑availability, and hosting costs to help you choose the most suitable option for your monitoring stack.

Long‑term StorageThanosVictoriaMetrics
0 likes · 18 min read
Thanos vs VictoriaMetrics: Which Prometheus Storage Solution Wins for Scale and Cost?
dbaplus Community
dbaplus Community
Sep 23, 2024 · Operations

How Bilibili Scaled Monitoring: From Prometheus to a 2.0 VM‑Flink Architecture

Bilibili rebuilt its monitoring platform to handle explosive metric growth by separating collection, storage, and compute, adopting VictoriaMetrics, zone‑based scheduling, and Flink‑driven pre‑aggregation, which together improved stability, query performance, cloud data quality, and overall observability.

FlinkPrometheusVictoriaMetrics
0 likes · 31 min read
How Bilibili Scaled Monitoring: From Prometheus to a 2.0 VM‑Flink Architecture
Architect
Architect
Sep 12, 2024 · Operations

How Bilibili Scaled Its Monitoring: From Prometheus OOMs to VictoriaMetrics & Flink Pre‑Aggregation

The article details Bilibili's evolution of its monitoring platform, describing the stability and performance challenges of a Prometheus‑Thanos stack, the redesign using VictoriaMetrics, collection‑storage separation, unit‑level disaster recovery, query‑tree auto‑replacement, Flink‑based pre‑aggregation, Grafana upgrades, and future roadmap for observability.

Cloud NativeFlinkMetrics
0 likes · 30 min read
How Bilibili Scaled Its Monitoring: From Prometheus OOMs to VictoriaMetrics & Flink Pre‑Aggregation
Soul Technical Team
Soul Technical Team
Sep 2, 2024 · Databases

Comparative Analysis of VictoriaMetrics and Thanos for Large‑Scale Metric Storage

This article examines the migration from Thanos to VictoriaMetrics for large‑scale metric storage, detailing background challenges, VictoriaMetrics architecture and storage engine, data write and read processes, and a comparative analysis of performance, scalability, and operational costs between the two systems.

ThanosTime Series DatabaseVictoriaMetrics
0 likes · 15 min read
Comparative Analysis of VictoriaMetrics and Thanos for Large‑Scale Metric Storage
ITPUB
ITPUB
Aug 11, 2024 · Operations

Scaling Bilibili’s Metrics Platform with VictoriaMetrics and Flink Pre‑aggregation

This article details how Bilibili redesigned its monitoring system to overcome explosive metric growth by separating collection and storage, adopting VictoriaMetrics, implementing zone‑based scheduling, automating PromQL query replacement, and using Flink for efficient pre‑aggregation, resulting in dramatically lower latency and higher stability.

FlinkPromQLVictoriaMetrics
0 likes · 31 min read
Scaling Bilibili’s Metrics Platform with VictoriaMetrics and Flink Pre‑aggregation
Bilibili Tech
Bilibili Tech
Aug 9, 2024 · Operations

Design and Optimization of Monitoring 2.0 Architecture with VictoriaMetrics and Flink

The new Monitoring 2.0 architecture separates collection, compute and storage, adopts VictoriaMetrics for compact time‑series storage and a zone‑based scheduler, introduces push‑based ingestion, uses Flink for real‑time pre‑aggregation and automatic PromQL rewrite, delivering ten‑fold query speedups, sub‑300 ms p90 latency, and dramatically higher write and query throughput.

FlinkMetricsPrometheus
0 likes · 29 min read
Design and Optimization of Monitoring 2.0 Architecture with VictoriaMetrics and Flink
Efficient Ops
Efficient Ops
Aug 5, 2024 · Operations

Thanos vs VictoriaMetrics: Which Prometheus Long‑Term Storage Wins?

This article compares Thanos and VictoriaMetrics as Prometheus long‑term storage solutions, evaluating their architectures, write and read paths, reliability, data consistency, performance, scalability, high‑availability, and cost to help you choose the best fit for your monitoring stack.

ThanosVictoriaMetricscloud
0 likes · 17 min read
Thanos vs VictoriaMetrics: Which Prometheus Long‑Term Storage Wins?
Sohu Tech Products
Sohu Tech Products
Jul 24, 2024 · Cloud Native

Understanding Helm and Kubernetes Operators

The article explains how Helm simplifies deploying complex Kubernetes applications with a single YAML chart but cannot manage runtime operations, while Kubernetes Operators—built on custom resource definitions and webhook logic—automate tasks such as scaling, upgrades, and side‑car injection, offering higher‑level lifecycle management.

Application DeploymentCRDKubernetes
0 likes · 9 min read
Understanding Helm and Kubernetes Operators
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Jan 10, 2024 · Operations

Building Cloud Music's APM Metric Monitoring System Based on VictoriaMetrics

Cloud Music’s middleware team built the Pylon APM monitoring system on VictoriaMetrics, combining exporters, vmagent, Nacos, Flink‑based pre‑aggregation recording rules and vminsert for collection with Grafana, a custom Proxy and vmselect for querying, achieving millisecond‑level latency, metric‑trace correlation, stability improvements, and cost‑effective storage for nearly 700 million active time series.

APM monitoringFlinkMetric Pre-aggregation
0 likes · 12 min read
Building Cloud Music's APM Metric Monitoring System Based on VictoriaMetrics
Weimob Technology Center
Weimob Technology Center
Dec 26, 2023 · Operations

Rebuilding Our APM: Scalable Metrics & Alerts with VictoriaMetrics & VMAlert

This article details the complete redesign of our internal APM system, covering the motivations, architecture choices, metric collection pipeline, integration of VictoriaMetrics and VMAlert, metric and alert design principles, implementation steps, visualizations, performance gains, and future plans for scaling and SaaS‑ification.

APMAlertingMetrics
0 likes · 17 min read
Rebuilding Our APM: Scalable Metrics & Alerts with VictoriaMetrics & VMAlert
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Dec 14, 2023 · Cloud Native

Evolution of Xiaohongshu Metrics System: Cloud‑Native Observability, High Availability, and Performance Optimizations

Xiaohongshu’s observability team rebuilt its Prometheus‑based metrics platform using vmagent, dual‑active HA clusters, query push‑down, high‑cardinality governance and multi‑cloud active‑active design, delivering ten‑fold collection speed, up to 70× query capacity, massive CPU‑memory‑storage savings and fully automated scaling.

MetricsTime SeriesVictoriaMetrics
0 likes · 35 min read
Evolution of Xiaohongshu Metrics System: Cloud‑Native Observability, High Availability, and Performance Optimizations
Didi Tech
Didi Tech
Sep 26, 2023 · Databases

Didi's Time Series Storage Evolution: From InfluxDB to VictoriaMetrics

Facing exponential growth of time‑series data from 2017 to 2023, Didi migrated from InfluxDB to RRDtool, then to an in‑memory cache layer, and finally adopted VictoriaMetrics because its low‑cost commodity‑hardware operation, high write throughput, strong compression, and easy horizontal scaling solved the earlier storage, OOM, and scalability problems.

Performance EvaluationTSDBTime Series Database
0 likes · 13 min read
Didi's Time Series Storage Evolution: From InfluxDB to VictoriaMetrics
Architect
Architect
Sep 7, 2023 · Cloud Native

How Vivo Scaled Container Monitoring with Prometheus, Kafka, and VictoriaMetrics

This article details how Vivo's container platform faced exploding metric volumes, component overload, data gaps, and storage spikes, and explains the step‑by‑step architectural redesign, metric governance, performance tuning, cAdvisor redeployment, and VictoriaMetrics upgrade that restored high‑availability, low‑latency monitoring across a large Kubernetes fleet.

Cloud NativeKubernetesPrometheus
0 likes · 18 min read
How Vivo Scaled Container Monitoring with Prometheus, Kafka, and VictoriaMetrics
vivo Internet Technology
vivo Internet Technology
Aug 16, 2023 · Cloud Native

Building a Scalable Container Monitoring System with Prometheus and VictoriaMetrics at vivo

The vivo Internet Container Team built a scalable, high‑availability container monitoring platform by deploying dual‑replica Prometheus clusters with a custom HA adapter, remoteWrite to VictoriaMetrics, and a Kafka forwarder, while cutting metric cardinality, tuning cAdvisor, and upgrading VictoriaMetrics to eliminate data loss and storage spikes, achieving stable, efficient monitoring.

Cloud NativeContainerKubernetes
0 likes · 16 min read
Building a Scalable Container Monitoring System with Prometheus and VictoriaMetrics at vivo
ITPUB
ITPUB
Dec 4, 2022 · Cloud Native

How Qunar Scaled Container Monitoring with VictoriaMetrics: A Cloud‑Native Case Study

This article details Qunar's migration from a Prometheus‑based monitoring stack to VictoriaMetrics, describing the limitations they faced, the architectural redesign using vmagent, vmcluster, and vmalert, and the resulting performance improvements and operational benefits for large‑scale Kubernetes environments.

Cloud NativeKubernetesPrometheus
0 likes · 14 min read
How Qunar Scaled Container Monitoring with VictoriaMetrics: A Cloud‑Native Case Study
Qunar Tech Salon
Qunar Tech Salon
Nov 29, 2022 · Cloud Native

Qunar’s Experience Replacing Prometheus with VictoriaMetrics for Cloud‑Native Container Monitoring

This article details Qunar’s migration from a traditional Prometheus‑based monitoring stack to VictoriaMetrics, describing the challenges of large‑scale container metrics collection, the architectural redesign using VM‑Cluster, vmagent, and vmalert, and the performance improvements achieved after full replacement.

KubernetesPrometheusTime Series Database
0 likes · 14 min read
Qunar’s Experience Replacing Prometheus with VictoriaMetrics for Cloud‑Native Container Monitoring
dbaplus Community
dbaplus Community
Sep 26, 2022 · Backend Development

How Ctrip Replaced HBase with VictoriaMetrics & ClickHouse for Scalable Metrics Monitoring

Ctrip’s internal Dashboard monitoring platform, originally built on HBase, was redesigned by migrating its core writer and storage components to a hybrid VictoriaMetrics‑ClickHouse solution, delivering faster queries, higher write stability, and full Prometheus compatibility while keeping the user experience unchanged.

DashboardHBaseMetrics
0 likes · 13 min read
How Ctrip Replaced HBase with VictoriaMetrics & ClickHouse for Scalable Metrics Monitoring
360 Smart Cloud
360 Smart Cloud
Sep 8, 2022 · Databases

Integrating TiDB Multi‑Cluster Monitoring with Prometheus, Consul, and VictoriaMetrics

This article presents a step‑by‑step solution for consolidating TiDB multi‑cluster monitoring by deploying Consul for service registration, configuring Prometheus to discover services via Consul, and optionally replacing Prometheus with VictoriaMetrics to achieve unified dashboards, scalable data collection, and easier health inspection across dozens or hundreds of instances.

ConsulGrafanaPrometheus
0 likes · 10 min read
Integrating TiDB Multi‑Cluster Monitoring with Prometheus, Consul, and VictoriaMetrics
dbaplus Community
dbaplus Community
Sep 5, 2022 · Operations

How EyesTSDB Evolved into a Cloud‑Native, Second‑Level Monitoring Platform

This article details the evolution of NetEase's self‑built time‑series database EyesTSDB into a cloud‑native, second‑level monitoring solution, covering its architecture, core features, integration with VictoriaMetrics, custom plugin workflow, CMDB linkage, real‑world use cases, and future challenges.

CMDB integrationMetricsTime Series Database
0 likes · 21 min read
How EyesTSDB Evolved into a Cloud‑Native, Second‑Level Monitoring Platform
Architecture Digest
Architecture Digest
Jun 17, 2022 · Cloud Native

Vivo Container Cluster Monitoring Architecture and Cloud‑Native Practices

This article describes Vivo's practical experience building a cloud‑native monitoring system for large‑scale container clusters, covering the shortcomings of traditional monitoring, the Prometheus‑centric ecosystem, high‑availability architecture, challenges faced, and future directions such as automation and AI‑driven operations.

PrometheusVictoriaMetricsVivo
0 likes · 13 min read
Vivo Container Cluster Monitoring Architecture and Cloud‑Native Practices
vivo Internet Technology
vivo Internet Technology
Jun 15, 2022 · Cloud Native

Vivo Container Cluster Monitoring Architecture and Cloud‑Native Observability Practices

Vivo’s cloud‑native monitoring solution combines high‑availability Prometheus clusters, VictoriaMetrics storage, Grafana visualization, and a custom leader‑election adapter to deduplicate data while forwarding metrics to Kafka and OLAP systems, addressing large‑scale performance, scalability, and integration challenges and paving the way for AI‑driven AIOps.

Cloud Native MonitoringKubernetesPrometheus
0 likes · 18 min read
Vivo Container Cluster Monitoring Architecture and Cloud‑Native Observability Practices
Ops Development Stories
Ops Development Stories
Jan 24, 2022 · Cloud Native

Deploy and Configure vmagent on Kubernetes for Efficient Metrics

This guide explains what vmagent is, its key features, and provides step‑by‑step instructions to install, configure, and verify vmagent on a Kubernetes cluster, including namespace and RBAC setup, custom scrape configs, monitoring endpoints, and troubleshooting tips.

KubernetesMetricsVictoriaMetrics
0 likes · 15 min read
Deploy and Configure vmagent on Kubernetes for Efficient Metrics