DevOps Operations Practice
Author

DevOps Operations Practice

We share professional insights on cloud-native, DevOps & operations, Kubernetes, observability & monitoring, and Linux systems.

165
Articles
0
Likes
815
Views
0
Comments
Recent Articles

Latest from DevOps Operations Practice

100 recent articles max
DevOps Operations Practice
DevOps Operations Practice
May 30, 2024 · Operations

Introducing Karma: A Prometheus Alert Dashboard Tool

This article introduces Karma, a Docker‑deployed Prometheus alert dashboard that aggregates multiple Alertmanager instances, explains its installation requirements, and details key features such as visual alert aggregation, tag‑based grouping, and silence management, positioning it as a valuable operations tool.

Alert DashboardAlertmanagerDocker
0 likes · 4 min read
Introducing Karma: A Prometheus Alert Dashboard Tool
DevOps Operations Practice
DevOps Operations Practice
May 20, 2024 · Cloud Computing

Google Cloud Data Deletion Incident at UniSuper: Causes, Impact, and Lessons Learned

Google Cloud mistakenly deleted data and backups for Australian pension fund UniSuper, causing over 600,000 members to lose access for more than a week, and the incident highlights the risks of single‑provider reliance, the importance of robust backup strategies, and the growing relevance of hybrid and multi‑cloud architectures.

BackupCloud ComputingData loss
0 likes · 5 min read
Google Cloud Data Deletion Incident at UniSuper: Causes, Impact, and Lessons Learned
DevOps Operations Practice
DevOps Operations Practice
May 19, 2024 · Operations

High‑Availability Solutions for Prometheus Monitoring

Prometheus, a leading monitoring system, can achieve high availability through several common architectures—including dual-node with external storage, federated mode with external storage, and multi-node clusters combined with Thanos and object storage—each offering data persistence and load distribution to enhance system stability and performance.

External StorageHigh AvailabilityThanos
0 likes · 3 min read
High‑Availability Solutions for Prometheus Monitoring
DevOps Operations Practice
DevOps Operations Practice
May 13, 2024 · Cloud Computing

Understanding KVM: Definition, Advantages, and Operation

This article explains what Kernel-based Virtual Machine (KVM) is, outlines its key benefits such as high performance, security, stability, cost‑effectiveness and flexibility, and describes how it operates on Linux systems using kernel modules and supporting packages.

Cloud ComputingKVMVirtualization
0 likes · 6 min read
Understanding KVM: Definition, Advantages, and Operation