Raymond Ops
Author

Raymond Ops

Linux ops automation, cloud-native, Kubernetes, SRE, DevOps, Python, Golang and related tech discussions.

622
Articles
0
Likes
3.0k
Views
0
Comments
Recent Articles

Latest from Raymond Ops

100 recent articles max
Raymond Ops
Raymond Ops
Mar 4, 2026 · Operations

Build an Enterprise‑Grade DevOps CI/CD Pipeline in 7 Days with Ready‑to‑Use Scripts

This guide walks you through constructing a full‑stack, enterprise‑level DevOps pipeline—from environment preparation and tool installation to Jenkins pipeline scripting, Kubernetes deployment, monitoring, security hardening, and cost optimization—providing complete scripts and step‑by‑step instructions to achieve automated, reliable releases within a week.

CI/CDDockerJenkins
0 likes · 27 min read
Build an Enterprise‑Grade DevOps CI/CD Pipeline in 7 Days with Ready‑to‑Use Scripts
Raymond Ops
Raymond Ops
Mar 3, 2026 · Operations

How I Turned a Firefighter Ops Engineer into a High‑Paid Tech Expert in 3 Years

This article chronicles a three‑year journey from a junior operations engineer blamed for outages to a senior technical specialist, detailing the four pivotal turning points, concrete learning plans, automation projects, cost‑optimization strategies, and actionable advice for anyone seeking to advance in modern operations.

careercloud-nativemonitoring
0 likes · 27 min read
How I Turned a Firefighter Ops Engineer into a High‑Paid Tech Expert in 3 Years
Raymond Ops
Raymond Ops
Mar 2, 2026 · Operations

Why Most Alerts Fail and How to Build a Night‑Quiet, High‑Signal Monitoring System

This article examines the root causes of alert fatigue—mis‑configured thresholds, noisy alerts, lack of context, and poor routing—then presents a step‑by‑step guide using golden signals, dynamic baselines, enriched alert payloads, severity‑based routing, and suppression techniques to create an effective, low‑noise monitoring system.

AlertingAlertmanagerSRE
0 likes · 24 min read
Why Most Alerts Fail and How to Build a Night‑Quiet, High‑Signal Monitoring System
Raymond Ops
Raymond Ops
Mar 2, 2026 · Cloud Native

ELK vs EFK vs Loki: 2025’s Best Log Solution for Cost, Performance & Simplicity

This comprehensive 2025 guide compares ELK, EFK, and Loki across architecture, deployment complexity, storage cost, query performance, feature completeness, high‑availability, and real‑world case studies, helping teams of any size choose the most cost‑effective and operationally suitable log collection stack.

EFKELKLoki
0 likes · 37 min read
ELK vs EFK vs Loki: 2025’s Best Log Solution for Cost, Performance & Simplicity
Raymond Ops
Raymond Ops
Mar 1, 2026 · Operations

How I Transitioned from Traditional Ops to SRE/DevOps in 18 Months

This detailed guide shares a step‑by‑step 18‑month roadmap, covering self‑assessment, skill acquisition (Python, Kubernetes, monitoring), project execution, interview preparation, and real‑world outcomes for engineers moving from legacy operations to SRE/DevOps roles.

CI/CDPythonSRE
0 likes · 35 min read
How I Transitioned from Traditional Ops to SRE/DevOps in 18 Months
Raymond Ops
Raymond Ops
Feb 26, 2026 · Operations

What Core Skills Do 500k‑CNY Ops Engineers Master?

This article breaks down the essential technical and soft‑skill competencies—ranging from deep Linux kernel knowledge and database optimization to cloud‑native Kubernetes expertise, observability, automation, cost‑saving architecture, and security—that distinguish high‑salary operations engineers and provides a practical roadmap for achieving them.

DatabaseObservabilityPerformance Optimization
0 likes · 38 min read
What Core Skills Do 500k‑CNY Ops Engineers Master?
Raymond Ops
Raymond Ops
Feb 25, 2026 · Operations

How to Stop 3 AM Alert Wake‑Ups: 5 Smart Monitoring Techniques

Every night engineers are jolted awake by noisy alerts, but by applying five practical techniques—including alert severity tiers, aggregation, dynamic thresholds, intelligent routing, and data‑driven effectiveness analysis—teams can cut daily alerts from over a hundred to fewer than ten and dramatically improve response times.

AlertingAlertmanagerdynamic thresholds
0 likes · 44 min read
How to Stop 3 AM Alert Wake‑Ups: 5 Smart Monitoring Techniques
Raymond Ops
Raymond Ops
Feb 24, 2026 · Cloud Native

Master Enterprise Monitoring: Build a Prometheus + Grafana Observability Platform

This guide details how to design and implement an enterprise‑grade cloud‑native observability platform using Prometheus for metrics collection and Grafana for visualization, covering architecture, high‑availability deployment, alerting, dashboard automation, case studies, best‑practice recommendations, and future trends.

Cloud NativeObservabilitygrafana
0 likes · 24 min read
Master Enterprise Monitoring: Build a Prometheus + Grafana Observability Platform
Raymond Ops
Raymond Ops
Feb 24, 2026 · Operations

Mastering Enterprise CI/CD with Ansible: A Complete Hands‑On Guide

This comprehensive guide explains how to build an enterprise‑grade CI/CD automation platform with Ansible, covering its evolution, core principles, environment setup, dynamic inventory, modular playbooks, GitLab integration, blue‑green deployments, Vault security, custom module development, real‑world case studies, performance tuning, error handling, monitoring, and testing with Molecule.

AnsibleCI/CDConfiguration Management
0 likes · 25 min read
Mastering Enterprise CI/CD with Ansible: A Complete Hands‑On Guide
Raymond Ops
Raymond Ops
Feb 14, 2026 · Operations

How I Cut 80% of Ops Time with an Automated Service Management System

This article details a complete automated operations framework that replaces manual service restarts, log cleaning, and deployment tasks with health‑checks, systemd units, Kubernetes probes, monitoring scripts, fault‑diagnosis tools, auto‑scaling policies, and Ansible playbooks, saving roughly 80% of repetitive work and dramatically improving reliability.

automationmonitoringops
0 likes · 38 min read
How I Cut 80% of Ops Time with an Automated Service Management System