Ops Community
Author

Ops Community

A leading IT operations community where professionals share and grow together.

164
Articles
0
Likes
457
Views
0
Comments
Recent Articles

Latest from Ops Community

100 recent articles max
Ops Community
Ops Community
Oct 26, 2025 · Cloud Native

Master Docker Basics: From Containers to Cloud‑Native Deployment

This article introduces Docker as an open‑source container platform, explains its core concepts of images and containers, highlights benefits such as rapid, lightweight deployment and consistency across environments, and outlines essential topics including Linux container fundamentals, Docker workflow, commands, image management, daily container operations, and data‑volume handling.

Container OperationsDockerImage Management
0 likes · 4 min read
Master Docker Basics: From Containers to Cloud‑Native Deployment
Ops Community
Ops Community
Oct 25, 2025 · Operations

How to Diagnose and Fix CrashLoopBackOff in Kubernetes: 10 Common Causes

This guide explains the CrashLoopBackOff state, provides quick kubectl commands, lists ten typical reasons such as misconfiguration, image errors, health‑probe issues, OOM kills, and offers step‑by‑step fixes, prevention tips, and best practices for reliable pod deployment.

CrashLoopBackOffKubernetesPod troubleshooting
0 likes · 15 min read
How to Diagnose and Fix CrashLoopBackOff in Kubernetes: 10 Common Causes
Ops Community
Ops Community
Oct 23, 2025 · Operations

Zero‑Downtime Nginx Load Balancing: Build a 99.99% HA Architecture

This guide walks through designing and implementing a highly available Nginx load‑balancing solution—covering applicable scenarios, prerequisites, environment matrix, step‑by‑step configuration of Nginx, Keepalived, SSL termination, health checks, monitoring, performance tuning, security hardening, troubleshooting, and a concise list of best‑practice recommendations.

KeepalivedMonitoringSSL
0 likes · 29 min read
Zero‑Downtime Nginx Load Balancing: Build a 99.99% HA Architecture
Ops Community
Ops Community
Oct 22, 2025 · Cloud Native

How to Build a Production‑Ready GitOps Pipeline with ArgoCD and Helm in 10 Minutes

This step‑by‑step guide shows how to set up a full‑stack GitOps workflow using ArgoCD and Helm on Kubernetes, covering prerequisites, environment matrix, a 10‑step implementation checklist, monitoring, performance tuning, security hardening, common troubleshooting, rollback scripts, and best‑practice recommendations.

ArgoCDCI/CDContinuous delivery
0 likes · 32 min read
How to Build a Production‑Ready GitOps Pipeline with ArgoCD and Helm in 10 Minutes
Ops Community
Ops Community
Oct 15, 2025 · Operations

Master Ansible: Complete Playbook Guide for Managing Hundreds of Servers

This comprehensive guide explores Ansible’s architecture, core principles, inventory management, playbook creation, advanced techniques, role usage, variable handling, error handling, idempotency, and real‑world case studies to help engineers efficiently automate and maintain large server fleets.

AnsibleConfiguration ManagementInfrastructure as Code
0 likes · 37 min read
Master Ansible: Complete Playbook Guide for Managing Hundreds of Servers
Ops Community
Ops Community
Oct 15, 2025 · Operations

Unlock Linux Automation: Master Shell Scripting with This Free Handbook

This article introduces a comprehensive, free Shell scripting handbook that covers everything from basic syntax and variables to advanced functions, flow control, and real‑world automation tasks, making it an essential resource for Linux system administrators and developers.

Automationbashscripting
0 likes · 8 min read
Unlock Linux Automation: Master Shell Scripting with This Free Handbook
Ops Community
Ops Community
Oct 14, 2025 · Operations

Mastering Ansible: A Complete Guide to Automated Operations Standards

Discover how to replace chaotic shell scripts with a comprehensive, Ansible‑based automation framework that covers tool selection, architecture design, standardized directory structures, inventory management, variable hierarchy, role development, secure vault usage, real‑world multi‑environment deployments, baseline configurations, monitoring, CI/CD integration, and best‑practice guidelines for modern operations teams.

AnsibleCI/CDInfrastructure as Code
0 likes · 34 min read
Mastering Ansible: A Complete Guide to Automated Operations Standards
Ops Community
Ops Community
Oct 12, 2025 · Operations

When etcd Certificates Expire: How One Failure Crippled an Entire Kubernetes Cluster

A midnight alarm revealed that an expired etcd TLS certificate caused a cascade of failures across a Kubernetes cluster, leading to a full outage that took over half an hour to diagnose, remediate, and restore, highlighting the critical need for proactive certificate management and automated monitoring.

Cluster RecoveryKubernetescertificate expiration
0 likes · 44 min read
When etcd Certificates Expire: How One Failure Crippled an Entire Kubernetes Cluster
Ops Community
Ops Community
Oct 10, 2025 · Operations

From Shell Scripts to Terraform: Mastering Infrastructure as Code

This article traces the evolution of infrastructure automation from ad‑hoc Shell scripts through configuration‑management tools to modern declarative Terraform, highlighting common pitfalls, offering concrete best‑practice recommendations, detailed code samples, and a real‑world migration case study that demonstrates how to transition safely and efficiently.

AWSDevOpsTerraform
0 likes · 31 min read
From Shell Scripts to Terraform: Mastering Infrastructure as Code