Ops Community
Author

Ops Community

A leading IT operations community where professionals share and grow together.

164
Articles
0
Likes
457
Views
0
Comments
Recent Articles

Latest from Ops Community

100 recent articles max
Ops Community
Ops Community
Oct 8, 2025 · Cloud Native

How I Cut My Kubernetes Cloud Bill by 60% in 3 Months – Proven Strategies

Facing a 35‑million‑yuan monthly Kubernetes bill, the author analyzed hidden cost components, implemented five optimization campaigns—including resource request tuning, autoscaling, spot instances, storage tiering, and network consolidation—and reduced monthly expenses by 60% while boosting performance, delivering a detailed, reproducible methodology.

FinOpsKubernetesSpot Instances
0 likes · 33 min read
How I Cut My Kubernetes Cloud Bill by 60% in 3 Months – Proven Strategies
Ops Community
Ops Community
Oct 7, 2025 · Operations

How I Cut API Response Time from 500 ms to 100 ms with Linux Tuning

This article recounts a two‑week Linux system tuning project that reduced a high‑traffic API’s P99 response time from over 500 ms to under 100 ms by systematically diagnosing bottlenecks, applying USE‑based analysis, and tuning kernel, network, file‑descriptor, memory, CPU, and I/O parameters.

API latency reductionLinux TuningTCP Optimization
0 likes · 26 min read
How I Cut API Response Time from 500 ms to 100 ms with Linux Tuning
Ops Community
Ops Community
Oct 6, 2025 · Cloud Computing

Master Terraform for Multi-Cloud Management: From Beginner to Expert

This comprehensive guide walks you through Terraform fundamentals, multi‑cloud support, state management, modular design, environment handling, and real‑world case studies, showing how to automate infrastructure provisioning, improve consistency, and boost operational efficiency across AWS, Azure, GCP and Alibaba Cloud.

Infrastructure AutomationTerraformiac
0 likes · 30 min read
Master Terraform for Multi-Cloud Management: From Beginner to Expert
Ops Community
Ops Community
Oct 5, 2025 · Databases

Master MySQL Slow Query Optimization: Proven Methods & Pitfall Guide

This comprehensive guide walks you through a systematic methodology for diagnosing and fixing MySQL slow queries, covering proper log configuration, EXPLAIN analysis, index design, query rewriting, table restructuring, and practical caching techniques, while highlighting common misconceptions and real‑world case studies.

Index OptimizationMySQLPerformance Tuning
0 likes · 35 min read
Master MySQL Slow Query Optimization: Proven Methods & Pitfall Guide
Ops Community
Ops Community
Oct 4, 2025 · Databases

How to Quickly Diagnose and Fix a Frozen MySQL in Production: 5 Proven Steps

Facing a MySQL that suddenly becomes unresponsive in production? This article walks through the exact five‑step investigative process—checking process status, examining connections, locating lock waits, analyzing slow queries and system bottlenecks, and applying emergency recovery—illustrated with real‑world examples and command‑line snippets.

MySQLProduction Incidentdatabase troubleshooting
0 likes · 19 min read
How to Quickly Diagnose and Fix a Frozen MySQL in Production: 5 Proven Steps
Ops Community
Ops Community
Oct 3, 2025 · Operations

2025 Ops Skill Blueprint: Must‑Learn Technologies Every Top Engineer Is Mastering

This comprehensive guide analyzes the rapid transformation of the operations industry, presents data‑driven evidence of declining traditional roles, and delivers a detailed 2025 skill roadmap—including cloud‑native, programming, observability, automation, database, networking, and soft‑skill competencies—complete with learning paths, practical examples, and verification standards.

0 likes · 42 min read
2025 Ops Skill Blueprint: Must‑Learn Technologies Every Top Engineer Is Mastering
Ops Community
Ops Community
Oct 2, 2025 · Operations

How to Fix Nginx 502 Bad Gateway Errors: A 90% Success Checklist

This article provides a comprehensive, step‑by‑step checklist for diagnosing and resolving Nginx 502 Bad Gateway errors, covering backend service verification, configuration checks, log analysis, resource monitoring, network troubleshooting, special scenarios, and long‑term preventive measures.

502Bad GatewayMonitoring
0 likes · 25 min read
How to Fix Nginx 502 Bad Gateway Errors: A 90% Success Checklist
Ops Community
Ops Community
Oct 1, 2025 · Databases

Why Did Redis Memory Spike 10×? Uncover the Hidden Config Mistake

A sudden Redis memory surge from 2 GB to 20 GB was traced to a misconfigured list-compress-depth parameter, revealing how uncompressed lists and queue backlogs can cause ten‑fold memory growth, and outlining step‑by‑step diagnostics, compression fixes, and long‑term optimization strategies.

ConfigurationList CompressionRedis
0 likes · 24 min read
Why Did Redis Memory Spike 10×? Uncover the Hidden Config Mistake
Ops Community
Ops Community
Sep 30, 2025 · Operations

7 Ready‑to‑Use Shell Scripts for Automated Server Health Checks

This article presents seven production‑grade Bash scripts that automate server health inspections—including system resource monitoring, disk analysis, network diagnostics, process validation, security baseline checks, MySQL health assessment, and a batch scheduler—plus best‑practice guidance for integrating them into an operations workflow.

Server Automationshell scripting
0 likes · 72 min read
7 Ready‑to‑Use Shell Scripts for Automated Server Health Checks
Ops Community
Ops Community
Sep 29, 2025 · Cloud Native

Enterprise Docker Deployment: From Zero to Production – A Complete Guide

This comprehensive guide walks through the evolution of container technology, explains Docker's core mechanisms, and presents enterprise‑grade architecture, deployment strategies, monitoring, security hardening, and real‑world case studies, helping ops engineers build efficient, scalable, and secure production‑ready Docker environments.

ContainerizationDockerEnterprise Deployment
0 likes · 19 min read
Enterprise Docker Deployment: From Zero to Production – A Complete Guide