Ops Community
Author

Ops Community

A leading IT operations community where professionals share and grow together.

164
Articles
0
Likes
457
Views
0
Comments
Recent Articles

Latest from Ops Community

100 recent articles max
Ops Community
Ops Community
Apr 28, 2026 · Operations

How Dangerous Is an HTTPS Certificate Expiration and How Ops Can Prevent It?

When an HTTPS certificate expires, browsers show warnings, users abandon sites, services become unavailable, and security is weakened, so this article explains the TLS fundamentals, the risks of expiration, real‑world outage cases, and provides step‑by‑step guidance on acquisition, deployment, automated renewal, monitoring, and best‑practice procedures for reliable certificate management.

AutomationHTTPSMonitoring
0 likes · 25 min read
How Dangerous Is an HTTPS Certificate Expiration and How Ops Can Prevent It?
Ops Community
Ops Community
Apr 27, 2026 · Operations

10 Essential Linux Commands Every Sysadmin Must Master

This guide walks system administrators through the ten most frequently used Linux commands—top/htop, df/du, free, ss/netstat, ping/traceroute, ps/kill, grep/sed/awk, tail/less, uname/hostname/uptime, and tar/rsync—explaining core options, output interpretation, common pitfalls, and practical troubleshooting scenarios.

Command LineFile ManagementLinux
0 likes · 25 min read
10 Essential Linux Commands Every Sysadmin Must Master
Ops Community
Ops Community
Apr 26, 2026 · Operations

8 Common Shell Script Mistakes Junior Ops Engineers Make (Are You Guilty?)

This article examines the eight most frequent errors junior and mid‑level Linux operations engineers make when writing Bash scripts—such as missing quotes, wrong comparison operators, incomplete file checks, ignoring return codes, mishandling spaces, concurrency issues, lack of error handling, and absent logging—and provides concrete examples, detailed analysis, and corrected code snippets to improve script reliability and maintainability.

Error Handlingbashconcurrency
0 likes · 26 min read
8 Common Shell Script Mistakes Junior Ops Engineers Make (Are You Guilty?)
Ops Community
Ops Community
Apr 22, 2026 · Databases

Is MySQL CPU Spike a Database Issue or an Application Issue? Troubleshooting Guide

When MySQL CPU usage spikes above 80% or hits 100%, this guide walks you through a systematic investigation—from confirming the MySQL process consumes CPU, checking system and MySQL status, analyzing connection counts, slow queries, lock waits, and configuration settings, to applying short‑term mitigations and long‑term architectural fixes.

CPUDatabase OperationsMySQL
0 likes · 17 min read
Is MySQL CPU Spike a Database Issue or an Application Issue? Troubleshooting Guide
Ops Community
Ops Community
Apr 21, 2026 · Artificial Intelligence

How to Tame Unstable LLM Prompts: Causes and Fixes

This article explains why large‑model prompts can yield inconsistent answers, examines the roles of temperature, top‑p/top‑k, tokenization, context windows, position bias, and model randomness, and provides a step‑by‑step debugging workflow and production‑grade best‑practice checklist to achieve stable outputs.

LLM stabilityTemperatureTop‑P
0 likes · 13 min read
How to Tame Unstable LLM Prompts: Causes and Fixes
Ops Community
Ops Community
Apr 19, 2026 · Databases

How to Diagnose and Resolve MySQL CPU Spikes: A Complete Step‑by‑Step Guide

This guide walks you through identifying why MySQL CPU usage jumps, from confirming the MySQL process consumes CPU to checking connection counts, slow queries, lock waits, configuration settings, and business‑level traffic, and then provides short‑term mitigations and long‑term solutions such as read‑write splitting, sharding, and caching.

CPUMonitoringMySQL
0 likes · 17 min read
How to Diagnose and Resolve MySQL CPU Spikes: A Complete Step‑by‑Step Guide
Ops Community
Ops Community
Apr 18, 2026 · Operations

Master Linux Host Monitoring: Prometheus, Node Exporter, Thresholds & Scripts

This comprehensive guide walks you through building a robust Linux host monitoring system with Prometheus and node_exporter, covering CPU, memory, disk, and network metrics, practical threshold formulas, ready‑to‑run Bash scripts, Alertmanager rules, Grafana dashboards, and best‑practice recommendations for reliable operations.

AlertmanagerGrafanaLinux monitoring
0 likes · 49 min read
Master Linux Host Monitoring: Prometheus, Node Exporter, Thresholds & Scripts
Ops Community
Ops Community
Apr 15, 2026 · Operations

Why Do Linux Processes Vanish? A Complete Troubleshooting Guide

This article systematically explains why Linux processes may disappear, covering OOM Killer, signal termination, cgroup limits, systemd timeouts, manual kills, and provides step‑by‑step diagnostic commands and preventive measures for RHEL, AlmaLinux, and Ubuntu environments.

LinuxMemoryOOM killer
0 likes · 26 min read
Why Do Linux Processes Vanish? A Complete Troubleshooting Guide
Ops Community
Ops Community
Apr 10, 2026 · Databases

How to Diagnose and Fix MySQL Too Many Connections Errors in Production

When MySQL reports 'Too many connections', this guide walks you through emergency assessment, step‑by‑step diagnostics, quick mitigation scripts, root‑cause analysis of slow queries, connection leaks, short‑connection spikes, and long‑term solutions including parameter tuning, connection‑pool configuration, and Prometheus‑based monitoring to prevent future outages.

AlertmanagerConnection PoolConnection leak
0 likes · 40 min read
How to Diagnose and Fix MySQL Too Many Connections Errors in Production