Tagged articles
6 articles
Page 1 of 1
MaGe Linux Operations
MaGe Linux Operations
Oct 6, 2025 · Operations

Avoid the Fatal Ops Mistakes That Could Ruin Your Career – 10 Critical Pitfalls and How to Prevent Them

Drawing on real-world incidents and Gartner 2023 data, this article reveals ten deadly operational pitfalls—from executing untested commands in production to inadequate backups—and offers concrete technical safeguards, process controls, and cultural practices to help engineers avoid costly errors and protect their careers.

BackupOperationsautomation
0 likes · 27 min read
Avoid the Fatal Ops Mistakes That Could Ruin Your Career – 10 Critical Pitfalls and How to Prevent Them
Efficient Ops
Efficient Ops
Feb 23, 2025 · Information Security

Top 10 Server Ops Mistakes That Can Cripple Your Business – How to Avoid Them

This article presents ten critical server‑operation blunders—from forced power‑offs to neglecting firewall rules—and illustrates each with real‑world incidents, offering concrete best‑practice recommendations to help IT teams prevent costly outages and security breaches.

incident preventionsecurity best practicesserver operations
0 likes · 7 min read
Top 10 Server Ops Mistakes That Can Cripple Your Business – How to Avoid Them
Tech Architecture Stories
Tech Architecture Stories
Dec 28, 2024 · Operations

Why Preventing Small Issues Is the Key to System Stability

The article explains how early detection and preventive measures—such as comprehensive monitoring, rate limiting, chaos testing, and proper SLOs—are essential for maintaining system stability and avoiding larger incidents, drawing on SRE principles and the incident triangle theory.

Error BudgetOperationsSRE
0 likes · 4 min read
Why Preventing Small Issues Is the Key to System Stability
Bilibili Tech
Bilibili Tech
Aug 9, 2024 · Operations

Design and Implementation of Bilibili's Change Control Platform

Bilibili’s Change Prevention Platform consolidates data from over 60 systems to proactively detect and block more than 100 risky changes daily, reducing change‑related incidents by applying a four‑pillar framework of technical support, landing, cross‑domain enablement, and cultural safeguards, while evolving toward AI‑driven, end‑to‑end change defense.

BilibiliDevOpsReliability
0 likes · 20 min read
Design and Implementation of Bilibili's Change Control Platform
ITPUB
ITPUB
Apr 15, 2019 · Operations

Essential Practices to Prevent Operational Failures and Boost System Availability

This guide outlines six practical strategies—rollback testing, cautious destructive actions, clear command prompts, verified backups, careful handovers, and proactive monitoring—to help operations teams minimize outages and maintain high system availability.

AvailabilityOperationsbackup verification
0 likes · 6 min read
Essential Practices to Prevent Operational Failures and Boost System Availability
ITPUB
ITPUB
Mar 9, 2017 · Operations

How the Four‑Eyes Principle Saves IT Ops from Costly Mistakes

The article shares frontline IT operations experiences, emphasizing careful command execution, mandatory operation logs, two‑person verification, and backup strategies to prevent disastrous errors, illustrated by real incidents like a massive Deutsche Bank loss caused by a simple input mistake.

IT best practicesOperationsbackup strategy
0 likes · 4 min read
How the Four‑Eyes Principle Saves IT Ops from Costly Mistakes