Databases 5 min read

DBA Nightmares: Real‑World Incident Stories and Hard‑Earned Lessons

A collection of vivid DBA anecdotes reveals common pitfalls—from missed alerts and accidental production restarts to unsafe terminal habits—and distills practical safeguards that any database operator can adopt to avoid costly mishaps.

Efficient Ops
Efficient Ops
Efficient Ops
DBA Nightmares: Real‑World Incident Stories and Hard‑Earned Lessons

These stories, gathered from Zhihu contributors, illustrate the chaotic side of database administration and the lessons learned after each incident.

Typical mishaps

Alert emails landing in spam folders, leaving critical warnings unnoticed.

Phone service suspension preventing on‑call engineers from receiving alerts.

Leadership criticism whether a failure occurs or not, highlighting the pressure on DBAs.

Waking up at 2 am to find a laptop still booting, delaying response to an alarm.

Opening dozens of SSH windows, inadvertently restarting a production database instead of a test instance.

Windows server screen saver activation causing an unintended SQL execution when a colleague pressed Enter.

Relying on outdated backups that fail to restore data after a mistaken operation.

Discovering that a long‑running single‑instance database has no recent backup.

Accidentally typing DROP DATABASE while a colleague watches, underscoring the need for caution.

Hard‑earned safeguards ("亡羊补牢")

Upgrade storage to SSDs to reduce boot‑time delays during emergencies.

Separate production and test terminals; open only the windows directly related to the task at hand.

Disable screen savers on servers and prevent the Enter key from triggering commands after wake‑up.

Automate backup verification and keep manual checks frequent to ensure restore readiness.

Maintain up‑to‑date backup policies and regularly test restoration procedures.

Sample remediation script

0 0 22 03 * mysql xx -e "delete from tablex where create_time > '2006-03-22 00:00:00'"

The snippet shows a cron job that deletes rows with a future create_time, a typical fix for mistakenly inserted data.

Overall, the anecdotes emphasize proactive monitoring, disciplined workflow, and reliable backup strategies as essential practices for any DBA.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

Operationsbest practicesincident managementdatabasesDBA
Efficient Ops
Written by

Efficient Ops

This public account is maintained by Xiaotianguo and friends, regularly publishing widely-read original technical articles. We focus on operations transformation and accompany you throughout your operations career, growing together happily.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.