Tagged articles
16 articles
Page 1 of 1
dbaplus Community
dbaplus Community
Jul 24, 2025 · Operations

How Bilibili Scales Server Fault Management with Automated Detection and Repair

This article details Bilibili's approach to handling explosive growth in server count by classifying faults, identifying shortcomings of manual processes, and implementing an automated, end‑to‑end detection, rule‑based alerting, and repair workflow that combines in‑band and out‑of‑band data collection to achieve near‑perfect coverage and accuracy.

Data centerfault detectionin‑band
0 likes · 17 min read
How Bilibili Scales Server Fault Management with Automated Detection and Repair
Liangxu Linux
Liangxu Linux
May 15, 2025 · Operations

10 Critical Server Ops Mistakes to Avoid and Real-World Lessons

This article outlines ten common server operation pitfalls—such as forced power‑offs, reckless experiments in production, neglecting firewall rules, running unknown scripts as root, unbacked‑up database changes, weak SSH settings, poor log management, exposed ports, unmonitored changes, and delayed patching—each illustrated with real‑world cases and practical remediation advice.

BackupSecuritySystem Administration
0 likes · 7 min read
10 Critical Server Ops Mistakes to Avoid and Real-World Lessons
Lin is Dream
Lin is Dream
May 7, 2025 · Operations

How to Prevent Nginx Log Overgrowth with Automated Log Splitting

This guide explains why default Nginx logging can fill disks, then walks through disabling logging, clearing old files, re‑enabling logging, using a shell script to rotate logs by date, and scheduling the script with cron to keep server storage healthy.

log rotationserver operations
0 likes · 5 min read
How to Prevent Nginx Log Overgrowth with Automated Log Splitting
ITPUB
ITPUB
May 3, 2025 · Information Security

20 Critical Server Operations You Must Never Do – Real Cases & Fixes

Based on analysis of over 500 enterprise server failure cases, this guide lists 20 absolutely prohibited server actions across six dimensions, each illustrated with a real incident and practical technical measures to prevent recurrence.

DevOpsSystem Administrationincident response
0 likes · 14 min read
20 Critical Server Operations You Must Never Do – Real Cases & Fixes
Open Source Linux
Open Source Linux
Mar 27, 2025 · Operations

10 Critical Server Ops Mistakes to Avoid: Real-World Lessons

This article outlines ten critical server operation mistakes—ranging from forced power cuts to neglecting updates—illustrated with real-world incidents and practical advice, helping engineers adopt safer practices, proper backups, secure configurations, and effective monitoring to prevent costly outages.

best practicesincident managementserver operations
0 likes · 6 min read
10 Critical Server Ops Mistakes to Avoid: Real-World Lessons
Efficient Ops
Efficient Ops
Feb 23, 2025 · Information Security

Top 10 Server Ops Mistakes That Can Cripple Your Business – How to Avoid Them

This article presents ten critical server‑operation blunders—from forced power‑offs to neglecting firewall rules—and illustrates each with real‑world incidents, offering concrete best‑practice recommendations to help IT teams prevent costly outages and security breaches.

incident preventionsecurity best practicesserver operations
0 likes · 7 min read
Top 10 Server Ops Mistakes That Can Cripple Your Business – How to Avoid Them
Liangxu Linux
Liangxu Linux
Nov 28, 2022 · Operations

How to Diagnose and Free Unexpected Linux Disk Space Usage

Learn step‑by‑step Linux techniques to locate hidden disk consumers—using df, du, find, lsof, and tune2fs—to resolve mismatched usage reports, delete lingering open files, and adjust reserved space, ensuring your server’s storage stays under control.

Linuxcommand-linedisk-management
0 likes · 6 min read
How to Diagnose and Free Unexpected Linux Disk Space Usage
Efficient Ops
Efficient Ops
Aug 18, 2022 · Operations

How to Quickly Identify Disk Space Hogs on Linux Servers

This guide shows step‑by‑step Linux commands—df, du, find, lsof, and tune2fs—to pinpoint large directories, uncover hidden space used by deleted files, and adjust reserved root space, helping you resolve mysterious disk‑space alerts efficiently.

dfdisk usagedu
0 likes · 4 min read
How to Quickly Identify Disk Space Hogs on Linux Servers
MaGe Linux Operations
MaGe Linux Operations
Jan 9, 2022 · Operations

How to Quickly Identify Disk Space Hogs on Linux Servers

Learn step-by-step methods to locate large directories and files consuming disk space on Linux servers, using commands like df, du, find, and lsof, plus tips for handling deleted files and adjusting reserved space with tune2fs to reclaim lost storage.

du commandfind commandlsof
0 likes · 4 min read
How to Quickly Identify Disk Space Hogs on Linux Servers
Programmer DD
Programmer DD
May 23, 2021 · Operations

How I Cut Full GC Frequency from 40 to 1 in 10 Days: A JVM Tuning Journey

Over a month of systematic investigation, the author reduced Full GC occurrences on a 2‑core, 4 GB Java server cluster from 40 times a day to roughly once every ten days by adjusting heap settings, fixing a memory‑leak caused by an anonymous listener, and tuning Metaspace and CMS thresholds, ultimately achieving stable performance and lower latency.

Garbage CollectionJVMJava
0 likes · 11 min read
How I Cut Full GC Frequency from 40 to 1 in 10 Days: A JVM Tuning Journey
Open Source Linux
Open Source Linux
Mar 19, 2020 · Operations

Essential Ops Playbook: Avoid Costly Mistakes in Server Management

This guide shares practical Linux server operation rules, emphasizing thorough testing, careful use of destructive commands, strict access control, regular backups, security hardening, continuous monitoring, and disciplined performance tuning to prevent costly outages and data loss.

Backupmonitoringperformance tuning
0 likes · 13 min read
Essential Ops Playbook: Avoid Costly Mistakes in Server Management