Boost Ops Efficiency: 5 Python Scripts Every Sysadmin Should Use
This article explains how Python can automate common operations tasks—remote command execution, log parsing, system monitoring with alerts, batch software deployment, and backup/restore—providing code examples and highlighting the benefits for sysadmins.
Many operations engineers use Python scripts to automate routine tasks because Python offers a rich ecosystem of third‑party libraries and strong automation capabilities across various domains.
In the operations field, Python can be used for several automation scenarios:
Connecting to remote servers and executing commands
Parsing log files and extracting useful information
Monitoring system status and sending alerts
Batch deploying software or updating systems
Performing backup and recovery tasks
1. Connect to Remote Servers and Execute Commands
Connecting to remote servers is a frequent task for ops engineers. Python’s
paramikolibrary enables SSH connections.
<code>import paramiko
# Create SSH client
ssh = paramiko.SSHClient()
# Auto‑accept unknown host keys
ssh.set_missing_host_key_policy(paramiko.AutoAddPolicy())
# Connect to the remote server
ssh.connect(hostname='remote.server.com', username='user', password='password')
# Execute a command
stdin, stdout, stderr = ssh.exec_command('ls -l /tmp')
</code>2. Parse Log Files and Extract Information
Log parsing is another common task. The
regexlibrary provides powerful regular‑expression tools.
<code>import regex
# Read log file
with open('log.txt', 'r') as f:
log = f.read()
# Find error messages
errors = regex.findall(r'ERROR:\s+(.*)', log)
# Print each error
for error in errors:
print(error)
</code>More advanced regular expressions or libraries such as
loguruor
python‑logstashcan be used for complex parsing.
3. Monitor System Status and Send Alerts
The
psutillibrary provides system metrics, and
smtplibcan send email alerts.
<code>import psutil
import smtplib
cpu_percent = psutil.cpu_percent()
if cpu_percent > 80:
server = smtplib.SMTP('smtp.example.com')
server.login('user', 'password')
message = f'CPU usage exceeds 80%: {cpu_percent}%'
subject = 'Alert: High CPU Usage'
server.sendmail('[email protected]', '[email protected]', subject, message)
server.quit()
</code>Thresholds and monitored resources can be adjusted, and other tools like
nagios‑apior
sensu‑clientmay be integrated.
4. Batch Deploy Software or Update Systems
The
fabriclibrary allows remote command execution across multiple servers.
<code>from fabric import task
@task
def update_system(c):
c.run('apt-get update')
</code>Similar tasks can be built with
ansible,
puppet, or other automation frameworks.
5. Perform Backup and Recovery Tasks
Python’s
shutilmodule can copy files or entire directories.
<code>import shutil
# Backup a single file
shutil.copy('/path/to/file', '/path/to/backup/file')
# Backup an entire directory
shutil.copytree('/path/to/dir', '/path/to/backup/dir')
</code>Beyond these examples, Python can also be used for automated testing (e.g.,
pytest,
selenium), data analysis and visualization (
numpy,
pandas,
matplotlib), and machine learning (
scikit‑learn,
tensorflow,
nltk).
Overall, Python offers a versatile toolkit for operations engineers to increase efficiency, reduce manual errors, and expand their capabilities.
Efficient Ops
This public account is maintained by Xiaotianguo and friends, regularly publishing widely-read original technical articles. We focus on operations transformation and accompany you throughout your operations career, growing together happily.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.