Topic

monitoring

Collection size
1674 articles
Page 80 of 84
Efficient Ops
Efficient Ops
May 11, 2016 · Operations

How to Build an Automated Operations Platform: Insights from Tencent's Experience

This article shares Peng Lihang's practical insights on operations automation, covering the essential trio of configuration, state, and change management, the evolution of ops practices, platform design principles, and concrete steps for building scalable, business‑driven ops platforms.

DevOpsPlatformautomation
0 likes · 24 min read
How to Build an Automated Operations Platform: Insights from Tencent's Experience
Efficient Ops
Efficient Ops
Mar 31, 2016 · Operations

Rethinking CMDB: Building Scalable, Automated Configuration Management for Modern Ops

This talk explores the challenges of building and maintaining a CMDB, proposes a goal‑driven, industry‑referenced modeling approach, and outlines practical steps such as tagging, relationship mapping, dynamic attributes, automation, and visualization to create a service‑oriented, scalable configuration management database.

CMDBIT OperationsInfrastructure
0 likes · 11 min read
Rethinking CMDB: Building Scalable, Automated Configuration Management for Modern Ops
Efficient Ops
Efficient Ops
Mar 21, 2016 · Operations

How to Build a High‑Performance Unified Monitoring & Alerting Platform

This article outlines a comprehensive design for a high‑performance, unified operations monitoring platform, detailing a six‑layer architecture, the roles of data collection (using Ganglia), data extraction, and alerting modules (with Centreon), and provides practical integration tips, deployment diagrams, and Q&A for large‑scale environments.

AlertingCentreonGanglia
0 likes · 24 min read
How to Build a High‑Performance Unified Monitoring & Alerting Platform
Efficient Ops
Efficient Ops
Feb 3, 2016 · Operations

Mastering Operations Automation: Strategies, Stages, and Common Pitfalls

This article explores the fundamentals of operations automation, outlines its three evolutionary stages, provides practical guidance for implementation, and highlights hidden risks and pitfalls that organizations must address to build reliable, secure, and scalable automation systems.

DevOpsOperations Automationcloud
0 likes · 17 min read
Mastering Operations Automation: Strategies, Stages, and Common Pitfalls
Efficient Ops
Efficient Ops
Feb 2, 2016 · Operations

Unlocking Efficient Operations: 7 Secrets to Happy SysAdmins

This article explores why efficient operations are hard to achieve, identifies common pitfalls such as unclear responsibilities, communication gaps, and resource mismatches, and presents a practical framework—including clear roles, professional processes, and a good service interface—to help operations teams become more effective and satisfied.

automationcommunicationefficiency
0 likes · 16 min read
Unlocking Efficient Operations: 7 Secrets to Happy SysAdmins
Efficient Ops
Efficient Ops
Jan 28, 2016 · Operations

Unlocking Performance: Practical Strategies for Application and Architecture Optimization

This article explores the benefits and trade‑offs of performance optimization, outlines single‑application and structural optimization approaches, details bottleneck identification methods, common tuning techniques, and illustrates architectural evolution with diagrams to guide effective ops improvements.

ArchitecturePerformance Optimizationapplication scaling
0 likes · 6 min read
Unlocking Performance: Practical Strategies for Application and Architecture Optimization
Efficient Ops
Efficient Ops
Jan 17, 2016 · Operations

From Telecom to Startup: A Veteran Ops Engineer Shares Career Lessons

Veteran operations engineer Wang Jinyin recounts his journey from telecom system development to leading ops teams at Tencent, YY, and UC, then founding Youwei, offering practical insights on standardization, automation, DevOps integration, and team building for modern IT operations.

DevOpsInfrastructureautomation
0 likes · 17 min read
From Telecom to Startup: A Veteran Ops Engineer Shares Career Lessons
Efficient Ops
Efficient Ops
Jan 8, 2016 · Operations

How Monitoring and Template Deployment Supercharge Automated Operations

This article explains why modern IT operations rely on monitoring-driven automation, template-based deployments, and containerized tools to dramatically improve efficiency, reduce manual effort, and pave the way toward intelligent, DevOps-enabled operational platforms.

DevOpsautomationmonitoring
0 likes · 8 min read
How Monitoring and Template Deployment Supercharge Automated Operations
Efficient Ops
Efficient Ops
Jan 4, 2016 · Operations

Unlocking the Power of Logs: From Basic Monitoring to Security Insights

This article explores how logs serve as vital diagnostic tools in operations, detailing their definition, practical uses for performance and security monitoring, step‑by‑step analysis techniques, and recommendations for free and commercial log analysis solutions.

ELKSIEMlog analysis
0 likes · 14 min read
Unlocking the Power of Logs: From Basic Monitoring to Security Insights
Efficient Ops
Efficient Ops
Nov 5, 2015 · Cloud Computing

Mastering Virtualization Ops: Monitoring, Disaster Recovery, and Cloud Choices

This article shares practical insights from a seasoned KVM specialist on how to monitor hardware, set up alerts, design disaster‑recovery strategies, choose optimal software and hardware, and evaluate public‑cloud providers when migrating workloads to a virtualized environment.

Cloud MigrationKVMdisaster recovery
0 likes · 8 min read
Mastering Virtualization Ops: Monitoring, Disaster Recovery, and Cloud Choices
Efficient Ops
Efficient Ops
Oct 27, 2015 · Operations

How to Build a Practical Monitoring System for Small and Medium Enterprises

An in‑depth guide walks readers through building a comprehensive monitoring system for small‑to‑medium enterprises, covering hardware, system, application, network, security, traffic analysis, business metrics, log aggregation, automation, visualization, and practical integration with tools like Zabbix, IPMI, ELK, and Smokeping.

Zabbixautomationlog management
0 likes · 18 min read
How to Build a Practical Monitoring System for Small and Medium Enterprises
Efficient Ops
Efficient Ops
Sep 21, 2015 · Operations

How OWL Redefines Enterprise Monitoring with Dynamic Alerts and Scalable Architecture

This article introduces OWL, a distributed, enterprise‑grade monitoring solution that combines infrastructure and business metrics, offers floating alert rules, customizable dashboards, visual asset management, a resilient Golang‑based agent, and a parallel‑scalable HBase storage backend.

AlertingBig DataDistributed Systems
0 likes · 12 min read
How OWL Redefines Enterprise Monitoring with Dynamic Alerts and Scalable Architecture
Efficient Ops
Efficient Ops
Jul 4, 2015 · Operations

From Xiaomi to a Trading Exchange: Real‑World Automation Ops Case Studies

This article presents two practical automation operations case studies—Xiaomi's three‑year journey to platform‑managed, self‑scheduling services and a trading exchange's step‑by‑step build from zero automation—highlighting standards, tooling, and cultural challenges for modern ops teams.

DevOpsautomationcapacity scaling
0 likes · 9 min read
From Xiaomi to a Trading Exchange: Real‑World Automation Ops Case Studies
Efficient Ops
Efficient Ops
Jun 15, 2015 · Operations

How Dazhong Dianping Scaled Operations: Architecture, Automation, and Lessons Learned

This article summarizes the key insights from Dazhong Dianping's operations talk, covering team organization, multi‑datacenter architecture, comprehensive monitoring, automation workflows, configuration management tools, incident analysis systems, common pitfalls, and future directions such as PaaS and Docker adoption.

DevOpsInfrastructurePlatform
0 likes · 18 min read
How Dazhong Dianping Scaled Operations: Architecture, Automation, and Lessons Learned
Efficient Ops
Efficient Ops
Jun 1, 2015 · Operations

Unlock Real-Time Log Analysis with ELK: From Basics to Advanced Practices

This article explores how the ELK stack can transform large‑scale log processing into fast, flexible, and interactive analysis for troubleshooting, security auditing, and monitoring, sharing practical examples, common pitfalls, and best‑practice recommendations from real‑world deployments at Sina.

ELKElasticsearchKibana
0 likes · 13 min read
Unlock Real-Time Log Analysis with ELK: From Basics to Advanced Practices
Linux Ops Smart Journey
Linux Ops Smart Journey
Nov 12, 2024 · Databases

Master PostgreSQL Monitoring with Grafana: Step-by-Step Guide

Learn how to deploy postgres_exporter, configure PostgreSQL extensions, set up Prometheus scraping, and create Grafana dashboards for comprehensive PostgreSQL performance monitoring, complete with command-line instructions and tips for verifying data collection and visualizing metrics.

DatabaseGrafanaPostgreSQL
0 likes · 6 min read
Master PostgreSQL Monitoring with Grafana: Step-by-Step Guide
Linux Ops Smart Journey
Linux Ops Smart Journey
Oct 20, 2024 · Operations

Master Prometheus: Step-by-Step Deployment and Verification on Kubernetes

This guide walks you through the fundamentals of Prometheus, its architecture, and detailed Helm‑based deployment and validation steps on a Kubernetes cluster, enabling reliable monitoring for cloud‑native environments.

Prometheuscloud nativehelm
0 likes · 6 min read
Master Prometheus: Step-by-Step Deployment and Verification on Kubernetes
macrozheng
macrozheng
Feb 21, 2025 · Backend Development

Boost SpringBoot Performance: Monitoring, Profiling, and Optimization Techniques

This guide walks through practical SpringBoot performance improvements, covering metric exposure with Prometheus, flame‑graph profiling via async‑profiler, distributed tracing with SkyWalking, HTTP and Tomcat tuning, and layer‑specific optimizations for controllers, services, and data access.

JavaPerformanceSpringBoot
0 likes · 17 min read
Boost SpringBoot Performance: Monitoring, Profiling, and Optimization Techniques
macrozheng
macrozheng
Nov 25, 2021 · Operations

Master SkyWalking: End‑to‑End Guide for Distributed Tracing & Monitoring

This article introduces SkyWalking, a Chinese open‑source APM framework, compares it with Spring Cloud Sleuth+Zipkin, explains server and client setup, storage configuration, log collection, performance profiling, and alerting, providing step‑by‑step instructions, code snippets, and screenshots to help developers implement comprehensive distributed tracing.

APMDistributed TracingJava
0 likes · 16 min read
Master SkyWalking: End‑to‑End Guide for Distributed Tracing & Monitoring