Tagged articles
3281 articles
Page 11 of 33
Open Source Linux
Open Source Linux
Aug 28, 2023 · Operations

Detecting and Resolving Network Loops with Traffic Analysis

This article explains how a large internal network suffered severe slowdown and packet loss due to a routing loop, how traffic analysis revealed massive UDP2425 usage consuming 99% of bandwidth, and the step‑by‑step method used to identify and eliminate the loop.

OperationsTraffic analysisUDP
0 likes · 7 min read
Detecting and Resolving Network Loops with Traffic Analysis
Efficient Ops
Efficient Ops
Aug 27, 2023 · Operations

Why Do CMDB Projects Fail? Lessons from Tech Leaders, Developers, and Ops

This article compiles insights from technical leaders, developers, product managers, and operations staff on building effective CMDBs, highlighting common pitfalls, essential success factors, Huawei’s experience, and practical lessons for improving data modeling, automation, and cross‑team collaboration.

CMDBDevOpsIT Service Management
0 likes · 7 min read
Why Do CMDB Projects Fail? Lessons from Tech Leaders, Developers, and Ops
Liangxu Linux
Liangxu Linux
Aug 27, 2023 · Operations

Essential Linux Commands: uname, hostname, dmesg, du, date, echo and More

This guide presents a concise reference of common Linux command‑line utilities—including uname, hostname, dmesg, stat, du, date, echo, watch, which, whereis, locate and updatedb—showing their typical options, usage examples and visual output to help users quickly retrieve system information, manage files and monitor processes.

LinuxOperationsShell
0 likes · 10 min read
Essential Linux Commands: uname, hostname, dmesg, du, date, echo and More
Top Architect
Top Architect
Aug 24, 2023 · Operations

Blue‑Green, Rolling, and Canary Deployment Strategies Overview

This article explains three common software release strategies—blue‑green deployment, rolling deployment, and canary (gray) deployment—detailing their principles, advantages, potential pitfalls, and practical considerations, while also contrasting them with A/B testing and noting related operational concerns.

A/B testingBlue-GreenCanary
0 likes · 12 min read
Blue‑Green, Rolling, and Canary Deployment Strategies Overview
Efficient Ops
Efficient Ops
Aug 23, 2023 · Operations

How to Diagnose High Load with Low CPU on Linux: Tools & Tips

This guide explains how to analyze Linux load situations—whether CPU and load are both high or CPU is low while load remains high—by using commands like top, vmstat, iostat, sar, and jstack, and provides practical troubleshooting steps for common I/O‑related issues.

CPULoadOperations
0 likes · 11 min read
How to Diagnose High Load with Low CPU on Linux: Tools & Tips
MaGe Linux Operations
MaGe Linux Operations
Aug 23, 2023 · Operations

Master Logwatch: Install and Automate Linux Log Analysis on CentOS

This guide explains why log analysis is essential for Linux system health, walks through installing Logwatch on CentOS, configuring its core settings, automating daily runs via cron, and interpreting sample output for connections, SSH activity, package installs, and disk usage.

CentOSOperationsSystem Administration
0 likes · 8 min read
Master Logwatch: Install and Automate Linux Log Analysis on CentOS
dbaplus Community
dbaplus Community
Aug 22, 2023 · Operations

Designing a Multi‑Cloud Intelligent Monitoring Platform at Huolala: Architecture, Practices, and Future Directions

This article details Huolala's one‑stop monitoring platform called Monitor, covering its multi‑cloud architecture, data collection pipelines, real‑time business monitoring, unified alarm handling, and future AI‑driven enhancements, while sharing concrete metrics, incident case studies, and practical implementation steps for large‑scale observability.

GPTOperationscloud-native
0 likes · 19 min read
Designing a Multi‑Cloud Intelligent Monitoring Platform at Huolala: Architecture, Practices, and Future Directions
JD Retail Technology
JD Retail Technology
Aug 22, 2023 · Operations

Mastering JDK8 GC: Practical Tuning Guide for High‑Performance Applications

This guide summarizes JD.com’s extensive JDK8 garbage‑collector optimization experience, covering version requirements, how to select the right GC, core and collector‑specific parameter settings, logging configuration, and concrete methods to assess GC health for latency‑sensitive and throughput‑oriented workloads.

BackendGC tuningGarbage Collection
0 likes · 5 min read
Mastering JDK8 GC: Practical Tuning Guide for High‑Performance Applications
Huolala Tech
Huolala Tech
Aug 22, 2023 · Operations

How HuoLala Built a Resilient Fault‑Drill Platform to Boost System Reliability

Facing growing microservice complexity, HuoLala designed a comprehensive fault‑drill system—covering management, tooling, and operations—to simulate failures, control blast radius, automate scenarios, and continuously improve resilience, ultimately reducing downtime and enhancing system stability across more than ten business units.

Fault InjectionMicroservicesOperations
0 likes · 12 min read
How HuoLala Built a Resilient Fault‑Drill Platform to Boost System Reliability
Huolala Tech
Huolala Tech
Aug 18, 2023 · Operations

Beyond System Metrics: Building Effective Business Monitoring for Pricing Services

Facing unpredictable software behavior, the article explains why traditional system‑level monitoring often misses critical business issues, especially in complex pricing services, and presents a comprehensive approach that combines result (black‑box) and process (white‑box) monitoring, practical metrics, and actionable recommendations to improve observability and reduce operational risk.

Operationsbusiness metricsmonitoring
0 likes · 14 min read
Beyond System Metrics: Building Effective Business Monitoring for Pricing Services
Alibaba Cloud Developer
Alibaba Cloud Developer
Aug 15, 2023 · Operations

What Is Technical Operations? Insights and Best Practices from an Alibaba Front‑End Veteran

This article explores the emerging role of technical operations for developer‑focused companies, sharing practical frameworks for organizing tech communities, driving content engagement, building technical brands, and defining the core abilities of effective tech ops, all illustrated with real examples from Alibaba.

Operationsbrand buildingcontent strategy
0 likes · 11 min read
What Is Technical Operations? Insights and Best Practices from an Alibaba Front‑End Veteran
dbaplus Community
dbaplus Community
Aug 14, 2023 · Operations

Designing Business‑Focused Monitoring for Banking Systems: Metrics, Alerts, and Implementation Challenges

The article outlines a practical framework for business‑level monitoring in banking systems, describing three evolution stages, key metrics such as transaction success rates and volume spikes, concrete alert rules, and the technical challenges of data collection, standardization, and massive parameter management.

AlertingMetricsOperations
0 likes · 14 min read
Designing Business‑Focused Monitoring for Banking Systems: Metrics, Alerts, and Implementation Challenges
58UXD
58UXD
Aug 11, 2023 · Product Management

How 58租房’s “Housekeeper Talk” Boosted Video Tours and Conversion Rates

This article examines how 58租房 introduced the “Housekeeper Talk” video‑tour product, detailing the advantages of video viewings, the end‑to‑end workflow for creators, the UI enhancements for renters, and the measurable impact on user engagement and connection rates.

OperationsProduct DesignUser experience
0 likes · 8 min read
How 58租房’s “Housekeeper Talk” Boosted Video Tours and Conversion Rates
FunTester
FunTester
Aug 11, 2023 · Operations

Essential Performance Testing Best Practices Every Engineer Should Follow

Performance testing is crucial for ensuring software reliability, and this guide outlines essential best practices—including setting clear goals, selecting appropriate tools, crafting maintainable scripts, using realistic data, running long‑duration loads, and scheduling regular tests—to help engineers achieve stable, high‑performing applications.

Load TestingOperationsPerformance Testing
0 likes · 8 min read
Essential Performance Testing Best Practices Every Engineer Should Follow
Efficient Ops
Efficient Ops
Aug 10, 2023 · Operations

How China’s Telecom Giants Accelerate IT Efficiency with the DevOps Maturity Model

This article explains how the CAICT‑led DevOps Capability Maturity Model guides major Chinese telecom operators—China Mobile, China Telecom, China Unicom, ZTE, Huawei and China Tower—through 31 assessments, showcasing project‑level improvements, integration of resources, and measurable gains in delivery speed, quality and operational efficiency across the industry.

Continuous DeliveryDevOpsIT efficiency
0 likes · 15 min read
How China’s Telecom Giants Accelerate IT Efficiency with the DevOps Maturity Model
Efficient Ops
Efficient Ops
Aug 10, 2023 · Operations

How Chinese Banks Accelerate Digital Transformation with DevOps Maturity Models

Amid a nationwide digital transformation push, leading Chinese banks adopt the CAICT‑led DevOps Capability Maturity Model, using standardized assessments to improve IT efficiency, integrate resources, and support business systems, with detailed case studies from nine city‑commercial and other financial institutions illustrating measurable gains.

BankingDevOpsDigital Transformation
0 likes · 15 min read
How Chinese Banks Accelerate Digital Transformation with DevOps Maturity Models
Architect
Architect
Aug 10, 2023 · Operations

Capacity Management: Goals, Stages, Optimization Techniques, and Scaling Practices

The article explains how capacity management balances cost control and service quality through defined goals, three development stages, detailed resource optimization methods, stress‑testing metrics and standards, and automated scaling to achieve significant cost reductions while maintaining system stability.

OperationsPerformance TestingResource Optimization
0 likes · 10 min read
Capacity Management: Goals, Stages, Optimization Techniques, and Scaling Practices
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Aug 10, 2023 · Operations

How Kubernetes Powers Modern DevOps Automation and Operations

By integrating Kubernetes with DevOps practices, teams can automate deployment pipelines, achieve dynamic resource allocation, centralize monitoring with tools like Prometheus and Grafana, and treat infrastructure as code, resulting in faster, higher-quality software delivery and improved collaboration between development and operations.

DevOpsInfrastructure as CodeKubernetes
0 likes · 7 min read
How Kubernetes Powers Modern DevOps Automation and Operations
Efficient Ops
Efficient Ops
Aug 9, 2023 · Operations

How China’s Leading Banks Boost Digital Transformation with DevOps Maturity

This article examines how major Chinese state‑owned banks adopted the CAICT‑led DevOps Capability Maturity Model, detailing assessment data, case studies of continuous delivery, security, and system‑tool standards, and the measurable improvements in efficiency, risk reduction, and business agility achieved across dozens of projects.

BankingDevOpsDigital Transformation
0 likes · 19 min read
How China’s Leading Banks Boost Digital Transformation with DevOps Maturity
Efficient Ops
Efficient Ops
Aug 9, 2023 · Operations

How Leading Chinese Banks Accelerate IT Efficiency with DevOps Maturity Assessments

This article reviews how seven major Chinese joint‑stock banks adopted the CAICT DevOps Capability Maturity Model, detailing their evaluation results, project implementations, and the operational improvements achieved across continuous delivery, technical operations, security, and BizDevOps practices.

BankingDevOpsDigital Transformation
0 likes · 18 min read
How Leading Chinese Banks Accelerate IT Efficiency with DevOps Maturity Assessments
Efficient Ops
Efficient Ops
Aug 8, 2023 · Operations

How a One‑Stop DevOps Platform Helped China Pacific Insurance Pass Level‑3 Continuous Delivery Assessment

At the 2023 DOIS DevOps International Summit, China Pacific Insurance leveraged a one‑stop R&D efficiency platform to successfully achieve Level‑3 continuous delivery certification for four projects, illustrating how standardized processes, integrated toolchains, and end‑to‑end metrics can accelerate digital transformation in the insurance industry.

Continuous DeliveryDevOpsDigital Transformation
0 likes · 13 min read
How a One‑Stop DevOps Platform Helped China Pacific Insurance Pass Level‑3 Continuous Delivery Assessment
Liangxu Linux
Liangxu Linux
Aug 6, 2023 · Cloud Native

Unlock Hidden kubectl Tricks: Advanced Commands for Kubernetes Mastery

This guide presents a collection of advanced kubectl techniques—including printing API details, filtering and deleting pods by status, counting pods per node, analyzing pod distribution across machines, and leveraging kubectl proxy—providing practical command examples and explanations for experienced Kubernetes users.

CLIDevOpsKubernetes
0 likes · 8 min read
Unlock Hidden kubectl Tricks: Advanced Commands for Kubernetes Mastery
Zuoyebang Tech Team
Zuoyebang Tech Team
Jul 28, 2023 · Operations

How Ops Teams Can Thrive in the Cloud‑Native Era: Strategies and Lessons

This article explores how the rise of cloud‑native technologies forces traditional operations to transform into service‑oriented platforms, detailing new organizational structures, the OPaS model, onion‑layered migration, practical steps, and key lessons for successful ops modernization.

DevOpsOperationsTransformation
0 likes · 19 min read
How Ops Teams Can Thrive in the Cloud‑Native Era: Strategies and Lessons
Programmer DD
Programmer DD
Jul 28, 2023 · Artificial Intelligence

How Shopify’s AI Gamble Triggered Massive Layoffs and Customer Chaos

Shopify’s rapid adoption of ChatGPT‑powered assistants and the later launch of the Sidekick AI tool led to aggressive cost‑cutting, a 20% workforce reduction, deteriorating customer support, rising fraud risks, and widespread criticism from merchants and insiders.

AILayoffsOperations
0 likes · 9 min read
How Shopify’s AI Gamble Triggered Massive Layoffs and Customer Chaos
MaGe Linux Operations
MaGe Linux Operations
Jul 25, 2023 · Operations

Master Linux Process, User Queries and System Hardening with Bash

This guide provides Bash scripts to filter process details by PID or name, retrieve comprehensive user information, and apply a series of system hardening configurations—including password policies, login restrictions, and file attribute locks—to improve Linux server security and manageability.

BashOperationsSystem Hardening
0 likes · 11 min read
Master Linux Process, User Queries and System Hardening with Bash
Tech Architecture Stories
Tech Architecture Stories
Jul 23, 2023 · Operations

Why Every Backend Engineer Should Read Google’s SRE Handbook

The article recommends two essential Google SRE books for backend developers, explains what SRE is, how it differs from traditional operations, and shows how the concepts like SLI/SLO, incident postmortems, and reliability engineering can be applied to improve system availability and stability.

OperationsSRESite Reliability Engineering
0 likes · 4 min read
Why Every Backend Engineer Should Read Google’s SRE Handbook
Architecture and Beyond
Architecture and Beyond
Jul 22, 2023 · Operations

Mastering Production Change Management: Prevent Outages with Proven Processes

This article analyzes high‑profile service outages, defines the production environment and its components, categorizes five types of production changes, and presents a comprehensive change‑management framework—including organizational roles, step‑by‑step procedures, and best‑practice tips—to help teams reduce risk and maintain system stability.

DevOpsOperationschange management
0 likes · 15 min read
Mastering Production Change Management: Prevent Outages with Proven Processes
Efficient Ops
Efficient Ops
Jul 19, 2023 · Operations

How China’s Bank of Communications Achieved Top‑Tier DevOps Maturity

The article details the Bank of Communications' participation in the 2023 XOps Forum, its successful DevOps continuous‑delivery level‑3 assessment for two projects, the resulting efficiency gains, metrics, and future plans for expanding DevOps practices across the organization.

BankingContinuous DeliveryDevOps
0 likes · 12 min read
How China’s Bank of Communications Achieved Top‑Tier DevOps Maturity
MaGe Linux Operations
MaGe Linux Operations
Jul 19, 2023 · Operations

Master Linux System Monitoring: Top, vmstat, pidstat, iostat & More

This guide explains essential Linux monitoring tools—top, vmstat, pidstat, iostat, netstat, sar, and tcpdump—detailing the metrics they expose, how to interpret CPU, memory, disk, and network statistics, and practical command examples for effective server performance troubleshooting.

LinuxOperationsperformance
0 likes · 17 min read
Master Linux System Monitoring: Top, vmstat, pidstat, iostat & More
21CTO
21CTO
Jul 19, 2023 · Operations

Why Unnecessary Meetings Drain Developers' Productivity and How to Fix It

The article examines recent Stack Overflow research and academic studies revealing that a large share of developers consider many meetings unnecessary, explains the clash between Maker's and Manager's schedules, and offers practical ways to reduce meeting waste and boost productivity.

MeetingsOperationsmaker schedule
0 likes · 9 min read
Why Unnecessary Meetings Drain Developers' Productivity and How to Fix It
21CTO
21CTO
Jul 19, 2023 · Operations

Scaling a Fast‑Growing Supply Chain Platform: Architecture and Ops Insights

This article details how a rapidly expanding B2B fresh‑food company restructured its R&D organization, adopted a matrix management model, and built a comprehensive distributed infrastructure—including task scheduling, service discovery, messaging, logging, file storage, CDN, configuration, sharding, search, caching, and monitoring—to support nationwide warehouse operations and future growth.

DevOpsDistributed SystemsOperations
0 likes · 7 min read
Scaling a Fast‑Growing Supply Chain Platform: Architecture and Ops Insights
Efficient Ops
Efficient Ops
Jul 16, 2023 · Operations

Mastering ELK: Deploy Architectures, Multiline Logs, and Kibana Tips

This guide explains the three main ELK deployment architectures, compares Logstash and Filebeat collectors, introduces a cache‑queue option for high‑volume logs, and provides practical solutions for multiline log merging, timestamp correction, and module‑level filtering in Kibana, helping operations teams build efficient log pipelines.

ELKElasticsearchFilebeat
0 likes · 10 min read
Mastering ELK: Deploy Architectures, Multiline Logs, and Kibana Tips
Ziru Technology
Ziru Technology
Jul 13, 2023 · Operations

How to Design Effective Performance Test Scenarios with JMeter

This article explains why website performance directly impacts business goals, outlines a four‑scenario testing framework (baseline, capacity, stability, and exception), and provides practical steps for environment setup, data preparation, parameterization, and execution using JMeter.

JMeterLoad TestingOperations
0 likes · 13 min read
How to Design Effective Performance Test Scenarios with JMeter
Huolala Tech
Huolala Tech
Jul 13, 2023 · Operations

How HuoLaLa Built a 0‑to‑1 Stability Metric System in 2 Years

This article explains how HuoLaLa’s stability team tackled the challenge of proving their work’s value by designing and implementing a comprehensive stability metric system from scratch, detailing the motivations, principles, step‑by‑step construction, data platform, cultural adoption, measurable results, and future plans.

Data-drivenMetricsOperations
0 likes · 18 min read
How HuoLaLa Built a 0‑to‑1 Stability Metric System in 2 Years
FunTester
FunTester
Jul 13, 2023 · Industry Insights

How HuoLala Built a 0‑to‑1 Stability Metric System and Cut Faults by 78%

In this detailed case study, HuoLala's stability leader shares how a two‑year, zero‑to‑one stability metric framework was designed, implemented, and iterated—covering the why, the pain points, the metric definition process, data collection platform, cultural adoption, and the resulting 78% fault reduction and SLA improvement from three to four nines.

OperationsPerformance Monitoringcase study
0 likes · 18 min read
How HuoLala Built a 0‑to‑1 Stability Metric System and Cut Faults by 78%
Qunar Tech Salon
Qunar Tech Salon
Jul 12, 2023 · Operations

Design and Implementation of Qunar's Root Cause Analysis System for Microservice Fault Diagnosis

This article describes Qunar's comprehensive root cause analysis platform, detailing its background, data-driven fault categorization, architecture—including trace, runtime, middleware, and event analysis modules—and demonstrates its high accuracy and practical impact on reducing incident resolution times across microservice services.

DevOpsMicroservicesOperations
0 likes · 20 min read
Design and Implementation of Qunar's Root Cause Analysis System for Microservice Fault Diagnosis
IT Services Circle
IT Services Circle
Jul 7, 2023 · Operations

Implementing Gray Release with Nginx, Docker, and NestJS

This guide explains how to set up a gray‑release (canary) deployment using Nginx as a reverse‑proxy gateway, Docker containers for isolation, and two versions of a NestJS service, with traffic split controlled by cookies and configurable percentages.

AB testingNginxOperations
0 likes · 8 min read
Implementing Gray Release with Nginx, Docker, and NestJS
JD Tech
JD Tech
Jul 7, 2023 · Operations

Practical Applications of Shell Scripting for Test Development and Automation

This article explores common pain points in test development, demonstrates how Shell scripting can automate repetitive Linux tasks, introduces basic commands like copy and concatenate, and presents real‑world case studies such as auto‑comment generation, memory‑usage monitoring, service management, and function encapsulation to boost productivity.

DevOpsLinuxOperations
0 likes · 11 min read
Practical Applications of Shell Scripting for Test Development and Automation
Open Source Linux
Open Source Linux
Jul 4, 2023 · Operations

Master Redis Monitoring, Migration, and Cluster Management with Prometheus and CacheCloud

This guide walks through essential Redis operations, covering real‑time monitoring with the INFO command and Prometheus‑compatible exporters, data migration using Redis‑shake, consistency verification via Redis‑full‑check, and comprehensive cluster management with CacheCloud, providing practical tools for reliable Redis administration.

Data MigrationOperationsPrometheus
0 likes · 11 min read
Master Redis Monitoring, Migration, and Cluster Management with Prometheus and CacheCloud
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Jul 3, 2023 · Operations

How GitOps Revolutionizes Cloud‑Native Deployments: Lessons from Horizon CD

This article examines the shortcomings of traditional host‑based deployments, explains the GitOps methodology with declarative configuration and automation tools, and details a real‑world implementation at NetEase Cloud Music using Horizon CD, Argo CD, and Helm to achieve scalable, reliable, and version‑controlled cloud‑native releases.

ArgoCDCloud NativeGitOps
0 likes · 19 min read
How GitOps Revolutionizes Cloud‑Native Deployments: Lessons from Horizon CD
21CTO
21CTO
Jun 30, 2023 · Information Security

How WeChat’s Security Data Warehouse Powers Billions of Daily Feature Reads

This article explains the origins, evolution, and current architecture of WeChat’s security data warehouse, detailing its unified feature storage, data quality guarantees, multi‑IDC synchronization, and operational system that streamlines feature management, analysis, and deployment to support the platform’s massive security strategy.

Big DataFeature ManagementOperations
0 likes · 15 min read
How WeChat’s Security Data Warehouse Powers Billions of Daily Feature Reads
MaGe Linux Operations
MaGe Linux Operations
Jun 30, 2023 · Operations

What Went Wrong When Vipshop Crashed? Lessons on High‑Concurrency Failures

The article examines the March 29 Vipshop data‑center outage that caused over a billion‑yuan loss, explains the cooling‑system failure that triggered a 12‑hour P0 incident, discusses its impact on Tencent services, and analyzes why high‑concurrency crashes remain common, offering availability tier insights and mitigation strategies.

AvailabilityOperationshigh concurrency
0 likes · 7 min read
What Went Wrong When Vipshop Crashed? Lessons on High‑Concurrency Failures
Efficient Ops
Efficient Ops
Jun 29, 2023 · Operations

China Life (Overseas) Boosts DevOps Maturity: OnePartner Platform Success

China Life (Overseas) detailed how its OnePartner insurance marketing platform achieved advanced DevOps continuous‑delivery maturity through CAICT assessment, highlighting the benefits, challenges, and future plans of standardised, tool‑enabled digital transformation for the insurance industry.

Continuous DeliveryDevOpsDigital Transformation
0 likes · 12 min read
China Life (Overseas) Boosts DevOps Maturity: OnePartner Platform Success
Efficient Ops
Efficient Ops
Jun 29, 2023 · Operations

How ICBC Trust Achieved Leading DevOps Maturity: A 3‑Level Continuous Delivery Success

ICBC Trust Fund Management’s Transaction Management Platform passed the CAICT DevOps Continuous Delivery Level 3 assessment, showcasing how standardized DevOps practices, tool empowerment, and cultural change dramatically cut build times, accelerate releases, and boost overall digital transformation efficiency.

Continuous DeliveryDevOpsMaturity Model
0 likes · 14 min read
How ICBC Trust Achieved Leading DevOps Maturity: A 3‑Level Continuous Delivery Success
Efficient Ops
Efficient Ops
Jun 29, 2023 · Operations

How China’s CFFEX Tech Company Achieved Top‑Tier DevOps Continuous Delivery Rating

China Information Communication Research Institute announced that Shanghai Financial Futures Information Technology Co., the tech subsidiary of the China Financial Futures Exchange, passed the DevOps Continuous Delivery Level 3 assessment, marking the first such achievement among domestic securities and futures exchanges and showcasing how standardized DevOps practices can boost digital transformation, quality, and efficiency.

Continuous DeliveryDevOpsDigital Transformation
0 likes · 15 min read
How China’s CFFEX Tech Company Achieved Top‑Tier DevOps Continuous Delivery Rating
Efficient Ops
Efficient Ops
Jun 29, 2023 · Operations

What Do the Latest DevOps Maturity Assessments Reveal About Chinese Enterprises?

The China Academy of Information and Communications Technology released the newest DevOps Capability Maturity Model evaluation results, showing that 78 leading firms across banking, finance, internet, and telecom sectors have collectively completed 224 projects, highlighting the impact of standardization and tool empowerment on enterprise competitiveness.

ChinaDevOpsEnterprise
0 likes · 5 min read
What Do the Latest DevOps Maturity Assessments Reveal About Chinese Enterprises?
Efficient Ops
Efficient Ops
Jun 29, 2023 · Operations

What Do the Latest DevOps Maturity Model Results Reveal About Enterprise Adoption?

The June 2023 release by China’s Academy of Information and Communications Technology details how 78 leading firms across banking, finance, telecom and internet sectors have passed the DevOps Capability Maturity Model assessments, highlighting the impact of standardized pipelines, tool empowerment and industry‑wide adoption on quality, efficiency and competitiveness.

Capability Maturity ModelDevOpsEnterprise Assessment
0 likes · 6 min read
What Do the Latest DevOps Maturity Model Results Reveal About Enterprise Adoption?
Efficient Ops
Efficient Ops
Jun 29, 2023 · Operations

How China Life (Overseas) Reached Advanced DevOps Maturity and Boosted Digital Transformation

China Life (Overseas) passed the CAICT DevOps continuous delivery Level 2 assessment, showcasing how standardized DevOps practices and a one‑stop insurance marketing platform dramatically improved development efficiency, quality, and market competitiveness while highlighting challenges, outcomes, and future plans.

Continuous DeliveryDevOpsDigital Transformation
0 likes · 11 min read
How China Life (Overseas) Reached Advanced DevOps Maturity and Boosted Digital Transformation
Architects' Tech Alliance
Architects' Tech Alliance
Jun 26, 2023 · Fundamentals

Understanding Linux Ext Filesystems, RAID, and LVM

This article explains the structure of Linux Ext (2/3/4) filesystems, detailing superblocks, inode tables and data blocks, then describes block groups, the role of superblocks, and outlines the differences between hardware and software RAID as well as the principles and risks of using LVM for flexible storage management.

FilesystemLVMLinux
0 likes · 5 min read
Understanding Linux Ext Filesystems, RAID, and LVM
Programmer DD
Programmer DD
Jun 26, 2023 · Operations

What’s New in Grafana 10? Explore Correlations, Scenes, and Powerful New Panels

Grafana 10 introduces a suite of enhancements—including Correlations for cross‑data‑source linking, the Scenes front‑end library for building stunning dashboards, new Canvas, Trends, and Datagrid panels, CSV drag‑and‑drop support, sub‑folder organization, and improved data‑source selection—aimed at boosting analysis, collaboration, and efficiency for monitoring teams.

DashboardGrafanaNew Features
0 likes · 7 min read
What’s New in Grafana 10? Explore Correlations, Scenes, and Powerful New Panels
Efficient Ops
Efficient Ops
Jun 25, 2023 · Operations

How to Build a Next‑Gen “Big Operations” System for Reliability and Observability

This article outlines the evolution from manual operations to DevOps and SRE‑driven “big operations,” detailing system reliability and continuity practices, observability concepts, and the development of AIOps maturity standards, offering a comprehensive guide for building stable, efficient, and secure operational frameworks.

DevOpsOperationsSRE
0 likes · 14 min read
How to Build a Next‑Gen “Big Operations” System for Reliability and Observability
Ops Development Stories
Ops Development Stories
Jun 22, 2023 · Operations

How to Write an Ops Resume That Actually Gets You Interviews

The article examines three common resume pitfalls for operations candidates—unclear focus, breadth without depth, and vague personal planning—and offers concrete strategies to highlight strengths, showcase impactful projects, and present a clear career trajectory to attract interview opportunities.

Operationscareer advicejob interview
0 likes · 7 min read
How to Write an Ops Resume That Actually Gets You Interviews
Efficient Ops
Efficient Ops
Jun 20, 2023 · Operations

Mastering SRE: How Error Budgets and SLOs Drive System Reliability

This article explains the fundamentals of Site Reliability Engineering, detailing how SRE combines development and operations to improve stability through metrics like MTBF and MTTR, the roles of SLI/SLO, the VALET selection method, and the practical use of error budgets for quantifying work and guiding alerts.

Error BudgetMTBFOperations
0 likes · 14 min read
Mastering SRE: How Error Budgets and SLOs Drive System Reliability
JD Cloud Developers
JD Cloud Developers
Jun 14, 2023 · Operations

How to Ensure System Stability During Mega Sales Events like 618

This article examines the technical and operational challenges of the 618 shopping festival, presenting data‑driven insights and detailed strategies—including modular deployment, monitoring, logging, fast‑failure, rate limiting, database and cache optimizations, and emergency response plans—to help teams maintain system stability under massive traffic spikes.

OperationsScalabilitylarge‑scale promotion
0 likes · 13 min read
How to Ensure System Stability During Mega Sales Events like 618
DevOps
DevOps
Jun 13, 2023 · Operations

Why DevOps Is Not Dead: The Rise of Platform Engineering and Its Impact on Modern Operations

The article argues that DevOps is still alive, explains the shortcomings of isolated operational practices, introduces platform engineering as the next evolution, and discusses practical considerations such as third‑party software selection, cloud‑native adoption, and the role of internal developer platforms in improving organizational efficiency.

Cloud NativeDevOpsInfrastructure
0 likes · 10 min read
Why DevOps Is Not Dead: The Rise of Platform Engineering and Its Impact on Modern Operations
Architecture and Beyond
Architecture and Beyond
Jun 10, 2023 · Operations

What Is Systemic Risk in Technology and How to Manage It Effectively

The article explains the concept of systemic risk in both economics and technology, compares it with non‑systemic risk, describes how it propagates, lists common sources, outlines its impact on technical teams and business value, and provides a step‑by‑step framework for modeling, identifying, and governing such risks.

Operationsgovernancerisk assessment
0 likes · 23 min read
What Is Systemic Risk in Technology and How to Manage It Effectively
Architecture & Thinking
Architecture & Thinking
Jun 9, 2023 · Backend Development

Why Do Message Queues Get Backlogged and How to Fix It Fast?

This article examines why message queues become backlogged—covering producer overload, broker persistence failures, and consumer bottlenecks—and outlines a step‑by‑step scaling and remediation strategy to restore smooth processing, including temporary queue expansion, load‑balanced forwarding, and post‑recovery cleanup.

BacklogOperationsscaling
0 likes · 6 min read
Why Do Message Queues Get Backlogged and How to Fix It Fast?
Qunar Tech Salon
Qunar Tech Salon
Jun 8, 2023 · Operations

System Complexity Modeling and Anti‑Corruption Governance at Qunar

This article describes how Qunar's technology center defined, measured, and managed system complexity through a custom modeling framework, implemented a dashboard for continuous monitoring, and established an anti‑corruption governance process that limits complexity growth to maintain low maintenance costs across hundreds of applications and systems.

OperationsQunarSoftware Architecture
0 likes · 14 min read
System Complexity Modeling and Anti‑Corruption Governance at Qunar
JD Cloud Developers
JD Cloud Developers
Jun 6, 2023 · Operations

How openKylin’s Community Board Drove Open‑Source Growth and Governance

The second openKylin community board meeting in Beijing detailed governance rules, controlled open‑source initiatives, open‑build infrastructure, innovation projects, ecosystem expansion, and the nomination of new board members, highlighting the community’s rapid growth, extensive SIG groups, and strategic plans for future development.

LinuxOpenKylinOperations
0 likes · 7 min read
How openKylin’s Community Board Drove Open‑Source Growth and Governance
Tongcheng Travel Technology Center
Tongcheng Travel Technology Center
Jun 6, 2023 · Operations

Root Cause Analysis and GC Parameter Optimization for Elasticsearch OOM Issues in the Membership Service

This article details a comprehensive investigation of an out‑of‑memory crash in a critical Elasticsearch cluster, explains how GC logs and heap dumps revealed a to‑space‑exhausted condition, and describes the G1GC tuning parameters that eliminated the nightly spikes and stabilized performance.

BackendElasticsearchOOM
0 likes · 9 min read
Root Cause Analysis and GC Parameter Optimization for Elasticsearch OOM Issues in the Membership Service
dbaplus Community
dbaplus Community
Jun 5, 2023 · Operations

Mastering Production Faults: Diagnose and Fix Network, Server, Database Issues

This guide outlines the most common production failures—including network, server, database, software, security, storage, configuration, and third‑party service issues—and provides step‑by‑step methods to detect, troubleshoot, and resolve each problem, helping maintain system stability and reliability.

OperationsServerdatabase
0 likes · 30 min read
Mastering Production Faults: Diagnose and Fix Network, Server, Database Issues
Open Source Linux
Open Source Linux
May 30, 2023 · Operations

Essential Linux Ops Interview Questions & Answers for Sysadmins

A comprehensive collection of Linux operations interview questions covering topics such as system administration, RAID configurations, load balancing, middleware, MySQL troubleshooting, network monitoring, security, scripting, and best practices for optimizing and maintaining Linux servers.

LinuxLoadBalancingNetworking
0 likes · 38 min read
Essential Linux Ops Interview Questions & Answers for Sysadmins
FunTester
FunTester
May 30, 2023 · Operations

Software Performance Testing: Process, Tools, and Required Skills

The article explains why software performance testing is essential, outlines a comprehensive testing workflow, reviews popular load‑testing tools, offers guidance on selecting the right tool, and lists the technical and analytical skills needed to become an effective performance testing engineer.

Load TestingOperationsPerformance Testing
0 likes · 13 min read
Software Performance Testing: Process, Tools, and Required Skills
dbaplus Community
dbaplus Community
May 29, 2023 · Operations

How Bilibili Built a High‑Availability Multi‑Active Architecture for SRE

This article details Bilibili's SRE team's design and implementation of a high‑availability multi‑active architecture, covering zone types, same‑city and cross‑region deployments, traffic routing, cache consistency, message handling, governance, and practical lessons learned from real‑world incidents.

BilibiliOperationsSRE
0 likes · 20 min read
How Bilibili Built a High‑Availability Multi‑Active Architecture for SRE
Data Thinking Notes
Data Thinking Notes
May 28, 2023 · Operations

Why Do State‑Owned Enterprises Struggle with Digital Transformation? Key Challenges and Solutions

This analysis examines why Chinese state‑owned enterprises face unclear digital‑transformation goals, weak strategic positioning, fragmented data, talent shortages, and inadequate technology ecosystems, and it outlines the root causes, typical case studies, and recommended actions to achieve effective digital change.

Data GovernanceDigital TransformationOperations
0 likes · 16 min read
Why Do State‑Owned Enterprises Struggle with Digital Transformation? Key Challenges and Solutions
Efficient Ops
Efficient Ops
May 28, 2023 · Operations

Essential Linux Ops Tools: Install & Use Nethogs, IOZone, IOTop and More

A concise guide for Linux administrators that introduces thirteen practical monitoring and security tools—ranging from network bandwidth trackers like Nethogs to vulnerability scanners like NMap—complete with installation steps, usage examples, and key configuration tips.

Operationsnetwork-tools
0 likes · 12 min read
Essential Linux Ops Tools: Install & Use Nethogs, IOZone, IOTop and More
360 Tech Engineering
360 Tech Engineering
May 23, 2023 · Operations

Data‑Driven Growth: Underlying Logic, Case Studies, and Essential Factors

The article explains how data‑driven thinking replaces traditional money‑burning growth tactics by establishing logical loops, experimental validation, and concrete case studies in acquisition, activation, and targeting, while outlining the essential collaborative factors needed for successful data‑powered operations.

AnalyticsData-drivenGrowth
0 likes · 10 min read
Data‑Driven Growth: Underlying Logic, Case Studies, and Essential Factors
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
May 23, 2023 · Information Security

How to Seamlessly Migrate and Validate Anti‑Cheat Services Across Environments

This article details the end‑to‑end process of migrating an anti‑cheat service to a new data center, verifying strategy effectiveness, building a real‑sample regression pipeline, and automating integration steps using GoAPI and traffic‑comparison platforms to ensure functional consistency and security.

Operationsanti-cheatenvironment migration
0 likes · 9 min read
How to Seamlessly Migrate and Validate Anti‑Cheat Services Across Environments
Tencent Cloud Developer
Tencent Cloud Developer
May 22, 2023 · Artificial Intelligence

Application of AI Large Language Models in the Full Software Development Lifecycle

The article shows how AI large‑language models such as ChatGPT can support every stage of the software development lifecycle—from extracting requirements and designing solutions to generating code, tests, deployment scripts, and operational diagnostics—while warning about model inaccuracies, hallucinations, intellectual‑property and privacy risks.

AIChatGPTDeployment
0 likes · 8 min read
Application of AI Large Language Models in the Full Software Development Lifecycle
Efficient Ops
Efficient Ops
May 21, 2023 · Operations

From Apollo to Google: How Margaret Hamilton Shaped Modern SRE

This article traces the origins of Site Reliability Engineering from Margaret Hamilton’s pioneering work on the Apollo program, through Google’s formal SRE team creation, and highlights the key differences between SRE and traditional operations practices.

GoogleMargaret HamiltonOperations
0 likes · 7 min read
From Apollo to Google: How Margaret Hamilton Shaped Modern SRE
Wukong Talks Architecture
Wukong Talks Architecture
May 17, 2023 · Operations

Common Production Faults and Their Handling Guide

This guide outlines the most common production failures—including network, server, database, software, security, storage, configuration, and third‑party service issues—and provides detailed steps for detecting, diagnosing, and resolving each type to maintain system stability and reliability.

Operationsfault handlingproduction
0 likes · 30 min read
Common Production Faults and Their Handling Guide
php Courses
php Courses
May 17, 2023 · Operations

Scene Management in RunnerGo: Overview and Usage

This article explains RunnerGo's scene management module, covering the interface between left‑hand navigation and the main scene area, how to create and link interfaces and controllers into executable business scenarios, configure scene settings, debug scenes, and manage test case sets, with links to the project's repositories.

OperationsRunnerGoScene Management
0 likes · 6 min read
Scene Management in RunnerGo: Overview and Usage
Laravel Tech Community
Laravel Tech Community
May 16, 2023 · Operations

Linux System Commands Cheat Sheet

This article presents a comprehensive reference of common Linux/Unix command-line utilities covering system information, date handling, shutdown/reboot, file and directory management, searching, mounting, disk usage, user/group administration, permissions, special attributes, compression, package management, backup, and networking, providing a handy guide for system administrators and developers.

LinuxOperationsShell
0 likes · 37 min read
Linux System Commands Cheat Sheet
Efficient Ops
Efficient Ops
May 14, 2023 · Operations

How China’s Telecom Giants Accelerate IT Efficiency with DevOps Maturity Models

Amid a nationwide digital transformation, leading Chinese telecom operators have leveraged the CAICT‑backed DevOps Capability Maturity Model to evaluate and improve their IT performance, integrating team resources and talent to better support business systems, with detailed case studies and measurable outcomes across dozens of projects.

Continuous DeliveryDevOpsIT efficiency
0 likes · 14 min read
How China’s Telecom Giants Accelerate IT Efficiency with DevOps Maturity Models
Efficient Ops
Efficient Ops
May 10, 2023 · Operations

How Chinese Banks Accelerate Digital Transformation with DevOps Maturity Models

Amid digital transformation, nine Chinese city‑commercial banks and financial institutions adopted the CAICT‑led DevOps Capability Maturity Model, achieving significant IT efficiency gains, integrating resources, and enhancing business support across continuous delivery, technical operation, security, and system tooling, with detailed project case studies and a comprehensive overview of the standard.

BankingDevOpsDigital Transformation
0 likes · 16 min read
How Chinese Banks Accelerate Digital Transformation with DevOps Maturity Models
Efficient Ops
Efficient Ops
May 10, 2023 · Operations

Mastering XOps: From DevOps to FinOps – A Comprehensive Guide

This article presents a systematic overview of the emerging XOps ecosystem—including DevOps, BizDevOps, AIOps, FinOps, and SRE—detailing their relationships, maturity models, standards, and practical guidance for enterprises seeking to achieve efficient, secure, and data‑driven digital transformation.

BizDevOpsDevOpsFinOps
0 likes · 13 min read
Mastering XOps: From DevOps to FinOps – A Comprehensive Guide
JD Retail Technology
JD Retail Technology
May 10, 2023 · Product Management

Using ChatGPT 4.0 to Boost Product Manager Efficiency: Methods, Prompts, and Case Studies

The article outlines how ChatGPT 4.0 can significantly improve product managers' workflow across research, planning, design, project execution, and iteration by providing prompt engineering techniques, practical examples, and actionable recommendations while emphasizing security and information‑risk considerations.

AI prompt engineeringChatGPTOperations
0 likes · 31 min read
Using ChatGPT 4.0 to Boost Product Manager Efficiency: Methods, Prompts, and Case Studies