Tagged articles

346 articles

Page 4 of 4

Mar 24, 2018 · Operations

How Service Degradation and Fault‑Tolerance Keep Large‑Scale Systems Resilient

This article explains how setting low timeouts for non‑core services, decoupling and physically isolating micro‑services, separating light and heavy workloads, and implementing automated configuration checks together enhance system reliability and reduce both technical and human errors in high‑traffic environments.

Configuration Managementfault tolerancesystem reliability

0 likes · 9 min read

How Service Degradation and Fault‑Tolerance Keep Large‑Scale Systems Resilient

DevOpsClub

Mar 23, 2018 · Operations

How to Automate Infrastructure with IaC, Terraform, and Chef for Seamless CI/CD

This article explains why manual VM provisioning and application deployment are bottlenecks, and shows how combining Infrastructure as Code tools like Terraform with configuration‑management solutions such as Chef can create a fully automated, version‑controlled pipeline that speeds up continuous integration, delivery and deployment across development, test and production environments.

AutomationChefConfiguration Management

0 likes · 26 min read

How to Automate Infrastructure with IaC, Terraform, and Chef for Seamless CI/CD

DevOps Coach

Mar 23, 2018 · Operations

Automating Infrastructure with IaC, Terraform, and Chef for Seamless CI/CD

This article explains how to replace manual VM provisioning and application deployment with Infrastructure as Code using Terraform, Chef, and related automation tools, detailing the problems of hand‑crafted processes, layered system management, cookbook design, CI pipelines, and best‑practice deployment workflows.

ChefConfiguration ManagementDevOps

0 likes · 28 min read

Automating Infrastructure with IaC, Terraform, and Chef for Seamless CI/CD

System Architect Go

Jan 3, 2018 · Backend Development

How to Retrieve File Content from GitHub Using the REST API v3

This article explains how to store configuration files in a remote GitHub repository and fetch them via GitHub's REST API v3, covering both public and private repositories, authentication methods, and the steps to create and use a Personal Access Token for secure access.

Backend DevelopmentConfiguration ManagementREST API

0 likes · 4 min read

How to Retrieve File Content from GitHub Using the REST API v3

MaGe Linux Operations

Dec 25, 2017 · Operations

How SaltStack Automates Cloud Operations: Boost Efficiency and Reduce Workload

This article explains how the open‑source automation tool SaltStack can be deployed in a large‑scale cloud environment to centralize management, distribute files, collect server data, and streamline configuration, thereby reducing operational effort and improving efficiency for administrators.

AutomationConfiguration ManagementSaltStack

0 likes · 14 min read

How SaltStack Automates Cloud Operations: Boost Efficiency and Reduce Workload

Programmer DD

Dec 19, 2017 · Backend Development

Key Building Blocks for Microservice Architecture: Config, Registry, API Gateway & More

This guide walks beginners through the fundamental components of microservice infrastructure—configuration management, service registry, API gateway, authentication, message broker, BFF, circuit breaker, and load balancing—explaining their roles, interactions, and practical considerations for building resilient systems.

Backend ArchitectureConfiguration Managementcircuit breaker

0 likes · 18 min read

Key Building Blocks for Microservice Architecture: Config, Registry, API Gateway & More

MaGe Linux Operations

Sep 23, 2017 · Operations

Mastering SaltStack: A Practical Guide to Automated Operations

This article compares popular automation tools, introduces SaltStack’s architecture and features, walks through its installation, configuration, authentication, command usage, targeting methods, common modules, and explains grains, pillars, and state files for effective large‑scale Linux operations.

AutomationConfiguration ManagementSaltStack

0 likes · 19 min read

Mastering SaltStack: A Practical Guide to Automated Operations

Java Backend Technology

Sep 18, 2017 · Fundamentals

Mastering ZooKeeper: Core Concepts, Architecture, and Real-World Use Cases

This article provides a comprehensive overview of ZooKeeper, covering its purpose, design goals, architecture, key features, data flow, components, and common application scenarios such as service naming, configuration management, cluster coordination, distributed locks, and queues.

Configuration ManagementCoordination Servicedistributed-lock

0 likes · 12 min read

Mastering ZooKeeper: Core Concepts, Architecture, and Real-World Use Cases

MaGe Linux Operations

Sep 9, 2017 · Operations

Master Ansible Basics: Essential Modules and Commands for Automation

This guide walks you through Ansible's core architecture, host inventory setup, variable definitions, and the most commonly used modules—including group, user, copy, cron, shell, and ping—showing practical command examples and how to retrieve module help with ansible‑doc.

AnsibleAutomationConfiguration Management

0 likes · 10 min read

Master Ansible Basics: Essential Modules and Commands for Automation

Efficient Ops

Aug 23, 2017 · Operations

Inside Tencent’s DevOps Pipeline: How Continuous Delivery Powers Scalable Operations

Tencent builds a complete DevOps pipeline using four platforms—TAPD, TGit, CIS, and ZhiYun—explaining the eight principles of continuous delivery, the four stages of operability, a three‑layer architecture, and showcasing ZhiYun’s configuration, automation, and self‑healing practices to deliver a systematic operations solution for enterprises.

Configuration ManagementDevOpsscalable architecture

0 likes · 11 min read

Inside Tencent’s DevOps Pipeline: How Continuous Delivery Powers Scalable Operations

Ctrip Technology

Jul 13, 2017 · Operations

Ctrip Architecture Refactoring: From Croller to TARS – A Deployment and Configuration Management Case Study

This article reviews Ctrip's two‑year architecture transformation, describing the limitations of the legacy Croller release system, the design of a new group‑based configuration model, the introduction of seven‑layer load balancing and the TARS deployment platform, and detailing the implementation of a unified configuration management system (CMS) to improve operational efficiency and reliability.

Configuration ManagementDeploymentarchitecture

0 likes · 12 min read

Ctrip Architecture Refactoring: From Croller to TARS – A Deployment and Configuration Management Case Study

Continuous Delivery 2.0

Jun 22, 2017 · Operations

Implementing Periodic Releases and Operational Automation for Small Teams

The article describes how a small development team adopts a three‑week periodic release cadence, improves demand management, resolves operational concerns, and standardizes configuration, environment, deployment, and testing processes to achieve continuous delivery with higher quality and lower coordination cost.

AutomationConfiguration ManagementContinuous Delivery

0 likes · 13 min read

Implementing Periodic Releases and Operational Automation for Small Teams

Java High-Performance Architecture

Jun 21, 2017 · Cloud Native

What Is Spring Cloud? A Complete Overview of Its Features and Modules

Spring Cloud is a toolbox built on Spring Boot that provides a suite of tools for developing distributed systems, offering features such as configuration management, service discovery, routing, load balancing, circuit breaking, global locks, cluster state management, and a variety of sub‑projects like Config, Netflix, Bus, Consul, Security, Sleuth, Data Flow, Stream, and Zookeeper.

Configuration ManagementSpring Cloudservice discovery

0 likes · 4 min read

What Is Spring Cloud? A Complete Overview of Its Features and Modules

MaGe Linux Operations

Jun 21, 2017 · Operations

Master Ansible: From Basics to Advanced Playbooks and Automation

This comprehensive guide walks you through Ansible fundamentals, architecture, installation, core modules, playbook structure, variables, templates, handlers, roles, conditionals, loops, and practical examples, enabling you to automate configuration management across multiple Linux hosts with confidence and precision.

AnsibleAutomationConfiguration Management

0 likes · 23 min read

Master Ansible: From Basics to Advanced Playbooks and Automation

dbaplus Community

Jun 5, 2017 · Operations

Designing an Effective CMDB: Insights from a 2017 GDevOps Talk

This article summarizes Zhang Chuan’s 2017 GDevOps conference presentation on CMDB, covering its definition, common implementations, design principles, automation integration, synchronization strategies, and the business value of a centralized configuration management database.

AutomationCMDBConfiguration Management

0 likes · 12 min read

Designing an Effective CMDB: Insights from a 2017 GDevOps Talk

Qunar Tech Salon

May 2, 2017 · Information Security

Design and Function of the Internal Application Center for Token‑Based Authentication and Resource Access

The internal Application Center, created in 2013, provides a unified system for naming applications, managing permissions, issuing encrypted tokens, and authenticating app identity to securely control access to configuration data and other resources, replacing fragile IP‑based restrictions with logical app‑based authorization.

Configuration ManagementIdentity verificationapplication center

0 likes · 7 min read

Design and Function of the Internal Application Center for Token‑Based Authentication and Resource Access

Continuous Delivery 2.0

May 1, 2017 · Operations

Implementing Periodic Releases: Strategies, Challenges, and Automation in Software Development

The article describes how a development team transitioned to short‑cycle, periodic releases, outlining the goals, benefits, operational concerns, and a comprehensive set of improvements—including testing strategy, configuration and environment management, and automated deployment pipelines—to maintain quality while increasing release frequency.

AutomationConfiguration ManagementContinuous Delivery

0 likes · 14 min read

Implementing Periodic Releases: Strategies, Challenges, and Automation in Software Development

Efficient Ops

Apr 24, 2017 · Operations

15 Essential Open-Source DevOps Tools for Private Cloud Environments

This article reviews fifteen open-source DevOps tools across categories such as configuration management, bare-metal provisioning, orchestration, continuous integration, and monitoring, explaining their functions, typical use cases in private-cloud deployments, and providing example code snippets to help engineers automate, scale, and secure their cloud operations.

Configuration ManagementDevOpsOrchestration

0 likes · 15 min read

15 Essential Open-Source DevOps Tools for Private Cloud Environments

MaGe Linux Operations

Mar 21, 2017 · Operations

Master YAML and Ansible: From Basics to Playbook Automation

This guide introduces YAML syntax and features, explains how to define variables, facts, and loops in Ansible, details the INI‑style inventory file format, and walks through creating playbooks with hosts, tasks, handlers, and tags for reliable automation.

AnsibleConfiguration ManagementDevOps

0 likes · 11 min read

Master YAML and Ansible: From Basics to Playbook Automation

MaGe Linux Operations

Mar 20, 2017 · Operations

Mastering Ansible: A Beginner’s Guide to Operations Automation

This article introduces Ansible as a powerful, agentless automation tool for operations engineers, covering the tasks suitable for batch deployment, its architecture, core concepts like idempotence, essential components, basic command usage, and practical examples such as installing software, managing users, and configuring SSH keys.

AnsibleConfiguration ManagementDevOps

0 likes · 7 min read

Mastering Ansible: A Beginner’s Guide to Operations Automation

Efficient Ops

Feb 25, 2017 · Operations

60+ Must‑Use Open‑Source DevOps Tools to Accelerate Your Delivery

This article compiles more than sixty free, open‑source DevOps utilities—covering version control, build automation, CI/CD, container platforms, configuration management, logging, and monitoring—to help teams speed up development, streamline deployments, and improve operational efficiency.

Configuration ManagementDevOpsci/cd

0 likes · 17 min read

60+ Must‑Use Open‑Source DevOps Tools to Accelerate Your Delivery

MaGe Linux Operations

Jan 23, 2017 · Operations

Mastering Puppet: How Automated Configuration Management Scales Server Ops

This article explains Puppet's architecture, data flow, and practical examples, showing how automated configuration management can efficiently handle large numbers of servers, reduce manual errors, and improve operational reliability in modern IT environments.

AutomationConfiguration ManagementDevOps

0 likes · 8 min read

Mastering Puppet: How Automated Configuration Management Scales Server Ops

MaGe Linux Operations

Jan 8, 2017 · Operations

Master Ansible: From Basics to Advanced Modules for Efficient Operations

This guide introduces Ansible for operations, covering its core features, installation, host preparation, key management, essential modules, playbook structure, YAML syntax, handlers, tags, variables, templates, loops, and conditional execution, with practical command examples and visual illustrations.

AnsibleAutomationConfiguration Management

0 likes · 8 min read

Master Ansible: From Basics to Advanced Modules for Efficient Operations

MaGe Linux Operations

Jan 5, 2017 · Operations

Mastering Puppet: Automate Server Deployment and Configuration

This article explains how Puppet automates large‑scale server provisioning by describing its architecture, workflow, manifest examples, class inheritance, and module structure, helping operations teams reduce manual effort and avoid errors in configuration management.

AutomationConfiguration ManagementInfrastructure as Code

0 likes · 8 min read

Mastering Puppet: Automate Server Deployment and Configuration

MaGe Linux Operations

Nov 14, 2016 · Operations

Master Ansible: From Basics to Advanced Automation without Agents

This comprehensive guide introduces Ansible, explains its agentless architecture, core components, installation, SSH key setup, inventory configuration, essential commands, and common modules, providing a practical roadmap for automating system administration and deployment tasks.

AnsibleAutomationConfiguration Management

0 likes · 17 min read

Master Ansible: From Basics to Advanced Automation without Agents

Art of Distributed System Architecture Design

Nov 1, 2016 · Operations

JEN: JD Extended Nginx Platform for Scalable Management and Automation

The article introduces JEN, JD's extended Nginx platform that centralizes configuration, monitoring, traffic splitting, rate limiting and automated operations through a web console and Ansible integration, addressing the complexity, restart requirements, and scaling challenges of large‑scale Nginx deployments.

AutomationConfiguration ManagementNginx

0 likes · 14 min read

JEN: JD Extended Nginx Platform for Scalable Management and Automation

dbaplus Community

Oct 28, 2016 · Operations

Mastering Full‑Scale Configuration Management for Seamless Continuous Delivery

This article breaks down comprehensive configuration management—covering code and build artifact versioning, application settings, and environment provisioning—explaining why it is the cornerstone of continuous delivery and offering concrete branch strategies, Git practices, artifact handling, injection methods, and automation tools to achieve reliable, low‑risk software releases.

Branch StrategyConfiguration ManagementContinuous Delivery

0 likes · 26 min read

Mastering Full‑Scale Configuration Management for Seamless Continuous Delivery

ITFLY8 Architecture Home

Oct 12, 2016 · Backend Development

How to Implement and Manage Feature Toggles in Java for Scalable Systems

This article explains how to design and operate feature toggles in Java applications, covering single‑instance implementation, cross‑instance synchronization via a meta‑server or Diamond, handling composite switches, avoiding security pitfalls, and automating degradation and upgrade based on runtime metrics.

Auto ScalingConfiguration Managementfeature toggle

0 likes · 8 min read

How to Implement and Manage Feature Toggles in Java for Scalable Systems

MaGe Linux Operations

Sep 21, 2016 · Operations

Master Ansible Playbooks: Core Components, Variables, Loops & Handlers

Ansible playbooks, a powerful configuration management tool, use YAML to define tasks, variables, inventory, conditionals, loops, and handlers; this guide explains core components, syntax, variable scopes, command-line overrides, and provides practical examples for installing services, managing users, and triggering handlers based on changes.

AnsibleAutomationConfiguration Management

0 likes · 7 min read

Master Ansible Playbooks: Core Components, Variables, Loops & Handlers

Architecture Digest

Aug 5, 2016 · Backend Development

Implementation Principles and Architecture of the Diamond Configuration Management System

The article explains Diamond, a simple, reliable, and easy‑to‑use distributed configuration management system used inside Taobao, detailing its features, persistence and disaster‑recovery mechanisms, overall architecture, client‑side subscription code, and the internal processes that keep configuration data synchronized.

BackendConfiguration ManagementJava

0 likes · 10 min read

Implementation Principles and Architecture of the Diamond Configuration Management System

ITPUB

Jun 28, 2016 · Operations

Seamless Tomcat Webapp Migration with Docker and Layered Configuration

This guide explains how to simplify and accelerate Tomcat web application migration by separating static binaries from external configurations, using Docker containers or Juju packages, applying layered configuration, managing persistent data with volumes, and automating deployment, scaling, and rollback operations.

Application MigrationConfiguration ManagementContainers

0 likes · 9 min read

Seamless Tomcat Webapp Migration with Docker and Layered Configuration

Efficient Ops

Jun 26, 2016 · Operations

Avoid CMDB Pitfalls and Achieve Cost‑Effective Configuration Management

This article examines common misconceptions in CMDB implementation, outlines four key success factors, and shares two real‑world case studies to help operations teams balance effectiveness with cost while automating configuration management efficiently.

AutomationCMDBConfiguration Management

0 likes · 12 min read

Avoid CMDB Pitfalls and Achieve Cost‑Effective Configuration Management

Architect

Jun 20, 2016 · Operations

Application Migration and Docker‑Based Deployment Practices

The article explains how to achieve simple, fast, and lossless application migration by separating immutable binaries from external configuration, and how Docker containers and layered images can streamline deployment, scaling, configuration management, and persistent data handling.

Application MigrationConfiguration ManagementDocker

0 likes · 7 min read

Application Migration and Docker‑Based Deployment Practices

Art of Distributed System Architecture Design

May 25, 2016 · Operations

An Introduction to Ansible: Installation, Configuration, and MySQL Replication with Playbooks

This article introduces Ansible, a Python‑based configuration management and deployment tool, explains its installation prerequisites, demonstrates basic ad‑hoc commands, and provides a comprehensive guide to managing MySQL master‑slave replication using Ansible modules, playbooks, and role‑based directory structures.

AnsibleAutomationConfiguration Management

0 likes · 19 min read

An Introduction to Ansible: Installation, Configuration, and MySQL Replication with Playbooks

Efficient Ops

May 11, 2016 · Operations

How to Build an Automated Operations Platform: Insights from Tencent's Experience

This article shares Peng Lihang's practical insights on operations automation, covering the essential trio of configuration, state, and change management, the evolution of ops practices, platform design principles, and concrete steps for building scalable, business‑driven ops platforms.

AutomationConfiguration ManagementOperations

0 likes · 24 min read

How to Build an Automated Operations Platform: Insights from Tencent's Experience

Big Data and Microservices

Apr 20, 2016 · Operations

How to Build an Effective IT Operations Service System: Principles, Architecture & Best Practices

This article outlines the fundamental principles, overall architecture, scope, and detailed components of an IT operations service system, covering policies, processes, organizational structure, platform tools, and management workflows such as incident, problem, change, and configuration management.

Configuration ManagementIT OperationsPlatform Integration

0 likes · 19 min read

How to Build an Effective IT Operations Service System: Principles, Architecture & Best Practices

Baidu Intelligent Testing

Apr 5, 2016 · Operations

Hot Reload: Common Pitfalls and How to Avoid Them

This article examines the hidden risks of hot‑reload mechanisms in web services, illustrates real incidents caused by careless configuration updates, analyzes root causes, and offers practical steps for detecting and fixing such pitfalls to improve operational reliability.

Configuration ManagementSoftware Operationshot-reload

0 likes · 7 min read

Hot Reload: Common Pitfalls and How to Avoid Them

MaGe Linux Operations

Feb 18, 2016 · Operations

Salt vs Ansible: Which Configuration Management Tool Wins for Speed, Security, and Scalability?

After hands‑on experience with both Salt and Ansible, the author compares their terminology, architecture, community support, speed, code structure, security, auditing, deployment, learning curve, and documentation, concluding that Ansible excels in ease of use while Salt offers superior scalability and performance for cloud environments.

AnsibleAutomationConfiguration Management

0 likes · 14 min read

Salt vs Ansible: Which Configuration Management Tool Wins for Speed, Security, and Scalability?

Java High-Performance Architecture

Feb 12, 2016 · Backend Development

Mastering Distributed Configuration Management with Zookeeper

Learn how distributed configuration management works—why large systems need a central store, the role of publish/subscribe, and how Zookeeper provides a robust solution for real‑time feature toggling without service restarts, ensuring high availability and seamless operations across micro‑services.

BackendConfiguration ManagementDevOps

0 likes · 4 min read

Mastering Distributed Configuration Management with Zookeeper

Efficient Ops

Nov 4, 2015 · Operations

From Idea to Published Book: My Journey Writing the Puppet Authority Guide

This article shares the author's personal journey of conceiving, planning, writing, and publishing a technical book on Puppet, detailing the motivations, step‑by‑step process, challenges, case studies, and the professional benefits gained from turning the writing effort into tangible value.

Configuration ManagementOperationsPuppet

0 likes · 15 min read

From Idea to Published Book: My Journey Writing the Puppet Authority Guide

Art of Distributed System Architecture Design

Jul 1, 2015 · Operations

An Introduction to Ansible: Installation, Basic Usage, and MySQL Replication with Playbooks

This article provides a comprehensive guide to Ansible, covering its purpose, installation via pip, ad‑hoc command usage, MySQL replication with the mysql_replication module, role‑based playbook structure, sample YAML files, execution output, and a comparison with other configuration‑management tools.

AnsibleAutomationConfiguration Management

0 likes · 18 min read

An Introduction to Ansible: Installation, Basic Usage, and MySQL Replication with Playbooks

Efficient Ops

May 29, 2015 · Operations

Why Ctrip’s Outage Took Hours to Recover – Lessons for Ops Teams

The article examines Ctrip’s prolonged service restoration after a May 28 incident, analyzing the complexities of SOA‑based architectures, the pitfalls of black‑box operations, and how transitioning to white‑box, DevOps‑aligned practices can prevent similar outages.

Configuration ManagementDevOpsIT Operations

0 likes · 11 min read

Why Ctrip’s Outage Took Hours to Recover – Lessons for Ops Teams

Efficient Ops

May 23, 2015 · Operations

Expert Insights: Puppet vs Ansible, MCollective, and Building Sustainable Startup Ops

This article compiles a Q&A from the InfoQ Efficient Ops Talk, where expert Liu Yu discusses Puppet's limitations, compares automation tools like Ansible and MCollective, and offers practical advice for startups building sustainable release pipelines and managing Windows and cloud environments.

AnsibleConfiguration ManagementMCollective

0 likes · 11 min read

Expert Insights: Puppet vs Ansible, MCollective, and Building Sustainable Startup Ops

Efficient Ops

May 22, 2015 · Operations

Mastering Puppet: From Basics to Advanced Ops Automation and Docker Integration

This article summarizes a comprehensive talk on Puppet covering its evolution, core concepts, architecture, ecosystem, practical use cases such as building a CMDB, automated deployment pipelines, OpenStack deployment, and the interplay with Docker, highlighting how Puppet drives modern operations automation.

Configuration ManagementInfrastructureOperations

0 likes · 13 min read

Mastering Puppet: From Basics to Advanced Ops Automation and Docker Integration

Art of Distributed System Architecture Design

Apr 16, 2015 · Operations

How Puppet Transforms Infrastructure Management and Boosts DevOps Collaboration

This article explains why many IT teams struggle with code that works in development but fails in production, introduces Puppet as a configuration‑management platform, describes its declarative language, resources, types, providers, classes, manifests and modules, and shows how it enables reliable, collaborative DevOps workflows.

AutomationConfiguration ManagementDevOps

0 likes · 17 min read

How Puppet Transforms Infrastructure Management and Boosts DevOps Collaboration

MaGe Linux Operations

Mar 11, 2015 · Operations

What Makes a Great Linux Ops Engineer? 13 Practical Principles

This article shares a former developer‑turned‑ops manager’s 13‑point guide on treating operations as resource‑centric engineering, covering what ops is (and isn’t), writing reliable programs, abstracting resources, configuration management, monitoring, and practical tips for building stable, scalable Linux systems.

Configuration ManagementLinux

0 likes · 11 min read

What Makes a Great Linux Ops Engineer? 13 Practical Principles