Tagged articles
346 articles
Page 4 of 4
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Mar 24, 2018 · Operations

How Service Degradation and Fault‑Tolerance Keep Large‑Scale Systems Resilient

This article explains how setting low timeouts for non‑core services, decoupling and physically isolating micro‑services, separating light and heavy workloads, and implementing automated configuration checks together enhance system reliability and reduce both technical and human errors in high‑traffic environments.

Configuration Managementfault tolerancesystem reliability
0 likes · 9 min read
How Service Degradation and Fault‑Tolerance Keep Large‑Scale Systems Resilient
DevOpsClub
DevOpsClub
Mar 23, 2018 · Operations

How to Automate Infrastructure with IaC, Terraform, and Chef for Seamless CI/CD

This article explains why manual VM provisioning and application deployment are bottlenecks, and shows how combining Infrastructure as Code tools like Terraform with configuration‑management solutions such as Chef can create a fully automated, version‑controlled pipeline that speeds up continuous integration, delivery and deployment across development, test and production environments.

AutomationChefConfiguration Management
0 likes · 26 min read
How to Automate Infrastructure with IaC, Terraform, and Chef for Seamless CI/CD
DevOps Coach
DevOps Coach
Mar 23, 2018 · Operations

Automating Infrastructure with IaC, Terraform, and Chef for Seamless CI/CD

This article explains how to replace manual VM provisioning and application deployment with Infrastructure as Code using Terraform, Chef, and related automation tools, detailing the problems of hand‑crafted processes, layered system management, cookbook design, CI pipelines, and best‑practice deployment workflows.

ChefConfiguration ManagementDevOps
0 likes · 28 min read
Automating Infrastructure with IaC, Terraform, and Chef for Seamless CI/CD
System Architect Go
System Architect Go
Jan 3, 2018 · Backend Development

How to Retrieve File Content from GitHub Using the REST API v3

This article explains how to store configuration files in a remote GitHub repository and fetch them via GitHub's REST API v3, covering both public and private repositories, authentication methods, and the steps to create and use a Personal Access Token for secure access.

Backend DevelopmentConfiguration ManagementREST API
0 likes · 4 min read
How to Retrieve File Content from GitHub Using the REST API v3
Programmer DD
Programmer DD
Dec 19, 2017 · Backend Development

Key Building Blocks for Microservice Architecture: Config, Registry, API Gateway & More

This guide walks beginners through the fundamental components of microservice infrastructure—configuration management, service registry, API gateway, authentication, message broker, BFF, circuit breaker, and load balancing—explaining their roles, interactions, and practical considerations for building resilient systems.

Backend ArchitectureConfiguration Managementcircuit breaker
0 likes · 18 min read
Key Building Blocks for Microservice Architecture: Config, Registry, API Gateway & More
MaGe Linux Operations
MaGe Linux Operations
Sep 23, 2017 · Operations

Mastering SaltStack: A Practical Guide to Automated Operations

This article compares popular automation tools, introduces SaltStack’s architecture and features, walks through its installation, configuration, authentication, command usage, targeting methods, common modules, and explains grains, pillars, and state files for effective large‑scale Linux operations.

AutomationConfiguration ManagementSaltStack
0 likes · 19 min read
Mastering SaltStack: A Practical Guide to Automated Operations
Java Backend Technology
Java Backend Technology
Sep 18, 2017 · Fundamentals

Mastering ZooKeeper: Core Concepts, Architecture, and Real-World Use Cases

This article provides a comprehensive overview of ZooKeeper, covering its purpose, design goals, architecture, key features, data flow, components, and common application scenarios such as service naming, configuration management, cluster coordination, distributed locks, and queues.

Configuration ManagementCoordination Servicedistributed-lock
0 likes · 12 min read
Mastering ZooKeeper: Core Concepts, Architecture, and Real-World Use Cases
MaGe Linux Operations
MaGe Linux Operations
Sep 9, 2017 · Operations

Master Ansible Basics: Essential Modules and Commands for Automation

This guide walks you through Ansible's core architecture, host inventory setup, variable definitions, and the most commonly used modules—including group, user, copy, cron, shell, and ping—showing practical command examples and how to retrieve module help with ansible‑doc.

AnsibleAutomationConfiguration Management
0 likes · 10 min read
Master Ansible Basics: Essential Modules and Commands for Automation
Efficient Ops
Efficient Ops
Aug 23, 2017 · Operations

Inside Tencent’s DevOps Pipeline: How Continuous Delivery Powers Scalable Operations

Tencent builds a complete DevOps pipeline using four platforms—TAPD, TGit, CIS, and ZhiYun—explaining the eight principles of continuous delivery, the four stages of operability, a three‑layer architecture, and showcasing ZhiYun’s configuration, automation, and self‑healing practices to deliver a systematic operations solution for enterprises.

Configuration ManagementDevOpsscalable architecture
0 likes · 11 min read
Inside Tencent’s DevOps Pipeline: How Continuous Delivery Powers Scalable Operations
Ctrip Technology
Ctrip Technology
Jul 13, 2017 · Operations

Ctrip Architecture Refactoring: From Croller to TARS – A Deployment and Configuration Management Case Study

This article reviews Ctrip's two‑year architecture transformation, describing the limitations of the legacy Croller release system, the design of a new group‑based configuration model, the introduction of seven‑layer load balancing and the TARS deployment platform, and detailing the implementation of a unified configuration management system (CMS) to improve operational efficiency and reliability.

Configuration ManagementDeploymentarchitecture
0 likes · 12 min read
Ctrip Architecture Refactoring: From Croller to TARS – A Deployment and Configuration Management Case Study
Continuous Delivery 2.0
Continuous Delivery 2.0
Jun 22, 2017 · Operations

Implementing Periodic Releases and Operational Automation for Small Teams

The article describes how a small development team adopts a three‑week periodic release cadence, improves demand management, resolves operational concerns, and standardizes configuration, environment, deployment, and testing processes to achieve continuous delivery with higher quality and lower coordination cost.

AutomationConfiguration ManagementContinuous Delivery
0 likes · 13 min read
Implementing Periodic Releases and Operational Automation for Small Teams
Java High-Performance Architecture
Java High-Performance Architecture
Jun 21, 2017 · Cloud Native

What Is Spring Cloud? A Complete Overview of Its Features and Modules

Spring Cloud is a toolbox built on Spring Boot that provides a suite of tools for developing distributed systems, offering features such as configuration management, service discovery, routing, load balancing, circuit breaking, global locks, cluster state management, and a variety of sub‑projects like Config, Netflix, Bus, Consul, Security, Sleuth, Data Flow, Stream, and Zookeeper.

Configuration ManagementSpring Cloudservice discovery
0 likes · 4 min read
What Is Spring Cloud? A Complete Overview of Its Features and Modules
MaGe Linux Operations
MaGe Linux Operations
Jun 21, 2017 · Operations

Master Ansible: From Basics to Advanced Playbooks and Automation

This comprehensive guide walks you through Ansible fundamentals, architecture, installation, core modules, playbook structure, variables, templates, handlers, roles, conditionals, loops, and practical examples, enabling you to automate configuration management across multiple Linux hosts with confidence and precision.

AnsibleAutomationConfiguration Management
0 likes · 23 min read
Master Ansible: From Basics to Advanced Playbooks and Automation
dbaplus Community
dbaplus Community
Jun 5, 2017 · Operations

Designing an Effective CMDB: Insights from a 2017 GDevOps Talk

This article summarizes Zhang Chuan’s 2017 GDevOps conference presentation on CMDB, covering its definition, common implementations, design principles, automation integration, synchronization strategies, and the business value of a centralized configuration management database.

AutomationCMDBConfiguration Management
0 likes · 12 min read
Designing an Effective CMDB: Insights from a 2017 GDevOps Talk
Qunar Tech Salon
Qunar Tech Salon
May 2, 2017 · Information Security

Design and Function of the Internal Application Center for Token‑Based Authentication and Resource Access

The internal Application Center, created in 2013, provides a unified system for naming applications, managing permissions, issuing encrypted tokens, and authenticating app identity to securely control access to configuration data and other resources, replacing fragile IP‑based restrictions with logical app‑based authorization.

Configuration ManagementIdentity verificationapplication center
0 likes · 7 min read
Design and Function of the Internal Application Center for Token‑Based Authentication and Resource Access
Continuous Delivery 2.0
Continuous Delivery 2.0
May 1, 2017 · Operations

Implementing Periodic Releases: Strategies, Challenges, and Automation in Software Development

The article describes how a development team transitioned to short‑cycle, periodic releases, outlining the goals, benefits, operational concerns, and a comprehensive set of improvements—including testing strategy, configuration and environment management, and automated deployment pipelines—to maintain quality while increasing release frequency.

AutomationConfiguration ManagementContinuous Delivery
0 likes · 14 min read
Implementing Periodic Releases: Strategies, Challenges, and Automation in Software Development
Efficient Ops
Efficient Ops
Apr 24, 2017 · Operations

15 Essential Open-Source DevOps Tools for Private Cloud Environments

This article reviews fifteen open-source DevOps tools across categories such as configuration management, bare-metal provisioning, orchestration, continuous integration, and monitoring, explaining their functions, typical use cases in private-cloud deployments, and providing example code snippets to help engineers automate, scale, and secure their cloud operations.

Configuration ManagementDevOpsOrchestration
0 likes · 15 min read
15 Essential Open-Source DevOps Tools for Private Cloud Environments
MaGe Linux Operations
MaGe Linux Operations
Mar 21, 2017 · Operations

Master YAML and Ansible: From Basics to Playbook Automation

This guide introduces YAML syntax and features, explains how to define variables, facts, and loops in Ansible, details the INI‑style inventory file format, and walks through creating playbooks with hosts, tasks, handlers, and tags for reliable automation.

AnsibleConfiguration ManagementDevOps
0 likes · 11 min read
Master YAML and Ansible: From Basics to Playbook Automation
MaGe Linux Operations
MaGe Linux Operations
Mar 20, 2017 · Operations

Mastering Ansible: A Beginner’s Guide to Operations Automation

This article introduces Ansible as a powerful, agentless automation tool for operations engineers, covering the tasks suitable for batch deployment, its architecture, core concepts like idempotence, essential components, basic command usage, and practical examples such as installing software, managing users, and configuring SSH keys.

AnsibleConfiguration ManagementDevOps
0 likes · 7 min read
Mastering Ansible: A Beginner’s Guide to Operations Automation
Efficient Ops
Efficient Ops
Feb 25, 2017 · Operations

60+ Must‑Use Open‑Source DevOps Tools to Accelerate Your Delivery

This article compiles more than sixty free, open‑source DevOps utilities—covering version control, build automation, CI/CD, container platforms, configuration management, logging, and monitoring—to help teams speed up development, streamline deployments, and improve operational efficiency.

Configuration ManagementDevOpsci/cd
0 likes · 17 min read
60+ Must‑Use Open‑Source DevOps Tools to Accelerate Your Delivery
MaGe Linux Operations
MaGe Linux Operations
Jan 8, 2017 · Operations

Master Ansible: From Basics to Advanced Modules for Efficient Operations

This guide introduces Ansible for operations, covering its core features, installation, host preparation, key management, essential modules, playbook structure, YAML syntax, handlers, tags, variables, templates, loops, and conditional execution, with practical command examples and visual illustrations.

AnsibleAutomationConfiguration Management
0 likes · 8 min read
Master Ansible: From Basics to Advanced Modules for Efficient Operations
MaGe Linux Operations
MaGe Linux Operations
Jan 5, 2017 · Operations

Mastering Puppet: Automate Server Deployment and Configuration

This article explains how Puppet automates large‑scale server provisioning by describing its architecture, workflow, manifest examples, class inheritance, and module structure, helping operations teams reduce manual effort and avoid errors in configuration management.

AutomationConfiguration ManagementInfrastructure as Code
0 likes · 8 min read
Mastering Puppet: Automate Server Deployment and Configuration
MaGe Linux Operations
MaGe Linux Operations
Nov 14, 2016 · Operations

Master Ansible: From Basics to Advanced Automation without Agents

This comprehensive guide introduces Ansible, explains its agentless architecture, core components, installation, SSH key setup, inventory configuration, essential commands, and common modules, providing a practical roadmap for automating system administration and deployment tasks.

AnsibleAutomationConfiguration Management
0 likes · 17 min read
Master Ansible: From Basics to Advanced Automation without Agents

JEN: JD Extended Nginx Platform for Scalable Management and Automation

The article introduces JEN, JD's extended Nginx platform that centralizes configuration, monitoring, traffic splitting, rate limiting and automated operations through a web console and Ansible integration, addressing the complexity, restart requirements, and scaling challenges of large‑scale Nginx deployments.

AutomationConfiguration ManagementNginx
0 likes · 14 min read
JEN: JD Extended Nginx Platform for Scalable Management and Automation
dbaplus Community
dbaplus Community
Oct 28, 2016 · Operations

Mastering Full‑Scale Configuration Management for Seamless Continuous Delivery

This article breaks down comprehensive configuration management—covering code and build artifact versioning, application settings, and environment provisioning—explaining why it is the cornerstone of continuous delivery and offering concrete branch strategies, Git practices, artifact handling, injection methods, and automation tools to achieve reliable, low‑risk software releases.

Branch StrategyConfiguration ManagementContinuous Delivery
0 likes · 26 min read
Mastering Full‑Scale Configuration Management for Seamless Continuous Delivery
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Oct 12, 2016 · Backend Development

How to Implement and Manage Feature Toggles in Java for Scalable Systems

This article explains how to design and operate feature toggles in Java applications, covering single‑instance implementation, cross‑instance synchronization via a meta‑server or Diamond, handling composite switches, avoiding security pitfalls, and automating degradation and upgrade based on runtime metrics.

Auto ScalingConfiguration Managementfeature toggle
0 likes · 8 min read
How to Implement and Manage Feature Toggles in Java for Scalable Systems
MaGe Linux Operations
MaGe Linux Operations
Sep 21, 2016 · Operations

Master Ansible Playbooks: Core Components, Variables, Loops & Handlers

Ansible playbooks, a powerful configuration management tool, use YAML to define tasks, variables, inventory, conditionals, loops, and handlers; this guide explains core components, syntax, variable scopes, command-line overrides, and provides practical examples for installing services, managing users, and triggering handlers based on changes.

AnsibleAutomationConfiguration Management
0 likes · 7 min read
Master Ansible Playbooks: Core Components, Variables, Loops & Handlers
Architecture Digest
Architecture Digest
Aug 5, 2016 · Backend Development

Implementation Principles and Architecture of the Diamond Configuration Management System

The article explains Diamond, a simple, reliable, and easy‑to‑use distributed configuration management system used inside Taobao, detailing its features, persistence and disaster‑recovery mechanisms, overall architecture, client‑side subscription code, and the internal processes that keep configuration data synchronized.

BackendConfiguration ManagementJava
0 likes · 10 min read
Implementation Principles and Architecture of the Diamond Configuration Management System
ITPUB
ITPUB
Jun 28, 2016 · Operations

Seamless Tomcat Webapp Migration with Docker and Layered Configuration

This guide explains how to simplify and accelerate Tomcat web application migration by separating static binaries from external configurations, using Docker containers or Juju packages, applying layered configuration, managing persistent data with volumes, and automating deployment, scaling, and rollback operations.

Application MigrationConfiguration ManagementContainers
0 likes · 9 min read
Seamless Tomcat Webapp Migration with Docker and Layered Configuration
Architect
Architect
Jun 20, 2016 · Operations

Application Migration and Docker‑Based Deployment Practices

The article explains how to achieve simple, fast, and lossless application migration by separating immutable binaries from external configuration, and how Docker containers and layered images can streamline deployment, scaling, configuration management, and persistent data handling.

Application MigrationConfiguration ManagementDocker
0 likes · 7 min read
Application Migration and Docker‑Based Deployment Practices

An Introduction to Ansible: Installation, Configuration, and MySQL Replication with Playbooks

This article introduces Ansible, a Python‑based configuration management and deployment tool, explains its installation prerequisites, demonstrates basic ad‑hoc commands, and provides a comprehensive guide to managing MySQL master‑slave replication using Ansible modules, playbooks, and role‑based directory structures.

AnsibleAutomationConfiguration Management
0 likes · 19 min read
An Introduction to Ansible: Installation, Configuration, and MySQL Replication with Playbooks
Efficient Ops
Efficient Ops
May 11, 2016 · Operations

How to Build an Automated Operations Platform: Insights from Tencent's Experience

This article shares Peng Lihang's practical insights on operations automation, covering the essential trio of configuration, state, and change management, the evolution of ops practices, platform design principles, and concrete steps for building scalable, business‑driven ops platforms.

AutomationConfiguration ManagementOperations
0 likes · 24 min read
How to Build an Automated Operations Platform: Insights from Tencent's Experience
Big Data and Microservices
Big Data and Microservices
Apr 20, 2016 · Operations

How to Build an Effective IT Operations Service System: Principles, Architecture & Best Practices

This article outlines the fundamental principles, overall architecture, scope, and detailed components of an IT operations service system, covering policies, processes, organizational structure, platform tools, and management workflows such as incident, problem, change, and configuration management.

Configuration ManagementIT OperationsPlatform Integration
0 likes · 19 min read
How to Build an Effective IT Operations Service System: Principles, Architecture & Best Practices
Baidu Intelligent Testing
Baidu Intelligent Testing
Apr 5, 2016 · Operations

Hot Reload: Common Pitfalls and How to Avoid Them

This article examines the hidden risks of hot‑reload mechanisms in web services, illustrates real incidents caused by careless configuration updates, analyzes root causes, and offers practical steps for detecting and fixing such pitfalls to improve operational reliability.

Configuration ManagementSoftware Operationshot-reload
0 likes · 7 min read
Hot Reload: Common Pitfalls and How to Avoid Them
MaGe Linux Operations
MaGe Linux Operations
Feb 18, 2016 · Operations

Salt vs Ansible: Which Configuration Management Tool Wins for Speed, Security, and Scalability?

After hands‑on experience with both Salt and Ansible, the author compares their terminology, architecture, community support, speed, code structure, security, auditing, deployment, learning curve, and documentation, concluding that Ansible excels in ease of use while Salt offers superior scalability and performance for cloud environments.

AnsibleAutomationConfiguration Management
0 likes · 14 min read
Salt vs Ansible: Which Configuration Management Tool Wins for Speed, Security, and Scalability?
Java High-Performance Architecture
Java High-Performance Architecture
Feb 12, 2016 · Backend Development

Mastering Distributed Configuration Management with Zookeeper

Learn how distributed configuration management works—why large systems need a central store, the role of publish/subscribe, and how Zookeeper provides a robust solution for real‑time feature toggling without service restarts, ensuring high availability and seamless operations across micro‑services.

BackendConfiguration ManagementDevOps
0 likes · 4 min read
Mastering Distributed Configuration Management with Zookeeper
Efficient Ops
Efficient Ops
Nov 4, 2015 · Operations

From Idea to Published Book: My Journey Writing the Puppet Authority Guide

This article shares the author's personal journey of conceiving, planning, writing, and publishing a technical book on Puppet, detailing the motivations, step‑by‑step process, challenges, case studies, and the professional benefits gained from turning the writing effort into tangible value.

Configuration ManagementOperationsPuppet
0 likes · 15 min read
From Idea to Published Book: My Journey Writing the Puppet Authority Guide

An Introduction to Ansible: Installation, Basic Usage, and MySQL Replication with Playbooks

This article provides a comprehensive guide to Ansible, covering its purpose, installation via pip, ad‑hoc command usage, MySQL replication with the mysql_replication module, role‑based playbook structure, sample YAML files, execution output, and a comparison with other configuration‑management tools.

AnsibleAutomationConfiguration Management
0 likes · 18 min read
An Introduction to Ansible: Installation, Basic Usage, and MySQL Replication with Playbooks
Efficient Ops
Efficient Ops
May 29, 2015 · Operations

Why Ctrip’s Outage Took Hours to Recover – Lessons for Ops Teams

The article examines Ctrip’s prolonged service restoration after a May 28 incident, analyzing the complexities of SOA‑based architectures, the pitfalls of black‑box operations, and how transitioning to white‑box, DevOps‑aligned practices can prevent similar outages.

Configuration ManagementDevOpsIT Operations
0 likes · 11 min read
Why Ctrip’s Outage Took Hours to Recover – Lessons for Ops Teams
Efficient Ops
Efficient Ops
May 22, 2015 · Operations

Mastering Puppet: From Basics to Advanced Ops Automation and Docker Integration

This article summarizes a comprehensive talk on Puppet covering its evolution, core concepts, architecture, ecosystem, practical use cases such as building a CMDB, automated deployment pipelines, OpenStack deployment, and the interplay with Docker, highlighting how Puppet drives modern operations automation.

Configuration ManagementInfrastructureOperations
0 likes · 13 min read
Mastering Puppet: From Basics to Advanced Ops Automation and Docker Integration

How Puppet Transforms Infrastructure Management and Boosts DevOps Collaboration

This article explains why many IT teams struggle with code that works in development but fails in production, introduces Puppet as a configuration‑management platform, describes its declarative language, resources, types, providers, classes, manifests and modules, and shows how it enables reliable, collaborative DevOps workflows.

AutomationConfiguration ManagementDevOps
0 likes · 17 min read
How Puppet Transforms Infrastructure Management and Boosts DevOps Collaboration
MaGe Linux Operations
MaGe Linux Operations
Mar 11, 2015 · Operations

What Makes a Great Linux Ops Engineer? 13 Practical Principles

This article shares a former developer‑turned‑ops manager’s 13‑point guide on treating operations as resource‑centric engineering, covering what ops is (and isn’t), writing reliable programs, abstracting resources, configuration management, monitoring, and practical tips for building stable, scalable Linux systems.

Configuration ManagementLinux
0 likes · 11 min read
What Makes a Great Linux Ops Engineer? 13 Practical Principles