Ops Development Stories
Author

Ops Development Stories

Maintained by a like‑minded team, covering both operations and development. Topics span Linux ops, DevOps toolchain, Kubernetes containerization, monitoring, log collection, network security, and Python or Go development. Team members: Qiao Ke, wanger, Dong Ge, Su Xin, Hua Zai, Zheng Ge, Teacher Xia.

500
Articles
0
Likes
958
Views
0
Comments
Recent Articles

Latest from Ops Development Stories

100 recent articles max
Ops Development Stories
Ops Development Stories
Nov 10, 2025 · Operations

Build a Low‑Cost Observability Platform with OpenObserve and Vector

This guide walks you through the architecture, deployment, and configuration of the Rust‑based OpenObserve observability platform together with the high‑performance Vector data pipeline, covering log, metric, and trace collection, Docker‑Compose setup, UI usage, and common FAQs for small teams.

TracingVectorcloud-native
0 likes · 11 min read
Build a Low‑Cost Observability Platform with OpenObserve and Vector
Ops Development Stories
Ops Development Stories
Sep 5, 2025 · Backend Development

Simplify Jenkins Role Management with a Go SDK and REST API

This article explains how the Jenkins Role Strategy plugin’s UI can become unresponsive with many users, introduces a Go SDK that wraps its REST API, demonstrates permission template, role, and user/group management code, and shows how to build a simple Gin‑based web UI for role assignment.

GoJenkinsREST API
0 likes · 13 min read
Simplify Jenkins Role Management with a Go SDK and REST API
Ops Development Stories
Ops Development Stories
Sep 4, 2025 · Cloud Native

Why Kubernetes HPA Ignores High CPU Usage and How Tolerance Affects Scaling

This article explains the internal architecture and source‑code flow of Kubernetes Horizontal Pod Autoscaler, detailing how components like HorizontalController and ReplicaCalculator compute desired replicas, why a default 10% tolerance can prevent scaling even when CPU exceeds the target, and how behavior policies and scaling limits influence HPA decisions.

HPAHorizontal Pod AutoscalerKubernetes
0 likes · 16 min read
Why Kubernetes HPA Ignores High CPU Usage and How Tolerance Affects Scaling
Ops Development Stories
Ops Development Stories
Aug 29, 2025 · Cloud Native

How to Build a GPU Spot‑Pool Operator on Kubernetes with Kubebuilder

This guide walks through creating a Kubernetes Operator using Kubebuilder to manage a GPU spot‑pool on Tencent Cloud, covering CRD design, controller logic, code generation, and deployment steps, enabling automated scaling of GPU resources for AI workloads while illustrating core Cloud‑Native concepts.

GPUKubebuilderKubernetes
0 likes · 19 min read
How to Build a GPU Spot‑Pool Operator on Kubernetes with Kubebuilder
Ops Development Stories
Ops Development Stories
Aug 20, 2025 · Databases

Master Elasticsearch: Core Concepts, Architecture, Queries & Performance for Interviews

This comprehensive guide covers Elasticsearch fundamentals—including core concepts, data model, cluster roles, indexing, mapping, inverted index, query DSL, aggregation, pagination, performance tuning, operational monitoring, security, high availability, and real‑world use cases—providing interview‑ready knowledge and practical tips for developers and ops engineers.

ElasticsearchPerformance TuningQuery DSL
0 likes · 15 min read
Master Elasticsearch: Core Concepts, Architecture, Queries & Performance for Interviews
Ops Development Stories
Ops Development Stories
Aug 13, 2025 · Cloud Native

How to Build a Kubernetes Fault‑Diagnosis CLI with AI‑Powered Insights

This article walks through extending the K8s Chat command‑line tool by adding an ‘analyze event’ command that gathers warning‑level events and pod logs, stores them in a map, and sends the information to a large‑language model via OpenAI’s API to receive actionable troubleshooting recommendations, while also suggesting further enhancements such as self‑healing and visualization.

AICLIGo
0 likes · 15 min read
How to Build a Kubernetes Fault‑Diagnosis CLI with AI‑Powered Insights