Ops Development Stories
Author

Ops Development Stories

Maintained by a like‑minded team, covering both operations and development. Topics span Linux ops, DevOps toolchain, Kubernetes containerization, monitoring, log collection, network security, and Python or Go development. Team members: Qiao Ke, wanger, Dong Ge, Su Xin, Hua Zai, Zheng Ge, Teacher Xia.

500
Articles
0
Likes
958
Views
0
Comments
Recent Articles

Latest from Ops Development Stories

100 recent articles max
Ops Development Stories
Ops Development Stories
Apr 30, 2025 · Artificial Intelligence

Unlock Your AI Assistant’s Power: A Step‑by‑Step Guide to Configuring MCP Filesystem

This comprehensive tutorial shows how to use the Model Context Protocol (MCP) to transform AI assistants like Claude, ChatWise, or Cherry Studio into powerful tools that can access your computer’s files, fetch real‑time information, interact with software, and even control smart devices, with clear step‑by‑step instructions, code examples, and troubleshooting tips.

AIClaudeConfiguration
0 likes · 20 min read
Unlock Your AI Assistant’s Power: A Step‑by‑Step Guide to Configuring MCP Filesystem
Ops Development Stories
Ops Development Stories
Apr 15, 2025 · Cloud Native

Boost Kubernetes Management with AI: Introducing the Lightweight k8m Console

This article introduces k8m, a lightweight AI‑enhanced console for Kubernetes that simplifies cluster management, installation, configuration, and daily operations, while offering features such as YAML auto‑translation, AI‑driven event and log diagnostics, command generation, multi‑cluster support, and role‑based access control.

AIDevOpsKubernetes
0 likes · 13 min read
Boost Kubernetes Management with AI: Introducing the Lightweight k8m Console
Ops Development Stories
Ops Development Stories
Mar 24, 2025 · Operations

Why Do Some Ops Teams Face Value Challenges? Insights for CTOs

Operations leaders and CTOs often confront the question of the true value of their teams, and this article explores who asks it, why it matters, typical challenges, and practical ways to define and protect the operational role through unified platforms, processes, and strategic collaboration with development.

CTOSREoperations
0 likes · 13 min read
Why Do Some Ops Teams Face Value Challenges? Insights for CTOs
Ops Development Stories
Ops Development Stories
Mar 19, 2025 · Cloud Native

Unified Multi‑Cluster Monitoring with KubeDoor 1.0: Alerts, Metrics & Best Practices

KubeDoor 1.0 introduces a new architecture for unified multi‑Kubernetes monitoring, offering components for master and agent, flexible deployment options, Helm‑based installation, configurable storage and alerting settings, and detailed guidance on integrating with existing Prometheus/VictoriaMetrics setups while providing automatic peak‑usage data collection.

ClickHouseHelmKubernetes
0 likes · 14 min read
Unified Multi‑Cluster Monitoring with KubeDoor 1.0: Alerts, Metrics & Best Practices
Ops Development Stories
Ops Development Stories
Mar 10, 2025 · Cloud Native

What Are Kubernetes Core Components and How Do They Work?

This article provides a comprehensive overview of Kubernetes fundamentals, covering core control‑plane and node components, key object differences such as Pod vs Deployment, Service types, ConfigMap vs Secret, scheduling, health checks, scaling, security, storage, and troubleshooting techniques.

KubernetesPodService
0 likes · 19 min read
What Are Kubernetes Core Components and How Do They Work?
Ops Development Stories
Ops Development Stories
Mar 4, 2025 · Operations

Master Process Exporter: Deploy, Integrate with Prometheus & Grafana in Kubernetes

This guide walks Kubernetes administrators through the full lifecycle of Process Exporter—from lightweight deployment and RBAC setup, through Prometheus Operator integration and Grafana dashboard creation, to detailed configuration and alerting—enabling precise process‑level monitoring and rapid root‑cause analysis.

DaemonSetGrafanaKubernetes
0 likes · 15 min read
Master Process Exporter: Deploy, Integrate with Prometheus & Grafana in Kubernetes
Ops Development Stories
Ops Development Stories
Feb 13, 2025 · Cloud Native

KubeDoor: AI‑Driven Kubernetes Load‑Aware Scheduling & Capacity Management

KubeDoor is an open‑source platform built with Python and Vue that leverages Kubernetes admission control, AI recommendations, and expert experience to provide load‑aware scheduling, capacity governance, real‑time resource analytics, and automated scaling for microservices, featuring a web UI, Grafana dashboards, and extensible control mechanisms.

AI schedulingAdmission ControllerKubernetes
0 likes · 11 min read
KubeDoor: AI‑Driven Kubernetes Load‑Aware Scheduling & Capacity Management
Ops Development Stories
Ops Development Stories
Feb 6, 2025 · Cloud Native

Automate Java OOM Heapdump Collection with a Kubernetes DaemonSet

This guide explains how to automatically capture Java OOM heapdump files using a DaemonSet that watches for heapdump.prof creation, compresses and uploads them to Alibaba Cloud OSS, and notifies developers via a WeChat bot, providing a scalable, non‑intrusive solution for memory‑leak diagnostics in Kubernetes environments.

GoOOMcloud-native
0 likes · 19 min read
Automate Java OOM Heapdump Collection with a Kubernetes DaemonSet
Ops Development Stories
Ops Development Stories
Jan 23, 2025 · Operations

How SREs Can Boost Their Influence Within Teams

This article explores why influence matters for Site Reliability Engineers, outlines the challenges they face in gaining recognition, and provides practical strategies—enhancing technical expertise, improving communication, quantifying achievements, and sharing knowledge—to elevate their impact within organizations.

SREcommunicationoperations
0 likes · 19 min read
How SREs Can Boost Their Influence Within Teams