Ops Development Stories
Author

Ops Development Stories

Maintained by a like‑minded team, covering both operations and development. Topics span Linux ops, DevOps toolchain, Kubernetes containerization, monitoring, log collection, network security, and Python or Go development. Team members: Qiao Ke, wanger, Dong Ge, Su Xin, Hua Zai, Zheng Ge, Teacher Xia.

500
Articles
0
Likes
958
Views
0
Comments
Recent Articles

Latest from Ops Development Stories

100 recent articles max
Ops Development Stories
Ops Development Stories
Jul 31, 2025 · Cloud Native

Why Vitpress + CNB Makes Blog Deployment a Breeze

This article walks through the author’s journey of building a lightweight blog with Vitpress, leveraging Cloud Native Build (CNB), EdgeOne CDN, and Obsidian plugins for image handling, providing step‑by‑step configuration and deployment scripts for a smooth, automated publishing workflow.

CNBObsidiandeployment
0 likes · 8 min read
Why Vitpress + CNB Makes Blog Deployment a Breeze
Ops Development Stories
Ops Development Stories
Jul 29, 2025 · Artificial Intelligence

Master AI Agents with LangGraph: Build Adaptive RAG, Translation, and ReAct Agents

This comprehensive guide explains what an AI Agent is, its core capabilities and design patterns, and walks through step‑by‑step implementations of RAG, Translation, and ReAct agents using LangGraph, complete with code samples, workflow diagrams, and practical tips for building personal ops knowledge‑base agents.

LLMLangGraphRAG
0 likes · 64 min read
Master AI Agents with LangGraph: Build Adaptive RAG, Translation, and ReAct Agents
Ops Development Stories
Ops Development Stories
Jul 25, 2025 · Cloud Native

How Kubernetes 1.33 Enables In‑Place Pod Resizing Without Restarts

Kubernetes 1.33 introduces in‑place vertical pod resizing, allowing administrators to adjust CPU and memory resources on running containers without restarting pods, reducing downtime for stateful workloads, improving cost efficiency, and integrating with VPA, while outlining implementation details, supported runtimes, limitations, and practical demos.

Container RuntimeIn‑Place Vertical ScalingKubelet
0 likes · 18 min read
How Kubernetes 1.33 Enables In‑Place Pod Resizing Without Restarts
Ops Development Stories
Ops Development Stories
Jul 14, 2025 · Artificial Intelligence

Mastering AIOps: Prompt Engineering, Function Calling, RAG, Graph RAG, and Local LLM Deployment

This comprehensive guide explores AIOps techniques such as prompt engineering, chat completions, memory management, function calling, fine‑tuning, retrieval‑augmented generation (RAG), graph‑based RAG, and practical steps for deploying open‑source large language models locally, providing code examples and best‑practice recommendations for modern DevOps environments.

Function CallingGraph RAGRAG
0 likes · 47 min read
Mastering AIOps: Prompt Engineering, Function Calling, RAG, Graph RAG, and Local LLM Deployment
Ops Development Stories
Ops Development Stories
Jul 3, 2025 · Cloud Native

Mastering IaC with Terraform: From Basics to Multi‑Environment Deployments

This article explains the fundamentals of Infrastructure as Code (IaC), its relationship with AIOps, and why Terraform is the preferred tool, then walks through Terraform architecture, core commands, state management, remote backends, project layout, and a complete multi‑environment deployment example on Alibaba Cloud.

DevOpsInfrastructure as CodeTerraform
0 likes · 24 min read
Mastering IaC with Terraform: From Basics to Multi‑Environment Deployments
Ops Development Stories
Ops Development Stories
Jul 1, 2025 · Artificial Intelligence

From Lean to AIOps: How AI is Transforming Modern Operations

This comprehensive guide walks through the evolution from Lean and Agile practices to DevOps and finally AIOps, explaining core concepts, key algorithms, the role of large language models, RAG‑based root‑cause analysis, and practical implementation steps for intelligent operations.

LeanRAGagile
0 likes · 19 min read
From Lean to AIOps: How AI is Transforming Modern Operations
Ops Development Stories
Ops Development Stories
Jun 19, 2025 · Operations

How to Build an Automated Prometheus Inspection System with Go

This article explains how to design and implement an automated inspection platform that leverages Prometheus and Grafana for metric collection, splits inspection tasks, schedules them with cron, generates reports, sends WeChat notifications, and exports results to PDF, all using Go and the gin‑vue‑admin framework.

Automated InspectionGoOps Automation
0 likes · 17 min read
How to Build an Automated Prometheus Inspection System with Go
Ops Development Stories
Ops Development Stories
Jun 12, 2025 · Cloud Native

One-Click GPU-Enabled Kind Cluster Setup for Running Large AI Models

This tutorial walks you through using a one‑click script to create a GPU‑enabled Kind Kubernetes cluster, evenly distribute GPU resources across nodes with nvkind, install necessary drivers and toolkits, deploy a vLLM‑served large language model, and verify its operation, all on a local or cloud environment.

AI Model DeploymentDockerGPU
0 likes · 23 min read
One-Click GPU-Enabled Kind Cluster Setup for Running Large AI Models