Tagged articles
19 articles
Page 1 of 1
Wuming AI
Wuming AI
Sep 29, 2025 · Artificial Intelligence

How I Quickly Uncovered Cherry Studio’s New API Server Using AI

In this walkthrough I show how, within minutes, I used AI‑driven code search and simple curl commands to locate, enable, and test the newly added API server feature in the open‑source Cherry Studio project, illustrating the power of an AI‑first workflow.

AIAI-firstAPI Server
0 likes · 6 min read
How I Quickly Uncovered Cherry Studio’s New API Server Using AI
Alibaba Cloud Observability
Alibaba Cloud Observability
Dec 30, 2024 · Cloud Native

What Caused OpenAI’s Global Outage? Lessons for Cloud‑Native Observability

The article analyzes the December 11 OpenAI outage, revealing that a newly deployed telemetry service overloaded Kubernetes API servers, breaking DNS resolution and slowing recovery, and compares OpenAI’s approach with LoongCollector/iLogtail’s design to offer stability insights for cloud‑native environments.

API ServerCloud NativeKubernetes
0 likes · 15 min read
What Caused OpenAI’s Global Outage? Lessons for Cloud‑Native Observability
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 26, 2024 · Cloud Native

How a New Telemetry Service Overwhelmed OpenAI’s Kubernetes API Server

An in‑depth post‑mortem reveals how OpenAI’s newly deployed telemetry service generated massive Kubernetes API requests, overloading the API server, breaking DNS resolution, and slowing recovery, while contrasting OpenAI’s approach with LoongCollector/iLogtail’s design to minimize API load and improve cluster stability.

API ServerCloud NativeCluster Reliability
0 likes · 15 min read
How a New Telemetry Service Overwhelmed OpenAI’s Kubernetes API Server
System Architect Go
System Architect Go
Dec 23, 2024 · Cloud Native

Mastering Kubernetes API Server Flow Control: APF Explained

This article explains how Kubernetes' API Priority and Fairness (APF) mechanism enhances kube‑apiserver traffic control by introducing FlowSchema and PriorityLevelConfiguration objects, allowing fine‑grained request prioritization, concurrency limits, and queue management beyond the basic inflight throttling flags.

APFAPI ServerCloud Native
0 likes · 7 min read
Mastering Kubernetes API Server Flow Control: APF Explained
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Dec 6, 2024 · Operations

Boost Kubernetes API Server Performance: Tuning max-mutating-requests-inflight & watch-cache-size

This guide explains how to optimize Kubernetes API Server performance by configuring the max-mutating-requests-inflight limit and watch-cache-size, offering recommended values for different cluster sizes, monitoring metrics, and step‑by‑step adjustment strategies for stable, high‑throughput clusters.

API ServerKubernetescluster operations
0 likes · 7 min read
Boost Kubernetes API Server Performance: Tuning max-mutating-requests-inflight & watch-cache-size
Baobao Algorithm Notes
Baobao Algorithm Notes
Dec 1, 2023 · Operations

Deploy Hugging Face Transformers with One Click Using LMDeploy

This article explains how LMDeploy streamlines the deployment of Hugging Face transformer models by adding online conversion, offering an OpenAI‑compatible API server, a Gradio WebUI, and 4‑bit weight‑only quantization with AWQ, providing step‑by‑step commands, code examples, and performance insights.

AI inferenceAPI ServerHugging Face
0 likes · 9 min read
Deploy Hugging Face Transformers with One Click Using LMDeploy
Bilibili Tech
Bilibili Tech
Apr 18, 2023 · Cloud Native

Kubernetes Audit Log Analysis for Container Security

The article explains how to enable Kubernetes audit logging and use its detailed fields—such as userAgent, responseStatus, requestURI, and object references—to detect CDK‑generated attacks and other threats like CVE‑2022‑3172, privilege escalation, and backdoor deployment, offering practical detection examples and security recommendations.

API ServerAudit loggingCDK
0 likes · 15 min read
Kubernetes Audit Log Analysis for Container Security
Open Source Linux
Open Source Linux
Jun 16, 2022 · Cloud Native

Mastering Kubernetes Control Plane: etcd, API Server, Scheduler, and Nodes

This article explains the key Kubernetes control‑plane components—including etcd, the API Server, Controller Manager, Scheduler, as well as worker‑node components like Kubelet, kube‑proxy, and the container runtime—detailing their roles, interactions, and the underlying mechanisms such as Raft consensus and admission control.

API ServerControl PlaneKubernetes
0 likes · 10 min read
Mastering Kubernetes Control Plane: etcd, API Server, Scheduler, and Nodes
Open Source Linux
Open Source Linux
May 12, 2022 · Cloud Native

Mastering Kubernetes Control Plane: etcd, API Server, Scheduler & More

This article explains the core components of the Kubernetes control plane—including etcd, the API Server, Controller Manager, Scheduler—as well as key worker‑node components like Kubelet, kube‑proxy, and the container runtime, detailing their roles, interactions, and essential functions.

API ServerControl PlaneKubernetes
0 likes · 11 min read
Mastering Kubernetes Control Plane: etcd, API Server, Scheduler & More
AntTech
AntTech
Mar 7, 2022 · Cloud Native

Design and Practice of Multi‑Cluster Management in SOFAStack CAFE Using an Extended KubeFed Framework

This article details the architectural background, challenges, and practical solutions implemented in SOFAStack CAFE for cloud‑native multi‑cluster deployment, including a custom multi‑topology CRD, an independent federation API server, enhanced KubeFed controller features, and network proxy integration to support hybrid‑cloud scenarios.

API ServerKubeFedKubernetes
0 likes · 18 min read
Design and Practice of Multi‑Cluster Management in SOFAStack CAFE Using an Extended KubeFed Framework
Alibaba Cloud Native
Alibaba Cloud Native
Feb 14, 2020 · Cloud Native

How Kubernetes Schedules Pods and Secures API Access: A Hands‑On Deep Dive

This article walks through building a simple Go web service, containerizing it, pushing the image to a registry, and then exploring how Kubernetes API Server authenticates clients with mutual TLS and how the scheduler’s pre‑filter and scoring phases decide the optimal node for a pod, complete with code snippets and log analysis.

API ServerDockerGo
0 likes · 16 min read
How Kubernetes Schedules Pods and Secures API Access: A Hands‑On Deep Dive
Alibaba Cloud Developer
Alibaba Cloud Developer
Oct 16, 2019 · Cloud Native

How Alibaba Scaled Kubernetes to 10,000 Nodes: Key Optimizations and Lessons

This article details how Alibaba and Ant Financial tackled the performance and stability challenges of running Kubernetes at massive scale, describing their enhancements to etcd, API Server, scheduler, controller failover, load balancing, and other components that enabled a 10k‑node cluster to support the 2019 Tmall 618 promotion.

API ServerCloud NativeKubernetes
0 likes · 25 min read
How Alibaba Scaled Kubernetes to 10,000 Nodes: Key Optimizations and Lessons
Alibaba Cloud Native
Alibaba Cloud Native
Sep 21, 2019 · Cloud Native

How Alibaba Scaled Kubernetes to 10,000 Nodes: Key Optimizations and Lessons

This article details Alibaba's experience deploying Kubernetes at massive scale, describing the performance bottlenecks encountered in etcd, API server, controller, and scheduler components, and presenting the concrete engineering improvements—such as storage sharding, lease‑based heartbeats, load‑balancing, watch bookmarks, and hot‑standby controllers—that enabled stable operation of clusters with tens of thousands of nodes.

API ServerKubernetesetcd
0 likes · 18 min read
How Alibaba Scaled Kubernetes to 10,000 Nodes: Key Optimizations and Lessons