Tagged articles
3116 articles
Page 8 of 32
Linux Ops Smart Journey
Linux Ops Smart Journey
Jul 30, 2024 · Cloud Native

Unveiling Kubernetes: Inside the Cosmic Architecture Powering Cloud Native Apps

Amid the digital transformation era, Kubernetes has become essential for modern cloud computing, and this article demystifies its inner workings by detailing its master and node components, service discovery, storage orchestration, networking, high availability, flexible resource management, and thriving ecosystem.

Cloud NativeKubernetesarchitecture
0 likes · 5 min read
Unveiling Kubernetes: Inside the Cosmic Architecture Powering Cloud Native Apps
Baidu Geek Talk
Baidu Geek Talk
Jul 29, 2024 · Cloud Native

How Baidu’s Jarvis2.0 Redefined Cloud‑Native Microservice Governance

This article examines Baidu's Jarvis2.0 platform, detailing how its multi‑runtime architecture, unified control plane, and automated deployment pipelines transformed a sprawling microservice ecosystem of over 1,000 services into a highly efficient, cloud‑native system that dramatically reduced release times, improved stability, and enabled seamless multi‑language support.

BaiduCloud NativeJarvis2.0
0 likes · 15 min read
How Baidu’s Jarvis2.0 Redefined Cloud‑Native Microservice Governance
58 Tech
58 Tech
Jul 29, 2024 · Databases

HBase Cloud Migration: Architecture, Challenges, and Solutions

This technical report details the background, architecture, construction, core issues, migration plans, and future roadmap of moving 58's HBase clusters to a cloud‑native environment, highlighting cost reduction, operational automation, and performance optimizations.

Big DataCloud NativeHBase
0 likes · 22 min read
HBase Cloud Migration: Architecture, Challenges, and Solutions
Architect
Architect
Jul 28, 2024 · Backend Development

Why Microservices? Benefits, Principles, and Practical Implementation Guide

This article provides a comprehensive analysis of microservices, covering why they are adopted, the drawbacks of monolithic architectures, core characteristics, splitting principles and strategies, the relationship with Domain‑Driven Design, comparisons with other architectural styles, and a detailed overview of the common technical stack used to build and operate microservice systems.

Backend ArchitectureCloud NativeDDD
0 likes · 22 min read
Why Microservices? Benefits, Principles, and Practical Implementation Guide
MaGe Linux Operations
MaGe Linux Operations
Jul 26, 2024 · Operations

How to Install and Use Velero for Kubernetes Backup and Migration

This guide introduces Velero, an open-source Kubernetes backup and migration tool, explains its architecture and workflow, and provides step-by-step instructions for installing the client and server, configuring object storage with MinIO, and managing backups and restores.

Cloud NativeKubernetesVelero
0 likes · 10 min read
How to Install and Use Velero for Kubernetes Backup and Migration
Linux Ops Smart Journey
Linux Ops Smart Journey
Jul 25, 2024 · Cloud Native

Master Docker & Containerd Image Push/Pull: Secure Registry Configuration

This guide explains how to configure Docker and Containerd to correctly set image push and pull parameters for both HTTP and HTTPS registries, covering insecure registry settings, certificate management, host resolution, and command‑line examples using docker, ctr, crictl, and nerdctl to ensure efficient and secure container image handling.

Cloud NativeDockerImage Registry
0 likes · 12 min read
Master Docker & Containerd Image Push/Pull: Secure Registry Configuration
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jul 19, 2024 · Cloud Native

Integrating Dubbo Applications with Alibaba Cloud Service Mesh (ASM): Best Practices and Transition Scenarios

This guide explains how to connect Dubbo microservices to Alibaba Cloud Service Mesh (ASM) to obtain mTLS, observability, and traffic management, covering prerequisites, step‑by‑step deployment, code examples, a registry‑based transition approach, and recommended best‑practice configurations.

ASMCloud NativeDubbo
0 likes · 29 min read
Integrating Dubbo Applications with Alibaba Cloud Service Mesh (ASM): Best Practices and Transition Scenarios
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Jul 17, 2024 · Operations

How NetEase Cloud Music Automated Massive Service Upgrades with a Custom Platform

This article presents a comprehensive case study of NetEase Cloud Music's automatic upgrade platform, detailing the background challenges, technical architecture, sidecar versus component upgrades, workflow orchestration, operational safeguards, performance metrics, and future roadmap for large‑scale microservice migrations.

Cloud NativeMicroservicesOperations
0 likes · 17 min read
How NetEase Cloud Music Automated Massive Service Upgrades with a Custom Platform
Ctrip Technology
Ctrip Technology
Jul 16, 2024 · Backend Development

Implementation and Optimization of the QUIC Protocol in the Trip.com App

This article details the deployment of QUIC in Trip.com’s mobile app, covering multi‑process architecture, containerized upgrades, service discovery, health monitoring, push‑pull resilience, full‑link tracing, congestion‑control algorithm redesign, and the resulting performance and reliability improvements achieved across global users.

Cloud NativePerformance MonitoringQUIC
0 likes · 25 min read
Implementation and Optimization of the QUIC Protocol in the Trip.com App
Alibaba Cloud Observability
Alibaba Cloud Observability
Jul 16, 2024 · Cloud Native

Auto‑Instrumenting Java in GraalVM Native Images: A Practical OTel Solution

This article explains how Alibaba Cloud engineers tackled the challenges of long startup times and high memory usage in Java serverless workloads by designing a static Java Agent instrumentation that works with GraalVM Native Image, preserving OpenTelemetry observability while dramatically improving performance.

Cloud NativeJava AgentOpenTelemetry
0 likes · 13 min read
Auto‑Instrumenting Java in GraalVM Native Images: A Practical OTel Solution
Alibaba Cloud Observability
Alibaba Cloud Observability
Jul 16, 2024 · Cloud Native

How to Seamlessly Migrate Your Self‑Hosted Prometheus + Thanos to Alibaba Cloud Managed Prometheus

This guide explains why many users still run self‑built Prometheus + Thanos, outlines the common deployment scenarios and pain points, and provides detailed step‑by‑step migration procedures—including metric collection, visualization, and alerting—for moving to Alibaba Cloud's fully managed Prometheus service across Kubernetes, ECS, and IDC environments.

Alibaba CloudCloud NativePrometheus
0 likes · 14 min read
How to Seamlessly Migrate Your Self‑Hosted Prometheus + Thanos to Alibaba Cloud Managed Prometheus
Top Architecture Tech Stack
Top Architecture Tech Stack
Jul 16, 2024 · Cloud Native

Designing Fault‑Tolerant Microservices Architecture: Patterns and Practices

The article explains how to build reliable microservices by isolating failures, applying graceful degradation, change‑management, health checks, self‑healing, fallback caching, retry strategies, rate limiting, fast‑fail principles, circuit breakers, and failure‑testing to ensure high availability in distributed cloud‑native systems.

Cloud NativeMicroservicesOperations
0 likes · 14 min read
Designing Fault‑Tolerant Microservices Architecture: Patterns and Practices
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jul 5, 2024 · Cloud Native

Koordinator v1.5.0 Release: New Features and Enhancements

Koordinator v1.5.0, the 13th major release since its open‑source debut, introduces pod‑level NUMA alignment, Terway network QoS, core scheduling, and numerous performance and stability improvements, while also being accepted as a CNCF Sandbox project and outlining future roadmap plans.

Cloud NativeCore SchedulingKubernetes
0 likes · 14 min read
Koordinator v1.5.0 Release: New Features and Enhancements
Alibaba Cloud Observability
Alibaba Cloud Observability
Jul 4, 2024 · Cloud Native

Mastering AliyunPipelineConfig: A Complete Guide to Modern Log Collection in Cloud Native Environments

This article explains the design, advantages, and detailed configuration of the new AliyunPipelineConfig CRD for Alibaba Cloud Log Service, covering its structure, field specifications, migration from AliyunLogConfig, validation, error handling, and practical examples for Kubernetes log collection.

Alibaba CloudCRDCloud Native
0 likes · 24 min read
Mastering AliyunPipelineConfig: A Complete Guide to Modern Log Collection in Cloud Native Environments
Alibaba Cloud Observability
Alibaba Cloud Observability
Jul 4, 2024 · Cloud Native

How to Install and Use Go Agent for Cloud‑Native Application Monitoring on ACK

This guide explains how Go Agent solves the manual instrumentation problem in Go microservices by injecting monitoring code at compile time, details its implementation using AST analysis and -toolexec, lists supported SDKs and product features, and provides step‑by‑step instructions for installing the ARMS component, compiling Go binaries, granting permissions, and enabling monitoring on Alibaba Cloud ACK clusters.

ARMSCloud NativeGo
0 likes · 16 min read
How to Install and Use Go Agent for Cloud‑Native Application Monitoring on ACK
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jun 29, 2024 · Backend Development

Evolution of System Architecture: From Monolithic to Cloud‑Native Microservices

This article traces the historical evolution of backend system architecture—from early single‑machine monolithic deployments through database isolation, static‑dynamic separation, front‑back separation, clustering, caching, message queues, sharding, SOA, microservices, containerization, CI/CD, and finally cloud‑native designs—highlighting the motivations, challenges, and key technologies at each stage.

Cloud NativeMicroservicesSystem Architecture
0 likes · 33 min read
Evolution of System Architecture: From Monolithic to Cloud‑Native Microservices
Ops Development Stories
Ops Development Stories
Jun 28, 2024 · Cloud Native

Multi-Cluster Kubernetes: Benefits, Federation, Karmada, and Practical Tips

This article explains why organizations adopt multi‑cluster Kubernetes for high availability, hybrid‑cloud scaling, and fault isolation, outlines the preparatory steps, compares Federation v1 and v2, introduces Karmada as a CNCF project, and shares practical non‑federated deployment, monitoring, traffic management, and migration techniques with code examples.

Cloud NativeDevOpsFederation
0 likes · 18 min read
Multi-Cluster Kubernetes: Benefits, Federation, Karmada, and Practical Tips
TAL Education Technology
TAL Education Technology
Jun 27, 2024 · Cloud Native

Case Study: Integrating the AiFenxi BI Platform with Apache APISIX Gateway for Improved Performance and Stability

This case study details how the AiFenxi business intelligence platform integrated Apache APISIX as a high‑performance API gateway within Tencent Cloud TKE, addressing latency, scalability, and security challenges, and outlines the architectural changes, deployment steps, and resulting performance improvements.

APISIXBI platformCloud Native
0 likes · 7 min read
Case Study: Integrating the AiFenxi BI Platform with Apache APISIX Gateway for Improved Performance and Stability
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jun 26, 2024 · Cloud Native

Securing LLM Calls with Alibaba Cloud ASM Service Mesh Using a Wasm Plugin

This article demonstrates how to protect large language model (LLM) requests in a cloud‑native environment by using Alibaba Cloud ASM service mesh and a custom Wasm plugin to dynamically inject API keys, enforce custom denial patterns, and optionally route requests through a private LLM for intelligent data‑leak detection.

Cloud NativeKubernetesLLM
0 likes · 13 min read
Securing LLM Calls with Alibaba Cloud ASM Service Mesh Using a Wasm Plugin
Practical DevOps Architecture
Practical DevOps Architecture
Jun 24, 2024 · Backend Development

Comprehensive Backend Development and Cloud Native Training Curriculum

This document outlines a detailed training curriculum covering backend development, cloud-native technologies, microservice frameworks, databases, messaging systems, containerization, orchestration, performance tuning, and related tools, presented as a series of numbered sessions with practical labs and theory.

Cloud NativeDevOpsMicroservices
0 likes · 7 min read
Comprehensive Backend Development and Cloud Native Training Curriculum
DataFunSummit
DataFunSummit
Jun 22, 2024 · Cloud Native

Optimizing I/O for Data-Intensive Analytics in Cloud-Native Environments: Insights from Uber Presto

This whitepaper examines the industry trend of migrating data‑intensive analytics workloads to cloud‑native environments, revealing how cloud storage’s unique cost model demands finer‑grained performance optimization, and presents Uber Presto case‑study findings that expose fragmented I/O patterns and associated financial impacts.

Cloud NativeCost ModelData Analytics
0 likes · 3 min read
Optimizing I/O for Data-Intensive Analytics in Cloud-Native Environments: Insights from Uber Presto
Architect's Guide
Architect's Guide
Jun 22, 2024 · Databases

Understanding TiDB: NewSQL Features, Architecture, and Comparison with MySQL and NoSQL

This article provides a comprehensive overview of TiDB, a cloud‑native NewSQL database that combines the scalability of NoSQL with the strong consistency of traditional relational databases, detailing its architecture, high‑availability design, HTAP capabilities, MySQL compatibility, and practical application scenarios.

Cloud NativeHTAPMySQL compatibility
0 likes · 23 min read
Understanding TiDB: NewSQL Features, Architecture, and Comparison with MySQL and NoSQL
Alibaba Cloud Observability
Alibaba Cloud Observability
Jun 20, 2024 · Cloud Native

How to Achieve Unified Multi‑Cluster Monitoring with Alibaba Cloud Prometheus and ACK One

This article explains how enterprises can use Alibaba Cloud's ACK One platform together with the Prometheus‑based Observability service to build a unified, cloud‑native monitoring solution for heterogeneous, multi‑region Kubernetes clusters, addressing scalability, cost, and operational challenges.

ACK OneCloud NativeKubernetes
0 likes · 12 min read
How to Achieve Unified Multi‑Cluster Monitoring with Alibaba Cloud Prometheus and ACK One
dbaplus Community
dbaplus Community
Jun 17, 2024 · Cloud Native

Why Terraform Should Retire: Embracing Kubernetes Operator Alternatives

The article critiques Terraform’s architectural quirks, drift‑management challenges, licensing controversy, and community split, then argues that adopting Kubernetes operator‑based infrastructure management offers a more modular, collaborative, and cloud‑agnostic solution for modern DevOps teams.

Cloud NativeInfrastructure as CodeKubernetes
0 likes · 13 min read
Why Terraform Should Retire: Embracing Kubernetes Operator Alternatives
Volcano Engine Developer Services
Volcano Engine Developer Services
Jun 14, 2024 · Operations

How ByteDance Built an EB‑Scale Log Service: Design & Optimization

This article details the evolution of ByteDance's TLS (Tinder Log Service) from a Loki‑based prototype to an EB‑scale, cloud‑native log system, covering its core properties, data organization, architecture, caching, hybrid storage, private codec, ecosystem compatibility, intelligent features, and real‑world case studies.

ByteDanceCloud NativeTLS
0 likes · 24 min read
How ByteDance Built an EB‑Scale Log Service: Design & Optimization
Alibaba Cloud Observability
Alibaba Cloud Observability
Jun 13, 2024 · Cloud Native

Kickstart Your Observability Journey with Alibaba Cloud Monitoring

This guide introduces new Alibaba Cloud users to the fundamentals of cloud observability, explaining the metric‑trace‑log stack, the layered monitoring pyramid, and step‑by‑step how to set up out‑of‑the‑box resource monitoring, dashboards, alerts, and advanced integration options.

Alibaba CloudCloud NativeMetrics
0 likes · 7 min read
Kickstart Your Observability Journey with Alibaba Cloud Monitoring
TAL Education Technology
TAL Education Technology
Jun 11, 2024 · Cloud Native

Containerizing Elasticsearch: Architecture Upgrade, API Gateway Integration, and Cloud‑Native Migration

This article details how a large‑scale Elasticsearch deployment was transformed from physical servers to a Kubernetes‑based, containerized architecture, addressing cost, scalability, API compatibility, security, observability, and multi‑cloud migration using the 极限网关 API gateway.

Cloud NativeElasticsearchKubernetes
0 likes · 11 min read
Containerizing Elasticsearch: Architecture Upgrade, API Gateway Integration, and Cloud‑Native Migration
Alibaba Cloud Native
Alibaba Cloud Native
Jun 9, 2024 · Cloud Native

Deploy a SpringAI Chatbot in Minutes with Alibaba Cloud AppStack

This guide walks you through obtaining free ECS resources and a DashScope API‑key, initializing a SpringAI application with the AppStack template, deploying it via the built‑in pipeline, and verifying the AI chatbot runs successfully, while also showing optional template‑based batch upgrades.

ChatbotCloud NativeSpringAI
0 likes · 6 min read
Deploy a SpringAI Chatbot in Minutes with Alibaba Cloud AppStack
DataFunSummit
DataFunSummit
Jun 8, 2024 · Cloud Native

Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto

This whitepaper examines the industry shift of moving data‑intensive analytics to cloud‑native platforms, analyzes how cloud storage cost models affect performance optimization, and presents Uber Presto case‑study findings that reveal fragmented access patterns and the financial impact of traditional I/O strategies in the cloud.

Cloud NativeCost ModelData Analytics
0 likes · 3 min read
Optimizing I/O for Data‑Intensive Analytics in Cloud‑Native Environments: Insights from Uber Presto
Java Tech Enthusiast
Java Tech Enthusiast
Jun 8, 2024 · Cloud Native

Performance Comparison of Spring Boot and Quarkus for Cloud‑Native Java Applications

The article benchmarks Spring Boot and Quarkus for cloud‑native Java apps, showing Quarkus native images start up twice as fast, use less memory, and produce smaller artifacts than Spring Boot, while Spring Boot JVM offers slightly lower latency, and a migration guide demonstrates API compatibility and code reuse.

Cloud NativeMicroservicesPerformance Testing
0 likes · 13 min read
Performance Comparison of Spring Boot and Quarkus for Cloud‑Native Java Applications
Tencent Cloud Developer
Tencent Cloud Developer
Jun 7, 2024 · Cloud Native

Multi-AZ High‑Availability Architecture of Tencent Cloud TDMQ for Apache Pulsar

Tencent Cloud TDMQ for Apache Pulsar achieves multi‑AZ high availability by containerizing ZooKeeper, BookKeeper and Brokers, using managed ZK, persistent cloud disks and elastic NICs, enforcing quorum and rack‑aware replicas, and planning cross‑region bidirectional replication to ensure seamless disaster recovery and continuous messaging.

Cloud NativeMulti‑AZPulsar
0 likes · 15 min read
Multi-AZ High‑Availability Architecture of Tencent Cloud TDMQ for Apache Pulsar
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jun 6, 2024 · Cloud Native

Argo Workflows vs Jenkins: Building Cloud‑Native CI/CD Pipelines on ACK One Serverless

Argo Workflows, a cloud‑native Kubernetes job orchestrator, offers superior autoscaling, concurrency, cost efficiency, and seamless integration with the Argo ecosystem compared to Jenkins, and the article provides a detailed tutorial on deploying a Go‑based CI pipeline on ACK One Serverless Argo with BuildKit, NAS caching, and parameterized workflow templates.

Argo WorkflowsBuildKitCloud Native
0 likes · 15 min read
Argo Workflows vs Jenkins: Building Cloud‑Native CI/CD Pipelines on ACK One Serverless
Cloud Native Technology Community
Cloud Native Technology Community
Jun 5, 2024 · Cloud Native

Which Multi‑Cluster Networking Solution Is Right for Your Kubernetes Fleet?

With Kubernetes clusters expanding across multiple environments, choosing the right cross‑cluster networking method is crucial; this article compares six approaches—underlay networks, overlay CNI meshes, Submariner, Skupper, KubeSlice, and others—detailing their core principles, advantages, and limitations to help you select the best fit.

CNICloud NativeKubernetes
0 likes · 11 min read
Which Multi‑Cluster Networking Solution Is Right for Your Kubernetes Fleet?
ByteDance Cloud Native
ByteDance Cloud Native
Jun 3, 2024 · Cloud Native

How to Build High‑Availability Kubernetes Clusters with Volcengine VKE & VCI

This guide explains how Volcengine's VKE (Kubernetes Engine) and VCI (Elastic Container Instance) enable high‑availability, multi‑AZ deployments, covering cluster creation, control‑plane distribution, virtual node configuration, inventory‑aware scheduling, and practical YAML examples for resilient cloud‑native workloads.

Cloud NativeContainer ServiceKubernetes
0 likes · 14 min read
How to Build High‑Availability Kubernetes Clusters with Volcengine VKE & VCI
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
May 31, 2024 · Cloud Native

Best Practices for Deploying AI Model Inference on Knative

This guide explains how to efficiently deploy AI model inference services on Knative by externalizing model data, using Fluid for accelerated loading, configuring secrets, ImageCache, graceful shutdown, probes, autoscaling parameters, mixed ECS/ECI resources, shared GPU scheduling, and observability features to achieve fast scaling, low cost, and high elasticity.

AI Model InferenceCloud NativeGPU
0 likes · 19 min read
Best Practices for Deploying AI Model Inference on Knative
360 Smart Cloud
360 Smart Cloud
May 27, 2024 · Databases

Deployment, Optimization, and Management of TiDB Service in 360 Zhihui Cloud

This article details the product models, usage scenarios, and a series of performance and operational optimizations—including query plan health checks, space reclamation, resource isolation, cloud‑native deployment, cross‑region high availability, and unified monitoring—implemented for the TiDB service operated by 360 Zhihui Cloud since its launch in April 2023.

Cloud NativeKubernetesTiDB
0 likes · 17 min read
Deployment, Optimization, and Management of TiDB Service in 360 Zhihui Cloud
ITPUB
ITPUB
May 26, 2024 · Cloud Native

Containerizing Elasticsearch & ClickHouse on Kubernetes: Bilibili’s Scalable, Low‑Cost Solution

This article details Bilibili’s journey of containerizing Elasticsearch and ClickHouse on Kubernetes, covering the challenges of stateful services, architectural decisions, custom operators, storage and network solutions, deployment steps, observability enhancements, and the resulting cost, quality, and efficiency gains.

Cloud NativeElasticsearchKubernetes
0 likes · 38 min read
Containerizing Elasticsearch & ClickHouse on Kubernetes: Bilibili’s Scalable, Low‑Cost Solution
iQIYI Technical Product Team
iQIYI Technical Product Team
May 24, 2024 · Operations

High Availability and Disaster Recovery Practices of iQIYI's Video Relay Service (VRS)

iQIYI’s Video Relay Service ensures uninterrupted video playback by employing a two‑region, three‑center hybrid cloud architecture, multi‑layer storage, cross‑AZ retry mechanisms, protective rate‑limiting and degradation paths, layered monitoring, and rigorous stress‑testing and chaos engineering to achieve high availability and disaster recovery.

Backend ArchitectureCloud NativeVideo Streaming
0 likes · 18 min read
High Availability and Disaster Recovery Practices of iQIYI's Video Relay Service (VRS)
MaGe Linux Operations
MaGe Linux Operations
May 23, 2024 · Cloud Native

How to Slash Server Load: Optimizing Go Apps on Kubernetes

This article explains how to reduce server load and improve performance of Go applications on Kubernetes by using lightweight base images, fine‑tuning resource limits, adjusting garbage‑collection settings, employing connection pools, configuring health probes, and leveraging autoscaling, load balancing, and caching techniques.

Cloud NativeGoKubernetes
0 likes · 19 min read
How to Slash Server Load: Optimizing Go Apps on Kubernetes
DaTaobao Tech
DaTaobao Tech
May 22, 2024 · Cloud Native

AONE Serverless Quality Assurance: Design, Testing, and Monitoring

The article explains how AONE Serverless separates development and operations domains to enable independent iteration and lower costs, details a QA workflow—functional regression, performance testing, monitoring verification, reverse‑engineered interfaces, automated API traffic replay, and isolated pressure testing— and reports deployment build time cuts of 17% and overall deployment reductions up to 44%, while outlining challenges and future plans for layered automation and plugin‑based extensions.

Cloud NativeServerlessquality assurance
0 likes · 9 min read
AONE Serverless Quality Assurance: Design, Testing, and Monitoring
Huolala Tech
Huolala Tech
May 21, 2024 · Information Security

How Huolala Built a Comprehensive Security Asset Map for Cloud‑Native Environments

Huolala’s Information Security team built a comprehensive security asset library and visualization framework, detailing asset pain points, mapping methodology, detection and drawing modules, and measurable outcomes, to enhance asset visibility, risk assessment, and continuous security operations in a cloud‑native environment.

Cloud NativeGraph DatabaseInformation Security
0 likes · 12 min read
How Huolala Built a Comprehensive Security Asset Map for Cloud‑Native Environments
Continuous Delivery 2.0
Continuous Delivery 2.0
May 21, 2024 · Operations

Platform Engineering: Enhancing Developer Experience and Operational Efficiency

The article explains how platform engineering, as an evolution of DevOps, aims to reduce developers' cognitive load by providing integrated, user‑friendly tools and workflows, discusses real‑world practices such as ByteDance's CI/CD pipeline, and outlines future trends toward process‑orientation, intelligence, and scalability.

Cloud NativeDevOpsDeveloper Experience
0 likes · 4 min read
Platform Engineering: Enhancing Developer Experience and Operational Efficiency
DataFunSummit
DataFunSummit
May 20, 2024 · Big Data

Real-Time High-Performance Analytics on Data Lakes with CloudLakehouse Multi-Cluster Architecture

This article explains how CloudLakehouse’s Multi‑Cluster elastic architecture enables high‑concurrency, low‑latency real‑time analytics on data lakes by addressing storage‑compute separation, dynamic caching, and automated scaling, providing a cost‑effective solution for customer‑facing data products.

Cloud NativeMulti-ClusterReal-time analytics
0 likes · 18 min read
Real-Time High-Performance Analytics on Data Lakes with CloudLakehouse Multi-Cluster Architecture
360 Smart Cloud
360 Smart Cloud
May 20, 2024 · Cloud Native

Design and Implementation of Managed (Dedicated) Kubernetes Clusters

This article introduces the concept of managed Kubernetes clusters, compares them with traditional setups, details the architecture and implementation process, discusses challenges such as cross‑cluster webhook routing, certificate issuance, container runtime integration, and outlines the diverse features of the dedicated cluster solution.

CertificateCloud NativeKubernetes
0 likes · 13 min read
Design and Implementation of Managed (Dedicated) Kubernetes Clusters
DataFunSummit
DataFunSummit
May 19, 2024 · Cloud Native

Design and Implementation of a Cloud‑Native Recommendation System Architecture

This article explains how to design and implement a recommendation system by leveraging a four‑layer cloud‑native stack, covering virtualization, micro‑service migration, service governance, elasticity, cloud‑native business capabilities, and chaos‑engineering‑based stability practices to achieve cost‑effective, high‑performance, and reliable recommendation services.

Cloud NativeMicroservicesVirtualization
0 likes · 10 min read
Design and Implementation of a Cloud‑Native Recommendation System Architecture
Architect's Tech Stack
Architect's Tech Stack
May 18, 2024 · Operations

Graceful Shutdown in Kubernetes and Spring Boot Microservices: Best Practices and Optimizations

This article explains the concept of graceful shutdown, outlines essential steps, examines Kubernetes pod termination and Spring Boot integration with Nacos, and provides practical optimizations—including PreStop hooks, terminationGracePeriod settings, and actuator shutdown—to ensure reliable service termination without data loss.

Cloud NativeGraceful ShutdownKubernetes
0 likes · 11 min read
Graceful Shutdown in Kubernetes and Spring Boot Microservices: Best Practices and Optimizations
DataFunTalk
DataFunTalk
May 18, 2024 · Artificial Intelligence

Tencent FinTech AI Development Platform: Architecture, Challenges, and Solutions

This article details the background, goals, and evolution of Tencent's FinTech AI development platform, outlines the technical challenges faced in feature engineering, model training, and inference services, and presents the comprehensive solutions and future plans implemented to improve efficiency, stability, and scalability.

Cloud NativeFinTechInference
0 likes · 13 min read
Tencent FinTech AI Development Platform: Architecture, Challenges, and Solutions
Selected Java Interview Questions
Selected Java Interview Questions
May 16, 2024 · Backend Development

Rethinking Microservices in 2023: From Google’s “Microservices 2.0” to Amazon’s Monolith Shift

In 2023, leading cloud giants like Google and Amazon publicly questioned the value of microservices, presenting new “Microservices 2.0” principles and monolithic alternatives that promise dramatically lower latency, cost, and operational complexity while reshaping backend architecture debates.

Cloud NativeMicroservicesmonolith
0 likes · 12 min read
Rethinking Microservices in 2023: From Google’s “Microservices 2.0” to Amazon’s Monolith Shift
360 Smart Cloud
360 Smart Cloud
May 15, 2024 · Cloud Native

Polefs: A Cloud‑Native Distributed Cache File System for AI Training Workloads

The article outlines the challenges of massive AI training data, defines storage performance requirements, and presents Polefs—a cloud‑native distributed cache file system with unified storage, metadata acceleration, and read/write caching designed to improve GPU utilization and reduce data redundancy.

AICloud NativeDistributed File System
0 likes · 14 min read
Polefs: A Cloud‑Native Distributed Cache File System for AI Training Workloads
Yang Money Pot Technology Team
Yang Money Pot Technology Team
May 14, 2024 · Cloud Native

Optimizing CI/CD Pipeline and Release Strategies for Microservices in a Cloud‑Native Environment

This article details a comprehensive overhaul of a company's CI/CD workflow for Java, Python, Go, and Node.js microservices, introducing automated pipelines, parallel builds, rolling, canary, and blue‑green deployments on Kubernetes with Istio to improve release speed, stability, and observability.

Cloud Nativeci/cdrelease-management
0 likes · 15 min read
Optimizing CI/CD Pipeline and Release Strategies for Microservices in a Cloud‑Native Environment
dbaplus Community
dbaplus Community
May 13, 2024 · Cloud Native

Do You Really Need Kubernetes? Real‑World Dev Opinions and Practical Tips

This article compiles diverse Zhihu answers discussing whether Kubernetes is necessary, weighing its automation benefits and scaling power against configuration complexity, resource costs, and team readiness, while offering concrete kubectl commands and guidance for making an informed adoption decision.

Cloud NativeKubernetesOperations
0 likes · 9 min read
Do You Really Need Kubernetes? Real‑World Dev Opinions and Practical Tips
MaGe Linux Operations
MaGe Linux Operations
May 10, 2024 · Cloud Native

How to Set Up a Secure Kubernetes Dashboard Without RBAC

Learn step-by-step how to download the Kubernetes dashboard YAML, prepare offline images, modify the deployment to use a private registry, configure the API server address, create the service, verify its status, and access the dashboard via a browser, all without RBAC.

Cloud NativeDashboardDocker
0 likes · 3 min read
How to Set Up a Secure Kubernetes Dashboard Without RBAC
360 Tech Engineering
360 Tech Engineering
May 10, 2024 · Cloud Native

Practical Experience of Deploying and Optimizing Apache Pulsar on Kubernetes at 360

This article shares the architecture design, cluster deployment, storage selection, multi‑region mode, service discovery, performance tuning, monitoring, alerting, and future plans of a production‑grade Apache Pulsar platform running on Kubernetes, providing valuable insights for engineers adopting Pulsar.

Apache PulsarCloud NativeDistributed Messaging
0 likes · 17 min read
Practical Experience of Deploying and Optimizing Apache Pulsar on Kubernetes at 360
360 Smart Cloud
360 Smart Cloud
May 9, 2024 · Cloud Native

Practical Experience of Deploying and Optimizing Apache Pulsar on Kubernetes at 360

This article shares 360's practical experience deploying Apache Pulsar on Kubernetes, covering architecture design, storage choices, multi‑cluster modes, service discovery, performance tuning, monitoring, alerting, and future plans, offering valuable guidance for engineers building cloud‑native messaging platforms.

Apache PulsarCloud NativeDistributed Messaging
0 likes · 19 min read
Practical Experience of Deploying and Optimizing Apache Pulsar on Kubernetes at 360
JD Cloud Developers
JD Cloud Developers
May 9, 2024 · Operations

Unlocking IT Architecture Governance: From Theory to JD.com’s Real‑World Success

This article explains the origins and concepts of IT architecture and governance, compares traditional and internet‑era enterprise structures, reviews major standards such as COBIT, ITIL and IT4IT, outlines practical governance methods, and showcases JD.com’s concrete implementation across infrastructure, applications, and organizational processes.

COBITCloud NativeIT Governance
0 likes · 30 min read
Unlocking IT Architecture Governance: From Theory to JD.com’s Real‑World Success
Alibaba Cloud Observability
Alibaba Cloud Observability
May 8, 2024 · Cloud Computing

Mastering FinOps: A Practical Guide to Cloud Cost Management and Optimization

FinOps combines finance and DevOps to create a culture of transparent, predictable cloud spending, guiding enterprises through preparation, strategy alignment, shared responsibility, cost insight, optimization tactics, and ongoing control, enabling them to balance agility with disciplined cost management in cloud-native environments.

Cloud Cost ManagementCloud NativeCost Optimization
0 likes · 12 min read
Mastering FinOps: A Practical Guide to Cloud Cost Management and Optimization
Liangxu Linux
Liangxu Linux
May 7, 2024 · Cloud Native

Unlocking Docker: From Containers to Compose, Swarm, and Kubernetes

This article explains Docker’s role as a container platform, covering basic concepts like images and Dockerfiles, the architecture of Docker Engine, and how Docker Compose, Swarm, and Kubernetes extend container orchestration across multiple services and hosts.

Cloud NativeContainersDocker
0 likes · 13 min read
Unlocking Docker: From Containers to Compose, Swarm, and Kubernetes
Efficient Ops
Efficient Ops
May 6, 2024 · Cloud Native

Why Is My Kubernetes Pod OOMKilled Before Reaching Its Memory Limit?

A Kubernetes pod repeatedly restarted with exit code 137 despite not hitting its memory limit, revealing that node‑level memory pressure and QoS‑based eviction caused the pod to be killed, and outlining how to diagnose and prevent such OOMKill events.

Cloud NativeKubernetesOOMKill
0 likes · 9 min read
Why Is My Kubernetes Pod OOMKilled Before Reaching Its Memory Limit?
DevOps Cloud Academy
DevOps Cloud Academy
May 6, 2024 · Cloud Native

How to Deploy a Highly Available Application on Kubernetes

This article explains key Kubernetes configurations—such as pod replicas, pod anti‑affinity, deployment strategies, graceful termination, probes, resource allocation, scaling, and disruption budgets—to achieve high availability and zero‑downtime deployments for containerized applications in production.

Cloud NativeKubernetesProbes
0 likes · 20 min read
How to Deploy a Highly Available Application on Kubernetes
DataFunTalk
DataFunTalk
May 6, 2024 · Big Data

OPPO Next‑Generation Big Data & AI Integrated Architecture on Functional Cloud

This article presents OPPO’s next‑generation big‑data and AI integrated architecture on functional cloud, detailing a cloud‑native elastic compute framework, a unified data‑lake solution, real‑time feature platforms, machine‑learning data acceleration, and hybrid‑cloud deployments, highlighting performance gains and cost reductions.

Big DataCloud Nativeelastic computing
0 likes · 11 min read
OPPO Next‑Generation Big Data & AI Integrated Architecture on Functional Cloud
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
May 2, 2024 · Cloud Native

Understanding Istio Architecture and Core Principles

This article provides a comprehensive overview of Istio, explaining its role as an open‑source service mesh for cloud‑native microservices, detailing the data plane and control plane components, sidecar proxy functions, traffic management, security, and monitoring while also promoting related learning resources.

Cloud NativeIstioKubernetes
0 likes · 4 min read
Understanding Istio Architecture and Core Principles
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
May 1, 2024 · Cloud Native

Comprehensive Introduction to Service Mesh

This article provides a detailed, English-language overview of Service Mesh, explaining its role in cloud‑native microservice architectures, the need for it, its core principles—including data and control planes—and key Istio components, while also offering promotional links to related resources.

Cloud NativeControl PlaneData Plane
0 likes · 5 min read
Comprehensive Introduction to Service Mesh
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Apr 28, 2024 · Cloud Native

Enhancements in ACK One Serverless Argo Workflows: Large File Upload, Artifact GC, and Streaming Transfer

The article introduces ACK One Serverless Argo Workflows, a fully managed cloud‑native workflow service that adds chunked large‑file uploads, automatic artifact garbage collection, and OSS streaming download to overcome limitations of the open‑source Argo Workflows in high‑volume data scenarios.

Argo WorkflowsArtifactsCloud Native
0 likes · 10 min read
Enhancements in ACK One Serverless Argo Workflows: Large File Upload, Artifact GC, and Streaming Transfer
DevOps
DevOps
Apr 27, 2024 · Cloud Native

How Platform Engineering Can Support API Development

The article explains how platform engineering—a cloud‑native discipline for building self‑service toolchains—enhances API development by improving developer experience, enabling better API integration, discovery, management, and CI/CD pipelines, ultimately providing a more customizable and efficient development environment.

API developmentCloud NativeDeveloper Experience
0 likes · 8 min read
How Platform Engineering Can Support API Development
21CTO
21CTO
Apr 25, 2024 · Cloud Native

Why IBM’s $6.4B HashiCorp Deal Could Redefine Multi‑Cloud Management

IBM’s $6.4 billion acquisition of HashiCorp brings Terraform, Vault and other infrastructure‑as‑code tools under its umbrella, promising tighter hybrid‑cloud integration while sparking debate over open‑source licensing and the future of multi‑cloud strategies.

AcquisitionCloud NativeHashiCorp
0 likes · 6 min read
Why IBM’s $6.4B HashiCorp Deal Could Redefine Multi‑Cloud Management
360 Smart Cloud
360 Smart Cloud
Apr 25, 2024 · Cloud Native

Building High‑Performance RoCE v2 and InfiniBand Networks in a Cloud‑Native Environment for Large‑Model Training

This article explains how to construct high‑performance RoCE v2 and InfiniBand networks within a cloud‑native Kubernetes environment, detailing the underlying technologies, required components, configuration steps, and performance test results that demonstrate significant communication speed improvements for large‑scale AI model training.

AI trainingCloud NativeHigh‑Performance Networking
0 likes · 12 min read
Building High‑Performance RoCE v2 and InfiniBand Networks in a Cloud‑Native Environment for Large‑Model Training
Alibaba Cloud Native
Alibaba Cloud Native
Apr 24, 2024 · R&D Management

Why R&D Standards Fail and How Aone Inspires Better DevOps Practices

This article examines the origins and goals of software development standards, explains why they often break down when enforced through tools, presents concrete examples of such failures, and shares insights from Alibaba's Aone platform that guide the design of a more effective DevOps‑centric R&D workflow.

AoneCloud NativeDevOps
0 likes · 27 min read
Why R&D Standards Fail and How Aone Inspires Better DevOps Practices
Alibaba Cloud Observability
Alibaba Cloud Observability
Apr 24, 2024 · Cloud Native

How to Connect Kibana to Alibaba Cloud SLS via ES Compatibility with Docker‑Compose

This guide shows how to deploy a lightweight Elasticsearch instance, a proxy, and Kibana with Docker‑Compose to connect Kibana to Alibaba Cloud Log Service’s Elasticsearch‑compatible API, configure index patterns, run queries, create visualizations, and troubleshoot common issues.

Cloud NativeDocker ComposeElasticsearch compatibility
0 likes · 9 min read
How to Connect Kibana to Alibaba Cloud SLS via ES Compatibility with Docker‑Compose
Bilibili Tech
Bilibili Tech
Apr 23, 2024 · Cloud Computing

Redesign of Bilibili Edge CDN Architecture: From Legacy to Cloud‑Native Load‑Balancing

Bilibili’s downstream CDN was rebuilt from a tightly‑coupled scheduler model into a cloud‑native architecture featuring separate gateway, cache, back‑origin and control services that provide layer‑4 and layer‑7 load‑balancing, cluster‑wide health checks, centralized routing, and a refactored NGINX slice module, cutting latency by 40 % and dramatically reducing incidents.

CDNCloud NativeEdge Computing
0 likes · 11 min read
Redesign of Bilibili Edge CDN Architecture: From Legacy to Cloud‑Native Load‑Balancing
Huolala Tech
Huolala Tech
Apr 23, 2024 · Operations

How Huolala Automated Full‑Link Load Testing to Boost Efficiency and Cut Costs

This article details Huolala's journey from manual, resource‑intensive full‑link load testing to a fully automated, model‑driven platform that improves peak‑capacity verification, reduces testing time and manpower, ensures safety through circuit‑breaker mechanisms, and delivers measurable cost and performance gains.

Cloud NativeLoad TestingScalability
0 likes · 21 min read
How Huolala Automated Full‑Link Load Testing to Boost Efficiency and Cut Costs
Mike Chen's Internet Architecture
Mike Chen's Internet Architecture
Apr 19, 2024 · Cloud Native

Understanding Docker: Architecture, Core Components, and Underlying Mechanisms

This article explains Docker’s core architecture—including images, containers, and registries—and details how Linux namespaces, cgroups, and UnionFS work together to provide resource isolation, limitation, and lightweight virtualization, while also offering promotional links to extensive architecture and interview collections.

Cloud NativeContainersDocker
0 likes · 6 min read
Understanding Docker: Architecture, Core Components, and Underlying Mechanisms
Architect
Architect
Apr 18, 2024 · Industry Insights

Why 2023 Signals Microservices' Decline: Lessons from Google, Amazon, DHH

2023 saw a wave of high‑profile tech leaders—Google’s engineers, Amazon Prime Video, and DHH—questioning the microservices paradigm, presenting data‑driven case studies that show dramatic latency and cost reductions when moving to monolithic or runtime‑managed architectures, and urging architects to reassess scalability versus complexity trade‑offs.

Cloud NativeCost OptimizationMicroservices
0 likes · 12 min read
Why 2023 Signals Microservices' Decline: Lessons from Google, Amazon, DHH
Cloud Native Technology Community
Cloud Native Technology Community
Apr 18, 2024 · Cloud Native

What’s New in Kubernetes v1.30? 45 Features Reaching GA

Kubernetes v1.30, released in 2024, introduces 45 major updates—including GA for pod scheduling readiness, interactive kubectl delete, CEL‑based admission control, container‑based HPA, new service trafficDistribution, indexed job success policies, volume manager reconstruction, hostIPs list, node log query, default node swap, and recursive read‑only mounts—providing a more conservative yet feature‑rich evolution of the platform.

Cloud NativeGAKubernetes
0 likes · 19 min read
What’s New in Kubernetes v1.30? 45 Features Reaching GA