Tagged articles
14 articles
Page 1 of 1
Shuge Unlimited
Shuge Unlimited
Feb 27, 2026 · Databases

Why Is Milvus, the 43K‑Star Vector Database, So Powerful?

This article analyzes Milvus—its open‑source origins, three deployment modes, four‑layer architecture, eight‑plus indexing algorithms, real‑world case studies, and a detailed comparison with competitors—highlighting its strengths, weaknesses, common pitfalls, and when it’s the right choice for large‑scale AI workloads.

AI workloadsCloud NativeDeployment
0 likes · 15 min read
Why Is Milvus, the 43K‑Star Vector Database, So Powerful?
21CTO
21CTO
Feb 2, 2026 · Databases

Is Oracle’s Promise a New Era for MySQL? Community Reactions and Risks

Oracle claims a new era for MySQL by moving commercial‑only features to the community edition and adding AI‑focused vector functions, but developers question whether these promises are timely or sufficient, fearing continued neglect of the open‑source community.

AI workloadsDatabase CommunityOracle
0 likes · 5 min read
Is Oracle’s Promise a New Era for MySQL? Community Reactions and Risks
HyperAI Super Neural
HyperAI Super Neural
Dec 17, 2025 · Artificial Intelligence

Can cuTile’s Tile Paradigm Disrupt the GPU Programming Landscape and Challenge Triton?

The article analyzes NVIDIA's newly announced cuTile, a tile‑based Python DSL for GPU kernels, examining its technical differences from CUDA's SIMT model, its potential to reshape the GPU programming ecosystem, community reactions, competition with Triton, and the uncertain future that hinges on ecosystem maturity and migration tools.

AI workloadsCUDAGPU programming
0 likes · 12 min read
Can cuTile’s Tile Paradigm Disrupt the GPU Programming Landscape and Challenge Triton?
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Oct 30, 2025 · Cloud Native

15 Real-World Kubernetes Use Cases You Need to Know

Explore the 15 most impactful Kubernetes scenarios—from microservices and auto‑scaling to multi‑cloud deployments, AI workloads, edge computing, and compliance—detailing how they boost reliability, efficiency, and cost‑effectiveness, while also highlighting situations where Kubernetes may not be the right choice.

AI workloadsAuto ScalingEdge Computing
0 likes · 11 min read
15 Real-World Kubernetes Use Cases You Need to Know
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jun 26, 2025 · Cloud Native

How Fluid Enables Cloud‑Native Elastic Data for AI Workloads

Fluid introduces a cloud‑native elastic data abstraction that lets AI workloads efficiently access, manage, and accelerate heterogeneous data sources across serverful and serverless environments, offering unified Dataset, Runtime, and DataOperation concepts, and has been recognized by CNCF’s 2024 Technology Radar.

AI workloadsCNCFCloud Native
0 likes · 9 min read
How Fluid Enables Cloud‑Native Elastic Data for AI Workloads
Architects' Tech Alliance
Architects' Tech Alliance
Apr 13, 2025 · Industry Insights

Which NVIDIA GPU Wins for AI? Deep Dive into RTX & A‑Series Performance and Power

This article presents a detailed comparison of major NVIDIA GPUs—including RTX 4090, RTX 4090 D, RTX 3090, A10, A40, A100, and H100—covering memory size, bandwidth, Tensor BF16/FP16/FP32 throughput, FP16/FP32 performance, power draw and release dates, and explains how these specs affect AI workload efficiency.

AI workloadsGPUIndustry analysis
0 likes · 9 min read
Which NVIDIA GPU Wins for AI? Deep Dive into RTX & A‑Series Performance and Power
Architects' Tech Alliance
Architects' Tech Alliance
Jan 18, 2025 · Industry Insights

Why Co‑Packaged Optics Are Redefining Data Center Networks

The article analyzes how Co‑Packaged Optics (CPO) and silicon photonics address exploding data‑center bandwidth demands, reduce power consumption, and enable AI‑driven workloads, while outlining industry roadmaps, major vendor contributions, and future technical challenges.

AI workloadsCo-Packaged OpticsData Center Networking
0 likes · 14 min read
Why Co‑Packaged Optics Are Redefining Data Center Networks
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Sep 13, 2024 · Industry Insights

Why Distributed Cloud‑Native Is the Next Enterprise Cloud Choice – Expert Insights

In an interview, Alibaba Cloud’s distributed cloud‑native platform lead explains how distributed cloud‑native addresses elasticity, high‑availability, and multi‑cluster management challenges, outlines the evolution of ACK One, and forecasts its role in AI and edge computing for modern enterprises.

ACK OneAI workloadsEdge Computing
0 likes · 11 min read
Why Distributed Cloud‑Native Is the Next Enterprise Cloud Choice – Expert Insights
ByteDance Cloud Native
ByteDance Cloud Native
Aug 12, 2024 · Cloud Native

How mGPU Enables Efficient GPU Sharing for AI Workloads in Cloud‑Native Environments

The article explains the mGPU solution from Volcano Engine, detailing its kernel‑level GPU virtualization, container runtime hooks, and scheduling mechanisms that allow multiple containers to share a single NVIDIA GPU with isolated compute and memory resources, achieving near‑lossless performance and up to 50% higher utilization for AI tasks.

AI workloadsGPU Sharingcontainer-runtime
0 likes · 9 min read
How mGPU Enables Efficient GPU Sharing for AI Workloads in Cloud‑Native Environments
Open Source Linux
Open Source Linux
Jul 11, 2024 · Operations

Why Traditional ECMP Fails for AI Workloads and How Modern Load‑Balancing Solves It

The article examines the rapid growth of AI‑driven compute demand, explains why conventional ECMP load balancing struggles with uneven, high‑bandwidth flows in data‑center networks, and compares advanced strategies such as Fat‑Tree design, VoQ, flow‑based, packet‑based, flowlet, and cell‑based approaches, including vendor implementations.

AI workloadsData Center NetworkECMP
0 likes · 13 min read
Why Traditional ECMP Fails for AI Workloads and How Modern Load‑Balancing Solves It
Architects' Tech Alliance
Architects' Tech Alliance
May 7, 2024 · Operations

Why ECMP Struggles in AI‑Driven Data Centers and Better Load‑Balancing Alternatives

As AI workloads push intelligent compute power growth beyond 50% CAGR, data‑center networks face massive parallel paths, making traditional ECMP load‑balancing insufficient and causing severe congestion, while newer granular schemes such as packet‑spraying, flowlet, and cell‑based balancing offer higher bandwidth utilization and fairness.

AI workloadsData Center NetworkingECMP
0 likes · 17 min read
Why ECMP Struggles in AI‑Driven Data Centers and Better Load‑Balancing Alternatives
MaGe Linux Operations
MaGe Linux Operations
Mar 5, 2024 · Cloud Native

How to Run GPU‑Accelerated AI Workloads on Kubernetes

This article explains how Kubernetes supports GPU workloads for AI and machine learning, covering device plugins, pod GPU requests, oversubscription, security isolation, cloud‑provider node setup, and protecting GPU nodes from non‑GPU pods.

AI workloadsCloud NativeDevice Plugin
0 likes · 8 min read
How to Run GPU‑Accelerated AI Workloads on Kubernetes
Baidu Geek Talk
Baidu Geek Talk
Aug 2, 2023 · Cloud Native

Baidu Intelligent Cloud GPU Container Virtualization 2.0: Advancements and Full-Scenario Practices

Baidu Intelligent Cloud’s GPU Container Virtualization 2.0 combines user‑mode and kernel‑mode isolation in a dual‑engine design that unifies scheduling of AI compute, rendering and encoding, supports mixed deployment and multi‑scheduler integration, and boosts GPU utilization across inference, offline tasks, autonomous‑driving simulation, and cloud‑gaming workloads.

AI workloadsGPU virtualizationMulti Scheduler
0 likes · 14 min read
Baidu Intelligent Cloud GPU Container Virtualization 2.0: Advancements and Full-Scenario Practices
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jun 29, 2023 · Artificial Intelligence

How Baidu’s Dual‑Engine GPU Container Virtualization Boosts AI, Rendering, and Cloud Gaming

This article explains Baidu Intelligent Cloud’s GPU container virtualization 2.0, detailing its dual‑engine architecture, resource pooling, and scheduling innovations that isolate AI, rendering, and codec workloads, and showcases real‑world scenarios such as online inference, autonomous‑driving simulation, and cloud gaming to improve GPU utilization.

AI workloadsGPU virtualizationKubernetes scheduling
0 likes · 14 min read
How Baidu’s Dual‑Engine GPU Container Virtualization Boosts AI, Rendering, and Cloud Gaming