Tagged articles
60 articles
Page 1 of 1
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jan 17, 2025 · Artificial Intelligence

Elastic Scaling of Large Language Model Inference on Alibaba Cloud ACK with Knative, ResourcePolicy, and Fluid

This article explains how to reduce inference cost and improve performance for large language models on Alibaba Cloud ACK by using Knative's request‑based autoscaling, custom ResourcePolicy priority scheduling, and Fluid data‑caching to achieve elastic scaling, resource pre‑emption, and faster model loading.

FluidInferenceKnative
0 likes · 22 min read
Elastic Scaling of Large Language Model Inference on Alibaba Cloud ACK with Knative, ResourcePolicy, and Fluid
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Nov 8, 2024 · Industry Insights

Unlocking Efficient LLM Inference: Insights from China’s Cloud Computing Conference

The 5th China Cloud Computing Infrastructure Developer Conference in Beijing highlighted cutting‑edge AI inference optimization, Knative‑based serverless acceleration, AMD PMU virtualization, and CDI‑driven GPU management, offering detailed technical insights and real‑world case studies that illustrate how cloud providers are tackling performance and cost challenges of modern workloads.

AI inferenceAMD virtualizationCloud Native
0 likes · 9 min read
Unlocking Efficient LLM Inference: Insights from China’s Cloud Computing Conference
360 Smart Cloud
360 Smart Cloud
Jun 14, 2024 · Cloud Native

Knative Eventing: Concepts, Core Components, and Practical Usage Patterns (Source‑to‑Sink, Channel & Subscription, Broker & Trigger)

This article introduces Knative Eventing concepts and core components, then demonstrates three practical usage patterns—Source‑to‑Sink, Channel and Subscription, and Broker and Trigger—through step‑by‑step examples, Kubernetes manifests, and command‑line operations, highlighting event flow, filtering, and reply handling.

CloudNativeEventDrivenKnative
0 likes · 32 min read
Knative Eventing: Concepts, Core Components, and Practical Usage Patterns (Source‑to‑Sink, Channel & Subscription, Broker & Trigger)
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
May 31, 2024 · Cloud Native

Best Practices for Deploying AI Model Inference on Knative

This guide explains how to efficiently deploy AI model inference services on Knative by externalizing model data, using Fluid for accelerated loading, configuring secrets, ImageCache, graceful shutdown, probes, autoscaling parameters, mixed ECS/ECI resources, shared GPU scheduling, and observability features to achieve fast scaling, low cost, and high elasticity.

AI Model InferenceCloud NativeGPU
0 likes · 19 min read
Best Practices for Deploying AI Model Inference on Knative
Alibaba Cloud Native
Alibaba Cloud Native
Apr 12, 2024 · Cloud Native

How to Run Knative Serverless on Alibaba Cloud Spot Instances for Cost Savings

Learn how to integrate Alibaba Cloud's low‑cost spot (preemptible) instances with Knative serverless workloads, configure pod annotations, handle automatic replacement, monitor eviction events, and apply best‑practice YAML settings to achieve cost‑effective, resilient deployments on a cloud‑native platform.

Alibaba CloudKnativeKubernetes
0 likes · 13 min read
How to Run Knative Serverless on Alibaba Cloud Spot Instances for Cost Savings
Alibaba Cloud Native
Alibaba Cloud Native
Feb 28, 2024 · Cloud Native

Building a Unified Cloud‑Native Serverless Platform Across Public Cloud and IDC with ACK One & Knative

This guide explains how to design and implement a unified cloud‑native serverless platform that runs seamlessly on public clouds and on‑premise IDC clusters using Alibaba Cloud ACK One, Kubernetes, and Knative, covering architecture, key components, deployment steps, and best‑practice recommendations.

ACK OneKnativeKubernetes
0 likes · 11 min read
Building a Unified Cloud‑Native Serverless Platform Across Public Cloud and IDC with ACK One & Knative
Alibaba Cloud Native
Alibaba Cloud Native
Dec 13, 2023 · Cloud Native

Mastering Traffic Management in Knative: Blue‑Green Deployments, Autoscaling, and Monitoring

This article explains how Knative leverages request‑driven traffic management to simplify blue‑green releases, configure multi‑gateway ingress, apply revision garbage‑collection policies, enable custom domains, support multiple protocols, and provide automatic scaling and observability through Prometheus and Grafana.

Blue‑Green deploymentKnativeautoscaling
0 likes · 15 min read
Mastering Traffic Management in Knative: Blue‑Green Deployments, Autoscaling, and Monitoring
Alibaba Cloud Native
Alibaba Cloud Native
Oct 20, 2023 · Cloud Native

How Knative Cuts AI Service Costs by 60% and Halves Deployment Time

This article explains how Shuhe Tech combined Knative with AI workloads to achieve 60% resource cost savings and reduce model deployment cycles from one day to half a day, detailing Knative's architecture, request‑based autoscaling, multi‑version releases, and advanced scaling features.

Cloud NativeKPAKnative
0 likes · 19 min read
How Knative Cuts AI Service Costs by 60% and Halves Deployment Time
Alibaba Cloud Native
Alibaba Cloud Native
Sep 3, 2023 · Cloud Native

Master Knative’s Request‑Based Autoscaling: KPA, Scale‑to‑Zero, and Advanced Strategies

This article explains how Knative implements request‑based autoscaling with KPA, details the scale‑to‑zero mechanism, shows how to handle burst traffic using stable and panic windows, and demonstrates advanced extensions such as resource pools, precise MPA scaling, and predictive AHPA configurations with concrete YAML examples.

Cloud NativeKPAKnative
0 likes · 18 min read
Master Knative’s Request‑Based Autoscaling: KPA, Scale‑to‑Zero, and Advanced Strategies
HelloTech
HelloTech
Apr 19, 2023 · Cloud Native

How FaaS Transforms AI Platforms: Lessons from Haro’s Cloud‑Native Journey

The article analyzes the operational, stability, and cost challenges of Haro’s AI platform, explains why a serverless FaaS architecture—specifically Knative—was selected, and details the implementation steps, performance gains, and future scenarios for AI workloads.

AI PlatformCloud NativeCost Optimization
0 likes · 8 min read
How FaaS Transforms AI Platforms: Lessons from Haro’s Cloud‑Native Journey
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Mar 24, 2023 · Cloud Native

Serverless Adoption at NetEase Cloud Music: Architecture, Migration, and Benefits

NetEase Cloud Music transitioned from public FaaS to a private Knative‑based serverless platform, delivering elastic audio‑video processing, multi‑language support, event‑driven scaling across hybrid private and public clouds, boosting resource utilization and cutting costs, though cold‑starts and container expertise remain challenges.

Cloud NativeEvent-Driven ArchitectureKnative
0 likes · 16 min read
Serverless Adoption at NetEase Cloud Music: Architecture, Migration, and Benefits
Cloud Native Technology Community
Cloud Native Technology Community
Mar 14, 2023 · Cloud Native

How PNC Bank Accelerated Code Deployment and Automated Compliance with Cloud‑Native Knative and TriggerMesh

This case study explains how PNC Bank, one of the largest U.S. banks, used cloud‑native technologies such as Knative, Kubernetes, and TriggerMesh to replace a 30‑day manual compliance process with an automated Policy‑as‑Code system, dramatically shortening deployment cycles and enabling continuous delivery.

Cloud NativeCompliance AutomationDevOps
0 likes · 8 min read
How PNC Bank Accelerated Code Deployment and Automated Compliance with Cloud‑Native Knative and TriggerMesh
Tencent Cloud Developer
Tencent Cloud Developer
Jul 26, 2022 · Cloud Native

Understanding Knative: A Cloud-Native Serverless Framework

Knative is a CNCF‑incubated, cloud‑native serverless framework on Kubernetes that combines Build, Eventing, and Serving components—featuring a Knative Pod Autoscaler that can scale pods to zero—offering improved resource utilization, rapid traffic response, and developer productivity despite modest performance overhead.

Auto ScalingCNCFCloud-Native
0 likes · 16 min read
Understanding Knative: A Cloud-Native Serverless Framework
Open Source Linux
Open Source Linux
Oct 26, 2021 · Cloud Native

How Modern Distributed Apps Evolve on Kubernetes: From Monoliths to Serverless

This article explores the evolution of modern distributed applications, outlining the core primitives required for lifecycle, networking, resource binding, and state management, and examines how Kubernetes, microservices, service meshes, Knative, Dapr, and emerging trends reshape cloud‑native architectures.

Cloud NativeDaprKnative
0 likes · 30 min read
How Modern Distributed Apps Evolve on Kubernetes: From Monoliths to Serverless
Alibaba Cloud Native
Alibaba Cloud Native
Oct 11, 2021 · Cloud Native

Step‑by‑Step Guide to Deploy Knative on Alibaba Cloud

This tutorial introduces Knative, explains its architecture and core components, and provides a detailed, image‑rich walkthrough for deploying Knative on Alibaba Cloud, testing a sample serverless application, and managing the cluster via CloudShell.

Alibaba CloudCloud NativeDeployment
0 likes · 7 min read
Step‑by‑Step Guide to Deploy Knative on Alibaba Cloud
Kuaishou Tech
Kuaishou Tech
Aug 16, 2021 · Backend Development

Design and Challenges of Kuaishou Y‑tech Server‑Side Effects Platform

The article examines Kuaishou Y‑tech's server‑side effects platform, detailing its background, technical challenges, exploration of open‑source workflow and serverless frameworks, architectural adjustments for non‑blocking APIs, and future development directions while highlighting recruitment opportunities.

Backend ArchitectureKnativeKubernetes
0 likes · 8 min read
Design and Challenges of Kuaishou Y‑tech Server‑Side Effects Platform
Architect
Architect
Jul 25, 2021 · Cloud Native

Understanding Serverless: Concepts, Benefits, Limitations, and Knative Implementation

This article explains the motivations behind Serverless, defines its core concepts such as FaaS and BaaS, outlines key characteristics like one‑step deployment, auto‑scaling and event‑driven execution, discusses suitable use cases, current challenges, and introduces Knative as a cloud‑native implementation.

Cloud NativeFaaSKnative
0 likes · 13 min read
Understanding Serverless: Concepts, Benefits, Limitations, and Knative Implementation
Cloud Native Technology Community
Cloud Native Technology Community
Jun 9, 2021 · Cloud Native

Understanding Knative Eventing: Broker & Trigger Architecture and Implementation

This article provides a comprehensive overview of Knative Eventing's Broker and Trigger model, detailing background concepts, event routing patterns, data‑plane and control‑plane workflows, and includes practical YAML and command‑line examples for deploying and managing the components in a Kubernetes environment.

BrokerCloud NativeKnative
0 likes · 11 min read
Understanding Knative Eventing: Broker & Trigger Architecture and Implementation
Alibaba Cloud Native
Alibaba Cloud Native
Jun 7, 2021 · Cloud Native

Master Knative Serverless: Serving, Eventing, and Alibaba Cloud Integration

This article introduces Knative, the open‑source serverless framework built on Kubernetes, explains its core Serving and Eventing components, details Alibaba Cloud’s deep integration and enhanced capabilities such as traffic management, auto‑scaling policies, and provides a step‑by‑step example of a real‑time bullet‑screen application.

Alibaba CloudCloud NativeKnative
0 likes · 8 min read
Master Knative Serverless: Serving, Eventing, and Alibaba Cloud Integration
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Apr 17, 2021 · Cloud Native

How Knative Handles Cold‑Start Traffic: From Activator to Pod

This article explores Knative’s traffic routing and autoscaling mechanisms, detailing how requests are initially directed through the Activator during cold‑start, how VirtualService configurations evolve, and how newer versions shift traffic handling to Kubernetes Service/Endpoint layers, improving performance and decoupling gateway logic.

IstioKnativeKubernetes
0 likes · 14 min read
How Knative Handles Cold‑Start Traffic: From Activator to Pod
Alibaba Cloud Native
Alibaba Cloud Native
Apr 5, 2021 · Cloud Native

How Knative Enables Traffic‑Based Autoscaling and Gray Deployments

This article explains Knative’s traffic‑driven autoscaling and gray‑release capabilities, detailing the request flow architecture, the roles of Service, Configuration, Route and Revision, and walks through built‑in scaling strategies such as KPA, HPA, scheduled‑HPA, event‑gateway and custom plugins, with practical examples.

Cloud NativeGray DeploymentHPA
0 likes · 10 min read
How Knative Enables Traffic‑Based Autoscaling and Gray Deployments
Alibaba Cloud Native
Alibaba Cloud Native
Mar 27, 2021 · Cloud Native

Why Knative? Simplifying Serverless on Kubernetes with ASK Integration

This article explains why Knative is needed to simplify Kubernetes‑based serverless workloads, describes its core modules and traffic‑based gray release capabilities, and shows how Alibaba Cloud's ASK platform integrates with Knative to reduce operational complexity, improve elasticity, and lower costs.

ASKCloud NativeKnative
0 likes · 10 min read
Why Knative? Simplifying Serverless on Kubernetes with ASK Integration
21CTO
21CTO
Mar 20, 2021 · Cloud Native

How Knative Enables Low‑Code Serverless Workflows for Complex Business Logic

This article explores the evolution of microservices, the challenges they introduce, and how Knative’s Cloud‑Native serverless framework—through its Build, Serving, and Eventing modules—can be leveraged to create a low‑code, production‑grade workflow platform that simplifies complex business logic while reducing development overhead.

KnativeKubernetesServerless
0 likes · 14 min read
How Knative Enables Low‑Code Serverless Workflows for Complex Business Logic
iQIYI Technical Product Team
iQIYI Technical Product Team
Mar 19, 2021 · Cloud Native

Building a Serverless Workflow Platform with Knative: Architecture and Implementation

The article describes how the team built a production‑grade serverless workflow platform on Knative by creating four modules—a dashboard, API, operator, and syncer—that generate Kubernetes resources, automate CI/CD, monitor via Prometheus, and aim to cut boilerplate while supporting future extensions such as richer constructs, multi‑language support, and synchronous invocations.

Cloud NativeEvent-drivenKnative
0 likes · 12 min read
Building a Serverless Workflow Platform with Knative: Architecture and Implementation
Architects Research Society
Architects Research Society
Feb 1, 2021 · Cloud Native

Understanding Knative: Serving, Configuration, Traffic Splitting, and Service Integration

This article introduces Knative as a Kubernetes‑based serverless platform, explains its three core components, walks through deploying a Hello World service with YAML and kubectl commands, shows how configuration changes create revisions, demonstrates traffic splitting, and illustrates integration with external services such as Twilio.

Cloud NativeDevOpsKnative
0 likes · 8 min read
Understanding Knative: Serving, Configuration, Traffic Splitting, and Service Integration
Architects Research Society
Architects Research Society
Jan 22, 2021 · Cloud Native

Introduction to Knative Serving: Architecture and Core Resources

Knative Serving, built on Kubernetes and Istio, provides middleware primitives for rapid serverless container deployment, automatic scaling to zero, Istio‑based routing, and snapshotting of code and configuration, with key custom resources such as Service, Route, Configuration, and Revision to manage lifecycle and traffic.

Cloud NativeIstioKnative
0 likes · 4 min read
Introduction to Knative Serving: Architecture and Core Resources
Alibaba Cloud Native
Alibaba Cloud Native
Apr 26, 2020 · Cloud Native

How to Build Serverless Cloud‑Native Apps with RocketMQ and Knative

This article explains how to combine RocketMQ with Knative to create cloud‑native, event‑driven serverless applications, covering cloud‑native fundamentals, Knative architecture, RocketMQSource integration, a food‑delivery use case, deployment steps, command‑line examples, and additional scenarios.

Cloud NativeEvent-drivenKnative
0 likes · 11 min read
How to Build Serverless Cloud‑Native Apps with RocketMQ and Knative
Alibaba Cloud Native
Alibaba Cloud Native
Apr 23, 2020 · Cloud Native

Latest Cloud Native Updates: Istio Donation, Kotlin gRPC, Kubernetes Watch Cache, Knative 0.14, and More

This roundup highlights recent cloud‑native developments, including the donation of Istio to an open‑source foundation, Kotlin gRPC becoming open source, dynamic watch‑cache sizing and authentication tweaks in Kubernetes, Knative 0.14 release details, xDS v3 planning, plus curated open‑source project and reading recommendations.

Cloud NativeIstioKnative
0 likes · 6 min read
Latest Cloud Native Updates: Istio Donation, Kotlin gRPC, Kubernetes Watch Cache, Knative 0.14, and More
DevOps Cloud Academy
DevOps Cloud Academy
Jan 23, 2020 · Cloud Native

Serverless Jenkins and Prow: Cloud‑Native CI/CD on Kubernetes

Jenkins X introduces serverless Jenkins, leveraging Prow and Knative on Kubernetes to provide a cloud‑native, highly available CI/CD solution that eliminates single‑point failures, reduces costs, and integrates GitOps, while offering migration guidance, FAQs, and practical Terraform commands for deployment.

Cloud NativeJenkinsKnative
0 likes · 11 min read
Serverless Jenkins and Prow: Cloud‑Native CI/CD on Kubernetes
Alibaba Cloud Native
Alibaba Cloud Native
Dec 12, 2019 · Cloud Native

What’s New in Kubernetes v1.17? Key Features and Improvements Explained

The article summarizes Kubernetes v1.17’s 22 enhancements—including GA cloud‑provider labels, beta VolumeSnapshot, CSI storage migration, numerous feature stabilizations, dual‑stack networking, Knative updates, and curated open‑source and reading recommendations—providing a detailed technical overview for practitioners.

KnativeKubernetesRelease Notes
0 likes · 10 min read
What’s New in Kubernetes v1.17? Key Features and Improvements Explained
Alibaba Cloud Native
Alibaba Cloud Native
Dec 5, 2019 · Cloud Native

How Knative Turns Ordinary Apps into Serverless Cloud‑Native Services

This article explains the motivations behind cloud native computing, introduces Knative’s three core modules—Tekton, Eventing, and Serving—and shows how they enable serverless capabilities, traffic management, gray‑release, and elasticity for modern applications, illustrated with a practical demo and Q&A.

KnativeKubernetesServerless
0 likes · 26 min read
How Knative Turns Ordinary Apps into Serverless Cloud‑Native Services
Alibaba Cloud Native
Alibaba Cloud Native
Oct 18, 2019 · Cloud Native

Deploy WebSocket and gRPC Services on Knative: A Step‑by‑Step Guide

This article walks you through deploying a WebSocket chat service and a gRPC ping service on Knative, covering repository cloning, Docker image preparation, Knative Service YAML configuration, kubectl deployment commands, and browser‑based verification to ensure the services run correctly in a serverless environment.

DockerGoKnative
0 likes · 9 min read
Deploy WebSocket and gRPC Services on Knative: A Step‑by‑Step Guide
Alibaba Cloud Native
Alibaba Cloud Native
Aug 29, 2019 · Cloud Native

Mastering Knative Serving SDK: Build Serverless Apps with Go Context

This article walks through the fundamentals of Golang Context, explains how Knative Serving SDK leverages Context for informer and client initialization, and provides step‑by‑step code examples for building serverless applications, including logging, reconciler implementation, and practical deployment tips.

GoKnativeSDK
0 likes · 11 min read
Mastering Knative Serving SDK: Build Serverless Apps with Go Context
Alibaba Cloud Native
Alibaba Cloud Native
Aug 23, 2019 · Cloud Native

Mastering Knative Sequence: Build Event‑Driven Pipelines with 4 Real‑World Scenarios

This guide explains how Knative Eventing's Sequence CRD enables multi‑step event pipelines, walks through its spec definition, and demonstrates four practical usage patterns—direct service calls, event‑driven processing, cascading sequences, and Broker/Trigger integration—complete with YAML examples and visual diagrams.

KnativeKubernetesPipeline
0 likes · 14 min read
Mastering Knative Sequence: Build Event‑Driven Pipelines with 4 Real‑World Scenarios
Alibaba Cloud Native
Alibaba Cloud Native
Aug 19, 2019 · Cloud Native

Latest Cloud Native Updates: Kubernetes Security Audit, Ephemeral Containers, New Projects

This weekly roundup highlights the CNCF security audit of Kubernetes and related projects, the archiving of rkt, several Kubernetes enhancements such as readonly interfaces and ephemeral containers, Istio's HTTP/2 DDoS fix, Knative v0.8.0 features, the krew plugin manager, Alluxio, and curated reading on microservices, ARM containers, WebAssembly, Helm charts, and a cloud‑native transformation case study.

DevOpsKnativeKubernetes
0 likes · 10 min read
Latest Cloud Native Updates: Kubernetes Security Audit, Ephemeral Containers, New Projects
Alibaba Cloud Native
Alibaba Cloud Native
Aug 12, 2019 · Cloud Native

What’s New in Cloud Native: Kubernetes Security Fixes, Project Updates, and Must‑Read Resources

This week’s cloud‑native roundup covers Mesosphere’s rebranding to D2iQ, two critical Kubernetes CVEs with remediation steps, upcoming CNCF Kubernetes summits, the Knative 0.8 release, curated open‑source project recommendations, and a selection of in‑depth reading on multi‑cluster, serverless, and API‑gateway technologies.

CNCFFluxKnative
0 likes · 11 min read
What’s New in Cloud Native: Kubernetes Security Fixes, Project Updates, and Must‑Read Resources
Alibaba Cloud Native
Alibaba Cloud Native
Aug 5, 2019 · Cloud Native

What’s New in the Cloud‑Native Ecosystem This Week? Kubernetes, Knative, Istio and More

This week’s cloud‑native roundup covers Alibaba Cloud Container Service’s top Forrester ranking, the first production‑ready 1.0 release of Virtual Kubelet, key Kubernetes design proposals, Knative event‑type namespace discussions, Istio multi‑cluster enhancements, and several noteworthy open‑source projects.

Container ServicesKnativeKubernetes
0 likes · 9 min read
What’s New in the Cloud‑Native Ecosystem This Week? Kubernetes, Knative, Istio and More
Alibaba Cloud Native
Alibaba Cloud Native
Jul 19, 2019 · Cloud Native

Master Traffic Splitting and Version Control in Knative Serving

This guide walks through building a Go REST‑API, containerizing it, deploying multiple Knative Service revisions, configuring traffic percentages for gray releases, validating revisions before traffic shift, and rolling back, while comparing Knative’s traffic‑driven model to traditional pod‑based deployments.

Blue‑Green deploymentDockerGo
0 likes · 15 min read
Master Traffic Splitting and Version Control in Knative Serving
Alibaba Cloud Native
Alibaba Cloud Native
Jul 12, 2019 · Cloud Native

How to Deploy Production-Ready Knative Services on Alibaba Cloud Container Service

This guide walks you through deploying a Knative Service on Alibaba Cloud Container Service, configuring logging and monitoring with Log Service, setting up alerting, and enabling distributed tracing, providing step‑by‑step commands, YAML examples, and console screenshots for a production‑grade serverless setup.

Alibaba CloudContainer ServiceKnative
0 likes · 9 min read
How to Deploy Production-Ready Knative Services on Alibaba Cloud Container Service
Alibaba Cloud Native
Alibaba Cloud Native
Jul 5, 2019 · Cloud Native

Mastering Knative Build: Key Features and Practical YAML Examples

This article explains the core concepts of Knative Build, outlines its most important features, and provides detailed YAML examples for steps, templates, sources, service accounts, volumes, and timeout configuration, helping developers create native Kubernetes pipelines without pre‑provisioned build environments.

KnativeKubernetesYAML
0 likes · 9 min read
Mastering Knative Build: Key Features and Practical YAML Examples