Tagged articles
271 articles
Page 1 of 3
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
May 12, 2026 · Artificial Intelligence

Treating Automated Testing as AI Coding: Xiaohongshu GUI Agent Real‑World Review

During the 2026 Spring Festival promotion, Xiaohongshu replaced manual UI testing with a three‑layer AI‑driven GUI Agent that executed over 43,000 runs across 106 devices and 128 scenarios, achieving 58% automation, 82% AI‑generated case adoption, 68% bug recall, 98% stability and roughly $1 per test case while drastically cutting token costs.

AI CodingAutomated TestingCode-as-Action
0 likes · 23 min read
Treating Automated Testing as AI Coding: Xiaohongshu GUI Agent Real‑World Review
Machine Heart
Machine Heart
May 9, 2026 · Artificial Intelligence

Can QuantClaw Cut OpenClaw Costs by 21% and Speed Up Inference by 15%?

QuantClaw, an open‑source plug‑in for the OpenClaw AI agent framework, uses a systematic quantization study to dynamically route tasks to appropriate model precisions, achieving up to 21% cost reduction, 8‑15% latency improvement, and even higher task scores across diverse workloads.

AI agentsCost OptimizationModel Quantization
0 likes · 8 min read
Can QuantClaw Cut OpenClaw Costs by 21% and Speed Up Inference by 15%?
Lao Guo's Learning Space
Lao Guo's Learning Space
May 3, 2026 · Artificial Intelligence

2026 Enterprise Guide to Large Model Fine‑Tuning: Choosing, Training, and Deploying

This comprehensive guide explains why enterprises should fine‑tune large language models instead of using raw APIs or RAG, compares six fine‑tuning techniques (Full, LoRA, QLoRA, AdaLoRA, DoRA, Prompt‑Tuning), evaluates popular toolchains, outlines a step‑by‑step workflow, presents cost analyses, real‑world case studies, and practical best‑practice recommendations for 2026.

Cost OptimizationEnterprise AIFine-tuning
0 likes · 18 min read
2026 Enterprise Guide to Large Model Fine‑Tuning: Choosing, Training, and Deploying
Old Meng AI Explorer
Old Meng AI Explorer
Apr 23, 2026 · Artificial Intelligence

Zero‑Cost AI Coding: How to Connect Google Gemini Free Tier to Claude Code

Claude Code offers a great AI coding experience but quickly becomes costly, so this guide shows how to route its requests through Google AI Studio’s free Gemini 2.5 Flash model via OpenRouter or an open‑source proxy, compares performance and pricing, and provides step‑by‑step configuration, advanced switching tips, and common pitfalls.

AI CodingClaude CodeCost Optimization
0 likes · 14 min read
Zero‑Cost AI Coding: How to Connect Google Gemini Free Tier to Claude Code
AI Tech Publishing
AI Tech Publishing
Apr 20, 2026 · Artificial Intelligence

How Claude Code Achieves 92% Prompt Cache Hit Rate and Cuts Costs by 81% – A Deep Dive

This article explains the mechanics of prompt‑caching for large language models, breaks down static versus dynamic context, details KV‑cache operation and its pricing, and shows how Claude Code’s 30‑minute programming session reached a 92% cache hit rate that reduced inference costs by 81%, concluding with three production‑grade design rules.

AI agentsAnthropic APIClaude Code
0 likes · 13 min read
How Claude Code Achieves 92% Prompt Cache Hit Rate and Cuts Costs by 81% – A Deep Dive
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 20, 2026 · Operations

How We Built a 24/7 Autonomous User‑Feedback Pipeline with Qoder CLI

The article details how a growing Qoder product suite prompted the creation of a fully automated, 24‑hour feedback handling pipeline that classifies, clusters, analyses logs, and even generates fix code using Qoder CLI agents, cutting manual effort from 30 minutes per issue to about two minutes while maintaining human code‑review oversight.

AI automationCost OptimizationDevOps
0 likes · 13 min read
How We Built a 24/7 Autonomous User‑Feedback Pipeline with Qoder CLI
AI Engineer Programming
AI Engineer Programming
Apr 16, 2026 · Artificial Intelligence

Choosing the Right LLM: A Complete Guide to Selecting from Over 2 Million Models

With more than two million LLMs available, this guide explains how to evaluate functional capabilities, latency, throughput, cost, tool‑calling reliability, context‑window size and compliance, and presents a step‑by‑step framework for picking the most suitable model for each business scenario.

BenchmarkingContext WindowCost Optimization
0 likes · 25 min read
Choosing the Right LLM: A Complete Guide to Selecting from Over 2 Million Models
AI Insight Log
AI Insight Log
Apr 11, 2026 · Artificial Intelligence

Can Opus + Sonnet Advisor Cut Costs While Raising AI Benchmark Scores?

Anthropic’s new advisor strategy lets the cheaper Opus model act as a consultant for Sonnet or Haiku, delivering higher benchmark scores—e.g., SWE‑bench Multilingual up to 74.8% and BrowseComp up to 41.2%—while reducing per‑task cost to about 15% of solo runs, though it introduces trade‑offs such as the need for the executor to recognize when to ask for advice and potential vendor lock‑in.

AnthropicClaudeCost Optimization
0 likes · 8 min read
Can Opus + Sonnet Advisor Cut Costs While Raising AI Benchmark Scores?
AI Explorer
AI Explorer
Apr 10, 2026 · Artificial Intelligence

Achieve Top‑Tier AI Performance at Low Cost with Claude’s Advisor Strategy

Claude’s new Advisor Strategy lets cheaper models like Sonnet or Haiku call the powerful Opus model for guidance, delivering higher benchmark scores and up to 85% cost reduction, while the new advisor tool simplifies integration via a single API call.

AI Model CollaborationClaudeCost Optimization
0 likes · 6 min read
Achieve Top‑Tier AI Performance at Low Cost with Claude’s Advisor Strategy
Node.js Tech Stack
Node.js Tech Stack
Apr 10, 2026 · Artificial Intelligence

How Anthropic’s Advisor Strategy Boosts Sonnet Scores by 2.7% While Cutting Costs 12%

Anthropic’s new advisor strategy flips the traditional multi‑agent model by letting a cheap front‑line model call Opus for advice only when needed, delivering a 2.7 percentage‑point score lift on SWE‑bench, a 12 % cost reduction, and a simple one‑line API integration, while also outlining its limitations and future implications.

AnthropicClaudeCost Optimization
0 likes · 10 min read
How Anthropic’s Advisor Strategy Boosts Sonnet Scores by 2.7% While Cutting Costs 12%
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Apr 8, 2026 · Artificial Intelligence

Can an Open‑Source Router Cut AI Agent Costs by 60% and Keep Sensitive Data Local?

The article analyzes three major pain points of current AI agents—privacy risk, high cloud cost, and poor local performance—and presents ClawXRouter, an open‑source end‑cloud routing plugin that uses three‑level privacy routing, cost‑aware routing, and dual‑track memory to reduce expenses by 58% while improving performance by 6.3%, all without exposing sensitive data.

ClawXRouterCost OptimizationEdge Computing
0 likes · 8 min read
Can an Open‑Source Router Cut AI Agent Costs by 60% and Keep Sensitive Data Local?
Digital Planet
Digital Planet
Apr 8, 2026 · Industry Insights

How Qingdao Beer Turned Shrinking Sales into Profit Growth: Lessons for Channel Managers

Amid a stagnant Chinese beer market, Qingdao Beer’s 2025 report shows modest revenue growth but a sharp profit rise achieved by cutting costs and redesigning channel fee structures, offering a detailed roadmap for channel directors to escape the costly “fee‑vs‑sales” dilemma through precise, data‑driven expense allocation and product‑level value creation.

Beer IndustryChannel ManagementCost Optimization
0 likes · 17 min read
How Qingdao Beer Turned Shrinking Sales into Profit Growth: Lessons for Channel Managers
Top Architecture Tech Stack
Top Architecture Tech Stack
Apr 5, 2026 · Artificial Intelligence

Which OpenClaw API Saves You Money? 5 Solutions Tested, Up to 55% Savings

Choosing the right API for OpenClaw agents dramatically impacts latency, stability, and monthly costs, and this article evaluates five options across eight weighted criteria, revealing that a mixed strategy using an aggregation platform with DeepSeek as a fallback can cut expenses by up to 55% while maintaining performance.

Cost OptimizationLLM APIOpenClaw
0 likes · 9 min read
Which OpenClaw API Saves You Money? 5 Solutions Tested, Up to 55% Savings
Old Meng AI Explorer
Old Meng AI Explorer
Apr 3, 2026 · Artificial Intelligence

Unlock Faster, Cheaper Claude Code with Domestic LLMs: 3 Practical Solutions

Discover three practical ways to replace costly, slow Claude Code API calls with domestic large‑language models—DeepSeek, Alibaba Cloud Bailei, and third‑party relay services—offering lower latency, dramatically reduced fees, step‑by‑step configuration, performance benchmarks, and troubleshooting tips for developers.

AI CodingClaude CodeCost Optimization
0 likes · 8 min read
Unlock Faster, Cheaper Claude Code with Domestic LLMs: 3 Practical Solutions
Old Meng AI Explorer
Old Meng AI Explorer
Apr 2, 2026 · Artificial Intelligence

Slash Your AI Coding Costs: Connect Codex with Chinese Large Models in 10 Minutes

This guide shows how the high OpenAI Codex fees can be replaced by domestic large language models—DeepSeek, GLM‑4.7, Qwen3.5 and others—through three practical integration methods, providing step‑by‑step commands, configuration files, performance benchmarks and cost‑saving calculations for individual developers and teams.

AI CodingCodex integrationCost Optimization
0 likes · 20 min read
Slash Your AI Coding Costs: Connect Codex with Chinese Large Models in 10 Minutes
AI Large-Model Wave and Transformation Guide
AI Large-Model Wave and Transformation Guide
Apr 2, 2026 · Artificial Intelligence

What Claude Code’s Leaked Source Reveals About Building Production‑Grade AI Agents

An in‑depth analysis of the leaked Claude Code repository uncovers its massive scale, Bun runtime, React‑in‑terminal UI, a 1,729‑line async generator loop, multi‑layer context compression, eight‑layer security, extensive tool families, unreleased features, and engineering patterns that together form a blueprint for constructing robust, cost‑aware AI agents.

AI agentsCost OptimizationSoftware Architecture
0 likes · 11 min read
What Claude Code’s Leaked Source Reveals About Building Production‑Grade AI Agents
AI Step-by-Step
AI Step-by-Step
Apr 1, 2026 · Artificial Intelligence

When to Use Which Model in an Agent: Beyond the “Strongest Model” Myth

The article explains why routing every request to the most powerful LLM hurts cost, speed, and throughput, and presents a three‑layer task decomposition that assigns execution‑level tasks to cheap small models, intermediate tasks to mid‑size models, and high‑risk judgment tasks to large models, with concrete examples and a minimal routing strategy.

Agent DesignCost OptimizationLLM
0 likes · 8 min read
When to Use Which Model in an Agent: Beyond the “Strongest Model” Myth
Lao Guo's Learning Space
Lao Guo's Learning Space
Mar 30, 2026 · Artificial Intelligence

Building an AI Dream Team with OpenClaw: A Hands‑On Multi‑Agent Guide

The article explains why single‑agent LLMs struggle with complex tasks and demonstrates how OpenClaw's multi‑agent architecture—featuring persistent, sub‑ and ACP agents, isolated workspaces, and cost‑aware model selection—enables parallel role‑focused collaboration, scalability, and significant efficiency gains.

AI CollaborationAgent ArchitectureCost Optimization
0 likes · 14 min read
Building an AI Dream Team with OpenClaw: A Hands‑On Multi‑Agent Guide
Alibaba Cloud Observability
Alibaba Cloud Observability
Mar 30, 2026 · Cloud Native

How a Global Enterprise Cut Log Analytics Costs by 87% with Alibaba Cloud SLS

A large multinational company migrated its multi‑cloud log pipeline from a fragmented AWS stack to Alibaba Cloud Log Service (SLS), achieving unified data processing, query, visualization and alerting while reducing total monthly cost by over 87% and gaining additional free storage and feature benefits.

AWS comparisonCost OptimizationLog Analytics
0 likes · 21 min read
How a Global Enterprise Cut Log Analytics Costs by 87% with Alibaba Cloud SLS
Architect's Ambition
Architect's Ambition
Mar 25, 2026 · Artificial Intelligence

From Zero to Production: Building AI‑Native Infrastructure for Agents – Local Inference to Full‑Scale Deployment

The article walks through constructing AI‑native infrastructure for agents, covering local inference deployment with vLLM, setting up an AI gateway using LiteLLM, implementing observability with logs, metrics, and tracing, and applying cost‑saving strategies that reduced latency, improved stability, and cut expenses by up to 60%.

AI agentsCost OptimizationDeployment
0 likes · 13 min read
From Zero to Production: Building AI‑Native Infrastructure for Agents – Local Inference to Full‑Scale Deployment
DataFunSummit
DataFunSummit
Mar 20, 2026 · Artificial Intelligence

Why OpenClaw v2026.3.7 Is a Game‑Changer for Enterprise AI Agents

OpenClaw v2026.3.7 brings webhook compatibility fixes, private‑message typing feedback, a 33% token‑saving prompt‑cache, smarter model routing, seamless integration of domestic LLMs such as DeepSeek, Doubao and Qwen, and persistent bindings for Docker deployments, dramatically improving stability, cost efficiency and scalability for enterprise AI agents.

Cost OptimizationFeishuOpenClaw
0 likes · 10 min read
Why OpenClaw v2026.3.7 Is a Game‑Changer for Enterprise AI Agents
AI Tech Publishing
AI Tech Publishing
Mar 20, 2026 · Artificial Intelligence

Why Agent Harnesses and Coding Aren’t the Real Competitive Edge

The article argues that while AI agents can now generate code cheaply, the true competitive advantage lies in reducing cost and speed, and that elaborate harness engineering and coding optimizations offer little economic benefit compared to solid verification practices like testing, CI, and clear contracts.

AI agentsCost OptimizationHarness Engineering
0 likes · 8 min read
Why Agent Harnesses and Coding Aren’t the Real Competitive Edge
IT Architects Alliance
IT Architects Alliance
Mar 18, 2026 · Cloud Native

Why Serverless Projects Fail in Production and How to Avoid the Pitfalls

The article analyzes common misconceptions and hidden costs of serverless adoption, outlines four critical steps from PoC to production, and presents five enterprise‑grade best practices—including scenario selection, framework usage, observability, security, and cost governance—to ensure reliable, cost‑effective serverless deployments.

Cloud NativeCost OptimizationServerless
0 likes · 9 min read
Why Serverless Projects Fail in Production and How to Avoid the Pitfalls
PMTalk Product Manager Community
PMTalk Product Manager Community
Mar 17, 2026 · Industry Insights

When Large Models Are Standard, What KPIs Define an AI Product Manager’s Success?

The article examines how AI’s transition to a core infrastructure reshapes the AI product manager role, citing a 42% drop in job openings but a 35% salary rise for senior experts, and offers a decision‑matrix, three‑layer capability model, cost‑control tactics, and actionable methods for thriving in 2026.

AI product managementCost OptimizationKPIs
0 likes · 11 min read
When Large Models Are Standard, What KPIs Define an AI Product Manager’s Success?
DataFunTalk
DataFunTalk
Mar 15, 2026 · Artificial Intelligence

How OpenClaw v2026.3.7 Boosts Enterprise AI Agent Efficiency and Cuts Costs

The OpenClaw v2026.3.7 upgrade introduces webhook compatibility fixes, typing‑feedback support, a 33% prompt‑caching cost reduction, smarter model routing with domestic model integration, and persistent bindings for container deployments, making the platform far more suitable for enterprise AI agent scenarios.

AI agentsContainer DeploymentCost Optimization
0 likes · 10 min read
How OpenClaw v2026.3.7 Boosts Enterprise AI Agent Efficiency and Cuts Costs
DeepHub IMBA
DeepHub IMBA
Mar 14, 2026 · Artificial Intelligence

Three Proven Multi‑Agent Orchestration Patterns: Supervisor, Pipeline, and Swarm

The article explains why single LLM agents often fail due to context overload, role confusion, and fault propagation, then details three reliable orchestration patterns—Supervisor, Pipeline, and Swarm—along with concrete code examples, communication schemas, error‑handling layers, cost and latency considerations, and best‑practice recommendations for production deployment.

Cost OptimizationDistributed TracingLLM agents
0 likes · 15 min read
Three Proven Multi‑Agent Orchestration Patterns: Supervisor, Pipeline, and Swarm
DevOps Coach
DevOps Coach
Mar 10, 2026 · Cloud Computing

Why Hybrid Cloud Is the Future: Balancing Agility, Cost, and Security

The article explains how hybrid cloud combines rapid, scalable cloud environments with stable on‑premises systems to cut costs, improve performance, meet compliance, and boost developer velocity, while orchestration platforms like Spacelift, Terraform, and Ansible make this multi‑environment management practical.

Cost OptimizationInfrastructure as CodeOrchestration
0 likes · 10 min read
Why Hybrid Cloud Is the Future: Balancing Agility, Cost, and Security
DevOps Coach
DevOps Coach
Mar 8, 2026 · Databases

Boosting Performance 25× and Cutting Costs 80%: Our Switch from Redis to DragonflyDB

Facing high memory overhead, operational complexity, and scaling limits of a large Redis cluster, we migrated to DragonflyDB using a three‑stage shadow, dual‑write, and cut‑over process, achieving up to 25‑fold throughput increase, 80% cost reduction, and simpler maintenance while preserving compatibility with existing Redis clients.

Cost OptimizationDragonflyDBdatabase migration
0 likes · 7 min read
Boosting Performance 25× and Cutting Costs 80%: Our Switch from Redis to DragonflyDB
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Mar 4, 2026 · Artificial Intelligence

How to Build a 24‑Hour AI Agent Team with OpenClaw – A Real‑World Walkthrough

The author details a month‑long experiment creating a six‑agent AI team with OpenClaw that automates research, content creation, code review and email newsletters, saving 4‑5 hours each day for under $400 per month by using file‑based coordination, a two‑layer memory system, and a gradual rollout plan.

AI agentsCost OptimizationMemory Management
0 likes · 14 min read
How to Build a 24‑Hour AI Agent Team with OpenClaw – A Real‑World Walkthrough
Frontend AI Walk
Frontend AI Walk
Mar 4, 2026 · Operations

Choosing Between MaxClaw and Self‑Hosted OpenClaw: A Primary‑Plus‑Secondary Strategy for Small Teams

The article proposes a hybrid solution for individual developers and small teams where MaxClaw handles everyday multi‑agent tasks while a self‑hosted OpenClaw instance is used for model experiments and high‑privilege operations, covering architecture, deployment steps, cost tactics, and security best practices.

Cost OptimizationDeploymentMaxClaw
0 likes · 12 min read
Choosing Between MaxClaw and Self‑Hosted OpenClaw: A Primary‑Plus‑Secondary Strategy for Small Teams
AI Code to Success
AI Code to Success
Mar 1, 2026 · Artificial Intelligence

How Prompt Caching Supercharges Long‑Running AI Agents: 5 Practical Lessons

This article explains how Claude Code’s Prompt Caching technique dramatically reduces latency and cost for long‑running AI agents, and shares five hard‑won engineering practices—including prompt layout, message‑based updates, avoiding mid‑conversation model or tool changes, and safe context forking—to help developers build efficient, cache‑friendly AI applications.

Cost OptimizationPrompt CachingSystem Design
0 likes · 10 min read
How Prompt Caching Supercharges Long‑Running AI Agents: 5 Practical Lessons
AI Architecture Hub
AI Architecture Hub
Feb 26, 2026 · Artificial Intelligence

Mastering Anthropic’s Agent Teams: Practical Guide, Pitfalls & Cost Hacks

Anthropic’s experimental Agent Teams lets multiple Claude instances collaborate on complex tasks, but success hinges on clear role definitions, task splitting, communication protocols, and robust integration, with detailed guidance on engineering decisions, common pitfalls, cost management, reusable hooks, and step‑by‑step setup instructions.

Agent TeamsClaudeCost Optimization
0 likes · 17 min read
Mastering Anthropic’s Agent Teams: Practical Guide, Pitfalls & Cost Hacks
ShiZhen AI
ShiZhen AI
Feb 25, 2026 · Artificial Intelligence

Building an AI Agent Orchestrator for 50 Daily Commits at $190/month

Independent developer Elvis built an OpenClaw‑based AI agent orchestration system that lets a Zoe orchestrator manage Codex, Claude Code, and Gemini agents to write code, open PRs, and perform cross‑review, achieving about 50 commits per day for roughly $190 a month while highlighting cost, hardware bottlenecks, and failure‑handling strategies.

AI agentsClaude CodeCodex
0 likes · 13 min read
Building an AI Agent Orchestrator for 50 Daily Commits at $190/month
DevOps Coach
DevOps Coach
Feb 22, 2026 · Backend Development

Why Go Beats Java Spring Boot for SaaS: Cost, Deployment, and Concurrency Insights

After years of using Java Spring Boot, the author rewrote a SaaS microservice in Go, discovering a 60 % AWS cost reduction, simpler deployment, and easier concurrency, while also noting scenarios where Java's rich ecosystem remains preferable, offering practical guidance on when to choose Go for SaaS.

Cost OptimizationGoSaaS
0 likes · 8 min read
Why Go Beats Java Spring Boot for SaaS: Cost, Deployment, and Concurrency Insights
Architect
Architect
Feb 13, 2026 · Artificial Intelligence

Cutting Agent Costs: Practical Tips from the ‘Toward Efficient Agents’ Survey

The article analyzes why autonomous LLM agents become expensive, breaks down their cost components, and presents concrete engineering strategies—memory management, tool‑call optimization, and planning constraints—to dramatically reduce token usage and improve reliability while maintaining performance.

Cost OptimizationLLM agentsPlanning
0 likes · 19 min read
Cutting Agent Costs: Practical Tips from the ‘Toward Efficient Agents’ Survey
AI Large Model Application Practice
AI Large Model Application Practice
Feb 10, 2026 · Artificial Intelligence

How OpenClaw Secures Production‑Grade AI Agents with Zero‑Trust Tool Policies

This article dissects OpenClaw’s engineering techniques for building robust, production‑level AI agents, covering zero‑trust tool policies for security, markdown‑based memory management, cost‑aware reasoning levels, and controlled sub‑agent collaboration to ensure safety, efficiency, and reliability.

AI agentsCost OptimizationMemory Management
0 likes · 12 min read
How OpenClaw Secures Production‑Grade AI Agents with Zero‑Trust Tool Policies
Old Zhao – Management Systems Only
Old Zhao – Management Systems Only
Feb 3, 2026 · Operations

Scientifically Set Procurement Frequency to Cut Costs and Avoid Stockouts

This guide shows manufacturing and trade managers how to scientifically determine procurement frequency by classifying materials, accounting for hidden ordering and holding costs, applying the Economic Order Quantity model, and adjusting for supply‑chain uncertainty, ultimately using a procurement system to automate and optimize the process.

Cost OptimizationEOQinventory management
0 likes · 9 min read
Scientifically Set Procurement Frequency to Cut Costs and Avoid Stockouts
Programmer DD
Programmer DD
Feb 3, 2026 · Artificial Intelligence

Build Reliable AI Agent Systems: Boost Accuracy 50% While Controlling Cost & Latency

This guide explains how to construct production‑ready AI agent systems by balancing cost, latency, and accuracy, offering a decision framework, concrete techniques such as planner‑executor architecture, chain‑of‑thought prompting, verification agents, parallel agents, and file‑system state management, plus real‑world examples and impact metrics.

AI agentsCost OptimizationLatency
0 likes · 21 min read
Build Reliable AI Agent Systems: Boost Accuracy 50% While Controlling Cost & Latency
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Jan 26, 2026 · Cloud Native

How Kimi Scaled AI Agents with Alibaba Cloud’s Elastic Sandbox Architecture

Kimi built a high‑performance, low‑cost AI Agent infrastructure by combining Alibaba Cloud ACK node pools and the ACS Agent Sandbox, addressing challenges of instant sandbox response, state continuity, massive concurrency, cost efficiency, security isolation, and search‑memory integration for production‑grade agents.

AI AgentCloud NativeCost Optimization
0 likes · 18 min read
How Kimi Scaled AI Agents with Alibaba Cloud’s Elastic Sandbox Architecture
dbaplus Community
dbaplus Community
Jan 11, 2026 · Databases

Why Using Only Postgres Can Replace Redis, RabbitMQ, and Elasticsearch

The article argues that a single PostgreSQL instance can handle caching, queuing, full‑text search, and real‑time notifications, eliminating the need for separate services like Redis, RabbitMQ, and Elasticsearch, while reducing cost and complexity.

Cost OptimizationDatabase ConsolidationFull‑Text Search
0 likes · 12 min read
Why Using Only Postgres Can Replace Redis, RabbitMQ, and Elasticsearch
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 6, 2026 · Artificial Intelligence

How Tair‑KVCache‑HiSim Simulates LLM Inference 390 000× Faster with <5% Error

This article explains the design, challenges, and high‑fidelity architecture of Tair‑KVCache‑HiSim, a simulation tool that models multi‑level KV‑Cache behavior for large‑language‑model inference, predicts latency, throughput and cost under SLO constraints, and validates its predictions against real GPU deployments with sub‑5% error.

AI InfrastructureCost OptimizationKVCache
0 likes · 32 min read
How Tair‑KVCache‑HiSim Simulates LLM Inference 390 000× Faster with <5% Error
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 29, 2025 · Cloud Native

How a Visual Platform Cut Search Costs by 60% with All‑in‑Elasticsearch

This case study details how a major internet visual platform consolidated its log, keyword, and vector search workloads onto Alibaba Cloud Elasticsearch, eliminating three separate pipelines, reducing write‑costs by 60%, cutting storage expenses over 60%, and achieving multi‑fold performance gains through serverless scaling, FalconSeek engine optimizations, and unified monitoring.

Cost OptimizationElasticsearchRAG
0 likes · 10 min read
How a Visual Platform Cut Search Costs by 60% with All‑in‑Elasticsearch
dbaplus Community
dbaplus Community
Dec 22, 2025 · Cloud Computing

How We Cut Kubernetes Costs by 40% Without Switching Platforms

By rethinking resource requests, eliminating unused workloads, downsizing node types, fine‑tuning autoscaling, and trimming log storage, a team reduced their Kubernetes bill by 40% while keeping the same cloud provider, demonstrating that most cost overruns stem from misconfiguration rather than the platform itself.

Cost OptimizationKubernetesPrometheus
0 likes · 6 min read
How We Cut Kubernetes Costs by 40% Without Switching Platforms
DevOps Coach
DevOps Coach
Dec 2, 2025 · Cloud Computing

Why CloudFront Missed the Cache and How We Slashed S3 Costs by 80%

After months of puzzling over a $2,400 monthly S3 bill, we discovered a missing Cache‑Control header caused CloudFront to revalidate every request, and by adding the header we boosted cache hits from 12% to 94%, cutting costs to under $500.

AWSCloudFrontCost Optimization
0 likes · 5 min read
Why CloudFront Missed the Cache and How We Slashed S3 Costs by 80%
Baidu Tech Salon
Baidu Tech Salon
Nov 26, 2025 · Big Data

How Baidu MEG Cut Data Costs: Inside a Big Data Governance Playbook

This article details Baidu's MEG data cost governance practice, covering background challenges, a unified governance framework, health‑score metrics, platform and engine capabilities, concrete compute and storage optimization techniques, achieved results, and future plans for continuous cost reduction.

Cost OptimizationData Governance
0 likes · 23 min read
How Baidu MEG Cut Data Costs: Inside a Big Data Governance Playbook
Data Party THU
Data Party THU
Nov 25, 2025 · Artificial Intelligence

What $47,000 Taught Us About Deploying Multi‑Agent AI Systems

After spending $47,000 running four LangChain agents in production, we reveal the hidden costs of A2A communication and Anthropic’s MCP, expose seven common deployment pitfalls, and argue that dedicated AI infrastructure is essential for scalable multi‑agent systems.

A2A communicationAI InfrastructureCost Optimization
0 likes · 13 min read
What $47,000 Taught Us About Deploying Multi‑Agent AI Systems
Architect
Architect
Nov 6, 2025 · Operations

Why Most Teams Should Choose Loki Over ELK for Log Management – A Cost‑Effective Guide

This comprehensive guide compares ELK, EFK, and Loki log‑management solutions, analyzing their architecture, performance, cost, and use‑case suitability, and provides a decision framework, real‑world case studies, migration strategies, and optimization tips to help teams select the most efficient logging stack for their needs.

Cost OptimizationELKLog Management
0 likes · 36 min read
Why Most Teams Should Choose Loki Over ELK for Log Management – A Cost‑Effective Guide
DataFunTalk
DataFunTalk
Nov 6, 2025 · Cloud Native

How Tencent Music Cut Kafka Costs by 50% with Cloud‑Native AutoMQ

Tencent Music migrated its massive Kafka streaming infrastructure to the cloud‑native AutoMQ platform, slashing operational costs by over half, achieving second‑level partition migration, and dramatically improving scaling efficiency while maintaining high‑throughput, low‑latency data processing for its music services.

AutoMQCost OptimizationData Streaming
0 likes · 16 min read
How Tencent Music Cut Kafka Costs by 50% with Cloud‑Native AutoMQ
dbaplus Community
dbaplus Community
Nov 2, 2025 · Databases

How a Simple PgBouncer Switch Saved Us $10 Million in Cloud Costs

When a sudden 38% rise in AWS bills revealed hidden connection‑storm costs in a Kubernetes‑based microservice architecture, the team introduced PgBouncer as a transaction‑pooling proxy, slashing database connections from over 14,000 to under 400 and cutting monthly cloud spend by more than $300,000, ultimately saving $10.8 million over three years.

Connection PoolingCost OptimizationKubernetes
0 likes · 9 min read
How a Simple PgBouncer Switch Saved Us $10 Million in Cloud Costs
DataFunSummit
DataFunSummit
Oct 29, 2025 · Big Data

How Huolala Scaled to 40PB: Inside Their Evolving Big Data Storage Architecture

Huolala, founded in 2013, runs a massive cross‑cloud hybrid big‑data storage platform of over 40 PB across 3,000+ machines, evolving through four online‑storage phases, robust HA design, performance‑cost optimizations, AI vector storage, and a cost‑governance system that saved more than half of its storage expenses.

AI vector storageBig DataCost Optimization
0 likes · 18 min read
How Huolala Scaled to 40PB: Inside Their Evolving Big Data Storage Architecture
Instant Consumer Technology Team
Instant Consumer Technology Team
Oct 28, 2025 · Artificial Intelligence

Turning My AI Development Squad from Goldfish to Command Center: Lessons & Tools

The author recounts how an AI‑driven development pipeline initially seemed promising but quickly ran into costly context‑loss issues, leading to a redesign that introduces a single commanding agent and expert sub‑agents, dramatically reducing token costs and improving workflow efficiency.

AI automationAgent ManagementCost Optimization
0 likes · 8 min read
Turning My AI Development Squad from Goldfish to Command Center: Lessons & Tools
Tech Minimalism
Tech Minimalism
Oct 24, 2025 · Operations

Deploy n8n in One Click with Zeabur: Full Step‑by‑Step Guide

This guide shows how to quickly and affordably deploy the n8n automation platform on Zeabur, covering core concepts, one‑click template installation, region selection, domain configuration, version upgrades, cost‑control strategies, and troubleshooting tips for a reliable workflow service.

Cost OptimizationZeaburautomation
0 likes · 10 min read
Deploy n8n in One Click with Zeabur: Full Step‑by‑Step Guide
IT Architects Alliance
IT Architects Alliance
Oct 19, 2025 · Cloud Native

Mastering Cloud‑Native Autoscaling: HPA, VPA, CA, and Cost‑Aware Strategies

This article explores the challenges and best practices of cloud‑native scaling, covering Horizontal and Vertical Pod Autoscalers, Cluster Autoscaler cost optimization, event‑driven scaling with KEDA, traffic‑aware scaling in service meshes, and intelligent cost‑aware strategies backed by monitoring and future AI‑driven trends.

Cost OptimizationKubernetesService Mesh
0 likes · 11 min read
Mastering Cloud‑Native Autoscaling: HPA, VPA, CA, and Cost‑Aware Strategies
MaGe Linux Operations
MaGe Linux Operations
Oct 14, 2025 · Cloud Native

How Loki + S3 Cuts Log Storage Costs by Up to 90% at PB Scale

This article explains how the cloud‑native Loki logging system combined with S3 object storage can reduce PB‑level log storage expenses by 80‑90%, while simplifying operations, improving query performance, and meeting compliance requirements through detailed architecture, configuration, deployment, and real‑world case studies.

Cost OptimizationLog ManagementLoki
0 likes · 23 min read
How Loki + S3 Cuts Log Storage Costs by Up to 90% at PB Scale
AI Info Trend
AI Info Trend
Oct 13, 2025 · Industry Insights

Why Software Spending Is a Money‑Burning Black Hole and How to Stop It

A new BCG report reveals that software expenses now consume over one‑fifth of enterprise IT budgets, driven by exploding SaaS options, M&A‑induced price pressure, and hidden infrastructure costs, and proposes a three‑pronged strategy—business measures, demand management, and technical optimization—to regain control and fund high‑value AI initiatives.

AICost OptimizationSaaS
0 likes · 8 min read
Why Software Spending Is a Money‑Burning Black Hole and How to Stop It
Ops Community
Ops Community
Oct 8, 2025 · Cloud Native

How I Cut My Kubernetes Cloud Bill by 60% in 3 Months – Proven Strategies

Facing a 35‑million‑yuan monthly Kubernetes bill, the author analyzed hidden cost components, implemented five optimization campaigns—including resource request tuning, autoscaling, spot instances, storage tiering, and network consolidation—and reduced monthly expenses by 60% while boosting performance, delivering a detailed, reproducible methodology.

Cloud NativeCost OptimizationFinOps
0 likes · 33 min read
How I Cut My Kubernetes Cloud Bill by 60% in 3 Months – Proven Strategies
DataFunTalk
DataFunTalk
Oct 7, 2025 · Big Data

How ByteHouse Tackles Data Warehouse Cost and Efficiency Challenges

This article examines the exploding data volumes that pressure modern enterprises, outlines the explicit and hidden cost challenges of data warehouses, and presents ByteHouse’s cloud‑native architecture and features as a solution for reducing expenses while boosting analytical performance.

ByteHouseCost OptimizationOLAP
0 likes · 6 min read
How ByteHouse Tackles Data Warehouse Cost and Efficiency Challenges
DataFunTalk
DataFunTalk
Sep 29, 2025 · Big Data

How ByteHouse Cuts Data Warehouse Costs While Boosting Performance

This article examines the exploding data volumes that pressure modern enterprises, outlines the explicit (hardware, performance) and implicit (operations, migration) cost challenges of OLAP data warehouses, and presents ByteHouse’s cloud‑native architecture and features as a solution for cost reduction and efficiency gains.

ByteHouseCloud NativeCost Optimization
0 likes · 6 min read
How ByteHouse Cuts Data Warehouse Costs While Boosting Performance
Ops Community
Ops Community
Sep 26, 2025 · Cloud Native

Cut Your Kubernetes Cloud Bill by 50%: Proven Cost‑Optimization Tricks

This article reveals why Kubernetes can become a costly “money‑eater” and provides a step‑by‑step, data‑driven methodology—including resource profiling, Spot instance mixing, HPA/VPA pairing, smart scheduling, and FinOps practices—that can halve your cloud expenses within weeks.

Cloud NativeCost OptimizationFinOps
0 likes · 14 min read
Cut Your Kubernetes Cloud Bill by 50%: Proven Cost‑Optimization Tricks
IT Architects Alliance
IT Architects Alliance
Sep 17, 2025 · Artificial Intelligence

How Distributed Scheduling Redefines AI Large-Model Training Architecture

The article examines how the explosive compute, storage, network, and fault‑tolerance demands of AI large‑model training force a fundamental redesign of system architecture, covering layered storage, optimized All‑Reduce communication, elastic resource orchestration, observability, and cost‑saving strategies.

AI ArchitectureCompute SchedulingCost Optimization
0 likes · 9 min read
How Distributed Scheduling Redefines AI Large-Model Training Architecture
DevOps Coach
DevOps Coach
Sep 14, 2025 · Operations

How We Cut Redis Costs by 95% with AWS ElastiCache Serverless for Argo CD

This article details Kaltura's migration of Argo CD's built‑in Redis to AWS ElastiCache Serverless for Valkey, explaining the cost, performance, and operational challenges of the default setup, the step‑by‑step migration process, and the substantial savings and reliability gains achieved.

Argo CDCost OptimizationElastiCache
0 likes · 11 min read
How We Cut Redis Costs by 95% with AWS ElastiCache Serverless for Argo CD
IT Architects Alliance
IT Architects Alliance
Sep 13, 2025 · Cloud Computing

Why Multi‑Cloud Strategies Are Replacing Single‑Vendor Cloud Choices

Enterprises are shifting from single‑vendor cloud adoption to multi‑cloud and hybrid architectures, driven by concerns over vendor lock‑in, compliance, cost, performance, and resilience, with Kubernetes providing a unified layer to orchestrate workloads across AWS, Azure, and Alibaba Cloud.

Cloud NativeCost OptimizationKubernetes
0 likes · 10 min read
Why Multi‑Cloud Strategies Are Replacing Single‑Vendor Cloud Choices
Sohu Tech Products
Sohu Tech Products
Sep 10, 2025 · Artificial Intelligence

Can Kimi K2 Replace Claude’s Brain? A Deep Dive into AI‑Powered Code Agents

This article evaluates whether the domestically‑developed Kimi K2 model can serve as a cost‑effective alternative brain for Claude Code, detailing step‑by‑step integration, performance tests across task accuracy, advanced feature compatibility, memory retrieval, parallel development with Git Worktree, and hook automation, concluding with strengths, limitations, and overall success.

AI model evaluationClaude CodeCost Optimization
0 likes · 18 min read
Can Kimi K2 Replace Claude’s Brain? A Deep Dive into AI‑Powered Code Agents
StarRocks
StarRocks
Sep 9, 2025 · Big Data

From Hadoop to StarRocks: Revamping a Government Procurement Data Platform

Facing massive data volumes, complex component dependencies, high TCO, and real‑time processing limits, the政采云 platform replaced its Hadoop stack with StarRocks’ minimalist, decoupled architecture, achieving lower costs, elastic scaling, faster queries, easier operations, and robust fault tolerance across diverse government procurement workloads.

Cloud NativeCost OptimizationHadoop migration
0 likes · 16 min read
From Hadoop to StarRocks: Revamping a Government Procurement Data Platform
Refining Core Development Skills
Refining Core Development Skills
Sep 3, 2025 · Operations

When Should You Hire a Dedicated Performance Engineering Team?

This article explains why modern enterprises increasingly need specialized performance engineering teams, outlines their ROI through cost savings, latency reduction, scalability, and engineering efficiency, details the engineers' responsibilities, and provides practical hiring guidelines and real‑world case studies.

Cost OptimizationLatency ReductionScalability
0 likes · 29 min read
When Should You Hire a Dedicated Performance Engineering Team?
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Aug 30, 2025 · Operations

INFINI Console FAQ: Enterprise‑Grade Unified Elasticsearch Management

The article introduces INFINI Console, an open‑source, lightweight platform for unified, multi‑cluster and cross‑version Elasticsearch governance, compares it with Kibana, details deployment options, enterprise‑level features such as monitoring, alerting and security, and analyzes cost advantages and practical migration scenarios.

Cluster ManagementCost OptimizationElasticsearch
0 likes · 13 min read
INFINI Console FAQ: Enterprise‑Grade Unified Elasticsearch Management
Qborfy AI
Qborfy AI
Aug 16, 2025 · Artificial Intelligence

Mastering LLM Tokens: How They Work, Cost, and Choose the Right Model

This article explains what tokens are in large language models, how they are counted and priced, compares tokenization methods across major models, and provides practical guidelines and code examples for optimizing token usage and selecting the appropriate model for different scenarios.

AICost OptimizationLLM
0 likes · 8 min read
Mastering LLM Tokens: How They Work, Cost, and Choose the Right Model
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 4, 2025 · Big Data

36 Proven Strategies for Scalable and Efficient Big Data Operations

This article outlines the unique challenges of big‑data platform operations, emphasizing large‑scale infrastructure, layered service architecture, and presents 36 practical strategies across stability, cost, and efficiency to help engineers build resilient, cost‑effective, and automated big‑data environments.

Cost Optimizationautomationplatform management
0 likes · 10 min read
36 Proven Strategies for Scalable and Efficient Big Data Operations
Cloud Native Technology Community
Cloud Native Technology Community
Jul 31, 2025 · Cloud Native

Cut Kubernetes Costs by 30%: Six Proven Automation Strategies

An analysis of recent Kubernetes cost benchmarks reveals chronic over‑provisioning, with up to 40% idle CPU and 57% idle memory, and offers six community‑validated, actionable automation techniques—including flexible instance selection, arm migration, custom autoscaling, bin‑packing, VPA, and safe Spot usage—to dramatically reduce cloud spend.

Cost OptimizationKubernetesautoscaling
0 likes · 8 min read
Cut Kubernetes Costs by 30%: Six Proven Automation Strategies
Alibaba Cloud Native
Alibaba Cloud Native
May 28, 2025 · Cloud Native

How Alibaba Cloud Function Compute Cuts MCP Server Idle Costs by Up to 87%

This article explains how Alibaba Cloud Function Compute (FC) provides a serverless, pay‑per‑use environment for MCP Server, introduces the session‑affinity challenges that cause resource idle time, and details the idle‑billing mechanism that can reduce overall MCP Server costs by as much as 87% while preserving performance.

Cost OptimizationFunction ComputeIdle Billing
0 likes · 10 min read
How Alibaba Cloud Function Compute Cuts MCP Server Idle Costs by Up to 87%
Alibaba Cloud Observability
Alibaba Cloud Observability
May 26, 2025 · Cloud Native

Why Observability 2.0 Is Redefining Cloud‑Native Monitoring and Data Pipelines

Observability 2.0 unifies logs, metrics and traces into a single platform, leveraging event‑centric wide‑key‑value models, AI‑driven anomaly detection, and cloud‑native elastic architectures to deliver faster system insight, lower downtime, scalable data pipelines, reduced costs, and improved developer experience across SLS services.

Cloud NativeCost Optimizationperformance
0 likes · 12 min read
Why Observability 2.0 Is Redefining Cloud‑Native Monitoring and Data Pipelines
Alibaba Cloud Native
Alibaba Cloud Native
May 20, 2025 · Cloud Native

How Observability 2.0 Redefines Cloud‑Native Log Pipelines and Cuts Costs by 66%

Observability 2.0 unifies logs, metrics and traces into a single platform, introduces event‑centric Wide Events, and drives a complete redesign of Alibaba Cloud's SLS data pipeline that delivers higher performance, lower latency, richer low‑code SPL processing, and up to a 66.7% reduction in processing costs.

Cost OptimizationSPLdata pipeline
0 likes · 12 min read
How Observability 2.0 Redefines Cloud‑Native Log Pipelines and Cuts Costs by 66%
Alibaba Cloud Native
Alibaba Cloud Native
May 13, 2025 · Cloud Native

How Alibaba Cloud Serverless Boosted Efficiency and Cut Costs by 60%

This case study details how 海牙湾 (G‑Town) migrated its digital platform to Alibaba Cloud Serverless services, addressing resource waste, elasticity limits, observability gaps, and operational complexity, and achieved over 60% cost savings, 3‑fold deployment speed, and near‑perfect availability through SAE, API Gateway, MSE, and automated CICD pipelines.

Alibaba CloudCost OptimizationServerless
0 likes · 10 min read
How Alibaba Cloud Serverless Boosted Efficiency and Cut Costs by 60%
Baidu Geek Talk
Baidu Geek Talk
Apr 28, 2025 · Operations

How Baidu’s Log Platform Cuts Billions in Cost with Full‑Lifecycle Event Governance

This article details Baidu's log platform point‑governance practice, explaining why uncontrolled event logging inflates storage and compute costs, and describing a three‑stage solution—manual, semi‑automatic platform, and full‑lifecycle standardization—that uses anomaly detection, automated workflows, and IM bots to achieve massive PV reduction and annual cost savings.

Cost OptimizationLog ManagementOperations
0 likes · 20 min read
How Baidu’s Log Platform Cuts Billions in Cost with Full‑Lifecycle Event Governance
Cloud Native Technology Community
Cloud Native Technology Community
Apr 24, 2025 · Cloud Native

Simplifying Kubernetes Application Deployment with Korifi and Optimizing Cloud Costs using OpenCost

Korifi provides a Cloud Foundry‑style abstraction layer that simplifies Kubernetes application deployment and management, while OpenCost offers comprehensive, multi‑cloud cost visibility and optimization; together they enable developers to deploy apps easily, monitor pod‑level expenses, and make data‑driven resource allocation decisions.

Cost OptimizationDevOpsKorifi
0 likes · 7 min read
Simplifying Kubernetes Application Deployment with Korifi and Optimizing Cloud Costs using OpenCost
AI Frontier Lectures
AI Frontier Lectures
Apr 12, 2025 · Artificial Intelligence

How ByteDance Scales Attn/MoE: Cost Models, Mesh Communication, and Network Hacks

The article analyzes ByteDance's MegaScale‑Infer paper, detailing micro‑batching, M:N Attn‑MoE ratios, cost‑driven constraint search, communication redesign with Mesh All‑2‑All, network latency challenges, and innovative NIC and routing solutions for large‑scale mixture‑of‑experts inference.

AI inferenceByteDanceCost Optimization
0 likes · 7 min read
How ByteDance Scales Attn/MoE: Cost Models, Mesh Communication, and Network Hacks
iQIYI Technical Product Team
iQIYI Technical Product Team
Mar 27, 2025 · Big Data

Cost‑Effective Real‑Time Data Warehouse 2.0: Migrating from Kafka to Iceberg

iQIYI transformed its real‑time data warehouse by replacing a costly Kafka‑based Lambda stack with a unified stream‑batch Iceberg lake, cutting storage expenses by 90%, halving compute costs, extending data retention, and delivering minute‑level freshness for 90% of use cases while preserving second‑level processing where needed.

Cost OptimizationFlinkIceberg
0 likes · 11 min read
Cost‑Effective Real‑Time Data Warehouse 2.0: Migrating from Kafka to Iceberg
21CTO
21CTO
Mar 25, 2025 · Databases

How Google Spanner’s New Hierarchical Storage Cuts Costs with HDD Tiering

Google Cloud Spanner now offers a policy‑driven hierarchical storage tier that moves infrequently accessed data to cheap HDDs, reducing costs by up to 80% while keeping SQL queries seamless across SSD and HDD layers and preserving backup consistency.

Cost OptimizationDistributed SQLGoogle Cloud
0 likes · 5 min read
How Google Spanner’s New Hierarchical Storage Cuts Costs with HDD Tiering
DataFunSummit
DataFunSummit
Feb 27, 2025 · Big Data

Case Study: Migrating Spark Thinking Education's Big Data Architecture from EMR to Serverless

This article details Spark Thinking Education's comprehensive migration from EMR to a serverless big‑data architecture, outlining the challenges of elasticity, cost accounting, and resource contention, the step‑by‑step implementation of serverless compute, storage, and integration services, and the resulting performance, cost, and stability gains.

Big DataCost OptimizationServerless
0 likes · 41 min read
Case Study: Migrating Spark Thinking Education's Big Data Architecture from EMR to Serverless
dbaplus Community
dbaplus Community
Feb 25, 2025 · Cloud Native

Why We Dropped Kubernetes and Boosted DevOps Happiness by 89%

A DevOps team managing 47 Kubernetes clusters across three clouds faced burnout, high costs, and operational chaos, so they gradually replaced Kubernetes with simpler AWS services, cutting infrastructure spend by 58%, speeding deployments by 89%, and dramatically improving team morale and reliability.

Cloud NativeCost OptimizationDevOps
0 likes · 9 min read
Why We Dropped Kubernetes and Boosted DevOps Happiness by 89%
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Feb 19, 2025 · Databases

How Alibaba Cloud Elasticsearch Serverless Cuts Log Costs by Over 70%

This article compares Alibaba Cloud Elasticsearch Serverless with self‑built Elasticsearch clusters for log processing, highlighting cost reductions over 70 %, improved performance stability, automatic scaling, and step‑by‑step guidance to activate and configure the serverless service for real‑world workloads.

Cost OptimizationElasticsearchLog Analytics
0 likes · 8 min read
How Alibaba Cloud Elasticsearch Serverless Cuts Log Costs by Over 70%
Bilibili Tech
Bilibili Tech
Jan 24, 2025 · Operations

Design and Implementation of a CDN Edge‑Node Scheduling System for Bilibili Live Streaming

The paper presents Bilibili’s multi‑layer CDN edge‑node scheduling system, which groups heterogeneous nodes by quality and price, uses cost‑aware and resource‑aware heuristics—including maximum‑flow regional borrowing and contextual‑bandit utilization prediction—to allocate bandwidth per business, achieving a 43 % bandwidth reuse increase, 33 % coverage boost, and markedly lower stall rates.

BilibiliCDNCost Optimization
0 likes · 10 min read
Design and Implementation of a CDN Edge‑Node Scheduling System for Bilibili Live Streaming
IT Architects Alliance
IT Architects Alliance
Jan 19, 2025 · Cloud Native

How Cloud‑Native Architecture Slashes Costs and Supercharges Enterprise Efficiency

The article examines how adopting a cloud‑native architecture—through precise resource monitoring, automation pipelines, pay‑as‑you‑go scaling, hybrid‑cloud strategies, and container‑based microservices—enables companies to dramatically reduce operational expenses, improve resource utilization, and accelerate innovation in competitive markets.

Cloud NativeContainersCost Optimization
0 likes · 9 min read
How Cloud‑Native Architecture Slashes Costs and Supercharges Enterprise Efficiency
IT Architects Alliance
IT Architects Alliance
Jan 16, 2025 · Cloud Native

How Microservices and Serverless Combine to Transform Modern Applications

Microservices break monoliths into focused services, while serverless offloads infrastructure management to cloud providers; together they boost agility, scalability, cost efficiency, and security, as illustrated by real-world cases from ride‑hailing and e‑commerce, and the article outlines adoption challenges and future opportunities.

Cloud NativeCost OptimizationMicroservices
0 likes · 9 min read
How Microservices and Serverless Combine to Transform Modern Applications
DataFunSummit
DataFunSummit
Jan 16, 2025 · Big Data

Zhihu Big Data Cost‑Reduction Practices: FinOps, Erasure Coding, ZSTD Compression, Spark Auto‑Tuning, and Remote Shuffle Service

This article details Zhihu's comprehensive cost‑reduction and efficiency‑boosting initiatives for its big‑data platform, covering FinOps‑driven financial operations, hybrid‑cloud architecture, cost allocation models, operational monitoring, and technical optimizations such as erasure coding, ZSTD compression, Spark auto‑tuning, and a remote shuffle service.

Big DataCloud Cost ManagementCost Optimization
0 likes · 22 min read
Zhihu Big Data Cost‑Reduction Practices: FinOps, Erasure Coding, ZSTD Compression, Spark Auto‑Tuning, and Remote Shuffle Service
Efficient Ops
Efficient Ops
Jan 12, 2025 · Operations

Mastering FinOps: The Enterprise IT Resource Operations Maturity Model Explained

This article introduces the FinOps maturity model for enterprise IT resource operations, outlines its six capability domains and five maturity levels, and showcases real‑world assessment cases from leading firms, highlighting how precise cost control and lifecycle management can drive efficiency and high‑quality growth.

Cost OptimizationFinOpsIT resource management
0 likes · 9 min read
Mastering FinOps: The Enterprise IT Resource Operations Maturity Model Explained
AntData
AntData
Dec 17, 2024 · Databases

Designing Database Services for Modern Online Business: Scalability, Agility, Security, and Cost Optimization

The article examines how database services must evolve to meet the high‑availability, real‑time response, horizontal scalability, application agility, security compliance, and cost‑optimization demands of modern online businesses, using Ant Group’s multi‑generation architecture and technologies such as distributed middleware, HTAP/HSAP, and polyglot persistence as examples.

Application AgilityCost OptimizationDatabase Services
0 likes · 22 min read
Designing Database Services for Modern Online Business: Scalability, Agility, Security, and Cost Optimization
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Dec 11, 2024 · Cloud Native

Cut CI/CD Costs with Alibaba Cloud Serverless Containers: A Moka Case Study

This article details how Moka migrated its CI/CD pipeline to Alibaba Cloud's serverless container service (ACS) on an ACK cluster, addressing high concurrency pain points, implementing step‑by‑step runner configuration, and achieving significant cost savings, higher resource utilization, and faster job execution.

Alibaba CloudContainerCost Optimization
0 likes · 9 min read
Cut CI/CD Costs with Alibaba Cloud Serverless Containers: A Moka Case Study
DataFunSummit
DataFunSummit
Nov 26, 2024 · Information Security

AI‑Driven Security Operations (AISECOPS): Architecture, Practices, and Evaluation

This article explains how large‑model AI can be integrated into security operations (AISECOPS) to simplify application integration, improve fault detection, and automate protection across complex north‑south and east‑west network layers, while addressing challenges such as data quality, cost control, model selection, and safety frameworks.

AISECOPSCost OptimizationEmbedding
0 likes · 22 min read
AI‑Driven Security Operations (AISECOPS): Architecture, Practices, and Evaluation
BirdNest Tech Talk
BirdNest Tech Talk
Nov 17, 2024 · Cloud Computing

How to Host 4K Video on Cloudflare R2 for Under $2 a Month

This article walks through the full process of delivering high‑quality 4K video at minimal cost by comparing storage providers, choosing Cloudflare R2 for its free outbound bandwidth, encoding with FFmpeg, packaging with HLS, uploading via rclone, and playing back with HLS.js or a custom web component.

Cost OptimizationVideo Streamingadaptive bitrate
0 likes · 18 min read
How to Host 4K Video on Cloudflare R2 for Under $2 a Month
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Oct 23, 2024 · Backend Development

Pulsar vs RocketMQ: Architecture, Cost Benefits, and Migration Strategy for Xiaohongshu Online Messaging

Xiaohongshu replaced its RocketMQ‑based online messaging platform with Apache Pulsar, achieving up to 48% total cost reduction, 43% higher CPU utilization, 30% resource savings, and a latency drop from 20.2 ms to 5.7 ms through cloud‑native, elastic scaling and a phased migration strategy.

Apache PulsarCost OptimizationMessage Queue
0 likes · 12 min read
Pulsar vs RocketMQ: Architecture, Cost Benefits, and Migration Strategy for Xiaohongshu Online Messaging