Tagged articles

1919 articles

Page 2 of 20

Jan 6, 2026 · Backend Development

How ChatGPT Analyzes and Optimizes Application Performance

The article shows how ChatGPT examines JMeter load‑test results, identifies CPU, database and I/O bottlenecks, and then proposes concrete optimizations such as adding indexes, tuning HikariCP pool settings, caching, code refactoring, and monitoring to improve login throughput and latency.

ChatGPTHikariCPJMeter

0 likes · 11 min read

How ChatGPT Analyzes and Optimizes Application Performance

Baidu Intelligent Cloud Tech Hub

Jan 6, 2026 · Operations

How vLLM‑Kunlun Plugin Enabled Two‑Day Adaptation of MiMo Flash V2 on Kunlun P800 XPU

In just two days, Baidu Baige and Kunlun's engineers extended the vLLM‑Kunlun Plugin to overcome asymmetric KV dimensions and integrate SWA+Sink attention, achieving lossless, high‑performance inference of the MiMo Flash V2 model on the Kunlun P800 XPU.

Kunlun P800MiMo Flash V2Model Adaptation

0 likes · 8 min read

How vLLM‑Kunlun Plugin Enabled Two‑Day Adaptation of MiMo Flash V2 on Kunlun P800 XPU

java1234

Jan 6, 2026 · Backend Development

Boost API Latency 10× with Spring Boot 3 and a Three‑Level Local Cache Pyramid

The article explains why adding Redis alone often remains slow, introduces a three‑level cache pyramid (L1 Caffeine, L2 Redis, L3 MySQL) built with Spring Boot 3, and shows how this design reduces request latency from 28 ms to 2 ms, cuts CPU usage by 35 % and achieves up to 14‑fold throughput improvement.

CacheCaffeineJava

0 likes · 10 min read

Boost API Latency 10× with Spring Boot 3 and a Three‑Level Local Cache Pyramid

Tech Freedom Circle

Jan 5, 2026 · Artificial Intelligence

A Three‑Step Guide to Mastering RAG Semantic‑Loss Interview Questions

RAG (Retrieval‑Augmented Generation) is a hot interview topic, and many candidates stumble on semantic‑loss issues; this article dissects a real JD interview case, identifies three core shortcomings, and presents a three‑step technical solution—structure restoration, semantic splitting, and hybrid retrieval—plus a ready‑to‑use answer template.

AI InterviewDocument ParsingHybrid Search

0 likes · 25 min read

A Three‑Step Guide to Mastering RAG Semantic‑Loss Interview Questions

php Courses

Jan 5, 2026 · Artificial Intelligence

How AI is Transforming PHP Development: From Smart Coding to Automated Testing

This article explores how AI technologies are reshaping the entire PHP development lifecycle—offering context‑aware code generation, intelligent architecture suggestions, automated test creation, performance optimisation, and end‑to‑end workflow integration—while highlighting real‑world case studies and practical adoption strategies.

AIAutomated TestingDevOps

0 likes · 9 min read

How AI is Transforming PHP Development: From Smart Coding to Automated Testing

Baidu Intelligent Cloud Tech Hub

Jan 5, 2026 · Artificial Intelligence

How Baidu Tianchi Supernodes Supercharge Large‑Model Inference: Architecture, Deployment, and Optimization

This article details Baidu's Tianchi supernode design and software tuning—covering hardware scale‑up, deployment planning, Prefill and Decode stage optimizations, quantization strategies, and communication schemes—to dramatically boost large‑model inference throughput and latency while lowering token‑cost.

AI InfrastructureParallelismPerformance Optimization

0 likes · 20 min read

How Baidu Tianchi Supernodes Supercharge Large‑Model Inference: Architecture, Deployment, and Optimization

JavaScript

Dec 31, 2025 · Frontend Development

Unlock Hidden Performance Gains in Front‑End Apps with Web Workers

Web Workers let JavaScript run heavy computations in background threads, freeing the main UI thread, boosting multi‑core utilization, and improving memory management, which together dramatically enhance front‑end performance for tasks like data processing, image filtering, media handling, and AI inference.

JavaScriptPerformance OptimizationWeb Workers

0 likes · 5 min read

Unlock Hidden Performance Gains in Front‑End Apps with Web Workers

Alibaba Cloud Big Data AI Platform

Dec 30, 2025 · Big Data

How StarRocks and Apache Paimon Unite to Build a True Lakehouse Native Engine

StarRocks and Apache Paimon have been progressively integrated across multiple releases, enabling a unified lakehouse architecture that supports multi-source federated analysis, time-travel queries, native readers/writers, distributed planning, and advanced profiling, while delivering performance gains that bring Paimon query speed on par with native StarRocks tables.

Apache PaimonData IntegrationLakehouse

0 likes · 9 min read

How StarRocks and Apache Paimon Unite to Build a True Lakehouse Native Engine

Alibaba Cloud Developer

Dec 29, 2025 · Artificial Intelligence

How Alibaba’s Tair KVCache Manager Revolutionizes Enterprise‑Level LLM Cache Management

This article details the architecture and implementation of Tair KVCache Manager, an enterprise‑grade service that centralises KVCache metadata, decouples inference engines from storage, provides elastic scaling, multi‑tenant isolation, high availability, and performance‑optimised cache management for large‑scale LLM inference workloads.

Cache ManagementKVCacheLLM

0 likes · 28 min read

How Alibaba’s Tair KVCache Manager Revolutionizes Enterprise‑Level LLM Cache Management

PMTalk Product Manager Community

Dec 29, 2025 · Artificial Intelligence

Essential GPU Selection Tips for AI Model Training (Why Nvidia Dominates)

This guide explains how product managers can choose the right GPU and complementary hardware for AI model training, covering GPU memory, cores, architecture, budget, CPU role, RAM, storage, cooling, and other factors, with real‑world examples and practical trade‑offs.

AI hardwareDeep LearningGPU selection

0 likes · 9 min read

Essential GPU Selection Tips for AI Model Training (Why Nvidia Dominates)

DeWu Technology

Dec 25, 2025 · Frontend Development

Build a High‑Performance H5 PAG Player: SDK, Image Editing, Batch Synthesis

This guide details how to implement a full‑stack H5 PAG player for the “Use Basketball to Know Me” activity, covering SDK loading, canvas‑based image manipulation (drag, scale, rotate), dynamic layer and text replacement, real‑time preview synchronization, snapshot export, batch synthesis, performance tuning, and fallback strategies.

Batch ProcessingCanvasPAG

0 likes · 30 min read

Build a High‑Performance H5 PAG Player: SDK, Image Editing, Batch Synthesis

Ray's Galactic Tech

Dec 24, 2025 · Backend Development

Master Production-Ready Nginx Server Configurations: Templates & Best Practices

This guide explains why most Nginx server setups fail, presents a core server template, walks through five real‑world scenarios with complete configuration snippets, and adds performance, security, and monitoring tips so you can copy a production‑grade Nginx configuration instantly.

Performance OptimizationSecurityServer Configuration

0 likes · 8 min read

Master Production-Ready Nginx Server Configurations: Templates & Best Practices

JakartaEE China Community

Dec 23, 2025 · Backend Development

Performance Optimization Best Practice #3: Implementing Caching in Jakarta EE

This article explains the different cache layers—client, application, database, and distributed—provides best‑practice guidelines, and shows how to configure client‑side headers, application‑side caches with Hazelcast, JPA second‑level caching, and JMS‑based cache coordination in GlassFish.

GlassFishHazelcastPerformance Optimization

0 likes · 9 min read

Performance Optimization Best Practice #3: Implementing Caching in Jakarta EE

Architect's Guide

Dec 23, 2025 · Databases

How to Implement Conditional Queries and Pagination in Redis Using ZSet and Hash

This guide explains how to achieve pagination and multi‑condition fuzzy queries in Redis by leveraging ZSet for ordered paging and Hash with HSCAN for pattern matching, detailing command usage, combination strategies, and performance optimizations such as temporary set expiration and data freshness handling.

CacheFuzzy QueryHash

0 likes · 10 min read

How to Implement Conditional Queries and Pagination in Redis Using ZSet and Hash

Ray's Galactic Tech

Dec 22, 2025 · Databases

Mastering Redis: Choosing the Right Data Structure for High‑Performance Systems

Redis offers five core data types—String, Hash, List, Set, and ZSet—each acting as a high‑performance concurrent data structure that determines system throughput, latency, and stability; this guide explains their characteristics, optimal use‑cases, anti‑patterns, and practical code examples for robust architecture design.

Backend DevelopmentPerformance Optimizationcaching

0 likes · 6 min read

Mastering Redis: Choosing the Right Data Structure for High‑Performance Systems

Alibaba Cloud Infrastructure

Dec 22, 2025 · Artificial Intelligence

Boost LLM Inference with KV‑Cache‑Aware Routing on Alibaba Cloud ACK GIE

This article explains why KV‑Cache hit rate is critical for large‑model inference, describes vLLM's automatic prefix caching, outlines the distributed cache challenges, and provides a step‑by‑step guide to deploying Alibaba Cloud ACK Gateway with Inference Extension's precise‑mode prefix‑cache‑aware routing, backed by benchmark results.

Alibaba CloudInferenceKV cache

0 likes · 18 min read

Boost LLM Inference with KV‑Cache‑Aware Routing on Alibaba Cloud ACK GIE

Linux Code Review Hub

Dec 21, 2025 · Fundamentals

Achieving Up to 3× Speedup for dma_map_sg, dma_unmap_sg and DMA Sync on Arm64

The article explains how batching cache‑sync operations for dma_map_sg, dma_unmap_sg and dma_sync_sg on arm64 can cut their execution time by up to three‑fold, details the kernel patches introduced, and presents benchmark results on Dimensity 9500 and RK3588 platforms.

ARM64DMALinux kernel

0 likes · 5 min read

Achieving Up to 3× Speedup for dma_map_sg, dma_unmap_sg and DMA Sync on Arm64

IT Services Circle

Dec 20, 2025 · Operations

How Windows Server 2025’s Native NVMe Boosts IOPS by Up to 80% and Cuts CPU Overhead

Microsoft’s Windows Server 2025 introduces native NVMe support that can increase IOPS by up to 80%, reduce latency, and save roughly 45% of CPU cycles on 4K random reads, with a manual registry tweak to enable the feature.

DirectStorageIOPSNVMe

0 likes · 5 min read

How Windows Server 2025’s Native NVMe Boosts IOPS by Up to 80% and Cuts CPU Overhead

Code Ape Tech Column

Dec 19, 2025 · Backend Development

Boost SpringBoot Log Management: Step‑by‑Step Integration with Hera

This article explains why traditional SpringBoot logging falls short, introduces the Hera log platform’s three core benefits, outlines a layered integration architecture, and provides a detailed five‑step guide—including Maven dependencies, YAML configuration, custom field providers, log output, traceability, and console usage—plus performance, high‑availability, security tips and common pitfalls.

Distributed SystemsHeraLog Management

0 likes · 14 min read

Boost SpringBoot Log Management: Step‑by‑Step Integration with Hera

Sohu Tech Products

Dec 17, 2025 · Artificial Intelligence

How We Cut Vision Transformer Inference Latency from 53 ms to 8 ms

Facing 53.64 ms per‑image latency in a Flask‑served Vision Transformer classifier, we iteratively optimized the pipeline—switching to ONNX Runtime, leveraging TensorRT, replacing Pillow with OpenCV, eliminating URL downloads, and finally batching requests—reducing average server‑side processing to 8.34 ms, a 6.4× speedup.

BatchingFlaskONNX

0 likes · 28 min read

How We Cut Vision Transformer Inference Latency from 53 ms to 8 ms

Baidu Geek Talk

Dec 17, 2025 · Artificial Intelligence

Accelerate LLM Deployment on Baidu Kunlun XPU with the Open‑Source vLLM‑Kunlun Plugin

The vLLM‑Kunlun Plugin, jointly released by Baidu Baige and Kunlun Chip, provides a high‑performance, zero‑intrusion solution for deploying open‑source large language models on domestic Kunlun XPU hardware, includes fused operators, precision‑validation and profiling tools, and supports over twenty mainstream and multimodal models.

Kunlun XPUModel DeploymentPerformance Optimization

0 likes · 7 min read

Accelerate LLM Deployment on Baidu Kunlun XPU with the Open‑Source vLLM‑Kunlun Plugin

Alibaba Cloud Developer

Dec 17, 2025 · Cloud Native

How 3FS Powers High‑Performance KVCache for AI Inference: Architecture, Optimizations, and Cloud‑Native Deployment

This article details the design and engineering of the 3FS distributed file system as a scalable KVCache backend for large‑language‑model inference, covering its architecture, performance tuning, reliability fixes, integration with SGLang/vLLM, and cloud‑native Kubernetes operator deployment.

3FSAI inferenceCloud Native

0 likes · 30 min read

How 3FS Powers High‑Performance KVCache for AI Inference: Architecture, Optimizations, and Cloud‑Native Deployment

Ray's Galactic Tech

Dec 15, 2025 · Databases

Mastering Database Design: From Core Principles to Modern Distributed Practices

This comprehensive guide walks you through fundamental database design goals, a step‑by‑step lifecycle, nine essential strategies—including normalization, indexing, and security—plus modern distributed and NoSQL considerations, performance tuning, high‑availability tactics, and practical tools for robust data governance.

Data GovernanceDatabase designNoSQL

0 likes · 11 min read

Mastering Database Design: From Core Principles to Modern Distributed Practices

Java Architect Handbook

Dec 12, 2025 · Backend Development

Master Spring Task: Build, Optimize, and Scale Your Scheduled Jobs

This guide walks you through what Spring Task is, how to enable it in three simple steps, decode cron expressions, explore six real‑world use cases, unlock four advanced features, avoid common pitfalls, and apply performance‑tuning and future‑proofing techniques for robust Java scheduling.

Distributed SchedulingJavaPerformance Optimization

0 likes · 11 min read

Master Spring Task: Build, Optimize, and Scale Your Scheduled Jobs

Raymond Ops

Dec 11, 2025 · Operations

Master Container Networking: From Basics to Advanced Kubernetes Practices

This comprehensive guide explores container networking fundamentals, Docker network modes, Kubernetes CNI plugins, network security policies, monitoring, troubleshooting, and performance optimization, providing practical commands and configuration examples for operations engineers.

CNIDockerKubernetes

0 likes · 20 min read

Master Container Networking: From Basics to Advanced Kubernetes Practices

Su San Talks Tech

Dec 11, 2025 · Databases

How to Scale a 900 M‑Row Message Table to 60 B Rows Without Downtime

This article details a comprehensive sharding strategy for a high‑traffic message table, explaining why early partitioning is essential, how to design language‑based vertical shards combined with hash‑based horizontal shards, calculate table counts and virtual nodes, and execute a zero‑downtime migration using dual‑writes and gradual traffic rollout.

Performance Optimizationhorizontal partitioningmysql

0 likes · 16 min read

How to Scale a 900 M‑Row Message Table to 60 B Rows Without Downtime

Alibaba Cloud Observability

Dec 9, 2025 · Cloud Native

Unlocking System Insights with Graph Queries in Cloud‑Native Observability

This article explains how integrating graph‑based data models into cloud‑native observability platforms transforms isolated metric monitoring into a relational view, enabling powerful queries such as graph‑match and Cypher to perform fault impact analysis, root‑cause tracing, and security audits across services, pods, and infrastructure.

CypherObservabilityPerformance Optimization

0 likes · 29 min read

Unlocking System Insights with Graph Queries in Cloud‑Native Observability

Linux Kernel Journey

Dec 7, 2025 · Fundamentals

CUDA Optimization Basics: Understanding GPU Architecture and Warp Scheduling

This article explains the fundamentals of CUDA performance tuning, covering GPU architectures from Kepler to Volta, the role of SMX, warp schedulers, registers and memory hierarchies, and provides practical guidance on launch configuration, latency hiding, and thread‑block sizing to maximize throughput.

CUDAGPU architecturePerformance Optimization

0 likes · 21 min read

CUDA Optimization Basics: Understanding GPU Architecture and Warp Scheduling

Deepin Linux

Dec 7, 2025 · Fundamentals

How CPU Context Switching Powers Multitasking: From Theory to Code

This article explains the fundamentals of CPU context switching, covering the hardware and kernel mechanisms, step‑by‑step saving and restoring of registers, scheduling algorithms, different types of switches, performance costs, optimization techniques, and includes a complete C++ simulation example.

CPUKernelMultitasking

0 likes · 42 min read

How CPU Context Switching Powers Multitasking: From Theory to Code

Alibaba Cloud Big Data AI Platform

Dec 5, 2025 · Big Data

How EMR Serverless Spark Cut Batch Processing Time by Over 50% for a 600M‑User Platform

This case study details how Qimao leveraged Alibaba Cloud EMR Serverless Spark with Fusion and Celeborn to overcome multi‑business‑line data‑processing challenges, achieving more than 50% faster batch jobs, significant cost reductions, and improved operational flexibility across its 600 million‑user ecosystem.

Data WarehousePerformance OptimizationServerless Spark

0 likes · 9 min read

How EMR Serverless Spark Cut Batch Processing Time by Over 50% for a 600M‑User Platform

Selected Java Interview Questions

Dec 5, 2025 · Databases

Unlock MySQL 8.0’s Hidden Optimization Tricks to Supercharge Your Queries

Discover eight powerful, lesser‑known MySQL 8.0 features—including window functions, descending indexes, generated columns, invisible indexes, hints, resource groups, LATERAL JOIN, and multi‑valued JSON indexes—that can dramatically improve query performance, simplify code, and give you an edge in interviews and production environments.

Generated ColumnsJSONLATERAL JOIN

0 likes · 12 min read

Unlock MySQL 8.0’s Hidden Optimization Tricks to Supercharge Your Queries

Data STUDIO

Dec 5, 2025 · Big Data

Why Parquet Is the Default Choice for Big Data Storage

The article explains how Apache Parquet’s columnar layout, multi‑level row‑group structure, projection and predicate push‑down, and advanced compression and encoding make it the high‑performance, space‑efficient storage format that powers modern big‑data ecosystems and tools like Spark, Python pandas, and ClickHouse.

Big DataClickHouseColumnar Storage

0 likes · 11 min read

Why Parquet Is the Default Choice for Big Data Storage

vivo Internet Technology

Dec 3, 2025 · Mobile Development

How Parallel Loading Supercharges H5 Instant SDK: Challenges, Early Designs, and Optimized Solutions

This article explains the parallel loading technique used in the H5 instant SDK to accelerate page startup, details the three resource‑handoff scenarios that cause synchronization challenges, reviews the early simple design and its drawbacks, and presents an optimized producer‑consumer model with fair locks and bridge streams to eliminate wasteful waiting and memory consumption.

H5 instant SDKJava concurrencyParallel Loading

0 likes · 31 min read

How Parallel Loading Supercharges H5 Instant SDK: Challenges, Early Designs, and Optimized Solutions

Full-Stack Cultivation Path

Dec 2, 2025 · Frontend Development

Why React Introduced a Compiler to Automate Performance Optimization

React Compiler 1.0, released in 2023, automatically eliminates the need for manual useMemo, useCallback, and memo by statically analyzing code to cache computations, generate update conditions, and replace these hooks, offering a safer, more comprehensive performance boost across Vite, Webpack, Next.js, and React Native projects.

Next.jsPerformance OptimizationReact

0 likes · 7 min read

Why React Introduced a Compiler to Automate Performance Optimization

Raymond Ops

Nov 28, 2025 · Databases

Essential DBA Guide to Enterprise MySQL Architecture, Optimization & Ops

This comprehensive guide equips DBAs with enterprise‑level MySQL strategies, covering master‑slave replication, InnoDB cluster setup, performance tuning parameters, index design, backup and recovery methods, monitoring scripts, security hardening, and emergency response procedures to ensure a stable, high‑performance database environment.

Database AdministrationPerformance Optimizationbackup and recovery

0 likes · 15 min read

Essential DBA Guide to Enterprise MySQL Architecture, Optimization & Ops

Big Data Technology & Architecture

Nov 28, 2025 · Big Data

What’s New in Apache Paimon 2025? Core Performance, AI Integration & Real‑Time Lakehouse Updates

The 2025 Apache Paimon release brings major performance boosts, AI‑centric multimodal storage, deeper streaming‑batch integration, and broader engine compatibility, detailing query and write optimizations, memory management tweaks, and a unified lake format for structured and unstructured data.

AI integrationApache PaimonBig Data

0 likes · 6 min read

What’s New in Apache Paimon 2025? Core Performance, AI Integration & Real‑Time Lakehouse Updates

Ctrip Technology

Nov 27, 2025 · Big Data

How Ctrip Cut Query Latency by 85% with StarRocks’ Compute‑Storage Separation

Ctrip migrated its massive User Behavior Tracking system from ClickHouse to a compute‑storage separated StarRocks cluster on Kubernetes, achieving millisecond‑level query latency, halving storage usage, reducing node count, and sustaining millions‑of‑rows‑per‑second write throughput while simplifying scaling and operations.

Big DataClickHouseCompute-Storage Separation

0 likes · 15 min read

How Ctrip Cut Query Latency by 85% with StarRocks’ Compute‑Storage Separation

Baidu Intelligent Cloud Tech Hub

Nov 25, 2025 · Artificial Intelligence

Why DeepSeek‑V3.2‑Exp Lost Performance and How a Simple RoPE Fix Restored It

The Baidu Baige team discovered that DeepSeek‑V3.2‑Exp’s long‑context performance lagged behind the official report, traced the issue to a subtle RoPE layout mismatch in the open‑source inference demo, collaborated with DeepSeek to fix it, and verified that the model’s speed and accuracy fully recovered across multiple benchmarks.

AI InfrastructureDeepSeekLLM inference

0 likes · 9 min read

Why DeepSeek‑V3.2‑Exp Lost Performance and How a Simple RoPE Fix Restored It

Tencent TDS Service

Nov 25, 2025 · Mobile Development

What HarmonyOS App Quality Reveals: Crash & Freeze Insights for 2025

The 2025 Bugly HarmonyOS Quality Report analyzes system version adoption, device distribution, crash and freeze rates across industries, identifies top stability issues such as memory leaks and main‑thread blocking, and offers concrete optimization recommendations for developers seeking better app performance.

App StabilityBuglyFreeze Analysis

0 likes · 10 min read

What HarmonyOS App Quality Reveals: Crash & Freeze Insights for 2025

Big Data Tech Team

Nov 24, 2025 · Big Data

Avoid the 5 Common DWS Design Traps and Build Scalable Data Warehouses

This article analyzes the five typical pitfalls when designing DWS aggregation tables—from chimney‑style schemas to performance blind spots—explains their consequences, and provides concrete, production‑ready recommendations, code examples, and design principles to create reusable, efficient data‑warehouse layers.

DWS DesignData WarehousePerformance Optimization

0 likes · 10 min read

Avoid the 5 Common DWS Design Traps and Build Scalable Data Warehouses

Wu Shixiong's Large Model Academy

Nov 22, 2025 · Artificial Intelligence

Why Your RAG System Slows Down Over Time and How to Fix It

The article explains why a production Retrieval‑Augmented Generation (RAG) system becomes slower as it runs—due to growing embedding costs, expanding vector databases, heavier re‑ranking, and larger prompts—and provides concrete engineering optimizations such as batching, async concurrency, caching, partitioned retrieval, HNSW tuning, replica scaling, answer caching, and prompt sparsification to keep performance stable.

AI EngineeringPerformance OptimizationRAG

0 likes · 10 min read

Why Your RAG System Slows Down Over Time and How to Fix It

Baidu Intelligent Cloud Tech Hub

Nov 19, 2025 · Artificial Intelligence

Boost LLM Inference Speed with Token‑Level Two‑Chunk Overlap

Token‑level Two‑Chunk Overlap replaces traditional batch‑level Two‑Batch Overlap, dynamically splitting sequences into balanced token chunks, enabling near‑equal compute and communication times, improving GPU utilization and achieving up to 30% throughput gains in heterogeneous request workloads, with zero accuracy loss.

Batch schedulingGPU utilizationLLM inference

0 likes · 9 min read

Boost LLM Inference Speed with Token‑Level Two‑Chunk Overlap

php Courses

Nov 18, 2025 · Backend Development

Can PHP Swap Variables Like Reusable Resources? Exploring Performance Gains

The article examines how treating PHP variables as interchangeable reusable resources could reduce memory usage, boost execution speed, and simplify code, while outlining current limitations, possible implementation strategies, challenges, and practical scenarios where such a feature would be beneficial.

Backend DevelopmentPHPPerformance Optimization

0 likes · 6 min read

Can PHP Swap Variables Like Reusable Resources? Exploring Performance Gains

Tech Freedom Circle

Nov 18, 2025 · Backend Development

Optimizing Internal HTTP Calls: From Head‑of‑Line Blocking to High‑Performance Microservices

This article dissects the hidden technical debt of internal HTTP APIs, explains why HTTP/1.1 causes head‑of‑line blocking, redundant headers and serialization overhead, and walks through a step‑by‑step, data‑driven optimization roadmap—including HTTP/2, Protobuf, request aggregation, connection pooling, compression, caching, async processing, observability, and safe gray‑release deployment—backed by concrete benchmarks and code samples.

Connection PoolingHTTP/2Microservices

0 likes · 32 min read

Optimizing Internal HTTP Calls: From Head‑of‑Line Blocking to High‑Performance Microservices

IT Services Circle

Nov 17, 2025 · Backend Development

Why JSON Slows Down High‑Performance APIs and Which Binary Formats Can Save You Time

The article explains why JSON becomes a bottleneck in latency‑sensitive applications, compares four high‑performance binary serialization formats—Protocol Buffers, FlatBuffers, MessagePack, and CBOR—and provides practical guidance on selecting and deploying the right format to dramatically reduce latency, payload size, and CPU usage.

CBORFlatBuffersMessagePack

0 likes · 11 min read

Why JSON Slows Down High‑Performance APIs and Which Binary Formats Can Save You Time

Sohu Smart Platform Tech Team

Nov 13, 2025 · Cloud Native

How We Tuned Nacos Config Center to Eliminate Timeouts and QPS Limits

This article explains how Nacos, an open‑source dynamic naming and configuration service, was used in a micro‑service project, the two performance problems encountered—configuration fetch timeouts and server‑side QPS throttling—and the step‑by‑step optimizations (memory caching, fallback values, pre‑fetching and listener registration, and limit adjustments) that resolved them.

Cloud NativeJavaMicroservices

0 likes · 16 min read

How We Tuned Nacos Config Center to Eliminate Timeouts and QPS Limits

vivo Internet Technology

Nov 12, 2025 · Fundamentals

Linux Kernel Innovations Powering the AI Agent Era – Highlights from China’s 20th CLK

The 20th China Linux Kernel Developers Conference, hosted by vivo, presented eleven technical talks covering AI‑driven kernel challenges, memory‑compression techniques, heterogeneous compression, async file‑cache management, uncached I/O, direct I/O for compressed files, parallel writeback, host‑initiated defragmentation, zoned storage, energy‑efficient I/O, and eBPF‑based CPU idle policies, each with concrete performance results and implementation details.

AIFile SystemsLinux kernel

0 likes · 12 min read

Linux Kernel Innovations Powering the AI Agent Era – Highlights from China’s 20th CLK

JD Retail Technology

Nov 11, 2025 · Backend Development

JIMDB’s Big-Hot Key Solution: Optimizing Distributed Cache Performance

JIMDB, a high‑performance Redis‑based distributed cache, introduces the “Big‑Hot Key” concept to dynamically identify keys that strain CPU or bandwidth, and implements a multi‑layer active governance framework—including real‑time detection, server‑side caching, circuit‑breaker, and client‑side consistency—to dramatically reduce resource consumption and boost throughput.

JimdbPerformance OptimizationResource Management

0 likes · 41 min read

JIMDB’s Big-Hot Key Solution: Optimizing Distributed Cache Performance

DeWu Technology

Nov 10, 2025 · Databases

How TiDB v7 Upgrade Boosted Performance and Solved Real‑World Issues

This article details the TiDB upgrade journey from version 5.3.3 to 7.5.x, covering architecture, new v7 features, migration strategies, encountered problems with full‑table scans and inaccurate aggregation plans, the solutions applied, and the measurable performance and stability gains achieved after the upgrade.

Database UpgradeDistributed SQLNewSQL

0 likes · 15 min read

How TiDB v7 Upgrade Boosted Performance and Solved Real‑World Issues

Baidu Geek Talk

Nov 10, 2025 · Cloud Native

How Polar‑TCP Breaks Kernel Network Bottlenecks for Cloud‑Native High‑Performance Services

This article explains how traditional kernel network stacks struggle with high‑concurrency, low‑latency cloud data‑center workloads and introduces Baidu Intelligent Cloud’s Polar solution—Polar‑TCP and Polar‑RDMA—which combine user‑space DPDK drivers, a lightweight TCP stack, and an industrial RPC framework to achieve near‑RDMA performance while preserving compatibility with existing TCP ecosystems.

DPDKNetwork StackPerformance Optimization

0 likes · 23 min read

How Polar‑TCP Breaks Kernel Network Bottlenecks for Cloud‑Native High‑Performance Services

Architect's Guide

Nov 10, 2025 · Artificial Intelligence

Build a Scalable, High‑Performance OCR Invoice Pipeline with Spring Boot & Tesseract

This article details a complete, production‑grade OCR invoice processing pipeline that combines a distributed Spring Boot microservice architecture, deep Tesseract optimizations, ML‑based data validation, GPU acceleration, Kubernetes deployment, and extensive performance and security strategies to achieve million‑scale daily throughput with high accuracy.

OCRPerformance OptimizationSpring Boot

0 likes · 16 min read

Build a Scalable, High‑Performance OCR Invoice Pipeline with Spring Boot & Tesseract

Practical DevOps Architecture

Nov 10, 2025 · Backend Development

Master Nginx: From Basics to Advanced Architecture and Real-World Scenarios

This comprehensive Nginx course covers everything from introductory concepts, installation, and configuration syntax to advanced topics like load balancing, security, performance tuning, Lua scripting, gRPC integration, and smooth upgrade strategies, providing practical scenario demos throughout.

NginxPerformance OptimizationSecurity

0 likes · 12 min read

Master Nginx: From Basics to Advanced Architecture and Real-World Scenarios

AndroidPub

Nov 9, 2025 · Mobile Development

How to Diagnose and Fix Jetpack Compose Performance Pitfalls

Learn how to identify and resolve performance issues in Jetpack Compose by using Layout Inspector, Stability Reports, and configuration files, understanding stable vs unstable parameters, applying strong skipping, and leveraging annotations and wrapper classes to achieve efficient UI recomposition.

AndroidJetpack ComposePerformance Optimization

0 likes · 12 min read

How to Diagnose and Fix Jetpack Compose Performance Pitfalls

Ops Community

Nov 8, 2025 · Operations

Mastering Nginx Reverse Proxy & Load Balancing: Best Practices for High‑Performance Deployments

This comprehensive guide walks you through Nginx reverse proxy and load balancing fundamentals, key features, suitable scenarios, environment prerequisites, step‑by‑step installation, core configuration, performance tuning, security hardening, high‑availability designs, troubleshooting, monitoring, backup strategies, real‑world case studies, and advanced learning paths for production‑grade deployments.

Performance OptimizationSecurityhigh availability

0 likes · 56 min read

Mastering Nginx Reverse Proxy & Load Balancing: Best Practices for High‑Performance Deployments

MaGe Linux Operations

Nov 8, 2025 · Backend Development

Mastering Redis Cache: Prevent Penetration, Breakdown, and Avalanche with Proven Solutions

This comprehensive guide explains the three major Redis cache issues—penetration, breakdown, and avalanche—detailing their causes, impacts, and production‑ready solutions such as Bloom filters, distributed locks, logical expiration, random TTL, multi‑level caching, high‑availability setups, monitoring, backup, and best‑practice recommendations.

Performance OptimizationSpring Bootbloom-filter

0 likes · 56 min read

Mastering Redis Cache: Prevent Penetration, Breakdown, and Avalanche with Proven Solutions

Top Architect

Nov 8, 2025 · Backend Development

Mastering High‑Throughput Thread Pools: Strategies for 100k QPS in Java

This article analyzes a high‑traffic scenario where 100,000 QPS requests each require 100 ms processing, explains why a naïve fixed thread pool would exhaust resources, and presents practical optimization goals, strategies, and Spring‑Boot code examples—including custom pool parameters, rejection handling, batch processing, and advanced techniques like Disruptor and rate‑limiting—to build a stable, high‑performance task execution system.

Performance OptimizationSpring BootThreadPool

0 likes · 8 min read

Mastering High‑Throughput Thread Pools: Strategies for 100k QPS in Java

Sohu Tech Products

Nov 5, 2025 · Cloud Native

How We Optimized Nacos Config Center to Eliminate Timeouts and QPS Limits

This article explains Nacos's role as a dynamic service discovery and configuration platform, describes two real‑world performance problems encountered in production, and details the step‑by‑step code‑level optimizations—memory caching with fallback and pre‑fetching with listeners—that resolved timeout and rate‑limit issues.

Cloud NativeConfiguration ManagementMicroservices

0 likes · 16 min read

How We Optimized Nacos Config Center to Eliminate Timeouts and QPS Limits

Architect

Nov 4, 2025 · Operations

How to Accurately Track API Calls per Minute: 5 Proven Monitoring Strategies

This article explores why precise per‑minute API call statistics are essential for performance bottleneck detection, capacity planning, security alerts, billing, and troubleshooting, and presents five practical implementations—including fixed‑window counters, sliding windows, AOP‑based interception, Redis time‑series storage, and Micrometer‑Prometheus integration—along with their trade‑offs and capacity‑planning guidelines.

API monitoringJavaMetrics

0 likes · 25 min read

How to Accurately Track API Calls per Minute: 5 Proven Monitoring Strategies

Model Perspective

Nov 4, 2025 · Fundamentals

Can Math Predict a Joke’s Success? A Modeling Guide to Humor

This article applies mathematical modeling—including surprise indices, timing dynamics, density optimization, and risk assessment—to dissect how stand‑up comedy works, illustrating each concept with real‑world jokes and offering practical formulas for crafting more effective humor.

Performance Optimizationcomedy analysishumor

0 likes · 16 min read

Can Math Predict a Joke’s Success? A Modeling Guide to Humor

DevOps Coach

Oct 31, 2025 · Backend Development

How Netflix’s Maestro Engine Gained a 100× Speed Boost with a New Actor‑Based Architecture

Netflix’s Maestro workflow orchestrator was redesigned with a lightweight, stateful actor model and Java virtual threads, cutting engine overhead from seconds to milliseconds, delivering a hundred‑fold performance increase while preserving scalability, reliability, and strong execution guarantees for massive data and ML pipelines.

Distributed SystemsJava virtual threadsNetflix Maestro

0 likes · 28 min read

How Netflix’s Maestro Engine Gained a 100× Speed Boost with a New Actor‑Based Architecture

Java Backend Technology

Oct 31, 2025 · Databases

10 Essential MySQL Tricks to Boost Performance and Simplify Queries

This article presents ten practical MySQL techniques—from using JSON columns for flexible data to leveraging CTEs, window functions, and advanced clauses like WITH ROLLUP and INSERT IGNORE—each illustrated with real‑world scenarios and ready‑to‑run SQL code to improve query efficiency and developer productivity.

CTEDatabase QueriesJSON

0 likes · 11 min read

10 Essential MySQL Tricks to Boost Performance and Simplify Queries

JavaScript

Oct 30, 2025 · Frontend Development

Unlock Hidden Performance: How Web Workers Supercharge Frontend Apps

Web Workers, an often‑overlooked browser API, enable JavaScript to run heavy computations in background threads, freeing the main UI thread, boosting multi‑core utilization, improving memory management, and powering use‑cases such as big data processing, image filtering, AI inference, and encryption, dramatically enhancing frontend performance.

JavaScriptPerformance OptimizationWeb Workers

0 likes · 5 min read

Unlock Hidden Performance: How Web Workers Supercharge Frontend Apps

JD Tech Talk

Oct 30, 2025 · Backend Development

How a Massive Cache Key Crashed a System and How to Prevent It

This article examines a real-world incident where a massive cache key and cache penetration during a high‑traffic promotion caused Redis overload and a system outage, then details root‑cause analysis, mitigation steps like serialization changes, compression, lock‑based fallback, and preventive best practices.

Big KeyPerformance Optimizationcache-penetration

0 likes · 8 min read

How a Massive Cache Key Crashed a System and How to Prevent It

Eric Tech Circle

Oct 30, 2025 · Frontend Development

How to Slash Mini Program Package Size with Smart Sub‑Package Strategies

This article explains why WeChat Mini Programs hit the 2 MB package limit, analyzes common causes of oversized main bundles, and provides a step‑by‑step guide—including pre‑slimming tactics, subpackage configuration, code examples, and performance tips—to reduce the main package below the limit and improve load speed.

Package SizePerformance OptimizationSubpackage

0 likes · 12 min read

How to Slash Mini Program Package Size with Smart Sub‑Package Strategies

JD Cloud Developers

Oct 29, 2025 · Mobile Development

How JD Finance Accelerated iOS Chart Rendering with ROMA and ECharts

This article details JD Finance's iOS app chart rendering challenges, the integration of Apache ECharts via a custom ROMA WebView, cache design, data communication between Native and WebView, and performance gains achieved through automated retries and WebView pooling.

EChartsPerformance OptimizationROMA

0 likes · 8 min read

How JD Finance Accelerated iOS Chart Rendering with ROMA and ECharts

Top Architect

Oct 28, 2025 · Backend Development

How I Cut XML‑to‑MySQL Import Time from 300 s to 4 s

This article details a step‑by‑step performance overhaul for importing 60,000+ XML records into MySQL, covering baseline measurements, MySQL batch processing, asynchronous writes with Disruptor, XML parsing optimizations, and tuning MySQL buffers, ultimately reducing total runtime from 300 seconds to just four seconds.

Batch ProcessingDisruptorJava

0 likes · 14 min read

How I Cut XML‑to‑MySQL Import Time from 300 s to 4 s

Raymond Ops

Oct 27, 2025 · Operations

Master Linux Network Tuning for High-Concurrency: Practical Guide

This guide walks through a real‑world high‑concurrency Linux scenario, diagnosing TCP state bottlenecks, analyzing default kernel parameters, and providing step‑by‑step sysctl tweaks, queue and buffer adjustments, monitoring scripts, and stress‑test recommendations to dramatically improve connection handling and throughput.

LinuxNetwork TuningPerformance Optimization

0 likes · 9 min read

Master Linux Network Tuning for High-Concurrency: Practical Guide

Code Mala Tang

Oct 26, 2025 · Databases

Why Parameterized Queries Supercharge SQL Performance (And Prevent Injection)

This article explains how using bound (parameterized) queries in SQL Server improves performance by enabling execution plan reuse, reducing compilation overhead and memory usage, while also preventing SQL injection, and discusses potential pitfalls like parameter sniffing and how to mitigate them with OPTION (RECOMPILE).

Parameterized QueriesPerformance OptimizationSQL

0 likes · 6 min read

Why Parameterized Queries Supercharge SQL Performance (And Prevent Injection)

Programmer DD

Oct 23, 2025 · Backend Development

Why Jackson Serialization Can Drain Your CPU and How to Fix It

The article reveals how default Jackson JSON serialization can become a hidden CPU bottleneck in high‑traffic Spring Boot services, explains why full entity objects inflate serialization time, and presents practical optimizations such as @JsonView, custom ObjectMapper settings, hand‑written serializers, and alternative data formats to dramatically cut latency and cloud costs.

JacksonJavaPerformance Optimization

0 likes · 13 min read

Why Jackson Serialization Can Drain Your CPU and How to Fix It

Alipay Experience Technology

Oct 22, 2025 · Mobile Development

How MYKMP Enables One-Code-Three-Platform Development for Alipay’s Mobile Apps

MYKMP is Alipay’s native cross‑platform solution built on Kotlin Multiplatform and Compose, enabling a single codebase to run on Android, iOS and HarmonyOS, with detailed architecture, engineering practices, GC optimizations, and integration guidelines presented for billion‑scale applications.

Cross‑platform developmentGarbage CollectionHarmonyOS

0 likes · 20 min read

How MYKMP Enables One-Code-Three-Platform Development for Alipay’s Mobile Apps

Architect Chen

Oct 22, 2025 · Big Data

How to Eliminate Kafka Message Backlog with Practical Optimizations

This guide presents concrete techniques for improving Kafka consumer and producer performance, scaling clusters, tuning broker settings, and designing asynchronous buffering layers to prevent message accumulation and boost overall throughput.

Big DataKafkaPerformance Optimization

0 likes · 5 min read

How to Eliminate Kafka Message Backlog with Practical Optimizations

Tech Freedom Circle

Oct 20, 2025 · Backend Development

Mastering JVM Tuning: Real-World Enterprise Case Study for Interview Success

The article walks through a high‑traffic video service that suffered GC spikes, details a systematic diagnosis of three JVM configuration flaws, evaluates four GC tuning schemes across load scenarios, resolves CMS‑related pauses, and presents concrete performance gains with metrics, code snippets, and visual charts.

CMSGC tuningJVM

0 likes · 28 min read

Mastering JVM Tuning: Real-World Enterprise Case Study for Interview Success

Alibaba Cloud Observability

Oct 20, 2025 · Artificial Intelligence

How We Boosted Embedding Throughput 16× and Cut Vector Index Costs in a Cloud‑Native Setup

This article examines the high cost and low throughput of embedding vectors in log‑processing scenarios, analyzes the performance bottlenecks of inference frameworks, and details a series of cloud‑native optimizations—including switching to vLLM, deploying multiple model replicas with Triton, decoupling tokenization, and priority queuing—that together raise throughput by 16× and reduce per‑token pricing by two orders of magnitude.

EmbeddingGPU inferencePerformance Optimization

0 likes · 9 min read

How We Boosted Embedding Throughput 16× and Cut Vector Index Costs in a Cloud‑Native Setup

Code Ape Tech Column

Oct 20, 2025 · Backend Development

Unlock Redis Performance: Master Lua Scripting in Spring Boot

This tutorial explains how to integrate Lua scripts with Spring Boot and Redis, covering Lua fundamentals, advantages, real‑world use cases, step‑by‑step implementation in Spring Boot, performance gains, error handling, security measures, and best practices for reliable backend development.

Backend DevelopmentLua scriptingPerformance Optimization

0 likes · 21 min read

Unlock Redis Performance: Master Lua Scripting in Spring Boot

ITPUB

Oct 18, 2025 · Backend Development

How We Eliminated GC Pauses in a 100k QPS Service: Deep Dive into JVM Tuning

This article details the step‑by‑step investigation and JVM‑level optimizations—including early tenuring, parameter tuning, ZGC migration, and an Eden‑pre‑heat trick—that transformed a high‑concurrency, low‑latency system from 95% to 99.995% success during massive index switches.

GC tuningJVMJava

0 likes · 22 min read

How We Eliminated GC Pauses in a 100k QPS Service: Deep Dive into JVM Tuning

macrozheng

Oct 16, 2025 · Backend Development

How to Turn a 3‑Second Category Tree into 30 ms: A Real‑World Backend Optimization

This article analyzes a severe N+1 query performance disaster in a Java Spring Boot project, explains why the traditional recursive approach is slow, and presents a production‑tested solution that reduces database calls to one, uses O(n) tree construction, and adds multi‑level caching to achieve a 100‑fold speedup.

Java backendN+1 QueryPerformance Optimization

0 likes · 24 min read

How to Turn a 3‑Second Category Tree into 30 ms: A Real‑World Backend Optimization

Ray's Galactic Tech

Oct 14, 2025 · Backend Development

15 Proven Spring Boot Performance Hacks to Supercharge Your Apps

This guide presents fifteen practical Spring Boot performance techniques—from lazy initialization and context indexing to JVM tuning, connection‑pool tweaks, caching strategies, JSON serialization, and native image compilation—providing concrete configurations, code snippets, and best‑practice advice for faster startup, lower memory usage, and higher throughput.

JVMJavaPerformance Optimization

0 likes · 11 min read

15 Proven Spring Boot Performance Hacks to Supercharge Your Apps

StarRocks

Oct 14, 2025 · Big Data

How Ctrip Scaled UBT Analytics by Migrating from ClickHouse to StarRocks

Ctrip's User Behavior Tracking (UBT) system, handling 30 TB of daily data, moved from ClickHouse to StarRocks' compute‑storage separated architecture, cutting average query latency from 1.4 seconds to 203 ms, halving storage, reducing nodes from 50 to 40, and boosting write throughput to 3 million rows per second.

Big DataClickHouseData Migration

0 likes · 15 min read

How Ctrip Scaled UBT Analytics by Migrating from ClickHouse to StarRocks

Java Tech Enthusiast

Oct 14, 2025 · Backend Development

How to Build a High‑Performance MyBatis Log Analyzer with Spring Boot

This article describes the design and implementation of a high‑performance log‑analysis system for massive MyBatis logs, covering problem analysis, technology selection, architecture, multithreaded parsing, flexible regex templates, performance monitoring, pitfalls, optimization practices, results, and future plans.

JavaMyBatisPerformance Optimization

0 likes · 13 min read

How to Build a High‑Performance MyBatis Log Analyzer with Spring Boot

JavaScript

Oct 14, 2025 · Frontend Development

Boost JavaScript Async Performance by Up to 80% with New Promise Techniques

While async/await simplifies JavaScript code, it can introduce significant overhead in high‑frequency or compute‑heavy scenarios; this article introduces alternative async patterns—optimized Promise chaining, parallel Promise.all, batch processing, and pooling—that can reduce context switches and deliver performance gains of up to 80%.

JavaScriptParallelismPerformance Optimization

0 likes · 5 min read

Boost JavaScript Async Performance by Up to 80% with New Promise Techniques

Baidu Geek Talk

Oct 13, 2025 · Big Data

How Baidu Scaled Its Data Warehouse to Handle Billions of PVs and Petabytes

This article details Baidu APP's massive data‑warehouse overhaul, describing the two‑step strategy that stabilized log cleaning, modernized the ETL framework, introduced wide‑table architectures, and implemented tiered storage to dramatically improve processing speed, reliability, and cost efficiency for petabyte‑scale workloads.

Big DataData WarehouseETL

0 likes · 25 min read

How Baidu Scaled Its Data Warehouse to Handle Billions of PVs and Petabytes

Deepin Linux

Oct 9, 2025 · Fundamentals

Why Memory Alignment Is Critical for Linux Performance and Stability

This article explains how memory alignment in Linux—driven by hardware granularity, kernel allocation policies, and compiler optimizations—affects struct layout, cross‑platform compatibility, and execution speed, and provides practical techniques and testing methods to ensure optimal performance and reliability.

C programmingLinuxPerformance Optimization

0 likes · 37 min read

Why Memory Alignment Is Critical for Linux Performance and Stability

Code Ape Tech Column

Oct 8, 2025 · Databases

Boost Your Data Ingestion: A High‑Performance Java Stream Load Architecture for Doris

This article presents a complete Java‑based architecture for high‑throughput Doris stream loading, covering project structure, Maven dependencies, configuration properties, field‑mapping annotations, automatic mapper utilities, a robust parallel loader with retry and compression, plus performance tuning recommendations.

Annotation MappingJavaPerformance Optimization

0 likes · 23 min read

Boost Your Data Ingestion: A High‑Performance Java Stream Load Architecture for Doris

ITPUB

Oct 5, 2025 · Backend Development

How to Clear a 10‑Million‑Message Queue in 5 Hours: A Five‑Step Rescue Plan

When a flash‑sale causes a 10 million‑message backlog and consumers only process 200 messages per second, this guide shows a five‑step, 5‑hour strategy—horizontal scaling, message downgrade, flow control, temporary dump, and parallel blasting—to restore throughput and prevent system collapse.

Distributed SystemsKafkaPerformance Optimization

0 likes · 6 min read

How to Clear a 10‑Million‑Message Queue in 5 Hours: A Five‑Step Rescue Plan

IT Architects Alliance

Oct 3, 2025 · Backend Development

Designing High‑Performance API Gateways for Microservices: Best Practices & Code Samples

This article explores why API gateways are essential in microservice architectures, outlines core design functions such as routing, load balancing, authentication, rate limiting, and protocol translation, and provides practical code examples, performance‑tuning strategies, technology comparisons, and deployment guidelines for robust backend systems.

Backend DevelopmentJavaKubernetes

0 likes · 11 min read

Designing High‑Performance API Gateways for Microservices: Best Practices & Code Samples

IT Services Circle

Oct 3, 2025 · Databases

Why Does PostgreSQL’s B‑Tree Index Behave Like a B+ Tree?

PostgreSQL’s index system, officially called a B‑Tree, actually implements B+‑tree functionality, storing only TIDs in leaf nodes, using linked leaf pages for efficient range queries, and includes optimizations like deduplication, index‑only scans, and reverse‑key indexes to boost performance.

B+TreeB-TreePerformance Optimization

0 likes · 7 min read

Why Does PostgreSQL’s B‑Tree Index Behave Like a B+ Tree?

ITPUB

Oct 3, 2025 · Big Data

How Qunar Travel Cut 2000 CPU Cores by Optimizing Kafka Production

This case study details how Qunar Travel's engineering team analyzed Kafka production bottlenecks during peak traffic, added targeted monitoring, tuned thread and batch parameters, and validated the changes through gray‑scale tests, ultimately saving about 2000 CPU cores across three clusters while reducing request volume and improving network and disk utilization.

Big DataCPU SavingsKafka

0 likes · 14 min read

How Qunar Travel Cut 2000 CPU Cores by Optimizing Kafka Production

JavaScript

Oct 2, 2025 · Frontend Development

Unlock Hidden Performance: How Web Workers Supercharge Frontend Apps

This article explains why Web Workers, an often‑overlooked browser API, can free the main thread, leverage multi‑core CPUs, and improve memory management, offering concrete scenarios and code examples to dramatically boost JavaScript performance in modern front‑end applications.

JavaScriptPerformance OptimizationWeb Workers

0 likes · 5 min read

Python Programming Learning Circle

Sep 30, 2025 · Fundamentals

How to Supercharge Your Python Code: Proven Performance Optimization Techniques

This comprehensive guide walks you through Python performance optimization, covering profiling, algorithmic improvements, data‑structure choices, code‑level tricks, concurrency, memory management, third‑party libraries and a practical checklist to ensure your programs run faster and more efficiently.

Memory ManagementPerformance OptimizationProfiling

0 likes · 18 min read

How to Supercharge Your Python Code: Proven Performance Optimization Techniques

Huawei Cloud Developer Alliance

Sep 30, 2025 · Artificial Intelligence

Boost AI Model Performance: Master Host‑Device Scheduling on Ascend Platforms

This article explains how CPUs and Ascend AI processors cooperate as host and device, compares sink and host scheduling modes, defines Host‑Bound and Device‑Bound models, and presents optimization techniques such as tiling cache, multi‑core concurrency, and small‑shape operator handling that dramatically improve AI model execution efficiency.

AIModel SchedulingPerformance Optimization

0 likes · 12 min read

Boost AI Model Performance: Master Host‑Device Scheduling on Ascend Platforms

Kuaishou Frontend Engineering

Sep 28, 2025 · Frontend Development

Inside Kuaishou’s Low‑Code ‘Jimu’ Platform: Solving Large‑Scale Event Page Challenges

This technical article examines Kuaishou’s low‑code “Jimu” platform, detailing its architecture, performance bottlenecks, and optimization strategies that enable efficient, high‑quality construction of massive activity pages such as the 2025 CNY campaign.

Component ArchitecturePerformance Optimizationlow-code

0 likes · 19 min read

Inside Kuaishou’s Low‑Code ‘Jimu’ Platform: Solving Large‑Scale Event Page Challenges

Linux Kernel Journey

Sep 28, 2025 · Fundamentals

Low‑Latency GPU Packet Processing: Techniques, Trade‑offs, and Benchmarks

This article examines how to achieve low‑latency network packet processing on NVIDIA GPUs by comparing CPU and GPU implementations, exploring memory optimizations, batch strategies, stream concurrency, persistent kernels, and CUDA graphs, and presenting detailed performance measurements for each technique.

CUDAGPULow latency

0 likes · 12 min read

Low‑Latency GPU Packet Processing: Techniques, Trade‑offs, and Benchmarks

Tech Freedom Circle

Sep 25, 2025 · Artificial Intelligence

RAGFlow Search Engine Deep Dive: Multi‑Path Retrieval, Fusion, and Reranking

The article provides a detailed technical analysis of RAGFlow's search engine, covering the Searcher class coordination, adaptive multi‑path retrieval (vector, keyword, and knowledge‑graph), intelligent fusion with weighted scoring, caching, performance monitoring, and both built‑in and model‑driven reranking to achieve high‑precision results.

Performance OptimizationRAGFlowReranking

0 likes · 32 min read

RAGFlow Search Engine Deep Dive: Multi‑Path Retrieval, Fusion, and Reranking

Mingyi World Elasticsearch

Sep 24, 2025 · Big Data

How 3 Simple Tweaks Doubled Elasticsearch Scan Performance on 40M Docs

The article details a real‑world case of scanning over 40 million Elasticsearch documents, identifies four performance bottlenecks, and presents three concrete optimizations—_source filtering, precise index targeting, and batch‑size tuning—that together cut processing time in half and raise CPU utilization from 25% to 85%.

Batch Size TuningElasticsearchIncremental Sync

0 likes · 8 min read

How 3 Simple Tweaks Doubled Elasticsearch Scan Performance on 40M Docs

Deepin Linux

Sep 23, 2025 · Fundamentals

Mastering Linux Memory Allocation: When to Use brk vs mmap

This article explains the low‑level mechanisms of brk and mmap in Linux, compares their characteristics, shows why malloc selects one over the other based on allocation size, and provides practical code examples, performance tips, and common pitfalls for developers.

LinuxMemory ManagementPerformance Optimization

0 likes · 29 min read

Mastering Linux Memory Allocation: When to Use brk vs mmap

MaGe Linux Operations

Sep 22, 2025 · Operations

10 Proven Nginx Tweaks to Turn Your Web Server from Slow to Lightning Fast

This guide walks through ten practical Nginx optimization techniques—from worker process tuning and connection handling to gzip compression, static file caching, load‑balancing, security hardening, logging, memory tuning, HTTP/2/SSL tweaks, monitoring scripts, and common pitfalls—helping you dramatically boost throughput, reduce latency, and improve stability in high‑traffic environments.

NginxPerformance OptimizationWeb server

0 likes · 14 min read

10 Proven Nginx Tweaks to Turn Your Web Server from Slow to Lightning Fast

php Courses

Sep 22, 2025 · Fundamentals

Boost C++ Performance: Proven Memory Management Techniques You Must Use

This article explores why efficient memory management is crucial for C++ performance and presents practical strategies—including stack allocation, RAII, smart pointers, custom memory pools, optimal containers, move semantics, and diagnostic tools—to write faster, more robust programs.

CPerformance OptimizationRAII

0 likes · 9 min read

Boost C++ Performance: Proven Memory Management Techniques You Must Use

Su San Talks Tech

Sep 22, 2025 · Backend Development

Mastering Pagination in Billion‑Row Sharded Databases: Interview‑Ready Strategies

This article dissects the challenges of implementing pagination on billion‑row sharded tables, explains common sharding strategies, compares SDK, Proxy, and Sidecar architectures, and presents practical solutions such as global query, infinite scroll, two‑phase queries, index tables, and external storage to help engineers ace interview questions.

Performance Optimizationbackend interviewdistributed databases

0 likes · 24 min read

Mastering Pagination in Billion‑Row Sharded Databases: Interview‑Ready Strategies

MaGe Linux Operations

Sep 20, 2025 · Databases

Master Redis Memory Troubleshooting: From Basics to Advanced Solutions

This comprehensive guide walks you through diagnosing and resolving Redis memory issues, covering the underlying architecture, common pitfalls such as memory leaks and fragmentation, practical diagnostic commands, automated monitoring scripts, and optimization techniques to prevent costly outages and improve performance.

Memory ManagementPerformance Optimizationredis

0 likes · 26 min read

Master Redis Memory Troubleshooting: From Basics to Advanced Solutions

Model Perspective

Sep 19, 2025 · Fundamentals

Unlocking Your Body’s Clock: How Biological Rhythms Shape Performance

This article explains how human physiological, emotional, and intellectual abilities follow predictable biological rhythms, describes sinusoidal mathematical models for these cycles, and shows practical applications such as athlete scheduling and student study planning, while also discussing model extensions and limitations.

Performance Optimizationbiological rhythmschronobiology

0 likes · 8 min read