Tagged articles
1919 articles
Page 2 of 20
Woodpecker Software Testing
Woodpecker Software Testing
Jan 6, 2026 · Backend Development

How ChatGPT Analyzes and Optimizes Application Performance

The article shows how ChatGPT examines JMeter load‑test results, identifies CPU, database and I/O bottlenecks, and then proposes concrete optimizations such as adding indexes, tuning HikariCP pool settings, caching, code refactoring, and monitoring to improve login throughput and latency.

ChatGPTHikariCPJMeter
0 likes · 11 min read
How ChatGPT Analyzes and Optimizes Application Performance
java1234
java1234
Jan 6, 2026 · Backend Development

Boost API Latency 10× with Spring Boot 3 and a Three‑Level Local Cache Pyramid

The article explains why adding Redis alone often remains slow, introduces a three‑level cache pyramid (L1 Caffeine, L2 Redis, L3 MySQL) built with Spring Boot 3, and shows how this design reduces request latency from 28 ms to 2 ms, cuts CPU usage by 35 % and achieves up to 14‑fold throughput improvement.

CacheCaffeineJava
0 likes · 10 min read
Boost API Latency 10× with Spring Boot 3 and a Three‑Level Local Cache Pyramid
Tech Freedom Circle
Tech Freedom Circle
Jan 5, 2026 · Artificial Intelligence

A Three‑Step Guide to Mastering RAG Semantic‑Loss Interview Questions

RAG (Retrieval‑Augmented Generation) is a hot interview topic, and many candidates stumble on semantic‑loss issues; this article dissects a real JD interview case, identifies three core shortcomings, and presents a three‑step technical solution—structure restoration, semantic splitting, and hybrid retrieval—plus a ready‑to‑use answer template.

AI InterviewDocument ParsingHybrid Search
0 likes · 25 min read
A Three‑Step Guide to Mastering RAG Semantic‑Loss Interview Questions
php Courses
php Courses
Jan 5, 2026 · Artificial Intelligence

How AI is Transforming PHP Development: From Smart Coding to Automated Testing

This article explores how AI technologies are reshaping the entire PHP development lifecycle—offering context‑aware code generation, intelligent architecture suggestions, automated test creation, performance optimisation, and end‑to‑end workflow integration—while highlighting real‑world case studies and practical adoption strategies.

AIAutomated TestingDevOps
0 likes · 9 min read
How AI is Transforming PHP Development: From Smart Coding to Automated Testing
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jan 5, 2026 · Artificial Intelligence

How Baidu Tianchi Supernodes Supercharge Large‑Model Inference: Architecture, Deployment, and Optimization

This article details Baidu's Tianchi supernode design and software tuning—covering hardware scale‑up, deployment planning, Prefill and Decode stage optimizations, quantization strategies, and communication schemes—to dramatically boost large‑model inference throughput and latency while lowering token‑cost.

AI InfrastructureParallelismPerformance Optimization
0 likes · 20 min read
How Baidu Tianchi Supernodes Supercharge Large‑Model Inference: Architecture, Deployment, and Optimization
JavaScript
JavaScript
Dec 31, 2025 · Frontend Development

Unlock Hidden Performance Gains in Front‑End Apps with Web Workers

Web Workers let JavaScript run heavy computations in background threads, freeing the main UI thread, boosting multi‑core utilization, and improving memory management, which together dramatically enhance front‑end performance for tasks like data processing, image filtering, media handling, and AI inference.

JavaScriptPerformance OptimizationWeb Workers
0 likes · 5 min read
Unlock Hidden Performance Gains in Front‑End Apps with Web Workers
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 30, 2025 · Big Data

How StarRocks and Apache Paimon Unite to Build a True Lakehouse Native Engine

StarRocks and Apache Paimon have been progressively integrated across multiple releases, enabling a unified lakehouse architecture that supports multi-source federated analysis, time-travel queries, native readers/writers, distributed planning, and advanced profiling, while delivering performance gains that bring Paimon query speed on par with native StarRocks tables.

Apache PaimonData IntegrationLakehouse
0 likes · 9 min read
How StarRocks and Apache Paimon Unite to Build a True Lakehouse Native Engine
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 29, 2025 · Artificial Intelligence

How Alibaba’s Tair KVCache Manager Revolutionizes Enterprise‑Level LLM Cache Management

This article details the architecture and implementation of Tair KVCache Manager, an enterprise‑grade service that centralises KVCache metadata, decouples inference engines from storage, provides elastic scaling, multi‑tenant isolation, high availability, and performance‑optimised cache management for large‑scale LLM inference workloads.

Cache ManagementKVCacheLLM
0 likes · 28 min read
How Alibaba’s Tair KVCache Manager Revolutionizes Enterprise‑Level LLM Cache Management
DeWu Technology
DeWu Technology
Dec 25, 2025 · Frontend Development

Build a High‑Performance H5 PAG Player: SDK, Image Editing, Batch Synthesis

This guide details how to implement a full‑stack H5 PAG player for the “Use Basketball to Know Me” activity, covering SDK loading, canvas‑based image manipulation (drag, scale, rotate), dynamic layer and text replacement, real‑time preview synchronization, snapshot export, batch synthesis, performance tuning, and fallback strategies.

Batch ProcessingCanvasPAG
0 likes · 30 min read
Build a High‑Performance H5 PAG Player: SDK, Image Editing, Batch Synthesis
Ray's Galactic Tech
Ray's Galactic Tech
Dec 24, 2025 · Backend Development

Master Production-Ready Nginx Server Configurations: Templates & Best Practices

This guide explains why most Nginx server setups fail, presents a core server template, walks through five real‑world scenarios with complete configuration snippets, and adds performance, security, and monitoring tips so you can copy a production‑grade Nginx configuration instantly.

Performance OptimizationSecurityServer Configuration
0 likes · 8 min read
Master Production-Ready Nginx Server Configurations: Templates & Best Practices
JakartaEE China Community
JakartaEE China Community
Dec 23, 2025 · Backend Development

Performance Optimization Best Practice #3: Implementing Caching in Jakarta EE

This article explains the different cache layers—client, application, database, and distributed—provides best‑practice guidelines, and shows how to configure client‑side headers, application‑side caches with Hazelcast, JPA second‑level caching, and JMS‑based cache coordination in GlassFish.

GlassFishHazelcastPerformance Optimization
0 likes · 9 min read
Performance Optimization Best Practice #3: Implementing Caching in Jakarta EE
Ray's Galactic Tech
Ray's Galactic Tech
Dec 22, 2025 · Databases

Mastering Redis: Choosing the Right Data Structure for High‑Performance Systems

Redis offers five core data types—String, Hash, List, Set, and ZSet—each acting as a high‑performance concurrent data structure that determines system throughput, latency, and stability; this guide explains their characteristics, optimal use‑cases, anti‑patterns, and practical code examples for robust architecture design.

Backend DevelopmentPerformance Optimizationcaching
0 likes · 6 min read
Mastering Redis: Choosing the Right Data Structure for High‑Performance Systems
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Dec 22, 2025 · Artificial Intelligence

Boost LLM Inference with KV‑Cache‑Aware Routing on Alibaba Cloud ACK GIE

This article explains why KV‑Cache hit rate is critical for large‑model inference, describes vLLM's automatic prefix caching, outlines the distributed cache challenges, and provides a step‑by‑step guide to deploying Alibaba Cloud ACK Gateway with Inference Extension's precise‑mode prefix‑cache‑aware routing, backed by benchmark results.

Alibaba CloudInferenceKV cache
0 likes · 18 min read
Boost LLM Inference with KV‑Cache‑Aware Routing on Alibaba Cloud ACK GIE
Code Ape Tech Column
Code Ape Tech Column
Dec 19, 2025 · Backend Development

Boost SpringBoot Log Management: Step‑by‑Step Integration with Hera

This article explains why traditional SpringBoot logging falls short, introduces the Hera log platform’s three core benefits, outlines a layered integration architecture, and provides a detailed five‑step guide—including Maven dependencies, YAML configuration, custom field providers, log output, traceability, and console usage—plus performance, high‑availability, security tips and common pitfalls.

Distributed SystemsHeraLog Management
0 likes · 14 min read
Boost SpringBoot Log Management: Step‑by‑Step Integration with Hera
Sohu Tech Products
Sohu Tech Products
Dec 17, 2025 · Artificial Intelligence

How We Cut Vision Transformer Inference Latency from 53 ms to 8 ms

Facing 53.64 ms per‑image latency in a Flask‑served Vision Transformer classifier, we iteratively optimized the pipeline—switching to ONNX Runtime, leveraging TensorRT, replacing Pillow with OpenCV, eliminating URL downloads, and finally batching requests—reducing average server‑side processing to 8.34 ms, a 6.4× speedup.

BatchingFlaskONNX
0 likes · 28 min read
How We Cut Vision Transformer Inference Latency from 53 ms to 8 ms
Baidu Geek Talk
Baidu Geek Talk
Dec 17, 2025 · Artificial Intelligence

Accelerate LLM Deployment on Baidu Kunlun XPU with the Open‑Source vLLM‑Kunlun Plugin

The vLLM‑Kunlun Plugin, jointly released by Baidu Baige and Kunlun Chip, provides a high‑performance, zero‑intrusion solution for deploying open‑source large language models on domestic Kunlun XPU hardware, includes fused operators, precision‑validation and profiling tools, and supports over twenty mainstream and multimodal models.

Kunlun XPUModel DeploymentPerformance Optimization
0 likes · 7 min read
Accelerate LLM Deployment on Baidu Kunlun XPU with the Open‑Source vLLM‑Kunlun Plugin
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 17, 2025 · Cloud Native

How 3FS Powers High‑Performance KVCache for AI Inference: Architecture, Optimizations, and Cloud‑Native Deployment

This article details the design and engineering of the 3FS distributed file system as a scalable KVCache backend for large‑language‑model inference, covering its architecture, performance tuning, reliability fixes, integration with SGLang/vLLM, and cloud‑native Kubernetes operator deployment.

3FSAI inferenceCloud Native
0 likes · 30 min read
How 3FS Powers High‑Performance KVCache for AI Inference: Architecture, Optimizations, and Cloud‑Native Deployment
Ray's Galactic Tech
Ray's Galactic Tech
Dec 15, 2025 · Databases

Mastering Database Design: From Core Principles to Modern Distributed Practices

This comprehensive guide walks you through fundamental database design goals, a step‑by‑step lifecycle, nine essential strategies—including normalization, indexing, and security—plus modern distributed and NoSQL considerations, performance tuning, high‑availability tactics, and practical tools for robust data governance.

Data GovernanceDatabase designNoSQL
0 likes · 11 min read
Mastering Database Design: From Core Principles to Modern Distributed Practices
Java Architect Handbook
Java Architect Handbook
Dec 12, 2025 · Backend Development

Master Spring Task: Build, Optimize, and Scale Your Scheduled Jobs

This guide walks you through what Spring Task is, how to enable it in three simple steps, decode cron expressions, explore six real‑world use cases, unlock four advanced features, avoid common pitfalls, and apply performance‑tuning and future‑proofing techniques for robust Java scheduling.

Distributed SchedulingJavaPerformance Optimization
0 likes · 11 min read
Master Spring Task: Build, Optimize, and Scale Your Scheduled Jobs
Raymond Ops
Raymond Ops
Dec 11, 2025 · Operations

Master Container Networking: From Basics to Advanced Kubernetes Practices

This comprehensive guide explores container networking fundamentals, Docker network modes, Kubernetes CNI plugins, network security policies, monitoring, troubleshooting, and performance optimization, providing practical commands and configuration examples for operations engineers.

CNIDockerKubernetes
0 likes · 20 min read
Master Container Networking: From Basics to Advanced Kubernetes Practices
Su San Talks Tech
Su San Talks Tech
Dec 11, 2025 · Databases

How to Scale a 900 M‑Row Message Table to 60 B Rows Without Downtime

This article details a comprehensive sharding strategy for a high‑traffic message table, explaining why early partitioning is essential, how to design language‑based vertical shards combined with hash‑based horizontal shards, calculate table counts and virtual nodes, and execute a zero‑downtime migration using dual‑writes and gradual traffic rollout.

Performance Optimizationhorizontal partitioningmysql
0 likes · 16 min read
How to Scale a 900 M‑Row Message Table to 60 B Rows Without Downtime
Alibaba Cloud Observability
Alibaba Cloud Observability
Dec 9, 2025 · Cloud Native

Unlocking System Insights with Graph Queries in Cloud‑Native Observability

This article explains how integrating graph‑based data models into cloud‑native observability platforms transforms isolated metric monitoring into a relational view, enabling powerful queries such as graph‑match and Cypher to perform fault impact analysis, root‑cause tracing, and security audits across services, pods, and infrastructure.

CypherObservabilityPerformance Optimization
0 likes · 29 min read
Unlocking System Insights with Graph Queries in Cloud‑Native Observability
Linux Kernel Journey
Linux Kernel Journey
Dec 7, 2025 · Fundamentals

CUDA Optimization Basics: Understanding GPU Architecture and Warp Scheduling

This article explains the fundamentals of CUDA performance tuning, covering GPU architectures from Kepler to Volta, the role of SMX, warp schedulers, registers and memory hierarchies, and provides practical guidance on launch configuration, latency hiding, and thread‑block sizing to maximize throughput.

CUDAGPU architecturePerformance Optimization
0 likes · 21 min read
CUDA Optimization Basics: Understanding GPU Architecture and Warp Scheduling
Deepin Linux
Deepin Linux
Dec 7, 2025 · Fundamentals

How CPU Context Switching Powers Multitasking: From Theory to Code

This article explains the fundamentals of CPU context switching, covering the hardware and kernel mechanisms, step‑by‑step saving and restoring of registers, scheduling algorithms, different types of switches, performance costs, optimization techniques, and includes a complete C++ simulation example.

CPUKernelMultitasking
0 likes · 42 min read
How CPU Context Switching Powers Multitasking: From Theory to Code
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 5, 2025 · Big Data

How EMR Serverless Spark Cut Batch Processing Time by Over 50% for a 600M‑User Platform

This case study details how Qimao leveraged Alibaba Cloud EMR Serverless Spark with Fusion and Celeborn to overcome multi‑business‑line data‑processing challenges, achieving more than 50% faster batch jobs, significant cost reductions, and improved operational flexibility across its 600 million‑user ecosystem.

Data WarehousePerformance OptimizationServerless Spark
0 likes · 9 min read
How EMR Serverless Spark Cut Batch Processing Time by Over 50% for a 600M‑User Platform
Selected Java Interview Questions
Selected Java Interview Questions
Dec 5, 2025 · Databases

Unlock MySQL 8.0’s Hidden Optimization Tricks to Supercharge Your Queries

Discover eight powerful, lesser‑known MySQL 8.0 features—including window functions, descending indexes, generated columns, invisible indexes, hints, resource groups, LATERAL JOIN, and multi‑valued JSON indexes—that can dramatically improve query performance, simplify code, and give you an edge in interviews and production environments.

Generated ColumnsJSONLATERAL JOIN
0 likes · 12 min read
Unlock MySQL 8.0’s Hidden Optimization Tricks to Supercharge Your Queries
Data STUDIO
Data STUDIO
Dec 5, 2025 · Big Data

Why Parquet Is the Default Choice for Big Data Storage

The article explains how Apache Parquet’s columnar layout, multi‑level row‑group structure, projection and predicate push‑down, and advanced compression and encoding make it the high‑performance, space‑efficient storage format that powers modern big‑data ecosystems and tools like Spark, Python pandas, and ClickHouse.

Big DataClickHouseColumnar Storage
0 likes · 11 min read
Why Parquet Is the Default Choice for Big Data Storage
vivo Internet Technology
vivo Internet Technology
Dec 3, 2025 · Mobile Development

How Parallel Loading Supercharges H5 Instant SDK: Challenges, Early Designs, and Optimized Solutions

This article explains the parallel loading technique used in the H5 instant SDK to accelerate page startup, details the three resource‑handoff scenarios that cause synchronization challenges, reviews the early simple design and its drawbacks, and presents an optimized producer‑consumer model with fair locks and bridge streams to eliminate wasteful waiting and memory consumption.

H5 instant SDKJava concurrencyParallel Loading
0 likes · 31 min read
How Parallel Loading Supercharges H5 Instant SDK: Challenges, Early Designs, and Optimized Solutions
Full-Stack Cultivation Path
Full-Stack Cultivation Path
Dec 2, 2025 · Frontend Development

Why React Introduced a Compiler to Automate Performance Optimization

React Compiler 1.0, released in 2023, automatically eliminates the need for manual useMemo, useCallback, and memo by statically analyzing code to cache computations, generate update conditions, and replace these hooks, offering a safer, more comprehensive performance boost across Vite, Webpack, Next.js, and React Native projects.

Next.jsPerformance OptimizationReact
0 likes · 7 min read
Why React Introduced a Compiler to Automate Performance Optimization
Raymond Ops
Raymond Ops
Nov 28, 2025 · Databases

Essential DBA Guide to Enterprise MySQL Architecture, Optimization & Ops

This comprehensive guide equips DBAs with enterprise‑level MySQL strategies, covering master‑slave replication, InnoDB cluster setup, performance tuning parameters, index design, backup and recovery methods, monitoring scripts, security hardening, and emergency response procedures to ensure a stable, high‑performance database environment.

Database AdministrationPerformance Optimizationbackup and recovery
0 likes · 15 min read
Essential DBA Guide to Enterprise MySQL Architecture, Optimization & Ops
Big Data Technology & Architecture
Big Data Technology & Architecture
Nov 28, 2025 · Big Data

What’s New in Apache Paimon 2025? Core Performance, AI Integration & Real‑Time Lakehouse Updates

The 2025 Apache Paimon release brings major performance boosts, AI‑centric multimodal storage, deeper streaming‑batch integration, and broader engine compatibility, detailing query and write optimizations, memory management tweaks, and a unified lake format for structured and unstructured data.

AI integrationApache PaimonBig Data
0 likes · 6 min read
What’s New in Apache Paimon 2025? Core Performance, AI Integration & Real‑Time Lakehouse Updates
Ctrip Technology
Ctrip Technology
Nov 27, 2025 · Big Data

How Ctrip Cut Query Latency by 85% with StarRocks’ Compute‑Storage Separation

Ctrip migrated its massive User Behavior Tracking system from ClickHouse to a compute‑storage separated StarRocks cluster on Kubernetes, achieving millisecond‑level query latency, halving storage usage, reducing node count, and sustaining millions‑of‑rows‑per‑second write throughput while simplifying scaling and operations.

Big DataClickHouseCompute-Storage Separation
0 likes · 15 min read
How Ctrip Cut Query Latency by 85% with StarRocks’ Compute‑Storage Separation
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Nov 25, 2025 · Artificial Intelligence

Why DeepSeek‑V3.2‑Exp Lost Performance and How a Simple RoPE Fix Restored It

The Baidu Baige team discovered that DeepSeek‑V3.2‑Exp’s long‑context performance lagged behind the official report, traced the issue to a subtle RoPE layout mismatch in the open‑source inference demo, collaborated with DeepSeek to fix it, and verified that the model’s speed and accuracy fully recovered across multiple benchmarks.

AI InfrastructureDeepSeekLLM inference
0 likes · 9 min read
Why DeepSeek‑V3.2‑Exp Lost Performance and How a Simple RoPE Fix Restored It
Tencent TDS Service
Tencent TDS Service
Nov 25, 2025 · Mobile Development

What HarmonyOS App Quality Reveals: Crash & Freeze Insights for 2025

The 2025 Bugly HarmonyOS Quality Report analyzes system version adoption, device distribution, crash and freeze rates across industries, identifies top stability issues such as memory leaks and main‑thread blocking, and offers concrete optimization recommendations for developers seeking better app performance.

App StabilityBuglyFreeze Analysis
0 likes · 10 min read
What HarmonyOS App Quality Reveals: Crash & Freeze Insights for 2025
Big Data Tech Team
Big Data Tech Team
Nov 24, 2025 · Big Data

Avoid the 5 Common DWS Design Traps and Build Scalable Data Warehouses

This article analyzes the five typical pitfalls when designing DWS aggregation tables—from chimney‑style schemas to performance blind spots—explains their consequences, and provides concrete, production‑ready recommendations, code examples, and design principles to create reusable, efficient data‑warehouse layers.

DWS DesignData WarehousePerformance Optimization
0 likes · 10 min read
Avoid the 5 Common DWS Design Traps and Build Scalable Data Warehouses
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 22, 2025 · Artificial Intelligence

Why Your RAG System Slows Down Over Time and How to Fix It

The article explains why a production Retrieval‑Augmented Generation (RAG) system becomes slower as it runs—due to growing embedding costs, expanding vector databases, heavier re‑ranking, and larger prompts—and provides concrete engineering optimizations such as batching, async concurrency, caching, partitioned retrieval, HNSW tuning, replica scaling, answer caching, and prompt sparsification to keep performance stable.

AI EngineeringPerformance OptimizationRAG
0 likes · 10 min read
Why Your RAG System Slows Down Over Time and How to Fix It
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Nov 19, 2025 · Artificial Intelligence

Boost LLM Inference Speed with Token‑Level Two‑Chunk Overlap

Token‑level Two‑Chunk Overlap replaces traditional batch‑level Two‑Batch Overlap, dynamically splitting sequences into balanced token chunks, enabling near‑equal compute and communication times, improving GPU utilization and achieving up to 30% throughput gains in heterogeneous request workloads, with zero accuracy loss.

Batch schedulingGPU utilizationLLM inference
0 likes · 9 min read
Boost LLM Inference Speed with Token‑Level Two‑Chunk Overlap
php Courses
php Courses
Nov 18, 2025 · Backend Development

Can PHP Swap Variables Like Reusable Resources? Exploring Performance Gains

The article examines how treating PHP variables as interchangeable reusable resources could reduce memory usage, boost execution speed, and simplify code, while outlining current limitations, possible implementation strategies, challenges, and practical scenarios where such a feature would be beneficial.

Backend DevelopmentPHPPerformance Optimization
0 likes · 6 min read
Can PHP Swap Variables Like Reusable Resources? Exploring Performance Gains
Tech Freedom Circle
Tech Freedom Circle
Nov 18, 2025 · Backend Development

Optimizing Internal HTTP Calls: From Head‑of‑Line Blocking to High‑Performance Microservices

This article dissects the hidden technical debt of internal HTTP APIs, explains why HTTP/1.1 causes head‑of‑line blocking, redundant headers and serialization overhead, and walks through a step‑by‑step, data‑driven optimization roadmap—including HTTP/2, Protobuf, request aggregation, connection pooling, compression, caching, async processing, observability, and safe gray‑release deployment—backed by concrete benchmarks and code samples.

Connection PoolingHTTP/2Microservices
0 likes · 32 min read
Optimizing Internal HTTP Calls: From Head‑of‑Line Blocking to High‑Performance Microservices
IT Services Circle
IT Services Circle
Nov 17, 2025 · Backend Development

Why JSON Slows Down High‑Performance APIs and Which Binary Formats Can Save You Time

The article explains why JSON becomes a bottleneck in latency‑sensitive applications, compares four high‑performance binary serialization formats—Protocol Buffers, FlatBuffers, MessagePack, and CBOR—and provides practical guidance on selecting and deploying the right format to dramatically reduce latency, payload size, and CPU usage.

CBORFlatBuffersMessagePack
0 likes · 11 min read
Why JSON Slows Down High‑Performance APIs and Which Binary Formats Can Save You Time
Sohu Smart Platform Tech Team
Sohu Smart Platform Tech Team
Nov 13, 2025 · Cloud Native

How We Tuned Nacos Config Center to Eliminate Timeouts and QPS Limits

This article explains how Nacos, an open‑source dynamic naming and configuration service, was used in a micro‑service project, the two performance problems encountered—configuration fetch timeouts and server‑side QPS throttling—and the step‑by‑step optimizations (memory caching, fallback values, pre‑fetching and listener registration, and limit adjustments) that resolved them.

Cloud NativeJavaMicroservices
0 likes · 16 min read
How We Tuned Nacos Config Center to Eliminate Timeouts and QPS Limits
vivo Internet Technology
vivo Internet Technology
Nov 12, 2025 · Fundamentals

Linux Kernel Innovations Powering the AI Agent Era – Highlights from China’s 20th CLK

The 20th China Linux Kernel Developers Conference, hosted by vivo, presented eleven technical talks covering AI‑driven kernel challenges, memory‑compression techniques, heterogeneous compression, async file‑cache management, uncached I/O, direct I/O for compressed files, parallel writeback, host‑initiated defragmentation, zoned storage, energy‑efficient I/O, and eBPF‑based CPU idle policies, each with concrete performance results and implementation details.

AIFile SystemsLinux kernel
0 likes · 12 min read
Linux Kernel Innovations Powering the AI Agent Era – Highlights from China’s 20th CLK
JD Retail Technology
JD Retail Technology
Nov 11, 2025 · Backend Development

JIMDB’s Big-Hot Key Solution: Optimizing Distributed Cache Performance

JIMDB, a high‑performance Redis‑based distributed cache, introduces the “Big‑Hot Key” concept to dynamically identify keys that strain CPU or bandwidth, and implements a multi‑layer active governance framework—including real‑time detection, server‑side caching, circuit‑breaker, and client‑side consistency—to dramatically reduce resource consumption and boost throughput.

JimdbPerformance OptimizationResource Management
0 likes · 41 min read
JIMDB’s Big-Hot Key Solution: Optimizing Distributed Cache Performance
DeWu Technology
DeWu Technology
Nov 10, 2025 · Databases

How TiDB v7 Upgrade Boosted Performance and Solved Real‑World Issues

This article details the TiDB upgrade journey from version 5.3.3 to 7.5.x, covering architecture, new v7 features, migration strategies, encountered problems with full‑table scans and inaccurate aggregation plans, the solutions applied, and the measurable performance and stability gains achieved after the upgrade.

Database UpgradeDistributed SQLNewSQL
0 likes · 15 min read
How TiDB v7 Upgrade Boosted Performance and Solved Real‑World Issues
Baidu Geek Talk
Baidu Geek Talk
Nov 10, 2025 · Cloud Native

How Polar‑TCP Breaks Kernel Network Bottlenecks for Cloud‑Native High‑Performance Services

This article explains how traditional kernel network stacks struggle with high‑concurrency, low‑latency cloud data‑center workloads and introduces Baidu Intelligent Cloud’s Polar solution—Polar‑TCP and Polar‑RDMA—which combine user‑space DPDK drivers, a lightweight TCP stack, and an industrial RPC framework to achieve near‑RDMA performance while preserving compatibility with existing TCP ecosystems.

DPDKNetwork StackPerformance Optimization
0 likes · 23 min read
How Polar‑TCP Breaks Kernel Network Bottlenecks for Cloud‑Native High‑Performance Services
Architect's Guide
Architect's Guide
Nov 10, 2025 · Artificial Intelligence

Build a Scalable, High‑Performance OCR Invoice Pipeline with Spring Boot & Tesseract

This article details a complete, production‑grade OCR invoice processing pipeline that combines a distributed Spring Boot microservice architecture, deep Tesseract optimizations, ML‑based data validation, GPU acceleration, Kubernetes deployment, and extensive performance and security strategies to achieve million‑scale daily throughput with high accuracy.

OCRPerformance OptimizationSpring Boot
0 likes · 16 min read
Build a Scalable, High‑Performance OCR Invoice Pipeline with Spring Boot & Tesseract
AndroidPub
AndroidPub
Nov 9, 2025 · Mobile Development

How to Diagnose and Fix Jetpack Compose Performance Pitfalls

Learn how to identify and resolve performance issues in Jetpack Compose by using Layout Inspector, Stability Reports, and configuration files, understanding stable vs unstable parameters, applying strong skipping, and leveraging annotations and wrapper classes to achieve efficient UI recomposition.

AndroidJetpack ComposePerformance Optimization
0 likes · 12 min read
How to Diagnose and Fix Jetpack Compose Performance Pitfalls
Ops Community
Ops Community
Nov 8, 2025 · Operations

Mastering Nginx Reverse Proxy & Load Balancing: Best Practices for High‑Performance Deployments

This comprehensive guide walks you through Nginx reverse proxy and load balancing fundamentals, key features, suitable scenarios, environment prerequisites, step‑by‑step installation, core configuration, performance tuning, security hardening, high‑availability designs, troubleshooting, monitoring, backup strategies, real‑world case studies, and advanced learning paths for production‑grade deployments.

Performance OptimizationSecurityhigh availability
0 likes · 56 min read
Mastering Nginx Reverse Proxy & Load Balancing: Best Practices for High‑Performance Deployments
MaGe Linux Operations
MaGe Linux Operations
Nov 8, 2025 · Backend Development

Mastering Redis Cache: Prevent Penetration, Breakdown, and Avalanche with Proven Solutions

This comprehensive guide explains the three major Redis cache issues—penetration, breakdown, and avalanche—detailing their causes, impacts, and production‑ready solutions such as Bloom filters, distributed locks, logical expiration, random TTL, multi‑level caching, high‑availability setups, monitoring, backup, and best‑practice recommendations.

Performance OptimizationSpring Bootbloom-filter
0 likes · 56 min read
Mastering Redis Cache: Prevent Penetration, Breakdown, and Avalanche with Proven Solutions
Top Architect
Top Architect
Nov 8, 2025 · Backend Development

Mastering High‑Throughput Thread Pools: Strategies for 100k QPS in Java

This article analyzes a high‑traffic scenario where 100,000 QPS requests each require 100 ms processing, explains why a naïve fixed thread pool would exhaust resources, and presents practical optimization goals, strategies, and Spring‑Boot code examples—including custom pool parameters, rejection handling, batch processing, and advanced techniques like Disruptor and rate‑limiting—to build a stable, high‑performance task execution system.

Performance OptimizationSpring BootThreadPool
0 likes · 8 min read
Mastering High‑Throughput Thread Pools: Strategies for 100k QPS in Java
Sohu Tech Products
Sohu Tech Products
Nov 5, 2025 · Cloud Native

How We Optimized Nacos Config Center to Eliminate Timeouts and QPS Limits

This article explains Nacos's role as a dynamic service discovery and configuration platform, describes two real‑world performance problems encountered in production, and details the step‑by‑step code‑level optimizations—memory caching with fallback and pre‑fetching with listeners—that resolved timeout and rate‑limit issues.

Cloud NativeConfiguration ManagementMicroservices
0 likes · 16 min read
How We Optimized Nacos Config Center to Eliminate Timeouts and QPS Limits
Architect
Architect
Nov 4, 2025 · Operations

How to Accurately Track API Calls per Minute: 5 Proven Monitoring Strategies

This article explores why precise per‑minute API call statistics are essential for performance bottleneck detection, capacity planning, security alerts, billing, and troubleshooting, and presents five practical implementations—including fixed‑window counters, sliding windows, AOP‑based interception, Redis time‑series storage, and Micrometer‑Prometheus integration—along with their trade‑offs and capacity‑planning guidelines.

API monitoringJavaMetrics
0 likes · 25 min read
How to Accurately Track API Calls per Minute: 5 Proven Monitoring Strategies
Model Perspective
Model Perspective
Nov 4, 2025 · Fundamentals

Can Math Predict a Joke’s Success? A Modeling Guide to Humor

This article applies mathematical modeling—including surprise indices, timing dynamics, density optimization, and risk assessment—to dissect how stand‑up comedy works, illustrating each concept with real‑world jokes and offering practical formulas for crafting more effective humor.

Performance Optimizationcomedy analysishumor
0 likes · 16 min read
Can Math Predict a Joke’s Success? A Modeling Guide to Humor
DevOps Coach
DevOps Coach
Oct 31, 2025 · Backend Development

How Netflix’s Maestro Engine Gained a 100× Speed Boost with a New Actor‑Based Architecture

Netflix’s Maestro workflow orchestrator was redesigned with a lightweight, stateful actor model and Java virtual threads, cutting engine overhead from seconds to milliseconds, delivering a hundred‑fold performance increase while preserving scalability, reliability, and strong execution guarantees for massive data and ML pipelines.

Distributed SystemsJava virtual threadsNetflix Maestro
0 likes · 28 min read
How Netflix’s Maestro Engine Gained a 100× Speed Boost with a New Actor‑Based Architecture
Java Backend Technology
Java Backend Technology
Oct 31, 2025 · Databases

10 Essential MySQL Tricks to Boost Performance and Simplify Queries

This article presents ten practical MySQL techniques—from using JSON columns for flexible data to leveraging CTEs, window functions, and advanced clauses like WITH ROLLUP and INSERT IGNORE—each illustrated with real‑world scenarios and ready‑to‑run SQL code to improve query efficiency and developer productivity.

CTEDatabase QueriesJSON
0 likes · 11 min read
10 Essential MySQL Tricks to Boost Performance and Simplify Queries
JavaScript
JavaScript
Oct 30, 2025 · Frontend Development

Unlock Hidden Performance: How Web Workers Supercharge Frontend Apps

Web Workers, an often‑overlooked browser API, enable JavaScript to run heavy computations in background threads, freeing the main UI thread, boosting multi‑core utilization, improving memory management, and powering use‑cases such as big data processing, image filtering, AI inference, and encryption, dramatically enhancing frontend performance.

JavaScriptPerformance OptimizationWeb Workers
0 likes · 5 min read
Unlock Hidden Performance: How Web Workers Supercharge Frontend Apps
JD Tech Talk
JD Tech Talk
Oct 30, 2025 · Backend Development

How a Massive Cache Key Crashed a System and How to Prevent It

This article examines a real-world incident where a massive cache key and cache penetration during a high‑traffic promotion caused Redis overload and a system outage, then details root‑cause analysis, mitigation steps like serialization changes, compression, lock‑based fallback, and preventive best practices.

Big KeyPerformance Optimizationcache-penetration
0 likes · 8 min read
How a Massive Cache Key Crashed a System and How to Prevent It
Eric Tech Circle
Eric Tech Circle
Oct 30, 2025 · Frontend Development

How to Slash Mini Program Package Size with Smart Sub‑Package Strategies

This article explains why WeChat Mini Programs hit the 2 MB package limit, analyzes common causes of oversized main bundles, and provides a step‑by‑step guide—including pre‑slimming tactics, subpackage configuration, code examples, and performance tips—to reduce the main package below the limit and improve load speed.

Package SizePerformance OptimizationSubpackage
0 likes · 12 min read
How to Slash Mini Program Package Size with Smart Sub‑Package Strategies
JD Cloud Developers
JD Cloud Developers
Oct 29, 2025 · Mobile Development

How JD Finance Accelerated iOS Chart Rendering with ROMA and ECharts

This article details JD Finance's iOS app chart rendering challenges, the integration of Apache ECharts via a custom ROMA WebView, cache design, data communication between Native and WebView, and performance gains achieved through automated retries and WebView pooling.

EChartsPerformance OptimizationROMA
0 likes · 8 min read
How JD Finance Accelerated iOS Chart Rendering with ROMA and ECharts
Top Architect
Top Architect
Oct 28, 2025 · Backend Development

How I Cut XML‑to‑MySQL Import Time from 300 s to 4 s

This article details a step‑by‑step performance overhaul for importing 60,000+ XML records into MySQL, covering baseline measurements, MySQL batch processing, asynchronous writes with Disruptor, XML parsing optimizations, and tuning MySQL buffers, ultimately reducing total runtime from 300 seconds to just four seconds.

Batch ProcessingDisruptorJava
0 likes · 14 min read
How I Cut XML‑to‑MySQL Import Time from 300 s to 4 s
Raymond Ops
Raymond Ops
Oct 27, 2025 · Operations

Master Linux Network Tuning for High-Concurrency: Practical Guide

This guide walks through a real‑world high‑concurrency Linux scenario, diagnosing TCP state bottlenecks, analyzing default kernel parameters, and providing step‑by‑step sysctl tweaks, queue and buffer adjustments, monitoring scripts, and stress‑test recommendations to dramatically improve connection handling and throughput.

LinuxNetwork TuningPerformance Optimization
0 likes · 9 min read
Master Linux Network Tuning for High-Concurrency: Practical Guide
Code Mala Tang
Code Mala Tang
Oct 26, 2025 · Databases

Why Parameterized Queries Supercharge SQL Performance (And Prevent Injection)

This article explains how using bound (parameterized) queries in SQL Server improves performance by enabling execution plan reuse, reducing compilation overhead and memory usage, while also preventing SQL injection, and discusses potential pitfalls like parameter sniffing and how to mitigate them with OPTION (RECOMPILE).

Parameterized QueriesPerformance OptimizationSQL
0 likes · 6 min read
Why Parameterized Queries Supercharge SQL Performance (And Prevent Injection)
Programmer DD
Programmer DD
Oct 23, 2025 · Backend Development

Why Jackson Serialization Can Drain Your CPU and How to Fix It

The article reveals how default Jackson JSON serialization can become a hidden CPU bottleneck in high‑traffic Spring Boot services, explains why full entity objects inflate serialization time, and presents practical optimizations such as @JsonView, custom ObjectMapper settings, hand‑written serializers, and alternative data formats to dramatically cut latency and cloud costs.

JacksonJavaPerformance Optimization
0 likes · 13 min read
Why Jackson Serialization Can Drain Your CPU and How to Fix It
Alipay Experience Technology
Alipay Experience Technology
Oct 22, 2025 · Mobile Development

How MYKMP Enables One-Code-Three-Platform Development for Alipay’s Mobile Apps

MYKMP is Alipay’s native cross‑platform solution built on Kotlin Multiplatform and Compose, enabling a single codebase to run on Android, iOS and HarmonyOS, with detailed architecture, engineering practices, GC optimizations, and integration guidelines presented for billion‑scale applications.

Cross‑platform developmentGarbage CollectionHarmonyOS
0 likes · 20 min read
How MYKMP Enables One-Code-Three-Platform Development for Alipay’s Mobile Apps
Architect Chen
Architect Chen
Oct 22, 2025 · Big Data

How to Eliminate Kafka Message Backlog with Practical Optimizations

This guide presents concrete techniques for improving Kafka consumer and producer performance, scaling clusters, tuning broker settings, and designing asynchronous buffering layers to prevent message accumulation and boost overall throughput.

Big DataKafkaPerformance Optimization
0 likes · 5 min read
How to Eliminate Kafka Message Backlog with Practical Optimizations
Tech Freedom Circle
Tech Freedom Circle
Oct 20, 2025 · Backend Development

Mastering JVM Tuning: Real-World Enterprise Case Study for Interview Success

The article walks through a high‑traffic video service that suffered GC spikes, details a systematic diagnosis of three JVM configuration flaws, evaluates four GC tuning schemes across load scenarios, resolves CMS‑related pauses, and presents concrete performance gains with metrics, code snippets, and visual charts.

CMSGC tuningJVM
0 likes · 28 min read
Mastering JVM Tuning: Real-World Enterprise Case Study for Interview Success
Alibaba Cloud Observability
Alibaba Cloud Observability
Oct 20, 2025 · Artificial Intelligence

How We Boosted Embedding Throughput 16× and Cut Vector Index Costs in a Cloud‑Native Setup

This article examines the high cost and low throughput of embedding vectors in log‑processing scenarios, analyzes the performance bottlenecks of inference frameworks, and details a series of cloud‑native optimizations—including switching to vLLM, deploying multiple model replicas with Triton, decoupling tokenization, and priority queuing—that together raise throughput by 16× and reduce per‑token pricing by two orders of magnitude.

EmbeddingGPU inferencePerformance Optimization
0 likes · 9 min read
How We Boosted Embedding Throughput 16× and Cut Vector Index Costs in a Cloud‑Native Setup
Code Ape Tech Column
Code Ape Tech Column
Oct 20, 2025 · Backend Development

Unlock Redis Performance: Master Lua Scripting in Spring Boot

This tutorial explains how to integrate Lua scripts with Spring Boot and Redis, covering Lua fundamentals, advantages, real‑world use cases, step‑by‑step implementation in Spring Boot, performance gains, error handling, security measures, and best practices for reliable backend development.

Backend DevelopmentLua scriptingPerformance Optimization
0 likes · 21 min read
Unlock Redis Performance: Master Lua Scripting in Spring Boot
ITPUB
ITPUB
Oct 18, 2025 · Backend Development

How We Eliminated GC Pauses in a 100k QPS Service: Deep Dive into JVM Tuning

This article details the step‑by‑step investigation and JVM‑level optimizations—including early tenuring, parameter tuning, ZGC migration, and an Eden‑pre‑heat trick—that transformed a high‑concurrency, low‑latency system from 95% to 99.995% success during massive index switches.

GC tuningJVMJava
0 likes · 22 min read
How We Eliminated GC Pauses in a 100k QPS Service: Deep Dive into JVM Tuning
macrozheng
macrozheng
Oct 16, 2025 · Backend Development

How to Turn a 3‑Second Category Tree into 30 ms: A Real‑World Backend Optimization

This article analyzes a severe N+1 query performance disaster in a Java Spring Boot project, explains why the traditional recursive approach is slow, and presents a production‑tested solution that reduces database calls to one, uses O(n) tree construction, and adds multi‑level caching to achieve a 100‑fold speedup.

Java backendN+1 QueryPerformance Optimization
0 likes · 24 min read
How to Turn a 3‑Second Category Tree into 30 ms: A Real‑World Backend Optimization
Ray's Galactic Tech
Ray's Galactic Tech
Oct 14, 2025 · Backend Development

15 Proven Spring Boot Performance Hacks to Supercharge Your Apps

This guide presents fifteen practical Spring Boot performance techniques—from lazy initialization and context indexing to JVM tuning, connection‑pool tweaks, caching strategies, JSON serialization, and native image compilation—providing concrete configurations, code snippets, and best‑practice advice for faster startup, lower memory usage, and higher throughput.

JVMJavaPerformance Optimization
0 likes · 11 min read
15 Proven Spring Boot Performance Hacks to Supercharge Your Apps
StarRocks
StarRocks
Oct 14, 2025 · Big Data

How Ctrip Scaled UBT Analytics by Migrating from ClickHouse to StarRocks

Ctrip's User Behavior Tracking (UBT) system, handling 30 TB of daily data, moved from ClickHouse to StarRocks' compute‑storage separated architecture, cutting average query latency from 1.4 seconds to 203 ms, halving storage, reducing nodes from 50 to 40, and boosting write throughput to 3 million rows per second.

Big DataClickHouseData Migration
0 likes · 15 min read
How Ctrip Scaled UBT Analytics by Migrating from ClickHouse to StarRocks
Java Tech Enthusiast
Java Tech Enthusiast
Oct 14, 2025 · Backend Development

How to Build a High‑Performance MyBatis Log Analyzer with Spring Boot

This article describes the design and implementation of a high‑performance log‑analysis system for massive MyBatis logs, covering problem analysis, technology selection, architecture, multithreaded parsing, flexible regex templates, performance monitoring, pitfalls, optimization practices, results, and future plans.

JavaMyBatisPerformance Optimization
0 likes · 13 min read
How to Build a High‑Performance MyBatis Log Analyzer with Spring Boot
JavaScript
JavaScript
Oct 14, 2025 · Frontend Development

Boost JavaScript Async Performance by Up to 80% with New Promise Techniques

While async/await simplifies JavaScript code, it can introduce significant overhead in high‑frequency or compute‑heavy scenarios; this article introduces alternative async patterns—optimized Promise chaining, parallel Promise.all, batch processing, and pooling—that can reduce context switches and deliver performance gains of up to 80%.

JavaScriptParallelismPerformance Optimization
0 likes · 5 min read
Boost JavaScript Async Performance by Up to 80% with New Promise Techniques
Baidu Geek Talk
Baidu Geek Talk
Oct 13, 2025 · Big Data

How Baidu Scaled Its Data Warehouse to Handle Billions of PVs and Petabytes

This article details Baidu APP's massive data‑warehouse overhaul, describing the two‑step strategy that stabilized log cleaning, modernized the ETL framework, introduced wide‑table architectures, and implemented tiered storage to dramatically improve processing speed, reliability, and cost efficiency for petabyte‑scale workloads.

Big DataData WarehouseETL
0 likes · 25 min read
How Baidu Scaled Its Data Warehouse to Handle Billions of PVs and Petabytes
Deepin Linux
Deepin Linux
Oct 9, 2025 · Fundamentals

Why Memory Alignment Is Critical for Linux Performance and Stability

This article explains how memory alignment in Linux—driven by hardware granularity, kernel allocation policies, and compiler optimizations—affects struct layout, cross‑platform compatibility, and execution speed, and provides practical techniques and testing methods to ensure optimal performance and reliability.

C programmingLinuxPerformance Optimization
0 likes · 37 min read
Why Memory Alignment Is Critical for Linux Performance and Stability
Code Ape Tech Column
Code Ape Tech Column
Oct 8, 2025 · Databases

Boost Your Data Ingestion: A High‑Performance Java Stream Load Architecture for Doris

This article presents a complete Java‑based architecture for high‑throughput Doris stream loading, covering project structure, Maven dependencies, configuration properties, field‑mapping annotations, automatic mapper utilities, a robust parallel loader with retry and compression, plus performance tuning recommendations.

Annotation MappingJavaPerformance Optimization
0 likes · 23 min read
Boost Your Data Ingestion: A High‑Performance Java Stream Load Architecture for Doris
ITPUB
ITPUB
Oct 5, 2025 · Backend Development

How to Clear a 10‑Million‑Message Queue in 5 Hours: A Five‑Step Rescue Plan

When a flash‑sale causes a 10 million‑message backlog and consumers only process 200 messages per second, this guide shows a five‑step, 5‑hour strategy—horizontal scaling, message downgrade, flow control, temporary dump, and parallel blasting—to restore throughput and prevent system collapse.

Distributed SystemsKafkaPerformance Optimization
0 likes · 6 min read
How to Clear a 10‑Million‑Message Queue in 5 Hours: A Five‑Step Rescue Plan
IT Architects Alliance
IT Architects Alliance
Oct 3, 2025 · Backend Development

Designing High‑Performance API Gateways for Microservices: Best Practices & Code Samples

This article explores why API gateways are essential in microservice architectures, outlines core design functions such as routing, load balancing, authentication, rate limiting, and protocol translation, and provides practical code examples, performance‑tuning strategies, technology comparisons, and deployment guidelines for robust backend systems.

Backend DevelopmentJavaKubernetes
0 likes · 11 min read
Designing High‑Performance API Gateways for Microservices: Best Practices & Code Samples
IT Services Circle
IT Services Circle
Oct 3, 2025 · Databases

Why Does PostgreSQL’s B‑Tree Index Behave Like a B+ Tree?

PostgreSQL’s index system, officially called a B‑Tree, actually implements B+‑tree functionality, storing only TIDs in leaf nodes, using linked leaf pages for efficient range queries, and includes optimizations like deduplication, index‑only scans, and reverse‑key indexes to boost performance.

B+TreeB-TreePerformance Optimization
0 likes · 7 min read
Why Does PostgreSQL’s B‑Tree Index Behave Like a B+ Tree?
ITPUB
ITPUB
Oct 3, 2025 · Big Data

How Qunar Travel Cut 2000 CPU Cores by Optimizing Kafka Production

This case study details how Qunar Travel's engineering team analyzed Kafka production bottlenecks during peak traffic, added targeted monitoring, tuned thread and batch parameters, and validated the changes through gray‑scale tests, ultimately saving about 2000 CPU cores across three clusters while reducing request volume and improving network and disk utilization.

Big DataCPU SavingsKafka
0 likes · 14 min read
How Qunar Travel Cut 2000 CPU Cores by Optimizing Kafka Production
JavaScript
JavaScript
Oct 2, 2025 · Frontend Development

Unlock Hidden Performance: How Web Workers Supercharge Frontend Apps

This article explains why Web Workers, an often‑overlooked browser API, can free the main thread, leverage multi‑core CPUs, and improve memory management, offering concrete scenarios and code examples to dramatically boost JavaScript performance in modern front‑end applications.

JavaScriptPerformance OptimizationWeb Workers
0 likes · 5 min read
Unlock Hidden Performance: How Web Workers Supercharge Frontend Apps
Python Programming Learning Circle
Python Programming Learning Circle
Sep 30, 2025 · Fundamentals

How to Supercharge Your Python Code: Proven Performance Optimization Techniques

This comprehensive guide walks you through Python performance optimization, covering profiling, algorithmic improvements, data‑structure choices, code‑level tricks, concurrency, memory management, third‑party libraries and a practical checklist to ensure your programs run faster and more efficiently.

Memory ManagementPerformance OptimizationProfiling
0 likes · 18 min read
How to Supercharge Your Python Code: Proven Performance Optimization Techniques
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Sep 30, 2025 · Artificial Intelligence

Boost AI Model Performance: Master Host‑Device Scheduling on Ascend Platforms

This article explains how CPUs and Ascend AI processors cooperate as host and device, compares sink and host scheduling modes, defines Host‑Bound and Device‑Bound models, and presents optimization techniques such as tiling cache, multi‑core concurrency, and small‑shape operator handling that dramatically improve AI model execution efficiency.

AIModel SchedulingPerformance Optimization
0 likes · 12 min read
Boost AI Model Performance: Master Host‑Device Scheduling on Ascend Platforms
Kuaishou Frontend Engineering
Kuaishou Frontend Engineering
Sep 28, 2025 · Frontend Development

Inside Kuaishou’s Low‑Code ‘Jimu’ Platform: Solving Large‑Scale Event Page Challenges

This technical article examines Kuaishou’s low‑code “Jimu” platform, detailing its architecture, performance bottlenecks, and optimization strategies that enable efficient, high‑quality construction of massive activity pages such as the 2025 CNY campaign.

Component ArchitecturePerformance Optimizationlow-code
0 likes · 19 min read
Inside Kuaishou’s Low‑Code ‘Jimu’ Platform: Solving Large‑Scale Event Page Challenges
Tech Freedom Circle
Tech Freedom Circle
Sep 25, 2025 · Artificial Intelligence

RAGFlow Search Engine Deep Dive: Multi‑Path Retrieval, Fusion, and Reranking

The article provides a detailed technical analysis of RAGFlow's search engine, covering the Searcher class coordination, adaptive multi‑path retrieval (vector, keyword, and knowledge‑graph), intelligent fusion with weighted scoring, caching, performance monitoring, and both built‑in and model‑driven reranking to achieve high‑precision results.

Performance OptimizationRAGFlowReranking
0 likes · 32 min read
RAGFlow Search Engine Deep Dive: Multi‑Path Retrieval, Fusion, and Reranking
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Sep 24, 2025 · Big Data

How 3 Simple Tweaks Doubled Elasticsearch Scan Performance on 40M Docs

The article details a real‑world case of scanning over 40 million Elasticsearch documents, identifies four performance bottlenecks, and presents three concrete optimizations—_source filtering, precise index targeting, and batch‑size tuning—that together cut processing time in half and raise CPU utilization from 25% to 85%.

Batch Size TuningElasticsearchIncremental Sync
0 likes · 8 min read
How 3 Simple Tweaks Doubled Elasticsearch Scan Performance on 40M Docs
Deepin Linux
Deepin Linux
Sep 23, 2025 · Fundamentals

Mastering Linux Memory Allocation: When to Use brk vs mmap

This article explains the low‑level mechanisms of brk and mmap in Linux, compares their characteristics, shows why malloc selects one over the other based on allocation size, and provides practical code examples, performance tips, and common pitfalls for developers.

LinuxMemory ManagementPerformance Optimization
0 likes · 29 min read
Mastering Linux Memory Allocation: When to Use brk vs mmap
MaGe Linux Operations
MaGe Linux Operations
Sep 22, 2025 · Operations

10 Proven Nginx Tweaks to Turn Your Web Server from Slow to Lightning Fast

This guide walks through ten practical Nginx optimization techniques—from worker process tuning and connection handling to gzip compression, static file caching, load‑balancing, security hardening, logging, memory tuning, HTTP/2/SSL tweaks, monitoring scripts, and common pitfalls—helping you dramatically boost throughput, reduce latency, and improve stability in high‑traffic environments.

NginxPerformance OptimizationWeb server
0 likes · 14 min read
10 Proven Nginx Tweaks to Turn Your Web Server from Slow to Lightning Fast
php Courses
php Courses
Sep 22, 2025 · Fundamentals

Boost C++ Performance: Proven Memory Management Techniques You Must Use

This article explores why efficient memory management is crucial for C++ performance and presents practical strategies—including stack allocation, RAII, smart pointers, custom memory pools, optimal containers, move semantics, and diagnostic tools—to write faster, more robust programs.

CPerformance OptimizationRAII
0 likes · 9 min read
Boost C++ Performance: Proven Memory Management Techniques You Must Use
Su San Talks Tech
Su San Talks Tech
Sep 22, 2025 · Backend Development

Mastering Pagination in Billion‑Row Sharded Databases: Interview‑Ready Strategies

This article dissects the challenges of implementing pagination on billion‑row sharded tables, explains common sharding strategies, compares SDK, Proxy, and Sidecar architectures, and presents practical solutions such as global query, infinite scroll, two‑phase queries, index tables, and external storage to help engineers ace interview questions.

Performance Optimizationbackend interviewdistributed databases
0 likes · 24 min read
Mastering Pagination in Billion‑Row Sharded Databases: Interview‑Ready Strategies
MaGe Linux Operations
MaGe Linux Operations
Sep 20, 2025 · Databases

Master Redis Memory Troubleshooting: From Basics to Advanced Solutions

This comprehensive guide walks you through diagnosing and resolving Redis memory issues, covering the underlying architecture, common pitfalls such as memory leaks and fragmentation, practical diagnostic commands, automated monitoring scripts, and optimization techniques to prevent costly outages and improve performance.

Memory ManagementPerformance Optimizationredis
0 likes · 26 min read
Master Redis Memory Troubleshooting: From Basics to Advanced Solutions
Model Perspective
Model Perspective
Sep 19, 2025 · Fundamentals

Unlocking Your Body’s Clock: How Biological Rhythms Shape Performance

This article explains how human physiological, emotional, and intellectual abilities follow predictable biological rhythms, describes sinusoidal mathematical models for these cycles, and shows practical applications such as athlete scheduling and student study planning, while also discussing model extensions and limitations.

Performance Optimizationbiological rhythmschronobiology
0 likes · 8 min read
Unlocking Your Body’s Clock: How Biological Rhythms Shape Performance