Tagged articles
211 articles
Page 1 of 3
Java Architect Essentials
Java Architect Essentials
May 16, 2026 · Game Development

How 128 KB Contra Managed a Long Storyline on the NES

Despite being limited to just 128 KB of ROM, the 1987 NES game Contra achieved a surprisingly long storyline by exploiting tile‑based graphics, tightly packed audio data generated by the Ricoh 2A03 chip, and minimalist code that leveraged the console’s built‑in hardware functions.

Audio SynthesisContraMemory Optimization
0 likes · 6 min read
How 128 KB Contra Managed a Long Storyline on the NES
21CTO
21CTO
May 11, 2026 · Artificial Intelligence

How jcode Runs 10‑20 AI Agents on an 8 GB Laptop with Rust

jcode, a Rust‑based AI agent framework, uses only 27.8 MB per agent and 14 ms startup time, enabling 10‑20 concurrent agents on an 8 GB laptop, outperforming Claude Code, GitHub Copilot CLI and other Python‑based solutions in memory, speed, and scalability.

AI agentsMemory OptimizationMulti-Agent
0 likes · 11 min read
How jcode Runs 10‑20 AI Agents on an 8 GB Laptop with Rust
Deepin Linux
Deepin Linux
May 5, 2026 · Fundamentals

Master Linux Memory Performance with HugePages

Linux’s default 4 KB pages cause massive page tables and TLB misses in high‑memory workloads; this article explains the HugePage mechanism, its types, how it reduces page‑table entries, improves TLB hit rates, lowers fragmentation, and provides step‑by‑step configuration for static and transparent huge pages in production.

LinuxMemory OptimizationTLB
0 likes · 39 min read
Master Linux Memory Performance with HugePages
Data Party THU
Data Party THU
Apr 20, 2026 · Artificial Intelligence

How MemPO Uses Reinforcement Learning to Turn Agent Memory into a Trainable Policy

MemPO introduces a self‑memory policy optimization framework that lets long‑horizon LLM agents autonomously manage and refine their memory via reinforcement learning, using global‑trajectory and informative‑memory advantage estimates, achieving up to 25.98% F1 gain and 73% token reduction on benchmark tasks.

BenchmarkLLMLong-Horizon Agents
0 likes · 8 min read
How MemPO Uses Reinforcement Learning to Turn Agent Memory into a Trainable Policy
Deepin Linux
Deepin Linux
Apr 19, 2026 · Fundamentals

How Linux Fork Uses Copy‑On‑Write to Boost Process Creation Efficiency

This article explains the fork system call in the Linux kernel, details the copy‑on‑write (COW) mechanism that underpins its efficiency, provides code examples, and explores practical scenarios and performance implications for process creation, memory usage, and concurrent server programming.

C programmingCOWCopy-on-Write
0 likes · 36 min read
How Linux Fork Uses Copy‑On‑Write to Boost Process Creation Efficiency
SuanNi
SuanNi
Apr 12, 2026 · Artificial Intelligence

How MemPO Gives AI Agents Long‑Term Memory and Cuts Costs by 70%

The paper introduces MemPO, a self‑memory strategy optimization algorithm that lets large language model agents actively manage their memory, dramatically improving accuracy on complex multi‑step tasks while reducing token consumption by up to 73%, and validates the approach with extensive experiments and analysis.

AILong-term MemoryMemory Optimization
0 likes · 11 min read
How MemPO Gives AI Agents Long‑Term Memory and Cuts Costs by 70%
ShiZhen AI
ShiZhen AI
Apr 2, 2026 · Artificial Intelligence

How KV Cache Works and Why Large Model Outputs Cost Five Times More Than Inputs

The article explains the KV Cache mechanism that stores previously computed key/value vectors to avoid redundant Transformer calculations, delivering roughly a 5× speedup, while also detailing why generating output tokens is far more expensive than processing input tokens due to serial generation and memory trade‑offs.

KV cacheLLM inferenceMemory Optimization
0 likes · 9 min read
How KV Cache Works and Why Large Model Outputs Cost Five Times More Than Inputs
Data STUDIO
Data STUDIO
Mar 20, 2026 · Fundamentals

Save Up to 80% Memory in Python with 5 Built‑In Tricks

The article shows how to diagnose and dramatically cut Python’s memory usage by using built‑in tools such as sys.getsizeof, psutil, __slots__, generator expressions, memory‑mapped files (mmap) and string interning, providing concrete code examples, benchmarks and practical tips to avoid common pitfalls.

Memory OptimizationPythonString Interning
0 likes · 15 min read
Save Up to 80% Memory in Python with 5 Built‑In Tricks
Data Party THU
Data Party THU
Feb 28, 2026 · Artificial Intelligence

How MIT’s Attention Matching Turns Linear Regression into Fast KV Compression

The article explains MIT’s Attention Matching technique that reformulates large‑model context compression as a linear regression problem, detailing its theoretical foundations, three‑step gradient‑free implementation, architectural adaptations, non‑uniform budgeting, and extensive evaluations showing orders‑of‑magnitude speed gains with minimal accuracy loss.

Attention MatchingKV compressionMemory Optimization
0 likes · 10 min read
How MIT’s Attention Matching Turns Linear Regression into Fast KV Compression
IT Services Circle
IT Services Circle
Feb 13, 2026 · Fundamentals

5 Proven Python Memory‑Optimization Patterns to Slash RAM Usage

Learn five practical Python techniques—streaming large files, using generator pipelines, leveraging __slots__, avoiding temporary objects in loops, and reusing buffers—that together can reduce memory consumption by up to 70% and dramatically improve performance when processing gigabyte‑scale datasets.

GeneratorsMemory OptimizationPython
0 likes · 9 min read
5 Proven Python Memory‑Optimization Patterns to Slash RAM Usage
HyperAI Super Neural
HyperAI Super Neural
Feb 11, 2026 · Artificial Intelligence

Reduce Memory by 75% Using D‑CHAG’s Cross‑Channel Hierarchical Aggregation

Researchers at Oak Ridge National Laboratory introduced D‑CHAG, a distributed cross‑channel hierarchical aggregation method that cuts memory consumption by up to 75% and more than doubles throughput when training massive multi‑channel foundation models on up to 1,024 AMD GPUs, as demonstrated on hyperspectral imaging and weather‑forecasting workloads.

D-CHAGDistributed TrainingMemory Optimization
0 likes · 14 min read
Reduce Memory by 75% Using D‑CHAG’s Cross‑Channel Hierarchical Aggregation
Data STUDIO
Data STUDIO
Feb 9, 2026 · Fundamentals

5 Python Memory‑Optimization Patterns That Cut Usage by 70%

The article walks through five concrete Python techniques—streaming file reads, generator expressions, __slots__, avoiding temporary objects in loops, and reusing buffers—showing code examples and measured memory reductions that together lowered overall RAM consumption by about 70%.

GeneratorsMemory OptimizationProfiling
0 likes · 9 min read
5 Python Memory‑Optimization Patterns That Cut Usage by 70%
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 12, 2026 · Artificial Intelligence

How CodeGenius Re‑engineered Memory to Tame AI Agent Context Bloat

This article explains how the rapid evolution of AI agents caused context explosion, why the original fixed‑window memory failed, and how CodeGenius introduced a layered memory system that unloads stale data, deduplicates files, generates structural summaries, and dynamically compresses dialogue to keep prompts stable, reduce token cost, and improve task continuity.

AI AgentContext managementLLM cost reduction
0 likes · 18 min read
How CodeGenius Re‑engineered Memory to Tame AI Agent Context Bloat
Deepin Linux
Deepin Linux
Dec 25, 2025 · Operations

How to Optimize Linux Thread Stack Memory for High‑Concurrency Services

This article explains the fundamentals of Linux thread stack memory, identifies why default stack sizes can cause waste or overflow in high‑concurrency scenarios, and provides practical techniques—including stack‑size tuning, code refactoring, and memory‑mapping—to reduce memory usage and improve service stability.

CLinuxMemory Optimization
0 likes · 34 min read
How to Optimize Linux Thread Stack Memory for High‑Concurrency Services
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Dec 21, 2025 · Artificial Intelligence

Why KV Caching Is Critical for Efficient LLM Inference

The article breaks down the principles of KV caching in large language models, explaining how Q/K/V behavior differs between training and inference, the role of prompts, cache size trade‑offs, and the complexities of concurrent inference, all backed by concrete examples and references.

Concurrent InferenceKV cacheLLM inference
0 likes · 7 min read
Why KV Caching Is Critical for Efficient LLM Inference
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Dec 11, 2025 · Artificial Intelligence

Fine‑Grained Activation Offloading: Cutting Memory Use While Preserving LLM Throughput

The article introduces a fine‑grained activation offloading technique implemented in Megatron‑Core that offloads module‑level activations to CPU, overlaps transfer with computation, and remains compatible with pipeline and virtual pipeline parallelism, dramatically reducing peak GPU memory for large language models while incurring minimal throughput loss.

LLMMegatronMemory Optimization
0 likes · 18 min read
Fine‑Grained Activation Offloading: Cutting Memory Use While Preserving LLM Throughput
Top Architect
Top Architect
Dec 8, 2025 · Backend Development

How Much Heap Memory Does Each SpringBoot HTTP Request Really Use?

This article explains how to measure the exact heap memory consumption of individual SpringBoot HTTP and RPC requests by calculating required memory from request size, configuring JVM GC parameters, creating a JMeter test plan, collecting GC logs, and analyzing results to identify optimization opportunities.

JMeterJVMMemory Optimization
0 likes · 10 min read
How Much Heap Memory Does Each SpringBoot HTTP Request Really Use?
Liangxu Linux
Liangxu Linux
Nov 13, 2025 · Fundamentals

Boost Embedded Performance: 10 Proven C Code Optimization Tricks

This article presents a collection of practical embedded‑system optimization techniques, covering time‑efficiency improvements such as avoiding floating‑point arithmetic and inlining functions, space‑efficiency strategies like choosing appropriate data types, using unions and flexible arrays, as well as loop unrolling, bit‑field usage, and data‑type selection to maximize performance on resource‑constrained devices.

Memory Optimizationembeddedlow‑level
0 likes · 10 min read
Boost Embedded Performance: 10 Proven C Code Optimization Tricks
Xiao Liu Lab
Xiao Liu Lab
Oct 28, 2025 · Operations

How to Safely Optimize Tomcat Memory for Stable Production

This guide walks you through why Tomcat's default JVM settings are insufficient, common pitfalls when manually editing catalina.sh, and the recommended, maintainable approach using setenv.sh to configure heap, Metaspace, and GC options for reliable, high‑performance server operation.

JVMMemory OptimizationTomcat
0 likes · 7 min read
How to Safely Optimize Tomcat Memory for Stable Production
MaGe Linux Operations
MaGe Linux Operations
Sep 20, 2025 · Operations

Master Tomcat Memory Tuning: Boost Performance Up to 300%

This comprehensive guide shows how a seasoned operations engineer can dramatically improve Tomcat stability and throughput by understanding JVM memory structures, configuring optimal JVM and connector parameters, selecting the right garbage collector, and implementing robust monitoring and diagnostics to prevent crashes and latency.

JVMMemory OptimizationServer Configuration
0 likes · 15 min read
Master Tomcat Memory Tuning: Boost Performance Up to 300%
macrozheng
macrozheng
Sep 18, 2025 · Fundamentals

How Bitmaps and Bloom Filters Slash Memory Usage for Massive Datasets

This article explains how using a bitmap can reduce the memory needed to store billions of integers from about 15 GB to under 500 MB, describes the bitmap concept, introduces Bloom filters, outlines their principles, advantages, common use cases, and provides Java and Redis code examples for implementation.

BitmapData StructuresJava
0 likes · 11 min read
How Bitmaps and Bloom Filters Slash Memory Usage for Massive Datasets
Python Crawling & Data Mining
Python Crawling & Data Mining
Sep 17, 2025 · Fundamentals

Unlock Massive Memory Savings in Python with __slots__: A Complete Guide

This article explains how Python's default class memory overhead can be dramatically reduced using the __slots__ magic attribute, providing detailed code examples, memory and speed benchmarks, practical use cases, limitations, and best‑practice recommendations for developers who need high‑performance, memory‑efficient objects.

Memory OptimizationPython__slots__
0 likes · 12 min read
Unlock Massive Memory Savings in Python with __slots__: A Complete Guide
Su San Talks Tech
Su San Talks Tech
Sep 11, 2025 · Databases

Why Redis Handles Millions of Requests: Deep Dive into Its High‑Performance Architecture

This article explains how Redis achieves ultra‑high concurrency by combining a single‑threaded event‑driven core, highly optimized in‑memory data structures, efficient I/O multiplexing, smart memory management, asynchronous persistence, and scalable clustering, allowing it to comfortably serve hundreds of thousands of simultaneous requests.

Event-drivenMemory OptimizationPersistence
0 likes · 16 min read
Why Redis Handles Millions of Requests: Deep Dive into Its High‑Performance Architecture
Data Party THU
Data Party THU
Sep 4, 2025 · Artificial Intelligence

How MXFP4 Quantization Lets a 1200‑Billion‑Parameter LLM Run on a Single 80GB GPU

This article analyzes the memory bottleneck of massive language models, explains the mathematical modeling of memory requirements, evaluates traditional sharding limits, and details how GPT‑OSS’s MXFP4 quantization combined with Mixture‑of‑Experts reduces memory, bandwidth, and compute demands enough to fit a 1200‑billion‑parameter model onto an 80 GB GPU with minimal accuracy loss.

FP4LLMMXFP4
0 likes · 11 min read
How MXFP4 Quantization Lets a 1200‑Billion‑Parameter LLM Run on a Single 80GB GPU
21CTO
21CTO
Aug 19, 2025 · Backend Development

How Generators Slash PHP Memory Usage When Processing Massive CSV Files

This article explains how PHP lazy evaluation using generators and the Iterator API can dramatically reduce memory consumption when loading huge CSV files, provides side‑by‑side code examples and benchmarks, and offers guidance on when to choose each approach for real‑world data processing tasks.

GeneratorsLazy EvaluationMemory Optimization
0 likes · 7 min read
How Generators Slash PHP Memory Usage When Processing Massive CSV Files
ITPUB
ITPUB
Jul 29, 2025 · Big Data

How to Deduplicate 4 Billion QQ IDs Using a Bitmap Within 1 GB Memory

Learn how to efficiently remove duplicates from 4 billion QQ numbers using a memory‑friendly Bitmap approach that fits within a 1 GB limit, including calculations, step‑by‑step implementation, Java code, and a discussion of its advantages and drawbacks.

Big DataBitmapData Structures
0 likes · 9 min read
How to Deduplicate 4 Billion QQ IDs Using a Bitmap Within 1 GB Memory
Deepin Linux
Deepin Linux
Jul 24, 2025 · Fundamentals

Boosting Large-Scale std::vector Performance: Memory, Moves, and SIMD

This article examines why std::vector can become a bottleneck when handling millions of elements, analyzes memory consumption, insertion/deletion costs, and cache behavior, and presents practical optimizations such as pre‑allocation, move semantics, SIMD vectorization, and cache‑friendly designs illustrated with real‑world case studies and code examples.

CMemory OptimizationSIMD
0 likes · 21 min read
Boosting Large-Scale std::vector Performance: Memory, Moves, and SIMD
AntData
AntData
Jul 8, 2025 · Artificial Intelligence

How RaBitQ Achieves 32× Vector Compression Without Sacrificing Accuracy

This article explains the challenges of high‑dimensional vector retrieval, introduces quantization techniques—especially the binary RaBitQ method and its MRQ extension—detailing their compression ratios, speed gains, compatibility with search indexes, and real‑world performance results in the VSAG system.

AI embeddingsMRQMemory Optimization
0 likes · 15 min read
How RaBitQ Achieves 32× Vector Compression Without Sacrificing Accuracy
macrozheng
macrozheng
Jul 3, 2025 · Game Development

How 128KB Powered Classic Games Like Contra – Secrets of Retro Game Tech

This article explores why early 1980s games such as Contra could deliver rich graphics, music, and gameplay within just 128KB of memory, explaining tile‑based rendering, limited audio chip capabilities, and ultra‑compact code techniques that made such feats possible.

Game DevelopmentGraphicsMemory Optimization
0 likes · 7 min read
How 128KB Powered Classic Games Like Contra – Secrets of Retro Game Tech
Tencent Cloud Developer
Tencent Cloud Developer
Jul 1, 2025 · Game Development

Mastering Memory Optimization for Unity Mobile Games: Practical Steps and Tools

This article explains why memory optimization is critical for Unity mobile games, outlines a three‑step workflow of testing, analysis, and fixing, details practical testing methods, data‑analysis toolchains, and concrete optimization techniques such as resource LOD, array pre‑allocation, string deduplication, compression, and atlas management, and concludes with a concise checklist for effective memory management.

Memory OptimizationToolchainUnity
0 likes · 16 min read
Mastering Memory Optimization for Unity Mobile Games: Practical Steps and Tools
Tencent Cloud Developer
Tencent Cloud Developer
May 21, 2025 · Operations

How Tencent Docs Cut Desktop Memory Usage by 60% with Tombstone and Page‑Freeze Techniques

This article details the memory‑bloat problem of the Electron‑based Tencent Docs desktop client, outlines several optimization ideas, explains the implemented tombstone mechanism and Windows‑level memory cleaning plus Chromium page‑freezing, and presents benchmark results showing a roughly 60% reduction in RAM consumption.

Desktop ApplicationElectronMemory Optimization
0 likes · 11 min read
How Tencent Docs Cut Desktop Memory Usage by 60% with Tombstone and Page‑Freeze Techniques
Architect
Architect
May 18, 2025 · Artificial Intelligence

How Much GPU Memory Can One Model Use? A Deep Dive into Transformer Memory Accounting

This article breaks down GPU memory consumption for large Transformer models, explains how to estimate each component—parameters, optimizer state, activations, gradients—and shows how parallelism, mixed precision, and recomputation strategies can dramatically reduce the footprint.

AI trainingGPU MemoryMemory Optimization
0 likes · 14 min read
How Much GPU Memory Can One Model Use? A Deep Dive into Transformer Memory Accounting
Code Mala Tang
Code Mala Tang
May 17, 2025 · Backend Development

How Lazy Imports Can Slash Python Startup Time by 80%

Learn how Python's eager import mechanism can cause slow startup in large projects and discover practical lazy import techniques—including function-level imports, custom LazyLoader classes, context manager approaches, and standard library solutions—that dramatically reduce launch time and memory usage while maintaining IDE support.

Lazy ImportMemory OptimizationStartup Time
0 likes · 11 min read
How Lazy Imports Can Slash Python Startup Time by 80%
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Apr 27, 2025 · Databases

Why MySQL Memory Stays High and How to Optimize It

This article explains MySQL's memory architecture, why memory usage often stays high after spikes, and provides practical steps—including connection checks, slow query analysis, workload scaling, and switching to jemalloc—to diagnose and reduce memory consumption on 360's internal cloud platform.

Connection ManagementDatabase PerformanceMemory Optimization
0 likes · 7 min read
Why MySQL Memory Stays High and How to Optimize It
macrozheng
macrozheng
Apr 7, 2025 · Big Data

How to Deduplicate 4 Billion QQ Numbers Using a Bitmap Under 1 GB

This article explains how to efficiently remove duplicates from 4 billion QQ numbers within a 1 GB memory limit by replacing the naïve HashSet approach with a memory‑saving Bitmap data structure, complete with calculations, algorithm steps, Java code, and a discussion of its pros and cons.

BitmapJavaMemory Optimization
0 likes · 9 min read
How to Deduplicate 4 Billion QQ Numbers Using a Bitmap Under 1 GB
php Courses
php Courses
Mar 27, 2025 · Fundamentals

Understanding Python List Comprehensions and Generator Expressions

This article explores Python's list comprehensions and generator expressions, detailing their syntax, performance characteristics, memory usage, multi‑level nesting, and practical tips such as dictionary/set comprehensions and integration with functional programming, helping developers choose the appropriate tool for efficient data processing.

Memory OptimizationPythondata-processing
0 likes · 6 min read
Understanding Python List Comprehensions and Generator Expressions
Cognitive Technology Team
Cognitive Technology Team
Mar 1, 2025 · Databases

Understanding and Mitigating Redis Large‑Key Issues

The article explains what constitutes a Redis large key, outlines its performance and stability risks, describes common scenarios and root causes, and provides practical detection commands, mitigation techniques such as splitting, compression, proper data modeling, and monitoring strategies to prevent future issues.

Memory Optimizationdatabaselarge key
0 likes · 6 min read
Understanding and Mitigating Redis Large‑Key Issues
DataFunSummit
DataFunSummit
Feb 2, 2025 · Artificial Intelligence

BladeDISC++: A Dynamic‑Shape AI Compiler for Memory‑Peak Optimization in Deep Learning Training

The article introduces BladeDISC++, a dynamic‑shape AI compiler from Alibaba Cloud PAI, explains the memory‑peak challenges of dynamic‑shape deep‑learning workloads, describes its symbolic‑shape graph, joint compile‑time/runtime optimizations such as operation fusion, scheduling and just‑in‑time rematerialization, and presents Llama2 experiments showing significant GPU memory savings and throughput gains.

AI compilerBladeDISCLlama2
0 likes · 15 min read
BladeDISC++: A Dynamic‑Shape AI Compiler for Memory‑Peak Optimization in Deep Learning Training
php Courses
php Courses
Jan 24, 2025 · Backend Development

Using PHP Generator Functions to Create Infinite Iterable Objects

This article explains PHP generator functions introduced in PHP 5.5 and enhanced in PHP 7, demonstrating how to implement infinite iterable objects such as a Fibonacci sequence, and discusses their advantages for processing large data sets, memory efficiency, and typical use cases like log handling and pagination.

Backend DevelopmentLarge Data ProcessingMemory Optimization
0 likes · 5 min read
Using PHP Generator Functions to Create Infinite Iterable Objects
DeWu Technology
DeWu Technology
Jan 22, 2025 · Operations

How We Cut Video Detection Memory Usage by 78% with WebAssembly and WorkerFS

This article details the challenges of video corruption detection on a creator platform, analyzes existing server‑side and client‑side approaches, and presents a WebAssembly‑based solution using ffmpeg, WorkerFS, and memory‑growth tuning that reduces memory consumption by up to 78% while speeding up large‑file processing.

Memory OptimizationVideo processingWeb Worker
0 likes · 13 min read
How We Cut Video Detection Memory Usage by 78% with WebAssembly and WorkerFS
Xuanwu Backend Tech Stack
Xuanwu Backend Tech Stack
Jan 18, 2025 · Backend Development

How Java’s String.intern() Boosts Memory Efficiency and Performance

Java’s String.intern() method adds a string to the constant pool, enabling memory sharing and faster reference comparisons; this article explains its behavior, demonstrates usage with code examples, and discusses performance benefits and cautions for optimal use in real‑world applications.

JavaMemory OptimizationString
0 likes · 5 min read
How Java’s String.intern() Boosts Memory Efficiency and Performance
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Jan 17, 2025 · Artificial Intelligence

How BladeDISC++ Cuts Memory Peaks for Dynamic‑Shape Deep Learning Models

This article explains the challenges of dynamic‑shape deep learning workloads and introduces BladeDISC++, an AI compiler that uses symbolic shape graphs, operation scheduling, and just‑in‑time auto‑rematerialization to dramatically reduce GPU memory peaks while maintaining training throughput.

AI compilerBladeDISC++LLM training
0 likes · 16 min read
How BladeDISC++ Cuts Memory Peaks for Dynamic‑Shape Deep Learning Models
Radish, Keep Going!
Radish, Keep Going!
Dec 6, 2024 · Backend Development

How BigCache Achieves Ultra‑Fast In‑Memory Caching in Go

This article explains how the Go library BigCache uses sharding, efficient hash functions, and memory‑optimised data structures to deliver high‑concurrency, low‑latency caching, while avoiding GC bottlenecks and providing practical code examples and performance benchmarks.

GoMemory Optimizationbigcache
0 likes · 12 min read
How BigCache Achieves Ultra‑Fast In‑Memory Caching in Go
Python Programming Learning Circle
Python Programming Learning Circle
Nov 23, 2024 · Fundamentals

Useful Python Tricks and Advanced Techniques

This article presents a collection of lesser‑known Python tricks—including string cleaning with translate, iterator slicing via itertools.islice, skipping header lines, keyword‑only functions, custom context managers, memory‑saving __slots__, resource limits, import control, and total_ordering—to help developers write cleaner, more efficient code.

Memory OptimizationPythoncontext manager
0 likes · 10 min read
Useful Python Tricks and Advanced Techniques
BirdNest Tech Talk
BirdNest Tech Talk
Oct 4, 2024 · Fundamentals

How Go’s New unique Package Enables Efficient Interning and Memory Savings

Go 1.23 introduces the unique package, which provides generic interning for comparable values, allowing canonicalization of strings and structs, reducing memory usage and speeding up equality checks; the article walks through a simple map‑based implementation, its limitations, the advanced Handle[T] design, real‑world netip usage, and future prospects.

Garbage CollectionMemory Optimizationcanonicalization
0 likes · 10 min read
How Go’s New unique Package Enables Efficient Interning and Memory Savings
360 Smart Cloud
360 Smart Cloud
Jul 17, 2024 · Artificial Intelligence

Parallelism and Memory‑Optimization Techniques for Distributed Large‑Scale Transformer Training

This article reviews the principles and practical implementations of data, pipeline, tensor, sequence, and context parallelism together with memory‑saving strategies such as recomputation and ZeRO, and demonstrates how the QLM framework leverages these techniques to accelerate large‑model training and fine‑tuning on multi‑GPU clusters.

GPUMegatron-LMMemory Optimization
0 likes · 18 min read
Parallelism and Memory‑Optimization Techniques for Distributed Large‑Scale Transformer Training
Baobao Algorithm Notes
Baobao Algorithm Notes
Jun 14, 2024 · Artificial Intelligence

Boost LLM Speed: How KV Cache Quantization Cuts Memory While Preserving Quality

This article explains Hugging Face's KV cache quantization technique, detailing how it reduces memory usage for long‑context LLM generation, the underlying quantization methods, implementation steps in 🤗 Transformers, benchmark results versus fp16, and the trade‑offs between speed, memory, and accuracy.

LLMMemory OptimizationTransformers
0 likes · 15 min read
Boost LLM Speed: How KV Cache Quantization Cuts Memory While Preserving Quality
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
May 15, 2024 · Artificial Intelligence

How Baidu’s AIAK‑LLM Supercharges Large‑Model Training and Inference

The article explains the scaling challenges of ever‑larger LLMs, introduces the MFU performance metric, surveys industry parallelism and memory‑saving techniques, and details Baidu’s AIAK‑LLM suite—including resource, component and acceleration layers—as well as concrete training and inference optimizations that raise MFU by 30‑60% and cut deployment costs.

AI InfrastructureLarge ModelMFU
0 likes · 25 min read
How Baidu’s AIAK‑LLM Supercharges Large‑Model Training and Inference
DeWu Technology
DeWu Technology
Apr 8, 2024 · Operations

Analyzing and Optimizing ZooKeeper WatchManager Memory Usage

By replacing ZooKeeper’s default WatchManager hash‑set tables with concurrent maps and bitmap‑based structures, the authors cut watch‑related heap usage from several gigabytes to under 12 MB, lowered lock contention, and achieved 5‑6× latency gains, delivering up to 91 % memory reduction and ten‑fold SLA improvement in production clusters.

JavaMemory OptimizationPerformance Testing
0 likes · 13 min read
Analyzing and Optimizing ZooKeeper WatchManager Memory Usage
NewBeeNLP
NewBeeNLP
Mar 18, 2024 · Artificial Intelligence

Mastering RAG and LLM Techniques: From Retrieval to Fine‑Tuning

This article provides a comprehensive technical guide on Retrieval‑Augmented Generation (RAG), open‑source large language models such as LLaMA, fine‑tuning methods, evaluation metrics, memory‑optimization tricks, and attention‑related optimizations for modern AI systems.

LLMLangChainMemory Optimization
0 likes · 19 min read
Mastering RAG and LLM Techniques: From Retrieval to Fine‑Tuning
Python Programming Learning Circle
Python Programming Learning Circle
Jan 25, 2024 · Fundamentals

Python Memory Optimization Techniques: __slots__, Generators, mmap, Data Types, and String Interning

This article explains practical Python memory‑saving methods—including inspecting memory usage, using __slots__, generators, memory‑mapped files, choosing efficient data structures, and string interning—providing code examples and performance comparisons to help developers write more memory‑efficient programs.

Data StructuresMemory OptimizationPython
0 likes · 11 min read
Python Memory Optimization Techniques: __slots__, Generators, mmap, Data Types, and String Interning
macrozheng
macrozheng
Jan 25, 2024 · Backend Development

How to Slash Spring Boot Memory Usage by Over 40% for Local Development

This guide explains why Spring Boot applications consume memory, shows how to configure the JVM and Tomcat settings, and demonstrates using Docker‑compose to apply these tweaks, enabling developers to reduce local memory usage by more than 40% and speed up development cycles.

ConfigurationDockerJVM
0 likes · 7 min read
How to Slash Spring Boot Memory Usage by Over 40% for Local Development
Open Source Linux
Open Source Linux
Jan 24, 2024 · Game Development

How 128KB Made a Classic Game: Secrets of Retro Graphics, Audio, and Code

This article explains how 1980s game developers squeezed full‑screen scrolling, rich music, and extensive gameplay into just 128 KB by using tile‑based graphics, specialized audio chips, and ultra‑compact code, revealing the technical tricks behind classic titles like Contra.

Audio SynthesisGame DevelopmentMemory Optimization
0 likes · 6 min read
How 128KB Made a Classic Game: Secrets of Retro Graphics, Audio, and Code
Liangxu Linux
Liangxu Linux
Dec 11, 2023 · Fundamentals

How to Squeeze Extra RAM on a Resource‑Constrained Embedded Linux Device

This article examines memory usage on a Junzheng T31ZC embedded Linux board, breaks down physical and kernel allocations, and presents practical techniques—such as trimming kernel symbols, stripping ELF files, optimizing rootfs, and manually dropping caches—to reclaim valuable RAM without adding hardware.

LinuxMemory OptimizationRootFS
0 likes · 13 min read
How to Squeeze Extra RAM on a Resource‑Constrained Embedded Linux Device
OPPO Kernel Craftsman
OPPO Kernel Craftsman
Nov 2, 2023 · Industry Insights

OPPO’s Kernel Tweaks: Dynamic HugePages and Scheduler Boosts for Smoother ColorOS

At the 18th China Linux Kernel Developer Conference, OPPO unveiled memory and scheduler optimizations—including a dynamic 64 KB huge‑page scheme, LRU reclamation improvements, and a latency‑aware scheduler balance—that collectively raise memory access performance by over 10 %, reduce lock contention, and lower frame‑drop and audio‑stutter rates, demonstrating the impact of programmable kernel technologies on Android smoothness.

Linux kernelMemory OptimizationMobile OS
0 likes · 7 min read
OPPO’s Kernel Tweaks: Dynamic HugePages and Scheduler Boosts for Smoother ColorOS
php Courses
php Courses
Oct 31, 2023 · Backend Development

Using Generators in PHP 7 for Efficient Data Processing

This article explains PHP 7 generators, describing their concept, memory‑saving benefits, and how to implement them with a practical example that reads a large file and yields squared numbers, demonstrating efficient data processing and lazy loading in backend development.

Backend DevelopmentMemory OptimizationPHP
0 likes · 5 min read
Using Generators in PHP 7 for Efficient Data Processing
Selected Java Interview Questions
Selected Java Interview Questions
Oct 14, 2023 · Backend Development

Configuring and Optimizing Memory Settings in Spring Boot

This article explains Java memory concepts, demonstrates how to set JVM memory parameters for Spring Boot via startup scripts and configuration files, lists common memory flags, and provides practical tips for reducing memory consumption and improving application performance.

Backend DevelopmentConfigurationJVM
0 likes · 7 min read
Configuring and Optimizing Memory Settings in Spring Boot
OPPO Kernel Craftsman
OPPO Kernel Craftsman
Sep 22, 2023 · Mobile Development

Advances in Android Memory Optimization and Linux Kernel Techniques Presented at CCF ESTC 2023

At the 2023 CCF ESTC conference, OPPO showcased its 64 KB dynamic large‑page technology that can increase memory‑access bandwidth up to sixteenfold and improve app startup and frame rates, while also co‑hosting a Linux‑kernel forum on scheduler, lock contention, memory defragmentation and hot‑cold page optimization, underscoring the company’s push for industry‑academic collaboration and future OS research.

AndroidDynamic Large PagesMemory Optimization
0 likes · 8 min read
Advances in Android Memory Optimization and Linux Kernel Techniques Presented at CCF ESTC 2023
21CTO
21CTO
Sep 21, 2023 · Operations

How Desktop QQ Tamed Memory Bloat: Deep Dive into Electron Optimization

This article details the systematic memory‑usage analysis and multi‑stage optimization performed on the desktop QQ Electron client, covering background challenges, target metrics, tooling, code and resource slimming, rendering layer tuning, Lottie handling, online monitoring, and the resulting stable sub‑300 MB memory footprint.

Desktop ApplicationElectronMemory Optimization
0 likes · 24 min read
How Desktop QQ Tamed Memory Bloat: Deep Dive into Electron Optimization
Tencent Cloud Developer
Tencent Cloud Developer
Sep 21, 2023 · Frontend Development

Memory Optimization of Desktop QQ: A Stage Summary

To curb Desktop QQ’s excessive Electron memory consumption, the team applied comprehensive profiling, code and resource slimming, thumbnail image generation, visible‑only DOM rendering, layer merging, Lottie animation tuning, API caching, and leak removal, achieving average usage around 228 MB and keeping all six processes below 300 MB with ongoing monitoring.

Desktop ApplicationElectronMemory Optimization
0 likes · 24 min read
Memory Optimization of Desktop QQ: A Stage Summary
Sohu Tech Products
Sohu Tech Products
Sep 20, 2023 · Mobile Development

Memory Optimization and GPUImage‑Based Gaussian Blur for iOS Live‑Stream Images

The iOS live‑stream app was crashing from out‑of‑memory due to high‑resolution images and CPU‑based Gaussian blur, so the solution reduced download size, skipped downloads during fast scrolling, limited cached blurred images, and switched to GPUImage‑accelerated blur with a custom composite shader, cutting memory use and processing time.

GPUImageGaussian BlurImage Processing
0 likes · 12 min read
Memory Optimization and GPUImage‑Based Gaussian Blur for iOS Live‑Stream Images
Architecture Digest
Architecture Digest
Sep 12, 2023 · Fundamentals

Understanding Java Integer Caching and the == Operator

This article explains how Java caches Integer objects in the range -128 to 127, why the == operator behaves differently for small and large integers, and demonstrates how to inspect and manipulate the cache using reflection.

JavaMemory OptimizationReference Equality
0 likes · 4 min read
Understanding Java Integer Caching and the == Operator
High Availability Architecture
High Availability Architecture
Aug 30, 2023 · Backend Development

Diagnosing and Optimizing JVM Memory Issues in a Core Service

This article details the identification, analysis, and resolution of JVM memory problems in a core music metadata service, covering GC tuning, large‑object handling, fault‑tolerance strategies, custom Dubbo codec monitoring, and non‑intrusive memory object tracking to improve performance and stability.

DubboJVMMemory Optimization
0 likes · 14 min read
Diagnosing and Optimizing JVM Memory Issues in a Core Service
政采云技术
政采云技术
Aug 15, 2023 · Databases

Understanding and Handling Redis Bigkey Issues

This article explains what Redis bigkeys are, their impact on performance and memory, how they are generated, methods to detect and analyze them—including built‑in commands, SCAN, memory usage, and RDB file inspection—and practical steps to mitigate their effects.

BigKeyMemory OptimizationRDB analysis
0 likes · 12 min read
Understanding and Handling Redis Bigkey Issues
ITPUB
ITPUB
Aug 9, 2023 · Operations

Why Is My Elasticsearch Cluster Using 15 GB Heap? A Deep Dive into Memory Bottlenecks

The article examines a 7‑node Elasticsearch cluster with 500 million documents, uncovering excessive heap usage, high OS memory pressure, numerous deleted documents, large translog, low query‑cache hit rate, and an over‑sharded design, then offers concrete tuning and redesign recommendations to restore performance.

ElasticsearchMemory Optimizationcluster operations
0 likes · 16 min read
Why Is My Elasticsearch Cluster Using 15 GB Heap? A Deep Dive into Memory Bottlenecks
dbaplus Community
dbaplus Community
Aug 1, 2023 · Operations

Why Is My Elasticsearch Cluster Consuming 15 GB Heap? A Deep Dive into Memory, Sharding, and Performance Bottlenecks

A 7‑node Elasticsearch cluster handling 500 million documents shows excessive heap usage, many deleted documents, high translog size, saturated OS memory, and inefficient sharding, prompting a detailed analysis of stats, root‑cause identification, and concrete recommendations for mapping, shard design, and JVM tuning.

Cluster MonitoringJVMMemory Optimization
0 likes · 16 min read
Why Is My Elasticsearch Cluster Consuming 15 GB Heap? A Deep Dive into Memory, Sharding, and Performance Bottlenecks
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Jul 17, 2023 · Artificial Intelligence

How MindSpore’s Auto Parallel Tech Simplifies Large-Model Training

During a livestream titled “Solving the ‘Development Difficulty’ of Large Models with MindSpore Auto Parallel”, Huawei’s MindSpore experts explained how the framework’s distributed training techniques—including data, model, and pipeline parallelism as well as memory‑saving strategies—enable efficient pre‑training of trillion‑parameter models across diverse AI domains.

Data ParallelDistributed TrainingMemory Optimization
0 likes · 6 min read
How MindSpore’s Auto Parallel Tech Simplifies Large-Model Training
Programmer DD
Programmer DD
Jul 15, 2023 · Operations

Why Is My Elasticsearch Cluster Using 99% Memory? Sharding, Translog & JVM Insights

This article analyzes a 7‑node Elasticsearch cluster with 500 million documents, revealing excessive shard count, high heap and OS memory usage, large translog, low query‑cache hit rate, and heavy I/O, and offers concrete recommendations on mapping, sharding, JVM tuning, and resource management to restore performance.

Cluster MonitoringElasticsearchJVM
0 likes · 17 min read
Why Is My Elasticsearch Cluster Using 99% Memory? Sharding, Translog & JVM Insights
Python Programming Learning Circle
Python Programming Learning Circle
Jun 27, 2023 · Fundamentals

Useful Python Tricks: String Cleaning, Iterator Slicing, Keyword‑Only Arguments, Context Managers, __slots__, Resource Limits, __all__, and Total Ordering

This article presents a collection of lesser‑known Python tricks—including string normalization, iterator slicing, keyword‑only functions, custom context managers, memory‑saving __slots__, CPU/memory limits, import control with __all__, and simplified ordering with total_ordering—to help developers write cleaner and more efficient code.

IteratorMemory OptimizationPython
0 likes · 10 min read
Useful Python Tricks: String Cleaning, Iterator Slicing, Keyword‑Only Arguments, Context Managers, __slots__, Resource Limits, __all__, and Total Ordering
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Jun 16, 2023 · Fundamentals

The Semantics of nil in Swift

The article explains that Swift’s nil differs from Objective‑C’s null pointer by representing ‘no value’ for any type through optionals, details how various optionals (Bool?, String?, class?, enum?) are encoded in memory, shows the extra byte cost for Int?, and advises minimizing optional Int fields in structs, using 0 when appropriate.

ClassMemory OptimizationSwift
0 likes · 6 min read
The Semantics of nil in Swift
Tencent Cloud Developer
Tencent Cloud Developer
May 31, 2023 · Cloud Computing

TencentOS "Wujing": Server Memory Multi-Level Offloading Solution for Cloud Data Centers

TencentOS “Wujing” provides a server‑memory multi‑level offloading framework that uses kernel‑side reclamation, heat‑aware page classification, SWAP balancing, and CXL promotion to shift cold pages to cheaper storage, cutting data‑center memory use by up to 50 % while preserving performance.

DAMONLinux kernelMemory Optimization
0 likes · 14 min read
TencentOS "Wujing": Server Memory Multi-Level Offloading Solution for Cloud Data Centers
DataFunSummit
DataFunSummit
Apr 11, 2023 · Artificial Intelligence

OneFlow Coop: Joint Optimization of Dynamic‑Graph Recomputation and Memory Allocation

This article introduces OneFlow Coop, a memory‑optimization technique that jointly optimizes dynamic‑graph recomputation strategies and GPU memory allocation by analyzing existing DTR limitations, proposing recomputable in‑place, op‑guided tensor allocation, and layout‑aware eviction modules, and demonstrating superior experimental results.

Deep LearningDynamic GraphGPU Memory
0 likes · 18 min read
OneFlow Coop: Joint Optimization of Dynamic‑Graph Recomputation and Memory Allocation
Python Programming Learning Circle
Python Programming Learning Circle
Apr 7, 2023 · Fundamentals

Lesser‑Known Python Tricks and Techniques

This article introduces a collection of lesser‑known Python tricks—including string cleaning with translate, iterator slicing via itertools, skipping header lines, keyword‑only functions, custom context managers, memory‑saving __slots__, resource limits, import control with __all__, and total_ordering—to help developers write cleaner, more efficient code.

Memory Optimizationcode snippetscontext manager
0 likes · 10 min read
Lesser‑Known Python Tricks and Techniques
21CTO
21CTO
Mar 31, 2023 · Backend Development

Boost Go Performance: 6 Proven Techniques for Faster, Leaner Apps

This article presents six practical Go performance optimizations—including GOMAXPROCS tuning for Kubernetes, struct field ordering, garbage‑collection limits, zero‑copy unsafe conversions, jsoniter usage, and sync.Pool pooling—that together can dramatically lower CPU, memory, and latency in production services.

Garbage CollectionGoKubernetes
0 likes · 9 min read
Boost Go Performance: 6 Proven Techniques for Faster, Leaner Apps
Tencent Cloud Developer
Tencent Cloud Developer
Mar 22, 2023 · Artificial Intelligence

How AngelPTM Cuts Large Model Training Costs with ZeRO-Cache Optimizations

This article analyzes Tencent's AngelPTM framework, detailing its ZeRO-Cache strategy, unified storage management, multi‑stream async execution, SSD tiered storage, and performance benchmarks that show up to 95% larger model capacity and over 44% speedup compared to community solutions.

AI InfrastructureGPU AccelerationMemory Optimization
0 likes · 12 min read
How AngelPTM Cuts Large Model Training Costs with ZeRO-Cache Optimizations
Tencent Advertising Technology
Tencent Advertising Technology
Mar 10, 2023 · Artificial Intelligence

Optimizing Large-Scale Model Training with Tencent's AngelPTM and ZeRO-Cache

This article presents Tencent's latest advancements in large‑scale model training, detailing the AngelPTM framework and its ZeRO‑Cache optimization techniques that reduce memory and storage costs, improve hardware utilization, and achieve high‑performance training for trillion‑parameter AI models across various applications.

AI modelsAngelPTMLarge-Scale Training
0 likes · 14 min read
Optimizing Large-Scale Model Training with Tencent's AngelPTM and ZeRO-Cache
Ctrip Technology
Ctrip Technology
Mar 9, 2023 · Backend Development

Optimizing Hotel Query Service Memory Usage: GC Tuning, Native Memory Management, and Migration to jemalloc

This article details the systematic reduction of memory consumption for Ctrip's hotel query service by halving container memory, evaluating and tuning modern garbage collectors, diagnosing off‑heap leaks, and ultimately replacing the default ptmalloc allocator with jemalloc to achieve stable performance and lower resource costs.

Backend PerformanceGarbage CollectionJVM
0 likes · 22 min read
Optimizing Hotel Query Service Memory Usage: GC Tuning, Native Memory Management, and Migration to jemalloc
NiuNiu MaTe
NiuNiu MaTe
Mar 9, 2023 · Databases

Understanding Redis Ziplist and Listpack: Memory‑Efficient List Implementations

This article explains Redis's compressed list structures—ZIPLIST and its successor LISTPACK—detailing their layouts, encoding schemes, query and update complexities, and how LISTPACK resolves ZIPLIST's chain‑update performance issues to provide a more memory‑efficient list implementation.

Backend DevelopmentData StructuresListpack
0 likes · 10 min read
Understanding Redis Ziplist and Listpack: Memory‑Efficient List Implementations