Backend Development 35 min read

Deep Dive into Caffeine Cache: W‑TinyLFU Design, High‑Performance Architecture, and Code Analysis

This article provides a comprehensive analysis of the Caffeine Java caching library, covering its performance‑focused W‑TinyLFU eviction algorithm, detailed source‑code walkthroughs of frequency sketching, read/write buffers, timer‑wheel expiration, and practical usage examples compared with Guava Cache.

Architect

Jan 30, 2021

Caffeine is a high‑performance, low‑memory local cache for Java that improves upon Guava Cache by adopting a sophisticated W‑TinyLFU eviction strategy, asynchronous read/write buffers, and a hierarchical timer wheel for flexible expiration.

Comparison with Guava Cache – Caffeine retains the familiar Guava API (Cache, LoadingCache, CacheBuilder) while offering significantly better hit rates and throughput, which is why Spring 5 switched its default local cache to Caffeine.

Basic Usage – Creating a cache is straightforward:

private static LoadingCache<String, String> cache = Caffeine.newBuilder()
    .maximumSize(256L)
    .initialCapacity(1)
    .expireAfterAccess(2, TimeUnit.DAYS)
    .expireAfterWrite(2, TimeUnit.HOURS)
    .refreshAfterWrite(1, TimeUnit.HOURS)
    .recordStats()
    .writer(new CacheWriter<Object, Object>() {
        @Override public void write(@NonNull Object key, @NonNull Object value) {
            log.info("key={}, CacheWriter write", key);
        }
        @Override public void delete(@NonNull Object key, @Nullable Object value, @NonNull RemovalCause cause) {
            log.info("key={}, cause={}, CacheWriter delete", key, cause);
        }
    })
    .build(new CacheLoader<String, String>() {
        @Override public String load(@NonNull String key) throws Exception {
            return "value_" + key;
        }
        @Override public String reload(@NonNull String key, @NonNull String oldValue) throws Exception {
            return "value_" + key;
        }
    });

W‑TinyLFU Core Design – The algorithm combines a small Window cache (admission space) with a main cache split into protected and probation segments using a Segmented LRU (SLRU). Frequency is tracked by a compact FrequencySketch based on a Count‑Min Sketch stored in a single 64‑bit word, limiting each counter to 15 and halving all counters periodically to age old entries.

public void increment(@NonNull E e) {
    if (isNotInitialized()) return;
    int hash = spread(e.hashCode());
    int start = (hash & 3) << 2;
    int index0 = indexOf(hash, 0);
    int index1 = indexOf(hash, 1);
    int index2 = indexOf(hash, 2);
    int index3 = indexOf(hash, 3);
    boolean added = incrementAt(index0, start);
    added |= incrementAt(index1, start + 1);
    added |= incrementAt(index2, start + 2);
    added |= incrementAt(index3, start + 3);
    if (added && (++size == sampleSize)) {
        reset();
    }
}

The evictEntries method decides whether to keep a candidate from the window or evict the victim from the main space based on their frequencies, with a small random tie‑breaker to mitigate hash‑collision attacks.

boolean admit(K candidateKey, K victimKey) {
    int victimFreq = frequencySketch().frequency(victimKey);
    int candidateFreq = frequencySketch().frequency(candidateKey);
    if (candidateFreq > victimFreq) return true;
    if (candidateFreq <= 5) return false;
    int random = ThreadLocalRandom.current().nextInt();
    return ((random & 127) == 0);
}

Adaptive Window Size – The climb routine monitors hit‑rate changes and dynamically adjusts the window proportion (default 1 % of total capacity) to optimise for workloads with bursty or steady access patterns.

Asynchronous Read/Write Buffers – Caffeine uses a striped, non‑blocking BoundedBuffer (read buffer) and an MPSC MpscGrowableArrayQueue (write buffer) to batch maintenance work such as statistics updates, expiration, and eviction, reducing lock contention on hot keys.

Timer Wheel Expiration – When custom per‑key expiration is required via .expireAfter(Expiry), Caffeine employs a hierarchical timer wheel to place entries into buckets based on their absolute expiration time, avoiding costly sorted structures.

Additional optimisations include soft/weak references, false‑sharing elimination, and optional CompletableFuture ‑based asynchronous loading.

Overall, Caffeine combines modern caching theory (W‑TinyLFU, Count‑Min Sketch, adaptive window sizing) with practical engineering (striped buffers, timer wheel) to deliver near‑optimal hit rates with minimal memory overhead.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

backend Java Performance Cache Caffeine W‑TinyLFU

Written by

Architect

Professional architect sharing high‑quality architecture insights. Topics include high‑availability, high‑performance, high‑stability architectures, big data, machine learning, Java, system and distributed architecture, AI, and practical large‑scale architecture case studies. Open to ideas‑driven architects who enjoy sharing and learning.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.