Fundamentals 28 min read

Unveiling HashMap's Inner Mechanics: Design, Source Code Walkthrough, and Best Practices

This article dissects Java's HashMap by first explaining the collection framework's design goals—generality, scalability, performance, interoperability, readability, and thread safety—then walks through the core source code of HashMap (hash, put, resize, get, remove, treeify, etc.), illustrates LRU cache implementation, and finally offers practical usage tips, performance considerations, and version‑specific changes up to JDK 9.

IT Niuke

Dec 24, 2023

Unveiling HashMap's Inner Mechanics: Design, Source Code Walkthrough, and Best Practices

Design Principles of the Java Collections Framework

The framework is built around several key ideas: Generality —a reusable set of interfaces and classes for any object type; Scalability —developers can extend it with custom collections; Performance —choice of data structures such as ArrayList and LinkedList optimises insertion and deletion; Interoperability —interfaces work together, allowing easy conversion; Readability & Simplicity —clear APIs make the framework easy to understand; and Thread Safety —some classes like Hashtable and Vector are synchronized, while ConcurrentHashMap provides high‑performance concurrency.

Deep Dive into HashMap Source Code

The article presents a line‑by‑line analysis of the most important methods.

hash(key) – returns 0 for null keys, otherwise computes key.hashCode() ^ (key.hashCode() >>> 16) to spread bits.

put(key, value) – delegates to putVal after hashing the key.

putVal(int hash, K key, V value, boolean onlyIfAbsent, boolean evict) – initialises the table if needed, finds the bucket index, handles empty buckets, resolves collisions via linked‑list traversal, upgrades to a red‑black tree when the bucket length exceeds TREEIFY_THRESHOLD, and finally inserts or updates the node, adjusting modCount and size and triggering resize() when the threshold is crossed.

treeifyBin – converts a long bucket list into a red‑black tree; if the table is too small ( MIN_TREEIFY_CAPACITY) it first expands the table.

resize() – doubles the capacity when the load factor (default 0.75) is exceeded, rehashes existing entries, and handles special cases for tree nodes.

get(key) – hashes the key, locates the bucket, and searches either a tree node ( TreeNode) or a linked list.

remove(key) – finds the target node, unlinks it, updates modCount and size, and calls afterNodeRemoval for LinkedHashMap support.

All code snippets are shown in <pre><code> blocks with generic types escaped (e.g., Map<String, Integer>).

LRU Cache Example Using HashMap

The article demonstrates how to build an LRU cache by combining a doubly‑linked list (to record access order) with a HashMap (for O(1) look‑ups). Insertion moves the accessed node to the head; when the cache is full, the tail node is evicted and the map entry removed.

Use a doubly‑linked list where the head represents the most recently used entry and the tail the least.

Store key → node mappings in a HashMap for constant‑time retrieval.

On a cache hit, move the node to the head; on a miss, load the data, insert a new node at the head, and evict the tail if necessary.

Practical Usage Recommendations and Pitfalls

Initial capacity : Provide a suitable initial capacity (e.g., new HashMap<>(16)) to reduce rehashing.

Load factor : Default is 0.75; adjust it (e.g., new HashMap<>(16, 0.8f)) based on memory vs. speed trade‑offs.

Hash function : Ensure custom key classes correctly implement hashCode() and equals().

Thread safety : Use ConcurrentHashMap in multithreaded scenarios.

Iteration : Prefer entrySet() over separate keySet() and values() to avoid repeated hash calculations.

Null checks : Verify a key exists with containsKey() before calling get() to prevent NullPointerException.

Capacity tuning : Manual capacity adjustments can mitigate the cost of automatic resizing in performance‑critical code.

Avoid overuse : In scenarios requiring ordering, consider LinkedHashMap or TreeMap instead of plain HashMap.

JDK Version Evolution

JDK 8 : Introduced red‑black tree buckets to improve lookup performance for heavily colliding keys.

JDK 9 : Buckets are now ordered by insertion within the tree, providing deterministic iteration order.

Conclusion

HashMap remains a cornerstone of Java's collection utilities, offering O(1) average‑case performance for insertion, lookup, and removal while providing extensibility through custom hash functions, configurable load factor, and optional treeification. Understanding its internal mechanisms—hash calculation, bucket management, tree conversion, and resizing—enables developers to write more efficient, thread‑safe, and maintainable code.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

Java Performance concurrency JDK HashMap Data Structures

Written by

IT Niuke

Focused on IT technology sharing, original and innovative content. IT Niuke, we grow together.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.