Fundamentals 6 min read

Understanding Jump Consistent Hash: Principles, Implementation, and Performance

This article explains the principle of jump consistent hashing, compares it with traditional ring‑based methods, shows a step‑by‑step implementation with Go code, and discusses its performance advantages and potential optimizations for distributed key‑value storage systems.

Aikesheng Open Source Community

Feb 13, 2020

Understanding Jump Consistent Hash: Principles, Implementation, and Performance

Consistent hashing distributes key‑value pairs across multiple nodes, but traditional modulo‑based hashing causes massive data movement when nodes are added. Jump consistent hash, introduced by Google, reduces the amount of data that needs to be moved to only about 1/n of the total when a new node joins.

The basic implementation starts node numbering from 1 instead of 0. For a single node, all keys map to node 1 ( ch(key,1)=1). When a second node is added, half of the keys are randomly moved to node 2; when a third node is added, one‑third of the keys are randomly moved to node 3, and so on, ensuring that each addition only moves a small fraction of data.

The algorithm uses a reproducible random sequence: the key itself is used as the seed for a random number generator, so the same key follows the same random decisions across different runs.

Mathematical induction describes the placement: for n nodes, a key stays on its current node unless a random value is less than 1/(n+1), in which case it jumps to the newly added node. This logic directly maps to a recursive function.

Example Go implementation:

func ch(r *rand.Rand, k int, i int) int {
    if i == 1 {
        // base case
        return 1
    } else {
        // inductive case
        b := ch(r, k, i-1)
        if rand.Float() < 1.0/float64(i) {
            return i
        } else {
            return b
        }
    }
}

func ch_wrapper(k int, i int) int {
    r := rand.Seed(k) // use key as random seed
    return ch(r, k, i)
}

In production code the recursion is usually replaced by an iterative loop for efficiency.

Performance-wise, the naive implementation scans all nodes (O(N)). The original paper presents a logarithmic‑time method that directly samples the next jump target according to a specific probability distribution, further reducing the computational cost.

Further details and the original paper "A Fast, Minimal Memory, Consistent Hash Algorithm" are linked at the end of the article.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

algorithm Go Consistent Hash jump hash

Written by

Aikesheng Open Source Community

The Aikesheng Open Source Community provides stable, enterprise‑grade MySQL open‑source tools and services, releases a premium open‑source component each year (1024), and continuously operates and maintains them.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.