Implementation of the enhanced double hashing technique based on the Bloom Filters in Probabilistic Verification paper (Dillinger and Manolios, 2004).
This crate is very simple: given a key key, one can hash it to a sequence of k hashes
[hash_1, hash_2, .., hash_k], instead of a single hash.
This is useful for implementing many hash-based algorithms, such as:
- Hash tables with open addressing, where double hashing is used to resolve collisions.
- Bloom filters, since as shown in
Less Hashing, Same Performance: Building a Better Bloom Filter
(Kirsch and Mitzenmacher, 2006) instead of using
kdifferent hashers, we can rely on double hashing to producekfilter positions. - Consistent hashing, where double hashing is used to map keys to a ring of servers (for example in Multi-probe consistent hashing).
Create and configure a hasher either by directly calling constructors of DoubleHashHasher or by
using the builder object DoubleHashBuilder.
When we are relying on default parameters (for seed values, max hash value etc), and do not need to
keep builder around (which is normally the case, as a single generated hasher is often enough), we
can use the DoubleHashHasher constructor directly:
use hash_iter::{DoubleHashHasher, HashIterHasher};
let hasher = DoubleHashHasher::new();
// Hash each key to 3 hash points.
let hashes = hasher.hash_iter(&"foo", 3).collect::<Vec<_>>();
let hashes = hasher.hash_iter(&"bar", 3).collect::<Vec<_>>();There are several optional parameters that can be configured:
n: the maximum hash value producible (by default it isusize::MAX, so that array indexing is safe).seed1andseed2: seeds for the two hash functions (by default they are12345and67890respectively).
The DoubleHashBuilder allows you to configure how hash iterators are produced:
use hash_iter::{BuildHashIterHasher, DoubleHashHasher, DoubleHashBuilder, HashIterHasher};
// Specify default values explicitly.
let hasher = DoubleHashBuilder::new()
.with_seed1(12345)
.with_seed2(67890)
.with_n(usize::MAX as u64)
.build_hash_iter_hasher();
let hashes = hasher.hash_iter(&"hello", 3).collect::<Vec<_>>();One can specify which hash functions to use when creating the first two hash values used to produce the sequence.
All you need to do is to supply DoubleHashHasher::with_hash_builders() function with two structs
that implement hash::BuildHasher:
use hash_iter::{DoubleHashHasher, HashIterHasher};
use xxhash_rust::xxh3::Xxh3Builder;
let hasher = DoubleHashHasher::with_hash_builders(
Xxh3Builder::new().with_seed(12345),
Xxh3Builder::new().with_seed(67890),
usize::MAX, // n
);
let hashes = hasher.hash_iter(&"hello", 3).collect::<Vec<_>>();