Less hashing, same performance: Building a better Bloom filter
Abstract
Abstract A standard technique from the hashing literature is to use two hash functions h 1 ( x ) and h 2 ( x ) to simulate additional hash functions of the form g i ( x ) = h 1 ( x ) + i h 2 ( x ). We demonstrate that this technique can be usefully applied to Bloom filters and related data structures. Specifically, only two hash functions are necessary to effectively implement a Bloom filter without any loss in the asymptotic false positive probability. This leads to less computation and potentially less need for randomness in practice. © 2008 Wiley Periodicals, Inc. Random Struct. Alg., 2008
Code References
postgres/postgres
1 file
src/backend/access/brin/brin_bloom.c
1
L53
* Applied Sciences, Cambridge, Massachusetts [DOI 10.1002/rsa.20208]
Link copied to clipboard!