Less hashing, same performance: Building a better Bloom filter

Adam Kirsch, Michael Mitzenmacher

2006

2 references

Abstract

Abstract A standard technique from the hashing literature is to use two hash functions h 1 ( x ) and h 2 ( x ) to simulate additional hash functions of the form g i ( x ) = h 1 ( x ) + i h 2 ( x ). We demonstrate that this technique can be usefully applied to Bloom filters and related data structures. Specifically, only two hash functions are necessary to effectively implement a Bloom filter without any loss in the asymptotic false positive probability. This leads to less computation and potentially less need for randomness in practice. © 2008 Wiley Periodicals, Inc. Random Struct. Alg., 2008

View Paper PDF DOI

💾 Databases

2 repositories

2 references

Code References

▶ postgres/postgres

1 file

▶ src/backend/access/brin/brin_bloom.c

L53

* Applied Sciences, Cambridge, Massachusetts [DOI 10.1002/rsa.20208]

▶ yugabyte/yugabyte-db

1 file

▶ src/postgres/src/backend/access/brin/brin_bloom.c

L53

* Applied Sciences, Cambridge, Massachusetts [DOI 10.1002/rsa.20208]

Link copied to clipboard!