bloomfilter-go

A Bloom filter is a space-efficient probabilistic data structure, conceived by Burton Howard Bloom in 1970, that is used to test whether an element is a member of a set. False positive matches are possible, but false negatives are not – in other words, a query returns either "possibly in set" or "definitely not in set". (from wikipedia)

In short, if a bloom filter return false for a specific element, then the element is definitely not in this bloom filter.

If a bloom filter return true for a specific element, then it's possible that the element is not in this bloom filter. The probability (error rate) is based on the number of the hash functions, the total number of bit, and the bit per element.

NewBloomFilter provides two parameters. The first one is entries that indicate your expected max number of element. The second one is err that indicate the false positive rate (error rate) you allows. Based on these two parameters, this package could find the optimal number for hash function and bit per element.

Install

Install this package through go get.

go get github.com/jdxyw/bloomfilter-go

Usage

The usage is quite simple.

package main

import (
	"fmt"
	bf "github.com/jdxyw/bloomfilter-go"
)

func main() {
    // The first parameter is your expected elements or expected max number of elements.
    // The second parameter is your expected collision error rate you can accept.
	b, _ := bf.NewBloomFilter(100000, 0.01)

	b.Set([]byte("Java"))
	b.Set([]byte("Python"))
	b.Set([]byte("Go"))
	b.Set([]byte("C++"))

	if b.Check([]byte("Python")) == true {
		fmt.Println("The Python is in this bloomfilter.")
	}
}

Benchmark

go test -bench=. ./benchmark/

BenchmarkEntries1000000Err005-8          8743465               155 ns/op
BenchmarkEntries2000000Err005-8          8170458               152 ns/op
BenchmarkEntries5000000Err005-8          6824294               183 ns/op
BenchmarkEntries5000000Err001-8          5236119               237 ns/op
BenchmarkEntries10000000Err005-8         5822541               218 ns/op
BenchmarkEntries50000000Err001-8         3174663               369 ns/op

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github/workflows		.github/workflows
benchmark		benchmark
.editorconfig		.editorconfig
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bloomfilter.go		bloomfilter.go
bloomfilter_test.go		bloomfilter_test.go
go.mod		go.mod
murmur.go		murmur.go
murmur_test.go		murmur_test.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bloomfilter-go

Install

Usage

Benchmark

About

Releases 1

Packages

Languages

License

jdxyw/bloomfilter-go

Folders and files

Latest commit

History

Repository files navigation

bloomfilter-go

Install

Usage

Benchmark

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages