Roaring bitmaps in Go (golang)

Roaring bitmaps: A better compressed bitset

Last update: Jan 8, 2023

Comments: 17

roaring

This is a go version of the Roaring bitmap data structure.

Roaring bitmaps are used by several major systems such as Apache Lucene and derivative systems such as Solr and Elasticsearch, Apache Druid (Incubating), LinkedIn Pinot, Netflix Atlas, Apache Spark, OpenSearchServer, Cloud Torrent, Whoosh, Pilosa, Microsoft Visual Studio Team Services (VSTS), and eBay's Apache Kylin. The YouTube SQL Engine, Google Procella, uses Roaring bitmaps for indexing.

Roaring bitmaps are found to work well in many important applications:

Use Roaring for bitmap compression whenever possible. Do not use other bitmap compression methods (Wang et al., SIGMOD 2017)

The roaring Go library is used by

This library is used in production in several systems, it is part of the Awesome Go collection.

There are also Java and C/C++ versions. The Java, C, C++ and Go version are binary compatible: e.g, you can save bitmaps from a Java program and load them back in Go, and vice versa. We have a format specification.

This code is licensed under Apache License, Version 2.0 (ASL2.0).

When should you use a bitmap?

Sets are a fundamental abstraction in software. They can be implemented in various ways, as hash sets, as trees, and so forth. In databases and search engines, sets are often an integral part of indexes. For example, we may need to maintain a set of all documents or rows (represented by numerical identifier) that satisfy some property. Besides adding or removing elements from the set, we need fast functions to compute the intersection, the union, the difference between sets, and so on.

To implement a set of integers, a particularly appealing strategy is the bitmap (also called bitset or bit vector). Using n bits, we can represent any set made of the integers from the range [0,n): the ith bit is set to one if integer i is present in the set. Commodity processors use words of W=32 or W=64 bits. By combining many such words, we can support large values of n. Intersections, unions and differences can then be implemented as bitwise AND, OR and ANDNOT operations. More complicated set functions can also be implemented as bitwise operations.

When the bitset approach is applicable, it can be orders of magnitude faster than other possible implementation of a set (e.g., as a hash set) while using several times less memory.

However, a bitset, even a compressed one is not always applicable. For example, if the you have 1000 random-looking integers, then a simple array might be the best representation. We refer to this case as the "sparse" scenario.

When should you use compressed bitmaps?

An uncompressed BitSet can use a lot of memory. For example, if you take a BitSet and set the bit at position 1,000,000 to true and you have just over 100kB. That is over 100kB to store the position of one bit. This is wasteful even if you do not care about memory: suppose that you need to compute the intersection between this BitSet and another one that has a bit at position 1,000,001 to true, then you need to go through all these zeroes, whether you like it or not. That can become very wasteful.

This being said, there are definitively cases where attempting to use compressed bitmaps is wasteful. For example, if you have a small universe size. E.g., your bitmaps represent sets of integers from [0,n) where n is small (e.g., n=64 or n=128). If you are able to uncompressed BitSet and it does not blow up your memory usage, then compressed bitmaps are probably not useful to you. In fact, if you do not need compression, then a BitSet offers remarkable speed.

The sparse scenario is another use case where compressed bitmaps should not be used. Keep in mind that random-looking data is usually not compressible. E.g., if you have a small set of 32-bit random integers, it is not mathematically possible to use far less than 32 bits per integer, and attempts at compression can be counterproductive.

How does Roaring compares with the alternatives?

Most alternatives to Roaring are part of a larger family of compressed bitmaps that are run-length-encoded bitmaps. They identify long runs of 1s or 0s and they represent them with a marker word. If you have a local mix of 1s and 0, you use an uncompressed word.

There are many formats in this family:

Oracle's BBC is an obsolete format at this point: though it may provide good compression, it is likely much slower than more recent alternatives due to excessive branching.
WAH is a patented variation on BBC that provides better performance.
Concise is a variation on the patented WAH. It some specific instances, it can compress much better than WAH (up to 2x better), but it is generally slower.
EWAH is both free of patent, and it is faster than all the above. On the downside, it does not compress quite as well. It is faster because it allows some form of "skipping" over uncompressed words. So though none of these formats are great at random access, EWAH is better than the alternatives.

There is a big problem with these formats however that can hurt you badly in some cases: there is no random access. If you want to check whether a given value is present in the set, you have to start from the beginning and "uncompress" the whole thing. This means that if you want to intersect a big set with a large set, you still have to uncompress the whole big set in the worst case...

Roaring solves this problem. It works in the following manner. It divides the data into chunks of 2¹⁶ integers (e.g., [0, 2¹⁶), [2¹⁶, 2 x 2¹⁶), ...). Within a chunk, it can use an uncompressed bitmap, a simple list of integers, or a list of runs. Whatever format it uses, they all allow you to check for the present of any one value quickly (e.g., with a binary search). The net result is that Roaring can compute many operations much faster than run-length-encoded formats like WAH, EWAH, Concise... Maybe surprisingly, Roaring also generally offers better compression ratios.

References

Daniel Lemire, Owen Kaser, Nathan Kurz, Luca Deri, Chris O'Hara, François Saint-Jacques, Gregory Ssi-Yan-Kai, Roaring Bitmaps: Implementation of an Optimized Software Library, Software: Practice and Experience 48 (4), 2018 arXiv:1709.07821
Samy Chambi, Daniel Lemire, Owen Kaser, Robert Godin, Better bitmap performance with Roaring bitmaps, Software: Practice and Experience 46 (5), 2016. http://arxiv.org/abs/1402.6407 This paper used data from http://lemire.me/data/realroaring2014.html
Daniel Lemire, Gregory Ssi-Yan-Kai, Owen Kaser, Consistently faster and smaller compressed bitmaps with Roaring, Software: Practice and Experience 46 (11), 2016. http://arxiv.org/abs/1603.06549

Dependencies

Dependencies are fetched automatically by giving the -t flag to go get.

they include

github.com/willf/bitset
github.com/mschoch/smat
github.com/glycerine/go-unsnap-stream
github.com/philhofer/fwd
github.com/jtolds/gls

Note that the smat library requires Go 1.6 or better.

Installation

go get -t github.com/RoaringBitmap/roaring

Example

Here is a simplified but complete example:

package main

import (
    "fmt"
    "github.com/RoaringBitmap/roaring"
    "bytes"
)


func main() {
    // example inspired by https://github.com/fzandona/goroar
    fmt.Println("==roaring==")
    rb1 := roaring.BitmapOf(1, 2, 3, 4, 5, 100, 1000)
    fmt.Println(rb1.String())

    rb2 := roaring.BitmapOf(3, 4, 1000)
    fmt.Println(rb2.String())

    rb3 := roaring.New()
    fmt.Println(rb3.String())

    fmt.Println("Cardinality: ", rb1.GetCardinality())

    fmt.Println("Contains 3? ", rb1.Contains(3))

    rb1.And(rb2)

    rb3.Add(1)
    rb3.Add(5)

    rb3.Or(rb1)

    // computes union of the three bitmaps in parallel using 4 workers  
    roaring.ParOr(4, rb1, rb2, rb3)
    // computes intersection of the three bitmaps in parallel using 4 workers  
    roaring.ParAnd(4, rb1, rb2, rb3)


    // prints 1, 3, 4, 5, 1000
    i := rb3.Iterator()
    for i.HasNext() {
        fmt.Println(i.Next())
    }
    fmt.Println()

    // next we include an example of serialization
    buf := new(bytes.Buffer)
    rb1.WriteTo(buf) // we omit error handling
    newrb:= roaring.New()
    newrb.ReadFrom(buf)
    if rb1.Equals(newrb) {
    	fmt.Println("I wrote the content to a byte stream and read it back.")
    }
    // you can iterate over bitmaps using ReverseIterator(), Iterator, ManyIterator()
}

If you wish to use serialization and handle errors, you might want to consider the following sample of code:

	rb := BitmapOf(1, 2, 3, 4, 5, 100, 1000)
	buf := new(bytes.Buffer)
	size,err:=rb.WriteTo(buf)
	if err != nil {
		t.Errorf("Failed writing")
	}
	newrb:= New()
	size,err=newrb.ReadFrom(buf)
	if err != nil {
		t.Errorf("Failed reading")
	}
	if ! rb.Equals(newrb) {
		t.Errorf("Cannot retrieve serialized version")
	}

Given N integers in [0,x), then the serialized size in bytes of a Roaring bitmap should never exceed this bound:

8 + 9 * ((long)x+65535)/65536 + 2 * N

That is, given a fixed overhead for the universe size (x), Roaring bitmaps never use more than 2 bytes per integer. You can call BoundSerializedSizeInBytes for a more precise estimate.

64-bit Roaring

By default, roaring is used to stored unsigned 32-bit integers. However, we also offer an extension dedicated to 64-bit integers. It supports roughly the same functions:

package main

import (
    "fmt"
    "github.com/RoaringBitmap/roaring/roaring64"
    "bytes"
)


func main() {
    // example inspired by https://github.com/fzandona/goroar
    fmt.Println("==roaring64==")
    rb1 := roaring64.BitmapOf(1, 2, 3, 4, 5, 100, 1000)
    fmt.Println(rb1.String())

    rb2 := roaring64.BitmapOf(3, 4, 1000)
    fmt.Println(rb2.String())

    rb3 := roaring64.New()
    fmt.Println(rb3.String())

    fmt.Println("Cardinality: ", rb1.GetCardinality())

    fmt.Println("Contains 3? ", rb1.Contains(3))

    rb1.And(rb2)

    rb3.Add(1)
    rb3.Add(5)

    rb3.Or(rb1)



    // prints 1, 3, 4, 5, 1000
    i := rb3.Iterator()
    for i.HasNext() {
        fmt.Println(i.Next())
    }
    fmt.Println()

    // next we include an example of serialization
    buf := new(bytes.Buffer)
    rb1.WriteTo(buf) // we omit error handling
    newrb:= roaring64.New()
    newrb.ReadFrom(buf)
    if rb1.Equals(newrb) {
    	fmt.Println("I wrote the content to a byte stream and read it back.")
    }
    // you can iterate over bitmaps using ReverseIterator(), Iterator, ManyIterator()
}

Only the 32-bit roaring format is standard and cross-operable between Java, C++, C and Go. There is no guarantee that the 64-bit versions are compatible.

Documentation

Current documentation is available at http://godoc.org/github.com/RoaringBitmap/roaring and http://godoc.org/github.com/RoaringBitmap/roaring64

Goroutine safety

In general, it should not generally be considered safe to access the same bitmaps using different goroutines--they are left unsynchronized for performance. Should you want to access a Bitmap from more than one goroutine, you should provide synchronization. Typically this is done by using channels to pass the *Bitmap around (in Go style; so there is only ever one owner), or by using sync.Mutex to serialize operations on Bitmaps.

Coverage

We test our software. For a report on our test coverage, see

https://coveralls.io/github/RoaringBitmap/roaring?branch=master

Benchmark

Type

     go test -bench Benchmark -run -

To run benchmarks on Real Roaring Datasets run the following:

go get github.com/RoaringBitmap/real-roaring-datasets
BENCH_REAL_DATA=1 go test -bench BenchmarkRealData -run -

Iterative use

You can use roaring with gore:

go get -u github.com/motemen/gore
Make sure that $GOPATH/bin is in your $PATH.
go get github.com/RoaringBitmap/roaring

$ gore
gore version 0.2.6  :help for help
gore> :import github.com/RoaringBitmap/roaring
gore> x:=roaring.New()
gore> x.Add(1)
gore> x.String()
"{1}"

Fuzzy testing

You can help us test further the library with fuzzy testing:

     go get github.com/dvyukov/go-fuzz/go-fuzz
     go get github.com/dvyukov/go-fuzz/go-fuzz-build
     go test -tags=gofuzz -run=TestGenerateSmatCorpus
     go-fuzz-build github.com/RoaringBitmap/roaring
     go-fuzz -bin=./roaring-fuzz.zip -workdir=workdir/ -timeout=200

Let it run, and if the # of crashers is > 0, check out the reports in the workdir where you should be able to find the panic goroutine stack traces.

Alternative in Go

There is a Go version wrapping the C/C++ implementation https://github.com/RoaringBitmap/gocroaring

For an alternative implementation in Go, see https://github.com/fzandona/goroar The two versions were written independently.

Mailing list/discussion group

https://groups.google.com/forum/#!forum/roaring-bitmaps

Owner

Roaring bitmaps: A better compressed bitset

Roaring bitmaps are compressed bitmaps. They can be hundreds of times faster. (Picture credit: tambako)

https://github.com/RoaringBitmap/roaring http://roaringbitmap.org/

Comments

RunContainer32 implements a run-length

(interval based) uint32 set. Intersect and Union are complete and tested.

There is more to do to integrate with roaring.Bitmap, but this is a nice chunk of functionality. A good start.
Add allocation-free iterators

Currently, iterators are implemented by instantiating sub-iterators that run over the containers. This approach requires us to create many more temporary objects than necessary.

A much simpler approach is the one taken by the CRoaring library where no memory allocation is done... https://github.com/RoaringBitmap/CRoaring/blob/master/src/roaring.c#L1156

This should reduce the amount of code and reduce the pressure on the garbage collector, which should improve performance in real-life systems.
Implement hashing (#198)

This is a proof of concept for hashing roaring bitmaps.
It works by writing to a customizable hash.Hash, for each key, a sequence composed of the given key as an LE 16 bit number followed by the values represented by its matching container in the same format, followed by a 0x0000 delimiter.
Since no empty containers are kept in the bitmap there's no conflict with a container starting with the 0 value, because a separator following a key is simply impossible.
Because we store the key and the the values following it is impossible for two containers corresponding to different sets but with the same lower parts to end up passing the same input for the hasher, so any collisions that could happen are caused by the hashing algorithm itself, rather than by two different sets creating the same data.
Add 64 bit implementation of roaring.Bitmap
Fixes #136

This PR is very much a work-in-progress. I've tried to stay with the original 32 bit method signatures as much as possible. Serialization is explicitly not part of this PR.

[x] fix failing tests

[x] augment tests with 64 bit values

[x] verify copy-on-write
some perf tuning

faster newRunContainer32FromBitmapContainer; countTrailingZeros is done in assembly, replacing numberOfTrailingZeros.

there's more too do, but this is worthwhile to merge; it speeds up the newRunContainer constructor by 4x.
Unification of bitmap unserialization by using auxiliary byteInput approach
This pull request is not aimed to be merged, it should be treated as a proposal for enhancement.

Currently, bitmap package has two cases that I would like to consider:

Methods readFrom/fromBuffer are responsible for restoring bitmap's state from io.Reader and byte slice. They have pretty the same code base and each of them has own cons.

fromBuffer makes a user keep (to hold reference) the provided byte slice and also to leave it constant.

readFrom loses the performance due to the necessity to create objects from the read byte data.

This pull request tries to solve these issues. For unification was implemented the auxiliary class byteInput for io utils. Also, in roaringArray was added special field sentry that holds the references of allocated/type converted slices (here I am not sure about this approach, in theory, it should work and all we need to ask a user to keep the provided buf immutable).
Implement run containers

For better compression when there are long runs of consecutive values, the Java version has run containers (as of version 0.5). They should be implemented :

https://github.com/lemire/RoaringBitmap/blob/master/src/main/java/org/roaringbitmap/RunContainer.java

These can be created by the "runOptimize" function...

https://github.com/lemire/RoaringBitmap/blob/master/src/main/java/org/roaringbitmap/RoaringBitmap.java#L1286

These containers can improve the compression ratio in some cases.

Faster ParOr

This commit introduces a faster ParOr. The algorithm is described informally in #140.

Benchmark data shows significant improvement compared to previous algorithm.

Benchmarks run on 15' 2017 MacBook Pro Intel(R) Core(TM) i7-7700HQ CPU @ 2.80GHz

benchmark                                           old ns/op     new ns/op     delta
BenchmarkRealDataParOr/census-income_srt-8          269673        194486        -27.88%
BenchmarkRealDataParOr/census-income-8              395797        310505        -21.55%
BenchmarkRealDataParOr/census1881_srt-8             723821        402359        -44.41%
BenchmarkRealDataParOr/census1881-8                 718247        573550        -20.15%
BenchmarkRealDataParOr/dimension_003-8              7466247       2197374       -70.57%
BenchmarkRealDataParOr/dimension_008-8              3062717       833368        -72.79%
BenchmarkRealDataParOr/dimension_033-8              474489        302080        -36.34%
BenchmarkRealDataParOr/uscensus2000-8               1912128       1092854       -42.85%
BenchmarkRealDataParOr/weather_sept_85_srt-8        370585        178577        -51.81%
BenchmarkRealDataParOr/weather_sept_85-8            1272372       1142709       -10.19%
BenchmarkRealDataParOr/wikileaks-noquotes_srt-8     290843        123167        -57.65%
BenchmarkRealDataParOr/wikileaks-noquotes-8         330745        152330        -53.94%

The previous algorithm is exposed in the public API as ParHeapOr somehow resembling the convention in fastaggregation.go.

Resolves #140

It's the second revision of the pull request. It fixed bugs found in #164

I've compared the cardinalities on /Real data/ dataset https://gist.github.com/maciej/add052d10c5edad2eb74eceda7484f07 semin-manually. They are the same, so I'm leaving #169 for later

Support CRoaring's frozen format for interoperability

Doing this would mean the pure Go implementation can be used to create bitmaps to later be used by a more restricted environment, such as one using Python or C, while still being able to memory map the contents. Using the pure Go version introduces several advantages versus using GoCRoaring, such as better memory accounting and parallel operations, as well as the ability to use scratch-images when deploying in Docker. Is there any interest in this functionality? I could try to implement it, I just want to know if it'll be rejected in principle.

Note this isn't about performance and there will be a penalty; there would be a comment in the documentation noting that this should only be used for interoperability, never where performance matters.
runContainer16/32 and msgpack2/snappy serialization
https://github.com/RoaringBitmap/roaring/pull/70 had the problem that the tests wouldn't all pass because the serialization wasn't done for runContainer16. In this serialization_msgp branch I finished it off using the serialization methods (msgpack2 for serialization and snappy for compression) discussed in https://github.com/RoaringBitmap/roaring/issues/76.

Fixes #37 Fixes #76

All tests are green. From the checklist on #70:

[x] serialization of Bitmaps including the runContainer is done in this PR.

[x] rangeOfOnes should generate a run container (Done: https://github.com/glycerine/roaring/pull/1)

[x] Implement RunOptimize

[x] Add isFull method; and

[ ] convert full bitmaps to run containers when they are full

[ ] Upgrade unit tests so that they call RunOptimize, including the smat tests

[ ] Make sure we pass the updated unit tests

these routines still need tests for the runContainer implementation

[ ] fillLeastSignificant16bits()

[ ] getShortIterator()

[ ] getSizeInBytes()

[ ] iaddRange()

[ ] iremoveRange()

I open this as a distinct branch in case nobody wants the msgpack2/snappy serialization.

Of course I think it's great. But if there are back-wards compatibility concerns then it may be a non-starter.

Use Go 1.9 math/bits package

This is a quick PR replacing bodies of popcount and countTrailingZeros functions to use Go 1.9 math/bits. Approaches fixing #101.

@lemire are there any other functions you had in mind to replace?

I'll post results of running go test -bench Benchmark -run - on relevant (of the Roaring functions that use the popcnt and countTrailingZeros) benchmarks from the current master and the PR branch soon. Do you think it's sufficient or should I write some specific benchmarks?

I have not inlined the popcount and countTrailingZeros functions themselves.

popcount gets inlined in popcntSlice:

TEXT %22%22.popcntSlice(SB) gofile../Users/maciejb/gowork/src/github.com/RoaringBitmap/roaring/popcnt.go
  popcnt.go:10		0x4b630			65488b0c2500000000	MOVQ GS:0, CX			[5:9]R_TLS_LE
  popcnt.go:10		0x4b639			483b6110		CMPQ 0x10(CX), SP		
  popcnt.go:10		0x4b63d			0f8685000000		JBE 0x4b6c8			
  popcnt.go:10		0x4b643			4883ec30		SUBQ $0x30, SP			
  popcnt.go:10		0x4b647			48896c2428		MOVQ BP, 0x28(SP)		
  popcnt.go:10		0x4b64c			488d6c2428		LEAQ 0x28(SP), BP		
  popcnt.go:10		0x4b651			31c0			XORL AX, AX			
  popcnt.go:10		0x4b653			488b4c2438		MOVQ 0x38(SP), CX		
  popcnt.go:10		0x4b658			31d2			XORL DX, DX			
  popcnt.go:12		0x4b65a			eb0a			JMP 0x4b666			
  popcnt.go:12		0x4b65c			4883c108		ADDQ $0x8, CX			
  popcnt.go:12		0x4b660			48ffc0			INCQ AX				
  popcnt.go:13		0x4b663			4801f2			ADDQ SI, DX			
  popcnt.go:12		0x4b666			488b5c2440		MOVQ 0x40(SP), BX		
  popcnt.go:12		0x4b66b			4839d8			CMPQ BX, AX			
  popcnt.go:12		0x4b66e			7d49			JGE 0x4b6b9			
  popcnt.go:12		0x4b670			488b31			MOVQ 0(CX), SI			
  popcnt.go:7		0x4b673			0fb63d00000000		MOVZX 0(IP), DI			[3:7]R_PCREL:runtime.support_popcnt
  popcnt.go:7		0x4b67a			4084ff			TESTL DI, DI			
  popcnt.go:7		0x4b67d			7407			JE 0x4b686			
  popcnt.go:7		0x4b67f			f3480fb8f6		POPCNT SI, SI			
  popcnt.go:7		0x4b684			ebd6			JMP 0x4b65c			
  popcnt.go:7		0x4b686			4889442418		MOVQ AX, 0x18(SP)		
  popcnt.go:7		0x4b68b			4889542410		MOVQ DX, 0x10(SP)		
  popcnt.go:7		0x4b690			48894c2420		MOVQ CX, 0x20(SP)		
  popcnt.go:7		0x4b695			48893424		MOVQ SI, 0(SP)			
  popcnt.go:7		0x4b699			e800000000		CALL 0x4b69e			[1:5]R_CALL:math/bits.OnesCount64
  popcnt.go:7		0x4b69e			488b742408		MOVQ 0x8(SP), SI		
  popcnt.go:7		0x4b6a3			488b442418		MOVQ 0x18(SP), AX		
  popcnt.go:7		0x4b6a8			488b4c2420		MOVQ 0x20(SP), CX		
  popcnt.go:7		0x4b6ad			488b542410		MOVQ 0x10(SP), DX		
  popcnt.go:7		0x4b6b2			488b5c2440		MOVQ 0x40(SP), BX		
  popcnt.go:7		0x4b6b7			eba3			JMP 0x4b65c			
  popcnt.go:15		0x4b6b9			4889542450		MOVQ DX, 0x50(SP)		
  popcnt.go:15		0x4b6be			488b6c2428		MOVQ 0x28(SP), BP		
  popcnt.go:15		0x4b6c3			4883c430		ADDQ $0x30, SP			
  popcnt.go:15		0x4b6c7			c3			RET				
  popcnt.go:10		0x4b6c8			e800000000		CALL 0x4b6cd			[1:5]R_CALL:runtime.morestack_noctxt
  popcnt.go:10		0x4b6cd			e95effffff		JMP %22%22.popcntSlice(SB)

countTrailingZeros gets inlined even "deeper" with no calls to either countTrailingZeros or bits.TrailingZeros64 in the output assembly:

TEXT %22%22.(*bitmapContainer).minimum(SB) gofile../Users/maciejb/gowork/src/github.com/RoaringBitmap/roaring/bitmapcontainer.go
  bitmapcontainer.go:50	0x3f291			488b442408		MOVQ 0x8(SP), AX	
  bitmapcontainer.go:50	0x3f296			31c9			XORL CX, CX		
  bitmapcontainer.go:51	0x3f298			eb03			JMP 0x3f29d		
  bitmapcontainer.go:51	0x3f29a			48ffc1			INCQ CX			
  bitmapcontainer.go:51	0x3f29d			488b5008		MOVQ 0x8(AX), DX	
  bitmapcontainer.go:51	0x3f2a1			488b5810		MOVQ 0x10(AX), BX	
  bitmapcontainer.go:51	0x3f2a5			4839d9			CMPQ BX, CX		
  bitmapcontainer.go:51	0x3f2a8			7d27			JGE 0x3f2d1		
  bitmapcontainer.go:52	0x3f2aa			488b14ca		MOVQ 0(DX)(CX*8), DX	
  bitmapcontainer.go:53	0x3f2ae			4885d2			TESTQ DX, DX		
  bitmapcontainer.go:53	0x3f2b1			74e7			JE 0x3f29a		
  ctz.go:40		0x3f2b3			480fbcc2		BSFQ DX, AX		
  bitmapcontainer.go:55	0x3f2b7			48c1e106		SHLQ $0x6, CX		
  ctz.go:40		0x3f2bb			480fbcd2		BSFQ DX, DX		
  ctz.go:40		0x3f2bf			ba40000000		MOVL $0x40, DX		
  ctz.go:40		0x3f2c4			480f44c2		CMOVE DX, AX		
  bitmapcontainer.go:55	0x3f2c8			4801c8			ADDQ CX, AX		
  bitmapcontainer.go:55	0x3f2cb			6689442410		MOVW AX, 0x10(SP)	
  bitmapcontainer.go:55	0x3f2d0			c3			RET			
  bitmapcontainer.go:58	0x3f2d1			66c7442410ffff		MOVW $-0x1, 0x10(SP)	
  bitmapcontainer.go:58	0x3f2d8			c3			RET

External-memory roaring data structure

Hello. I know that FromBuffer/FrozenView methods allow read from MMAP-file. But how to create such MMAP-file in roaring format: i f my bitmap > RAM? (or if I would like enforce some RAM usage limit)
Add CodeQL workflow for GitHub code scanning
Hi RoaringBitmap/roaring!

This is a one-off automatically generated pull request from LGTM.com :robot:. You might have heard that we’ve integrated LGTM’s underlying CodeQL analysis engine natively into GitHub. The result is GitHub code scanning!

With LGTM fully integrated into code scanning, we are focused on improving CodeQL within the native GitHub code scanning experience. In order to take advantage of current and future improvements to our analysis capabilities, we suggest you enable code scanning on your repository. Please take a look at our blog post for more information.

This pull request enables code scanning by adding an auto-generated codeql.yml workflow file for GitHub Actions to your repository — take a look! We tested it before opening this pull request, so all should be working :heavy_check_mark:. In fact, you might already have seen some alerts appear on this pull request!

Where needed and if possible, we’ve adjusted the configuration to the needs of your particular repository. But of course, you should feel free to tweak it further! Check this page for detailed documentation.

Questions? Check out the FAQ below!

FAQ

Click here to expand the FAQ section

How often will the code scanning analysis run?

By default, code scanning will trigger a scan with the CodeQL engine on the following events:

On every pull request — to flag up potential security problems for you to investigate before merging a PR.

On every push to your default branch and other protected branches — this keeps the analysis results on your repository’s Security tab up to date.

Once a week at a fixed time — to make sure you benefit from the latest updated security analysis even when no code was committed or PRs were opened.

What will this cost?

Nothing! The CodeQL engine will run inside GitHub Actions, making use of your unlimited free compute minutes for public repositories.

What types of problems does CodeQL find?

The CodeQL engine that powers GitHub code scanning is the exact same engine that powers LGTM.com. The exact set of rules has been tweaked slightly, but you should see almost exactly the same types of alerts as you were used to on LGTM.com: we’ve enabled the security-and-quality query suite for you.

How do I upgrade my CodeQL engine?

No need! New versions of the CodeQL analysis are constantly deployed on GitHub.com; your repository will automatically benefit from the most recently released version.

The analysis doesn’t seem to be working

If you get an error in GitHub Actions that indicates that CodeQL wasn’t able to analyze your code, please follow the instructions here to debug the analysis.

How do I disable LGTM.com?

If you have LGTM’s automatic pull request analysis enabled, then you can follow these steps to disable the LGTM pull request analysis. You don’t actually need to remove your repository from LGTM.com; it will automatically be removed in the next few months as part of the deprecation of LGTM.com (more info here).

Which source code hosting platforms does code scanning support?

GitHub code scanning is deeply integrated within GitHub itself. If you’d like to scan source code that is hosted elsewhere, we suggest that you create a mirror of that code on GitHub.

How do I know this PR is legitimate?

This PR is filed by the official LGTM.com GitHub App, in line with the deprecation timeline that was announced on the official GitHub Blog. The proposed GitHub Action workflow uses the official open source GitHub CodeQL Action. If you have any other questions or concerns, please join the discussion here in the official GitHub community!

I have another question / how do I get in touch?

Please join the discussion here to ask further questions and send us suggestions!

[Draft] Add support for custom allocators

TODO: Its error prone to require allocator to be non-nil. Could just do a nil check on each usage so that users instantiating bitmap like &Bitmap{} aren't broken. Will address if we decide to move forward with this approach.

Benchmark Before

goos: darwin
goarch: amd64
pkg: github.com/RoaringBitmap/roaring
cpu: Intel(R) Core(TM) i7-8569U CPU @ 2.80GHz
BenchmarkRepeatedSparseSerialization-8   	 3732640	       316.4 ns/op	      96 B/op	       5 allocs/op
PASS
ok  	github.com/RoaringBitmap/roaring	1.849s

Benchmark After

goos: darwin
goarch: amd64
pkg: github.com/RoaringBitmap/roaring
cpu: Intel(R) Core(TM) i7-8569U CPU @ 2.80GHz
BenchmarkRepeatedSparseSerialization-8   	 5236890	       220.4 ns/op	      24 B/op	       1 allocs/op
PASS
ok  	github.com/RoaringBitmap/roaring	1.575s

Optimize generating many bitsets for serialization
One of our heaviest workloads uses this library to index a lot of ingested data in real time. In one part of the workload we basically have dozens of goroutines doing something like the following in a loop:

postings := roaring.NewBitmap() for _, thing := range things { postings.Clear() for _, row := range $MATCHING_ROWS { postings.Add(row) } postings.WriteTo() }

We've already heavily optimized the code that generates things to be very efficient and avoid allocations/pointers, however, when its time to actually serialize the index we're building we have to go through this library and it allocates like crazy even though we're trying to reuse a single datastructure.

I've provided a benchmark that demonstrates this along with some improvements in this P.R: https://github.com/RoaringBitmap/roaring/pull/364

I realize that adding pooling of internal datastructures may be a big change, but I tried to structure the P.R so that its completely "opt in" via the new ClearRetainDatastructures() API.

EDIT: I can't provide screenshots of profiling for obvious reasons, but I just went back and checked and the allocations removed by this P.R represent ~ 56% of all allocations in our workload so this will have a very substantial performance impact for us (and others I expect who use the new API).

Optimize repeatedly serializing array bitmaps

Benchmark before

goos: darwin
goarch: amd64
pkg: github.com/RoaringBitmap/roaring
cpu: Intel(R) Core(TM) i7-8569U CPU @ 2.80GHz
BenchmarkRepeatedSparseSerialization-8   	 3732640	       316.4 ns/op	      96 B/op	       5 allocs/op
PASS
ok  	github.com/RoaringBitmap/roaring	1.849s

Benchmark after

goos: darwin
goarch: amd64
pkg: github.com/RoaringBitmap/roaring
cpu: Intel(R) Core(TM) i7-8569U CPU @ 2.80GHz
BenchmarkRepeatedSparseSerialization-8   	 6733712	       178.6 ns/op	       0 B/op	       0 allocs/op
PASS
ok  	github.com/RoaringBitmap/roaring	1.760s

So a little less than 2x as fast, but zero allocation now as well.

Fault When Creating Frozen Bitmap From MMapped
We have been trying to use Frozen Bitmaps for our internal system and were met with unanticipated faults. After working with @a392n328 we were able to write up this minimal failure, which is in #362. The actual error is

=== RUN TestFrozenCases unexpected fault address 0x15fb008 fatal error: fault [signal SIGBUS: bus error code=0x2 addr=0x15fb008 pc=0x11ed56a]

I can't figure out what exactly could be causing this.

Sroar - 64-bit Roaring Bitmaps in Go

sroar: Serialized Roaring Bitmaps This is a fork of dgraph-io/sroar, being maint

Dec 8, 2022

CLRS study. Codes are written with golang.

algorithms CLRS study. Codes are written with golang. go version: 1.11 Heap BinaryHeap on array BinaryHeap on linkedlist LeftistHeap FibonacciHeap Tre

Dec 26, 2022

Golang string comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...

Go-edlib : Edit distance and string comparison library Golang string comparison and edit distance algorithms library featuring : Levenshtein, LCS, Ham

Dec 20, 2022

Roaring bitmaps in Go (golang)

roaring

When should you use a bitmap?

When should you use compressed bitmaps?

How does Roaring compares with the alternatives?

References

Dependencies

Installation

Example

64-bit Roaring

Documentation

Goroutine safety

Coverage

Benchmark

Iterative use

Fuzzy testing

Alternative in Go

Mailing list/discussion group

Owner

Roaring bitmaps: A better compressed bitset

Comments

RunContainer32 implements a run-length

Add allocation-free iterators

Implement hashing (#198)

Add 64 bit implementation of roaring.Bitmap

some perf tuning

Unification of bitmap unserialization by using auxiliary byteInput approach

Implement run containers

Faster ParOr

Support CRoaring's frozen format for interoperability

runContainer16/32 and msgpack2/snappy serialization

Use Go 1.9 math/bits package

External-memory roaring data structure

Add CodeQL workflow for GitHub code scanning

FAQ

How often will the code scanning analysis run?

What will this cost?

What types of problems does CodeQL find?

How do I upgrade my CodeQL engine?

The analysis doesn’t seem to be working

How do I disable LGTM.com?

Which source code hosting platforms does code scanning support?

How do I know this PR is legitimate?

I have another question / how do I get in touch?

[Draft] Add support for custom allocators

Optimize generating many bitsets for serialization

Optimize repeatedly serializing array bitmaps

Fault When Creating Frozen Bitmap From MMapped

Related tags

Sroar - 64-bit Roaring Bitmaps in Go

CLRS study. Codes are written with golang.

Golang string comparison and edit distance algorithms library, featuring : Levenshtein, LCS, Hamming, Damerau levenshtein (OSA and Adjacent transpositions algorithms), Jaro-Winkler, Cosine, etc...

Gota: DataFrames and data wrangling in Go (Golang)

A simple Set data structure implementation in Go (Golang) using LinkedHashMap.

skiplist for golang

An in-memory string-interface{} map with various expiration options for golang

High-performance minimalist queue implemented using a stripped-down lock-free ringbuffer, written in Go (golang.org)

Double-ARray Trie System for golang

Golang implementation of Radix trees

Golang library for reading and writing Microsoft Excel™ (XLSX) files.

HyperLogLog and HyperLogLog++ implementation in Go/Golang.

Golang library for querying and parsing OFX

Go (golang) library for reading and writing XLSX files.

Golang FrodoKEM implementation

Golang LRU cache

Meow hash for Golang

A Golang lock-free thread-safe HashMap optimized for fastest read access.

Trie data structure implementation in Golang 🌳