Releases: apache/datasketches-cpp
Releases · apache/datasketches-cpp
datasketches-cpp-3.3.0
- several fixes with respect to allocations using a provided allocator instance
- fixes and improvements in cmake files for including DataSketches as dependency in other projects
- Tuple sketch serial version 3 for compatibility with Java
- support for older serialization versions of Theta sketch
- added reset() method in Theta and Tuple sketch and union
- minor changes to some corner cases of Theta and Tuple intersection and a-not-b operations
Known problems:
- support for older serialization versions of Theta sketch is incomplete: deserialize from bytes does not handle old versions
- REQ sketch get_PMF() has undefined behavior for empty sketches (can crash). Check is_empty() before calling get_PMF()
3.2.0
This version includes the following changes:
- Fix issue #236 , a serialization bug in the KLL sketch
- Refactored python to remove pybind11 as a submodule. It is now a dependency only for building the package
- Updated LICENSE file to reflect how pybind11 is used
- Added convenience binaries for python available from https://pypi.org/project/datasketches
datasketches-cpp-3.1.0
- Kolmogorov-Smirnov test for KLL sketch
- custom seed support in Theta Jaccard similarity
- Theta union bug fix
- added get_max_serialized_size_bytes method for KLL and CPC sketches
- added wrapped_compact_theta_sketch to avoid some cost of deserialization
- massive code cleanup to avoid compiler warnings
- iterator fix in KLL sketch
- iterator fix in REQ sketch
- exception safety fix in theta_update_sketch_base
- misaligned access fix in MurmurHash3
datasketches-cpp-3.0.0
- Introduction of new Relative Error Quantiles Sketch,
- Added Tuple sketch and rewritten Theta sketch to share the same base,
- Performance improvement of HLL sketch,
- Removed serialization of Update Theta sketch and Union, and HLL Union,
- Added support for passing instances of allocators
Apache Release 2.1.0-incubating
- fixed potential crash when querying KLL with complex types
- added vector_of_kll to python
- added help text to all python methods
Apache Release 2.0.0-incubating
- header-only library
- fully allocator-aware
- exception-safe
- varopt sampling added
- API changes for consistency
Apache Release 1.0.0-incubating
The first release.
- KLL quantiles sketch
- Frequent items sketch
- CPC distinct-counting sketch
- Theta distinct-counting sketch with set operations
- HLL distinct-counting sketch