Releases: valeriansaliou/sonic
Releases · valeriansaliou/sonic
Sonic v1.3.5
- Rolled back
rocksdb
version, as the latest version does not link properly in--release
mode [@valeriansaliou].
Sonic v1.3.4
- Dependencies have been bumped to latest versions (namely:
rocksdb
,clap
,regex
) [@valeriansaliou].
Sonic v1.3.3
- Dependencies have been bumped to latest versions (namely:
hashbrown
,whatlang
,regex
) [@valeriansaliou]. - Moved the release pipeline to GitHub Actions [@valeriansaliou].
- The language detection system is now about 2x faster (due to the upgrade of
whatlang
pastv0.14.0
) [@valeriansaliou]. - Added Armenian stopwords [@valeriansaliou].
- Added Georgian stopwords [@valeriansaliou].
- Added Gujarati stopwords [@valeriansaliou].
- Added Tagalog stopwords [@valeriansaliou].
Sonic v1.3.2
- Fixed Norwegian stopwords [@valeriansaliou, #239].
- Code has been formatted according to
clippy
recommendations. This does not change the way Sonic behaves [@pleshevskiy, #233]. - Added support for Chinese word segmentation in tokenizer (note that as this adds quite some size overhead to the final binary size, the feature
tokenizer-chinese
can be disabled when building Sonic) [@vincascm, #209].
Sonic v1.3.1
- Apple Silicon is now supported [@valeriansaliou].
- Added Norwegian stopwords [@mikalv, #236].
- Added Catalan stopwords [@coopanio, #227].
- Dependencies have been bumped to latest versions (namely:
rocksdb
,fst-levenshtein
,fst-regex
,hashbrown
,whatlang
,byteorder
,rand
) [@valeriansaliou]. - A few rarely-used languages have been removed, following
whatlang
v0.12.0
release, see the notes here [@valeriansaliou, 940d3c3].
Sonic v1.3.0
- Added support for Slovak, which is now auto-detected from terms [@valeriansaliou, 19412ce].
- Added Slovak stopwords [@valeriansaliou, 19412ce].
- Dependencies have been bumped to latest versions (namely:
whatlang
) [@valeriansaliou, 19412ce].
Sonic v1.2.4
- Fixed multiple deadlocks, which where not noticed in practice by running Sonic at scale, but that are still theoretically possible [@BurtonQin, #213, #211].
- Added support for Latin, which is now auto-detected from terms [@valeriansaliou, e6c5621].
- Added Latin stopwords [@valeriansaliou, e6c5621].
- Dependencies have been bumped to latest versions (namely:
rocksdb
,radix
,hashbrown
,whatlang
) [@valeriansaliou]. - Added a release script, with cross-compilation capabilities (currently for the
x86_64
architecture, dynamically linked against GNU libraries) [@valeriansaliou, 961bab9].
Sonic v1.2.3
- RocksDB compression algorithm has been changed from LZ4 to Zstandard, for a slightly better compression ratio, and much better read/write performance; this will be used for new SST files only [@valeriansaliou, cd4cdfb].
- Dependencies have been bumped to latest versions (namely:
rocksdb
) [@valeriansaliou, cd4cdfb].
Sonic v1.2.2
- Fixed a regression on optional configuration values not working anymore, due to an issue in the environment variable reading system introduced in
v1.2.1
[@valeriansaliou, #155]. - Optimized some aspects of FST consolidation and pending operations management [@valeriansaliou, #156].
Sonic v1.2.1
- FST graph consolidation is now able to ignore new words when the graph is over configured limits, which are set with the new
store.fst.graph.max_size
andstore.fst.graph.max_words
configuration variables [@valeriansaliou, 53db9c1]. - An integration testing infrastructure has been added to the Sonic automated test suite [@vilunov, #154].
- Configuration values can now be sourced from environment variables, using the
${env.VARIABLE}
syntax inconfig.cfg
[@perzanko, #148]. - Dependencies have been bumped to latest versions (namely:
rand
,radix
andhashbrown
) [@valeriansaliou, c1b1f54].