Skip to content

BNLP 3.1.0

Compare
Choose a tag to compare
@sagorbrur sagorbrur released this 24 Apr 13:42
5d34e0a

New Features

  • Updated word2vec for gensim 4.0 from gensim 3.8.3
  • Updated word2vec training modules with gensim identical parameter passing while training
  • Added new word2vec pre-trained model with vector size 100
  • Added pre-training function for resume training of word2vec
  • Updated fasttext training module with fasttext identical parameter passing while training
  • Added bin2vec function in fasttext for generating vector file from fasttext bin model
  • Updated corpus class with bengali letter, punctuation, digits
  • added stale app

Bug fixed

  • fixed Bengali nltk sentence tokenizer issue.
    PR-19 Bengali NLTK sentence tokenizer was wrongly tokenizing for . punctuation.

New Contributor