Releases: alexandrainst/danlp
Releases · alexandrainst/danlp
v0.1.2: Models for Hate Speech detection and classification
✨ Features
- Add BERT HateSpeech model (detection of offensive language & classification of hate speech)
- Add ELECTRA Offensive model (detection of offensive language)
v0.1.1: Datasets and model for Named Entity Disambiguation
✨ Features
- Add DaNED dataset
- Add DaWikiNED dataset
- Add XLM-R NED model
⚠️ Upgrades
- upgraded package versions for dependencies:
- gensim == 3.8.3
- flair==0.9
v0.1.0: Model for HateSpeech detection and new datasets
✨ Features
- Add DaUnimorph dataset
- Add AngryTweets dataset
- Add BERT Offensive model
⚠️ Upgrades
- default python version for development is now 3.8
- upgraded package versions for dependencies:
- torch==1.7.1
- flair==0.8
- transformers==4.3.3
- allennlp==2.5.0
v0.0.12: New dataset for Hate Speech and new model for coreference resolution
✨ Features
- Add DKHate dataset for hate speech detection
- Add a feature for finding
domains
with DanNet - Add a model for coreference resolution
🐛 Bugfixes
- #108 Fix download of datasets/models to a custom directory
- Fix warning unclosed file in Dacoref
- Fix UserWarning "Maximum length exceeded" with BERT
v0.0.11: New datasets and added features for using BERT model
✨ Features
- Add DaCoref dataset for coreference resolutions
- Add the DanNet wordnet to make it easy to find e.g. synonyms
- Add a pre-trained Bert model to use as embeddings, mask word predictions and next sentence predictions
- Update BERT ner predict methods to provide the tags combined
🐛 Bugfixes
- #80 Fix bug with missing argument in loading bert model
- Update requirements to fix mistakes in Flair
v.0.0.10: Permanent URL for storage host and new spacy models
✨ New features
- Add spaCy sentiment model with the tags "positive, neutral and negative" trained using hard distil of the "Tone BERT" model
- Add spaCy function for noun phrase chunking using the dependency model in spacy
- Add predict proba method in BERT models for tone and emotions
🔍 Minor updates
- Made it posible to choose any static embeddings form DaNLP with the Flair embeddings
- Allowed for longer sequences than 512 in BERT sentiment models, and add warning for doing so
🏨 Storage
- Updatet to permanent URL for storage of models and dataset to http://danlp-downloads.alexandra.dk
v0.0.9: Change hositng storage of models and datasets
🏨 changes in storage
- The hosting storage is removed from Amazon to local
v0.0.8: More sentiment models and dataset
✨ Features
- Add a first version of BERT Tone model which inlc a model for polarity (positive, neutral, negative) and a model for analytic (objective, subjective)
- Add a small dataset annotated from Twitter with sentiment, and intergation to download tweets based on ID from Twitter
- Add a small dataset from Europarl annotated with sentiment
v0.0.6: NER support
✨ Features
- Add the updated DaNE dataset
- Add spaCy model for DEP, NER and POS
- Add finetuned M-BERT and Danish BERT for NER
- Add DSL Reddit wordembeddings
🐛 Bugfixes
- Fix bug #13 loading wikiann as spaCy dataset.
- Add a new Flair POS model for support for newest Flair and PyTorch
v0.0.5: More wordembeddings
✨ Features
- Add of the word similarity datasets DSD and the danish WS353
- Add Sketchengine fasttext wordembeddings