Bangla Sentiment Dataset (Labeled)

Description

A corpus of 300,000 (full dataset) Bangla sentences labeled according to their sentiment (positive, negative). Currently, only 4000 sentences are available in this repository. If you need the full version, please don't hesitate to drop us an email. The sentences were collected from social media sites, blogs and news portals. It can be used to train Sentiment Analysis systems. It has a separate training set (200,000 sentences) and test set (100,000 sentences). This dataset is large enough to apply Deep Learning in Bangla Sentiment Analysis, as Deep Learning models inherently require more data to be useful in practice.

Data Fromat

The corpus is released in plain text format. It has two seperate files (bangla.pos, bangla.neg) for positive and negative labeled sentences

How To Get The Full Version

If you need the full version, we can arrange a way to send the dataset to you. Please email at contact@socian.ai

License

The corpus is licensed under GNU GPLv3, making it very easy to anyone to use the data for any purpose.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
bangla.neg		bangla.neg
bangla.pos		bangla.pos
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bangla Sentiment Dataset (Labeled)

Description

Data Fromat

How To Get The Full Version

License

About

Releases

Packages

Contributors 2

socian-ai/socian-bangla-sentiment-dataset-labeled

Folders and files

Latest commit

History

Repository files navigation

Bangla Sentiment Dataset (Labeled)

Description

Data Fromat

How To Get The Full Version

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages