Banglish Sentiment Dataset (Unlabeled)

Description

A corpus of 300,000 (full dataset) Banglish sentences (eg. 'আমার দেশ' writtern as 'amar desh'). Currently, only 50,000 sentences are available in this repository. If you need the full version, please don't hesitate to drop us an email. The sentences were collected from social media sites, blogs and news portal comments. It can be used to train Sentiment Analysis systems. This dataset can be used to train unsupervised learning algorithms.

Data Fromat

The corpus is released in excel and csv format.

How To Get The Full Version

If you need the full version, we can arrange a way to send the dataset to you. Please email at contact@socian.ai

License

The corpus is licensed under GNU GPLv3, making it very easy to anyone to use the data for any purpose.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readme.md

readme.md

Banglish Sentiment Dataset (Unlabeled)

Description

Data Fromat

How To Get The Full Version

License

Files

readme.md

Latest commit

History

readme.md

File metadata and controls

Banglish Sentiment Dataset (Unlabeled)

Description

Data Fromat

How To Get The Full Version

License