A tool for modeliing documents containing emojis.
This project partially depends on python, R and emoji2vec, which has been added as a git submodule in this project. In order to install all dependencies, you need to:
- Have mini conda installed and executable your command line prompt.
- Create the
feiwen
python virtual environment:conda env create -f environment.yml
- git submodule update --init --recursive (This pulls emoji2vec into libs/emoji2vec)
- Get some data, the format should be
{
"id": STRING,
"text": "....",
"sentiment": "POS" || "NEG",
"emoji_class": "+" || "-"
}
- Split the data by emoji class
- Get emoji2vec vectors
- Generate doc2vec based on (2) and (3)
- Visualise the data in 2 by something like tsne