- Pre-processing: lowercasing, frequency of word count to limit vocabulary, NLTK Snowball stemming
- Unigram Feature
- Unigram and Bigram Features
- two layer with tanh activation layer, with sigmoid activation for final output (used BCE loss)
- Bi-directional LSTM with "max pooling" activation to a linear fully connected layer. Then, this is fed to final Sigmoid activation.
- Bi-directional LSTM with convolution and max pooling.