This project includes Python scripts for analysis privacy policy documents. The following text analysis methods have been used in this protect.
- Topic modeling
- Latent Dirichlet Allocation (LDA).
- Information extraction
- Semantic Role Labeling (SRL).
- Dependency Parsing.
- Named Entity Recognition (NER).
- Part-Of-Speech Tagger (POS Tagger).
- Text classification
- Universal Sentence Encodrder.
- Bidirectional Encoder Representations from Transformers (BERT).
- Universal Language Model Fine-tuning (ULMFiT).
- Support Vector Machine (SVM).
- Naive Bayes.