Skip to content

[Data Science Project] Indonesian Social Media Post Toxicity Classification

Notifications You must be signed in to change notification settings

christianwbsn/netifier

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Netifier

Overview

Indonesian Social Media Text Toxicity Classification. In summary, our contributions are:

  1. Created Indonesian social media post toxicity dataset (with 4 labels: pornography, racism, radicalism, and hate speech)
  2. Performed exploratory data analysis, data preprocessing, and modelling for toxicity content classification task
  3. Compared various machine learning model performance on this task

Project Organization

├── LICENSE
├── README.md          <- The top-level README for developers using this project.
├── data
│   ├── processed      <- The final, canonical data sets for modeling.
│   └── raw            <- The original, immutable data dump.
└── notebooks          <- Jupyter notebooks. Naming convention is a number (for ordering),
 						  and short description

Contributors

  • Ahmad Izzan
  • Christian Wibisono
  • Ilham Firdausi Putra

About

[Data Science Project] Indonesian Social Media Post Toxicity Classification

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%