Aims to build and test classification models to predict salaries from the text contained in the job description.
This project will create classification models to predict high (75th percentile and above) or low (below 75th percentile) salary from the text contained in the job descriptions, with the usage of Naïve Bayes classifier.
Steps:
- Build a classification model with text (full job description) as the predictor.
- Calculate the model accuracy.
- Show the top 10 words (excluding stopwords) that are most indicative of (i) high salary, and (ii) low salary.