"Feed the Machine - Language Analysis" (Project 3)

Jenni Davis, David Jimenez, Elizabeth Conway, Austin Olea, Susan Farago, & Catherine Poirier (June 2021)

For more information and detailed results, please refer to the Project Final Report!

Problem to Solve

Can we use machine learning to accurately and effectively evaluate a trainer’s responses to essay-style questions in order to predict a trainer’s training and facilitation skills?

Results / Findings

Based on the dataset we used, we were unable to reliably predict a trainer’s training and facilitation skills based on evaluating the trainer’s response to essay-style questions. The data was run against three different machine learning models: Scikit-Learn, Linear Regression, and Random Forest Regressor. Next steps: Reevaluate questions in order to gain better responses. Rerun models and analyze results. It's not the model, it's the data.

Database Information

File #1
Extracted from SalesForce & cleaned utilizing Tableau.
Essay responses from 2,388 trainers to twenty open-ending questions from June 2020 - May 2021, resulting in 42,998 rows of data.
File #2
Extracted from SalesForce & cleaned utilizing Tableau.
Student scoring on the respective trainer’s training and facilitation skills.
912 trainers taught courses of those 486 received scoring.
Final cleaned data included 312 trainers and 6,240 records.

Tools Used

Tableau
Python Pandas
Python NLTK
Vader Sentiment Analysis
Matplotlib
Machine Learning Models: -- Scikit-learn -- Naive Bayes Classifier -- GaussianNB -- Linear Regression -- Random Forest Regressor
HTML / CSS / Bootstrap
GitHub Pages

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
Machine Learning		Machine Learning
Sentiment_Preprocessing		Sentiment_Preprocessing
Sentiment_Vader		Sentiment_Vader
images		images
.DS_Store		.DS_Store
11 - Average Training Score vs. Compount Sentiment Polarity Score.png		11 - Average Training Score vs. Compount Sentiment Polarity Score.png
12 - Average Training Score vs. Compount Sentiment Polarity Score.png		12 - Average Training Score vs. Compount Sentiment Polarity Score.png
13 - Average Training Score vs. Compount Sentiment Polarity Score (1).png		13 - Average Training Score vs. Compount Sentiment Polarity Score (1).png
13 - Average Training Score vs. Compount Sentiment Polarity Score.png		13 - Average Training Score vs. Compount Sentiment Polarity Score.png
14 - Average Training Score vs. Compount Sentiment Polarity Score.png		14 - Average Training Score vs. Compount Sentiment Polarity Score.png
15 - Average Training Score vs. Compount Sentiment Polarity Score.png		15 - Average Training Score vs. Compount Sentiment Polarity Score.png
Average Knowledge Score.png		Average Knowledge Score.png
Average Training and Facilitation Score.png		Average Training and Facilitation Score.png
Filtered Word Count.png		Filtered Word Count.png
Project3_Feed_The_Machine_June2021.pdf		Project3_Feed_The_Machine_June2021.pdf
README.md		README.md
Tokenized Word Count.png		Tokenized Word Count.png
index.html		index.html
material-dashboard.css		material-dashboard.css
production ID_5192067.mp4		production ID_5192067.mp4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

"Feed the Machine - Language Analysis" (Project 3)

Problem to Solve

Results / Findings

Database Information

Tools Used

About

Releases

Packages

Contributors 4

Languages

svfarago/Project_3

Folders and files

Latest commit

History

Repository files navigation

"Feed the Machine - Language Analysis" (Project 3)

Problem to Solve

Results / Findings

Database Information

Tools Used

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages