This project aims at predicting the attractiveness of a household device.
I provide the environment used to run this code.
The files contains the following main documents:
- Data Preparation (train)
- Data Visualizations and Data Wrangling
- Data Preparation (test)
- Data Preparation Train data (2)
- Algorithm selection
- Feature Selection, Algorithm fine tunning, Generalization evidence
- Running a prediction (inference
File 2 clearly shows through certain visualizations the relationships between variables and the label, so we can see how 3 clearly distinct product types behave according to the variable.
File number 5 presents the results of running different algorithms.
File number 6 takes the best algorithm from file 5 and improves it, how? By selecting the best features selection for this model (hybrid method) and then tuning its parameters.
The author has done his best to produce an unbiased algorithm, however the algorithm is based on data, and data may be biased based on its collection process.
This project has been undertaken complying with a code of ethics
This project is under Copyright © 2019 Josep Maria Niubo. It is free software, and may be redistributed under the terms specified in the LICENSE file.