Skip to content

This project aims to achieve the best prediction results by applying various preprocessing techniques and blind data engineering.

Notifications You must be signed in to change notification settings

Wa-lead/ML485_blind_preprocessing_prediction_comp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ML485_blind_preprocessing_prediction_comp

This project aims to achieve the best prediction results by applying various preprocessing techniques and blind data engineering.

Data Preprocessing

To gain an understanding of the data structure and the separation of classes, the initial step involved using UMAP (Uniform Manifold Approximation and Projection). Additionally, the data was imputed, and generic data engineering techniques were applied. Finally, feature selection was performed using mutual information.

Prediction

Several models were evaluated, and XGBoost emerged as the top-performing model for prediction.

Gain Further Insights into the Data

To gain insights into the weaknesses of the model, various techniques were employed to understand the reasons behind incorrect predictions (details can be found in the "main.ipynb" file).

Please refer to the "main.ipynb" file for a more comprehensive analysis and implementation details.

" Won first place btw 🥱 "

About

This project aims to achieve the best prediction results by applying various preprocessing techniques and blind data engineering.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published