ML485_blind_preprocessing_prediction_comp

This project aims to achieve the best prediction results by applying various preprocessing techniques and blind data engineering.

Data Preprocessing

To gain an understanding of the data structure and the separation of classes, the initial step involved using UMAP (Uniform Manifold Approximation and Projection). Additionally, the data was imputed, and generic data engineering techniques were applied. Finally, feature selection was performed using mutual information.

Prediction

Several models were evaluated, and XGBoost emerged as the top-performing model for prediction.

Gain Further Insights into the Data

To gain insights into the weaknesses of the model, various techniques were employed to understand the reasons behind incorrect predictions (details can be found in the "main.ipynb" file).

Please refer to the "main.ipynb" file for a more comprehensive analysis and implementation details.

" Won first place btw 🥱 "

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
.gitattributes		.gitattributes
README.md		README.md
data_preprocessing.ipynb		data_preprocessing.ipynb
model.ipynb		model.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML485_blind_preprocessing_prediction_comp

Data Preprocessing

Prediction

Gain Further Insights into the Data

About

Releases

Packages

Languages

Wa-lead/ML485_blind_preprocessing_prediction_comp

Folders and files

Latest commit

History

Repository files navigation

ML485_blind_preprocessing_prediction_comp

Data Preprocessing

Prediction

Gain Further Insights into the Data

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages