higgs_boson

A supervised classification model was used to predict a binary variable (described only as ‘class’) using 31 contextless features (‘f1’, ‘f2’, …, ‘f31’). The development was approached in four parts, often overlapping.

First, learners were compared using ‘area under the ROC curve’ (AUC) on a test partition of the data. Second, features (as-given, transformed, and engineered) were selected using forward/backward stepwise search and Top-N. Third, learner parameters were tuned using grid search. Fourth, logistic regression was used as a meta-algorithm to stack and blend models.

The top model uses XGBoost and achieved an AUC of 0.80939 on the Kaggle public leaderboard.

View the report here .

View the code here .

View the original challenge here .

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.Rhistory		.Rhistory
R Code.Rmd		R Code.Rmd
README.md		README.md
The Higgs boson machine learning challenge.pdf		The Higgs boson machine learning challenge.pdf
report.pdf		report.pdf
testdata_nolabels.csv		testdata_nolabels.csv
traindata.csv		traindata.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

higgs_boson

About

Releases

Packages

irecasens/higgs_boson

Folders and files

Latest commit

History

Repository files navigation

higgs_boson

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages