Gradient boosted trees based predictor for MHC Class I epitope binding prediction. MHCBoost is also part of BERT, a very powerful deimmunization workflow.
Python 3.7 is a requirement! Support with 3.6 or lower is experimental! If you do get any errors about utf-8 you're likely not running MHCBoost with python 3.7.
Please also make sure that you're also installing MHCBoost using python 3.7 in a new virtual environment! Anaconda may help you with this. If you do get any errors about missing modules you're likely not installing this tool correctly. We tested the release thoroughly - it works perfectly fine!
If you're still having trouble with the setup, please write an e-mail to igem@ifib.uni-tuebingen.de . We're happy to help!
git clone https://github.com/igemsoftware2018/Team_Tuebingen_MHCBoost
python setup.py install
mhc-1
> mhc-1
-p, --dataset_to_predict_path <arg> file to perform prediction on OR peptide sequence
-o, --predicted_dataset_path <arg> filepath to save the predicted output file to
-a, --allele <arg> allele to perform prediction on
-l, --peptide_length <arg> epitope peptide length - usually 9
optional -t, --training_dataset_path <arg> file for classifier training
optional -s, --silent suppresses learning output
Simply provide the answers to the questions asked by our tool.
> mhc-1
Alternatively, provide input parameters when starting the tool.
> mhc-1 -p examples/example_input.txt -o /home/mypc/Desktop/output.txt -a A*02:01 -l 9
MHCBoost supports 65 alleles.
HLA-A01:01
HLA-A02:01
HLA-A02:02
HLA-A02:03
HLA-A02:06
HLA-A02:11
HLA-A02:12
HLA-A02:16
HLA-A02:19
HLA-A02:50
HLA-A03:01
HLA-A11:01
HLA-A23:01
HLA-A24:02
HLA-A24:03
HLA-A25:01
HLA-A26:01
HLA-A26:02
HLA-A26:03
HLA-A29:02
HLA-A30:01
HLA-A30:02
HLA-A31:01
HLA-A32:01
HLA-A32:07
HLA-A32:15
HLA-A33:01
HLA-A66:01
HLA-A68:01
HLA-A68:02
HLA-A68:23
HLA-A69:01
HLA-A80:01
HLA-B07:02
HLA-B08:01
HLA-B08:02
HLA-B08:03
HLA-B14:02
HLA-B15:01
HLA-B15:02
HLA-B15:03
HLA-B15:09
HLA-B15:17
HLA-B18:01
HLA-B27:05
HLA-B35:01
HLA-B38:01
HLA-B39:01
HLA-B40:01
HLA-B40:02
HLA-B44:02
HLA-B44:03
HLA-B45:01
HLA-B46:01
HLA-B48:01
HLA-B51:01
HLA-B53:01
HLA-B57:01
HLA-B58:01
HLA-B73:01
HLA-B83:01
HLA-C05:01
HLA-C06:02
HLA-C14:02
HLA-C15:02
MHCBoost has an 5-fold crossvalidated average AUC of 0.899 on the IEDB dataset. The performance on each allele was compared to the state of the art NetMHCPan. Please refer to Results
MIT
Team iGEM 2018 Tübingen
Lukas Heumos
Steffen Lemke
Alexander Röhl