Skip to content

Latest commit

 

History

History
73 lines (54 loc) · 2.07 KB

README.md

File metadata and controls

73 lines (54 loc) · 2.07 KB

lasagne4bio

This repository provides code examples to train neural networks for 3 biological sequence analysis problems:

  • subcellular localization
  • secondary structure
  • peptide binding to MHCII molecules

Please find detailed instructions in the respective directories.

Data sets

All data sets are either included in the repositroy or links are provided to download them.

Jupyter notebooks

In the directory subcellular_localization there are four tutorials on how to train four different types of neural networks for protein subcellular localization prediction:

  • Feedforward neural network
  • Convolutional neural network
  • Convolutional LSTM neural network
  • Convolutional LSTM neural network with attention mechanism

The dataset used for this tutorial is a reduced version of the original one, only with proteins shorter than 400 amino acids. This is done to save computational time, as here the main focus is to show how the network is built.

There is an additional tutorial on how to load the trained models and a comparison of their performances.

Dependencies

All code was written in python programming language version 2.7. Neural networks are implemented using the lasagne library, please find installation instructions here: https://lasagne.readthedocs.io/en/latest/user/installation.html.

The libraries used in this code are:

  • argparse
  • cPickle
  • csv
  • datetime
  • gc
  • glob
  • gzip
  • importlib
  • itertools
  • lasagne
  • math
  • matplotlib
  • numpy
  • operator
  • os
  • platform
  • scipy
  • sklearn
  • string
  • subprocess
  • sys
  • theano
  • time

Citation

Please cite the following when using our code as template: ...to be added...

Contributors

Vanessa Isabell Jurtz, DTU Bioinformatics
Alexander Rosenberg Johansen, DTU Compute
Morten Nielsen, DTU Bioinformatics
Jose Juan Almagro Armenteros, DTU Bioinformatics
Henrik Nielsen, DTU Bioinformatics
Casper Kaae Sønderby, University of Copenhagen
Ole Winther, DTU Compute
Søren Kaae Sønderby, University of Copenhagen