GitHub - Kortemme-Lab/covariation: A covariation benchmark capture containing the benchmark dataset and benchmarked protocol captures.

Sequence profile and covariation benchmark

Evolutionary pressures on protein structure and function have shaped the amino acid sequences of today's naturally occurring proteins. Consequently, the sequences of natural proteins are nearly optimal for their structures. Natural protein sequences therefore provide valuable information for evaluating the accuracy of computational protein design. The purpose of this benchmark is to evaluate the extent to which protein design can recapitulate properties of naturally occurring proteins, including amino acid sequence preferences (“sequence profiles”) and patterns of amino acid covariation.

This benchmark includes:

a set of 40 diverse protein domains with representative crystal structures and sequence alignments
command line arguments for running fixed backbone and flexible backbone design methods in Rosetta
analysis scripts that compare sequence profiles and patterns of amino acid covariation between natural and designed sequences

This protocol capture is based on a benchmark developed by Ollikainen & Kortemme and referenced below.

Licensing

The novel content in this repository is licensed according to LICENSE.

Downloading the benchmark

The benchmark is hosted on GitHub. The most recent version can be checked out using the git command-line tool:

git clone https://github.com/Kortemme-Lab/covariation.git

Directories in this archive

This archive contains the following directories:

input : contains the input files for the benchmark. Input files specific to a particular protocol are in a subdirectory named after the protocol. The input files are described in more detail in input/README.rst.
output : these directories are empty by default. This is the default output location for protocols if they are run on the local machine.
output/sample : contains sample output data that can be used to test the analysis script.
analysis : contains the analysis scripts used to analyze the output of a prediction run. All protocols are expected to produce output that will work with the analysis scripts.
protocols : contains the scripts needed to run a job. The scripts for a protocol are provided in a specific subdirectory.
hpc : contains scripts that can be used to run the entire benchmark using specific cluster architectures. For practical reasons, a limited number of cluster systems are supported. Please feel free to provide scripts which run the benchmark for your particular cluster system.

Protocols

This repository contains one protocol which can be used to run the benchmark. We welcome the inclusion of more protocols. Please contact support@kortemmelab.ucsf if you wish to contribute towards the repository.

Each protocol is accompanied by specific documentation in its protocol directory.

Protocol 1: Fixed backbone design

Software suite: Rosetta

Protocol directory: protocols/fixed_backbone

Protocol 2: Flexible backbone design using backrub ensembles

Software suite: Rosetta

Protocol directory: protocols/backrub

References

The latest release of this repository:

Computational protein design quantifies structural constraints on amino acid covariation. 2013. Ollikainen N, Kortemme T. PLoS Comput Biol 9(11):e1003313. doi: 10.1371/journal.pcbi.1003313. Epub 2013 Nov 14.

Analysis

The same set of analysis scripts is used by all protocols. Conceptually, the analysis scripts should be a black box that is separated from the output of each protocol by an interface.

The analysis scripts calculate sequence profile similarity and covariation similarity metrics which can be used to evaluate the results of the design simulations. The scripts are described in more detail in analysis/README.rst.

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
analysis		analysis
hpc		hpc
input		input
libraries		libraries
output		output
protocols/rosetta		protocols/rosetta
.gitignore		.gitignore
LICENSE		LICENSE
LICENSE-3RD-PARTY		LICENSE-3RD-PARTY
README.rst		README.rst

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sequence profile and covariation benchmark

Licensing

Downloading the benchmark

Directories in this archive

Protocols

Protocol 1: Fixed backbone design

Protocol 2: Flexible backbone design using backrub ensembles

References

Analysis

About

Releases 1

Packages

Contributors 2

Languages

License

Kortemme-Lab/covariation

Folders and files

Latest commit

History

Repository files navigation

Sequence profile and covariation benchmark

Licensing

Downloading the benchmark

Directories in this archive

Protocols

Protocol 1: Fixed backbone design

Protocol 2: Flexible backbone design using backrub ensembles

References

Analysis

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages