To create a comprehensive and easy-to-use database of structures and affinities of TCR-pMHC recognition and mutants at this interface.
- Go to IMGT site
- Click on IMGT/3Dstructure-DB and IMGT/2Dstructure-DB under IMGT Database
- Search Species – human, and TRAV under IMGT group
- You will see PDBids, release date, etc.
- Compare last entered PDBid in dataset to PDBids on site
- Find where database ends and new entries can be explored
- Click on PDBid
- Go to literature and search through paper to see if binding experiments were completed. If so find rest of information. If not, move on. Use PDB site as a second option/check validity of data.
Name of the wild type TCR. The names of mutants are not needed. However, in some cases a series of experiments was performed using a particular mutant as a template (with its structure solved). If this is the case, list the name of that mutant, as it will serve as the wild type template that the other mutants in that series of binding experiments will be compared to
MHC Allele. Most of the time the MHCname / MHCname_PDB will be identical. Some experiments were done where the MHC allele was changed from mutant to mutant, but the complex's structure was only solved for one of the MHC alleles. When this happens, list the allele that the experiment was performed with in the MHCname column. In the MHCname_PDB column, record the MHC allele found in the template structure and list all mutations from the structure allele to the allele the experiment was performed with in the MHC_mut column.
Record all mutations between the MHC the experiment was performed with and the MHC found in the template structure.
For MHC Class I molecules this is the alpha chain, 'A'. For MHC Class II, record whether the mutations occur in the alpha or beta chain, 'A' or 'B'. The chains should also be listed in the same order their corresponding MHC mutations are listed
Record all mutations between the TCR the experiment was performed with and the TCR in the template structure. For residue positions ALWAYS use PDB numbering, which can be found on both PDB and on IMGT.
Record the equilibrium binding constant
Record
Record
Ratio of the
If given in the article, use those values (make sure it's in kcal). If not, calculate using
Temperature in Kelvin the experiments were performed at. If the temperature is not 25°C / 298K, make sure you use the correct temperature in calculating
Sequence of the peptide
Record any mutations needed to get from the peptide in the template structure to the peptide used in the experiment. Number the residue positions as they are in the peptide fragment, not as they are in the uncleaved protein.
*At this point if a residue is added or deleted, do not add this data to ATLAS
Record the PDB ID if applicable. A structure has a true PDB ID if there exists a solved structure of the MHC, TCR, and peptide complex with no mutations. Make sure that the structure contains all parts. This means it will have 5 (MHC Class I) or 6 (MHC class II) sequence chains: HLA (Class II will have an alpha and beta chain), beta-2-microglobulin, peptide, TCR alpha chain, and TCR beta chain.
The method used to determine the crystal structure of the complex. This can be found in the box on the right side of the PDB main page for each entry. This is only recorded if a true PDB ID is recorded.
These can all be found in the box on the right side of the PDB main page for each entry. These are only recorded if a true PDB ID is recorded.
This is the PDB structure complex that can be mutated to obtain each particular mutant. It is possible for an entry that is based off a template structure to also have its own true PDB ID, however, if an entry doesn't have a true PDB ID, it MUST have a template PDB ID in order to be modeled.
Record the structure of the peptide-MHC complex if available. Don't worry about the structure method, resolution, or R-values for these.
Record whether the mutations to the TCR from the template structure occur in the alpha or beta chains. If there are multiple mutations in a chain, that chain should be listed as many times as there are mutations. The chains should also be listed in the same order their corresponding TCR mutations are listed
Record the CDR (1, 2 or 3, no a or b needed) that is mutated to form the mutant TCR from the template structure. If there are multiple mutations in a single CDR, that CDR should be listed as many times as there are mutations in that loop, in the same order their corresponding TCR mutations are listed.
List the sequences of the CDRs from the template structure that must be mutated to obtain the mutant. The sequences should be listed as many times as there are mutations in the corresponding loop, in the same order the corresponding TCR mutations are listed.
Record the chain ID (D for alpha chain, E for beta chain) the TCR mutations occur in. If there are multiple mutations in a chain, that chain ID should be listed as many times as there are mutations. The chain IDs should be listed in the same order their corresponding TCR mutations are listed
Record the seven or eight digit number the reference paper can be found under in Pubmed.
Record the method of determining experimental
Record the sensor chip used in Biacore experiments if applicable.
This is the ligand (TCR or pMHC complex) that is bound to the sensor chip and immobilized in SPR experiments.
Record whether the immobilized ligand was bound to the chip directly or indirectly. Direct coupling methods include amine coupling and thiol coupling. Indirect methods include using biotin/streptavidin, antibody capture, or any other method that puts another molecule between the sensor chip and the immobilized ligand.
This is the ligand (TCR or pMHC complex) that is flowed over the sensor chip in SPR experiments.
In ATLAS, "wild type" does not always refer to the traditional wild type, but instead refers to the template structure that serves as a basis of comparison for a particular entry or set of entries. ATLAS is designed to test and train scoring functions. So when parsing the data, the computer does the following:
- Check to see if a true_PDB structure exists
- If TRUE, it will use that structure for scoring
- If FALSE, it will use the template_PDB structure and make all mutations specified in the TCR_mut, MHC_mut, and PEP_mut columns.
This means that the only mutations that should ever be listed are mutations that MUST be made to the template_PDB structure. This also means that a traditional wild type structure may require mutations to match the template. For example, when looking at the DMF5-HLA-A*02:01-ELAGIGILTV structure 3QDG, WT should be recorded in the pep_mut column despite the ELA peptide being a mutant of the EAA peptide. A mutant containing EAAGIGILTV should record L2A in the pep_mut column, even though EAA is the traditional wild type peptide.