This repository includes JSON schemas for data models representing a dataset of protein thermodynamic data.
Copyright (C) 2024 Emidio Capriotti and Maria Paola Turina
This program and all programs in this package are free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
- Title: Protein Variants Dataset Schema
- Description: Defines the structure for a dataset containing information about protein variants.
- Properties:
name
: Name of the dataset.description
: Description of the dataset.version
: Version of the dataset.creation_date
: Date when the dataset was created.modification_date
: Latest date when the dataset was modified.additional_info
: Additional information about the dataset.author
: Author of the dataset.records
: Array of protein variants.record
: Protein variant object._id
: Identifier.cluster_id
: Cluster identifier.cluster_info
: Information about the clustering procedure.protein_variant
: Protein variant object.
reference
: Information about the publication reference.
- Required Fields:
name
,records
,reference
- Title: Protein Variant Schema
- Description: Defines the structure for a protein variant.
- Properties:
pdb_variant
: Protein Data Bank variant information.variations
: List of Protein Data Bank amino acid variants.
predicted_structure_variant
: Predicted structure variant information.variations
: List of Predicted structure amino acid variants
uniprot_variant
: UniProt information.variations
: List of UniProt aminoacid variants.
alternative_sequence_variant
: Alternative sequence database variant information.variations
: List of Alternative sequence database amino acid variants.
thermodynamic_data
: Thermodynamic data related to the protein variant.
- Required Fields: Depends on the variant type.
- Title: Amino Acid Variant Schema
- Description: Defines the structure for an amino acid variant.
- Properties:
uniprot_variant
: UniProt variant information.alternative_sequence_database_variant
: Alternative sequence database variant information.pdb_variant
: Protein Data Bank variant information.predicted_structure_variant
: Predicted structure variant information.
- Required Fields: Depends on the variant type.
- Title: Thermodynamic Data Schema
- Description: Defines the structure for thermodynamic data.
- Properties:
- Various properties including
Tm
,unfolding_percentage
,dG_H2O
, etc.
- Various properties including
- Required Fields:
experiment
and at least one ofTm
,dG_H2O
,ddG_H2O
,dG
,ddG
- Title: Experiment Schema
- Description: Defines the structure for experimental data.
- Properties:
method
: Method used for the experiment.conditions
: Experimental conditions such as temperature (T) and pH.T
: Temperature at which the experiment was performed.pH
: pH at which the experiment was performed.
reference
: Information about the publication reference.metadata
: Additional metadata about the experiment.
- Required Fields:
method
,conditions
,reference
These schemas provide a standardized format for organizing and validating data related to protein variants, experimental data, and thermodynamic properties. Each schema is tailored to capture specific aspects of protein data analysis, ensuring consistency and compatibility across datasets.