Skip to content
/ PAGER Public
forked from EpistasisLab/PAGER

PAGER is an efficient genotype encoding strategy designed to improve the detection of non-additive genetic variation in complex trait association studies and epistasis investigation. PAGER dynamically encodes genetic variants or multi-locus genotypes (MLG) by normalizing mean phenotypic differences between genotype/MLG classes.

Notifications You must be signed in to change notification settings

pfreda/PAGER

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PAGER

Logo

PAGER : Phenotype Adjusted Genotype Encoding and Ranking

PAGER is a genotype encoding method specifically designed for improving the detection of deviations from additivity in genotype-phenotype associations and investigating epistasis. It provides a computationally efficient means to capture a variety of inheritance models, including additive, dominant, recessive, and heterotic, without the need for multiple encodings or increased computational cost. By normalizing the relative phenotypic differences between genotypic classes or multi-locus genotypes (MLGs), PAGER reveals novel insights into genetic associations and potential sources of missing heritability.

Key Features

  • Flexible Genotype Encoding: Automatically adjusts to capture a range of inheritance models (additive, dominant, recessive, heterosis) on a SNP-by-SNP basis.
  • Phenotype-Driven: Uses phenotypic information to generate genotype encodings, reflecting each variant’s true inheritance model.
  • Computational Efficiency: PAGER can be employed in any coding language and can be parallelized and leveraged using GPUs for robust computational efficiency.
  • Broad Applicability: Supports both binary (case/control) and continuous phenotypes, as well as multi-locus epistasis investigations.

Methodology

PAGER encodes genetic variants based on normalized mean phenotypic differences between the three genotype classes (AA, Aa, aa) for each SNP, which reflects the inheritance pattern at each locus.

The encoding is computed as follows:

Let:

  • x̄AA, x̄Aa, and x̄aa represent the mean phenotype values (proportion of cases [p̂] in case/control studies) for the genotype classes AA, Aa, and aa, respectively, at each SNP.

  • The AA class is used as an anchor point for encoding (set to 0).

For each SNP, PAGER calculates the following encoding values (diallelic, non-epistatic example):

  • PAGER EncodingAA = x̄AA − x̄AA = 0 (anchor)

  • PAGER EncodingAa = x̄Aa − x̄AA

  • PAGER Encodingaa = x̄aa − x̄AA

Normalization

These raw encodings are then normalized using min-max scaling to fall between 0 and 1 for easier interpretability. The normalization ensures that the differences between genotype classes are comparable across all SNPs.

Key Advantages

  1. Phenotype-Specific Adjustments: By directly calculating the phenotypic differences between genotype classes, PAGER dynamically adjusts to the specific inheritance pattern observed in the data, whether additive, dominant, recessive, or heterotic.
  2. Scalability: The simple mathematical operations make PAGER computationally efficient, particularly for large-scale studies. GPU support and parallelization further accelerate computation.
  3. Extensibility: The method can be applied beyond single-locus analysis to multi-locus genotype interactions (epistasis). PAGER can also handle multi-allelic systems and variants. Extension is achieved by using one genotype/MLG as the anchor and computing the relative differences for others.

Usage and Examples

We provide both univariate and epistatic code usage examples of PAGER in the Scripts directory in both Python and R programming languages. Data used to run these examples are in the Data directory.

Relevant Publications

If you use PAGER in scientific publications, please cite the following:

Freda, Philip J., Attri Ghosh, Priyanka Bhandary, Nicholas Matsumoto, Apurva S. Chitre, Jiayan Zhou, Molly A. Hall, Abraham A. Palmer, Tayo Obafemi-Ajayi, and Jason H. Moore. 2024. PAGER: A novel genotype encoding strategy for modeling deviations from additivity in complex trait association studies. BioData Mining, 17(1), 41.

Link to publication.

@article{freda2024pager,
  title={PAGER: A novel genotype encoding strategy for modeling deviations from additivity in complex trait association studies},
  author={Freda, Philip J and Ghosh, Attri and Bhandary, Priyanka and Matsumoto, Nicholas and Chitre, Apurva S and Zhou, Jiayan and Hall, Molly A and Palmer, Abraham A and Obafemi-Ajayi, Tayo and Moore, Jason H},
  journal={BioData Mining},
  volume={17},
  number={1},
  pages={41},
  year={2024},
  publisher={Springer}
}

About

PAGER is an efficient genotype encoding strategy designed to improve the detection of non-additive genetic variation in complex trait association studies and epistasis investigation. PAGER dynamically encodes genetic variants or multi-locus genotypes (MLG) by normalizing mean phenotypic differences between genotype/MLG classes.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 66.4%
  • R 25.0%
  • Jupyter Notebook 8.6%