Poster for the 2020 TAGC conference with audio guide

Abstract:

Polygenic risk scores (PRS) summarise the genetic information spread across several genetic variants into one single number. This number can be used to predict an individual’s phenotype or - more realistically - to place an individual in risk groups according to their PRS. Many different steps are involved in calculating a PRS that is meaningful. For instance, the quality of the summary statistics is paramount. The choice of which variants to include in the PRS is also far from trivial and can involve simple ad hoc approaches (such as clumping and threshholding) or model-based approaches such as LDpred. Finally, there are the issues of validation and overfitting in the validation set, as well as the even more complicated issue of transferability of polygenic risk scores across ancestries, which is the focus of this talk. A major barrier to the use of PRS in genomic medicine applications is that the majority of GWAS come from cohorts of European ancestry. The predictive power of PRS constructed from these studies is substantially lower in non-European ancestry cohorts, although the reasons for this are unclear. To address this question, we investigate the performance of PRS for height in cohorts with admixed African and European ancestry, allowing us to evaluate ancestry-related differences in PRS predictive accuracy while controlling for environment and cohort differences. We evaluate the roles of recombination and LD, allelic frequencies, genetic and phenotypic variance, and marginal effect sizes, as well as the potential for linear combinations of ancestry-specific PRSs to improve predictions in admixed populations.

Here are the slides.