| Literature DB >> 32376789 |
Chantal Babb de Villiers1, Mark Kroese2, Sowmiya Moorthie2.
Abstract
The use of genomic information to better understand and prevent common complex diseases has been an ongoing goal of genetic research. Over the past few years, research in this area has proliferated with several proposed methods of generating polygenic scores. This has been driven by the availability of larger data sets, primarily from genome-wide association studies and concomitant developments in statistical methodologies. Here we provide an overview of the methodological aspects of polygenic model construction. In addition, we consider the state of the field and implications for potential applications of polygenic scores for risk estimation within healthcare. © Author(s) (or their employer(s)) 2020. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.Entities:
Keywords: clinical genetics; genetic epidemiology; genome-wide; getting research into practice; prevention
Mesh:
Year: 2020 PMID: 32376789 PMCID: PMC7591711 DOI: 10.1136/jmedgenet-2019-106763
Source DB: PubMed Journal: J Med Genet ISSN: 0022-2593 Impact factor: 6.318
Figure 1Polygenic score calculation. This calculation aggregates the SNPs and their weights selected for a polygenic score. Common diseases are thought to be influenced by many genetic variants with small individual effect sizes, such that meaningful risk prediction necessitates examining the aggregated impact of these multiple variants including their weightings. PGS, polygenic score.
Figure 2Construction of a polygenic score. In the process of developing a polygenic score, numerous models are tested and then compared. The model that performs best (as determined by one or more measures) is then selected for validation in the external data set. GWAS, genome-wide association studies.
Figure 3Example distribution of polygenic scores across a population. Thresholds can be set to stratify risk as low (some), average (most) and high (some).