| Literature DB >> 30024900 |
Miguel Chagnon1,2, Jennifer O'Loughlin1,2, James C Engert3, Igor Karp2,4, Marie-Pierre Sylvestre1,2.
Abstract
Using a genetic risk score (GRS) to predict a phenotype in a target sample can be complicated by missing data on the single nucleotide polymorphisms (SNPs) that comprise the GRS. This is usually addressed by imputation, omission of the SNPs or by replacing the missing SNPs with proxy SNPs. To assess the impact of the omission and proxy approaches on effect size estimation and predictive ability of weighted and unweighted GRS with small numbers of SNPs, we simulated a dichotomous phenotype conditional on real genotype data. We considered scenarios in which the proportion of missing SNPs ranged from 20-70%. We assessed the impact of omitting or replacing missing SNPs on the association between the GRS and phenotype, the corresponding statistical power and the area under the receiver operating curve. Omission resulted in a larger bias towards the null value of the effect size, a smaller predictive ability and greater loss of statistical power than proxy approaches. The predictive ability of a weighted GRS that includes SNPs with large weights depends of the availability of these large-weight SNPs.Entities:
Mesh:
Year: 2018 PMID: 30024900 PMCID: PMC6053141 DOI: 10.1371/journal.pone.0200630
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Schematic overview of the algorithm used for simulation of weighted GRS.
Fig 2Impact of omission of the unavailable SNPs or replacement by proxy SNPs on the correlation between the GRS and gold standard GRS.
Results for unweighted GRS and weighted GRS correspond to different scenarios. The interpretation must be made separately and not contrast.
Fig 3Impact of omission of the unavailable SNPs or replacement by proxy SNPs on effect size estimation and predictive ability of the unweighted GRS for the phenotype.
Fig 4Impact of omission of the unavailable SNPs or replacement by proxy SNPs on effect size estimation and predictive ability of the weighted GRS for the phenotype.
Fig 5Impact of the unavailability of SNPs corresponding to extreme weights on effect size estimation and predictive ability of the weighted GRS for the phenotype.
Fig 6Impact of omission of unavailable SNPs or replacement by proxy SNPs on statistical power to detect the effect size of the weighted GRS for the phenotype.