Literature DB >> 32077750

Efficient Estimation and Applications of Cross-Validated Genetic Predictions to Polygenic Risk Scores and Linear Mixed Models.

Joel Mefford1, Danny Park2, Zhili Zheng3, Arthur Ko4, Mika Ala-Korpela5,6,7,8, Markku Laakso9, Päivi Pajukanta4, Jian Yang3, John Witte10, Noah Zaitlen1.   

Abstract

Large-scale cohorts with combined genetic and phenotypic data, coupled with methodological advances, have produced increasingly accurate genetic predictors of complex human phenotypes called polygenic risk scores (PRSs). In addition to the potential translational impacts of identifying at-risk individuals, PRS are being utilized for a growing list of scientific applications, including causal inference, identifying pleiotropy and genetic correlation, and powerful gene-based and mixed-model association tests. Existing PRS approaches rely on external large-scale genetic cohorts that have also measured the phenotype of interest. They further require matching on ancestry and genotyping platform or imputation quality. In this work, we present a novel reference-free method to produce a PRS that does not rely on an external cohort. We show that naive implementations of reference-free PRS either result in substantial overfitting or prohibitive increases in computational time. We show that our algorithm avoids both of these issues and can produce informative in-sample PRSs over a single cohort without overfitting. We then demonstrate several novel applications of reference-free PRSs, including detection of pleiotropy across 246 metabolic traits and efficient mixed-model association testing.

Entities:  

Keywords:  BLUP; PCA; PRS; linear mixed model; polygenic risk score

Mesh:

Year:  2020        PMID: 32077750      PMCID: PMC7185352          DOI: 10.1089/cmb.2019.0325

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  39 in total

1.  Correction for hidden confounders in the genetic analysis of gene expression.

Authors:  Jennifer Listgarten; Carl Kadie; Eric E Schadt; David Heckerman
Journal:  Proc Natl Acad Sci U S A       Date:  2010-09-01       Impact factor: 11.205

2.  Genomewide rapid association using mixed model and regression: a fast and simple method for genomewide pedigree-based quantitative trait loci association analysis.

Authors:  Yurii S Aulchenko; Dirk-Jan de Koning; Chris Haley
Journal:  Genetics       Date:  2007-07-29       Impact factor: 4.562

3.  Analysis of case-control association studies with known risk variants.

Authors:  Noah Zaitlen; Bogdan Pasaniuc; Nick Patterson; Samuela Pollack; Benjamin Voight; Leif Groop; David Altshuler; Brian E Henderson; Laurence N Kolonel; Loic Le Marchand; Kevin Waters; Christopher A Haiman; Barbara E Stranger; Emmanouil T Dermitzakis; Peter Kraft; Alkes L Price
Journal:  Bioinformatics       Date:  2012-05-03       Impact factor: 6.937

4.  Efficient control of population structure in model organism association mapping.

Authors:  Hyun Min Kang; Noah A Zaitlen; Claire M Wade; Andrew Kirby; David Heckerman; Mark J Daly; Eleazar Eskin
Journal:  Genetics       Date:  2008-03       Impact factor: 4.562

5.  Rapid variance components-based method for whole-genome association analysis.

Authors:  Gulnara R Svishcheva; Tatiana I Axenovich; Nadezhda M Belonogova; Cornelia M van Duijn; Yurii S Aulchenko
Journal:  Nat Genet       Date:  2012-09-16       Impact factor: 38.330

6.  Regional adiposity patterns in relation to lipids, lipoprotein cholesterol, and lipoprotein subfraction mass in men.

Authors:  R B Terry; P D Wood; W L Haskell; M L Stefanick; R M Krauss
Journal:  J Clin Endocrinol Metab       Date:  1989-01       Impact factor: 5.958

7.  Advantages and pitfalls in the application of mixed-model association methods.

Authors:  Jian Yang; Noah A Zaitlen; Michael E Goddard; Peter M Visscher; Alkes L Price
Journal:  Nat Genet       Date:  2014-02       Impact factor: 38.330

8.  Polygenic modeling with bayesian sparse linear mixed models.

Authors:  Xiang Zhou; Peter Carbonetto; Matthew Stephens
Journal:  PLoS Genet       Date:  2013-02-07       Impact factor: 5.917

9.  Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations.

Authors:  Amit V Khera; Mark Chaffin; Krishna G Aragam; Mary E Haas; Carolina Roselli; Seung Hoan Choi; Pradeep Natarajan; Eric S Lander; Steven A Lubitz; Patrick T Ellinor; Sekar Kathiresan
Journal:  Nat Genet       Date:  2018-08-13       Impact factor: 38.330

10.  Breast Cancer Risk From Modifiable and Nonmodifiable Risk Factors Among White Women in the United States.

Authors:  Paige Maas; Myrto Barrdahl; Amit D Joshi; Paul L Auer; Mia M Gaudet; Roger L Milne; Fredrick R Schumacher; William F Anderson; David Check; Subham Chattopadhyay; Laura Baglietto; Christine D Berg; Stephen J Chanock; David G Cox; Jonine D Figueroa; Mitchell H Gail; Barry I Graubard; Christopher A Haiman; Susan E Hankinson; Robert N Hoover; Claudine Isaacs; Laurence N Kolonel; Loic Le Marchand; I-Min Lee; Sara Lindström; Kim Overvad; Isabelle Romieu; Maria-Jose Sanchez; Melissa C Southey; Daniel O Stram; Rosario Tumino; Tyler J VanderWeele; Walter C Willett; Shumin Zhang; Julie E Buring; Federico Canzian; Susan M Gapstur; Brian E Henderson; David J Hunter; Graham G Giles; Ross L Prentice; Regina G Ziegler; Peter Kraft; Montse Garcia-Closas; Nilanjan Chatterjee
Journal:  JAMA Oncol       Date:  2016-10-01       Impact factor: 31.777

View more
  7 in total

1.  Protein-coding repeat polymorphisms strongly shape diverse human phenotypes.

Authors:  Ronen E Mukamel; Robert E Handsaker; Maxwell A Sherman; Alison R Barton; Yiming Zheng; Steven A McCarroll; Po-Ru Loh
Journal:  Science       Date:  2021-09-23       Impact factor: 47.728

Review 2.  Post-GWAS knowledge gap: the how, where, and when.

Authors:  Steven E Pierce; Alix Booms; Jordan Prahl; Edwin J C van der Schans; Trevor Tyson; Gerhard A Coetzee
Journal:  NPJ Parkinsons Dis       Date:  2020-09-09

3.  A model and test for coordinated polygenic epistasis in complex traits.

Authors:  Brooke Sheppard; Nadav Rappoport; Po-Ru Loh; Stephan J Sanders; Noah Zaitlen; Andy Dahl
Journal:  Proc Natl Acad Sci U S A       Date:  2021-04-13       Impact factor: 11.205

4.  Placental genomics mediates genetic associations with complex health traits and disease.

Authors:  Rebecca C Fry; Hudson P Santos; Arjun Bhattacharya; Anastasia N Freedman; Vennela Avula; Rebeca Harris; Weifang Liu; Calvin Pan; Aldons J Lusis; Robert M Joseph; Lisa Smeester; Hadley J Hartwell; Karl C K Kuban; Carmen J Marsit; Yun Li; T Michael O'Shea
Journal:  Nat Commun       Date:  2022-02-04       Impact factor: 17.694

5.  Strengthening Causal Inference in Exposomics Research: Application of Genetic Data and Methods.

Authors:  Christy L Avery; Annie Green Howard; Anna F Ballou; Victoria L Buchanan; Jason M Collins; Carolina G Downie; Stephanie M Engel; Mariaelisa Graff; Heather M Highland; Moa P Lee; Adam G Lilly; Kun Lu; Julia E Rager; Brooke S Staley; Kari E North; Penny Gordon-Larsen
Journal:  Environ Health Perspect       Date:  2022-05-09       Impact factor: 11.035

6.  Explainable deep transfer learning model for disease risk prediction using high-dimensional genomic data.

Authors:  Long Liu; Qingyu Meng; Cherry Weng; Qing Lu; Tong Wang; Yalu Wen
Journal:  PLoS Comput Biol       Date:  2022-07-15       Impact factor: 4.779

7.  GBAT: a gene-based association test for robust detection of trans-gene regulation.

Authors:  Xuanyao Liu; Joel A Mefford; Andrew Dahl; Yuan He; Meena Subramaniam; Alexis Battle; Alkes L Price; Noah Zaitlen
Journal:  Genome Biol       Date:  2020-08-24       Impact factor: 13.583

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.