Literature DB >> 26699465

Limitations of GCTA as a solution to the missing heritability problem.

Siddharth Krishna Kumar1, Marcus W Feldman2, David H Rehkopf3, Shripad Tuljapurkar2.   

Abstract

Genome-wide association studies (GWASs) seek to understand the relationship between complex phenotype(s) (e.g., height) and up to millions of single-nucleotide polymorphisms (SNPs). Early analyses of GWASs are commonly believed to have "missed" much of the additive genetic variance estimated from correlations between relatives. A more recent method, genome-wide complex trait analysis (GCTA), obtains much higher estimates of heritability using a model of random SNP effects correlated between genotypically similar individuals. GCTA has now been applied to many phenotypes from schizophrenia to scholastic achievement. However, recent studies question GCTA's estimates of heritability. Here, we show that GCTA applied to current SNP data cannot produce reliable or stable estimates of heritability. We show first that GCTA depends sensitively on all singular values of a high-dimensional genetic relatedness matrix (GRM). When the assumptions in GCTA are satisfied exactly, we show that the heritability estimates produced by GCTA will be biased and the standard errors will likely be inaccurate. When the population is stratified, we find that GRMs typically have highly skewed singular values, and we prove that the many small singular values cannot be estimated reliably. Hence, GWAS data are necessarily overfit by GCTA which, as a result, produces high estimates of heritability. We also show that GCTA's heritability estimates are sensitive to the chosen sample and to measurement errors in the phenotype. We illustrate our results using the Framingham dataset. Our analysis suggests that results obtained using GCTA, and the results' qualitative interpretations, should be interpreted with great caution.

Entities:  

Keywords:  GCTA; GWAS; SNP; heritability; singular value decomposition

Mesh:

Year:  2015        PMID: 26699465      PMCID: PMC4711841          DOI: 10.1073/pnas.1520109113

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  29 in total

1.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

2.  Efficient methods to compute genomic predictions.

Authors:  P M VanRaden
Journal:  J Dairy Sci       Date:  2008-11       Impact factor: 4.034

3.  Personal genomes: The case of the missing heritability.

Authors:  Brendan Maher
Journal:  Nature       Date:  2008-11-06       Impact factor: 49.962

4.  The Third Generation Cohort of the National Heart, Lung, and Blood Institute's Framingham Heart Study: design, recruitment, and initial examination.

Authors:  Greta Lee Splansky; Diane Corey; Qiong Yang; Larry D Atwood; L Adrienne Cupples; Emelia J Benjamin; Ralph B D'Agostino; Caroline S Fox; Martin G Larson; Joanne M Murabito; Christopher J O'Donnell; Ramachandran S Vasan; Philip A Wolf; Daniel Levy
Journal:  Am J Epidemiol       Date:  2007-03-19       Impact factor: 4.897

5.  Common SNPs explain a large proportion of the heritability for human height.

Authors:  Jian Yang; Beben Benyamin; Brian P McEvoy; Scott Gordon; Anjali K Henders; Dale R Nyholt; Pamela A Madden; Andrew C Heath; Nicholas G Martin; Grant W Montgomery; Michael E Goddard; Peter M Visscher
Journal:  Nat Genet       Date:  2010-06-20       Impact factor: 38.330

6.  REFINING GENETICALLY INFERRED RELATIONSHIPS USING TREELET COVARIANCE SMOOTHING.

Authors:  Andrew Crossett; Ann B Lee; Lambertus Klei; Bernie Devlin; Kathryn Roeder
Journal:  Ann Appl Stat       Date:  2013-06-27       Impact factor: 2.083

7.  Separation of the largest eigenvalues in eigenanalysis of genotype data from discrete subpopulations.

Authors:  Katarzyna Bryc; Wlodek Bryc; Jack W Silverstein
Journal:  Theor Popul Biol       Date:  2013-08-20       Impact factor: 1.570

8.  Estimating the proportion of variation in susceptibility to schizophrenia captured by common SNPs.

Authors:  S Hong Lee; Teresa R DeCandia; Stephan Ripke; Jian Yang; Patrick F Sullivan; Michael E Goddard; Matthew C Keller; Peter M Visscher; Naomi R Wray
Journal:  Nat Genet       Date:  2012-02-19       Impact factor: 38.330

9.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

10.  Estimating quantitative genetic parameters in wild populations: a comparison of pedigree and genomic approaches.

Authors:  Camillo Bérénos; Philip A Ellis; Jill G Pilkington; Josephine M Pemberton
Journal:  Mol Ecol       Date:  2014-06-26       Impact factor: 6.185

View more
  34 in total

Review 1.  Complex-Trait Prediction in the Era of Big Data.

Authors:  Gustavo de Los Campos; Ana Ines Vazquez; Stephen Hsu; Louis Lello
Journal:  Trends Genet       Date:  2018-08-20       Impact factor: 11.639

2.  Genome-wide Association Studies of Posttraumatic Stress Disorder in 2 Cohorts of US Army Soldiers.

Authors:  Murray B Stein; Chia-Yen Chen; Robert J Ursano; Tianxi Cai; Joel Gelernter; Steven G Heeringa; Sonia Jain; Kevin P Jensen; Adam X Maihofer; Colter Mitchell; Caroline M Nievergelt; Matthew K Nock; Benjamin M Neale; Renato Polimanti; Stephan Ripke; Xiaoying Sun; Michael L Thomas; Qian Wang; Erin B Ware; Susan Borja; Ronald C Kessler; Jordan W Smoller
Journal:  JAMA Psychiatry       Date:  2016-07-01       Impact factor: 21.596

3.  Reply to Yang et al.: GCTA produces unreliable heritability estimates.

Authors:  Siddharth Krishna Kumar; Marcus W Feldman; David H Rehkopf; Shripad Tuljapurkar
Journal:  Proc Natl Acad Sci U S A       Date:  2016-07-25       Impact factor: 11.205

4.  GCTA-GREML accounts for linkage disequilibrium when estimating genetic variance from genome-wide SNPs.

Authors:  Jian Yang; S Hong Lee; Naomi R Wray; Michael E Goddard; Peter M Visscher
Journal:  Proc Natl Acad Sci U S A       Date:  2016-07-25       Impact factor: 11.205

5.  Opportunities and challenges of big data for the social sciences: The case of genomic data.

Authors:  Hexuan Liu; Guang Guo
Journal:  Soc Sci Res       Date:  2016-04-21

6.  Concepts, estimation and interpretation of SNP-based heritability.

Authors:  Jian Yang; Jian Zeng; Michael E Goddard; Naomi R Wray; Peter M Visscher
Journal:  Nat Genet       Date:  2017-08-30       Impact factor: 38.330

7.  On Negative Heritability and Negative Estimates of Heritability.

Authors:  David Steinsaltz; Andy Dahl; Kenneth W Wachter
Journal:  Genetics       Date:  2020-04-14       Impact factor: 4.562

8.  Correlations between relatives: From Mendelian theory to complete genome sequence.

Authors:  Elizabeth A Thompson
Journal:  Genet Epidemiol       Date:  2019-05-02       Impact factor: 2.135

9.  An Exome-Wide Association Study Identifies New Susceptibility Loci for Age of Smoking Initiation in African- and European-American Populations.

Authors:  Keran Jiang; Zhongli Yang; Wenyan Cui; Kunkai Su; Jennie Z Ma; Thomas J Payne; Ming D Li
Journal:  Nicotine Tob Res       Date:  2019-05-21       Impact factor: 4.244

10.  A meta-analytic framework for detection of genetic interactions.

Authors:  Yulun Liu; Yong Chen; Paul Scheet
Journal:  Genet Epidemiol       Date:  2016-08-15       Impact factor: 2.135

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.