Literature DB >> 28238293

The Weighting is the Hardest Part: On the Behavior of the Likelihood Ratio Test and the Score Test Under a Data-Driven Weighting Scheme in Sequenced Samples.

Camelia C Minică1, Giulio Genovese2, Christina M Hultman3, René Pool1, Jacqueline M Vink4, Michael C Neale1, Conor V Dolan1, Benjamin M Neale2.   

Abstract

Sequence-based association studies are at a critical inflexion point with the increasing availability of exome-sequencing data. A popular test of association is the sequence kernel association test (SKAT). Weights are embedded within SKAT to reflect the hypothesized contribution of the variants to the trait variance. Because the true weights are generally unknown, and so are subject to misspecification, we examined the efficiency of a data-driven weighting scheme. We propose the use of a set of theoretically defensible weighting schemes, of which, we assume, the one that gives the largest test statistic is likely to capture best the allele frequency-functional effect relationship. We show that the use of alternative weights obviates the need to impose arbitrary frequency thresholds. As both the score test and the likelihood ratio test (LRT) may be used in this context, and may differ in power, we characterize the behavior of both tests. The two tests have equal power, if the weights in the set included weights resembling the correct ones. However, if the weights are badly specified, the LRT shows superior power (due to its robustness to misspecification). With this data-driven weighting procedure the LRT detected significant signal in genes located in regions already confirmed as associated with schizophrenia - the PRRC2A (p = 1.020e-06) and the VARS2 (p = 2.383e-06) - in the Swedish schizophrenia case-control cohort of 11,040 individuals with exome-sequencing data. The score test is currently preferred for its computational efficiency and power. Indeed, assuming correct specification, in some circumstances, the score test is the most powerful test. However, LRT has the advantageous properties of being generally more robust and more powerful under weight misspecification. This is an important result given that, arguably, misspecified models are likely to be the rule rather than the exception in weighting-based approaches.

Entities:  

Keywords:  MAF thresholding; SKAT; power; robustness; schizophrenia; variable weighting

Mesh:

Substances:

Year:  2017        PMID: 28238293      PMCID: PMC5357183          DOI: 10.1017/thg.2017.7

Source DB:  PubMed          Journal:  Twin Res Hum Genet        ISSN: 1832-4274            Impact factor:   1.587


  46 in total

1.  Are rare variants responsible for susceptibility to complex diseases?

Authors:  J K Pritchard
Journal:  Am J Hum Genet       Date:  2001-06-12       Impact factor: 11.025

2.  SIFT: Predicting amino acid changes that affect protein function.

Authors:  Pauline C Ng; Steven Henikoff
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

3.  Sequence variations in PCSK9, low LDL, and protection against coronary heart disease.

Authors:  Jonathan C Cohen; Eric Boerwinkle; Thomas H Mosley; Helen H Hobbs
Journal:  N Engl J Med       Date:  2006-03-23       Impact factor: 91.245

4.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data.

Authors:  Bingshan Li; Suzanne M Leal
Journal:  Am J Hum Genet       Date:  2008-08-07       Impact factor: 11.025

5.  Most rare missense alleles are deleterious in humans: implications for complex disease and association studies.

Authors:  Gregory V Kryukov; Len A Pennacchio; Shamil R Sunyaev
Journal:  Am J Hum Genet       Date:  2007-03-08       Impact factor: 11.025

Review 6.  Common vs. rare allele hypotheses for complex diseases.

Authors:  Nicholas J Schork; Sarah S Murray; Kelly A Frazer; Eric J Topol
Journal:  Curr Opin Genet Dev       Date:  2009-05-28       Impact factor: 5.578

7.  Young cases of schizophrenia identified in a national inpatient register--are the diagnoses valid?

Authors:  Ch Dalman; J Broms; J Cullberg; P Allebeck
Journal:  Soc Psychiatry Psychiatr Epidemiol       Date:  2002-11       Impact factor: 4.328

Review 8.  The distribution of fitness effects of new mutations.

Authors:  Adam Eyre-Walker; Peter D Keightley
Journal:  Nat Rev Genet       Date:  2007-08       Impact factor: 53.242

9.  A groupwise association test for rare mutations using a weighted sum statistic.

Authors:  Bo Eskerod Madsen; Sharon R Browning
Journal:  PLoS Genet       Date:  2009-02-13       Impact factor: 5.917

10.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

View more
  3 in total

1.  Unified Sequence-Based Association Tests Allowing for Multiple Functional Annotations and Meta-analysis of Noncoding Variation in Metabochip Data.

Authors:  Zihuai He; Bin Xu; Seunggeun Lee; Iuliana Ionita-Laza
Journal:  Am J Hum Genet       Date:  2017-08-24       Impact factor: 11.025

2.  Convex combination sequence kernel association test for rare-variant studies.

Authors:  Daniel C Posner; Honghuang Lin; James B Meigs; Eric D Kolaczyk; Josée Dupuis
Journal:  Genet Epidemiol       Date:  2020-02-26       Impact factor: 2.135

3.  Epigenetic signatures relating to disease-associated genotypic burden in familial risk of bipolar disorder.

Authors:  Sonia Hesam-Shariati; Bronwyn J Overs; Gloria Roberts; Claudio Toma; Oliver J Watkeys; Melissa J Green; Kerrie D Pierce; Howard J Edenberg; Holly C Wilcox; Emma K Stapp; Melvin G McInnis; Leslie A Hulvershorn; John I Nurnberger; Peter R Schofield; Philip B Mitchell; Janice M Fullerton
Journal:  Transl Psychiatry       Date:  2022-08-03       Impact factor: 7.989

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.