Literature DB >> 16646856

Computing asymptotic power and sample size for case-control genetic association studies in the presence of phenotype and/or genotype misclassification errors.

Fei Ji1, Yaning Yang, Chad Haynes, Stephen J Finch, Derek Gordon.   

Abstract

It is well established that phenotype and genotype misclassification errors reduce the power to detect genetic association. Resampling a subset of the data (e.g, double-sampling) of genotype and/or phenotype with a gold standard measurement is one method to address this issue. We derive the non-centrality parameter (NCP) for the recently published Likelihood Ratio Test Allowing for Error (LRTae) in the presence of random phenotype and genotype errors. With the NCP, power and sample size can be analytically determined at any significance level. We verify analytic power with simulations using a 2**k factorial design given high and low settings of: case and control genotype frequencies, phenotype and genotype misclassification probabilities, total sample size, ratio of cases to controls, and proportions of phenotype and/or genotype double-samples. We also perform example applications of our method assuming equal costs for the LRTae method and the standard method that does not use double-sample information (LRTstd) to determine if power gain due to double-sampling a proportion of samples outweighs the reduction in sample size due to additional costs in obtaining double-samples. Our results showed a median difference of at most 0.01 between analytic and simulation power for the factorial design settings, with maximum difference of 0.054. For our cost/benefits analysis calculations, results for genotype errors are that double-sampling appears most beneficial (in terms of power gain) when cost of double-sampling is relatively low, irrespective of the proportion of individuals double-sampled. In the presence of phenotype error, there is always power gain using the LRTae method for the parameter settings considered. We have freely available software that performs power and sample size calculations for the LRTae method and cost/benefits analyses comparing power for LRTae and LRTstd methods assuming equal costs.

Year:  2006        PMID: 16646856     DOI: 10.2202/1544-6115.1184

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  15 in total

1.  Impact of phenotype definition on genome-wide association signals: empirical evaluation in human immunodeficiency virus type 1 infection.

Authors:  Evangelos Evangelou; Jacques Fellay; Sara Colombo; Javier Martinez-Picado; Niels Obel; David B Goldstein; Amalio Telenti; John P A Ioannidis
Journal:  Am J Epidemiol       Date:  2011-04-13       Impact factor: 4.897

Review 2.  Informatics and machine learning to define the phenotype.

Authors:  Anna Okula Basile; Marylyn DeRiggi Ritchie
Journal:  Expert Rev Mol Diagn       Date:  2018-02-16       Impact factor: 5.225

3.  Genotyping error detection in samples of unrelated individuals without replicate genotyping.

Authors:  Nianjun Liu; Dabao Zhang; Hongyu Zhao
Journal:  Hum Hered       Date:  2008-12-15       Impact factor: 0.444

4.  Single-variant and multi-variant trend tests for genetic association with next-generation sequencing that are robust to sequencing error.

Authors:  Wonkuk Kim; Douglas Londono; Lisheng Zhou; Jinchuan Xing; Alejandro Q Nato; Anthony Musolf; Tara C Matise; Stephen J Finch; Derek Gordon
Journal:  Hum Hered       Date:  2013-04-11       Impact factor: 0.444

5.  The cost effectiveness of duplicate genotyping for testing genetic association.

Authors:  Nathan Tintle; Derek Gordon; Dirk Van Bruggen; Stephen Finch
Journal:  Ann Hum Genet       Date:  2009-03-25       Impact factor: 1.670

6.  TDT-HET: a new transmission disequilibrium test that incorporates locus heterogeneity into the analysis of family-based association data.

Authors:  Douglas Londono; Steven Buyske; Stephen J Finch; Swarkar Sharma; Carol A Wise; Derek Gordon
Journal:  BMC Bioinformatics       Date:  2012-01-20       Impact factor: 3.169

7.  LRTae: improving statistical power for genetic association with case/control data when phenotype and/or genotype misclassification errors are present.

Authors:  Sandra Barral; Chad Haynes; Millicent Stone; Derek Gordon
Journal:  BMC Genet       Date:  2006-04-27       Impact factor: 2.797

8.  Are molecular haplotypes worth the time and expense? A cost-effective method for applying molecular haplotypes.

Authors:  Mark A Levenstien; Jürg Ott; Derek Gordon
Journal:  PLoS Genet       Date:  2006-06-28       Impact factor: 5.917

9.  Testing for linkage and association across the dihydrolipoyl dehydrogenase gene region with Alzheimer's disease in three sample populations.

Authors:  Abraham M Brown; Derek Gordon; Hsinhwa Lee; Fabienne Wavrant-De Vrièze; Elena Cellini; Silvia Bagnoli; Benedetta Nacmias; Sandro Sorbi; John Hardy; John P Blass
Journal:  Neurochem Res       Date:  2007-03-07       Impact factor: 3.996

10.  Computing power and sample size for case-control association studies with copy number polymorphism: application of mixture-based likelihood ratio test.

Authors:  Wonkuk Kim; Derek Gordon; Jonathan Sebat; Kenny Q Ye; Stephen J Finch
Journal:  PLoS One       Date:  2008-10-22       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.