Rui Chen1, Qiang Wei1, Xiaowei Zhan1, Xue Zhong1, James S Sutcliffe1, Nancy J Cox1, Edwin H Cook1, Chun Li1, Wei Chen2, Bingshan Li1. 1. Department of Molecular Physiology and Biophysics, Vanderbilt University, TN, 37221, USA, Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center, Dallas, TX, USA, Center for Quantitative Sciences, Vanderbilt University, TN, 37221, USA, Department of Medicine, University of Chicago, Chicago, IL, USA, Department of Psychiatry, University of Illinois at Chicago, Chicago, IL, USA, Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA and Department of Pediatrics, University of Pittsburgh, Pittsburgh, PA, USA. 2. Department of Molecular Physiology and Biophysics, Vanderbilt University, TN, 37221, USA, Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center, Dallas, TX, USA, Center for Quantitative Sciences, Vanderbilt University, TN, 37221, USA, Department of Medicine, University of Chicago, Chicago, IL, USA, Department of Psychiatry, University of Illinois at Chicago, Chicago, IL, USA, Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA and Department of Pediatrics, University of Pittsburgh, Pittsburgh, PA, USA Department of Molecular Physiology and Biophysics, Vanderbilt University, TN, 37221, USA, Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center, Dallas, TX, USA, Center for Quantitative Sciences, Vanderbilt University, TN, 37221, USA, Department of Medicine, University of Chicago, Chicago, IL, USA, Department of Psychiatry, University of Illinois at Chicago, Chicago, IL, USA, Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA and Department of Pediatrics, University of Pittsburgh, Pittsburgh, PA, USA.
Abstract
MOTIVATION: A major focus of current sequencing studies for human genetics is to identify rare variants associated with complex diseases. Aside from reduced power of detecting associated rare variants, controlling for population stratification is particularly challenging for rare variants. Transmission/disequilibrium tests (TDT) based on family designs are robust to population stratification and admixture, and therefore provide an effective approach to rare variant association studies to eliminate spurious associations. To increase power of rare variant association analysis, gene-based collapsing methods become standard approaches for analyzing rare variants. Existing methods that extend this strategy to rare variants in families usually combine TDT statistics at individual variants and therefore lack the flexibility of incorporating other genetic models. RESULTS: In this study, we describe a haplotype-based framework for group-wise TDT (gTDT) that is flexible to encompass a variety of genetic models such as additive, dominant and compound heterozygous (CH) (i.e. recessive) models as well as other complex interactions. Unlike existing methods, gTDT constructs haplotypes by transmission when possible and inherently takes into account the linkage disequilibrium among variants. Through extensive simulations we showed that type I error was correctly controlled for rare variants under all models investigated, and this remained true in the presence of population stratification. Under a variety of genetic models, gTDT showed increased power compared with the single marker TDT. Application of gTDT to an autism exome sequencing data of 118 trios identified potentially interesting candidate genes with CH rare variants. AVAILABILITY AND IMPLEMENTATION: We implemented gTDT in C++ and the source code and the detailed usage are available on the authors' website (https://medschool.vanderbilt.edu/cgg). CONTACT: bingshan.li@vanderbilt.edu or wei.chen@chp.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: A major focus of current sequencing studies for human genetics is to identify rare variants associated with complex diseases. Aside from reduced power of detecting associated rare variants, controlling for population stratification is particularly challenging for rare variants. Transmission/disequilibrium tests (TDT) based on family designs are robust to population stratification and admixture, and therefore provide an effective approach to rare variant association studies to eliminate spurious associations. To increase power of rare variant association analysis, gene-based collapsing methods become standard approaches for analyzing rare variants. Existing methods that extend this strategy to rare variants in families usually combine TDT statistics at individual variants and therefore lack the flexibility of incorporating other genetic models. RESULTS: In this study, we describe a haplotype-based framework for group-wise TDT (gTDT) that is flexible to encompass a variety of genetic models such as additive, dominant and compound heterozygous (CH) (i.e. recessive) models as well as other complex interactions. Unlike existing methods, gTDT constructs haplotypes by transmission when possible and inherently takes into account the linkage disequilibrium among variants. Through extensive simulations we showed that type I error was correctly controlled for rare variants under all models investigated, and this remained true in the presence of population stratification. Under a variety of genetic models, gTDT showed increased power compared with the single marker TDT. Application of gTDT to an autism exome sequencing data of 118 trios identified potentially interesting candidate genes with CH rare variants. AVAILABILITY AND IMPLEMENTATION: We implemented gTDT in C++ and the source code and the detailed usage are available on the authors' website (https://medschool.vanderbilt.edu/cgg). CONTACT: bingshan.li@vanderbilt.edu or wei.chen@chp.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Alkes L Price; Gregory V Kryukov; Paul I W de Bakker; Shaun M Purcell; Jeff Staples; Lee-Jen Wei; Shamil R Sunyaev Journal: Am J Hum Genet Date: 2010-05-13 Impact factor: 11.025
Authors: Stephen F Schaffner; Catherine Foo; Stacey Gabriel; David Reich; Mark J Daly; David Altshuler Journal: Genome Res Date: 2005-11 Impact factor: 9.043
Authors: Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich Journal: Nat Genet Date: 2006-07-23 Impact factor: 38.330
Authors: Juan R González; Josep L Carrasco; Frank Dudbridge; Lluís Armengol; Xavier Estivill; Victor Moreno Journal: Genet Epidemiol Date: 2008-04 Impact factor: 2.135
Authors: Hyun Min Kang; Jae Hoon Sul; Susan K Service; Noah A Zaitlen; Sit-Yee Kong; Nelson B Freimer; Chiara Sabatti; Eleazar Eskin Journal: Nat Genet Date: 2010-03-07 Impact factor: 38.330
Authors: Weizhen Ji; Jia Nee Foo; Brian J O'Roak; Hongyu Zhao; Martin G Larson; David B Simon; Christopher Newton-Cheh; Matthew W State; Daniel Levy; Richard P Lifton Journal: Nat Genet Date: 2008-04-06 Impact factor: 38.330
Authors: William M Brandler; Danny Antaki; Madhusudan Gujral; Amina Noor; Gabriel Rosanio; Timothy R Chapman; Daniel J Barrera; Guan Ning Lin; Dheeraj Malhotra; Amanda C Watts; Lawrence C Wong; Jasper A Estabillo; Therese E Gadomski; Oanh Hong; Karin V Fuentes Fajardo; Abhishek Bhandari; Renius Owen; Michael Baughn; Jeffrey Yuan; Terry Solomon; Alexandra G Moyzis; Michelle S Maile; Stephan J Sanders; Gail E Reiner; Keith K Vaux; Charles M Strom; Kang Zhang; Alysson R Muotri; Natacha Akshoomoff; Suzanne M Leal; Karen Pierce; Eric Courchesne; Lilia M Iakoucheva; Christina Corsello; Jonathan Sebat Journal: Am J Hum Genet Date: 2016-03-24 Impact factor: 11.025
Authors: Karen Y He; Heming Wang; Brian E Cade; Priyanka Nandakumar; Ayush Giri; Erin B Ware; Jeffrey Haessler; Jingjing Liang; Jennifer A Smith; Nora Franceschini; Thu H Le; Charles Kooperberg; Todd L Edwards; Sharon L R Kardia; Xihong Lin; Aravinda Chakravarti; Susan Redline; Xiaofeng Zhu Journal: PLoS Genet Date: 2017-03-27 Impact factor: 5.917
Authors: Rui Chen; Lea K Davis; Stephen Guter; Qiang Wei; Suma Jacob; Melissa H Potter; Nancy J Cox; Edwin H Cook; James S Sutcliffe; Bingshan Li Journal: Mol Autism Date: 2017-03-21 Impact factor: 7.509
Authors: Julian Hecker; F William Townes; Priyadarshini Kachroo; Cecelia Laurie; Jessica Lasky-Su; John Ziniti; Michael H Cho; Scott T Weiss; Nan M Laird; Christoph Lange Journal: Bioinformatics Date: 2020-12-26 Impact factor: 6.937