Xiang Zhan1, Santhosh Girirajan2,3, Ni Zhao1, Michael C Wu1, Debashis Ghosh4. 1. Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA. 2. Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA. 3. Department of Anthropology, Pennsylvania State University, University Park, PA 16802, USA and. 4. Department of Biostatistics and Informatics, University of Colorado, Aurora, CO 80045, USA.
Abstract
MOTIVATION: Copy number variants (CNVs) have been implicated in a variety of neurodevelopmental disorders, including autism spectrum disorders, intellectual disability and schizophrenia. Recent advances in high-throughput genomic technologies have enabled rapid discovery of many genetic variants including CNVs. As a result, there is increasing interest in studying the role of CNVs in the etiology of many complex diseases. Despite the availability of an unprecedented wealth of CNV data, methods for testing association between CNVs and disease-related traits are still under-developed due to the low prevalence and complicated multi-scale features of CNVs. RESULTS: We propose a novel CNV kernel association test (CKAT) in this paper. To address the low prevalence, CNVs are first grouped into CNV regions (CNVR). Then, taking into account the multi-scale features of CNVs, we first design a single-CNV kernel which summarizes the similarity between two CNVs, and next aggregate the single-CNV kernel to a CNVR kernel which summarizes the similarity between two CNVRs. Finally, association between CNVR and disease-related traits is assessed by comparing the kernel-based similarity with the similarity in the trait using a score test for variance components in a random effect model. We illustrate the proposed CKAT using simulations and show that CKAT is more powerful than existing methods, while always being able to control the type I error. We also apply CKAT to a real dataset examining the association between CNV and autism spectrum disorders, which demonstrates the potential usefulness of the proposed method. AVAILABILITY AND IMPLEMENTATION: A R package to implement the proposed CKAT method is available at http://works.bepress.com/debashis_ghosh/ CONTACTS: xzhan@fhcrc.org or debashis.ghosh@ucdenver.eduSupplementary information: Supplementary data are available at Bioinformatics online.
MOTIVATION: Copy number variants (CNVs) have been implicated in a variety of neurodevelopmental disorders, including autism spectrum disorders, intellectual disability and schizophrenia. Recent advances in high-throughput genomic technologies have enabled rapid discovery of many genetic variants including CNVs. As a result, there is increasing interest in studying the role of CNVs in the etiology of many complex diseases. Despite the availability of an unprecedented wealth of CNV data, methods for testing association between CNVs and disease-related traits are still under-developed due to the low prevalence and complicated multi-scale features of CNVs. RESULTS: We propose a novel CNV kernel association test (CKAT) in this paper. To address the low prevalence, CNVs are first grouped into CNV regions (CNVR). Then, taking into account the multi-scale features of CNVs, we first design a single-CNV kernel which summarizes the similarity between two CNVs, and next aggregate the single-CNV kernel to a CNVR kernel which summarizes the similarity between two CNVRs. Finally, association between CNVR and disease-related traits is assessed by comparing the kernel-based similarity with the similarity in the trait using a score test for variance components in a random effect model. We illustrate the proposed CKAT using simulations and show that CKAT is more powerful than existing methods, while always being able to control the type I error. We also apply CKAT to a real dataset examining the association between CNV and autism spectrum disorders, which demonstrates the potential usefulness of the proposed method. AVAILABILITY AND IMPLEMENTATION: A R package to implement the proposed CKAT method is available at http://works.bepress.com/debashis_ghosh/ CONTACTS: xzhan@fhcrc.org or debashis.ghosh@ucdenver.eduSupplementary information: Supplementary data are available at Bioinformatics online.
Authors: Michael C Wu; Peter Kraft; Michael P Epstein; Deanne M Taylor; Stephen J Chanock; David J Hunter; Xihong Lin Journal: Am J Hum Genet Date: 2010-06-11 Impact factor: 11.025
Authors: Christian R Marshall; Abdul Noor; John B Vincent; Anath C Lionel; Lars Feuk; Jennifer Skaug; Mary Shago; Rainald Moessner; Dalila Pinto; Yan Ren; Bhooma Thiruvahindrapduram; Andreas Fiebig; Stefan Schreiber; Jan Friedman; Cees E J Ketelaars; Yvonne J Vos; Can Ficicioglu; Susan Kirkpatrick; Rob Nicolson; Leon Sloman; Anne Summers; Clare A Gibbons; Ahmad Teebi; David Chitayat; Rosanna Weksberg; Ann Thompson; Cathy Vardy; Vicki Crosbie; Sandra Luscombe; Rebecca Baatjes; Lonnie Zwaigenbaum; Wendy Roberts; Bridget Fernandez; Peter Szatmari; Stephen W Scherer Journal: Am J Hum Genet Date: 2008-01-17 Impact factor: 11.025
Authors: Joseph T Glessner; Kai Wang; Guiqing Cai; Olena Korvatska; Cecilia E Kim; Shawn Wood; Haitao Zhang; Annette Estes; Camille W Brune; Jonathan P Bradfield; Marcin Imielinski; Edward C Frackelton; Jennifer Reichert; Emily L Crawford; Jeffrey Munson; Patrick M A Sleiman; Rosetta Chiavacci; Kiran Annaiah; Kelly Thomas; Cuiping Hou; Wendy Glaberson; James Flory; Frederick Otieno; Maria Garris; Latha Soorya; Lambertus Klei; Joseph Piven; Kacie J Meyer; Evdokia Anagnostou; Takeshi Sakurai; Rachel M Game; Danielle S Rudd; Danielle Zurawiecki; Christopher J McDougle; Lea K Davis; Judith Miller; David J Posey; Shana Michaels; Alexander Kolevzon; Jeremy M Silverman; Raphael Bernier; Susan E Levy; Robert T Schultz; Geraldine Dawson; Thomas Owley; William M McMahon; Thomas H Wassink; John A Sweeney; John I Nurnberger; Hilary Coon; James S Sutcliffe; Nancy J Minshew; Struan F A Grant; Maja Bucan; Edwin H Cook; Joseph D Buxbaum; Bernie Devlin; Gerard D Schellenberg; Hakon Hakonarson Journal: Nature Date: 2009-04-28 Impact factor: 49.962
Authors: Jonathan Sebat; B Lakshmi; Dheeraj Malhotra; Jennifer Troge; Christa Lese-Martin; Tom Walsh; Boris Yamrom; Seungtai Yoon; Alex Krasnitz; Jude Kendall; Anthony Leotta; Deepa Pai; Ray Zhang; Yoon-Ha Lee; James Hicks; Sarah J Spence; Annette T Lee; Kaija Puura; Terho Lehtimäki; David Ledbetter; Peter K Gregersen; Joel Bregman; James S Sutcliffe; Vaidehi Jobanputra; Wendy Chung; Dorothy Warburton; Mary-Claire King; David Skuse; Daniel H Geschwind; T Conrad Gilliam; Kenny Ye; Michael Wigler Journal: Science Date: 2007-03-15 Impact factor: 47.728
Authors: Santhosh Girirajan; Megan Y Dennis; Carl Baker; Maika Malig; Bradley P Coe; Catarina D Campbell; Kenneth Mark; Tiffany H Vu; Can Alkan; Ze Cheng; Leslie G Biesecker; Raphael Bernier; Evan E Eichler Journal: Am J Hum Genet Date: 2013-01-31 Impact factor: 11.025
Authors: Xiang Zhan; Ni Zhao; Anna Plantinga; Timothy A Thornton; Karen N Conneely; Michael P Epstein; Michael C Wu Journal: Genetics Date: 2017-06-22 Impact factor: 4.562