Literature DB >> 28211093

gsSKAT: Rapid gene set analysis and multiple testing correction for rare-variant association studies using weighted linear kernels.

Nicholas B Larson1, Shannon McDonnell1, Lisa Cannon Albright2, Craig Teerlink2, Janet Stanford3, Elaine A Ostrander4, William B Isaacs5, Jianfeng Xu6, Kathleen A Cooney2,7,8, Ethan Lange9, Johanna Schleutker10, John D Carpten11, Isaac Powell12, Joan E Bailey-Wilson13, Olivier Cussenot14, Geraldine Cancel-Tassin14, Graham G Giles15,16, Robert J MacInnis15,16, Christiane Maier17, Alice S Whittemore18, Chih-Lin Hsieh19, Fredrik Wiklund20, William J Catalona21, William Foulkes22,23, Diptasri Mandal24, Rosalind Eeles25, Zsofia Kote-Jarai25,26, Michael J Ackerman27, Timothy M Olson27, Christopher J Klein28, Stephen N Thibodeau29, Daniel J Schaid1.   

Abstract

Next-generation sequencing technologies have afforded unprecedented characterization of low-frequency and rare genetic variation. Due to low power for single-variant testing, aggregative methods are commonly used to combine observed rare variation within a single gene. Causal variation may also aggregate across multiple genes within relevant biomolecular pathways. Kernel-machine regression and adaptive testing methods for aggregative rare-variant association testing have been demonstrated to be powerful approaches for pathway-level analysis, although these methods tend to be computationally intensive at high-variant dimensionality and require access to complete data. An additional analytical issue in scans of large pathway definition sets is multiple testing correction. Gene set definitions may exhibit substantial genic overlap, and the impact of the resultant correlation in test statistics on Type I error rate control for large agnostic gene set scans has not been fully explored. Herein, we first outline a statistical strategy for aggregative rare-variant analysis using component gene-level linear kernel score test summary statistics as well as derive simple estimators of the effective number of tests for family-wise error rate control. We then conduct extensive simulation studies to characterize the behavior of our approach relative to direct application of kernel and adaptive methods under a variety of conditions. We also apply our method to two case-control studies, respectively, evaluating rare variation in hereditary prostate cancer and schizophrenia. Finally, we provide open-source R code for public use to facilitate easy application of our methods to existing rare-variant analysis results.
© 2017 WILEY PERIODICALS, INC.

Entities:  

Keywords:  gene set; next-generation sequencing; pathway; rare variation

Mesh:

Year:  2017        PMID: 28211093      PMCID: PMC5397327          DOI: 10.1002/gepi.22036

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  32 in total

1.  A new measure of the effective number of tests, a practical tool for comparing families of non-independent significance tests.

Authors:  Nicholas W Galwey
Journal:  Genet Epidemiol       Date:  2009-11       Impact factor: 2.135

2.  Correction for multiple testing in a gene region.

Authors:  Audrey E Hendricks; Josée Dupuis; Mark W Logue; Richard H Myers; Kathryn L Lunetta
Journal:  Eur J Hum Genet       Date:  2013-07-10       Impact factor: 4.246

Review 3.  Genetic evidence for a role of the SREBP transcription system and lipid biosynthesis in schizophrenia and antipsychotic treatment.

Authors:  Vidar M Steen; Silje Skrede; Tatiana Polushina; Miguel López; Ole A Andreassen; Johan Fernø; Stephanie Le Hellard
Journal:  Eur Neuropsychopharmacol       Date:  2016-08-01       Impact factor: 4.600

4.  A description of the Molecular Signatures Database (MSigDB) Web site.

Authors:  Arthur Liberzon
Journal:  Methods Mol Biol       Date:  2014

5.  A rare variant in MYH6 is associated with high risk of sick sinus syndrome.

Authors:  Hilma Holm; Daniel F Gudbjartsson; Patrick Sulem; Gisli Masson; Hafdis Th Helgadottir; Carlo Zanon; Olafur Th Magnusson; Agnar Helgason; Jona Saemundsdottir; Arnaldur Gylfason; Hrafnhildur Stefansdottir; Solveig Gretarsdottir; Stefan E Matthiasson; Gu Mundur Thorgeirsson; Aslaug Jonasdottir; Asgeir Sigurdsson; Hreinn Stefansson; Thomas Werge; Thorunn Rafnar; Lambertus A Kiemeney; Babar Parvez; Raafia Muhammad; Dan M Roden; Dawood Darbar; Gudmar Thorleifsson; G Bragi Walters; Augustine Kong; Unnur Thorsteinsdottir; David O Arnar; Kari Stefansson
Journal:  Nat Genet       Date:  2011-03-06       Impact factor: 38.330

6.  Pathogenic Variants in PIGG Cause Intellectual Disability with Seizures and Hypotonia.

Authors:  Periklis Makrythanasis; Mitsuhiro Kato; Maha S Zaki; Hirotomo Saitsu; Kazuyuki Nakamura; Federico A Santoni; Satoko Miyatake; Mitsuko Nakashima; Mahmoud Y Issa; Michel Guipponi; Audrey Letourneau; Clare V Logan; Nicola Roberts; David A Parry; Colin A Johnson; Naomichi Matsumoto; Hanan Hamamy; Eamonn Sheridan; Taroh Kinoshita; Stylianos E Antonarakis; Yoshiko Murakami
Journal:  Am J Hum Genet       Date:  2016-03-17       Impact factor: 11.025

Review 7.  Common and rare variants in multifactorial susceptibility to common diseases.

Authors:  Walter Bodmer; Carolina Bonilla
Journal:  Nat Genet       Date:  2008-06       Impact factor: 38.330

8.  PID: the Pathway Interaction Database.

Authors:  Carl F Schaefer; Kira Anthony; Shiva Krupa; Jeffrey Buchoff; Matthew Day; Timo Hannay; Kenneth H Buetow
Journal:  Nucleic Acids Res       Date:  2008-10-02       Impact factor: 16.971

9.  The Reactome pathway knowledgebase.

Authors:  David Croft; Antonio Fabregat Mundo; Robin Haw; Marija Milacic; Joel Weiser; Guanming Wu; Michael Caudy; Phani Garapati; Marc Gillespie; Maulik R Kamdar; Bijay Jassal; Steven Jupe; Lisa Matthews; Bruce May; Stanislav Palatnik; Karen Rothfels; Veronica Shamovsky; Heeyeon Song; Mark Williams; Ewan Birney; Henning Hermjakob; Lincoln Stein; Peter D'Eustachio
Journal:  Nucleic Acids Res       Date:  2013-11-15       Impact factor: 16.971

10.  Biological insights from 108 schizophrenia-associated genetic loci.

Authors: 
Journal:  Nature       Date:  2014-07-22       Impact factor: 49.962

View more
  3 in total

1.  An optimal kernel-based multivariate U-statistic to test for associations with multiple phenotypes.

Authors:  Y Wen; Qing Lu
Journal:  Biostatistics       Date:  2022-07-18       Impact factor: 5.279

2.  Stepwise approach to SNP-set analysis illustrated with the Metabochip and colorectal cancer in Japanese Americans of the Multiethnic Cohort.

Authors:  John Cologne; Lenora Loo; Yurii B Shvetsov; Munechika Misumi; Philip Lin; Christopher A Haiman; Lynne R Wilkens; Loïc Le Marchand
Journal:  BMC Genomics       Date:  2018-07-09       Impact factor: 3.969

3.  Multi-Set Testing Strategies Show Good Behavior When Applied to Very Large Sets of Rare Variants.

Authors:  Ruby Fore; Jaden Boehme; Kevin Li; Jason Westra; Nathan Tintle
Journal:  Front Genet       Date:  2020-11-09       Impact factor: 4.599

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.