Jaehoon Lee1, Young Jin Kim2, Juyoung Lee2, Bong-Jo Kim2, Seungyeoun Lee3, Taesung Park1. 1. Department of Statistics, Seoul National University, Seoul 151-742, Korea. 2. Division of Structural and Functional Genomics, Korean National Institute of Health, Osong, Chungchungbuk-Do 363-951, Korea. 3. Department of Mathematics and Statistics, Sejong University, Seoul 143-747, Korea.
Abstract
MOTIVATION: Recently, many methods have been developed for conducting rare-variant association studies for sequencing data. These methods have primarily been based on gene-level associations but have not been proven to be as effective as expected. Gene-set-level tests have shown great advantages over gene-level tests in terms of power and robustness, because complex diseases are often caused by multiple genes that comprise of biological gene sets. RESULTS: Here, we propose several novel gene-set tests that employ rapid and efficient dimensionality reduction. The performance of these tests was investigated using extensive simulations and application to 1058 whole-exome sequences from a Korean population. We identified some known pathways and novel pathways whose rare or common variants are associated with elevated liver enzymes and replicated the results in an independent cohort. AVAILABILITY AND IMPLEMENTATION: Source R code for our algorithm is freely available at http://statgen.snu.ac.kr/software/QTest CONTACT: tspark@stats.snu.ac.kr SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Recently, many methods have been developed for conducting rare-variant association studies for sequencing data. These methods have primarily been based on gene-level associations but have not been proven to be as effective as expected. Gene-set-level tests have shown great advantages over gene-level tests in terms of power and robustness, because complex diseases are often caused by multiple genes that comprise of biological gene sets. RESULTS: Here, we propose several novel gene-set tests that employ rapid and efficient dimensionality reduction. The performance of these tests was investigated using extensive simulations and application to 1058 whole-exome sequences from a Korean population. We identified some known pathways and novel pathways whose rare or common variants are associated with elevated liver enzymes and replicated the results in an independent cohort. AVAILABILITY AND IMPLEMENTATION: Source R code for our algorithm is freely available at http://statgen.snu.ac.kr/software/QTest CONTACT: tspark@stats.snu.ac.kr SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Alkes L Price; Gregory V Kryukov; Paul I W de Bakker; Shaun M Purcell; Jeff Staples; Lee-Jen Wei; Shamil R Sunyaev Journal: Am J Hum Genet Date: 2010-05-13 Impact factor: 11.025
Authors: Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov Journal: Proc Natl Acad Sci U S A Date: 2005-09-30 Impact factor: 11.205
Authors: Chun Sun; Lu Sun; Kai Jiang; Dong-Mei Gao; Xiao-Nan Kang; Cun Wang; Shu Zhang; Shan Huang; Xue Qin; Yan Li; Yin-Kun Liu Journal: Int J Biochem Cell Biol Date: 2013-03-07 Impact factor: 5.085
Authors: Phoebe C R Parrish; Delong Liu; Russell H Knutsen; Charles J Billington; Robert P Mecham; Yi-Ping Fu; Beth A Kozel Journal: Hum Mol Genet Date: 2020-07-29 Impact factor: 6.150