Literature DB >> 35198092

VCSEL: PRIORITIZING SNP-SET BY PENALIZED VARIANCE COMPONENT SELECTION.

Juhyun Kim1, Judong Shen2, Anran Wang2, Devan V Mehrotra2, Seyoon Ko1, Jin J Zhou3, Hua Zhou1.   

Abstract

Single nucleotide polymorphism (SNP) set analysis aggregates both common and rare variants and tests for association between phenotype(s) of interest and a set. However, multiple SNP-sets, such as genes, pathways, or sliding windows are usually investigated across the whole genome in which all groups are tested separately, followed by multiple testing adjustments. We propose a novel method to prioritize SNP-sets in a joint multivariate variance component model. Each SNP-set corresponds to a variance component (or kernel), and model selection is achieved by incorporating either convex or nonconvex penalties. The uniqueness of this variance component selection framework, which we call VCSEL, is that it naturally encompasses multivariate traits (VCSEL-M) and SNP-set-treatment or -environment interactions (VCSEL-I). We devise an optimization algorithm scalable to many variance components, based on the majorization-minimization (MM) principle. Simulation studies demonstrate the superiority of our methods in model selection performance, as measured by the area under the precision-recall (PR) curve, compared to the commonly used marginal testing and group penalization methods. Finally, we apply our methods to a real pharmacogenomics study and a real whole exome sequencing study. Some top ranked genes by VCSEL are detected as insignificant by the marginal test methods which emphasizes formal inference of individual genes with a strict significance threshold. This provides alternative insights for biologists to prioritize follow-up studies and develop polygenic risk score models.

Entities:  

Keywords:  Rare variants; group selection; majorization-minimization (MM); multiple phenotypes; nonconvex penalties; penalized estimation; restricted maximum likelihood (REML); variance components model

Year:  2021        PMID: 35198092      PMCID: PMC8863365          DOI: 10.1214/21-aoas1491

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  61 in total

1.  Optimal tests for rare variant effects in sequencing association studies.

Authors:  Seunggeun Lee; Michael C Wu; Xihong Lin
Journal:  Biostatistics       Date:  2012-06-14       Impact factor: 5.899

2.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data.

Authors:  Bingshan Li; Suzanne M Leal
Journal:  Am J Hum Genet       Date:  2008-08-07       Impact factor: 11.025

3.  Discussion of "Sure Independence Screening for Ultra-High Dimensional Feature Space.

Authors:  Hao Helen Zhang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2008-11       Impact factor: 4.488

4.  A Statistical Approach for Testing Cross-Phenotype Effects of Rare Variants.

Authors:  K Alaine Broadaway; David J Cutler; Richard Duncan; Jacob L Moore; Erin B Ware; Min A Jhun; Lawrence F Bielak; Wei Zhao; Jennifer A Smith; Patricia A Peyser; Sharon L R Kardia; Debashis Ghosh; Michael P Epstein
Journal:  Am J Hum Genet       Date:  2016-03-03       Impact factor: 11.025

5.  Low LDL cholesterol in individuals of African descent resulting from frequent nonsense mutations in PCSK9.

Authors:  Jonathan Cohen; Alexander Pertsemlidis; Ingrid K Kotowski; Randall Graham; Christine Kim Garcia; Helen H Hobbs
Journal:  Nat Genet       Date:  2005-01-16       Impact factor: 38.330

Review 6.  Statistical analysis of rare sequence variants: an overview of collapsing methods.

Authors:  Carmen Dering; Claudia Hemmelmann; Elizabeth Pugh; Andreas Ziegler
Journal:  Genet Epidemiol       Date:  2011       Impact factor: 2.135

7.  Multivariate phenotype association analysis by marker-set kernel machine regression.

Authors:  Arnab Maity; Patrick F Sullivan; Jun-Ying Tzeng
Journal:  Genet Epidemiol       Date:  2012-08-16       Impact factor: 2.135

8.  MM Algorithms For Variance Components Models.

Authors:  Hua Zhou; Liuyi Hu; Jin Zhou; Kenneth Lange
Journal:  J Comput Graph Stat       Date:  2019-03-09       Impact factor: 2.302

9.  Kernel Approach for Modeling Interaction Effects in Genetic Association Studies of Complex Quantitative Traits.

Authors:  K Alaine Broadaway; Richard Duncan; Karen N Conneely; Lynn M Almli; Bekh Bradley; Kerry J Ressler; Michael P Epstein
Journal:  Genet Epidemiol       Date:  2015-04-17       Impact factor: 2.135

10.  GEMINI: integrative exploration of genetic variation and genome annotations.

Authors:  Umadevi Paila; Brad A Chapman; Rory Kirchner; Aaron R Quinlan
Journal:  PLoS Comput Biol       Date:  2013-07-18       Impact factor: 4.475

View more
  1 in total

1.  VCSEL: PRIORITIZING SNP-SET BY PENALIZED VARIANCE COMPONENT SELECTION.

Authors:  Juhyun Kim; Judong Shen; Anran Wang; Devan V Mehrotra; Seyoon Ko; Jin J Zhou; Hua Zhou
Journal:  Ann Appl Stat       Date:  2021-12-21       Impact factor: 2.083

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.