Literature DB >> 25196635

Meta-analysis based variable selection for gene expression data.

Quefeng Li1, Sijian Wang, Chiang-Ching Huang, Menggang Yu, Jun Shao.   

Abstract

Recent advance in biotechnology and its wide applications have led to the generation of many high-dimensional gene expression data sets that can be used to address similar biological questions. Meta-analysis plays an important role in summarizing and synthesizing scientific evidence from multiple studies. When the dimensions of datasets are high, it is desirable to incorporate variable selection into meta-analysis to improve model interpretation and prediction. According to our knowledge, all existing methods conduct variable selection with meta-analyzed data in an "all-in-or-all-out" fashion, that is, a gene is either selected in all of studies or not selected in any study. However, due to data heterogeneity commonly exist in meta-analyzed data, including choices of biospecimens, study population, and measurement sensitivity, it is possible that a gene is important in some studies while unimportant in others. In this article, we propose a novel method called meta-lasso for variable selection with high-dimensional meta-analyzed data. Through a hierarchical decomposition on regression coefficients, our method not only borrows strength across multiple data sets to boost the power to identify important genes, but also keeps the selection flexibility among data sets to take into account data heterogeneity. We show that our method possesses the gene selection consistency, that is, when sample size of each data set is large, with high probability, our method can identify all important genes and remove all unimportant genes. Simulation studies demonstrate a good performance of our method. We applied our meta-lasso method to a meta-analysis of five cardiovascular studies. The analysis results are clinically meaningful.
© 2014, The International Biometric Society.

Keywords:  Gene selection; High dimension; Meta-analysis; Weak oracle property

Mesh:

Year:  2014        PMID: 25196635     DOI: 10.1111/biom.12213

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  14 in total

1.  Integrative linear discriminant analysis with guaranteed error rate improvement.

Authors:  Quefeng Li; Lexin Li
Journal:  Biometrika       Date:  2018-10-22       Impact factor: 2.445

2.  Statistical Methods in Integrative Genomics.

Authors:  Sylvia Richardson; George C Tseng; Wei Sun
Journal:  Annu Rev Stat Appl       Date:  2016-04-18       Impact factor: 5.810

3.  A Statistical Framework for Pathway and Gene Identification from Integrative Analysis.

Authors:  Quefeng Li; Menggang Yu; Sijian Wang
Journal:  J Multivar Anal       Date:  2017-01-21       Impact factor: 1.473

4.  GRIA: Graphical Regularization for Integrative Analysis.

Authors:  Changgee Chang; Jihwan Oh; Qi Long
Journal:  Proc SIAM Int Conf Data Min       Date:  2020

5.  Circulating microRNA trafficking and regulation: computational principles and practice.

Authors:  Juan Cui; Jiang Shu
Journal:  Brief Bioinform       Date:  2020-07-15       Impact factor: 11.622

6.  Simultaneous Covariance Inference for Multimodal Integrative Analysis.

Authors:  Yin Xia; Lexin Li; Samuel N Lockhart; William J Jagust
Journal:  J Am Stat Assoc       Date:  2019-06-28       Impact factor: 5.033

7.  Modeling Between-Study Heterogeneity for Improved Replicability in Gene Signature Selection and Clinical Prediction.

Authors:  Naim U Rashid; Quefeng Li; Jen Jen Yeh; Joseph G Ibrahim
Journal:  J Am Stat Assoc       Date:  2019-10-29       Impact factor: 5.033

8.  Meta-analytic support vector machine for integrating multiple omics data.

Authors:  SungHwan Kim; Jae-Hwan Jhong; JungJun Lee; Ja-Yong Koo
Journal:  BioData Min       Date:  2017-01-26       Impact factor: 2.522

9.  Dynamic and Modularized MicroRNA Regulation and Its Implication in Human Cancers.

Authors:  Jiang Shu; Bruno Vieira Resende E Silva; Tian Gao; Zheng Xu; Juan Cui
Journal:  Sci Rep       Date:  2017-10-17       Impact factor: 4.379

10.  Meta-Analyzing Multiple Omics Data With Robust Variable Selection.

Authors:  Zongliang Hu; Yan Zhou; Tiejun Tong
Journal:  Front Genet       Date:  2021-07-05       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.