Literature DB >> 16772399

Nonparametric pathway-based regression models for analysis of genomic data.

Zhi Wei1, Hongzhe Li.   

Abstract

High-throughout genomic data provide an opportunity for identifying pathways and genes that are related to various clinical phenotypes. Besides these genomic data, another valuable source of data is the biological knowledge about genes and pathways that might be related to the phenotypes of many complex diseases. Databases of such knowledge are often called the metadata. In microarray data analysis, such metadata are currently explored in post hoc ways by gene set enrichment analysis but have hardly been utilized in the modeling step. We propose to develop and evaluate a pathway-based gradient descent boosting procedure for nonparametric pathways-based regression (NPR) analysis to efficiently integrate genomic data and metadata. Such NPR models consider multiple pathways simultaneously and allow complex interactions among genes within the pathways and can be applied to identify pathways and genes that are related to variations of the phenotypes. These methods also provide an alternative to mediating the problem of a large number of potential interactions by limiting analysis to biologically plausible interactions between genes in related pathways. Our simulation studies indicate that the proposed boosting procedure can indeed identify relevant pathways. Application to a gene expression data set on breast cancer distant metastasis identified that Wnt, apoptosis, and cell cycle-regulated pathways are more likely related to the risk of distant metastasis among lymph-node-negative breast cancer patients. Results from analysis of other two breast cancer gene expression data sets indicate that the pathways of Metalloendopeptidases (MMPs) and MMP inhibitors, as well as cell proliferation, cell growth, and maintenance are important to breast cancer relapse and survival. We also observed that by incorporating the pathway information, we achieved better prediction for cancer recurrence.

Entities:  

Mesh:

Year:  2006        PMID: 16772399     DOI: 10.1093/biostatistics/kxl007

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  29 in total

1.  Likelihood-based selection and sharp parameter estimation.

Authors:  Xiaotong Shen; Wei Pan; Yunzhang Zhu
Journal:  J Am Stat Assoc       Date:  2012-06-11       Impact factor: 5.033

2.  Sparse combinatorial inference with an application in cancer biology.

Authors:  Sach Mukherjee; Steven Pelech; Richard M Neve; Wen-Lin Kuo; Safiyyah Ziyad; Paul T Spellman; Joe W Gray; Terence P Speed
Journal:  Bioinformatics       Date:  2008-11-27       Impact factor: 6.937

3.  Identification of differential gene pathways with principal component analysis.

Authors:  Shuangge Ma; Michael R Kosorok
Journal:  Bioinformatics       Date:  2009-02-17       Impact factor: 6.937

4.  Integrating biological knowledge with gene expression profiles for survival prediction of cancer.

Authors:  Xi Chen; Lily Wang
Journal:  J Comput Biol       Date:  2009-02       Impact factor: 1.479

5.  U-statistics-based tests for multiple genes in genetic association studies.

Authors:  Zhi Wei; Mingyao Li; Timothy Rebbeck; Hongzhe Li
Journal:  Ann Hum Genet       Date:  2008-08-06       Impact factor: 1.670

6.  Pathway-Structured Predictive Model for Cancer Survival Prediction: A Two-Stage Approach.

Authors:  Xinyan Zhang; Yan Li; Tomi Akinyemiju; Akinyemi I Ojesina; Phillip Buckhaults; Nianjun Liu; Bo Xu; Nengjun Yi
Journal:  Genetics       Date:  2016-11-09       Impact factor: 4.562

7.  A Selective Review of Group Selection in High-Dimensional Models.

Authors:  Jian Huang; Patrick Breheny; Shuangge Ma
Journal:  Stat Sci       Date:  2012       Impact factor: 2.901

8.  Identification of cancer-associated gene clusters and genes via clustering penalization.

Authors:  Shuangge Ma; Jian Huang; Shihao Shen
Journal:  Stat Interface       Date:  2009-01-01       Impact factor: 0.582

9.  Survival associated pathway identification with group Lp penalized global AUC maximization.

Authors:  Zhenqiu Liu; Laurence S Magder; Terry Hyslop; Li Mao
Journal:  Algorithms Mol Biol       Date:  2010-08-16       Impact factor: 1.405

10.  Incorporating pathway information into boosting estimation of high-dimensional risk prediction models.

Authors:  Harald Binder; Martin Schumacher
Journal:  BMC Bioinformatics       Date:  2009-01-13       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.