Literature DB >> 16698769

Averaged gene expressions for regression.

Mee Young Park1, Trevor Hastie, Robert Tibshirani.   

Abstract

Although averaging is a simple technique, it plays an important role in reducing variance. We use this essential property of averaging in regression of the DNA microarray data, which poses the challenge of having far more features than samples. In this paper, we introduce a two-step procedure that combines (1) hierarchical clustering and (2) Lasso. By averaging the genes within the clusters obtained from hierarchical clustering, we define supergenes and use them to fit regression models, thereby attaining concise interpretation and accuracy. Our methods are supported with theoretical justifications and demonstrated on simulated and real data sets.

Mesh:

Year:  2006        PMID: 16698769     DOI: 10.1093/biostatistics/kxl002

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  38 in total

1.  Module-based prediction approach for robust inter-study predictions in microarray data.

Authors:  Zhibao Mi; Kui Shen; Nan Song; Chunrong Cheng; Chi Song; Naftali Kaminski; George C Tseng
Journal:  Bioinformatics       Date:  2010-08-17       Impact factor: 6.937

2.  Sparse regression and marginal testing using cluster prototypes.

Authors:  Stephen Reid; Robert Tibshirani
Journal:  Biostatistics       Date:  2015-11-27       Impact factor: 5.899

3.  Integrating biological knowledge with gene expression profiles for survival prediction of cancer.

Authors:  Xi Chen; Lily Wang
Journal:  J Comput Biol       Date:  2009-02       Impact factor: 1.479

4.  Simultaneous supervised clustering and feature selection over a graph.

Authors:  Xiaotong Shen; Hsin-Cheng Huang; Wei Pan
Journal:  Biometrika       Date:  2012-10-18       Impact factor: 2.445

5.  Pathway-Structured Predictive Model for Cancer Survival Prediction: A Two-Stage Approach.

Authors:  Xinyan Zhang; Yan Li; Tomi Akinyemiju; Akinyemi I Ojesina; Phillip Buckhaults; Nianjun Liu; Bo Xu; Nengjun Yi
Journal:  Genetics       Date:  2016-11-09       Impact factor: 4.562

6.  Radiomics in nuclear medicine: robustness, reproducibility, standardization, and how to avoid data analysis traps and replication crisis.

Authors:  Alex Zwanenburg
Journal:  Eur J Nucl Med Mol Imaging       Date:  2019-06-25       Impact factor: 9.236

7.  The Cluster Elastic Net for High-Dimensional Regression With Unknown Variable Grouping.

Authors:  Daniela M Witten; Ali Shojaie; Fan Zhang
Journal:  Technometrics       Date:  2014-02-20

8.  Prediction of breast cancer prognosis using gene set statistics provides signature stability and biological context.

Authors:  Gad Abraham; Adam Kowalczyk; Sherene Loi; Izhak Haviv; Justin Zobel
Journal:  BMC Bioinformatics       Date:  2010-05-25       Impact factor: 3.169

9.  Predictive response-relevant clustering of expression data provides insights into disease processes.

Authors:  Lisa E M Hopcroft; Martin W McBride; Keith J Harris; Amanda K Sampson; John D McClure; Delyth Graham; Graham Young; Tessa L Holyoake; Mark A Girolami; Anna F Dominiczak
Journal:  Nucleic Acids Res       Date:  2010-06-22       Impact factor: 16.971

10.  Pathway index models for construction of patient-specific risk profiles.

Authors:  Kevin H Eng; Sijian Wang; William H Bradley; Janet S Rader; Christina Kendziorski
Journal:  Stat Med       Date:  2012-10-16       Impact factor: 2.373

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.