Literature DB >> 21636597

Robust biclustering by sparse singular value decomposition incorporating stability selection.

Martin Sill1, Sebastian Kaiser, Axel Benner, Annette Kopp-Schneider.   

Abstract

MOTIVATION: Over the past decade, several biclustering approaches have been published in the field of gene expression data analysis. Despite of huge diversity regarding the mathematical concepts of the different biclustering methods, many of them can be related to the singular value decomposition (SVD). Recently, a sparse SVD approach (SSVD) has been proposed to reveal biclusters in gene expression data. In this article, we propose to incorporate stability selection to improve this method. Stability selection is a subsampling-based variable selection that allows to control Type I error rates. The here proposed S4VD algorithm incorporates this subsampling approach to find stable biclusters, and to estimate the selection probabilities of genes and samples to belong to the biclusters.
RESULTS: So far, the S4VD method is the first biclustering approach that takes the cluster stability regarding perturbations of the data into account. Application of the S4VD algorithm to a lung cancer microarray dataset revealed biclusters that correspond to coregulated genes associated with cancer subtypes. Marker genes for different lung cancer subtypes showed high selection probabilities to belong to the corresponding biclusters. Moreover, the genes associated with the biclusters belong to significantly enriched cancer-related Gene Ontology categories. In a simulation study, the S4VD algorithm outperformed the SSVD algorithm and two other SVD-related biclustering methods in recovering artificial biclusters and in being robust to noisy data. AVAILABILITY: R-Code of the S4VD algorithm as well as a documentation can be found at http://s4vd.r-forge.r-project.org/.

Entities:  

Mesh:

Year:  2011        PMID: 21636597     DOI: 10.1093/bioinformatics/btr322

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  17 in total

1.  Subject-specific functional parcellation via prior based eigenanatomy.

Authors:  Paramveer S Dhillon; David A Wolk; Sandhitsu R Das; Lyle H Ungar; James C Gee; Brian B Avants
Journal:  Neuroimage       Date:  2014-05-20       Impact factor: 6.556

2.  Biclustering via sparse clustering.

Authors:  Erika S Helgeson; Qian Liu; Guanhua Chen; Michael R Kosorok; Eric Bair
Journal:  Biometrics       Date:  2019-10-14       Impact factor: 2.571

3.  Sparse principal component analysis in cancer research.

Authors:  Ying-Lin Hsu; Po-Yu Huang; Dung-Tsa Chen
Journal:  Transl Cancer Res       Date:  2014-06       Impact factor: 1.241

4.  Eigenanatomy improves detection power for longitudinal cortical change.

Authors:  Brian Avants; Paramveer Dhillon; Benjamin M Kandel; Philip A Cook; Corey T McMillan; Murray Grossman; James C Gee
Journal:  Med Image Comput Comput Assist Interv       Date:  2012

5.  Generalized Co-Clustering Analysis via Regularized Alternating Least Squares.

Authors:  Gen Li
Journal:  Comput Stat Data Anal       Date:  2020-05-04       Impact factor: 1.681

6.  A new analysis approach of epidermal growth factor receptor pathway activation patterns provides insights into cetuximab resistance mechanisms in head and neck cancer.

Authors:  Silvia von der Heyde; Tim Beissbarth
Journal:  BMC Med       Date:  2012-05-01       Impact factor: 8.775

7.  Configurable pattern-based evolutionary biclustering of gene expression data.

Authors:  Beatriz Pontes; Raúl Giráldez; Jesús S Aguilar-Ruiz
Journal:  Algorithms Mol Biol       Date:  2013-02-23       Impact factor: 1.405

8.  Analysis of regulatory network involved in mechanical induction of embryonic stem cell differentiation.

Authors:  Xinan Zhang; Maria Jaramillo; Satish Singh; Prashant Kumta; Ipsita Banerjee
Journal:  PLoS One       Date:  2012-04-27       Impact factor: 3.240

9.  A composite model for subgroup identification and prediction via bicluster analysis.

Authors:  Hung-Chia Chen; Wen Zou; Tzu-Pin Lu; James J Chen
Journal:  PLoS One       Date:  2014-10-27       Impact factor: 3.240

10.  MultiFacTV: module detection from higher-order time series biological data.

Authors:  Xutao Li; Yunming Ye; Michael Ng; Qingyao Wu
Journal:  BMC Genomics       Date:  2013-10-01       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.