Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Overlapping clustering of gene expression data using penalized weighted normalized cut.

Literature DB >> 30302823

Overlapping clustering of gene expression data using penalized weighted normalized cut.

Sebastian J Teran Hidalgo¹, Tingyu Zhu², Mengyun Wu^1,3, Shuangge Ma¹.

Abstract

Clustering has been widely conducted in the analysis of gene expression data. For complex diseases, it has played an important role in identifying unknown functions of genes, serving as the basis of other analysis, and others. A common limitation of most existing clustering approaches is to assume that genes are separated into disjoint clusters. As genes often have multiple functions and thus can belong to more than one functional cluster, the disjoint clustering results can be unsatisfactory. In addition, due to the small sample sizes of genetic profiling studies and other factors, there may not be sufficient evidence to confirm the specific functions of some genes and cluster them definitively into disjoint clusters. In this study, we develop an effective overlapping clustering approach, which takes account into the multiplicity of gene functions and lack of certainty in practical analysis. A penalized weighted normalized cut (PWNCut) criterion is proposed based on the NCut technique and an <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>L</mml:mi> <mml:mn>2</mml:mn></mml:msub> </mml:math> norm constraint. It outperforms multiple competitors in simulation. The analysis of the cancer genome atlas (TCGA) data on breast cancer and cervical cancer leads to biologically sensible findings which differ from those using the alternatives. To facilitate implementation, we develop the function pwncut in the R package NCutYX.

Entities: Chemical Disease Gene Species

Keywords: NCut; gene expression data; overlapping clustering; penalization

Mesh：

Year: 2018 PMID： 30302823 PMCID： PMC6239939 DOI： 10.1002/gepi.22164

Source DB: PubMed Journal: Genet Epidemiol ISSN： 0741-0395 Impact factor: 2.135

22 in total

1. CLIFF: clustering of high-dimensional microarray data via iterative feature filtering using normalized cuts.

Authors: E P Xing; R M Karp
Journal: Bioinformatics Date: 2001 Impact factor: 6.937

2. Fuzzy C-means method for clustering microarray data.

Authors: Doulaye Dembélé; Philippe Kastner
Journal: Bioinformatics Date: 2003-05-22 Impact factor: 6.937

3. Graph-based consensus clustering for class discovery from gene expression data.

Authors: Zhiwen Yu; Hau-San Wong; Hongqiang Wang
Journal: Bioinformatics Date: 2007-09-14 Impact factor: 6.937

4. Combinatorial optimization with use of guided evolutionary simulated annealing.

Authors: P C Yip; Y H Pao
Journal: IEEE Trans Neural Netw Date: 1995

5. A roadmap of clustering algorithms: finding a match for a biomedical application.

Authors: Bill Andreopoulos; Aijun An; Xiaogang Wang; Michael Schroeder
Journal: Brief Bioinform Date: 2009-02-24 Impact factor: 11.622

6. Comparative study of unsupervised dimension reduction techniques for the visualization of microarray gene expression data.

Authors: Christoph Bartenhagen; Hans-Ulrich Klein; Christian Ruckert; Xiaoyi Jiang; Martin Dugas
Journal: BMC Bioinformatics Date: 2010-11-18 Impact factor: 3.169

1. Multi-cancer samples clustering via graph regularized low-rank representation method under sparse and symmetric constraints.

Authors: Juan Wang; Cong-Hai Lu; Jin-Xing Liu; Ling-Yun Dai; Xiang-Zhen Kong
Journal: BMC Bioinformatics Date: 2019-12-30 Impact factor: 3.169

1 in total

Overlapping clustering of gene expression data using penalized weighted normalized cut.

1. CLIFF: clustering of high-dimensional microarray data via iterative feature filtering using normalized cuts.

2. Fuzzy C-means method for clustering microarray data.

3. Graph-based consensus clustering for class discovery from gene expression data.

4. Combinatorial optimization with use of guided evolutionary simulated annealing.

5. A roadmap of clustering algorithms: finding a match for a biomedical application.

6. Comparative study of unsupervised dimension reduction techniques for the visualization of microarray gene expression data.

7. Incorporating gene ontology into fuzzy relational clustering of microarray gene expression data.

8. Integrative analysis of multiple cancer genomic datasets under the heterogeneity model.

9. Assisted clustering of gene expression data using ANCut.

10. Clustering multilayer omics data using MuNCut.

1. Multi-cancer samples clustering via graph regularized low-rank representation method under sparse and symmetric constraints.