Literature DB >> 30302823

Overlapping clustering of gene expression data using penalized weighted normalized cut.

Sebastian J Teran Hidalgo1, Tingyu Zhu2, Mengyun Wu1,3, Shuangge Ma1.   

Abstract

Clustering has been widely conducted in the analysis of gene expression data. For complex diseases, it has played an important role in identifying unknown functions of genes, serving as the basis of other analysis, and others. A common limitation of most existing clustering approaches is to assume that genes are separated into disjoint clusters. As genes often have multiple functions and thus can belong to more than one functional cluster, the disjoint clustering results can be unsatisfactory. In addition, due to the small sample sizes of genetic profiling studies and other factors, there may not be sufficient evidence to confirm the specific functions of some genes and cluster them definitively into disjoint clusters. In this study, we develop an effective overlapping clustering approach, which takes account into the multiplicity of gene functions and lack of certainty in practical analysis. A penalized weighted normalized cut (PWNCut) criterion is proposed based on the NCut technique and an <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>L</mml:mi> <mml:mn>2</mml:mn></mml:msub> </mml:math> norm constraint. It outperforms multiple competitors in simulation. The analysis of the cancer genome atlas (TCGA) data on breast cancer and cervical cancer leads to biologically sensible findings which differ from those using the alternatives. To facilitate implementation, we develop the function pwncut in the R package NCutYX.
© 2018 Wiley Periodicals, Inc.

Entities:  

Keywords:  NCut; gene expression data; overlapping clustering; penalization

Mesh:

Year:  2018        PMID: 30302823      PMCID: PMC6239939          DOI: 10.1002/gepi.22164

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  22 in total

1.  CLIFF: clustering of high-dimensional microarray data via iterative feature filtering using normalized cuts.

Authors:  E P Xing; R M Karp
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

2.  Fuzzy C-means method for clustering microarray data.

Authors:  Doulaye Dembélé; Philippe Kastner
Journal:  Bioinformatics       Date:  2003-05-22       Impact factor: 6.937

3.  Graph-based consensus clustering for class discovery from gene expression data.

Authors:  Zhiwen Yu; Hau-San Wong; Hongqiang Wang
Journal:  Bioinformatics       Date:  2007-09-14       Impact factor: 6.937

4.  Combinatorial optimization with use of guided evolutionary simulated annealing.

Authors:  P C Yip; Y H Pao
Journal:  IEEE Trans Neural Netw       Date:  1995

5.  A roadmap of clustering algorithms: finding a match for a biomedical application.

Authors:  Bill Andreopoulos; Aijun An; Xiaogang Wang; Michael Schroeder
Journal:  Brief Bioinform       Date:  2009-02-24       Impact factor: 11.622

6.  Comparative study of unsupervised dimension reduction techniques for the visualization of microarray gene expression data.

Authors:  Christoph Bartenhagen; Hans-Ulrich Klein; Christian Ruckert; Xiaoyi Jiang; Martin Dugas
Journal:  BMC Bioinformatics       Date:  2010-11-18       Impact factor: 3.169

7.  Incorporating gene ontology into fuzzy relational clustering of microarray gene expression data.

Authors:  Animesh Kumar Paul; Pintu Chandra Shill
Journal:  Biosystems       Date:  2017-11-04       Impact factor: 1.973

8.  Integrative analysis of multiple cancer genomic datasets under the heterogeneity model.

Authors:  Jin Liu; Jian Huang; Shuangge Ma
Journal:  Stat Med       Date:  2013-03-21       Impact factor: 2.373

9.  Assisted clustering of gene expression data using ANCut.

Authors:  Sebastian J Teran Hidalgo; Mengyun Wu; Shuangge Ma
Journal:  BMC Genomics       Date:  2017-08-16       Impact factor: 3.969

10.  Clustering multilayer omics data using MuNCut.

Authors:  Sebastian J Teran Hidalgo; Shuangge Ma
Journal:  BMC Genomics       Date:  2018-03-14       Impact factor: 3.969

View more
  1 in total

1.  Multi-cancer samples clustering via graph regularized low-rank representation method under sparse and symmetric constraints.

Authors:  Juan Wang; Cong-Hai Lu; Jin-Xing Liu; Ling-Yun Dai; Xiang-Zhen Kong
Journal:  BMC Bioinformatics       Date:  2019-12-30       Impact factor: 3.169

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.