Literature DB >> 15737073

Tight clustering: a resampling-based approach for identifying stable and tight patterns in data.

George C Tseng1, Wing H Wong.   

Abstract

In this article, we propose a method for clustering that produces tight and stable clusters without forcing all points into clusters. The methodology is general but was initially motivated from cluster analysis of microarray experiments. Most current algorithms aim to assign all genes into clusters. For many biological studies, however, we are mainly interested in identifying the most informative, tight, and stable clusters of sizes, say, 20-60 genes for further investigation. We want to avoid the contamination of tightly regulated expression patterns of biologically relevant genes due to other genes whose expressions are only loosely compatible with these patterns. "Tight clustering" has been developed specifically to address this problem. It applies K-means clustering as an intermediate clustering engine. Early truncation of a hierarchical clustering tree is used to overcome the local minimum problem in K-means clustering. The tightest and most stable clusters are identified in a sequential manner through an analysis of the tendency of genes to be grouped together under repeated resampling. We validated this method in a simulated example and applied it to analyze a set of expression profiles in the study of embryonic stem cells.

Mesh:

Year:  2005        PMID: 15737073     DOI: 10.1111/j.0006-341X.2005.031032.x

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  75 in total

1.  Module-based prediction approach for robust inter-study predictions in microarray data.

Authors:  Zhibao Mi; Kui Shen; Nan Song; Chunrong Cheng; Chi Song; Naftali Kaminski; George C Tseng
Journal:  Bioinformatics       Date:  2010-08-17       Impact factor: 6.937

2.  Combined systemic elimination of MET and epidermal growth factor receptor signaling completely abolishes liver regeneration and leads to liver decompensation.

Authors:  Shirish Paranjpe; William C Bowen; Wendy M Mars; Anne Orr; Meagan M Haynes; Marie C DeFrances; Silvia Liu; George C Tseng; Anastasia Tsagianni; George K Michalopoulos
Journal:  Hepatology       Date:  2016-10-01       Impact factor: 17.425

3.  Merging K-means with hierarchical clustering for identifying general-shaped groups.

Authors:  Anna D Peterson; Arka P Ghosh; Ranjan Maitra
Journal:  Stat (Int Stat Inst)       Date:  2018-01-17

4.  A data-mining scheme for identifying peptide structural motifs responsible for different MS/MS fragmentation intensity patterns.

Authors:  Yingying Huang; George C Tseng; Shinsheng Yuan; Ljiljana Pasa-Tolic; Mary S Lipton; Richard D Smith; Vicki H Wysocki
Journal:  J Proteome Res       Date:  2007-12-04       Impact factor: 4.466

5.  An efficient method to identify differentially expressed genes in microarray experiments.

Authors:  Huaizhen Qin; Tao Feng; Scott A Harding; Chung-Jui Tsai; Shuanglin Zhang
Journal:  Bioinformatics       Date:  2008-05-03       Impact factor: 6.937

6.  Bayesian model-based tight clustering for time course data.

Authors:  Yongsung Joo; G Casella; J Hobert
Journal:  Comput Stat       Date:  2010-03       Impact factor: 1.000

7.  Identification of homogeneous genetic architecture of multiple genetically correlated traits by block clustering of genome-wide associations.

Authors:  Mayetri Gupta; Ching-Lung Cheung; Yi-Hsiang Hsu; Serkalem Demissie; L Adrienne Cupples; Douglas P Kiel; David Karasik
Journal:  J Bone Miner Res       Date:  2011-06       Impact factor: 6.741

8.  Combined Systemic Disruption of MET and Epidermal Growth Factor Receptor Signaling Causes Liver Failure in Normal Mice.

Authors:  Anastasia Tsagianni; Wendy M Mars; Bharat Bhushan; William C Bowen; Anne Orr; John Stoops; Shirish Paranjpe; George C Tseng; Silvia Liu; George K Michalopoulos
Journal:  Am J Pathol       Date:  2018-07-20       Impact factor: 4.307

9.  Distinct signaling pathways after higher or lower doses of radiation in three closely related human lymphoblast cell lines.

Authors:  Tzu-Pin Lu; Liang-Chuan Lai; Be-I Lin; Li-Han Chen; Tzu-Hung Hsiao; Howard L Liber; John A Cook; James B Mitchell; Mong-Hsun Tsai; Eric Y Chuang
Journal:  Int J Radiat Oncol Biol Phys       Date:  2010-01-01       Impact factor: 7.038

10.  Spectral clustering strategies for heterogeneous disease expression data.

Authors:  Grace T Huang; Kathryn I Cunningham; Panayiotis V Benos; Chakra S Chennubhotla
Journal:  Pac Symp Biocomput       Date:  2013
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.