Literature DB >> 12235383

Density of points clustering, application to transcriptomic data analysis.

Nicolas Wicker1, Doulaye Dembele, Wolfgang Raffelsberger, Olivier Poch.   

Abstract

With the increasing amount of data produced by high-throughput technologies in many fields of science, clustering has become an integral step in exploratory data analysis in order to group similar elements into classes. However, many clustering algorithms can only work properly if aided by human expertise. For example, one parameter which is crucial and often manually set is the number of clusters present in the analyzed set. We present a novel stopping rule to find the optimal number of clusters based on the comparison of the density of points inside the clusters and between them. The method is evaluated on synthetic as well as on real transcriptomic data and compared with two current methods. Finally, we illustrate its usefulness in the analysis of the expression profiles of promyelocytic cells before and after treatment with all-trans retinoic acid. Simultaneous clustering for gene regulation and absolute initial expression levels allowed the identification of numerous genes associated with signal transduction revealing the complexity of retinoic acid signaling.

Entities:  

Mesh:

Year:  2002        PMID: 12235383      PMCID: PMC137097          DOI: 10.1093/nar/gkf511

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  14 in total

1.  Systematic determination of genetic network architecture.

Authors:  S Tavazoie; J D Hughes; M J Campbell; R J Cho; G M Church
Journal:  Nat Genet       Date:  1999-07       Impact factor: 38.330

2.  Identifying splits with clear separation: a new class discovery method for gene expression data.

Authors:  A von Heydebreck; W Huber; A Poustka; M Vingron
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

3.  Secator: a program for inferring protein subfamilies from phylogenetic trees.

Authors:  N Wicker; G R Perrin; J C Thierry; O Poch
Journal:  Mol Biol Evol       Date:  2001-08       Impact factor: 16.240

4.  CLIFF: clustering of high-dimensional microarray data via iterative feature filtering using normalized cuts.

Authors:  E P Xing; R M Karp
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

5.  Statistical estimation of cluster boundaries in gene expression profile data.

Authors:  K Horimoto; H Toh
Journal:  Bioinformatics       Date:  2001-12       Impact factor: 6.937

6.  Analysis of temporal gene expression profiles: clustering by simulated annealing and determining the optimal number of clusters.

Authors:  A V Lukashin; R Fuchs
Journal:  Bioinformatics       Date:  2001-05       Impact factor: 6.937

7.  CLICK: a clustering algorithm with applications to gene expression analysis.

Authors:  R Sharan; R Shamir
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  2000

8.  The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications.

Authors:  P M Sharp; W H Li
Journal:  Nucleic Acids Res       Date:  1987-02-11       Impact factor: 16.971

9.  Quality indicators increase the reliability of microarray data.

Authors:  Wolfgang Raffelsberger; Doulaye Dembélé; Mike G Neubauer; Marco M Gottardis; Hinrich Gronemeyer
Journal:  Genomics       Date:  2002-10       Impact factor: 5.736

10.  Retinoic acid-induced apoptosis in leukemia cells is mediated by paracrine action of tumor-selective death ligand TRAIL.

Authors:  L Altucci; A Rossin; W Raffelsberger; A Reitmair; C Chomienne; H Gronemeyer
Journal:  Nat Med       Date:  2001-06       Impact factor: 53.440

View more
  8 in total

1.  PipeAlign: A new toolkit for protein family analysis.

Authors:  Frédéric Plewniak; Laurent Bianchetti; Yann Brelivet; Annaick Carles; Frédéric Chalmel; Odile Lecompte; Thiebaut Mochel; Luc Moulinier; Arnaud Muller; Jean Muller; Veronique Prigent; Raymond Ripp; Jean-Claude Thierry; Julie D Thompson; Nicolas Wicker; Olivier Poch
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

2.  Signature of the oligomeric behaviour of nuclear receptors at the sequence and structural level.

Authors:  Yann Brelivet; Sabrina Kammerer; Natacha Rochel; Olivier Poch; Dino Moras
Journal:  EMBO Rep       Date:  2004-04       Impact factor: 8.807

3.  Sequence and comparative genomic analysis of actin-related proteins.

Authors:  Jean Muller; Yukako Oma; Laurent Vallar; Evelyne Friederich; Olivier Poch; Barbara Winsor
Journal:  Mol Biol Cell       Date:  2005-09-29       Impact factor: 4.138

4.  The disruption of the rod-derived cone viability gene leads to photoreceptor dysfunction and susceptibility to oxidative stress.

Authors:  T Cronin; W Raffelsberger; I Lee-Rivera; C Jaillard; M-L Niepon; B Kinzel; E Clérin; A Petrosian; S Picaud; O Poch; J-A Sahel; T Léveillard
Journal:  Cell Death Differ       Date:  2010-02-05       Impact factor: 15.828

5.  MSV3d: database of human MisSense Variants mapped to 3D protein structure.

Authors:  Tien-Dao Luu; Alin-Mihai Rusu; Vincent Walter; Raymond Ripp; Luc Moulinier; Jean Muller; Thierry Toursel; Julie D Thompson; Olivier Poch; Hoan Nguyen
Journal:  Database (Oxford)       Date:  2012-04-03       Impact factor: 3.451

6.  Current awareness on comparative and functional genomics.

Authors: 
Journal:  Comp Funct Genomics       Date:  2003

7.  RETINOBASE: a web database, data mining and analysis platform for gene expression data on retina.

Authors:  Ravi Kiran Reddy Kalathur; Nicolas Gagniere; Guillaume Berthommier; Laetitia Poidevin; Wolfgang Raffelsberger; Raymond Ripp; Thierry Léveillard; Olivier Poch
Journal:  BMC Genomics       Date:  2008-05-05       Impact factor: 3.969

8.  The Annotation, Mapping, Expression and Network (AMEN) suite of tools for molecular systems biology.

Authors:  Frédéric Chalmel; Michael Primig
Journal:  BMC Bioinformatics       Date:  2008-02-06       Impact factor: 3.169

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.