Literature DB >> 19296858

Bayesian clustering and feature selection for cancer tissue samples.

Pekka Marttinen1, Samuel Myllykangas, Jukka Corander.   

Abstract

BACKGROUND: The versatility of DNA copy number amplifications for profiling and categorization of various tissue samples has been widely acknowledged in the biomedical literature. For instance, this type of measurement techniques provides possibilities for exploring sets of cancerous tissues to identify novel subtypes. The previously utilized statistical approaches to various kinds of analyses include traditional algorithmic techniques for clustering and dimension reduction, such as independent and principal component analyses, hierarchical clustering, as well as model-based clustering using maximum likelihood estimation for latent class models.
RESULTS: While purely algorithmic methods are usually easily applicable, their suboptimal performance and limitations in making formal inference have been thoroughly discussed in the statistical literature. Here we introduce a Bayesian model-based approach to simultaneous identification of underlying tissue groups and the informative amplifications. The model-based approach provides the possibility of using formal inference to determine the number of groups from the data, in contrast to the ad hoc methods often exploited for similar purposes. The model also automatically recognizes the chromosomal areas that are relevant for the clustering.
CONCLUSION: Validatory analyses of simulated data and a large database of DNA copy number amplifications in human neoplasms are used to illustrate the potential of our approach. Our software implementation BASTA for performing Bayesian statistical tissue profiling is freely available for academic purposes at (http://web.abo.fi/fak/mnf/mate/jc/software/basta.html).

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19296858      PMCID: PMC2679022          DOI: 10.1186/1471-2105-10-90

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  12 in total

Review 1.  The discovery of receptor tyrosine kinases: targets for cancer therapy.

Authors:  Andreas Gschwind; Oliver M Fischer; Axel Ullrich
Journal:  Nat Rev Cancer       Date:  2004-05       Impact factor: 60.716

2.  DNA copy number amplification profiling of human neoplasms.

Authors:  S Myllykangas; J Himberg; T Böhling; B Nagy; J Hollmén; S Knuutila
Journal:  Oncogene       Date:  2006-06-05       Impact factor: 9.867

3.  Simultaneous feature selection and clustering using mixture models.

Authors:  Martin H C Law; Mário A T Figueiredo; Anil K Jain
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2004-09       Impact factor: 6.226

4.  Bayesian search of functionally divergent protein subgroups and their function specific residues.

Authors:  Pekka Marttinen; Jukka Corander; Petri Törönen; Liisa Holm
Journal:  Bioinformatics       Date:  2006-07-26       Impact factor: 6.937

Review 5.  Comparing antibody and small-molecule therapies for cancer.

Authors:  Kohzoh Imai; Akinori Takaoka
Journal:  Nat Rev Cancer       Date:  2006-09       Impact factor: 60.716

6.  Bayesian identification of admixture events using multilocus molecular markers.

Authors:  Jukka Corander; Pekka Marttinen
Journal:  Mol Ecol       Date:  2006-09       Impact factor: 6.185

7.  Random partition models and exchangeability for Bayesian identification of population structure.

Authors:  Jukka Corander; Mats Gyllenberg; Timo Koski
Journal:  Bull Math Biol       Date:  2006-11-04       Impact factor: 1.758

8.  Targeting tyrosine kinases in cancer: the second wave.

Authors:  Jose Baselga
Journal:  Science       Date:  2006-05-26       Impact factor: 47.728

9.  Language production: Methods and methodologies.

Authors:  K Bock
Journal:  Psychon Bull Rev       Date:  1996-12

10.  Classification of human cancers based on DNA copy number amplification modeling.

Authors:  Samuel Myllykangas; Jarkko Tikka; Tom Böhling; Sakari Knuutila; Jaakko Hollmén
Journal:  BMC Med Genomics       Date:  2008-05-14       Impact factor: 3.063

View more
  5 in total

1.  Efficient Bayesian approach for multilocus association mapping including gene-gene interactions.

Authors:  Pekka Marttinen; Jukka Corander
Journal:  BMC Bioinformatics       Date:  2010-09-02       Impact factor: 3.169

2.  Restricted gene flow among hospital subpopulations of Enterococcus faecium.

Authors:  Rob J L Willems; Janetta Top; Willem van Schaik; Helen Leavis; Marc Bonten; Jukka Sirén; William P Hanage; Jukka Corander
Journal:  mBio       Date:  2012-07-17       Impact factor: 7.867

3.  Clustering gene expression data with a penalized graph-based metric.

Authors:  Ariel E Bayá; Pablo M Granitto
Journal:  BMC Bioinformatics       Date:  2011-01-04       Impact factor: 3.169

4.  Bayesian semi-supervised classification of bacterial samples using MLST databases.

Authors:  Lu Cheng; Thomas R Connor; David M Aanensen; Brian G Spratt; Jukka Corander
Journal:  BMC Bioinformatics       Date:  2011-07-26       Impact factor: 3.169

5.  K-Pax2: Bayesian identification of cluster-defining amino acid positions in large sequence datasets.

Authors:  Alberto Pessia; Yonatan Grad; Sarah Cobey; Juha Santeri Puranen; Jukka Corander
Journal:  Microb Genom       Date:  2015-07-15
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.