Literature DB >> 17573363

Microarray learning with ABC.

Dhammika Amaratunga1, Javier Cabrera, Vladimir Kovtun.   

Abstract

Standard clustering algorithms when applied to DNA microarray data often tend to produce erroneous clusters. A major contributor to this divergence is the feature characteristic of microarray data sets that the number of predictors (genes) in such data far exceeds the number of samples by many orders of magnitude, with only a small percentage of predictors being truly informative with regards to the clustering while the rest merely add noise. An additional complication is that the predictors exhibit an unknown complex correlational configuration embedded in a small subspace of the entire predictor space. Under these conditions, standard clustering algorithms fail to find the true clusters even when applied in tandem with some sort of gene filtering or dimension reduction to reduce the number of predictors. We propose, as an alternative, a novel method for unsupervised classification of DNA microarray data. The method, which is based on the idea of aggregating results obtained from an ensemble of randomly resampled data (where both samples and genes are resampled), introduces a way of tilting the procedure so that the ensemble includes minimal representation from less important areas of the gene predictor space. The method produces a measure of dissimilarity between each pair of samples that can be used in conjunction with (a) a method like Ward's procedure to generate a cluster analysis and (b) multidimensional scaling to generate useful visualizations of the data. We call the dissimilarity measures ABC dissimilarities since they are obtained by aggregating bundles of clusters. An extensive comparison of several clustering methods using actual DNA microarray data convincingly demonstrates that classification using ABC dissimilarities offers significantly superior performance.

Entities:  

Mesh:

Year:  2007        PMID: 17573363     DOI: 10.1093/biostatistics/kxm017

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  3 in total

1.  Herpesviruses and their genetic diversity in the blood virome of healthy individuals: effect of aging.

Authors:  Arttu Autio; Jalmari Kettunen; Tapio Nevalainen; Bryn Kimura; Mikko Hurme
Journal:  Immun Ageing       Date:  2022-03-12       Impact factor: 6.400

2.  ABC gene-ranking for prediction of drug-induced cholestasis in rats.

Authors:  Yauheniya Cherkas; Michael K McMillian; Dhammika Amaratunga; Nandini Raghavan; Jennifer C Sasaki
Journal:  Toxicol Rep       Date:  2016-01-18

3.  Aging-associated patterns in the expression of human endogenous retroviruses.

Authors:  Tapio Nevalainen; Arttu Autio; Binisha Hamal Mishra; Saara Marttila; Marja Jylhä; Mikko Hurme
Journal:  PLoS One       Date:  2018-12-04       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.