Literature DB >> 15700408

Clustering binary fingerprint vectors with missing values for DNA array data analysis.

Andres Figueroa1, James Borneman, Tao Jiang.   

Abstract

Oligonucleotide fingerprinting is a powerful DNA array-based method to characterize cDNA and ribosomal RNA gene (rDNA) libraries and has many applications including gene expression profiling and DNA clone classification. We are especially interested in the latter application. A key step in the method is the cluster analysis of fingerprint data obtained from DNA array hybridization experiments. Most of the existing approaches to clustering use (normalized) real intensity values and thus do not treat positive and negative hybridization signals equally (positive signals are much more emphasized). In this paper, we consider a discrete approach. Fingerprint data are first normalized and binarized using control DNA clones. Because there may exist unresolved (or missing) values in this binarization process, we formulate the clustering of (binary) oligonucleotide fingerprints as a combinatorial optimization problem that attempts to identify clusters and resolve the missing values in the fingerprints simultaneously. We study the computational complexity of this clustering problem and a natural parameterized version and present an efficient greedy algorithm based on MINIMUM CLIQUE PARTITION on graphs. The algorithm takes advantage of some unique properties of the graphs considered here, which allow us to efficiently find the maximum cliques as well as some special maximal cliques. Our preliminary experimental results on simulated and real data demonstrate that the algorithm runs faster and performs better than some popular hierarchical and graph-based clustering methods. The results on real data from DNA clone classification also suggest that this discrete approach is more accurate than clustering methods based on real intensity values in terms of separating clones that have different characteristics with respect to the given oligonucleotide probes.

Entities:  

Mesh:

Year:  2004        PMID: 15700408     DOI: 10.1089/cmb.2004.11.887

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  4 in total

1.  Improving oligonucleotide fingerprinting of rRNA genes by implementation of polony microarray technology.

Authors:  Paul M Ruegger; Elizabeth Bent; Wei Li; Daniel R Jeske; Xinping Cui; Jonathan Braun; Tao Jiang; James Borneman
Journal:  J Microbiol Methods       Date:  2012-05-25       Impact factor: 2.363

2.  Bacteria and bacterial rRNA genes associated with the development of colitis in IL-10(-/-) mice.

Authors:  Jingxiao Ye; Jimmy W Lee; Laura L Presley; Elizabeth Bent; Bo Wei; Jonathan Braun; Neal L Schiller; Daniel S Straus; James Borneman
Journal:  Inflamm Bowel Dis       Date:  2008-08       Impact factor: 5.325

3.  Detection and Investigation of Soil Biological Activity against Meloidogyne incognita.

Authors:  E Bent; A Loffredo; M V McKenry; J O Becker; J Borneman
Journal:  J Nematol       Date:  2008-06       Impact factor: 1.402

4.  The role of coral-associated bacterial communities in Australian Subtropical White Syndrome of Turbinaria mesenterina.

Authors:  Scott Godwin; Elizabeth Bent; James Borneman; Lily Pereg
Journal:  PLoS One       Date:  2012-09-06       Impact factor: 3.240

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.