Literature DB >> 12499289

A divide-and-conquer approach to fragment assembly.

Hasan H Otu1, Khalid Sayood.   

Abstract

MOTIVATION: One of the major problems in DNA sequencing is assembling the fragments obtained by shotgun sequencing. Most existing fragment assembly techniques follow the overlap-layout-consensus approach. This framework requires extensive computation in each phase and becomes inefficient with increasing number of fragments.
RESULTS: We propose a new algorithm which solves the overlap, layout, and consensus phases simultaneously. The fragments are clustered with respect to their Average Mutual Information (AMI) profiles using the k-means algorithm. This removes the unnecessary burden of considering the collection of fragments as a whole. Instead, the orientation and overlap detection are solved efficiently, within the clusters. The algorithm has successfully reconstructed both artificial and real data. AVAILABILITY: Available on request from the authors.

Mesh:

Substances:

Year:  2003        PMID: 12499289     DOI: 10.1093/bioinformatics/19.1.22

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  Data Compression Concepts and Algorithms and their Applications to Bioinformatics.

Authors:  O U Nalbantog̃lu; D J Russell; K Sayood
Journal:  Entropy (Basel)       Date:  2010-01-01       Impact factor: 2.524

2.  Use of average mutual information for studying changes in HIV populations.

Authors:  Khalid Sayood; Federico Hoffman; Charles Wood
Journal:  Conf Proc IEEE Eng Med Biol Soc       Date:  2009

3.  Establishment of a pipeline to analyse non-synonymous SNPs in Bos taurus.

Authors:  Michael A Lee; Orla M Keane; Belinda C Glass; Tim R Manley; Neil G Cullen; Ken G Dodds; Alan F McCulloch; Chris A Morris; Mark Schreiber; Jonathan Warren; Amonida Zadissa; Theresa Wilson; John C McEwan
Journal:  BMC Genomics       Date:  2006-11-26       Impact factor: 3.969

4.  Sequence analysis of origins of replication in the Saccharomyces cerevisiae genomes.

Authors:  Wen-Chao Li; Zhe-Jin Zhong; Pan-Pan Zhu; En-Ze Deng; Hui Ding; Wei Chen; Hao Lin
Journal:  Front Microbiol       Date:  2014-11-18       Impact factor: 5.640

5.  ARK: Aggregation of Reads by K-Means for Estimation of Bacterial Community Composition.

Authors:  David Koslicki; Saikat Chatterjee; Damon Shahrivar; Alan W Walker; Suzanna C Francis; Louise J Fraser; Mikko Vehkaperä; Yueheng Lan; Jukka Corander
Journal:  PLoS One       Date:  2015-10-23       Impact factor: 3.240

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.