Literature DB >> 18631023

Efficiently identifying max-gap clusters in pairwise genome comparison.

Xu Ling1, Xin He, Dong Xin, Jiawei Han, Jaiwei Han.   

Abstract

The spatial clustering of genes across different genomes has been used to study important problems in comparative genomics, from identification of operons to detection of homologous regions. A set of formal models and algorithms of so-called max-gap clusters have been proposed recently. These algorithms guarantee the completeness of the results, and the simplicity of the model enables a rigorous statistical test of significance. These features overcome the weakness of many previous methods, which are often heuristic in nature. We developed a very efficient algorithm to compute max-gap clusters in pairwise genome comparison. Our algorithm is an order-of-magnitude faster than the previous algorithm based on the same model under a number of different settings. In our evaluation on two bacterial genomes, we showed that our method could identify known operons as well as some novel structures in the genome. We also demonstrated that the current framework for conserved spatial clustering of genes can be used to detect homologous regions in higher organisms, through the comparison of human and mouse genomes.

Entities:  

Mesh:

Year:  2008        PMID: 18631023     DOI: 10.1089/cmb.2008.0010

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  5 in total

1.  Statistics for approximate gene clusters.

Authors:  Katharina Jahn; Sascha Winter; Jens Stoye; Sebastian Böcker
Journal:  BMC Bioinformatics       Date:  2013-12-13       Impact factor: 3.169

2.  Finding approximate gene clusters with Gecko 3.

Authors:  Sascha Winter; Katharina Jahn; Stefanie Wehner; Leon Kuchenbecker; Manja Marz; Jens Stoye; Sebastian Böcker
Journal:  Nucleic Acids Res       Date:  2016-09-26       Impact factor: 16.971

3.  G-NEST: a gene neighborhood scoring tool to identify co-conserved, co-expressed genes.

Authors:  Danielle G Lemay; William F Martin; Angie S Hinrichs; Monique Rijnkels; J Bruce German; Ian Korf; Katherine S Pollard
Journal:  BMC Bioinformatics       Date:  2012-09-28       Impact factor: 3.169

4.  Bacterial syntenies: an exact approach with gene quorum.

Authors:  Yves-Pol Deniélou; Marie-France Sagot; Frédéric Boyer; Alain Viari
Journal:  BMC Bioinformatics       Date:  2011-05-24       Impact factor: 3.169

5.  Bidirectional best hit r-window gene clusters.

Authors:  Melvin Zhang; Hon Wai Leong
Journal:  BMC Bioinformatics       Date:  2010-01-18       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.