Literature DB >> 19188192

Accurate identification of orthologous segments among multiple genomes.

Tsuyoshi Hachiya1, Yasunori Osana, Kris Popendorf, Yasubumi Sakakibara.   

Abstract

MOTIVATION: The accurate detection of orthologous segments (also referred to as syntenic segments) plays a key role in comparative genomics, as it is useful for inferring genome rearrangement scenarios and computing whole-genome alignments. Although a number of algorithms for detecting orthologous segments have been proposed, none of them contain a framework for optimizing their parameter values.
METHODS: In the present study, we propose an algorithm, named OSfinder (Orthologous Segment finder), which uses a novel scoring scheme based on stochastic models. OSfinder takes as input the positions of short homologous regions (also referred to as anchors) and explicitly discriminates orthologous anchors from non-orthologous anchors by using Markov chain models which represent respective geometric distributions of lengths of orthologous and non-orthologous anchors. Such stochastic modeling makes it possible to optimize parameter values by maximizing the likelihood of the input dataset, and to automate the setting of the optimal parameter values.
RESULTS: We validated the accuracies of orthology-mapping algorithms on the basis of their consistency with the orthology annotation of genes. Our evaluation tests using mammalian and bacterial genomes demonstrated that OSfinder shows higher accuracy than previous algorithms. AVAILABILITY: The OSfinder software was implemented as a C++ program. The software is freely available at http://osfinder.dna.bio.keio.ac.jp under the GNU General Public License. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2009        PMID: 19188192     DOI: 10.1093/bioinformatics/btp070

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  17 in total

1.  Computational methods for Gene Orthology inference.

Authors:  David M Kristensen; Yuri I Wolf; Arcady R Mushegian; Eugene V Koonin
Journal:  Brief Bioinform       Date:  2011-06-19       Impact factor: 11.622

2.  Synonymous codon usage, GC(3), and evolutionary patterns across plastomes of three pooid model species: emerging grass genome models for monocots.

Authors:  Gaurav Sablok; Kinshuk Chandra Nayak; Franck Vazquez; Tatiana V Tatarinova
Journal:  Mol Biotechnol       Date:  2011-10       Impact factor: 2.695

3.  Whole genome assembly of a natto production strain Bacillus subtilis natto from very short read data.

Authors:  Yukari Nishito; Yasunori Osana; Tsuyoshi Hachiya; Kris Popendorf; Atsushi Toyoda; Asao Fujiyama; Mitsuhiro Itaya; Yasubumi Sakakibara
Journal:  BMC Genomics       Date:  2010-04-16       Impact factor: 3.969

4.  Murasaki: a fast, parallelizable algorithm to find anchors from multiple genomes.

Authors:  Kris Popendorf; Hachiya Tsuyoshi; Yasunori Osana; Yasubumi Sakakibara
Journal:  PLoS One       Date:  2010-09-24       Impact factor: 3.240

5.  Efficient large-scale protein sequence comparison and gene matching to identify orthologs and co-orthologs.

Authors:  Khalid Mahmood; Geoffrey I Webb; Jiangning Song; James C Whisstock; Arun S Konagurthu
Journal:  Nucleic Acids Res       Date:  2011-12-30       Impact factor: 16.971

6.  Identification and profiling of novel microRNAs in the Brassica rapa genome based on small RNA deep sequencing.

Authors:  Bumjin Kim; Hee-Ju Yu; Sin-Gi Park; Ja Young Shin; Mijin Oh; Namshin Kim; Jeong-Hwan Mun
Journal:  BMC Plant Biol       Date:  2012-11-19       Impact factor: 4.215

7.  Analyses of genome architecture and gene expression reveal novel candidate virulence factors in the secretome of Phytophthora infestans.

Authors:  Sylvain Raffaele; Joe Win; Liliana M Cano; Sophien Kamoun
Journal:  BMC Genomics       Date:  2010-11-16       Impact factor: 3.969

Review 8.  Positional orthology: putting genomic evolutionary relationships into context.

Authors:  Colin N Dewey
Journal:  Brief Bioinform       Date:  2011-06-24       Impact factor: 11.622

9.  SyMAP v3.4: a turnkey synteny system with application to plant genomes.

Authors:  Carol Soderlund; Matthew Bomhoff; William M Nelson
Journal:  Nucleic Acids Res       Date:  2011-03-11       Impact factor: 16.971

10.  Detecting long tandem duplications in genomic sequences.

Authors:  Eric Audemard; Thomas Schiex; Thomas Faraut
Journal:  BMC Bioinformatics       Date:  2012-05-08       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.