Literature DB >> 14660689

Empirical models for substitution in ribosomal RNA.

Andrew D Smith1, Thomas W H Lui, Elisabeth R M Tillier.   

Abstract

Empirical models of substitution are often used in protein sequence analysis because the large alphabet of amino acids requires that many parameters be estimated in all but the simplest parametric models. When information about structure is used in the analysis of substitutions in structured RNA, a similar situation occurs. The number of parameters necessary to adequately describe the substitution process increases in order to model the substitution of paired bases. We have developed a method to obtain substitution rate matrices empirically from RNA alignments that include structural information in the form of base pairs. Our data consisted of alignments from the European Ribosomal RNA Database of Bacterial and Eukaryotic Small Subunit and Large Subunit Ribosomal RNA ( Wuyts et al. 2001. Nucleic Acids Res. 29:175-177; Wuyts et al. 2002. Nucleic Acids Res. 30:183-185). Using secondary structural information, we converted each sequence in the alignments into a sequence over a 20-symbol code: one symbol for each of the four individual bases, and one symbol for each of the 16 ordered pairs. Substitutions in the coded sequences are defined in the natural way, as observed changes between two sequences at any particular site. For given ranges (windows) of sequence divergence, we obtained substitution frequency matrices for the coded sequences. Using a technique originally developed for modeling amino acid substitutions ( Veerassamy, Smith, and Tillier. 2003. J. Comput. Biol. 10:997-1010), we were able to estimate the actual evolutionary distance for each window. The actual evolutionary distances were used to derive instantaneous rate matrices, and from these we selected a universal rate matrix. The universal rate matrices were incorporated into the Phylip Software package ( Felsenstein 2002. http://evolution.genetics.washington.edu/phylip.html), and we analyzed the ribosomal RNA alignments using both distance and maximum likelihood methods. The empirical substitution models performed well on simulated data, and produced reasonable evolutionary trees for 16S ribosomal RNA sequences from sequenced Bacterial genomes. Empirical models have the advantage of being easily implemented, and the fact that the code consists of 20 symbols makes the models easily incorporated into existing programs for protein sequence analysis. In addition, the models are useful for simulating the evolution of RNA sequence and structure simultaneously.

Mesh:

Substances:

Year:  2003        PMID: 14660689     DOI: 10.1093/molbev/msh029

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  10 in total

1.  Size-variable zone in V3 region of 16S rRNA.

Authors:  Francisco Vargas-Albores; Luis Enrique Ortiz-Suárez; Enrique Villalpando-Canchola; Marcel Martínez-Porchas
Journal:  RNA Biol       Date:  2017-05-23       Impact factor: 4.652

2.  The effects of alignment quality, distance calculation method, sequence filtering, and region on the analysis of 16S rRNA gene-based studies.

Authors:  Patrick D Schloss
Journal:  PLoS Comput Biol       Date:  2010-07-08       Impact factor: 4.475

3.  BlastR--fast and accurate database searches for non-coding RNAs.

Authors:  Giovanni Bussotti; Emanuele Raineri; Ionas Erb; Matthias Zytnicki; Andreas Wilm; Emmanuel Beaudoing; Philipp Bucher; Cedric Notredame
Journal:  Nucleic Acids Res       Date:  2011-05-30       Impact factor: 16.971

4.  Diversity measures in environmental sequences are highly dependent on alignment quality--data from ITS and new LSU primers targeting basidiomycetes.

Authors:  Dirk Krüger; Danuta Kapturska; Christiane Fischer; Rolf Daniel; Tesfaye Wubet
Journal:  PLoS One       Date:  2012-02-21       Impact factor: 3.240

5.  Using the nucleotide substitution rate matrix to detect horizontal gene transfer.

Authors:  Micah Hamady; M D Betterton; Rob Knight
Journal:  BMC Bioinformatics       Date:  2006-10-26       Impact factor: 3.169

6.  4SALE--a tool for synchronous RNA sequence and secondary structure alignment and editing.

Authors:  Philipp N Seibel; Tobias Müller; Thomas Dandekar; Jörg Schultz; Matthias Wolf
Journal:  BMC Bioinformatics       Date:  2006-11-13       Impact factor: 3.169

7.  Evolutionary models for insertions and deletions in a probabilistic modeling framework.

Authors:  Elena Rivas
Journal:  BMC Bioinformatics       Date:  2005-03-21       Impact factor: 3.169

8.  What an rRNA secondary structure tells about phylogeny of fungi in Ascomycota with emphasis on evolution of major types of ascus.

Authors:  Wen-Ying Zhuang; Chao-Yang Liu
Journal:  PLoS One       Date:  2012-10-26       Impact factor: 3.240

9.  Visualizing differences in phylogenetic information content of alignments and distinction of three classes of long-branch effects.

Authors:  Johann Wolfgang Wägele; Christoph Mayer
Journal:  BMC Evol Biol       Date:  2007-08-28       Impact factor: 3.260

10.  Probabilistic phylogenetic inference with insertions and deletions.

Authors:  Elena Rivas; Sean R Eddy
Journal:  PLoS Comput Biol       Date:  2008-09-19       Impact factor: 4.475

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.