| Literature DB >> 18387202 |
Ernesto Picardi1, Carla Quagliariello.
Abstract
BACKGROUND: In plant mitochondria, the post-transcriptional RNA editing process converts C to U at a number of specific sites of the mRNA sequence and usually restores phylogenetically conserved codons and the encoded amino acid residues. Sites undergoing RNA editing evolve at a higher rate than sites not modified by the process. As a result, editing sites strongly affect the evolution of plant mitochondrial genomes, representing an important source of sequence variability and potentially informative characters. To date no clear and convincing evidence has established whether or not editing sites really affect the topology of reconstructed phylogenetic trees. For this reason, we investigated here the effect of RNA editing on the tree building process of twenty different plant mitochondrial gene sequences and by means of computer simulations.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18387202 PMCID: PMC2323663 DOI: 10.1186/1471-2105-9-S2-S14
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Methodology overview. Schematic overview of the methodology to in silico evaluate the effect of RNA editing on reconstructed tree topologies.
Accuracy of data sets generated along a 12 taxa tree (JC model).
Accuracy of ML inferred trees from data sets generated along a 12 taxa tree and under a growing percentage of editing positions. Background sites evolved according to JC model [17]. Results from multiple alignments without editing sites are also shown as control.
| 0 | 1 | 3 | 5 | 7 | 10 | |||
| 1 | 0.73 | 0.73 | 0.71 | 0.70 | 0.71* | |||
| 1 | 0.72 | 0.66 | 0.61 | 0.60 | 0.57 | |||
| 1 | 0.73 | 0.72 | 0.69 | 0.63 | 0.65 | |||
| 1 | 0.91 | 0.91 | 0.89* | 0.89* | 0.88* | |||
| 1 | 0.9 | 0.85 | 0.78 | 0.76 | 0.71 | |||
| 1 | 0.91 | 0.88 | 0.88 | 0.90 | 0.89 | |||
| 1 | 0.99 | 0.99 | 0.98 | 0.99* | 0.96* | |||
| 1 | 0.99 | 0.97 | 0.94 | 0.91 | 0.86 | |||
| 1 | 0.99 | 0.99 | 0.98 | 0.97 | 0.96 | |||
| 1 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | |||
| 1 | 0.99 | 0.99 | 0.97 | 0.96 | 0.95 | |||
| 1 | 0.99 | 0.99 | 0.98 | 0.99 | 0.99 | |||
| 1 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | |||
| 1 | 0.99 | 0.99 | 0.99 | 0.99 | 0.97 | |||
| 1 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | |||
* indicates that the comparison between genomic and cDNA accuracy is significant (Pχ21 < 0.05)
Accuracy of data sets generated along a 12 taxa tree (HKY model).
Accuracy of ML inferred trees from data sets generated along a 12 taxa tree and under a growing percentage of editing positions. Background sites evolved according to HKY model [18]. Results from multiple alignments without editing sites are also shown as control.
| 0 | 1 | 3 | 5 | 7 | 10 | |||
| 1 | 0.92 | 0.92 | 0.84 | 0.82* | 0.80* | |||
| 1 | 0.89 | 0.85 | 0.80 | 0.74 | 0.72 | |||
| 1 | 0.90 | 0.91 | 0.83 | 0.80 | 0.79 | |||
| 1 | 0.96 | 0.95 | 0.94 | 0.92* | 0.90* | |||
| 1 | 0.93 | 0.93 | 0.91 | 0.83 | 0.77 | |||
| 1 | 0.96 | 0.95 | 0.94 | 0.92 | 0.90 | |||
| 1 | 0.98 | 0.98 | 0.98 | 0.97 | 0.97 | |||
| 1 | 0.98 | 0.98 | 0.96 | 0.95 | 0.92 | |||
| 1 | 0.98 | 0.98 | 0.98 | 0.97 | 0.96 | |||
| 1 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | |||
| 1 | 0.99 | 0.99 | 0.99 | 0.98 | 0.97 | |||
| 1 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | |||
| 1 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | |||
| 1 | 0.99 | 0.99 | 0.99 | 0.98 | 0.98 | |||
| 1 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | |||
* indicates that the comparison between genomic and cDNA accuracy is significant (Pχ21 < 0.05)
Accuracy of data sets generated along a 18 taxa tree (JC and HKY models).
Accuracy of ML inferred trees from data sets of 1,000 bp generated along a 18 taxa tree and under a growing percentage of editing positions. Background sites evolved according both JC and HKY models [17,18]. Results from multiple alignments without editing sites are also shown as control.
| 0 | 1 | 3 | 5 | 7 | 10 | |||
| 1 | 0.91 | 0.90* | 0.93* | 0.92* | 0.88* | |||
| 1 | 0.86 | 0.75 | 0.65 | 0.52 | 0.46 | |||
| 1 | 0.91 | 0.88 | 0.94 | 0.89 | 0.88 | |||
| 0 | 1 | 3 | 5 | 7 | 10 | |||
| 1 | 0.93 | 0.96* | 0.96* | 0.94* | 0.93* | |||
| 1 | 0.87 | 0.86 | 0.85 | 0.73 | 0.71 | |||
| 1 | 0.91 | 0.90 | 0.95 | 0.83 | 0.83 | |||
* indicates that the comparison between genomic and cDNA accuracy is significant (Pχ21< 0.05)
Accuracy of data sets generated along a 24 taxa tree (JC and HKY models).
Accuracy of ML inferred trees from data sets of 1,000 bp generated along a 24 taxa tree and under a growing percentage of editing positions. Background sites evolved according both JC and HKY models [17,18]. Results from multiple alignments without editing sites are also shown as control.
| 0 | 1 | 3 | 5 | 7 | 10 | |||
| 1 | 0.92 | 0.86 | 0.88* | 0.80* | 0.79* | |||
| 1 | 0.88 | 0.82 | 0.77 | 0.64 | 0.58 | |||
| 1 | 0.95 | 0.88 | 0.91 | 0.90 | 0.86 | |||
| 0 | 1 | 3 | 5 | 7 | 10 | |||
| 1 | 0.94 | 0.94 | 0.91* | 0.90* | 0.90* | |||
| 1 | 0.93 | 0.92 | 0.83 | 0.78 | 0.73 | |||
| 1 | 0.94 | 0.90 | 0.87 | 0.81 | 0.70 | |||
* indicates that the comparison between genomic and cDNA accuracy is significant (Pχ21< 0.05)
Accuracy of data sets generated along a 18 taxa tree (JC model).
Accuracy of ML inferred trees from data sets of 1,500 bp generated along a 18 taxa tree and under a growing percentage of editing positions. Background sites evolved according to the JC model [17]. Results from multiple alignments lacking editing sites are also shown as control.
| 0 | 1 | 3 | 5 | 7 | 10 | |||
| 1 | 0.98* | 0.95* | 0.95* | 0.95* | 0.93* | |||
| 1 | 0.89 | 0.81 | 0.77 | 0.65 | 0.56 | |||
| 1 | 0.97 | 0.95 | 0.97 | 0.96 | 0.91 | |||
* indicates that the comparison between genomic and cDNA accuracy is significant (Pχ21< 0.05)
Plant mitochondrial genes used in this study and ratioDt values.
For each gene, the number of sequences (N), the mean length (L), the editing percentage (E), the number of variable sites (Vg for genomic and Vc for cDNA), the evolutionary model (Mg for genomic and Mc for cDNA) and the ratioDt are shown.
| Gene | N | L | E | Vg | Vc | Mg | Mc | |
| 7 | 1527.86 | 0.85 | 199 | 202 | HKY+I | HKY+I | 0 | |
| 8 | 2023.50 | 1.24 | 360 | 356 | K80+G | K80+G | 0 | |
| 8 | 504.00 | 2.58 | 143 | 142 | HKY+G | HKY+G | 0 | |
| 7 | 1670.14 | 2.93 | 483 | 472 | GTR+I | GTR+I | 0 | |
| 7 | 980.57 | 4.90 | 71 | 63 | HKY+I | HKY | 0 | |
| 7 | 735.43 | 6.66 | 116 | 107 | HKY+I | HKY+I | 0 | |
| 9 | 962.00 | 3.43 | 510 | 485 | HKY+G+I | HKY+G+I | 0.16 | |
| 9 | 802.67 | 3.86 | 71 | 70 | GTR+I | GTR+G | 0.16 | |
| 14 | 377.79 | 4.24 | 79 | 74 | HKY+G | HKY+G | 0.18 | |
| 7 | 2012.14 | 1.99 | 139 | 124 | HKY+I | HKY+I | 0.25 | |
| 8 | 568.13 | 2.46 | 186 | 182 | GTR+I | HKY+G | 0.25 | |
| 7 | 602.57 | 3.65 | 153 | 157 | HKY+G | HKY+G | 0.25 | |
| 7 | 642.43 | 4.05 | 124 | 119 | F81+I | F81+I | 0.25 | |
| 8 | 350.25 | 2.86 | 45 | 45 | F81+G | F81+G | 0.4 | |
| 10 | 1183.50 | 3.72 | 135 | 129 | HKY+I | HKY+I | 0.43 | |
| 16 | 774.75 | 4.26 | 176 | 160 | HKY+G | HKY+G | 0.5 | |
| 7 | 621.00 | 11.27 | 75 | 72 | HKY | HKY+G | 0.5 | |
| 19 | 363.00 | 14.05 | 86 | 72 | GTR+G | GTR+G | 0.5 | |
| 16 | 234.19 | 6.41 | 81 | 72 | K80+G | HKY+G | 0.54 | |
| 9 | 606.00 | 2.81 | 58 | 52 | HKY+I | HKY+G | 0.6 |
Notes. K80, Kimura; F81, Felsenstein 1981; HKY, Hasegawa-Kishino-Yano; GTR, General time reversible; I, invariant sites; G, gamma correction.
The editing percentage is calculated for each multiple alignment as the proportion of site patterns containing at least one editing event over the total number of site patterns.
Figure 2Examples of genomic and cDNA trees from plant mitochondrial genes. Genomic and cDNA tree topologies for three plant mitochondrial genes with different ratioDt values. A, B) Genomic and cDNA inferred trees for the atp1 gene where ratioDt is equal to 0; C, D) genomic and cDNA topologies deduced for the atp4 gene where there is a moderate editing bias (ratioDt equal to 0.25); E, F) genomic and cDNA trees for the ccb2 gene where there is a strong editing effect (ratioDt equal to 0.50).