| Literature DB >> 23637852 |
Argelia Cuenca1, Gitte Petersen, Ole Seberg.
Abstract
In order to study the evolution of mitochondrial genomes in the early branching lineages of the monocotyledons, i.e., the Acorales and Alismatales, we are sequencing complete genomes from a suite of key taxa. As a starting point the present paper describes the mitochondrial genome of Butomus umbellatus (Butomaceae) based on next-generation sequencing data. The genome was assembled into a circular molecule, 450,826 bp in length. Coding sequences cover only 8.2% of the genome and include 28 protein coding genes, four rRNA genes, and 12 tRNA genes. Some of the tRNA genes and a 16S rRNA gene are transferred from the plastid genome. However, the total amount of recognized plastid sequences in the mitochondrial genome is only 1.5% and the amount of DNA transferred from the nucleus is also low. RNA editing is abundant and a total of 557 edited sites are predicted in the protein coding genes. Compared to the 40 angiosperm mitochondrial genomes sequenced to date, the GC content of the Butomus genome is uniquely high (49.1%). The overall similarity between the mitochondrial genomes of Butomus and Spirodela (Araceae), the closest relative yet sequenced, is low (less than 20%), and the two genomes differ in size by a factor 2. Gene order is also largely unconserved. However, based on its phylogenetic position within the core alismatids Butomus will serve as a good reference point for subsequent studies in the early branching lineages of the monocotyledons.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23637852 PMCID: PMC3634813 DOI: 10.1371/journal.pone.0061552
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Comparison of complete mitochondrial genomes of selected seed plants.
| Species | Refseq | Length | GC1 | Coding2 | Protein3 | tRNA4 | rRNA | Introns5 | Plastid5 | Nuclear5 | Repeats6 | Edited |
| (bp) | (%) | (%) | genes | genes | genes | total ( | (%) | (%) | (%) | Sites7 | ||
|
| NC_010303 | 414,903 | 46.9 | 10.0 | 41 | 22 | 3 | 25 (5) | 4.4 | – | 15.1 | 10848 |
|
| KC208619 | 450,826 | 49.1 | 8.2 | 28 | 12 | 49 | 21 (5) | – | 0 | – | 5578 |
|
| NC_017840 | 228,493 | 45.7 | 16.2 | 35 | 18 | 3 | 21 (6) | 4.1 | 0 | – | 5408 |
|
| NC_016740 | 715,001 | 45.1 | 6.3 | 38 | 22 | 3 | 24 (4) | 10.3 | – | 2.3 | 5928 |
|
| EU365401 | 509,941 | 44.1 | 7.8 | 35 | 18 | 3 | 22 (6) | – | – | – | – |
|
| NC_007886 | 491,515 | 43.8 | 10.2 | 35 | 21 | 3 | 23 (6) | 6.3 | 13.4 | 26 | 491 |
|
| NC_007579 | 452,528 | 44.4 | 11.4 | 33 | 16 | 3 | 22 (6) | 3.0 | – | 10.1 | – |
|
| NC_008360 | 468,628 | 43.7 | 8.2 | 32 | 19 | 3 | 22 (6) | – | – | – | – |
|
| NC_008362 | 704,100 | 43.9 | 5.9 | 32 | 17 | 3 | 22 (6) | – | – | – | – |
|
| NC_008333 | 539,368 | 43.9 | 7.2 | 32 | 16 | 3 | 22 (6) | – | – | – | – |
|
| NC_007982 | 569,630 | 43.9 | 8.4 | 32 | 18 | 3 | 22 (7) | 4.5 | – | 22.9 | – |
|
| NC_012119 | 773,279 | 44.1 | 6.3 | 38 | 23 | 3 | 25 (5) | 8.8 | – | 6.84 | 40110 |
|
| NC_001284 | 366,924 | 44.8 | 11.5 | 31 | 17 | 3 | 23 (5) | 1 | 4 | 7 | 441 |
|
| NC_008285 | 221,853 | 45.2 | 17.4 | 32 | 18 | 3 | 24 (5) | 3.6 | – | 5.5 | 427 |
|
| NC_012116 | 476,890 | 45.1 | 8.6 | 39 | 20 | 3 | 24 (5) | – | – | – | – |
|
| NC_014043 | 379,236 | 45.1 | 10.3 | 38 | 20 | 3 | 24 (5)+111 | 6 | 6.4 | 10 | 463 |
|
| NC_014050 | 982,833 | 42.8 | 3.9 | 38 | 26 | 3 | 24 (5) | 11.5 | 2.3 | 38 | 444 |
|
| NC_016005 | 1,555,935 | 44.3 | 2.9 | 37 | 22 | 3 | 23 (5)+111 | 4.6 | 1.3 | 36 | – |
|
| NC_016743 | 380,861 | 45.4 | 9.9 | 31 | 16 | 3 | 22 (5) | – | – | – | – |
|
| NC_016742 | 425,718 | 45.0 | 9.7 | 33 | 17 | 3 | 23 (5) | – | – | – | – |
|
| NC_015121 | 401,262 | 45.1 | 8.8 | 31 | 14 | 3 | 22 (5) | 0.5 | – | – | – |
|
| NC_006581 | 430,597 | 45.0 | 10.1 | 37 | 21 | 3 | 23 (6) | 2.5 | – | >8.1 | 5148 |
|
| NC_002511 | 368,801 | 43.9 | 11.3 | 30 | 21 | 3 | 20 (6) | 2.1 | 3.3 | 10.3 | 357 |
|
| NC_014487 | 253,413 | 42.6 | 14.5 | 25 | 9 | 3 | 19 (6) | 1 | – | – | 287 |
|
| NC_017855 | 281,132 | 45.4 | 18.4 | 34 | 17 | 3 | 24 (5) | 2 | – | 46 | – |
|
| NC_016741 | 510,519 | 43.3 | 7.6 | 30 | 23 | 613 | 23 (5) | 10.5 | – | 1.5 | – |
|
| NC_015141 | 502,773 | 45.0 | 7.6 | 37 | 16 | 3 | 24 (5) | – | – | – | – |
Gene content of the mitochondrial genome of Butomus umbellatus.
|
| |
| Complex I (NADH dehydrogenase) |
|
| Complex II (succinate dehydrogenase) | – |
| Complex III (cytochrome C reductase) |
|
| Complex IV (cytochrome C oxidase) |
|
| Complex V (ATP Synthase) |
|
| Cytochrome C biogenesis |
|
| Other genes |
|
| Large subunit ribosomal proteins | – |
| Small subunit ribosomal proteins |
|
| Transfer RNAs |
|
| Ribosomal RNAs |
|
| Pseudogenes, partial |
|
|
| |
| Ribosomal RNAs |
|
| Transfer RNAs |
|
Figure 1Circular map of the mitochondrial genome of Butomus umbellatus.
Known protein coding, rRNA, and tRNA genes are shown and color coded as indicated in the lower left corner. Genes shown outside and inside of the circle are transcribed clockwise and counter clockwise, respectively. Repeats longer than 1 kb are shown as colored bars named RR1-RR3. The location of repeat units shown inside or outside of the circle is purely practical; information on directionality is given in Table 4. The inner circle shows GC content. The figure was created using OGDRAW [59].
Long repeats in the mitochondrial genome of Butomus umbellatum.
| Repeat | Size (kb) | % Identity | E value | Directionality |
| RR1 | 9.0 | 100 | 0 | +/+ |
| RR2 | 6.3 | 100 | 0 | +/− |
| RR3 | 2.8 | 100 | 0 | +/− |
| RR4 | 0.75 | 97 | 0 | +/+/+/− |
| RR5 | 0.4 | 100 | 0 | +/− |
| RR6 | 0.3 | 88 | 1.00×10−92 | +/− |
| RR7 | 0.25 | 96 | 1.00×10−113 | +/+ |
| RR8 | 0.25 | 100 | 4.00×10−128 | +/+ |
| RR9 | 0.25 | 100 | 6.00×10−126 | +/+ |
| RR10 | 0.2 | 88 | 1.00×10−57 | +/− |
Only repeats >0.2 Kb and identity score >80% are shown. RR1 to RR10 refer to the nomenclature used in Fig. 1. RR4 has four copies, the remaining has only two. Direct and inverted repeats are indicates as+and –.
Sequences of plastid origin in mitochondrial genome of Butomus umbellatus.
| Start | End | Length (bp) | Sequence characteristics |
| 34.784 | 34.889 | 106 |
|
| 47.701 | 48.603 | 903 |
|
| 62.015 | 62.099 | 85 |
|
| 171.200 | 171.359 | 160 |
|
| 215.497 | 215.653 | 157 |
|
| 249.922 | 249.992 | 71 |
|
| 299.456 | 299540 | 85 |
|
| 368.045 | 372.941 | 4.897 |
|
| 434.652 | 434.714 | 63 |
|
| 438.532 | 438.932 | 401 |
|
|
|
Repeated sequence.
trnP(ugg) in CP genomes, trnK(uuu) in Butomus.
Figure 2Gene content in selected seed plant mitochondrial genomes.
Dark squares indicate presence of assumed intact, functional genes, lighter gray squares indicate putative pseudogenes, and white squares indicate absence of genes. Numbers below columns are the total numbers of assumed intact, functional genes. The table is modified and expanded from Sloan et al. [2].
Figure 3Phylogeny of 26 seed plants inferred from 24 protein coding mitochondrial genes.
Phylogenetic tree based on Maximum Likelihood with branch lengths proportional to substitution per site.
Figure 4Phylogenetic analysis of substitution rates in three mitochondrial rRNA genes.
Analyses are based on a constraint tree using the topology from analysis of 24 protein coding genes (Fig. 3). Branch lengths are proportional to substitutions per site except the Silene branch in the rrn5 tree having been reduced to ca. 1/3 of its actual length.