| Literature DB >> 22534162 |
Jeffrey P Mower1, Andrea L Case, Eric R Floro, John H Willis.
Abstract
Despite intense investigation for over 25 years, the in vivo structure of plant mitochondrial genomes remains uncertain. Mapping studies and genome sequencing generally produce large circular chromosomes, whereas electrophoretic and microscopic studies typically reveal linear and multibranched molecules. To more fully assess the structure of plant mitochondrial genomes, the complete sequence of the monkeyflower (Mimulus guttatus DC. line IM62) mitochondrial DNA was constructed from a large (35 kb) paired-end shotgun sequencing library to a high depth of coverage (~30×). The complete genome maps as a 525,671 bp circular molecule and exhibits a fairly conventional set of features including 62 genes (encoding 35 proteins, 24 transfer RNAs, and 3 ribosomal RNAs), 22 introns, 3 large repeats (2.7, 9.6, and 29 kb), and 96 small repeats (40-293 bp). Most paired-end reads (71%) mapped to the consensus sequence at the expected distance and orientation across the entire genome, validating the accuracy of assembly. Another 10% of reads provided clear evidence of alternative genomic conformations due to apparent rearrangements across large repeats. Quantitative assessment of these repeat-spanning read pairs revealed that all large repeat arrangements are present at appreciable frequencies in vivo, although not always in equimolar amounts. The observed stoichiometric differences for some arrangements are inconsistent with a predominant master circular structure for the mitochondrial genome of M. guttatus IM62. Finally, because IM62 contains a cryptic cytoplasmic male sterility (CMS) system, an in silico search for potential CMS genes was undertaken. The three chimeric open reading frames (ORFs) identified in this study, in addition to the previously identified ORFs upstream of the nad6 gene, are the most likely CMS candidate genes in this line.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22534162 PMCID: PMC3381676 DOI: 10.1093/gbe/evs042
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FDepth of read coverage across the genome. All reads in the Lib35k sublibrary were mapped to the mitochondrial consensus sequence and read depth was calculated in several ways. Top panel: read depth of TRs (red line) that mapped to the mitochondrial consensus sequence and CPs (brown line) that mapped in the proper orientation and distance. Middle panel: read depth of inconsistent read pairs mapping in a forward (IP-F, green line) or reverse (IP-R, cyan line) orientation. Lower panel: read depth of reads whose mate pair did not map to the mitochondrial genome; these unpaired reads were also split into forward (UR-F, purple line) and reverse (UR-R, pink line) mapping orientations. Above the coverage plots, a linear representation of the genome is given that shows the position of all direct repeats (white boxes), inverted repeats (black boxes), and inserted chloroplast DNA (green boxes) >1 kb in length.
FCircular genome map. The outer circle shows sequence and orientation of known protein coding, rRNA, and tRNA genes. Genes are color coded as shown at top left. Genes on the outside and inside of the outer circle are transcribed clockwise and counterclockwise, respectively. The inner circle corresponds to the C1 conformation in figure 3, showing six single-copy regions (gradients of color) separated by the three sets of large repeats R1–R3 (black arrows). Dark green boxes denote chloroplast inserts in the genome.
FAlternative repeat arrangements and mitochondrial genomic conformations. (A) Each of the three large repeats (R1, R2, and R3) is shown in all four possible arrangements (a, b, c, and d). Coexisting arrangements found within particular genomic conformations are paired together as recombination products (a + b and c + d). (B) Eight complete conformations (C1–C8) are possible for the Mimulus mitochondrial genome as the result of homologous recombination between one of the three large repeat pairs (direction of black arrows on genome maps indicates orientation between repeat copies). Recombination between inverted repeats leads to inversions of genomic segments in different conformations (gray arrows), whereas recombination between direct repeats causes genomic fission or fusion events (brown arrows). The repeat facilitating a particular recombination event is labeled on each arrow. The six single-copy genomic regions are shown in gradients of color; the same color for each single-copy region was used in all conformations.
Mimulus Protein-Coding Gene Content Compared with Selected Angiosperms
| Gene | ||||||||
| 28 genes | • | • | • | • | • | • | • | • |
| Ψ | • | ○ | • | • | • | • | Ψ | |
| • | • | ○ | ○ | • | • | Ψ | ○ | |
| • | • | ○ | • | • | • | • | • | |
| ○ | • | ○ | ○ | • | • | • | • | |
| ○ | ○ | ○ | ○ | ○ | ○ | • | • | |
| ○ | ○ | • | • | • | • | • | • | |
| • | • | ○ | ○ | • | • | ○ | ○ | |
| ○ | ○ | ○ | ○ | ○ | ○ | Ψ | ○ | |
| • | • | • | ○ | • | • | • | • | |
| • | Ψ | ○ | Ψ | Ψ | • | Ψ | Ψ | |
| ○ | • | ○ | Ψ | • | • | • | Ψ | |
| • | • | ○ | ○ | • | • | ○ | ○ | |
| • | • | Ψ | Ψ | • | • | ○ | ○ | |
| • (present) | 35 | 37 | 30 | 31 | 38 | 39 | 35 | 33 |
| Ψ (pseudo) | 1 | 1 | 1 | 3 | 1 | 0 | 3 | 3 |
| ○ (absent) | 5 | 3 | 10 | 7 | 2 | 2 | 3 | 5 |
The 28 genes include atp[1, 4, 6, 8, 9], ccm[B, C, Fc, Fn], cob, cox[1, 2, 3], matR, mttB, nad[1, 2, 3, 4, 4L, 5, 6, 7, 9], rpl5, and rps[3, 4, 12]. Although Beta vulgaris ccmC is widely reported to be a pseudogene, it is transcribed, edited, and translated (Mower and Palmer 2006; Kitazaki et al. 2009) and is scored as functionally present here. Arabidopsis ccmFn is split into two genes, but the two halves are counted as a single gene here. Although mttB was reported to be a pseudogene in Vitis due to an absence of a conserved start codon (Goremykin et al. 2009), it is probably translated from an alternative start codon as suggested for other species (Sunkel et al. 1994) and is scored as present here.
The rpl10 gene was recently identified in a wide variety of streptophytes (Mower and Bonen 2009; Kubo and Arimura 2010).
Mimulus Intron Content Compared with Selected Angiosperms
| Intron | ||||||||
| 12 | • | • | • | • | • | • | • | • |
| 5 | ⊖ | ⊖ | ⊖ | ⊖ | ⊖ | ⊖ | ⊖ | ⊖ |
| ○ | • | • | ○ | ○ | • | • | • | |
| • | ○ | ○ | • | • | • | ○ | ○ | |
| ⊖ | ⊖ | ⊖ | • | • | • | ⊖ | ⊖ | |
| • | • | ○ | • | • | • | • | • | |
| ○ | ○ | • | • | • | • | • | • | |
| ○ | • | × | • | • | • | • | × | |
| • | • | ○ | • | • | • | • | • | |
| • | • | × | × | • | • | × | × | |
| • ( | 16 | 17 | 14 | 18 | 19 | 20 | 17 | 16 |
| ⊖ ( | 6 | 6 | 6 | 5 | 5 | 5 | 6 | 6 |
| ○ (intron absent) | 3 | 2 | 3 | 1 | 1 | 0 | 1 | 1 |
| × (gene absent) | 0 | 0 | 2 | 1 | 0 | 0 | 1 | 2 |
The 12 cis-spliced introns include ccmFc-i829, nad1-i477, nad2-[i156, i709, i1282], nad4-[i461, i1399], nad5-[i230, i1872], and nad7-[i140, i209, i917].
The five trans-spliced introns include nad1-[i394, i669], nad2-i542, and nad5-[i1455, i1477].
Read Pair Counts for Alternative Repeat Conformations
| Spanning Read Pairs | ||||
| Region ID | Region Environment | All | Fixed Distance | Compatible Conformations |
| A. Repeat R1 (28,763 bp) | ||||
| R1a | Violet-R1-Green | 72 | 58 | C1, C3, C4, C7 |
| R1b | Cyan-R1-Red | 59 | 53 | C1, C3, C4, C7 |
| R1c | Violet-R1-Red | 67 | 58 | C2, C5, C6, C8 |
| R1d | Cyan-R1-Green | 72 | 61 | C2, C5, C6, C8 |
| 230 | All (×2) | |||
| B. Repeat R2 (2,742 bp) | ||||
| R2a | Green-R2-Red | 283 | 54 | C1, C2, C4, C6 |
| R2b | Yellow-R2-Pink | 356 | 81 | C1, C2, C4, C6 |
| R2c | Green-R2-Pink | 202 | 33 | C3, C5, C7, C8 |
| R2d | Yellow-R2-Red | 196 | 37 | C3, C5, C7, C8 |
| 205 | All (×2) | |||
| C. Repeat R3 (9,620 bp) | ||||
| R3a | Violet-R3-Yellow | 130 | 27 | C1, C2, C3, C5 |
| R3b | Pink-R3-Cyan | 241 | 65 | C1, C2, C3, C5 |
| R3c | Violet-R3-Cyan | 217 | 52 | C4, C6, C7, C8 |
| R3d | Pink-R3-Yellow | 265 | 72 | C4, C6, C7, C8 |
| 216 | All (×2) | |||
| D. Segment R2 + R3 (29,250 bp) | ||||
| R3a + R2b | Violet-R3-Yellow-R2-Pink | 22 | 19 | C1, C2 |
| R3a + R2d | Violet-R3-Yellow-R2-Red | 16 | 14 | C3, C5 |
| R3d + R2b | Pink-R3-Yellow-R2-Pink | 54 | 48 | C4, C6 |
| R3d + R2d | Pink-R3-Yellow-R2-Red | 23 | 19 | C7, C8 |
| 100 | All | |||
| E. Single-copy regions | ||||
| SC1 | Red (40 kb) | 517 | 76 | All |
| SC2 | Red (110 kb) | 555 | 109 | All |
| SC3 | Green (190 kb) | 604 | 115 | All |
| SC4 | Violet (275 kb) | 529 | 91 | All |
| SC5 | Violet (355 kb) | 565 | 110 | All |
| SC6 | Pink (460 kb) | 646 | 136 | All |
Colors listed in the region environments correspond to the single-copy regions in figures 2 and 3.
Numbers in parentheses indicate genomic position in the C1 conformation.
FCandidate CMS genes. (A) Genomic map surrounding the ORFs upstream of the nad6 gene that were previously shown to be associated with the CMS phenotype (Case and Willis 2008). The atp1 promoter region is shown in blue. The beginning is estimated to be 1736 bp upstream of the start of nad6 (Case and Willis 2008). (B) Genomic map surrounding the three ORFs (orf387, orf112, and orf56) identified in this study that may be alternative or additional CMS genes due to their chimeric nature. Small repeats that generate the chimeric portions of the ORFs are color coded according to the genome map in figure 2. Other small repeats are shown on the map in purple. Below each small repeat, the number of read pairs that indicate substoichiometric rearrangements with the other repeat copy is shown, and the genomic position of the other repeat copy (or copies) is given in parentheses. The large repeat R1 is shown in black. A plastid insertion site is shown in green. Additional ORFs not considered candidate CMS genes are shown as shorter boxes with hatched borders.
Chimeric Mitochondrial Genes
| Chimeric ORF | Gene fragment (nt position in ORF) | Fragment size (nt) |
| 71, 326 | ||
| 170, 52 | ||
| 55 | ||
| 44 | ||
| 44 | ||
| 37 | ||
| 34 | ||
| 33 | ||
| 30 |
Gene fragments in bold are in proper orientation and reading frame. rc = reverse complement.