Literature DB >> 16260473

Structural dynamics of cereal mitochondrial genomes as revealed by complete nucleotide sequencing of the wheat mitochondrial genome.

Yasunari Ogihara1, Yukiko Yamazaki, Koji Murai, Akira Kanno, Toru Terachi, Takashi Shiina, Naohiko Miyashita, Shuhei Nasuda, Chiharu Nakamura, Naoki Mori, Shigeo Takumi, Minoru Murata, Satoshi Futo, Koichiro Tsunewaki.   

Abstract

The application of a new gene-based strategy for sequencing the wheat mitochondrial genome shows its structure to be a 452 528 bp circular molecule, and provides nucleotide-level evidence of intra-molecular recombination. Single, reciprocal and double recombinant products, and the nucleotide sequences of the repeats that mediate their formation have been identified. The genome has 55 genes with exons, including 35 protein-coding, 3 rRNA and 17 tRNA genes. Nucleotide sequences of seven wheat genes have been determined here for the first time. Nine genes have an exon-intron structure. Gene amplification responsible for the production of multicopy mitochondrial genes, in general, is species-specific, suggesting the recent origin of these genes. About 16, 17, 15, 3.0 and 0.2% of wheat mitochondrial DNA (mtDNA) may be of genic (including introns), open reading frame, repetitive sequence, chloroplast and retro-element origin, respectively. The gene order of the wheat mitochondrial gene map shows little synteny to the rice and maize maps, indicative that thorough gene shuffling occurred during speciation. Almost all unique mtDNA sequences of wheat, as compared with rice and maize mtDNAs, are redundant DNA. Features of the gene-based strategy are discussed, and a mechanistic model of mitochondrial gene amplification is proposed.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16260473      PMCID: PMC1275586          DOI: 10.1093/nar/gki925

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

The mitochondrial genome is important in plant development, as well as in productivity (1–3), and extensive studies have been done on its functions (4). Although the complete nucleotide sequence has been determined for seven land plant species (5–11), the genomic makeup is not well understood (11–13) because of the multipartite structure of the genome (14–16). With a new gene-based strategy for sequencing the wheat mitochondrial genome, we obtained a number of recombinant molecules, analyses of which for the first time have provided proof, at the nucleotide sequence level, of the mechanism that produces multipartite molecules in the mitochondrial genome. Moreover, we demonstrate by gene map comparison that thorough gene shuffling occurred during the speciation of three cereals (wheat, rice and maize), leading to remarkable changes in their mitochondrial genome structures, as previously shown by the restriction fragment mapping of maize mitochondrial DNA (mtDNAs) (17) and by MultiPipMaker analysis of several sequenced plant mitochondrial genomes (10). Based on this information, we propose a new method for quantifying genome-wide molecular changes in mitochondrial genomes, which result in ontogenetic as well as phylogenetic variability of the cereal mitochondrial genomes. In the wheat complex, Triticum (wheat) and Aegilops (goat grass), inter- as well as intra-specific molecular diversity of both the chloroplast and the mitochondrial genomes were studied in order to clarify the phylogenetic relationships of various taxa of the complex, including the origin of wheat (18,19). Diversity among plasmons of their phenotypic effects on various wheat characters also was investigated [for review see (20)]. However, we have not studied the functional relationships between molecular variation and differential phenotypic effects. We determined recently the complete nucleotide sequence and gene content of the wheat chloroplast genome (21). Here we report those of the mitochondrial genome. The information obtained provides a basis for future studies on the linkage of molecular diversity and phenotypic variability in the wheat complex.

MATERIALS AND METHODS

Plant material

The common wheat, Triticum aestivum cv. Chinese Spring, was the source of the mtDNA studied here, that was obtained from mitochondria of 14-day-old etiolated seedlings (22) and was purified before use (23).

MtDNA library construction and clone sequencing

An mtDNA library was constructed by the SuperCos1 in vitro packaging method (Stratagene, LaJolla) from partially digested wheat mtDNA with Sau3AI. From this library, 232 clones were randomly selected and dot-blotted with 32 mitochondrial genes as probes (24). All probe genes, except rps13, hybridized with 7 or more clones, from which 23 clones were selected to cover 31 probe genes and were sequenced by the shotgun method. Sequenced fragments were aligned using BLASTn (25) to determine the entire sequence of each clone.

Sequence assembly and gene analysis

Alignment of the 23 clones gave two linear molecules of ∼350 and 76 kb. Two additional clones, #194 and #204, whose ends hybridized to one end each of the two linear molecules, were selected and sequenced. Phrap (), BLASTn and blast2sequences programs were used for the primary assembly of all the clones. Manual fine tuning was done to generate the final master circle (MC). Repeat sequences were analyzed by in-house script, window size 8 bp, and represented as a dot-plot image. Open reading frames (ORFs) were identified by a Genome Gambler (Xanagen Co.) and ORFfinder (). tRNA genes were searched for by tRNAscan-SE (26). The annotated rice and maize mitochondrial genes, BA000029 and AY506529, respectively, as well as individual wheat mitochondrial genes submitted to the DNA databank, were compared with our sequence data to annotate all the genes. Sequences homologous to known cereal transposable elements were searched for, referring to the TIGR grass transposable elements database after Clifton et al. (10).

Gene nomenclature and nucleotide position

The nomenclature of Clifton et al. (10) for maize mitochondrial genes was adopted, except for the designation of exons, for which ex-1 to ex-5 of a given gene are indicated by a to e, affixed to its gene symbol. Positions of a forward-strand nucleotide in the MC molecule and in a gene or repeat sequence, respectively, are shown as the ‘MC coordinate’ and ‘gene or repeat coordinate’.

RESULTS

Sequencing of individual clones and their alignment

Twenty-five wheat mtDNA clones were sequenced (Table 1). Their sizes ranged from 27 to 44 kb, except for two (#27 and #39) ∼16 kb in size. The average size was 34 898 bp, and the total size was 872 455 bp. Alignment showed a single 452 528 bp MC molecule (Figure 1). Fifteen clones occupied single locations in the genome (‘intact clone’), while the remaining 10 were split into two or three segments located in different parts of the genome, tentatively called the ‘recombinant clone’. Quetier et al. (15) estimated size of the wheat mitochondrial genome to be ∼430 kb, based on its SalI restriction map. Their estimate is very close to the size, 452 528 bp, determined by the present sequencing work.
Table 1

Wheat mtDNA clones sequenced, showing their size, type, marker genes used and genes other than probe genes identified by sequencing

CloneSize (bp)TypeaProbe genes usedbAdditional genes found by sequencingc
#137 129R(S)nad1a, nad7, rrn5/18mttB, trnfM, trnP, trnS
#533 266IccmFC, rrn26(p)trnK, trnQ
#638 445Inad4, nad5detrnP
#2435 670R(D)cob, rps7, rrn5/18, rrn26trnF, trnfM, trnM, trnS
#2715 896R(S)nad1accmFC
#3135 843R(S)cox2, nad3, nad9, nad2cde, rps12orf173, orf349, rps2, trnD, trnS, trnY
#3916 661Icox2, nad2cde, nad9, atp4orf349, rps2, trnD, trnY
#5134 696R(S)rps7, rrn5/18, rrn26trnF, trnfM, trnS
#6335 769Icox1, rrn26trnK, trnQ
#6636 206Inad7nad4L, rps19(p), trnD, trnfM, trnI, trnK, trnM, trnN, trnS
#7434 458Inad1a, nad1d, nad5ab, nad6, rrn5/18rpl2(p), rps4, trnfM, trnP
#7535 217R(S)atp6, nad1bc, nad5deccmFCa, orf194, orf359, rps13, rpl16, rps1, rps3, trnC
#9237 038Iatp6, cox3, matR, nad1bc, nad1e, nad5c, nad7rpl5, rps13, trnE
#9427 319R(S)nad1d, nad6, rrn26rpl2(p), rps4, trnK, trnQ
#9644 184R(S)atp6, cob, cox3, rrn5/18ccmFN, trnE, trnfM
#10237 709Iatp6, nad5deccmFCa, ccmFCb, orf194, orf359, rpl16, rps3, trnC, trnP
#11039 360Iatp6, cob, ccmFN, rrn5/18, rps1trnfM
#12634 832Icob, cox1, cox2, atp8trnD
#14636 595Icob, matR, nad1e, nad5c, ccmFN, rrn5/18, rps1rpl5, trnfM
#16036 135Inad2ab, nad2cde, nad9, atp4ccmB, orf349, trnK, trnQ, trnY
#16238 872R(S)atp1, atp6, atp9, nad1bcccmFCa, ccmFCb, orf194, orf359, rps13, rpl16, rps3, trnC
#19039 803Icox2, nad3, atp8, rps12mttB, trnS
#19438 416I(None)atp1, nad4L, rps19(p), trnD, trnfM, trnI, trnK, trnM, trnN, trnW
#20434 585Irps7, rrn26trnF, trnS
#22438 351R(D)cob, cox3, nad1a, rrn5/18mttB, rpl5, trnfM, trnP
Total872 455(Average size = 34 898 bp)

aI: intact clone; R(S): single-recombinant clone; S(D): double recombinant clone.

bUnderlined: probe genes not detected by sequencing.

c(p): partial gene.

Figure 1

Alignment of 25 mtDNA clones in the 452 528 bp MC molecule of the wheat mitochondrial genome. Broad, light-green bar shows the MC molecule cleaved between MC coordinates 452 528 and 1. Numbers on the MC molecule show the MC coordinates of the ends of all the clones, their segments and repeat sequences. Rectangle with projection in the broad bar: R1–R9 repeat pairs involved in recombinant clone formation. DRs are dark green, IRs dark brown. The projection shows the direction of each repeat copy. Slender bar: individual clones; light blue, yellow and orange represent intact, single recombinant and double recombinant clones, respectively. L, C or R affixed to clone numbers: Left, central and right segments of a recombinant clone. Note that L and R segments of a single-recombinant clone have the same repeat copy at the end connecting two segments; head-to-tail for DRs, and head-to-head or tail-to-tail for IRs. The double-recombinant clone has a copy of one repeat pair at one end each of its L and C segments and a copy of another repeat pair at the other end of C and at one end of R that, respectively, connect the L and C segments and the C and R segments by recombination.

Intra-molecular recombination and site of recombination

Of the 10 recombinant clones, 8 were split into two segments. The other two (#24 and #224) were cleaved into three segments. Without exception, there was a pair of direct repeats (DRs) or inverted repeats (IRs) at the split site (Figure 1). All the recombinant clones carried a completely or nearly identical copy of the same repeat at the recombination site (details in the next paragraph). DRs connected split fragments head-to-tail, whereas IRs connected them head-to-head or tail-to-tail. These facts indicate that the split clones were produced by intra-molecular recombination between the relevant repeats. In sum, nine repeat pairs, R1 to R9, were responsible for the production of all of the recombinant clones (Table 2). The production of clones #24 and #51 was mediated by the same R7 repeats, whereas #75 (#162 as well) and #96 were reciprocal products of recombination of R8 repeats (Figure 2A). Two clones, #24 and #224, were double recombinants (Figure 2B and C). The former was produced by recombination between two DR pairs, R3 and R7, and the latter recombination between two IR pairs, R2 and R6. Seven additional repeats, R10 to R16, larger than 100 bp were present in the genome (Table 2). Three repeats, R1, R7 and R10, shared a 1634 bp sequence in common, containing a part of rrn26. Similarly, three other repeats, R2, R3 and R4, shared a 4430 bp common sequence that carried trnfM, rrn18 and rrn5. In addition, small repeats of 30–100 bp in size were detected in a dot-matrix image, of which 35 were the direct and 38 were the inverted types. All those repeats are shown in Figure 3, in which R1 to R16 are marked by arrows. We need to search for whether all of them serve as recombination sites or not, although our results showed that a repeat pair as small as 197 bp in size (= R9) mediated recombination.
Table 2

Repeats involved in intra-molecular recombination, and other repeats larger than 100 bp found in the wheat mitochondrial genome

No.TypeaSize (bp)Gene in repeatsbMC coordinatescDifference between copiesRecombinant clone/fragment
Copy-1Copy-2Copy-3
R1DR9882rrn26(p)-trnQ-trnK170 632–180 513262 529–272 409copy-2 1bp def.#94R/L
R2IR6064rrn5-rrn18-trnfM54 623–60 686304 973–298 9101bp mismatch#224C/R
R3DR5469rrn5-rrn18-trnfM53 584–59 052390 552–396 020identical#24C/L
R4IR4430rrn5-rrn18-trnfM304 973–300 544391 591–396 0201bp mismatch#1L/R
R5DR2463No gene159 586–162 048358 521–360 983identical#27L/R
R6IR2045No gene32 869–34 913326 589–324 545identical#224C/L
R7DR1634rrn26(p)170 632–172 265262 529–264 162374 267–375 900identical#24C/R; #51L/Re
R8DR1341atp619 042–20 38284 918–86 257copy-2 1bp def.#75L/R; #96L/R;
+7 bp mismatch#162L/R
R9DR197No gene224 888–225 084d340 161–340 3571 bp mismatch#31L/R
R10DR7035rrn26257 128–264 162368 866–375 900identicalNone
R11DR493atp8233 393–233 885338 884–339 3765 bp mismatchNone
R12DR385trnK178 466–178 850270 362–270 746442 527–442 911copy-3 1 bp dif.None
R13DR207No gene63 737–63 943233 154–233 3601 bp mismatchNone
R14DR190trnP117 643–117 832305 046–305 2354 bp mismatchNone
R15DR186trnD222 071–222 256429 332–429 517identicalNone
R16DR104No gene20 239–20 34286 115–86 218197 109–197 212identicalNone

aDR and IR: direct and inverted repeats.

brrn26(p): partial 422 bp sequence of rrn26.

cBoldface: IR copy.

dIncluded in cox2a.

eBoth recombinations occurred between copy-1 and -3.

Figure 2

Origins of four recombinant clones obtained by recombination mediated by different repeat pairs. Rectangle: MC molecule. Arrows: DR or IR pairs. Broken line: fusion of separate segments by recombination. Thick and thin lines: cloned DNA segment and remaining part of the recombinant molecule not included in the clone. Numbers on MC molecules: MC coordinates at the ends of repeats and the cloned molecule. Note: DRs should be drawn in the same direction by folding the MC molecule with a 180° twist. This was omitted to simplify the figure. (A) Clones #75 and #96 as reciprocal products of R8-mediated recombination. They are part of two subgenomic molecules; (B) clone #24 is the product of double recombination at two DR pairs, R3 and R7; (C) clone #224 is the product of double recombination at two IR pairs, R2 and R6.

Figure 3

Dot matrix of the MC molecule of wheat mitochondrial genome, showing direct (blue) and inverted (orange) repeat pairs of larger than 30 bp. Sixteen repeat pairs, R1–R16, of larger than 100 bp are marked by arrows (Table 2).

We tried to identify the recombination site in each repeat pair. Four, R3, R5, R6 and R7, had identical copies. Four others, R1, R2, R4 and R9, had only 1 nt difference between the two copies, located at the extreme end of the repeat (Table 2). Identification of the recombination site therefore was informative only for repeat pair R8, which was involved in the production of three recombinant clones; #75, #96 and #162 (Figure 4). Two copies of this repeat, R8-1 and -2, which carried atp6-1 and atp6-2 at the same R8 coordinates, 91–1251, had 8 nt differences; one at R8 coordinate 6, the others between coordinates 1301 and 1316 (Figure 4A). Nucleotide sequences of the two R8 copies and their 5′- and 3′-flanking segments were compared with those of the three recombinants. As for the 5′-flanking sequence and sixth nucleotide of R8, #75 and #162 were the same as the R8-1 copy, whereas #96 was the same as R8-2. As for the 3′-flanking sequence and seven variable nucleotides at R8 coordinates 1301–1316, #75 and #162 were the same as the R8-2 copy, whereas #96 was identical to the R8-1 copy. These findings indicate that all three recombinant clones were produced by recombination in the same 1294 bp segment of the R8 repeat (Figure 4A). Previously, Bonen and Bird (27) sequenced wheat mtDNA segments flanking atp6, and found that there are two molecular forms at both the 5′ (‘downstream’ in their designation) and 3′ (‘upstream’) borders of the gene. Their nucleotide sequences were in complete agreement with ours, except for a 1 bp deletion in our R8-2 copy between MC coordinates 86 217 and 86 218. Their sequences 3 and 2 correspond to the 5′ and 3′ borders of the R8-1 copy, and the sequences 4 and 1 to the 5′ and 3′ borders of the R8-2 copy (Figure 4B). They located a 6 bp insertion in sequence 3, extending the homologous region between sequences 3 and 4 by 22 bp downstream (toward the 5′ end), which was confirmed by our findings. The sequence comparison (Figure 4B) indicated that the three recombinant clones were produced by recombination in the same 1291 bp segment (3 bp smaller, comparing with the alignment in Figure 4A). Because this segment occupies ∼95% of the R8 repeat, it is not surprising that three independent recombination events occurred within this segment.
Figure 4

(A and B) Recombination site in R8 repeats which produced the three recombinant clones, #75, #96 and #162. Nucleotide sequences in pink, light green and yellow backgrounds, respectively, are sequences homologous to an R8 copy (R8-1) and its flanking regions, sequences homologous to the other R8 copy (R8-2) and its flanking regions, and the recombination site sequence. In this figure, the forward strands are shown, which are antisense relative to the apt6-coding sequence. Numbers outside and inside the R8 or R8′ repeat: MC coordinates of the nucleotides flanked respectively by two R8 copies and the R8 or R8′ coordinates of the variable nucleotides between them. Capital and lower case letter: Consensus and unique nucleotide between two R8 copies and their flanking regions. Asterisk: deficient nucleotide.

Bonen and Bird (27) also reported the presence of short DRs in three of the above four sequences, corresponding to the present 5′ and 3′ borders of the R8-1 copy and the 5′ border of the R8-2 copy (Figure 4B), where ‘border’ means the boundary between a repeat end and its flanking sequence. We examined 60 bp sequences around the 5′ and 3′ borders (30 bp on both sides of each border) of all repeats shown in Table 2. The complete border sequences are given in Supplementary Table 1. Of 70 border sequences of the 35 repeat copies, 22 contained straight, DRs (no gap, no mismatch) of 3–7 bp while additional 24 possessed aberrant 4–10 bp DRs, having a mismatched nucleotide or a few nucleotides intervening between the repeats, and the remaining 24 did not have short DRs (Table 3). Fourteen repeats had short DRs at both ends, which did not show any sequence similarity, homologous or complementary, to each other. Thus, we conclude that the majority of the repeat ends are associated with short DRs, although their functional role is unknown.
Table 3

Short DRs found in the 5′- and 3′-borders of 16 repeats, R1–R16, in the wheat mitochondrial genome

Repeat5′-Border3′-Border
R1-1TGGt/gg
R1-2C/CCC+cccc
R2-1AA/Aaaa
R2-2AGGTagg/tAA/Gaag
R3-1TTT/TCT+++ttttct
R3-2CCC/T++ccct
R4-1AA/Gaag
R4-2ACATA+++acata/
R5-1ACCTAa/ccta
R5-2CCTA+/ccta
R6-1GCAA/gcaaATTTC+++att/tc
R6-2CAAc/aaCTTGC/ATC++++cttgcttc
R7-1AT/CCCaccccTATTTCA+t/atttaa
R7-2C/CCC+ccccTATTTCA+t/atttaa
R7-3C/CCC+cccc
R8′-1ATCTACA/atctacaACGAAac/gaa
R8′-2ATCT/atct
R9-1
R9-2
R10-1TGCTTTCTTC+++t/tctttcttcTATTTCA+t/atttaa
R10-2/TCTTTCTtctttct
R11-1TCA/tcaAAG/Aaaga
R11-2AAATAAG/+aaaaaag
R12-1AGATCaga/tc
R12-2AGATCaga/tc
R12-3A/GTagt
R13-1
R13-2TAT/TC+tattc
R14-1GCGCTgc/ggt
R14-2GGT/ggtAGGCagg/c
R15-1AAGA/aagaT/ATAtata
R15-2GAAGA+g/aagaTTTCTT/tttctt
R16-1/AATAGCA+++aatagcaGAAAGga/a*g
R16-2/AATAGCA+++aatagca/GA*CTgatct
R16-3/AATAGCA+++aatagcaGAAAGga/a*g

Slash, border; plus, intervening nucleotide between short repeat sequences; asterisk, deficient nucleotide; capital and lower-case letters, short DR sequences; underlined, mismatched nucleotide; sequences of R8′-1 and -2, first identified by Bonen and Bird (27), and confirmed here.

Stern and Palmer (28) indicated that rrn18 and rrn26 often are contained in recombination sites of the wheat mitochondrial genome. Our results confirmed this because 6 of the 12 recombination events detected are mediated by repeat pairs containing those genes (Table 2).

Genes and the genetic map of the wheat mitochondrial genome

In all, 55 genes and their exons were identified (Table 4) and mapped on the MC molecule (Figure 5). All the protein-, rRNA- and tRNA-coding genes known for wheat (24), rice (8) and maize (10) were present, i.e. 9 Complex I genes, 1 Complex III gene, 3 Complex IV genes, 5 Complex V genes, 4 cytochrome c biogenesis genes, 11 ribosomal protein genes, 2 other protein-coding genes, 3 rRNA genes and 17 tRNA genes. Nucleotide sequences of seven wheat genes, rpl16, rps3, rps4, mttB, trnA, trnI and trnM, were determined here for the first time. Three genes, rpl2-p, rps19-p, and rrn26-p (the third rrn26 copy), were truncated. The first two are functional in rice but missing in maize (10). Nine genes, nad1, nad2, nad4, nad5, nad7, cox2, ccmFC, rps3 and trnA (chloroplast origin), had the exon–intron structure. The chloroplast counterpart of trnA also has an intron (21). All exons of nad4 (exons a-d), nad7 (a-e), cox2 (a,b), ccmC (a,b), rps3 (a,b) and trnA (5′-,3′-ex) were cis-spliced, whereas some exons of nad1, nad2 and nad5, were trans-spliced (the slash indicating trans-spliced exons) as follows: nad1a/nad1b,c/nad1d/nad1e; nad2a,b/nad2c-e; and nad5a,b/nad5c/nad5d,e.
Table 4

Genes in the wheat mitochondrial genome

GeneaSize (bp)MC coordinatesStrandbNo. of amino acidsPrevious accession no.c
FromTo
I. Complex I genes
    nad1a386306 345306 730+X57968
    nad1b8217 60217 683X57967
    nad1c19215 98816 179X57967
    nad1d59282 283282 341X57966
    nad1e25943 39443 652X57965
    nad1978×325
    nad2a153182 361182 513Y14433
    nad2b392181 155181 546Y14433
    nad2c161210 083210 243Y14434
    nad2d573207 093207 665Y14434
    nad2e188205 502205 689Y14434
    nad21467488
    nad3357341 099341 455118X59153
    nad4a461135 541136 001+X57164
    nad4b515137 026137 540+X57164
    nad4c423140 981141 403+X57164
    nad4d89143 059143 147+X57164
    nad41488+495
    nad4L303421 629421 931100AJ295996
    nad5a231295 842296 072M74157
    nad5b1216293 764294 979M74157
    nad5c2143 01043 030M74158
    nad5d395111 614112 008+M74159
    nad5e150112 942113 091+M74159
    nad52013×670
    nad6744280 549281 292247X62100
    nad7a143409 840409 982+X75036
    nad7b69410 796410 864+X75036
    nad7c467412 177412 643+X75036
    nad7d244413 642413 885+X75036
    nad7e262415 585415 846+X75036
    nad71185+394
    nad9864211 710212 573287X69720
II. Complex III & IV genes
    cob119763 12264 318398X02352
    cox11575245 285246 859+524Y00417
    cox2a390224 812225 201X01108
    cox2b393223 200223 592X01108
    cox2783260
    cox379828 05328 850265X15944
III. Complex V genes
    atp1153068328361+509X15918
    atp4579196 584197 162192X54311
    atp6-1116119 13220 292386M24084
    atp6-2116185 00886 168386M24084
    atp8-1471338 884339 354156X59153
    atp8-2471233 393233 863156X59153
    atp924388249066+80X15919
IV. Cytochrome c biogenesis genes
    ccmB621185 578186 198206AF082025
    ccmC723156 957157 679+240X79609
    ccmFCa75599 514100 268+AY500223
    ccmFCb559101 280101 838+AY500223
    ccmFC1314+437
    ccmFN177050 11251 881589X69205
V. Ribosomal protein genes
    rpl2-p169283 450283 61856+1/3AJ295995
    rpl557029 77830 347189AJ535507
    rpl1655890 16890 725185New
    rps152549 34149 865174X69205
    rps21083215 091216 173+360Y13920
    rps3a7493 92593 998New
    rps3b161290 57492 185New
    rps31686561
    rps41074273 586274 659357New
    rps7447379 580380 026148X67242
    rps12378340 677341 054125X59153
    rps1335118 62818 978116Y00520
    rps19-p198422 830423 02766AJ295996
VI. Other protein coding genes
    matR203744 17246 208678X57965
    mttB816314 992315 807+271New
VII. rRNA genes
    rrn5-1122302 949303 070+Z14078
    rrn5-2122393 494393 615Z14078
    rrn5-312256 52656 647Z14078
    rrn18-11955300 880302 834+Z14078
    rrn18-21955393 730395 684Z14078
    rrn18-3195556 76258 716Z14078
    rrn26-13467371 222374 688Z11889
    rrn26-23467259 484262 950Z11889
    rrn26-p422170 632171 053Z11889
VIII. tRNA genes
    trnA 5′-ex*3874 73874 775+New
    trnA 3′-ex*3575 58175 615+New
    trnA*73+
    trnC*7197 42097 490+X15119
    trnD-174429 341429 414X15379
    trnD-274222 080222 153X15379
    trnE7227 05027 121X14698
    trnF*73382 956383 028X15118
    trnfM-174300 805300 878+Z14078
    trnfM-274395 686395 759Z14078
    trnfM-37458 71858 791Z14078
    trnI74430 118430 191New
    trnK-173270 536270 608+X15236
    trnK-273442 701442 773+X15236
    trnK-373178 640178 712+X15236
    trnM73436 154436 226New
    trnN*72428 634428 705X15379
    trnP-175305 095305 169+Z14078
    trnP-275117 692117 766+Z14078
    trnQ-172266 806266 877+X15140
    trnQ-272174 909174 980+X15140
    trnQ-372193 282193 353+X06902
    trnS-188341 968342 055X13245
    trnS-287408 505408 591+X15118
    trnS-3*87383 444383 530X15118
    trnW*74445 613445 686+X05602
    trnY83210 880210 962Y14434

aBoldface, sum of all exons; lower-case letters, exons of a protein-coding gene; hyphenated, copies of the same gene; asterisk: probable chloroplast origin.

bPlus and minus, coded by the forward and reverse strand; ×, trans-spliced gene.

cNew, gene or exon whose nucleotide sequence is first reported for wheat.

Figure 5

Genetic map of the wheat mitochondrial genome showing the location of all the genes and their exons in the outer-most circle, of ORFs larger than 300 bp in the central circle, and of chloroplast-derived DNA segments in the inner-most circle. The broad, outer-most circle represents the MC molecule, in which the nine repeat pairs, R1–R9, that mediate production of all the recombinant clones and an additional 7035 bp repeat pair, R10, are shown. Genes and exons coded by the forward and reverse DNA strands are shown outside and inside the MC molecule, respectively.

Ten genes were present in multi-copy: atp6, atp8, rrn26, trnD and trnP were duplicated and rrn5, rrn18, trnfM, trnK and trnQ triplicated. In addition, three trnS genes were found, but they greatly differed each other in nucleotide sequence and therefore were considered different genes, confirming the results of two previous works (29,30). Restriction fragment analyses of wheat mtDNA revealed the presence of seven molecular forms of the rrn18-rrn5 cluster (31,32). We identified three copies, Copy-1, -2 and -3, of a three-gene cluster, trnfM-rrn18-rrn5, in the MC molecule, all of which were included in three repeats, R2, R3 and R4 (Table 2). Figure 6 illustrates the production of two recombinant forms of this gene cluster from recombination between Copy-1 and -2 (pathway [A]) and Copy-2 and -3 (pathway [B]). Because recombination also occurs between Copy-1 and -3, six recombinants are expected altogether. We obtained three of them, which were produced by recombination between Copy-1 and -2 (#1L/R), Copy-1 and -3 (#224C/R) and Copy-2 and -3 (#24C/L) (Table 2). None of their reciprocal products was obtained, probably as a matter of chance owing to the small number of the clones examined, because the fourth recombinant molecule is reported by Lejeune et al. (32). As for rrn26, two molecular forms of its 5′ end, and three forms of the 3′ end had been predicted previously (15,32). This prediction was verified by the present findings confirming two complete and one partial copy (422 bp 3′ end) of rrn26.
Figure 6

Production of various molecular forms from the MC molecule by intra-molecular recombination between different repeat pairs. Copy-1, -2 and -3 are three copies of the trnfM-rrn18-rrn5 gene cluster. Copy-2 and -3 are inverted relative to Copy-1. R5 and R6 represent a DR and an IR pair, respectively. A/B, C/D and E/F are PCR primer pairs to mark the 5′- and 3′-flanking regions of Copy-1, -2 and -3, respectively. [A]: production of an isomer (flop form) of the MC molecule (flip form) by recombination between an IR pair, Copy-1 and -2. [B] and [C]: production of two complementary subgenomic molecules by recombination between a DR pair, Copy-2 and -3, and two R5 copies, respectively. [D]: production of an aberrant MC molecule having extra copies of the three-gene cluster (Copy-3/2) and R5 repeat by recombination between R6 repeats in two subgenomic molecules, II and III.

Two copies of atp8 had five mismatched nucleotide pairs scattered within the 471 bp gene region. Sequence analyses of recombinant molecules supposedly produced by recombination between the R11 repeats containing this gene (Table 2) might be useful in specifying recombination site(s) within the repeat. In addition to those genes, 179 ORFs larger than 300 bp were found (Supplementary Table 2). Their total size amounted to 75 465 bp, occupying ∼16.7% of the entire genome. This number greatly exceeds the 121 ORFs of comparable size reported for maize (10), in spite of the fact that the wheat mitochondrial genome is much smaller than the maize genome. Functional analysis of those ORFs will be an important problem in the future mitochondrial genomics.

MtDNA sequences homologous to ctDNA

Homology search using the blast2sequence program revealed that the wheat mitochondrial genome has 55 sequences homologous (mostly with 80% or higher homology on the nucleotide basis) to the corresponding sequences of the wheat chloroplast genome (Table 5). Exceptions were nine sequences question-marked in the last column of Table 5, which were mosaic of highly conserved and variable sequences, showing segmental differentiation of the sequences. Sizes of individual sequences vary between 27 bp for the smallest and 4239 bp for the largest. The total size, 26 264 bp, corresponds to 5.80% of the entire genome.
Table 5

Wheat mtDNA sequences showing homology to ctDNA sequences

MtDNA sequenceHomologous ctDNA sequenceNucleotide sequence homology (%)c
MC coordinatesaSize (bp)Mt gene locatedbCtDNA coordinatesSize (bp)Ct gene locatedb
995–1157163No62 057–62 218162psbE*92.0
O7309–7889581atp1*35 143–35 696554atpA* (copy-3)?
53 841–54 9501110No36 037–34 9181120atpA# (copy-4)97.2
55 179–55 20527No48 098–48 07227trnF*100.0
O56 769–58 7001932rrn18-392 532–91 0611472rrn16(copy-1)?
74 171–76 0031833No93 226–95 0591834trnI 3′-ex, trnA, rrn23*99.8
79 301–79 405105No63 441–63 336106petL95.3
97 417–97 542126trnC18 754–18 628127trnC91.3
97 779–97 86183No18 367–18 28286No81.4
98 764–99 133370No77 395–77 024372rpl14#81.6
99 254–99 373120No76 833–76 715119rps8*86.7
117 697–117 76064trnP-2#64 131–64 06963trnP#82.8
119 524–120 020497No35 542–36 072531atpA* (copy-5)?
146 989–147 05567No109 596–109 52671ndhG*84.5
154 459–154 51254No84 016–83 96354No98.1
157 714–157 74532trnI-p82 976–82 94532trnI*96.9
162 458–162 51558No34 309–34 36658atpF 3′-ex*96.6
O170 827–170 89569rrn26-p*95 063–94 99569rrn23* (copy-3)79.7
174 918–174 97154trnQ-2#6749–669654trnQ*83.3
242 709–243 106398No110 973–111 387415ndhA 3′-ex#?
249 838–249 89053No51 175–51 22551trnV 3′-ex*88.7
O259 648–262 7923145rrn26-297 615–94 9952 621rrn23(copy-1)?
266 815–266 86854trnQ-1#6749–669654trnQ*83.3
294 426–294 49065nad5b*102 531–102 59565ndhF*81.5
O300 896–302 8271932rrn18-191 061–92 5321472rrn16(copy-2)?
304 391–304 41727No48 072–48 09827trnF*100.0
304 646–304 973328No34 918–35 245328atpA* (copy-1)95.7
305 100–305 16364trnP-1#64 131–64 06963trnP#82.8
316 034–316 10269No21 268–21 34376rpoB*86.8
324 416–324 550135No75 643–75 509135rps11*94.1
343 407–343 573167No111 572–111 403170No88.4
349 029–349 06436No33 284–33 31936atpF 5′-ex*91.7
358 076–358 521446No41 098–40 653446psaA*99.8
O371 386–374 5303145rrn26-197 615–94 9952621rrn23(copy-2)?
378 941–379 02888No44 149–44 06288No94.3
379 044–379 150107No44 039–43 932108No88.0
380 703–380 885183No68 074–68 256183clpP*95.6
382 951–383 089139trnF48 133–47 995139trnF94.2
383 203–383 27876No47 851–47 77676No82.9
383 342–383 42281No47 702–47 62182trnL 3′-ex91.5
383 409–383 603195trnS45 160–44 967194trnS90.8
388 154–388 18229No34 485–34 51329atpF 3′-ex100.0
390 809–391 9181 110No36 037–34 9181120atpA* (copy-2)97.2
392 147–392 17327No48 098–48 07227trnF*100.0
O393 737–395 6681932rrn18-292 532–91 0611472rrn16(copy-3)?
400 525–400 55632No49 041–49 07232ndhJ*96.9
408 510–408 58576trnS-2#11 655–11 57977trnS#80.5
417 240–421 4784239No89 049–84 7804270ndhB, rps7, rps12 ex-2, -3?
421 513–421 55846No84 756–84 71146No97.8
428 633–428 71886trnN98 896–98 81186trnN98.8
O436 153–436 22573trnM52 107–52 03573trnM94.5
445 372–445 41645No64 290–64 24645No88.9
445 455–445 48834No64 084–64 05134trnP*100.0
445 609–445 69082trnW63 927–63 84682trnW96.3
452 168–452 356189No61 844–62 032189psbF*, psbE*88.4
Total26 264
Total excluding O-marked sequences: 13 455 bp

CtDNA sequences present in one IR, IRB, are shown, omitting those in the other copy (IRA), because of the same gene set present in two copies. Total size of 26 264 bp is 14 bp smaller than the sum of all the segments because a 14 bp sequence overlaps between two segments of the mtDNA coordinates 383 342–383 422 and 383 409–383 603.

aO: native mtDNA sequence.

b# and asterisk: genes, of which a large portion and only a small portion are located in the respective DNA sequences. Gene in boldface: complete or nearly complete gene sequence included in the respective DNA sequences. No: no gene present.

c?: undetermined because of segmental differentiation of the sequence within the gene.

Of the above 55 wheat mtDNA sequences, 8 carried native (not chloroplast-derived) mitochondrial genes, atp1, rrn18-1, -2, -3, rrn26-1, -2, -p and trnM, whose total size amounted to 12 809 bp. They showed homology to the ctDNA sequences carrying the corresponding chloroplast genes, atpA, rrn16, rrn23 and trnM (marked by circles in Table 5). Each of the gene pairs, atp1/atpA, rrn18/rrn16, rrn26/rrn23 and mt-trnM/ct-trnM, is assumed to have originated from a common prokaryotic gene, being homoeologous to each other (evidence will be reported elsewhere). The total size of the mtDNA sequences of real chloroplast origin therefore was estimated as 13 455 bp; 2.97% of the wheat mitochondrial genome, compared with 22 593 bp (6.3%) and 25 281 bp (4.4%) reported, respectively, for rice and maize (8,10). Thus, both the total size and proportion of the chloroplast-derived sequences relative to the entire genome were smallest in wheat, comparing with rice and maize.

Gene shuffling in the cereal mitochondrial genome

We compared mitochondrial gene maps of wheat, rice (8) and maize (10), excluding tRNA genes, pseudogenes and ORFs (Figure 7). Five exons of nad7, nad7a to e, showed a common arrangement in the three cereals. This gene was used to mark the common map origin, and the arrangement of nad7a to nad7e to mark the common map direction. A syntenic gene/exon arrangement, then, should appear as a row of genes/exons parallel to either diagonal line. Only a few gene/exon clusters of the three cereals showed synteny. One 5-gene cluster, ccmFN-rps1-matR-nad1e-nad5c, and five 2-gene clusters, rps13-nad1bc, rrn18-rrn5, rps3-rpl16, nad9-nad2cde and nad3-rps12, showed synteny. The third and fourth ones are shown as 3-gene clusters in Figure 7, because maize has an extra copy of both rps3a and nad2de and, for this, rps3a and rps3bcd, and nad2c and nad2de were shown separately. Similarly, nad4abc and nad4d were shown as a 2-gene cluster because rice has two extra copies of nad4d. In addition, three 2-gene clusters, rps19(p)-nad4L, ccmB-nad2ab and nad5ab-rpl2(p), of wheat and rice conserved synteny, and two 2-gene clusters (cox1-rrn26 and nad6-rps4) of wheat and maize preserved synteny. No synteny was detected for any other gene combinations, indicative that frequent gene shuffling occurred during cereal speciation, resulting in remarkable structural differences in the cereal's mitochondrial genomes. Fauron et al. (17) showed by the physical map comparison that mitochondrial genome restructuring has taken place between three maize cytotypes, and Clifton et al. (10) demonstrated by MultiPipMaker analysis that little sequence similarity exists between mitochondrial genomes of six plant species. Those results agree with ours of the above cereal gene map comparison.
Figure 7

Correlation of gene order between the mitochondrial gene maps of wheat and rice (A) and wheat and maize (B). All the protein- and rRNA-coding genes and the former's trans-spliced exons are arranged from top to bottom for wheat, and from left to right for rice and maize, based on their order in the respective gene maps. Genes of rice and maize are indicated by code numbers given to the corresponding wheat genes in the left margin of figures. Duplicate genes carry the same number.

DISCUSSION

Features of the gene-based strategy for sequencing plant mitochondrial genomes

Two principal strategies have been used to sequence plant mitochondrial genomes; the physical map-based (5,7–9), and the genome shotgun strategies (6,10,11). We used a new gene-based strategy for wheat, facilitated by the fact that many wheat mitochondrial genes are available as probes (24) for selecting wheat mtDNA clones for sequencing. Use of this strategy gave a complete picture of the wheat mitochondrial genome by sequencing the 872.5 kb mtDNA, less than twice the genome size, 452 528 bp. Comparative values for the genome shotgun strategy are ∼4, 8 and >20 times for Arabidopsis, tobacco and maize (6,11,10), indicating apparent high genome sequencing efficiency of the gene-based strategy. However, application of this strategy requires construction of a cosmid mtDNA library and selection of mtDNA clones covering known mitochondrial genes by dot hybridization. The overall efficiency of the gene-based strategy, compared with that of the genome shotgun strategy, is not clear. The advantage of the gene-based, compared with the physical map-based strategy, is that no physical map construction is required. This is difficult with some plants because of the multipartite structure of the mitochondrial genome. Based on the physical map of the mitochondrial genome of a common wheat cultivar, Capitole (15), Lejeune and Quetier [cited from (24)] constructed the first gene map of the wheat mitochondrial genome, to which 36 genes were allocated. Their map completely matches ours for five local gene maps: (i) rrn18/rrn5–cobatp6–nad5de–nad4abcd–nad2ab–orf25 (= atp4)–nad2cde–nad9–cox2ab–cox1–rrn26, (ii) rps7– rrn18/rrn5–nad7abcde–atp1atp9–nad1bc–rps13atp6cox3, (iii) nad1a–rrn18/rrn5–nad5ab–nad1d–nad6–rrn26, (iv) nad3rps12–orf156 (= atp8) and (v) matRnad1enad5c. The arrangement of these five gene groups within the genome, however, differ both in order and direction. Their order in Lejeune and Quetier's map is that shown above, whereas in our map it is (i)–(iii, reverted)–(iv, reverted)–(ii)–(v, reverted). Whether this discrepancy is due to the different mtDNA sources, or to problems in the physical map they relied upon, needs to be clarified. The gene-based strategy for complete mitochondrial genome sequencing can not achieve its goal by itself if the genome contains large gene-free region(s) of >35 kb (average insert size of the vector) when Cosmid mtDNA clones are used in sequencing. To cover such regions of the genome, we need to perform sequencing of some additional clones which do not carry any probe genes. In fact, it was necessary for us to sequence a probe gene-free clone, #194, to complete sequencing the wheat mitochondrial genome. The MC molecule obtained successfully integrated all sequenced mtDNAs in it without leaving any pieces out. Blast search on the sequence homology between the present wheat and previously reported rice and maize mitochondrial genomes (8,10) provided supporting evidence that the present MC molecule represents the wheat mitochondrial genome. The rice and maize mitochondrial genomes were divided into successive 30 kb sections (sizes of the end sections were somewhat different), and sequences homologous to wheat mtDNA were investigated for each section (Table 6). All the sections contained homologous sequences of ∼3 kb or larger (up to 15 kb) to wheat mtDNA. MtDNA sequences conserved between wheat and rice, and between wheat and maize were distributed all over the rice and maize genomes, with no large conserved sequence-free regions (larger than 10 kb; detailed data omitted) in the genomes. This fact indicates that rice and maize mitochondrial genomic sequences are well represented in the wheat MC molecule.
Table 6

Rice and maize mtDNA sequences homologous to wheat mtDNA in different sections of the genome

Rice mtDNAMaize mtDNA
Genome sectionaSection size (bp)Homologous section (bp)bGenome sectionaSection size (bp)Homologous section (bp)b
130 00012 778130 0006518
230 0007676230 00012 814
330 00015 569330 00012 460
430 00013 001430 00012 185
530 0005465530 00010 010
630 00010 057630 0006241
730 00010 989730 0007782
830 00012 804830 0004308
930 0008527930 00014 730
1030 0006 8811030 0009739
1130 00014 8981130 00011 500
1230 00010 9851230 0008014
1330 00013 1821330 0004667
1430 00011 0091430 0005216
1530 00011 7481530 0007642
1640 52081121630 0002842
Total490 520173 6911730 0005079
1830 0004233
1929 6306776
Total569 630152 756

aRice and maize mitochondrial genomes are divided into successive 30 kb sections, the last one being the remaining part of the respective genome.

bTotal size of wheat mtDNA sequences of larger than 30 bp which are homologous to the rice or maize mtDNA sequences.

The most essential feature of the present gene-based strategy is that it facilitated the recovery of recombinant molecules. Restriction fragment mapping of plant mtDNA shows a multipartite structure of the mitochondrial genome, consisting of isomeric as well as subgenomic molecules produced by intra-molecular recombination (12,14–17,33). None of the previous works on complete sequencing of flowering plant mitochondrial genomes, by use of either the genome shotgun or physical map-based strategies, has recovered recombinant molecules. This is why recombination events have not been analyzed at the nucleotide sequence level. By virtue of the gene-based strategy, we obtained 10 recombinant clones among 25 examined, determined their nucleotide sequences, and identified repeat sequences responsible for their formation.

Structural features of the wheat mitochondrial genome

The wheat mitochondrial genome was assumed to be a 452 528 bp MC molecule (Figure 1), that was ∼92 and 79% the size of the rice and maize mitochondrial genomes, and possessed all the protein-, rRNA- and tRNA-coding genes known to be present in rice and maize (8,10). These facts indicate that wheat has the most compact mitochondrial genome among the three cereals. Multicopy mitochondrial genes were compared between wheat, rice and maize (Table 7). Gene amplification in general was species-specific. All of the multicopy wheat genes were located in the repeated sequences (Table 2). With the exceptions of atp8 and trnQ, multicopies of all the wheat genes had identical nucleotide sequences. As for trnQ, two copies were identical, whereas the third copy differed from them by a single nucleotide. These facts suggest their recent amplification, comparing with the divergence time of three cereals. One alternative possibility is copy correction through homologous recombination, which is known to occur in the case of chloroplast IRs (34).
Table 7

Copy numbers of mitochondrial genes that differ in number in wheat, rice and maize: gene fragments, pseudogenes and chloroplast-derived genes are excluded

GeneWheatRiceaMaizea
(1) Protein-coding gene
    atp1122
    atp4121
    atp6211
    atp8211
    cox3121
    nad1a122
    nad2c121
    nad2d, e122
    nad4d131
    nad5a,b121
    nad9121
    rpl2030
    rpl5120
    rps2112
    rps3a112
    rps7111
(2) RNA gene
    rrn5321
    rrn18321
    rrn26221
    trnD212
    trnE112
    trnfM311
    trnI112
    trnK311
    trnM110
    trnN011
    trnP212
    trnQ311

aAfter Notsu et al. (8) for rice and Clifton et al. (10) for maize.

To account for the observed species-specific gene amplification, a mechanistic model can be proposed. Recombination between the same repeat sequences in two subgenomic molecules produced by recombination between different repeat pairs will give rise to an aberrant MC molecule having a duplicate segment. Figure 6 illustrates an example, using a simplified MC molecule, in which only three copies (Copy-1, -2 and -3) of the trnfM-rrn18-rrn5 cluster and two repeat pairs, R5 and R6, are shown. Recombination between the R6 sequences in two subgenomic molecules, II and III, which are produced in pathways [B] and [C], gives a new MC molecule with an extra copy of the trnfM-rrn18-rrn5 cluster and R5 repeat together with their flanking regions. The size of the duplication corresponds to the sum of two segments, one between the recombination breakpoints in Copy-2 and one R5 copy, and the other between those in Copy-3 and the other R5 copy. Search for transposable element sequences in the wheat mitochondrial genome revealed presence of five sequences, three of which were different partial sequences of the wheat Sabrina retrotransposon, and two others were a part of a rice Tos-14 retrotransposon and wheat Tar1 retrotransposon. Total size of the five sequences was 805 bp, being ∼0.2% of the mitochondrial genome. Comparable figures for rice and maize were 20 sequences (total size 7003 bp, 14.3% of the genome) and 4 sequences (total size 641 bp, 0.1% of the genome), respectively (8,10). In this respect, wheat mitochondrial genome is similar to maize than to rice mitochondrial genome. It is important to know what kinds of sequences were involved in the observed mitochondrial genome differentiation. For this purpose, the MC coordinates of all unique wheat sequences larger than 100 bp, comparing with both the rice and maize mtDNA sequences were enumerated (Supplementary Table 3). In total, 227 unique sequences distributed throughout the genome were identified. Comparison between their positions and those of all mitochondrial genes in the genome indicated that almost all unique sequences corresponded to intergenic spaces. The exceptions were nine sequences carrying partial sequence of a gene. Of those, six sequences carried 3–97 bp of the highly variable 3′ end of the sense strand of cob, nad6, rpl2-p, rrn5-1, rrn5-2 and rrn5-3. Two sequences contained a 324 bp segment of atp6-1 and -2, that is located in the 3′-terminal region of these genes. The last sequence carried a 28 bp 5′ end of nad9, that is variable among the three cereals. These facts taken together demonstrate that the mtDNA sequences diversified in the three cereals are mostly redundant DNAs. In a summary, the wheat mtDNA sequences were partitioned into six categories, genic (including introns), ORF, repetitive, chloroplast-derived, retro-element and unique sequences (Table 8). This partition was not orthogonal, because some sequences were enumerated in more than one category. Sizes of the genic, ORF, repetitive, chloroplast-derived and unique sequences were obtained from the data presented in Table 4, Supplementary Table 2, Table 2, Table 5 and Supplementary Table 3, respectively.
Table 8

Classification of wheat mtDNA sequences into different categories

CategoryNo. sequencesTotal size (bp)Proportion (%)Source
Entire genome452 528100.0Figure 1
Genic, including introns7671 84815.9Table 4
ORFs (larger than 300 bp), including those in repeats17975 46516.7Supplementary Table 2
Repetitive (repeat sequences larger than 100 bp)2668 96015.2Table 2
Chloroplast origin4713 4553.0Table 5
Retro elements58050.2Text
Unique (larger than 100 bp), comparing with rice and maize277257 76257.0Supplementary Table 3

Classification of the sequences is not orthogonal, because some sequences are enumerated in more than one category.

Structural dynamics of the mitochondrial genome in ontogeny

Arrieta-Montiel et al. (35) reported on the structural dynamics of the common bean mitochondrial genome, which was revealed by studying a single mtDNA segment carrying the cms-associated pvs-orf239 sequence. Using the gene-based strategy, we isolated 10 recombinant mtDNA molecules, and determined the repeat sequences responsible for their production. Many other repeat pairs also were characterized (Table 2 and Figure 3), which are potential sites for additional recombination. Based on the entire wheat mitochondrial genome sequence (DNA Database accession no. AP008982) and the map positions of all repeat pairs larger than 100 bp (Table 2), we may prepare DNA primers for the sequences flanking both ends of those repeats. Their use in long-range PCR will allow efficient screening of recombinant molecules produced by recombination between the marked repeat pairs and quantification of isomeric as well as subgenomic molecules, as proved by Sugiyama et al. (11) in tobacco. They also demonstrated that long-range PCR works for a distance as long as 23 kb between two primers, which is sufficient to cover all repeats present in the wheat mitochondrial genome (Table 2). The same method may also facilitate finding the difference in recombinational activity among various repeat pairs as well as the equality or inequality of the reciprocal recombination products. The methodological details for such studies are as follows: recombination between an IR pair will produce an isomer (flop form) of the MC molecule (flip form; Figure 6, pathway [A]). This event is detected by long-range PCR using four primer pairs, A/B, C/D, A/D and C/B. If either the A/B or C/D primer pair gives an amplified product in PCR, the template clone is regarded as the original MC molecule, whereas if either the A/D or C/B pair gives an amplified product, the template clone is regarded as the flop configuration of MC, so far as the marked IR is concerned. The ratio of the latter to the former clones in number gives the molar ratio of the recombinant to the non-recombinant clones. Similarly, recombination between a DR pair will produce two subgenomic molecules (Figure 6, pathway [B]), whose production is detected by successful DNA amplification by use of the C/F or E/D primer pair. Quantification of the subgenomic molecules over the non-recombinants is achieved in the same way as described above. Such studies targeted to different repeat pairs enumerated in Table 2, using Cosmid clones of wheat mtDNA extracted from different organs or different ages of the plant as the template, will disclose structural dynamics of the mitochondrial genome in plant development.

Evolutionary change in the mitochondrial genome structures of cereals

The chloroplast genomes of rice, maize and wheat have identical gene arrangements (36,37,21), evidence of the structure's evolutionary stability. In contrast, the mitochondrial genome structure differs markedly in the three cereals (Figure 7) although the kinds of genes present essentially are the same [(8,10), present findings]. We showed that a variety of mtDNA molecules are produced in somatic tissues by intra-molecular recombination mediated by different repeat pairs. The structural differences of several mitochondrial genes in wheat and rice are suspected to be caused by short repeat pairs (data to be published elsewhere). We postulate that the same mechanism operates in germ cell lines, creating structural diversity in the mitochondrial genomes of different plant phylogenies. Another possible factor for high phylogenetic variability of the mitochondrial genome, compared with the chloroplast genome, is high DNA redundancy in the former than in the latter genome. The ratio of the genic sequences, including all exons and cis-introns, and excluding the sequences of chloroplast origin and pseudogenes, to the total mitochondrial genome size is 18.0% for rice (8), 11.7% for maize (10) and 15.9% for wheat (Table 8). Comparable values for the chloroplast genome are 58.8% for rice (36) and 60.4% for wheat (21), indicative of the presence of a much larger amount of redundant DNAs in the mitochondrial than in the chloroplast genome.

The MC molecule may represent the intact wheat mitochondrial genome

All previous works on complete sequencing of flowering plant mitochondrial genomes are based upon the MC molecule hypothesis (6–11). Because of the multipartite structure of the genome and the lack of direct electron-microscopic evidence, however, the existence of the MC molecule is still a matter of debate (11–13). After Andre et al. (12), we suspected reality of the MC molecule in wheat and upon this suspicion we adopted the gene-based sequencing strategy. It turns out, however, that analysis of the 10 recombinant clones obtained has given support to the existence of the MC molecule. If we consider the MC molecule to be a flip configuration of the genome, then recombination between either of the three IR pairs (Table 2) will produce its flop (= isomeric) molecule, as shown in the pathway [A] of Figure 6, whereas recombination between either of the DR pairs produces two complementary, subgenomic molecules (pathways [B] and [C] in Figure 6), where ‘complementary’ means that a complete gene set is shared by two or more molecules (15). The origin of eight recombinant clones can be explained by a single recombination event, while the remaining two double-recombination events occurred in the MC molecule. However, if the genome were in any other configuration, most of the recombinant clones obtained could not have been produced by simple recombination events (Table 9). Consider the following: if the genome existed in the flop configuration of MC (Table 9, Case 1–4), then recombination between any pair of the present DRs should produce double-flop configurations of the genome (Case 1 and 2), and recombination between IRs should produce two subgenomic molecules (Case 3 and 4). Similarly, if the genome consisted of two subgenomic molecules (Case 5–8), recombination between the repeat sequences in two separated molecules should produce the MC molecule (Case 5 and 6), or its double-flop configuration (Case 7 and 8). In all eight postulated cases, the expected recombination products do not match the ones we actually obtained. This fact supports the hypothesis that the MC molecule serves as the basic wheat mitochondrial genome structure.
Table 9

Expected and actual products of recombination when the mitochondrial genome has alternative configurations

Alternative genome configurationType of affected repeatbExpected product of recombination at the affected repeatscActual recombinant produced (clone obtained)d
CaseConfigurationRecombination siteaRepeatBefore recombinationAfter recombination
1Flop configuration at R2R2 (IR)R5DRIRDouble-flop configuration at R2 and R5Subgenomic molecule (#27)
2Flop configuration at R2R2 (IR)R7DRIRDouble-flop configuration at R2 and R7Subgenomic molecule (#24*)
3Flop configuration at R4R4 (IR)R6IRDRSubgenomic moleculesFlop configuration at R6 (#224*)
4Flop configuration at R6R6 (IR)R4IRDRSubgenomic moleculesFlop configuration at R4 (#1)
5Subgenomic moleculesR1 (DR)R9DRSeparatedPresent MC configurationSubgenomic molecule (#31)
6Subgenomic moleculesR3 (DR)R3DRSeparatedPresent MC configurationSubgenomic molecule (#75, #96, #162)
7Subgenomic moleculesR7 (DR)R4IRSeparatedDouble-flop configuration at R7 and R4Flop configuration at R4 (#1)
8Subgenomic moleculesR5 (DR)R2IRSeparatedDouble flop configuration at R2 and R5Flop config. at R2 (#224*)

aRepeat mediating recombination that results in respective genome configuration. DR and IR, direct and inverted repeats.

bOne of several repeat pairs, of which type is changed by altered genome configuration. Separated, two repeat sequences are separated to different subgenomic molecules.

cThe present MC molecule is considered as the flip configuration of the genome. Single- or double-flop configuration is caused by single or double recombination at the indicated repeats.

dAsterisk: double recombinant clone.

A possible alternative is that the wheat mitochondrial genome contains all kinds of isomeric as well as subgenomic molecules (13). Lonsdale et al. (16) and Fauron et al. (17) showed that 5–14 subgenomic molecules are produced from the MC molecule of sugar beet and maize by intra-molecular recombination. In our study we prepared wheat mtDNA from 2-week-old seedlings. Now, if a seedling consists of ∼106 cells, it means that 19 successive cell divisions, on the average, occurred before DNA extraction. We do not know how many replication origins exist in the wheat mitochondrial MC molecule. An electron-microscopic study of mtDNA replication in Chenopodium indicates only a few, if not just one, origins in its mtDNA (38). Considering this fact, together with information on the single replication origin of bacterial chromosomes, it is hard to believe that all kinds of subgenomic molecules have replication origins necessary for their maintenance through many cell cycles. This is further support for the presence of the MC molecule.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.
  28 in total

Review 1.  Small repeated sequences and the structure of plant mitochondrial genomes.

Authors:  C André; A Levy; V Walbot
Journal:  Trends Genet       Date:  1992-04       Impact factor: 11.639

2.  A mitochondrial protein associated with cytoplasmic male sterility in the T cytoplasm of maize.

Authors:  R E Dewey; D H Timothy; C S Levings
Journal:  Proc Natl Acad Sci U S A       Date:  1987-08       Impact factor: 11.205

3.  The mitochondrial genome of Arabidopsis thaliana contains 57 genes in 366,924 nucleotides.

Authors:  M Unseld; J R Marienfeld; P Brandt; A Brennicke
Journal:  Nat Genet       Date:  1997-01       Impact factor: 38.330

4.  The complete sequence of the rice (Oryza sativa L.) mitochondrial genome: frequent DNA sequence acquisition and loss during the evolution of flowering plants.

Authors:  Y Notsu; S Masood; T Nishikawa; N Kubo; G Akiduki; M Nakazono; A Hirai; K Kadowaki
Journal:  Mol Genet Genomics       Date:  2002-11-01       Impact factor: 3.291

5.  The complete nucleotide sequence of the mitochondrial genome of sugar beet (Beta vulgaris L.) reveals a novel gene for tRNA(Cys)(GCA).

Authors:  T Kubo; S Nishizawa; A Sugawara; N Itchoda; A Estiati; T Mikami
Journal:  Nucleic Acids Res       Date:  2000-07-01       Impact factor: 16.971

Review 6.  The maize mitochondrial genome: dynamic, yet functional.

Authors:  C Fauron; M Casper; Y Gao; B Moore
Journal:  Trends Genet       Date:  1995-06       Impact factor: 11.639

7.  Recombination sequences in plant mitochondrial genomes: diversity and homologies to known mitochondrial genes.

Authors:  D B Stern; J D Palmer
Journal:  Nucleic Acids Res       Date:  1984-08-10       Impact factor: 16.971

8.  Ebb and flow of the chloroplast inverted repeat.

Authors:  S E Goulding; R G Olmstead; C W Morden; K H Wolfe
Journal:  Mol Gen Genet       Date:  1996-08-27

9.  The complete sequence of the rice (Oryza sativa) chloroplast genome: intermolecular recombination between distinct tRNA genes accounts for a major plastid DNA inversion during the evolution of the cereals.

Authors:  J Hiratsuka; H Shimada; R Whittier; T Ishibashi; M Sakamoto; M Mori; C Kondo; Y Honji; C R Sun; B Y Meng
Journal:  Mol Gen Genet       Date:  1989-06

10.  Genes for tRNA(Asp), tRNA (Pro), tRNA (Tyr) and two tRNAs (Ser) in wheat mitochondrial DNA.

Authors:  P B Joyce; D F Spencer; L Bonen; M W Gray
Journal:  Plant Mol Biol       Date:  1988-05       Impact factor: 4.076

View more
  84 in total

1.  Genes and processed paralogs co-exist in plant mitochondria.

Authors:  Argelia Cuenca; Gitte Petersen; Ole Seberg; Anne Hoppe Jahren
Journal:  J Mol Evol       Date:  2012-04-07       Impact factor: 2.395

2.  Impact of genomic environment on mitochondrial rps7 mRNA features in grasses.

Authors:  Evan Byers; Jennifer Rueger; Linda Bonen
Journal:  Mol Genet Genomics       Date:  2010-07-21       Impact factor: 3.291

3.  Plant mitochondrial recombination surveillance requires unusual RecA and MutS homologs.

Authors:  Vikas Shedge; Maria Arrieta-Montiel; Alan C Christensen; Sally A Mackenzie
Journal:  Plant Cell       Date:  2007-04-27       Impact factor: 11.277

4.  Frequent, phylogenetically local horizontal transfer of the cox1 group I Intron in flowering plant mitochondria.

Authors:  M Virginia Sanchez-Puerta; Yangrae Cho; Jeffrey P Mower; Andrew J Alverson; Jeffrey D Palmer
Journal:  Mol Biol Evol       Date:  2008-06-03       Impact factor: 16.240

5.  Developmentally-specific transcripts from the ccmFN-rps1 locus in wheat mitochondria.

Authors:  Sophie Calixte; Linda Bonen
Journal:  Mol Genet Genomics       Date:  2008-09-03       Impact factor: 3.291

6.  Recent insertion of a 52-kb mitochondrial DNA segment in the wheat lineage.

Authors:  Juncheng Zhang; Jizeng Jia; James Breen; Xiuying Kong
Journal:  Funct Integr Genomics       Date:  2011-07-16       Impact factor: 3.410

7.  Discovery of global genomic re-organization based on comparison of two newly sequenced rice mitochondrial genomes with cytoplasmic male sterility-related genes.

Authors:  Sota Fujii; Tomohiko Kazama; Mari Yamada; Kinya Toriyama
Journal:  BMC Genomics       Date:  2010-03-29       Impact factor: 3.969

8.  A complete mitochondrial genome of wheat (Triticum aestivum cv. Chinese Yumai), and fast evolving mitochondrial genes in higher plants.

Authors:  Peng Cui; Huitao Liu; Qiang Lin; Feng Ding; Guoyin Zhuo; Songnian Hu; Dongcheng Liu; Wenlong Yang; Kehui Zhan; Aimin Zhang; Jun Yu
Journal:  J Genet       Date:  2009-12       Impact factor: 1.166

9.  A trans-splicing group I intron and tRNA-hyperediting in the mitochondrial genome of the lycophyte Isoetes engelmannii.

Authors:  Felix Grewe; Prisca Viehoever; Bernd Weisshaar; Volker Knoop
Journal:  Nucleic Acids Res       Date:  2009-06-23       Impact factor: 16.971

10.  Ribosomal protein L10 is encoded in the mitochondrial genome of many land plants and green algae.

Authors:  Jeffrey P Mower; Linda Bonen
Journal:  BMC Evol Biol       Date:  2009-11-16       Impact factor: 3.260

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.