Nan Zhao1, Yumei Wang2, Jinping Hua3. 1. Laboratory of Cotton Genetics, Genomics and Breeding/Key Laboratory of Crop Heterosis and Utilization of Ministry of Education, College of Agronomy and Biotechnology , China Agricultural University, Beijing 100193, China. Nan_Zhao@cau.edu.cn. 2. Institute of Cash Crops, Hubei Academy of Agricultural Sciences, Wuhan 430064, China. yumeiwang001@126.com. 3. Laboratory of Cotton Genetics, Genomics and Breeding/Key Laboratory of Crop Heterosis and Utilization of Ministry of Education, College of Agronomy and Biotechnology , China Agricultural University, Beijing 100193, China. jinping_hua@cau.edu.cn.
Abstract
Intergenomic gene transfer (IGT) is continuous in the evolutionary history of plants. In this field, most studies concentrate on a few related species. Here, we look at IGT from a broader evolutionary perspective, using 24 plants. We discover many IGT events by assessing the data from nuclear, mitochondrial and chloroplast genomes. Thus, we summarize the two roles of the mitochondrion: a source and a pool. That is, the mitochondrion gives massive sequences and integrates nuclear transposons and chloroplast tRNA genes. Though the directions are opposite, lots of likenesses emerge. First, mitochondrial gene transfer is pervasive in all 24 plants. Second, gene transfer is a single event of certain shared ancestors during evolutionary divergence. Third, sequence features of homologies vary for different purposes in the donor and recipient genomes. Finally, small repeats (or micro-homologies) contribute to gene transfer by mediating recombination in the recipient genome.
Intergenomic gene transfer (IGT) is continuous in the evolutionary history of plants. In this field, most studies concentrate on a few related species. Here, we look at IGT from a broader evolutionary perspective, using 24 plants. We discover many IGT events by assessing the data from nuclear, mitochondrial and chloroplast genomes. Thus, we summarize the two roles of the mitochondrion: a source and a pool. That is, the mitochondrion gives massive sequences and integrates nuclear transposons and chloroplast tRNA genes. Though the directions are opposite, lots of likenesses emerge. First, mitochondrial gene transfer is pervasive in all 24 plants. Second, gene transfer is a single event of certain shared ancestors during evolutionary divergence. Third, sequence features of homologies vary for different purposes in the donor and recipient genomes. Finally, small repeats (or micro-homologies) contribute to gene transfer by mediating recombination in the recipient genome.
A billion years ago, a host cell engulfed a dependent bacteria, α-proteobacteria, which turned into a semi-autonomic organelle, a mitochondrion [1]. It delivers energy to the eukaryotic host cell in the diversifying evolution [2]. Meanwhile, mitochondrial sequences transferred among intracellular genomes [3,4,5,6,7], which is intergenomic gene transfer (IGT). On the one hand, nuclear transposons and chloroplast tRNA genes transferred into the mitochondrial genomes in most seed plants [5,7,8,9,10,11,12]. Nuclear sequences contributed to mitogenome expansion, contributing almost half in melons [10]. Among the nuclear-like sequences in the mitochondrial genome, the long terminal repeat retrotransposons (LTR-retro) ranked first [4,7,8,9,10,11,12]. In addition, chloroplast-like genes promoted the translation in mitochondrial genome [13,14,15]. The sequence states of chloroplast genes changed in the donor (chloroplast) and receptor (mitochondrion) genomes [16,17], which concerned their later roles [18]. Besides, DNA sequence microhomology played an important role in chloroplast DNA inserting into the mitochondrion, which might be the microhomology-mediated break-induced replication (MMBIR) [19] or non-homologous end joining (NHEJ) [20].On the other hand, a large-scale of mitochondrial genes moved into the nucleus and chloroplast [21,22,23,24,25]. The prokaryotic genes (mitochondrial genes) converted to eukaryotic genes (nuclear genes) [26] to engage in sexual recombination [27]. Besides, RNA could mediate mitochondrion-to-nucleus transfers [28,29]. The mitochondrial genes preferentially inserted in the open nuclear chromosome regions [30]. These nuclear integrants of mitochondrial genes (numts) would gradually decay or transform to nuclear sequences [31]. A few numts received nuclear promoters and transit peptides [2,32] that guided their products to the mitochondrion [33,34]. Few nuclear homologies of organellar DNA could transcribe successfully [35]. However, mitochondrion-to-chloroplast transfer only occurred in a few angiosperms [16,17,18,36,37,38,39,40,41]. Perhaps because plastids were conservative [17,36] and lacked efficient DNA uptake setups [42]. During the evolution, mitochondrial sequences moved to the chloroplast genomes of the shared ancestors of certain relative species [17,18,38,39]. They preferentially inserted into the intergenic spacer [16,17,41] or the large single copy (LSC) region of the chloroplast genomes [38]. The insertion accompanied DNA repair by homologous recombination [17]. However, most chloroplast homologies of mitochondrial genes had low transcriptional levels [17]. Environmental stresses could promote chloroplast [43] and nucleus [44] to absorb exogenous DNA. Meanwhile, the loss of mitochondrial membrane proteins could facilitate the export of the mitochondrial genes [2].Recently, rapid development of genomic sequencing technologies has made it feasible to approach more IGT events in plants. It enables us to look into the details of intergenomic gene transfer. In this paper, we unveil the IGT events related to the mitochondrion based on 24 sets of nuclear, mitochondrial and chloroplast genomic sequences in plants (Table S1). We expect these results will lay the foundation for further exploration of genome evolution.
2. Results and Discussion
2.1. The Role of Mitochondrion as a Gene Source: Intergenomic Gene Transfer from Mitochondrion
2.1.1. Intergenomic Gene Transfer from Mitochondrion to Nucleus
There exist a number of conserved genes during the mitochondrial genome evolution [45,46]. In the present study, we use 67 essential genes to study the gene loss and transfer about the mitochondrial genome. As a result, genes encoding complex II and ribosomal subunits have been lost massively in most of the higher plants (Figure 1, yellow cells). Genes encoding complexes III and V display much greater conservation. These gene losses are parts of the mitochondrial genome variations in plants. Our next goal is to elucidate where the lost genes transferred. Two of the main detectable destinations are nuclear and chloroplast genomes.
Figure 1
Genes identified to transfer in and out of the mitochondrial genome or genes lost from the mitochondrial genome of 21 land plants. The first two columns are mitochondrial protein-encoding genes (the second column) and their functional categories (the first column). The first line lists the names of plant species. The red and green cells represent mitochondrial full-length intact homologs and pseudogenes in nuclear genomes, respectively. The white and yellow cells represent no mitochondrial homologs in nuclear genomes and genes lost from mitochondrial genomes, respectively.
Transferred genes exist in two forms: remnants left in the mitochondrial genome [47] and fragments inserted into the nuclear genome (numts) [48,49]. Few numts’ products returned to the mitochondrion and played a role [33,34]. Researchers have achieved the mitochondrion-to-nucleus transfer by experiments, whose flow was as follows: (1) introduce a silent selectable marker gene with a nuclear promoter and transit peptide-encoding sequence into the mitochondrial genome; (2) transform this recombinant mitochondrion into a new cell; (3) detect the phenotype related to the marker gene. This approach has been successful in the unicellular green alga Chlamydomonas reinhardtii [27]. However, there is no experimental report on the real mitochondrion-to-nucleus IGT. Mitochondrial genes transferred with prokaryotic signals, which needed a long time or a favorable evolutionary event to turn into the eukaryotic ones.In present research, to identify possible numts, we carry on non-experimental analyses by performing the genome alignment between conserved mitochondrial genes and nuclear genomes in above 21 land plants. First, we find extensive gene transfer and gene loss in these plants (Figure 1). Second, the gene transfer is more popular in eudicots and monocots than that in bryophytes. Specifically, the latter is merely 1/20 of the former (Figure 2). Since the bryophytes with few mitochondria could not survive after vast transfer [24]. Third, we identify a number of full-length mitochondrial-like protein-coding genes in the nuclear genome (Figure 1, red cells), which may be useful candidate genes. Fourth, there are also mitochondrion-like truncated genes, which we define as pseudogenes (Figure 1, green cells).
Figure 2
Length of mitochondrial-derived fragments in plant nuclear genomes. The plant species are arrayed on the horizontal axis. The total lengths of mitochondrial-to-nuclear sequences are along the vertical axis. The bars represent the lengths of sequences transferring from the mitochondrion to the nucleus in plant species. The error bars stand for the positive and negative deviations of 5.0%.
Genes integrated by nuclear genome have different endings. Nearly all lost their original roles and became a part of new nuclear sequences [31]. A few could re-gain function by receiving nuclear promoter and transit peptide [2,32]. Others would suffer from irreversible decay with accumulating an increasing number of unfavorable mutations. These events allow prokaryotic gene(s) to turn into eukaryotic gene(s) [26] to join in sexual recombination [27]. As for the transferred forms, early studies displayed RNA-mediated gene transfers from the mitochondrion to the nucleus [28,29], while DNA-mediated gene transfer was rare in plants. In addition, the lack of integral mitochondrial membrane proteins could hasten the gene export from the mitochondrion [2].To dissect the mechanism of mitochondrion-to-nucleus gene transfers, we analyze the repeats in nuclear genomes of 22 land plants. The ratios of the repeat size to the genome size of the four species, including two bryophytes (M. polymorpha and P. patens) and two angiosperms (A. thaliana and S. polyrhiza), are less than 20% (Table 1). Meanwhile, these four species contain fewer numts than other species (Figure 1). And there is a positive correlation between numts and the repeats in the nuclear genome (R2 = 0.6321) (Figure 3). The weak correlation may due to limited number of plant species used in present research. So, we consider that numts may become parts of the nuclear repeats to take part in repeat-mediated sexual recombination for a greater genetic diversity.
Table 1
Variation of repeats in nuclear genomes of 22 land plants.
Species
Repeat Sizes (Mb)
Genome Sizes (Mb)
Repeat/Genome (%)
References
Spermatophytes
Eudicots
B. rapa
191.63
284.13
67.44
[50]
B. napus
441.77
930.51
47.48
[51]
B. oleracea
185.43
539.91
34.34
[52]
A. thaliana1
23.58
119.67
19.70
[53]
C. papaya
316.53
369.78
85.60
[54]
R. communis
176.00
350.62
50.20
[55]
C. lanatus
159.80
321.05
49.77
[56]
G. max
587.10
978.97
59.97
[57]
V. radiata
216.17
548.08
39.44
[58]
S. latifolia
244.82
665.28
36.80
[59]
D. carota
193.70
473.00
40.95
[60]
N. tabacum
3479.49
4500.00
77.32
[61]
V. vinifera
185.35
487.00
38.06
[62]
Monocots
S. polyrhiza1
19.43
132.01
14.72
[63]
P. dactylifera
214.34
558.02
38.41
[64]
O. sativa japonica
188.00
374.42
50.21
[65]
O. sativa indica
148.14
374.25
39.58
[66]
S. bicolor
231.28
739.15
31.29
[67]
Z. mays
1757.48
2067.62
85.00
[68]
Basal Angiosperms
A. trichopoda
407.43
706.50
57.67
[69]
Bryophytes
M. polymorpha1
12.48
304.37
4.10
[70]
P. patens1
79.37
477.95
16.61
[71]
1 notes repeat content less than 20%.
Figure 3
Correlation between the length of mitochondrial sequences transferring to the nucleus and repeat sizes of the nuclear genome in 22 land plants. Each dot represents a length value (X, Y). X refers to the size of the repeats in nuclear genomes of one species (based on the horizontal axis). Y means the length of mitochondrial-to-nuclear sequences in its corresponding species (based on the vertical axis). The slash represents the linear regression function of the distribution tendency of the dots. R2 is the regression coefficient.
2.1.2. Intergenomic Gene Transfer from Mitochondrion to Chloroplast
Given the prevailing mitochondrion-to-nucleus IGT, similar transfers into the chloroplast might be expected. However, mitochondrion-to-chloroplast IGT happened only in three angiosperms, Apiaceae [16,18,36,37,38,39], Apocynaceae [17] and Poaceae [40,41]. The first two families belong to the eudicots and the last to the monocots.The existing forms of gene sequences in and out of both donor and receptor genomes altered after the transfer. D. carota Mitochondrial Plastid sequence (DcMP)—presented three fragment sequences (DcMP 1, −2 and −3 +4) in the plastid genome. The split probably arose from new DNA recombination that happened after one copy of DcMP migrated into the mitochondrial genome [16]. Besides, mitochondrial-like rpl2 only contained an exon in the plastid genome and two homologies in different regions of the mitochondrial genome in A. syriaca [17]. In addition, the traits of gene sequences in the plastid genome (recipient genomes) might affect their specialized roles. DcMP inserted into two short direct repeats in the plastid genome, which suggested that it served as non-LTR retrotransposon [18]. For those mitochondrial-derived pseudogenes in the plastid, they contained nonsense mutations that would lead to a premature stop codon, which was consistent with the low transcriptional level of the plastid copy rpl2 in A. syriaca [17].From an evolutionary perspective, mitochondrion-to-chloroplast transfer occurred in the earlier common ancestor of certain relative species as a single event. For example, the homolog of mitochondrial gene, DcMP, existed in the plastid genomes of Daucus and their close relative Cuminum [18]. Further studies showed that DcMP moved to the shared ancestor of Daucinae Dumort and Torilidinae Dumort subtribes after they diverged from their ancestral tribe, Scandiceae Spreng [38,39]. Also, in Apocynaceae, mitochondrial rpl2 transferred to the plastid genome of the common ancestor of the Asclepiadeae and Eustegia [17].Mitochondrial sequences preferentially inserted into the intergenic spacer of plastid genomes. For instance, DcMP inserted in the rps12-trnV intergenic spacer in the D. carota plastid genome [16]. There were also mitochondrial insertions in the rps2-rpoC2 intergenic spacer of the plastid genome in A. syriaca [17] and in the rpl23-ndhB intergenic spacer of the plastid genome of Parianinae (Eremitis sp. and Pariana radiciflora) [41]. Besides, another mitochondrial-to-nuclear transfer appeared in the large single copy (LSC) region between the junction with inverted repeat A (IRA) and tRNA-His (GUG) (trnH-GUG) in limited Apiaceae species [38]. Additionally, insertion locations implied the roles of the transferred genes. DcMP was regarded as a non-LTR retrotransposon targeting tRNA-coding regions because it moved to the upstream of the trnV gene in the plastid genome. Otherwise, DcMP worked as three new promoters (P1–P3) that substituted two original promoters of the trnV gene (P4 and P5) [18]. More importantly, insertion typically came with DNA repair of a double-stranded break by homologous recombination. To create homologies, the plastid gene rpoC2 preferentially inserted into the mitochondrial genome, just near the mitochondrial-native gene rpl2, then intact mitochondrial rpl2 and part of rpoC2 transferred together to the plastid of A. syriaca [17].
2.2. The Role of Mitochondrion as a Gene Pool: Intergenomic Gene Transfer into Mitochondrion
2.2.1. Intergenomic Gene Transfer from Nucleus to Mitochondrion
Compared with the conservative chloroplast genome, the mitochondrial genome diversified among plant species. The primary drivers of genome variations might be repetitive sequences and nuclear-derived DNA, which represented 42% and 47% of the total sequences in melon, respectively [10]. In present study, we analyze the nucleus-to-mitochondrion sequences of 23 plants. First, nuclear-derived sequences are widespread in all mitochondrial genomes of 23 plants (Figure 4). Second, among spermatophytes, total nuclear sequences in mitochondrial genomes range from a low of 7960 bp in S. latifolia to a high of 36,123 bp in V. vinifera (Table S2). Third, the nucleus-to-mitochondrion transferred sequences are less in bryophytes than in spermatophytes, 4249 bp and 4814 bp in P. patens and M. polymorpha, respectively (Figure 4).
Figure 4
Length of nuclear-derived sequences in plant mitochondrial genomes. The plant species are arrayed on the horizontal axis. The total lengths of nuclear-to-mitochondrial sequences are along the vertical axis. The bars represent the lengths of sequences transferring from the nucleus to the mitochondrion in plant species. The error bars stand for the positive and negative deviations of 5.0%.
According to the different degrees of the matching and annotation, these nuclear-to-mitochondrial repetitive sequences fall into seven categories: copia, gypsy, low complexity, long terminal repeat retrotransposons (LTR-retro), simple repeat, transposable element (TE) and unspecified (Table S2). Copia and gypsy represent two main classes of LTR-retrotransposons that belong to Class 1 transposable elements [72]. Low-complexity DNA primarily include poly-purine/poly-pyrimidine stretches and regions of extremely high AT or GC content. First, the mean of each type in 21 spermatophytes is significantly larger than that in 2 bryophytes (Figure 5), which show most nucleus-to-mitochondrion transfers occurred after the differentiation of seed plants and bryophytes, at least, for the analyzed 2 bryophytes species. Second, the first three are LTR-retro, gypsy and copia in 23 plants (Figure 6, Table S2). This result conforms to the early discoveries in a number of plants, including the gymnosperm Cycas taitungensis [9], the monocot Oryza sativa [8] and the eudicotsArabidopsis thaliana, Cucumis melo and Cucumis sativus [4,7,10,11,12]. Third, the total length of transferred sequences correlates with the mitogenome size (Figure 7). This result supports the import of promiscuous DNA is a core mechanism for mitochondrial genome expansion in land plants [73].
Figure 5
Mean value of the length of different nuclear sequences transferring to the mitochondrial genomes of spermatophytes and bryophytes. ** p < 0.01. The seven categories of repeats are arrayed on the horizontal axis. The total lengths of nuclear-to-mitochondrial repetitive sequences are along the vertical axis. The dark gray and light gray bars represent the mean values of repeats transferring from the nucleus to the mitochondrion in 21 spermatophytes and 2 bryophytes, respectively. The error bars stand for the positive and negative deviations of 5.0%.
Figure 6
Percentages of each kind of repeats from all nuclear-to-mitochondrial repetitive sequences in 23 plants. The 23 circles represent the whole nuclear-to-mitochondrial repeats of 23 plants inside and out. The boxes in different colors on the right are the symbols of seven kinds of repetitive sequences. (From top to bottom) Light blue: copia; Orange: gypsy; Gray: low complex; Yellow: LTR-retro (long terminal repeat retrotransposons); Middle blue: simple repeat; Green: TE (transposable element); dark blue: un-specific.
Figure 7
Correlation between the length of nuclear sequences transferring to the mitochondrion and the size of the mitochondrial genome in 23 land plants. Each dot represents a length value (X, Y). X refers to the length of the mitochondrial genome of one species (based on the horizontal axis). Y means the length of nuclear-to-mitochondrial sequences in this corresponding species (based on the vertical axis). The slash represents the linear regression function of the distribution tendency of the dots. R2 is the regression coefficient.
2.2.2. Intergenomic Gene Transfer from Chloroplast to Mitochondrion
As with mitochondrial genomes, chloroplast genomes also contain a minimum set of largely conserved protein-encoding, rRNA and tRNA genes [21,74,75]. In contrast to the extensive gene loss of mitochondrial genomes, only few chloroplast-encoded genes have been lost in chloroplast genomes of specific plants (Figure S1, yellow cells). For example, three genes (accD, ycf1 and ycf2) are lost in the grasses (O. sativa japonica, O. sativa indica, S. bicolor, Z. mays), another three genes (ccsA, rpoA and rpl16) are lost in the moss P. patens (Figure S1, yellow cells). Compared to a few gene loss, chloroplast genes transferring to nucleus and mitochondrion are richer (Figures S1 and S2). In our study, we unearth the enormous chloroplast-to-mitochondrion gene transfers in 24 land plants. Similar gene copies exist in two contemporary intracellular genomes simultaneously (Figure S1, the red and green cells). In two bryophytes, the total lengths of integrated sequences are close, 1.05 kb in M. polymorpha and 1.99 kb in P. patens (Figure 8). In addition, the variation range is greater in 22 seed plants, from 1.67 kb in S. latifolia to 130 kb in A. trichopoda (Figure 8). Besides, the chloroplast-to-mitochondrion fragments of most seed plants are more than that in bryophytes (Figure 8).
Figure 8
Length of chloroplast-derived fragments in plant mitochondrial genomes. The plant species are arrayed on the horizontal axis. The total lengths of chloroplast-to-mitochondrial sequences are along the vertical axis. The bars represent the lengths of sequences transferring from the chloroplast to the mitochondrion in species. The error bars stand for the positive and negative deviations of 5.0%.
Large parts of chloroplast tRNA genes immigrated into plant mitochondrial genomes [5,9]. These transfers were essential to the translation of the mitochondrial genes [13,14,15]. Here, we identify the chloroplast-like tRNA genes in the mitochondrial genome of 24 plants species using blast. And then we build a phylogenetic tree to elucidate the evolutionary implications. First, there is no chloroplast-derived tRNA gene in mitochondrial genomes of two bryophytes (Figure S2). Second, single or multiple chloroplast genes immigrated to the mitochondrial genomes of spermatophytes, at least, for the analyzed 21 angiosperms and 1 gymnosperms. For example, (1) chloroplast-like trnM gene appears in the mitochondrial genomes of all studied seed plants except Z. mays, which suggests that chloroplast trnM lost only in Z. mays during or after transferring to the mitochondrion and this transfer happened with spermatophytes and bryophytes diverging; (2) chloroplast trnH gene transferred to the mitochondrial genomes of most spermatophytes but lost in P. dactylifera, T. aestivum and G. biloba, which might be the random loss; (3) trnN, trnP, trnS and trnW transferred merely in angiosperms, despite parts of these four genes lost in a few species; (4) chloroplast trnD gene moved into the mitochondrion only in eudicots, which shows that trnD transferred when eudicots and monocots diverged; (5) chloroplast-like trnC gene and trnF gene transferred to the mitochondrion simply in Gramineae crops of monocots; (6) ten chloroplast-to-mitochondrion genes (trnD, trnE, trnG, trnI, trnK, trnL, trnP, trnR, trnT and trnY) transferred together in V. vinifera (Figure S2).To infer the mechanism of chloroplast tRNA genes inserting into mitochondria, we analyze the flanking nucleotide sequences in insertion sites of mitochondrial genomes. trnH transferred in most spermatophytes (Figure 9). trnD moved specifically in eudicots (Figure S3). trnC and trnF migrated only in Gramineae crops (Figure S4). Taking together, we notice the micro-homologies (1 to 4 bp) among plant species in the breakpoint sequences of chloroplast-mitochondrial DNA fusion. The micro-homologies are the same adenine-thymine (AT) on the right of trnH in spermatophytes. But on the left are four short tandems Guanine (G) in eudicots, two repeated Guanine (G) in monocots and no microhomology in gymnosperms (Figure 9). Therefore, we confer that DNA sequence microhomology plays an important role in chloroplast DNA inserting into the mitochondrion, which may be the microhomology-mediated break-induced replication (MMBIR) [19] or non-homologous end joining (NHEJ) [20].
Figure 9
Nucleotide-resolution analysis on flanking sequences of the chloroplast-derived trnH gene in mitochondrial genomes of most spermatophytes. cpDNA and mtDNA are the abbreviations of chloroplast DNA and mitochondrial DNA. The yellow-green-yellow strip represents the fusion sequence of mtDNA-cpDNA-mtDNA. The sequences under the two yellow strips on the left and right are the flanking sequences of inserted chloroplast-like tRNA gene in the mitochondrial genomes. The red capital English letters close to cpDNA indicate the nucleotides of micro-homologies among the different species. The species in the blue, green, yellow and red boxes belong to eudicots, monocots, basal angiosperms and gymnosperms, respectively.
On top of it all, we infer the repeats in mitochondrial genomes have the potential to mediate DNA recombination, which contributes to gene transfer and reuse of the transferred genes in target genomes. Therefore, we analyze the repeats variation in recipient genomes (the mitochondrial genomes) of land plants (Table 2) to explain various rates of gene transfer to some extent. First, plants with smaller values of repeat size, repeat number (>1 kb) and repeat number (>100 bp) contain less gene transfer, among which the most obvious is a bryophyte P. patens (Table 2 and Figure 8). Second, small repeats (>100 bp) are more favorable to gene transfer than large repeats (>1 kb) (Table 2).
Table 2
Variation of repeats in mitochondrial genomes from 22 land plants.
Species
Mitochondrial Genome
Repeat Size (Kb)
Repeat Number (>1 kb)
Repeat Number (>100 bp)
Spermatophytes
Eudicots
B. rapa
3.80
1
9
B. napus
4.62
1
17
B. oleracea
152.00
2
24
A. thaliana
15.63
2
25
C. papaya
13.43
1
13
R. communis
5.80
6
6
G. max
60.67
13
68
V. radiata
1.02
0
6
S. latifolia
23.27
15
17
D. carota
71.09
4
19
N. tabacum
42.07
3
22
V. vinifera
5.77
0
26
Monocots
S. polyrhiza
1.58
0
5
P. dactylifera
3.03
1
12
O. sativa japonica
141.19
12
39
O. sativa indica
141.76
11
27
S. bicolor
58.56
5
18
Z. mays
51.94
4
19
Basal Angiosperms
A. trichopoda
266.14
1
1811
Gymnosperms
C. taitungensis
62.65
2
5070
Bryophytes
M. polymorpha
2.08
0
13
P. patens
0
0
0
3. Materials and Methods
3.1. Availability of Chloroplast, Mitochondrial and Nuclear Genomes
We download all the chloroplast, mitochondrial and nuclear genome sequences and gene annotations from NCBI database. And then we list all the accession numbers in Table S1.
3.2. Detection of Total Intergenomic-Transfer DNA Sequences
For 24 land plants, we align the sequences of chloroplast and mitochondrial genomes to nuclear chromosomes to detect nuclear insertions of chloroplast DNA (nupts) and nuclear insertions of mitochondrial DNA (numts) using the BLAST program. We set e-value to 1e−5 [76]. The minimum length of an exact match (95%) is 100 bp. While identifying mitochondrial insertions of chloroplast DNAs (mtpts) by local BLASTN (version 2.2.23) [76], we set the minimum length of an exact match to be 50-bp.
3.3. Identification of Intergenomic-Transfer Homologies
Taking a set of essential chloroplast or mitochondrial genes as references (Table S3), we gain their copies in the donor and recipient genomes using the BLAST program with the same parameters above [76]. If there is no counterpart in the donor genomes (chloroplast or mitochondrial genomes), we would consider them as the lost genes (Figure 1 and Figure S1, the yellow cells). To those presented in the donor genomes but absent in the recipient genomes, we consider that they did not transfer between two genomes (Figure 1 and Figure S1, the white cells). For those appearing concurrently in both donor and recipient genomes, we consider that their copies moved into another genome after duplication in the original genome (Figure 1 and Figure S1, the red and green cells). Further, we define the full-length copies of the transferred genes in the recipient genomes as the intact homologies (Figure 1 and Figure S1, the red cells). Otherwise, we recognize the truncated copies as pseudogenes (Figure 1 and Figure S1, the green cells).
3.4. Detection of the Repeats in Mitochondrial Genomes
We detect nuclear-derived repetitive transposons using online software RepeatMasker (http://www.repeatmasker.org) in 24 land species and a custom repeats database. And then we use two-tailed t-tests to evaluate the significant difference of repeats between spermatophytes and bryophytes.
3.5. NHEJ Analysis
We perform the NHEJ analysis as previously described [77,78]. In short, nupts, numts or mtpts are inserted by NHEJ, like micro-homology or blunt end repair. If nucleotides close to the fusion point are similar in different land species, we would regard them as micro-homology. Otherwise, we would consider no micro-homology as blunt-end repair.
3.6. Phylogenetic Analysis
The phylogenetic analysis involves nucleotide sequences of 17 mitochondrial genes (nad1–nad6, nad9, cob, cox1–cox3, atp1, atp4, atp6, atp8 and atp9). We use the maximum likelihood (ML) method with the model GTR + G + I in MEGA5.05 [79]. And then we perform phylogenetic analyses according to the same methods in previous studies [80,81].
4. Conclusions
With the rapid development of genomic sequencing technologies, nuclear and organellar genomes data became available for many plants. Here, based on 24 sets of genome data, we detect and analyze intergenomic gene transfers (IGT) related to the mitochondrion. Meanwhile, we review the research advances of intergenomic gene transfer. As a summary, we find mitochondrion mainly plays two essential roles in gene transfer: Source and pool. From the source perspective, massive mitochondrial genes transfer into nuclear and chloroplast genomes. For the role of the pool, the mitochondrion integrates enormous genes from the other two genomes. Except for the disparate orientation, a lot of likenesses emerge when bringing them together. First, gene transfer related to mitochondrial genomes is prevalent in plants, though few genes flow from the mitochondrion to the chloroplast. Second, specific IGT is a single event of certain shared ancestors, which is consistent with the divergence clade. Third, an intact gene usually changes existing forms after transferring in and out of both donor and recipient genomes, which agrees with their consequent roles, such as, functioning like before, reusing for new loci or decaying gradually. Fourth, most exogenous DNA preferentially inserts into the intergenic region. Besides, small repeats (or micro-homologies) may contribute to gene transfers by mediating recombination in the recipient genomes. In a word, mitochondrial gene transfers dedicate to the genome variation and evolutionary diversity.
Authors: Radim Cegan; Boris Vyskot; Eduard Kejnovsky; Zdenek Kubat; Hana Blavet; Jan Šafář; Jaroslav Doležel; Nicolas Blavet; Roman Hobza Journal: PLoS One Date: 2012-02-29 Impact factor: 3.240
Authors: Shannon C K Straub; Richard C Cronn; Christopher Edwards; Mark Fishbein; Aaron Liston Journal: Genome Biol Evol Date: 2013 Impact factor: 3.416
Authors: Ibrahim S Al-Mssallem; Songnian Hu; Xiaowei Zhang; Qiang Lin; Wanfei Liu; Jun Tan; Xiaoguang Yu; Jiucheng Liu; Linlin Pan; Tongwu Zhang; Yuxin Yin; Chengqi Xin; Hao Wu; Guangyu Zhang; Mohammed M Ba Abdullah; Dawei Huang; Yongjun Fang; Yasser O Alnakhli; Shangang Jia; An Yin; Eman M Alhuzimi; Burair A Alsaihati; Saad A Al-Owayyed; Duojun Zhao; Sun Zhang; Noha A Al-Otaibi; Gaoyuan Sun; Majed A Majrashi; Fusen Li; Jixiang Wang; Quanzheng Yun; Nafla A Alnassar; Lei Wang; Meng Yang; Rasha F Al-Jelaify; Kan Liu; Shenghan Gao; Kaifu Chen; Samiyah R Alkhaldi; Guiming Liu; Meng Zhang; Haiyan Guo; Jun Yu Journal: Nat Commun Date: 2013 Impact factor: 14.919
Authors: Nicolas Sierro; James N D Battey; Sonia Ouadi; Nicolas Bakaher; Lucien Bovet; Adrian Willig; Simon Goepfert; Manuel C Peitsch; Nikolai V Ivanov Journal: Nat Commun Date: 2014-05-08 Impact factor: 14.919
Authors: In-Su Choi; Erika N Schwarz; Tracey A Ruhlman; Mohammad A Khiyami; Jamal S M Sabir; Nahid H Hajarah; Mernan J Sabir; Samar O Rabah; Robert K Jansen Journal: BMC Plant Biol Date: 2019-10-25 Impact factor: 4.215