Literature DB >> 24515269

Horizontal transfer and gene conversion as an important driving force in shaping the landscape of mitochondrial introns.

Baojun Wu1, Weilong Hao.   

Abstract

Group I introns are highly dynamic and mobile, featuring extensive presence-absence variation and widespread horizontal transfer. Group I introns can invade intron-lacking alleles via intron homing powered by their own encoded homing endonuclease gene (HEG) after horizontal transfer or via reverse splicing through an RNA intermediate. After successful invasion, the intron and HEG are subject to degeneration and sequential loss. It remains unclear whether these mechanisms can fully address the high dynamics and mobility of group I introns. Here, we found that HEGs undergo a fast gain-and-loss turnover comparable with introns in the yeast mitochondrial 21S-rRNA gene, which is unexpected, as the intron and HEG are generally believed to move together as a unit. We further observed extensively mosaic sequences in both the introns and HEGs, and evidence of gene conversion between HEG-containing and HEG-lacking introns. Our findings suggest horizontal transfer and gene conversion can accelerate HEG/intron degeneration and loss, or rescue and propagate HEG/introns, and ultimately result in high HEG/intron turnover rate. Given that up to 25% of the yeast mitochondrial genome is composed of introns and most mitochondrial introns are group I introns, horizontal transfer and gene conversion could have served as an important mechanism in introducing mitochondrial intron diversity, promoting intron mobility and consequently shaping mitochondrial genome architecture.

Entities:  

Keywords:  LSU rRNA; gene conversion; group I intron; horizontal transfer; intron mobility; mitochondrial genome; ω intron

Mesh:

Substances:

Year:  2014        PMID: 24515269      PMCID: PMC4059233          DOI: 10.1534/g3.113.009910

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


Group I introns are a special kind of self-splicing ribozyme widely found in protist nuclear ribosomal (r)DNA genes, fungal and plant organellar genomes, bacteria, and viruses (Haugen ). In eukaryotes, group I introns are highly dynamic, featuring extensive presence-absence variation (Skelly and Maleszka 1991) and widespread horizontal transfer (Colleaux ; Vaughn ; Goddard and Burt 1999; Rot ; Fukami ). Unrelated group I introns share little conservation at the sequence level, but group I introns mostly contain 10 conserved helices with a structurally conserved catalytic core (Nielsen and Johansen 2009), which is crucial for self-splicing (Adams ). Group I introns are generally considered neutral to their hosts (Haugen ) and spread widely thanks to their mobility and to horizontal transfer (Cho ; Goddard and Burt 1999; Bhattacarya ; Haugen ; Sanchez-Puerta , 2011). Two main mechanisms are currently recognized for group I intron mobility. One powerful mechanism for intron mobility is intron homing, powered by active intron-encoded homing endonuclease gene (HEG) (Jacquier and Dujon 1985; Chevalier and Stoddard 2001; Haugen ; Nielsen and Johansen 2009). During intron homing, HEG initiates a double-strand break repair pathway (Belfort and Roberts 1997; Chevalier and Stoddard 2001) and creates a break at the specific HEG recognition site (around 14−45 nucleotides) within the invaded intronless allele (Colleaux ). After cleavage, the break is repaired using the intron-containing allele as a template, and the intron and the HEG get integrated as a unit (Lambowitz and Belfort 1993). During group I intron homing, exonic regions immediately flanking the insertion site often engage in a gene conversion process that replaces part of the host exonic sequence and creates a co-conversion tract (CCT) (Mueller ; Palmer ; Sanchez-Puerta ). Goddard and Burt (1999) have proposed a model of the group I intron life cycle that begins with intron homing into an intronless allele followed by intron/HEG degeneration. Once the intron is fixed in the population, the HEG and intron are believed to be on an evolutionary path to loss, and new intron cycles will then start with intron homing in new intronless populations. This intron life cycle model gained support in other genome-wide surveys (Wikmark ; Milstein ). Another mechanism of intron invasion is reverse splicing through an RNA intermediate (Roman ). A short sequence (4−6 nt) called the internal guide sequence is required for the group I intron to recognize the target region that is complementary to the internal guide sequence, followed by insertion into the transcribed RNA. The group I intron is then incorporated into the genome through reverse transcription of the intron RNA and genomic integration of the resulting cDNA (Roman ). With much less-specific recognizing nucleotides (4−6 nt) compared with those (14−45 nt) of intron homing, the reverse splicing pathway could permit group I introns to invade not only homologous sites but also heterologous sites (Bhattacharya , 2005). The yeast mitochondrial 21S large subunit (LSU) rRNA contains a group I intron known as the ω intron (Coen ). The presence of the ω intron within the mitochondrial LSU rRNA has been found to be highly variable among yeast species (Skelly and Maleszka 1991; Goddard and Burt 1999). In this study, we used quantitative and phylogenetic approaches to examine the gain and loss of the ω intron and its HEG sequences from 29 strains in the Saccharomyces complex. We have examined the gain and loss of the HEG only among the intron-containing strains, since no existing theories suggest a high frequency of gains and losses of the HEG within the ω intron, and the turnover rate of HEG within the intron would be expected to be very low compared with that of the mobile intron based on the intron-homing model. To our surprise, we found fast turnover rates of the HEG comparable with those of the well-known mobile intron. We further found extensive mosaic sequences in both the HEG and intron sequences, which is inconsistent with any available mechanisms for group I intron mobility, but suggests recurrent horizontal transfer and gene conversion. This mechanism is believed to play important roles in not only introducing intron sequence diversity but also introducing intron content variation and promoting intron mobility by altering the HEG function during the evolution of group I introns.

Materials and Methods

Strains and sequence data

Ten Torulaspora yeast strains from four species were obtained from the National Center of Agricultural Utilization Research (Peoria, IL) and Dr. Matthew Goddard (The University of Auckland). Genomic DNA of each strain was extracted from overnight cultures following the procedure described in Lee . Polymerase chain reaction amplification was performed to obtain the LSU rRNA ω intron and HEG sequences. The nuclear internal transcribed spacer region (ITS1-5.8S-ITS2), 26S rRNA D1/D2, mitochondrial small subunit rRNA and sequences were also obtained to infer organismal relationships. The GenBank accession numbers for the sequences used in the study are listed in Supporting Information, Table S1, and the primers used for PCR amplification and Sanger sequencing are listed in Table S2.

Phylogenetic analyses

Nucleotide sequences were aligned using a combination of MUSCLE (Edgar 2004) and PRANK (Loytynoja and Goldman 2005); sequence alignments were edited manually with SEAVIEW (Gouy ). Phylogenetic trees were reconstructed using the RAxML program (Stamatakis 2006) under a GTR+Γ+I substitution model. In each phylogenetic reconstruction, 100 bootstrap iterations were performed. Phylogenetic incongruence was examined by the approximately unbiased test (Shimodaira 2002) implemented in the CONSEL program (Shimodaira and Hasegawa 2001). The two nuclear gene regions (ITS1-5.8S-ITS2, 26S rRNA D1/D2) and two mitochondrial genes (small subunit rRNA, and ) were used to reconstruct the organismal relationship of the Saccharomyces complex. The detailed relationship among the three S. cerevisiae genomes [288C (Goffeau ), YJM789 (Wei ), and No7 (Akao )] was determined by the core genomic regions (3380 genes, 477,080 characters) shared by the published nuclear genomes, using the Saccharomyces mikatae genome (Kellis ) as an outgroup.

Estimation of the gain and loss rates of introns and HEGs

We sought to model the gain and loss of an intron or an HEG in homologous sites across organisms related by a phylogeny using a two-state continuous-time Markov process, with states 0 (absence) and 1 (presence). In the Saccharomyces complex, to which the Torulaspora and Saccharomyces genera belong, the phyletic pattern (presence and absence) of the intron and HEG of the LSU rRNA was available in 29 strains. The relationship of these 29 strains (Figure 1) was constructed using the nuclear ITS1-5.8S-ITS2, 26S rDNA D1/D2, mitochondrial small subunit rRNA, and genes. The gain and loss rates were estimated using a maximum likelihood estimation implemented in the ACE (ancestral character estimation) function of the APE (analysis of phylogenetics and evolution) package (Paradis ) in R and BayesTraits (Pagel ). The estimation was performed separately on the pattern of intron presence/absence within the LSU rRNA of all 29 strains and on the pattern of HEG presence/absence within the intron among the 22 intron-containing strains. The rates of gain and loss in the ω intron and HEG were estimated based on the tree branch lengths and are therefore relative to nucleotide substitution with the unit as the number of gains/losses per site per one nucleotide substitution. Such a concept was developed in Hao and Golding (2006) and has been well received in modeling the rates of gene gain/loss during bacterial genome evolution (Cohen ; Spencer and Sangaralingam 2009; Hao and Golding 2010; Cohen ). We found that the two-parameter model separating the gain and loss rates does not significantly outperform the one-parameter model that constrains the rates of gains and losses to be the same on either the intron or HEG data (2ΔlnL < 2.0, P > 0.10, df = 1 for either the intron or HEG data, as 2ΔlnL follows approximately a chi-square distribution). Here, we only present the estimations using the simplistic model by constraining the rates of gains and losses to be the same. Furthermore, we have used custom R scripts to compute the likelihood values of given turnover rates in the same way as using the ACE and to conduct likelihood ratio tests between different turnover rates.
Figure 1

Sporadic distribution of the ω intron and homing endonuclease gene (HEG) in the Saccharomyces complex. The phylogenetic relationship was constructed using the concatenated sequences of cox2, small subunit rRNA, 26S rRNA D1/D2, and ITS1-5.8S-ITS2. A total of 100 bootstrap iterations were performed, bootstrap values when >70 are shown. The detailed relationship among the three S. cerevisiae strains was determined using their core nuclear genomic regions shared with S. mikatae. The Saccharomyces and Torulaspora clades, each of which contains intron or HEG variation among closely related strains, are shaded. The intron-HEG organization of each strain is illustrated after the strain name. Gray boxes and open boxes represent intron and HEG, respectively, whereas crosses stand for intron absence.

Sporadic distribution of the ω intron and homing endonuclease gene (HEG) in the Saccharomyces complex. The phylogenetic relationship was constructed using the concatenated sequences of cox2, small subunit rRNA, 26S rRNA D1/D2, and ITS1-5.8S-ITS2. A total of 100 bootstrap iterations were performed, bootstrap values when >70 are shown. The detailed relationship among the three S. cerevisiae strains was determined using their core nuclear genomic regions shared with S. mikatae. The Saccharomyces and Torulaspora clades, each of which contains intron or HEG variation among closely related strains, are shaded. The intron-HEG organization of each strain is illustrated after the strain name. Gray boxes and open boxes represent intron and HEG, respectively, whereas crosses stand for intron absence.

Results and Discussion

Fast intron turnover and even faster HEG turnover

The presence-absence pattern of the ω intron in the mitochondrial LSU rRNA gene and its encoded HEG was determined in 29 strains within the Saccharomyces complex (Figure 1). There are three main intron-HEG organizations: intron with HEG; intron with no HEG; and no intron at all. The HEG sequence in Torulaspora delbrueckii CBS3003 was disrupted by a 32-nucleotide high-GC insertion, and the HEG in T. delbrueckii CBS5448 was disrupted by a 38-nucleotide high-GC insertion (Figure S1 and Figure S2). The disrupted HEG open reading frames (ORFs) in T. delbrueckii were not attributable to sequencing error, as each type of the T. delbrueckii sequences represents sequences from at least three different T. delbrueckii strains examined in this study (Table S1). The HEGs with an intact ORF were classified as putative functional HEGs, and the ones with a disrupted ORF due to either premature stop codon or indel-associated frame-shift were classified as nonfunctional HEGs. Identical intron-HEG patterns can be found between distantly related strains, whereas very closely related strains may have different intron-HEG organizations. For instance, the ω intron is absent from the S. cerevisiae strain No7, whereas both the intron and HEG are present in the other two S. cerevisiae strains 288C and YJM789. The T. delbrueckii strains contain a copy of non-functional HEG in the intron, while all other Torulaspora species only contain an intron with no HEG. To gain a better understanding of the presence-absence variation of the ω intron within the LSU rRNA and the HEG within the ω intron, we sought a quantitative measure for the intron turnover rate and HEG turnover rate. Our analysis assumed equal rates of gains and losses, since the likelihood value of letting the gain and loss rates vary is not significantly better than that of constraining the gain and loss rates to be the same for either the intron or HEG data. Analyses using the ACE function in the Analyses of Phylogenetics and Evolution package (Paradis ) and BaysianTraits (Pagel ) gave essentially identical results; here we only present the results from ACE. The intron turnover rate (± SE) was estimated as μ = 3.51 ± 1.52 (Figure 2A), suggesting a faster turnover process of introns than nucleotide substitution. We further calculated the likelihood ratio 2ΔlnL between μ = 3.51 and μ = 1. The latter equals the rate of nucleotide substitution (see Materials and Methods), and the estimated intron turnover rate 3.51 is significantly faster than the nucleotide substitution rate (2ΔlnL = 4.26, P < 0.05, df = 1). This is in good agreement with the well-known high mobility of group I introns, as ribozymes often are powered by HEGs. Both the intron and its encoded-HEG target the same recognition sequence to promote the mobility of intron/HEG as a unit. If the HEG always goes together with the intron and is lost slowly through mutation accumulation and degeneration, one would expect a very slow HEG turnover rate within the intron. We compared the HEG turnover rate against the intron turnover rate. Please note the null hypothesis here is not that the HEG rate equals the intron turnover rate; instead, the null hypothesis would be that the HEG turnover rate is close to zero if no mechanism promotes the high mobility of HEG within the intron. Surprisingly, however, the turnover rate of HEGs was estimated as μ = 21.88 ± 16.21 (Figure 2B) and is significantly larger than the rate of nucleotide substitution, as 2ΔlnL between μ = 21.88 and μ = 1 is 14.68 (P < 0.001, df = 1). On the HEG tree (Figure 2B), the estimated HEG turnover rate of 21.88 is significantly greater than 3.51, the estimated intron turnover rate (2ΔlnL = 4.86, P < 0.05, df = 1). A likelihood ratio test between μ = 3.51 and μ = 21.88 was also conducted on the intron tree (Figure 2A), but the likelihood ratio was not significant (2ΔlnL=2.68, P > 0.05, df = 1). The different results of the two likelihood ratio tests are likely attributable to the different shapes in their likelihood surfaces (Figure S3). Nevertheless, our results suggest that both the intron and HEG undergo rapid turnover that is faster than the nucleotide substitution rate. The observed trend that the HEG might undergo faster turnover rates than the intron is subject to further investigation in larger datasets.
Figure 2

The gain and loss rates of the intron (A) and homing endonuclease gene (HEG) (B) during the evolution of the Saccharomyces complex. The phyletic pattern of the intron or HEG is shown in parentheses for each strain. Pie charts illustrate the relative likelihoods (local estimators) of the two possible states (presence or absence) at each ancestral node. An equal rate has been assumed for gains and losses.

The gain and loss rates of the intron (A) and homing endonuclease gene (HEG) (B) during the evolution of the Saccharomyces complex. The phyletic pattern of the intron or HEG is shown in parentheses for each strain. Pie charts illustrate the relative likelihoods (local estimators) of the two possible states (presence or absence) at each ancestral node. An equal rate has been assumed for gains and losses.

High ω intron mobility via intron homing and the unexpectedly high HEG mobility demand an explanation

The HEG within the ω intron has been documented to promote intron insertion by recognizing a specific 18-nucleotide exon sequence (5′-TAGGGATAACAGGGTAAT-3′) (Dujon 1989). We have carefully examined the HEG and intron sequences at a number of subgenic regions (Figure 3, A−D). Among the 22 strains with available exon/intron boundary sequences, 20 strains contain the identical 18-nucleotide HEG recognition sequences (14 strains are shown in Figure 3B). Nakaseomyces bacillisporus and Candida castellii differed by one nucleotide out of the 18 HEG recognition nucleotides from the rest of the strains (not shown). The highly conserved HEG recognition sequence could have served as an excellent precondition for active ω intron invasion promoted by HEG, which is believed to be highly specific on the target sites (Dujon 1989).
Figure 3

Mosaic intron and homing endonuclease gene (HEG) sequences. (A) An illustration of the large subunit rRNA sequence region examined in the study. Seven small subregions (E1, I1, H1, H2, H3, I2, and E2) were arbitrarily chosen to demonstrate the mosaic nature in detailed sequence alignment. (B) Selected exon regions showing no evidence of gene transfer. Dots indicate identities relative to the S. cerevisiae 288C sequence, whereas letters represent nucleotide differences. The 18-nucleotide HEG reorganization sites are underlined. The dendrogram at left was derived from Figure 1. Intron presence is shown as [+], whereas intron absence is shown as [ ]. C) Chimeric structure of the intron sequence. Introns containing HEG are shown as [+], whereas introns lacking HEG are shown as [ ]. Sequences identical with the S. cerevisiae 288C sequence are highlighted in gray. (D) Mosaic structure of the HEG sequence. Sequences identical with or differing by only one nucleotide from the S. cerevisiae 288C sequence are highlighted in gray.

Mosaic intron and homing endonuclease gene (HEG) sequences. (A) An illustration of the large subunit rRNA sequence region examined in the study. Seven small subregions (E1, I1, H1, H2, H3, I2, and E2) were arbitrarily chosen to demonstrate the mosaic nature in detailed sequence alignment. (B) Selected exon regions showing no evidence of gene transfer. Dots indicate identities relative to the S. cerevisiae 288C sequence, whereas letters represent nucleotide differences. The 18-nucleotide HEG reorganization sites are underlined. The dendrogram at left was derived from Figure 1. Intron presence is shown as [+], whereas intron absence is shown as [ ]. C) Chimeric structure of the intron sequence. Introns containing HEG are shown as [+], whereas introns lacking HEG are shown as [ ]. Sequences identical with the S. cerevisiae 288C sequence are highlighted in gray. (D) Mosaic structure of the HEG sequence. Sequences identical with or differing by only one nucleotide from the S. cerevisiae 288C sequence are highlighted in gray. CCTs are considered the footprints of previous invading introns. In this study, the putative CCT spans the nucleotides G and C at sites +15 and +37 downstream from the intron insertion site, whereas the two intron-lacking strains contain nucleotides A and T at the corresponding sites. Group I intron invasion can also take place in the absence of the CCT (Sanchez-Puerta ). In fact, S. paradoxus CBS432 contains the intron but does not have the signature CCT nucleotides G+15 and C+37 (Figure 3B). In conclusion, intron homing has occurred in most strains in the Saccharomyces complex and made important contributions to the high turnover of introns. Intron homing might well explain the high turnover rate of introns, but it cannot explain the apparently high turnover rate of HEGs. If the mobility of HEGs was primarily due to HEG degeneration and loss by mutation accumulation, the HEG would be present in most introns, as mutation accumulation is a much slower process than the turnover of introns and the HEG turnover rate would be expected to be very low or close to zero as HEG turnover was estimated only among the intron-containing taxa. We then investigated the possibility that the unexpectedly fast turnover rate of HEG could be introduced by artifacts in the analysis. First, an intron/HEG presence-absence polymorphism within a defined species (e.g., S. cerevisiae in Figure 1) could result in high estimates for the turnover rate. However, only an intron presence-absence polymorphism was observed among the three S. cerevisiae strains, and no HEG presence-absence polymorphism was observed between the two intron-containing S. cerevisiae strains. Second, we noticed that our obtained “organismal” relationship based on the concatenated four-gene sequences in Figure 1 slightly differs from the topology published by Kurtzman (2003) (mostly on the low-bootstrap-support branches). To minimize the concern that possible phylogenetic uncertainty might affect rate estimation, we performed maximum likelihood estimation on the Kurtzman-topology by using the same set of taxa (Figure S4). In this case, a similarly fast turnover rate of HEG was observed. We then considered whether any previously recognized mechanisms could explain the high HEG mobility. HEGs themselves have been suggested to be mobile elements independent of a host intron (Sellem and Belcour 1997). There has been evidence of phylogenetic incongruence between the intron and the intron-encoded-HEG trees among fungal nuclear rDNA, involving intron-independent mobility of HEGs into both homologous and heterologous positions within a group I intron (Haugen and Bhattacharya 2004; Haugen ). Recent studies have suggested that the intron-encoded HEGs were once free-standing endonucleases and the introns and their HEGs evolved separately to target the same highly conserved sequences, uniting afterward to create a composite mobile element (Bonocora and Shub 2009; Zeng ). The specificity of the HEG recognition sequence can serve as a guide for detecting intron-independent HEG movements. That is, the flanking sequence at the recent insertion site of a group I intron-encoded HEG is expected to be very similar to the HEG recognition sequence (Loizos ). In this study, however, no sequences flanking the HEG in the ω intron were found to be similar to the HEG recognition sequence (see Figure S2 for details). The observed high HEG mobility, therefore, could not be explained by its own recent cleavage activity. HEG degeneration and sequential loss caused by mutation accumulation can contribute to HEG turnover (Goddard and Burt 1999). Our analysis, however, observed HEG turnover rates significantly greater than the nucleotide substitution rate, which could not be explained solely by the mutation-initiated HEG degeneration and loss. Furthermore, reverse splicing is believed to have little impact on the fast turnover of HEG, because the reverse splicing pathway generally involves intron and HEG moving together as a unit (Haugen and Bhattacharya 2004). A satisfactory explanation for the very fast HEG turnover, therefore, demands mechanisms involving genetic changes that are more sudden and/or substantial than mutations.

Mosaic structure of the ω intron and HEG implicates horizontal transfer and gene conversion

Two T. pretoriensis strains, CBS2187 and CBS5080, respectively, contain remarkably different intron sequences. T. pretoriensis CBS2187 is very similar to the two T. delbrueckii strains and S. cerevisiae. T. pretoriensis CBS5080 is similar to K. thermotolerans and L. mirantina in different genera. Significant phylogenetic incongruence was observed between the intron tree and organismal tree (P < 0.001, approximately unbiased test; Figure 4), indicating horizontal transfer of the intron.
Figure 4

Maximum likelihood tree of the Saccharomyces, Torulaspora, and Lachancea intron sequences in the large subunit rRNA gene. Phylogenetic analysis was based on the sequence alignment shown in Figure S1. Bootstrap values when >60% are shown. The two T. pretoriensis strains located at remarkably different phylogenetic positions are boxed. As in Figure 1, gray boxes and open boxes represent intron and homing endonuclease gene, respectively.

Maximum likelihood tree of the Saccharomyces, Torulaspora, and Lachancea intron sequences in the large subunit rRNA gene. Phylogenetic analysis was based on the sequence alignment shown in Figure S1. Bootstrap values when >60% are shown. The two T. pretoriensis strains located at remarkably different phylogenetic positions are boxed. As in Figure 1, gray boxes and open boxes represent intron and homing endonuclease gene, respectively. Mosaic intron sequences were found in a number of strains, i.e., different regions within an intron were of different evolutionary origins, which is inconsistent with intron homing or reverse splicing. For instance, T. pretoriensis CBS2187 and the two T. delbrueckii strains are identical to two S. cerevisiae strains 288C and YJM789 in intron region I1 (Figure 3C). In intron region I2, however, they are highly similar (with no more than two nucleotides different) to all other Torulaspora strains except T. pretoriensis CBS5080, but differed from the S. cerevisiae strains by 17 or 18 nucleotide substitutions plus a two-nucleotide indel. The chimeric intron sequences in T. pretoriensis CBS2187 and T. delbrueckii are believed to be the result of gene conversion after the horizontal transfer of the intron. On the basis of the I1 and I2 regions, T. pretoriensis CBS2187 was found to be remarkably similar to the two T. delbrueckii strains. T. pretoriensis CBS5080 was strikingly different from T. pretoriensis CBS2187 and other Torulaspora strains but similar to K. thermotolerans and L. mirantina. However, these relationships do not hold for the entire intron sequences. For instance, the beginning of the T. pretoriensis CBS5080 (alignment positions 169−245 in Figure S1) was identical with T. delbrueckii CBS5448, but differed from either K. thermotolerans or L. mirantina by at least 12 nucleotides plus one indel. In T. pretoriensis CBS2187, at least two intron regions (alignment positions 212−304, and 1257−1287 in Figure S2) were different from T. delbrueckii but identical with the two S. cerevisiae strains. These findings suggest that horizontal transfer and gene conversion can take place recurrently in the intron sequences with multiple donor strains and can be at a fine-scale. We have previously demonstrated that gene conversion between foreign and native homologs can significantly confound phylogenetic analysis (Hao and Palmer 2011), it is not unreasonable to believe that the true evolutionary history of these group I introns could be much more complex than we have inferred. Like the mosaic intron sequences, the HEG sequences are also highly mosaic (Figure 3D). The H1 (86 nucleotide in length) and H3 (75 nucleotide in length) regions in T. delbrueckii were found to differ by only one nucleotide from the two S. cerevisiae strains, but the H2 region (50 nucleotides in length) in T. delbrueckii differs from the two S. cerevisiae strains by eight nucleotides, all of which could be found in either K. thermotolerans or L. mirantina. It is important to mention that along the T. delbrueckii HEG sequence there are additional regions highly similar or identical with S. cerevisiae, and some other regions that are significantly different from S. cerevisiae but for which the donor species could not be successfully identified (Figure S1). These findings strongly suggest that frequent gene conversion has taken place within the HEG sequence. We used the PHI (pairwise homoplasy index) package (Bruen ) to statistically examine the significance of gene conversion. Significant recombination signals (P < 0.001, PHI test) were found in both the intron and HEG sequences. In a contrast, no significant recombination signal was detected in the exon sequences, nor was significant phylogenetic incongruence observed between the exon tree and organismal tree (Figure S5). In this study, we focused primarily on the demonstration of the highly mosaic nature of the intron and HEG sequences and did not attempt to identify all the possible gene conversion breakpoints. For illustration purposes, the regions in Figure 3 were arbitrarily chosen, as our previous studies have shown that the recombination breakpoints cannot all be correctly detected by existing recombination detection programs, especially when recombination is frequent and recurrent, (Hao 2010; Hao ).

Horizontal transfer and gene conversion alter HEG content and function

All the Torulaspora strains, except T. pretoriensis CBS5080, share nearly identical intron sequences in the I2 regions, but only the T. delbrueckii introns contain HEG (Figure 3C). It is possible that T. delbrueckii once had a native HEG-lacking intron just like other Torulaspora species. We tend to disfavor the explanation that intron homing introduced HEG within a chimeric intron in T. delbrueckii, since if the T. delbrueckii ancestor, like other Torulaspora species, already had an HEG-lacking intron, the intron would interrupt the HEG recognition sequence and prevent the homing process. Unlike intron homing, gene conversion can explain the gain of HEG associated with the mosaic intron sequence in T. delbrueckii (Figure 2C and Figure S2). It is very likely that the T. delbrueckii HEG resulted from gene conversion between a native HEG-lacking intron and a foreign S. cerevisiae-like HEG-containing intron. It is also noteworthy that the HEG-containing intron in S. cerevisiae itself is likely of foreign origin, as the intron sequences in the two S. cerevisiae strains are not clustered with their sister species, S. paradoxus, S. cariocanus, and S. mikatae, which formed a phylogenetic clade (Figure 4). The overall intron sequence in T. pretoriensis CBS2187 is remarkably similar to the two T. delbrueckii strains, whereas the overall intron sequence in T. pretoriensis CBS5080 is similar to K. thermotolerans and L. mirantina. The K. thermotolerans, L. mirantina and T. delbrueckii strains all contain an HEG in their introns, whereas the two T. pretoriensis strains lack the HEG. The mosaic intron sequences within T. pretoriensis (Figure S1 and Figure S2) do not support the scenario of HEG loss after homing of an HEG-containing intron because intron homing would have introduced the HEG and intron as a unit from the very same donor. One possible explanation is that horizontal transfer of an HEG-containing intron has taken place independently in each of the two T. pretoriensis strains; each foreign HEG-containing intron (K. thermotolerans-L. mirantina-like or T. delbrueckii-like) has been separately converted to an HEG-lacking intron by the presumably native HEG-lacking intron via gene conversion. Gene conversion occurs within group I introns independently of the HEG-recognition sequence and HEG function, which therefore increases the chance of genetic exchange. As a consequence, group I introns can show high sequence diversity and high HEG turnover rates. Furthermore, gene conversion also takes place within the HEG sequence, which could potentially alter HEG function in two ways. (1) Gene conversion could pseudogenize a previously functional HEG open reading frame, leading to HEG degeneration, HEG loss, and ultimately intron loss. (2) Gene conversion could rescue a degenerated HEG back to an intact and functional HEG and promote further intron invasion. Together with the gene conversion between the HEG-containing and HEG-lacking introns, gene conversion can be a powerful force driving HEG and intron evolution (Figure 5).
Figure 5

Rapid alteration of mitochondrial intron content via horizontal transfer and gene conversion. (A) Gene conversion that takes place in the exon region can result in the gain or loss of the whole intron. This has been previously reported in the loss of the cox2 intron in plants (Hepburn ). (B) Gene conversion that takes place in the intron region can result in chimeric introns, the gain and loss of homing endonuclease gene (HEG) as per Figure 3. (C) Gene conversion that takes place within HEG can result in chimeric HEG, and the alteration between functional HEG and nonfunctional HEG as per Figure 3.

Rapid alteration of mitochondrial intron content via horizontal transfer and gene conversion. (A) Gene conversion that takes place in the exon region can result in the gain or loss of the whole intron. This has been previously reported in the loss of the cox2 intron in plants (Hepburn ). (B) Gene conversion that takes place in the intron region can result in chimeric introns, the gain and loss of homing endonuclease gene (HEG) as per Figure 3. (C) Gene conversion that takes place within HEG can result in chimeric HEG, and the alteration between functional HEG and nonfunctional HEG as per Figure 3.

On the possibility of gene conversion at the exon region

A growing body of evidence has shown that, after horizontal transfer of foreign homologs, gene conversion takes place between the foreign and native homologs to introduce sequence diversity, sequence content variation, and even the gain and loss of adjacent dispensable genes (Bergthorsson ; Barkman ; Hao ; Mower ; Hepburn ; Kong ). It is therefore not unreasonable to suspect that gene conversion can take place at the exon region of a group I intron following horizontal transfer and directly alter intron presence or absence. This study has only discovered evidence of frequent gene conversion in the intron and HEG sequences, not in the exon sequences. We tend to believe that gene conversion occurs at a much higher frequency in the intron and HEG regions than in the exon region, largely because group I introns are likely under less functional constraint than native protein coding sequences. The possibility of gene conversion at the exon regions can only reinforce the importance of horizontal transfer and gene conversion on the turnover of group I introns. Group I intron invasion is generally believed to introduce the intron as a whole into an intron-less allele. Up to date, few studies questioned the presumption that the whole intron is of a single origin. In this study, our maximum likelihood analysis supports that HEGs undergo faster or at least comparable turnover compared with the intron in the mitochondrial LSU rRNA gene from the Saccharomyces complex, which seemed to be incompatible with current working theories on the movement of group I introns. Our sequence analysis discovered evidence of recurrent gene conversion within the intron and HEG following horizontal transfer. These findings suggest that frequent horizontal transfer and gene conversion can alter HEG content within a group I intron (Figure 5), rescue the nonfunctional HEG, and avoid the ultimate fate of HEG loss and intron loss. Thus, horizontal transfer and gene conversion can play an important role in promoting group I intron mobility via the change of HEG content and HEG sequence. Given the abundance of group I introns in fungal mitochondrial genomes, horizontal transfer and gene conversion would play a significant role in shaping mitochondrial genome architecture. Our results are consistent with the increasingly appreciated role of gene conversion on mitochondrial genome evolution. For instance, gene conversion has recently been shown to take place between the two ends of linear mitochondrial genomes and shapes linear mitochondrial genome architecture (Smith and Keeling 2013). Our conclusions on a mitochondrial group I intron in this study could have a broad implication that gene conversion within a mobile intron can alter the presence/absence and the function of an endonuclease or a retrotranscriptase and ultimately promote the gain and loss of the mobile intron. This might not only be true in organellar genomes, but might also be true in nuclear genomes (e.g., group I, II introns, transposable elements). Considering the wide spread of mobile introns and elements, horizontal transfer and gene conversion could have a significant impact on eukaryotic genomes. All of this could be tested using fast growing genomic data.
  62 in total

1.  CONSEL: for assessing the confidence of phylogenetic tree selection.

Authors:  H Shimodaira; M Hasegawa
Journal:  Bioinformatics       Date:  2001-12       Impact factor: 6.937

2.  An approximately unbiased test of phylogenetic tree selection.

Authors:  Hidetoshi Shimodaira
Journal:  Syst Biol       Date:  2002-06       Impact factor: 15.683

3.  The evolution of homing endonuclease genes and group I introns in nuclear rDNA.

Authors:  Peik Haugen; Valérie Reeb; François Lutzoni; Debashish Bhattacharya
Journal:  Mol Biol Evol       Date:  2003-10-31       Impact factor: 16.240

4.  RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2006-08-23       Impact factor: 6.937

Review 5.  Group I introns: Moving in new directions.

Authors:  Henrik Nielsen; Steinar D Johansen
Journal:  RNA Biol       Date:  2009-09-23       Impact factor: 4.652

6.  Explosive invasion of plant mitochondria by a group I intron.

Authors:  Y Cho; Y L Qiu; P Kuhlman; J D Palmer
Journal:  Proc Natl Acad Sci U S A       Date:  1998-11-24       Impact factor: 11.205

7.  Exon coconversion biases accompanying intron homing: battle of the nucleases.

Authors:  J E Mueller; D Smith; M Belfort
Journal:  Genes Dev       Date:  1996-09-01       Impact factor: 11.361

8.  Group I intron lateral transfer between red and brown algal ribosomal RNA.

Authors:  D Bhattacarya; J J Cannone; R R Gutell
Journal:  Curr Genet       Date:  2001-08       Impact factor: 3.886

Review 9.  Life with 6000 genes.

Authors:  A Goffeau; B G Barrell; H Bussey; R W Davis; B Dujon; H Feldmann; F Galibert; J D Hoheisel; C Jacq; M Johnston; E J Louis; H W Mewes; Y Murakami; P Philippsen; H Tettelin; S G Oliver
Journal:  Science       Date:  1996-10-25       Impact factor: 47.728

10.  A likely pathway for formation of mobile group I introns.

Authors:  Richard P Bonocora; David A Shub
Journal:  Curr Biol       Date:  2009-02-10       Impact factor: 10.834

View more
  22 in total

1.  Multilayered horizontal operon transfers from bacteria reconstruct a thiamine salvage pathway in yeasts.

Authors:  Carla Gonçalves; Paula Gonçalves
Journal:  Proc Natl Acad Sci U S A       Date:  2019-10-14       Impact factor: 11.205

Review 2.  Maternal transmission, sex ratio distortion, and mitochondria.

Authors:  Steve J Perlman; Christina N Hodson; Phineas T Hamilton; George P Opit; Brent E Gowen
Journal:  Proc Natl Acad Sci U S A       Date:  2015-04-13       Impact factor: 11.205

3.  Mitochondrial genome and diverse inheritance patterns in Pleurotus pulmonarius.

Authors:  Li-Yun Ye; You-Jin Deng; Irum Mukhtar; Guo-Liang Meng; Yan-Jiao Song; Bing Cheng; Jin-Bing Hao; Xiao-Ping Wu
Journal:  J Microbiol       Date:  2020-01-29       Impact factor: 3.422

4.  A Dynamic Mobile DNA Family in the Yeast Mitochondrial Genome.

Authors:  Baojun Wu; Weilong Hao
Journal:  G3 (Bethesda)       Date:  2015-04-20       Impact factor: 3.154

5.  DiscML: an R package for estimating evolutionary rates of discrete characters using maximum likelihood.

Authors:  Tane Kim; Weilong Hao
Journal:  BMC Bioinformatics       Date:  2014-09-27       Impact factor: 3.169

6.  Genetic Drift and Indel Mutation in the Evolution of Yeast Mitochondrial Genome Size.

Authors:  Shujie Xiao; Duong T Nguyen; Baojun Wu; Weilong Hao
Journal:  Genome Biol Evol       Date:  2017-11-01       Impact factor: 3.416

7.  Origin and Spread of Spliceosomal Introns: Insights from the Fungal Clade Zymoseptoria.

Authors:  Baojun Wu; Allison I Macielog; Weilong Hao
Journal:  Genome Biol Evol       Date:  2017-10-01       Impact factor: 3.416

8.  The diversity of mtDNA rns introns among strains of Ophiostoma piliferum, Ophiostoma pluriannulatum and related species.

Authors:  Iman M Bilto; Georg Hausner
Journal:  Springerplus       Date:  2016-08-24

9.  The mitochondrial genome of the plant-pathogenic fungus Stemphylium lycopersici uncovers a dynamic structure due to repetitive and mobile elements.

Authors:  Mario Emilio Ernesto Franco; Silvina Marianela Yanil López; Rocio Medina; César Gustavo Lucentini; Maria Inés Troncozo; Graciela Noemí Pastorino; Mario Carlos Nazareno Saparrat; Pedro Alberto Balatti
Journal:  PLoS One       Date:  2017-10-03       Impact factor: 3.240

10.  Deletion of the sex-determining gene SXI1α enhances the spread of mitochondrial introns in Cryptococcus neoformans.

Authors:  Zhun Yan; Zhimin Li; Li Yan; Yongting Yu; Yi Cheng; Jia Chen; Yunyun Liu; Chunsheng Gao; Liangbin Zeng; Xiangping Sun; Litao Guo; Jianping Xu
Journal:  Mob DNA       Date:  2018-07-17
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.