Literature DB >> 30018871

Mitochondrial introgression and interspecies recombination in the Fusarium fujikuroi species complex.

Gerda Fourie1, Nicolaas A Van der Merwe1, Brenda D Wingfield1, Mesfin Bogale1, Michael J Wingfield1, Emma T Steenkamp1.   

Abstract

The Fusarium fujikuroi species complex (FFSC) is an economically important monophyletic lineage in the genus Fusarium. Incongruence observed among mitochondrial gene trees, as well as the multiple non-orthologous copies of the internal transcribed spacer region of the ribosomal RNA genes, suggests that the origin and history of this complex likely involved interspecies gene flow. Based on this hypothesis, the mitochondrial genomes of non-conspecific species should harbour signatures of introgression or introgressive hybridization. The aim of this study was therefore to search for recombination between the mitochondrial genomes of different species in the FFSC. Using methods based on mt genome sequence similarity, five significant recombinant regions in both gene and intergenic regions were detected. Using coalescent-based methods and the sequences for individual mt genes, various ancestral recombination events between different lineages of the FFSC were also detected. These findings suggest that interspecies gene flow and introgression are likely to have played key roles in the evolution of the FFSC at both ancient and more recent time scales.

Entities:  

Keywords:  FFSC; evolutionary history; heteroplasmy-associated mitochondrial recombination; hybridization; introgression; species concepts

Year:  2018        PMID: 30018871      PMCID: PMC6048563          DOI: 10.5598/imafungus.2018.09.01.04

Source DB:  PubMed          Journal:  IMA Fungus        ISSN: 2210-6340            Impact factor:   3.515


INTRODUCTION

The Fusarium fujikuroi species complex (FFSC, previously referred to as the Gibberella fujikuroi species complex) is one of several monophyletic assemblages in the genus Fusarium (phylum Ascomycota, order Hypocreales) (Geiser ). This complex is well-known for the many well-documented plant pathogens and mycotoxin producers it includes (Kvas ). Previous work suggests that the FFSC likely emerged during the middle-to-late Miocene (O’Donnell ) and that its evolutionary history could have involved interspecies gene flow (O’Donnell & Cigelnik 1997). Such interspecies interactions have also been described from other Fusarium species (e.g. F. oxysporum and F. graminearum species complexes) (Ma , O’Donnell ) and, in the FFSC, was suggested to explain the existence of multiple non-orthologous copies of the internal transcribed spacer region of the ribosomal RNA genes (O’Donnell & Cigelnik 1997). Interspecies gene flow is typically associated with hybridization and introgression (Stuckenbrock 2016). Hybridization is the production of viable and recombinant offspring by non-conspecific individuals. Introgression occurs when short-lived hybrids backcross with individuals from the parental species, allowing incorporation of new genetic material into the genome of that parental species. The process of introgressing new genetic material into the gene pool of a species is referred to as “introgressive hybridization” (Anderson & Hubricht 1938). In nature, interspecies gene flow is generally thought to be limited by species isolation mechanisms, such as vegetative incompatibility, pheromone-receptor recognition, intersterility and post-zygotic nuclear-cytoplasmic incompatibility systems that restrict or prevent the exchange of genetic material between species (Giraud et al 2008). In many Fusarium species, including those in the FFSC, laboratory-based mating studies have shown that the level of reproductive isolation is not complete and that various species are capable of interbreeding (Desjardins , Leslie ). The mitochondrial (mt) genome is potentially a valuable tool for studying hybridization and introgression in fungi. Fungal mitochondria are mostly inherited from the maternal parent (Taylor 1986), but cross-species interactions would often lead to a short-lived heteroplasmic state in which the hybrid individual would harbour mt haplotypes from both parents (Ballard & Whitlock 2004, Barr ). Recombination between the different haplotypes would cause the introduction and/or replacement of new genes/regions on one or both mt genomes. Such signatures of the ancestral cross-species interactions thus would be retained in the species’ mt genomes, despite the fact that one of the mt haplotypes would typically be purged from subsequent populations (Rand 2001). In fish, for example, Wilson & Bernatchez (1998) described an ancient introgession in Salvelinus namaycush (trout) due to the presence of a single mt haplotype belonging to S. alpines (arctic char) in the S. namaycush population. In plants, Jaramillo-Correa & Bousquet (2005) described mitochondrial recombination between Picea mariana (black spruce) and P. rebens (red spruce) as a result of introgressive hybridization in the zone of contact between these conifers in north-eastern North America. Examples from fungi are still limited, but Fourie hypothesized that the incongruence observed among gene trees inferred from mitochondrial genes could have resulted from recombination between the mt genomes of non-conspecific species. In this study we considered the hypothesis that introgression or introgressive hybridization occurred in the history of the FFSC. Our first aim was to identify and characterize regions in the mt genomes of extant FFSC species that potentially originate from such interspecies gene flow events. For this purpose, the mt genomes for two FFSC species (i.e. F. mangiferae and F. sterilihyphosum) were determined and used to complement those (F. circinatum, F. verticillioides and F. fujikuroi) already in the public domain (Al-Reedy , Fourie ). These genomes were then subjected to the recombination detection method (Martin & Rybicki 2000), Bootscan (Martin ), Geneconv (Padidam ) and Maximum χ2 (Smith 1992) analyses that were designed for detecting interspecific recombination (Martin ). The second aim of this study was to utilize a coalescent-based approach for detecting ancestral recombination in the mt genes of extant FFSC species (Price & Carbone 2005). We purposefully did not employ phylogenetic methods given the low sequence diversity observed in the mt genes of the FFSC and other fungi (e.g. Seifert , Huang , Fourie ). For these coalescent analyses, the sequences for five mt genes (atp6, cox2, nad3, nad5, and nad6), previously shown to support incongruent phylogenetic histories (Fourie ), and from a collection of species spanning the diversity of the FFSC, were utilized. To assess the potential effects of false negatives and/or systematic errors (i.e. artefacts that arise from failure to fully account for the properties of these data) (Delsuc ) in the analyses, the degree to which selection and substitution rate heterogeneity affected the individual mt gene datasets were also evaluated.

MATERIAL AND METHODS

Fungal isolates

Twenty-seven Fusarium isolates representing three, four and five species in the respective “African”, “Asian” and “American” clades of the FFSC (O’Donnell , 2000b), were used (Table 1). This collection included the standard mating type tester strains for the nine mating populations (i.e. MP-A to MP-I) or biological species of the FFSC (Leslie & Summerell 2006, Kvas ), as well as representatives of F. mangiferae and F. sterilihyphosum.
Table 1.

Isolate information for the Fusarium fujikuroi complex species used in this study.

SpeciesMPaCladebIsolate numbercOther numberdAccession numbereReference
F. circinatumHAmericanCMWF 350MRC 7870/Fsp34JX910419*Fourie et al. 2013
F. circinatumHAmericanCMWF 497MRC 7488This study
F. circinatumHAmericanCMWF 498MRC 6213This study
F. verticillioidesAAfricanNRRL 29056JN041210*Al-Reedy et al. 2012
F. verticillioidesAAfricanCMWF 1196MRC 8559This study
F. verticillioidesAAfricanCMWF 1197MRC 8560This study
F. fujikuroiCAsianCMWF 1220IMI58289JX910420*Fourie et al. 2013
F. fujikuroiCAsianCMWF 1200MRC 8532This study
F. fujikuroiCAsianCMWF 1201MRC 8534This study
F. subglutinansEAmericanCMWF 1204MRC 8553/6483This study
F. subglutinansEAmericanCMWF 1205MRC 8554/6512This study
F. temperatumEAmericanCMWF 1206MRC 1084This study
F. temperatumEAmericanCMWF 389MRC 7828KP742837*This study
F. mangiferaeN/AAsianCMWF 1213MRC 8092/8093This study
F. mangiferaeN/AAsianCMWF 1214MRC 7559KP742838*This study
F. sacchariBAsianCMWF 1198MRC 8552This study
F. sacchariBAsianCMWF 1199MRC 8551This study
F. proliferatumDAsianCMWF 1202MRC 8549This study
F. proliferatumDAsianCMWF 1203MRC 8550This study
F. thapsinumFAfricanCMWF 1207MRC 8558This study
F. thapsinumFAfricanCMWF 1208MRC 8557This study
F. nygamaiGAfricanCMWF 1209MRC 8546This study
F. nygamaiGAfricanCMWF 1210MRC 8547This study
F. konzumIAmericanCMWF 1211MRC 8545This study
F. konzumIAmericanCMWF 1212MRC 8544This study
F. sterilihyphosumN/AAmericanCMWF 1215MRC 2802This study
F. sterilihyphosumN/AAmericanCMWF 1216MRC 8105This study

aMating populations (i.e. MP-A to MP-I) or biological species of the FFSC (Leslie & Summerell 2006, Kvas ).

b”African”, “Asian” and “American” clade designation of the Fusarium species based on their associated plant hosts (O’Donnell ).

cAll isolates used in this study are maintained in the Fusarium Culture Collection of Mike Wingfield, FABI, University of Pretoria, South Africa.

dAdditional culture collections: MRC = The South African Medical Research Council, NRRL = ARS Culture Collection, USDA, IMI = CABI Biosciences, Egham, UK.

eAccession numbers of the mitochondrial genome and gene sequences used in this study. Whole mt genome sequences are indicated with asterisks. Individual genes were deposited to the European Nucleotide Archive (http://www.ebi.ac.uk/ena/data/view/EMBL) under the accession numbers LN8111335-LN811269 in the gene order atp6, cox2, nad3, nad5 and nad6 and isolate order similar to Table 1.

Mt genome sequencing and assembly

To determine the mt genome sequence for F. temperatum isolate CMWF 389 and F. mangiferae isolate CMWF 1214 (Table 1), total genomic DNA was extracted as described previously (Groenewald ) and subjected to pyrosequencing at Inqaba Biotechnologies (Pretoria, South Africa) on a single lane using the GS-FLX platform (Roche 454 system, Life Sciences, CT). After exclusion of low quality reads, those encoding mt sequences were identified using BLAST comparison to the available FFSC mt genomes (Al-Reedy , Fourie ). The mt reads for the two species were subsequently assembled de novo with the CLC Genomics Workbench software version 6.0 (CLC bio, Århus, Denmark). The order and orientation of contigs were determined using F. circinatum, F. verticillioides, and F. fujikuroi as reference genomes. Gaps between contigs were filled manually by Sanger sequencing. Protein coding and tRNA mt genes were identified with MFANNOT and RNAweasel (http://megasun.bch.umontreal.ca) (Lang ), as well as tRNAscan-SE (Lowe & Eddy 1997). Gene identities were confirmed with BLASTp comparisons against NCBI.

Mt genome-based recombination analysis (RDP, Bootscan, Geneconv and Maximum χ2)

The Recombination Detection Program (RDP) package version 3.44 (Martin ) was used to screen for possible recombination events in the five FFSC mt genomes and the individual gene datasets using RDP (Martin & Rybicki 2000), Bootscan (Martin ), Geneconv (Padidam ) and Maximum χ2 (Smith 1992). Since these tools differ with regards to their power to detect recombination (Wiuf , Posada 2002), results from all four recombination detection methods were compared and only recombination identified by all four methods were considered. For these analyses (see below), the five mt genomes and the individual gene datasets were aligned using the CLC Genomics Workbench software. The RDP method identifies potential recombinant segments by plotting the pair-wise percentage identity values of all combinations of three sequences/isolates within the given dataset. A potential recombinant region is subsequently identified as the region where the pair-wise percentage identity of sequence A to C or B to C is higher than that of A to B given that A and B are more closely related to one another than to C. The probability that the potential recombinant occurred by chance is then approximated using the binomial distribution (Martin & Rybicki 2000). Bootscan identifies potential recombination segments by constructing pair-wise distances and bootstrap replicates within overlapping sequence blocks. High degrees of bootstrap support for different tree topologies suggest potential recombinant regions (Martin ). Geneconv detects recombination by identifying aligned sequenced pairs, where a match between two sequences is given +1 and a mismatch is awarded a penalty -m. The mismatch penalty depends on the density (ratio) of polymorphic sites between the sequences and the mismatch intensity parameter (G-scale), which is proportional to the total number of site differences (i.e. polymorphic sites) between the two isolates (Padidam ). The Maximum χ2 test searches for recombination break points by comparing the number of segregating sites on both sides of a putative recombination break point and calculating 2 × 2 χ2 values as an expression of the difference on either side of the central partition (Smith 1992). The p-value was set to 0.05 for all methods employed.

Mt gene sequencing

Total genomic DNA was extracted from week-old cultures (Table 1) incubated on half strength potato dextrose agar (PDA; Biolab Diagnostics, Wadeville, South Africa) at 25 °C. For amplification of mt genes, Primer3 (http://primer3.sourceforge.net/) was used to design primers that target nad3, nad5 and nad6 encoding the respective nicotinamide adenine dinucleotide (NADH) dehydrogenase subunits, apt6 that encodes adenosine triphosphate (ATP) synthase subunit 6, and cox2 that encodes cytochrome c oxidase subunit II (Supplementary Table S1). PCR reaction mixtures were adjusted to 25μl with sterile distilled water and contained ca. 5 ng/ml DNA, 0.5 mM of each primer, 250 mM dNTPs (Fermentas, Nunningen, Switzerland), 0.04 U/ml Taq DNA polymerase (Roche Molecular Biochemicals, Manheim, Germany) and PCR buffer with MgCl2 (Roche). PCR cycling conditions consisted of an initial denaturation step at 94 °C for 5 min, followed by 35 cycles at 94 °C for 35 s, each primer pair specific annealing temperature (Supplementary Table S1) for 35 s, and 72 °C for 90 s with a final extension step at 72 °C for 5 min. Amplification products were precipitated and purified with polyethylene glycol (Hartley & Bowen 2003) and sequenced in both directions using the original primers, the BigDye® terminator v3.1 cycle sequencing kit (Applied Biosystems, Foster City, CA) and anABI PRISM®377 DNA sequencer (Applied Biosystems).

Coalescent-based detection of recombination in mt genes

Individual sequence alignments were collapsed into binary matrices by excluding segregating sites and indels using SNAP MAP (Aylor ). This was done in order to assume the infinite-sites model of mutation where at most one mutation event can occur at each site (Kimura 1969). The minimum number of recombination events (Rm) within each binary matrix (gene dataset) was determined using RECMIN (Myers & Griffiths 2002) in SNAP Workbench (Price & Carbone 2005). Rm is based on the four-gamete test of Hudson & Kaplan (1985) that infers recombination between pairs of loci at which all four possible gametic types are present. Finally, minimal ancestral recombination graphs (ARG) were reconstructed using the BEAGLE branch and bound algorithm (Lyngsø ) in SNAP Workbench. The sequence data for whole mt genomes were not used in these coalescent-based analyses. This is because the high sequence diversity of the intergenic and/or intron regions (Al-Reedy , Fourie ) would increase the false positive recombination events detected under the infinite-site model of mutation (McVean ). Conversely, the individual mt gene datasets were not subjected to the analytical tools included in RDP3. This is because of these tools have limited value for detecting recombination in highly conserved regions (Posada 2002, Tsaousis ) such as the five mt gene datasets examined here (see below).

Evaluating possible sources of systematic error and/or false positives

The ability to detect recombination in DNA sequences depends on the genetic diversity of the data as well as among site rate variation (Posada 2002). Little genetic diversity within the dataset could obscure the signal for recombination whereas rate variation could allow recombination to be detected incorrectly. Nucleotide diversity, sequence divergence and rate heterogeneity of each of the individual mt gene datasets were, therefore, estimated. For each dataset, DNAsp ver. 5 (Librado & Rozas 2009) was used to determine π, which is the average number of nucleotide differences per site between two sequences (Nei 1987). This software package was also used to determine the sequence divergence estimates Dxy and Da, which respectively are the average and net numbers of nucleotide substitutions per site between species (Nei 1987). Dxy and Da were used to estimate divergence between species within the FFSC where F. oxysporum was used as the outgroup (Cunnington 2007, Pantou ). For comparative purposes, π, Dxy and Da values were converted to percentages. jModeltest was used to evaluate the pattern of among-site rate heterogeneity for all gene datasets by estimating the shape parameter (α) of the gamma distribution, where smaller α values indicate strong rate variation (Yang 1996, Posada 2008). Signals of recombination might also be obscured by other evolutionary phenomena such as directional selection acting on the target genes and/or analytical artefacts arising from factors such as substitution saturation. Substitution saturation results in homoplasy (Rubinoff & Holland 2005), which can incorrectly point towards recombination because recurrent mutations (i.e. mutation hot spots) and recombination can generate similar patterns of genetic variability (Eyre-Walker , Hagelberg 2003, Galtier ). In addition, recurrent mutation could also result from selection pressure acting on the target genes or selection pressures acting on specific regions of the target genes (Nielsen 2005, Reed & Tishkoff 2006). We, therefore, tested if positive selection acted on the mt gene datasets and determined the level of substitution saturation in the mt gene datasets. Specific sites under positive or negative selection were identified using three codon based maximum likelihood methods. These included the Fixed Effect Likelihood (FEL), Random Effect Likelihood (REL) and Single Likelihood Ancestor Counting (SLAC) methods from Datamonkey (http://www.datamonkey.org/) developed by Kosakovsky . FEL estimates ω for each site in a sequence alignment. REL allows rate variation in both non-synonymous and synonymous rates and a general underlying nucleotide substitution model. SLAC reconstructs ancestral sequences using the joint likelihood reconstruction method in the codon-state space (Kosakovsky ). Results arising from all methods were compared and only codons identified as being under selection by all methods were considered. The level of substitution saturation was measured by calculating the information entropy-based index of substitution saturation (Xia ) with DAMBE5 (Xia 2013). This is a tree-based approach where substitution saturation (ISS) can be determined by testing if the observed entropy at site i is significantly smaller than the expected entropy under full substitution saturation. We compared ISS to the critical ISS value, ISS.C, where the latter depends on the topology of the tree, the number of taxa, the sequence length, the nucleotide frequency, and the transition/transversion ratio, all of which are studied and compared through simulations of an experimental set of topologies given the alignment (Xia , Xia & Lemey 2009). Since the third codon is more variable due to the wobble effect of the genetic code (Spencer & Barral 2012) and thus likely to experience more substitutions, substitution saturation was determined for the first and second codons separately from the third codon of each gene dataset.

RESULTS

Pyrosequencing together with Sanger sequencing allowed for the assembly of the mt genomes of Fusarium temperatum and F. mangiferae. In both cases, the sequences spanned the entire replicon, except for a gap containing the large subunit ribosomal RNA gene and three of the clusters of tRNA genes, i.e. tRNA gene clusters 2, 3 and 4 (Fourie ). None of the pyrosequencing reads mapped to the corresponding region in the mt genomes of F. circinatum, F. fujikuroi, and F verticillioides, in which this sequence has been determined. Also, the repetitive nature of these regions (Fourie ) precluded their amplification and sequencing, despite various attempts using multiple primerpairs. The mt genome sequences of F. temperatum (GenBank KP 742837) and F. mangiferae (GenBank KP 742838) contained the 14 known mt protein coding genes (Fig. 1), the products of which are involved in oxidative phosphorylation. Within these protein coding genes, 12 and four group 1 introns were, respectively, found in the two mt genomes. Fusarium temperatum and F. mangiferae both contained an intron in their cob gene, while the F. temperatum cox1 gene contained 8 introns and the cox3, nad1 and nad2 genes each contained one intron, as opposed to the three introns in cox1of F. mangiferae as well as cox3 and nad2 that were free of introns (Table 2).
Fig. 1.

Annotated map for the mitochondrial fragment of Fusarium temperatum (KP742837). The genome fragments encode the 14 protein coding genes of the oxidative phosphorylation pathway (blue = entire gene; yellow = coding sequence), one rRNA (red), 5tRNA (red) and tRNA cluster 1.

Table 2.

The number of introns identified in the mitochondrial protein coding genes of the Fusarium species used in this study.

nad2cobcox1nad1cox3
F. circinatum14711
F. temperatum11811
F. fujikuroi01000
F. mangiferae00310
F. verticillioides00210
Although the presence of introns within protein coding genes varied greatly among and within the species examined, comparison of the five mt genomes suggested a possible link between intron abundance and the FFSC clade of the species (Table 2). For example, the mt genomes of the “American” clade species F. circinatum and F. temperatum both contained 14 introns as opposed to the one and four introns found in the mt genes of F. fujikuroi and F. mangiferae, respectively, that reside in the “Asian” clade and the three introns in the mt genes of the “African” clade species F. verticillioides. Regarding tRNA genes outside clusters 2, 3, and 4 (fragment not sequenced), the mt genome sequences of both F. temperatum and F. mangiferae contained tRNA cluster 1 which encodes four tRNA genes, as well as nine individual tRNA genes (Fig. 1). In both assemblies, all protein coding and sequenced tRNA genes were located in the same gene order and orientation (Fig. 1), similar to what has been described for other FFSC mt genomes (Al-Reedy , Fourie ).

Mt genome-based recombination analysis

All four of the recombination detection tools identified recombinant regions within the FFSC genomes examined in this study (Supplementary Table S2). The consensus of the four detection methods suggested five significant recombinant regions in both gene and intergenic regions (Table 3). For example, recombinant regions were detected within the intergenic region between the tRNA gene for cysteine and the cox1 gene, as well as between cob and the tRNA gene for arginine. Recombination was also detected within the atp6, atp9 and cox2 genes. For all five of the detected recombination events, RDP, Bootscan, Geneconv, and Maximum χ2 suggested events in which F. circinatum, F. mangiferae or F. temperatum were predicted to be the daughter of the recombination event, although the major and minor parents could not always be identified (Table 3).
Table 3.

The recombinant regions detected by RDP, Geneconv, Bootscan and Maximum χ2 of the Fusarium mitochondrial genomes.

RecombinantaRecombinant regionbParentscP-valuesd
BeginEndGenes includedRDPGeneConvBootscanMax χ2
F. circinatum20 36620934intergenic between tRNA.cys and cox1F. mangiferae x unknown6.89E-043.96E-024.54E-036.25E-05
F. mangiferae28624530atp9 to cox2F. fujkuroi x F. circinatum2.50E-452.29E-482.46E-468.77E-19
F. mangiferae1062110820intergenic between cob and tRNA.cysF. fujikuroi x unknown1.03E-132.59E-118.52E-134.14E-05
F. mangiferae2329624471atp6F. fujikuroi x F. circinatum3.90E-115.08E-082.29E-111.16E-06
F. temperatum1114311771tRNA.argF. mangiferae x unknown1.78E-222.54E-249.66E-272.47E-11

aThe FFC species or daughter in which the recombinant region was detected.

bThe position of the recombinant region according to the mt genome of each specific FFSC species, F. mangiferae (KP742837), F. temperatum (KP742838), and F. circinatum (JX910419).

cThe suggested origin of the recombinant region according to RDP, Geneconv, Bootscan and Maximum χ2 implemented in RDP version 3.44 (Martin ). Unknown = the parent of this recombination event is not present within the dataset.

dThe P-values for each of the recombination detection methods RDP, Geneconv, Bootscan and Maximum χ2 implemented in RDP version 3.44 (Martin ). Only those events detected with all four methods are shown (see Supplementary Table S3 for the results for all individual tests). RDP identifies recombinant segments via pair-wise percentage identity values (Martin & Rybicki 2000), Geneconv detects recombinant segments as aligned pairs that are unusually long and sufficiently similar (Padidam ), Bootscan identifies recombinant segments as high degree of bootstrap support for different phylogenies (Martin ) and Maximum χ2 detects recombinant sections by comparing the number of segregating sites on both sides of the recombinant breakpoint (Smith 1992).

Within the five mt gene sequence datasets examined for the 27 Fusarium isolates included in this study, 17 recombination events were detected using RECMIN. The estimated minimum number of recombination events needed to explain the incompatibilities in the individual datasets were 0, 0, 1, 3, 3, and 10 for the nad3, cox2, atp6, nad6 and nad5 datasets, respectively. This suggested that recombination occurred in all the datasets examined with the exception of nad3 and cox2, and that recombination was extensive in nad5. SNAP MAP collapsed the 27 sequences for each mt dataset into their respective haplotypes. There were six haplotypes in each of the nad3 and cox2 datasets, 10 in each of the nad6 and atp6 datasets and 15 in the nad5 dataset (Supplementary Table S3). Each haplotype typically comprised of the representatives for each species, or the representatives of closely related/sister species combined into a single haplotype (Supplementary Table S3). Consistent with that suggested by the minimum number of recombination events, the ARG analysis suggested extensive recombination in the sequences of nad5, with some recombination in the sequences of atp6 and nad6 and no recombination in the sequences of nad3 and cox2 (Fig. 2). The ARG analysis also allowed identification of the recombinant region within each mt gene dataset. For example, it identified a recombination region within the atp6 dataset at nucleotide position 65 (Fig. 2, Supplementary Table S4).
Fig. 2.

Minimum ancestral recombination graphs (ARGs) inferred from the FFSC datasets for the atp6, nad5 and nad6 using the BEAGLE branch and bound algorithm in SNAP Workbench 2.0 (Price & Carbone 2005). Haplotypes are colour coded according to the clade within the FFSC complex to which the species belong (“African” = red, “American” = blue, “Asian” = green). Circles with numbers represent recombination events and the number within a circle represents the nucleotide position of the recombination event for each dataset. After a recombination event, the two sequences are replaced with a recombinant consisting of a prefix (P) from one sequence and a suffix (S) of the other sequence. The numbers on the branches suggest the number of mutation events before the coalescence of the specific haplotypes.

The ARGs were also used to determine the relative order in which recombination occurred and to evaluate the contribution of mutations and coalescent events. Overall, the “American” clade haplotypes were associated with the deepest recombination events in the ARGs inferred from nad5 (position 1629 and position 1026) and nad6 (position 332) datasets. In addition, the ARGs of atp6, nad6 and nad5 suggested that recombination events had also occurred more recently in the evolutionary history of the FFSC based on their emergence towards the tips of the ARGs, for example, recombinant position 65 of the atp6 dataset and/or positions 503 and 515 of nad6 dataset. Finally, the ARGs of nad6 and nad5 also suggested that recombination occurred between the clades of the FFSC. For example, recombination event position 1629 of the nad5 dataset (haplotype H4) resulted from ancestral individuals that became haplotype H5 and H6 and therefore haplotypes that represent both the “African” and “Asian” clades (Fig. 2). To evaluate the effect of systematic error on our coalescent-based analyses, various additional parameters were estimated and examined. This is because failure to appropriately account for the complex properties of the individual gene datasets could lead to false detection or non-detection of ancestral recombination events. In other words, these analyses provided an indication of the robustness of the conclusions drawn from the RECMIN and ARG results. Indeed, based on our analyses, neither nucleotide diversity nor substitution saturation (homoplasy) appeared to represent significant sources of such systematic errors. The average nucleotide diversity (π) estimated for all of the mt datasets (see Table 1 for EMBL nucleotide sequence database numbers) was low and ranged from 0.4–1 % (Table 4). For these datasets, the sequence divergence estimates Dxy and Da between FFSC and F. oxysporum were also low and ranged between 0.9–2 %, and 0.3–1.5 %, respectively (Table 4). In terms of substitution saturation, the observed entropy ISS was compared to ISS.C. For the five gene datasets used in this study, as well as for the datasets respectively containing first plus second codon position and third codon positions only, the ISS values were significantly smaller than the ISS.C values (Table 5), which suggested negligible substitution saturation.
Table 4.

Gene diversity, divergence and rate heterogeneity of the mitochondrial genes (nad3, nad5, nad6, atp6 and cox2) examined in this study.

GeneDiversity and DivergenceaRate heterogeneityb
ππ (%)DxyDxy (%)DaDa (%)
atp60.005690.50.0203220.0142810.01
nad30.009340.90.0111610.003180.30.58
nad50.004290.40.010210.006960.60.30
nad60.0101510.009670.90.00460.40.46
cox20.006270.60.018821.80.015691.50.01

aSequence diversity (Nei 1987) and divergence (Nei 1987) for each dataset were determined using DNAsp ver. 5 (Librado & Rozas 2009). π = the average number of nucleotide differences per site between two sequences, Dxy = the average number of nucleotide substitutions per site between populations, Da = the number of net nucleotide substitutions per site between populations. π, Dxy, and Da values were converted to percentages for comparative purposes.

bThe shape of the gamma distribution which indicate the pattern of among-site rate heterogeneity for all gene datasets were inferred by with jModeltest (Posada 2008).

Table 5.

Index of substitution saturation as well as the critical value of the index of substitution saturation used to measure the level of substitution saturation within the FFSC mitochondrial genes (nad3, nad5, nad6, atp6, and cox2).

GenesSaturation (1+2 position)Saturation (3 position)
Iss.criticalbIssaIss.criticalbIssa
atp60.70090.01730.61110.0197
nad30.73410.01020.79220.0341
nad50.77580.00170.72830.0263
nad60.69410.00430.64290.045
cox20.68860.15820.68860.2665

aIss = information entropy-based index of substitution saturation determined with DAMBE5 (Xia 2013) by testing if the observed entropy at site i is significantly smaller than the expected entropy under full substitution saturation.

bIss.critical = critical ISS value estimated with DAMBE 5 (Xia 2013). The critical ISS depend on the topology of the true tree, the number of OTUs, the sequence length, nucleotide frequency and the transition/transversion ratio (Xia , Xia & Lemey 2009).

Selection analyses with FEL, REL and SLAC suggested that no codons were under positive selection. In contrast, the FEL, REL and SLAC analyses identified a number of codons under negative/purifying selection for all the datasets included (Supplementary Table S4). For example, 13, 10, 6, 31 and 20 codons of the atp6, cox2, nad3, nad5, and nad6 gene datasets, respectively, were identified to be under negative selection by one and/or two of the methods used. The consensus results, however, suggested that only nad5 and nad6, respectively, had three [(codon 139; methionine), (287; phenylalanine), (356; leucine)] and one (codon 116; serine) codon under negative selection (significant p-values FEL and SLAC = 0.1; REL baysian factor = 50). However, none of the consensus codons in the nad5 and nad6 genes were shown to be subjected to negative selection overlapped with the recombination events suggested from the ARGs. The only potential source of systematic error in our coalescent-based analyses was among site rate heterogeneity. The gamma distribution shape parameter varied among the different mt gene datasets (Table 4), but was particularly low for the atp6 and cox2 datasets (i.e. α-values of 0.01 as opposed to 3 0.3 for the other three datasets). This indicated strong substitution rate variation among sites that could lead to the detection of false positive recombination events. The RECMIN and ARG results did not predict any recombination events in the cox2 dataset, but the single recombination event predicted in the atp6 dataset likely represents an analytical artefact (Posada 2002).

DISCUSSION

This aim of this study was to find evidence of heteroplasmy-associated recombination between mt genomes of species in the FFSC. Both direct and coalescent-based method allowed for the detection of heteroplasmy-associated recombination, which support previous suggestions that gene flow or introgressive hybridization occurred in the history of the FFSC (O’Donnel & Cigelnik 1997, Fourie ). In addition, the detection of recombination by both methods also provided evidence that introgressive hybridization occurred at ancient and more recent time scales. Both of the approaches used in this study allowed for the identification of interspecies recombination events in the ancestry of the FFSC. The methods implemented in the RDP3 package infer recombination events directly from the sequence information provided (Martin ), while the ARGs provide information on the order in which recombination and mutation occurred over evolutionary time (Lyngsø ). In other words, ARGs represent statistical descriptions of the genealogical history of each mt gene sequences backwards in time to the most recent common ancestor (Griffiths 1999, McVean , Lyngsø ). Although examples in fungi are limited, RDP3’s direct recombination detection methods detected recombination and hybridization between free-ranging Australian lizards (Ujvari ), between scorpion species in the family Buthidae (Gantenbein ) and between divergent populations of the nematode species Globodera pallida (Hoolahan ), while ARG analysis showed introgression or hybridization in organisms such hydrothermal vent mussels (Faure ). We performed extensive analyses to ensure that the putative ancestral recombination events detected in this study were not due to a failure to account for the inherent evolutionary complexity of the data (Possada 2002, Delsuc ). Despite nucleotide substitution saturation possibly being common in mt genomes (Spencer & Barral 2012, Gaillardin ), little evidence for it was found as was expected since the FFSC mt datasets were highly conserved. However, as previously described (Dowling , Rand 2001, Stewart , Soares ), the various FFSC mt gene datasets contained evidence of selection at specific codons. But we only detected purifying selection (codons under positive or diversifying selection were not detected) and none of the affected consensus sites (i.e. identified by all methods) occurred in the recombinant regions identified. Also as expected (Excoffier & Yang 1999, Ingman ), strong among-site rate variation was detected in some of the mt genes examined, which may be linked to the function/structure of their products (Yang 1996). Although the infinite-sites model of substitution (Kimura 1969) utilized in the ARG analyses likely excluded the effects of this phenomenon, only one putative recombination event was detected in a dataset associated with strong among-site rate variation. Taken together, these results thus indicated that the putative recombination events identified do not represent analytical artefacts, because the sources of systematic error in the various FFSC mt gene datasets were limited. One concern that could not be fully eliminated in the current study is that the DNA sequence signatures of introgression or hybridization are not readily distinguishable from those of incomplete lineage sorting or deep coalescence (Maddison 1997, Degnan & Rosenberg 2009). Incomplete lineage sorting typically manifests as polymorphisms that persist through several speciation events; i.e. divergence and drift-associated random sorting of an ancestral polymorphism did not lead to its differential fixation in the resulting species (Maddison 1997). In ancestral recombination analyses, incompletely sorted polymorphisms would thus “behave” in a similar manner to those originating from interspecies gene flow (Degnan & Rosenberg 2009). Although we could not rule out its involvement in our analyses, incomplete lineage sorting is unlikely to have affected all of the recombinant sites/regions identified. This is especially true for the long stretches of recombinant sequences (199–1668 bp) detected among the genomes of the “Asian” and “American” clade species included (see Table 3). Future studies should however investigate the role of incomplete lineage sorting in the evolution of FFSC by employing statistical approaches to distinguish gene flow and incomplete lineage sorting based on whole genome sequence data (Joly ). Mechanisms that would allow for recombination between mt genomes of different FFSC species is unknown. It is currently hypothesized that recombination between different mt genomes can occur via the dispersed repeat elements they harbour, exchange between highly conserved regions or via intron homing (Basse 2010, Galtier 2011). Recombination between dispersed repeat elements is common among plant mitochondria in which the repeats serve as crossover points for homologous recombination (Palmer & Herbon 1988). Recombination via intron homing occurs when LAGLIDADG or GIY-YIG endonucleases that are encoded in fungal mito-chondrial introns move into previously intron-less genes (Goddard & Burt 1999, Haugen , Stoddard 2006). Given the overall gene order conservation of the intron-rich mt genomes of the FFSC described here and previously (Al-Reedy , Fourie ), recombination via intron homing and/or exchange between conserved regions is potentially more likely as recombination via dispersed repeats would allow for gene order rearrangements. In general, however, mt recombination in fungi is expected to employ mechanisms that are markedly different from those inferred for animals where mt genomes typically lack introns and dispersed repetitive elements (Rokas , Piganeau ). Overall, the findings presented here indicate that interspecies or heteroplasmy-associated gene flow and recombination occurred at both ancient and recent timescales during the evolution of the FFSC. The results of the ARG analyses presented here (especially the nad5 and nad6 ARGs) provide evidence for older and/or ancient recombination within the FFSC. It is conceivable that such recombination and subsequent introgression events could have occurred in the Miocene, prior to biogeographic separation of the clades (O’Donnell ), during which diversification of the complex coincided with the radiation of grasses and eudicots that use C4 photosynthesis to fix carbon (Edwards , Christin , O’Donnell ). During this time period, environmental conditions likely influenced the distribution of the ancestral FFSC members, thus providing the opportunity for introgressive hybridization to occur (Olson & Stenlid 2002, Schardl & Craven 2003). Recombination events that occurred at more recent timescales were revealed by the tools implemented in RDP3 (Martin ). These all detect recombination by identifying regions of sequence similarity between individuals that are unusually high in comparison to the overall sequence similarity of these individuals as estimated from the entire region and/or genome in question. However, post-recombination mutations (i.e. those that accumulate over evolutionary time after the original interspecies gene flow event) would obscure the distinction of these recombinant regions from other background mutations (Posada 2002). Accordingly, recombinant regions detected by these methods most likely represent sites at which the signatures of recombination have not yet been eroded away by normal mutational processes. The five significant regions of recombination detected here (see Table 3), were thus the result of interspecies gene flow events that occurred relatively recently in the history of the FFSC, although information regarding the geographic contact points of these events are lacking. Also, many FSSC species are inter-fertile under laboratory conditions (Desjardins , Leslie ) and a natural hybrid has been described from native tall-grass prairie south of Manhattan in Kansas (Leslie , 2007). Overall the results of this study showed that interspecies gene flow and introgressive hybridization have played an important role in the evolution of the FFSC and will likely continue to do so. However, the extent to which these phenomena would influence the evolution of the complex and at what point new species will emerge remains to be determined.
  76 in total

1.  A broad survey of recombination in animal mitochondria.

Authors:  Gwenaël Piganeau; Michael Gardner; Adam Eyre-Walker
Journal:  Mol Biol Evol       Date:  2004-09-01       Impact factor: 16.240

Review 2.  Analyzing the mosaic structure of genes.

Authors:  J M Smith
Journal:  J Mol Evol       Date:  1992-02       Impact factor: 2.395

Review 3.  Homing endonuclease structure and function.

Authors:  Barry L Stoddard
Journal:  Q Rev Biophys       Date:  2005-12-09       Impact factor: 5.318

4.  Widespread recombination in published animal mtDNA sequences.

Authors:  A D Tsaousis; D P Martin; E D Ladoukakis; D Posada; E Zouros
Journal:  Mol Biol Evol       Date:  2005-01-12       Impact factor: 16.240

Review 5.  Gene tree discordance, phylogenetic inference and the multispecies coalescent.

Authors:  James H Degnan; Noah A Rosenberg
Journal:  Trends Ecol Evol       Date:  2009-03-21       Impact factor: 17.712

6.  How clonal are human mitochondria?

Authors:  A Eyre-Walker; N H Smith; J M Smith
Journal:  Proc Biol Sci       Date:  1999-03-07       Impact factor: 5.349

7.  Gene genealogies reveal global phylogeographic structure and reproductive isolation among lineages of Fusarium graminearum, the fungus causing wheat scab.

Authors:  K O'Donnell; H C Kistler; B K Tacke; H H Casper
Journal:  Proc Natl Acad Sci U S A       Date:  2000-07-05       Impact factor: 11.205

8.  The application of high-throughput AFLP's in assessing genetic diversity in Fusarium oxysporum f. sp. cubense.

Authors:  Susan Groenewald; Noëlani Van Den Berg; Walter F O Marasas; Altus Viljoen
Journal:  Mycol Res       Date:  2006-02-14

9.  The complete mitochondrial genome of Fusarium oxysporum: insights into fungal mitochondrial evolution.

Authors:  Malena P Pantou; Vassili N Kouvelis; Milton A Typas
Journal:  Gene       Date:  2008-04-27       Impact factor: 3.688

10.  The intriguing evolutionary dynamics of plant mitochondrial DNA.

Authors:  Nicolas Galtier
Journal:  BMC Biol       Date:  2011-09-27       Impact factor: 7.431

View more
  8 in total

1.  Panorama of intron dynamics and gene rearrangements in the phylum Basidiomycota as revealed by the complete mitochondrial genome of Turbinellus floccosus.

Authors:  Jie Cheng; Qing Luo; Yuanhang Ren; Zhou Luo; Wenlong Liao; Xu Wang; Qiang Li
Journal:  Appl Microbiol Biotechnol       Date:  2021-02-08       Impact factor: 4.813

2.  Exploring Mitogenomes Diversity of Fusarium musae from Banana Fruits and Human Patients.

Authors:  Luca Degradi; Valeria Tava; Anna Prigitano; Maria Carmela Esposto; Anna Maria Tortorano; Marco Saracchi; Andrea Kunova; Paolo Cortesi; Matias Pasquali
Journal:  Microorganisms       Date:  2022-05-28

3.  Diversity of Mobile Genetic Elements in the Mitogenomes of Closely Related Fusarium culmorum and F. graminearum sensu stricto Strains and Its Implication for Diagnostic Purposes.

Authors:  Tomasz Kulik; Balazs Brankovics; Anne D van Diepeningen; Katarzyna Bilska; Maciej Żelechowski; Kamil Myszczyński; Tomasz Molcan; Alexander Stakheev; Sebastian Stenglein; Marco Beyer; Matias Pasquali; Jakub Sawicki; Joanna Wyrȩbek; Anna Baturo-Cieśniewska
Journal:  Front Microbiol       Date:  2020-05-25       Impact factor: 5.640

4.  The 287,403 bp Mitochondrial Genome of Ectomycorrhizal Fungus Tuber calosporum Reveals Intron Expansion, tRNA Loss, and Gene Rearrangement.

Authors:  Xiaolin Li; Lijiao Li; Zhijie Bao; Wenying Tu; Xiaohui He; Bo Zhang; Lei Ye; Xu Wang; Qiang Li
Journal:  Front Microbiol       Date:  2020-12-09       Impact factor: 5.640

5.  Characterization and phylogenetic analysis of the complete mitochondrial genome of the pathogenic fungus Ilyonectria destructans.

Authors:  Piotr Androsiuk; Adam Okorski; Łukasz Paukszto; Jan Paweł Jastrzębski; Sławomir Ciesielski; Agnieszka Pszczółkowska
Journal:  Sci Rep       Date:  2022-02-11       Impact factor: 4.379

Review 6.  Promising Perspectives for Detection, Identification, and Quantification of Plant Pathogenic Fungi and Oomycetes through Targeting Mitochondrial DNA.

Authors:  Tomasz Kulik; Katarzyna Bilska; Maciej Żelechowski
Journal:  Int J Mol Sci       Date:  2020-04-10       Impact factor: 5.923

7.  The complete mitochondrial genome of medicinal fungus Taiwanofungus camphoratus reveals gene rearrangements and intron dynamics of Polyporales.

Authors:  Xu Wang; Lihua Jia; Mingdao Wang; Hao Yang; Mingyue Chen; Xiao Li; Hanyu Liu; Qiang Li; Na Liu
Journal:  Sci Rep       Date:  2020-10-05       Impact factor: 4.379

8.  Evidence for Persistent Heteroplasmy and Ancient Recombination in the Mitochondrial Genomes of the Edible Yellow Chanterelles From Southwestern China and Europe.

Authors:  Ying Zhang; Shaojuan Wang; Haixia Li; Chunli Liu; Fei Mi; Ruirui Wang; Meizi Mo; Jianping Xu
Journal:  Front Microbiol       Date:  2021-07-14       Impact factor: 5.640

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.