| Literature DB >> 25527836 |
Denis Beaudet1, Ivan Enrique de la Providencia1, Manuel Labridy1, Alice Roy-Bolduc1, Laurence Daubois1, Mohamed Hijri2.
Abstract
Arbuscular mycorrhizal fungi (AMF) are multinucleated and coenocytic organisms, in which the extent of the intraisolate nuclear genetic variation has been a source of debate. Conversely, their mitochondrial genomes (mtDNAs) have appeared to be homogeneous within isolates in all next generation sequencing (NGS)-based studies. Although several lines of evidence have challenged mtDNA homogeneity in AMF, extensive survey to investigate intraisolate allelic diversity has not previously been undertaken. In this study, we used a conventional polymerase chain reaction -based approach on selected mitochondrial regions with a high-fidelity DNA polymerase, followed by cloning and Sanger sequencing. Two isolates of Rhizophagus irregularis were used, one cultivated in vitro for several generations (DAOM-197198) and the other recently isolated from the field (DAOM-242422). At different loci in both isolates, we found intraisolate allelic variation within the mtDNA and in a single copy nuclear marker, which highlighted the presence of several nonsynonymous mutations in protein coding genes. We confirmed that some of this variation persisted in the transcriptome, giving rise to at least four distinct nad4 transcripts in DAOM-197198. We also detected the presence of numerous mitochondrial DNA copies within nuclear genomes (numts), providing insights to understand this important evolutionary process in AMF. Our study reveals that genetic variation in Glomeromycota is higher than what had been previously assumed and also suggests that it could have been grossly underestimated in most NGS-based AMF studies, both in mitochondrial and nuclear genomes, due to the presence of low-level mutations.Entities:
Keywords: NGS and Sanger sequencing; arbuscular mycorrhizal fungi; gene variants coexpression; heteroplasmy; mitochondria; polymorphism
Mesh:
Year: 2014 PMID: 25527836 PMCID: PMC4316628 DOI: 10.1093/gbe/evu275
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Allelic diversity analysis features at all investigated loci in two R. irregularis isolates
| Locus | Isolate DAOM/sequence length | Number of clones | Number of alleles | Dominant allele frequency | Polymorphic sites | Chao-1 estimator (lci-hci) | Good’s coverage (%) |
|---|---|---|---|---|---|---|---|
| 197198 (514 bp) | 47 | 1 | 47 | 0 | 1,00 (1,00–1,00) | 100 | |
| 242422 (514 bp) | 56 | 8 | 49 | 12 | 28,62 (6,54–50,71) | 88 | |
| 197198 (306 bp) | 40 | 5 | 36 | 4 | 10,85 (4,21–17,49) | 90 | |
| 242422 (306 bp) | 51 | 5 | 46 | 4 | 9,41 (0,92–17,89) | 94 | |
| 197198 (655 bp) | 51 | 3 | 49 | 2 | 3,98 (2,91–5,04) | 96 | |
| 242422 (683 bp) | 54 | 5 | 50 | 4 | 10,89 (4,19–17,59) | 93 | |
| 197198 (672 bp) | 46 | 3 | 43 | 2 | 3,49 (0,83–6,15) | 98 | |
| 242422 (680 bp) | 37 | 3 | 35 | 2 | 3,97 (2,82–5,12) | 95 | |
| 197198 (541–619 bp) | 40 | 20 | 16 | 22 | 56,56 (−13,76 to 126,89) | 63 | |
| 242422 (619 bp) | 49 | 2 | 48 | 1 | 2,00 (1,06–2,94) | 98 |
aLength of the sequences analyzed at each loci, differences between or within isolates are explained by the presence of indels.
bIndels ≥2 nucleotides were counted as a single polymorphic site.
cThe lower confidence interval (lci) and higher confidence interval (hci) values are the bounds on the upper and lower 95% confidence intervals for the average Chao-1 values. In other words, the observed richness was between those two numbers in 950 of the 1,000 bootstrap iterations (Chao 1984).
dThe estimated sample coverage, which is a measure of sample completeness (Good 1953).
eOut of the 12 SNPs found, 11 were nonsynonymous mutations in the nad1 CDS.
FRarefaction curves inferred with the allelic diversity present at each investigated locus. (A) The nad1 gene and (B) the nuclear single copy rps2 gene were used as background variation controls (shaded in gray) and compared with three mitochondrial intergenic regions. (C) The cob-nad4, (D) cox2-atp8 and (E) nad4-nad1 intergenic regions were investigated. The coverage based rarefaction (solid lines) and an extrapolation up to 100 clones (dashed lines) are shown. A confidence interval of 95% (shaded area, based on a bootstrap method with 1,000 replicates) of the allelic diversity from cloning sequencing (filled circles) is represented. The comparison was done with R. irregularis model isolate DAOM-197198 (blue line) and the first in vitro generation of R. irregularis isolate DAOM-242422 (red line).
FSchematic representation of the intra-isolate mitochondrial allelic diversity found in the nad4-nad1 intergenic region in the model AMF isolate R. irregularis DAOM-197198. (A) The seven structural variant found in the nad4-nad1 intergenic region are shown in a nucleotide alignment. (B) The four nad4 C-terminal protein variants are presented in an amino acid alignment. (C) Electrophoresis gel of the RT-PCR reactions performed on R. irregularis DAOM-197198 cDNA showing the expression of the four nad4 C-terminal allelic variants. (L) low range ladder, (a–d) mRNA expression of all four nad4 length variants, (e, f) reverse transcription negative control in a mitochondrial intergenic region on cDNA and DNA templates, respectively, (g, h) reverse transcription positive control in the nad1 gene performed on RNA and cDNA templates, respectively. Further, Sanger sequencing was used to confirm the transcripts identity.
FExperimental confirmation of a nuclear mitochondrial DNA (numt) copy. (A) Schematic representation of the R. irregularis DAOM-197198 nuclear contig harboring the putative nad4-nad1 numt. (B) Electrophoresis gel of syntenic PCR amplifications performed on the flanking regions of the nad4-nad1 intergenic region nuclear localization. Different combinations of primers were used in order to confirm the presence of the complete intergenic region on the nuclear contig (a–c). A mitochondrial positive control was done (ctrl + mt), with primer spanning from nad4 to nad1. The nuclear positive control (ctrl + nuclear) was performed with the rps2 primers, while the negative control (ctrl −) was done using the same primers without DNA template. All amplifications corresponded to the expected length and were further corroborated by Sanger sequencing (results not shown).
BLASTn survey of all putative mtDNA mitochondrial loci investigated in this study, present in the nuclear genome assemblies of R. irregularis DAOM-197198
| Nuclear assembly | mt-DNA copies in the nuclear genome (numts) | ||||
|---|---|---|---|---|---|
| DNA1 | ○ | JARA01006962 | JARA01006115 | ○ | ○ |
| DNA2 | ○ | JARB01006781 | JARB01003323 | ○ | JARB01002805 |
| N31 | ○ | JAQW01003701 | JAQW01006974 | ○ | ○ |
| N33 | JAQX01002384 | ○ | ○ | ○ | |
| JAQX01002367 | |||||
| N36 | JAQY01000696 | JAQY01000696 | ○ | ○ | ○ |
| JAQY01007470 | |||||
| N6 | ○ | JAQZ01003472 | JAQZ01005991 | ○ | ○ |
| ○ | AUPC01008815 | ○ | ○ | ○ | |
anumts >100 nucleotides were taken into account.
bOnly partial coding gene sequences were found in the nuclear genome.
cThe reference genome available on GenBank, assembled based on the reads used for genome assemblies in JAQW00000000, JAQX00000000, JAQY00000000, JAQZ00000000, JARA00000000 and JARB00000000, was not used in the analysis.
dA bona fide complete nad4-nad1 intergenic region found on a nuclear contig. Its presence was confirmed by syntenic PCR amplifications (supplementary fig. S4, see Supplementary Material online).