| Literature DB >> 30588394 |
Balázs Brankovics1,2,3, Tomasz Kulik4, Jakub Sawicki4, Katarzyna Bilska4, Hao Zhang5, G Sybren de Hoog2,3, Theo Aj van der Lee1, Cees Waalwijk1, Anne D van Diepeningen1,2.
Abstract
There is a gradual shift from representing a species' genome by a single reference genome sequence to a pan-genome representation. Pan-genomes are the abstract representations of the genomes of all the strains that are present in the population or species. In this study, we employed a pan-genomic approach to analyze the intraspecific mitochondrial genome diversity of Fusarium graminearum. We present an improved reference mitochondrial genome for F. graminearum with an intron-exon annotation that was verified using RNA-seq data. Each of the 24 studied isolates had a distinct mitochondrial sequence. Length variation in the F. graminearum mitogenome was found to be largely due to variation of intron regions (99.98%). The "intronless" mitogenome length was found to be quite stable and could be informative when comparing species. The coding regions showed high conservation, while the variability of intergenic regions was highest. However, the most important variable parts are the intron regions, because they contain approximately half of the variable sites, make up more than half of the mitogenome, and show presence/absence variation. Furthermore, our analyses show that the mitogenome of F. graminearum is recombining, as was previously shown in F. oxysporum, indicating that mitogenome recombination is a common phenomenon in Fusarium. The majority of mitochondrial introns in F. graminearum belongs to group I introns, which are associated with homing endonuclease genes (HEGs). Mitochondrial introns containing HE genes may spread within populations through homing, where the endonuclease recognizes and cleaves the recognition site in the target gene. After cleavage of the "host" gene, it is replaced by the gene copy containing the intron with HEG. We propose to use introns unique to a population for tracking the spread of the given population, because introns can spread through vertical inheritance, recombination as well as via horizontal transfer. We demonstrate how pooled sequencing of strains can be used for mining mitogenome data. The usage of pooled sequencing offers a scalable solution for population analysis and for species level comparisons studies. This study may serve as a basis for future mitochondrial genome variability studies and representations.Entities:
Keywords: Comparative genomics; Mitogenome; Mitogenomics; Pan-genome; Pool sequencing
Year: 2018 PMID: 30588394 PMCID: PMC6304159 DOI: 10.7717/peerj.5963
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
List of Fusarium strains analysed in this study
| CBS123657 (PH-1) NRRL31084 | USA | maize | 1996 | both | |
| CBS119173 | USA | wheat head | 2005 | individually | |
| CBS139513 | Argentina | barley | 2011 | individually | |
| CBS139514 | Argentina | barley | 2010 | individually | |
| CBS119799 | South Africa | wheat kernel | 1987 | individually | |
| CBS119800 | South Africa | maize | 1990 | individually | |
| CBS110263 | Iran | maize | 1968 | individually | |
| CBS123688 | Sweden | oats | unknown | individually | |
| CBS128539 | Belgium | wheat kernel | 2007 | individually | |
| CBS138561 | Poland | wheat kernel | 2010 | individually | |
| CBS138562 | Poland | wheat kernel | 2010 | individually | |
| CBS138563 | Poland | wheat kernel | 2003 | individually | |
| CBS104.09 | unknown | unknown | 1909 | individually | |
| CBS185.32 | unknown | maize | 1932 | individually | |
| CS3005 | Australia | barley | 2001 | individually | |
| HN9-1 | China | wheat | 2002 | individually | |
| HN-Z6 | China | wheat | 2012 | individually | |
| INRA-156 | France | wheat | 2001 | individually | |
| INRA-159 | France | wheat | 2001 | individually | |
| INRA-164 | France | wheat | 2002 | individually | |
| INRA-171 | France | wheat | 2001 | individually | |
| INRA-181 | France | wheat | 2002 | individually | |
| INRA-195 | France | wheat | 2002 | individually | |
| YL-1 | China | wheat | 2012 | individually | |
| bfb0999_1 | China | barley | 2005 | pooled | |
| 68D2 | Netherlands | wheat | 2001 | pooled | |
| CHG013 | China | maize | 2005 | pooled | |
| CHG157 | China | barley | 2005 | pooled | |
| CBS123666 | USA | wheat head | 2000 | individually |
Figure 1The mitogenome of F. graminearum strain PH-1.
Green blocks: tRNA coding genes, blue arrows: genes or ORFs (no labels added for short ORFs), yellow arrows: protein coding sequences, red arrows: rDNA coding sequence, purple arrows: intron encoded homing endonuclease genes, gray box: the large variable (LV) region with orf1931 (LV-uORF).
Mitochondrial genome variation of the Fusarium graminearum strains.
| CBS123657 (PH-1) |
| 95,638 | 34 | 49,429 | 46,209 |
| CBS185.32 |
| 96,300 | 34 | 50,120 | 46,180 |
| CBS110263 |
| 97,364 | 35 | 51,165 | 46,199 |
| CBS119173 |
| 100,342 | 37 | 54,130 | 46,212 |
| CBS119799 |
| 96,005 | 35 | 49,919 | 46,086 |
| CBS119800 |
| 97,462 | 35 | 51,280 | 46,182 |
| CBS123688 |
| 95,035 | 34 | 48,837 | 46,198 |
| CBS128539 |
| 96,134 | 35 | 49,996 | 46,138 |
| CBS138561 |
| 95,034 | 34 | 48,837 | 46,197 |
| CBS138562 |
| 99,062 | 36 | 52,865 | 46,197 |
| CBS138563 |
| 99,068 | 36 | 52,865 | 46,203 |
| CBS139514 |
| 96,167 | 35 | 49,980 | 46,187 |
| CBS139513 |
| 95,041 | 34 | 48,837 | 46,204 |
| CBS104.09 |
| 97,460 | 35 | 51,280 | 46,180 |
| CS3005 |
| 93,560 | 33 | 47,381 | 46,179 |
| HN9-1 |
| 96,307 | 35 | 51,567 | 44740 |
| HN-Z6 |
| 97,767 | 34 | 50,120 | 47647 |
| INRA-156 |
| 101,424 | 37 | 55,243 | 46181 |
| INRA-159 |
| 96,199 | 35 | 49,996 | 46203 |
| INRA-164 |
| 99,678 | 37 | 53,476 | 46202 |
| INRA-171 |
| 96,199 | 35 | 49,996 | 46203 |
| INRA-181 |
| 96,187 | 35 | 49,996 | 46191 |
| INRA-195 |
| 97,358 | 35 | 51,165 | 46193 |
| YL-1 |
| 97,996 | 36 | 51,777 | 46219 |
Notes.
Core stands for the total mitogenome length minus the length of the intron regions.
Distribution of variation in the intron and intergenic regions within and between species.
| Intraspecies | Interspecies | |||||
|---|---|---|---|---|---|---|
| Coding | 21,572 | 4 | 0.02% | 21,572 | 5 | 0.02% |
| Intron | 59,091 | 399 | 0.68% | 59,091 | 419 | 0.71% |
| Intergenic | 18,982 | 310 | 1.63% | 189,82 | 436 | 2.30% |
Figure 2Pan-genomic representation of the presence/absence variation of introns in the mitochondrial genomes of the 24 F. graminearum strains.
In the figure, the thick orange lines highlight intron sequences in the alternative sequences. (SNPs and short indels are not indicated.) (A) The insertion of nad2-i1632; (B) the insertion of cox2-i228, cox2-i318 and cox2-i552; (C) the insertion of cob-i159 and cob-i201; (D) longer variant of cob-i490; (E) intron insertion in the HEG located in cox1-i906; and (F) the insertion of cox1-1287.
List of single nucleotide polymorphisms identified in the pooled dataset of Fusarium graminearum strains.
Positions are aligned positions between the PH-1 reference sequence and the pooled sequences (“short” and “long”). “Reference” refers to the nucleotide found in the given reference sequence used for mapping, while “Alternative” refers to the nucleotide suggested by the mapped reads. Position 90,636 shows unusual ratios: in both mappings the reference nucleotide (C or A) has a frequency of 70% and the alternative nucleotide has 30%. This is due to an adjacent indel that affects the mapping results.
| PH-1 | Pooled | |||
|---|---|---|---|---|
| Position | Reference | Alternative | Reference | Alternative |
| 2,337 | A (0.77) | G (0.23) | A (0.77) | G (0.23) |
| 6,288 | C (0.41) | A (0.59) | A (0.61) | C (0.39) |
| 6,355 | T (0.42) | C (0.58) | C (0.60) | T (0.40) |
| 13,540 | C (0.78) | A (0.22) | C (0.78) | A (0.22) |
| 37,126 | C (0.75) | T (0.25) | C (0.75) | T (0.25) |
| 37,773 | A (0.75) | G (0.25) | A (0.75) | G (0.25) |
| 44,773 | A (0.62) | G (0.38) | A (0.62) | G (0.38) |
| 64,776 | G (0.53) | A (0.47) | G (0.53) | A (0.47) |
| 70,827 | A (0.62) | G (0.38) | A (0.62) | G (0.38) |
| 89,194 | G (0.57) | A (0.43) | G (0.57) | A (0.43) |
| 90,636 | C (0.70) | A (0.30) | A (0.70) | C (0.30) |
| 95,918 | A (0.43) | C (0.57) | C (0.59) | A (0.41) |
| 99,784 | A (0.40) | G (0.60) | G (0.62) | A (0.38) |
| 100,362 | C (0.42) | A (0.58) | A (0.59) | C (0.41) |
| 100,538 | G (0.42) | A (0.58) | A (0.61) | G (0.39) |