| Literature DB >> 35756050 |
Niloofar Vaghefi1,2, Stefan Kusch3, Márk Z Németh4, Diána Seress4, Uwe Braun5, Susumu Takamatsu6, Ralph Panstruga3, Levente Kiss1,4,7.
Abstract
Powdery mildew fungi (Erysiphaceae), common obligate biotrophic pathogens of many plants, including important agricultural and horticultural crops, represent a monophyletic lineage within the Ascomycota. Within the Erysiphaceae, molecular phylogenetic relationships and DNA-based species and genera delimitations were up to now mostly based on nuclear ribosomal DNA (nrDNA) phylogenies. This is the first comprehensive genome-scale phylogenetic analysis of this group using 751 single-copy orthologous sequences extracted from 24 selected powdery mildew genomes and 14 additional genomes from Helotiales, the fungal order that includes the Erysiphaceae. Representative genomes of all powdery mildew species with publicly available whole-genome sequencing (WGS) data that were of sufficient quality were included in the analyses. The 24 powdery mildew genomes included in the analysis represented 17 species belonging to eight out of 19 genera recognized within the Erysiphaceae. The epiphytic genera, all but one represented by multiple genomes, belonged each to distinct, well-supported lineages. Three hemiendophytic genera, each represented by a single genome, together formed the hemiendophytic lineage. Out of the 14 other taxa from the Helotiales, Arachnopeziza araneosa, a saprobic species, was the only taxon that grouped together with the 24 genome-sequenced powdery mildew fungi in a monophyletic clade. The close phylogenetic relationship between the Erysiphaceae and Arachnopeziza was revealed earlier by a phylogenomic study of the Leotiomycetes. Further analyses of powdery mildew and Arachnopeziza genomes may discover signatures of the evolutionary processes that have led to obligate biotrophy from a saprobic way of life. A separate phylogeny was produced using the 18S, 5.8S, and 28S nrDNA sequences of the same set of powdery mildew specimens and compared to the genome-scale phylogeny. The nrDNA phylogeny was largely congruent to the phylogeny produced using 751 orthologs. This part of the study has revealed multiple contamination and other quality issues in some powdery mildew genomes. We recommend that the presence of 28S, internal transcribed spacer (ITS), and 18S nrDNA sequences in powdery mildew WGS datasets that are identical to those determined by Sanger sequencing should be used to assess the quality of assemblies, in addition to the commonly used Benchmarking Universal Single-Copy Orthologs (BUSCO) values.Entities:
Keywords: contaminated genomes; contaminating sequences; metagenomes; obligate biotrophs; phylogenomics; plant–microbe interactions; single-copy orthologs; whole-genome sequencing
Year: 2022 PMID: 35756050 PMCID: PMC9218914 DOI: 10.3389/fmicb.2022.903024
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 6.064
The current classification of the Erysiphaceae.
|
|
|
|
| One genus: |
|
|
| Subtribe |
| One genus: |
| Subtribe |
| One genus: |
| Subtribe |
| One genus: |
|
|
| Subtribe |
| Two genera: |
| Subtribe |
| One genus: |
|
|
| Four genera: |
|
|
| One genus: |
|
|
| One genus: |
|
|
|
|
List of powdery mildew genomes assessed for completeness through identification of Benchmarking Universal Single-Copy Orthologs (BUSCO) using BUSCO v.5.2.2 (Simão et al., 2015) and Leotiomycetes dataset Odb10.
| Powdery mildew species | Isolate/specimen ID | Reference | BUSCO results | ||||
| Genome completeness (%) | CS | CD | F | M | |||
| A6 |
| 84.7 | 2,728 | 11 | 193 | 302 | |
|
|
| 94.7 | 3,042 | 21 | 74 | 97 | |
| K1 |
| 94.3 | 3,023 | 25 | 77 | 109 | |
|
|
| 94.6 | 3,041 | 21 | 73 | 99 | |
|
|
| 94.8 | 3,043 | 22 | 69 | 100 | |
|
|
| 94.0 | 3,035 | 7 | 85 | 107 | |
| 94202 |
| 93.3 | 3,007 | 10 | 103 | 114 | |
|
| 94.5 | 3,038 | 18 | 70 | 108 | ||
| JIW2 |
| 91.9 | 2,963 | 9 | 124 | 138 | |
|
|
|
| 95.4 | 2,322 | 762 | 60 | 90 |
|
|
|
| 93.4 | 3,017 | 2 | 80 | 135 |
|
|
| 93.3 | 3,015 | 2 | 83 | 134 | |
|
|
| 93.2 | 3,014 | 1 | 82 | 137 | |
| Ranch9 |
| 93.0 | 3,009 | 2 | 84 | 139 | |
|
|
| 93.0 | 3,000 | 6 | 90 | 138 | |
|
|
| Unpublished JGI | 92.4 | 2,939 | 47 | 107 | 141 |
| – | Unpublished, NCBI Acc. GCA_000208805.1 | 56.2 | 1,818 | 1 | 550 | 865 | |
| – | Unpublished, NCBI Acc. GCA_000214055.1 | 63.8 | 2,064 | 1 | 464 | 705 | |
|
|
|
| 91.7 | 2,950 | 17 | 130 | 137 |
|
|
| 92.8 | 2,990 | 10 | 100 | 134 | |
|
|
|
| 91.1 | 2,926 | 18 | 76 | 214 |
|
|
| 91.3 | 2,932 | 20 | 76 | 206 | |
|
|
|
| 92.2 | 2,692 | 290 | 71 | 181 |
|
| MGH1 | Unpublished, JGI Genome MGH1 v4.0 | 89.9 | 648 | 2,261 | 103 | 222 |
|
|
|
| 82.4 | 2,661 | 3 | 83 | 487 |
|
|
|
| 93.0 | 2,991 | 15 | 27 | 201 |
|
|
|
| 71.3 | 2,306 | 0 | 236 | 692 |
|
|
|
| 76.6 | 2,465 | 12 | 93 | 664 |
|
|
| Unpublished, NCBI Acc. GCA_018398735.1 | 91.9 | 2,962 | 9 | 87 | 176 |
|
|
|
| 91.6 | 2,952 | 9 | 99 | 174 |
|
|
|
| 90.9 | 2,899 | 41 | 67 | 227 |
|
|
| 91.6 | 2,811 | 151 | 66 | 206 | |
Genomes of the isolates/specimens that were selected for the genome-scale phylogenetic analysis are indicated in bold.
List of non-powdery mildew Leotiomycete genomes included in the genome-scale phylogenetic analysis.
| Species | Strain | Accession number | Database | References |
|
| ATCC 22711 |
| NCBI |
|
|
| KUC3009 |
| NCBI |
|
|
| ICMP 21731 |
| NCBI | Unpublished |
|
| NRRL 50072 |
| NCBI |
|
|
| DSM 107184 |
| NCBI |
|
|
| ATCC 20868 |
| NCBI |
|
| MB_m1 |
| NCBI |
| |
|
| ICMP 18395 |
| NCBI | Unpublished |
|
| Zn |
| NCBI |
|
|
| CBS 120377 |
| NCBI |
|
|
| UAMH 11012 |
| NCBI |
|
|
| UK7 |
| NCBI |
|
|
| 02CH4-6a.1 |
| NCBI |
|
|
| SwB9 |
| NCBI |
|
List of powdery mildew specimens included in the genome-scale and nuclear ribosomal (nrDNA) phylogenetic analyses.
| Powdery mildew species | Isolate/specimen ID | GenBank assembly accession | Database | nrDNA accession no. | ||
| 28S | 5.8S | 18S | ||||
| DH14 |
| NCBI |
|
|
| |
| RACE1 |
| NCBI |
|
|
| |
| THUN-12 |
| NCBI |
|
|
| |
| 70 |
| NCBI |
|
|
| |
| 96224 |
| NCBI |
|
|
| |
|
| MS_42D | – | – |
|
|
|
|
| C |
| NCBI |
|
|
|
| e1-101 |
| NCBI |
|
|
| |
| Lodi |
| NCBI |
|
|
| |
| UMSG2 |
| NCBI | n.d. |
|
| |
|
| Palampur-1 | – | JGI |
|
|
|
|
| TENN-F-071826 |
| NCBI |
|
| n.d. |
| HO-73 |
| NCBI |
|
|
| |
|
| UCSC1 |
| NCBI | n.d. |
|
|
| UMSG3 |
| NCBI | n.d. |
|
| |
|
| FPH2017-1 |
| NCBI | n.d. | n.d. | n.d. |
|
| HNHM-MYC-006405 | PRJEB36538 | ENA |
|
|
|
|
| – | PRJEB29715 | ENA |
|
|
|
|
| HMJAU-PM91933 |
| NCBI |
|
|
|
|
| HAL3440 F |
| NCBI |
|
|
|
|
| MH |
| NCBI |
|
|
|
|
| PuE-3 |
| NCBI | n.d. |
| n.d. |
|
| 2086 |
| NCBI |
|
|
|
| Wanju2017 |
| NCBI |
|
|
| |
If available as a result of Sanger sequencing, 28S, 5.8S and 18S nrDNA sequences were obtained from the NCBI GenBank nucleotide database (accession numbers shown in
Comparison of 28S, ITS and 18S sequences extracted from whole genome sequencing (WGS) datasets (genome assembly accession numbers shown in Table 4) to sequences determined by Sanger sequencing available at NCBI GenBank database.
| Powdery mildew species | Isolate/specimen ID | Sanger sequencing (GenBank accession numbers) | WGS (Contigs/scaffolds containing nrDNA sequence fragments) | Number of nucleotide position differences | ||||||
|
|
|
| ||||||||
| 28S | ITS | 18S | 28S | ITS | 18S | 28S | ITS | 18S | ||
| HO-73 | KJ868175 | KJ868176 | KP171512 | QVIK010007 | QVIK01007 | QVIK010012 | 0/328 | – | – | |
| and QVIK010012 | and QVIK010021 | – | – | – | ||||||
| and QVIK010050 | and QVIK010112 | – | – | – | ||||||
|
| UMSG2 | n.a. | KX776199 | KX776199 | n.d. | n.d. | n.d. | – | – | – |
|
| TENN-F-071826 | n.a. | MH766898 | n.a. | n.d. | PEDP01005487.1 | n.d. | – | – | – |
|
| UCSC1 | n.a. | AF031282 | AF031282 | n.d. | n.d. | n.d. | – | – | – |
| UMSG3 | n.a. | KR611314 | KR611314 | n.d. | n.d. | n.d. | – | – | – | |
|
| HNHM-MYC-006405 | OM906815 | MT125856 | OM906851 | scaffold_175 | scaffold_04895 | scaffold_04895 | – | 41/585 | 894/1,797 |
| and scaffold_17559 | – | – | – | |||||||
|
| - | OM906816 | OM906197 | OM906852 | NODE_26095 | NODE_26095 | NODE_26095 | 3/813 | 0/566 | 0/423 |
|
| HMJAU-PM91933 | MZ540403 | MZ541088 | n.a. | JAHY | JAHYSQ010042711.1 | n.d. | 0/721 | 0/563 | – |
|
| HAL3440 F | OM906817 | MZ661116 | OM906853 | JAHY | JAHYSP010014424.1 | JAHYSP010014424.1 | 0/241 | 0/445 | 0/1,736 |
| and JAHYSP010004388.1 | – | – | 32/427 | |||||||
|
| PuE-3 | n.a. | MT180425 | n.a. | n.d. | n.d. | n.d. | – | – | – |
|
| 2086 | MK225554 | n.a. | MK225523 | JACSEY010000300.1 | n.d. | JACSEY010000300.1 | 28/3,019 | – | 4/899 |
FIGURE 1Phylogeny of powdery mildew species and closely related Leotiomycetes based on 751 orthologous protein sequences. Maximum likelihood phylogeny was inferred from a concatenated alignment of amino acid sequences using RAxML-NG v.1.0.1 (Kozlov et al., 2019) under the JTT+I+G4+F substitution model. Taxon labels include species names followed by the specimen/strain accession numbers, except for Parauncinula polyspora, for which a herbarium specimen is not available (Frantzeskakis et al., 2019a). Bootstrap support values greater than 70% are shown at the edges. The tree is rooted to Sclerotinia trifoliorum strain SwB9 (Kusch et al., 2022b). The scale bar represents nucleotide substitutions per site.
FIGURE 2Maximum likelihood phylogeny based on the concatenated sequences of the 5.8S, 18S, and 28S regions of the nuclear ribosomal DNA of representative powdery mildew taxa. Bootstrap values greater than 70% are shown above or below the branches. The tree is rooted to Parauncinula polyspora specimen sequenced by Frantzeskakis et al. (2019a). Taxon labels include species names followed by the specimen accession numbers, except for Parauncinula polyspora, for which a herbarium specimen is not available (Frantzeskakis et al., 2019a). The letters on the right side of the tree indicate source of the 28S, 5.8S, and 18S nrDNA sequences as G (extracted from the genome) or S (produced through Sanger sequencing or obtained from NCBI GenBank database). Dashes indicate missing loci. The scale bar represents nucleotide substitutions per site.
FIGURE 3Alignment of 18S (GenBank accession no. OM906851) and ITS (GenBank accession no. MT125856) sequences of Leveillula taurica HNHM-MYC-006405 produced via Sanger sequencing against the sequence of Scaffold-04895 extracted from the genome assembly of the same strain (Assembly project PRJEB36538 from ENA database). nrDNA annotations are depicted by arrowed boxes. Black and gray colors indicate dissimilar and identical nucleotides, respectively, with gaps depicted by horizontal lines.