| Literature DB >> 32714773 |
Robert Lücking1,2, M Catherine Aime2,3, Barbara Robbertse4, Andrew N Miller2,5, Hiran A Ariyawansa2,6, Takayuki Aoki2,7, Gianluigi Cardinali8, Pedro W Crous2,9,10, Irina S Druzhinina2,11,12, David M Geiser13, David L Hawksworth2,14,15,16,17, Kevin D Hyde2,18,19,20,21, Laszlo Irinyi22, Rajesh Jeewon23, Peter R Johnston2,24, Paul M Kirk25, Elaine Malosso2,26, Tom W May2,27, Wieland Meyer22, Maarja Öpik2,28, Vincent Robert8,9, Marc Stadler2,29, Marco Thines2,30, Duong Vu9, Andrey M Yurkov2,31, Ning Zhang2,32, Conrad L Schoch2,4.
Abstract
True fungi (Fungi) and fungus-like organisms (e.g. Mycetozoa, Oomycota) constitute the second largest group of organisms based on global richness estimates, with around 3 million predicted species. Compared to plants and animals, fungi have simple body plans with often morphologically and ecologically obscure structures. This poses challenges for accurate and precise identifications. Here we provide a conceptual framework for the identification of fungi, encouraging the approach of integrative (polyphasic) taxonomy for species delimitation, i.e. the combination of genealogy (phylogeny), phenotype (including autecology), and reproductive biology (when feasible). This allows objective evaluation of diagnostic characters, either phenotypic or molecular or both. Verification of identifications is crucial but often neglected. Because of clade-specific evolutionary histories, there is currently no single tool for the identification of fungi, although DNA barcoding using the internal transcribed spacer (ITS) remains a first diagnosis, particularly in metabarcoding studies. Secondary DNA barcodes are increasingly implemented for groups where ITS does not provide sufficient precision. Issues of pairwise sequence similarity-based identifications and OTU clustering are discussed, and multiple sequence alignment-based phylogenetic approaches with subsequent verification are recommended as more accurate alternatives. In metabarcoding approaches, the trade-off between speed and accuracy and precision of molecular identifications must be carefully considered. Intragenomic variation of the ITS and other barcoding markers should be properly documented, as phylotype diversity is not necessarily a proxy of species richness. Important strategies to improve molecular identification of fungi are: (1) broadly document intraspecific and intragenomic variation of barcoding markers; (2) substantially expand sequence repositories, focusing on undersampled clades and missing taxa; (3) improve curation of sequence labels in primary repositories and substantially increase the number of sequences based on verified material; (4) link sequence data to digital information of voucher specimens including imagery. In parallel, technological improvements to genome sequencing offer promising alternatives to DNA barcoding in the future. Despite the prevalence of DNA-based fungal taxonomy, phenotype-based approaches remain an important strategy to catalog the global diversity of fungi and establish initial species hypotheses.Entities:
Keywords: COX1; COX2; Oxford Nanopore technologies; PacBio; RPB2; Read placement; Species concepts; TEF1
Year: 2020 PMID: 32714773 PMCID: PMC7353689 DOI: 10.1186/s43008-020-00033-z
Source DB: PubMed Journal: IMA Fungus ISSN: 2210-6340 Impact factor: 3.515
Fig. 1The diversity of Fungi and fungal-like organisms is staggering, with between 2.2 to 3.8 million species predicted (Hawksworth and Lücking 2018). Identification tools specifically tailored to each group are indispensable to deal with such richness. A–B, Oomycota; C–D, Mycetozoa; E, Mucoromycota; F–U, Ascomycota; V–AE, Basidiomycota. A, Albugo candida (on Capsella bursa-pastoris). B, Hyaloperonospora thlaspeos-perfoliati (on Microthlaspi erraticum); for Oomycota, COX1 and COX2 have been proposed as alternative DNA barcodes (Choi et al. 2015). C, Arcyria denudata. D, unidentified slime mold plasmodium; a portion of the nuSSU, in combination with COX1 and TEF1, has been shown to provide good resolution to delimit species (Schnittler et al. 2017). E, Phycomyces blakesleeanus (mating). F, Helicoma taenia (conidium). G, Sorokina caeruleogrisea (ascomata). H, Fusarium duofalcatisporum (conidia); secondary DNA barcodes, such as TEF1, have been proposed to delimit species in this plant-pathogenic genus (O'Donnell et al. 2015; Al-Hatmi et al. 2016; Xia et al. 2019). I, Placomaronea candelarioides (thallus). J, Xylaria polymorpha (stromata bearing ascomata). K, Rhytidhysteron columbiense (ascomata); this conspicuous saprotrophic genus contains numerous unrecognized species based on ITS (Soto-Medina and Lücking 2017). L, Neocosmospora vasinfecta (perithecia); this genus is one example of competing solutions to ranking clades in Fusarium s.lat. at genus level (Summerell 2019; Sandoval-Denis et al. 2019), a problem that is not resolvable by phylogeny alone (Lücking 2019), but which affects nomenclature of economically important fungi. M, Ophiocordyceps curculionum (stroma growing out of a weevil). N, Cookeina tricholoma (ascomata). O, basidiomycetous yeast (various members of Cystofilobasidiales) efflux on tree stump (Yurkov et al. 2020). P, Aspergillus sydowii (culture); fungi of this genus can cause aspergillosis in humans and are identified through a combination of DNA barcoding (TUB2) and high-resolution melting (HRM) assay (Fidler et al. 2017). Q, Pyrenula subpraelucida (ascospore). R, Pseudopestalotiopsis ixorae (conidium); this is another genus for which secondary DNA barcodes (TEF1, TUB2) have been proposed (Maharachchikumbura et al. 2012, 2014). S, Rhytisma acerinum (tar spot on Acer); recently, a separate, near-cryptic North American species was discovered integrating ITS and biological data (Hudler et al. 1998). T, Macgarvieomyces juncicola (conidiophore with conidia). U, Batistia annulipes (stromata). V, Thelephora terrestris (basidioma). W, Cora imi (thallus); until recently, this genus was believed to include a single species, but integrative taxonomy combining the ITS barcoding marker and morpho-anatomical and ecological characters recognizes nearly 200 (Lücking et al. 2014, 2017). X, Cyathus striatus (basidiomata). Y, Ramaria formosa (basidiomata). Z, Campanella caesia (basidiomata); based on ITS barcoding data, this presumably European taxon is subcosmopolitan, being also found in North America including Mexico, South America (Colombia; photograph), and Africa (Kenya). AA, Coprinellus disseminatus (basidiomata). AB, Aseroe rubra (basidioma). AC, Tremella mesenterica (basidioma). AD, Schizophyllum commune (basidiomata); this industrially important taxon includes geographically separated clades based on the IGS (James et al. 2001). AE. Amanita muscaria (basidioma); according to a three-marker study (ITS, nuLSU, TUB2; Geml et al. 2006), this well-known mushroom comprises several cryptic species
Fig. 2The dependence of fungal identification on species concepts, delimitation and recognition approaches, and the importance of the verification process. Taxonomic specialists typically elaborate the first four steps up to the production of identification tools, whereas taxonomic users apply identification tools and perform verification. The verification process is generally neglected but is of crucial importance for accurate identifications
Fig. 3Comparison of BLAST-based (pairwise alignment) vs. tree-based (multiple alignment) identification of a target fungal ITS sequence (DB42771, Vietnam; see Lücking et al. 2020). BLAST (both blastn and megablast) initially suggested Trametes cubensis or Leiotrametes lactinea to be the most likely identification: the label ‘cf. cubensis’ had the three highest named BLAST hits and appeared six times among the top ten named hits. Yet, multiple alignment-based phylogenetic analysis placed the target sequence in a clade corresponding to T. menziesii, described from Indonesia. Apart from demonstrating the shortcomings of BLAST identifications, this example illustrates numerous problems with reference sequence labeling, including wrongly identified sequences and confusion about species and even genus concepts and nomenclature (Lücking et al. 2020). A user not aware of such issues would not be able to obtain a reliable identification using BLAST only, whereas the alignment-based phylogenetic approach followed by a verification process provided an accurate result in this case. Notably, two remedies would substantially improve BLAST identification results: (1) correct labeling of the reference sequences through third-party annotations (middle column), plus (2) sorting BLAST results by percentage identity (highlighted values)
Fig. 4Proportion of species with sequence data compared to total number of species per genus known in fungal genera, based on integration of the NBCI taxonomy and Species Fungorum. The mean proportion varies between 40% in species-poor genera and as little as 20% in species-rich genera. At least some species-poor to moderately diverse genera have all species sequenced, whereas many others are devoid of sequenced species. In more diverse genera, the maximum proportion of sequenced species sharply drops as a function of species richness, but also the minimum proportion increases, meaning that all large genera have at least some species sequenced but are consistently incomplete
DNA Barcoding markers proposed for fungi, their recommended nomenclature and selected examples (see also Stielow et al. 2015; Xu 2016)
| DNA barcoding marker | Acronym | Examples | References |
|---|---|---|---|
| Internal transcribed spacer | ITS | universal, | Schoch et al. |
| Intergenic spacer | IGS | James et al. | |
| β-tubulin II | Geml et al. | ||
| DNA-directed RNA polymerase II subunit A | Matheny | ||
| DNA-directed RNA polymerase II subunit B | universal, | Matheny | |
| Translation elongation factor 1 alpha | universal, | Buyck and Hofstetter | |
| hypothetical protein | Stielow et al. | ||
| Phosphoglycerate kinase | Al-Hatmi et al. | ||
| DNA topoisomerase I | Al-Hatmi et al. | ||
| Cytochrome c oxidase subunit I | Pino-Bodas et al. | ||
| Cytochrome c oxidase subunit II | Choi et al. |