| Literature DB >> 30035303 |
Kessy Abarenkov1, Panu Somervuo2, R Henrik Nilsson3,4, Paul M Kirk5, Tea Huotari6, Nerea Abrego6, Otso Ovaskainen2,7.
Abstract
Incompleteness of reference sequence databases and unresolved taxonomic relationships complicates taxonomic placement of fungal sequences. We developed Protax-fungi, a general tool for taxonomic placement of fungal internal transcribed spacer (ITS) sequences, and implemented it into the PlutoF platform of the UNITE database for molecular identification of fungi. With empirical data on root- and wood-associated fungi, Protax-fungi reliably identified (with at least 90% identification probability) the majority of sequences to the order level but only around one-fifth of them to the species level, reflecting the current limited coverage of the databases. Protax-fungi outperformed the Sintax and Rdb classifiers in terms of increased accuracy and decreased calibration error when applied to data on mock communities representing species groups with poor sequence database coverage. We applied Protax-fungi to examine the internal consistencies of the Index Fungorum and UNITE databases. This revealed inconsistencies in the taxonomy database as well as mislabelling and sequence quality problems in the reference database. The according improvements were implemented in both databases. Protax-fungi provides a robust tool for performing statistically reliable identifications of fungi in spite of the incompleteness of extant reference sequence databases and unresolved taxonomic relationships.Keywords: annotation; data quality; environmental sequencing; fungi; identification tool; internal transcribed spacer (ITS); molecular species identification; probabilistic taxonomic assignment
Mesh:
Substances:
Year: 2018 PMID: 30035303 DOI: 10.1111/nph.15301
Source DB: PubMed Journal: New Phytol ISSN: 0028-646X Impact factor: 10.151