| Literature DB >> 28228107 |
Fernanda Badotti1, Francislon Silva de Oliveira2, Cleverson Fernando Garcia1, Aline Bruna Martins Vaz3,4, Paula Luize Camargos Fonseca3, Laila Alves Nahum2,5, Guilherme Oliveira2,6, Aristóteles Góes-Neto7.
Abstract
BACKGROUND: Fungi are among the most abundant and diverse organisms on Earth. However, a substantial amount of the species diversity, relationships, habitats, and life strategies of these microorganisms remain to be discovered and characterized. One important factor hindering progress is the difficulty in correctly identifying fungi. Morphological and molecular characteristics have been applied in such tasks. Later, DNA barcoding has emerged as a new method for the rapid and reliable identification of species. The nrITS region is considered the universal barcode of Fungi, and the ITS1 and ITS2 sub-regions have been applied as metabarcoding markers. In this study, we performed a large-scale analysis of all the available Basidiomycota sequences from GenBank. We carried out a rigorous trimming of the initial dataset based in methodological principals of DNA Barcoding. Two different approaches (PCI and barcode gap) were used to determine the performance of the complete ITS region and sub-regions.Entities:
Keywords: Barcode gap; Basidiomycota; ITS; ITS1; ITS2; Probable correct identification
Mesh:
Substances:
Year: 2017 PMID: 28228107 PMCID: PMC5322588 DOI: 10.1186/s12866-017-0958-x
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Fig. 1Pie charts represent abundance (number) of sequences (a) and species (b) for the three subphyla represented in the dataset used in this study. The histograms show the number of species and sequences distributed for genera belonging to Agaricomycotina (c), Pucciniomycotina (d) and Ustilagomycotina (e) phylum
Probable Correct Identification (PCI) values (%) for all of the Basidiomycota genera from our trimmed dataset. The PCI values were estimated for the three genomic regions studied, the complete ITS region (ITS1 + 5.8S + ITS2) and the sub-regions ITS1 and ITS2
| Genera | ITS (ITS1 + 5.8S+ ITS2) | ITS1 | ITS2 |
|---|---|---|---|
|
| 100 | 100 | 100 |
|
| 44 | 33 | 33 |
|
| 24 | 27 | 27 |
|
| 25 | 25 | 25 |
|
| 100 | 100 | 100 |
|
| 70 | 70 | 70 |
|
| 75 | 50 | 75 |
|
| 20 | 20 | 20 |
|
| 80 | 40 | 60 |
|
| 32 | 26 | 47 |
|
| 67 | 33 | 67 |
|
| 0 | 0 | 0 |
|
| 100 | 100 | 100 |
|
| 100 | 100 | 100 |
|
| 50 | 50 | 50 |
|
| 100 | 100 | 100 |
|
| 33 | 33 | 33 |
|
| 50 | 50 | 50 |
|
| 50 | 50 | 50 |
|
| 36 | 45 | 37 |
|
| 50 | 50 | 50 |
|
| 50 | 50 | 33 |
|
| 100 | 100 | 100 |
|
| 0 | 50 | 0 |
|
| 100 | 100 | 100 |
|
| 100 | 86 | 93 |
|
| 100 | 100 | 100 |
|
| 100 | 100 | 100 |
|
| 100 | 100 | 100 |
|
| 100 | 100 | 67 |
|
| 100 | 50 | 75 |
|
| 50 | 25 | 50 |
|
| 100 | 100 | 100 |
|
| 43 | 43 | 43 |
|
| 100 | 67 | 67 |
|
| 100 | 100 | 100 |
|
| 100 | 100 | 50 |
|
| 67 | 67 | 60 |
|
| 42 | 37 | 32 |
|
| 100 | 75 | 100 |
|
| 0 | 0 | 0 |
|
| 56 | 56 | 56 |
|
| 50 | 50 | 50 |
|
| 0 | 0 | 0 |
|
| 67 | 67 | 33 |
|
| 100 | 100 | 100 |
|
| 60 | 60 | 80 |
|
| 100 | 100 | 100 |
|
| 0 | 0 | 0 |
|
| 30 | 19 | 28 |
|
| 0 | 0 | 13 |
|
| 37 | 37 | 41 |
|
| 50 | 50 | 50 |
|
| 43 | 43 | 29 |
|
| 57 | 57 | 71 |
|
| 92 | 83 | 75 |
|
| 100 | 100 | 100 |
|
| 90 | 100 | 80 |
|
| 100 | 100 | 100 |
|
| 100 | 100 | 100 |
|
| 100 | 100 | 100 |
|
| 100 | 100 | 67 |
|
| 50 | 50 | 33 |
|
| 83 | 83 | 33 |
|
| 40 | 40 | 20 |
|
| 38 | 38 | 25 |
|
| 70 | 80 | 50 |
|
| 0 | 0 | 0 |
|
| 22 | 22 | 33 |
|
| 100 | 100 | 100 |
|
| 100 | 100 | 100 |
|
| 100 | 100 | 100 |
|
| 100 | 67 | 67 |
|
| 0 | 0 | 33 |
|
| 50 | 50 | 50 |
|
| 0 | 0 | 0 |
|
| 67 | 67 | 67 |
|
| 100 | 33 | 67 |
|
| 86 | 86 | 71 |
|
| 50 | 50 | 50 |
|
| 0 | 0 | 0 |
|
| 29 | 43 | 14 |
|
| 27 | 27 | 23 |
|
| 100 | 100 | 86 |
|
| 100 | 100 | 100 |
|
| 50 | 50 | 50 |
|
| 100 | 100 | 100 |
|
| 100 | 100 | 100 |
|
| 43 | 43 | 43 |
|
| 100 | 100 | 100 |
|
| 33 | 33 | 33 |
|
| 100 | 100 | 100 |
|
| 27 | 27 | 27 |
|
| 100 | 100 | 100 |
|
| 100 | 100 | 50 |
|
| 38 | 27 | 36 |
|
| 86 | 86 | 86 |
|
| 33 | 33 | 33 |
|
| 33 | 0 | 33 |
|
| 80 | 80 | 20 |
|
| 67 | 67 | 67 |
|
| 83 | 83 | 83 |
|
| 100 | 100 | 100 |
|
| 0 | 0 | 0 |
|
| 100 | 33 | 100 |
|
| 25 | 50 | 25 |
|
| 42 | 42 | 42 |
|
| 43 | 57 | 29 |
|
| 100 | 100 | 100 |
|
| 67 | 0 | 67 |
|
| 33 | 33 | 33 |
|
| 100 | 50 | 100 |
|
| 100 | 100 | 50 |
Fig. 2Pairwise correlations (a, ITS X ITS1, b, ITS X ITS2 and c, ITS1 X ITS2) between PCI values of all genera from our dataset
Fig. 3Examples of the barcode gap performance classifications used in this study. a. Clear barcode gap (identification performance classified as good) for the genera Agaricus, b. Intermediate separation between the intra- and interspecific distances for Hebeloma and c. A poor barcode gap for Lactarius
Grouping of Basidiomycota genera based on the barcode gap analyses (See Additional file 5 for more details)
| Group 1 | Group 2 | Group 3 | Group 4 |
|---|---|---|---|
| Genera for which all the three genetic regions are | Genera for which one or two genetic regions showed a clearer barcode gap and are recommended over the other (s) | Genera for which most of the genetic regions showed | Genera for which all three genetic regions are |
|
|
|
|
|
Fig. 4PCI values for the genera classified in the Group 1 (a), Group 2 (b), Group 3 (c) and Group 4 (d) are represented for the ITS, ITS1 and ITS2 genomic region