| Literature DB >> 21949797 |
Leho Tedersoo1, Kessy Abarenkov, R Henrik Nilsson, Arthur Schüssler, Gwen-Aëlle Grelet, Petr Kohout, Jane Oja, Gregory M Bonito, Vilmar Veldre, Teele Jairus, Martin Ryberg, Karl-Henrik Larsson, Urmas Kõljalg.
Abstract
Sequence analysis of the ribosomal RNA operon, particularly the internal transcribed spacer (ITS) region, provides a powerful tool for identification of mycorrhizal fungi. The sequence data deposited in the International Nucleotide Sequence Databases (INSD) are, however, unfiltered for quality and are often poorly annotated with metadata. To detect chimeric and low-quality sequences and assign the ectomycorrhizal fungi to phylogenetic lineages, fungal ITS sequences were downloaded from INSD, aligned within family-level groups, and examined through phylogenetic analyses and BLAST searches. By combining the fungal sequence database UNITE and the annotation and search tool PlutoF, we also added metadata from the literature to these accessions. Altogether 35,632 sequences belonged to mycorrhizal fungi or originated from ericoid and orchid mycorrhizal roots. Of these sequences, 677 were considered chimeric and 2,174 of low read quality. Information detailing country of collection, geographical coordinates, interacting taxon and isolation source were supplemented to cover 78.0%, 33.0%, 41.7% and 96.4% of the sequences, respectively. These annotated sequences are publicly available via UNITE (http://unite.ut.ee/) for downstream biogeographic, ecological and taxonomic analyses. In European Nucleotide Archive (ENA; http://www.ebi.ac.uk/ena/), the annotated sequences have a special link-out to UNITE. We intend to expand the data annotation to additional genes and all taxonomic groups and functional guilds of fungi.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21949797 PMCID: PMC3174234 DOI: 10.1371/journal.pone.0024940
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Scheme of the metadata annotation workflow.
Shaded boxes indicate procedures performed and/or saved over the PlutoF workbench (http://plutof.ut.ee/).
Quality and metadata annotations of fungal ITS sequences by mycorrhiza types.
| EcM fungi | AM fungi | ErM fungi | OM fungi | All fungi | ||
| Studies | INSD original | 1242 | 127 | 78 | 93 | 12004 |
| Fungal ITS sequences | INSD original | 28791 | 3176 | 1457 | 2267 | 183208 |
| Chimeric sequences | INSD original | nd | nd | nd | nd | nd |
| Annotated | nd | nd | 46 | 11 | 677 | |
| Low-quality sequences | INSD original | nd | nd | nd | nd | nd |
| Annotated | 1970 | 25 | 56 | 121 | 2572 | |
| Sequences annotated for country | INSD original | 16390 | 2319 | 751 | 1150 | 104426 |
| Annotated | 21891 | 2603 | 1253 | 2053 | 111647 | |
| Sequences annotated for geocode | INSD original | 2276 | 271 | 163 | 91 | 11331 |
| Annotated | 9566 | 990 | 518 | 696 | 30702 | |
| Sequences annotated for interacting taxon (host) | INSD original | 6272 | 835 | 1093 | 1608 | 43126 |
| Annotated | 8890 | 2337 | 1455 | 2165 | 52551 | |
| Sequences annotated for isolation source | INSD original | 10891 | 2274 | 991 | 1335 | 30030 |
| Annotated | 27686 | 3000 | 1456 | 2209 | 79353 | |
| Sequences/isolates with experimental evidence for function | INSD original | nd | nd | nd | nd | nd |
| Annotated | nd | nd | 226 | 140 | nd |
all sequences belonging to EcM lineages regardless of isolation source;
all sequences belonging to Glomeromycota, except Geosiphon;
all sequences derived from roots (with or without a culturing step) of the respective host plants;
Chimeric sequences consist of two or more fragments of fungal sequences and are therefore not assigned below the kingdom level;
nd, not determined;
Annotated–sum of original annotations and metadata provided in the course of this study;
formation of coils in ErM, formation of pelotons, stimulation of germination or development in OM.
Contribution of different isolation sources to the number of INSD sequences in the most common ectomycorrhizal fungal lineages.
| Lineage | Fruit-body | Ectomycorrhiza | Soil and mycelium | Total |
| /russula-lactarius | 1772 | 1638 | 1238 | 5023 |
| /cortinarius | 2714 | 1001 | 363 | 4241 |
| /tomentella-thelephora | 310 | 1454 | 478 | 2840 |
| /inocybe | 1126 | 470 | 399 | 2139 |
| /tuber-helvella | 1410 | 285 | 37 | 1828 |
| /suillus-rhizopogon | 744 | 610 | 119 | 1552 |
| /boletus | 697 | 213 | 33 | 984 |
| /hebeloma-alnicola | 422 | 174 | 97 | 864 |
| /tricholoma | 466 | 172 | 125 | 862 |
| /amanita | 532 | 118 | 41 | 732 |
| /pisolithus-scleroderma | 458 | 77 | 43 | 702 |
| /piloderma | 14 | 390 | 227 | 657 |
| /sebacina | 29 | 358 | 50 | 592 |
| /cenococcum | 0 | 408 | 45 | 515 |
| /amphinema-tylospora | 2 | 284 | 163 | 468 |
| /clavulina | 58 | 159 | 108 | 414 |
| /wilcoxina | 2 | 119 | 249 | 404 |
| /cantharellus | 123 | 193 | 25 | 361 |
| /laccaria | 83 | 158 | 71 | 352 |
| /ramaria-gautieria | 214 | 33 | 7 | 275 |
| /hydnellum-sarcodon | 219 | 13 | 5 | 238 |
| /terfezia-peziza depressa | 104 | 69 | 29 | 221 |
| /meliniomyces | 0 | 144 | 31 | 218 |
| /paxillus-gyrodon | 72 | 45 | 5 | 206 |
| /genea-humaria | 74 | 101 | 5 | 199 |
| /geopora | 124 | 35 | 7 | 180 |
| /albatrellus | 132 | 20 | 3 | 156 |
| /hygrophorus | 67 | 27 | 48 | 146 |
| /pseudotomentella | 16 | 88 | 12 | 129 |
| /phellodon-bankera | 119 | 2 | 6 | 128 |
| /tomentellopsis | 31 | 57 | 12 | 109 |
| All lineages | 12465 | 9315 | 4214 | 28791 |
Figure 2Screenshot of the UNITE search page (http://unite.ut.ee/).
This search tool allows queries of sequences based on taxonomic information and associated metadata.