| Literature DB >> 32661237 |
Tomáš Větrovský1, Daniel Morais1, Petr Kohout1, Clémentine Lepinay1, Camelia Algora1, Sandra Awokunle Hollá1, Barbara Doreen Bahnmann1, Květa Bílohnědá1, Vendula Brabcová1, Federica D'Alò2, Zander Rainier Human1, Mayuko Jomura3, Miroslav Kolařík1, Jana Kvasničková1, Salvador Lladó1, Rubén López-Mondéjar1, Tijana Martinović1, Tereza Mašínová1, Lenka Meszárošová1, Lenka Michalčíková1, Tereza Michalová1, Sunil Mundra4,5, Diana Navrátilová1, Iñaki Odriozola1, Sarah Piché-Choquette1, Martina Štursová1, Karel Švec1, Vojtěch Tláskal1, Michaela Urbanová1, Lukáš Vlk1, Jana Voříšková1, Lucia Žifčáková1, Petr Baldrian6.
Abstract
Fungi are key players in vital ecosystem services, spanning carbon cycling, decomposition, symbiotic associations with cultivated and wild plants and pathogenicity. The high importance of fungi in ecosystem processes contrasts with the incompleteness of our understanding of the patterns of fungal biogeography and the environmental factors that drive those patterns. To reduce this gap of knowledge, we collected and validated data published on the composition of soil fungal communities in terrestrial environments including soil and plant-associated habitats and made them publicly accessible through a user interface at https://globalfungi.com . The GlobalFungi database contains over 600 million observations of fungal sequences across > 17 000 samples with geographical locations and additional metadata contained in 178 original studies with millions of unique nucleotide sequences (sequence variants) of the fungal internal transcribed spacers (ITS) 1 and 2 representing fungal species and genera. The study represents the most comprehensive atlas of global fungal distribution, and it is framed in such a way that third-party data addition is possible.Entities:
Mesh:
Year: 2020 PMID: 32661237 PMCID: PMC7359306 DOI: 10.1038/s41597-020-0567-7
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Map of locations of samples contained in the GlobalFungi database. Each point represents one or several samples where fungal community composition was reported using high-throughput-sequencing methods targeting the ITS1 or ITS2 marker of fungi. The map was created using the ‘leaflet’ package that uses an open-source JavaScript library for mobile-friendly interactive maps (Leaflet 1.6.0, GNU General Public License).
Fig. 2Processing of raw sequencing data for the GlobalFungi database. Workflow of processing of sequencing data included in the GlobalFungi database.
List of metadata contained in the GlobalFungi database.
| Metadata identifier | Unit | Description of content | Source |
|---|---|---|---|
| Sample ID | unique identifier | generated | |
| Longitude | degrees | Geographical longitude | original paper |
| Latitude | degrees | Geographical latitude | original paper |
| Continent | One of the following: Africa/Antarctica/Asia/Australia/Europe/North America/South America | original paper | |
| Sample type | One of the following: soil/rhizosphere soil/litter/litter + humus/deadwood/lichen/shoot/root | original paper | |
| Biome | One of the following: forest biome/woodland biome/shrubland biome/grassland biome/desert biome/tundra biome/mangrove biome/anthropogenic terrestrial biome/marine biome/freshwater biome/polar desert biome | original paper | |
| Sampling year | Year of sample collection | original paper | |
| Primers | Primers used | original paper | |
| pH | pH | original paper | |
| ITS total | Number of full ITS sequences extracted | generated | |
| MAT (°C) | °C | Mean annual temperature from CHELSA database | CHELSA |
| MAP (mm) | mm | Mean annual precipitation from CHELSA database | CHELSA |
The table lists identifiers, units and sources of metadata contained in the database with the description of their content. The data source “original paper” may also represent additional metadata provided by the authors of the paper.
List of identifiers and source database of the raw sequencing datasets used.
| Database | Accession Identifiers (in superscripts, respectively: dataset reference, study reference(s)) |
|---|---|
| National Center for Biotechnology Information Sequence Read Archive | SRP001058[ |
| European Nucleotide Archive Sequence Read Archive | ERP001713[ |
| DNA Data Bank of Japan | DRA000926[ |
| Dryad Digital Repository | 10.5061/dryad.2fc32[ |
| GenBank | KAYV00000000.1[ |
| Australian Antarctic Data Center database | 10.4225/15/526f42ada05b1[ |
| Supplemental Data | Hartmann et al. (2012)Supplementary_Data2,[ |
List of primer pairs to amplify general fungal communities in the samples included in this Data Descriptor[7,17].
| Primer pair | Sequence |
| 1737F / 2043 R | GGAAGTAAAAGTCGTAACAAGG / GCTGCGTTCTTCATCGATGC |
| 18S-F / 5.3S-R | GTAAAAGTCGTAACAAGGTTTC / GTTCAAAGAYTCGATGATTCAC |
| 58A2F / ITS4 | ATCGATGAAGAACGCAG / TCCTCCGCTTATTGATATGC |
| fITS7 / ITS4 | GTGARTCATCGAATCTTTG / TCCTCCGCTTATTGATATGC |
| fITS9 / ITS4 | GAACACAGCGAAATGTGA / TCCTCCGCTTATTGATATGC |
| gITS7 / ITS4 | GTGARTCATCGARTCTTTG / TCCTCCGCTTATTGATATGC |
| gITS7 / ITS4ngs | GTGARTCATCGARTCTTTG / TTCCTSCGCTTATTGATATGC |
| ITS1 / ITS2 | TCCGTAGGTGAACCTGCGG / GCTGCGTTCTTCATCGATGC |
| ITS1 / ITS4 | TCCGTAGGTGAACCTGCGG / TCCTCCGCTTATTGATATGC |
| ITS1F / 58A2R | CTTGGTCATTTAGAGGAAGTAA / CTGCGTTCTTCATCGAT |
| ITS1F / gITS7 | CTTGGTCATTTAGAGGAAGTAA / GTGARTCATCGARTCTTTG |
| ITS1F / ITS2 | CTTGGTCATTTAGAGGAAGTAA / GCTGCGTTCTTCATCGATGC |
| ITS1F / ITS3R | CTTGGTCATTTAGAGGAAGTAA / TCCTCCGCTTATTGATATGC |
| ITS1F / ITS4 | CTTGGTCATTTAGAGGAAGTAA / TCCTCCGCTTATTGATATGC |
| ITS1F_KYO1 / ITS2_KYO1 | CTHGGTCATTTAGAGGAASTAA / CTRYGTTCTTCATCGDT |
| ITS1F_KYO1 / ITS2_KYO2 | CTHGGTCATTTAGAGGAASTAA / TTYRCTRCGTTCTTCATC |
| ITS1F_KYO2 / ITS2_KYO2 | TAGAGGAAGTAAAAGTCGTAA / TTYRCTRCGTTCTTCATC |
| ITS1F_KYO2 / LR3 | TAGAGGAAGTAAAAGTCGTAA / CCGTGTTTCAAGACGGG |
| ITS1FI2 / 5.8 S | GAACCWGCGGARGGATCA / CGCTGCGTTCTTCATCG |
| ITS1FI2 / ITS2 | GAACCWGCGGARGGATCA / GCTGCGTTCTTCATCGATGC |
| ITS1Fngs / ITS2 | GGTCATTTAGAGGAAGTAA / GCTGCGTTCTTCATCGATGC |
| ITS1ngs / ITS2 | TCCGTAGGTGAACCTGC / GCTGCGTTCTTCATCGATGC |
| ITS1-OF / ITS4 | AACT(C)GG(C)CATTTAGAGGAAGT / TCCTCCGCTTATTGATATGC |
| ITS2 / ITS4 | GCTGCGTTCTTCATCGATGC / TCCTCCGCTTATTGATATGC |
| ITS2 / ITS5 | GCTGCGTTCTTCATCGATGC / GGAAGTAAAAGTCGTAACAAGG |
| ITS2F / ITS2R | GCATCGATGAAGAACGC / CCTCCGCTTATTGATATGC |
| ITS3 / ITS4 | GCATCGATGAAGAACGCAGC / TCCTCCGCTTATTGATATGC |
| ITS3_KYO2 / ITS4 | GATGAAGAACGYAGYRAA / TCCTCCGCTTATTGATATGC |
| ITS3_KYO2 / ITS4F | GATGAAGAACGYAGYRAA / CGCTTATTRATATGCTTAAXGXT |
| ITS3_KYO2 / LR_KYO1b | GATGAAGAACGYAGYRAA / MGCWGCATTCCCAAACWA |
| ITS3ngs mix / ITS4ngr mix + ARCH-ITS4-F1 | CA(T)CGATGAAGAACG(C)(A)G / TCCT(S)(C)(G)CTTATTGATATGC |
| ITS3ngs1 to ITS3ngs11 / ITS4ngs | (C)(A)(T)CGATGAAGA(A)CG(C)(A)G / TCCTSCGCTTATTGATATGC |
| ITS3tagmix1 to ITS3tagmix5 / ITS4ngs | CTAGACTCGTCA(T)CGATGAAGAACG(CA)G / TTCCTSCGCTTATTGATATGC |
| ITS4 / fITS9 | TCCTCCGCTTATTGATATGC / GAACACAGCGAAATGTGA |
| ITS4 / ITS3_KYO2 | TCCTCCGCTTATTGATATGC / GATGAAGAACGYAGYRAA |
| ITS4 / ITS5 | TCCTCCGCTTATTGATATGC / GGAAGTAAAAGTCGTAACAAGG |
| ITS4 / ITS9 | TCCTCCGCTTATTGATATGC / GCATTAGAAACTGCTCGTAATG |
| ITS5 / 5.8S_fungi | GGAAGTAAAAGTCGTAACAAGG / CAAGAGATCCGTTGTTGAAAGTT |
| ITS5 / ITS2 | GGAAGTAAAAGTCGTAACAAGG / GCTGCGTTCTTCATCGATGC |
| ITS5 / ITS2_KYO2 | GGAAGTAAAAGTCGTAACAAGG / TTYRCTRCGTTCTTCATC |
| ITS5 / ITS4 | GGAAGTAAAAGTCGTAACAAGG / TCCTCCGCTTATTGATATGC |
| ITS7o / ITS4 | GTGAATCATCRAATYTTTG / TCCTCCGCTTATTGATATGC |
| ITS86F / ITS4 | GTGAATCATCGAATCTTTGAA / TCCTCCGCTTATTGATATGC |
| ITS9 / ITS4 | GCATTAGAAACTGCTCGTAATG / TCCTCCGCTTATTGATATGC |
| ITS9F / ITS4 | CCATCTCATCCCTGCGTGTCTCCGACTCAG / TCCTCCGCTTATTGATATGC |
Fig. 3User interface to access the GlobalFungi database.
| Measurement(s) | internal_transcribed_spacer_region |
| Technology Type(s) | digital curation |
| Factor Type(s) | geographic location • sample type • biome • sampling year |
| Sample Characteristic - Organism | Fungi |
| Sample Characteristic - Environment | terrestrial biome |