| Literature DB >> 32463383 |
Irene Martorelli1,2, Leon S Helwerda1, Jesse Kerkvliet2, Sofia I F Gomes2, Jorinde Nuytinck2, Chivany R A van der Werff1, Guus J Ramackers1, Alexander P Gultyaev1, Vincent S F T Merckx2,3, Fons J Verbeek1.
Abstract
Fungi have crucial roles in ecosystems, and are important associates for many organisms. They are adapted to a wide variety of habitats, however their global distribution and diversity remains poorly documented. The exponential growth of DNA barcode information retrieved from the environment is assisting considerably the traditional ways for unraveling fungal diversity and detection. The raw DNA data in association to environmental descriptors of metabarcoding studies are made available in public sequence read archives. While this is potentially a valuable source of information for the investigation of Fungi across diverse environmental conditions, the annotation used to describe environment is heterogenous. Moreover, a uniform processing pipeline still needs to be applied to the available raw DNA data. Hence, a comprehensive framework to analyses these data in a large context is still lacking. We introduce the MycoDiversity DataBase, a database which includes public fungal metabarcoding data of environmental samples for the study of biodiversity patterns of Fungi. The framework we propose will contribute to our understanding of fungal biodiversity and aims to become a valuable source for large-scale analyses of patterns in space and time, in addition to assisting evolutionary and ecological research on Fungi.Entities:
Keywords: DNA barcoding; biogeography; data integration; environmental samples; fungal biodiversity
Mesh:
Year: 2020 PMID: 32463383 PMCID: PMC7734503 DOI: 10.1515/jib-2019-0046
Source DB: PubMed Journal: J Integr Bioinform ISSN: 1613-4516
Figure 1:Data acquisition workflow for retrieving data from Pubmed and SRA repositories. Once the data is retrieved, the final step of the data acquisition involves the curation of the two data types, the metadata retrieved from PubMed and SRA (Metadata Curation), and of the raw HTS data of the SRA fastq files (Sequencing Data Curation).
Regular expressions used for the detection of the sequence archive prefixes in the text of publications. For each regular expression, we provide the total number of publications for which a match is detected.
| Regular | INSDC DB | Number of Publications |
|---|---|---|
| SRA | SRA | 5 |
| SRP | SRA | 8 |
| SRS | SRA | 2 |
| SRX | SRA | 1 |
| PRJNA | BioProject | 5 |
| PRJEB | BioProject | 1 |
| ERP | ENA | 0 |
| ERS | ENA | 0 |
Figure 2:Components and Dependencies of the fungal metabarcoding data integration.
Figure 3:UML data model of the MDDB.
The results corresponding to the Query Tests conducted on different DBMSs. We have performed five runs for each query and taken the mean and standard deviation (SD) value. The results are displayed in seconds (unit).
| Query tests conducted on different DBMS | |||
|---|---|---|---|
| SQLite | MySQL | MonetDB | |
| DBMS size (MB) | 89 | 136 | 63 |
| Query | |||
| Q1: Mean | 0.242 | 0.238 | 0.018 |
| Q1: SD | 0.009 | 0.493 | 0.007 |
| Q2: Mean | 0.858 | 8.606 | 0.018 |
| Q2: SD | 0.005 | 0.005 | 0.004 |
| Q3: Mean | 0.918 | 0.116 | 0.015 |
| Q3: SD | 0.003 | 0.005 | 0.001 |
Post processing update steps for the incorporation of additional FASTA files containing processed ZOTU sequence. Each update includes 100 FASTA files (average 426 ZOTUs/file) obtained from 100 distinct samples of a study. The size displayed are for csv format of the Reference ZOTU and Contain datasets while the execution time is displayed in HH:MM:SS format.
| Post processing PROFUNGIS | ||||||
|---|---|---|---|---|---|---|
| Run | Input | Output | Execution | |||
| ZOTUs in FASTA | ZOTUs in Reference | Records in contains | ZOTUs in Reference | Records in contains | ||
| Update 1 | 47529 | empty | empty | 35981 | 48086 | 00:01:01 |
| 12,5 MB | 9.9 MB | 13,6 MB | ||||
| Update 2 | 43481 | 35981 | 48086 | 73699 | 91567 | 00:02:05 |
| 11,4 MB | 9.9 MB | 13,6 MB | 20,2 MB | 26 MB | ||
| Update 3 | 42508 | 73699 | 91567 | 110289 | 134074 | 00:04:36 |
| 11,2 MB | 20,2 MB | 26 MB | 30,2 MB | 38 MB | ||
| Update 4 | 42374 | 110289 | 134074 | 144193 | 177005 | 00:05:20 |
| 11,1 MB | 30,2 MB | 38 MB | 39,5 MB | 50,2 MB | ||
| Update 5 | 37535 | 144193 | 177005 | 172463 | 213984 | 00:06:43 |
| 9.9 MB | 39,5 MB | 50,2 MB | 47,3 MB | 60,7 MB | ||
Type and amount of data contained in MDDB.
| Data type | Total |
|---|---|
| Articles | 25 |
| SRA submissions | 21 |
| SRA Studies (SRP) | 21 |
| 454 GS | 16 |
| Illumina | 3 |
| Ion Torrent | 2 |
| SRA Experiments (SRX) | 4065 |
| SRA Run Files (SRR) | 4470 (37.65 Gb) |
| SRR processed | 511 (2.67 Gb) |
| Raw sequences processed | 3037390 |
| ZOTUs generated | 172463 |
| ZOTUs assigned to Fungi | 110910 |
| SRA Samples (SRS) | 3997 |
| SRA SRS curated | 512 |
Amount of ZOTUs assigned for each Phylum in every continental region.
| Phylum | Africa | Asia | Europe | North America | Oceania | South America |
|---|---|---|---|---|---|---|
| Basidiomycota | 3940 | 10573 | 20462 | 3865 | 8404 | 4117 |
| Ascomycota | 3862 | 9499 | 15563 | 3049 | 9542 | 6783 |
| Mortierellomycota | 207 | 1244 | 5331 | 505 | 1000 | 708 |
| Unidentified | 135 | 176 | 139 | 68 | 186 | 103 |
| Mucoromycota | 104 | 423 | 2479 | 338 | 804 | 269 |
| Chytridiomycota | 32 | 91 | 91 | 13 | 93 | 30 |
| Rozellomycota | 28 | 117 | 173 | 62 | 111 | 51 |
| Glomeromycota | 12 | 13 | 14 | 12 | 31 | 0 |
| Other | 8 | 41 | 48 | 22 | 28 | 6 |
Figure 4:Major phylum ZOTU representatives for each continent.
Figure 5:Diversity of ZOTUs belonging to the Russulaceae family and their distribution display in a spatial range. Map display by using Marker GeoCharts (http://developers.google.com/chart/interactive/docs/gallery/geochart).
ZOTUs representatives for each continent and common ZOTUs among continents.
| Amount | Globe | Africa | Asia | Europe | North America | Oceania | South America |
|---|---|---|---|---|---|---|---|
| Number of countries | 38 | 5 | 11 | 10 | 5 | 3 | 4 |
| Number of samples | 512 | 35 | 85 | 217 | 28 | 88 | 59 |
| Number of ZOTUs | 172463 | 13355 | 33178 | 68566 | 11414 | 31247 | 19324 |
| Number of ZOTUs shared betweeen continents | 497 | 2192 | 2191 | 926 | 1306 | 929 |
Figure 6:Overlap of ZOTUs among continents.
Selection of publications describing metabarcoding studies.
| Article | Year | PMID | Study | SRA Study Title | PMID C.R. | SRA C.R. |
|---|---|---|---|---|---|---|
| 1. Fungal biogeography. Global diversity and geography of soil fungi. | 2014 | 25430773 | SRP043706 | Global fungal diversity | Y | Null |
| DOI: | ||||||
| 2. New primers to amplify the fungal ITS2 region–evaluation by 454-sequencing of artificial and natural communities. | 2012 | 22738186 | SRP012868 | Design and test new primers to be used to amplify the fungal ITS2 region by targeting sites in the 5.8S encoding gene | Null | Null |
| DOI: | ||||||
| 3. Strong altitudinal partitioning in the distributions of ectomycorrhizal fungi along a short (300 m) elevation gradient. | 2015 | 25655082 | SRP043982 | Root associated fungi Metagenome | Y | Null |
| DOI: | ||||||
| 4. Roots and associated fungi drive long-term carbon sequestration in boreal forest. | 2013 | 23539604 | SRP016090 | The Island Project – Fungal communities in boreal forest soils | Null | Null |
| DOI: | ||||||
| 5. ITS1 versus ITS2 as DNA metabarcodes for fungi. | 2013 | 23350562 | SRP026239 | Analysis of ITS1 and ITS2 regions for barcoding fungal specimens | Null | Y |
| DOI: | ||||||
| 6. Dispersal in microbes: fungi in indoor air are dominated by outdoor air and show dispersal limitation at short distances. | 2013 | 23426013 | SRP015917 | Fungal ITS amplicons from airborne household dust | Null | Null |
| DOI: | ||||||
| 7. Quantifying microbial communities with 454 pyrosequencing: does read abundance count? | 2010 | 21050295 | SRP001800 | Pyrosequencing of Global House Dust | Null | Y |
| DOI: | ||||||
| 8. Indoor fungal composition is geographically patterned and morediverse n temperate zones than in the tropics. | 2010 | 20616017 | SRP001800 | Pyrosequencing of Global House Dust | Null | Y |
| DOI: | ||||||
| 9. Spatio-temporal dynamics of soil bacterial communities as a function of Amazon forest phenology. | 2018 | 29531240 | GENBANK | Null | Null | Null |
| DOI: | ||||||
| 10. Phylogenetic relatedness explains highly interconnected and nested symbiotic networks of woody plants and arbuscular mycorrhizal fungi in a Chinese subtropical forest. | 2017 | 28207957 | SRP042134 | Plant roots in subtropical forest Metagenome | Y | Null |
| DOI: | ||||||
| 11. Convergence and contrast in the community structure of Bacteria, Fungi and Archaea along a tropical elevation-climate gradient. | 2017 | 28402397 | SRP102378 | Hawaii Soils Raw sequence reads | Null | Null |
| DOI: | ||||||
| 12. FUNGAL SYMBIONTS. Global assessment of arbuscular mycorrhizal fungus diversity reveals very low endemism. | 2015 | 26315436 | ERP010906 | Global assessment of arbuscular mycorrhizal fungus diversity reveals very low endemism | Null | Null |
| DOI: | ||||||
| 13. Arbuscular mycorrhizal interactions of mycoheterotrophic Thismia are more specialized than in autotrophic plants. | 2017 | 27739593 | SRP083901 | Arbuscular mycorrhizal interactions of mycoheterotrophic Thismia are more specialized than autotrophic plants | Null | Null |
| DOI: | ||||||
| 14. A phosphorus threshold for mycoheterotrophic plants in tropical forests. | 2017 | 28148744 | SRP076949 | Study of fertilization, litter addition, and soil phosphorus effects on fungal and arbuscular mycorrhizal fungal communities in Panamanian tropical forest | Null | Null |
| DOI: | ||||||
| 15. Fungal-host diversity among mycoheterotrophic plants increases proportionally to their fungal-host overlap. | 2017 | 28515898 | SRP082976 | Host diversity increases proportionally to host overlap among mycoheterotrophic plants | Null | Null |
| DOI: | ||||||
| 16. Consistent responses of soil microbial communities to elevated nutrient inputs in grasslands across the globe. | 2015 | 26283343 | SRP052716 | Soil marker gene sequences across the Nutrient Network | Y | Null |
| DOI: | ||||||
| 17. Severe plant invasions can increase mycorrhizal fungal abundance and diversity. | 2013 | 23486251 | SRA037764 | Not Accessible | Null | Null |
| DOI: | ||||||
| 18. Host identity is a dominant driver of mycorrhizal fungal community composition during ecosystem development. | 2015 | 25640965 | SRP045608 | Host preference drives mycorrhizal fungal niche differentiation throughout ecosystem development | Y | Null |
| DOI: | ||||||
| 19. Fungal community analysis by large-scale sequencing of environmental samples. | 2005 | 16151147 | GENBANK | Null | Null | Null |
| DOI: | ||||||
| 20. Comparison and Validation of Some ITS Primer Pairs Useful for Fungal Metabarcoding Studies | 2014 | 24933453 | SRP026207 | Pioneer pine forest soil Targeted Locus (Loci) | Null | Null |
| DOI: | ||||||
| 21. Lack of host specificity leads to independent assortment of dipterocarps and ectomycorrhizal fungi across a soil fertility gradient. | 2015 | 26032408 | SRP057798 | Root associated fungi Targeted loci environmental | Null | Y |
| DOI: | ||||||
| 22. Meta-analysis of deep-sequenced fungal communities indicates limited taxon sharing between studies and the presence of biogeographic patterns. | 2017 | 24111803 | Multi | Null | Null | Null |
| DOI: | ||||||
| 23. Sequence Depth, Not PCR Replication, Improves Ecological Inference from Next Generation DNA Sequencing. | 2014 | 24587293 | SRP035367 | DOB_RepEx Targeted Locus (Loci) | Null | Y |
| DOI: | ||||||
| 24. Strong linkage between plant and soil fungal communities along a successional coastal dune system. | 2016 | 27411980 | SRP059280 | Ectomycorrhizal fungal communities in a relic foredune plain | Null | Null |
| DOI: | ||||||
| 25. Fungal endophyte communities reflect environmental structuring across a Hawaiian landscape. | 2012 | 22837398 | SRP013695 | Fungal ITS1 amplicon library from environmental samples Targeted Locus (Loci) | Null | Null |
| DOI: |