| Literature DB >> 35891785 |
Zixuan Xie1,2, Chaysavanh Manichanh1,2.
Abstract
While analysis of the bacterial microbiome has become routine, that of the fungal microbiome is still hampered by the lack of robust databases and bioinformatic pipelines. Here, we present FunOMIC, a pipeline with built-in taxonomic (1.6 million marker genes) and functional (3.4 million non-redundant fungal proteins) databases for the identification of fungi. Applied to more than 2,600 human metagenomic samples, the tool revealed fungal species associated with geography, body sites, and diseases. Correlation network analysis provided new insights into inter-kingdom interactions. With this pipeline and two of the most comprehensive fungal databases, we foresee a fast-growing resource for mycobiome studies.Entities:
Keywords: CD, Crohn’s disease; ESRD, End-stage renal disease; FDR, False discovery rate; Fungal databases; GS, Gallstones; HC, Healthy control; HTS, High throughput sequencing; ITS, internal transcribed spacer; Inter-kingdom interactions; Mycobiome; NA, Not applicable; PLWH, People live with HIV; PSO, Psoriasis; SCFA, Short chain fatty acid; SCZ, Schizophrenia; Shotgun metagenomics; T1D, Type 1 diabetes; T2D, Type 2 diabetes; TB, Tuberculosis; Taxonomy and functions; UC, Ulcerative colitis
Year: 2022 PMID: 35891785 PMCID: PMC9293737 DOI: 10.1016/j.csbj.2022.07.010
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 6.155
Fig. 1Workflow of the construction of the FunOMIC database and its application in metagenomic analysis. (A) Recovery of fungal single-copy marker genes from fungal draft genomes and Candida isolate sequencing reads downloaded from NCBI and JGI. A.1) Distribution of the fungal draft genomes at the phylum and species levels in FunOMIC-T (Taxonomy). A.2) Distribution of Candida assemblies at the species level. (B) Fungal and bacterial taxonomic and functional profiling of the 2,679 metagenomic datasets downloaded from NCBI. B.1) Geographical location of the collected human metagenomes. B.2) Proportions of the collected human metagenomes by body sites. B.3) Proportions of human metagenomes by disease type (HIV = human immunodeficiency virus; T2D = type 2 diabetes; CD = Crohn’s disease; UC = ulcerative colitis; ESRD = end-stage renal disease; SCZ = schizophrenia). B.4) Distribution of the collected human metagenomes by gender.
Summary of the characteristics of the 1,950 human metagenomes.
| Body site | Country | Health status | Number of samples | Mechanical Lysis |
|---|---|---|---|---|
| USA | Filariasis | 1 | no | |
| Lyme disease | 1 | no | ||
| France | Infections | 24 | no | |
| China | HC | 100 | no | |
| Australia | NA | 8 | no | |
| Australia | HC | 56 | yes | |
| T1D | 60 | yes | ||
| Belgium | CD | 92 | yes | |
| Canada | PLWH | 10 | na | |
| China | HC | 204 | yes | |
| CD | 38 | yes | ||
| ESRD | 208 | yes | ||
| T2D | 89 | yes | ||
| NA | 15 | NA | ||
| Denmark | HC | 165 | no | |
| Israel | NA | 20 | na | |
| Italy | HC | 18 | yes | |
| Spain | HC | 63 | yes | |
| CD | 50 | yes | ||
| UC | 69 | yes | ||
| Sweden | T2D | 10 | yes | |
| USA | HC | 11 | no | |
| CD | 13 | na | ||
| HIV | 3 | na | ||
| PSO | 24 | no | ||
| UC | 10 | na | ||
| Infant-preterm | 140 | na | ||
| NA | 272 | na | ||
| Chile | HC | 6 | no | |
| Asthma | 5 | no | ||
| South Africa | TB | 4 | na | |
| USA | NA | 61 | na | |
| Italy | HC | 3 | yes | |
| Singapore | NA | 30 | na | |
| South Africa | TB | 10 | no | |
| USA | HC | 16 | no | |
| SCZ | 14 | no | ||
| Italy | HC | 12 | yes | |
| USA | mock communities | 15 | na |
PLWH = People live with HIV patients, PSO = Psoriasis, TB = Tuberculosis.
Fig. 2Fungal taxonomic profiling of several human body sites based on the 1950 shotgun metagenomic data using the FunOMIC-T database. Taxonomic profiling is displayed at the phylum, genus, and species levels. Only the mean relative abundance of the genera and species summing 90 % of the sequence data is exhibited. Gut taxonomic profiling was performed for diseases including Crohn’s disease (CD, n = 193; from the USA, Europe, and Asia), ulcerative colitis (UC, n = 79 from Europe and the USA), end-stage renal disease (ESRD, n = 208, from Asia), type 1 diabetes (T1D, n = 60 from Australia), and type 2 diabetes (T2D, n = 99 from Asia). 468 faecal samples did not have health status information in the metadata files. Health status and geo-localization of conjunctiva, nasal, and saliva samples are described in Table 1.
Fig. 3Effect size of variables on the mycobiome community. The impact of the covariates on mycobiome composition (A) and function (B) was tested by performing a univariate analysis (adonis2) on the 1,950 metagenomes. The effect was considered significant when FDR < 0.05.
Core fungal species of different body sites.
| Bodysite | Health status | Core fungal species (>50 % prevalence) |
|---|---|---|
| HC (n = 262) | ||
| CD (n = 109) | ||
| ESRD (n = 106) | ||
| UC (n = 55) | ||
| T1D (n = 40) | ||
| T2D (n = 50) | ||
| PSO (n = 16) | ||
| PLWH (n = 7) | ||
| Infant (n = 14) | ||
| HC (n = 5) | ||
| HC (n = 76) | ||
| NA (n = 38) | ||
| HC (n = 14) | ||
| SCZ (n = 12) | ||
| Infant (n = 8) | ||
| BJIs (n = 24) | ||
| GS (n = 8) |
Core fungal species of different countries.
| HC (n = 46) | ||
| T1D (n = 40) | ||
| CD (n = 76) | ||
| ESRD (n = 106) | ||
| T2D (n = 49) | ||
| PLWH (n = 7) | ||
| HC (n = 118) | ||
| Infant (n = 14) | ||
| HC (n = 57) | ||
| CD (n = 38) | ||
| UC (n = 52) | ||
| PSO (n = 16) |
Fig. 4Interaction of fungal and bacterial communities in gut microbiome under healthy conditions. Correlation network between the relative abundance of fungal and bacterial species in the gut mycobiome under healthy conditions from Spain (A) and Denmark (B) using the SparCC algorithm. Each node represents a fungal/bacterial/archaeal species and their sizes are determined by relative abundances. The colours of the edges connecting two nodes represent positive (red) and negative (blue) correlations. For a better visual effect, only correlations with p-values<0.001 and an absolute correlation coefficient over 0.05 are represented. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)