| Literature DB >> 36220843 |
Kiana A West1, Xiaochen Yin1, Erica M Rutherford1, Brendan Wee1, Jinlyung Choi1, Brianna S Chrisman2, Kaiti L Dunlap3, Roberta L Hannibal1, Wiputra Hartono1, Michelle Lin1, Edward Raack1, Kayleen Sabino1, Yonggan Wu1,4, Dennis P Wall3,5,6, Maude M David7,8, Karim Dabbagh1, Todd Z DeSantis9, Shoko Iwai1.
Abstract
Observational studies have shown that the composition of the human gut microbiome in children diagnosed with Autism Spectrum Disorder (ASD) differs significantly from that of their neurotypical (NT) counterparts. Thus far, reported ASD-specific microbiome signatures have been inconsistent. To uncover reproducible signatures, we compiled 10 publicly available raw amplicon and metagenomic sequencing datasets alongside new data generated from an internal cohort (the largest ASD cohort to date), unified them with standardized pre-processing methods, and conducted a comprehensive meta-analysis of all taxa and variables detected across multiple studies. By screening metadata to test associations between the microbiome and 52 variables in multiple patient subsets and across multiple datasets, we determined that differentially abundant taxa in ASD versus NT children were dependent upon age, sex, and bowel function, thus marking these variables as potential confounders in case-control ASD studies. Several taxa, including the strains Bacteroides stercoris t__190463 and Clostridium M bolteae t__180407, and the species Granulicatella elegans and Massilioclostridium coli, exhibited differential abundance in ASD compared to NT children only after subjects with bowel dysfunction were removed. Adjusting for age, sex and bowel function resulted in adding or removing significantly differentially abundant taxa in ASD-diagnosed individuals, emphasizing the importance of collecting and controlling for these metadata. We have performed the largest (n = 690) and most comprehensive systematic analysis of ASD gut microbiome data to date. Our study demonstrated the importance of accounting for confounding variables when designing statistical comparative analyses of ASD- and NT-associated gut bacterial profiles. Mitigating these confounders identified robust microbial signatures across cohorts, signifying the importance of accounting for these factors in comparative analyses of ASD and NT-associated gut profiles. Such studies will advance the understanding of different patient groups to deliver appropriate therapeutics by identifying microbiome traits germane to the specific ASD phenotype.Entities:
Mesh:
Year: 2022 PMID: 36220843 PMCID: PMC9554176 DOI: 10.1038/s41598-022-21327-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Summary of datasets.
| Dataset | Publication | Technology | Sequencing instrument | 16S region | ASD (n) | NT (n) | Minimum age (years) | Maximum age (years) | Median age (years) | Male (%) |
|---|---|---|---|---|---|---|---|---|---|---|
| DS1 | Kang et al. 2013 | 16S HTS | 454 Genome Sequencer FLX Titanium | V2V3 | 20 | 20 | 3 | 16 | 6 | 87.5 |
| DS2 | Kang et al. 2018 | 16S HTS | 454 Genome Sequencer FLX Titanium | V2V3 | 23 | 21 | 4 | 17 | 9 | 84.1 |
| DS3 | Kang et al. 2017 Kang et al. 2019 | 16S HTS | Illumina MiSeq | V4 | 18 | 20 | 7.1 | 16.7 | 11.05 | 89.5 |
| DS4 | Li et al. 2019 | 16S HTS | Illumina HiSeq | V1V2 | 59 | 30 | 2 | 10 | Not reported | Not reported |
| DS5 | Strati et al. 2017 | 16S HTS | 454 Genome Sequencer FLX | V3V5 | 39 | 39 | Not reported | Not reported | Not reported | Not reported |
| DS6 | Coretti et al. 2018 | 16S HTS | Illumina MiSeq | V3V4 | 11 | 14 | 2 | 4 | Not reported | Not reported |
| DS7 | Wang et al. 2019 | MTG | Illumina HiSeq 4000 | – | 43 | 31 | 2 | 8 | 4 | 73.0 |
| DS8 | Pulikkan et al. 2018 | 16S HTS | Illumina NextSeq 500 | V3 | 29 | 24 | 3 | 16 | 9 | 75.5 |
| DS9 | Averina et al. 2020 | MTG | Illumina HiSeq 4000 | – | 29 | 20 | 2 | 9 | 3 | 79.6 |
| DS10 | 16S HTS | Illumina MiSeq | V3V4 | 15 | 5 | 2 | 9 | 3.5 | 70.0 | |
| DS11 | Internal | 16S HTS | Illumina MiSeq | V4 | 107 | 92 | 2 | 11.83 | 4.75 | 72.4 |
| DS12 | PhyloChip | Thermo-Fisher Gene Titan | V1V9 | 96 | 93 | 2 | 11.83 | 5.17 | 72.0 | |
| DS13 | MTG | Illumina NextSeq | – | 96 | 91 | 2 | 11.83 | 5.17 | 73.3 |
HTS high-throughput sequencing, MTG metagenomics.
Figure 1Concordance of metadata variables across datasets. Each bar represents the number of common variables between datasets and solid black circles below the bar indicate datasets that contain the intersected set of variables. The “Total variables” inset displays the total number of metadata variables for each dataset. The top (present in at least 6 datasets) intersected variables are labelled while the remaining variables are listed in Supplementary file 1: Table S2.
Figure 2Variables associated with beta-diversity in multiple datasets. (a) Variables/combinations significantly associated with changes in bacterial community structure in at least 2 datasets are plotted. Each bar represents the total number of datasets in which a variable (or variable combination) was tested and the red fraction of the bar denotes the number of datasets where the difference in beta-diversity was significant (PERMANOVA, P < 0.05). The first bar in each panel represents the variable “Subset: None; Autism Spectrum Disorder—FALSE over TRUE.” Remaining variables are listed in Supplementary file 1: Table S3. (b) Proportions of datasets with significant tests (non-aggregated data) for each variable combination are plotted. Columns are used to arrange subset values while rows segregate test variables. Results from aggregated data (genus and species levels) are presented in Supplementary file 2 (Figs. S4, S5).
Figure 3Features associated with ASD at different taxonomic levels. Effect sizes and q-values from random-effects models (meta-analysis) are plotted. Each model represents an association between the abundance of a specific taxon and ASD (negative direction) or NT (positive direction). The color, transparency, and size of each point denotes the taxonomic rank of the taxon, the significance of the model, and the number of datasets included in the model, respectively. Horizontal dotted line indicates significance threshold (q = 0.05). Significant models are labelled by the taxon investigated. Strain-level results are reported in Table S4.
Figure 4Bowel dysfunction confounds relationships linking ASD to differential bacterial taxa abundances. Taxa significantly associated with NT (positive direction) or ASD (negative direction) by meta-analysis are plotted by phylum. Taxonomic label prefixes indicate the taxon’s rank where t__, s__, g__ or f__ are strain, species, genus or family, respectively. ASD-associated signatures are different when children with bowel dysfunction are included (left panels) compared to when only children with normal bowel function are considered (right panels). Missing data points indicate taxa absent from the remaining datasets and individuals. Effect sizes and significance were calculated using random-effects models.
Figure 5ASD-associated taxa abundance depends on the period of life studied. Effect sizes and q-values from random-effects models investigating the associations between different taxa and NT (positive direction) or ASD (negative direction) at different stages of childhood are plotted. The color, transparency, and size of each point denotes the taxonomic rank of the taxon, the significance of the model, and the number of datasets included in the model, respectively. Horizontal dotted line indicates significance threshold (q = 0.05). Significant models are labelled by the taxon investigated except for strain-level results. These models are reported in Supplementary file 1: Table S6.
Figure 6Sex-dependent differences in ASD v. NT populations. (a) Effect sizes and q-values from random-effects models investigating the associations between different taxa and sex (female: positive direction, male: negative direction) in the ASD compared to NT groups are plotted. (b) Models investigating ASD-associated taxa (NT: positive direction, ASD: negative direction) in either males or females are also plotted. The color, transparency, and size of each point denotes the taxonomic rank of the taxon, the significance of the model, and the number of datasets included in the model, respectively. Horizontal dotted line indicates significance threshold (q = 0.05). Significant models are labelled by the taxon investigated except for strain-level results. These models are reported in Supplementary file 1: Table S7.