| Literature DB >> 31856911 |
Jostein Gohli1, Kari Oline Bøifot2,3, Line Victoria Moen2, Paulina Pastuszek4, Gunnar Skogan2, Klas I Udekwu5, Marius Dybwad2,3.
Abstract
BACKGROUND: Mass transit environments, such as subways, are uniquely important for transmission of microbes among humans and built environments, and for their ability to spread pathogens and impact large numbers of people. In order to gain a deeper understanding of microbiome dynamics in subways, we must identify variables that affect microbial composition and those microorganisms that are unique to specific habitats.Entities:
Keywords: 16S rRNA gene; Aerosol; Air; Amplicon sequencing; Microbiome; Seasonal variation; Subway
Mesh:
Substances:
Year: 2019 PMID: 31856911 PMCID: PMC6924074 DOI: 10.1186/s40168-019-0772-9
Source DB: PubMed Journal: Microbiome ISSN: 2049-2618 Impact factor: 14.650
Top 20 phyla, families, and genera in air samples (N = 69) and surface samples (N = 177). Dots indicate that a group is also represented in the top 20 set from the other sample matrix. Prevalence is the sum of prevalence of all ASVs within a taxonomic group
| Air samples | Surface samples | ||||||
|---|---|---|---|---|---|---|---|
| Phylum | Prevalence | Abundance (%) | Phylum | Prevalence | Abundance (%) | ||
| | • | 32638 | 42.92 | • | 59693 | 31.32 | |
| | • | 27216 | 23.88 | • | 61648 | 27.47 | |
| | • | 12494 | 11.97 | • | 28790 | 16.65 | |
| | • | 13390 | 8.10 | • | 34972 | 10.99 | |
| | • | 3988 | 6.29 | • | 10775 | 7.88 | |
| | • | 2731 | 1.53 | • | 3184 | 1.37 | |
| | • | 2451 | 1.10 | • | 4904 | 1.05 | |
| | • | 1445 | 1.06 | • | 3591 | 0.67 | |
| | • | 1976 | 0.68 | • | 1557 | 0.59 | |
| | • | 294 | 0.66 | • | 3321 | 0.51 | |
| | • | 1335 | 0.53 | • | 2281 | 0.37 | |
| | • | 998 | 0.34 | • | 2086 | 0.33 | |
| | • | 910 | 0.33 | • | 1667 | 0.26 | |
| | • | 410 | 0.17 | • | 958 | 0.18 | |
| | • | 381 | 0.15 | • | 773 | 0.12 | |
| | • | 250 | 0.10 | • | 162 | 0.11 | |
| | • | 185 | 0.06 | • | 155 | 0.03 | |
| | • | 65 | 0.03 | • | 172 | 0.02 | |
| | • | 56 | 0.02 | • | 111 | 0.02 | |
| | • | 28 | 0.01 | • | 69 | 0.01 | |
| Family | Prevalence | Abundance (%) | Family | Prevalence | Abundance (%) | ||
| | • | 19657 | 15.01 | • | 42005 | 14.23 | |
| | • | 3092 | 10.49 | • | 5787 | 7.21 | |
| | • | 3354 | 4.78 | • | 8367 | 5.84 | |
| | • | 1965 | 4.26 | • | 4280 | 5.00 | |
| | • | 5034 | 4.06 | • | 3810 | 4.53 | |
| | 1143 | 3.52 | • | 10578 | 4.42 | ||
| | • | 4187 | 3.35 | • | 9178 | 3.84 | |
| | • | 2899 | 3.24 | • | 3982 | 3.35 | |
| | • | 1689 | 2.96 | • | 4365 | 3.34 | |
| | • | 2555 | 2.60 | • | 5957 | 2.64 | |
| | • | 1578 | 2.49 | • | 4933 | 2.29 | |
| | • | 1498 | 2.30 | • | 2947 | 2.20 | |
| | 1755 | 2.14 | • | 3413 | 2.06 | ||
| | • | 1558 | 2.07 | • | 3654 | 1.93 | |
| | • | 969 | 2.06 | • | 4970 | 1.89 | |
| | • | 1604 | 1.80 | • | 2498 | 1.52 | |
| | • | 1224 | 1.71 | • | 1747 | 1.46 | |
| | • | 955 | 1.43 | • | 3227 | 1.28 | |
| | • | 1543 | 1.27 | 1474 | 1.16 | ||
| | • | 1476 | 1.13 | 2604 | 1.15 | ||
| Genus | Prevalence | Abundance (%) | Genus | Prevalence | Abundance (%) | ||
| | • | 38724 | 26.51 | • | 82052 | 24.72 | |
| | • | 377 | 3.97 | • | 3208 | 4.57 | |
| | • | 1445 | 3.83 | • | 3184 | 4.23 | |
| | 1143 | 3.52 | • | 2430 | 3.96 | ||
| | • | 1260 | 3.37 | • | 8696 | 3.75 | |
| | • | 3850 | 3.19 | • | 3278 | 2.70 | |
| | • | 999 | 2.99 | • | 1904 | 2.07 | |
| | • | 1248 | 2.13 | • | 915 | 1.99 | |
| | • | 1867 | 2.06 | • | 594 | 1.82 | |
| | • | 453 | 1.49 | • | 1106 | 1.65 | |
| | 494 | 1.38 | • | 3355 | 1.54 | ||
| | • | 383 | 1.26 | • | 2985 | 1.43 | |
| | • | 658 | 1.22 | 523 | 1.23 | ||
| | 753 | 1.10 | • | 1459 | 1.16 | ||
| | 378 | 0.92 | • | 2528 | 1.14 | ||
| | • | 907 | 0.92 | 1738 | 1.01 | ||
| | 315 | 0.84 | 1545 | 0.92 | |||
| | • | 1025 | 0.80 | 1596 | 0.84 | ||
| | • | 629 | 0.80 | 1076 | 0.83 | ||
| | • | 593 | 0.80 | • | 1432 | 0.78 | |
Fig. 1Taxonomic overview. a Relative abundances of the top 15 phyla. b Heatmap of most abundant families (relative abundance ≥ 0.01), color coded by phylum following the legend in panel a. Particularly differentiated features are highlighted with arrows, where green indicate seasonal variation and red variation between air and surface samples
Fig. 2The distribution of amplicon sequence variants (ASVs) across seasons and sample matrices. Panel a includes all ASVs and panel b only ASVs with prevalence > 4 and abundance > 10
Random forest classification models of season
| Season | |||||
|---|---|---|---|---|---|
| Out-of-bag estimate of error rate: 8.94% | |||||
| Confusion matrix | Autumn | Spring | Summer | Winter | Class error (%) |
| Autumn | 53 | 1 | 5 | 1 | 11.7 |
| Spring | 0 | 54 | 1 | 1 | 3.6 |
| Summer | 9 | 0 | 63 | 0 | 12.5 |
| Winter | 2 | 2 | 0 | 54 | 6.9 |
| Most important genera in sample classification | |||||
| Family:genera | Autumn | Spring | Summer | Winter | MDA |
| | 0.026 | 0.039 | 0.040 | 0.069 | 0.043 |
| | 0.022 | 0.057 | 0.011 | 0.004 | 0.022 |
| | 0.009 | 0.012 | 0.021 | 0.040 | 0.020 |
| | 0.020 | 0.024 | 0.011 | 0.013 | 0.016 |
| | 0.024 | 0.003 | 0.002 | 0.014 | 0.010 |
| | 0.009 | 0.012 | 0.007 | 0.009 | 0.009 |
| | 0.000 | 0.011 | 0.002 | 0.018 | 0.008 |
| | 0.001 | 0.015 | 0.003 | 0.012 | 0.007 |
| | 0.008 | 0.015 | 0.001 | 0.002 | 0.006 |
| | 0.000 | 0.018 | 0.003 | 0.004 | 0.006 |
| | 0.001 | 0.001 | 0.009 | 0.010 | 0.005 |
| | 0.000 | 0.016 | 0.001 | 0.006 | 0.005 |
| | 0.001 | 0.006 | 0.003 | 0.012 | 0.005 |
| | 0.001 | 0.009 | 0.002 | 0.009 | 0.005 |
| | 0.012 | 0.007 | 0.002 | 0.000 | 0.005 |
| | 0.003 | 0.017 | 0.000 | 0.001 | 0.005 |
| | 0.003 | 0.000 | 0.010 | 0.004 | 0.005 |
| | 0.001 | 0.000 | 0.006 | 0.011 | 0.004 |
| | 0.001 | 0.006 | 0.002 | 0.009 | 0.004 |
| | 0.000 | 0.010 | 0.002 | 0.004 | 0.004 |
Confusion matrices show the classification of samples and the associated class error. The mean decrease in model accuracy (MDA; from removing the genus in question) and mean Z-scores are given for the 20 most important genera for classifying samples
Random forest classification models of air/surface
| Air/surface | |||
|---|---|---|---|
| Out-of-bag estimate of error rate: 6.1% | |||
| Confusion matrix | Air | Surface | Class error (%) |
| Air | 56 | 13 | 18.8 |
| Surface | 2 | 175 | 1.1 |
| Most important genera in sample classification | |||
| Family:Genera | Air | Surface | MDA |
| | 0.027 | 0.010 | 0.015 |
| | 0.020 | 0.006 | 0.010 |
| | 0.018 | 0.006 | 0.009 |
| | 0.015 | 0.004 | 0.007 |
| | 0.012 | 0.004 | 0.006 |
| | 0.013 | 0.003 | 0.006 |
| | 0.011 | 0.004 | 0.006 |
| | 0.011 | 0.004 | 0.006 |
| | 0.008 | 0.004 | 0.005 |
| | 0.012 | 0.003 | 0.005 |
| | 0.012 | 0.002 | 0.005 |
| | 0.008 | 0.004 | 0.005 |
| | 0.009 | 0.003 | 0.005 |
| | 0.009 | 0.003 | 0.005 |
| | 0.007 | 0.003 | 0.004 |
| | 0.008 | 0.002 | 0.004 |
| | 0.007 | 0.002 | 0.003 |
| | 0.007 | 0.002 | 0.003 |
| | 0.007 | 0.002 | 0.003 |
| | 0.006 | 0.002 | 0.003 |
Confusion matrices show the classification of samples and the associated class error. The mean decrease in model accuracy (MDA; from removing the genus in question) and mean Z-scores are given for the 20 most important genera for classifying samples
Fig. 3Analysis of Shannon’s diversity index. The four significant predictors of within-sample diversity (see Table 4)
The best-fit model for Shannon’s diversity index score, which explained 27% of within-sample diversity variance and had a p value of 1.04 × 10−09. Slopes are given for continuous predictor variables and interactions between continuous and categorical predictors with two levels. Observed trends, from low to high average Shannon’s diversity scores, are given for the categorical predictors
| Predictor | DF | Sum Sq. | Mean Sq. | Slope/trend | ||
|---|---|---|---|---|---|---|
| Temperature | 1 | 0.089 | 0.089 | 17.62 | < 0.001 | 0.0006 |
| Air/surface | 1 | 0.041 | 0.041 | 8.20 | 0.005 | Surface > air |
| Season | 3 | 0.102 | 0.034 | 6.75 | < 0.001 | Winter > autumn > summer > spring |
| Humidity | 1 | 0.029 | 0.029 | 5.74 | 0.017 | − 0.0017 |
| Humidity SD | 1 | 0.005 | 0.005 | 1.04 | 0.309 | − 0.1415 |
| Temperature SD | 1 | 0.002 | 0.002 | 0.47 | 0.493 | − 0.1018 |
| Time of day | 1 | 0.002 | 0.002 | 0.43 | 0.514 | − 0.0001 |
| Indoor/Outdoor | 1 | 0.000 | 0.000 | 0.000 | 1.000 | Outdoor > indoor |
| Temperature SD: temperature | 1 | 0.097 | 0.097 | 19.11 | < 0.001 | − 0.0130 |
| Temperature: air/surface | 1 | 0.051 | 0.051 | 10.02 | 0.002 | − 0.0047 |
| Time: indoor/outdoor | 1 | 0.029 | 0.029 | 5.75 | 0.017 | 0.0005 |
| Humidity SD: season | 3 | 0.061 | 0.020 | 4.04 | 0.008 | |
| Time: season | 3 | 0.034 | 0.011 | 2.24 | 0.085 | |
| Season: indoor/outdoor | 3 | 0.029 | 0.010 | 1.91 | 0.130 | |
| Humidity SD: humidity | 1 | 0.005 | 0.005 | 1.01 | 0.316 | 0.0011 |
| Temperature SD: season | 3 | 0.006 | 0.002 | 0.37 | 0.773 | |
| Humidity SD: temperature | 1 | 0.000 | 0.000 | 0.08 | 0.772 | 0.0060 |
| Temperature SD: time of day | 1 | 0.000 | 0.000 | 0.00 | 0.990 | 0.0004 |
| Residuals | 209 | 1.056 | 0.005 |
The best-fit PERMANOVA model, which explained 56% of among-sample diversity (Bray–Curtis dissimilarity)
| Predictor | DF | Sum Sq. | Mean Sq. | |||
|---|---|---|---|---|---|---|
| Air/surface | 1 | 2.484 | 2.484 | 14.16 | 0.04 | 0.001 |
| Season | 3 | 6.593 | 2.198 | 12.52 | 0.11 | 0.001 |
| Subway station | 15 | 9.134 | 0.609 | 3.47 | 0.15 | 0.001 |
| Temperature | 1 | 0.475 | 0.475 | 2.71 | 0.01 | 0.001 |
| Sequence run | 3 | 1.063 | 0.354 | 2.02 | 0.02 | 0.001 |
| Time of day | 1 | 0.239 | 0.239 | 1.36 | < 0.01 | 0.050 |
| Season: air/surface | 3 | 0.853 | 0.284 | 1.620 | 0.01 | 0.001 |
| Subway station: air/surface | 15 | 3.382 | 0.225 | 1.28 | 0.06 | 0.001 |
| Season: subway station | 44 | 9.792 | 0.223 | 1.27 | 0.16 | 0.001 |
| Residuals | 151 | 26.500 | 0.175 | 0.44 |
Fig. 4Analysis of Bray–Curtis dissimilarity distances. PCoA plots of among-sample diversity with significant predictors from the PERMANOVA model (see Table 5). Dashed circles represent 95% CI for each cluster