| Literature DB >> 31666099 |
Chengsheng Zhu1, Maximilian Miller2,3,4, Nick Lusskin2, Yannick Mahlich2,3,4,5, Yanran Wang2, Zishuo Zeng2, Yana Bromberg6,7.
Abstract
BACKGROUND: Accumulating evidence suggests that the human microbiome impacts individual and public health. City subway systems are human-dense environments, where passengers often exchange microbes. The MetaSUB project participants collected samples from subway surfaces in different cities and performed metagenomic sequencing. Previous studies focused on taxonomic composition of these microbiomes and no explicit functional analysis had been done till now.Entities:
Keywords: Function analysis; Machine learning; MetaSUB; Microbiome; mi-faser
Mesh:
Year: 2019 PMID: 31666099 PMCID: PMC6822482 DOI: 10.1186/s13062-019-0252-y
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Fig. 1The city origins of the subway metagenomic samples. In a), the colored samples are from the known and known-unknown sets; the white samples are from the unknown and mix sets. Note that b) the known set and c) the known-unknown set are similarly dominated by NYC and Porto
Fig. 2The functional profiles of the same city cluster together in the t-SNE plot [26]
Assignment performance based on the eight city models
|
|
|
| ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Top hit | Second hit | Top hit | Second hit | Top hit | Second hit | |||||||||||||
| Ta | Fa | % | Ta | Fa | % | Ta | Fa | % | Ta | Fa | % | Ta | Fa | % | Ta | Fa | % | |
|
| 5 | 10 | 33 | 8 | 7 | 53 | 5 | 10 | 33 | 14 | 1 | 93 | 8 | 7 | 53 | 12 | 3 | 80 |
|
| 11 | 5 | 69 | 13 | 3 | 81 | 3 | 13 | 19 | 9 | 7 | 56 | 7 | 9 | 44 | 14 | 2 | 88 |
|
| 113 | 13 | 90 | 121 | 5 | 96 | 114 | 12 | 90 | 123 | 3 | 98 | 95 | 31 | 75 | 120 | 6 | 95 |
|
| 11 | 8 | 58 | 17 | 2 | 89 | 14 | 5 | 74 | 18 | 1 | 95 | 15 | 4 | 79 | 19 | 0 | 100 |
|
| 45 | 15 | 75 | 57 | 3 | 95 | 51 | 9 | 85 | 59 | 1 | 98 | 43 | 17 | 72 | 56 | 4 | 93 |
|
| 26 | 8 | 76 | 27 | 7 | 79 | 31 | 3 | 91 | 33 | 1 | 97 | 31 | 3 | 91 | 32 | 2 | 94 |
|
| 17 | 3 | 85 | 17 | 3 | 85 | 17 | 3 | 85 | 19 | 1 | 95 | 18 | 2 | 90 | 19 | 1 | 95 |
|
| 15 | 5 | 75 | 18 | 2 | 90 | 15 | 5 | 75 | 20 | 0 | 100 | 19 | 1 | 95 | 20 | 0 | 100 |
|
| 243 | 67 | 78 | 278 | 32 | 90 | 250 | 60 | 81 | 295 | 15 | 95 | 236 | 74 | 76 | 292 | 18 | 94 |
aAssignments are correct (T, true) if the sample provenance matches either of the two predicted cities, and incorrect (F, false) otherwise
Fig. 3Distribution of prediction scores from the city predictors trained on the selected 20 ECs. a AKL (Auckland); b HAM (Hamilton); c NYC (New York City); d OFA (Ofa); e PXO (Porto); f SAC (Sacramento); g SCL (Santiago); h TOK (Tokyo). Positive (P) and negative (N) score distributions for raw-select models were less obvious to their resampled model (balance-select) versions (ReP and ReN)
Final model scores for the known-unknown set
Boldface indicates correct hits. Shading indicates samples for which top hits are wrong and second hits are right
The top two city with highest normalized score (final-unbalanced) for the known-unknown set
Boldface indicates correct hits. Shading indicates samples for which top hits are wrong and second hits are right
Fig. 4Distribution of top-match scores from final-SVM. The columns from the left are: known set, random set, SAND set, Ilorin samples from unknown set, Lisbon samples from unknown set, Boston samples from unknown set and mix set. The black dash line indicates 0.65, the cutoff below which the samples are likely to be random, i.e., the sample is not from any of the eight cities with which we trained our model
Fig. 5Venn diagrams [29] of city subway microbiome signature overlaps between a) AKL (Auckland) and HAM (Hamilton), and b) NYC (New York City), TOK (Tokyo) and PXO (Porto)
The microbial functional signatures shared between AKL and HAM
| EC | Annotation |
|---|---|
| 1.8.1.15 | mycothione reductase |
| 1.8.7.1 | assimilatory sulfite reductase (ferredoxin) |
| 1.10.3.10 | ubiquinol oxidase (H + -transporting) |
| 1.2.2.3 | formate dehydrogenase (cytochrome-c-553) |
| 2.4.1.288 | galactofuranosylgalactofuranosylrhamnosyl-N-acetylglucosaminyl-diphospho-decaprenol beta-1,5/1,6-galactofuranosyltransferase |
| 5.4.3.5 | D-ornithine 4,5-aminomutase |
The microbial functional signatures shared between NYC and TOK
| EC | Annotation |
|---|---|
| 1.14.19.6 | Delta(12)-fatty-acid desaturase |
| 1.2.1.80a | Long-chain acyl-[acyl-carrier-protein] reductase |
| 1.3.5.5a | 15-cis-phytoene desaturase |
| 1.3.7.5a | Phycocyanobilin:ferredoxin oxidoreductase |
| 2.5.1.115a | Homogentisate phytyltransferase (HPT) |
| 3.4.15.6a | Cyanophycinase |
| 4.1.99.5a | Aldehyde decarbonylase (AD) |
aPhotosynthesis-related functions
The microbial functional signatures shared between NYC and PXO
| EC | Annotation |
|---|---|
| 1.10.3.9a | Photosystem II protein |
| 1.4.1.1 | Alanine dehydrogenase |
| 1.97.1.12a | Photosystem I iron-sulfur center |
| 3.2.1.31a | Beta-glucuronidase (GUS) |
| 4.1.1.39a | Ribulose bisphosphate carboxylase (RuBisCO) |
| 6.6.1.1a | Magnesium-chelatase 38 kDa subunit |
aPhotosynthesis-related functions