| Literature DB >> 31554845 |
Abhijeet R Sonawane1, Liang Tian1,2,3, Chin-Yi Chu4, Xing Qiu5, Lu Wang5, Jeanne Holden-Wiltse5, Alex Grier6, Steven R Gill6, Mary T Caserta4, Ann R Falsey7, David J Topham6, Edward E Walsh7, Thomas J Mariani4, Scott T Weiss1, Edwin K Silverman1, Kimberly Glass8, Yang-Yu Liu9,10.
Abstract
Respiratory syncytial virus (RSV) is a major cause of lower respiratory tract infections and hospital visits during infancy and childhood. Although risk factors for RSV infection have been identified, the role of microbial species in the respiratory tract is only partially known. We aimed to understand the impact of interactions between the nasal microbiome and host transcriptome on the severity and clinical outcomes of RSV infection. We used 16 S rRNA sequencing to characterize the nasal microbiome of infants with RSV infection. We used RNA sequencing to interrogate the transcriptome of CD4+ T cells obtained from the same set of infants. After dimension reduction through principal component (PC) analysis, we performed an integrative analysis to identify significant co-variation between microbial clade and gene expression PCs. We then employed LIONESS (Linear Interpolation to Obtain Network Estimates for Single Samples) to estimate the clade-gene association patterns for each infant. Our network-based integrative analysis identified several clade-gene associations significantly related to the severity of RSV infection. The microbial taxa with the highest loadings in the implicated clade PCs included Moraxella, Corynebacterium, Streptococcus, Haemophilus influenzae, and Staphylococcus. Interestingly, many of the genes with the highest loadings in the implicated gene PCs are encoded in mitochondrial DNA, while others are involved in the host immune response. This study on microbiome-transcriptome interactions provides insights into how the host immune system mounts a response against RSV and specific infectious agents in nasal microbiota.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31554845 PMCID: PMC6761288 DOI: 10.1038/s41598-019-50217-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1An overview of the data and analyses performed in this study.
Demographic and clinical information for the 58 infants included in our analysis.
| RSV Severity Group | P Value | ||
|---|---|---|---|
| Mild n (%) | Severe n (%) | ||
|
| |||
| Female | 13 (44.83%) | 16 (55.17%) | 0.592 |
| Male | 10 (34.48%) | 19 (65.52%) | |
|
| |||
| Hispanic or Latino | 6 (60%) | 4 (40%) | 0.173 |
| Non-Hispanic or Non-Latino | 17 (35.42%) | 31 (64.58%) | |
|
| |||
| White | 14 (36.84%) | 24 (63.16%) | 0.789 |
| Black or African American | 5 (45.45%) | 6 (54.55%) | |
| Other | 4 (44.44%) | 5 (55.56%) | |
|
| |||
| Not Hospitalized | 23 | 0 | |
| Hospitalized (>24 h) | 1 | 34 | |
|
|
| P Value | |
| Severity score | 1.32 (0.20) | 6.37 (0.29) | <0.001 |
| Age at Visit 1 | 3.81 (0.45) | 2.98 (0.37) | 0.155 |
| Days since onset relative to Visit 1 | 4.87 (0.50) | 5.21 (0.38) | 0.595 |
Figure 2(A) Overview of study timelines, including when each type of data was collected. All 58 infants included in this study had both transcriptomic and microbiome data collected for at least one time point. (B) Venn diagrams showing the overlap in the infants that had either transcriptomic or microbiome data collected during the acute visit (57 of the 58 infants), the overlap in the transcriptomic data collected at either the initial (Visit 1) or follow-up visit (Visit 3), and the overlap in the infants microbiome data collected at either the initial (Visit 1) or follow-up visit (Visit 2). Numbers are shown separately for mild and severe groups, with the total number in each category indicated below the Venn diagram. (C) Differential gene expression analysis comparing Visit 1 versus Visit 2 RNA-Seq samples yields 27 significantly differentially expressed genes (FDR < 0.05). The log2 of the expression levels of these genes are shown as boxplots combining mild and severe samples. The genes on the left side of the panel have higher mean expression during the acute visit and those on the right side of the panel have higher expression levels in the follow-up visit (D,E) The log2 expression levels for EZH1 comparing (D) data from infants with severe versus mild RSV infection and (E) data from the acute and follow-up visit. The reported FDR p-values are based on limma analysis. (F,G) Abundance profiles of nasal microbiota comparing groups of mild and severe infants at each of the two visits, including (F) observed diversity and (G) Shannon diversity index.
Figure 3(A) Principal component analysis was performed on Visit 1 samples for which we had both transcriptome and microbiome data to reduce the transcriptome into gPCs (top) and the microbiome into cPCs (bottom). (B) Cumulative distribution of the amount of variance explained by the top gPCs and cPCs. The vertical and horizontal lines indicate the number of PCs which explain 95% of variance and which were included in our network analysis. (C) The coordinates of the gene and clade principal components across the 40 analyzed samples from Visit 1. (D) Spearman correlation across the loadings of the top 13 gPCs and the top 10 cPCs.
Figure 4(A) The weights of edges predicted for each of the 40 sample-specific networks obtained from the LIONESS analysis. Columns are grouped based on each infant’s infection severity, sex, and race. Rows are sorted based on the significance of edges. (B) The significance of each edge as defined from multivariate linear analysis comparing edge-weights between two groups, mild and severe, and corrected for sex and race. The top significant edges with nominal p-value less than 0.05 are labeled.
Microbial composition and loadings of the top clades in each of the cPCs identified as associated with significant edges (with gPCs in parentheses) in the sample-specific network analysis. X = unclassified.
| Family | Genus | Species | OTU ID | PC loading | |
|---|---|---|---|---|---|
|
| Streptococcaceae |
|
| 4479601 | −0.834726 |
| Corynebacteriaceae |
|
| 4474764 | 0.378128 | |
| Aerococcaceae |
|
| 886735 | 0.333977 | |
| Staphylococcaceae |
|
| 1060737 | 0.147457 | |
| Neisseriaceae |
|
| 861327 | −0.074763 | |
| Moraxellaceae |
|
| 932707 | 0.071822 | |
| Pasteurellaceae |
|
| 3125352 | 0.067097 | |
| Moraxellaceae |
|
| 893147 | 0.064341 | |
| Moraxellaceae |
|
| 1080004 | 0.044229 | |
| Streptococcaceae |
|
| 131778 | −0.038727 | |
|
| Oxalobacteraceae |
|
| 550644 | 0.4919892 |
| Corynebacteriaceae |
|
| 4474764 | −0. 4735642 | |
| Neisseriaceae |
|
| 861327 | −0. 4097664 | |
| Corynebacteriaceae |
|
| 997086 | 0.3448678 | |
| Aerococcaceae |
|
| 886735 | 0. 2542127 | |
| Streptococcaceae |
|
| 1076969 | −0.2107604 | |
| Moraxellaceae |
|
| 932707 | −0.2008934 | |
| Pseudomonadaceae |
|
| 350105 | 0.1744260 | |
| Streptococcaceae |
|
| 4479601 | − 0.1421117 | |
| Staphylococcaceae |
|
| 1060737 | −0.1144590 | |
|
| Corynebacteriaceae |
|
| 997086 | 0.81347 |
| Oxalobacteraceae |
|
| 550644 | −0.460545 | |
| Aerococcaceae |
|
| 886735 | −0.208054 | |
| Staphylococcaceae |
|
| 1060737 | −0.175109 | |
| Corynebacteriaceae |
|
| 4474764 | 0.152976 | |
| Neisseriaceae |
|
| 861327 | −0.086017 | |
| Moraxellaceae |
|
| 893147 | −0.062479 | |
| Streptococcaceae |
|
| 4455633 | −0.058702 | |
| Corynebacteriaceae |
|
| 484315 | 0.053777 | |
| Pseudomonadaceae |
|
| 350105 | 0.050156 | |
|
| Aerococcaceae |
|
| 886735 | −0.56923761 |
| Oxalobacteraceae |
|
| 550644 | 0.55397649 | |
| Corynebacteriaceae |
|
| 4474764 | 0.52806963 | |
| Neisseriaceae |
|
| 861327 | −0.18306144 | |
| Staphylococcaceae |
|
| 1060737 | −0.15290246 | |
| Moraxellaceae |
|
| 893147 | −0.13401397 | |
| Streptococcaceae |
|
| 4455633 | −0. 08041118 | |
| Pseudomonadaceae |
|
| 350105 | 0.04756214 | |
| Pasteurellaceae |
|
| 3125352 | 0.04140956 | |
| Streptococcaceae |
|
| 4479601 | −0.02649437 | |
|
| Moraxellaceae |
|
| 1080004 | −0.95754551 |
| Staphylococcaceae |
|
| 1060737- | 0.15074314 | |
| Corynebacteriaceae |
|
| 4474764 | 0.11808617 | |
| Neisseriaceae |
|
| 861327 | 0.08584030 | |
| Aerococcaceae |
|
| 886735 | 0.08026760 | |
| Moraxellaceae |
|
| 1053321 | −0.07771730 | |
| Pasteurellaceae |
|
| 3125352 | 0.07023736 | |
| Streptococcaceae |
|
| 4479601 | 0.06943155 | |
| Moraxellaceae |
|
| 932707 | 0.06696839 | |
| Moraxellaceae |
|
| 893147 | 0.06376892 | |
|
| Pasteurellaceae |
|
| 3125352 | 0.68452895 |
| Staphylococcaceae |
|
| 1060737 | −0.54778068 | |
| Neisseriaceae |
|
| 861327 | 0.32793288 | |
| Moraxellaceae |
|
| 893147 | −0.20623383 | |
| Moraxellaceae |
|
| 932707 | −0.19733937 | |
| Streptococcaceae |
|
| 4479601 | −0.11957387 | |
| Corynebacteriaceae |
| 4474764 | −0.09267271 | ||
|
| Aerococcaceae |
|
| 886735 | 0.07434957 |
| Streptococcaceae |
|
| 4455633 | −0.05788193 | |
| Pasteurellaceae |
|
| 2243354 | 0.05635886 | |
Top 20 genes with positive loadings in the gPCs (with associated cPCs in parentheses) identified as associated with significant edges in the sample-specific network analysis.
| gPC1 (cPC2) | gPC3 (cPC10) | gPC8 (cPC8 and cPC4) | gPC12 (cPC1) | gPC13 (cPC9) | |||||
|---|---|---|---|---|---|---|---|---|---|
| Genes | loadings | Genes | loadings | Genes | loadings | Genes | loadings | Genes | loadings |
| MT-ATP8 | 0.2631 | MT-ATP8 | 0.4266 | CCR7 | 0.2988 | RPL10 | 0.2631 | MT-CO2 | 0.3230 |
| MT-CO2 | 0.1894 | MALAT1 | 0.3832 | MT-ND4L | 0. 2391 | TMSB4X | 0.1894 | B2M | 0.2056 |
| ACTB | 0.1122 | MT-CO2 | 0.1134 | RPS3 | 0. 2103 | MT-ATP8 | 0.1122 | MT-CO3 | 0.2044 |
| MT-ND4 | 0.1053 | MT-ATP6 | 0.0846 | RPL3 | 0. 1979 | RPL30 | 0.1053 | UCP2 | 0.1937 |
| MT-CYB | 0.0937 | MT-CYB | 0.0814 | RPL10 | 0. 1977 | RPS3 | 0.0937 | RPL10 | 0.1893 |
| MT-ATP6 | 0.0865 | IFITM2 | 0.0738 | RPL21 | 0. 1921 | TXNIP | 0.0865 | RPL30 | 0.1619 |
| CCR7 | 0.0665 | DDX5 | 0.070 | MALAT1 | 0.1468 | RPL14 | 0.0665 | CD52 | 0.1411 |
| GNB2L1 | 0.0653 | MT-CO1 | 0.0659 | GNB2L1 | 0.1464 | RPS28 | 0.0653 | MT-ND1 | 0.1297 |
| MT-ND4L | 0.0653 | MT-ND4 | 0.0605 | MT-CO2 | 0.1342 | CD52 | 0.0653 | RPLP1 | 0.1181 |
| UCP2 | 0.0557 | IFITM1 | 0.0543 | ARHGDIB | 0.1308 | RPL19 | 0.0557 | RPS4Y1 | 0.1076 |
| SELL | 0.0521 | ACTB | 0.0496 | UCP2 | 0.1234 | B2M | 0.0521 | EIF1 | 0.1014 |
| MT-CO3 | 0.0513 | MT-CO3 | 0.0482 | PABPC1 | 0.1123 | RPL18 | 0.0513 | MALAT1 | 0.0967 |
| TRAC | 0.0506 | SELL | 0.0312 | TRAC | 0.1123 | IL7R | 0.0506 | H3F3B | 0.0945 |
| RPL3 | 0.0424 | MT-ND5 | 0.0307 | IL7R | 0.1017 | TRAC | 0.0424 | RPLP0 | 0.09342 |
| ARHGDIB | 0.0420 | MT-ND4L | 0.0297 | MT-CO3 | 0.0928 | PTMA | 0.0420 | RPL36 | 0.0822 |
| MT-CO1 | 0.0410 | MT-ND2 | 0.0295 | RPS28 | 0.0895 | RPS20 | 0.0410 | RPL23A | 0.08148 |
| RPL10 | 0.0397 | H3F3B | 0.0282 | MTND1P23 | 0.0886 | RPL27A | 0.0397 | GAPDH | 0.0814 |
| MT-ND1 | 0.03553 | CCR7 | 0.0281 | HIST1H4C | 0.0874 | MT-ND6 | 0.03553 | MT-ND4 | 0.0778 |
| HSPA8 | 0.0321 | IFI44L | 0.0281 | SNORD13 | 0.0874 | PABPC1 | 0.0321 | RPL18A | 0.07362 |
| MT-ND5 | 0.0301 | IFIT3 | 0.0275 | RPS4X | 0.0834 | TRBV25-1 | 0.0301 | ACTB | 0.0721 |
Figure 5GO term enrichment analysis for the top 100 genes based on the loadings of (A) gPC1, (B) gPC8, and (C) gPC3. These bubble plots include the 25 most significant GO terms identified in each analysis based on the FDR significance. All terms shown are significant at an FDR < 0.05. Size of each bubble indicates the number of genes annotated to the respective GO term and the color indicates the percentage of the top 100 genes annotated to that term. The top 25 GO terms enriched in the other identified gPCs are shown in Supplemental Fig. 6A,B and all GO terms enriched at an FDR < 0.05 are shown in Supplemental Fig. 6C–G.