| Literature DB >> 36042194 |
Michimasa Fujiogi1, Yoshihiko Raita2, Marcos Pérez-Losada3,4, Robert J Freishtat5,6,7, Juan C Celedón8, Jonathan M Mansbach9, Pedro A Piedra10, Zhaozhong Zhu2, Carlos A Camargo2, Kohei Hasegawa2.
Abstract
Bronchiolitis is a leading cause of infant hospitalizations but its immunopathology remains poorly understood. Here we present data from 244 infants hospitalized with bronchiolitis in a multicenter prospective study, assessing the host response (transcriptome), microbial composition, and microbial function (metatranscriptome) in the nasopharyngeal airway, and associate them with disease severity. We investigate individual associations with disease severity identify host response, microbial taxonomical, and microbial functional modules by network analyses. We also determine the integrated relationship of these modules with severity. Several modules are significantly associated with risks of positive pressure ventilation use, including the host-type I interferon, neutrophil/interleukin-1, T cell regulation, microbial-branched-chain amino acid metabolism, and nicotinamide adenine dinucleotide hydrogen modules. Taken together, we show complex interplays between host and microbiome, and their contribution to disease severity.Entities:
Mesh:
Year: 2022 PMID: 36042194 PMCID: PMC9427849 DOI: 10.1038/s41467-022-32323-y
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
Fig. 1Analytic flow of integrated-omics analysis.
This flowchart presents a brief overview of the main analytical steps in the current study. The steps are shown in order from top to bottom (A to C). For each of 1) host response (transcriptome), 2) microbial composition (metatranscriptome), and 3) microbial function (metatranscriptome) data elements, we individually performed the analysis in steps A and B. Then, we subsequently integrated these omics data in step C. A We examined the relationship of each omics data element with the risk of PPV use at the individual data level. B To reduce the dimensions of the host response, microbial composition, and microbial function data, we performed a weighted gene co-expression network analysis and identified distinct networks (modules). In each omics element, we selected the top five modules with the highest correlation of PPV use and biological significance for the subsequent integrated analyses. C Finally, to determine the integrated relationships of these dual-transcriptome modules with the risk of PPV use, we constructed a logistic regression model with ridge regularization. To uncover the causal relationship structure between these dual-transcriptome modules, we also applied a causal structural learning approach. Abbreviations: FDR, false discovery rate; PPV, positive pressure ventilation; WGCNA, weighted gene co-expression network analysis.
Patient characteristics of 244 infants hospitalized for bronchiolitis
| Overall ( | |
|---|---|
| Age (month), median (IQR) | 3.1 (1.7–6.2) |
| Female sex | 98 (40) |
| Race/ethnicity | |
| Non-Hispanic white | 102 (42) |
| Non-Hispanic black | 57 (23) |
| Hispanic | 76 (31) |
| Other or unknown | 9 (4) |
| Maternal smoking during pregnancy | 34 (14) |
| C-section delivery | 84 (34) |
| Prematurity (<37 weeks) | 47 (19) |
| Mostly breastfed for the first 3 months of age | 115 (47) |
| Previous breathing problems before the index hospitalizationa | |
| 1 episode | 30 (12) |
| ≥2 episodes | 10 (4) |
| History of eczema | 31 (13) |
| Ever attended daycare | 71 (29) |
| Corticosteroid use before the index hospitalization | 18 (7) |
| Lifetime history of systemic antibiotic use | 79 (32) |
| Body weight at presentation (kg), median (IQR) | 6.1 (4.6–8.0) |
| Respiratory rate at presentation (per minute), median (IQR) | 48 (40–60) |
| Oxygen saturation at presentation | |
| <90% | 18 (7) |
| 90–93% | 29 (12) |
| ≥94% | 190 (78) |
| Any RSV | 222 (91) |
| Any rhinovirus | 51 (21) |
| RSV/rhinovirus coinfection | 29 (12) |
| Other coinfection pathogensb | 47 (19) |
| Positive pressure ventilation during hospitalizationc | 18 (7) |
| Intensive care use during hospitalizationd | 42 (17) |
| Hospital length of stay (day), median (IQR) | 2 (1–3) |
IQR interquartile range, RSV respiratory syncytial virus.
Data are n (%) of infants unless otherwise indicated. Percentages may not equal 100 because of rounding and missingness.
aDefined as an infant having a cough that wakes him or her at night or causes emesis, or when the child has wheezing or shortness of breath without cough.
bAdenovirus, bocavirus, Bordetella pertussis, enterovirus, human coronavirus NL63, OC43, 229E, or HKU1, human metapneumovirus, influenza A or B virus, Mycoplasma pneumoniae, and parainfluenza virus 1–3.
cDefined as use of invasive and/or non-invasive mechanical ventilation (e.g., continuous positive airway pressure ventilation).
dDefined as use of positive pressure ventilation and/or intensive care unit admission.
Fig. 2Differential gene expression analysis of host transcriptome data with regard to the use of positive pressure ventilation in infants hospitalized for bronchiolitis.
A Volcano plot of differentially expressed genes (transcriptome). The threshold of log2 fold change is |0.58| (i.e., ≥|1.5|-fold change), and that of FDR is <0.05. There were 197 differentially expressed transcripts that met these criteria. B Gene set enrichment analysis (transcriptome). We showed 30 host pathways (GO biological process) with the most significant FDR in the gene set enrichment analysis (GSEA) with downregulated pathways on the left side and upregulated pathways on the right side. We also showed the absolute normalized enrichment score, FDR, and the gene ratio for the corresponding pathways. Abbreviations: FDR false discovery rate, GO gene ontology, GSEA gene set enrichment analysis, PPV positive pressure ventilation.
Fig. 3Relationship of abundant microbial species with the risk of higher severity in infants hospitalized for bronchiolitis.
A Phylogenetic plot of top 20 most abundant microbial species in the nasopharyngeal airway of infants hospitalized for bronchiolitis. The colors in the inner circle annotate the six major phyla. The colors in the two internal rings represent the magnitude of the association between the relative abundance of each species and higher severity (PPV use and intensive care use) outcomes. Greyscale bars on the outside of the circular graph are proportional to the microbial species’ mean relative abundance. B The pirate plots show the comparison of the distribution of ten most abundant species in the nasopharyngeal microbiome in infants hospitalized for bronchiolitis, according to the PPV use. Each point represents each infant. The gray bar and rectangle represent the mean and 95% confidence interval. In the violin plots, the width represents the probability that infants take on a specific relative abundance. The between-group differences in the abundance were tested by fitting Poisson regression models. n = 244 biologically independent samples. Abbreviations: FC fold change, FDR false discovery rate, PPV positive pressure ventilation.
Fig. 4Differential gene expression analysis of microbial function data with regard to the use of positive pressure ventilation in infants hospitalized for bronchiolitis.
A Volcano plot of differentially expressed microbial transcripts (metatranscriptome). The threshold of log2 fold change is |0.58| (i.e., ≥|1.5|-fold change), and that of FDR is <0.05. There were 129 differentially expressed microbial transcripts that met these criteria. B Gene set enrichment analysis (GSEA) of the metatranscriptome data. We showed 30 microbial functional pathways (GO biological process) with the most significant FDR in the gene set enrichment analysis (GSEA). Downregulated pathways were not detected. We also showed the normalized enrichment score, FDR, and the gene ratio for the corresponding pathways. Abbreviations: FDR false discovery rate; GO gene ontology; GSEA gene set enrichment analysis.
Fig. 5Integrated associations of the dual-transcriptome modules with the use of positive pressure ventilation in infants hospitalized for bronchiolitis.
A Heatmap of the median eigenvalues (the first principal component) for the corresponding modules in each outcome group. The areas of circles and colors represent the median value of the corresponding eigenvalue. The between-group differences tested using two-tailed t-test s, accounting for multiple comparisons by applying Benjamini–Hochberg false discovery rate (FDR). Asterisks indicate statistical significance (FDR < 0.05). The exact P values and FDR are the following: In PPV use, T-cell regulation, P value = 7.3 × 10−5, FDR = 0.002; Neutrophil/IL-1, P value = 6.7 × 10−3, FDR = 0.014; GPCR, P value = 1.1 × 10−2, FDR = 0.018; Type I IFN; P value = 7.2 × 10−5, FDR = 0.014; HR-1, P value = 2.5 × 10−2, FDR = 0.034; S. pneumonia/S. aureus, P value = 1.3 × 10−2, FDR = 0.020; MC-1, P value = 1.6 × 10−1, FDR = 0.197; Moraxella, P value = 2.3 × 10−1, FDR = 0.244; Streptococcus, P value = 2.0 × 10−1, FDR = 0.226; Haemophilus, P value = 3.8 × 10−1, FDR = 0.379; Plasma membrane, P value = 1.1 × 10−17, FDR < 0.001; mRNA metabolism, P value = 2.1 × 10−4, FDR = 0.001; BCAA metabolism, P value = 6.0 × 10−3, FDR = 0.014; Oxidative stress response, P value = 6.4 × 10−5, FDR < 0.001; and NADH, P value = 5.5 × 10−5, FDR < 0.001. In intensive care use, T-cell regulation, P value = 2.0 × 10−3, FDR = 0.030; Neutrophil/IL-1, P value = 6.5 × 10−3, FDR = 0.036; GPCR, P value = 1.3 × 10−2, FDR = 0.036; Type I IFN, P value = 8.7 × 10−3, FDR = 0.036; HR-1, P value = 3.4*10−2, FDR = 0.064; S. pneumonia/S. aureus, P value = 3.4 × 10−2, FDR = 0.064; MC-1, P value = 6.2 × 10−1, FDR = 0.659; Moraxella, P value = 3.4 × 10−1, FDR = 0.422; Streptococcus, P value = 3.9 × 10−1, FDR = 0.448; Haemophilus, P value = 7.4 × 10−1, FDR = 0.739; Plasma membrane, P value = 1.4 × 10−2, FDR = 0.036; mRNA metabolism, P value = 5.0 × 10−2, FDR = 0.083; BCAA metabolism, P value = 1.4 × 10−1, FDR = 0.211; Oxidative stress response, P value = 2.0 × 10−1, FDR = 0.278; and NADH, P value = 1.4 × 10−2, FDR = 0.036. B Integrated relationship of the dual-transcriptome modules with the risk of PPV use in infants hospitalized for bronchiolitis. The adjusted odds ratio for the outcome was estimated per one unit increased in the eigenvalue (the first principal component) of the corresponding module by fitting a multivariable logistic regression model with ridge regularization. The 95% CIs were estimated by a bootstrap method with 2000 replicates. In the model, we adjusted for age, sex, and respiratory virus. Statistically significant modules are in bold. C Causal structural learning is applied to the dual-transcriptomics data. It identifies an underlying causal relationship between these host immune response (blue), microbial species (pink), and microbial function (orange) modules in the niche, and demonstrates it as a directed acyclic graph (DAG). This approach is distinctly different from a co-occurrence network, which can reparent only correlations between variables and is agnostic about their underlying causal relationships. For example, the S. pneumoniae/S. aureus module has direct effects on the microbial-mRNA metabolism module and the host neutrophil/IL-1 and type I IFN modules, which have a subsequent effect on the PPV use. Abbreviations: BCAA branched-chain amino acid, FDR false discovery rate, GPCR G-protein-coupled receptor, HR host response, IFN interferon, IL interleukin, NADH nicotinamide adenine dinucleotide hydrogen, MC microbial composition, PPV positive pressure ventilation.