| Literature DB >> 27403523 |
Deborah Chasman1, Kevin B Walters2, Tiago J S Lopes2, Amie J Eisfeld2, Yoshihiro Kawaoka2,3,4,5, Sushmita Roy1,6,7.
Abstract
Mammalian host response to pathogenic infections is controlled by a complex regulatory network connecting regulatory proteins such as transcription factors and signaling proteins to target genes. An important challenge in infectious disease research is to understand molecular similarities and differences in mammalian host response to diverse sets of pathogens. Recently, systems biology studies have produced rich collections of omic profiles measuring host response to infectious agents such as influenza viruses at multiple levels. To gain a comprehensive understanding of the regulatory network driving host response to multiple infectious agents, we integrated host transcriptomes and proteomes using a network-based approach. Our approach combines expression-based regulatory network inference, structured-sparsity based regression, and network information flow to infer putative physical regulatory programs for expression modules. We applied our approach to identify regulatory networks, modules and subnetworks that drive host response to multiple influenza infections. The inferred regulatory network and modules are significantly enriched for known pathways of immune response and implicate apoptosis, splicing, and interferon signaling processes in the differential response of viral infections of different pathogenicities. We used the learned network to prioritize regulators and study virus and time-point specific networks. RNAi-based knockdown of predicted regulators had significant impact on viral replication and include several previously unknown regulators. Taken together, our integrated analysis identified novel module level patterns that capture strain and pathogenicity-specific patterns of expression and helped identify important regulators of host response to influenza infection.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27403523 PMCID: PMC4942116 DOI: 10.1371/journal.pcbi.1005013
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 5Multi-task Group LASSO (MTG-LASSO) structured-sparsity approach for integration of protein data with expression-based regulatory module networks.
A. Illustration of MTG-LASSO framework for predicting protein regulators for one module (Methods). Horizontal separations in the and data boxes represent different virus time course. Rows of and represent time points. Columns of correspond to proteins and columns of correspond to mRNA levels of genes in a module. B. Comparison of number of nonzero regression weights identified by MTG-LASSO and LASSO. Each dot (MTG-LASSO) or plus sign (LASSO) represents the number of non-zero regression weights for one setting of λ (the sparsity parameter) for one module. Number of nonzero weights is averaged over 10 folds of cross-validation. C. Comparison of cross-validation predictive quality between MTG-LASSO and LASSO. Results are shown for λ = {0.10, 0.75}; results for other settings are in In each scatterplot, there is one point per module. Left two plots compare methods based on Pearson correlation (ρ) of predicted to actual expression values; right plots compare on the basis of root mean squared error (RMSE). Inset ρ gives Pearson correlation between MTG-LASSO and LASSO scores. Diagonal line is shown for comparison. D/E. Examples of curves used to select λ for individual modules; human (D) and mouse (E). Y-axis gives Pearson correlation (cross-validation predictive quality); X-axis gives the average number of nonzero regression weights for that module; this value is higher than the final number of high-confidence, high-weight regulators. Stars indicate the chosen value of λ for the example modules.
Catalog of MERLIN modules for human and mouse systems.
| Host system | Module size ( | Number of modules | Number of genes |
|---|---|---|---|
| 2 | 3,272 | ||
| 10 | 39 | 1,541 | |
| 2,387 | |||
| 2 | 1,526 | ||
| 10 | 54 | 1,418 | |
| 4,296 |
"Module size" refers to the number of genes in a module. 'Number of genes' gives the number of genes covered by modules of a given size.
Summary of results of siRNA validation study of twenty regulators predicted by MERLIN.
| Result for four siRNAs | Regulator(s) |
|---|---|
| BOLA1, HCLS1, HOXA7 | |
| IRAK3, FGFR3, YTHDC1 | |
| ≥ 2 reduce, none induce | KCNIP3, TCEB1 |
| ≥ 2 reduce, one induces | DDX20, FIG4, MET, PTPRF, SAG, TP53BP2, WDR81 |
| ≥ 2 induce, one reduces | IGBP1 |
| Other | ANKRD2, FKBP15, PHF3, TNS53 |
Four siRNAs were used per gene. siRNA results are reported for significant changes in viral titer as assessed by T-test compared to negative control. Hits are defined as those that show a strong, significant and consistent effect across multiple siRNAs. Full data and results are available in S7 Table.
Consensus human protein regulators identified by MTG-LASSO.
| Category | Gene (Protein name/family) | Module(s) |
|---|---|---|
| PRPF31 | 1487 | |
| MAGED2 (Melanoma antigen) | 1549 | |
| THBS1 (Thrombospondin) | 1434, 1472, 1482, 1485, 1502, 1540, 1543, 1549, 1596 | |
| APP | 1484, 1501, 1549 | |
| SERPINA3 (Serpin peptidase inhibitor) | 1484 | |
| ISG15 | 1484, 1501, 1549 | |
| DDX50 (ATP-dependent RNA helicase) | 1482, 1487 | |
| EHD4 (EH-domain containing) | 1482 | |
| STMN4 (Stathmin) | 1472, 1540 | |
| COLGALT1 (Collagen beta(1–0) galactosyltransferase) | 1549 | |
| ITGB4 | 1484 | |
| PMM2 (Phosphomannomutase) | 1487 | |
| IARS2 (Isoleucil-tRNA synthetase) | 1543 | |
| NFS1 (Cystein desulfurase) | 1484 | |
| YME1L (ATP-ase) | 1487 | |
| HIST1H1B (Histone protein) | 1472, 1540 | |
| KLHL33 (Kelch-like family) | 1549 |
Genes are categorized by major biological annotation according to NCBI Entrez, GeneCards, or UniProt. The Module column lists the MTG-LASSO predicted target modules of a protein regulator. Genes that have been identified as relevant to influenza by a screening or literature study are marked with a superscript: Brass et al. 2009B; Karlas et al. 2010K, König et al. 2010O; Shapira et al. 2009S.
Top consensus mouse protein regulators identified by MTG-LASSO.
| Category | Gene (Protein name/family) | Module(s) |
|---|---|---|
| Sumo2 | 3135, 3147, 3280 | |
| C3 (Complement protein) | 2810, 2975, 3208,3210 | |
| Serpina3k (Serpin peptidase inhibitor) | 3029, 3206 | |
| Serpina3m (Serpin peptidase inhibitor) | 3029, 3159, 3181, 3184, 3186, 3192, 3206, 3207 | |
| S100-A8 | 3141, 3154 | |
| Hp (Haptoglobin) | 3207, 3210 | |
| Hpx (Hemopexin) | 2899, 2976, 3159, 3184, 3192, 3198, 3207 | |
| Letmd1 (LETM1-domain containing) | 2950, 3047, 3072, 3135, 3139, 3179, 3187 | |
| Snrpf | 3056, 3134, 3135, 3139, 3181, 3193 | |
| Fgb (Fibrinogen bet-chain) | 2810, 2977, 3056, 3156 | |
| Hopx (HOP homeobox) | 3154, 3280 | |
| 2310036022Rik | 3070, 3159, 3199, 3206, 3249 |
Genes are categorized by major biological annotation according to NCBI Entrez, GeneCards, or UniProt. The Module column lists the MTG-LASSO predicted target modules of a protein regulator. Because there were 33 total, we limit presentation here to twelve regulators that were linked to at least two modules. A full list of consensus proteins is available in . Genes for which human homologs have been identified as relevant to influenza by a screening or literature study are marked with a superscript: Karlas et al. 2010; König et al.2010; Watanabe et al. 2014.
Summary of results of siRNA validation study of seven human protein regulators predicted by MTG-LASSO.
| Regulator | Majority viral phenotype |
|---|---|
| APP | Anti |
| HIST1H1B | Pro |
| ISG15 | Pro |
| ITGB4 | Anti |
| PRPF31 | Pro |
| SERPINA3 | Anti |
| THBS1 | Pro |
Four siRNAs were used per gene. All genes had multiple siRNAs with significant effects, and all had at least one at least ten-fold. However, for most genes, different siRNAs resulted in different effects on virus titer; therefore we characterize each by its majority viral phenotype among the significant siRNAs. Full data and results are available in .
Characterization of time- and virus-specific subnetworks.
| Cluster | Timing | In-cluster viruses | Out-of-cluster viruses | Enriched pathways | Enriched regulators |
|---|---|---|---|---|---|
| Sustained | H1N1 CA04 and NL; H5N1 NS1trunc | H5N1 WT, PB1-F2del, PB2-627E | JAK-STAT signaling | FOSL1, IRF7, ISG15p, LMO2, MAFF, MLKL, NFS1p, NMI, NUPR1, SP110, SSTR2, STAT1, TRIM21, TRIM5 | |
| Sustained | H1N1 CA04 and NL; H5N1 WT, PB1-F2del, PB2-627E | NS1trunc | GPCR signaling | SAG, YTHDC1 | |
| Sustained | H5N1 WT, PB1-F2del, PB2-627E | H1N1 CA04 and NL; H5N1 NS1trunc | Muscle contraction | DDR2, HCLS1, HOXA7, SAG, TAL2, YTHDC1 | |
| Mid-late | H5N1 WT, NS1trunc, PB1-F2del, PB2-627E | H1N1 CA04 and NL | PPARA | CCRN4L, CDKL2, CDKL3, CITED2, ETV3, MXD1, NR4A2, PAPOLG, PPP1R10, SNAI1, TCEB1, TOB1, WNK4 | |
| Very late | H5N1 WT, NS1trunc, PB1-F2del, PB2-627E | H1N1 CA04 and NL | Circadian clock, Wnt signaling | ACVR1B, ERCC2, FBXL19, FOXH1, NKX2-5, PPP1R11, SAP30BP, TLX2 |
"Timing" describes the general timing under which the cluster is active; a value of 'sustained means that the cluster appears active throughout every time course. "In-cluster" lists the viruses represented in the cluster. 'Out-of-cluster' lists the viruses mostly absent from the cluster. "Enriched pathways" lists major enriched pathway terms
* indicates that the pathway was enriched only among regulators of the cluster network. "Enriched regulators" are the subset of cluster regulators for which their targets are statistically enriched in the cluster relative to all clusters.