| Literature DB >> 34508139 |
Cristina Cheroni1,2,3, Lara Manganaro1, Lorena Donnici1, Valeria Bevilacqua1, Raoul J P Bonnal4,5, Riccardo L Rossi6, Raffaele De Francesco7,8.
Abstract
Interferons (IFNs) are key cytokines involved in alerting the immune system to viral infection. After IFN stimulation, cellular transcriptional profile critically changes, leading to the expression of several IFN stimulated genes (ISGs) that exert a wide variety of antiviral activities. Despite many ISGs have been already identified, a comprehensive network of coding and non-coding genes with a central role in IFN-response still needs to be elucidated. We performed a global RNA-Seq transcriptome profile of the HCV permissive human hepatoma cell line Huh7.5 and its parental cell line Huh7, upon IFN treatment, to define a network of genes whose coordinated modulation plays a central role in IFN-response. Our study adds molecular actors, coding and non-coding genes, to the complex molecular network underlying IFN-response and shows how systems biology approaches, such as correlation networks, network's topology and gene ontology analyses can be leveraged to this aim.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34508139 PMCID: PMC8433181 DOI: 10.1038/s41598-021-97258-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Experimental design.
| Cell line | Treatment (# of samples) | Control (# of samples) |
|---|---|---|
| Huh7 | IFN alpha (4) | Mock (3) |
| IFN beta (3) | Mock (3) | |
| IFN gamma (4) | Mock (4) | |
| Huh7.5 | IFN alpha (4) | Mock (4) |
| IFN beta (3) | Mock (3) | |
| IFN gamma (4) | Mock (4) |
The 6 biological comparisons (3 treatments in each cell line) are described with the number of biological replicates (from 3 to 4) for each condition in parenthesis.
Figure 1Differential expression analysis on RNASeq transcriptome profiling. (a) Number of differentially expressed genes that are significantly modulated in each treatment compared to mock-treated cells after DESeq2 analysis with FDR < 0.05 and absolute Log2FC > 1. Over expressed (blue) and downregulated genes (rust red): dark colours for protein coding genes (actual numbers in bold) and light colours for non-coding genes (actual numbers in brackets). (b–d) Venn diagrams illustrating the overlap of modulated genes across treatments. Panel (b) shows the overlap between IFNα and IFNβ treatments in the Huh7 line, while panel (c) shows the same results for Huh7.5. In both lines, the genes modulated by IFNλ are entirely included in the overlapping region of other two Interferons. Panel (d) shows how all genes in (b) and (c) overlap, showing a group of 185 genes modulated in all cell lines and by both type I IFN perturbations.
Figure 2Generation of the gene co-expression network and selection of modules associated with IFN treatment. (a) Gene dendrogram generated on the Topological Overlap Dissimilarity matrix: each branch of the dendrogram corresponds to a module of highly interconnected genes, for a total of 12 modules identified. The color row below the dendrogram represents the module assignment. (b) Identified modules with relative colour “names”: numbers beside each colour block indicates the number of genes assigned to each of them. The heat map shows the correlation values (Pearson correlation degree “Cor” and statistical significance “Pval”) between each module eigengene and the induction of IFIT2 and MX1 transcripts measured by real-time PCR. Negative correlations are in blue, positive correlations are in red. Only Green and Tan modules show a significant positive correlation with the ISGs expression.
Figure 3Functional characterization of Green and Tan modules. (a,b) Results of gene ontology enrichment analysis performed on Green (a) and Tan (b) sub-networks; bar plots depict the –log p-values for the top-15 Biological Process terms. (c) Tabular results of the analysis of regulatory motifs performed on Green and Tan sub-networks by IRegulon. The motif clusters identified as significantly enriched for each module (NES > 5) are reported. (d,e) Circular plots illustrating the regulatory relationship among each motif cluster and its putative target genes for Green and Tan modules. The thickness of each line and correspondent gene sector represents the number of motifs in the cluster found as putative regulators of the gene.
Figure 4Green and Tan sub networks of putative new ISGs. (a) The method for extraction of putative, previously unknown IFN sensitive genes, using network pruning by centrality metrics cutoffs and subtractive gene ontology annotations. (b) Eigenvectors and closeness of genes in the Green module. These two metrics are linearly correlated above a certain threshold: higher eigenvector values correspond to higher closeness and identify hub genes. Nodes with eigenvector < 0.02 are defined as peripheral and depicted as transparent. (c) Eigenvectors and closeness of genes in the Tan module. (d) Cytoscape-generated sub-network for the Green module. Gene ontology annotation of surviving nodes after subtractive gene ontology annotation are displayed, while nodes (genes) associated to immune-related biological processes are not, allowing focus on non-peripheral novel players. Eigenvector is proportional to node size, closeness to color gradient. (e) Similar cytoscape-generated sub-network for the Tan module.
Novel IFN sensitive genes.
| Source module | Gene name | Ensembl Gene ID | Biotype | Description (Biomart) |
|---|---|---|---|---|
| Green | SIDT1 | ENSG00000072858 | protein_coding | SID1 transmembrane family member 1 |
| Green | CYTH1 | ENSG00000108669 | protein_coding | cytohesin 1 |
| Green | BLZF1 | ENSG00000117475 | protein_coding | basic leucine zipper nuclear factor 1 |
| Green | DNPEP | ENSG00000123992 | protein_coding | aspartyl aminopeptidase |
| Green | RARRES3 | ENSG00000133321 | protein_coding | phospholipase A and acyltransferase 4 |
| Green | MOB3C | ENSG00000142961 | protein_coding | MOB kinase activator 3C |
| Green | NME7 | ENSG00000143156 | protein_coding | NME/NM23 family member 7 |
| Green | GRIP2 | ENSG00000144596 | protein_coding | glutamate receptor interacting protein 2 |
| Green | GPR158 | ENSG00000151025 | protein_coding | G protein-coupled receptor 158 |
| Green | MFSD12 | ENSG00000161091 | protein_coding | major facilitator superfamily domain containing 12 |
| Green | KCNT2 | ENSG00000162687 | protein_coding | potassium sodium-activated channel subfamily T member 2 |
| Green | PLEKHA7 | ENSG00000166689 | protein_coding | pleckstrin homology domain containing A7 |
| Green | GRINA | ENSG00000178719 | protein_coding | glutamate ionotropic receptor NMDA type subunit associated protein 1 |
| Green | ZNF107 | ENSG00000196247 | protein_coding | zinc finger protein 107 |
| Green | LAP3P2 | ENSG00000213500 | processed_pseudogene | leucine aminopeptidase 3 pseudogene 2 |
| Green | SMTNL1 | ENSG00000214872 | protein_coding | smoothelin like 1 |
| Green | AC009948.5 | ENSG00000223960 | antisense | cholesterol induced regulator of metabolism RNA |
| Green | PSME2P2 | ENSG00000225131 | processed_pseudogene | proteasome activator subunit 2 pseudogene 2 |
| Green | AC009950.2 | ENSG00000225963 | antisense | novel transcript, antisense to SP110 |
| Green | AP001610.5 | ENSG00000228318 | antisense | novel transcript |
| Green | PNPT1P1 | ENSG00000229241 | processed_pseudogene | polyribonucleotide nucleotidyltransferase 1 pseudogene 1 |
| Green | PATL2 | ENSG00000229474 | protein_coding | PAT1 homolog 2 |
| Green | NAMPTL | ENSG00000229644 | processed_pseudogene | nicotinamide phosphoribosyltransferase pseudogene 1 |
| Green | HLA-K | ENSG00000230795 | unprocessed_pseudogene | major histocompatibility complex, class I, K (pseudogene) |
| Green | RP11-274E7.2 | ENSG00000238000 | processed_pseudogene | proteasome activator subunit 2 pseudogene 1 |
| Green | RP11-54O7.17 | ENSG00000272512 | lincRNA | novel transcript |
| Tan | STOML1 | ENSG00000067221 | protein_coding | stomatin like 1 |
| Tan | NANS | ENSG00000095380 | protein_coding | N-acetylneuraminate synthase |
| Tan | ATP10B | ENSG00000118322 | protein_coding | ATPase phospholipid transporting 10B (putative) |
| Tan | PRRG4 | ENSG00000135378 | protein_coding | proline rich and Gla domain 4 |
| Tan | NIPSNAP3A | ENSG00000136783 | protein_coding | nipsnap homolog 3A |
| Tan | ATHL1 | ENSG00000142102 | protein_coding | protein-glucosylgalactosylhydroxylysine glucosidase |
| Tan | GPR128 | ENSG00000144820 | protein_coding | adhesion G protein-coupled receptor G7 |
| Tan | TMCO4 | ENSG00000162542 | protein_coding | transmembrane and coiled-coil domains 4 |
| Tan | AGBL2 | ENSG00000165923 | protein_coding | ATP/GTP binding protein like 2 |
| Tan | PLEKHN1 | ENSG00000187583 | protein_coding | pleckstrin homology domain containing N1 |
| Tan | AC009299.3 | ENSG00000227403 | lincRNA | long intergenic non-protein coding RNA 1806 |
| Tan | RNF144A-AS1 | ENSG00000228203 | processed_transcript | glycosaminoglycan regulatory associated long non-coding RNA |
| Tan | RP11-395B7.7 | ENSG00000260336 | Sense_overlapping | |
| Tan | CADPS2 | ENSG00000081803 | protein_coding | calcium dependent secretion activator 2 |
The 40 novel putative IFN sensitive genes (ISGs) annotated with the source module they were extracted from (Green or Tan modules), gene names, Ensembl gene IDs, biotype and description as retrieved from Ensembl Biomart.
Figure 5New IFN-stimulated signature is confirmed as ISGs with GSEA. (a) GSEA running sums showing the enrichment of custom geneset obtained by the combination of the Green and Tan modules in the Boolen et al. 2014 datasets (GSE48400) after 12 h of induction with IFN alpha, beta and lambda1. (b) Venn diagram of overlapping genes belonging to the leading edges of the three GSEA analyses in the previous panel. Seven genes (DNPEP, REC8, GRINA, CYTH1, PCK2, NME7, SIDT1) have been consistently found upregulated after different IFNs induction. (c) Normalized enrichment scores (NES) and relative GSEA p-Values for all sets of performed GSEAs (for datasets with more time points the 12 h point was selected and plotted). Dashed horizontal line is the standard GSEA significant cutoff: enrichments whose p-Value is above the dashed line are thus significant. Green, Tan and Total refer to genesets from Green module, Tan module and the combination of the two, respectively.