| Literature DB >> 27307624 |
Siddhartha Jain1, Joel Arrais2, Narasimhan J Venkatachari3, Velpandi Ayyavoo3, Ziv Bar-Joseph4.
Abstract
MOTIVATION: Most methods for reconstructing response networks from high throughput data generate static models which cannot distinguish between early and late response stages.Entities:
Mesh:
Year: 2016 PMID: 27307624 PMCID: PMC4908338 DOI: 10.1093/bioinformatics/btw254
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Overview of TimePath. (Left) Several time series and static datasets are used as input. (Top right) Based on these inputs an initial set of pathways is selected such that each starts at a source (a host protein interacting with a virus protein), ends in a target (a DE gene for one of the time points) and contains PPI and Protein–DNA edges. (Middle right) Next, Integer Programming (IP) is performed to select a subset of these pathways. (Bottom right) The resulting pathways explain the dynamics of the observed response including the different expression phases observed for different genes
Fig. 2.Dynamic signaling and regulatory network for HIV-1 immune response. The red nodes are the host proteins that interact with the HIV-1 proteins (selected sources). Blue nodes are intermediate signaling proteins and green nodes are the TFs that are predicted to directly up/down-regulate the differential expression of target genes (targets not shown in figure, but the average levels of the regulated targets for each TF is presented by the yellow nodes while the size of each of the yellow nodes indicates how many genes belong to the cluster represented by the node). The figure displays the top predicted nodes for each of the three phases and also demonstrates is directly linked to the sources via the signaling proteins and DE genes in earlier phases. Diamond shaped nodes were identified as supported RNAi screen hits (text) and rectangular nodes are targets for the phase they are in. Nodes with bold blue border represent proteins we experimentally tested. Note that some intermediate proteins may also be TFs. The functional role in the network figure is based on the location of the protein in the selected paths based on the IP
Phase ranking for top genes
| Phase | Gene | R1 | R2 | R3 | Expression change direction |
|---|---|---|---|---|---|
| 1 | EP300 | 1 | 2 | 2 | Up |
| 1 | TP53 | 2 | 5 | 4 | Up |
| 1 | HDAC1 | 3 | 6 | 6 | Up |
| 1 | RELA | 4 | 20 | 10 | Up |
| 1 | RB1 | 5 | 4 | 3 | Up |
| 1 | BRCA1 | 6 | 8 | 11 | Up |
| 1 | PCNA | 7 | 11 | 9 | Up |
| 1 | SUMO1 | 8 | 9 | 8 | Up |
| 1 | HDAC2 | 9 | 22 | 14 | Up |
| 1 | CEBPB | 10 | 21 | 12 | Up |
| 1 | DNMT1 | 15 | 23 | 15 | Up |
| 1 | STAT1 | 27 | 25 | 33 | Up |
| 1 | RAF1 | 28 | 66 | 39 | Up |
| 1 | CDK2 | 29 | 59 | 41 | Up |
| 2 | JUN | NP | 1 | 1 | Up |
| 2 | ATF2 | 143 | 7 | 5 | No change |
| 2 | CALM3 | 127 | 10 | 106 | Up |
| 2 | CD4 | 136 | 12 | 109 | Up |
| 2 | STAT5B | 105 | 13 | 86 | Up |
| 2 | CCND3 | 91 | 14 | 100 | Up |
| 2 | SMARCB1 | 92 | 15 | 97 | Up |
| 2 | AP1B1 | 124 | 16 | 114 | Up |
| 2 | SKI | NP | 17 | 147 | Up |
| 2 | AP2B1 | 138 | 18 | 130 | Up |
| 3 | FOS | NP | NP | 7 | Up |
| 3 | PSMA4 | NP | NP | 23 | Up |
| 3 | DDIT3 | NP | NP | 25 | Up |
| 3 | GTF2H1 | NP | NP | 26 | No change |
| 3 | SGTA | NP | NP | 36 | Down |
| 3 | JUNB | 224 | 148 | 38 | Down |
| 3 | JUND | 276 | NP | 40 | Down |
| 3 | GNB2L1 | 118 | NP | 46 | No change |
| 3 | UBB | 113 | 155 | 47 | Down |
| 3 | VAV1 | 112 | 170 | 49 | Down |
| 3 | LCK | NP | NP | 291 | Down |
R1/2/3 indicates the rank of the gene in phase 1/2/3. If the rank is ‘NP’, that means the gene was not found to be present in the phase. Genes tested experimentally are colored red (see Supplementary Tables S1–S3 for complete rankings). For later phases we focused on genes that were ranked high for these phases compared to their rank in an earlier phase. Genes with absolute log fold change expression <2 are designated as not being differential expressed.
Overlap between RNAi screen hits and top 100 genes for the different dynamic network reconstruction methods and between edge list from Reactome (1265 edges in network) and the edges extracted by the different methods
| Method | Overlap with screen hits | Overlap with Reactome edges | ||
|---|---|---|---|---|
| TimePath | 23 | 101/3203 | ||
| SDREM | 21 | 74/3203 | ||
| TimeXnet | 16 | 54/2585 | ||
| DE ranking | 5 | 0.23 | NA | NA |
Comparison with a baseline ranking of the differentially expression (DE) genes is also presented.
GO comparison
| Method | % of immune-related categories | |
|---|---|---|
| TimePath | 11.16 (72/645) | |
| SDREM | 8.04 (71/883) | 0.077 |
| TimeXnet | 10.44 (66/632) |
We give the % of immune-related categories as well as the absolute number of immune related categories and total categories enriched for in parenthesis. The P-value cutoff for all categories was 0.05. The GO enrichment was performed on the top 100 genes as ranked by path flow (Section 2) using the FuncAssociate tool (Berriz ).
Overlap with HIV screen hits at various stages of the algorithm
| Stage | Overlap | Overlap % |
|---|---|---|
| Pre-algorithm | 364/16 671 | 2.1 |
| Unexpressed genes filtered | 246/6604 | 3.7 |
| After pathway search | 144/1374 | 10.4 |
| After IP | 85/607 | 14.0 |
‘Pre-algorithm’ is the initial overlap for all genes in the network. ‘Unexpressed genes filtered’ is when we remove all genes from our interaction network that are unexpressed. ‘After pathway search’ is that stage that uses all genes included in the initial top scoring set of pathways. ‘After IP’ is the final stage after the IP (and thus the whole algorithm) has run. As can be seen, the IP step seems to further improve the resulting set of genes indicating that the selection process indeed identifies HIV response pathways.
Validation for the time constraint.
| Method | Overlap | |
|---|---|---|
| TimePath | 101/3203 | |
| TimePath without time constraint | 37/3203 |
Fig. 3.Experimental validations. Relative infection after treatment with inhibitors. Significant changes in infection are highlighted with a *. The inhibitor names are given on the X axis and the target proteins of the inhibitors are given in parenthesis. See also supporting Figure S3 for the full list of targets for each inhibitor