| Literature DB >> 27599550 |
Maryam Nazarieh1,2, Andreas Wiese3, Thorsten Will1,2, Mohamed Hamed1,4, Volkhard Helms5.
Abstract
BACKGROUND: Identifying the gene regulatory networks governing the workings and identity of cells is one of the main challenges in understanding processes such as cellular differentiation, reprogramming or cancerogenesis. One particular challenge is to identify the main drivers and master regulatory genes that control such cell fate transitions. In this work, we reformulate this problem as the optimization problems of computing a Minimum Dominating Set and a Minimum Connected Dominating Set for directed graphs.Entities:
Keywords: Gene regulatory network; Heuristic algorithm; Integer linear programming; Minimum connected dominating set; Minimum dominating set
Mesh:
Year: 2016 PMID: 27599550 PMCID: PMC5011974 DOI: 10.1186/s12918-016-0329-5
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Fig. 1A graphical representation that illustrates the MDS and MCDS solutions of an example network. The network can be controlled by MDS and MCDS nodes. In the case of a GRN, directed arcs symbolize that a transcription factor regulates a target gene. In panel a, the MDS nodes {A,B} are the dominators of the network. Together, they regulate all other nodes of the network (C, E, D). Panel b visualizes the respective set of MCDS nodes (black and gray). Here, node C is added in order to preserve the connection between the two dominators A and B to form an MCDS
Fig. 2A graphical representation that illustrates the concept of MDS on a toy network. In addition, the MCDS nodes are colored black on three types of components (LSCC, LCC of the underlying directed graph and LCC of the underlying undirected graph) in the toy network. The above toy network includes 14 nodes and 14 edges as shown in yellow in panel (a). The nodes {J, B, C, H, L} are the dominators of the network obtained by computing a MDS (right panel). The nodes colored blue in panel b, make up the largest connected component of the underlying undirected graph. MCDS nodes for this component are {J, D, B, C, G, H}. Green colored nodes in panel c are elements of the largest connected component underlying the directed graph. The two nodes {B, C} form the MCDS for this component. The nodes colored orange in panel d show the LSCC in the network. Here, the node A is the only element of the MCDS
Fig. 3Tightly interwoven network of 17 TFs and target genes that organize the cell cycle of S. cerevisiae. Shown on the circumference of the outer circle are 164 target genes that are differentially expressed during the cell cycle. The inner circle consists of the 14 TFs from the heuristic MCDS and of 123 other target genes that are regulated by at least two of these TFs
Fig. 4Connectivity among TFs in the heuristic MCDS of the largest strongly connected component of a GRN for mouse ESCs. The red circle borders mark the 7 TFs belonging to the set of master regulatory genes identified experimentally in [39]
Fig. 5Percentage overlap of the genes of the MDS and MCDS with the list of top genes (same size as MCDS) according to 3 centrality measures. Shown is the percentage of genes in the MDS or MCDS that also belong to the list of top genes with respect to degree, betweenness and closeness centrality
Identified genes in the MDS and MCDS (ILP) for 10 modules of the breast cancer network
| Method | Module | Network size | Result size | Key driver genes |
|---|---|---|---|---|
| MCDS | Black | 41 | 5 | ZNF254, KIAA1632, ZNF681, SEC24B, ZNF615 |
| MDS | 5 | ZNF254, KIAA1632, ZNF681, SEC24B, ZNF615 | ||
| MCDS | Blue | 247 | 3 | FAM54A, |
| MDS | 2 |
| ||
| MCDS | Brown | 195 | 1 |
|
| MDS | 1 |
| ||
| MCDS | Green | 110 | 18 |
|
| MDS | 17 |
| ||
| MCDS | Magenta | 26 | 4 | ILF2, |
| MDS | 4 | ILF2, | ||
| MCDS | Pink | 30 | 5 | TCEB1, RAB2A, ZNF706, TMEM70, |
| MDS | 5 | TCEB1, RAB2A, TMEM70, TCEA1, | ||
| MCDS | Red | 93 | 13 | SIX4, |
| MDS | 13 | LSM11, SIX4, PCGF1, SUMF2, EPN3, ZNRF2, GTF3A, RAP1B, FHL3, RPS3A, | ||
| MCDS | Tur quoise | 295 | 1 |
|
| MDS | 1 |
| ||
| MCDS | Yellow | 132 | 20 |
|
| MDS | 20 |
|
The genes, whose protein products are known to be targeted by drugs, are marked in bold
Fig. 6Number of MCDS genes determined by the heuristic approach or by the ILP formulation and in the MDS. Shown are the results for 9 modules of the breast cancer network
Overlapping genes between the heuristic and optimal solutions of MCDS for modules of the breast cancer network. The names of the modules were introduced in the original ref. [9]
| Module | Shared genes | Count |
|---|---|---|
| black | SEC24B, ZNF254, ZNF681 | 3 |
| green | UTP14A, LTBR, SH3GLB2, | |
| OS9, CDC34, CDC37, AKT1 | 7 | |
| magenta | BGLAP, ATF6, ILF2 | 3 |
| pink | ZNF706, TCEB1, TMEM70 | 3 |
| red | FHL3, SUMF2, RPS3A, PCGF1, | |
| EPN3, GTF3A, ATP1B1 | 7 | |
| yellow | FUT4, SPI1, DFNA5, CASP10, PAG1, | |
| HDAC11, LCP2, TRAF3IP3, HTRA4, TSPAN2, GZMK | 9 | |
| blue | ACAN, FAM54A | 2 |
The modules brown and turquoise have only 1 mcds gene and give 100 % overlap