| Literature DB >> 29232861 |
Jialu Hu1,2, Xuequn Shang3.
Abstract
Network motifs are patterns of complex networks occurring significantly more frequently than those in random networks. They have been considered as fundamental building blocks of complex networks. Therefore, the detection of network motifs in transcriptional regulation networks is a crucial step in understanding the mechanism of transcriptional regulation and network evolution. The search for network motifs is similar to solving subgraph searching problems, which has proven to be NP-complete. To quickly and effectively count subgraphs of a large biological network, we propose a novel graph canonization algorithm based on resolving sets. This method has been implemented in a command line interface (CLI) program sgip using the SeqAn library. Comparing to Babai's algorithm, this approach has a tighter complexity bound, o ( exp ( n log 2 n + 4 log n ) ) , on strongly regular graphs. Results on several simulated datasets and transcriptional regulation networks indicate that sgip outperforms nauty on many graph cases. The source code of sgip is freely accessible in https://github.com/seqan/seqan/tree/master/apps/sgip and the binary code in http://packages.seqan.de/sgip/.Entities:
Keywords: algorithms; graph canonization; network motif
Mesh:
Year: 2017 PMID: 29232861 PMCID: PMC6150038 DOI: 10.3390/molecules22122194
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Metric dimension of the Petersen graph. Checking each node, we find that is one possible resolving set, and no smaller resolving set exists. Hence, the metric dimension of the Petersen graph is .
Figure 2Performance comparison of sgip and nauty on different types of graph. Each experiment was tested on 100 couples of isomorphic graphs. (a,b) Horizontal axis represents the density of tested graphs (density is the ratio of the number of existing directed edges and n(n − 1)), vertical axis represents the running time of the responding tool. (c,d,e) Horizontal axis represents the number of vertices of the tested graphs, and the vertical axis is run-time on mesh graphs.
Statistically significant patterns appearing in transcriptional regulation networks. Pattern A refers to a coherent feed-forward loop whose connections are x→y→z and x→z. Pattern B refers to these subgraphs with single input module (>10 nodes). Pattern C refers to pairs of operons controlled by the same two transcription factors.
| Patterns | Appearances in Real Network | Appearances in Randomized Network | ||||
|---|---|---|---|---|---|---|
| Yeast | Yeast | Yeast | ||||
| A | 65 | 34 | 10.2 ± 5 | 4.5 ± 2 | 0 | 0 |
| B | 122 | 79 | 33.5 ± 12 | 30 ± 6 | 8.2 × 10−14 | 1.1 × 10−16 |
| C | 396 | 203 | 108 ± 29 | 55 ± 10 | 0 | 0 |