| Literature DB >> 28938012 |
Maryam Shahdoust1, Hamid Pezeshk2, Hossein Mahjub1, Mehdi Sadeghi3.
Abstract
The Common topological features of related species gene regulatory networks suggest reconstruction of the network of one species by using the further information from gene expressions profile of related species. We present an algorithm to reconstruct the gene regulatory network named; F-MAP, which applies the knowledge about gene interactions from related species. Our algorithm sets a Bayesian framework to estimate the precision matrix of one species microarray gene expressions dataset to infer the Gaussian Graphical model of the network. The conjugate Wishart prior is used and the information from related species is applied to estimate the hyperparameters of the prior distribution by using the factor analysis. Applying the proposed algorithm on six related species of drosophila shows that the precision of reconstructed networks is improved considerably compared to the precision of networks constructed by other Bayesian approaches.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28938012 PMCID: PMC5609748 DOI: 10.1371/journal.pone.0184795
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Overview of F-MAP algorithm.
Fig 2The phylogenetic tree of species.
The graph is reproduced with the permission of Joshi et al. (2015).
Number of target genes for 12 transcription factors (TFs).
| TF | zD | twi | slp1 | Sna | run | prd | mad | kr | hb | dl | da | cad |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Number | 1166 | 1164 | 212 | 291 | 158 | 313 | 40 | 518 | 358 | 1503 | 795 | 273 |
These TF constitute the gold standard network. The gold standard network includes 6791 edges.
Measures of diagnostic accuracy of reconstructed networks for six species.
| Main species | approach | Related species | Edges | True positive | Precision | Recall | Accuracy | Specificity |
|---|---|---|---|---|---|---|---|---|
| ana | amel | 810 | 340 | 0.42 | 0.05 | 0.72 | 0.97 | |
| sim | 856 | 324 | 0.38 | 0.05 | 0.71 | 0.97 | ||
| per | 1285 | 509 | 0.40 | 0.07 | 0.71 | 0.96 | ||
| pse | 1036 | 474 | 0.46 | 0.07 | 0.72 | 0.97 | ||
| vir | 1721 | 609 | 0.35 | 0.09 | 0.70 | 0.94 | ||
| 1635 | 590 | 0.36 | 0.08 | 0.70 | 0.94 | |||
| 2230 | 742 | 0.33 | 0.11 | 0.69 | 0.92 | |||
| 480 | 167 | 0.35 | 0.02 | 0.71 | 0.98 | |||
| amel | ana | 860 | 393 | 0.45 | 0.06 | 0.72 | 0.97 | |
| Sim | 976 | 472 | 0.48 | 0.07 | 0.72 | 0.97 | ||
| per | 1183 | 517 | 0.44 | 0.08 | 0.72 | 0.96 | ||
| pse | 1001 | 513 | 0.51 | 0.07 | 0.72 | 0.97 | ||
| vir | 1604 | 612 | 0.38 | 0.09 | 0.71 | 0.94 | ||
| 1647 | 738 | 0.45 | 0.11 | 0.72 | 0.95 | |||
| 1736 | 758 | 0.44 | 0.11 | 0.71 | 0.94 | |||
| 390 | 207 | 0.53 | 0.03 | 0.72 | 0.99 | |||
| sim | amel | 802 | 349 | 0.43 | 0.05 | 0.72 | 0.97 | |
| ana | 819 | 303 | 0.37 | 0.04 | 0.71 | 0.97 | ||
| per | 1246 | 478 | 0.38 | 0.07 | 0.71 | 0.96 | ||
| pse | 984 | 445 | 0.45 | 0.06 | 0.72 | 0.97 | ||
| vir | 1739 | 633 | 0.36 | 0.09 | 0.70 | 0.94 | ||
| 1550 | 574 | 0.37 | 0.08 | 0.70 | 0.94 | |||
| 2461 | 968 | 0.39 | 0.14 | 0.70 | 0.9 | |||
| 619 | 274 | 0.44 | 0.04 | 0.72 | 0.98 | |||
| per | ana | 1595 | 710 | 0.45 | 0.10 | 0.72 | 0.95 | |
| sim | 1556 | 707 | 0.45 | 0.10 | 0.72 | 0.95 | ||
| amel | 1438 | 678 | 0.47 | 0.10 | 0.72 | 0.96 | ||
| pse | 1761 | 823 | 0.47 | 0.11 | 0.72 | 0.95 | ||
| vir | 2014 | 791 | 0.39 | 0.11 | 0.71 | 0.93 | ||
| 2389 | 994 | 0.42 | 0.14 | 0.71 | 0.92 | |||
| 1980 | 770 | 0.39 | 0.11 | 0.70 | 0.93 | |||
| 423 | 179 | 0.42 | 0.03 | 0.72 | 0.99 | |||
| pse | ana | 1318 | 624 | 0.47 | 0.09 | 0.72 | 0.96 | |
| sim | 1304 | 600 | 0.46 | 0.09 | 0.72 | 0.96 | ||
| per | 1608 | 696 | 0.43 | 0.10 | 0.71 | 0.95 | ||
| amel | 1162 | 590 | 0.50 | 0.08 | 0.72 | 0.97 | ||
| vir | 1959 | 793 | 0.40 | 0.11 | 0.71 | 0.93 | ||
| 2143 | 932 | 0.43 | 0.13 | 0.71 | 0.93 | |||
| 1859 | 600 | 0.45 | 0.14 | 0.70 | 0.93 | |||
| 432 | 186 | 0.43 | 0.02 | 0.72 | 0.99 | |||
| Vir | ana | 1951 | 890 | 0.46 | 0.13 | 0.72 | 0.94 | |
| sim | 2031 | 915 | 0.45 | 0.13 | 0.71 | 0.94 | ||
| per | 2094 | 964 | 0.46 | 0.14 | 0.71 | 0.94 | ||
| pse | 2117 | 997 | 0.47 | 0.15 | 0.72 | 0.94 | ||
| amel | 1881 | 873 | 0.46 | 0.13 | 0.72 | 0.94 | ||
| 2622 | 1181 | 0.45 | 0.17 | 0.71 | 0.92 | |||
| 2138 | 976 | 0.46 | 0.14 | 0.72 | 0.93 | |||
| 246 | 144 | 0.58 | 0.02 | 0.72 | 0.99 |
F-MAP, Ledoit and Wolf (Ledoit), Kuismin and Sillanpää(Kuismin), Graphical Lasso (Glasso).
(*): represents the species with highest impact on the network.
Fig 3Sub-networks for ana.
The graphs represent the interactions among 100 genes. The F-MAP network was constructed by using the information of species pse. The blue and grey nodes indicate the TFs and their target genes, respectively. The red and green lines indicate the false and true edges, respectively.
Fig 4Common edges.
The charts represent the number of true positive edges for each reconstructed network (black column) and the number of common edges (gray column) with the network which has the highest precision for each species in Table 2. The names of species on the horizontal axes indicate the species which its information is used as external hints for F-MAP approach. Ledoit and Kuismin represent the networks reconstructed by Ledoit and Wolf and Kuismin and Sillanpää approaches, respectively. GLASSO indicates the networks constructed by GLASSO.
The average of diagnostic accuracy measures of reconstructed networks for simulated data of pse.
| Main species | approach | Related species | Edges (SD) | True positive (SD) | Precision (SD) | Recall (SD) | Accuracy (SD) | Specificity (SD) |
|---|---|---|---|---|---|---|---|---|
| amel | 990(150) | 447(93) | 0.45(0.03) | 0.06(0.01) | 0.72(0.002) | 0.97(0.003) | ||
| sim | 971(189) | 421(111) | 0.43(0.04) | 0.06(0.02) | 0.72(0.004) | 0.97(0.004) | ||
| per | 1560(139) | 639(91) | 0.41(0.03) | 0.09(0.01) | 0.71(0.003) | 0.95(0.003) | ||
| ana | 1054(151) | 469(92) | 0.44(0.03) | 0.07(0.01) | 0.72(0.002) | 0.97(0.004) | ||
| vir | 1948(102) | 752(54) | 0.38(0.01) | 0.11(0.007) | 0.70(0.002) | 0.93(0.003) | ||
| 1809(371) | 704(170) | 0.39(0.03) | 0.10(0.02) | 0.71(0.005) | 0.94(0.01) | |||
| 1920(320) | 614(130) | 0.32(0.02) | 0.09(0.02) | 0.70(0.003) | 0.93(0.01) | |||
| 373(160) | 134(64) | 0.35(0.05) | 0.02(0.009) | 0.72(0.002) | 0.98(0.005) |
Simulated data generated via 100 times sampling with replacement from pse data. SD is standard deviation of measures in 100 simulated datasets.