| Literature DB >> 16563163 |
Anton Yuryev1, Zufar Mulyukov, Ekaterina Kotelnikova, Sergei Maslov, Sergei Egorov, Alexander Nikitin, Nikolai Daraselia, Ilya Mazo.
Abstract
BACKGROUND: Scientific literature is a source of the most reliable and comprehensive knowledge about molecular interaction networks. Formalization of this knowledge is necessary for computational analysis and is achieved by automatic fact extraction using various text-mining algorithms. Most of these techniques suffer from high false positive rates and redundancy of the extracted information. The extracted facts form a large network with no pathways defined.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16563163 PMCID: PMC1435941 DOI: 10.1186/1471-2105-7-171
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Statistics for the ResNet database as of August 21, 2005 generated by Medscan technology version 1.7 before and after automatic curation.
| Number of proteins with links | 11,000 | 10,845 |
| Number of chemicals with links | 37,904 | 37,904 |
| Number of cell process | 897 | 897 |
| 59,149 | 43,678 | |
| 12,426 | 11,961 | |
| 4,054 | 3,875 | |
| 760,621 | 509,105 | |
| 68,711 | 52,842 | |
| 28,265 | 20,781 | |
| 63,713 | 55,567 | |
| 0 | 16,772 |
Feed-forward and coherent loops statistics. The coherent loops were converted only if the relation between the transcription factor and a target was PromoterBinding. Table shows the number of loops with different relation types between the regulator node and the transcription factor node (Figure 1). Coherent loops are feed-forward loops with coherent regulation effects. Because many loops shared the same Expression relation between regulator and target, the number of Expression relations converted to regulation is smaller than the total number of coherent loops.
| Relation from Regulator to TF | Number of unique | Number of loops | Number of unique | Number of loops |
| 3242 | 4849 | 1109 | 1345 | |
| 8911 | 13468 | 3638 | 4600 | |
| 231 | 233 | 95 | 96 | |
| 591 | 767 | 124 | 133 | |
| 10592 | 17246 | 4359 | 5646 | |
| 26689 | 40588 | 7411 | 12023 | |
Figure 2Distribution of node overlaps between manually curated pathway and equivalent regulome pathway with the same ligand-receptor pair. X axis – every point represents a pair of manually curated and automatically built pathways, Y axis – number of nodes in common between two pathways.
Figure 4Automatically built pathway. Nodes and links in common with manually curated IL-1 pathway shown on Figure 3 are highlighted in blue. Note that the set of proteins unique to automatically built pathway represents a classical MAP kinase cascade. It has been suggested only recently that the IL-1 receptor appears to activate a MAP kinase cascade by interaction with other members of the Toll-like receptor superfamily [17]. Obviously, older review articles used for construction of the manually curated IL-1 pathway did not mention this information. For graph legend see figure 3.
Figure 5Distribution of number of Regulome pathways build for 79 tissues based on the tissues gene expression profile. The tissue-specific pathway construction is described in the Materials and Methods section.
Figure 6Increase in the number of pathways predicted for tissue-specific ligands in different tissue types. The number of pathways is normalized to the number of tissues in every tissue type. Twenty-one immune tissues names listed in [16]: Appendix, BM-CD105+Endothelial, BM-CD33+Myeloid, BM-CD34+, BM-CD71+Early Erythroid, Bone marrow, Burkitts-Daudi lymphoma, Burkitts-Raji lymphoma, Leukemia chronic myelogenous(k562), Leukemia lymphoblastic(molt4), Leukemia promyelocytic(hl60), Lymphnode, Lymphoblasts, PB-BDCA4+Dentritic, PB-CD14+Monocytes, PB-CD19+Bcells, PB-CD4+Tcells, PB-CD56+NKCells, PB-CD8+Tcells, Thymus, Whole blood. Twenty-three CNS tissues names listed in [16]: Amygdala, Caudate nucleus, Cerebellum, Cerebellum peduncles, Ciliary ganglion, Cingulate cortex, DRG, Fetal brain, Globus pallidus, Hypothalamus, Medulla oblongata, Occipital lobe, Olfactory bulb, Parietal lobe, Pituitary, Pons, Prefrontal cortex, Subthalamic nucleus, Superior Cervical Ganglion, Temporal lobe, Thalamus, Trigeminal ganglion, Whole Brain. 8 CNS specific ligands from ResNet database: BDNF (brain derived neurotrophic factor), CNTF (ciliary neurotrophic factor), GDNF (glial cell line derived neurotrophic factor), GPI autocrine motility factor, leptin, NGFB (nerve growth factor beta), NPTX1 (neuronal pentraxin I), NTF3 (neurotrophin 3). List of 36 immunological ligands is not shown.
Figure 1Examples of the coherent loops used for automatic ResNet curation. Node TF represents a transcription factor. Green arrows – PromoterBinding relations, blue arrow – Expression relation, grey arrow – Regulation, Brown arrow – MolSynthesis. The coherent loop conversion rule converts the expression link from Regulator node to Target node into Regulation link. Arrow with "+" indicate positive regulation and negative regulation is shown as "---|". A. Coherent loop with all relation having positive effect. Regulator->TF: positive; TF->Target: positive; Regulator->Target: positive; B. Coherent loop with negative effects. Regulator->TF: positive; TF->Target: negative. Regulator->Target: negative. Other two types of coherent loops have following configurations: Regulator->TF: negative; TF->Target: positive; Regulator->Target: negative. Regulator->TF: negative; TF->Target: negative; Regulator->Target: positive.