| Literature DB >> 26072489 |
Zongliang Yue1, Madhura M Kshirsagar1, Thanh Nguyen1, Chayaporn Suphavilai1, Michael T Neylon1, Liugen Zhu1, Timothy Ratliff1, Jake Y Chen2.
Abstract
In this article, we described a new database framework to perform integrative "gene-set, network, and pathway analysis" (GNPA). In this framework, we integrated heterogeneous data on pathways, annotated list, and gene-sets (PAGs) into a PAG electronic repository (PAGER). PAGs in the PAGER database are organized into P-type, A-type and G-type PAGs with a three-letter-code standard naming convention. The PAGER database currently compiles 44 313 genes from 5 species including human, 38 663 PAGs, 324 830 gene-gene relationships and two types of 3 174 323 PAG-PAG regulatory relationships-co-membership based and regulatory relationship based. To help users assess each PAG's biological relevance, we developed a cohesion measure called Cohesion Coefficient (CoCo), which is capable of disambiguating between biologically significant PAGs and random PAGs with an area-under-curve performance of 0.98. PAGER database was set up to help users to search and retrieve PAGs from its online web interface. PAGER enable advanced users to build PAG-PAG regulatory networks that provide complementary biological insights not found in gene set analysis or individual gene network analysis. We provide a case study using cancer functional genomics data sets to demonstrate how integrative GNPA help improve network biology data coverage and therefore biological interpretability. The PAGER database can be accessible openly at http://discovery.informatics.iupui.edu/PAGER/.Entities:
Mesh:
Year: 2015 PMID: 26072489 PMCID: PMC4553834 DOI: 10.1093/bioinformatics/btv265
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.PAG ID standard naming convention and three types of PAGs
Fig. 2.Example of calculation of each Cohesion measure
Basic statistics of the PAGER database
| In PAGER DB | In PAGER (Human) | |
|---|---|---|
| 44 313 | 40 476 | |
| 324 830 | 306 066 | |
| Molecular association | 205 185 | 190 226 |
| PPI | 93 713 | 93 713 |
| Gene regulation | 25 932 | 25 932 |
| 38 379 | 35 897 | |
| Singleton ( | 19 772 | 19 772 |
| Regular ( | 18 607 | 16 125 |
| with CoCo scores ( | 14 701 | 12 496 |
| with CoCo score ≥1 | 13 856 | 11 784 |
| | 3 101 499 | 2 230 614 |
| | 72 824 | 65 983 |
| sPAG to mPAG | 7250 | 7022 |
| mPAG to mPAG | 39 253 | 32 966 |
| sPAG to sPAG | 23 842 | 23 842 |
| mPAG to sPAG | 2479 | 2153 |
Fig. 3.Cohesions (CoI, CoT and CoCo) performance: (a) ROC curves, (b) comparison boxplot. CoI+, CoI in the true PAGs; CoI−, CoI in the random PAGs; CoT+, CoT in the true PAGs; CoT−, CoT in the random PAGs
AUC performance comparison between CoI and CoT for small and large PAGs
| CoI | 0.953 | |
| CoT | 0.979 |
The statistic for gene regulatory network and PAG regulatory network for MDSCs gene expression data
| Gene regulatory network | PAG regulatory network | |
|---|---|---|
| Up-regulated ( | 110 | 489 |
| Down-regulated ( | 146 | 483 |
| | 0 | 91 |
| Up-regulated | 0 | 13 |
| Down-regulated | 0 | 78 |
| Gene regulations (gene-gene) | 501 | 0 |
| PAG regulations (mPAG-mPAG) | 0 | 94 |
| PAG regulations (sPAG-mPAG) | 0 | 42 |
Fig. 4.The PAG regulatory network constructed for myeloid-derived suppressor cells gene expression data