| Literature DB >> 23046449 |
Abstract
BACKGROUND: Next-Generation Sequencing (NGS) technologies and Genome-Wide Association Studies (GWAS) generate millions of reads and hundreds of datasets, and there is an urgent need for a better way to accurately interpret and distill such large amounts of data. Extensive pathway and network analysis allow for the discovery of highly significant pathways from a set of disease vs. healthy samples in the NGS and GWAS. Knowledge of activation of these processes will lead to elucidation of the complex biological pathways affected by drug treatment, to patient stratification studies of new and existing drug treatments, and to understanding the underlying anti-cancer drug effects. There are approximately 141 biological human pathway resources as of Jan 2012 according to the Pathguide database. However, most currently available resources do not contain disease, drug or organ specificity information such as disease-pathway, drug-pathway, and organ-pathway associations. Systematically integrating pathway, disease, drug and organ specificity together becomes increasingly crucial for understanding the interrelationships between signaling, metabolic and regulatory pathway, drug action, disease susceptibility, and organ specificity from high-throughput omics data (genomics, transcriptomics, proteomics and metabolomics).Entities:
Mesh:
Year: 2012 PMID: 23046449 PMCID: PMC3439721 DOI: 10.1186/1471-2105-13-S15-S7
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Data Integration Process. The whole data integration process was divided into three steps: 1) associations of molecule-pathway, molecule-disease, molecule-drug, molecule-organ; 2) inter-association analysis between pathway, disease, drug and organ; and 3) enrichment analysis and inter-association analysis: 3a) self-validation and 3b) identification of previously undiscovered components by the enrichment analysis.
Current Statistics of Database
| Total Number | Count |
|---|---|
| genes | 22,498 GeneIDs |
| proteins | 25,469 UniProtIDs |
| Pathways | 1956 (BioCarta:310,KEGG:247, NCI-Nature curated:222, Reactome:1177) |
| Molecules in Pathway | 11663 |
| Diseases | 6704(CTD:5892, PharmGKB:812) |
| Molecules in Disease | 17925 |
| Drugs | 5615(DrugBank:4604, PharmGKB:1011) |
| Molecules in Drug | 3735 |
| Organs | 52 |
| Molecules in Organ | 5599 |
A Comparison of Human Pathways in IPAD against Several Common Pathway Data Sources
| IPAD | |||||
|---|---|---|---|---|---|
| Pathway coverage | 310 | 247 | 222 | 1177 | 1956 |
| Molecule coverage | 1372 | 9238 | 2561 | 5668 | 11663 |
| Last Updated | 2010 | Mar 2012 | July 2010 | Jan 2011 | Mar 2012 |
| Curation Type | Manual | Manual | Manual | Manual | Integrated |
| Disease Association | No | Yes | No | No | Yes |
| Drug Association | No | Yes | No | No | Yes |
| Organ Specificity Association | No | No | No | No | Yes |
| Inter-associations Quantitative | No | No | No | No | Yes |
| Enrichment Score Quantitative | No | No | No | No | Yes |
| Similarity | No | No | No | No | Yes |
Figure 2p-value Distribution of Inter-association. The y-axis is the scaled density of p-value which is calculated by Fisher Exact test. (a) p-value Distribution of Inter-association between pathway-pathway, disease-disease, drug-drug, and organ-organ. (b) p-value Distribution of Inter-association between pathway, disease, drug, and organ.
Figure 3Heatmap of Inter-associations between 52 Organs. x-axis and y-axis are both 52 organs. The degrees of redness and blackness in each cell represent increase of association between organs. The legend above the heatmap indicates the range of association between organs. The association between organs is expressed by 1 minus log10 of p-value. It is nonlinear color scale from white to red to black, correspondingly, indicating the value of 1-log10(p-value) scales from 1 to 7.6.
Figure 4Web Interface Structure. a) Query by genes or proteins. For example, UniGene IDs, Entrez gene IDs, gene names, UniProt IDs, UniProt Accessions or IPI IDs are all supported. To enter multiple values, delimit them by comma, semi-colon, line or space. b,c,d,e) search result. In the enrichment analysis table, it shows Pathway ID (Disease ID, Drug ID, Organ ID), Pathway Name (Disease Name, Drug Name, Organ Name), Molecule, significance (AE, RE, N, MJI, p-value), and inter-association. For each enriched component, users can further browse the inter-association between its molecule and pathway, disease, drug and organ by clicking on the link in the column of molecule, and its inter-association between pathway, disease, drug and organ by clicking on the inter-association icon in the last column. f) molecule inter-association. It shows molecule, Gene Symbol, Pathway ID (Disease ID, Drug ID, Organ ID), and Pathway Name (Disease Name, Drug Name, Organ Name). g,h,i,j) inter-association between pathway, disease, drug and organ. It shows associations between Pathway-Pathway, Pathway-Disease, Pathway-Drug, Pathway-Organ, Disease-Pathway, Disease-Disease, Disease-Drug, Disease-Organ, Drug-Pathway, Drug-Disease, Drug-Drug, Drug-Organ, Organ-Pathway, Organ-Disease, Organ-Drug, Organ-Organ, and their significance (AE, RE, MJI, p-value).
Enrichment Analysis of Breast Cancer Related Markers
| PathwayID | PathwayName | N | P-value | AE | RE | MJI |
|---|---|---|---|---|---|---|
| hsa05212 | Pancreatic cancer | 72 | 3.74E-08 | 7 | 75.59 | 0.2819 |
| hsa05213 | Endometrial cancer | 53 | 1.07E-07 | 6 | 88.02 | 0.2566 |
| hsa05215 | Prostate cancer | 100 | 1.07E-07 | 7 | 54.43 | 0.2683 |
| hsa05223 | Non-small cell lung cancer | 60 | 1.60E-07 | 6 | 77.75 | 0.25 |
| hsa05218 | Melanoma | 72 | 3.55E-07 | 6 | 64.79 | 0.2417 |
| hsa05200 | Pathways in cancer | 348 | 1.02E-06 | 9 | 20.11 | 0.3129 |
| hsa05219 | Bladder cancer | 42 | 1.02E-06 | 5 | 92.56 | 0.2262 |
| h_RacCycDPathway | Influence of Ras and Rho proteins on G1 to S Transition | 26 | 6.66E-06 | 4 | 119.62 | 0.2103 |
| hsa05214 | Glioma | 69 | 6.66E-06 | 5 | 56.34 | 0.2029 |
| hsa05220 | Chronic myeloid leukemia | 74 | 7.91E-06 | 5 | 52.54 | 0.2005 |
| hsa05166 | HTLV-I infection | 272 | 1.71E-05 | 7 | 20.01 | 0.2462 |
| hsa05222 | Small cell lung cancer | 90 | 1.71E-05 | 5 | 43.2 | 0.1944 |
| 200124 | E-cadherin signaling in the nascent adherens junction | 39 | 2.10E-05 | 4 | 79.75 | 0.1846 |
| 200141 | FOXM1 transcription factor network | 41 | 2.37E-05 | 4 | 75.86 | 0.1821 |
| 200190 | a6b1 and a6b4 Integrin signaling | 46 | 3.40E-05 | 4 | 67.61 | 0.1768 |
| hsa04110 | Cell cycle | 124 | 5.80E-05 | 5 | 31.35 | 0.1868 |
| hsa05210 | Colorectal cancer | 62 | 8.23E-05 | 4 | 50.16 | 0.1656 |
| 200041 | Signaling events mediated by Hepatocyte Growth Factor Receptor (c-Met) | 80 | 1.67E-05 | 4 | 38.88 | 0.1583 |
| MESH:D002528 | Cerebellar Neoplasms | 332 | 6.34E-07 | 9 | 32.39 | 0.3136 |
| MESH:D020967 | Myotonic Disorders | 278 | 1.87E-05 | 7 | 30.09 | 0.2459 |
| MESH:D042883 | Choledocholithiasis | 157 | 1.87E-05 | 6 | 45.67 | 0.2191 |
| MESH:D002282 | Adenocarcinoma, Bronchiolo-Alveolar | 339 | 3.44E-05 | 7 | 24.68 | 0.2437 |
| MESH:D009134 | Muscular Atrophy, Spinal | 1119 | 3.44E-05 | 11 | 11.75 | 0.3716 |
| MESH:D016510 | Corneal Neovascularization | 669 | 3.44E-05 | 9 | 16.08 | 0.3067 |
| MESH:D044483 | Intestinal Polyposis | 120 | 4.42E-05 | 5 | 49.79 | 0.1875 |
| PA443756 | Colonic Neoplasms | 122 | 4.42E-05 | 5 | 48.98 | 0.1872 |
| PA445062 | Neoplasms | 237 | 4.42E-05 | 6 | 30.25 | 0.2127 |
| MESH:D007972 | Leukoplakia, Oral | 238 | 4.42E-05 | 6 | 30.13 | 0.2126 |
| MESH:D003123 | Colorectal Neoplasms, Hereditary Nonpolyposis | 126 | 4.42E-05 | 5 | 47.42 | 0.1865 |
| MESH:D046152 | Gastrointestinal Stromal Tumors | 148 | 8.01E-05 | 5 | 40.37 | 0.1836 |
| PA451581 | tamoxifen | 74 | 1.83E-03 | 5 | 18.03 | 0.2124 |
| PA131301952 | gefitinib | 39 | 1.83E-03 | 4 | 27.36 | 0.1941 |
| PA152241907 | lapatinib | 14 | 1.83E-03 | 3 | 57.17 | 0.2143 |
| PA449383 | docetaxel | 77 | 9.81E-03 | 4 | 13.86 | 0.1688 |
| PA449509 | estrogens | 79 | 9.89E-03 | 4 | 13.51 | 0.1682 |
| larynx | larynx | 88 | 1.97E-2 | 2 | 25.45 | 0.2114 |
Figure 5Assessment of Different Associations. The bar plot shows sensitivity, specificity, prediction, accuracy, and F_measure of pathway-drug, drug-drug, pathway-disease, organ-disease, drug-disease, disease-disease, and all associations as a whole.
Enrichment Analysis of Brain-Specific Markers
| Rank | PathwayID | PathwayName | N | Pvalue | AE | RE | MJI |
|---|---|---|---|---|---|---|---|
| 1 | 112315 | Transmission across Chemical Synapses | 190 | 0 | 27 | 17.26 | 0.2117 |
| 2 | 112316 | Neuronal System | 283 | 0 | 32 | 13.74 | 0.2232 |
| 3 | hsa04723 | Retrograde endocannabinoid signaling | 116 | 4.68E-14 | 18 | 18.85 | 0.1713 |
| 4 | 112314 | Neurotransmitter Receptor Binding And Downstream Transmission In The Postsynaptic Cell | 136 | 4.48E-13 | 18 | 16.08 | 0.1599 |
| 5 | hsa04727 | GABAergic synapse | 98 | 5.48E-13 | 16 | 19.84 | 0.165 |
| 6 | hsa04080 | Neuroactive ligand-receptor interaction | 401 | 4.42E-12 | 26 | 7.88 | 0.1678 |
| 7 | 977441 | GABA A receptor activation | 12 | 7.38E-09 | 7 | 70.87 | 0.3281 |
| 8 | 975298 | Ligand-gated ion channel transport | 25 | 1.41E-08 | 8 | 38.88 | 0.2017 |
| 9 | 977443 | GABA receptor activation | 53 | 1.30E-07 | 9 | 20.63 | 0.1318 |
| 10 | hsa04724 | Glutamatergic synapse | 134 | 2.62E-07 | 12 | 10.88 | 0.1073 |
| 11 | 983712 | Ion channel transport | 61 | 3.20E-07 | 9 | 17.92 | 0.1206 |
| 12 | 420499 | Class C/3 (Metabotropic glutamate/pheromone receptors) | 15 | 5.19E-07 | 6 | 48.6 | 0.2313 |
| 13 | 888590 | GABA synthesis, release, reuptake and degradation | 19 | 1.52E-06 | 6 | 38.37 | 0.1891 |
| 14 | 399719 | Trafficking of AMPA receptors | 30 | 1.33E-05 | 6 | 24.3 | 0.1313 |
| 15 | 399721 | Glutamate Binding, Activation of AMPA Receptors and Synaptic Plasticity | 30 | 1.33E-05 | 6 | 24.3 | 0.1313 |
| 16 | 112310 | Neurotransmitter Release Cycle | 36 | 3.03E-05 | 6 | 20.25 | 0.1146 |
| 1 | MESH:D001764 | Blepharospasm | 699 | 0 | 45 | 5.52 | 0.1398 |
| 2 | MESH:D012563 | Schizophrenia, Paranoid | 649 | 3.18E-12 | 40 | 5.29 | 0.1265 |
| 3 | MESH:D002385 | Cataplexy | 723 | 1.33E-11 | 41 | 4.86 | 0.1264 |
| 4 | MESH:D020187 | REM Sleep Behavior Disorder | 506 | 1.44E-11 | 34 | 5.76 | 0.1149 |
| 5 | MESH:D020821 | Dystonic Disorders | 837 | 4.52E-10 | 41 | 4.2 | 0.1226 |
| 6 | MESH:D015877 | Miosis | 1000 | 1.94E-09 | 44 | 3.77 | 0.1273 |
| 7 | MESH:D001925 | Brain Damage, Chronic | 1732 | 1.20E-08 | 59 | 2.92 | 0.1582 |
| 8 | MESH:D000341 | Affective Disorders, Psychotic | 700 | 1.50E-08 | 34 | 4.17 | 0.1056 |
| 9 | MESH:D007415 | Intestinal Obstruction | 1293 | 2.33E-08 | 48 | 3.18 | 0.1334 |
| 10 | MESH:D011681 | Pupil Disorders | 1612 | 2.61E-08 | 55 | 2.93 | 0.1486 |
| 1 | DB01595 | Nitrazepam | 20 | 2.92E-08 | 10 | 29.64 | 0.3294 |
| 2 | DB00349 | Clobazam | 19 | 4.26E-08 | 9 | 28.08 | 0.3083 |
| 3 | DB00475 | Chlordiazepoxide | 19 | 4.26E-08 | 9 | 28.08 | 0.3083 |
| 4 | DB00683 | Midazolam | 19 | 4.26E-08 | 9 | 28.08 | 0.3083 |
| 5 | DB00690 | Flurazepam | 19 | 4.26E-08 | 9 | 28.08 | 0.3083 |
| 6 | DB00842 | Oxazepam | 19 | 4.26E-08 | 9 | 28.08 | 0.3083 |
| 7 | DB01558 | Bromazepam | 19 | 4.26E-08 | 9 | 28.08 | 0.3083 |
| 1 | brain | brain | 363 | 0 | 260 | 15.42 | 0.8581 |
Figure 6A Circular View of the Inter-association Analysis of 369 Brain-Specific Markers. The text colors for the four components: pathway, disease, drug, and organ are palegreen, Chocolate, royalblue, and magenta, respectively. Links with variable thickness represent the extent of association between two components which is 1 minus log10 of p-value. The direction of association (A->B) is represented by a ribbon's end touching A and its other end not touching B.
Identification of Previously Undiscovered Components by IPAD
| PathwayID | PathwayName | p-value | AE | RE | MJI | C |
|---|---|---|---|---|---|---|
| 1430728 | Metabolism | 4.32E-35 | 525 | 2.19 | 0.34 | 93 |
| 556833 | Metabolism of lipids and lipoproteins | 1.00E-13 | 175 | 2.25 | 0.27 | 91 |
| 453279 | Mitotic G1-G1/S phases | 2.41E-42 | 47 | 27.44 | 0.33 | 87 |
| 200137 | AP-1 transcription factor network | 2.36E-06 | 44 | 2.97 | 0.32 | 87 |
| 453279 | Mitotic G1-G1/S phases | 2.77E-09 | 78 | 2.73 | 0.30 | 87 |
| 200120 | Direct p53 effectors | 6.09E-07 | 70 | 2.4 | 0.27 | 85 |
| 69278 | Cell Cycle, Mitotic | 1.81E-14 | 166 | 2.37 | 0.28 | 82 |
| 1640170 | Cell Cycle | 1.51E-12 | 183 | 2.11 | 0.26 | 80 |
| 535734 | Fatty acid, triacylglycerol, and ketone body metabolism | 1.38E-07 | 81 | 2.35 | 0.26 | 79 |
| 71291 | Metabolism of amino acids and derivatives | 7.54E-15 | 120 | 2.9 | 0.33 | 76 |
| DiseaseID | DiseaseName | p-value | AE | RE | MJI | C |
| MESH:D015228 | Hypertriglyceridemia | 1.60E-192 | 4573 | 2.09 | 0.77 | 79 |
| MESH:D009468 | Neuromuscular Diseases | 4.34E-07 | 117 | 2.04 | 0.41 | 74 |
| MESH:D009468 | Neuromuscular Diseases | 2.16E-293 | 4573 | 2.52 | 0.82 | 74 |
| MESH:D052016 | Mucositis | 4.99E-199 | 4572 | 2.11 | 0.77 | 72 |
| MESH:D002543 | Cerebral Hemorrhage | 7.21E-128 | 4572 | 1.8 | 0.73 | 72 |
| MESH:D006463 | Hemolytic-Uremic Syndrome | 3.65E-215 | 4570 | 2.18 | 0.78 | 72 |
| MESH:D020246 | Venous Thrombosis | 6.59E-106 | 4573 | 1.71 | 0.72 | 70 |
| MESH:D013923 | Thromboembolism | 1.78E-123 | 4573 | 1.79 | 0.73 | 70 |
| DrugID | DrugName | p-value | AE | RE | MJI | C |
| PA449383 | Docetaxel | 3.08E-02 | 52 | 1.89 | 0.36 | 24 |
| PA449780 | Glutathione | 1.74E-10 | 20 | 10.48 | 0.24 | 13 |
| PA131301952 | Gefitinib | 8.52E-17 | 29.88 | 10.84 | 0.395 | 8 |
| PA451283 | Rosiglitazone | 7.73E-22 | 41.4 | 10.58 | 0.386 | 5 |
| PA448803 | Carboplatin | 2.57E-15 | 27 | 11.36 | 0.397 | 5 |
| OrganID | OrganName | p-value | AE | RE | MJI | C |
| liver | Liver | 1.82E-17 | 179 | 2.72 | 0.40 | 84 |
Thresholds for Inter-association Analysis in IPAD
| Typea | Typeb | p-value≤ | AE≥ | RE≥ | MJI≥ |
|---|---|---|---|---|---|
| pathway | Pathway | 2.13E-19 | 9.000 | 3.131 | 0.328 |
| Pathway | Disease | 1.28E-06 | 3.000 | 1.268 | 0.127 |
| Pathway | Disease | 5.73E-04 | 2.168 | 2.133 | 0.193 |
| Pathway | Organ | 5.00E-02 | 1.132 | 1.970 | 0.109 |
| Disease | Pathway | 1.02E-05 | 3.000 | 1.254 | 0.140 |
| Disease | Disease | 4.19E-72 | 73.538 | 1.370 | 0.393 |
| Disease | Drug | 5.00E-02 | 2.000 | 1.422 | 0.171 |
| Disease | Organ | 5.00E-02 | 1.000 | 1.313 | 0.121 |
| Drug | Pathway | 1.60E-05 | 2.666 | 2.468 | 0.141 |
| Drug | Disease | 6.51E-03 | 1.000 | 1.358 | 0.133 |
| Drug | Drug | 1.59E-05 | 3.000 | 3.391 | 0.333 |
| Drug | Organ | 5.00E-02 | 2.678 | 3.856 | 0.201 |
| Organ | Pathway | 4.01E-02 | 1.000 | 2.018 | 0.056 |
| Organ | Disease | 6.27E-03 | 2.000 | 1.384 | 0.085 |
| Organ | Drug | 5.00E-02 | 2.206 | 3.093 | 0.155 |
| Organ | Organ | 5.00E-02 | 7.000 | 4.279 | 0.095 |
A Comparison of the Five Quantile Thresholds
| # Associations In Pathway | #Associations In Disease | #Associations In Drug | #Associations In Organ | #total | F_measure | |
|---|---|---|---|---|---|---|
| Quantile 3% | 111374 | 594647 | 119170 | 3627 | 828818 | 60.75% |
| Quantile 4% | 148455 | 786699 | 124467 | 4471 | 1064092 | 68.58% |
| Quantile 5% | 185474 | 984366 | 130029 | 4471 | 1304340 | 68.56% |
| Quantile 6% | 222175 | 1176166 | 135915 | 4471 | 1538727 | 68.18% |
| Quantile 7% | 259592 | 1367923 | 143947 | 4471 | 1775933 | 66.92% |
A Comparison of the Four Sigma Thresholds
| # Associations In Pathway | #Associations In Disease | #Associations In Drug | #Associations In Organ | #total | F_measure | |
|---|---|---|---|---|---|---|
| 0.5 Sigma | 117535 | 644957 | 58579 | 2222 | 823293 | 60.58% |
| 1 Sigma | 185474 | 984366 | 130029 | 4471 | 1304340 | 68.56% |
| 2 Sigma | 223000 | 1215652 | 156337 | 5329 | 1600318 | 67.78% |
| 3 Sigma | 223000 | 1215652 | 156337 | 5329 | 1600318 | 67.78% |
Figure 7Relational Metadata Model. The datasets derived by the data generation pipeline are filled in gray.