| Literature DB >> 24917541 |
Toshihide Ono, Satoru Kuhara1.
Abstract
BACKGROUND: Understanding the molecular mechanisms involved in disease is critical for the development of more effective and individualized strategies for prevention and treatment. The amount of disease-related literature, including new genetic information on the molecular mechanisms of disease, is rapidly increasing. Extracting beneficial information from literature can be facilitated by computational methods such as the knowledge-discovery approach. Several methods for mining gene-disease relationships using computational methods have been developed, however, there has been a lack of research evaluating specific disease candidate genes.Entities:
Mesh:
Year: 2014 PMID: 24917541 PMCID: PMC4068192 DOI: 10.1186/1471-2105-15-179
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Lists of a) gene ontology terms and b) mammalian phenotype terms that were used to create pain gene set 1
| a) | |
| GO:0019233 | Sensory perception of pain |
| GO:0050968 | Detection of chemical stimulus involved in sensory perception of pain |
| GO:0050967 | Detection of electrical stimulus involved in sensory perception of pain |
| GO:0050966 | Detection of mechanical stimulus involved in sensory perception of pain |
| GO:0050965 | Detection of temperature stimulus involved in sensory perception of pain |
| GO:0051930 | Regulation of sensory perception of pain |
| GO:0044465 | Modulation of sensory perception of pain in another organism |
| GO:0019234 | Sensory perception of fast pain |
| GO:0019235 | Sensory perception of slow pain |
| GO:0048265 | Response to pain |
| GO:0048266 | Behavioral response to pain |
| GO:0061366 | Behavioral response to chemical pain |
| GO:0061367 | Behavioral response to acetic acid induced pain |
| GO:0061368 | Behavioral response to formalin induced pain |
| b) | |
| MP:0001491 | Unresponsive to tactile stimuli |
| MP:0001968 | Abnormal touch/nociception |
| MP:0001970 | Abnormal pain threshold |
| MP:0001973 | Increased thermal nociceptive threshold |
| MP:0001980 | Abnormal chemically-elicited antinociception |
| MP:0001981 | Increased chemically-elicited antinociception |
| MP:0001982 | Decreased chemically-elicited antinociception |
| MP:0002733 | Abnormal thermal nociception |
| MP:0002734 | Abnormal mechanical nociception |
| MP:0002735 | Abnormal chemical nociception |
| MP:0002736 | Abnormal nociception after inflammation |
| MP:0002738 | Hyperresponsive to tactile stimuli |
| MP:0003043 | Hypoalgesia |
| MP:0003177 | Allodynia |
| MP:0003998 | Decreased thermal nociceptive threshold |
| MP:0004270 | Analgesia |
| MP:0005316 | Abnormal response to tactile stimuli |
| MP:0005407 | Hyperalgesia |
| MP:0005498 | Hyporesponsive to tactile stimuli |
| MP:0008531 | Increased chemical nociceptive threshold |
| MP:0008532 | Decreased chemical nociceptive threshold |
Lists of a) gene ontology terms and b) mammalian phenotype terms that were used to create AD gene set 1
| a) | |
| GO:0001540 | Beta-amyloid binding |
| GO:0034205 | Beta-amyloid formation |
| GO:0034231 | Slet amyloid polypeptide processing |
| GO:0042982 | Amyloid precursor protein metabolic process |
| GO:0042983 | Amyloid precursor protein biosynthetic process |
| GO:0042984 | Regulation of amyloid precursor protein biosynthetic process |
| GO:0042987 | Amyloid precursor protein catabolic process |
| GO:0044548 | S100 protein binding |
| GO:0048152 | S100 beta biosynthetic process |
| GO:0048153 | S100 alpha biosynthetic process |
| GO:0048156 | Tau protein binding |
| GO:0050435 | Beta-amyloid metabolic process |
| GO:0097242 | Beta-amyloid clearance |
| GO:1900221 | Regulation of beta-amyloid clearance |
| GO:1902003 | Regulation of beta-amyloid formation |
| GO:1990000 | Amyloid fibril formation |
| b) | |
| MP:0000604 | Amyloidosis |
| MP:0008493 | Alpha-synuclein inclusion body |
| MP:0003214 | Neurofibrillary tangles |
| MP:0004250 | Tau protein deposits |
Figure 1Flowchart of the method for gathering and prioritizing pain candidate genes. We initially created a set of MeSH terms for comprehensive retrieval of disease-related publications (left). Next, specific disease candidate genes were obtained from disease-related publications, which were searched for with a set of disease-related MeSH terms. Finally, the prioritizing score was calculated based on the weighted literature score (right).
Example of pain-related MeSH terms obtained on the basis of the cosine similarity
| Pain | C, F, G | 1.000 | x | x | x | x | x | x | x | x | x | x |
| Pain measurement | E | 0.340 | | x | x | x | x | x | x | x | x | x |
| Nociceptors | A | 0.268 | | | x | x | x | x | x | x | x | x |
| Pain threshold | F, G | 0.259 | | | | x | x | x | x | x | x | x |
| Hyperalgesia | C | 0.225 | | | | | x | x | x | x | x | x |
| Posterior horn cells | A | 0.180 | | | | | | x | x | x | x | x |
| Ganglia, spinal | A | 0.147 | | | | | | | x | x | x | x |
| Injections, spinal | E | 0.147 | | | | | | | | x | x | x |
| Physical stimulation | E | 0.146 | | | | | | | | | x | x |
| Formaldehyde | D | 0.145 | | | | | | | | | | x |
| Recall | 0.669 | 0.692 | 0.693 | 0.693 | 0.703 | 0.700 | 0.668 | 0.667 | 0.661 | 0.650 | ||
Figure 2Comparison of prioritization score performance. A) Pain case study B) AD case study.
Figure 3Summary of the recall of prioritized genes resulting from each set of MeSH terms constructed using various similarity measures. A) Recall was calculated against pain gene set 1. “Pain” is located in the leftmost means the recall of the list of genes constructed by searching for the keyword “pain [Mesh:NoExp]” B) The recall was calculated against AD gene set 1. “Alzheimer Disease” is located in the leftmost means the recall of the list of genes obtained by searching for the keyword “Alzheimer Disease [Mesh:NoExp]”. The red bars indicate the set of MeSH terms with the highest recall.
Figure 4Comparison of pain candidate gene prioritization performance. Precision-recall plots show the performance of our method and of a simple approach based on the ranking of genes according to the number of gene related publications, resulting from the search term “pain” or “pain [MeSH]”.
The top-20 ranked pain candidate genes gathered by our method
| 140.92 | Trpv1 | Transient receptor potential cation channel, subfamily V, member 1 |
| 81.89 | Oprm1 | Opioid receptor, mu 1 |
| 55.04 | Trpa1 | Transient receptor potential cation channel, subfamily A, member 1 |
| 39.56 | Tacr1 | Tachykinin receptor 1 |
| 38.78 | Comt | Catechol-O-methyltransferase |
| 31.04 | P2rx3 | Purinergic receptor P2X, ligand-gated ion channel, 3 |
| 30.96 | Bdnf | Brain-derived neurotrophic factor |
| 27.58 | Scn9a | Sodium channel, voltage-gated, type IX, alpha |
| 27.48 | Ptgs2 | Prostaglandin-endoperoxide synthase 2 |
| 26.33 | Cnr1 | Cannabinoid receptor 1 (brain) |
| 23.67 | Ngf | Nerve growth factor (beta polypeptide) |
| 23.49 | Tnf | Tumor necrosis factor |
| 23.3 | Asic3 | Acid-sensing (proton-gated) ion channel 3 |
| 23.09 | Scn10a | Sodium channel, voltage-gated, type X, alpha subunit |
| 22.97 | Tac1 | Tachykinin, precursor 1 |
| 20.58 | Fos | FBJ osteosarcoma oncogene |
| 19.83 | Gal | Galanin/GMAP prepropeptide |
| 19.72 | Calca | Calcitonin-related polypeptide alpha |
| 19.41 | Il6 | Interleukin 6 |
| 19.06 | Grin1 | Glutamate receptor, ionotropic, N-methyl D-aspartate 1 |
Figure 5Comparison of pain candidate gene prioritization performance with other publicly available tools. Precision-recall plots show the performance of our method and of other publicly available tools. The precision and recall of other publicly available tools were calculated by the number of genes resulting from the use of the keyword “pain” using default parameters.
Summary of the maximum F-measures for pain candidate genes from our method and those from publicly available tools
| Our method | 1101 | 0.61 | 381 |
| Genotator | 892 | 0.23 | 544 |
| Gene prospector | 603 | 0.26 | 278 |
| LEGENDA | 601 | 0.19 | 601 |
| PolySerach | 83 | 0.14 | 83 |
The top-20 ranked AD candidate genes gathered by our method
| 1477.54 | App | Amyloid beta (A4) precursor protein |
| 1469.69 | Apoe | Apolipoprotein E |
| 794.48 | Mapt | Microtubule-associated protein tau |
| 560.37 | Psen1 | Presenilin 1 |
| 150.82 | Bace1 | Beta-site APP cleaving enzyme 1 |
| 105.30 | Psen2 | Presenilin 2 |
| 66.48 | Snca | Synuclein, alpha (non A4 component of amyloid precursor) |
| 62.84 | Gsk3b | Glycogen synthase kinase 3 beta |
| 58.73 | Prnp | Prion protein |
| 52.96 | Bdnf | Brain-derived neurotrophic factor |
| 46.54 | Aplp2 | Amyloid beta (A4) precursor-like protein 2 |
| 45.49 | Serpine2 | Serpin peptidase inhibitor, clade E (nexin, plasminogen activator inhibitor type 1), member 2 |
| 43.20 | Mme | Membrane metallo-endopeptidase |
| 39.21 | Ide | Insulin degrading enzyme |
| 38.82 | Ncstn | Nicastrin |
| 37.61 | Cdk5 | Cyclin-dependent kinase 5 |
| 37.08 | Sorl1 | Sortilin-related receptor, LDLR class A repeats-containing |
| 36.96 | Ace | Angiotensin I converting enzyme |
| 35.06 | Apbb1 | Amyloid beta (A4) precursor protein-binding, family B, member 1 (Fe65) |
| 32.46 | Clu | Clusterin |
Figure 6Comparison of AD candidate gene prioritization performance with other publicly available tools. Precision-recall plots show the performance of our method and of other publicly available tools. The precision and recall of other publicly available tools were calculated by the number of genes resulting from the use of the keyword “Alzheimer’s disease” using default parameters.
Summary of the maximum F-measures for AD candidate genes from our method and those from publicly available tools
| Our method | 2810 | 0.24 | 166 |
| Genotator | 2110 | 0.15 | 145 |
| Gene prospector | 1587 | 0.16 | 216 |
| LEGENDA | 1440 | 0.14 | 142 |
| PolySearch | 180 | 0.12 | 101 |