| Literature DB >> 19208197 |
Peter L Elkin1, Mark S Tuttle, Brett E Trusko, Steven H Brown.
Abstract
BioProspecting is a novel approach that enabled our team to mine data related to genetic markers from the New England Journal of Medicine (NEJM) utilizing SNOMED CT and the Human Gene Onotology (HUGO). The Biomedical Informatics Research Collaborative was able to link genes and disorders using the Multi-threaded Clinical Vocabulary Server (MCVS) and natural language processing engine, whose output creates an ontology-network using the semantic encodings of the literature that is organized by these two terminologies. We identified relationships between (genes or proteins) and (diseases or drugs) as linked by metabolic functions and identified potentially novel functional relationships between, for example, genes and diseases (e.g. Article #1 ([Gene - IL27] = > {Enzyme - Dipeptidyl Carboxypeptidase 1}) and Article #2 ({Enzyme - Dipeptidyl Carboxypeptidase 1} < = [Disorder - Type II DM]) showing a metabolic link between IL27 and Type II DM). In this manuscript we describe our method for developing the database and its content as well as its potential to assist in the discovery of novel markers and drugs.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19208197 PMCID: PMC2646243 DOI: 10.1186/1471-2105-10-S2-S9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Links between genes and disorders through a metabolic function
| GENE [HUGO HGNC Database] – DISORDER (14,293,089) | GENE [HUGO] (2,244) | DISORDER (8,076) |
| GENE [20889005] – DISORER (16,135) | GENE [20889005] (3) | DISORDER (7,728) |
| GENE [67271001] – DISORDER (151,329) | GENE [67271001] (24) | DISORDER (8,064) |
| GENE [8116006] – DISORDER (72,082) | GENE [8116006] (13) | DISORDER (8,059) |
| Total | - | 14,532,635 |
Links between drugs and disorders through a metabolic function
| DRUG-DISORDER (11,312,477) | DRUGS (1,783) | DISORDER (8,076) |
| PROTEIN [41175001] – DISORDER (5,576,445) | PROTEIN [41175001] (890) | DISORDER (8,076) |
| PROTEIN [61321005] – DISORDER (144,741) | PROTEIN [61321005] (24) | DISORDER (8,057) |
| PROTEIN [88878007] – DISORDER (5,694,011) | PROTEIN [88818007] (878) | DISORDER (8,076) |
| Total – 11,415,197 | ||
Links between proteins and drugs through a metabolic function
| PROTEIN [41175001] – DRUG (1,270,905) | PROTEIN [41175001] (890) | DRUG (1,783) |
| PROTEIN [61321005] – DRUG (33,652) | PROTEIN [61321005] (24) | DRUG (1,780) |
| PROTEIN [88878007] – DRUG (1,315,141) | PROTEIN [88818007] (878) | DRUG (1,783) |
| Total – 14,034,895 | ||
Links between genes and drugs through a metabolic function
| GENE [HGNC]-DRUG (3,314,327) | GENE [HUGO] (2,244) | DRUG (1,783) |
| GENE [20889005]-DRUG (3,709) | GENE [20889005] (3) | DRUG (1,749) |
| GENE [67271001]-DRUG (35,274) | GENE [67271001] (24) | DRUG (1,782) |
| GENE [8116006]-DRUG (15,931) | GENE [8116006] (13) | DRUG (1,782) |
| Total – 3,369,241 | ||
Grand Total – 54,664,445
Known relationships identified within NEJM articles
Summary data including novel and non-novel links
| DRUG_DISORDER | (115,602,787) |
| GENE_DISORDER | (51,833,042) |
| GENE_DRUG | (5,831,303) |
| PROTEIN_DISORDER | (213,884,964) |
| PROTEIN_DRUG | |
| Total | |
Figure 1NMR study of the SH3 domain from FYN proto-oncogene tyrosine kinase complexed with the synthetic peptide P2L corresponding to residues 91–104 of the P85 subunit of the PI3-kinase, family of 25 structures. [13]
Figure 2Illustrates the application parsing two journal articles, tying them to a common metabolic function and then "linking" the two articles together.
Figure 3This screen depicts genes with a functional relationship to Type 2 Diabetes Mellitus. Clicking on the Gene searches pubmed with the gene name. Clicking on the disease name searches the Online Mendelian Inheritance of Man (OMIM) for that disease. Clicking on the function brings up the list of functions that link the Gene indicated with that Disorder of Interest (See Figure 4).
Figure 4This screen shows the gene IL27 linked to the Disorder Diabetes Mellitus by metabolic functions such as Dipeptidyl carboxypeptidase I.