| Literature DB >> 19891799 |
Paul Breimyer1, Nathan Green, Vinay Kumar, Nagiza F Samatova.
Abstract
BACKGROUND: Publication databases in biomedicine (e.g., PubMed, MEDLINE) are growing rapidly in size every year, as are public databases of experimental biological data and annotations derived from the data. Publications often contain evidence that confirm or disprove annotations, such as putative protein functions, however, it is increasingly difficult for biologists to identify and process published evidence due to the volume of papers and the lack of a systematic approach to associate published evidence with experimental data and annotations. Natural Language Processing (NLP) tools can help address the growing divide by providing automatic high-throughput detection of simple terms in publication text. However, NLP tools are not mature enough to identify complex terms, relationships, or events.Entities:
Mesh:
Year: 2009 PMID: 19891799 PMCID: PMC2773920 DOI: 10.1186/1472-6947-9-S1-S5
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Figure 1The database, BioDEAL and publication feedback loop.
Figure 2The framework architecture - dotted lines indicate work-in-progress.
Figure 3The BioDEAL Browser Frontend.
Figure 4An example annotation file in CSV format [8]. The file may contain any number of columns, but must contain Gene and Gene name.
Figure 5A gene card for SO 2426 gene in Shewanella oneidensis MR-1.
Figure 6Search page results showing different genes and the publication IDs.