MOTIVATION: The rate at which gene-related findings appear in the scientific literature makes it difficult if not impossible for biomedical scientists to keep fully informed and up to date. The importance of these findings argues for the development of automated methods that can find, extract and summarize this information. This article reports on methods for determining the molecular function claims that are being made in a scientific article, specifically those that are backed by experimental evidence. RESULTS: The most significant result is that for molecular function claims based on direct assays, our methods achieved recall of 70.7% and precision of 65.7%. Furthermore, our methods correctly identified in the text 44.6% of the specific molecular function claims backed up by direct assays, but with a precision of only 0.92%, a disappointing outcome that led to an examination of the different kinds of errors. These results were based on an analysis of 1823 articles from the literature of Saccharomyces cerevisiae (budding yeast). AVAILABILITY: The annotation files for S.cerevisiae are available from ftp://genome-ftp.stanford.edu/pub/yeast/data_download/literature_curation/gene_association.sgd.gz. The draft protocol vocabulary is available by request from the first author.
MOTIVATION: The rate at which gene-related findings appear in the scientific literature makes it difficult if not impossible for biomedical scientists to keep fully informed and up to date. The importance of these findings argues for the development of automated methods that can find, extract and summarize this information. This article reports on methods for determining the molecular function claims that are being made in a scientific article, specifically those that are backed by experimental evidence. RESULTS: The most significant result is that for molecular function claims based on direct assays, our methods achieved recall of 70.7% and precision of 65.7%. Furthermore, our methods correctly identified in the text 44.6% of the specific molecular function claims backed up by direct assays, but with a precision of only 0.92%, a disappointing outcome that led to an examination of the different kinds of errors. These results were based on an analysis of 1823 articles from the literature of Saccharomyces cerevisiae (budding yeast). AVAILABILITY: The annotation files for S.cerevisiae are available from ftp://genome-ftp.stanford.edu/pub/yeast/data_download/literature_curation/gene_association.sgd.gz. The draft protocol vocabulary is available by request from the first author.
Authors: Patricia L Whetzel; Ryan R Brinkman; Helen C Causton; Liju Fan; Dawn Field; Jennifer Fostel; Gilberto Fragoso; Tanya Gray; Mervi Heiskanen; Tina Hernandez-Boussard; Norman Morrison; Helen Parkinson; Philippe Rocca-Serra; Susanna-Assunta Sansone; Daniel Schober; Barry Smith; Robert Stevens; Christian J Stoeckert; Chris Taylor; Joe White; Andrew Wood Journal: OMICS Date: 2006
Authors: Michael Chang; Mohammed Bellaoui; Chaoying Zhang; Ridhdhi Desai; Pavel Morozov; Lissette Delgado-Cruzata; Rodney Rothstein; Greg A Freyer; Charles Boone; Grant W Brown Journal: EMBO J Date: 2005-05-12 Impact factor: 11.598
Authors: Maria C Costanzo; Julie Park; Rama Balakrishnan; J Michael Cherry; Eurie L Hong Journal: Database (Oxford) Date: 2011-03-15 Impact factor: 3.451
Authors: Julie Park; Maria C Costanzo; Rama Balakrishnan; J Michael Cherry; Eurie L Hong Journal: Database (Oxford) Date: 2012-03-20 Impact factor: 3.451
Authors: Elizabeth T Hobbs; Stephen M Goralski; Ashley Mitchell; Andrew Simpson; Dorjan Leka; Emmanuel Kotey; Matt Sekira; James B Munro; Suvarna Nadendla; Rebecca Jackson; Aitor Gonzalez-Aguirre; Martin Krallinger; Michelle Giglio; Ivan Erill Journal: Front Res Metr Anal Date: 2021-07-13