Satya S Sahoo1, Joshua Valdez2, Michael Rueschman2, Matthew Kim2. 1. Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH, USA. 2. Department of Medicine, Brigham and Women's Hospital and Beth Israel Deaconess Medical Center, Harvard Medical School, Harvard University, Boston, MA, USA.
Abstract
OBJECTIVE: To characterize the scientific reproducibility of biomedical research studies by query and analysis of semantic provenance graphs generated from provenance metadata terms extracted from PubMed articles. METHODS: We develop a new semantic provenance graph generation algorithm that uses a provenance ontology developed as part of the Provenance for Clinical and Health Research (ProvCaRe) project. The ProvCaRe project has processed and extracted provenance metadata from more than 1.6 million full text articles from the PubMed database. RESULTS: The semantic provenance graph generation algorithm is evaluated using provenance terms extracted from 75 selected articles describing sleep medicine research studies. In addition, we use eight provenance queries to evaluate the quality of semantic provenance graphs generated by the new algorithm. CONCLUSION: The ProvCaRe project has created a unique resource to characterize the reproducibility of biomedical research studies and the semantic provenance graph generation algorithm enables users to effectively query and analyze the provenance metadata in the ProvCaRe knowledge repository.
OBJECTIVE: To characterize the scientific reproducibility of biomedical research studies by query and analysis of semantic provenance graphs generated from provenance metadata terms extracted from PubMed articles. METHODS: We develop a new semantic provenance graph generation algorithm that uses a provenance ontology developed as part of the Provenance for Clinical and Health Research (ProvCaRe) project. The ProvCaRe project has processed and extracted provenance metadata from more than 1.6 million full text articles from the PubMed database. RESULTS: The semantic provenance graph generation algorithm is evaluated using provenance terms extracted from 75 selected articles describing sleep medicine research studies. In addition, we use eight provenance queries to evaluate the quality of semantic provenance graphs generated by the new algorithm. CONCLUSION: The ProvCaRe project has created a unique resource to characterize the reproducibility of biomedical research studies and the semantic provenance graph generation algorithm enables users to effectively query and analyze the provenance metadata in the ProvCaRe knowledge repository.
Entities:
Keywords:
Biomedical Ontologies; Health Information Systems; Knowledge Bases
Authors: Ida Sim; Samson W Tu; Simona Carini; Harold P Lehmann; Brad H Pollock; Mor Peleg; Knut M Wittkowski Journal: J Biomed Inform Date: 2013-11-13 Impact factor: 6.317
Authors: Dennis A Dean; Ary L Goldberger; Remo Mueller; Matthew Kim; Michael Rueschman; Daniel Mobley; Satya S Sahoo; Catherine P Jayapandian; Licong Cui; Michael G Morrical; Susan Surovec; Guo-Qiang Zhang; Susan Redline Journal: Sleep Date: 2016-05-01 Impact factor: 5.849
Authors: Story C Landis; Susan G Amara; Khusru Asadullah; Chris P Austin; Robi Blumenstein; Eileen W Bradley; Ronald G Crystal; Robert B Darnell; Robert J Ferrante; Howard Fillit; Robert Finkelstein; Marc Fisher; Howard E Gendelman; Robert M Golub; John L Goudreau; Robert A Gross; Amelie K Gubitz; Sharon E Hesterlee; David W Howells; John Huguenard; Katrina Kelner; Walter Koroshetz; Dimitri Krainc; Stanley E Lazic; Michael S Levine; Malcolm R Macleod; John M McCall; Richard T Moxley; Kalyani Narasimhan; Linda J Noble; Steve Perrin; John D Porter; Oswald Steward; Ellis Unger; Ursula Utz; Shai D Silberberg Journal: Nature Date: 2012-10-11 Impact factor: 49.962
Authors: George T O'Connor; Brian Caffo; Anne B Newman; Stuart F Quan; David M Rapoport; Susan Redline; Helaine E Resnick; Jonathan Samet; Eyal Shahar Journal: Am J Respir Crit Care Med Date: 2009-03-05 Impact factor: 21.405