Sergey Kozhenkov1, Michael Baitaluk. 1. San Diego Supercomputer Center, University of California San Diego, La Jolla, CA 92093, USA.
Abstract
MOTIVATION: Pathway diagrams from PubMed and World Wide Web (WWW) contain valuable highly curated information difficult to reach without tools specifically designed and customized for the biological semantics and high-content density of the images. There is currently no search engine or tool that can analyze pathway images, extract their pathway components (molecules, genes, proteins, organelles, cells, organs, etc.) and indicate their relationships. RESULTS: Here, we describe a resource of pathway diagrams retrieved from article and web-page images through optical character recognition, in conjunction with data mining and data integration methods. The recognized pathways are integrated into the BiologicalNetworks research environment linking them to a wealth of data available in the BiologicalNetworks' knowledgebase, which integrates data from >100 public data sources and the biomedical literature. Multiple search and analytical tools are available that allow the recognized cellular pathways, molecular networks and cell/tissue/organ diagrams to be studied in the context of integrated knowledge, experimental data and the literature. AVAILABILITY: BiologicalNetworks software and the pathway repository are freely available at www.biologicalnetworks.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Pathway diagrams from PubMed and World Wide Web (WWW) contain valuable highly curated information difficult to reach without tools specifically designed and customized for the biological semantics and high-content density of the images. There is currently no search engine or tool that can analyze pathway images, extract their pathway components (molecules, genes, proteins, organelles, cells, organs, etc.) and indicate their relationships. RESULTS: Here, we describe a resource of pathway diagrams retrieved from article and web-page images through optical character recognition, in conjunction with data mining and data integration methods. The recognized pathways are integrated into the BiologicalNetworks research environment linking them to a wealth of data available in the BiologicalNetworks' knowledgebase, which integrates data from >100 public data sources and the biomedical literature. Multiple search and analytical tools are available that allow the recognized cellular pathways, molecular networks and cell/tissue/organ diagrams to be studied in the context of integrated knowledge, experimental data and the literature. AVAILABILITY: BiologicalNetworks software and the pathway repository are freely available at www.biologicalnetworks.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Christian Conrad; Holger Erfle; Patrick Warnat; Nathalie Daigle; Thomas Lörch; Jan Ellenberg; Rainer Pepperkok; Roland Eils Journal: Genome Res Date: 2004-06 Impact factor: 9.043
Authors: Marti A Hearst; Anna Divoli; Harendra Guturu; Alex Ksikes; Preslav Nakov; Michael A Wooldridge; Jerry Ye Journal: Bioinformatics Date: 2007-06-01 Impact factor: 6.937
Authors: Abraham L Brass; Derek M Dykxhoorn; Yair Benita; Nan Yan; Alan Engelman; Ramnik J Xavier; Judy Lieberman; Stephen J Elledge Journal: Science Date: 2008-01-10 Impact factor: 47.728
Authors: Thomas Kelder; Alexander R Pico; Kristina Hanspers; Martijn P van Iersel; Chris Evelo; Bruce R Conklin Journal: PLoS One Date: 2009-07-30 Impact factor: 3.240
Authors: Carl F Schaefer; Kira Anthony; Shiva Krupa; Jeffrey Buchoff; Matthew Day; Timo Hannay; Kenneth H Buetow Journal: Nucleic Acids Res Date: 2008-10-02 Impact factor: 16.971
Authors: Denise N Slenter; Martina Kutmon; Kristina Hanspers; Anders Riutta; Jacob Windsor; Nuno Nunes; Jonathan Mélius; Elisa Cirillo; Susan L Coort; Daniela Digles; Friederike Ehrhart; Pieter Giesbertz; Marianthi Kalafati; Marvin Martens; Ryan Miller; Kozo Nishida; Linda Rieswijk; Andra Waagmeester; Lars M T Eijssen; Chris T Evelo; Alexander R Pico; Egon L Willighagen Journal: Nucleic Acids Res Date: 2018-01-04 Impact factor: 16.971