Literature DB >> 15033866

Extracting human protein interactions from MEDLINE using a full-sentence parser.

Nikolai Daraselia1, Anton Yuryev, Sergei Egorov, Svetalana Novichkova, Alexander Nikitin, Ilya Mazo.   

Abstract

MOTIVATION: The living cell is a complex machine that depends on the proper functioning of its numerous parts, including proteins. Understanding protein functions and how they modify and regulate each other is the next great challenge for life-sciences researchers. The collective knowledge about protein functions and pathways is scattered throughout numerous publications in scientific journals. Bringing the relevant information together becomes a bottleneck in a research and discovery process. The volume of such information grows exponentially, which renders manual curation impractical. As a viable alternative, automated literature processing tools could be employed to extract and organize biological data into a knowledge base, making it amenable to computational analysis and data mining.
RESULTS: We present MedScan, a completely automated natural language processing-based information extraction system. We have used MedScan to extract 2976 interactions between human proteins from MEDLINE abstracts dated after 1988. The precision of the extracted information was found to be 91%. Comparison with the existing protein interaction databases BIND and DIP revealed that 96% of extracted information is novel. The recall rate of MedScan was found to be 21%. Additional experiments with MedScan suggest that MEDLINE is a unique source of diverse protein function information, which can be extracted in a completely automated way with a reasonably high precision. Further directions of the MedScan technology improvement are discussed. AVAILABILITY: MedScan is available for commercial licensing from Ariadne Genomics, Inc.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15033866     DOI: 10.1093/bioinformatics/btg452

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  66 in total

Review 1.  Chinese medicine pattern diagnosis could lead to innovation in medical sciences.

Authors:  Ai-Ping Lu; Ke-Ji Chen
Journal:  Chin J Integr Med       Date:  2011-11-06       Impact factor: 1.978

2.  Effectiveness of lexico-syntactic pattern matching for ontology enrichment with clinical documents.

Authors:  K Liu; W W Chapman; G Savova; C G Chute; N Sioutos; R S Crowley
Journal:  Methods Inf Med       Date:  2010-11-08       Impact factor: 2.176

3.  Using prior knowledge and rule induction methods to discover molecular markers of prognosis in lung cancer.

Authors:  Lewis Frey; Mary E Edgerton; Douglas H Fisher; Lianhong Tang; Zhihua Chen
Journal:  AMIA Annu Symp Proc       Date:  2005

4.  Quantitative assessment of dictionary-based protein named entity tagging.

Authors:  Hongfang Liu; Zhang-Zhi Hu; Manabu Torii; Cathy Wu; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2006-06-23       Impact factor: 4.497

Review 5.  The cognitive phenotype of Down syndrome: insights from intracellular network analysis.

Authors:  Avi Ma'ayan; Katheleen Gardiner; Ravi Iyengar
Journal:  NeuroRx       Date:  2006-07

6.  Domain adaption of parsing for operative notes.

Authors:  Yan Wang; Serguei Pakhomov; James O Ryan; Genevieve B Melton
Journal:  J Biomed Inform       Date:  2015-02-07       Impact factor: 6.317

7.  Extraction of protein interaction data: a comparative analysis of methods in use.

Authors:  Hena Jose; Thangavel Vadivukarasi; Jyothi Devakumar
Journal:  EURASIP J Bioinform Syst Biol       Date:  2007

8.  HIGH-PRECISION BIOLOGICAL EVENT EXTRACTION: EFFECTS OF SYSTEM AND OF DATA.

Authors:  K Bretonnel Cohen; Karin Verspoor; Helen L Johnson; Chris Roeder; Philip V Ogren; William A Baumgartner; Elizabeth White; Hannah Tipney; Lawrence Hunter
Journal:  Comput Intell       Date:  2011-11       Impact factor: 2.330

9.  PathText: a text mining integrator for biological pathway visualizations.

Authors:  Brian Kemper; Takuya Matsuzaki; Yukiko Matsuoka; Yoshimasa Tsuruoka; Hiroaki Kitano; Sophia Ananiadou; Jun'ichi Tsujii
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

10.  Biomedical discovery acceleration, with applications to craniofacial development.

Authors:  Sonia M Leach; Hannah Tipney; Weiguo Feng; William A Baumgartner; Priyanka Kasliwal; Ronald P Schuyler; Trevor Williams; Richard A Spritz; Lawrence Hunter
Journal:  PLoS Comput Biol       Date:  2009-03-27       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.