Literature DB >> 12499303

Protein structures and information extraction from biological texts: the PASTA system.

R Gaizauskas1, G Demetriou, P J Artymiuk, P Willett.   

Abstract

MOTIVATION: The rapid increase in volume of protein structure literature means useful information may be hidden or lost in the published literature and the process of finding relevant material, sometimes the rate-determining factor in new research, may be arduous and slow.
RESULTS: We describe the Protein Active Site Template Acquisition (PASTA) system, which addresses these problems by performing automatic extraction of information relating to the roles of specific amino acid residues in protein molecules from online scientific articles and abstracts. Both the terminology recognition and extraction capabilities of the system have been extensively evaluated against manually annotated data and the results compare favourably with state-of-the-art results obtained in less challenging domains. PASTA is the first information extraction (IE) system developed for the protein structure domain and one of the most thoroughly evaluated IE system operating on biological scientific text to date. AVAILABILITY: PASTA makes its extraction results available via a browser-based front end: http://www.dcs.shef.ac.uk/nlp/pasta/. The evaluation resources (manually annotated corpora) are also available through the website: http://www.dcs.shef.ac.uk/nlp/pasta/results.html.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12499303     DOI: 10.1093/bioinformatics/19.1.135

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  15 in total

1.  Semantic relations asserting the etiology of genetic diseases.

Authors:  Thomas C Rindflesch; Bisharah Libbus; Dimitar Hristovski; Alan R Aronson; Halil Kilicoglu
Journal:  AMIA Annu Symp Proc       Date:  2003

2.  PCfun: a hybrid computational framework for systematic characterization of protein complex function.

Authors:  Varun S Sharma; Andrea Fossati; Rodolfo Ciuffa; Marija Buljan; Evan G Williams; Zhen Chen; Wenguang Shao; Patrick G A Pedrioli; Anthony W Purcell; María Rodríguez Martínez; Jiangning Song; Matteo Manica; Ruedi Aebersold; Chen Li
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

Review 3.  What the papers say: text mining for genomics and systems biology.

Authors:  Nathan Harmston; Wendy Filsell; Michael P H Stumpf
Journal:  Hum Genomics       Date:  2010-10       Impact factor: 4.639

4.  Text mining improves prediction of protein functional sites.

Authors:  Karin M Verspoor; Judith D Cohn; Komandur E Ravikumar; Michael E Wall
Journal:  PLoS One       Date:  2012-02-29       Impact factor: 3.240

5.  Connecting the dots between PubMed abstracts.

Authors:  M Shahriar Hossain; Joseph Gresock; Yvette Edmonds; Richard Helm; Malcolm Potts; Naren Ramakrishnan
Journal:  PLoS One       Date:  2012-01-03       Impact factor: 3.240

6.  Improving the extraction of complex regulatory events from scientific text by using ontology-based inference.

Authors:  Jung-Jae Kim; Dietrich Rebholz-Schuhmann
Journal:  J Biomed Semantics       Date:  2011-10-06

7.  Applications of natural language processing in biodiversity science.

Authors:  Anne E Thessen; Hong Cui; Dmitry Mozzherin
Journal:  Adv Bioinformatics       Date:  2012-05-22

8.  Argument-predicate distance as a filter for enhancing precision in extracting predications on the genetic etiology of disease.

Authors:  Marco Masseroli; Halil Kilicoglu; François-Michel Lang; Thomas C Rindflesch
Journal:  BMC Bioinformatics       Date:  2006-06-08       Impact factor: 3.169

9.  myGRN: a database and visualisation system for the storage and analysis of developmental genetic regulatory networks.

Authors:  Jamil Bacha; James S Brodie; Matthew W Loose
Journal:  BMC Dev Biol       Date:  2009-06-06       Impact factor: 1.978

10.  Mining clinical relationships from patient narratives.

Authors:  Angus Roberts; Robert Gaizauskas; Mark Hepple; Yikun Guo
Journal:  BMC Bioinformatics       Date:  2008-11-19       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.