Literature DB >> 10842739

PNAD-CSS: a workbench for constructing a protein name abbreviation dictionary.

M Yoshida1, K Fukuda, T Takagi.   

Abstract

MOTIVATION: Since their initial development, integration and construction of databases for molecular-level data have progressed. Though biological molecules are related to each other and form a complex system, the information is stored in the vast archives of the literature or in diverse databases. There is no unified naming convention for biological object, and biological terms may be ambiguous or polysemic. This makes the integration and interaction of databases difficult. In order to eliminate these problems, machine-readable natural language resources appear to be quite promising. We have developed a workbench for protein name abbreviation dictionary (PNAD) building.
RESULTS: We have developed PNAD Construction Support System (PNAD-CSS), which offers various convenient facilities to decrease the construction costs of a protein name abbreviation dictionary of which entries are collected from abstracts in biomedical papers. The system allows the users to concentrate on higher level interpretation by removing some troublesome tasks, e.g. management of abstracts, extracting protein names and their abbreviations, and so on. To extract a pair of protein names and abbreviations, we have developed a hybrid system composed of the PROPER System and the PNAD System. The PNAD System can extract the pairs from parenthetical-paraphrases involved in protein names, the PROPER System identified these paris, with 98.95% precision, 95.56% recall and 97.58% complete precision. AVAILABILITY: PROPER System is freely available from http://www.hgc.inc.u-tokyo.ac.jp/service/tooldoc /KeX/intro.html. The other software are also available on request. Contact the authors. CONTACT: mikio@ims.u-tokyo.ac.jp

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 10842739     DOI: 10.1093/bioinformatics/16.2.169

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  12 in total

1.  Mapping abbreviations to full forms in biomedical articles.

Authors:  Hong Yu; George Hripcsak; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2002 May-Jun       Impact factor: 4.497

2.  A study of abbreviations in the UMLS.

Authors:  H Liu; Y A Lussier; C Friedman
Journal:  Proc AMIA Symp       Date:  2001

3.  Creating an online dictionary of abbreviations from MEDLINE.

Authors:  Jeffrey T Chang; Hinrich Schütze; Russ B Altman
Journal:  J Am Med Inform Assoc       Date:  2002 Nov-Dec       Impact factor: 4.497

4.  Automatic extraction of gene and protein synonyms from MEDLINE and journal articles.

Authors:  Hong Yu; Vasileios Hatzivassiloglou; Carol Friedman; Andrey Rzhetsky; W John Wilbur
Journal:  Proc AMIA Symp       Date:  2002

5.  A study of abbreviations in MEDLINE abstracts.

Authors:  Hongfang Liu; Alan R Aronson; Carol Friedman
Journal:  Proc AMIA Symp       Date:  2002

6.  Inferring higher functional information for RIKEN mouse full-length cDNA clones with FACTS.

Authors:  Takeshi Nagashima; Diego G Silva; Nikolai Petrovsky; Luis A Socha; Harukazu Suzuki; Rintaro Saito; Takeya Kasukawa; Igor V Kurochkin; Akihiko Konagaya; Christian Schönbach
Journal:  Genome Res       Date:  2003-06       Impact factor: 9.043

7.  ALICE: an algorithm to extract abbreviations from MEDLINE.

Authors:  Hiroko Ao; Toshihisa Takagi
Journal:  J Am Med Inform Assoc       Date:  2005-05-19       Impact factor: 4.497

8.  Enhancing acronym/abbreviation knowledge bases with semantic information.

Authors:  Manabu Torii; Hongfang Liu
Journal:  AMIA Annu Symp Proc       Date:  2007-10-11

9.  Deafness mutation mining using regular expression based pattern matching.

Authors:  Christopher M Frenz
Journal:  BMC Med Inform Decis Mak       Date:  2007-10-25       Impact factor: 2.796

10.  Biomedical term mapping databases.

Authors:  Jonathan D Wren; Jeffrey T Chang; James Pustejovsky; Eytan Adar; Harold R Garner; Russ B Altman
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.