Literature DB >> 15847682

Using co-occurrence network structure to extract synonymous gene and protein names from MEDLINE abstracts.

A M Cohen1, W R Hersh, C Dubay, K Spackman.   

Abstract

BACKGROUND: Text-mining can assist biomedical researchers in reducing information overload by extracting useful knowledge from large collections of text. We developed a novel text-mining method based on analyzing the network structure created by symbol co-occurrences as a way to extend the capabilities of knowledge extraction. The method was applied to the task of automatic gene and protein name synonym extraction.
RESULTS: Performance was measured on a test set consisting of about 50,000 abstracts from one year of MEDLINE. Synonyms retrieved from curated genomics databases were used as a gold standard. The system obtained a maximum F-score of 22.21% (23.18% precision and 21.36% recall), with high efficiency in the use of seed pairs.
CONCLUSION: The method performs comparably with other studied methods, does not rely on sophisticated named-entity recognition, and requires little initial seed knowledge.

Entities:  

Mesh:

Year:  2005        PMID: 15847682      PMCID: PMC1090552          DOI: 10.1186/1471-2105-6-103

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  27 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  A literature network of human genes for high-throughput analysis of gene expression.

Authors:  T K Jenssen; A Laegreid; J Komorowski; E Hovig
Journal:  Nat Genet       Date:  2001-05       Impact factor: 38.330

3.  MeSHmap: a text mining tool for MEDLINE.

Authors:  P Srinivasan
Journal:  Proc AMIA Symp       Date:  2001

4.  Automatic extraction of acronym-meaning pairs from MEDLINE databases.

Authors:  J Pustejovsky; J Castaño; B Cochran; M Kotecki; M Morrell
Journal:  Stud Health Technol Inform       Date:  2001

5.  Gene indexing: characterization and analysis of NLM's GeneRIFs.

Authors:  Joyce A Mitchell; Alan R Aronson; James G Mork; Lillian C Folk; Susanne M Humphrey; Janice M Ward
Journal:  AMIA Annu Symp Proc       Date:  2003

6.  Tagging gene and protein names in biomedical text.

Authors:  Lorraine Tanabe; W John Wilbur
Journal:  Bioinformatics       Date:  2002-08       Impact factor: 6.937

7.  GeneNet: a gene network database and its automated visualization.

Authors:  F A Kolpakov; E A Ananko; G B Kolesov; N A Kolchanov
Journal:  Bioinformatics       Date:  1998       Impact factor: 6.937

Review 8.  Interference of BCR-ABL1 kinase activity with antigen receptor signaling in B cell precursor leukemia cells.

Authors:  Florian Klein; Niklas Feldhahn; Markus Müschen
Journal:  Cell Cycle       Date:  2004-07-25       Impact factor: 4.534

9.  PTEN decreases in vivo vascularization of experimental gliomas in spite of proangiogenic stimuli.

Authors:  Tatsuya Abe; Kinya Terada; Hiroaki Wakimoto; Ryo Inoue; Edyta Tyminski; Robert Bookstein; James P Basilion; E Antonio Chiocca
Journal:  Cancer Res       Date:  2003-05-01       Impact factor: 12.701

Review 10.  DAX1 and its network partners: exploring complexity in development.

Authors:  Robert Clipsham; Edward R B McCabe
Journal:  Mol Genet Metab       Date:  2003 Sep-Oct       Impact factor: 4.797

View more
  14 in total

1.  Characterizing the sublanguage of online breast cancer forums for medications, symptoms, and emotions.

Authors:  Noémie Elhadad; Shaodian Zhang; Patricia Driscoll; Samuel Brody
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

Review 2.  Network integration and graph analysis in mammalian molecular systems biology.

Authors:  A Ma'ayan
Journal:  IET Syst Biol       Date:  2008-09       Impact factor: 1.615

Review 3.  Empirical distributional semantics: methods and biomedical applications.

Authors:  Trevor Cohen; Dominic Widdows
Journal:  J Biomed Inform       Date:  2009-02-14       Impact factor: 6.317

4.  PubMedMiner: Mining and Visualizing MeSH-based Associations in PubMed.

Authors:  Yucan Zhang; Indra Neil Sarkar; Elizabeth S Chen
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

5.  mspecLINE: bridging knowledge of human disease with the proteome.

Authors:  Jeremy Handcock; Eric W Deutsch; John Boyle
Journal:  BMC Med Genomics       Date:  2010-03-10       Impact factor: 3.063

6.  Googling social interactions: web search engine based social network construction.

Authors:  Sang Hoon Lee; Pan-Jun Kim; Yong-Yeol Ahn; Hawoong Jeong
Journal:  PLoS One       Date:  2010-07-21       Impact factor: 3.240

7.  Translating clinical findings into knowledge in drug safety evaluation--drug induced liver injury prediction system (DILIps).

Authors:  Zhichao Liu; Qiang Shi; Don Ding; Reagan Kelly; Hong Fang; Weida Tong
Journal:  PLoS Comput Biol       Date:  2011-12-15       Impact factor: 4.475

8.  Term identification methods for consumer health vocabulary development.

Authors:  Qing T Zeng; Tony Tse; Guy Divita; Alla Keselman; Jon Crowell; Allen C Browne; Sergey Goryachev; Long Ngo
Journal:  J Med Internet Res       Date:  2007-02-28       Impact factor: 5.428

9.  Semi-Supervised Learning to Identify UMLS Semantic Relations.

Authors:  Yuan Luo; Ozlem Uzuner
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2014-04-07

10.  Synonym extraction and abbreviation expansion with ensembles of semantic spaces.

Authors:  Aron Henriksson; Hans Moen; Maria Skeppstedt; Vidas Daudaravičius; Martin Duneld
Journal:  J Biomed Semantics       Date:  2014-02-05
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.