Literature DB >> 15360821

Text mining functional keywords associated with genes.

Ying Liu1, Martin Brandon, Shamkant Navathe, Ray Dingledine, Brian J Ciliax.   

Abstract

Modern experimental techniques provide the ability to gather vast amounts of biological data in a single experiment (e.g. DNA microarray experiment), making it extremely difficult for the researcher to interpret the data and form conclusions about the functions of the genes. Current approaches provide useful information that organizes or relates genes, but a major shortcoming is they either do not address specific functions of the genes or are constrained by functions predefined in other databases, which can be biased, incomplete, or out-of-date. We extended Andrade and Valencia's method [1] to statistically mine functional keywords associated with genes from MEDLINE abstracts. The MEDLINE abstracts are analyzed statistically to score and rank keywords for each gene using a background set of words for baseline frequencies. We generally got very good functional keyword information about the genes we tested, which was confirmed by searching for the individual keywords in context. The keywords extracted by our algorithm reveal a wealth of potential functional concepts, which were not represented in existing public databases. We feel that this approach is general enough to apply to medical and biological literature to find other relationships: drugs vs. genes, risk-factors vs. genes, etc.

Entities:  

Mesh:

Year:  2004        PMID: 15360821

Source DB:  PubMed          Journal:  Stud Health Technol Inform        ISSN: 0926-9630


  6 in total

1.  Fine-grained indexing of the biomedical literature: MeSH subheading attachment for a MEDLINE indexing tool.

Authors:  Aurélie Névéol; Sonya E Shooshan; James G Mork; Alan R Aronson
Journal:  AMIA Annu Symp Proc       Date:  2007-10-11

2.  eFIP: a tool for mining functional impact of phosphorylation from literature.

Authors:  Cecilia N Arighi; Amy Y Siu; Catalina O Tudor; Jules A Nchoutmboube; Cathy H Wu; Vijay K Shanker
Journal:  Methods Mol Biol       Date:  2011

3.  eGIFT: mining gene information from the literature.

Authors:  Catalina O Tudor; Carl J Schmidt; K Vijay-Shanker
Journal:  BMC Bioinformatics       Date:  2010-08-09       Impact factor: 3.169

4.  A recent advance in the automatic indexing of the biomedical literature.

Authors:  Aurélie Névéol; Sonya E Shooshan; Susanne M Humphrey; James G Mork; Alan R Aronson
Journal:  J Biomed Inform       Date:  2008-12-30       Impact factor: 6.317

5.  Click-words: learning to predict document keywords from a user perspective.

Authors:  Rezarta Islamaj Doğan; Zhiyong Lu
Journal:  Bioinformatics       Date:  2010-09-01       Impact factor: 6.937

6.  A combined approach to data mining of textual and structured data to identify cancer-related targets.

Authors:  Pavel Pospisil; Lakshmanan K Iyer; S James Adelstein; Amin I Kassis
Journal:  BMC Bioinformatics       Date:  2006-07-20       Impact factor: 3.169

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.