Literature DB >> 10878241

Automated extraction of information in molecular biology.

M A Andrade1, P Bork.   

Abstract

We review data mining techniques in molecular biology, specifically those that extract information from the scientific literature itself. As more of the biological literature is published electronically, there is an opportunity, and even a need, to automatically summarize the literature in a customized way, for example by associating keywords to a topic. These keywords can be extracted from relevant publications. The process of keyword extraction can be automated and optimized to keep literature pointers automatically up-to-date or to filter relevant information from the literature. To illustrate these points, OMIM (Online Mendelian Inheritance in Man), a database of human inherited diseases, was linked to the literature and keywords were derived that covered distinct aspects such as genetic information on the one hand and disease-specific protein and phenotypic information on the other. They were used to extract information that is helpful for keeping entries about disease up-to-date.

Entities:  

Mesh:

Year:  2000        PMID: 10878241     DOI: 10.1016/s0014-5793(00)01661-6

Source DB:  PubMed          Journal:  FEBS Lett        ISSN: 0014-5793            Impact factor:   4.124


  26 in total

1.  Predictome: a database of putative functional links between proteins.

Authors:  Joseph C Mellor; Itai Yanai; Karl H Clodfelter; Julian Mintseris; Charles DeLisi
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

2.  Discovering protein similarity using natural language processing.

Authors:  Indra N Sarkar; Thomas C Rindflesch
Journal:  Proc AMIA Symp       Date:  2002

3.  NLP-based information extraction for managing the molecular biology literature.

Authors:  Bisharah Libbus; Thomas C Rindflesch
Journal:  Proc AMIA Symp       Date:  2002

4.  Dragon TF Association Miner: a system for exploring transcription factor associations through text-mining.

Authors:  Hong Pan; Li Zuo; Vidhu Choudhary; Zhuo Zhang; Shoi Houi Leow; Fui Teen Chong; Yingliang Huang; Victor Wui Siong Ong; Bijayalaxmi Mohanty; Sin Lam Tan; S P T Krishnan; Vladimir B Bajic
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

5.  A combined strategy of "in silico" transcriptome analysis and web search engine optimization allows an agile identification of reference genes suitable for normalization in gene expression studies.

Authors:  Primetta Faccioli; Gian Paolo Ciceri; Paolo Provero; Antonio Michele Stanca; Caterina Morcia; Valeria Terzi
Journal:  Plant Mol Biol       Date:  2006-12-02       Impact factor: 4.076

6.  Literature based discovery of gene clusters using phylogenetic methods.

Authors:  Indra Neil Sarkar; Abha Agrawal
Journal:  AMIA Annu Symp Proc       Date:  2006

7.  Dragon Plant Biology Explorer. A text-mining tool for integrating associations between genetic and biochemical entities with genome annotation and biochemical terms lists.

Authors:  Vladimir B Bajic; Merlin Veronika; Pardha Sarathi Veladandi; Archana Meka; Mok-Wei Heng; Kanagasabai Rajaraman; Hong Pan; Sanjay Swarup
Journal:  Plant Physiol       Date:  2005-08       Impact factor: 8.340

8.  Textpresso: an ontology-based information retrieval and extraction system for biological literature.

Authors:  Hans-Michael Müller; Eimear E Kenny; Paul W Sternberg
Journal:  PLoS Biol       Date:  2004-09-21       Impact factor: 8.029

9.  Deafness mutation mining using regular expression based pattern matching.

Authors:  Christopher M Frenz
Journal:  BMC Med Inform Decis Mak       Date:  2007-10-25       Impact factor: 2.796

10.  Automatic extraction of mutations from Medline and cross-validation with OMIM.

Authors:  Dietrich Rebholz-Schuhmann; Stephane Marcel; Sylvie Albert; Ralf Tolle; Georg Casari; Harald Kirsch
Journal:  Nucleic Acids Res       Date:  2004-01-02       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.