Literature DB >> 21687783

Mining metabolites: extracting the yeast metabolome from the literature.

Chikashi Nobata, Paul D Dobson, Syed A Iqbal, Pedro Mendes, Jun'ichi Tsujii, Douglas B Kell, Sophia Ananiadou.   

Abstract

Text mining methods have added considerably to our capacity to extract biological knowledge from the literature. Recently the field of systems biology has begun to model and simulate metabolic networks, requiring knowledge of the set of molecules involved. While genomics and proteomics technologies are able to supply the macromolecular parts list, the metabolites are less easily assembled. Most metabolites are known and reported through the scientific literature, rather than through large-scale experimental surveys. Thus it is important to recover them from the literature. Here we present a novel tool to automatically identify metabolite names in the literature, and associate structures where possible, to define the reported yeast metabolome. With ten-fold cross validation on a manually annotated corpus, our recognition tool generates an f-score of 78.49 (precision of 83.02) and demonstrates greater suitability in identifying metabolite names than other existing recognition tools for general chemical molecules. The metabolite recognition tool has been applied to the literature covering an important model organism, the yeast Saccharomyces cerevisiae, to define its reported metabolome. By coupling to ChemSpider, a major chemical database, we have identified structures for much of the reported metabolome and, where structure identification fails, been able to suggest extensions to ChemSpider. Our manually annotated gold-standard data on 296 abstracts are available as supplementary materials. Metabolite names and, where appropriate, structures are also available as supplementary materials. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11306-010-0251-6) contains supplementary material, which is available to authorized users.

Entities:  

Year:  2010        PMID: 21687783      PMCID: PMC3111869          DOI: 10.1007/s11306-010-0251-6

Source DB:  PubMed          Journal:  Metabolomics        ISSN: 1573-3882            Impact factor:   4.290


  37 in total

1.  Analysis of biomedical text for chemical names: a comparison of three methods.

Authors:  W J Wilbur; G F Hazard; G Divita; J G Mork; A R Aronson; A C Browne
Journal:  Proc AMIA Symp       Date:  1999

2.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  The ENZYME database in 2000.

Authors:  A Bairoch
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

4.  Improving the performance of dictionary-based approaches in protein name recognition.

Authors:  Yoshimasa Tsuruoka; Jun'ichi Tsujii
Journal:  J Biomed Inform       Date:  2004-12       Impact factor: 6.317

Review 5.  Mining chemical structural information from the drug literature.

Authors:  Debra L Banville
Journal:  Drug Discov Today       Date:  2006-01       Impact factor: 7.851

6.  Journal club. A systems biologist ponders how disparate ideas can sometimes come together beautifully.

Authors:  Douglas Kell
Journal:  Nature       Date:  2009-08-06       Impact factor: 49.962

7.  Text mining meets workflow: linking U-Compare with Taverna.

Authors:  Yoshinobu Kano; Paul Dobson; Mio Nakanishi; Jun'ichi Tsujii; Sophia Ananiadou
Journal:  Bioinformatics       Date:  2010-08-12       Impact factor: 6.937

8.  Detection of IUPAC and IUPAC-like chemical names.

Authors:  Roman Klinger; Corinna Kolárik; Juliane Fluck; Martin Hofmann-Apitius; Christoph M Friedrich
Journal:  Bioinformatics       Date:  2008-07-01       Impact factor: 6.937

9.  From genomics to chemical genomics: new developments in KEGG.

Authors:  Minoru Kanehisa; Susumu Goto; Masahiro Hattori; Kiyoko F Aoki-Kinoshita; Masumi Itoh; Shuichi Kawashima; Toshiaki Katayama; Michihiro Araki; Mika Hirakawa
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

10.  Overview of BioCreAtIvE: critical assessment of information extraction for biology.

Authors:  Lynette Hirschman; Alexander Yeh; Christian Blaschke; Alfonso Valencia
Journal:  BMC Bioinformatics       Date:  2005-05-24       Impact factor: 3.169

View more
  20 in total

1.  Cognitive analysis of metabolomics data for systems biology.

Authors:  Erica L-W Majumder; Elizabeth M Billings; H Paul Benton; Richard L Martin; Amelia Palermo; Carlos Guijas; Markus M Rinschen; Xavier Domingo-Almenara; J Rafael Montenegro-Burke; Bradley A Tagtow; Robert S Plumb; Gary Siuzdak
Journal:  Nat Protoc       Date:  2021-01-22       Impact factor: 13.491

2.  MetaboListem and TABoLiSTM: Two Deep Learning Algorithms for Metabolite Named Entity Recognition.

Authors:  Cheng S Yeung; Tim Beck; Joram M Posma
Journal:  Metabolites       Date:  2022-03-22

Review 3.  Integrating bioinformatics approaches for a comprehensive interpretation of metabolomics datasets.

Authors:  Dinesh Kumar Barupal; Sili Fan; Oliver Fiehn
Journal:  Curr Opin Biotechnol       Date:  2018-02-06       Impact factor: 9.740

Review 4.  How close are we to complete annotation of metabolomes?

Authors:  Mark R Viant; Irwin J Kurland; Martin R Jones; Warwick B Dunn
Journal:  Curr Opin Chem Biol       Date:  2017-01-21       Impact factor: 8.822

5.  A text-mining system for extracting metabolic reactions from full-text articles.

Authors:  Jan Czarnecki; Irene Nobeli; Adrian M Smith; Adrian J Shepherd
Journal:  BMC Bioinformatics       Date:  2012-07-23       Impact factor: 3.169

6.  Context-based resolution of semantic conflicts in biological pathways.

Authors:  Seyeol Yoon; Jinmyung Jung; Hasun Yu; Mijin Kwon; Sungji Choo; Kyunghyun Park; Dongjin Jang; Sangwoo Kim; Doheon Lee
Journal:  BMC Med Inform Decis Mak       Date:  2015-05-20       Impact factor: 2.796

7.  An analysis of a 'community-driven' reconstruction of the human metabolic network.

Authors:  Neil Swainston; Pedro Mendes; Douglas B Kell
Journal:  Metabolomics       Date:  2013-07-12       Impact factor: 4.290

8.  Stringent response of Escherichia coli: revisiting the bibliome using literature mining.

Authors:  Sónia Carneiro; Anália Lourenço; Eugénio C Ferreira; Isabel Rocha
Journal:  Microb Inform Exp       Date:  2011-12-30

Review 9.  Metabolomics and systems pharmacology: why and how to model the human metabolic network for drug discovery.

Authors:  Douglas B Kell; Royston Goodacre
Journal:  Drug Discov Today       Date:  2013-07-26       Impact factor: 7.851

10.  Processing biological literature with customizable Web services supporting interoperable formats.

Authors:  Rafal Rak; Riza Theresa Batista-Navarro; Jacob Carter; Andrew Rowley; Sophia Ananiadou
Journal:  Database (Oxford)       Date:  2014-07-08       Impact factor: 3.451

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.