Literature DB >> 23532071

The potential of text mining in data integration and network biology for plant research: a case study on Arabidopsis.

Sofie Van Landeghem1, Stefanie De Bodt, Zuzanna J Drebert, Dirk Inzé, Yves Van de Peer.   

Abstract

Despite the availability of various data repositories for plant research, a wealth of information currently remains hidden within the biomolecular literature. Text mining provides the necessary means to retrieve these data through automated processing of texts. However, only recently has advanced text mining methodology been implemented with sufficient computational power to process texts at a large scale. In this study, we assess the potential of large-scale text mining for plant biology research in general and for network biology in particular using a state-of-the-art text mining system applied to all PubMed abstracts and PubMed Central full texts. We present extensive evaluation of the textual data for Arabidopsis thaliana, assessing the overall accuracy of this new resource for usage in plant network analyses. Furthermore, we combine text mining information with both protein-protein and regulatory interactions from experimental databases. Clusters of tightly connected genes are delineated from the resulting network, illustrating how such an integrative approach is essential to grasp the current knowledge available for Arabidopsis and to uncover gene information through guilt by association. All large-scale data sets, as well as the manually curated textual data, are made publicly available, hereby stimulating the application of text mining data in future plant biology studies.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23532071      PMCID: PMC3634689          DOI: 10.1105/tpc.112.108753

Source DB:  PubMed          Journal:  Plant Cell        ISSN: 1040-4651            Impact factor:   11.277


  59 in total

1.  BIND: the Biomolecular Interaction Network Database.

Authors:  Gary D Bader; Doron Betel; Christopher W V Hogue
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

Review 2.  Cell cycle regulation in plant development.

Authors:  Dirk Inzé; Lieven De Veylder
Journal:  Annu Rev Genet       Date:  2006       Impact factor: 16.830

3.  BANNER: an executable survey of advances in biomedical named entity recognition.

Authors:  Robert Leaman; Graciela Gonzalez
Journal:  Pac Symp Biocomput       Date:  2008

Review 4.  Web-queryable large-scale data sets for hypothesis generation in plant biology.

Authors:  Siobhan M Brady; Nicholas J Provart
Journal:  Plant Cell       Date:  2009-04-28       Impact factor: 11.277

5.  Enrichment and aggregation of topological motifs are independent organizational principles of integrated interaction networks.

Authors:  Tom Michoel; Anagha Joshi; Bruno Nachtergaele; Yves Van de Peer
Journal:  Mol Biosyst       Date:  2011-08-23

6.  PETAL LOSS is a boundary gene that inhibits growth between developing sepals in Arabidopsis thaliana.

Authors:  Edwin R Lampugnani; Aydin Kilinc; David R Smyth
Journal:  Plant J       Date:  2012-06-14       Impact factor: 6.417

7.  Evidence for network evolution in an Arabidopsis interactome map.

Authors: 
Journal:  Science       Date:  2011-07-29       Impact factor: 47.728

8.  Graph-based analysis and visualization of experimental results with ONDEX.

Authors:  Jacob Köhler; Jan Baumbach; Jan Taubert; Michael Specht; Andre Skusa; Alexander Rüegg; Chris Rawlings; Paul Verrier; Stephan Philippi
Journal:  Bioinformatics       Date:  2006-03-13       Impact factor: 6.937

9.  Motifs, themes and thematic maps of an integrated Saccharomyces cerevisiae interaction network.

Authors:  Lan V Zhang; Oliver D King; Sharyl L Wong; Debra S Goldberg; Amy H Y Tong; Guillaume Lesage; Brenda Andrews; Howard Bussey; Charles Boone; Frederick P Roth
Journal:  J Biol       Date:  2005-06-01

10.  A systems approach reveals regulatory circuitry for Arabidopsis trichome initiation by the GL3 and GL1 selectors.

Authors:  Kengo Morohashi; Erich Grotewold
Journal:  PLoS Genet       Date:  2009-02-27       Impact factor: 5.917

View more
  14 in total

Review 1.  Root systems biology: integrative modeling across scales, from gene regulatory networks to the rhizosphere.

Authors:  Kristine Hill; Silvana Porco; Guillaume Lobet; Susan Zappala; Sacha Mooney; Xavier Draye; Malcolm J Bennett
Journal:  Plant Physiol       Date:  2013-10-18       Impact factor: 8.340

2.  RLIMS-P 2.0: A Generalizable Rule-Based Information Extraction System for Literature Mining of Protein Phosphorylation Information.

Authors:  Manabu Torii; Cecilia N Arighi; Gang Li; Qinghua Wang; Cathy H Wu; K Vijay-Shanker
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2015 Jan-Feb       Impact factor: 3.710

3.  What Is Stress? Dose-Response Effects in Commonly Used in Vitro Stress Assays.

Authors:  Hannes Claeys; Sofie Van Landeghem; Marieke Dubois; Katrien Maleux; Dirk Inzé
Journal:  Plant Physiol       Date:  2014-04-07       Impact factor: 8.340

4.  Leaf responses to mild drought stress in natural variants of Arabidopsis.

Authors:  Pieter Clauw; Frederik Coppens; Kristof De Beuf; Stijn Dhondt; Twiggy Van Daele; Katrien Maleux; Veronique Storme; Lieven Clement; Nathalie Gonzalez; Dirk Inzé
Journal:  Plant Physiol       Date:  2015-01-20       Impact factor: 8.340

5.  Machine learning-based differential network analysis: a study of stress-responsive transcriptomes in Arabidopsis.

Authors:  Chuang Ma; Mingming Xin; Kenneth A Feldmann; Xiangfeng Wang
Journal:  Plant Cell       Date:  2014-02-11       Impact factor: 11.277

6.  Computational drug repositioning through heterogeneous network clustering.

Authors:  Chao Wu; Ranga C Gudivada; Bruce J Aronow; Anil G Jegga
Journal:  BMC Syst Biol       Date:  2013-12-09

7.  Cross-species Conservation of context-specific networks.

Authors:  Robert Pesch; Ralf Zimmer
Journal:  BMC Syst Biol       Date:  2016-08-17

8.  CARFMAP: A Curated Pathway Map of Cardiac Fibroblasts.

Authors:  Hieu T Nim; Milena B Furtado; Mauro W Costa; Hiroaki Kitano; Nadia A Rosenthal; Sarah E Boyd
Journal:  PLoS One       Date:  2015-12-16       Impact factor: 3.240

9.  Manually curated database of rice proteins.

Authors:  Pratibha Gour; Priyanka Garg; Rashmi Jain; Shaji V Joseph; Akhilesh K Tyagi; Saurabh Raghuvanshi
Journal:  Nucleic Acids Res       Date:  2013-11-07       Impact factor: 16.971

10.  Application of the EVEX resource to event extraction and network construction: Shared Task entry and result analysis.

Authors:  Kai Hakala; Sofie Van Landeghem; Tapio Salakoski; Yves Van de Peer; Filip Ginter
Journal:  BMC Bioinformatics       Date:  2015-10-30       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.