Literature DB >> 20375451

Enhancing data integration with text analysis to find proteins implicated in plant stress response.

Keywan Hassani-Pak1, Roxane Legaie, Catherine Canevet, Hugo A van den Berg, Jonathan D Moore, Christopher J Rawlings.   

Abstract

High throughput genomic studies can identify large numbers of potential candidate genes, which must be interpreted and filtered by investigators to select the best ones for further analysis. Prioritization is generally based on evidence that supports the role of a gene product in the biological process being investigated. The two most important bodies of information providing such evidence are bioinformatics databases and the scientific literature. In this paper we present an extension to the Ondex data integration framework that uses text mining techniques over Medline abstracts as a method for accessing both these bodies of evidence in a consistent way. In an example use case, we apply our method to create a knowledge base of Arabidopsis proteins implicated in plant stress response and use various scoring metrics to identify key protein-stress associations. In conclusion, we show that the additional text mining features are able to highlight proteins using the scientific literature that would not have been seen using data integration alone. Ondex is an open-source software project and can be downloaded, together with the text mining features described here, from www.ondex.org.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20375451     DOI: 10.2390/biecoll-jib-2010-121

Source DB:  PubMed          Journal:  J Integr Bioinform        ISSN: 1613-4516


  5 in total

1.  Assessing the functional coherence of modules found in multiple-evidence networks from Arabidopsis.

Authors:  Artem Lysenko; Michael Defoin-Platel; Keywan Hassani-Pak; Jan Taubert; Charlie Hodgman; Christopher J Rawlings; Mansoor Saqi
Journal:  BMC Bioinformatics       Date:  2011-05-25       Impact factor: 3.169

2.  Interactive exploration of integrated biological datasets using context-sensitive workflows.

Authors:  Fabian Horn; Martin Rittweger; Jan Taubert; Artem Lysenko; Christopher Rawlings; Reinhard Guthke
Journal:  Front Genet       Date:  2014-02-20       Impact factor: 4.599

3.  Developing integrated crop knowledge networks to advance candidate gene discovery.

Authors:  Keywan Hassani-Pak; Martin Castellote; Maria Esch; Matthew Hindle; Artem Lysenko; Jan Taubert; Christopher Rawlings
Journal:  Appl Transl Genom       Date:  2016-11-02

4.  DES-TOMATO: A Knowledge Exploration System Focused On Tomato Species.

Authors:  Adil Salhi; Sónia Negrão; Magbubah Essack; Mitchell J L Morton; Salim Bougouffa; Rozaimi Razali; Aleksandar Radovanovic; Benoit Marchand; Maxat Kulmanov; Robert Hoehndorf; Mark Tester; Vladimir B Bajic
Journal:  Sci Rep       Date:  2017-07-20       Impact factor: 4.379

Review 5.  Knowledge Discovery in Biological Databases for Revealing Candidate Genes Linked to Complex Phenotypes.

Authors:  Keywan Hassani-Pak; Christopher Rawlings
Journal:  J Integr Bioinform       Date:  2017-06-13
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.