Literature DB >> 25088781

How to learn about gene function: text-mining or ontologies?

Theodoros G Soldatos1, Nelson Perdigão2, Nigel P Brown3, Kenneth S Sabir4, Seán I O'Donoghue5.   

Abstract

As the amount of genome information increases rapidly, there is a correspondingly greater need for methods that provide accurate and automated annotation of gene function. For example, many high-throughput technologies--e.g., next-generation sequencing--are being used today to generate lists of genes associated with specific conditions. However, their functional interpretation remains a challenge and many tools exist trying to characterize the function of gene-lists. Such systems rely typically in enrichment analysis and aim to give a quick insight into the underlying biology by presenting it in a form of a summary-report. While the load of annotation may be alleviated by such computational approaches, the main challenge in modern annotation remains to develop a systems form of analysis in which a pipeline can effectively analyze gene-lists quickly and identify aggregated annotations through computerized resources. In this article we survey some of the many such tools and methods that have been developed to automatically interpret the biological functions underlying gene-lists. We overview current functional annotation aspects from the perspective of their epistemology (i.e., the underlying theories used to organize information about gene function into a body of verified and documented knowledge) and find that most of the currently used functional annotation methods fall broadly into one of two categories: they are based either on 'known' formally-structured ontology annotations created by 'experts' (e.g., the GO terms used to describe the function of Entrez Gene entries), or--perhaps more adventurously--on annotations inferred from literature (e.g., many text-mining methods use computer-aided reasoning to acquire knowledge represented in natural languages). Overall however, deriving detailed and accurate insight from such gene lists remains a challenging task, and improved methods are called for. In particular, future methods need to (1) provide more holistic insight into the underlying molecular systems; (2) provide better follow-up experimental testing and treatment options, and (3) better manage gene lists derived from organisms that are not well-studied. We discuss some promising approaches that may help achieve these advances, especially the use of extended dictionaries of biomedical concepts and molecular mechanisms, as well as greater use of annotation benchmarks.
Copyright © 2014 Elsevier Inc. All rights reserved.

Keywords:  Benchmarks; Functional annotation; GO term enrichment; Keyword enhancement; Systems biology; Text mining

Mesh:

Year:  2014        PMID: 25088781     DOI: 10.1016/j.ymeth.2014.07.004

Source DB:  PubMed          Journal:  Methods        ISSN: 1046-2023            Impact factor:   3.608


  10 in total

1.  Finding Gene Associations by Text Mining and Annotating it with Gene Ontology.

Authors:  Oviya Ramalakshmi Iyyappan; Sharanya Manoharan
Journal:  Methods Mol Biol       Date:  2022

2.  DNA Methylation, Deamination, and Translesion Synthesis Combine to Generate Footprint Mutations in Cancer Driver Genes in B-Cell Derived Lymphomas and Other Cancers.

Authors:  Igor B Rogozin; Abiel Roche-Lima; Kathrin Tyryshkin; Kelvin Carrasquillo-Carrión; Artem G Lada; Lennard Y Poliakov; Elena Schwartz; Andreu Saura; Vyacheslav Yurchenko; David N Cooper; Anna R Panchenko; Youri I Pavlov
Journal:  Front Genet       Date:  2021-05-19       Impact factor: 4.599

3.  An application of MeSH enrichment analysis in livestock.

Authors:  G Morota; F Peñagaricano; J L Petersen; D C Ciobanu; K Tsuyuzaki; I Nikaido
Journal:  Anim Genet       Date:  2015-06-02       Impact factor: 3.169

4.  Public Adverse Event Data Insights into the Safety of Pembrolizumab in Melanoma Patients.

Authors:  Anne Schaefer; Christos Sachpekidis; Francesca Diella; Anja Doerks; Anne-Sophie Kratz; Christian Meisel; David B Jackson; Theodoros G Soldatos
Journal:  Cancers (Basel)       Date:  2020-04-19       Impact factor: 6.639

5.  Examining Socioeconomic and Computational Aspects of Vaccine Pharmacovigilance.

Authors:  Vasiliki Soldatou; Anastasios Soldatos; Theodoros Soldatos
Journal:  Biomed Res Int       Date:  2019-02-19       Impact factor: 3.411

Review 6.  ThermoScan: Semi-automatic Identification of Protein Stability Data From PubMed.

Authors:  Paola Turina; Piero Fariselli; Emidio Capriotti
Journal:  Front Mol Biosci       Date:  2021-03-25

Review 7.  Advancing drug safety science by integrating molecular knowledge with post-marketing adverse event reports.

Authors:  Theodoros G Soldatos; Sarah Kim; Stephan Schmidt; Lawrence J Lesko; David B Jackson
Journal:  CPT Pharmacometrics Syst Pharmacol       Date:  2022-02-20

8.  The research on gene-disease association based on text-mining of PubMed.

Authors:  Jie Zhou; Bo-Quan Fu
Journal:  BMC Bioinformatics       Date:  2018-02-07       Impact factor: 3.169

9.  Knowledge-based biomedical Data Science.

Authors:  Lawrence E Hunter
Journal:  EPJ Data Sci       Date:  2017-12-08       Impact factor: 3.184

10.  Radioimmunotherapy in Non-Hodgkin's Lymphoma: Retrospective Adverse Event Profiling of Zevalin and Bexxar.

Authors:  Christos Sachpekidis; David B Jackson; Theodoros G Soldatos
Journal:  Pharmaceuticals (Basel)       Date:  2019-09-20
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.