Literature DB >> 30809637

Statistical principle-based approach for recognizing and normalizing microRNAs described in scientific literature.

Hong-Jie Dai1, Chen-Kai Wang2, Nai-Wen Chang3,4, Ming-Siang Huang4, Jitendra Jonnagaddala5, Feng-Duo Wang6, Wen-Lian Hsu4.   

Abstract

The detection of MicroRNA (miRNA) mentions in scientific literature facilitates researchers with the ability to find relevant and appropriate literature based on queries formulated using miRNA information. Considering most published biological studies elaborated on signal transduction pathways or genetic regulatory information in the form of figure captions, the extraction of miRNA from both the main content and figure captions of a manuscript is useful in aggregate analysis and comparative analysis of the studies published. In this study, we present a statistical principle-based miRNA recognition and normalization method to identify miRNAs and link them to the identifiers in the Rfam database. As one of the core components in the text mining pipeline of the database miRTarBase, the proposed method combined the advantages of previous works relying on pattern, dictionary and supervised learning and provided an integrated solution for the problem of miRNA identification. Furthermore, the knowledge learned from the training data was organized in a human-interpretable manner to understand the reason why the system considers a span of text as a miRNA mention, and the represented knowledge can be further complemented by domain experts. We studied the ambiguity level of miRNA nomenclature to connect the miRNA mentions to the Rfam database and evaluated the performance of our approach on two datasets: the BioCreative VI Bio-ID corpus and the miRNA interaction corpus by extending the later corpus with additional Rfam normalization information. Our study highlights and also proposes a better understanding of the challenges associated with miRNA identification and normalization in scientific literature and the research gap that needs to be further explored in prospective studies.
© The Author(s) 2019. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30809637      PMCID: PMC6391575          DOI: 10.1093/database/baz030

Source DB:  PubMed          Journal:  Database (Oxford)        ISSN: 1758-0463            Impact factor:   3.451


  26 in total

1.  An in silico analysis of microRNAs: mining the miRNAome.

Authors:  B Stuart Murray; Sung E Choe; Matthew Woods; Terence E Ryan; Wei Liu
Journal:  Mol Biosyst       Date:  2010-06-11

2.  Microarray profiling of microRNAs reveals frequent coexpression with neighboring miRNAs and host genes.

Authors:  Scott Baskerville; David P Bartel
Journal:  RNA       Date:  2005-03       Impact factor: 4.942

3.  Combinatorial microRNA target predictions.

Authors:  Azra Krek; Dominic Grün; Matthew N Poy; Rachel Wolf; Lauren Rosenberg; Eric J Epstein; Philip MacMenamin; Isabelle da Piedade; Kristin C Gunsalus; Markus Stoffel; Nikolaus Rajewsky
Journal:  Nat Genet       Date:  2005-04-03       Impact factor: 38.330

Review 4.  miRBase: the microRNA sequence database.

Authors:  Sam Griffiths-Jones
Journal:  Methods Mol Biol       Date:  2006

5.  Isolation and genetic characterization of cell-lineage mutants of the nematode Caenorhabditis elegans.

Authors:  H R Horvitz; J E Sulston
Journal:  Genetics       Date:  1980-10       Impact factor: 4.562

6.  miRCancer: a microRNA-cancer association database constructed by text mining on literature.

Authors:  Boya Xie; Qin Ding; Hongjin Han; Di Wu
Journal:  Bioinformatics       Date:  2013-01-16       Impact factor: 6.937

7.  MicroRNA gene expression deregulation in human breast cancer.

Authors:  Marilena V Iorio; Manuela Ferracin; Chang-Gong Liu; Angelo Veronese; Riccardo Spizzo; Silvia Sabbioni; Eros Magri; Massimo Pedriali; Muller Fabbri; Manuela Campiglio; Sylvie Ménard; Juan P Palazzo; Anne Rosenberg; Piero Musiani; Stefano Volinia; Italo Nenci; George A Calin; Patrizia Querzoli; Massimo Negrini; Carlo M Croce
Journal:  Cancer Res       Date:  2005-08-15       Impact factor: 12.701

8.  Detecting miRNA Mentions and Relations in Biomedical Literature.

Authors:  Shweta Bagewadi; Tamara Bobić; Martin Hofmann-Apitius; Juliane Fluck; Roman Klinger
Journal:  F1000Res       Date:  2014-08-28

9.  BioC: a minimalist approach to interoperability for biomedical text processing.

Authors:  Donald C Comeau; Rezarta Islamaj Doğan; Paolo Ciccarese; Kevin Bretonnel Cohen; Martin Krallinger; Florian Leitner; Zhiyong Lu; Yifan Peng; Fabio Rinaldi; Manabu Torii; Alfonso Valencia; Karin Verspoor; Thomas C Wiegers; Cathy H Wu; W John Wilbur
Journal:  Database (Oxford)       Date:  2013-09-18       Impact factor: 3.451

10.  miRiaD: A Text Mining Tool for Detecting Associations of microRNAs with Diseases.

Authors:  Samir Gupta; Karen E Ross; Catalina O Tudor; Cathy H Wu; Carl J Schmidt; K Vijay-Shanker
Journal:  J Biomed Semantics       Date:  2016-04-29
View more
  1 in total

1.  Posterior cingulate cortex reveals an expression profile of resilience in cognitively intact elders.

Authors:  Christy M Kelley; Stephen D Ginsberg; Winnie S Liang; Scott E Counts; Elliott J Mufson
Journal:  Brain Commun       Date:  2022-06-21
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.