Literature DB >> 31725857

Biomarker identification of hepatocellular carcinoma using a methodical literature mining strategy.

Nai-Wen Chang1,2, Hong-Jie Dai3,4, Yung-Yu Shih2, Chi-Yang Wu2, Mira Anne C Dela Rosa5, Rofeamor P Obena5, Yu-Ju Chen5, Wen-Lian Hsu2, Yen-Jen Oyang1.   

Abstract

Hepatocellular carcinoma (HCC), one of the most common causes of cancer-related deaths, carries a 5-year survival rate of 18%, underscoring the need for robust biomarkers. In spite of the increased availability of HCC related literatures, many of the promising biomarkers reported have not been validated for clinical use. To narrow down the wide range of possible biomarkers for further clinical validation, bioinformaticians need to sort them out using information provided in published works. Biomedical text mining is an automated way to obtain information of interest within the massive collection of biomedical knowledge, thus enabling extraction of data for biomarkers associated with certain diseases. This method can significantly reduce both the time and effort spent on studying important maladies such as liver diseases. Herein, we report a text mining-aided curation pipeline to identify potential biomarkers for liver cancer. The curation pipeline integrates PubMed E-Utilities to collect abstracts from PubMed and recognize several types of named entities by machine learning-based and pattern-based methods. Genes/proteins from evidential sentences were classified as candidate biomarkers using a convolutional neural network. Lastly, extracted biomarkers were ranked depending on several criteria, such as the frequency of keywords and articles and the journal impact factor, and then integrated into a meaningful list for bioinformaticians. Based on the developed pipeline, we constructed MarkerHub, which contains 2128 candidate biomarkers extracted from PubMed publications from 2008 to 2017. Database URL: http://markerhub.iis.sinica.edu.tw.
© The Author(s) 2017. Published by Oxford University Press.

Entities:  

Year:  2017        PMID: 31725857      PMCID: PMC7243925          DOI: 10.1093/database/bax082

Source DB:  PubMed          Journal:  Database (Oxford)        ISSN: 1758-0463            Impact factor:   3.451


  77 in total

Review 1.  Metabolite profiling: from diagnostics to systems biology.

Authors:  Alisdair R Fernie; Richard N Trethewey; Arno J Krotzky; Lothar Willmitzer
Journal:  Nat Rev Mol Cell Biol       Date:  2004-09       Impact factor: 94.444

2.  DisGeNET: a Cytoscape plugin to visualize, integrate, search and analyze gene-disease networks.

Authors:  Anna Bauer-Mehren; Michael Rautschka; Ferran Sanz; Laura I Furlong
Journal:  Bioinformatics       Date:  2010-09-21       Impact factor: 6.937

Review 3.  Key cancer cell signal transduction pathways as therapeutic targets.

Authors:  Roberto Bianco; Davide Melisi; Fortunato Ciardiello; Giampaolo Tortora
Journal:  Eur J Cancer       Date:  2006-01-11       Impact factor: 9.162

4.  TMT-HCC: a tool for text mining the biomedical literature for hepatocellular carcinoma (HCC) biomarkers identification.

Authors:  Rania A Abul Seoud; Mai S Mabrouk
Journal:  Comput Methods Programs Biomed       Date:  2013-08-23       Impact factor: 5.428

5.  Data mining in the Life Sciences with Random Forest: a walk in the park or lost in the jungle?

Authors:  Wouter G Touw; Jumamurat R Bayjanov; Lex Overmars; Lennart Backus; Jos Boekhorst; Michiel Wels; Sacha A F T van Hijum
Journal:  Brief Bioinform       Date:  2012-07-10       Impact factor: 11.622

6.  Next-generation human genetics.

Authors:  Jay Shendure
Journal:  Genome Biol       Date:  2011-09-14       Impact factor: 13.583

7.  UniProt: a hub for protein information.

Authors: 
Journal:  Nucleic Acids Res       Date:  2014-10-27       Impact factor: 16.971

8.  PubTator: a web-based text mining tool for assisting biocuration.

Authors:  Chih-Hsuan Wei; Hung-Yu Kao; Zhiyong Lu
Journal:  Nucleic Acids Res       Date:  2013-05-22       Impact factor: 16.971

9.  AFP, PIVKAII, GP3, SCCA-1 and follisatin as surveillance biomarkers for hepatocellular cancer in non-alcoholic and alcoholic fatty liver disease.

Authors:  Gary Beale; Dipankar Chattopadhyay; Joe Gray; Stephen Stewart; Mark Hudson; Christopher Day; Paolo Trerotoli; Gianluigi Giannelli; Derek Manas; Helen Reeves
Journal:  BMC Cancer       Date:  2008-07-18       Impact factor: 4.430

10.  LiverCancerMarkerRIF: a liver cancer biomarker interactive curation system combining text mining and expert annotations.

Authors:  Hong-Jie Dai; Johnny Chi-Yang Wu; Wei-San Lin; Aaron James F Reyes; Mira Anne C Dela Rosa; Shabbir Syed-Abdul; Richard Tzong-Han Tsai; Wen-Lian Hsu
Journal:  Database (Oxford)       Date:  2014-08-27       Impact factor: 3.451

View more
  2 in total

1.  Identification of most influential co-occurring gene suites for gastrointestinal cancer using biomedical literature mining and graph-based influence maximization.

Authors:  Charles C N Wang; Jennifer Jin; Jan-Gowth Chang; Masahiro Hayakawa; Atsushi Kitazawa; Jeffrey J P Tsai; Phillip C-Y Sheu
Journal:  BMC Med Inform Decis Mak       Date:  2020-09-03       Impact factor: 2.796

2.  Real-world biomarker testing rate and positivity rate in NSCLC in Spain: Prospective Central Lung Cancer Biomarker Testing Registry (LungPath) from the Spanish Society of Pathology (SEAP).

Authors:  Clara Salas; Javier Martín-López; Antonio Martínez-Pozo; Teresa Hernández-Iglesias; David Carcedo; Lucia Ruiz de Alda; J Francisco García; Federico Rojo
Journal:  J Clin Pathol       Date:  2021-03-15       Impact factor: 3.411

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.