Literature DB >> 30295718

Document triage for identifying protein-protein interactions affected by mutations: a neural network ensemble approach.

Ling Luo1, Zhihao Yang1, Hongfei Lin1, Jian Wang1.   

Abstract

The precision medicine (PM) initiative promises to identify individualized treatment depending on a patients' genetic profile and their related responses. In order to help health professionals and researchers in the PM endeavor, BioCreative VI organized a PM Track to mine protein-protein interactions (PPI) affected by genetic mutations from the biomedical literature. In this paper, we present a neural network ensemble approach to identify relevant articles describing PPI affected by mutations. In this approach, several neural network models are used for document triage, and the ensemble performs better than any individual model. In the official runs, our best submission achieves an F-score of 69.04% in the BioCreative VI PM document triage task. After post-challenge analysis, to address the problem of the limited size of training set, a PPI pre-trained module is incorporated into our approach to further improve the performance. Finally, our best ensemble method achieves an F-score of 71.04% on the test set.

Entities:  

Mesh:

Year:  2018        PMID: 30295718      PMCID: PMC6147215          DOI: 10.1093/database/bay097

Source DB:  PubMed          Journal:  Database (Oxford)        ISSN: 1758-0463            Impact factor:   3.451


  16 in total

1.  Automatic early stopping using cross validation: quantifying the criteria.

Authors:  Lutz Prechelt
Journal:  Neural Netw       Date:  1998-06

2.  An Overview of BioCreative II.5.

Authors:  Florian Leitner; Scott A Mardis; Martin Krallinger; Gianni Cesareni; Lynette A Hirschman; Alfonso Valencia
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2010 Jul-Sep       Impact factor: 3.710

3.  Feature generation and representations for protein-protein interaction classification.

Authors:  Man Lan; Chew Lim Tan; Jian Su
Journal:  J Biomed Inform       Date:  2009-07-16       Impact factor: 6.317

4.  Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms.

Authors: 
Journal:  Neural Comput       Date:  1998-09-15       Impact factor: 2.026

5.  tmVar: a text mining approach for extracting sequence variants in biomedical literature.

Authors:  Chih-Hsuan Wei; Bethany R Harris; Hung-Yu Kao; Zhiyong Lu
Journal:  Bioinformatics       Date:  2013-04-05       Impact factor: 6.937

6.  Prioritizing PubMed articles for the Comparative Toxicogenomic Database utilizing semantic information.

Authors:  Sun Kim; Won Kim; Chih-Hsuan Wei; Zhiyong Lu; W John Wilbur
Journal:  Database (Oxford)       Date:  2012-11-17       Impact factor: 3.451

7.  The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text.

Authors:  Martin Krallinger; Miguel Vazquez; Florian Leitner; David Salgado; Andrew Chatr-Aryamontri; Andrew Winter; Livia Perfetto; Leonardo Briganti; Luana Licata; Marta Iannuccelli; Luisa Castagnoli; Gianni Cesareni; Mike Tyers; Gerold Schneider; Fabio Rinaldi; Robert Leaman; Graciela Gonzalez; Sergio Matos; Sun Kim; W John Wilbur; Luis Rocha; Hagit Shatkay; Ashish V Tendulkar; Shashank Agarwal; Feifan Liu; Xinglong Wang; Rafal Rak; Keith Noto; Charles Elkan; Zhiyong Lu; Rezarta Islamaj Dogan; Jean-Fred Fontaine; Miguel A Andrade-Navarro; Alfonso Valencia
Journal:  BMC Bioinformatics       Date:  2011-10-03       Impact factor: 3.169

8.  Text Mining Genotype-Phenotype Relationships from Biomedical Literature for Database Curation and Precision Medicine.

Authors:  Ayush Singhal; Michael Simmons; Zhiyong Lu
Journal:  PLoS Comput Biol       Date:  2016-11-30       Impact factor: 4.475

9.  The BioGRID interaction database: 2017 update.

Authors:  Andrew Chatr-Aryamontri; Rose Oughtred; Lorrie Boucher; Jennifer Rust; Christie Chang; Nadine K Kolas; Lara O'Donnell; Sara Oster; Chandra Theesfeld; Adnane Sellam; Chris Stark; Bobby-Joe Breitkreutz; Kara Dolinski; Mike Tyers
Journal:  Nucleic Acids Res       Date:  2016-12-14       Impact factor: 16.971

10.  Overview of the protein-protein interaction annotation extraction task of BioCreative II.

Authors:  Martin Krallinger; Florian Leitner; Carlos Rodriguez-Penagos; Alfonso Valencia
Journal:  Genome Biol       Date:  2008-09-01       Impact factor: 13.583

View more
  2 in total

1.  Using deep learning to identify translational research in genomic medicine beyond bench to bedside.

Authors:  Yi-Yu Hsu; Mindy Clyne; Chih-Hsuan Wei; Muin J Khoury; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2019-01-01       Impact factor: 3.451

2.  Overview of the BioCreative VI Precision Medicine Track: mining protein interactions and mutations for precision medicine.

Authors:  Rezarta Islamaj Dogan; Sun Kim; Andrew Chatr-Aryamontri; Chih-Hsuan Wei; Donald C Comeau; Rui Antunes; Sérgio Matos; Qingyu Chen; Aparna Elangovan; Nagesh C Panyam; Karin Verspoor; Hongfang Liu; Yanshan Wang; Zhuang Liu; Berna Altinel; Zehra Melce Hüsünbeyi; Arzucan Özgür; Aris Fergadis; Chen-Kai Wang; Hong-Jie Dai; Tung Tran; Ramakanth Kavuluru; Ling Luo; Albert Steppi; Jinfeng Zhang; Jinchan Qu; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2019-01-01       Impact factor: 3.451

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.