Literature DB >> 34160596

LSTM-PHV: prediction of human-virus protein-protein interactions by LSTM with word2vec.

Sho Tsukiyama1, Md Mehedi Hasan2, Satoshi Fujii3, Hiroyuki Kurata3.   

Abstract

Viral infection involves a large number of protein-protein interactions (PPIs) between human and virus. The PPIs range from the initial binding of viral coat proteins to host membrane receptors to the hijacking of host transcription machinery. However, few interspecies PPIs have been identified, because experimental methods including mass spectrometry are time-consuming and expensive, and molecular dynamic simulation is limited only to the proteins whose 3D structures are solved. Sequence-based machine learning methods are expected to overcome these problems. We have first developed the LSTM model with word2vec to predict PPIs between human and virus, named LSTM-PHV, by using amino acid sequences alone. The LSTM-PHV effectively learnt the training data with a highly imbalanced ratio of positive to negative samples and achieved AUCs of 0.976 and 0.973 and accuracies of 0.984 and 0.985 on the training and independent datasets, respectively. In predicting PPIs between human and unknown or new virus, the LSTM-PHV learned greatly outperformed the existing state-of-the-art PPI predictors. Interestingly, learning of only sequence contexts as words is sufficient for PPI prediction. Use of uniform manifold approximation and projection demonstrated that the LSTM-PHV clearly distinguished the positive PPI samples from the negative ones. We presented the LSTM-PHV online web server and support data that are freely available at http://kurata35.bio.kyutech.ac.jp/LSTM-PHV.
© The Author(s) 2021. Published by Oxford University Press.

Entities:  

Keywords:  LSTM; SARS-CoV2; deep learning; human-virus protein–protein interaction; word2vec

Mesh:

Substances:

Year:  2021        PMID: 34160596      PMCID: PMC8574953          DOI: 10.1093/bib/bbab228

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  26 in total

1.  DeNovo: virus-host sequence-based protein-protein interaction prediction.

Authors:  Fatma-Elzahraa Eid; Mahmoud ElHefnawi; Lenwood S Heath
Journal:  Bioinformatics       Date:  2015-12-16       Impact factor: 6.937

2.  The IntAct molecular interaction database in 2012.

Authors:  Samuel Kerrien; Bruno Aranda; Lionel Breuza; Alan Bridge; Fiona Broackes-Carter; Carol Chen; Margaret Duesbury; Marine Dumousseau; Marc Feuermann; Ursula Hinz; Christine Jandrasits; Rafael C Jimenez; Jyoti Khadake; Usha Mahadevan; Patrick Masson; Ivo Pedruzzi; Eric Pfeiffenberger; Pablo Porras; Arathi Raghunath; Bernd Roechert; Sandra Orchard; Henning Hermjakob
Journal:  Nucleic Acids Res       Date:  2011-11-24       Impact factor: 16.971

3.  BioGRID: a general repository for interaction datasets.

Authors:  Chris Stark; Bobby-Joe Breitkreutz; Teresa Reguly; Lorrie Boucher; Ashton Breitkreutz; Mike Tyers
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

4.  Identifying antimicrobial peptides using word embedding with deep recurrent neural networks.

Authors:  Md-Nafiz Hamid; Iddo Friedberg
Journal:  Bioinformatics       Date:  2019-06-01       Impact factor: 6.937

5.  CD-HIT: accelerated for clustering the next-generation sequencing data.

Authors:  Limin Fu; Beifang Niu; Zhengwei Zhu; Sitao Wu; Weizhong Li
Journal:  Bioinformatics       Date:  2012-10-11       Impact factor: 6.937

6.  The landscape of human proteins interacting with viruses and other pathogens.

Authors:  Matthew D Dyer; T M Murali; Bruno W Sobral
Journal:  PLoS Pathog       Date:  2008-02-08       Impact factor: 6.823

Review 7.  Deciphering protein-protein interactions. Part I. Experimental techniques and databases.

Authors:  Benjamin A Shoemaker; Anna R Panchenko
Journal:  PLoS Comput Biol       Date:  2007-03-30       Impact factor: 4.475

8.  HPIDB 2.0: a curated database for host-pathogen interactions.

Authors:  Mais G Ammari; Cathy R Gresham; Fiona M McCarthy; Bindu Nanduri
Journal:  Database (Oxford)       Date:  2016-07-03       Impact factor: 3.451

9.  Protein-Protein Interactions Prediction Using a Novel Local Conjoint Triad Descriptor of Amino Acid Sequences.

Authors:  Jun Wang; Long Zhang; Lianyin Jia; Yazhou Ren; Guoxian Yu
Journal:  Int J Mol Sci       Date:  2017-11-08       Impact factor: 5.923

10.  Prediction of human-virus protein-protein interactions through a sequence embedding-based machine learning method.

Authors:  Xiaodi Yang; Shiping Yang; Qinmengge Li; Stefan Wuchty; Ziding Zhang
Journal:  Comput Struct Biotechnol J       Date:  2019-12-26       Impact factor: 7.271

View more
  7 in total

1.  PRIP: A Protein-RNA Interface Predictor Based on Semantics of Sequences.

Authors:  You Li; Jianyi Lyu; Yaoqun Wu; Yuewu Liu; Guohua Huang
Journal:  Life (Basel)       Date:  2022-02-18

2.  Hierarchical representation for PPI sites prediction.

Authors:  Michela Quadrini; Sebastian Daberdaku; Carlo Ferrari
Journal:  BMC Bioinformatics       Date:  2022-03-20       Impact factor: 3.169

3.  Comprehensive characterization of human-virus protein-protein interactions reveals disease comorbidities and potential antiviral drugs.

Authors:  Si Li; Weiwei Zhou; Donghao Li; Tao Pan; Jing Guo; Haozhe Zou; Zhanyu Tian; Kongning Li; Juan Xu; Xia Li; Yongsheng Li
Journal:  Comput Struct Biotechnol J       Date:  2022-03-07       Impact factor: 7.271

4.  Decoding the protein-ligand interactions using parallel graph neural networks.

Authors:  Carter Knutson; Mridula Bontha; Jenna A Bilbrey; Neeraj Kumar
Journal:  Sci Rep       Date:  2022-05-10       Impact factor: 4.996

5.  Deep Learning-Powered Prediction of Human-Virus Protein-Protein Interactions.

Authors:  Xiaodi Yang; Shiping Yang; Panyu Ren; Stefan Wuchty; Ziding Zhang
Journal:  Front Microbiol       Date:  2022-04-15       Impact factor: 6.064

6.  Accurate prediction of virus-host protein-protein interactions via a Siamese neural network using deep protein sequence embeddings.

Authors:  Sumit Madan; Victoria Demina; Marcus Stapf; Oliver Ernst; Holger Fröhlich
Journal:  Patterns (N Y)       Date:  2022-07-31

Review 7.  Protein-protein interaction prediction with deep learning: A comprehensive review.

Authors:  Farzan Soleymani; Eric Paquet; Herna Viktor; Wojtek Michalowski; Davide Spinello
Journal:  Comput Struct Biotechnol J       Date:  2022-09-19       Impact factor: 6.155

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.