Literature DB >> 33643375

COVID-DeepPredictor: Recurrent Neural Network to Predict SARS-CoV-2 and Other Pathogenic Viruses.

Indrajit Saha1, Nimisha Ghosh2, Debasree Maity3, Arjit Seal4, Dariusz Plewczynski5,6.   

Abstract

The COVID-19 disease for Novel coronavirus (SARS-CoV-2) has turned out to be a global pandemic. The high transmission rate of this pathogenic virus demands an early prediction and proper identification for the subsequent treatment. However, polymorphic nature of this virus allows it to adapt and sustain in different kinds of environment which makes it difficult to predict. On the other hand, there are other pathogens like SARS-CoV-1, MERS-CoV, Ebola, Dengue, and Influenza as well, so that a predictor is highly required to distinguish them with the use of their genomic information. To mitigate this problem, in this work COVID-DeepPredictor is proposed on the framework of deep learning to identify an unknown sequence of these pathogens. COVID-DeepPredictor uses Long Short Term Memory as Recurrent Neural Network for the underlying prediction with an alignment-free technique. In this regard, k-mer technique is applied to create Bag-of-Descriptors (BoDs) in order to generate Bag-of-Unique-Descriptors (BoUDs) as vocabulary and subsequently embedded representation is prepared for the given virus sequences. This predictor is not only validated for the dataset using K -fold cross-validation but also for unseen test datasets of SARS-CoV-2 sequences and sequences from other viruses as well. To verify the efficacy of COVID-DeepPredictor, it has been compared with other state-of-the-art prediction techniques based on Linear Discriminant Analysis, Random Forests, and Gradient Boosting Method. COVID-DeepPredictor achieves 100% prediction accuracy on validation dataset while on test datasets, the accuracy ranges from 99.51 to 99.94%. It shows superior results over other prediction techniques as well. In addition to this, accuracy and runtime of COVID-DeepPredictor are considered simultaneously to determine the value of k in k-mer, a comparative study among k values in k-mer, Bag-of-Descriptors (BoDs), and Bag-of-Unique-Descriptors (BoUDs) and a comparison between COVID-DeepPredictor and Nucleotide BLAST have also been performed. The code, training, and test datasets used for COVID-DeepPredictor are available at http://www.nitttrkol.ac.in/indrajit/projects/COVID-DeepPredictor/.
Copyright © 2021 Saha, Ghosh, Maity, Seal and Plewczynski.

Entities:  

Keywords:  SARS-CoV-2; genomic information; long-short term memory; sequence analysis; virus prediction

Year:  2021        PMID: 33643375      PMCID: PMC7906283          DOI: 10.3389/fgene.2021.569120

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


  23 in total

1.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

2.  Isolation and characterization of viruses related to the SARS coronavirus from animals in southern China.

Authors:  Y Guan; B J Zheng; Y Q He; X L Liu; Z X Zhuang; C L Cheung; S W Luo; P H Li; L J Zhang; Y J Guan; K M Butt; K L Wong; K W Chan; W Lim; K F Shortridge; K Y Yuen; J S M Peiris; L L M Poon
Journal:  Science       Date:  2003-09-04       Impact factor: 47.728

3.  An open-source k-mer based machine learning tool for fast and accurate subtyping of HIV-1 genomes.

Authors:  Stephen Solis-Reyes; Mariano Avino; Art Poon; Lila Kari
Journal:  PLoS One       Date:  2018-11-14       Impact factor: 3.240

Review 4.  Recent Advances of Deep Learning in Bioinformatics and Computational Biology.

Authors:  Binhua Tang; Zixiang Pan; Kang Yin; Asif Khateeb
Journal:  Front Genet       Date:  2019-03-26       Impact factor: 4.599

5.  Automated detection of COVID-19 cases using deep neural networks with X-ray images.

Authors:  Tulin Ozturk; Muhammed Talo; Eylul Azra Yildirim; Ulas Baran Baloglu; Ozal Yildirim; U Rajendra Acharya
Journal:  Comput Biol Med       Date:  2020-04-28       Impact factor: 4.589

6.  Receptor Recognition by the Novel Coronavirus from Wuhan: an Analysis Based on Decade-Long Structural Studies of SARS Coronavirus.

Authors:  Yushun Wan; Jian Shang; Rachel Graham; Ralph S Baric; Fang Li
Journal:  J Virol       Date:  2020-03-17       Impact factor: 5.103

7.  A benchmark study of k-mer counting methods for high-throughput sequencing.

Authors:  Swati C Manekar; Shailesh R Sathe
Journal:  Gigascience       Date:  2018-12-01       Impact factor: 6.524

8.  Deep-learning-based Prediction of Late Age-Related Macular Degeneration Progression.

Authors:  Qi Yan; Daniel E Weeks; Hongyi Xin; Anand Swaroop; Emily Y Chew; Heng Huang; Ying Ding; Wei Chen
Journal:  Nat Mach Intell       Date:  2020-02-14

9.  Functional assessment of cell entry and receptor usage for SARS-CoV-2 and other lineage B betacoronaviruses.

Authors:  Michael Letko; Andrea Marzi; Vincent Munster
Journal:  Nat Microbiol       Date:  2020-02-24       Impact factor: 17.745

10.  Potential inhibitors against 2019-nCoV coronavirus M protease from clinically approved medicines.

Authors:  Xin Liu; Xiu-Jie Wang
Journal:  J Genet Genomics       Date:  2020-02-13       Impact factor: 4.275

View more
  5 in total

1.  Integrated COVID-19 Predictor: Differential expression analysis to reveal potential biomarkers and prediction of coronavirus using RNA-Seq profile data.

Authors:  Naiyar Iqbal; Pradeep Kumar
Journal:  Comput Biol Med       Date:  2022-06-03       Impact factor: 6.698

2.  Conserved molecular signatures in the spike protein provide evidence indicating the origin of SARS-CoV-2 and a Pangolin-CoV (MP789) by recombination(s) between specific lineages of Sarbecoviruses.

Authors:  Bijendra Khadka; Radhey S Gupta
Journal:  PeerJ       Date:  2021-11-12       Impact factor: 2.984

3.  Sequencing meets machine learning to fight emerging pathogens: A preview.

Authors:  Artur Yakimovich
Journal:  Patterns (N Y)       Date:  2022-02-11

4.  Exploring the Lethality of Human-Adapted Coronavirus Through Alignment-Free Machine Learning Approaches Using Genomic Sequences.

Authors:  Rui Yin; Zihan Luo; Chee Keong Kwoh
Journal:  Curr Genomics       Date:  2021-12-31       Impact factor: 2.689

5.  Mapping Data to Deep Understanding: Making the Most of the Deluge of SARS-CoV-2 Genome Sequences.

Authors:  Bahrad A Sokhansanj; Gail L Rosen
Journal:  mSystems       Date:  2022-03-21       Impact factor: 7.324

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.