Literature DB >> 34812411

Genome-wide identification and prediction of SARS-CoV-2 mutations show an abundance of variants: Integrated study of bioinformatics and deep neural learning.

Md Shahadat Hossain1, A Q M Sala Uddin Pathan2, Md Nur Islam1, Mahafujul Islam Quadery Tonmoy1, Mahmudul Islam Rakib2, Md Adnan Munim1, Otun Saha3, Atqiya Fariha1, Hasan Al Reza4, Maitreyee Roy5, Newaz Mohammed Bahadur6, Md Mizanur Rahaman3.   

Abstract

Genomic data analysis is a fundamental system for monitoring pathogen evolution and the outbreak of infectious diseases. Based on bioinformatics and deep learning, this study was designed to identify the genomic variability of SARS-CoV-2 worldwide and predict the impending mutation rate. Analysis of 259044 SARS-CoV-2 isolates identified 3334545 mutations with an average of 14.01 mutations per isolate. Globally, single nucleotide polymorphism (SNP) is the most prevalent mutational event. The prevalence of C > T (52.67%) was noticed as a major alteration across the world followed by the G > T (14.59%) and A > G (11.13%). Strains from India showed the highest number of mutations (48) followed by Scotland, USA, Netherlands, Norway, and France having up to 36 mutations. D416G, F106F, P314L, UTR:C241T, L93L, A222V, A199A, V30L, and A220V mutations were found as the most frequent mutations. D1118H, S194L, R262H, M809L, P314L, A8D, S220G, A890D, G1433C, T1456I, R233C, F263S, L111K, A54T, A74V, L183A, A316T, V212F, L46C, V48G, Q57H, W131R, G172V, Q185H, and Y206S missense mutations were found to largely decrease the structural stability of the corresponding proteins. Conversely, D3L, L5F, and S97I were found to largely increase the structural stability of the corresponding proteins. Multi-nucleotide mutations GGG > AAC, CC > TT, TG > CA, and AT > TA have come up in our analysis which are in the top 20 mutational cohort. Future mutation rate analysis predicts a 17%, 7%, and 3% increment of C > T, A > G, and A > T, respectively in the future. Conversely, 7%, 7%, and 6% decrement is estimated for T > C, G > A, and G > T mutations, respectively. T > G\A, C > G\A, and A > T\C are not anticipated in the future. Since SARS-CoV-2 is mutating continuously, our findings will facilitate the tracking of mutations and help to map the progression of the COVID-19 intensity worldwide.
© 2021 Published by Elsevier Ltd.

Entities:  

Keywords:  COVID-19; Genomic data; Mutation; Mutation rate; SARS-CoV-2

Year:  2021        PMID: 34812411      PMCID: PMC8598266          DOI: 10.1016/j.imu.2021.100798

Source DB:  PubMed          Journal:  Inform Med Unlocked        ISSN: 2352-9148


  51 in total

1.  Fast algorithms for large-scale genome alignment and comparison.

Authors:  Arthur L Delcher; Adam Phillippy; Jane Carlton; Steven L Salzberg
Journal:  Nucleic Acids Res       Date:  2002-06-01       Impact factor: 16.971

2.  Multiple alignment of DNA sequences with MAFFT.

Authors:  Kazutaka Katoh; George Asimenos; Hiroyuki Toh
Journal:  Methods Mol Biol       Date:  2009

3.  A method and server for predicting damaging missense mutations.

Authors:  Ivan A Adzhubei; Steffen Schmidt; Leonid Peshkin; Vasily E Ramensky; Anna Gerasimova; Peer Bork; Alexey S Kondrashov; Shamil R Sunyaev
Journal:  Nat Methods       Date:  2010-04       Impact factor: 28.547

4.  COVID-19 prediction using LSTM Algorithm: GCC Case Study.

Authors:  Kareem Kamal A Ghany; Hossam M Zawbaa; Heba M Sabri
Journal:  Inform Med Unlocked       Date:  2021-04-06

5.  SIFT web server: predicting effects of amino acid substitutions on proteins.

Authors:  Ngak-Leng Sim; Prateek Kumar; Jing Hu; Steven Henikoff; Georg Schneider; Pauline C Ng
Journal:  Nucleic Acids Res       Date:  2012-06-11       Impact factor: 16.971

6.  Time series forecasting of COVID-19 transmission in Canada using LSTM networks.

Authors:  Vinay Kumar Reddy Chimmula; Lei Zhang
Journal:  Chaos Solitons Fractals       Date:  2020-05-08       Impact factor: 5.944

7.  Real-time forecasts of the COVID-19 epidemic in China from February 5th to February 24th, 2020.

Authors:  K Roosa; Y Lee; R Luo; A Kirpich; R Rothenberg; J M Hyman; P Yan; G Chowell
Journal:  Infect Dis Model       Date:  2020-02-14

8.  SARS-CoV-2 viral load is associated with increased disease severity and mortality.

Authors:  Jesse Fajnzylber; James Regan; Kendyll Coxen; Heather Corry; Colline Wong; Alexandra Rosenthal; Daniel Worrall; Francoise Giguel; Alicja Piechocka-Trocha; Caroline Atyeo; Stephanie Fischinger; Andrew Chan; Keith T Flaherty; Kathryn Hall; Michael Dougan; Edward T Ryan; Elizabeth Gillespie; Rida Chishti; Yijia Li; Nikolaus Jilg; Dusan Hanidziar; Rebecca M Baron; Lindsey Baden; Athe M Tsibris; Katrina A Armstrong; Daniel R Kuritzkes; Galit Alter; Bruce D Walker; Xu Yu; Jonathan Z Li
Journal:  Nat Commun       Date:  2020-10-30       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.