Literature DB >> 33584824

Computational Method for Classification of Avian Influenza A Virus Using DNA Sequence Information and Physicochemical Properties.

Fahad Humayun1, Fatima Khan2, Nasim Fawad3, Shazia Shamas4, Sahar Fazal2, Abbas Khan1, Arif Ali1, Ali Farhan5, Dong-Qing Wei1.   

Abstract

Accurate and fast characterization of the subtype sequences of Avian influenza A virus (AIAV) hemagglutinin (HA) and neuraminidase (NA) depends on expanding diagnostic services and is embedded in molecular epidemiological studies. A new approach for classifying the AIAV sequences of the HA and NA genes into subtypes using DNA sequence data and physicochemical properties is proposed. This method simply requires unaligned, full-length, or partial sequences of HA or NA DNA as input. It allows for quick and highly accurate assignments of HA sequences to subtypes H1-H16 and NA sequences to subtypes N1-N9. For feature extraction, k-gram, discrete wavelet transformation, and multivariate mutual information were used, and different classifiers were trained for prediction. Four different classifiers, Naïve Bayes, Support Vector Machine (SVM), K nearest neighbor (KNN), and Decision Tree, were compared using our feature selection method. This comparison is based on the 30% dataset separated from the original dataset for testing purposes. Among the four classifiers, Decision Tree was the best, and Precision, Recall, F1 score, and Accuracy were 0.9514, 0.9535, 0.9524, and 0.9571, respectively. Decision Tree had considerable improvements over the other three classifiers using our method. Results show that the proposed feature selection method, when trained with a Decision Tree classifier, gives the best results for accurate prediction of the AIAV subtype.
Copyright © 2021 Humayun, Khan, Fawad, Shamas, Fazal, Khan, Ali, Farhan and Wei.

Entities:  

Keywords:  Avian influenza A Virus; K-nearest neighbor; Naïve Bayes; decision tree; discrete wavelet transform; k-gram; multivariate mutual information; support vector machine

Year:  2021        PMID: 33584824      PMCID: PMC7877484          DOI: 10.3389/fgene.2021.599321

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


  23 in total

1.  Protein classification artificial neural system.

Authors:  C Wu; G Whitson; J McLarty; A Ermongkonchai; T C Chang
Journal:  Protein Sci       Date:  1992-05       Impact factor: 6.725

Review 2.  The emergence of pandemic influenza viruses.

Authors:  Yi Guan; Dhanasekaran Vijaykrishna; Justin Bahl; Huachen Zhu; Jia Wang; Gavin J D Smith
Journal:  Protein Cell       Date:  2010-02-07       Impact factor: 14.870

Review 3.  The evolving threat of influenza viruses of animal origin and the challenges in developing appropriate diagnostics.

Authors:  Polly W Y Mak; Shanthi Jayawardena; Leo L M Poon
Journal:  Clin Chem       Date:  2012-09-11       Impact factor: 8.327

4.  An ensemble of K-local hyperplanes for predicting protein-protein interactions.

Authors:  Loris Nanni; Alessandra Lumini
Journal:  Bioinformatics       Date:  2006-02-15       Impact factor: 6.937

Review 5.  Overview of influenza viruses.

Authors:  Stephan Pleschka
Journal:  Curr Top Microbiol Immunol       Date:  2013       Impact factor: 4.291

6.  Protein sequence classification using feature hashing.

Authors:  Cornelia Caragea; Adrian Silvescu; Prasenjit Mitra
Journal:  Proteome Sci       Date:  2012-06-21       Impact factor: 2.480

7.  ClassyFlu: classification of influenza A viruses with Discriminatively trained profile-HMMs.

Authors:  Sandra Van der Auwera; Ingo Bulla; Mario Ziller; Anne Pohlmann; Timm Harder; Mario Stanke
Journal:  PLoS One       Date:  2014-01-03       Impact factor: 3.240

8.  An Ameliorated Prediction of Drug-Target Interactions Based on Multi-Scale Discrete Wavelet Transform and Network Features.

Authors:  Cong Shen; Yijie Ding; Jijun Tang; Xinying Xu; Fei Guo
Journal:  Int J Mol Sci       Date:  2017-08-16       Impact factor: 5.923

9.  Data mining and model-predicting a global disease reservoir for low-pathogenic Avian Influenza (A) in the wider pacific rim using big data sets.

Authors:  Marina Gulyaeva; Falk Huettmann; Alexander Shestopalov; Masatoshi Okamatsu; Keita Matsuno; Duc-Huy Chu; Yoshihiro Sakoda; Alexandra Glushchenko; Elaina Milton; Eric Bortz
Journal:  Sci Rep       Date:  2020-10-08       Impact factor: 4.379

10.  A decision support framework for prediction of avian influenza.

Authors:  Samira Yousefinaghani; Rozita A Dara; Zvonimir Poljak; Shayan Sharif
Journal:  Sci Rep       Date:  2020-11-04       Impact factor: 4.379

View more
  1 in total

1.  Influenza virus genotype to phenotype predictions through machine learning: a systematic review.

Authors:  Laura K Borkenhagen; Martin W Allen; Jonathan A Runstadler
Journal:  Emerg Microbes Infect       Date:  2021-12       Impact factor: 7.163

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.