Literature DB >> 32259469

Predicting the short-term success of human influenza virus variants with machine learning.

Maryam Hayati1, Priscila Biller2, Caroline Colijn2,3.   

Abstract

Seasonal influenza viruses are constantly changing and produce a different set of circulating strains each season. Small genetic changes can accumulate over time and result in antigenically different viruses; this may prevent the body's immune system from recognizing those viruses. Due to rapid mutations, in particular, in the haemagglutinin (HA) gene, seasonal influenza vaccines must be updated frequently. This requires choosing strains to include in the updates to maximize the vaccines' benefits, according to estimates of which strains will be circulating in upcoming seasons. This is a challenging prediction task. In this paper, we use longitudinally sampled phylogenetic trees based on HA sequences from human influenza viruses, together with counts of epitope site polymorphisms in HA, to predict which influenza virus strains are likely to be successful. We extract small groups of taxa (subtrees) and use a suite of features of these subtrees as key inputs to the machine learning tools. Using a range of training and testing strategies, including training on H3N2 and testing on H1N1, we find that successful prediction of future expansion of small subtrees is possible from these data, with accuracies of 0.71-0.85 and a classifier 'area under the curve' 0.75-0.9.

Entities:  

Keywords:  influenza; machine learning; phylogenetics; prediction; tree shape statistics

Mesh:

Substances:

Year:  2020        PMID: 32259469      PMCID: PMC7209065          DOI: 10.1098/rspb.2020.0319

Source DB:  PubMed          Journal:  Proc Biol Sci        ISSN: 0962-8452            Impact factor:   5.349


  32 in total

1.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

Authors:  Kazutaka Katoh; Kazuharu Misawa; Kei-ichi Kuma; Takashi Miyata
Journal:  Nucleic Acids Res       Date:  2002-07-15       Impact factor: 16.971

2.  Simultaneous amino acid substitutions at antigenic sites drive influenza A hemagglutinin evolution.

Authors:  Arthur Chun-Chieh Shih; Tzu-Chang Hsiao; Mei-Shang Ho; Wen-Hsiung Li
Journal:  Proc Natl Acad Sci U S A       Date:  2007-03-29       Impact factor: 11.205

3.  Taming the BEAST-A Community Teaching Material Resource for BEAST 2.

Authors:  Joëlle Barido-Sottani; Veronika Bošková; Louis Du Plessis; Denise Kühnert; Carsten Magnus; Venelin Mitov; Nicola F Müller; Julija PecErska; David A Rasmussen; Chi Zhang; Alexei J Drummond; Tracy A Heath; Oliver G Pybus; Timothy G Vaughan; Tanja Stadler
Journal:  Syst Biol       Date:  2018-01-01       Impact factor: 15.683

4.  Universal or Specific? A Modeling-Based Comparison of Broad-Spectrum Influenza Vaccines against Conventional, Strain-Matched Vaccines.

Authors:  Rahul Subramanian; Andrea L Graham; Bryan T Grenfell; Nimalan Arinaminpathy
Journal:  PLoS Comput Biol       Date:  2016-12-15       Impact factor: 4.475

5.  PHYLOSCANNER: Inferring Transmission from Within- and Between-Host Pathogen Genetic Diversity.

Authors:  Chris Wymant; Matthew Hall; Oliver Ratmann; David Bonsall; Tanya Golubchik; Mariateresa de Cesare; Astrid Gall; Marion Cornelissen; Christophe Fraser
Journal:  Mol Biol Evol       Date:  2018-03-01       Impact factor: 16.240

6.  Nextstrain: real-time tracking of pathogen evolution.

Authors:  James Hadfield; Colin Megill; Sidney M Bell; John Huddleston; Barney Potter; Charlton Callender; Pavel Sagulenko; Trevor Bedford; Richard A Neher
Journal:  Bioinformatics       Date:  2018-12-01       Impact factor: 6.931

7.  How the dynamics and structure of sexual contact networks shape pathogen phylogenies.

Authors:  Katy Robinson; Nick Fyson; Ted Cohen; Christophe Fraser; Caroline Colijn
Journal:  PLoS Comput Biol       Date:  2013-06-20       Impact factor: 4.475

8.  GenBank.

Authors:  Dennis A Benson; Mark Cavanaugh; Karen Clark; Ilene Karsch-Mizrachi; David J Lipman; James Ostell; Eric W Sayers
Journal:  Nucleic Acids Res       Date:  2012-11-27       Impact factor: 16.971

9.  PhyloTempo: A Set of R Scripts for Assessing and Visualizing Temporal Clustering in Genealogies Inferred from Serially Sampled Viral Sequences.

Authors:  Melissa M Norström; Mattia C F Prosperi; Rebecca R Gray; Annika C Karlsson; Marco Salemi
Journal:  Evol Bioinform Online       Date:  2012-06-11       Impact factor: 1.625

10.  Phylogenetic tree shapes resolve disease transmission patterns.

Authors:  Caroline Colijn; Jennifer Gardy
Journal:  Evol Med Public Health       Date:  2014-06-09
View more
  3 in total

Review 1.  12 Plagues of AI in Healthcare: A Practical Guide to Current Issues With Using Machine Learning in a Medical Context.

Authors:  Stephane Doyen; Nicholas B Dadario
Journal:  Front Digit Health       Date:  2022-05-03

2.  Deep clustering of bacterial tree images.

Authors:  Maryam Hayati; Leonid Chindelevitch; David Aanensen; Caroline Colijn
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2022-08-22       Impact factor: 6.671

Review 3.  Utility of Artificial Intelligence Amidst the COVID 19 Pandemic: A Review.

Authors:  Agam Bansal; Rana Prathap Padappayil; Chandan Garg; Anjali Singal; Mohak Gupta; Allan Klein
Journal:  J Med Syst       Date:  2020-08-01       Impact factor: 4.460

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.