Literature DB >> 26115986

nextflu: real-time tracking of seasonal influenza virus evolution in humans.

Richard A Neher1, Trevor Bedford2.   

Abstract

UNLABELLED: Seasonal influenza viruses evolve rapidly, allowing them to evade immunity in their human hosts and reinfect previously infected individuals. Similarly, vaccines against seasonal influenza need to be updated frequently to protect against an evolving virus population. We have thus developed a processing pipeline and browser-based visualization that allows convenient exploration and analysis of the most recent influenza virus sequence data. This web-application displays a phylogenetic tree that can be decorated with additional information such as the viral genotype at specific sites, sampling location and derived statistics that have been shown to be predictive of future virus dynamics. In addition, mutation, genotype and clade frequency trajectories are calculated and displayed.
AVAILABILITY AND IMPLEMENTATION: Python and Javascript source code is freely available from https://github.com/blab/nextflu, while the web-application is live at http://nextflu.org. CONTACT: tbedford@fredhutch.org.
© The Author 2015. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2015        PMID: 26115986      PMCID: PMC4612219          DOI: 10.1093/bioinformatics/btv381

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1. Introduction

Every year, seasonal influenza infects between 10 and 20% of the global population, resulting in substantial human morbidity and mortality (World Health Organization, 2009). Vaccination remains the most effective public health measure to combat seasonal epidemics. However, influenza viruses constantly evolve and thereby undergo antigenic drift, allowing drifted viruses to reinfect individuals with acquired immunity to previously circulating strains. Owing to antigenic drift, the seasonal influenza vaccine needs frequent updating to remain effective. In any given year, the particular choice of vaccine strain plays a major role in determining vaccine efficacy and so it is of critical importance to develop tools to analyze the ongoing evolution of the influenza virus population in order to aid vaccine strain selection. The program nextflu presents a near real-time display of genetic relationships among influenza viruses and allows investigation of currently available sequence data. By visualizing many different genetic and epidemiological features, we hope that nextflu will help vaccine strain selection. Currently, nextflu tracks all four circulating lineages of seasonal influenza: A/H3N2, A/H1N1pdm, B/Victoria and B/Yamagata. In implementation, nextflu consists of a processing pipeline written in Python called augur that analyzes virus sequence data and a JavaScript-based browser visualization called auspice that displays this processed information. As input, augur requires a FASTA file of sequences with header labels containing relevant information such as strain name, sampling date and passage history. For this purpose, influenza sequence data for the hemagglutinin (HA) gene is downloaded from the GISAID EpiFlu database (Bogner ), which contains the most up-to-date collection of seasonal influenza viruses. The first step in the processing pipeline is to automatically select a subset of representative viruses. Here, viruses without complete date or geographic information, viruses passaged in eggs and sequences <987 bases are removed. In addition, local outbreaks are filtered by keeping only one instance of identical sequences sampled at the same location on the same day. Following filtering, viruses are subsampled to achieve a more equitable temporal and geographic distribution. For our standard display period of 3 years and 32 viruses per month, this typically results in ∼1200 viruses, for which we align full-length HA sequences where available and partial sequences otherwise, using MAFFT (Katoh and Standley, 2013). Once aligned, the set of virus sequences is further cleaned by removing insertions relative to the outgroup to enforce canonical HA site numbering, by removing sequences that show either too much or too little divergence relative to the expectation given sampling date, and by removing known reassortant clusters, such as the triple-reassortant swine influenza viruses that have sporadically circulating since 2009 (Bastien ). As outgroup for each viral lineage, we chose a well characterized virus without insertions relative to the canonical amino-acid numbering and a sampling date a few years before the time interval of interest. From the filtered and cleaned alignment, augur builds a phylogenetic tree using FastTree (Price ), which is then further refined using RAxML (Stamatakis, 2014). Next, the state of every internal node of the tree is inferred using a marginal maximum likelihood method and missing sequence data at phylogeny tips is filled with the nearest ancestral sequence at these sites. Internal branches without mutations are collapsed into polytomies. The final tree is decorated with the attributes to be displayed in the browser. In addition to the phylogenetic tree, augur estimates the frequency trajectories of mutations, genotypes and clades in the tree. Frequencies are determined by maximizing the likelihood of sampling the observed set of virus sequences. In addition, we impose a smoothing that penalized rapid changes in frequency of the frequency derivative. augur estimates frequency with up to 1-month resolution. The result is similar to ‘allele dynamics’ plots in Steinbrück and McHardy (2011), but provides frequencies of clades in the tree in addition to point mutations. The augur pipeline is run every 3–7 days in response to sequence updates in the GISAID database. At the end of the augur pipeline, JSON files are exported containing the annotated phylogenetic tree, sequence data and frequency trajectories. These JSON files are then visualized by auspice using D3 (Bostock ) and a phylogenetic tree is displayed with branches scaled according to evolutionary distance across all sites (Fig. 1). The user can explore the data interactively by selecting viruses from different dates or by coloring the tree by attributes such as:
Fig. 1.

The nextflu website with the user interface on the left and the phylogenetic tree on the right

epitope mutations at sites generally associated with antibody binding that have been suggested to be predictive of future clade success (Łuksza and Lässig, 2014), receptor binding mutations at seven positions close to the receptor binding site that have been shown to be responsible for major antigenic transitions in the past decades (Koel ), local branching index indicating the exponentially weighted tree length surrounding a node, which is associated with rapid branching and expansion of clades (Neher ), HA genotype, which directly colors the tree by genotype at specific amino acid positions. The nextflu website with the user interface on the left and the phylogenetic tree on the right The display can also be restricted to different geographic regions. The frequency plot below the tree (Fig. 2) displays the frequency trajectory of clades in the tree whenever the mouse hovers above the branch defining the clade. Furthermore, trajectories of individual mutations, combinations of two mutations and predefined clades such as 3c3.a can be plotted. A second plot shows the variability of the alignment. On mouse-click on a variable position in this plot, auspice will color the tree by amino-acid at this position and plot its mutation frequencies.
Fig. 2.

The frequency diagram allows geography-specific plotting of frequencies of individual mutations, pairs of mutations and clades in the tree

The frequency diagram allows geography-specific plotting of frequencies of individual mutations, pairs of mutations and clades in the tree We built nextflu to facilitate the analysis and exploration of seasonal influenza sequence data collected by laboratories around the world. By using the most recent data and integrating phylogenies with frequency trajectories and predictors of successful clades, we hope that nextflu can inform the choice of strains used in seasonal influenza vaccines. nextflu was designed to be readily adapted to other rapidly evolving viruses and we see significant room for future developments in this area.

Funding

This work was supported by the ERC though Stg-260686 and by the NIH through U54 GM111274. Conflict of Interest: none declared.
  9 in total

1.  D³: Data-Driven Documents.

Authors:  Michael Bostock; Vadim Ogievetsky; Jeffrey Heer
Journal:  IEEE Trans Vis Comput Graph       Date:  2011-12       Impact factor: 4.579

2.  Predicting evolution from the shape of genealogical trees.

Authors:  Richard A Neher; Colin A Russell; Boris I Shraiman
Journal:  Elife       Date:  2014-11-11       Impact factor: 8.140

3.  A predictive fitness model for influenza.

Authors:  Marta Luksza; Michael Lässig
Journal:  Nature       Date:  2014-02-26       Impact factor: 49.962

4.  Human infection with a triple-reassortant swine influenza A(H1N1) virus containing the hemagglutinin and neuraminidase genes of seasonal influenza virus.

Authors:  Nathalie Bastien; Nick A Antonishyn; Ken Brandt; Christine E Wong; Khami Chokani; Niki Vegh; Greg B Horsman; Shaun Tyler; Morag R Graham; Frank A Plummer; Paul N Levett; Yan Li
Journal:  J Infect Dis       Date:  2010-04-15       Impact factor: 5.226

5.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

6.  Substitutions near the receptor binding site determine major antigenic change during influenza virus evolution.

Authors:  Björn F Koel; David F Burke; Theo M Bestebroer; Stefan van der Vliet; Gerben C M Zondag; Gaby Vervaet; Eugene Skepner; Nicola S Lewis; Monique I J Spronken; Colin A Russell; Mikhail Y Eropkin; Aeron C Hurt; Ian G Barr; Jan C de Jong; Guus F Rimmelzwaan; Albert D M E Osterhaus; Ron A M Fouchier; Derek J Smith
Journal:  Science       Date:  2013-11-22       Impact factor: 47.728

7.  Allele dynamics plots for the study of evolutionary dynamics in viral populations.

Authors:  Lars Steinbrück; Alice Carolyn McHardy
Journal:  Nucleic Acids Res       Date:  2010-10-18       Impact factor: 16.971

8.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2014-01-21       Impact factor: 6.937

9.  FastTree: computing large minimum evolution trees with profiles instead of a distance matrix.

Authors:  Morgan N Price; Paramvir S Dehal; Adam P Arkin
Journal:  Mol Biol Evol       Date:  2009-04-17       Impact factor: 16.240

  9 in total
  70 in total

1.  Host immunity and pathogen diversity: A computational study.

Authors:  Tomás Aquino; Ana Nunes
Journal:  Virulence       Date:  2016       Impact factor: 5.882

Review 2.  The evolution of seasonal influenza viruses.

Authors:  Velislava N Petrova; Colin A Russell
Journal:  Nat Rev Microbiol       Date:  2017-10-30       Impact factor: 60.633

3.  Effective Online Bayesian Phylogenetics via Sequential Monte Carlo with Guided Proposals.

Authors:  Mathieu Fourment; Brian C Claywell; Vu Dinh; Connor McCoy; Frederick A Matsen Iv; Aaron E Darling
Journal:  Syst Biol       Date:  2018-05-01       Impact factor: 15.683

4.  Dynamic Perspectives on the Search for a Universal Influenza Vaccine.

Authors:  Chadi M Saad-Roy; Adrian B McDermott; Bryan T Grenfell
Journal:  J Infect Dis       Date:  2019-04-08       Impact factor: 5.226

Review 5.  Evolutionary Virology at 40.

Authors:  Jemma L Geoghegan; Edward C Holmes
Journal:  Genetics       Date:  2018-12       Impact factor: 4.562

Review 6.  Evolution and rapid spread of a reassortant A(H3N2) virus that predominated the 2017-2018 influenza season.

Authors:  Barney I Potter; Rebecca Kondor; James Hadfield; John Huddleston; John Barnes; Thomas Rowe; Lizheng Guo; Xiyan Xu; Richard A Neher; Trevor Bedford; David E Wentworth
Journal:  Virus Evol       Date:  2019-12-04

7.  Effectiveness of influenza vaccination on influenza-associated hospitalisations over time among children in Hong Kong: a test-negative case-control study.

Authors:  Shuo Feng; Susan S Chiu; Eunice L Y Chan; Mike Y W Kwan; Joshua S C Wong; Chi-Wai Leung; Yiu Chung Lau; Sheena G Sullivan; J S Malik Peiris; Benjamin J Cowling
Journal:  Lancet Respir Med       Date:  2018-11-12       Impact factor: 30.700

8.  Lineage: Visualizing Multivariate Clinical Data in Genealogy Graphs.

Authors:  Carolina Nobre; Nils Gehlenborg; Hilary Coon; Alexander Lex
Journal:  IEEE Trans Vis Comput Graph       Date:  2018-03-06       Impact factor: 4.579

Review 9.  The evolution of Ebola virus: Insights from the 2013-2016 epidemic.

Authors:  Edward C Holmes; Gytis Dudas; Andrew Rambaut; Kristian G Andersen
Journal:  Nature       Date:  2016-10-13       Impact factor: 49.962

10.  Prediction, dynamics, and visualization of antigenic phenotypes of seasonal influenza viruses.

Authors:  Richard A Neher; Trevor Bedford; Rodney S Daniels; Colin A Russell; Boris I Shraiman
Journal:  Proc Natl Acad Sci U S A       Date:  2016-03-07       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.