| Literature DB >> 35746515 |
Hui Chen1, Junqiu Wang2,3, Yunsong Liu2,4, Ivy Quek Ee Ling5, Chih Chuan Shih5, Dafei Wu2, Zhiyan Fu6, Raphael Tze Chuen Lee7, Miao Xu8, Vincent T Chow9, Sebastian Maurer-Stroh7,10,11,12, Da Zhou3, Jianjun Liu1,13, Weiwei Zhai1,2,14.
Abstract
Seasonal Influenza H3N2 virus poses a great threat to public health, but its vaccine efficacy remains suboptimal. One critical step in influenza vaccine production is the viral passage in embryonated eggs. Recently, the strength of egg passage adaptation was found to be rapidly increasing with time driven by convergent evolution at a set of functionally important codons in the hemagglutinin (HA1). In this study, we aim to take advantage of the negative correlation between egg passage adaptation and vaccine effectiveness (VE) and develop a computational tool for selecting the best candidate vaccine virus (CVV) for vaccine production. Using a probabilistic approach known as mutational mapping, we characterized the pattern of sequence evolution driven by egg passage adaptation and developed a new metric known as the adaptive distance (AD) which measures the overall strength of egg passage adaptation. We found that AD is negatively correlated with the influenza H3N2 vaccine effectiveness (VE) and ~75% of the variability in VE can be explained by AD. Based on these findings, we developed a computational package that can Measure the Adaptive Distance and predict vaccine Effectiveness (MADE). MADE provides a powerful tool for the community to calibrate the effect of egg passage adaptation and select more reliable strains with minimum egg-passaged changes as the seasonal A/H3N2 influenza vaccine.Entities:
Keywords: adaptive evolution; egg passage adaptation; influenza H3N2 virus; vaccine effectiveness; vaccine production
Year: 2022 PMID: 35746515 PMCID: PMC9227319 DOI: 10.3390/vaccines10060907
Source DB: PubMed Journal: Vaccines (Basel) ISSN: 2076-393X
Figure 1Statistical approaches for characterizing egg passage adaptation. (A) Numbers of nonsynonymous and synonymous changes at HA1 codons responsible for egg passage adaptation. The codons showing statistical significance from the enrichment and convergent tests (q-value < 0.05) are labelled in yellow and purple, respectively. Codons located in functional domains such as receptor binding sites (RBS), antigenic epitope A, B and D will be labeled in red, blue, violet and orange color respectively. (B) Enrichment scores across codons responsible for the egg passage adaptation. Alleles with enrichment scores higher than 20 are labeled. (C) Principal component analysis of the 17-dimensional space of ES scores for all the sequences. Discrete subgroups of sequences carrying different passage-related alleles distribute in clusters away from the major cluster. Pie charts display major adaptive alleles in different groups. For example, 194P and 186V are the dominant adaptive alleles observed in cluster 2 and 3, respectively. Adaptive distance (AD) is defined as the distance between the target strain (e.g., CVV) and the centroid of the major cluster for most of the non-egg sequences (i.e., group 1). (D) Correlation between the adaptive distance (AD) and vaccine efficacy (VE) between influenza seasons from 2010 to 2015. The predicted VEad is drawn as the dashed line. (E) Proportion of different alleles with enrichment score >10 in the cluster 2 of the PCA map (panel (C)). (F) Proportion of different alleles with enrichment score >10 in the cluster 3 of the PCA map (panel (C)).
Figure 2Schematic workflow of MADE. For any candidate vaccine virus (CVV), enrichment scores (ES) of all the alleles (amino acids) at the 17 positively selected HA1 codons were computed. These high dimensional vectors calculated at the 17 codons serve as allelic barcodes for each strain. Principal component analysis of the ES scores for all existing sequences was conducted, and the adaptive distance (AD) for the CVV was computed. Subsequently, MADE predicted the VEad of the input CVV based on the linear relationship between AD and vaccine effectiveness (VE). In addition, a machine learning algorithm can be applied to classify whether the input sequence with unknown passage history has been grown in embryonated eggs or other passage mediums (e.g., MDCK).