| Literature DB >> 28165025 |
Yousong Peng1, Dayan Wang2, Jianhong Wang3, Kenli Li3, Zhongyang Tan1, Yuelong Shu2, Taijiao Jiang4,5.
Abstract
Rapid determination of the antigenicity of influenza A virus could help identify the antigenic variants in time. Currently, there is a lack of computational models for predicting antigenic variants of some common hemagglutinin (HA) subtypes of influenza A viruses. By means of sequence analysis, we demonstrate here that multiple HA subtypes of influenza A virus undergo similar mutation patterns of HA1 protein (the immunogenic part of HA). Further analysis on the antigenic variation of influenza A virus H1N1, H3N2 and H5N1 showed that the amino acid residues' contribution to antigenic variation highly differed in these subtypes, while the regional bands, defined based on their distance to the top of HA1, played conserved roles in antigenic variation of these subtypes. Moreover, the computational models for predicting antigenic variants based on regional bands performed much better in the testing HA subtype than those did based on amino acid residues. Therefore, a universal computational model, named PREDAV-FluA, was built based on the regional bands to predict the antigenic variants for all HA subtypes of influenza A viruses. The model achieved an accuracy of 0.77 when tested with avian influenza H9N2 viruses. It may help for rapid identification of antigenic variants in influenza surveillance.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28165025 PMCID: PMC5292743 DOI: 10.1038/srep42051
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Position-dependent entropy.
Moving average position information entropy (MAPIE) was calculated with a window size of 11 for HA1 protein of influenza A virus H1 (blue), H3 (red) and H5 (green). The amino acid positions are numbered according to influenza A virus H3.
Figure 2The defined regional bands and their contribution to antigenic variation.
(A) Visualization of the ten regional bands in HA1 protein defined as described in the methods. Each regional band is differently colored in the 3-D model. (B) Pearson correlation coefficients (PCC) between the antigenic variation and the changes in each regional band, for subtype H1N1 (triangles), H3N2 (circles) and H5N1 (diamonds). (C) Correlations between the regional band’s contribution to the antigenic variation and their distances from the top of HA1. “SCC”, Spearman correlation coefficient.
Performance of regional band-based and residue-based computational models trained and tested with influenza A virus subtypes H3N2, H1N1 and H5N1.
| Subtype (training) | Subtype (testing) | Regional band-based model | Residue-based model | ||||
|---|---|---|---|---|---|---|---|
| Accu | Sen | Spe | Accu | Sen | Spe | ||
| H3N2 | H3N2 | 0.79 | 0.75 | 0.84 | 0.86 | 0.83 | 0.89 |
| H1N1 | 0.67 | 0.57 | 0.78 | 0.60 | 0.40 | 0.81 | |
| H5N1 | 0.76 | 0.68 | 0.89 | 0.62 | 0.61 | 0.64 | |
| H1N1 | H3N2 | 0.78 | 0.78 | 0.78 | 0.57 | 0.41 | 0.71 |
| H1N1 | 0.74 | 0.69 | 0.79 | 0.83 | 0.83 | 0.83 | |
| H5N1 | 0.75 | 0.68 | 0.86 | 0.73 | 0.70 | 0.77 | |
| H5N1 | H3N2 | 0.76 | 0.83 | 0.68 | 0.66 | 0.61 | 0.70 |
| H1N1 | 0.72 | 0.69 | 0.76 | 0.61 | 0.43 | 0.80 | |
| H5N1 | 0.83 | 0.85 | 0.81 | 0.86 | 0.88 | 0.84 | |
*Performance in five-fold cross-validations. “Accu”, accuracy; “Sen”, sensitivity; “Spe”, specificity.
Figure 3The Receiver Operating Characteristic (ROC) curve of PREDAV-FluA in five-fold cross-validations.
The table under the curve summarizes the Area under Curve (AUC) data of the shown curve.
Performance on the antigenic data of avian influenza H9N2 viruses for the regional band-based models built on the antigenic data of H1N1, H3N2, H5N1 and in combination.
| Subtypes for training models | Accuracy | Sensitivity | Specificity |
|---|---|---|---|
| Combination of H1N1, H3N2 and H5N1 | 0.77 | 0.95 | 0.26 |
| H1N1 | 0.80 | 0.94 | 0.39 |
| H3N2 | 0.75 | 0.93 | 0.26 |
| H5N1 | 0.79 | 0.93 | 0.39 |