| Literature DB >> 27498613 |
Jingxuan Qiu1, Tianyi Qiu1, Yiyan Yang1, Dingfeng Wu1, Zhiwei Cao1.
Abstract
The rapid and consistent mutation of influenza requires frequent evaluation of antigenicity variation among newly emerged strains, during which several in-silico methods have been reported to facilitate the assays. In this paper, we designed a structure-based antigenicity scoring model instead of those sequence-based previously published. Protein structural context was adopted to derive the antigenicity-dominant positions, as well as the physic-chemical change of local micro-environment in correlation with antigenicity change. Then a position specific scoring matrix (PSSM) profile and local environmental change over above positions were integrated to predict the antigenicity variance. Independent testing showed a high accuracy of 0.875, and sensitivity of 0.986, with a significant ability to discover antigenic-escaping strains. When applying this model to the historical data, global and regional antigenic drift events can be successfully detected. Furthermore, two well-known vaccine failure events were clearly suggested. Therefore, this structure-context model may be particularly useful to identify those to-be-failed vaccine strains, in addition to suggest potential new vaccine strains.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27498613 PMCID: PMC4976332 DOI: 10.1038/srep31156
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Flowchart of model.
(1) Data collection of HI assay results and the corresponding HA sequence; (2) Structure information as well as the mutation frequency were both considered to select a set of combinational positions contributing to antigenic variance. 47 positions containing 5 spatial clusters were derived at HA protein surface through iterative screening; (3) Descriptors used to build the model include PSSM score describing the sequence variance at 47 positions and micro-environmental change for 5 spatial clusters of antigenicity-dominant positions; (4) Antigenicity variation can be calculated for any queried HA pairs.
The performance of peering methods on 120 independent testing HA pairs.
| Methods | Accuracy | MCC | F-score | Sensitivity | Number of positions |
|---|---|---|---|---|---|
| 0.575 | 0.274 | 0.514 | 0.375 | 131 | |
| 0.725 | 0.516 | 0.723 | 0.597 | 241 | |
| 0.725 | 0.411 | 0.792 | 0.875 | 38 | |
| 0.767 | 0.544 | 0.788 | 0.722 | 17 | |
| 0.800 | 0.595 | 0.854 | 0.972 | 20 | |
Figure 2Detailed performance of representative methods on independent testing data of 120 pairs.
Panel (A–E) demonstrate the correlation between predicted (Y-axis) and experimental D (X-axis). In each panel, blue cross indicates those correctly classified pairs, with true negatives in quadrant 1 and true positives in quadrant 3. Red ones represent misclassified pairs, with false negatives in quadrant 2 and false positives in quadrant 4. Dotted line indicates the fuzzy regions. The fuzzy region I (D ∈[1, 3], exclude D = 2) was colored in gray. For better illustration, overlapped point was rearranged by slight deviation randomly without changing the overall classification. Lee and Chen’s method was a qualitative method and does not provide antigenic distance, thus was not compared in here.
Figure 3Vaccine coverage in northern temperate and three continents for the recent 20 years.
In subgraph (A–D), X-axis represents years from 1994 to 2015 and Y-axis represent vaccine coverage of each year. Each line refers to the antigenicity coverage of a vaccine strain from its emerging year to two years after being replaced by updated vaccine strain. Red stars indicate the recommendation years of the vaccine strain, while black dots labels the year being updated. Red box labeled the detected vaccine failure in seasons of 2003–2004 and 2014–2015.