| Literature DB >> 22532796 |
Lars Steinbrück1, Alice Carolyn McHardy.
Abstract
Distinguishing mutations that determine an organism's phenotype from (near-) neutral 'hitchhikers' is a fundamental challenge in genome research, and is relevant for numerous medical and biotechnological applications. For human influenza viruses, recognizing changes in the antigenic phenotype and a strains' capability to evade pre-existing host immunity is important for the production of efficient vaccines. We have developed a method for inferring 'antigenic trees' for the major viral surface protein hemagglutinin. In the antigenic tree, antigenic weights are assigned to all tree branches, which allows us to resolve the antigenic impact of the associated amino acid changes. Our technique predicted antigenic distances with comparable accuracy to antigenic cartography. Additionally, it identified both known and novel sites, and amino acid changes with antigenic impact in the evolution of influenza A (H3N2) viruses from 1968 to 2003. The technique can also be applied for inference of 'phenotype trees' and genotype-phenotype relationships from other types of pairwise phenotype distances.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22532796 PMCID: PMC3330098 DOI: 10.1371/journal.pcbi.1002492
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Figure 1Antigenic tree for influenza A (H3N2) viruses.
Branch lengths represent antigenic distances (maximum of up- and down-weights for each branch) inferred from a maximum likelihood tree of 258 hemagglutinin sequences of seasonal influenza A (H3N2) virus isolates and serological data. (A) Colored edges show antigenic type transitions, with internal branches with high average antigenic weights (≥1.0 antigenic units). Gray-blue edges represent high weight branches leading to a subtree with three isolates or less, representing low abundance types. (B) Isolates are color-coded by antigenic clusters according to Smith et al. (2004). Three isolates (A/Christchurch/4/85, A/Hong Kong/34/90 and A/Netherlands/172/96) are only present as antisera and were not assigned a cluster label.
Internal branches with high average antigenic weights (≥1.0 antigenic units) and according antigenic types in comparison to antigenic clusters identified by antigenic cartography (branches leading to three or less isolates are excluded).
| Type transition | Branch amino acid changes | Weights (up/down/avg) | Trunk | Additional amino acid changes | Weights (up/down/avg) | Trunk | Smith |
| HK68–EN72 |
| 2.6/0.4/1.5 | x | L3F, | 0.9/0.2/0.5 | x | 3.4 |
| EN72–VI75 |
| 0.6/2.4/1.5 | x |
| 0.0/1.0/0.5 | 4.4 | |
| VI75–TX77 |
| 0.6/2.8/1.7 | x |
| 1.0/-/0.5 | 3.4 | |
| TX77–BA79 |
| 1.4/0.0/0.7 | x | 3.3 | |||
| D2N, | 0.0/0.3/0.2 | x | |||||
| BA79–SI87 |
| 0.2/3.3/1.7 | x | 4.9 | |||
| SI87–BE89 | G135E, | 2.0/0.0/1.0 | 4.6 | ||||
| BE89–BE92 | I214T | 1.4/1.1/1.3 | x |
| 1.0/0.0/0.5 | x | 7.8 |
|
| 0.0/0.4/0.2 | x | |||||
| BE92–WU95 | K135T, | 1.5/1.1/1.3 | x | 4.6 | |||
| WU95–SY97 |
| 2.5/2.6/2.6 | x | 4.7 | |||
| SY97–FU02 |
| 1.8/3.2/2.5 | x | 3.5 |
Branch amino acid changes indicate the corresponding branches, where changes in bold were also found by Smith et al. (2004), and weights give the respective up, down and average branch weights. Multiple branches that can be mapped to a single antigenic type are separated by dashed lines. Additional amino acid changes indicate branches that carry further mutations found to be cluster transition substitutions by Smith et al. (2004). For some branches, the down-weight was not defined, as no antiserum was in the respective subtree. Branches that can be mapped to multiple type transitions are shown at the first mapping only. Smith et al. (2004) present average distances between consecutive antigenic clusters, whereas average antigenic branch weights give a minimum distance between consecutive antigenic types. Note that on branches with multiple changes not all changes have to contribute to the antigenic weight, though their individual impacts could not be resolved with the dataset (unsampled viral isolates).
Figure 2Schematic drawing demonstrating the up/down tree concept.
For the two taxa t and t, no antiserum is present, and thus, b and b only have up-weights. A path from t to t would use the up-weights of branch b and b, and the down-weights of branch b and b5. Similarly, the path from t to t would use the up-weight of branch b and the down-weight of branch b. Notably, the path from t to t, namely the antigenic distance from antigen t to the antiserum raised against strain t, would use the up-weight and the down-weight of branch b.