| Literature DB >> 31443542 |
Raphael T C Lee1, Hsiao-Han Chang2, Colin A Russell3, Marc Lipsitch2, Sebastian Maurer-Stroh4,5,6.
Abstract
Animal studies aimed at understanding influenza virus mutations that change host specificity to adapt to replication in mammalian hosts are necessarily limited in sample numbers due to high cost and safety requirements. As a safe, higher-throughput alternative, we explore the possibility of using readily available passage bias data obtained mostly from seasonal H1 and H3 influenza strains that were differentially grown in mammalian (MDCK) and avian cells (eggs). Using a statistical approach over 80,000 influenza hemagglutinin sequences with passage information, we found that passage bias sites are most commonly found in three regions: (i) the globular head domain around the receptor binding site, (ii) the region that undergoes pH-dependent structural changes and (iii) the unstructured N-terminal region harbouring the signal peptide. Passage bias sites were consistent among different passage cell types as well as between influenza A subtypes. We also find epistatic interactions of site pairs supporting the notion of host-specific dependency of mutations on virus genomic background. The sites identified from our large-scale sequence analysis substantially overlap with known host adaptation sites in the WHO H5N1 genetic changes inventory suggesting information from passage bias can provide candidate sites for host specificity changes to aid in risk assessment for emerging strains.Entities:
Keywords: adaptation; hemagglutinin; host specificity; influenza; mutations; pandemic; passage
Mesh:
Substances:
Year: 2019 PMID: 31443542 PMCID: PMC6770435 DOI: 10.3390/cells8090958
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 6.600
Summary of HA sequences used in this study classified into four passage categories.
| Subtype | EGG | MDCK | SIAT | ORI | SUM |
|---|---|---|---|---|---|
| H1 pandemic (≥2009) | 1312 | 18,059 | 2103 | 13,004 | 34,478 |
| H1 seasonal (<2009) | 606 | 2610 | 145 | 342 | 3703 |
| H3 seasonal | 1405 | 9663 | 10,423 | 17,641 | 39,132 |
| H5 (human) | 234 | 36 | 0 | 14 | 284 |
| H5 (avian) | 1439 | 16 | 0 | 186 | 1641 |
| SUM | 4996 | 30,384 | 12,671 | 31,187 | 79,238 |
Figure 1Overlap of passage bias sites between different influenza subtypes.
Figure 2Approach and data types for identification of common passage bias sites. (A) Direct evidence from same strain with different passage (SSDP), (B) geometric mean of odds ratios >5 from large statistical analysis over two time periods, (C) Venn diagram of overlap between SSDP and ODDS sets from two time periods. 54 sites are found in at least 2 of these sets of analyses, (D) Structural positions of dominant passage bias sites in HA. Black circle indicates receptor binding head region (i). Blue circle shows region undergoing pH dependent structural changes during fusion (ii). A third identified passage region is not shown as it is in the unstructured signal peptide region (iii).
Dominant common passage bias sites in HA. Passage bias sites were identified using (i) same strain different passage (SSDP) analysis, (ii) large-scale analysis of sequence and passage data until May 2013, and (iii) large-scale analysis for sequence and passage data from June 2013 to May 2019. The 54 passage bias sites listed in this table are found in at least 2 out of 3 of the above mentioned analyses. The 19 passage bias sites that are found in all 3 analyses are underlined.
| Structural Regions | Sites (H3 Numbering, Starting After Signal Peptide) | Sites (H1pdm Absolute Numbering, Starting at Leading Methionine) | # Occurrence in 18 Possible Subtype + Cell Combinations (4 H1p, 4 H1s, 4 H3, 3 H5 Avian and 3 H5 Human) | Geometric Mean of Odds Ratios from all Subtype/Cell Combinations |
|---|---|---|---|---|
| Globular Head (RBS) | ||||
| Globular Head (Around RBS) | ||||
| Globular Head (Others) | 77 80 81 82 | 85 88 89 91 | 12 17 14 15 | 5.9 35.95 6.27 18.62 18.65 24.19 12.41 3.97 |
| pH Structural Change site | 45 46 | 52 53 | 13 14 | 7.08 9.02 |
| N-terminal (Signal Peptide) |
* 54a (H3 numbering) represents the gap between positions 54 and 55 in H3 viruses. The corresponding position in H1 viruses is position 62 (H1pdm absolute numbering).
Figure 3Pairs of sites with significantly different association between passages after Bonferroni correction mapped to HA monomeric structure (PDB 3UBN) in context of common passage bias sites. Structural positions of dominant passage bias sites and sites undergoing epistatic interactions that are under the influence of passage bias adaptations are highlighted in the Figure. The ligands (sialic acid and N-acetyl glucosamine) are shown in pink ball representation. Epistatic sites are highlighted in orange. Passage bias sites (with high geometric mean of odds ratio across subtype and cell combinations) in the receptor binding region are colored in red and sites 208 and 239 which overlap with epistatic sites are colored in yellow (i). The rest of the passage bias site in the globular head region are colored in blue. Sites 158 and 173 which overlap with epistatic sites are highlighted in cyan. Sites colored in green lie at a region undergoing pH dependent structural changes during fusion (ii). A third identified passage region is not shown as it is in the unstructured signal peptide region (iii).
Pairs of sites with significantly different association between passages after Bonferroni correction.
| Position 1 | Position 2 | Log Odds Ratio | |
|---|---|---|---|
| 13 (13) | 101 (101) | 1.69 | 1.11 × 10−10 |
| 19 (gap) | 176 (174) | 1.20 | 2.22 × 10−15 |
| 19 (gap) | 210 (208) | −1.98 | 4.17 × 10−8 |
| 154 (152) | 158 (156) | 1.31 | 5.70 × 10−6 |
| 160 (158) | 176 (174) | 1.99 | 0 |
| 160 (158) | 210 (208) | −3.26 | 4.81 × 10−6 |
| 175 (173) | 176 (174) | 0.90 | 8.23 × 10−11 |
| 19 (gap) | 176 (174) | 3.00 | 0 |
| 19 (gap) | 210 (208) | −1.87 | 5.75 × 10−8 |
| 19 (gap) | 241 (239) | 1.57 | 2.82 × 10−7 |
| 19 (gap) | 327 (326) | 1.55 | 5.88 × 10−7 |
| 19 (gap) | 505 (504) | 1.78 | 5.69 × 10−7 |
| 130 (127) | 176 (174) | 5.85 | 9.10 × 10−6 |
| 144 (141) | 176 (174) | −0.69 | 5.79 × 10−7 |
| 154 (152) | 158 (156) | 4.15 | 5.41 × 10−8 |
| 154 (152) | 160 (158) | −1.01 | 1.32 × 10−7 |
| 154 (152) | 176 (174) | −0.95 | 4.60 × 10−8 |
| 154 (152) | 327 (326) | −0.99 | 6.46 × 10−8 |
| 158 (156) | 176 (174) | −0.76 | 2.02 × 10−9 |
| 158 (156) | 327 (326) | −0.68 | 4.27 × 10−7 |
| 160 (158) | 176 (174) | 4.18 | 0 |
| 160 (158) | 210 (208) | −2.85 | 1.14 × 10−6 |
| 160 (158) | 241 (239) | 3.11 | 1.38 × 10−6 |
| 175 (173) | 176 (174) | 2.27 | 0 |
| 175 (173) | 210 (208) | −1.23 | 4.46 × 10−6 |
| 175 (173) | 327 (326) | 1.12 | 7.11 × 10−6 |
| 175 (173) | 505 (504) | 1.28 | 2.13 × 10−6 |
Figure 4Overlap of different sets of passage sites with host specificity sites in the H5 genetic changes inventory.