| Literature DB >> 25391151 |
David F Burke1, Derek J Smith1.
Abstract
Comparisons of residues between sub-types of influenza virus is increasingly used to assess the zoonotic potential of a circulating strain and for comparative studies across subtypes. An analysis of N-terminal cleavage sites for thirteen subtypes of influenza A hemagglutinin (HA) sequences, has previously been described by Nobusawa and colleagues. We have expanded this analysis for the eighteen known subtypes of influenza. Due to differences in the length of HA, we have included strains from multiple clades of H1 and H5, as well as strains of H5 and H7 subtypes with both high and low pathogenicity. Analysis of known structures of influenza A HA enables us to define amino acids which are structurally and functionally equivalent across all HA subtypes using a numbering system based on the mature HA sequence. We provide a list of equivalences for amino acids which are known to affect the phenotype of the virus.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25391151 PMCID: PMC4229193 DOI: 10.1371/journal.pone.0112302
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Predicted signal peptide cleavage sites for all HA subtypes.
| Subtype | Representative strain | Signal Peptide | N-terminal sequence of mature protein |
| H1 | A/United Kingdom/1/1933 | MKARLLVLLCALAATDA | DTICIGYHANNS |
| H2 | A/Singapore/1/1957 | MAIIYLILLFTAVRG | DQICIGYHANNS |
| H3 | A/Aichi/2/1968 | MKTIIALSYIFCLPLG | QDLPGNDNSTATLCLGHHAVPN |
| H4 | A/swine/Ontario/01911–2/1999 | MLSIAILFLLIAEGSS | QNYTGNPVICLGHHAVSN |
| H5 | A/Vietnam/1203/2004 | MEKIVLLFAIVSLVKS | DQICIGYHANNS |
| H6 | A/chicken/Taiwan/0705/1999 | MIAIIVIATLAAAGKS | DKICIGYHANNS |
| H7 | A/Netherlands/219/2003 | MNTQILVFALVASIPTNA | DKICLGHHAVSN |
| H8 | A/turkey/Ontario/6118/1968 | MEKFIAIAMLLASTNA | YDRICIGYQSNNS |
| H9 | A/swine/Hong Kong/9/1998 | MEAASLITILLVVTASNA | DKICIGYQSTNS |
| H10 | A/mallard/bavaria/3/2006 | MYKIVVIIALLGAVKG | LDKICLGHHAVAN |
| H11 | A/duck/England/1/1956 | MEKTLLFAAIFLCVKA | DEICIGYLSNNS |
| H12 | A/duck/Alberta/60/1976 | MEKFIILSTVLAASFA | YDKICIGYQTNNS |
| H13 | A/gull/Maryland/704/1977 | MALNVIATLTLISVCVHA | DRICVGYLSTNS |
| H14 | A/mallard/Astrakhan/263/1982 | MIALILVALALSHTAYS | QITNGTTGNPIICLGHHAVEN |
| H15 | A/duck/Australia/341/1983 | MNTQIIVILVLGLSMVRS | DKICLGHHAVAN |
| H16 | A/black-headed-gull/Turkmenistan/13/1976 | MMIKVLYFLIIVLGRYSKA | DKICIGYLSNNS |
| H17 | A/little-yellow-shouldered bat/Guatemala/060/2010 | MELIILLILLNPYTFVLG | DRICIGYQANQN |
| H18 | A/flat-faced bat/Peru/033/2010 | MITILILVLPIVVG | DQICIGYHSNNS |
The N-terminal signal peptide cleavage site of HA was predicted using the signalP [7] for all HA subtypes. Most subtypes are cleaved close to a highly conserved aspartic acid. Three subtypes lacking this aspartic acid are cleaved at a glutamine resulting in a longer HA sequence.
Figure 1Sequence alignment of HA for known sub-types.
Alignment of mature HA sequence for all known HA sub-types. Additional strains have been included for sub-types which show variation in the length of HA. Sequences are ordered according to their phylogenetic classification as group 1 (magenta bar) or group 2 (orange bar) HA. The protein secondary structure elements, α-helices and β-strands, are highlighted with red bars and cyan arrows, respectively. A blue box highlights regions which have high structural variation across all subtypes. Amino acids within these regions should not be defined as equivalent between all sub-types. Each amino acid is coloured according to clustalx2 rules [11]. Briefly, glycine and proline are coloured orange and yellow, respectively. Conserved positively charged residues and negatively charged residues are coloured red and magenta, respectively. Conserved cysteines are coloured pink while conserved serine or threonine residues are in green. The remaining amino acids, if conserved are coloured blue. The sequences representative of each subtype are as follows: H1(A/United Kingdom/1/1933); H1pdm(A/California/04/2009); H2(A/Singapore/1/1957); H3(A/Aichi/2/1968); H4(A/swine/Ontario/01911/2/1999); H5(A/Vietnam/1203/2004); H5c221(A/chicken/Egypt/0915-NLQP/2009); H6(A/chicken/Taiwan/0705/1999); H7(A/Netherlands/219/2003); H8(A/turkey/Ontario/6118/1968); H9(A/swine/HongKong/9/1998); H10(A/mallard/bavaria/3/2006); H11(A/duck/England/1/1956); H12(A/duck/Alberta/60/1976); H13(A/gull/Maryland/704/1977); H14(A/mallard/Astrakhan/263/1982); H15(A/duck/Australia/341/1983); H16(A/black-headed-gull/Turkmenistan/13/1976); H17(A/little-yellow-shouldered-bat/Guatemala/060/2010); H18(A/flat-facedbat/Peru/033/2010).
Equivalent amino acid numbering for subtypes currently circulating in humans or have pandemic potential.
| Mutation | H1pdm | H3 | H5 | H7 | H9 | Phenotype | Reference |
| Tyr → His | 7 | 17 | 7 | 7 | 7 | Increase in fusion pH |
|
| His → Gln | 8 | 18 | 8 | 8 | 8 | Decrease in fusion pH; increased stability |
|
| Asn → Any | 11 | 21 | 11 | 11 | 11 | Loss of N-glycosylation; increased virulence |
|
| Glu→Lys | 75 | 83 | 75 | 73 | 75 | Increased virus binding to α2-6 glycans |
|
| His → Tyr | 103 | 110 | 103 | 100 | 103 | Increased stability |
|
| Ser→ Asn | 122 | 126 | 121 | 116 | 121 | Increased virus binding to α2-6 glycans |
|
| Ser→Pro | 124 | 128 | 123 | 118 | 123 | Increased virus binding to α2-6 glycans |
|
| Ala → Δ | 130 | Δ | 129 | Δ | Δ | Increased virus binding to α2-6 glycans |
|
| Ser → Ala | 134 | 137 | 133 | 127 | 131 | Increased virus binding to α2-6 glycans |
|
| Ala→ Val | 135 | 138 | 134 | 128 | 132 | Increased infectivity in SIAT Cells |
|
| Gly→Arg | 140 | 143 | 139 | 132 | Δ | Increased virus binding to α2-6 glycans |
|
| Ile→ Thr | 152 | 155 | 151 | 144 | 145 | Increased virus binding to α2-6 glycans |
|
| Asn→ Asp | 155 | 158 | 154 | 147 | 148 | Loss of N-glycosylation; increased binding and transmission |
|
| Thr→ Ala | 157 | 160 | 156 | 151 | 150 | Loss of N-glycosylation; increased binding and transmission |
|
| Asn→Lys | 183 | 186 | 182 | 177 | 176 | Increased virus binding to α2-6 glycans |
|
| Asp→Gly | 184 | 187 | 183 | 178 | 177 | Increased virus binding to α2-6 glycans |
|
| Glu→Gly | 187 | 190 | 186 | 181 | 180 | Increased virus binding to α2-6 glycans |
|
| Thr→Ile | 189 | 192 | 188 | 183 | 182 | Increased virus binding to α2-6 glycans |
|
| Lys→Arg | 190 | 193 | 189 | 184 | 183 | Increased virus binding to α2-6 glycans |
|
| Gln→Arg/His | 193 | 196 | 192 | 187 | 186 | Increased virus binding to α2-6 glycans |
|
| Asn→Lys | 194 | 197 | 193 | 188 | 187 | Increased virus binding to α2-6 glycans |
|
| Val → Ile | 211 | 214 | 210 | 205 | 204 | Increased virus binding to α2-6 glycans |
|
| Gln→Leu | 223 | 226 | 222 | 217 | 216 | Increased virus binding to α2-6 glycans |
|
| Ser→Asn | 224 | 227 | 223 | 218 | 217 | Increased virus binding to α2-6 glycans |
|
| Gly→Ser | 225 | 228 | 224 | 219 | 218 | Increased virus binding to α2-6 glycans |
|
| Pro→Ser | 236 | 239 | 235 | 230 | 229 | Increased virus binding to α2-6 glycans |
|
| Glu→Lys | 252 | 255 | 251 | 246 | 245 | Increased virus binding to α2-6 glycans |
|
| Thr→Ile | 316 | 318 | 315 | 309 | 309 | Increase in fusion pH |
|
| Insertion of Arg or Lys | 327 | 329 | 326 | 321 | 320 | Poly-basic cleavage; increased pathogenicity |
|
| Lys → Ile | 385 | 387 | 384 | 379 | 378 | Increase in fusion pH; increased stability |
|
| Asn → Lys | 441 | 443 | 440 | 435 | 434 | Increase in fusion pH; decreased stability |
|
| Asn → Asp | 444 | 446 | 443 | 438 | 437 | Increase in fusion pH |
|
| Arg → Lys | 494 | 496 | 493 | 488 | 487 | Increased virus binding to α2-6 glycans |
|
Residue numbering is based on the mature sequence of HA1 across all subtypes for a set mutations shown to cause phenotypic differences. Positions where there is a deletion relative to other subtypes are represented by a “Δ”.