| Literature DB >> 20482883 |
Jason W H Wong1, Alexander B Schwahn, Kevin M Downard.
Abstract
BACKGROUND: High resolution mass spectrometry has been employed to rapidly and accurately type and subtype influenza viruses. The detection of signature peptides with unique theoretical masses enables the unequivocal assignment of the type and subtype of a given strain. This analysis has, to date, required the manual inspection of mass spectra of whole virus and antigen digests.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20482883 PMCID: PMC3098065 DOI: 10.1186/1471-2105-11-266
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Schematic overview of the FluTyper algorithm.
Figure 2Performance of naïve Bayes classifier relative to protein coverage in terms of generating conclusive results for typing (A) and subtyping (B) of the influenza virus and in terms of false positive rates for typing (C) and subtyping (D). In all subfigures, the black, red, green and blue lines represent NP, M1, HA and NA respectively.
Figure 3High resolution mass spectrum of (A) whole virus digest of influenza type B strain B/Victoria/504/2000, (B) whole virus tryptic digest of influenza type A (H3N2) strain A/Brisbane/10/2007, and (C) in-gel tryptic digest of the nucleoprotein from type A (H1N1) strain A/Solomon Islands/03/06.
Identified peptides from a mass spectrum (Figure 3A) of a whole virus digest of type B influenza strain B/Victoria/504/2000
| Po | ||||||||
|---|---|---|---|---|---|---|---|---|
| Sequence | [M+H]+ | Start-End | Δppm | Uniqueness(in ppm) | FluA | FluB | Protein | Type signature |
| GGFVHQR | 800.41620 | 354-360 | -0.2449 | -1.6716 | 0.0000 | NA | * | |
| SHFANLK | 816.43626 | 69-75 | 0.2094 | 12.1210 | 0.0000 | HA | * | |
| QLPNLLR | 853.52541 | 136-142 | -0.3046 | -55.8003 | 0.0000 | HA | * | |
| GLILAERK | 899.56728 | 95-102 | -0.9505 | 4.4710 | 0.0001 | M1 | ||
| TIYFSPIR | 996.55129 | 163-170 | -0.1806 | 3.3917 | 0.0001 | NP | ||
| AGLNDDMER | 1020.44149 | 97-105 | 0.3116 | 32.3536 | 0.0000 | NP | * | |
| LQFWAPMTR | 1149.58736 | 439-447 | -0.5106 | 2.1677 | 0.0000 | NP | * | |
| QTIPNFFFGR | 1226.63167 | 540-549 | 0.1753 | 4.7871 | 0.0000 | NP | * | |
| SMVVVRPSVASK | 1259.71402 | 320-331 | -0.3271 | -11.0477 | 0.0000 | NP | * | |
| LNVETDTAEIR | 1260.64302 | 309-319 | 0.4815 | -1.9918 | 0.0000 | NA | * | |
| NLIQNAHAVER | 1264.67566 | 106-116 | -0.2001 | -0.8042 | 0.0000 | NP | * | |
| SKPYYTGEHAK | 1280.62696 | 336-346 | -0.4123 | 2.6393 | 0.0000 | HA | * | |
| LGEFYNQMMVK | 1359.64355 | 86-96 | -1.3173 | 8.2625 | 0.0000 | NP | * | |
| VLSALTGTEFKPR | 1418.80019 | 399-411 | 0.4130 | -6.9756 | 0.0000 | NP | * | |
| LAEELQSNIGVLR | 1441.80092 | 201-213 | -1.2249 | 7.7910 | 0.0000 | M1 | * | |
| EFDLDSALEWIK | 1465.72094 | 36-47 | -0.2681 | -2.7440 | 0.0000 | M1 | * | |
| REMQMVSAMNTAK | 1496.70182 | 175-187 | -0.3554 | 2.1066 | 0.0001 | M1 | ||
| NPGIADIEDLTLLAR | 1610.87481 | 305-319 | -1.1441 | 6.9732 | 0.0000 | NP | * | |
Naïve Bayes probabilities: P(FluB) = 1.000
Identified peptides from a mass spectrum (Figure 3B) of a whole virus digest of type A (H3N2) influenza strain A/Brisbane/10/2007
| Po | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Sequence | [M+H]+ | Start-End | Δppm | Uniqueness (in ppm) | H1N1 | H5N1 | P2009 | Protein | Type signature | Subtype sig/ind | |
| HENR | 555.26338 | 175-178 | 0.0432 | 52.5444 | 1.0000 | 1.0000 | 1.0000 | M1 | * | ||
| NFWR | 622.30961 | 211-214 | 0.2025 | 15.9037 | 1.0000 | 0.9885 | 1.0000 | NP | * | ||
| RIWR | 630.38344 | 124-127 | 0.7773 | -2.1209 | 0.8377 | 0.0235 | 0.0000 | NP | |||
| LLSFIR | 748.47159 | 349-354 | -2.2232 | -15.0079 | 0.0319 | 0.0000 | 0.0000 | NP | * | ||
| SGYWAIR | 852.43626 | 389-395 | 0.4282 | -5.2567 | 0.0000 | 0.0000 | 0.0000 | NP | * | ||
| QMVQAMR | 863.42261 | 211-217 | 0.9335 | 3.6413 | 0.9474 | 0.9730 | 0.0000 | M1 | * | ||
| EITFHGAK | 902.47304 | 106-113 | 1.7441 | 3.7342 | 0.9794 | 0.6944 | 0.9841 | M1 | |||
| MVLSAFDER | 1067.51901 | 72-80 | 1.1213 | -6.9338 | 1.0000 | 1.0000 | 0.9877 | NP | * | ||
| TRPILSPLTK | 1125.69902 | 48-57 | -0.6476 | 19.9583 | 1.0000 | 1.0000 | 1.0000 | M1 | * | ||
| GINDRNFWR | 1177.58612 | 206-214 | 0.2675 | -1.1430 | 0.8883 | 0.0345 | 0.9726 | NP | |||
| MVLSAFDERR | 1223.62012 | 72-81 | -0.9578 | 5.3252 | 0.9628 | 0.0000 | 0.9877 | NP | |||
| GIGTM*VMELIR | 1235.64864 | 191-201 | -0.5722 | -2.7281 | 0.0106 | 0.0116 | 0.0000 | NP | |||
| MMEGAKPEEVSFR | 1510.70286 | 453-465 | 0.2529 | 2.2314 | 0.0053 | 0.0000 | 0.0058 | NP | |||
| GWAFDDGNDVWMGR | 1625.68015 | 357-370 | -2.3418 | 79.7986 | 0.0000 | 0.0000 | 0.0000 | NA(N2) | * | * | |
| GILGFVFTLTVPSER | 1635.91047 | 58-72 | -0.6046 | -1.6425 | 0.9588 | 0.8947 | 0.9831 | M1 | * | ||
| NPGNAEIEDLIFLAR | 1671.87006 | 253-267 | 0.8739 | -6.7188 | 0.2606 | 0.9892 | 0.9589 | NP | * | ||
| MMEGAKPEEVSFRGR | 1723.82544 | 453-467 | 0.7002 | -1.9602 | 0.0053 | 0.0000 | 0.0000 | NP | |||
| ESRNPGNAEIEDLIFLAR | 2044.04579 | 250-267 | 1.0631 | -3.5273 | 0.2128 | 0.0538 | 0.9315 | NP | |||
Naïve Bayes probabilities: P(FluA) = 1.000, P(H3N2) = 1.000; M* indicates methionine sulfoxide
Identified peptides from a mass spectrum (Figure 3C) of nucleoprotein derived from type A (H1N1) influenza strain A/Solomon Islands/03/06
| Po | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Sequence | [M+H]+ | Start-End | Δppm | H3N2 | H5N1 | P2009 | Protein | Type signature | Subtype sig/ind | ||
| NFWR | 622.30961 | 205-208 | 0.0578 | 15.9037 | 0.9975 | 0.9885 | 1.0000 | NP | * | ||
| MIGGIGR | 703.39196 | 32-38 | 0.0668 | 9.2751 | 0.0078 | 0.0000 | 0.9877 | NP | |||
| YWAIR | 708.38276 | 385-389 | -0.0663 | -16.7649 | 0.2902 | 1.0000 | 1.0000 | NP | * | ||
| SRYWAIR | 951.51590 | 383-389 | 1.4419 | 10.4002 | 0.1878 | 0.8495 | 0.0411 | NP | |||
| SGGNTNQQR | 961.44460 | 392-400 | 0.6313 | 3.6976 | 0.9724 | 0.5161 | 0.0137 | NP | * | ||
| MVLSAFDER | 1067.51901 | 66-74 | -0.9958 | -6.9338 | 0.9974 | 1.0000 | 0.9877 | NP | * | ||
| M*VLSAFDER | 1083.51392 | 66-74 | -2.1209 | -12.6763 | 0.9974 | 1.0000 | 0.9877 | NP | |||
| MVLSAFDERR | 1223.62012 | 66-75 | 2.0251 | 5.3252 | 0.9844 | 0.0000 | 0.9877 | NP | |||
| FYIQMCTELK | 1332.63265 | 45-54 | 0.6754 | 32.3082 | 0.9948 | 0.9560 | 1.0000 | NP | * | ||
| SYEQM*ETDGER | 1360.53214 | 9-19 | 1.6854 | 8.0895 | 0.3526 | 0.0000 | 0.0260 | NP | |||
| M*CSLM*QGSTLPR | 1412.63307 | 163-174 | 2.5130 | -7.6276 | 0.9974 | 0.9765 | 1.0000 | NP | |||
Naïve Bayes probabilities: P(FluA) = 1.000, P(H1N1) = 0.9998; M* indicates methionine sulfoxide
Total protein coverage of the different antigens identified from the mass spectrum of each of samples tested
| Protein coverage | ||||
|---|---|---|---|---|
| Sample | HA | NA | NP | M1 |
| Influenza B (Victoria) | 12.0% | 10.3% | 27.9% | 18.5% |
| H3N2 (Brisbane) | 19.3% | 10.5% | 42.0% | 33.3% |
| H1N1 (Solomon Island) | n/a | n/a | 24.8% | n/a |