| Literature DB >> 23613826 |
Abstract
Proteins are under tight evolutionary constraints, so if a protein changes it can only do so in ways that do not compromise its function. In addition, the proteins in an organism evolve at different rates. Leveraging the history of patristic distance methods, a new method for analysing comparative protein evolution, called Mean Protein Evolutionary Distance (MeaPED), measures differential resistance to evolutionary pressure across viral proteomes and is thereby able to point to the proteins' roles. Different species' proteomes can also be compared because the results, consistent across virus subtypes, concisely reflect the very different lifestyles of the viruses. The MeaPED method is here applied to influenza A virus, hepatitis C virus, human immunodeficiency virus (HIV), dengue virus, rotavirus A, polyomavirus BK and measles, which span the positive and negative single-stranded, doubled-stranded and reverse transcribing RNA viruses, and double-stranded DNA viruses. From this analysis, host interaction proteins including hemagglutinin (influenza), and viroporins agnoprotein (polyomavirus), p7 (hepatitis C) and VPU (HIV) emerge as evolutionary hot-spots. By contrast, RNA-directed RNA polymerase proteins including L (measles), PB1/PB2 (influenza) and VP1 (rotavirus), and internal serine proteases such as NS3 (dengue and hepatitis C virus) emerge as evolutionary cold-spots. The hot spot influenza hemagglutinin protein is contrasted with the related cold spot H protein from measles. It is proposed that evolutionary cold-spot proteins can become significant targets for second-line anti-viral therapeutics, in cases where front-line vaccines are not available or have become ineffective due to mutations in the hot-spot, generally more antigenically exposed proteins. The MeaPED package is available from www.pam1.bcs.uwa.edu.au/~michaelw/ftp/src/meaped.tar.gz.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23613826 PMCID: PMC3626687 DOI: 10.1371/journal.pone.0061276
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Phylogenetic trees based on avian influenza M1 matrix and neuraminidase proteins.
Phylogenetic trees have been created based on small sets of M1 matrix proteins (a) and corresponding neuraminidase proteins (b) taken from complete influenza proteomes. The trees were created using Phyml and the figures drawn using FigTree. Notice that the neuraminidase tree forms clades largely corresponding to influenza type. Notice also that the scale bar is much larger for the neuraminidase tree.
Protein Evolutionary Distances - Initial and Final counts of sequences following deletion of duplicates, Mean PED, Adjusted Mean PED and Adjusted Mean PED per 100 aa, and mean .
| Dengue virus type 1 | |||||||
| Protein | Med.Len. | Init N | Final N | Mean | Adj. Mean | Adj. Mean per 100 aa | dnds |
| NS2a | 654.0 | 651 | 323 | 0.0394 | 0.0196 | 0.0090 | 0.0925 |
| M | 221.3 | 651 | 186 | 0.0219 | 0.0063 | 0.0086 | 0.0622 |
| NS4a | 381.0 | 649 | 246 | 0.0254 | 0.0096 | 0.0076 | 0.0592 |
| C | 300.0 | 651 | 149 | 0.0297 | 0.0068 | 0.0068 | 0.1883 |
| NS2b | 390.0 | 651 | 258 | 0.0219 | 0.0087 | 0.0067 | 0.0415 |
| 2K | 69.0 | 622 | 60 | 0.0158 | 0.0015 | 0.0066 | 0.0336 |
| NS1 | 1056.0 | 651 | 392 | 0.0288 | 0.0174 | 0.0049 | 0.0560 |
| NS4b | 747.0 | 651 | 304 | 0.0158 | 0.0074 | 0.0030 | 0.0383 |
| E | 1489.4 | 651 | 435 | 0.0195 | 0.0130 | 0.0026 | 0.0525 |
| NS5 | 2697.0 | 633 | 472 | 0.0212 | 0.0158 | 0.0018 | 0.0512 |
| NS3 | 1857.0 | 651 | 465 | 0.0131 | 0.0093 | 0.0015 | 0.0281 |
|
| |||||||
|
|
|
|
|
|
|
|
|
| p7 | 189.0 | 804 | 749 | 0.3524 | 0.3283 | 0.5211 | 0.1158 |
| NS2 | 651.0 | 804 | 778 | 0.5698 | 0.5514 | 0.2541 | 0.0974 |
| E1 | 576.1 | 804 | 764 | 0.4907 | 0.4663 | 0.2428 | 0.0878 |
| NS4a | 162.0 | 804 | 733 | 0.1388 | 0.1265 | 0.2343 | 0.0698 |
| E2 | 1089.3 | 804 | 780 | 0.7518 | 0.7294 | 0.2009 | 0.0761 |
| F | 485.0 | 630 | 581 | 0.3161 | 0.2915 | 0.1810 | 4.6603 |
| NS4b | 783.0 | 804 | 774 | 0.3403 | 0.3276 | 0.1255 | 0.0397 |
| NS5a | 1342.8 | 804 | 779 | 0.4860 | 0.4709 | 0.1051 | 0.0859 |
| NS5b | 1769.7 | 743 | 733 | 0.2824 | 0.2787 | 0.0471 | 0.0866 |
| NS3 | 1893.0 | 804 | 782 | 0.3045 | 0.2962 | 0.0469 | 0.0349 |
| C | 573.0 | 804 | 754 | 0.0481 | 0.0451 | 0.0236 | 0.0482 |
|
| |||||||
|
|
|
|
|
|
|
|
|
| VPU | 247.9 | 1018 | 700 | 0.5390 | 0.3706 | 0.4575 | 0.2750 |
| VPR | 290.9 | 1001 | 692 | 0.5365 | 0.3709 | 0.3863 | 0.3525 |
| NEF | 626.6 | 795 | 624 | 0.9122 | 0.7160 | 0.3459 | 0.3616 |
| TAT | 306.1 | 1021 | 722 | 0.4233 | 0.2993 | 0.2963 | 0.9970 |
| REV | 351.0 | 1023 | 724 | 0.3848 | 0.2723 | 0.2348 | 0.8367 |
| VIF | 579.2 | 1016 | 754 | 0.3996 | 0.2966 | 0.1545 | 0.4192 |
| ENV | 2575.5 | 975 | 875 | 0.7628 | 0.6845 | 0.0799 | 0.4389 |
| GAG | 1509.0 | 996 | 810 | 0.3543 | 0.2881 | 0.0575 | 0.1367 |
| POL | 3015.8 | 984 | 896 | 0.2816 | 0.2564 | 0.0256 | 0.1336 |
|
| |||||||
|
|
|
|
|
|
|
|
|
| HA | 1697.3 | 3368 | 2159 | 1.5015 | 0.9625 | 0.1701 | 0.1042 |
| NA | 1407.3 | 3357 | 1777 | 1.1722 | 0.6205 | 0.1323 | 0.1258 |
| PB1_F2 | 237.8 | 1932 | 344 | 0.6240 | 0.1111 | 0.1235 | 1.3072 |
| NS1 | 680.7 | 3364 | 1140 | 0.2652 | 0.0899 | 0.0391 | 0.2740 |
| M2 | 290.8 | 3359 | 539 | 0.1301 | 0.0209 | 0.0215 | 0.4055 |
| NS2 | 363.0 | 3349 | 624 | 0.0995 | 0.0185 | 0.0153 | 0.1124 |
| NP | 1494.0 | 3345 | 1656 | 0.1308 | 0.0648 | 0.0130 | 0.0827 |
| PA | 2148.0 | 3339 | 2090 | 0.0691 | 0.0432 | 0.0060 | 0.0640 |
| M1 | 756.0 | 3363 | 913 | 0.0542 | 0.0147 | 0.0058 | 0.0756 |
| PB2 | 2277.0 | 3338 | 2190 | 0.0624 | 0.0409 | 0.0054 | 0.0590 |
| PB1 | 2271.1 | 3343 | 2143 | 0.0448 | 0.0287 | 0.0038 | 0.0496 |
|
| |||||||
|
|
|
|
|
|
|
|
|
| V | 900.4 | 23 | 14 | 0.0407 | 0.0248 | 0.0083 | 1.0998 |
| C | 561.0 | 30 | 16 | 0.0276 | 0.0147 | 0.0079 | 0.2756 |
| P | 1524.0 | 34 | 23 | 0.0387 | 0.0262 | 0.0052 | 0.6530 |
| N | 1578.0 | 34 | 22 | 0.0236 | 0.0153 | 0.0029 | 0.2288 |
| M | 1008.0 | 33 | 22 | 0.0141 | 0.0094 | 0.0028 | 0.3392 |
| H | 1854.5 | 34 | 23 | 0.0246 | 0.0166 | 0.0027 | 0.7055 |
| F | 1657.8 | 34 | 22 | 0.0126 | 0.0081 | 0.0015 | 0.4244 |
| L | 6552.0 | 34 | 29 | 0.0119 | 0.0101 | 0.0005 | 0.1804 |
|
| |||||||
|
|
|
|
|
|
|
|
|
| AGNO | 199.3 | 528 | 38 | 0.0394 | 0.0028 | 0.0043 | 0.4549 |
| VP1 | 1089.0 | 530 | 124 | 0.0425 | 0.0099 | 0.0027 | 0.1217 |
| VP3 | 699.0 | 528 | 52 | 0.0334 | 0.0033 | 0.0014 | 0.1531 |
| VP2 | 1056.0 | 528 | 57 | 0.0238 | 0.0026 | 0.0007 | 0.1554 |
| ST | 519.0 | 530 | 48 | 0.0109 | 0.0010 | 0.0006 | 0.1269 |
| LT | 2087.6 | 526 | 138 | 0.0089 | 0.0023 | 0.0003 | 0.0328 |
|
| |||||||
|
|
|
|
|
|
|
|
|
| VP7 | 977.2 | 135 | 98 | 0.2149 | 0.1560 | 0.0478 | 0.0609 |
| NSP1 | 1458.9 | 129 | 94 | 0.2253 | 0.1641 | 0.0338 | 0.1122 |
| NSP4 | 525.0 | 134 | 84 | 0.0534 | 0.0335 | 0.0191 | 0.1037 |
| NSP5 | 591.3 | 131 | 67 | 0.0534 | 0.0273 | 0.0139 | 0.1339 |
| NSP3 | 931.7 | 118 | 78 | 0.0466 | 0.0308 | 0.0099 | 0.0613 |
| NSP2 | 951.0 | 122 | 71 | 0.0488 | 0.0284 | 0.0090 | 0.0461 |
| VP4 | 2325.2 | 135 | 102 | 0.0907 | 0.0685 | 0.0088 | 0.0401 |
| VP3 | 2505.0 | 130 | 98 | 0.0624 | 0.0471 | 0.0056 | 0.0537 |
| VP6 | 1189.7 | 130 | 82 | 0.0172 | 0.0109 | 0.0027 | 0.0131 |
| VP2 | 2675.1 | 131 | 98 | 0.0307 | 0.0230 | 0.0026 | 0.0214 |
| VP1 | 3260.5 | 135 | 113 | 0.0246 | 0.0206 | 0.0019 | 0.0345 |
List of Viral Species Examined in this Study.
| Species | Type | Genome format | N proteins |
| Influenza virus | Negative Sense ssRNA | Segmented linear | 11 |
| Measles | Negative Sense ssRNA | Single linear | 8 |
| Hepatitis C virus | Positive Sense ssRNA | Single, linear forming polyprotein | 11 |
| Dengue virus | Positive Sense ssRNA | Single, linear forming polyprotein | 11 |
| HIV 1 | Positive Sense RNA Reverse Transcribing | Single linear | 9 |
| Rota virus | dsRNA | Segmented linear | 11 |
| Polyoma virus | dsDNA | Single Circular | 6 |
Spearman Rank Correlation of MeaPED Analyses Undertaken via Phyml and Neighbor.
| Species | N proteins | Spearman | p-value |
| DENV1 | 11 | 0.964 | 8.40e−08 |
| DENV2 | 11 | 0.964 | 8.40e−08 |
| DENV3 | 11 | 0.982 | 3.706e−09 |
| DENV4 | 11 | 0.982 | 3.76e−09 |
| HIV1b | 9 | 0.751 | 2.50e−03 |
| HIV1c | 9 | 0.871 | 2.36e−04 |
| HIV1d | 9 | 0.934 | 2.16e−05 |
| HCV1 | 11 | 0.894 | 1.12e−05 |
| HCV2 | 11 | 0.946 | 5.14e−07 |
| HCV3 | 11 | 0.946 | 5.14e−07 |
| HCV4 | 11 | 1.000 | 1.29e−13 |
| HCV6 | 11 | 0.982 | 3.76e−09 |
| Avian Influenza | 11 | 0.995 | 7.46e−12 |
| Swine Influenza | 11 | 1.000 | 1.29e−13 |
| Human Influenza | 11 | 0.964 | 8.40e−08 |
| Measles | 8 | 0.862 | 0.001 |
| Polyomavirus | 6 | 1.000 | 1.46e−05 |
| Rotavirus A | 11 | 1.000 | 1.29e−13 |
Comparing the consistency of AMM100 and values across virus subtypes.
| Dengue virus | ( | ||
| Method | mean | Stouffer MST | Fisher MST |
| AMM100 | 0.790 | 4.14e−10 | 1.93e−09 |
| dnds | 0.615 | 7.47e−07 | 5.55e−07 |
|
|
| ||
|
|
|
|
|
| AMM100 | 0.831 | 3.82e−07 | 9.85e−07 |
| dnds | 0.857 | 3.21e−07 | 6.23e−07 |
|
|
| ||
|
|
|
|
|
| AMM100 | 0.916 | 1.018e−20 | 1.43e−19 |
| dnds | 0.60 | 1.35e−07 | 1.07e−06 |
|
|
| ||
|
|
|
|
|
| AMM100 | 0.900 | 5.59e−10 | 1.65e−09 |
| dnds | 0.946 | 1.57e−12 | 4.75e−12 |