| Literature DB >> 36146764 |
Guillaume Lafforgue1, Thierry Michon1, Justine Charon1,2.
Abstract
Intrinsically disordered regions (IDRs) are abundant in the proteome of RNA viruses. The multifunctional properties of these regions are widely documented and their structural flexibility is associated with the low constraint in their amino acid positions. Therefore, from an evolutionary stand point, these regions could have a greater propensity to accumulate non-synonymous mutations (NS) than highly structured regions (ORs, or 'ordered regions'). To address this hypothesis, we compared the distribution of non-synonymous mutations (NS), which we relate here to mutational robustness, in IDRs and ORs in the genome of potyviruses, a major genus of plant viruses. For this purpose, a simulation model was built and used to distinguish a possible selection phenomenon in the biological datasets from randomly generated mutations. We analyzed several short-term experimental evolution datasets. An analysis was also performed on the natural diversity of three different species of potyviruses reflecting their long-term evolution. We observed that the mutational robustness of IDRs is significantly higher than that of ORs. Moreover, the substitutions in the ORs are very constrained by the conservation of the physico-chemical properties of the amino acids. This feature is not found in the IDRs where the substitutions tend to be more random. This reflects the weak structural constraints in these regions, wherein an amino acid polymorphism is naturally conserved. In the course of evolution, potyvirus IDRs and ORs follow different evolutive paths with respect to their mutational robustness. These results have forced the authors to consider the hypothesis that IDRs and their associated amino acid polymorphism could constitute a potential adaptive reservoir.Entities:
Keywords: diversity; mutational robustness; potyvirus; protein intrinsic disorder
Mesh:
Substances:
Year: 2022 PMID: 36146764 PMCID: PMC9504506 DOI: 10.3390/v14091959
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.818
Figure 1The % of S and NS mutations in IDRs versus the mutations number in the TEV genome. For a given number of mutations, four independent simulations were run.
Figure 2Mutation % in the TEV proteins from the experimental evolution [13], natural diversity and simulations. For PVY and TuMV, please refer to the Supplemental Data. The proteins are sorted from the smallest to the largest, left to right: 6K1, 6K2, Nia-VPg, Nia-Pro, CP, P1, P3, Hc-Pro, Nib, and CI. % mutation = (number of NS or S mutation within each protein)/(total number of NS or S mutations within the whole genome).
Correlation coefficient (R²) between coding sequence length of the TEV proteins and the mutations (S or NS). Experimental evolution: TEV 2015 [13], PVY 2015 [11], and PVY 2017 [12]. TEVND, PVYND, and TuMVND; ND, natural diversity. Simulations: four in silico replicates.
|
|
| |
| TEV 2015 | 0.63 | 0.19 |
| TEVND | 0.94 | 0.16 |
| Simulations | 1 | 0.98 |
|
|
| |
| PVY 2015 | 0.93 | 0.12 |
| PVY 2017 | 0.78 | 0.01 |
| PVYND | 0.96 | 0.35 |
| Simulations | 0.96 | 0.97 |
|
|
| |
| TuMVND | 0.95 | 0.09 |
| Simulations | 0.92 | 0.98 |
Figure 3Ratio between the percentage of mutation (S or NS) present in IDRs and ORs for the TEV genome. For datasets from the two other studies [11,12] regarding TuMV and PVY, please refer to the Supplemental Data.
p values of Xhi2 test for percentage of NS mutations in IDRs between simulated and experimental data (TEV 2015, PVY 2015, PVY 2017) or natural diversity (TEVND, PVYND, TuMVND). There was no experimental datasets available for TuMV. Significance, p < 0.05.
| Simulations | TEV 2015 | TEVND | PVY 2015 | PVY 2017 | PVYND | TuMVND |
|---|---|---|---|---|---|---|
|
| 0.014 | 2.10 × 10−6 | 0.52 | 0.04 | 0.0005 | 3.10 × 10−8 |
|
| 0.026 | 1.10 × 10−5 | 0.46 | 0.03 | 0.0001 | 5.10 × 10−8 |
|
| 0.018 | 5.10 × 10−6 | 0.33 | 0.01 | 8.10-6 | 2.10 × 10−8 |
|
| 0.028 | 2.10 × 10−5 | 0.27 | 0.0049 | 1.10-7 | 6.10 × 10−9 |
Differences in physicochemical properties associated with amino acid substitutions were assessed using scores derived from the BLOSUM62 substitution matrix. For each type of region (IDRs or ORs), virus biological and simulated data were distributed into statistical groups (a, b, and c) by running a Dunn test (p value adjustment method: Bonferroni) for the (A) PVY genome, (B) TEV genome, and (C) TuMV genome.
| A | IDRs | ORs | B | IDRs | ORs | C | IDRs | ORs |
|---|---|---|---|---|---|---|---|---|
| PVY 2015 | abc | ab | TEV 2015 | ab | ab | TuMVND | a | a |
| PVY 2017 | abc | a | TEVND | b | b | Sim A | a | b |
| PVYND | ac | a | Sim A | a | a | Sim B | a | b |
| Sim A | bc | c | Sim B | a | a | Sim C | a | b |
| Sim B | bc | c | Sim C | a | a | Sim D | a | b |
| Sim C | b | bc | Sim D | ab | a | |||
| Sim D | bc | c |
Figure 4Mutational robustness of IDRs and codon optimization. The low rate of non-synonymous mutations in the ORs allows the optimization of the sequence towards the selection of abundant codons in the host. The high rate of non-synonymous mutations in IDRs prevents this optimization.