| Literature DB >> 26889113 |
Reza Pourrahim1, Shirin Farzadfar1.
Abstract
In this study, the full-length nucleotide sequences of four Iranian PVY isolates belonging to PVY(N) strain were determined. The genome of Iranian PVY isolates were 9,703-9,707 nucleotides long encoding all potyviral cistrons including P1, HC-Pro, P3, 6K1, CI, 6K2, VPg, NIa-Pro, NIb and CP with coding regions of 825, 1,395, 1,095, 156, 1,902, 156, 564, 732, 1,557 and 801 nucleotides in length, respectively. The length of pipo, embedded in the P3 cistron, was 231 nucleotides. Phylogenetic analysis showed that the Iranian isolates clustered with European recombinant NTN isolates in the N lineage. Recombination analysis demonstrated that Iranian PVY(N) isolates had a typical European PVY(NTN) genome having three recombinant junctions while PVY(N) and PVY(O) were identified as the parents. We used dN/dS methods to detect candidate amino acid positions for positive selection in viral proteins. The mean ω ratio differed among different genes. Using model M0, ω values were 0.267 (P1), 0.085 (HC-Pro), 0.153 (P3), 0.050 (CI), 0.078 (VPg), 0.087 (NIa-pro), 0.079 (NIb) and 0.165 (CP). The analysis showed different sites within P1, P3 and CP were under positive selection pressure, however, the sites varied among PVY populations. To the best of our knowledge, our analysis provides the first demonstration of population structure of PVY(N) strain in mid-Eurasia Iran using complete genome sequences and highlights the importance of recombination and selection pressure in the evolution of PVY.Entities:
Keywords: PVYN; phylogeny; recombination; variation
Year: 2016 PMID: 26889113 PMCID: PMC4755673 DOI: 10.5423/PPJ.OA.07.2015.0144
Source DB: PubMed Journal: Plant Pathol J ISSN: 1598-2254 Impact factor: 1.795
Fig. 3A maximum likelihood (ML) phylogenetic tree showing the relationship among Potato virus Y isolates. The tree was constructed from the full-length sequences of 58 PVY isolates, including recombinant isolates. Numbers at each node indicate the percentage of supporting puzzling steps (or bootstrap samples) in ML method. The name of each isolate, its country of origin, strain relationship, and GenBank accession number are listed. We used Pepper mottle virus (PeMoV) (accession no. NC_001517) as an outgrop.
Fig. 1Two dimensional nucleotide diversity plot constructed using complete genome sequences of four PVYN Iranian isolates (IRNH1, IRNH172, IRND41 and IRNZ33) with 54 others one obtained from GenBank.
Fig. 2Recombination pattern of Potato virus Y genome. (A) Color figure of each lineage and recombination pattern of Iranian isolates. (B) Graphs showing SISCAN Version 2 analysis of the full-length sequence of Iranian isolate (IRNH172) with that of EF026074 (Blue line) as well as X97895 (Red line). X and Y axes represent Z-value and genome nucleotides, respectively. The sequences of EF026074 and X97895 represent the likely parental sequences of IRNH172. Iranian recombinant isolate (IRNH172) was more closely related to EF026074 between approximately 190 to 2,244 nt, and 8,977 to 9,703 nt; and more closely related to X97895 between 5,581 to 8,977 nt (i.e. Z-value >3.0).
Recombination crossover sites in Potato virusY isolates detected by recombination detecting programs
| Recombinant | Event | Parental isolates | Breakpoints | Methods | SISCAN ( | ||
|---|---|---|---|---|---|---|---|
|
|
| ||||||
| Major parent (% identity) | Minor parent (% identity) | Begin | End | ||||
| Iranian isolates | 1 | EF026074 (O) (92.0) | X97895 (N) (99.0) | P1/190 | HC-Pro/2244 | RGBMCS | 12.0 |
| 2 | EF026074 (O) (92.0) | X97895 (N) (99.0) | 6K2/5581 | CP/8977 | RGBMCS | 10.0 | |
| 3 | EF026074 (O) (92.0) | X97895 (N) (99.0) | CP/8977 | CP/9703 | RGBMCS | 8.0 | |
Parental isolate means the most likely isolate among analyzed isolates.
Nucleotide sequence identity in particular genome region of each lineage is indicated in parentheses.
Numbers indicate recombination sites.
R, G, B, M, C, and S indicate detection by RDP, GENCONV, BOOTSCAN, MAXCHI, CHIMAERA and SISCAN methods, respectively, with the presented highest P-value being that determined by the method indicated in bold type.
The Z-value was calculated by original SISCAN v.2 program.
Haplotype and nucleotide diversity of each Potato virus Y population
| Lineage | Haplotype diversity | Nucleotide diversity | |
|---|---|---|---|
| N | 0.998 (0.000) | 0.074 (0.002) | |
|
| |||
| Sub-populations | N-Europe | 1.000 (0.003) | 0.015 (0.001) |
| N-Europe & North America | 0.989 (0.000) | 0.022 (0.001) | |
| W (PYVN-Wi) | 1.000 (0.002) | 0.016 (0.001) | |
|
| |||
| O | 0.964 (0.005) | 0.035 (0.001) | |
|
| |||
| C | 1.000 (0.003) | 0.120 (0.003) | |
Nucleotide diversity was estimated by the average pairwise difference among sequence in a sample, based on all sites (Tamura et al., 2007). Numbers in parentheses indicate standard deviations.
Mean nucleotide diversity between Potato virus Y populations
| Populations | C | O | N-Europe | N-Europe & North America | PYVN-Wi |
|---|---|---|---|---|---|
| C | - | 0.138 (0.003) | 0.229 (0.004) | 0.111 (0.002) | 0.159 (0.003) |
| O | - | - | 0.128 (0.003) | 0.187 (0.004) | 0.075 (0.002) |
| N-Europe | - | - | - | 0.089 (0.002) | 0.072 (0.003) |
| N-Europe & North America | - | - | - | - | 0.151 (0.004) |
| W (PYVN-Wi) | - | - | - | - | - |
Nucleotide diversity was estimated by the average pairwise difference among sequence in a sample, based on all sites (Tamura et al., 2007). Numbers in parentheses indicate standard deviations.
The dn/ds (ω) values, log-likelihood (lnL) values, likelihood ratio test (LRT) statistics and positively selected amino acid sites undergoing different models of codon substitution were used to investigate selection pressures on ten proteins encoded by the Potato virus Y genome analysed in this study
| Protein | Models | Parameter estimates | ln | LRT | Positively selected (amino acids) sites | |
|---|---|---|---|---|---|---|
|
| ||||||
| Site models | ||||||
| P1 | M0 | 0.267 | −6441.013 | None | ||
| M3 | 0.311 | −6314.060 | P < 0.01 (0.0) | 26 L, 34 S, 36 T, 52 R, 55 E, 56 F, 67 C*, 78 A, 116 L, 134 A, 137 Y, 138 H, 140 P, 184 H, 185 R*, 187 V, 203 R, 206 K, 209 V, 210 V, 211 R*, 212 L, 213 Q, 214 H*, 242 T*, 247 H | ||
| M1a | 0.366 | −6324.428 | Not allowed | |||
| M2a | 0.366 | −6324.428 | ||||
| M7 | 0.297 | −6319.041 | Not allowed | |||
| M8 | 0.310 | −6315.448 | P < 0.05 (0.0275) | 24 L, 26 L, 34 S, 36 T, 52 R, 55 E, 56 F, 67 C*, 78 A, 92 Y, 116 L, 124 V, 134 A, 137 Y, 138 H, 140 P, 184 H, 185 R*, 187 V, 189 C, 203 R, 204 C, 206 K, 209 V, 210 V, 211 R*, 212 L, 213 Q, 214 H*, 215 L, 242 T*, 247 H | ||
|
| ||||||
| HC-Pro | M0 | 0.085 | −6891.625 | None | ||
| M3 | 0.090 | −6829.734 | P < 0.01 (0.0) | Not allowed | ||
| M1a | 0.1100 | −6855.761 | Not allowed | |||
| M2a | 0.1110 | −6855.761 | 25 Q, 108 I | |||
| M7 | 0.089 | −6830.501 | Not allowed | |||
| M8 | 0.089 | −6830.501 | 4 D, 25 Q, 107 T, 108 I, 245 N | |||
|
| ||||||
| P3 | M0 | 0.153 | −5691.115 | None | ||
| M3 | 0.184 | −5559.300 | P < 0.01 (0.0) | 2 I, 51 Y, 77 R**, 131 H*, 160 P**, 163 N*, 164 T, 176 N**, 179 N**, 190 K, 191 L*, 192 S, 193 A, 194 T*, 195 W, 196 Y**, 197 S*, 198 Y**, 199 R**, 200 A, 202 R**, 203 S**, 204 I*, 206 R**, 207 Y, 212 G, 230 R, 231 G**, 232 A**, 233 Q*, 234 V**, 237 G, 243 S*, 299 Q, 361 D* | ||
| M1a | 0.181 | −5563.445 | Not allowed | |||
| M2a | 0.192 | −5562.836 | P > 0.05 (0.543) | 179 N, 196 Y, 198 Y | ||
| M7 | 0.171 | −5567.421 | Not allowed | |||
| M8 | 0.184 | −5559.396 | P < 0.001 (0.0003) | 51 Y, 77 R**, 131 H*, 160 P**, 163 N*, 164 T, 176 N**, 179 N**, 190 K, 191 L*, 193 A, 194 T*, 195 W, 196 Y**, 197 S*, 198 Y**, 199 R**, 200 A, 202 R**, 203 S**, 204 I*, 206 R**, 207 Y, 230 R, 231 G**, 232 A**, 234 V**, 243 S*, 299 Q, 361 D* | ||
|
| ||||||
| 6K1 | M0 | 0.087 | −704.608 | None | ||
| M3 | 0.090 | −703.224 | P > 0.05 (0.597) | None | ||
| M1a | 0.087 | −704.608 | Not allowed | |||
| M2a | 0.087 | −704.608 | None | |||
| M7 | 0.091 | −703.647 | Not allowed | |||
| M8 | 0.091 | −703.647 | None | |||
|
| ||||||
| CI | M0 | 0.050 | −8548.965 | None | ||
| M3 | 0.053 | −8488.735 | P < 0.01 (0.0) | 15 T**, 381 I | ||
| M1a | 0.066 | −8508.226 | Not allowed | |||
| M2a | 0.066 | −8508.226 | None | |||
| M7 | 0.053 | −8493.151 | Not allowed | |||
| M8 | 0.053 | −8489.164 | P < 0.05 (0.018) | 15 T*, 381 I | ||
|
| ||||||
| 6K2 | M0 | 0.042 | −682.505 | None | ||
| M3 | 0.047 | −679.229 | P < 0.01 (0.0) | None | ||
| M1a | 0.056 | −679.986 | Not allowed | |||
| M2a | 0.056 | −679.986 | None | |||
| M7 | 0.048 | −679.506 | Not allowed | |||
| M8 | 0.048 | −679.506 | None | |||
|
| ||||||
| VPg | M0 | 0.078 | −2565.512 | None | ||
| M3 | 0.084 | −2528.559 | P < 0.01 (0.0) | None | ||
| M1a | 0.113 | −2534.188 | Not allowed | |||
| M2a | 0.113 | −2534.188 | 123 T, 164 L | |||
| M7 | 0.087 | −2528.759 | Not allowed | |||
| M8 | 0.087 | −2527.488 | P > 0.05 (0.280) | 119 G, 123 T, 164 L | ||
|
| ||||||
| NIa | M0 | 0.087 | −3965.610 | |||
| M3 | 0.099 | −3917.779 | P < 0.01 (0.0) | |||
| M1a | 0.117 | −3929.965 | Not allowed | |||
| M2a | 0.117 | −3929.965 | ||||
| M7 | 0.100 | −3919.532 | Not allowed | |||
| M8 | 0.100 | −3919.533 | 21 V, 104 V, 173 V | |||
|
| ||||||
| NIb | M0 | 0.079 | −7900.021 | None | ||
| M3 | 0.087 | −7815.714 | P < 0.01 (0.0) | |||
| M1a | 0.106 | −7834.727 | Not allowed | |||
| M2a | 0.106 | −7834.727 | 98 I, 101 L, 157 L, 266 I, 511 C | |||
| M7 | 0.088 | −7818.351 | Not allowed | |||
| M8 | 0.088 | −7818.352 | P > 0.01 (0.999) | 98 I, 100 Y, 101 L, 157 L, 266 I, 290 I, 511 C | ||
|
| ||||||
| CP | M0 | 0.165 | −3311.127 | None | ||
| M3 | 0.181 | −3249.592 | P < 0.01 (0.0) | 24 P, 25 N, 187 V, 193 G* | ||
| M1a | 0.165 | −3251.333 | Not allowed | |||
| M2a | 0.185 | −3249.782 | P > 0.05 (0.212) | 24 P, 25 N, 187 V, 193 G | ||
| M7 | 0.171 | −3258.200 | Not allowed | |||
| M8 | 0.180 | −3250.429 | P > 0.0001 (0.0004) | 9 G, 10 S, 11 T, 24 P**, 25 N**, 26 L*, 58 K, 63 T*, 68 G, 99 L*, 128 I, 138 D**, 187 V**, 193 G**, 217 H, 230 S | ||
Model descriptions according to Yang et al. (2000) M0 (one ration); M3 (discrete); M7 (β); M8 (β plus ω), Wong et al. (2004), Yang et al. (2005) M1a (nearly neutral); M2a (positive selection).
LRTs of M3 vs. M0 are tests of heterogeneity of selection pressures among codon sites. M2a vs. M1a and M8 vs. M7 are tests of positive selection; all assess LRT statistics (2dlnL) against a chi-square distribution with the degrees of freedom equal to the difference in the number of parameters between the nested models being compared.
Amino acid (codon) sites with higher posterior probabilities of P > 95.0 (*) and P > 99.0 (**) undergoing positive selection are shown. Identification of positively selected amino acids is based on either the Naive Empirical Bayes (NEB) approach (under M3) or the Bayes Empirical Bayes (BEB) approach (with the M2a, M8, and branch-site model A).