| Literature DB >> 33785787 |
Jia-Ju Lu1, Er-Qi He1,2, Wen-Qing Bao2, Jian-Sheng Chen2, Sheng-Ren Sun2, San-Ji Gao3.
Abstract
Yellow leaf disease caused by sugarcane yellow leaf virus (SCYLV) is one of the most prevalent diseases worldwide. In this study, six near-complete genome sequences of SCYLV were determined to be 5775-5881 bp in length. Phylogenetic analysis revealed that the two SCYLV isolates from Réunion Island, France, and four from China were clustered into REU and CUB genotypes, respectively, based on 50 genomic sequences (this study = 6, GenBank = 44). Meanwhile, all 50 isolates were clustered into three phylogroups (G1-G3). Twelve significant recombinant events occurred in intra- and inter-phylogroups between geographical origins and host crops. Most recombinant hotspots were distributed in coat protein read-through protein (RTD), followed by ORF0 (P0) and ORF1 (P1). High genetic divergences of 12.4% for genomic sequences and 6.0-24.9% for individual genes were determined at nucleotide levels. The highest nucleotide diversity (π) was found in P0, followed by P1 and RdRP. In addition, purifying selection was a main factor restricting variability in SCYLV populations. Infrequent gene flow between Africa and the two subpopulations (Asia and America) were found, whereas frequent gene flow between Asia and America subpopulations was observed. Taken together, our findings facilitate understanding of genetic diversity and evolutionary dynamics of SCYLV.Entities:
Year: 2021 PMID: 33785787 PMCID: PMC8009895 DOI: 10.1038/s41598-021-86472-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Maximum Likelihood (ML) phylogenetic tree constructed using IQ-TREE 1.6.12 based on nearly complete genome sequences [open reading frames (ORFs) 0–5] from 50 sugarcane yellow leaf virus (SCYLV) isolates. ML analysis was carried out using nucleotide identity distances and 1000 bootstrap replicates (Bootstrap values > 60% are shown at nodes). The scale bar indicates the number of substitutions per nucleotide. Different geographic regions are represented with red, green, and purple lines for Asia (filled square), Africa (filled circle), and America (filled triangle), respectively. The six SCYLV isolates from sugarcane obtained in the present study are indicated by the solid pentagram (filled star). A potato leafroll virus (PLRV; NCBI accession no. NC_001747) isolate served as an outgroup (filled diamond). The acronyms of each SCYLV genotype based on their originally geographical origins: BRA (Brazil), HAW (Hawaii), PER (Peru), CHN (China), REU (Reunion), FLA (Florida), IND (India), and CUB (Cuba).
Recombination events among 50 sugarcane yellow leaf virus (SCYLV) isolates using RDP4 software.
| Putative recombinanta | Major × minor putative parents a | Recombination breakpointb | Detection methodc | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Begin | End | R | G | B | M | C | S | T | ||
| PI 157033 (G1) | IND2 (G3) × BAR-YL1 (G1) | 5001 | 5238 | ++ | ++ | ++ | ++ | ++ | − | ++ |
| Sorg1_1 (G3) | FL86 (G1) × FL84 (G3) | 624 | 728 | ++ | ++ | ++ | ++ | ++ | − | ++ |
| MU-AB193 (G2) | CHN-GD-ZJ4 (G1) × MU-AB197 (G2) | 2579 | 4506 | ++ | ++ | ++ | ++ | ++ | ++ | ++ |
| Sorg3_3 (G3) | SCYLV (G1) × ZJWL003 (G3) | 4503 | 5001 | ++ | ++ | ++ | ++ | ++ | ++ | ++ |
| MU-SC1233 (G2) | CHN-YN-KY2 (G1) × MU-AB193 (G2) | 4507* | 5214 | ++ | ++ | ++ | ++ | ++ | ++ | ++ |
| MU-AB193 (G2) | PI 157033 (G1) × REU-YL2 (G2) | 1093 | 1463 | ++ | ++ | ++ | ++ | ++ | ++ | ++ |
| MU-SC1233 (G2) | CHN-GD-ZJ15 (G1) × MU-AB197 (G2) | 2556 | 4506* | ++ | + | − | + | + | ++ | ++ |
| GZ-GZ18 (G1) | CHN-GD-WY19 (G1) × FL86 (G1) | 4437 | 5468 | ++ | ++ | ++ | ++ | ++ | ++ | + |
| Sorg2_2 (G3) | FL86 (G1) × IND2 (G3) | 4976 | 5178 | ++ | + | ++ | + | ++ | ++ | ++ |
| Sorg3_3 (G3) | SCYLV (G1) × FL180 (G3) | 387 | 521 | ++ | ++ | − | ++ | ++ | − | + |
| Sorg1_1 (G3) | PI 157033 (G1) × Sorg3_3 (G3) | 4257 | 4343* | ++ | + | + | − | + | − | + |
| IND2 (G3) | FL180 (G3) × CBLK97154 (G3) | 5026 | 5237 | ++ | ++ | ++ | + | + | + | + |
aPhylogroups are shown in brackets.
bActual breakpoint position undetermined (it was most likely overprinted by a subsequent one) shown as an asterisk (*).
cSeven algorithms implemented in RDP4 software were used, including RDP (R), GENECONV (G), BootScan (B), Maximum Chisquare (M), Chimaera (C), Sister Scan (S), and 3Seq (T). Symbols are showed for recombination signals at different statistical significance levels: ++, P ≤ 10–6; +, 10–6 < P ≤ 0.05; −, P > 0.05 (not significant).
Figure 2Maximum Likelihood phylogenetic trees (leaf panel) constructed by IQ-TREE v1.6.12 and pairwise identity matrixes (right panel) constructed by SDT v1.2 among SCYLV isolates. (a) Fifty nearly-complete genome sequences of SCYLV. (b)–(g) Six nucleotide sequences of ORF 0, ORF 1, ORF 1–2, ORF 3, ORF 4, and ORF 3–5 from 47 SCYLV isolates (Sorg1_1, Sorg2_2, and Sorg3_3 were excluded), respectively. The acronyms of each SCYLV genotype: BRA (Brazil), HAW (Hawaii), PER (Peru), CHN (China), REU (Reunion), FLA (Florida), IND (India), and CUB (Cuba).
The length, identity, nucleotide diversity, neutrality test, and selection pressure on each gene among the overall sugarcane yellow leaf virus (SCYLV) population and each phylogroup.
| ORF (protein) | Group | Length | Identity (%) | π | Tajima's D | dN/dS | ||
|---|---|---|---|---|---|---|---|---|
| aa | nt | aa | nt | |||||
| ORF0 (P0) | all (n = 47) | 255–256 | 765–768 | 88.61 | 91.72 | 0.10735 | 1.48728 (ns) | 0.4018 |
| G1 (n = 22) | 256 | 768 | 97.37 | 98.43 | 0.02458 | − 0.66815 (ns) | 0.3564 | |
| G2 (n = 10) | 256 | 768 | 98.87 | 99.25 | 0.01195 | − 0.04311 (ns) | 0.2966 | |
| G3 (n = 15) | 255–256 | 765–768 | 91.15 | 98.06 | 0.02837 | − 0.33086 (ns) | 0.1467 | |
| ORF1 (P1) | all (n = 47) | 633–650 | 1899–1950 | 91.26 | 92.96 | 0.09007 | 1.17081 (ns) | 0.2774 |
| G1 (n = 22) | 633–650 | 1899–1950 | 97.67 | 98.16 | 0.02388 | − 0.75623 (ns) | 0.2237 | |
| G2 (n = 10) | 650 | 1950 | 99.25 | 98.97 | 0.01589 | 0.63196 (ns) | 0.1048 | |
| G3 (n = 15) | 642–650 | 1926–1950 | 96.48 | 97.67 | 0.02627 | − 0.15383 (ns) | 0.1962 | |
| ORF1/2 (RdRP) | all (n = 47) | 1053–1070 | 3159–3210 | 91.45 | 94.17 | 0.07324 | 0.96401 (ns) | 0.3757 |
| G1 (n = 22) | 1053–1070 | 3159–3210 | 95.02 | 98.35 | 0.02163 | − 0.67551 (ns) | 0.3594 | |
| G2 (n = 10) | 1070 | 3210 | 97.64 | 98.69 | 0.01948 | 0.77878 (ns) | 0.0884 | |
| G3 (n = 15) | 1062–1070 | 3186–3210 | 95.02 | 98.14 | 0.02109 | − 0.29946 (ns) | 0.2980 | |
| ORF3 (CP) | all (n = 47) | 196 | 588 | 98.77 | 98.43 | 0.02242 | − 0.14750 (ns) | 0.2523 |
| G1 (n = 22) | 196 | 588 | 99.72 | 99.71 | 0.0054 | − 1.63275 (ns) | 0.2138 | |
| G2 (n = 10) | 196 | 588 | 99.85 | 99.63 | 0.00606 | − 0.34066 (ns) | 0.1165 | |
| G3 (n = 15) | 196 | 588 | 99.29 | 99.53 | 0.0078 | − 0.95978 (ns) | 0.1567 | |
| ORF4 (MP) | all (n = 47) | 150 | 450 | 98.85 | 99.03 | 0.01463 | − 0.65870 (ns) | 0.2244 |
| G1 (n = 22) | 150 | 450 | 99.61 | 99.74 | 0.00441 | − 1.71042 (ns) | 0.3124 | |
| G2 (n = 10) | 150 | 450 | 99.76 | 99.89 | 0.00209 | − 1.00300 (ns) | 1.3254 | |
| G3 (n = 15) | 150 | 450 | 99.11 | 99.58 | 0.00668 | − 0.85026 (ns) | 0.6926 | |
| ORF3/5 (RTD) | all (n = 47) | 675–676 | 2025–2028 | 94.29 | 94.30 | 0.07779 | 0.76678 (ns) | 0.2235 |
| G1 (n = 22) | 676 | 2028 | 98.05 | 98.17 | 0.02737 | − 0.48646 (ns) | 0.1782 | |
| G2 (n = 10) | 676 | 2028 | 99.20 | 98.95 | 0.0174 | − 0.31252 (ns) | 0.1381 | |
| G3 (n = 15) | 674–675 | 2022–2025 | 97.79 | 98.0 | 0.02983 | − 0.40820 (ns) | 0.1736 | |
aa, amino acid; nt, nucleotide; π, nucleotide diversity; ns, not significant (P > 0.05); nonsynonymous/synonymous ratio (dN/dS) < 1 (negative or purifying selection), dN/dS = 1 (neutral evolution), and dN/dS > 1 (positive selection).
Figure 3Schematic sketch of 12 Insertion/Deletion (InDel) events calculated by using amino acid sequences of 47 SCYLV isolates. (a) Position of InDel events in each ORF. (b) InDel events are indicated by the red pentagram (filled star), and the same InDel event occurring among SCYLV isolates is shown by the red dotted line. All isolate sequences (from ORF 0 to 5) are indicated by blue diagrams.
Gene flow and genetic differentiation among sugarcane yellow leaf virus (SCYLV) subpopulations based on geographic origins.
| Genomic region | Comparisons | Ks* ( | Z* ( | Snn ( | Fst | Nm |
|---|---|---|---|---|---|---|
| ORFs 0–5 | Asia (n = 23) versus Africa (n = 10) | 5.23017 (0.0000***) | 6.24901 (0.0000***) | 1.00000 (0.0000***) | 0.45497 | 0.60 |
| Asia (n = 23) versus America (n = 14) | 5.54500 (0.0030**) | 6.77899 (0.0000***) | 0.94595 (0.0000***) | 0.05951 | 7.90 | |
| Africa (n = 10) versus America (n = 14) | 5.03290 (0.0000***) | 5.48287 (0.0000***) | 1.00000 (0.0000***) | 0.40116 | 0.75 | |
| ORF0 (P0) | Asia (n = 23) versus Africa (n = 10) | 3.52869 (0.0000***) | 6.18396 (0.0000***) | 1.00000 (0.0000***) | 0.43323 | 0.65 |
| Asia (n = 23) versus America (n = 14) | 3.94367 (0.0130*) | 6.79418 (0.0000**) | 1.00000 (0.0000***) | 0.02270 | 21.52 | |
| Africa (n = 10) versus America (n = 14) | 3.07647 (0.0000***) | 5.48663 (0.0000***) | 1.00000 (0.0000***) | 0.34610 | 0.94 | |
| ORF1 (P1) | Asia (n = 23) versus Africa (n = 10) | 4.25633(0.0000***) | 6.18533 (0.0000***) | 1.00000 (0.0000***) | 0.47999 | 0.54 |
| Asia (n = 23) versus America (n = 14) | 4.57621 (0.0080**) | 6.79286 (0.0000***) | 1.00000 (0.0000***) | 0.05817 | 8.10 | |
| Africa (n = 10) versus America (n = 14) | 3.94750 (0.0000***) | 5.46695 (0.0000***) | 1.00000 (0.0000***) | 0.42363 | 0.68 | |
| ORF1-2 (RdRP) | Asia (n = 23) versus Africa (n = 10) | 4.55296 (0.0000***) | 5.96501 (0.0000***) | 1.00000 (0.0000***) | 0.42196 | 0.68 |
| Asia (n = 23) versus America (n = 14) | 4.83504 (0.0140*) | 6.56737 (0.0030**) | 0.93939 (0.0000***) | 0.01581 | 31.13 | |
| Africa (n = 10) versus America (n = 14) | 4.42390 (0.0000***) | 5.50327 (0.0000***) | 1.00000 (0.0000***) | 0.41083 | 0.72 | |
| ORF3 (CP) | Asia (n = 23) versus Africa (n = 10) | 1.99034 (0.0000***) | 6.42184 (0.0000***) | 0.94805 (0.0000***) | 0.38354 | 0.80 |
| Asia (n = 23) versus America (n = 14) | 2.22958 (0.0120*) | 6.83625 (0.0000***) | 0.85621 (0.0000***) | 0.06444 | 7.26 | |
| Africa (n = 10) versus America (n = 14) | 1.84867 (0.0000***) | 5.74703 (0.0000***) | 0.92024 (0.0000***) | 0.27405 | 1.32 | |
| ORF3-5 (RTD) | Asia (n = 23) versus Africa (n = 10) | 4.22064 (0.0000***) | 6.21377 (0.0000***) | 1.00000 (0.0000***) | 0.46086 | 0.58 |
| Asia (n = 23) versus America (n = 14) | 4.59517 (0.0000***) | 6.78249 (0.0000***) | 0.95045 (0.0000***) | 0.06138 | 7.65 | |
| Africa (n = 10) versus America (n = 14) | 4.06444 (0.0000***) | 5.48591 (0.0000***) | 1.00000 (0.0000***) | 0.40357 | 0.74 | |
| ORF4 (MP) | Asia (n = 23) versus Africa (n = 10) | 1.41525 (0.0000***) | 6.44432 (0.0000***) | 0.81567 (0.0000***) | 0.40385 | 0.74 |
| Asia (n = 23) versus America (n = 14) | 1.76195 (0.0130*) | 6.80554 (0.0060**) | 0.79410 (0.0000***) | 0.08389 | 5.46 | |
| Africa (n = 10) versus America (n = 14) | 1.21246 (0.0000***) | 5.86777 (0.0000***) | 0.80136 (0.0000***) | 0.20976 | 1.88 |
Symbols are showed for three permutation statistical tests of Ks*, Z*, and Snn at different statistical significance levels: *0.01 < P ≤ 0.05; **0.001 < P ≤ 0.01; ***P ≤ 0.001.