| Literature DB >> 34835036 |
Roberto Ramos-Sobrinho1, Raphael O Adegbola1, Kathy Lawrence2, Drew W Schrimsher2, Thomas Isakeit3, Olufemi J Alabi4, Judith K Brown1.
Abstract
Cotton leafroll dwarf virus (CLRDV) was first reported in the United States (US) in 2017 from cotton plants in Alabama (AL) and has become widespread in cotton-growing states of the southern US. To investigate the genomic variability among CLRDV isolates in the US, complete genomes of the virus were obtained from infected cotton plants displaying mild to severe symptoms from AL, Florida, and Texas. Eight CLRDV genomes were determined, ranging in size from 5865 to 5867 bp, and shared highest nucleotide identity with other CLRDV isolates in the US, at 95.9-98.7%. Open reading frame (ORF) 0, encoding the P0 silencing suppressor, was the most variable gene, sharing 88.5-99.6% and 81.2-89.3% amino acid similarity with CLRDV isolates reported in cotton growing states in the US and in Argentina and Brazil in South America, respectively. Based on Bayesian analysis, the complete CLRDV genomes from cotton in the US formed a monophyletic group comprising three relatively divergent sister clades, whereas CLRDV genotypes from South America clustered as closely related sister-groups, separate from US isolates, patterns reminiscent of phylogeographical structuring. The CLRDV isolates exhibited a complex pattern of recombination, with most breakpoints evident in ORFs 2 and 3, and ORF5. Despite extensive nucleotide diversity among all available CLRDV genomes, purifying selection (dN/dS < 1) was implicated as the primary selective force acting on viral protein evolution.Entities:
Keywords: Polerovirus; Solemoviridae; cotton viruses; genetic diversity; plant virus evolution
Mesh:
Substances:
Year: 2021 PMID: 34835036 PMCID: PMC8618375 DOI: 10.3390/v13112230
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Cotton leafroll dwarf virus (CLRDV) isolates recovered from symptomatic cotton plants using high throughput sequencing or Sanger sequencing.
| Isolate | Location | Symptoms | PCR-ORF0 | # of Reads | Coverage | Reference |
|---|---|---|---|---|---|---|
| CLRDV-USA-AL-MC2 1 | Alabama | Severe | Positive | 33,141 | 850 | This study |
| CLRDV-USA-AL-SC4 1 | Alabama | Mild | Positive | 16,964 | 435 | This study |
| CLRDV-USA-TX-CT2 | Texas | Severe | Positive | 13,043 | 336 | This study |
| CLRDV-USA-TX-CT3 1 | Texas | Severe | Positive | 147,639 | 3,780 | This study |
| CLRDV-USA-TXa | Texas | Severe | Positive | Sanger-derived | This study | |
| CLRDV-USA-TXb | Texas | Severe | Positive | Sanger-derived | This study | |
| CLRDV-USA-TXc | Texas | Severe | Positive | Sanger-derived | This study | |
| CLRDV-USA-TXd | Texas | Severe | Positive | Sanger-derived | This study | |
| CLRDV-USA-AL 1 | Alabama | Severe | Positive | - | - | Avelar et al., 2020 |
1 Representative CLRDV isolates for which viral 5′ and 3′ UTRs were determined.
Nucleotide and amino acid identities for the complete genome and open reading frame (ORF) sequences of cotton leafroll dwarf virus (CLRDV) isolates from the US and South America (SA).
| Genomes—Nucleotide/Amino Acid | ORF0—Nucleotide/Amino Acid | ||||
|---|---|---|---|---|---|
| CLRDV-SA | CLRDV-US | CLRDV-SA | CLRDV-US | ||
| CLRDV-SA | 95.9–99.5/- 1 | CLRDV-SA | 91.1–99.5/87.0–98.1 | ||
| CLRDV-US | 94.0–96.2/- | 94.5–100.0/- | CLRDV-US | 89.4–94.3/81.6–90.4 | 92.1–100.0/90.8–100.0 |
|
|
| ||||
| CLRDV-SA | CLRDV-US | CLRDV-SA | CLRDV-US | ||
| CLRDV-SA | 94.3–99.5/93.8–99.5 | CLRDV-SA | 95.5–99.4/93.3–99.1 | ||
| CLRDV-US | 92.4–95.5/90.3–95.0 | 93.8–100.0/91.6–100.0 | CLRDV-US | 92.9–96.2/87.8–94.7 | 92.5–100.0/87.5–100.0 |
|
|
| ||||
| CLRDV-SA | CLRDV-US | CLRDV-SA | CLRDV-US | ||
| CLRDV-SA | 97.9–100.0/98.5–100.0 | CLRDV-SA | 97.8–100.0/100.0 | ||
| CLRDV-US | 95.0–98.3/95.5–100.0 | 96.2–100.0/95.5–100.0 | CLRDV-US | 97.1–100.0/100.0 | 97.8–100.0/100.0 |
|
|
| ||||
| CLRDV-SA | CLRDV-US | CLRDV-SA | CLRDV-US | ||
| CLRDV-SA | 98.1–100.0/95.4–100.0 | CLRDV-SA | 96.7–99.7/96.8–100.0 | ||
| CLRDV-US | 94.9–98.9/89.7–97.1 | 96.0–100.0/92.5–100.0 | CLRDV-US | 94.8–96.8/94.1–98.3 | 96.5–100.0/95.7–100.0 |
1 No amino acid sequence comparisons were carried out for CLRDV complete genomes.
Figure 1Midpoint-rooted Bayesian phylogenetic trees reconstructed from complete genomes (a) and ORF0 nucleotide sequences (b) of cotton leafroll dwarf virus (CLRDV). Posterior probability values are represented by filled (0.95–1.00) and empty (0.80–0.94) circles near to the branch nodes. Sequences determined in this study are highlighted in bold font.3.4. Recombination.
Figure 2Reticulate phylogenetic network constructed using the Neighbor-Net method (a) and recombination events detected in RDP4 (b) based on analysis of aligned complete genomes of cotton leafroll dwarf virus (CLRDV). (a) Reticulation among the viral isolates is shown by parallel paths instead of a bifurcating evolutionary tree indicative of putative recombination. The branch internodes for the phylogenetic subclades are circled. (b) The regions highlighted in black correspond to the portion donated by the predicted minor parent, while the remaining portions represent the major parent of each independent recombination event. The CLRDV genome organization is shown in relation to the aligned sequences.
Genetic structure among cotton leafroll dwarf virus (CLRDV) subpopulations.
| Subpopulations | |||
|---|---|---|---|
| subclade 1-US | subclade 2-US | 0.52 | 0.51 |
| subclade 1-US | Subclade 3-US | 0.31 | 0.31 |
| subclade 1-US | US | 0.07 | 0.07 |
| subclade 1-US | typical-SA | 0.72 | 0.72 |
| subclade 1-US | atypical-SA | 0.67 | 0.67 |
| subclade 1-US | SA | 0.56 | 0.56 |
| subclade 2-US | subclade 3-US | 0.51 | 0.50 |
| subclade 2-US | US | 0.27 | 0.27 |
| subclade 2-US | typical-SA | 0.71 | 0.71 |
| subclade 2-US | atypical-SA | 0.66 | 0.66 |
| subclade 2-US | SA | 0.56 | 0.55 |
| subclade 3-US | US | 0.06 | 0.06 |
| subclade 3-US | typical-SA | 0.62 | 0.62 |
| subclade 3-US | atypical-SA | 0.56 | 0.55 |
| subclade 3-US | SA | 0.45 | 0.45 |
| US | typical-SA | 0.58 | 0.57 |
| US | atypical-SA | 0.53 | 0.52 |
| US | SA | 0.42 | 0.41 |
| typical-SA | atypical-SA | 0.60 | 0.60 |
| typical-SA | SA | 0.26 | 0.26 |
| atypical-SA | SA | − 0.07 | − 0.07 |
1 The F and N values range from 0 for no genetic differentiation, to 1 for complete differentiation between populations.
Figure 3The statistical significance of the differences amongst the mean pairwise number of nucleotide differences per site (π), as evaluated by 95% bootstrap confidence intervals (CIs) calculated using 1000 nonparametric simulations. No statistically significant differences observed for CI values including zero (grey line) for pairwise comparisons based on the complete genome of cotton leafroll dwarf virus (CLRDV) subpopulations.
Figure 4Nucleotide diversity across the complete genomes of cotton leafroll dwarf virus (CLRDV) isolates, calculated using DnaSP6 on a 100-nucleotide sliding window with a step size of 10 nucleotides. The horizontal lines (in grey) correspond to the genome-wide average nucleotide diversity for the full-length sequences within each subpopulation. The CLRDV genome organization is shown in relation to the alignment.
Nonsynonymous to synonymous substitution rates (dN/dS) and number of positively and negatively selected sites in the ORFs 0–5 of cotton leafroll dwarf virus (CLRDV) subpopulations using Single-Likelihood Ancestor Counting (SLAC) and Fixed-Effect Likelihood (FEL), respectively.
| Dataset | Gene | SLAC ( | FEL (# of Sites) | |
|---|---|---|---|---|
| Negative | Positive | |||
| CLRDV-subclade1-US | ORF0 | 0.52 | 2 | 0 |
| ORF1 | 0.36 | 3 | 0 | |
| ORF2 | 0.15 | 33 | 0 | |
| ORF3 | 0.16 | 3 | 0 | |
| ORF4 | 1.19 | 1 | 0 | |
| ORF5 | 0.19 | 30 | 0 | |
| CLRDV-subclade2-US | ORF0 | 0.36 | 8 | 0 |
| ORF1 | 0.28 | 27 | 0 | |
| ORF2 | 0.19 | 54 | 1 | |
| ORF3 | − 1 | − | − | |
| ORF4 | − | − | − | |
| ORF5 | 0.24 | 20 | 0 | |
| CLRDV-subclade3-US | ORF0 | 0.67 | 3 | 0 |
| ORF1 | 0.55 | 17 | 0 | |
| ORF2 | 0.47 | 54 | 3 | |
| ORF3 | 0.50 | 6 | 0 | |
| ORF4 | 0.98 | 1 | 1 | |
| ORF5 | 0.38 | 29 | 0 | |
| CLRDV-atypical-SA 2 | ORF0 | 1.63 | 1 | 0 |
| ORF1 | 0.33 | 13 | 0 | |
| ORF2 | 0.25 | 32 | 0 | |
| ORF3 | 0.16 | 5 | 1 | |
| ORF4 | 1.26 | 1 | 0 | |
| ORF5 | 0.15 | 16 | 1 | |
1 Analysis was not performed because the dataset was comprised by one only haplotype. 2 The selection analysis was not carried out for the CLRDV-typical-SA isolates because at least three different haplotypes are required.