| Literature DB >> 30219073 |
Angel Y S Chen1, Shizu Watanabe1, Raymond Yokomi2, James C K Ng3,4.
Abstract
BACKGROUND: The non-translated regions at the genome ends of RNA viruses serve diverse functions and can exhibit various levels of nucleotide (nt) heterogeneity. However, the extent of nt heterogeneity at the extreme termini of Citrus tristeza virus (CTV) genomes has not been comprehensively documented. This study aimed to characterize two widely prevalent CTV genotypes, T36-CA and T30-CA, from California that have not been sequenced or analyzed substantially. The information obtained will be used in our ongoing effort to construct the infectious complementary (c) DNA clones of these viruses.Entities:
Keywords: Citrus tristeza virus; Genome ends; Genotype; Heterogeneity; RACE
Mesh:
Substances:
Year: 2018 PMID: 30219073 PMCID: PMC6139129 DOI: 10.1186/s12985-018-1041-4
Source DB: PubMed Journal: Virol J ISSN: 1743-422X Impact factor: 4.099
Nucleotide variations at the extreme termini in the genome of CTV with the T36-CA genotype
aThe extreme 5′ end sequences of seven nt variants were identified by sequencing cloned DNA corresponding to the 5′ end of the T36-CA genome. In comparison with the T36 consensus genomic 5′ end nt sequences (black colored), the nt variants of T36-CA were found to contain extra/variant nt (s) (red colored nts). Group 1 nt variants (variants 1 to 3) (unshaded box) have consensus 5′ nt sequence “AATTTCTCAA” identical to that of T36 isolate FS674 (GenBank accession no. KC517485) and FS703 (GenBank accession no. KC517487). Group 2 nt variants (variants 4 to 7) (shaded box) have consensus 5′ nt sequence “AATTTCAAA” similar to that (“AATTTCACAA”) of the T36 sequences (GenBank accession nos. EU937521 and AY170468), except that it does not contain a “C” at position 8. The number of clones for each nt variant sequenced are as indicated
bThe 5′ end nts of the T36-CA genome was determined by sequencing the (+)-DNA generated by 5′ RLM-RACE using the (+)-strand of dsRNA as a genetic template for reverse transcription (RT) and PCR amplification
cThe 5′ end nts of the T36-CA genome was determined by 3′ RACE using the (−)-strand of dsRNA as a genetic template for reverse transcription (RT) and PCR amplification
dThe extreme 3′ end sequences of 4 nt variants were identified by sequencing cloned (+)-DNA (generated by 3′ RACE) corresponding to the 3′ end of the T36-CA genome. The T36 consensus genomic 3′ end nts (black colored) and the T36-CA nt variants (red colored) are as shown. The number of clones for each nt variant sequenced are as indicated
Nucleotide variations at the extreme termini in the genome of CTV with the T30-CA genotype
aThe extreme 5′ end sequences of three nt variants were identified by sequencing cloned DNA corresponding to the 5′ end of the T30-CA genome. The consensus (black colored) nts located at the 5′ ends of other T30 sequences (GenBank accession nos. KC517489, KC517490, KC517491 and AF260651) were identified in the population of T30-CA in addition to nt variants with an extra nt (“C”, in red). The number of clones for each nt variant sequenced are as indicated
bThe 5′ end nts of the T30-CA genome was determined by sequencing the (+)-DNA generated by 5′ RLM-RACE using the (+)-strand of dsRNA as a genetic template for reverse transcription (RT) and PCR amplification
cThe 5′ end nts of the T30-CA genome was determined by sequencing the (−)-DNA generated by 3′ RACE using the (−)-strand of dsRNA as a genetic template for reverse transcription (RT) and PCR amplification
dThe extreme 3′ end sequences of three nt variants were identified by sequencing cloned (+)-DNA (generated by 3′ RACE) corresponding to the 3′ end of the T30-CA genome. The conserved consensus nts and the variant nts are indicated in black and red, respectively. The number of clones for each nt variant sequenced are as indicated
Fig. 2Phylogenetic relationships among six major genotypes of CTV based on their full-length genome sequences. The phylogenetic tree was constructed using the maximum parsimony method and bootstrap values (expressed as percentages of 1000 replicates) are indicated next to the branches. The names of isolate sequences and their accession numbers are shown at the tips of the branches. The T36-CA and T30-CA sequences determined in this study are both marked by an asterisk. RB, T36, T30, VT, T3 and T68 are the names of the six major genotypes. The branch lengths are scaled in units representing the number of nt changes over the entire genome sequence analyzed
Fig. 1cDNA fragments corresponding to the genome of CTV with the T36-CA or the T30-CA genotype. a The genome organization of CTV. The numbered boxes (1a, 1b and 2 to 11) represent open reading frames (ORFs) and the proteins they encode (as indicated above or below the ORFs). RdRp, RNA-dependent RNA polymerase; CPm, minor coat protein; CP, major coat protein; and proteins named according to their predicted molecular weight, as indicated by the number (s) after the letter “p”. b To obtain the cDNA of the T36-CA genomic (g) RNA, the first strand cDNA (arrow with dotted line) was generated by reverse transcription (RT) using denatured T36-CA dsRNA and a reverse primer, CTV30-AC. The T36-CA 5′ fragment, corresponding to nucleotide (nt) position 1 to 8391, was PCR-amplified using primers CTV34-AC and CTV39-AC. The T36-CA 3′ fragment, corresponding to nt position 7800 to 19,292, was amplified using primers CTV38-AC and CTV35-AC. c The first strand cDNA of T30-CA (arrow with dotted line) was generated by the same method as described for T36-CA, using a reverse primer, CTV30-AC. Primers CTV32-AC and CTV37-AC were used to amplify the T30-CA 5′ fragment corresponding to nt position 1 to 8430, while primers CTV36-AC and CTV33-AC were used to amplify the T30-CA 3′ fragment (nt position 7584 to 19,259). Nucleotide sequences and other details of the primers (represented by solid arrows) used for generating the cDNA fragments are shown in Additional file 1: Table S1
T36-CA nucleotide (nt) and deduced amino acid (aa) identities compared with other CTV strains
| T36-CAa | T36 -FL | T30-CA | SY568 (CA) | RB-AT25 (CA) | VT-AT39 (CA) | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Regionsb | Number of nts | Number of aa | nt identity (%) | aa identity (%) | nt identity (%) | aa identity (%) | nt identity (%) | aa identity (%) | nt identity (%) | aa identity (%) | nt identity (%) | aa identity (%) |
| Full genome | 19,292 | 99.1 | 81.7 | 81.5 | 90.8 | 80.8 | ||||||
| 5′ UTR | 106 | 99.1 | 55.3 | 58.1 | 99.1 | 60.6 | ||||||
| p349 | 9369 | 3123 | 99.1 | 98.9 | 72.9 | 74.5 | 73.2 | 74.9 | 90.6 | 74.5 | 73.2 | 75.1 |
| RdRp | 1503 | 501 | 99.6 | 99.8 | 80.0 | 92.0 | 77.1 | 89.5 | 90.5 | 96.0 | 81.1 | 90.9 |
| p33 | 912 | 304 | 98.1 | 97.7 | 84.5 | 86.8 | 84.8 | 87.1 | 85.5 | 86.8 | 85.2 | 85.5 |
| p6 | 156 | 52 | 98.7 | 98.0 | 89.7 | 96.1 | 89.7 | 96.1 | 88.5 | 96.1 | 91.0 | 96.1 |
| p65 (HSP70h) | 1785 | 595 | 99.3 | 99.2 | 94.1 | 96.5 | 94.1 | 96.6 | 88.4 | 94.3 | 89.2 | 95.0 |
| p61 | 1608 | 536 | 99.0 | 98.1 | 95.4 | 96.6 | 96.7 | 96.8 | 94.2 | 95.9 | 88.6 | 91.8 |
| p27 (CPm) | 723 | 241 | 98.9 | 100.0 | 93.9 | 95.4 | 88.8 | 94.6 | 93.8 | 95.0 | 89.4 | 95.8 |
| p25 (CP) | 672 | 224 | 99.7 | 99.6 | 93.2 | 96.4 | 93.6 | 95.5 | 93.8 | 95.5 | 93.3 | 96.0 |
| p18 | 504 | 168 | 99.4 | 99.4 | 95.2 | 95.8 | 92.7 | 94.0 | 96.2 | 95.2 | 58.4 | 95.2 |
| p13 | 360 | 120 | 98.9 | 99.2 | 91.7 | 92.4 | 90.8 | 89.1 | 94.4 | 93.3 | 91.4 | 92.4 |
| p20 | 549 | 183 | 99.1 | 97.8 | 88.5 | 92.3 | 91.4 | 96.2 | 90.5 | 93.4 | 90.7 | 94.5 |
| p23 | 630 | 210 | 98.6 | 99.0 | 90.5 | 89.5 | 90.8 | 90.4 | 88.3 | 88.0 | 91.3 | 90.9 |
| 3′ UTR | 273 | 99.6 | 97.4 | 97.8 | 97.1 | 97.1 | ||||||
aA pairwise comparison was made between the genome sequence of T36-CA and that of the T36 genotype from Florida (T36-FL) or each of the sequences representing other CTV strains from California (CA), including T30-CA (obtained in this study), as indicated
bRegions in the CTV genome listed in the order of appearance from the 5′ to 3′ untranslated regions (UTR) s (as seen in Fig. 1)
T30-CA nucleotide (nt) and deduced amino acid (aa) identities compared with other CTV strains
| T30-CAa | T30-AT4 | T30-FL | SY568 (CA) | RB-AT25 (CA) | VT-AT39 (CA) | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Regionsb | Number of nts | Number of aa | nt identity (%) | aa identity (%) | nt identity (%) | aa identity (%) | nt identity (%) | aa identity (%) | nt identity (%) | aa identity (%) | nt identity (%) | aa identity (%) |
| Full genome | 19,259 | 99.4 | 99.4 | 93.1 | 82.4 | 90.3 | ||||||
| 5′ UTR | 108 | 99.1 | 97.2 | 73.1 | 55.8 | 75.7 | ||||||
| p349 | 9348 | 3116 | 99.4 | 99.2 | 99.5 | 99.4 | 90.8 | 92.0 | 72.8 | 99.2 | 90.9 | 92.3 |
| RdRp | 1434 | 478 | 99.7 | 99.6 | 99.4 | 99.4 | 91.6 | 96.4 | 84.0 | 94.8 | 91.3 | 97.0 |
| p33 | 912 | 304 | 99.0 | 98.4 | 99.3 | 99.0 | 99.7 | 99.7 | 94.0 | 93.1 | 85.9 | 87.1 |
| p6 | 156 | 52 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 96.8 | 100.0 | 93.6 | 96.1 |
| p65 (HSP70h) | 1785 | 595 | 99.3 | 98.8 | 99.6 | 99.0 | 99.8 | 99.5 | 88.2 | 94.1 | 88.7 | 94.6 |
| p61 | 1608 | 536 | 99.2 | 98.9 | 99.1 | 98.7 | 99.2 | 99.1 | 95.4 | 96.3 | 88.2 | 90.8 |
| p27 (CPm) | 723 | 241 | 99.2 | 99.2 | 99.6 | 99.2 | 92.7 | 97.9 | 97.7 | 97.5 | 89.5 | 96.3 |
| p25 (CP) | 672 | 224 | 99.6 | 99.6 | 99.7 | 99.6 | 93.5 | 96.4 | 93.0 | 96.4 | 93.2 | 96.4 |
| p18 | 504 | 168 | 99.4 | 99.4 | 99.2 | 98.2 | 93.1 | 95.8 | 94.8 | 94.6 | 58.0 | 97.0 |
| p13 | 360 | 120 | 99.7 | 99.2 | 99.7 | 100.0 | 90.8 | 90.8 | 91.4 | 89.9 | 91.4 | 95.0 |
| p20 | 549 | 183 | 99.8 | 100.0 | 99.5 | 100.0 | 93.3 | 95.1 | 93.3 | 95.6 | 91.6 | 93.4 |
| p23 | 630 | 210 | 99.1 | 98.6 | 98.4 | 96.7 | 90.0 | 91.4 | 93.2 | 94.3 | 91.1 | 92.3 |
| 3′ UTR | 274 | 99.6 | 99.3 | 97.8 | 97.8 | 98.5 | ||||||
aA pairwise comparison was made between the genome sequence of T30-CA and those with the T30 genotype – T30-FL (a T30 genotype originating from Florida) and T30-AT4 (the only other California-based T30 genotype besides T30-CA to be completely sequenced), or each of the sequences representing other CTV strains from California (CA) as indicated
bRegions in the CTV genome listed in the order of appearance from the 5′ to 3′ untranslated regions (UTR) s (as seen in Fig. 1)