| Literature DB >> 31323859 |
Boitumelo Motsoeneng1, Michael D Jukes2,3, Caroline M Knox1, Martin P Hill4, Sean D Moore4,5.
Abstract
The complete genome of an endemic South African Cydia pomonella granulovirus isolate was sequenced and analyzed. Several missing or truncated open reading frames (ORFs) were identified, including a 24 bp deletion in the pe38 gene which is reported to be associated with type I resistance-breaking potential. Comparison of single nucleotide polymorphisms (SNPs) with five other fully sequenced CpGV isolates identified 67 unique events, 47 of which occurred within ORFs, leading to several amino acid changes. Further analysis of single nucleotide variations (SNVs) within CpGV-SA revealed that this isolate consists of mixed genotypes. Phylogenetic analysis using complete genome sequences placed CpGV-SA basal to M, I12 and E2 and distal to S and I07 but with no distinct classification into any of the previously defined CpGV genogroups. These results suggest that CpGV-SA is a novel and genetically distinct isolate with significant potential as a biopesticide for management of codling moth (CM), not only in South Africa, but potentially in other pome fruit producing countries, particularly where CM resistance to CpGV has been reported.Entities:
Keywords: Cydia pomonella granulovirus; codling moth; single nucleotide polymorphism (SNP); single nucleotide variation (SNV)
Mesh:
Year: 2019 PMID: 31323859 PMCID: PMC6669624 DOI: 10.3390/v11070658
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Alignment of 67 novel SNPs identified between CpGV-SA and CpGV-E2, -I07, -I12, -M, and –S, grouped as CpGV. Positions for each nucleotide are given relative to the CpGV-M and CpGV-SA sequences. The corresponding ORF numbers are provided above the alignment for SNPs occurring within coding regions while any resulting amino acid substitutions are shown below. Nucleotides: A = Adenine, C = Cytosine, G = Guanine, T = Thymine; Ambiguous nucleotides: Y = C or T and M = A or C and amino acids: A = Alanine, R = Arginine, N = Asparagine, Q = Glutamine, H = Histidine, I = Isoleucine, K = Lysine, P = Proline, S = Serine, T = Threonine, and V = Valine.
| CpGV-M Position | 1355 | 4732 | 6826 | 6909 | 7448 | 8659 | 12978 | 13476 | 13879 | 14318 | 15829 | 17138 | 17511 | 20146 | 20293 | 20314 | 20442 | 24975 | 27108 | 30837 | 34313 | 34485 | 38331 | 41720 | 43772 | 43925 | 51043 | 51049 | 51177 | 51398 | 51413 | 51483 | 51534 | |
| CpGV-SA Position | 1355 | 4785 | 6879 | 6962 | 7501 | 8712 | 13048 | 13546 | 13949 | 14389 | 15891 | 17199 | 17572 | 20185 | 20316 | 20337 | 20439 | 24946 | 27091 | 30847 | 34368 | 34540 | 38386 | 41775 | 43827 | 43980 | 51160 | 51166 | 51279 | 51308 | 51323 | 51393 | 51445 | |
| ORF | 3 | 7 | 10 | 11 | 17 | 18 | 20 | 22 | 30 | 41 | 46 | 50 | 61 | |||||||||||||||||||||
| CpGV-SA | A | T | A | T | A | A | T | A | G | T | T | C | A | A | T | A | A | T | A | A | A | A | A | T | T | A | T | C | T | T | T | T | C | |
| CpGV | G | A | G | C | G | G | C | C | A | C | C | T | G | T | C | G | T | C | G | G | C | G | T | C | C | C | A | G | G | C | A | A | T | |
| Substitution SA -> Ref | K -> N | I -> T | S -> P | T -> A | T -> A | V -> A | I -> N | Q -> H | N -> I | R -> T | ||||||||||||||||||||||||
| CpGV-M Position | 51537 | 58836 | 62466 | 64748 | 66714 | 67897 | 68732 | 70388 | 72378 | 72492 | 76189 | 77867 | 78942 | 84035 | 90782 | 93156 | 93414 | 94559 | 97937 | 98947 | 105867 | 106113 | 109603 | 110904 | 111093 | 111270 | 111535 | 115974 | 116322 | 116527 | 117053 | 120008 | 120642 | 122324 |
| CpGV-SA Position | 51448 | 58937 | 62567 | 64824 | 66790 | 67973 | 68808 | 70464 | 72454 | 72568 | 76265 | 77943 | 79018 | 84111 | 90852 | 93226 | 93484 | 94626 | 98024 | 99031 | 105933 | 106179 | 109669 | 110970 | 111158 | 111335 | 111600 | 116038 | 116386 | 116591 | 117117 | 120072 | 120706 | 122389 |
| ORF | 73 | 77 | 82 | 84 | 85 | 87 | 89 | 90 | 93 | 95 | 96 | 101 | 111 | 112 | 115 | 124 | 128 | 132 | 133 | 135 | 140 | 141 | ||||||||||||
| CpGV-SA | C | T | T | T | T | T | A | A | C | T | A | A | A | A | C | G | G | A | C | A | G | A | T | T | T | A | A | T | C | A | C | A | A | C |
| CpGV | T | C | G | C | C | C a | C | G | T b | C | G | G | G | G | T | C | A | T | G c | G | A | G | A | C | C | G | G | C | T | G | T | C d | C | T |
| Substitution SA -> Ref | E -> D | I -> R | N -> S | I -> T | Y -> F | T -> A | E -> V | K -> E | L -> S | V -> A | A -> T | I -> L | K -> T | A -> T | ||||||||||||||||||||
| a CpGV-E2 = Y; b CpGV = Y; c CpGV-E2 = A; d CpGV-E2 = M | ||||||||||||||||||||||||||||||||||
Alignment of 66 genotypic SNVs identified within the CpGV-SA complete genome sequence. Variants which overlap with SNPs identified between the CpGV isolates are in light grey while those which overlap with SNPs unique to CpGV-SA (Table 1) are in dark grey. The corresponding ORF numbers are provided above the alignment for SNVs occurring within coding regions while any resulting amino acid substitutions are shown below. Nucleotides: A = Adenine, C = Cytosine, G = Guanine, T = Thymine and amino acids: A = Alanine, R = Arginine, N = Asparagine, D = Aspartic acid, E = Glutamic acid, G = Glycine, H = Histidine, I = Isoleucine, K = Lysine, M = Methionine, T = Threonine, Y = Tyrosine, and V = Valine.
| CpGV-SA Position | 6530 | 6823 | 6879 | 6962 | 7542 | 8502 | 8646 | 9003 | 9242 | 11275 | 11503 | 15891 | 17199 | 17572 | 20415 | 20439 | 20451 | 20471 | 27091 | 32302 | 34540 | 38386 | 40007 | 41403 | 43827 | 47598 | 47601 | 49694 | 51160 | 51166 | 51242 | 51259 | 51265 |
| ORFs | 10 | 11 | 12 | 15 | 20 | 22 | 37 | 41 | 46 | 48 | 50/51 | 57 | 60 | 61 | |||||||||||||||||||
| 52b | 58 | ||||||||||||||||||||||||||||||||
| CpGV-SA | T | A | A | T | G | C | G | A | G | C | T | T | C | A | A | A | A | C | A | G | A | A | A | G | T | C | C | G | T | C | G | T | A |
| Variant | C | G | G | C | A | T | A | T | A | T | A | C | T | G | T | T | T | A | G | T | G | T | G | A | C | T | T | A | A | G | T | C | C |
| Variant Coverage | 906 | 1452 | 1300 | 1210 | 642 | 1800 | 1172 | 1282 | 1338 | 584 | 474 | 1634 | 2158 | 794 | 750 | 2215 | 2201 | 1915 | 492 | 532 | 738 | 614 | 817 | 1544 | 840 | 1638 | 1388 | 840 | 2964 | 2782 | 1088 | 1426 | 1234 |
| Amino Acid Change | I -> T | A -> V | A -> T | A -> T | N -> Y | T -> A | T -> A | I -> N | D -> G | A -> T | A -> T | N -> I | R -> T | ||||||||||||||||||||
| CpGV-SA Position | 51273 | 51279 | 51284 | 51287 | 51289 | 51290* | 51290* | 51300 | 51301 | 51308 | 51323 | 51393 | 51445 | 51448 | 51605 | 62567 | 64981 | 66790 | 66971 | 70464 | 79018 | 82048 | 90574 | 90852 | 98024 | 99031 | 100326 | 111158 | 114455 | 117293 | 117676 | 119691 | 122389 |
| ORFs | 77 | 82b | 84 | 89 | 96 | 99 | 110 | 115 | 116 | 117 | 128 | 131 | 135 | 139 | 141 | ||||||||||||||||||
| CpGV-SA | C | T | A | T | C |
|
| G | T | T | T | T | C | C | T | T | T | T | C | A | A | G | G | C | C | A | A | T | T | T | C | G | C |
| Variant | T | G | T | G | A |
|
| C | A | C | A | A | T | T | A | G | C | C | T | G | G | A | A | T | G | G | G | C | C | C | T | A | T |
| Variant Coverage | 1234 | 1266 | 1240 | 2136 | 3170 | 970 | 2420 | 1406 | 1390 | 3448 | 2756 | 3328 | 3422 | 3394 | 1392 | 1628 | 1742 | 880 | 462 | 1272 | 828 | 612 | 1724 | 1992 | 1630 | 1396 | 1224 | 904 | 544 | 1250 | 466 | 1492 | 1730 |
| Amino Acid Change | E -> D | A -> V | I -> M | T -> A | K -> R | H -> R | D -> G | A -> V | A -> T | ||||||||||||||||||||||||
Figure 1CpGV phylogeny based on complete genome sequences for isolates from groups A-E and CpGV-SA. The number of unique SNPs present in each isolate is shown above the branches in bold. The number of SNPs per grouping is shown at each node alongside the bootstrap values. CpGV groups are indicated to right of the respective isolate.