| Literature DB >> 34055388 |
Gytis Dudas1, Greg Huber2, Michael Wilkinson2,3, David Yllanes2.
Abstract
Double synonyms in the genetic code can be used as a tool to test competing hypotheses regarding ambigrammatic narnavirus genomes. Applying the analysis to recent observations of Culex narnavirus 1 and Zhejiang mosquito virus 3 ambigrammatic viruses indicates that the open reading frame on the complementary strand of the segment coding for RNA-dependent RNA polymerase does not code for a functional protein. Culex narnavirus 1 has been shown to possess a second segment, also ambigrammatic, termed 'Robin'. We find a comparable segment for Zhejiang mosquito virus 3, a moderately diverged relative of Culex narnavirus 1. Our analysis of Robin polymorphisms suggests that its reverse open reading frame also does not code for a functional protein. We make a hypothesis about its role.Entities:
Keywords: ambigram; metagenome; mosquito; narnavirus
Year: 2021 PMID: 34055388 PMCID: PMC8155312 DOI: 10.1093/ve/veab038
Source DB: PubMed Journal: Virus Evol ISSN: 2057-1577
Figure 1.(A) A maximum-likelihood tree illustrating the relationship between CxNV1 (Culex narnavirus 1) (red) and ZJMV3 (Zhejiang mosquito virus 3) (blue) and related narnavirus RdRp amino acid sequences. (B) ORF-wide dN/dS values for forward and reverse directions of RdRp and Robin segments for both viruses.
For each amino acid (AA) that can have double-synonym single-nucleotide mutations, we list all of the possible codons which do not code for stop on the complementary strand, indicating their reverse complement (Comp. AA).
| AA | Codon |
|
| Comp. AA |
|---|---|---|---|---|
| Leu | UUG* | 1 + 0 | 1 + 0 | Gln |
| CUU | 1 + 1 | 0 + 0 | Lys | |
| CUC | 1 + 1 | 0 + 0 | Glu | |
| CUG* | 1 + 2 | 1 + 0 | Gln | |
| Pro | CCU* | 1 + 2 | 0 + 1 | Arg |
| CCC | 1 + 2 | 0 + 0 | Gly | |
| CCA | 1 + 2 | 0 + 0 | Trp | |
| CCG* | 1 + 2 | 0 + 1 | Arg | |
| Gln | CAA* | 1 + 0 | 1 + 0 | Leu |
| CAG* | 1 + 0 | 1 + 0 | Leu | |
| Arg | CGU | 1 + 2 | 0 + 0 | Thr |
| CGC | 1 + 2 | 0 + 0 | Ala | |
| CGA* | 1 + 3 | 0 + 1 | Ser | |
| CGG* | 1 + 3 | 0 + 1 | Pro | |
| AGA* | 1 + 1 | 0 + 1 | Ser | |
| AGG* | 1 + 1 | 0 + 1 | Pro | |
| Ser | UCU* | 1 + 1 | 0 + 1 | Arg |
| UCC | 1 + 1 | 0 + 0 | Gly | |
| UCG* | 1 + 2 | 0 + 1 | Arg | |
| AGU | 1 + 0 | 0 + 0 | Thr | |
| AGC | 1 + 0 | 0 + 0 | Ala |
The codons that have a double synonym are marked with an asterisk. For each of these codons, we list the number of mutations which are synonymous, and the number of double synonym mutations. In each case the numbers of single (double) mutations are written (), where the superscript n denotes transitions, and superscript v transversions. Also, double synonyms are counted in the list of single synonyms.
Figure 2.(A) Number of reads (from top to bottom): in the positive sense (i.e. same as genomic) with respect to RdRp segment of Culex narnavirus 1, the ratio of positive- to negative-sense reads, and negative-sense reads. Numbers of positive-sense reads and ratio of positive- to negative-sense reads are log-scaled while number of negative reads is displayed in normal space. Ratios where no negative sense reads could be identified are highlighted with hatches and the ratio reflects positive-sense reads (i.e. by assuming there is one negative-sense read). (B) Same information as A, but showing read numbers for Robin segment of Culex narnavirus 1. The results are displayed for the forty-two different strains, with the samples ordered in descending order of the total number of reads for each segment.
Nucleotide-level statistics of mutations.
| Strand |
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| CxNV1-RdRp | 1033 | 46 | 606 | 362 | 0.0068 | 3.35 | (181, 140, 645) | (0.56:0.44:2.00) |
| ZJMV3-RdRp | 1075 | 12 | 210 | 39 | 0.0064 | 10.80 | (47, 29, 173) | (0.57:0.35:2.08) |
| CxNV1-Robin | 272 | 46 | 213 | 146 | 0.0096 | 2.92 | (107, 100, 152) | (0.89:0.84:1.27) |
| ZJMV3-Robin | 304 | 10 | 84 | 48 | 0.0145 | 3.50 | (35, 31, 66) | (0.80:0.70:1.50) |
The consensus sequence has N codons. Among the mutations observed in M polymorphs, there are transitions, transversions, with overall rate r and transition/transversion rate ratio α. The numbers total mutations at each base position is , and normalising these to ratios via Equation (6) yields .
Summary of results for codon-level mutations.
| Strand |
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| CxNV1-RdRp-fwd | 623 | 189 | 123 | 0.303 | 2.37 | 0.128 | 0.12 |
| ZJMV3-RdRp-fwd | 170 | 59 | 13 | 0.347 | 2.14 | 0.162 | 0.012 |
| CxNV1-Robin-fwd | 112 | 141 | 89 | 1.26 | 2.34 | 0.538 | 0.45 |
| ZJMV3-Robin-fwd | 49 | 61 | 14 | 1.24 | 2.35 | 0.528 | 0.046 |
| CxNV1-RdRp-comp | 136 | 676 | 123 | 4.97 | 2.43 | 2.04 | 0.12 |
| ZJMV3-RdRp-comp | 50 | 179 | 13 | 3.58 | 2.14 | 1.67 | 0.012 |
| CxNV1-Robin-comp | 66 | 187 | 89 | 2.83 | 2.39 | 1.23 | 0.45 |
| ZJMV3-Robin-comp | 32 | 78 | 14 | 2.43 | 2.28 | 1.07 | 0.046 |
The numbers of single-nucleotide synonymous and non-synonymous mutations are and respectively, is the number of mutations with more than one base changed, is the null value of , and if the fraction of mutations that have multiple-nucleotide changes.
Figure 3.Distribution of synonymous (blue), non-synonymous (red) substitutions, and doubly synonymous sites (green) in CxNV1 (upper plots) and ZJMV3 (lower plots) RdRp (left) and Robin (right) segments in both directions (forward toward top, reverse toward bottom). Translated reverse ORFs are shown backwards (segment coordinate space). Double synonyms do not overlap perfectly because forward and reverse ORFs differ in length and begin and end at different positions along the segment.
Summary of results of the mutational hotspots test.
|
Sample |
|
| Gene |
|
|
|
|
| Double syns., CxNV1-RdRp | 0.954 | 0.161 | CxNV1-RdRp | 1033 | 220 | 0.986 | 1.044 |
| Other codons, CxNV1-RdRp | 0.968 | 0.155 | ZJMV3-RdRp | 1075 | 219 | 0.975 | 0.840 |
| Double syns., ZJMV3-RdRp | 1.20 | 0.042 | CxNV1-Robin | 272 | 54 | 1.19 | 1.16 |
| Other codons, ZJMV3-RdRp | 1.23 | 0.050 | ZJMV3-Robin | 304 | 81 | 0.926 | 0.978 |
| Double syns, CxNV1-Robin | 1.76 | 0.195 | |||||
| Other codons, CxNV1Robin | 1.48 | 0.169 | |||||
| Double syns, ZJMV3-Robin | 0.889 | 0.096 | |||||
| Other codons, ZJMV3-Robin | 0.960 | 0.097 |
Left panel: values of the average number of elements of the variant set, and of the average fraction of non-consensus codons, , for double-synonym sites, and for the other sites. Right panel: N is the number of loci in the alignment, is the number of double-synonym loci, and R, R are the ratios of and at double-synonym sites to their values at other sites. The differences of these ratios from unity do not appear significant.
Results for the mutational codon frequency test: N is the number of loci in the alignment, is the number of mutationally active double-synonym loci, and are, respectively, the numbers of single and double synonym mutations.
The actual ratio is compared with the null-hypothesis value R0, Equation (8).
| Sample |
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| CxNV1-RdRp | 1033 | 136 | 151 | 60 | 2.51 | 3.02 | 0.83 |
| ZJMV3-RdRp | 1075 | 219 | 33 | 20 | 1.65 | 3.21 | 0.51 |
| CxNV1-Robin | 272 | 40 | 24 | 3 | 8.00 | 3.21 | 2.49 |
| ZJMV3-Robin | 304 | 59 | 20 | 4 | 4.00 | 4.04 | 0.99 |