| Literature DB >> 35208820 |
Iván Martín-Hernández1, Israel Pagán1,2.
Abstract
In RNA viruses, which have high mutation-and fast evolutionary- rates, gene overlapping (i.e., genomic regions that encode more than one protein) is a major factor controlling mutational load and therefore the virus evolvability. Although DNA viruses use host high-fidelity polymerases for their replication, and therefore should have lower mutation rates, it has been shown that some of them have evolutionary rates comparable to those of RNA viruses. Notably, these viruses have large proportions of their genes with at least one overlapping instance. Hence, gene overlapping could be a modulator of virus evolution beyond the RNA world. To test this hypothesis, we use the genus Begomovirus of plant viruses as a model. Through comparative genomic approaches, we show that terminal gene overlapping decreases the rate of virus evolution, which is associated with lower frequency of both synonymous and nonsynonymous mutations. In contrast, terminal overlapping has little effect on the pace of virus evolution. Overall, our analyses support a role for gene overlapping in the evolution of begomoviruses and provide novel information on the factors that shape their genetic diversity.Entities:
Keywords: begomoviruses; overlapping genes; rate of evolution; ssDNA viruses
Year: 2022 PMID: 35208820 PMCID: PMC8875319 DOI: 10.3390/microorganisms10020366
Source DB: PubMed Journal: Microorganisms ISSN: 2076-2607
Figure 1Genome organization of mono- and bipartite begomoviruses. Colored arrows denote the position and orientation of each gene. Monopartite and DNA-A of bipartite begomoviruses encode for: AC1: Replication initiator protein (Rep); AC2: Transcriptional activator protein (TrAP); AC3: Replication enhancer protein (Ren); AC4: Silencing suppressor; AV1: Coat protein (CP); and AV2: Various functions. In bipartite begomoviruses AV2 is only present in Old World (OW) begomoviruses, where AV1 is as long as in monopartite begomoviruses (dashed). DNA-B of bipartite begomoviruses encodes for: BC1: Movement protein (MP) and BC2: Nuclear shuttle protein (NSP). CR, common region. The hairpin which includes the origin of replication (ORI) is indicated in the Long Intergenic Region (LIR) (modified from [17]).
Figure 2Overlapping and nonoverlapping tree lengths (t) in overlapping genes. Blue dots denote genes in which t is significantly higher in NOV than in OV regions. Red dots denote genes showing the opposite trend. Genes with different types of overlapping (internal, 5′-, and 3′-terminal) are presented in different panels.
Figure 3Overlapping and nonoverlapping d (upper line), d (middle line) and d (lower line) in overlapping genes. Blue dots denote instances in which parameters are significantly higher in NOV than in OV regions. Red dots denote genes showing the opposite trend. Genes with different types of overlapping (internal, 5′-, and 3′-terminal) are presented in different panels.
Figure 4Correlation between tree length (t/number of sequences) and the proportion of gene overlap (length of overlap/total gene length) for all types of overlap considered together (upper left), internal overlapping (upper right), 5′-terminal overlapping (lower left) and 3′-terminal overlapping (lower right).
Statistical parameters of traits associated with the rate of evolutionary change in overlapping genes.
| Species 1 | N 2 | Overlapping 3 | Gene Length 4 |
|
| |||
|---|---|---|---|---|---|---|---|---|
| ORF | Length | NOV | OV | NOV | OV | |||
| Internal (11/17) | ||||||||
|
| 27 (4%) | AC1-AC4 | 423 | 1077 | 0.53 | 0.41 | 0.39 | 0.11 |
|
| 11 (8%) | AC1-AC4 | 291 | 1086 | 0.47 | 0.64 | 0.54 | 0.33 |
|
| 121 (0%) | AC1-AC4 | 258 | 1086 | 0.44 | 0.86 | 0.34 | 0.06 |
|
| 39 (98%) | AC1-AC4 | 294 | 1092 | 1.25 | 0.85 | 0.48 | 0.09 |
|
| 32 (99%) | AC1-AC4 | 303 | 1092 | 1.90 | 1.35 | 0.11 | 0.08 |
|
| 16 (28%) | AC1-AC4 | 300 | 1086 | 0.86 | 0.83 | 0.22 | 0.20 |
|
| 21 (9%) | AC1-AC4 | 294 | 1089 | 0.22 | 0.89 | 0.37 | 0.02 |
|
| 50 (2%) | AC1-AC4 | 303 | 1092 | 0.67 | 0.21 | 0.07 | 0.02 |
|
| 50 (0%) | AC1-AC4 | 297 | 1065 | 0.27 | 0.31 | 0.05 | 0.07 |
|
| 13 (0%) | AC1-AC4 | 234 | 1080 | 0.07 | 0.06 | 0.20 | 0.19 |
|
| 153 (2%) | AC1-AC4 | 234 | 1080 | 1.55 | 1.15 | 0.41 | 0.36 |
|
| 14 (29%) | AC1-AC4 | 258 | 1080 | 0.35 | 0.09 | 0.08 | 0.03 |
|
| 60 (92%) | AC1-AC4 | 308 | 1089 | 2.96 | 1.48 | 0.29 | 0.24 |
|
| 125 (0%) | AC1-AC4 | 297 | 1080 | 0.55 | 0.79 | 0.09 | 0.14 |
|
| 17 (41%) | AC1-AC4 | 258 | 1095 | 0.79 | 0.99 | 0.12 | 0.28 |
|
| 97 (3%) | AC1-AC4 | 303 | 1086 | 3.75 | 3.34 | 0.43 | 0.39 |
|
| 397 (68%) | AC1-AC4 | 294 | 1074 | 2.43 | 1.60 | 0.70 | 0.05 |
| 5′-terminal (42/54) | ||||||||
|
| 32 (3%) | AC1-AC2 | 93 | 408 | 0.21 | 0.37 | 0.11 | 0.20 |
| 31 (3%) | AC2-AC3 | 260 | 405 | 0.31 | 0.03 | 0.50 | 0.03 | |
| 26 (0%) | AV1-AV2 | 193 | 777 | 0.39 | 0.26 | 0.37 | 0.15 | |
|
| 11 (18%) | AC1-AC2 | 98 | 406 | 0.22 | 0.01 | 0.22 | 0.02 |
| 13 (0%) | AC2-AC3 | 260 | 405 | 0.71 | 0.27 | 0.11 | 0.03 | |
| 10 (60%) | AV1-AV2 | 189 | 771 | 0.33 | 0.76 | 1.15 | 0.21 | |
|
| 158 (0%) | AC1-AC2 | 89 | 390 | 1.30 | 0.29 | 0.40 | 0.13 |
| 158 (0%) | AC2-AC3 | 254 | 399 | 1.50 | 0.25 | 0.26 | 0.05 | |
|
| 51 (71%) | AC1-AC2 | 104 | 453 | 1.91 | 1.68 | 0.33 | 0.29 |
| 51 (20%) | AC2-AC3 | 308 | 405 | 3.44 | 0.74 | 0.28 | 0.14 | |
| 50 (40%) | AV1-AV2 | 206 | 771 | 1.76 | 0.85 | 0.63 | 0.14 | |
|
| 57 (54%) | AC1-AC2 | 104 | 453 | 2.42 | 2.82 | 1.01 | 0.58 |
| 57 (2%) | AC2-AC3 | 308 | 405 | 1.80 | 0.56 | 1.18 | 0.56 | |
| 56 (16%) | AV1-AV2 | 206 | 771 | 1.63 | 0.61 | 0.38 | 0.21 | |
|
| 18 (33%) | AC1-AC2 | 98 | 405 | 0.99 | 0.30 | 0.17 | 0.02 |
| 23 (17%) | AC2-AC3 | 260 | 405 | 3.08 | 0.79 | 0.47 | 0.25 | |
| 22 (18%) | AV1-AV2 | 197 | 771 | 1.22 | 1.33 | 0.18 | 0.24 | |
|
| 32 (6%) | AC1-AC2 | 101 | 405 | 0.01 | 0.33 | 0.02 | 0.52 |
| 32 (0%) | AC2-AC3 | 257 | 402 | 0.47 | 0.04 | 0.42 | 0.07 | |
| 31 (0%) | AV1-AV2 | 209 | 777 | 0.35 | 0.37 | 0.16 | 0.24 | |
|
| 58 (2%) | AC1-AC2 | 104 | 453 | 0.59 | 0.65 | 0.12 | 0.14 |
| 59 (2%) | AC2-AC3 | 308 | 405 | 1.08 | 0.80 | 1.11 | 0.78 | |
| 59 (15%) | AV1-AV2 | 206 | 771 | 1.04 | 0.75 | 0.44 | 0.37 | |
|
| 71 (0%) | AC1-AC2 | 77 | 408 | 0.40 | 0.29 | 0.25 | 0.08 |
| 71 (0%) | AC2-AC3 | 260 | 405 | 0.34 | 0.22 | 0.18 | 0.06 | |
| 64 (0%) | AV1-AV2 | 197 | 774 | 0.23 | 0.18 | 0.55 | 0.10 | |
|
| 9 (0%) | AC1-AC2 | 92 | 408 | 0.00 | 0.00 | - | - |
| 10 (0%) | AC2-AC3 | 260 | 405 | 0.06 | 0.03 | 0.57 | 0.14 | |
| 12 (0%) | AV1-AV2 | 191 | 777 | 0.05 | 0.08 | 0.17 | 0.22 | |
|
| 166 (2%) | AC1-AC2 | 92 | 405 | 1.79 | 0.97 | 2.47 | 0.27 |
| 162 (0%) | AC2-AC3 | 260 | 405 | 1.38 | 0.60 | 0.67 | 0.05 | |
| 105 (0%) | AV1-AV2 | 197 | 775 | 0.79 | 0.31 | 0.58 | 0.09 | |
|
| 15 (27%) | AC1-AC2 | 92 | 408 | 0.23 | 0.01 | 0.50 | 0.15 |
| 15 (0%) | AC2-AC3 | 260 | 405 | 0.11 | 0.10 | 1.76 | 0.42 | |
| 15 (0%) | AV1-AV2 | 197 | 775 | 0.20 | 0.07 | 0.63 | 0.27 | |
|
| 67 (97%) | AC1-AC2 | 104 | 453 | 3.45 | 3.18 | 0.51 | 0.23 |
| 68 (32%) | AC2-AC3 | 308 | 405 | 1.04 | 0.63 | 1.44 | 0.17 | |
| 41 (58%) | AV1-AV2 | 188 | 771 | 0.64 | 1.26 | 0.24 | 0.62 | |
|
| 54 (57%) | AC1-AC2 | 59 | 390 | 1.49 | 0.66 | 0.47 | 0.12 |
| 54 (0%) | AC2-AC3 | 254 | 399 | 0.94 | 0.89 | 0.34 | 0.94 | |
|
| 19 (5%) | AC1-AC2 | 80 | 417 | 1.01 | 0.01 | 0.38 | 0.28 |
| 45 (0%) | AC2-AC3 | 254 | 399 | 0.74 | 0.42 | 1.99 | 0.32 | |
|
| 131 (0%) | AC1-AC2 | 92 | 408 | 0.56 | 0.77 | 0.47 | 1.52 |
| 130 (0%) | AC2-AC3 | 260 | 405 | 0.67 | 0.48 | 0.40 | 0.64 | |
| 126 (4%) | AV1-AV2 | 191 | 777 | 0.73 | 0.64 | 1.17 | 0.65 | |
|
| 18 (39%) | AC1-AC2 | 92 | 452 | 0.54 | 1.68 | 0.39 | 0.33 |
| 14 (7%) | AC2-AC3 | 278 | 435 | 2.96 | 0.73 | 1.78 | 0.25 | |
| 18 (9%) | AV1-AV2 | 176 | 765 | 0.52 | 0.17 | 0.87 | 0.10 | |
|
| 88 (4%) | AC1-AC2 | 98 | 420 | 2.83 | 1.47 | 0.71 | 0.62 |
| 113 (1%) | AC2-AC3 | 281 | 411 | 5.99 | 1.56 | 0.57 | 0.46 | |
| 115 (0%) | AV1-AV2 | 179 | 771 | 2.90 | 2.12 | 1.16 | 0.77 | |
|
| 588 (9%) | AC1-AC2 | 92 | 408 | 7.68 | 5.22 | 0.96 | 0.43 |
| 521 (10%) | AC2-AC3 | 260 | 405 | 6.37 | 3.08 | 0.95 | 0.64 | |
| 593 (10%) | AV1-AV2 | 191 | 777 | 3.47 | 3.16 | 0.89 | 0.42 | |
| 3′-terminal (39/54) | ||||||||
|
| 27 (3%) | AC1-AC2 | 93 | 1077 | 0.53 | 0.36 | 0.50 | 0.29 |
| 32 (3%) | AC2-AC3 | 260 | 408 | 0.21 | 0.03 | 0.44 | 0.20 | |
| 33 (0%) | AV1-AV2 | 193 | 342 | 0.54 | 0.24 | 0.47 | 0.18 | |
|
| 11 (18%) | AC1-AC2 | 98 | 1086 | 0.47 | 0.01 | 0.48 | 0.02 |
| 11 (0%) | AC2-AC3 | 260 | 406 | 0.22 | 0.27 | 0.45 | 0.57 | |
| 12 (60%) | AV1-AV2 | 189 | 348 | 1.21 | 0.76 | 0.57 | 0.13 | |
|
| 121 (0%) | AC1-AC2 | 89 | 1086 | 0.44 | 0.10 | 0.56 | 0.34 |
| 158 (0%) | AC2-AC3 | 254 | 390 | 0.30 | 0.28 | 0.65 | 0.61 | |
|
| 39 (92%) | AC1-AC2 | 104 | 1092 | 1.25 | 0.91 | 0.58 | 0.38 |
| 51 (20%) | AC2-AC3 | 308 | 453 | 1.91 | 0.91 | 0.27 | 0.03 | |
| 50 (40%) | AV1-AV2 | 206 | 366 | 1.34 | 0.46 | 0.13 | 0.10 | |
|
| 23 (98%) | AC1-AC2 | 104 | 1092 | 1.90 | 0.88 | 0.16 | 0.70 |
| 57 (2%) | AC2-AC3 | 308 | 453 | 2.42 | 0.57 | 0.21 | 0.51 | |
| 55 (16%) | AV1-AV2 | 206 | 366 | 0.89 | 0.32 | 0.78 | 0.54 | |
|
| 16 (38%) | AC1-AC2 | 98 | 1086 | 0.86 | 0.10 | 0.30 | 0.12 |
| 18 (17%) | AC2-AC3 | 260 | 405 | 0.99 | 0.50 | 0.66 | 0.57 | |
| 16 (19%) | AV1-AV2 | 197 | 357 | 0.26 | 0.45 | 1.76 | 2.67 | |
|
| 21 (9%) | AC1-AC2 | 101 | 1089 | 0.22 | 0.33 | 0.07 | 0.33 |
| 32 (0%) | AC2-AC3 | 257 | 405 | 0.01 | 0.03 | 0.24 | 0.47 | |
| 29 (0%) | AV1-AV2 | 209 | 369 | 0.08 | 0.50 | 0.07 | 0.63 | |
|
| 50 (2%) | AC1-AC2 | 104 | 1092 | 0.67 | 0.38 | 0.27 | 0.03 |
| 58 (2%) | AC2-AC3 | 308 | 453 | 0.59 | 0.75 | 0.11 | 0.14 | |
| 57 (16%) | AV1-AV2 | 206 | 366 | 0.45 | 0.46 | 0.24 | 0.26 | |
|
| 50 (0%) | AC1-AC2 | 77 | 1065 | 0.27 | 0.01 | 0.38 | 0.25 |
| 71 (0%) | AC2-AC3 | 260 | 408 | 0.40 | 0.25 | 0.25 | 0.61 | |
| 56 (0%) | AV1-AV2 | 197 | 357 | 0.16 | 0.18 | 0.42 | 0.45 | |
|
| 13 (0%) | AC1-AC2 | 92 | 1080 | 0.07 | 0.00 | 0.16 | 0.00 |
| 9 (0%) | AC2-AC3 | 260 | 408 | 0.00 | 0.00 | - | - | |
| 12 (0%) | AV1-AV2 | 191 | 351 | 0.06 | 0.08 | 0.34 | 0.39 | |
|
| 153 (3%) | AC1-AC2 | 92 | 1080 | 1.55 | 0.62 | 0.43 | 0.25 |
| 166 (0%) | AC2-AC3 | 260 | 405 | 1.79 | 0.62 | 0.43 | 0.35 | |
| 136 (0%) | AV1-AV2 | 197 | 357 | 0.75 | 0.32 | 0.36 | 0.22 | |
|
| 14 (29%) | AC1-AC2 | 92 | 1080 | 0.35 | 0.01 | 0.32 | 0.17 |
| 15 (0%) | AC2-AC3 | 260 | 408 | 0.23 | 0.10 | 0.77 | 0.28 | |
| 13 (0%) | AV1-AV2 | 197 | 357 | 0.15 | 0.07 | 0.65 | 0.47 | |
|
| 60 (92%) | AC1-AC2 | 104 | 1089 | 2.96 | 4.66 | 0.05 | 0.25 |
| 67 (33%) | AC2-AC3 | 308 | 453 | 3.45 | 0.68 | 0.32 | 0.04 | |
| 45 (53%) | AV1-AV2 | 188 | 348 | 0.38 | 0.75 | 0.21 | 0.50 | |
|
| 54 (57%) | AC1-AC2 | 59 | 1050 | 1.47 | 0.68 | 0.25 | 0.01 |
| 54 (0%) | AC2-AC3 | 254 | 390 | 1.49 | 0.92 | 0.44 | 0.29 | |
|
| 44 (2%) | AC1-AC2 | 80 | 1050 | 0.81 | 0.12 | 0.26 | 0.11 |
| 19 (0%) | AC2-AC3 | 254 | 417 | 1.01 | 0.21 | 1.40 | 1.03 | |
|
| 125 (0%) | AC1-AC2 | 92 | 1080 | 0.55 | 0.60 | 1.04 | 1.47 |
| 131 (0%) | AC2-AC3 | 260 | 408 | 0.56 | 0.42 | 0.70 | 0.43 | |
| 128 (4%) | AV1-AV2 | 191 | 351 | 1.16 | 0.99 | 0.24 | 0.08 | |
|
| 17 (41%) | AC1-AC2 | 92 | 1095 | 0.79 | 1.61 | 0.57 | 1.50 |
| 18 (6%) | AC2-AC3 | 278 | 452 | 0.54 | 1.21 | 0.70 | 1.16 | |
| 18 (9%) | AV1-AV2 | 176 | 345 | 0.62 | 0.16 | 0.18 | 0.06 | |
|
| 97 (3%) | AC1-AC2 | 98 | 1086 | 3.75 | 1.86 | 0.16 | 0.13 |
| 88 (1%) | AC2-AC3 | 281 | 420 | 2.83 | 0.88 | 0.12 | 0.84 | |
| 105 (0%) | AV1-AV2 | 179 | 339 | 2.78 | 1.58 | 0.71 | 0.52 | |
|
| 397 (8%) | AC1-AC2 | 92 | 1074 | 2.43 | 2.23 | 0.40 | 1.52 |
| 588 (9%) | AC2-AC3 | 260 | 408 | 7.67 | 4.27 | 0.20 | 0.87 | |
| 623 (9%) | AV1-AV2 | 191 | 351 | 3.01 | 2.27 | 0.40 | 0.18 | |
1 Number of genes with significant differences in tree length between OV and NOV regions over total number. Deviation from randomness was tested by Fisher’s exact test (p < 0.05). M: Monopartite; B: Bipartite. Asterisks indicate sequences formerly classified as Bendhi yellow vein India virus; 2 Number of sequences used. Percentage of recombinant sequences is shown in parentheses; 3 Name of the genes involved in the overlapping instance, and length of the OV region; 4 Length of the largest gene (internal overlapping) of the gene with 5′-terminal overlapping (5′-terminal) and of the gene with 3′-terminal overlapping (3′-terminal); 5 Tree length of inferred phylogenies for OV and NOV regions of each overlapping instance.