| Literature DB >> 34372529 |
Ruiming Hu1,2,3, Leyi Wang4, Qingyun Liu1, Lin Hua1, Xi Huang1, Yue Zhang1, Jie Fan1, Hongjian Chen1, Wenbo Song1, Wan Liang1,5, Nengshui Ding2,6,7, Zuohua Li8, Zhen Ding2,3, Xibiao Tang1, Zhong Peng1, Bin Wu1.
Abstract
Pseudorabies virus (PRV) is an economically significant swine infectious agent. A PRV outbreak took place in China in 2011 with novel virulent variants. Although the association of viral genomic variability with pathogenicity is not fully confirmed, the knowledge concerning PRV genomic diversity and evolution is still limited. Here, we sequenced 54 genomes of novel PRV variants isolated in China from 2012 to 2017. Phylogenetic analysis revealed that China strains and US/Europe strains were classified into two separate genotypes. PRV strains isolated from 2012 to 2017 in China are highly related to each other and genetically close to classic China strains such as Ea, Fa, and SC. RDP analysis revealed 23 recombination events within novel PRV variants, indicating that recombination contributes significantly to the viral evolution. The selection pressure analysis indicated that most ORFs were under evolutionary constraint, and 19 amino acid residue sites in 15 ORFs were identified under positive selection. Additionally, 37 unique mutations were identified in 19 ORFs, which distinguish the novel variants from classic strains. Overall, our study suggested that novel PRV variants might evolve from classical PRV strains through point mutation and recombination mechanisms.Entities:
Keywords: genome; phylogeny; pseudorabies virus; recombination; selection pressure
Mesh:
Year: 2021 PMID: 34372529 PMCID: PMC8310123 DOI: 10.3390/v13071322
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1Overview of the sequence diversity in the alignment of 73 PRV genomic sequences. The alignment of 73 PRV genomic sequences was visualized by Geneious Prime. The black bar at the top represents a consensus sequence drawn from the alignment of 73 PRV genomic sequences. The white lines in the second row indicate the distribution of assembly gaps. In the third row, the curve of identity level is plotted from this alignment, which is colored as follows: green, 100% identity; green-brown, 30 to <100% identity; red, <30% identity. The fourth row is the annotation of the PRV genome. The bottom row shows the region of the repetitive sequence of the PRV genome.
Figure 2Phylogenetic relationship of PRV genome sequences. (A) A maximum likelihood (ML) tree was constructed by using MEGA X, based on the alignment of 73 PRV genome sequences (Tables S1 and S2). Gapped regions, which are labeled in Figure 1 within the multisequence alignment, were removed from all sequences before phylogenetic analysis. The ML tree is a rooted tree, and all branch lengths were drawn to a scale of nucleotide substitutions per site. Bootstrap resampling (100 replication) was performed. The branches of Genetic Clade 2.1 are collapsed. (B)The mean distance between each genetic clade was calculated by MEGA X with the Jukes–Cantor model. (C) Expansion of the collapsed clade (Clade 2.1) in the panel A of this figure.
Figure 3Recombination occurring pervasively in PRV newly sequenced strains. Recombination between PRV strains was determined by the Neighbor-Net split network and BootScan. (A) The Neighbor-Net split network was drawn by SplitsTree version 4.16 of 73 genomic sequences. The reticulate connections between each clade indicate events of recombination. Each clade was covered by a color scheme. The name of each strain is not displayed. (B) RDP analysis was applied to detect the recombination events in newly sequenced PRV genomes. All the recombination events are plotted in the schematic diagram. Each strip represents one recombination event and is labeled with one color. There are 4 colors in total, each representing different combinations of parental strains, as follows: red = Bartha-K61 + novel PRV variants; blue = Bartha-K61 + China classic strains, green = China classic strains + novel variants; yellow = novel variants + novel variants.
Nucleotide and amino acid diversity of PRV strains in open reading frames.
| Nucleotide | Amino Acid | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| ORF | Common Name | Core Gene | Mean Distance | Standard Error | Variable Sites | Length * | Ratio of Variable Sites | Mean Distance | Standard Error | Ratio of Variable Sites |
| UL1 | gL | Yes | 2.66% | 0.006 | 22 | 468 | 4.70% | 3.10% | 0.011 | 5.13% |
| UL2 | UNG | Yes | 0.72% | 0.002 | 47 | 969 | 4.85% | 0.89% | 0.002 | 6.81% |
| UL3 | No | 2.00% | 0.002 | 79 | 720 | 10.97% | 3.79% | 0.007 | 17.50% | |
| UL3.5 | No | 1.64% | 0.003 | 45 | 666 | 6.76% | 2.80% | 0.006 | 13.06% | |
| UL4 | No | 0.84% | 0.002 | 18 | 435 | 4.14% | 1.27% | 0.004 | 6.21% | |
| UL5 | Yes | 0.23% | 0.000 | 43 | 2505 | 1.72% | 0.24% | 0.001 | 2.16% | |
| UL6 | Yes | 0.42% | 0.001 | 44 | 1935 | 2.27% | 0.72% | 0.002 | 3.26% | |
| UL7 | Yes | 1.13% | 0.002 | 24 | 798 | 3.01% | 0.19% | 0.005 | 5.26% | |
| UL8 | Yes | 1.60% | 0.001 | 114 | 2064 | 5.52% | 1.63% | 0.003 | 7.70% | |
| UL9 | OBP | No | 0.53% | 0.001 | 104 | 2532 | 4.11% | 0.83% | 0.001 | 6.40% |
| UL10 | gM | Yes | 0.84% | 0.002 | 34 | 1179 | 2.88% | 1.80% | 0.004 | 6.11% |
| UL11 | Yes | 1.67% | 0.006 | 10 | 189 | 5.29% | 2.94% | 0.014 | 9.52% | |
| UL12 | AN | Yes | 1.32% | 0.002 | 79 | 1449 | 5.45% | 2.15% | 0.004 | 8.90% |
| UL13 | PK | Yes | 1.28% | 0.002 | 51 | 1173 | 4.35% | 2.29% | 0.004 | 7.42% |
| UL14 | Yes | 1.30% | 0.003 | 16 | 477 | 3.35% | 2.20% | 0.008 | 5.66% | |
| UL15 | Yes | 1.38% | 0.002 | 119 | 2226 | 5.35% | 1.90% | 0.002 | 7.68% | |
| UL16 | Yes | 2.00% | 0.003 | 48 | 984 | 4.88% | 2.80% | 0.006 | 7.01% | |
| UL17 | Yes | 1.25% | 0.002 | 83 | 1800 | 4.61% | 1.82% | 0.003 | 6.67% | |
| UL18 | VP23 | Yes | 0.76% | 0.002 | 24 | 888 | 2.70% | 0.58% | 0.003 | 2.36% |
| UL19 | VP5 | Yes | 0.45% | 0.001 | 84 | 3990 | 2.11% | 0.47% | 0.001 | 2.56% |
| UL20 | No | 1.90% | 0.005 | 20 | 495 | 4.04% | 3.10% | 0.010 | 6.06% | |
| UL21 | Yes | 1.48% | 0.002 | 101 | 1599 | 6.32% | 2.20% | 0.004 | 9.57% | |
| UL22 | gH | Yes | 0.22% | 0.000 | 44 | 2064 | 2.13% | 0.44% | 0.001 | 4.22% |
| UL23 | TK | 0.49% | 0.001 | 21 | 960 | 2.19% | 0.83% | 0.003 | 3.44% | |
| UL24 | Yes | 0.83% | 0.003 | 9 | 513 | 1.75% | 0.84% | 0.005 | 1.75% | |
| UL25 | Yes | 0.72% | 0.001 | 54 | 1608 | 3.36% | 1.00% | 0.002 | 4.85% | |
| UL26 | VP24 | Yes | 0.77% | 0.001 | 62 | 1599 | 3.88% | 0.92% | 0.002 | 5.63% |
| UL26.5 | VP22 | Yes | 1.19% | 0.002 | 40 | 861 | 4.65% | 1.20% | 0.004 | 5.92% |
| UL27 | gB | Yes | 0.96% | 0.001 | 105 | 2742 | 3.83% | 1.67% | 0.003 | 6.67% |
| UL28 | ICP18.5 | Yes | 0.71% | 0.001 | 66 | 2166 | 3.05% | 0.98% | 0.002 | 4.99% |
| UL29 | ICP8 | Yes | 0.51% | 0.001 | 87 | 3537 | 2.46% | 0.56% | 0.001 | 3.14% |
| UL30 | Yes | 0.47% | 0.001 | 52 | 3144 | 1.65% | 0.48% | 0.001 | 2.10% | |
| UL31 | Yes | 0.96% | 0.002 | 16 | 813 | 1.97% | 1.44% | 0.005 | 2.95% | |
| UL32 | Yes | 0.80% | 0.002 | 33 | 1413 | 2.34% | 0.88% | 0.003 | 2.76% | |
| UL33 | Yes | 1.30% | 0.003 | 11 | 351 | 3.13% | 2.23% | 0.010 | 5.98% | |
| UL34 | Yes | 2.06% | 0.004 | 45 | 783 | 5.75% | 3.20% | 0.008 | 8.81% | |
| UL35 | VP26 | Yes | 1.45% | 0.004 | 10 | 309 | 3.24% | 3.60% | 0.013 | 7.77% |
| UL36 | VP1/2 | Yes | 1.87% | 0.001 | 993 | 9489 | 10.46% | 2.70% | 0.001 | 14.64% |
| UL37 | Yes | 0.70% | 0.001 | 100 | 2757 | 3.63% | 1.32% | 0.002 | 6.96% | |
| UL38 | VP19c | Yes | 0.93% | 0.002 | 32 | 1104 | 2.90% | 0.77% | 0.002 | 3.26% |
| UL39 | RR1 | Yes | 0.82% | 0.001 | 81 | 2364 | 3.43% | 1.18% | 0.002 | 5.58% |
| UL40 | RR2 | No | 0.82% | 0.001 | 25 | 909 | 2.75% | 0.90% | 0.003 | 3.30% |
| UL41 | VHS | No | 1.02% | 0.002 | 34 | 1095 | 3.11% | 1.08% | 0.003 | 4.11% |
| UL42 | Yes | 0.81% | 0.001 | 52 | 1155 | 4.50% | 1.69% | 0.004 | 9.09% | |
| UL43 | No | 0.93% | 0.001 | 45 | 1119 | 4.02% | 1.46% | 0.004 | 5.63% | |
| UL44 | gC | No | 2.38% | 0.003 | 108 | 1461 | 7.39% | 4.20% | 0.005 | 13.35% |
| UL46 | VP11/12 | No | 1.63% | 0.002 | 125 | 2085 | 6.00% | 2.63% | 0.003 | 9.64% |
| UL47 | VP13/14 | No | 1.89% | 0.002 | 150 | 2217 | 6.77% | 2.60% | 0.003 | 9.47% |
| UL48 | VP16 | No | 0.97% | 0.002 | 41 | 1239 | 3.31% | 1.52% | 0.004 | 5.33% |
| UL49 | VP22 | No | 1.37% | 0.003 | 40 | 729 | 5.49% | 2.06% | 0.005 | 8.23% |
| UL49.5 | gN | Yes | 4.10% | 0.008 | 25 | 297 | 8.42% | 7.56% | 0.022 | 15.15% |
| UL50 | dUTPase | Yes | 1.40% | 0.003 | 37 | 807 | 4.58% | 2.10% | 0.006 | 5.95% |
| UL51 | Yes | 2.06% | 0.004 | 40 | 729 | 5.49% | 4.38% | 0.009 | 11.11% | |
| UL52 | Yes | 1.81% | 0.001 | 329 | 2910 | 11.31% | 2.56% | 0.002 | 14.74% | |
| UL53 | gK | No | 1.60% | 0.003 | 39 | 936 | 4.17% | 2.38% | 0.006 | 6.09% |
| UL54 | ICP27 | Yes | 2.00% | 0.003 | 73 | 1083 | 6.74% | 2.80% | 0.005 | 11.08% |
| UL56 | No | 2.87% | 0.004 | 47 | 621 | 7.57% | 2.93% | 0.008 | 8.70% | |
| US1 | ICP22 | No | 2.49% | 0.003 | 235 | 1332 | 17.64% | 3.34% | 0.005 | 24.55% |
| US2 | 28K | No | 2.30% | 0.004 | 40 | 768 | 5.21% | 3.50% | 0.009 | 8.59% |
| US3 | No | 0.83% | 0.002 | 28 | 1002 | 2.79% | 1.33% | 0.004 | 4.19% | |
| US4 | gG | No | 1.29% | 0.001 | 142 | 1497 | 9.49% | 1.70% | 0.003 | 12.63% |
| US6 | gD | No | 0.83% | 0.001 | 44 | 1206 | 3.65% | 1.50% | 0.003 | 6.72% |
| US7 | gI | No | 1.98% | 0.002 | 99 | 1098 | 9.02% | 3.00% | 0.005 | 13.11% |
| US8 | gE | No | 1.78% | 0.001 | 233 | 1734 | 13.44% | 2.68% | 0.004 | 16.26% |
| US9 | 11K | No | 1.97% | 0.005 | 12 | 294 | 4.08% | 2.07% | 0.011 | 4.08% |
| EP0 | ICP0 | No | 0.79% | 0.001 | 44 | 1104 | 3.99% | 1.19% | 0.003 | 5.98% |
| IE180 | ICP4 | No | 0.64% | 0.001 | 214 | 4425 | 4.84% | 0.98% | 0.001 | 6.98% |
*: The ORF nucleotide sequence lengths of each ORF were collected from the reference genome of PRV Ea strain.
Figure 4Presentative amino acid (AA) variation in UL15, UL3.5, and UL27. Partial regions of the amino acid (AA) alignment of UL15 (A), UL3.5 (B), and UL27 (C) are displayed in the three panels. To make the alignment easy to display, the redundancy of newly sequenced ORFs is reduced, while all the reference sequences are maintained. The panel of the strain name is covered by colored boxes, which represent their genetic clades with the same color scheme used in Figure 3A. The distinguishing mutations that distinguish PRV strains from classic strains are highlighted by red boxes.
Distinguishing mutations in newly sequenced PRV strains.
| ORF Name | Mutation Position $ | Function |
|---|---|---|
| UL2 | V107A * | UNG, DNA repair; Uracil-DNA glycosylase |
| UL3.5 | Viral egress (secondary envelopment); membrane-associated protein | |
| UL5 | H250R | DNA replication; UL5 is helicase subunit of UL5/UL8/UL52 helicase/primase complex; |
| UL9 | A283T, W500R, P696T, | OBP, sequence-specific ori-binding protein, ATP-dependent helicase motif |
| UL13 |
| VP18.8, protein-serine/threonine kinase |
| UL14 | G145E | Virion tegument protein, |
| UL15 |
| Interacts with UL33, UL28 & UL6; DNA viral concatemeric DNA cleavage/encapsidation; terminase subunit of the UL15/UL28 complex; |
| UL27 | Viral entry (fusion); cell–cell spread; glycoprotein B; type I | |
| UL36 | Viral egress (capsid tegumentation); major tegument scaffold; interacts with UL37 and capsid | |
| UL37 | M782V | Viral egress (capsid tegumentation); interacts with UL36 |
| UL42 |
| DNA replication; polymerase accessory subunit of UL30/UL42 holoenzyme |
| UL44 |
| Glycoprotein C (gC), viral entry (virion attachment); type I membrane protein; binds to heparan sulfate |
| UL47 | G67E, | VP13/14, viral egress (secondary envelopment); tegument protein |
| UL49.5 |
| Glycoprotein N (gN), immune evasion (TAP inhibitor); type I membrane protein; complexed with gM |
| US1 | E211G, E260G | ICP22, acts as a regulator of gene expression |
| US4 |
| Glycoprotein G (secreted) |
| US6 | V338A, | Glycoprotein D (gD), viral entry (cellular receptor-binding protein); type I membrane protein |
| US8 | G54D, P403A | Glycoprotein E (gE), cell–cell spread; glycoprotein E; type I membrane protein; complexed with gI; C-terminus interacts with UL49; protein sorting in axons |
| IE180 | ΔΔ349-350RG, A841P, A842S, | ICP4, gene regulation (transcription activator); immediate-early protein |
*: The mutation sites listed in the table indicate that certain amino acid residues of every PRV strain isolated after 2011 except HuB1/CHN2017 are collectively different from the classic Chinese PRV strains Ea/Fa and SC, which distinguish the novel PRV variants from the classic Chinese PRV strains (Ea, Fa, and SC). &: The mutation sites with bold letters indicate that certain amino acid residues of every PRV strain isolated after 2011 except HuB1/CHN2017 are collectively different from the classic Chinese PRV strains Ea, Fa, and SC and all EU/US strains. $: Each amino acid residue number is the codon number in the corresponding ORF of reference genome Ea.
Figure 5Selection pressure analysis. Selection pressure of each ORF of PRV was analyzed. ORF sequences were extracted from each annotated genome. By using FUBAR in the Datamonkey web server, nonsynonymous substitutions per nonsynonymous site (dN), synonymous substitutions per synonymous site (dS), and the dN/dS ratio of each ORF were calculated. The result is plotted as a bar graph. The dN and dN/dS ratio read the major y-axis at the left side, and dS takes the minor y-axis at the right side.
Position of each codon under positive selection.
| ORF Name | Codon Position $ | Detection Method |
|---|---|---|
| UL5 | 574 | F*, M& |
| UL6 | 462, 464 | F, C# |
| UL9 | 283 | F, C |
| UL12 | 474 | F, M |
| UL15 | 674 | F, M |
| UL19 | 502 | F, M |
| UL23 | 293 | M, C |
| UL25 | 472 | F, C |
| UL27 | 75 | F, M, C |
| 505 | F, C | |
| UL36 | 561 | F, M |
| UL39 | 4 | F, M, C |
| UL52 | 662 | F, C |
| 663 | M, C | |
| US8 | 575 | F, C |
| 578 | M, C | |
| EP0 | 212 | F, M |
| IE180 | 1462 | F, M |
F*: FUBAR, posterior probability >90%; M&: MEME, p < 0.1; C#: CodeML, p < 0.05. $: All amino acid residue positions are the codon number in the corresponding ORF of reference genome Ea.