| Literature DB >> 35632721 |
Jianbo Huang1, Wenjie Tang2, Xvetao Wang3, Jun Zhao1, Kenan Peng1, Xiangang Sun1, Shuwei Li2,3, Shengyao Kuang2,3, Ling Zhu1,4, Yuancheng Zhou2,3, Zhiwen Xu1,4.
Abstract
We sequenced the complete genome of the pseudorabies virus (PRV) FJ epidemic strain, and we studied the characteristics and the differences compared with the classical Chinese strain and that of other countries. Third-generation sequencing and second-generation sequencing technology were used to construct, sequence, and annotate an efficient, accurate PRV library. The complete FJ genome was 143,703 bp, the G+C content was 73.67%, and it encoded a total of 70 genes. The genetic evolution of the complete genome and some key gene sequences of the FJ strain and PRV reference strains were analyzed by the maximum likelihood (ML) method of MEGA 7.0 software. According to the ML tree based on the full-length genome sequences, PRV FJ strain was assigned to the branch of genotype II, and it showed a close evolutionary relationship with PRV epidemic variants isolated in China after 2011. The gB, gC, gD, gH, gL, gM, gN, TK, gI, and PK genes of the FJ strain were assigned to the same branch with other Chinese epidemic mutants; its gG gene was assigned to the same branch with the classic Chinese Fa and Ea strains; and its gE gene was assigned to a relatively independent branch. Potential recombination events were predicted by the RDP4 software, which showed that the predicted recombination sites were between 1694 and 1936 bp, 101,113 and 102,660 bp, and 107,964 and 111,481 bp in the non-coding region. This result broke the previously reported general rule that pseudorabies virus recombination events occur in the gene coding region. The major backbone strain of the recombination event was HLJ8 and the minor backbone strain was Ea. Our results allowed us to track and to grasp the recent molecular epidemiological changes of PRV. They also provide background materials for the development of new PRV vaccines, and they lay a foundation for further study of PRV.Entities:
Keywords: complete genome sequencing; gene recombination; phylogenetic analysis; pseudorabies virus
Mesh:
Substances:
Year: 2022 PMID: 35632721 PMCID: PMC9146711 DOI: 10.3390/v14050978
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.818
Complete genome sequence information of pseudorabies viruses in the National Center of Biotechnology Information databases.
| No. | Strain Name | Accession Number | Country | Isolation |
|---|---|---|---|---|
| 1 | Becker | JF797219.1 | USA | 1970 |
| 2 | Bartha | JF797217.1 | Hungary | 1961 |
| 3 | Ea | KU315430.1 | China | 1990 |
| 4 | Fa | KM189913.1 | China | 2012 |
| 5 | GD0304 | MH582511.1 | China | 2015 |
| 6 | HB1201 | KU057086.1 | China | 2012 |
| 7 | HeN1 | KP098534.1 | China | 2012 |
| 8 | HLJ8 | KT824771.1 | China | 2012 |
| 9 | HNB | KM189914.3 | China | 2012 |
| 10 | HNX | KM189912.1 | China | 2012 |
| 11 | Kaplan | JF797218.1 | Hungary | 1959 |
| 12 | Kolchis | KT983811.1 | Greece | 2010 |
| 13 | MY-1 | AP018925.1 | Japan | 2015 |
| 14 | ZJ-01 | KM061380.1 | China | 2012 |
| 15 | TJ | KJ789182.1 | China | 2012 |
Nucleotide sequence coordinates and lengths are given relative to the genome sequence of PRV FJ strain.
| Region | Location | Length (bp) |
|---|---|---|
| UL | 1–101,012 | 101,012 |
| IRs | 101,013–117,681 | 16,669 |
| US | 117,682–127,034 | 9353 |
| TRs | 127,035–143,703 | 16,669 |
Figure 1Sequences and distribution of the PRV FJ complete genome.
Annotation of each open reading frame of from the PRV FJ complete genome.
| Protein Name | Location of ORF (bp) | Length (aa) | Function |
|---|---|---|---|
| UL56 | 754–1377 | 207 | Possibly vesicular trafficking |
| ICP27 | 1932–3017 | 361 | Gene regulation; early protein |
| gK | 3096–4034 | 312 | Viral glycoprotein K; type III membrane protein |
| UL52 | 3989–6895 | 968 | DNA replication; primase subunit of ULS/UL8/UL52 complex |
| UL51 | 6882–7613 | 243 | Tegument protein |
| dUTPase | 7812–8621 | 269 | dUTPase |
| gN | 8542–8841 | 99 | Glycoprotein N; type I membrane protein; complexed with gM |
| VP22 | 8879–9619 | 246 | Interacts with C-terminal domains of gE and gM; tegument protein |
| VP16 | 9683–10,924 | 413 | Gene regulation (transactivator); egress (secondary envelopment); tegument protein |
| VP13/14 | 11,034–13,250 | 738 | Viral egress (secondary envelopment); |
| VP11/12 | 13,269–15,356 | 695 | Possibly gene regulation; tegument protein |
| gB | 15,905–18,649 | 914 | Viral entry (fusion); cell–cell spread; glycoprotein B; type I |
| ICP18.5 | 18,520–20,688 | 722 | DNA cleavage and encapsulations (terminase); associated with UL15, UL33 and UL6 |
| ICP8 | 20,836–24,378 | 1180 | DNA replication-recombination; binds single-stranded DNA |
| UL30 | 24,677–27,823 | 1048 | DNA replication; DNA polymerase subunit of UL30/UL42 complex |
| UL31 | 27,744–28,559 | 271 | Viral egress (nuclear egress); primary virion tegument protein; interacts with UL34 |
| UL32 | 28,552–29,967 | 471 | DNA packaging; efficient localization of capsids to replication compartments |
| UL33 | 29,966–30,322 | 118 | DNA cleavage and encapsidation; associated with UL28 and UL 15 |
| UL34 | 30,494–31,279 | 261 | Viral egress (nuclear egress); primary virion |
| VP26 | 31,334–31,645 | 103 | Capsid protein |
| VP1/2 | 32,057–41,644 | 3195 | Large tegument protein; interacts with UL37 and UL19 |
| UL37 | 41,682–44,441 | 919 | Tegument protein; interacts with UL36 |
| VP19c | 44,498–45,604 | 368 | Capsid protein; forms triplexes together with ULl8 |
| RR1 | 45,941–48,307 | 788 | Nucleotide synthesis; large subunit of ribonucleotide reductase |
| RR2 | 48,317–49,228 | 303 | Nucleotide synthesis; small subunit of ribonucleotide reductase |
| vhs | 49,843–50,940 | 365 | Gene regulation (inhibitor of gene expression); virion host cell shutoff |
| UL42 | 51,069–52,226 | 385 | DNA replication; polymerase accessory subunit of UL30/UL42 complex |
| UL43 | 52,286–53,407 | 373 | Unknown; type III membrane protein |
| gC | 53,474–54,937 | 487 | Viral entry (virion attachment); glycoprotein C; type I membrane protein; binds to heparan sulfate |
| UL26.5 | 55,233–56,093 | 286 | Scaffold protein; substrate for UL26; required for capsid formation and maturation |
| VP24 | 55,233–56,831 | 532 | Scaffold protein; proteinase; required for capsid formation and maturation |
| UL25 | 56,883–58,493 | 536 | Capsid-associated protein; required for capsid assembly |
| UL24 | 58,592–59,107 | 171 | Unknown; type III membrane protein |
| TK | 59,100–60,062 | 320 | Nucleotide synthesis; thymidine kinase |
| gH | 60,198–62,255 | 685 | Viral entry (fusion); cell–cell spread; glycoprotein H; type I membrane protein; complexed with gL |
| UL21 | 64,012–65,613 | 533 | Capsid-associated protein |
| UL20 | 65,720–66,217 | 165 | Viral egress; type III membrane protein |
| VP5 | 66,306–70,298 | 1330 | Major capsid protein; forms hexons and pentons |
| VP23 | 70,473–71,363 | 296 | Capsid protein; forms triplexes together with UL38 |
| UL17 | 72,739–74,538 | 599 | DNA cleavage and encapsidation |
| UL16 | 74,565–75,551 | 328 | Possibly virion morphogenesis |
| UL15 | 71,546–72,687 | 753 | DNA cleavage and encapsidation; terminase subunit; interacts with UL33 |
| UL14 | 76,690–77,169 | 159 | Virion morphogenesis |
| VP18.8 | 77,139–78,314 | 391 | Protein-serine/threonine kinase |
| AN | 78,280–79,731 | 483 | DNA recombination; alkaline exonuclease |
| UL11 | 79,689–79,880 | 63 | Viral egress (secondary envelopment); membrane-associated tegument protein |
| gM | 80,309–81,490 | 393 | Viral egress (secondary envelopment); glycoprotein M; type III membrane protein; C terminus interacts with UL49; inhibits membrane fusion in transient assays; complexed with gN |
| OBP | 81,489–84,023 | 844 | Sequence-specific ori-binding protein |
| UL8 | 84,020–86,086 | 688 | DNA replication; part of ULS/UL8/UL52 helicase-primase complex |
| UL7 | 86,288–87,088 | 266 | Virion morphogenesis |
| UL6 | 86,979–88,916 | 645 | DNA packaging Capsid protein; portal protein; docking site for terminase |
| UL5 | 88,915–91,470 | 851 | DNA replication; part of ULS/LJL8/UL52 helicase-primase complex; helicase motif |
| UL4 | 91,528–91,965 | 145 | Nuclear protein |
| UL3.5 | 92,141–92,809 | 222 | Possibly virion morphogenesis |
| UL3 | 92,806–93,540 | 244 | Nuclear protein |
| UNG | 93,597–94,568 | 323 | Uracil-DNA glycosylase |
| gL | 94,546–95,016 | 156 | Viral entry; cell–cell spread; glycoprotein L; membrane-anchored via complex with gH |
| ICP0 | 96,248–97,348 | 366 | Gene regulation (transactivator of viral and cellular genes); early protein |
| ICP4 | 103,130–107,544 | 1471 | Gene regulation; immediate early protein |
| ICP22 | 116,146–117,339 | 397 | Gene regulation |
| PK | 118,467–119,471 | 334 | Minor form of protein kinase (53-kDa mobility); viral egress (nuclear egress); major form of protein kinase (41-kDa mobility) |
| gG | 119,531–121,030 | 499 | Cell–cell spread; secreted; glycoprotein G |
| gD | 121,214–122,422 | 402 | Viral entry (cellular receptor binding protein); glycoprotein D |
| gI | 122,446–123,543 | 465 | Cell–cell spread; glycoprotein I; type I membrane protein; complexed with gE |
| gE | 123,647–125,386 | 579 | Cell–cell spread; glycoprotein E; type I membrane protein; complexed with gI; C terminus interacts with UL49 |
| US9(11K) | 125,444–125,740 | 98 | Protein sorting in axons; type II tail-anchored membrane protein |
| US2(28K) | 125,994–126,764 | 256 | Possibly envelope associated |
Abbreviations: aa, amino acids; ORF, open reading frame.
Complete gene sequence nucleotide homology analysis.
| Virus | Nucleotides Homology (%) | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MY-1 | FJ * | Bartha | Kaplan | Becker | TJ | ZJ01 | HNX | Fa | HNB | HeN1 | HLJ8 | Kolchis | HB1201 | Ea | GD0304 | |
| MY-1 | ||||||||||||||||
| FJ * | 99.0 | |||||||||||||||
| Bartha | 95.4 | 95.7 | ||||||||||||||
| Kaplan | 95.6 | 96.0 | 99.4 | |||||||||||||
| Becker | 95.4 | 95.7 | 98.6 | 98.9 | ||||||||||||
| TJ | 99.0 | 100 | 95.7 | 96.0 | 95.7 | |||||||||||
| ZJ01 | 99.0 | 99.9 | 95.6 | 95.9 | 95.6 | 99.9 | ||||||||||
| HNX | 99.0 | 100 | 95.7 | 96.0 | 95.7 | 100 | 99.9 | |||||||||
| Fa | 98.9 | 99.7 | 95.7 | 96.0 | 95.6 | 99.7 | 99.7 | 99.7 | ||||||||
| HNB | 99.0 | 100.0 | 95.7 | 96.0 | 95.7 | 100.0 | 99.9 | 100.0 | 99.7 | |||||||
| HeN1 | 99.0 | 100.0 | 95.7 | 96.0 | 95.7 | 100.0 | 99.9 | 100.0 | 99.7 | 100.0 | ||||||
| HLJ8 | 99.0 | 100.0 | 95.7 | 96.0 | 95.7 | 100.0 | 99.9 | 100.0 | 99.7 | 100.0 | 100.0 | |||||
| Kolchis | 95.4 | 95.7 | 99.1 | 99.5 | 99.0 | 95.7 | 95.6 | 95.7 | 95.7 | 95.7 | 95.7 | 95.7 | ||||
| HB1201 | 99.0 | 100.0 | 95.7 | 96.0 | 95.7 | 100.0 | 99.9 | 100.0 | 99.7 | 100.0 | 100.0 | 100.0 | 95.7 | |||
| Ea | 98.9 | 99.7 | 95.7 | 96.0 | 95.6 | 99.7 | 99.7 | 99.7 | 100.0 | 99.7 | 99.7 | 99.7 | 95.7 | 99.7 | ||
| GD0304 | 99.0 | 100.0 | 95.7 | 96.0 | 95.7 | 100.0 | 99.9 | 100.0 | 99.7 | 100.0 | 100.0 | 100.0 | 95.7 | 100.0 | 99.7 | |
Note. “*” indicates that this PRV strain is the target PRV strain.
Figure 2Genome organization and comparison of the PRV Bartha strain with the PRV FJ strain and the remaining 14 reference PRV strains. Comparison of the PRV genome shows conserved (blue) and variable (pink) regions. Open reading frames (ORFs) are represented by gray horizontal arrows across the top of each panel, and genome coordinates in kilobase pairs (kbp) are shown along the bottom.
Figure 3Phylogenetic analysis of the PRV complete genome sequences. The tree was constructed using the MEGA 7.0 software with the maximum likelihood method and 1000 bootstrap replicates. The bar and the number represent the genetic distance scale of these genes at this length is 0.003.
Figure 4Phylogenetic analysis based on nucleotide sequences of PRV virulence-related and immunogenicity genes: (A) gB, (B) gC, (C) gD, (D) gG, (E) gH, (F) gL, (G) gM, (H) gN, (I) TK, (J) gI, (K) gE and (L) PK. The tree was constructed with the same method described in Figure 3.
Figure 5Putative recombination events in the PRV FJ strain complete genome. Recombination sites are marked by a blue arrow. The region with a pink background represents the potential recombination region. (A): BootScan method; (B): RDP method.
Analysis of PRV FJ recombination events with different algorithms; the numbers represent the p value of each algorithm.
| Position (bp) | Method and | ||||||
|---|---|---|---|---|---|---|---|
| Bootscan | Maxchi | Chimaera | SiScan | PhylPro | LARD | 3Seq | |
| 1694–1936 | 2.31 × 10−11 * | 3.50 × 10−9 * | NS | 2.27 × 10−2 | 1.05 × 10−2 | NS | NS |
| 101,113–102,660 | 6.54 × 10−5 * | 6.26 × 10−3 | NS | 5.67 × 10−3 | 1.82 × 10−3 | NS | NS |
| 107,964–111,481 | 9.87 × 10−10 * | 9.81 × 10−9 * | NS | 1.93 × 10−5 * | 1.68 × 10−4 * | NS | NS |
Note. NS, does not support reorganization events; * p < 10−3.