| Literature DB >> 35032481 |
Elisabeth Mathijs1, Andy Haegeman2, Kris De Clercq2, Steven Van Borm2, Frank Vandenbussche2.
Abstract
The diseases caused by capripoxviruses (CaPVs) are of major economic concern in sheep, goat and cattle as they are inexorably spreading into non-endemic regions. As CaPV strains are serologically indistinguishable and genetically highly homologous, typing closely related strains can only be achieved by whole genome sequencing. Unfortunately the number of publicly available genomes remains low as most sequencing methods rely on virus isolation. Therefore, we developed a robust, cost-effective and widely applicable method that allows to generate (nearly) complete CaPV genomes directly from clinical samples or commercial vaccine batches. A set of pan-CaPVs long-range PCRs spanning the entire genome was designed to generate PCR amplicons that can be sequenced on commonly used high-throughput sequencing platforms: MiSeq (Illumina), RSII (PacBio) and MinION (Oxford Nanopore Technologies). The robustness of the LR-PCR strategy was evaluated for all 3 members of CaPV directly from a variety of samples, including clinical samples (N = 7), vaccine batches (N = 6), and virus isolates (N = 2). The sequencing method described here allows to reconstruct (nearly) complete CaPV genomes in less than a week and will aid researchers studying closely-related CaPV strains worldwide.Entities:
Keywords: Capripoxvirus; Long-range PCR; Nanopore sequencing; Next-generation sequencing (NGS); Single-Molecule real-time sequencing (SMRT); Whole-genome sequencing (WGS)
Mesh:
Year: 2022 PMID: 35032481 PMCID: PMC8872832 DOI: 10.1016/j.jviromet.2022.114464
Source DB: PubMed Journal: J Virol Methods ISSN: 0166-0934 Impact factor: 2.014
Overview of the Capripoxvirus complete coding sequences determined using the methodology developed in this study. The sequencing strategy (PCR enrichment and sequencing technology) is clarified for each strain.
| ID | Strain | Species | Sample type | Amplicon length | Sequencing platform | GenBank accession | Reference |
|---|---|---|---|---|---|---|---|
| SA-Neethling | Attenuated LSDV strain SA-Neethling | LSDV | Cell culture | 5500−7500 | MiSeq (7500) | MW435866 | This study |
| LSDV | Skin sample | 7500 | – | – | |||
| Caprivac | Gorgan | GTPV | Vaccine batch | 5500−7500 | RSII (5500) | KX576657 | ( |
| Jovivac | Yugoslavian RM65 | SPPV | Vaccine batch | 5500−7500 | – | KJ818284 | (Tuppurainen et al., 2014) |
| Kenyavac | KSGP 0240 | LSDV | Vaccine batch | 5500 | RSII | KX683219 | ( |
| Lumpyvax | Attenuated strain SIS | LSDV | Vaccine batch | 5500 | RSII | KX764643 | (Mathijs et al., 2017, 2016b) |
| Herbivac | Attenuated strains SA-Neethling | KX764644 | |||||
| LSD OBP | KX764645 | ||||||
| Evros/GRC/15 | Field strain Greece 2015 | LSDV | Cell culture | 7500 | MiSeq | KY829023 | ( |
| Kubash/KAZ/16 | Field strain Kazakhstan 2016 | LSDV | Skin sample | 7500 | MiSeq | MN642592 | ( |
| 219−249/BUL/16 | Field strain Bulgaria 2016 | LSDV | Blood sample | 7500 | MiSeq | MT643825 | ( |
| 20L42_Quyet-Thang/VNM/20 | Field strain Vietnam 2020 | LSDV | Skin sample | 7500 | MiSeq | MZ577073 | (Submitted for publication) |
| 20L43_Ly-Quoc/VNM/20 | MZ577074 | ||||||
| 20L70_Dinh-To/VNM/20 | MZ577075 | ||||||
| 20L81_Bang-Thanh/VNM/20 | MZ577076 |
LSD(V) = lumpy skin disease (virus); GTPV = Goatpox virus; SPPV = Sheeppox virus.
OBP = Onderstepoort Biological Products.
PacBio = Pacific Biosciences; ONT = Oxford Nanopore Technologies.
Primer sequences for long-range PCR amplification and expected amplicon sizes for species compromising the Capripoxvirus genus.
| F | Primer name | Primer sequence (5′-3′) | Expected size (bp) | ||
|---|---|---|---|---|---|
| GTPV | LSDV | SPPV | |||
| 1 | fp CaPV-F1 | AAACCTGTAAATGGATACTTTTTTCATTC | 7339−459 | 7566−68 | 7403−11 |
| rp CaPV-F1 | ATTTGGAAATATAATTGTGTTAACTGTTCT | ||||
| 2 | fp CaPV-F2 | TGTGAAAAATTAATCCATTCTTCTAAACAG | 7584−9 | 7652−62 | 7594−617 |
| rp CaPV-F2 | TACATACATTTCAAGTACTAAAGAGAAGGAA | ||||
| 3 | fp CaPV-F3 | ATGTCAACAACATTTTTGCTATTCAATG | 7639−53 | 7713−20 | 7575−6 |
| rp CaPV-F3 | TTTTGGCCAGATATTTACAATGCTATCA | ||||
| 4 | fp CaPV-F4 | GATGGACCTAATGGAGTTATTATTGAG | 7622 | 7621−2 | 7620−3 |
| rp CaPV-F4 | GAAAAATCAAATGTAAACAAACAGCTGT | ||||
| 5 | fp CaPV-F5 | ATTCCATCATTGTTTGGTATTATTCCA | 7617−8 | 7608−11 | 7597 |
| rp CaPV-F5 | ACATATCATGTAAATAATAATAACGGAACAAC | ||||
| 6 | fp CaPV-F6 | GCTGAAGAATATGAATACAATACGCTAT | 7618−22 | 7619−20 | 7631−2 |
| rp CaPV-F6 | AAAAAACAAAATTTGAAGAACCTAAATCAG | ||||
| 7 | fp CaPV-F7 | GTATTGTTCTCCAAGTTTTACATCCTT | 7557−61 | 7556−59 | 7570−1 |
| rp CaPV-F7 | CACATTTCTATTTTTAATAAATACGATTCCTTTC | ||||
| 8 | fp CaPV-F8 | ATCTCCGTTTGCTAAAAAAGATAAAGC | 7667−76 | 7675 | 7690−1 |
| rp CaPV-F8 | CATCTATGATAAATCGCACTATGGGTTTTA | ||||
| 9 | fp CaPV-F9 | TGGGACCCAAATTGTTCAGAATCTAA | 7682−97 | 7688−95 | 7701−2 |
| rp CaPV-F9 | TTTCTAACAATGGCCAAAAACGTTTATAA | ||||
| 10 | fp CaPV-F10 | ATCCCACTTAAGATAATAAGATTTTTTAGAAAC | 7549−52 | 7549−50 | 7555−6 |
| rp CaPV-F10 | TTCCTCAGATTATCCGCTAATTTATTTGA | ||||
| 11 | fp CaPV-F11 | CTCTCTAATTTTAGTTATGTTTTCATCTATCCA | 7621−39 | 7593−9 | 7603−9 |
| rp CaPV-F11 | AGAAAGGAATATTATATGCCCTATAGATATAGA | ||||
| 12 | fp CaPV-F12 | CCCAGATAAAGACGCAATAAGTAGAT | 7668−71 | 7668 | 7669 |
| rp CaPV-F12 | AACGAGTTGTTAGTCATTTGAGATAC | ||||
| 13 | fp CaPV-F13 | ACCATTCCAGAATTAAGTTTGATAATAAAC | 7698 | 7700 | 7703 |
| rp CaPV-F13 | GGGGGAAATTATTTCAGAGTTATTAGAT | ||||
| 14 | fp CaPV-F14 | TATTATCATCGTTTCCAATTAATGAATTAATTAC | 7547 | 7559−608 | 7550−6 |
| rp CaPV-F14 | CAACGAAAAACATTGATAAAATCTGATG | ||||
| 15 | fp CaPV-F15 | CTTTGATGTTGTTACCACCTTTCC | 7771−2 | 7792 | 7773 |
| rp CaPV-F15 | TTAAAAGAACAAAGTGGAATGGTAAGATAG | ||||
| 16 | fp CaPV-F16 | GAGTAAGATTTGATTTTTGAGATGCTTG | 7627−8 | 7638−45 | 7622 |
| rp CaPV-F16 | TTTTCAAACCCGTATTCATTTTTTACTG | ||||
| 17 | fp CaPV-F17 | ACTAAGGTTTGTTTTAGATAAATGGGAT | 7515−6 | 7510−5 | 7517−643 |
| rp CaPV-F17 | GTTTACAATTCTGCATATGATAGTTATATATGG | ||||
| 18 | fp CaPV-F18 | GCCATCTAACTCTATTGTTAAATCCA | 7553−65 | 7571−99 | 7518−20 |
| rp CaPV-F18 | CCTTTGCCTAAATCATCATTTTTCTC | ||||
| 19 | fp CaPV-F19a | CCRTCTATAGATATTAGAATTGTTAGTAAACC | 7567−81 | 7711−5 | 7626−7 |
| fp CaPV-F19b | CCGTCTATAGATATTAGAATTATTAGTAAACC | ||||
| rp CaPV-F19 | GTATACATGATATTAGTGCAACATTGTTATG | ||||
| 20 | fp CaPV-F20 | GTTTGTATGATGCCAGATTCAGATATTAC | 7610−77 | 7623−7 | 7576−604 |
| rp CaPV-F20 | TAAACATAGACTCTTCTTTCGGTAGAC | ||||
| 21 | fp CaPV-F21 | GTTCGGTTTCATATTTTTAGCATATTCAC | 7601−68 | 7650−707 | 7612−39 |
| rp CaPV-F21 | TGATATAAGTTTCATCCAAAAATCATATGTTG | ||||
| 22 | fp CaPV-F22 | GATTTACMCCACTTTTATCTTCTGTATATG | 7771−85 | 7782−852 | 7738−67 |
| rp CaPV-F22 | GACATATTAGATTTTGGAAATATAAGAGGTG | ||||
| 23 | fp CaPV-F23 | CCAAAAACGATTTCATTGTATAAAGAAC | 7375−9 | 7417−22 | 7279−87 |
| rp CaPV-F23 | AAACCTGTAAATGGATACTTTTTTCATTCAATC | ||||
F, fragment; fp, forward primer; rp, reverse primer; CaPV, capripoxvirus; GTPV, goatpox virus; LSDV, lumpy skin disease virus; SPPV, sheeppox virus.
expected amplicon sizes based on 9 full genome sequences of genus Capripoxvirus (GPV n = 3, LSDV n = 3, SPV n = 3) available at http://www.viprbrc.org/.
Fig. 1Schematic overview of the different sequencing strategies applied to the attenuated LSDV strain SA-Neethling in the study. ILL-DNA (orange) and ILL-PCR (blue) refer to Nextera XT Illumina sequencing of a library prepared from extracted DNA without any enrichment or from pools of 12 amplicons corresponding to half of the Capripox genome, respectively. PB (purple) refers to PacBio sequencing of a library prepared from a pool of 23 amplicons. ONT-R9 and ONT-FLG refer to MinION (Oxford Nanopore Technologies) sequencing of a library prepared from a pool of 23 amplicons on a R9.4 Flow Cell or a Flongle Flow Cell, respectively.
Fig. 21% agarose gel electrophoreses analysis of the 23 long range PCR amplicons for the attenuated LSDV strain SA-Neethling cell culture isolate. M = molecular weight marker range 250 – 10,000 bp.
Sequencing data obtained for the cell-cultured attenuated SA-Neethling LSDV strain with the different sequencing strategies applied in this study.
| Output Stats | Short-read data: Illumina | Long-read data: PacBio | ||||
|---|---|---|---|---|---|---|
| Lib-DNA | Lib1−12 | Lib12−23 | PacBio | ONT-R9.4 | ONT-FLG | |
| # bases | 120,756,296 | 159,153,635 | 171,532,167 | 1,002,374,879 | 4,342,829,196 | 38,787,394 |
| 73,537,623 | 101,446,577 | 113,149,263 | 956,205,996 | |||
| # reads | 566,987 | 701,789 | 749,787 | 71,593 | 579,266 | 5173 |
| 566,987 | 701,789 | 749,787 | 138,929 | |||
| Av read length | 212.98 | 226.78 | 228.77 | 14,001 | 7497 | 7498 |
| 129.7 | 144.55 | 150.91 | 6882 | |||
| N50 read length | 222 | 250 | 246 | 26,269 | 7543 | 7542 |
| 134 | 154 | 160 | 7752 | |||
Paired-end data is given R1 and R2, respectively. Read trimming parameters (Trim_Galore!): Quality = 30; Length = 80; clip_R1 and clip_R2 = 20.
Read and subread stats are given. Read filtering parameters (“RS_Subreads.1″ protocol in SMRT Portal): Minimum Polymerase Read Length and Mininum Subread Length = 3000 bp; Minimum Polymerase Read Quality = 0.8.
Read filtering parameters (NanoFilt): length = 6000 bp; maxlength = 8000 bp; quality = 12; headcrop = 50 bp.
De novo assembly statistics obtained from the data generated by the different sequencing strategies used to determine the coding genome of SA-Neethling. Statistics are based on contigs of size > = 5000 bp, unless otherwise noted (e.g., "# contigs (> = 0 bp)" include all contigs).
| Assembly stats | ILL-DNA | ILL-PCR | PB | ONT-R9.4 | ONT-FLG |
|---|---|---|---|---|---|
| # contigs (> = 0 bp) | 23,405 | 1 | 1 | 1 | 1 |
| # contigs | 14 | 1 | 1 | 1 | 1 |
| Largest contig | 35,563 | 150,242 | 150,280 | 150,382 | 150,231 |
| Reference | 150,396 | 150,396 | 150,396 | 150,396 | 150,396 |
| GC (%) | 29.76 | 25.91 | 25.91 | 25.90 | 25.92 |
| Reference | 25.91 | 25.91 | 25.91 | 25.91 | 25.91 |
| Genome fraction (%) | 95.86 | 99.91 | 99.93 | 99.93 | 99.93 |
| # mismatches | 24.97 | 21.96 | 22.62 | 27.28 | 25.28 |
| # indels | 11.79 | 11.31 | 11.31 | 110.45 | 56.56 |
ILL = Illumina, PB = PacBio, ONT = Oxford Nanopore Technologies.
Reference : Lumpy skin disease virus isolate LSD-220−2-NW-RSA-1993 (MN636842).
Fig. 3Depth of coverage of the attenuated LSDV strain SA-Neethling genome for all (A) Nextera XT Illumina sequencing of a library prepared from extracted DNA without any enrichment (ILL-DNA). (B) Nextera XT Illumina sequencing of a library prepared from 2 pools of 12 amplicons each corresponding to half of the Capripox genome (ILL-PCR). (C) PacBio sequencing of a library prepared from a pool of 23 amplicons (PB). (D) MinION sequencing of a library prepared from a pool of 23 amplicons on a R9.4 Flow Cell (ONT-R9). (E) MinION sequencing of a library prepared from a pool of 23 amplicons on a Flongle Flow Cell (ONT-FLG).