| Literature DB >> 31514438 |
Jan H Forth1, Leonie F Forth2, Jacqueline King3, Oxana Groza4, Alexandra Hübner5, Ann Sofie Olesen6, Dirk Höper7, Linda K Dixon8, Christopher L Netherton9, Thomas Bruun Rasmussen10,11, Sandra Blome12, Anne Pohlmann13, Martin Beer14.
Abstract
African swine fever (ASF) is a severe disease of suids caused by African swine fever virus (ASFV). Its dsDNA genome (170-194 kbp) is scattered with homopolymers and repeats as well as inverted-terminal-repeats (ITR), which hamper whole-genome sequencing. To date, only a few genome sequences have been published and only for some are data on sequence quality available enabling in-depth investigations. Especially in Europe and Asia, where ASFV has continuously spread since its introduction into Georgia in 2007, a very low genetic variability of the circulating ASFV-strains was reported. Therefore, only whole-genome sequences can serve as a basis for detailed virus comparisons. Here, we report an effective workflow, combining target enrichment, Illumina and Nanopore sequencing for ASFV whole-genome sequencing. Following this approach, we generated an improved high-quality ASFV Georgia 2007/1 whole-genome sequence leading to the correction of 71 sequencing errors and the addition of 956 and 231 bp at the respective ITRs. This genome, derived from the primary outbreak in 2007, can now serve as a reference for future whole-genome analyses of related ASFV strains and molecular approaches. Using both workflow and the reference genome, we generated the first ASFV-whole-genome sequence from Moldova, expanding the sequence knowledge from Eastern Europe.Entities:
Keywords: African swine fever virus (ASFV); Nanopore sequencing; next-generation sequencing (NGS); target enrichment; whole-genome sequencing
Mesh:
Substances:
Year: 2019 PMID: 31514438 PMCID: PMC6783980 DOI: 10.3390/v11090846
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
African swine fever virus (ASFV) whole-genome sequences available from public databases (status 11.03.2019).
| Number | Accession Number | ASFV Isolate | Country of Origin | Submission Date | Collection Date | Host | P72 Genotype | WGS Publication | Method | Coverage |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | NC_001659.2 | BA71V | Spain | 1995 | 1967 | Vero cells | I | [ | Sanger sequencing | N/A |
| 2 | AY261360.1 | Kenya 1950 | Kenya | 2003 | 1950 | Domestic pig | X | N/A | N/A | N/A |
| 3 | AY261362.1 | Mkuzi 1979 | South Africa | 2003 | 1979 | Tick | XII | N/A | N/A | N/A |
| 4 | AY261365 | Warmbaths | South Africa | 2003 | N/A | Tick | III / I | N/A | N/A | N/A |
| 5 | AY261363.1 | Pretorisuskop/96/4 | South Africa | 2003 | 1996 | Tick | XX / I | N/A | N/A | N/A |
| 6 | AY261361.1 | Malawi Lil-20/1 | Malawi | 2003 | 1983 | Tick | VIII | [ | N/A | N/A |
| 7 | AY261366.1 | Warthog | Namibia | 2003 | 1980 | Warthog | IV | N/A | N/A | N/A |
| 8 | AY261364.1 | Tengani 62 | Malawi | 2003 | 1962 | Domestic pig | V / I | N/A | N/A | N/A |
| 9 | AM712239.1 | Benin 97/1 | Benin | 2007 | 1997 | Domestic pig | I | [ | Sanger sequencing | N/A |
| 10 | AM712240.1 | OURT 88/3 | Portugal | 2007 | 1988 | Domestic pig | I | [ | Sanger sequencing | N/A |
| 11 | FN557520.1 | E75 | Spain | 2009 | 1975 | Domestic pig | I | [ | Roche 454 GS FLX, Sanger sequencing | N/A |
| 12 | FR682468.1 | Georgia 2007/1 | Georgia | 2010 | 2007 | Domestic pig | II | [ | Roche 454 GS FLX | N/A |
| 13 | KM102979.1 | 26544/OG10 | Italy (Sardinia) | 2014 | 2010 | Domestic pig | I | [ | Illumina HiScanSQ, Sanger sequencing | 20 |
| 14 | KJ747406.1 | Kashino 04/13 | Russia | 2014 | 2013 | Wild boar | II | N/A | Sanger sequencing | N/A |
| 15 | KM111295.1 | Ken06.Bus | Kenya | 2014 | 2006 | Domestic pig | X | [ | Illumina HiSeq | N/A |
| 16 | KM262844.1 | L60 | Portugal | 2014 | 1960 | Domestic pig | I | [ | Amplicon sequencing on Roche 454 GS FLX, Sanger sequencing | N/A |
| 17 | KP055815.1 | BA71 | Spain | 2014 | 1971 | Domestic pig | I | [ | Sanger sequencing | N/A |
| 18 | KM262845.1 | NHV | Spain | 2014 | 1968 | Domestic pig | I | [ | Amplicon sequencing on Roche 454 GS FLX, Sanger sequencing | N/A |
| 19 | KM111294.1 | Ken05/Tk1 | Kenya | 2015 | 2005 | Tick | IX | [ | Illumina HiSeq | N/A |
| 20 | KP843857.1 | Odintsovo_02/14 | Russia | 2015 | 2014 | Wild boar | II | N/A | Roche 454 GS FLX | N/A |
| 21 | LP643842.1 | Patent WO2015091322 | N/A | 2015 | N/A | N/A | N/A | N/A | N/A | N/A |
| 22 | KX354450.1 | 47/Ss/2008 | Italy (Sardinia) | 2016 | 2008 | Domestic pig | I | [ | Illumina MiSeq; PacBio | N/A |
| 23 | MG939585.1 | Pol16_20540_o10 | Poland | 2018 | 2016/2017 | Sus scrofa | II | [ | Illumina MiSeq | 20-40 |
| 24 | MG939587.1 | Pol17_03029_C201 | Poland | 2018 | 2016/2017 | Sus scrofa | II | [ | Illumina MiSeq | 20-40 |
| 25 | MG939583.1 | Pol16_20186_o7 | Poland | 2018 | 2016/2017 | Sus scrofa | II | [ | Illumina MiSeq | 20-40 |
| 26 | MG939588.1 | Pol17_04461_C210 | Poland | 2018 | 2016/2017 | Sus scrofa | II | [ | Illumina MiSeq | 20-40 |
| 27 | MG939584.1 | Pol16_20538_o9 | Poland | 2018 | 2016/2017 | Sus scrofa | II | [ | Illumina MiSeq | 20-40 |
| 28 | MG939586.1 | Pol16_29413_o23 | Poland | 2018 | 2016/2017 | Sus scrofa | II | [ | Illumina MiSeq | 20-40 |
| 29 | MG939589.1 | Pol17_05838_C220 | Poland | 2018 | 2016/2017 | Sus scrofa | II | [ | Illumina MiSeq | 20-40 |
| 30 | MH681419.1 | ASFV/POL/2015/Podlaskie | Poland | 2018 | 2015 | Wild boar | II | [ | Illumina MiSeq | 103 |
| 31 | MH766894.1 | ASFV-SY18 | China | 2018 | 2018 | Domestic pig | II | N/A | N/A | N/A |
| 32 | MH025918.1 | R25 | Uganda | 2018 | 2015 | Domestic pig | IX | [ | Illumina NextSeq 500 | 869 |
| 33 | MH025920.1 | R35 | Uganda | 2018 | 2015 | Domestic pig | IX | [ | Illumina NextSeq 500 | 1487 |
| 34 | MH025917.1 | R7 | Uganda | 2018 | 2015 | Domestic pig | IX | [ | Illumina NextSeq 500 | 439 |
| 35 | MH025916.1 | R8 | Uganda | 2018 | 2015 | Domestic pig | IX | [ | Illumina NextSeq 500 | 309 |
| 36 | MH025919.1 | N10 | Uganda | 2018 | 2015 | Domestic pig | IX | [ | Illumina NextSeq 500 | 23 |
| 37 | LS478113.1 | Estonia 2014 | Estonia | 2018 | 2014 | Domestic pig | II | [ | Illumina MiSeq | 100 |
| 38 | MH910495.1 | Georgia 2008/1 | Georgia | 2018 | 2008 | Domestic pig | II | [ | Illumina MiSeq | 8.5 |
| 39 | MH910496.1 | Georgia 2008/2 | Georgia | 2018 | 2008 | Domestic pig | II | [ | Illumina MiSeq | 118 |
| 40 | MK128995.1 | China/2018/AnhuiXCGQ | China | 2019 | 2018 | Domestic pig | II | [ | BGISEQ-500 | 271 |
| 41 | LR536725.1 | Belgium 2018/1 | Belgium | 2019 | 2018 | Wild Boar | II | [ | Illumina MiSeq | 292 |
N/A: data not available; WGS: whole-genome sequence.
Figure 1Workflow for the enrichment of ASFV-specific target sequences using MyBaits® (Arbor Bioscience) prior to Illumina MiSeq sequencing. Extracted DNA including ASFV target sequences (red) is fragmented and library is prepared by adding Illumina specific sequencing adapters (yellow) (A). After initial denaturation (95 °C), the library is cooled to hybridisation temperature (65 °C) and adapter specific blocking oligos (green) are added to prevent the re-hybridisation of adapter sequences during bait hybridisation (B). Biotinylated ASFV-specific RNA baits are added to the blocked library, and the reaction is incubated at hybridisation temperature for 16 h (B). Streptavidin-coated magnetic beads are added and bind to biotinylated RNA baits hybridised to ASFV target sequences (C). After magnetic separation and washing (C), ASFV target sequences are eluted from the baits and amplified (D). ASFV read-enriched libraries are sequenced on an Illumina MiSeq, and the resulting data are analysed in silico (E).
Comparison of shotgun and target enrichment sequencing approaches for the generation of ASFV whole-genome sequences.
| ASFV | Sample Type | Library Number | Sequencing Mode | Total Reads | Total ASFV Reads | % ASFV Reads | Mean Coverage |
|---|---|---|---|---|---|---|---|
| Georgia 2007/1 | Cell culture supernatant | lib02645 | shotgun | 1,764,078 | 8309 (8150) | 0.47 (0.46) | 12.7 (12.5) |
| lib02645 | shotgun | 7,317,744 | 36,268 (33,454) | 0.5 (0.46) | 56.5 (52.1) | ||
| lib02679 | myBaits | 67,174 | 44,862 (40,234) | 66.78 (59.89) | 57.2 (51.7) | ||
| Moldova 2017/1 | Spleen | lib02487 | myBaits | 829,408 | 690,206 (207,763) | 83.89 (25.0) | 1055 (317) |
| lib02577 | shotgun | 8,232,518 | 4042 (3986) | 0.05 (0.048) | N/A |
() Number/percentage of unique reads and coverage of the corresponding ASFV whole-genome sequence are given in brackets.
Figure 2Coverage of ASFV Georgia 2007/1 with ASFV-specific reads from the different sequencing strategies. Blue line, coverage Illumina (left axis); red line, coverage Nanopore (right axis); green line, coverage total (Illumina + Nanopore) (left axis). Indicated below the figure are regions showing low coverage due to inverted terminal repeats (ITR) or G/C homopolymer stretches (red).
Figure 3Phylogenetic tree showing all available ASFV whole-genome sequences. The maximum-likelihood (ML) tree was constructed using IQ-TREE v1.6.5 based on MAFFT v7.388 aligned ASFV whole-genome sequences. Standard model selection was used, resulting in the best-fit model K3Pu + F + R3 (three substitution types model + empirical base frequencies + FreeRate model with 3 categories). Statistical support of 10,000 ultrafast bootstraps is indicated at the nodes. Taxon names include, where available, ASFV-designation and INSDC accession number.
Figure 4Comparison of sequencing workflows used in this study. (A) Overview of the possible strategies tested here for sequencing. (1) Illumina shotgun library preparation plus target enrichment prior to Illumina sequencing; (2) pure Illumina shotgun sequencing; (3) Nanopore sequencing. (B) Decision tree derived from the experiences in this study. Depending on the known or expected virus/host ratio and genome complexity, different workflows should be chosen. In this study, ASFV Georgia 2007/1 was treated as being a low virus/host ratio and high genome complexity sample and hence was sequenced according to all strategies (1/2/3), while ASFV Moldova 2017/1 had a low virus/host ratio but a suitable reference sequence was available, and hence target enrichment was sufficient, so only workflow (1) had to be applied.