Literature DB >> 30563591

Assessment of metagenomic Nanopore and Illumina sequencing for recovering whole genome sequences of chikungunya and dengue viruses directly from clinical samples.

Liana E Kafetzopoulou1,2, Kyriakos Efthymiadis3, Kuiama Lewandowski1, Ant Crook1, Dan Carter1,2, Jane Osborne4, Emma Aarons4, Roger Hewson1,2, Julian A Hiscox2,5, Miles W Carroll1,2, Richard Vipond1,2, Steven T Pullan1,2.   

Abstract

BackgroundThe recent global emergence and re-emergence of arboviruses has caused significant human disease. Common vectors, symptoms and geographical distribution make differential diagnosis both important and challenging. AimTo investigate the feasibility of metagenomic sequencing for recovering whole genome sequences of chikungunya and dengue viruses from clinical samples.MethodsWe performed metagenomic sequencing using both the Illumina MiSeq and the portable Oxford Nanopore MinION on clinical samples which were real-time reverse transcription-PCR (qRT-PCR) positive for chikungunya (CHIKV) or dengue virus (DENV), two of the most important arboviruses. A total of 26 samples with a range of representative clinical Ct values were included in the study.ResultsDirect metagenomic sequencing of nucleic acid extracts from serum or plasma without viral enrichment allowed for virus identification, subtype determination and elucidated complete or near-complete genomes adequate for phylogenetic analysis. One PCR-positive CHIKV sample was also found to be coinfected with DENV. ConclusionsThis work demonstrates that metagenomic whole genome sequencing is feasible for the majority of CHIKV and DENV PCR-positive patient serum or plasma samples. Additionally, it explores the use of Nanopore metagenomic sequencing for DENV and CHIKV, which can likely be applied to other RNA viruses, highlighting the applicability of this approach to front-line public health and potential portable applications using the MinION.

Entities:  

Keywords:  chikungunya; chikungunya virus; dengue; dengue fever; dengue virus; metagenomic; molecular methods; nanopore; surveillance; typing; vector-borne infections; viral infections

Mesh:

Substances:

Year:  2018        PMID: 30563591      PMCID: PMC6299504          DOI: 10.2807/1560-7917.ES.2018.23.50.1800228

Source DB:  PubMed          Journal:  Euro Surveill        ISSN: 1025-496X


Introduction

Arboviruses are predominantly RNA viruses that replicate in haematophagous (blood-sucking) arthropod vectors such as ticks, mosquitoes and other biting flies to maintain their transmission cycle [1]. Human disease outbreaks caused by arboviruses have increased in prevalence since the 2000’s, led by the spread of mosquito-borne arboviruses such as chikungunya (CHIKV), dengue (DENV), West Nile (WNV), yellow fever (YFV) and Zika (ZIKV) viruses across both hemispheres [2]. CHIKV and DENV are of particular global health concern, as they have lost the need for enzootic amplification and consequently have caused extensive epidemics [3]. CHIKV is a single-stranded positive-sense RNA virus of the alphavirus genus, which causes the debilitating arthritic disease, chikungunya [4]. It has spread globally and been designated a serious emerging disease by the World Health Organization [5]. Outbreaks of CHIKV since 2005 have been associated with increased morbidity and possibly mortality [6,7]. DENV, which causes dengue, is a single-stranded positive-sense RNA virus of the flavivirus genus and the most prevalent human arboviral pathogen. Dengue occurs following infection with one of four DENV serotypes (DENV1–4). A minority of cases develop acute haemorrhagic manifestations and multi-organ failure. Despite DENV cases being under-reported, a 143.1% increased global incidence was estimated between 2005 and 2015 [8]. Approximately 500,000 DENV infected patients worldwide require hospitalisation annually [9]. Both CHIKV and DENV are predominantly transmitted to humans via Aedes species mosquitoes, particularly Ae. aegypti and Ae. albopictus [10,11], and share clinical presentations of arthralgia, headache, high fever, myalgia and rash. Circulation of CHIKV, DENV (and other arboviruses) in the same areas leads to challenges in differential diagnosis, especially in endemic regions in which diagnosis is predominantly symptom-based [12]. Additionally, reports of arboviral coinfections are increasingly common [13-16]. Metagenomic RNA sequencing allows for identification of multiple pathogens within a sample in a non-targeted and unbiased manner. It has identified causative agents in outbreaks, e.g. Lujo virus in South Africa [17], Bundibugyo ebolavirus in Uganda [18] and lead to novel virus discovery such as a rhabdovirus causing haemorrhagic fever in central Africa [19]. It also provides genomic information for typing and surveillance. Real-time genomic surveillance was facilitated on-site by the portable Oxford Nanopore MinION sequencer during the 2014–16 Ebola virus (EBOV) epidemic in West Africa and the 2015-16 ZIKV outbreak in the Americas [20-23] for epidemiological and transmission chain investigations [24]. In both examples, an amplicon sequencing approach was used, but viruses and bacteria from clinical, environmental and vector samples have been sequenced using metagenomic approaches on the MinION [25-28]. Metagenomic sequencing of CHIKV was demonstrated in principle on the MinION by Greninger et al. in 2015 reporting the detection of CHIKV from a human blood sample [28]. Additionally, Illumina-based metagenomics identified CHIKV coinfections within a ZIKV sample cohort [29], with the high proportion of CHIKV reads present making it a promising target for the approach. In this study we set out to test the feasibility of direct metagenomic sequencing of DENV and CHIKV genomes from a cohort of clinical serum and plasma samples across a representative range of viral loads. The objective was to assess the proportion of viral nucleic acid relative to patient/background present in each sample and determine the sequencing limits for whole genome retrieval using both the laboratory-based Illumina technology and the portable MinION platform.

Methods

Sample collection and nucleic acid extraction

Twenty-six routine diagnostic samples, nine plasma and 17 serum, were obtained from the Rare and Imported Pathogens Laboratory (RIPL), Public Health England (PHE), Porton Down. All had previously tested positive by real-time reverse transcription-PCR (qRT-PCR) for chikungunya or dengue virus, with a maximum cut-off value of cycle threshold (Ct) 35. These samples had been selected based on their Ct values, among a larger set of 441 samples, so as to represent a Ct clinical range. Total nucleic acid was extracted from 140 μL of each using the QIAamp viral RNA kit (Qiagen, Hilden, Germany) replacing carrier RNA with linear polyacrylamide and eluting in 60 µL elution buffer provided in the kit, followed by treatment with TURBO DNase (Thermo Fisher Scientific, Waltham, United States (US)) at 37 °C for 30 min. RNA was purified and concentrated to 8 μL using the RNA Clean and Concentrator-5 kit (Zymo Research, Irvine, US).

Molecular confirmation and quantification

Drosten et al. [30] and Edwards et al. [31] RT-PCR assays were used for confirmation of DENV and CHIKV respectively. RNA oligomers were used as standards for genome copy quantitation.

Metagenomic cDNA reparation

Complementary DNA (cDNA) was prepared using a Sequence Independent Single Primer Amplification (SISPA) approach adapted from Greninger et al. [28]. Reverse transcription and second strand cDNA synthesis were as described [28]. cDNA amplification was performed using AccuTaq LA (Sigma, Poole, United Kingdom), in which 5μL of cDNA and 1 μL (100 pmol/μL) Primer B (5′-GTTTCCCACTGGAGGATA-3′) were added to a 50 μL reaction, according to manufacturer's instructions. PCR conditions were 98 °C for 30s, followed by 30 cycles of 94 °C for 15 s, 50 °C for 20 s, and 68 °C for 5 min, and a final step of 68 °C for 10 min. Amplified cDNA was purified using a 1:1 ratio of AMPure XP beads (Beckman Coulter, Brea, California (CA)) and quantified using the Qubit High Sensitivity dsDNA kit (Thermo Fisher, Waltham, US).

MinION library preparation and sequencing

MinION sequencing libraries were prepared using total amplified cDNA of each sample to a maximum of 1 µg. Oxford Nanopore kits SQK-NSK007 or SQK-LSK208 (2D), SQK-LSK308 (1D2) and SQK-RBK001 (Rapid) were used and each sample was run individually on the appropriate flow cell (FLO-MIN105, FLO-MIN106 or FLO-MIN107) using the 48hr run script. Base calling was performed using Metrichor (ONT) for SQK-NSK007 and SQK-LSK208 or Albacore v1.2 for SQK-LSK308 and SQK-RBK001. Poretools [32] was used to extract FASTQ files from Metrichor FAST5 files.

Illumina library preparation and sequencing

Nextera XT V2 kit (Illumina) sequencing libraries were prepared using 1.5 ng of amplified cDNA as per manufacturer's instructions. Samples were multiplexed in batches of a maximum of 16 samples per run and sequenced on a 2x150 bp-paired end Illumina MiSeq run, by Genomics Services Development Unit, PHE.

Data handling

BWA MEM v0.7.15 [33] was used to align reads to the following references (GenBank ID): DENV Serotype 1 (NC_001477.1), DENV Serotype 2 (NC_001474.2), DENV Serotype 3 (NC_001475.2), DENV Serotype 4 (NC_002640.1) and CHIKV (NC_004162.2) using -x ont2d mode for Nanopore and MEM defaults for Illumina reads. Samtools v1.4 [34] was used to compute percentage reads mapped and coverage depth. Bedtools v2.26.0 [35] was used to calculate genome coverage at 10x and 20x. Mapping consensuses for Illumina were generated using in-house software QuasiBam [36] and for MinION using a simple pileup with bases called at a minimum depth of 20x and 70% support fraction. Nanopolish variants [24,37] was used in consensus mode to compute an error-corrected consensus sequence from the Rapid kit data. Taxonomic classification was performed using Kraken (0.10.4-beta) [38] and a locally built database populated with all RefSeq bacterial, viral, and archaeal genomes plus additional sequences [39]. De novo assemblies were generated using Spades 3.8.2 [40] in combination with SSPACE Standard v3.0 [41] for Illumina generated sequences and Canu v1.6 [41,42] for Nanopore sequences (settings: corOutCoverage = 1,000; genomeSize = 12,000; minReadLength = 300, minOverlapLength = 50). Consensus sequences for all samples tested are available in Genbank, raw fast5 files from 1D2 and 1D data (viral reads only) are deposited in SRA (Both under BioProject PRJNA508296).

Results

Metagenomic Illumina sequencing

A total of 73 samples tested during 2016 in RIPL diagnostic laboratories, PHE Porton Down, were positive by qRT-PCR for CHIKV, and 368 were positive for DENV. Median Ct for CHIKV was 26.1, for DENV it was 26.8. For each virus, samples representing the range of viral titres seen during 2016 were selected, based on qRT-PCR Ct value (Figure 1). CHIKV samples selected (n = 14) ranged from Ct 14.72 to Ct 32.57, corresponding to 1010 and 105 genome copies per mL of plasma or serum. DENV samples selected (n = 12) ranged from Ct 16.29 to Ct 31.29, corresponding to 109 and 105 genome copies per mL (Table 1). To measure the proportion of viral nucleic acid present relative to host/background and assess genome coverage, all samples were processed as described in methods and Illumina sequenced (Table 1). The proportion of total reads mapping to the respective viral reference was high for both viruses (Figure 2). In some low Ct samples, over 90% of reads mapped to the viral reference and proportions over 50% were still observed at mid-Ct range. The lowest proportions observed were 5.03% and 0.47% for CHIKV and DENV respectively (Table 1, Figure 2). The majority of samples returned over 95% genome coverage at 20x (21/26 samples) and over 98% genome coverage at 10x (20/26 samples). Irrespective of lower mapping percentages in high Ct value samples, genome coverage of 88.5% (20x) and 89.4% (10x) for CHIKV and 75.0% (20x) and 77.8% (10x) for DENV was observed.
Figure 1

Cycle threshold (Ct) values distribution of chikungunya (n = 73) and dengue virus (n = 368) positive samples from the Rare and Imported Pathogens Laboratory, Public Health England, United Kingdom, 2016 (n = 441 total samples)

Table 1

Description of samples positive for chikungunya and dengue virus by real-time reverse transcription-PCR with corresponding Illumina mapping data, United Kingdom, 2017a (n = 26 samples)

SampleCt valueEstimated genome copy number in the sample (/mL)Sample typeTotal reads(R1 + R2)b % reads mapping to reference viral genome% 20x coverage% 10x coverageReference virusc Referencesize (nts)
CHIKV 114.722.12E + 10Plasma1,113,56078.3299.5999.72CHIKV11,826
CHIKV 220.065.49E + 08Serum1,278,62498.4899.1499.47CHIKV11,826
CHIKV 321.412.18E + 08Plasma1,391,25895.2398.8699.37CHIKV11,826
CHIKV 422.748.76E + 07Plasma888,96819.1697.0897.32CHIKV11,826
CHIKV 523.774.33E + 07Plasma1,357,60697.1399.1699.58CHIKV11,826
CHIKV 625.41.42E + 07Serum3,236,84834.8897.8098.40CHIKV11,826
CHIKV 725.761.11E + 07Plasma3,748,07072.7799.0499.56CHIKV11,826
CHIKV 828.172.13E + 06Plasma1,499,95228.4198.6999.00CHIKV11,826
CHIKV 930.085.76E + 05Serum1,035,0266.6695.9898.22CHIKV11,826
CHIKV 1030.374.72E + 05Serum1,575,22216.8497.3998.01CHIKV11,826
CHIKV 1130.663.87E + 05Serum1,143,05413.5295.3696.96CHIKV11,826
CHIKV 1230.953.17E + 05Serum1,507,38010.9396.1196.52CHIKV11,826
CHIKV 1332.21.35E + 05Serum1,323,9205.0388.4789.38CHIKV11,826
CHIKV 1432.571.05E + 05Serum1,479,40421.7296.3296.93CHIKV11,826
DENV 116.294.21E + 09Plasma439,29293.4499.5199.58DENV 110,735
DENV 216.852.83E + 09Serum513,47292.5699.4099.58DENV 110,735
DENV 317.671.58E + 09Plasma738,81492.5399.5899.58DENV 210,723
DENV 418.201.09E + 09Serum477,36893.9798.7399.12DENV 210,723
DENV 519.733.67E + 08Serum91555489.6599.1499.40DENV 210,723
DENV 621.223.61E + 07Serum3,587,92683.8799.6899.69DENV 410,649
DENV 723.762.11E + 07Serum4,146,6782.1786.9989.13DENV 110,735
DENV 824.81.01E + 07Serum777,26469.2399.5699.58DENV 310,707
DENV 925.287.17E + 06Plasma787,72826.9798.7798.81DENV 210,723
DENV 1026.982.15E + 06Serum596,2406.5893.4793.97DENV 310,707
DENV 1128.696.39E + 05Serum1,034,6983.7394.4494.70DENV 110,735
DENV 1231.291.01E + 05Serum1,374,7660.4771.4677.76DENV 110,735

CHIKV: chikungunya virus; Ct: cycle threshold; DENV: dengue virus.

a The Illumina mapping data presented in the table were obtained in 2017 on samples that had been collected and found positive for chikungunya or dengue virus by real-time reverse transcription-PCR in 2016.

b ‘R1 + R2’ indicates paired-end sequencing.

c For DENV the serotype is also indicated.

Figure 2

Proportion of reads mapping to the appropriate viral reference sequence and proportion of reference genome sequenced at minimum 20-fold coverage in each chikungunya or dengue virus positive sample, United Kingdom, 2017a (n = 26 samples)

Cycle threshold (Ct) values distribution of chikungunya (n = 73) and dengue virus (n = 368) positive samples from the Rare and Imported Pathogens Laboratory, Public Health England, United Kingdom, 2016 (n = 441 total samples) CHIKV: chikungunya virus; Ct: cycle threshold; DENV: dengue virus. The 14 CHIKV and 12 DENV samples selected for this work are indicated by circles. For each virus, the median Ct value of positive samples by quantitative real-time PCR is shown (horizontal line inside the box), as well as 25th and 75th percentiles (box lower and upper boundaries) and total range (whiskers). CHIKV: chikungunya virus; Ct: cycle threshold; DENV: dengue virus. a The Illumina mapping data presented in the table were obtained in 2017 on samples that had been collected and found positive for chikungunya or dengue virus by real-time reverse transcription-PCR in 2016. b ‘R1 + R2’ indicates paired-end sequencing. c For DENV the serotype is also indicated. Proportion of reads mapping to the appropriate viral reference sequence and proportion of reference genome sequenced at minimum 20-fold coverage in each chikungunya or dengue virus positive sample, United Kingdom, 2017a (n = 26 samples) Ct: cycle threshold. a The Illumina data presented in the figure were obtained in 2017 on samples that had been collected and found positive for chikungunya or dengue virus by real-time reverse transcription-PCR in 2016. The percentage of total reads mapping to the appropriate reference sequence is plotted in the upper panel. Lower panels display the percentage of the reference genome sequenced to a minimum depth of 20-fold in the Illumina data.

Metagenomic MinION sequencing

Four representative samples for each virus were selected for Nanopore sequencing (Table 2).
Table 2

Description of chikungunya and dengue virus positive samples by real-time reverse transcription-PCR and corresponding Nanopore sequencing data, United Kingdom, 2017a (n = 8 samples)

SampleCt valuecDNA amount used for the library (ng)Sequencing kit(2D kit version)Flow cell(FLO-)1D total bp1D totalreads1D meanread length (nt)1D maxread length (nt)
CHIKV 114.7431.5SQK-NSK007MIN1051.51E + 08267,17156492,712
CHIKV 321.4928.8SQK-LSK208MIN1061.63E + 091,891,02886299,031
CHIKV 422.7113.4SQK-NSK007MIN1051.74E + 08216,493805125,387
CHIKV 930.1212.4SQK-LSK208MIN1062.12E + 093,481,358608121,711
DENV 116.31,626.0SQK-NSK007MIN1052.42E + 08284,622851115,494
DENV 216.91,626.0SQK-NSK007MIN1051.55E + 08203,70076052,157
DENV 621.2475.0SQK-LSK208MIN1061.22E + 091,377,721886118,733
DENV 1128.765.8SQK-LSK208MIN1067.07E + 081,111,566636119,438

Ct: cycle threshold.

a The Nanopore data presented in the table were obtained in 2017 on samples that had been collected and found positive for chikungunya or dengue virus by real-time reverse transcription-PCR in 2016.

Ct: cycle threshold. a The Nanopore data presented in the table were obtained in 2017 on samples that had been collected and found positive for chikungunya or dengue virus by real-time reverse transcription-PCR in 2016. Figure 3 shows percentages of reads mapping to viral reference, which were generally concordant with the Illumina data, although a slight decrease is observed across the range of Ct values. In the Nanopore data, the highest mapped read percentages observed were 85.12% and 72.14% for CHIKV3 and DENV 2 respectively, compared with 95.23% and 92.56% in the Illumina data from the same samples. While in high Ct samples the viral proportion drops to 4.08% for CHIKV 9 and 2.90% for DENV 11, from 6.66% and 3.73% in the Illumina data.
Figure 3

Comparison of Nanopore and Illumina results, as to proportions of reads mapping to the appropriate reference viral sequence, and proportions of reference genome sequenced at minimum 20-fold coverage, United Kingdom, 2017a (n = 8 samples)

Comparison of Nanopore and Illumina results, as to proportions of reads mapping to the appropriate reference viral sequence, and proportions of reference genome sequenced at minimum 20-fold coverage, United Kingdom, 2017a (n = 8 samples) Ct: cycle threshold; TC: template/complement. a The Nanopore and Illumina data presented in the figure were obtained in 2017 on samples that had been collected and found positive for chikungunya or dengue virus by real-time reverse transcription-PCR in 2016. The percentage of total reads mapping to the appropriate reference sequence is plotted in the upper panel. Lower panels display the percentage of the reference genome sequenced to a minimum depth of 20-fold in the data generated, in dark blue or dark green for the Illumina sequence data, in light blue or light green for Nanopore data (MinION_TC). Despite the decrease in proportion of mapped viral reads, comparable genome coverage is observed at both 20x and 10x (Figure 3, Table 3) and is even increased compared with Illumina data at lower viral titres, e.g. 100% at 20x for CHIKV 9 compared with 95.98% in the Illumina data and 95.25% for the high Ct DENV 11 sample, which generated 94.44% coverage from the Illumina data. Average read lengths in Nanopore data ranged from 564 to 886 bp (Table 2).
Table 3

Summary of Nanopore mapping data on chikungunya and dengue virus positive samples by real-time reverse transcription-PCR, United Kingdom, 2017a (n = 8 samples)

SampleCt valueTotalreads% reads mapping to appropriate viral sequence% 20xcoverage20x genomelength (nt)% 10x coverageReferenceb Reference size (nt)Max de novo contig (nt)
CHIKV 114.7267,17165.198.5711,65899.2CHIKV11,8265,263
CHIKV 321.41,891,02885.199.7611,79899.9CHIKV11,82610,793
CHIKV 422.7216,49311.694.1111,13097.2CHIKV11,8264,256
CHIKV 930.083,481,3584.0810011,826100CHIKV11,8269,860
DENV 116.3284,62271.399.910,71999.9DENV 110,7358,281
DENV 216.9203,70072.199.610,69299.6DENV 110,73510,157
DENV 621.21,377,72171.199.910,63499.9DENV 410,6497,877
DENV 1128.71,111,5662.995.310,22696.3DENV 110,7354,699

CHIKV: chikungunya virus; Ct: cycle threshold; DENV: dengue virus.

a The Nanopore data presented in the table were obtained in 2017 on samples that had been collected and found positive for chikungunya or dengue virus by real-time reverse transcription-PCR in 2016.

b For DENV the serotype is also indicated.

CHIKV: chikungunya virus; Ct: cycle threshold; DENV: dengue virus. a The Nanopore data presented in the table were obtained in 2017 on samples that had been collected and found positive for chikungunya or dengue virus by real-time reverse transcription-PCR in 2016. b For DENV the serotype is also indicated. Figure 4 shows coverage depth of reads mapped across the relevant genome for each sample sequenced by both Illumina and Nanopore. Read levels are not normalised thus actual depth is a function of total reads sequenced, but the pattern of coverage seen is highly similar suggesting it is more dependent upon the SISPA methodology than sequencing library preparation. From Nanopore consensus genome sequences, between 99.93% and 100% of bases called per sample agreed with the Illumina generated sequence.
Figure 4

Coverage depth across the chikungunya or dengue viral genome, United Kingdom, 2017a (n = 8 samples)

Coverage depth across the chikungunya or dengue viral genome, United Kingdom, 2017a (n = 8 samples) Ct: cycle threshold. a The Nanopore and Illumina data presented in the figure were obtained in 2017 on samples that had been collected and found positive for chikungunya or dengue virus by real-time reverse transcription-PCR in 2016. Each graph corresponds to a given sample, defined by its Ct value. Read depth (y-axis) across the genome (x-axis) following reference alignment is shown. Illumina coverage is shown in darker blue and darker green for chikugunya and dengue virus positive samples respectively. Nanopore (MinION) coverage is indicated in lighter blue or lighter green for chikungunya and dengue virus positive samples respectively. Total depth has not been normalised; comparison is to show overall pattern of coverage is highly similar across the methods. Dotted horizontal line indicates depth of 20x coverage, used for consensus calling.

Metagenomic data analysis and coinfection identification

To test the applicability of a metagenomic analysis approach to the data, we assessed read taxonomic classification using Kraken (Figure 5). The distribution of reads classified as CHIKV, DENV, other viruses, bacteria, and archaea/eukaryota show a similar pattern for Illumina and Nanopore data. The proportion of unclassified reads for each sample increased with Ct value, as the proportion of human origin reads is higher and the human genome is not represented in our Kraken database. A decrease in the percentage of CHIKV and DENV classified reads is observed for MinION data compared with Illumina, but was sufficient to identify the correct predominant virus in all samples.
Figure 5

Kraken classification of reads from metagenomic sequencing in (A) chikungunya and (B) dengue real-time reverse transcription-PCR positive samples, United Kingdom, 2017a (n = 8 samples)

Kraken classification of reads from metagenomic sequencing in (A) chikungunya and (B) dengue real-time reverse transcription-PCR positive samples, United Kingdom, 2017a (n = 8 samples) Ct: cycle threshold. a Read taxonomic classification using Kraken was conducted in 2017 on samples that had been collected and found positive for chikungunya or dengue virus by real-time reverse transcription-PCR in 2016. Kraken classification distribution comparison for Illumina (cross-hatched) and Nanopore data. Reads grouped as either chikungunya virus (blue in panel A), dengue virus (green in panel B), other viruses (brown), archea/eukaryota (orange), bacteria (brown) or unclassified (grey). Kraken analysis also allowed for the identification of a DENV coinfection in sample CHIKV 3, the consensus sequence of which was unique in the sample set, eliminating cross-contamination from the DENV positive samples as potential source. Kraken classified 0.08% of Illumina reads and 0.15% of MinION reads as DENV. Using reference mapping to validate the finding, 0.22% of Illumina reads and 0.43% of MinION reads mapped to a DENV-1 reference genome. Genome coverage at 20x of 99.73% and 95.99% was achieved for the primary CHIKV and secondary DENV coinfection respectively, with a single MinION flow cell.

De novo assembly

De novo assembly of the data was attempted using Canu [42] and contigs identified using Basic Local Alignment Search Tool against a Nt database (BLASTn). Table 3 lists the longest viral contig length identified in each sample, ranging from 4.2 Kb (36% of reference genome size) to 10.8 Kb (91%) for CHIKV and 4.7 Kb (44%) to 10.1 Kb (95%) for DENV. Identification of the pathogen present without prior knowledge would have therefore been possible for all samples.

Updated MinION library kits

We repeated the sequencing of the coinfected CHIKV 3 sample using the MinION 1D2 (SQK-LSK308) and Rapid (SQK-RBK001) kits, currently the most accurate and the fastest library preparation kits available, respectively. Using the 1D2 kit 74.5% of reads generated mapped to CHIKV and 0.37% to DENV, while from the Rapid kit the result was 66.26% and 0.29% respectively (both lower than observed in the 2D chemistry). Coverage at 20x for CHIKV was above 99% for both kits, and for DENV was 95.04% from the 1D2 and 81.09% from the Rapid kit (Table 4). Coverage depth pattern across the genome for both viruses (Figure 6) was similar for all library kits tested. Near-maximum coverage for both viruses was obtained within 30 min with the 2D kit, 8 min with the 1D2 kit and 85 min with the Rapid kit (Supplementary Figure 1). De novo assembly (Table 4) produced best CHIKV contigs of 10.7, 11.3 and 11.4 Kb for the 2D, 1D2 and Rapid libraries respectively and the longest contigs generated for DENV were 7.5, 2.2 and 4.2 Kb.
Table 4

Comparison of Nanopore mapping data across library kits, United Kingdom, 2017a (n = 8 samples)

PlatformKitinformationFlow cell(FLO-)VirusidentifiedTotalreads (nt)% readsmapping% 20xcoverage% 10xcoverageReferenceb Referencesize (nt)Maxde novo contig (nt)
IlluminaNextera XTNACHIKV1,391,25895.2398.8699.37CHIKV11,8267,321
IlluminaNextera XTNADENV1,391,2580.2263.6677.82DENV110,7356,613
MinION 2DSQK-LSK208MIN106CHIKV1,891,02885.1299.7399.91CHIKV11,82610,793
MinION 2DSQK-LSK208MIN106DENV1,891,0280.4395.9996.09DENV110,7357,549
MinION 1D2 SQK-LSK308MIN107CHIKV5,080,90674.5099.94100CHIKV11,82611,369
MinION 1D2 SQK-LSK308MIN107DENV5,080,9060.3795.0496.42DENV110,7352,199
MinION RapidSQK-RBK001MIN106CHIKV611,11066.2699.6699.68CHIKV11,82611,473
MinION RapidSQK-RBK001MIN106DENV611,1100.2981.0990.83DENV110,7354,227

CHIKV: chikungunya virus; Ct: cycle threshold; DENV: dengue virus. NA: not applicable.

a Results presented in the table were obtained in 2017 on samples that had been collected and found positive for chikungunya or dengue virus by real-time reverse transcription-PCR in 2016.

b For DENV the serotype is also indicated.

Figure 6

Comparison of genome coverage depth across the chikungunya virus or dengue virus genome for different sequencing library preparation methods in a sample coinfected with dengue and chikungunya viruses, United Kingdom, 2017a (n = 1 sample)

CHIKV: chikungunya virus; Ct: cycle threshold; DENV: dengue virus. NA: not applicable. a Results presented in the table were obtained in 2017 on samples that had been collected and found positive for chikungunya or dengue virus by real-time reverse transcription-PCR in 2016. b For DENV the serotype is also indicated. Comparison of genome coverage depth across the chikungunya virus or dengue virus genome for different sequencing library preparation methods in a sample coinfected with dengue and chikungunya viruses, United Kingdom, 2017a (n = 1 sample) a Results presented in the Figure were obtained in 2017 on a sample that had been collected and found positive for chikungunya virus by real-time reverse transcription-PCR in 2016. In 2017, the sample was further found to be coinfected with CHIKV and DENV by metagenomic sequencing. Read depth across both CHIKV and DENV genomes following reference alignment is shown for coinfection sample CHIKV 3, sequenced using four different sequencing library preparation/sequencing methods. Total coverage depth has not been normalised; comparison is to show overall pattern of coverage is highly similar across the methods. Dotted horizontal line indicates depth of 20x coverage, used for consensus calling. The 1D data from the Rapid kit was sufficient to call a consensus from 11,647/11,826 bases of the CHIKV reference with 179/11,826 bases called as ambiguous or too low coverage. All bases called were concordant with the Illumina consensus. A polishing step using Nanopolish [37] with a subset of the mapped reads (ca 100x coverage depth) significantly reduced ambiguous calls to 90/11,826, introducing a single disagreement with the Illumina consensus (99.99% concordance). Despite considerably greater read depth, the 1D2 kit called only 11,082/11,826 due to a higher proportion, 744/11826, of ambiguous base calls, suggesting 1D reads are most suitable for this approach.

Discussion

These results clearly show that there are considerable levels of viral nucleic acid present in a large proportion of CHIKV and DENV qRT-PCR positive clinical samples, and demonstrate that relatively modest metagenomic sequencing is capable of elucidating significant portions of viral genome even for samples with Ct values at the higher end of clinical range. A decreased Ct value coincided with an increased proportion of viral reads, with a considerable level of variation between samples, likely because of the total level of non-viral host/background nucleic acid present due to variability between patients or in sample handling during collection, storage and testing. For example, the two lowest viral titre CHIKV samples (13 and 14) have similar Ct values (32.2 and 32.57) but varied significantly in the proportion of viral reads (5.03% and 21.72%). The 5.03% viral reads in CHIKV13 is the lowest for CHIKV, yet still sufficient to generate 88.5% of the CHIKV genome at 20x depth from just ca 662,000 paired-end Illumina reads. This amount of genomic information is highly informative and further sequencing would likely increase coverage. Only seven of the 73 total CHIKV diagnostic samples tested in 2016 had a Ct greater than 32.2 (including sample CHIKV14) (Table 1), which suggests that for the majority (> 90%) of CHIKV PCR positive samples, viral load is sufficient for genome sequencing directly from patient samples without further viral enrichment beyond a simple DNAse digestion (Figure 1). The lowest viral read proportion observed in the DENV samples was 0.47% in DENV12, Ct 31.29, which generated 71.5% coverage at 20x depth (increased to 77.8 at 10x depth) from just 687,000 paired end Illumina reads and allowed for DENV serotype identification. Only 62 of 368 DENV cases in 2016 had a higher Ct, predicting that > 80% of PCR positive DENV samples have a viral load sufficient for genome sequencing (Figure 1). These estimates are based on Ct range distribution from a single year, results may vary from year to year. The high yield of viral sequences from clinical CHIKV and DENV samples make the exciting prospect of metagenomic MinION viral whole-genome-sequencing feasible, even for lower viral titre samples. Evaluating this on a representative subset of our samples demonstrates that viral read proportions are in general agreement with that seen for Illumina sequencing, predicting a similar proportion of qRT-PCR positive patient samples would be suitable for direct metagenomic sequencing on the MinION. Differences in precise proportions of viral reads seen between Illumina and MinION are likely due to inter-library variation. Differences in genome coverage achieved are due to both differences in total reads generated per sample (not normalised between platforms) as well as differences in average read length. Of the samples tested on the MinION, the lowest titre samples CHIKV 9 and DENV 11 both generated near complete genome coverage. We repeated the sequencing of the coinfected CHIKV 3 sample using the MinION 1D2 (SQK-LSK308) and Rapid (SQK-RBK001) kits. A reduction in viral proportion of total reads was observed compared with the 2D kit, which may be due partly to the extended storage time of the original samples before retesting. In the case of the 1D2 kit, the lower proportion was outweighed by a substantial increase in total data generated per flow cell (5 M vs 1.8 M reads). For the Rapid kit, the total data produced should be considered in the light of the greatly simplified sample workflow and turnaround-time. The use of metagenomics to elucidate genomic sequences of RNA viruses directly from clinical samples has several obvious benefits in public health applications. The primary benefit over targeted methods is the hypothesis-free nature of the assay, which allows identification and genomic characterisation of novel or unexpected RNA viral agents, either as primary or coinfectants (demonstrated here in the CHIKV/DENV coinfection sample), without any prior clinical knowledge. It also removes the need for laboratory optimisation of targeted methods, such as primer or bait-probe design and testing, and is not subject to escape mutations in target sites that afflict targeted sequencing and diagnostic methods. This issue particularly relevant for highly diverse RNA viruses, such as Lassa virus, which are difficult to assess using targeted methods, without regular reappraisal [43]. The principal limitation of the metagenomic approach is the limit of detection. The data generated here show that viral titres as low as 105 are sufficient for significant genome recovery by this method, but ZIKV is a recent example of a pathogen typically present at lower clinical titres, for which targeted methods are an absolute requirement [22,23]. For diagnostic purposes qRT-PCR has a lower limit of detection, provided the target site is conserved in the pathogen isolate tested. Clearly no single method is most suitable for both detection and genotyping of all pathogens and each has a role to play in differing circumstances. The ability to generate genomic data directly from patient samples is clearly of great benefit to public health (reviewed in detail [44]). It can be used in a routine surveillance capacity or early during suspected outbreaks to link related cases who may be missed by traditional epidemiology [45] and identify outbreak cases distinct from typically circulating seasonal strains, which is key in regions endemic for the pathogen in question. The use of whole genome sequences offers the greatest precision for these applications, compared with typing methods based on specific genomic regions [44]. Whole genome sequencing on a portable device allows this information to be generated rapidly and within the affected region [24], enabling timely identification of an outbreak, or allaying fears of a potential one if cases are not linked. Furthermore mutations relating to viral drug resistance or pathogenicity can be monitored [44]. Therefore the ability to generate near-complete viral genome sequences directly from clinical samples on a portable sequencing device has many potential applications.

Conclusions

We demonstrate that across the clinically relevant range of viral loads an unexpectedly high proportion of reads generated metagenomically from CHIKV and DENV clinical samples are viral in origin. Therefore metagenomic sequencing provides an effective approach for the analysis of CHIKV and DENV genomes directly from the majority of qRT-PCR positive serum and plasma samples, without the need for culture or viral nucleic acid enrichment beyond a simple DNA digestion. We demonstrate this is equally possible on the Oxford Nanopore MinION, making metagenomic whole genome sequencing potentially feasible in the field.
  43 in total

1.  Scaffolding pre-assembled contigs using SSPACE.

Authors:  Marten Boetzer; Christiaan V Henkel; Hans J Jansen; Derek Butler; Walter Pirovano
Journal:  Bioinformatics       Date:  2010-12-12       Impact factor: 6.937

2.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

3.  BEDTools: a flexible suite of utilities for comparing genomic features.

Authors:  Aaron R Quinlan; Ira M Hall
Journal:  Bioinformatics       Date:  2010-01-28       Impact factor: 6.937

Review 4.  Present and future arboviral threats.

Authors:  Scott C Weaver; William K Reisen
Journal:  Antiviral Res       Date:  2009-10-24       Impact factor: 5.970

5.  Rapid detection and quantification of RNA of Ebola and Marburg viruses, Lassa virus, Crimean-Congo hemorrhagic fever virus, Rift Valley fever virus, dengue virus, and yellow fever virus by real-time reverse transcription-PCR.

Authors:  Christian Drosten; Stephan Göttig; Stefan Schilling; Marcel Asper; Marcus Panning; Herbert Schmitz; Stephan Günther
Journal:  J Clin Microbiol       Date:  2002-07       Impact factor: 5.948

6.  Molecular diagnosis and analysis of Chikungunya virus.

Authors:  Carolyn J Edwards; Stephen R Welch; John Chamberlain; Roger Hewson; Howard Tolley; Patricia A Cane; Graham Lloyd
Journal:  J Clin Virol       Date:  2007-07-12       Impact factor: 3.168

7.  Genetic detection and characterization of Lujo virus, a new hemorrhagic fever-associated arenavirus from southern Africa.

Authors:  Thomas Briese; Janusz T Paweska; Laura K McMullan; Stephen K Hutchison; Craig Street; Gustavo Palacios; Marina L Khristova; Jacqueline Weyer; Robert Swanepoel; Michael Egholm; Stuart T Nichol; W Ian Lipkin
Journal:  PLoS Pathog       Date:  2009-05-29       Impact factor: 6.823

8.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

9.  Increased mortality rate associated with chikungunya epidemic, Ahmedabad, India.

Authors:  Dileep Mavalankar; Priya Shastri; Tathagata Bandyopadhyay; Jeram Parmar; Karaikurichi V Ramani
Journal:  Emerg Infect Dis       Date:  2008-03       Impact factor: 6.883

10.  Newly discovered ebola virus associated with hemorrhagic fever outbreak in Uganda.

Authors:  Jonathan S Towner; Tara K Sealy; Marina L Khristova; César G Albariño; Sean Conlan; Serena A Reeder; Phenix-Lan Quan; W Ian Lipkin; Robert Downing; Jordan W Tappero; Samuel Okware; Julius Lutwama; Barnabas Bakamutumaho; John Kayiwa; James A Comer; Pierre E Rollin; Thomas G Ksiazek; Stuart T Nichol
Journal:  PLoS Pathog       Date:  2008-11-21       Impact factor: 6.823

View more
  36 in total

Review 1.  Long-read sequencing in deciphering human genetics to a greater depth.

Authors:  Mohit K Midha; Mengchu Wu; Kuo-Ping Chiu
Journal:  Hum Genet       Date:  2019-09-19       Impact factor: 4.132

2.  A Novel Series of Indole Alkaloid Derivatives Inhibit Dengue and Zika Virus Infection by Interference with the Viral Replication Complex.

Authors:  Christophe Pannecouque; Dominique Schols; Antonios Fikatas; Peter Vervaeke; Eef Meyen; Nuria Llor; Sergi Ordeix; Ine Boonen; Magda Bletsa; Liana Eleni Kafetzopoulou; Philippe Lemey; Mercedes Amat
Journal:  Antimicrob Agents Chemother       Date:  2021-07-16       Impact factor: 5.191

3.  "Kankasha" in Kassala: A prospective observational cohort study of the clinical characteristics, epidemiology, genetic origin, and chronic impact of the 2018 epidemic of Chikungunya virus infection in Kassala, Sudan.

Authors:  Hilary Bower; Mubarak El Karsany; Abd Alhadi Adam Hussein Adam; Mubarak Ibrahim Idriss; Ma'aaza Abasher Alzain; Mohamed Elamin Ahmed Alfakiyousif; Rehab Mohamed; Iman Mahmoud; Omer Albadri; Suha Abdulaziz Alnour Mahmoud; Orwa Ibrahim Abdalla; Mawahib Eldigail; Nuha Elagib; Ulrike Arnold; Bernardo Gutierrez; Oliver G Pybus; Daniel P Carter; Steven T Pullan; Shevin T Jacob; Tajeldin Mohammedein Abdallah; Benedict Gannon; Tom E Fletcher
Journal:  PLoS Negl Trop Dis       Date:  2021-04-30

4.  NanoSPC: a scalable, portable, cloud compatible viral nanopore metagenomic data processing pipeline.

Authors:  Yifei Xu; Fan Yang-Turner; Denis Volk; Derrick Crook
Journal:  Nucleic Acids Res       Date:  2020-07-02       Impact factor: 16.971

5.  How do advanced diagnostics support public health policy development?

Authors:  Jacob Moran-Gilad
Journal:  Euro Surveill       Date:  2019-01

6.  Proficiency Testing of Virus Diagnostics Based on Bioinformatics Analysis of Simulated In Silico High-Throughput Sequencing Data Sets.

Authors:  Annika Brinkmann; Andreas Andrusch; Ariane Belka; Claudia Wylezich; Dirk Höper; Anne Pohlmann; Thomas Nordahl Petersen; Pierrick Lucas; Yannick Blanchard; Anna Papa; Angeliki Melidou; Bas B Oude Munnink; Jelle Matthijnssens; Ward Deboutte; Richard J Ellis; Florian Hansmann; Wolfgang Baumgärtner; Erhard van der Vries; Albert Osterhaus; Cesare Camma; Iolanda Mangone; Alessio Lorusso; Maurilia Marcacci; Alexandra Nunes; Miguel Pinto; Vítor Borges; Annelies Kroneman; Dennis Schmitz; Victor Max Corman; Christian Drosten; Terry C Jones; Rene S Hendriksen; Frank M Aarestrup; Marion Koopmans; Martin Beer; Andreas Nitsche
Journal:  J Clin Microbiol       Date:  2019-07-26       Impact factor: 5.948

7.  Application of cell-free DNA sequencing in characterization of bloodborne microbes and the study of microbe-disease interactions.

Authors:  Kuo-Ping Chiu; Alice L Yu
Journal:  PeerJ       Date:  2019-08-06       Impact factor: 2.984

8.  Detection of Crimean-Congo Haemorrhagic Fever cases in a severe undifferentiated febrile illness outbreak in the Federal Republic of Sudan: A retrospective epidemiological and diagnostic cohort study.

Authors:  Hilary Bower; Mubarak El Karsany; Mazza Alzain; Benedict Gannon; Rehab Mohamed; Iman Mahmoud; Mawahib Eldegail; Rihab Taha; Abdalla Osman; Salim Mohamednour; Amanda Semper; Barry Atkinson; Daniel Carter; Stuart Dowall; Jenna Furneaux; Victoria Graham; Jack Mellors; Jane Osborne; Steven T Pullan; Gillian S Slack; Tim Brooks; Roger Hewson; Nicholas J Beeching; Jimmy Whitworth; Daniel G Bausch; Tom E Fletcher
Journal:  PLoS Negl Trop Dis       Date:  2019-07-10

9.  Increasing the accuracy of nanopore DNA sequencing using a time-varying cross membrane voltage.

Authors:  Matthew T Noakes; Henry Brinkerhoff; Andrew H Laszlo; Ian M Derrington; Kyle W Langford; Jonathan W Mount; Jasmine L Bowman; Katherine S Baker; Kenji M Doering; Benjamin I Tickman; Jens H Gundlach
Journal:  Nat Biotechnol       Date:  2019-04-22       Impact factor: 54.908

10.  Metagenomic sequencing at the epicenter of the Nigeria 2018 Lassa fever outbreak.

Authors:  E Ogbaini-Emovon; S Günther; S Duraffour; L E Kafetzopoulou; S T Pullan; P Lemey; M A Suchard; D U Ehichioya; M Pahlmann; A Thielebein; J Hinzmann; L Oestereich; D M Wozniak; K Efthymiadis; D Schachten; F Koenig; J Matjeschk; S Lorenzen; S Lumley; Y Ighodalo; D I Adomeh; T Olokor; E Omomoh; R Omiunu; J Agbukor; B Ebo; J Aiyepada; P Ebhodaghe; B Osiemi; S Ehikhametalor; P Akhilomen; M Airende; R Esumeh; E Muoebonam; R Giwa; A Ekanem; G Igenegbale; G Odigie; G Okonofua; R Enigbe; J Oyakhilome; E O Yerumoh; I Odia; C Aire; M Okonofua; R Atafo; E Tobin; D Asogun; N Akpede; P O Okokhere; M O Rafiu; K O Iraoyah; C O Iruolagbe; P Akhideno; C Erameh; G Akpede; E Isibor; D Naidoo; R Hewson; J A Hiscox; R Vipond; M W Carroll; C Ihekweazu; P Formenty; S Okogbenin
Journal:  Science       Date:  2019-01-04       Impact factor: 47.728

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.