| Literature DB >> 31932713 |
Xianding Deng1,2, Asmeeta Achari1,2, Scot Federman1,2, Guixia Yu1,2, Sneha Somasekar1,2, Inês Bártolo3, Shigeo Yagi4, Placide Mbala-Kingebeni5, Jimmy Kapetshi5, Steve Ahuka-Mundeke5, Jean-Jacques Muyembe-Tamfum5, Asim A Ahmed6,7, Vijay Ganesh8, Manasi Tamhankar9, Jean L Patterson9, Nicaise Ndembi10,11, Dora Mbanya12,13, Lazare Kaptue14, Carole McArthur15, José E Muñoz-Medina16, Cesar R Gonzalez-Bonilla16, Susana López17, Carlos F Arias17, Shaun Arevalo1, Steve Miller1, Mars Stone18, Michael Busch18, Kristina Hsieh4, Sharon Messenger4, Debra A Wadford4, Mary Rodgers19, Gavin Cloherty19, Nuno R Faria20, Julien Thézé20, Oliver G Pybus20, Zoraima Neto21, Joana Morais21, Nuno Taveira3,22, John R Hackett19, Charles Y Chiu23,24,25.
Abstract
Metagenomic next-generation sequencing (mNGS), the shotgun sequencing of RNA and DNA from clinical samples, has proved useful for broad-spectrum pathogen detection and the genomic surveillance of viral outbreaks. An additional target enrichment step is generally needed for high-sensitivity pathogen identification in low-titre infections, yet available methods using PCR or capture probes can be limited by high cost, narrow scope of detection, lengthy protocols and/or cross-contamination. Here, we developed metagenomic sequencing with spiked primer enrichment (MSSPE), a method for enriching targeted RNA viral sequences while simultaneously retaining metagenomic sensitivity for other pathogens. We evaluated MSSPE for 14 different viruses, yielding a median tenfold enrichment and mean 47% (±16%) increase in the breadth of genome coverage over mNGS alone. Virus detection using MSSPE arboviral or haemorrhagic fever viral panels was comparable in sensitivity to specific PCR, demonstrating 95% accuracy for the detection of Zika, Ebola, dengue, chikungunya and yellow fever viruses in plasma samples from infected patients. Notably, sequences from re-emerging and/or co-infecting viruses that have not been specifically targeted a priori, including Powassan and Usutu, were successfully enriched using MSSPE. MSSPE is simple, low cost, fast and deployable on either benchtop or portable nanopore sequencers, making this method directly applicable for diagnostic laboratory and field use.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31932713 PMCID: PMC7047537 DOI: 10.1038/s41564-019-0637-9
Source DB: PubMed Journal: Nat Microbiol ISSN: 2058-5276 Impact factor: 17.745
Fig. 1MSSPE viral primer design and metagenomic sequencing workflow.
a, An algorithm for the design of viral spiked primers (SP). Sets of viral reference genomes (n = 60–3,571 for each virus) were aligned using MAFFT multiple sequence alignment software[49], followed by the partitioning of each genome into 300–500-nt overlapping segments. Forward and reverse 13-nt primers were selected and filtered according to specific criteria (rounded rectangular box). Unique primer sequences are individually coloured in red, blue, orange and green. Using this algorithm, primers were designed for 15 RNA viruses. SP panels for ArboV (n = 4), HFV (n = 6) and CombV (n = 13, excluding HCV and JCV SP) were also constructed. b, The metagenomic sequencing workflow. MSSPE primers (red) were added (spiked) to a reaction mix containing random primers (blue) during the reverse transcription step of cDNA synthesis, without adding to the overall turnaround time for the subsequent transposase-based library amplification with adapter primers (brown) and sequencing analysis protocols. The MSSPE workflow is compatible with subsequent enrichment using tiling multiplex PCR and/or capture probes (dashed lines). Metagenomic sequence data were analysed for pathogen identification using SURPI (ref. [50]; also see Methods). MARV, Marburg virus; RVFV, Rift Valley fever virus; HEV, hepatitis E virus; and Tm, melting temperature.
Fig. 2Spiked primer enrichment of viral sequences using MSSPE.
a–c, Plots of the fold enrichment achieved for contrived samples containing ZIKV, DENV and/or EBOV at defined titres and using RH primers only or SP concentrations ranging from 1 μM to 40–80 μM. The enrichment of ZIKV and DENV using an ArboV SP panel (a). The asterisk denotes EBOV as an off-target virus when using the ArboV SP panel. The enrichment of EBOV using a HFV SP panel (b) and the enrichment of ZIKV, DENV and EBOV using a CombV SP panel (c). Dashed lines denote 1× or no enrichment. d–h, Box-and-whisker plots of the fold enrichment achieved using MSSPE compared to using RH only. The box outlines denote the IQR, the solid line in the box denotes median fold enrichment, the dashed line denotes mean (µ) fold enrichment and the whiskers outside of the box extend to the minimum and maximum fold enrichment points. The fold enrichment for DENV and ZIKV using virus-specific primers, ArboV panels or CombV panels (d). The fold enrichment for EBOV using virus-specific primers, HFV panels or CombV panels (e). The overall fold enrichment, including all 92 pairwise comparisons (with and without MSSPE) of contrived and clinical samples (f). The fold enrichment for 65 pairwise comparisons of contrived samples (g). The fold enrichment for 27 pairwise comparisons of clinical samples (h). The degree of fold enrichment at <100 cp ml−1 is significantly higher than at other titres (paired two-sided Student’s t test; P = 0.008 between groups <100 cp ml−1 and 100–1,000 cp ml−1; P = 0.0002 between groups <100 cp ml−1 and >1,000 cp ml−1).
Source Data
Viral enrichment in clinical blood samples from infected patients using MSSPE
| Virusa | Titre (cp ml−1) | MSSPE primerb | Total reads (RH) | Viral reads (RH) | Virus RPM (RH) | Total reads (SP) | Viral reads (SP) | Virus RPM (SP) | Fold change |
|---|---|---|---|---|---|---|---|---|---|
| CHIKV | 500 | ArboV | 2,749,920 | 16 | 5.8 | 2,437,971 | 100 | 41 | 7 |
| CHIKV | 210 | ArboV | 1,505,866 | 1 | 0.66 | 1,737,954 | 8 | 4.6 | 7 |
| CHIKV | 15 | ArboV | 1,530,012 | 0 | 0 | 1,482,002 | 38 | 25.6 | >25.6 |
| ZIKV | 156 | ArboV | 744,052 | 0 | 0 | 1,018,648 | 6 | 6 | >6 |
| ZIKV | 390 | ArboV | 1,376,445 | 2 | 1.45 | 1,367,684 | 12 | 8.8 | 6 |
| ZIKV | 64 | ArboV | 1,506,444 | 2 | 1.3 | 1,420,342 | 38 | 26.8 | 20.6 |
| DENV | 1,340 | ArboV | 1,040,773 | 3 | 3 | 1,341,554 | 24 | 17.9 | 6 |
| DENV | 5,500 | ArboV | 1,180,679 | 683 | 578.8 | 2,570,018 | 6,638 | 2,582.9 | 4.5 |
| DENV | 78 | ArboV | 2,277,733 | 64 | 28 | 968,431 | 388 | 400.7 | 14.3 |
| DENV | 326 | ArboV | 3,457,359 | 174 | 50.3 | 2,634,399 | 2,238 | 849 | 16.9 |
| EBOV | 35 | HFV | 1,160,376 | 1 | 0.9 | 1,883,786 | 40 | 21.2 | 23.5 |
| EBOV | 78 | HFV | 2,042,103 | 0 | 0 | 1,083,890 | 14 | 14 | >14 |
| EBOV | 83 | HFV | 1,775,183 | 2 | 1.1 | 1,816,865 | 33 | 18.2 | 16.5 |
| YFV | 68 | HFV | 1,823,776 | 24 | 13.2 | 2,364,155 | 508 | 214.9 | 16 |
| YFV | 2,150 | HFV | 2,475,206 | 26 | 10.5 | 1,309,068 | 123 | 94.6 | 9 |
| YFV | 43 | HFV | 1,963,780 | 31 | 15.8 | 2,517,501 | 574 | 228 | 14.4 |
| YFV | 2,370 | HFV | 2,456,777 | 26 | 10.6 | 3,689,583 | 346 | 93.8 | 8.8 |
| YFV | 79 | HFV | 1,168,865 | 19 | 16.3 | 1,467,611 | 550 | 374.8 | 23 |
| YFV | 228 | HFV | 1,303,652 | 5 | 3.8 | 2,441,729 | 63 | 25.8 | 6.8 |
aIndividual samples were barcoded and four to five samples were multiplexed and sequenced on a single nanopore flow cell. bArboV, ArboV SP (10 μM ArboV mixed with RH at a 10:1 ratio); HFV, HFV SP (20 μM mixed with RH at a 10:1 ratio).
Detection of untargeted emerging viruses using MSSPE
| Virus | Clinical sample type | Primer typea | No. of preprocessed readsb | No. of viral reads (RH primers) | Viral RPM (RH primers) | Genome coverage (RH primers) (%) | No. viral reads (SP) | Viral RPM (SP) | Genome coverage (SP) (%) | Increase in coverage (%)c | Fold change |
|---|---|---|---|---|---|---|---|---|---|---|---|
| USUV | Serum | ArboV SP | 122,517,964 | 114 | 0.9 | 5.5 | 845 | 6.8 | 23.0 | 17.5 | 7.5 |
| POWV | CSF | ArboV SP | 11,266,014 | 88 | 7.8 | 39.6 | 1,007 | 114.6 | 82.6 | 43 | 14.7 |
aArboV SP, ArboV SP panel at 10 µM concentration and mixed with RH at a 10:1 ratio. bThe same number of Illumina preprocessed reads were analysed from the RH and SP runs for comparison. cAbsolute percentage increase from using random primer only (coverage by SP (%) − coverage by RH (%)); a coverage of 40–60% is sufficient for genotypic and phylogenetic inference from partial genome assemblies[27].
Fig. 3Improvements in viral genome coverage using MSSPE.
a, Genome coverage of the ZIKV MRC766 (Uganda) strain (mapped to accession no. LC002520) at 1,000 cp ml−1 with no enrichment (top) or MSSPE enrichment using ZIKV SP (second from top), an ArboV SP panel (third from top) or a CombV SP panel (bottom). With no enrichment, there were 50 reads and 45% coverage; with ZIKV SP, there were 456 reads and 97.6% coverage; with ArboV SP, 528 reads and 100% coverage; with CombV SP, there were 254 reads and 93.9% coverage. b, Genome coverage of an HIV-1 Group M, CRF01 strain (mapped to accession no. KY580709) at 1,000 cp ml−1 with no enrichment (left) or using HIV-1 SP (right). With no enrichment, there were 35 reads and 23.2% coverage; with HIV-1 SP, there were 289 reads and 92.8% coverage. c, Genome coverage of an HCV genotype 4 strain (mapped to accession no. KM587625) at 10,000 cp ml−1 with no enrichment (left) or using HCV SP (right). With no enrichment, there were 63 reads and 31.5% coverage; with HCV SP, there were 686 reads and 80% coverage. d, Genome coverage of a POWV strain identified in CSF from an infected patient with tick-borne meningoencephalitis (mapped to accession no. NC_003687) at <1,000 cp ml−1 with no enrichment (left) or using the ArboV SP panel (right). With no enrichment, there were 48 reads and 37.1% coverage; with ArboV SP, there were 209 reads and 88.0% coverage. e, Genome coverage of a contrived sample of LASV (Josiah strain) spiked into donor plasma matrix at a titre of 10 cp ml−1 (mapped to accession nos. AY628202 and NC_004296) with no enrichment (left) or using the HFV SP panel (right). With no enrichment, there were 4 reads and 3.8% coverage; with HFV SP, there were 154 reads and 67.9% coverage. f, Genome coverage of a contrived sample of CCHFV (mapped to accession nos. AY389508, U39455 and U88410) spiked into donor plasma matrix at a titre of 2,500 cp ml−1 with no enrichment (left) or using the HFV SP panel (right). With no enrichment, there were 69 reads and 23.3% coverage; with HFV SP, there were 2,636 reads and 100% coverage. g, Genome coverage of a strain from a patient from Mexico with acute ZIKV infection during the 2013–2016 outbreak (ZIKV/Homo sapiens/MEX/2016/mex30; mapped to accession no. KX879603) at ~2,000 cp ml−1 with no enrichment (top) or enrichment using MSSPE with ZIKV SP (second from top), tiling multiplex PCR (third from top), capture probes (fourth from top, using random primers alone) or MSSPE with ZIKV SP followed by capture probes (bottom). With no enrichment, there were 33 reads and 26.5% coverage; with ZIKV SP, there were 260 reads and 87.5% coverage; with tiling multiplex PCR, there were 158,243 reads and 88.2% coverage (75.0% ≥10× coverage); with capture probes, there were 49,927 reads and 49.1% coverage (29.6% ≥10× coverage); and with ZIKV SP plus capture probes, there were 275,105 reads and 99.8% coverage (95.6% ≥10× coverage). The red bars below the coverage plots show nucleotide regions with coverage of ≥10×, at a threshold to minimize the inclusion of cross-contaminating reads[36]. For each graph in a–g, the number of reads is normalized to the total number of viral reads obtained with no enrichment. bp, base pairs; L, large segment; M, medium segment, S, small segment.
Improved viral genome coverage in clinical samples from infected patients using MSSPE
| Sequencer | Virus | Viral titrea (cp ml−1) | Primer typeb | No. of total reads analysed | No. of viral reads (RH primers)c | Genome coverage (RH primers) (%) | No. of viral reads (SP)c | Genome coverage (SP) (%) | Increase in coverage (%)d |
|---|---|---|---|---|---|---|---|---|---|
| Illumina MiSeq | HIV-1 (CRF01) | 100 | HIV SP | 1,892,148 | 11 | 12.3 | 136 | 62.7 | 50.4 |
| HIV-1 (CRF01) | 10,000 | HIV SP | 1,507,136 | 35 | 22.3 | 289 | 90.4 | 68.1 | |
| HIV-1 (CRF01) | 10,000 | HIV SP | 1,656,915 | 67 | 43.5 | 223 | 76.4 | 32.9 | |
| HIV-1 (URF-0201) | 12,589 | HIV SP | 1,622,623 | 55 | 45.1 | 151 | 75.1 | 30.0 | |
| HIV-1 (URF-0122) | 6,309 | HIV SP | 1,157,853 | 11 | 12.4 | 81 | 52.8 | 40.4 | |
| HCV (genotype 2) | 16,512 | HCV SP | 1,728,053 | 9 | 11.3 | 68 | 50.7 | 39.4 | |
| HCV (genotype 4) | 9,846 | HCV SP | 2,721,805 | 63 | 33.3 | 267 | 81.3 | 48.0 | |
| HCV (genotype 6) | 1,141 | HCV SP | 1,417,213 | 17 | 12.1 | 81 | 46.5 | 34.4 | |
| ZIKV (mex9) | 814 | ArboV SP | 846,638 | 0 | 0.0 | 65 | 40.5 | 40.5 | |
| EBOV (DRC13) | 6,440 | HFV SP | 455, 484 | 0 | 0.0 | 332 | 73.0 | 73.0 | |
| Oxford Nanopore Technology MinION | CHIKV (USA) | 500 | ArboV SP | 2,437,971 | 14 | 7.6 | 100 | 59.7 | 52.1 |
| DENV (USA) | 326 | ArboV SP | 2,634,399 | 132 | 16.1 | 2,238 | 89.8 | 73.7 | |
| DENV (USA) | 5,500 | ArboV SP | 1,180,679 | 683 | 42.5 | 3,074 | 67.6 | 25.1 | |
| YFV (Angola) | 68 | HFV SP | 1,823,776 | 24 | 8.7 | 384 | 81.6 | 72.9 | |
| YFV (Angola) | 43 | HFV SP | 1,963,780 | 31 | 29.4 | 446 | 85 | 55.6 | |
| YFV (Angola) | 79 | HFV SP | 1,168,865 | 19 | 10.2 | 437 | 70 | 59.8 | |
| Mean increase in coverage (%) | Mean (s.d.) ( | 50 (±16) |
aAbbott m2000 RT–PCR assays were used to estimate the titres of HIV and HCV; viral titres for other viruses were estimated using in-house qRT–PCR assays with standard curve analysis. bHIV SP/HCV SP, target virus-specific SP at 4 µM concentration and ArboV SP and HFV SP primer panels at 10 and 20 µM concentrations, respectively, were mixed with RH at a 10:1 ratio. cThe number of reads was normalized by equal number of preprocessed reads for comparison. dAbsolute percentage increase from using random primer only (coverage by SP (%) − coverage by RH (%)); coverage of 40–60% is sufficient for genotypic and phylogenetic inference from partial genome assemblies[27]. CRF, HIV circulating recombinant form; URF, HIV unique recombinant form; EBOV (DRC13), Ebola strain from the 2014 Boende outbreak in the DRC; USA, CHIKV or DENV strain from a traveller returning to the United States from an endemic region; YFV (Angola), YFV strain from the 2015–2016 Angola outbreak.