| Literature DB >> 33158200 |
Crystal M Gigante1, Gowri Yale2, Rene Edgar Condori1, Niceta Cunha Costa3, Nguyen Van Long4, Phan Quang Minh4, Vo Dinh Chuong4, Nguyen Dang Tho5, Nguyen Tat Thanh6, Nguyen Xuan Thin6, Nguyen Thi Hong Hanh6, Gati Wambura7, Frederick Ade7, Oscar Mito7, Veronicah Chuchu7,8, Mathew Muturi9, Athman Mwatondo9, Katie Hampson10, Samuel M Thumbi7,11,12, Byron G Thomae13, Victor Hugo de Paz14, Sergio Meneses14, Peninah Munyua15, David Moran16, Loren Cadena17, Andrew Gibson18, Ryan M Wallace1, Emily G Pieracci1, Yu Li1.
Abstract
As countries with endemic canine rabies progress towards elimination by 2030, it will become necessary to employ techniques to help plan, monitor, and confirm canine rabies elimination. Sequencing can provide critical information to inform control and vaccination strategies by identifying genetically distinct virus variants that may have different host reservoir species or geographic distributions. However, many rabies testing laboratories lack the resources or expertise for sequencing, especially in remote or rural areas where human rabies deaths are highest. We developed a low-cost, high throughput rabies virus sequencing method using the Oxford Nanopore MinION portable sequencer. A total of 259 sequences were generated from diverse rabies virus isolates in public health laboratories lacking rabies virus sequencing capacity in Guatemala, India, Kenya, and Vietnam. Phylogenetic analysis provided valuable insight into rabies virus diversity and distribution in these countries and identified a new rabies virus lineage in Kenya, the first published canine rabies virus sequence from Guatemala, evidence of rabies spread across an international border in Vietnam, and importation of a rabid dog into a state working to become rabies-free in India. Taken together, our evaluation highlights the MinION's potential for low-cost, high volume sequencing of pathogens in locations with limited resources.Entities:
Keywords: MinION; canine rabies elimination; lyssavirus; nanopore; portable sequencing; rabies
Mesh:
Year: 2020 PMID: 33158200 PMCID: PMC7694271 DOI: 10.3390/v12111255
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Primers used in this study. Sequences are in 5′ to 3′ orientation. Red colored bases correspond to the rabies virus genome binding sequence. Black nucleotides at the 5′ end were used to add barcode sequences for multiplexing. Red sequences correspond to positions 54 to 76 (NgeneFor), 1475 to 1496 (NgeneRev), 3273 to 3300 (GgeneFor), and 5264 to 5288 (GgeneRev) in rabies virus strain PV-2061 (accession number JX276550).
| Primer | Sequence |
|---|---|
| NgeneFor | TTTCTGTTGGTGCTGATATTGC |
| NgeneRev | ACTTGCCTGTCGCTCTATCTTC |
| GgeneFor | TTTCTGTTGGTGCTGATATTGC |
| GgeneRev | ACTTGCCTGTCGCTCTATCTTC |
Portable equipment used in this study.
| MinION (Oxford Nanopore Technologies, Oxford, UK) |
| Mic qPCR cycler (Bio Molecular Systems, El Cajon, CA, USA) |
| MiniPCR (Amplyus, Cambridge, MA, USA) |
| E-Gel Electrophoresis System (ThermoFisher, Waltham, MA, USA) |
| Mini Centrifuge (Southern Labware, Cumming, GA, USA) |
Price of consumable reagents for MinION sequencing of rabies virus amplicons. Prices are based on reagents purchased in the United States as advertised on the Oxford Nanopore website at the time of sequencing (2018). Bulk price of USD 500 per flow cell was used. Shipping prices are not included. Prices may vary substantially depending on location and product availability.
| Price/Sample | |||
|---|---|---|---|
| 50 Samples/Run | 20 Samples/Run | 1 Sample/Run | |
| PCR Barcoding | USD 2.11 | USD 2.11 | USD 0.00 |
| Library Preparation | USD 2.79 | USD 6.57 | USD 126.13 |
| Sequencing |
|
|
|
| Total | USD 14.90 | USD 33.68 | USD 626.13 |
Accuracy of consensus sequences generated from MinION sequencing data by three methods. Sequence accuracy of the raw consensus (Raw), consensus after polishing with raw reads using Nanopolish (Polished), and Nanopolished consensus after manual correction of indels in homopolymer regions (Manual) is shown relative to Sanger sequencing consensus. Sequence accuracy and number of sequences examined are shown for six different read depth levels. Accuracy above 99.99% is highlighted in red text.
| Read Depth | >0 | <50 | >50 | >100 | >1000 | >10,000 |
|---|---|---|---|---|---|---|
| Sequences | 70 | 11 | 58 | 48 | 13 | 5 |
| Raw | 99.842 | 99.213 | 99.958 | 99.968 | 99.965 | 99.958 |
| Polished | 99.933 | 99.778 | 99.961 | 99.965 | 99.987 |
|
| Manual | 99.988 | 99.940 |
|
|
|
|
Figure 1Errors observed in consensus sequences generated from MinION sequencing data by three methods. Number of differences in the nucleoprotein gene (1353 nt) for raw consensus (Raw Consensus), consensus after polishing with raw reads using Nanopolish (Polished Consensus), and after manual correction of indels in homopolymer regions (Manual Correction) compared to consensus generated by Sanger sequencing is shown. Insertions, deletions, and single nucleotide changes (miscalled bases or SNPs) are shown in different shades of blue. Number of each type of error is depicted on the bars. Data shown include data from all sequences (left) and those sequences produced with >50x read depth (right).
Success rate of MinION sequencing in Guatemala, India, Kenya, and Vietnam. The total number of positive samples was based on the result of diagnostic testing in the local laboratory; for Guatemala, Kenya, and Vietnam, all positive samples were confirmed by the LN34 real-time PCR assay. Only 19 samples from India were confirmed positive by LN34 PCR (see Table S1 for additional details). The number and percent of positive samples where N or G gene sequences were produced is shown for each country.
| Country | Samples | Positive | N full | G full | N Partial | G Partial | Percent Success |
|---|---|---|---|---|---|---|---|
| Guatemala | 25 | 13 | 6 | 4 | 0 | 0 | 46.15 |
| India | 104 | 103 | 80 | 97 | 0 | 0 | 93.27 |
| Kenya | 28 | 19 | 10 | 11 | 4 | 5 | 84.21 |
| Vietnam | 21 | 21 | 21 | 21 | 0 | 0 | 100.00 |
Figure 2Phylogenetic analysis of MinION sequences generated in Goa, India. Complete nucleoprotein gene sequences from Goa rabies virus isolates were compared to publicly available sequences. Reference sequences were chosen based on sequence similarity to the newly generated sequences or inclusion in rabies Arctic-like 1, Arctic, Arctic-like 2, Arctic-like 3, Indian Subcontinent, and Cosmopolitan phylogenetic clades (from [31]). Newly generated sequences are shown in bold; several sequences are collapsed for viewing ease. Sequences from Goa isolates are shown in magenta. Translocation case from Rajasthan is highlighted in green. Accession number, host, location, and collection date are included for reference, when available; additional sample details can be found in Table S1. Phylogenetic analysis was performed by maximum likelihood based on a GTR+G+I model. Differences between samples are shown by the number of changes per site along the horizontal axis. Bootstrap values near the branch points represent the percentage of trees that had the same clustering out of 1000 replicates.
Figure 3Phylogenetic analysis of MinION sequences generated in Nairobi, Kenya. Full-length glycoprotein gene sequences (1575 nt) from Kenya rabies virus isolates were compared to publicly available sequences. Reference sequences were chosen based on sequence similarity to the new Kenya sequences or inclusion in Africa 1a, Africa 1b, Africa 1c, Africa-2, Africa-3, or Africa 4 rabies phylogenetic clades based on Troupin et al. [31]. Accession number, location, and collection date are included, when available. Newly generated sequences are highlighted in color, corresponding to rough location on the map to the left. Additional sample information can be found in Table S1. Phylogenetic analysis was performed by maximum likelihood based on the GTR+G+I model. Number of changes per site is shown along the horizontal axis. Bootstrap values near the branch points represent the percentage of trees that had the same clustering out of 1000 replicates.
Figure 4Phylogenetic analysis of MinION sequences generated in Guatemala City, Guatemala. Complete nucleoprotein gene sequences from Guatemala rabies virus isolates were compared to publicly available sequences. Reference sequences were chosen based on sequence similarity to the new Guatemala sequences, isolation from vampire bat or bovine in Mexico or Guatemala, or inclusion Cosmopolitan Americas-2a (AM2a) rabies phylogenetic clade based on Troupin et al. [31]. Accession number, host animal, location, and collection year are included for reference, when available. Sequences generated in this study are highlighted in bold; additional information can be found in Table S1. Colored points on the map correspond to the location of LN34 positive samples; confirmed canine or vampire bat lineage isolates are colored dark blue or dark pink on the map and phylogenetic tree. Suspected lineage was based on host (canine or bovine). Phylogenetic analysis was performed by maximum likelihood based on the GTR+G+I model in Mega7. Number of changes per site is shown along the horizontal axis. Bootstrap values near the branch points represent the percentage of trees that had the same clustering out of 1000 replicates.
Figure 5Phylogenetic analysis of MinION sequences generated in Hanoi, Vietnam. Complete nucleoprotein gene sequences from Vietnam rabies virus isolates were compared to publicly available sequences. Reference sequences were chosen based on sequence similarity to the new Vietnam sequences or inclusion in South East Asia 1–5 (SEA1–SEA5) or Cosmopolitan rabies phylogenetic clades based on Troupin et al. [31]. Sequences generated in this study are highlighted in bold; some sequences have been collapsed for ease of viewing. Color coding of sequence names correspond to region location in Vietnam as colored on the map. Accession number, host animal, location, and collection date are included for reference, when available; additional information can be found in Table S1. Differences between samples are shown by the number of changes per site along the horizontal axis. Bootstrap values near the branch points represent the percentage of trees that had the same clustering out of 1000 replicates.