| Literature DB >> 30271393 |
Julie Yamaguchi1, Ana Olivo1, Oliver Laeyendecker2, Kenn Forberg1, Nicaise Ndembi3, Dora Mbanya4,5, Lazare Kaptue6, Thomas C Quinn2, Gavin A Cloherty1, Mary A Rodgers1, Michael G Berg1.
Abstract
Background: Global surveillance of viral sequence diversity is needed to keep pace with the constant evolution of HIV. Recent next generation sequencing (NGS) methods have realized the goal of sequencing circulating virus directly from patient specimens. Yet, a simple, universal approach that maximizes sensitivity and sequencing capacity remains elusive. Here we present a novel HIV enrichment strategy to yield near complete genomes from low viral load specimens. Methodology: A non-redundant biotin-labeled probe set (HIV-xGen; n = 652) was synthesized to tile all HIV-1 (groups M, N, O, and P) and HIV-2 (A and B) strains. Illumina Nextera barcoded libraries of either gene-specific or randomly primed cDNA derived from infected plasma were hybridized to probes in a single pool and unbound sequences were washed away. Captured viral cDNA was amplified by Illumina adaptor primers, sequenced on a MiSeq, and NGS reads were demultiplexed for alignment with CLC Bio software.Entities:
Keywords: HIV; HIV diversity; next-generation sequencing; target enrichment; virus surveillance; xGen
Year: 2018 PMID: 30271393 PMCID: PMC6146096 DOI: 10.3389/fmicb.2018.02150
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
NGS data for samples sequenced ± enrichment with HIV xGen.
| O-LA34 | Cameroon | HIV-1 O | 8.06 | 8019 | 1,336,488 | 50,865 | 3.80 | 100 | 986 | 589 | |
| – | – | – | – | 20667 | SSIII-xGen | 1,477,954 | 1,436,992 | 97.23 | 100 | 14,576 | 11,026 |
| N-LA28 | Cameroon | HIV-1 N | 7.61 | nt | 1,564,286 | 155,741 | 10.00 | 100 | 3,422 | 1,604 | |
| – | – | – | – | 7333 | SSIII-xGen | 2,033,250 | 2,020,867 | 99.39 | 100 | 22,538 | 17,020 |
| 2A-LA38 | Senegal | HIV-2A | 6.95 | 3,029,490 | 40,402 | 1.33 | 100 | 573.43 | 445.55 | ||
| – | – | – | – | 28355 | SSIII | 37,384 | 152 | 0.41 | 72 | 2 | 2 |
| – | – | – | – | – | SSIII-xGen | 342,284 | 331,623 | 96.89 | 100 | 3,400 | 3,344 |
| C-8119636 | South Africa | HIV-1 C | 5.62 | 8870 | 5,446,060 | 367,224 | 6.74 | 99 | 6,481 | 4,483 | |
| – | – | – | – | 85260 | SSIII | 586,936 | 15800 | 2.69 | 98 | 156 | 196 |
| – | – | – | – | – | SSIII-xGen | 400,458 | 393,350 | 98.23 | 100 | 4,042 | 5,648 |
| 02-421-10 | Cameroon | HIV-1 CRF02 | 4.48 | 643 | 1,418,404 | 42,153 | 2.97 | 61 | 1,201 | 1,127 | |
| – | – | – | – | 6177 | SSIII | 412,134 | 80 | 0.02 | 37 | 1 | 1 |
| – | – | – | – | – | SSIII-xGen | 9,776 | 7,131 | 72.94 | 82 | 78 | 231 |
| PBS1342 | DRC | HIV-1 URF | 4.59 | 3830 | HIV-SMART | 662,360 | 3,511 | 0.53 | 86 | 19 | 94 |
| – | – | – | – | – | HIV-SMART-xGen | 691,384 | 669,074 | 96.77 | 95 | 8,673 | 14,978 |
| – | – | – | – | – | SSIII | 3,432,778 | 1,095 | 0.03 | 99 | 11 | 8 |
| – | – | – | – | – | SSIII-xGen | 786,548 | 747,840 | 95.07 | 97 | 8,746 | 13,219 |
| PBS1191 | DRC | HIV-1 G | 3.86 | 1426 | HIV-SMART | 411,612 | 2,067 | 0.50 | 51 | 7 | 94 |
| – | – | – | – | – | HIV-SMART-xGen | 2,156,988 | 1,504,919 | 69.77 | 100 | 17,823 | 21,968 |
| – | – | – | – | – | SSIII | 1,276,656 | 306 | 0.02 | 45 | 2 | 4 |
| – | – | – | – | – | SSIII-xGen | 3,968,286 | 3,313,601 | 83.50 | 96 | 36,912 | 126,521 |
| PBS888 | DRC | HIV-1 A | 3.47 | 581 | HIV-SMART | 1,037,832 | 208 | 0.02 | 56 | 2 | 4 |
| – | – | – | – | – | HIV-SMART-xGen | 2,172,982 | 1,880,055 | 86.52 | 100 | 25,171 | 53,285 |
| – | – | – | – | – | SSIII | 4,532,554 | 257 | 0.01 | 54 | 2 | 2 |
| – | – | – | – | – | SSIII-xGen | 1,699,092 | 925,860 | 54.49 | 96 | 10,689 | 23,552 |
| 70641 | DRC | HIV-1 A1 | 5.78 | HIV-SMART | 1,361,655 | 51,646 | 3.79 | 99 | 947 | 1,385 | |
| – | – | – | – | HIV-SMART-xGen | 9,759,614 | 9,158,341 | 93.84 | 100 | 193,337 | 648,377 | |
| PBS5635 | DRC | HIV-1 D | 5.31 | HIV-SMART | 1,255,489 | 12,653 | 1.01 | 95 | 259 | 320 | |
| – | – | – | – | HIV-SMART-xGen | 574,273 | 522,043 | 90.91 | 99 | 12,321 | 30,529 | |
| P406 | DRC | HIV-1 G | 5.21 | HIV-SMART | 2,206,004 | 4,676 | 0.20 | 98 | 76 | 98 | |
| – | – | – | – | HIV-SMART-xGen | 1,238,978 | 629,912 | 50.84 | 100 | 14,903 | 40,689 | |
| PBS1189 | DRC | HIV-1 F1 | 4.98 | HIV-SMART | 1,978,086 | 2,260 | 0.11 | 82 | 41 | 76 | |
| – | – | – | – | HIV-SMART-xGen | 522,902 | 463,644 | 88.67 | 86 | 9,236 | 41,558 | |
| 50 | DRC | HIV-1 A1 basal | 4.96 | HIV-SMART | 1,662,501 | 1,731 | 0.10 | 83 | 29 | 38 | |
| – | – | – | – | HIV-SMART-xGen | 237,883 | 218,072 | 91.67 | 83 | 4,899 | 15,280 | |
| 87-2580 | DRC | HIV-1 G | 4.85 | HIV-SMART | 2,200,708 | 5,741 | 0.26 | 96 | 112 | 167 | |
| – | – | – | – | HIV-SMART-xGen | 529,218 | 221,335 | 41.82 | 96 | 5,088 | 10,026 | |
| PBS6126 | DRC | HIV-1 A1 | 4.67 | HIV-SMART | 149,709 | 115 | 0.08 | 79 | 2.6 | 2.4 | |
| – | – | – | – | HIV-SMART-xGen | 28,606 | 11,644 | 40.70 | 100 | 4,034 | 276 | |
| 2049 | DRC | HIV-1 CRF37 basal | 4.31 | HIV-SMART | 2,720,263 | 1,677 | 0.06 | 73 | 33 | 46 | |
| – | – | – | – | HIV-SMART-xGen | 10,689,440 | 9,321,568 | 87.20 | 90 | 201,793 | 340,069 | |
| PBS0724 | DRC | HIV-1A1 | 4.02 | HIV-SMART | 107,044 | 1 | 0.00 | 2 | 0.02 | 0.16 | |
| – | – | – | – | HIV-SMART-xGen | 2,010,039 | 1,061,113 | 52.79 | 100 | 21,120 | 38,865 | |
| PBS1195 | DRC | HIV-1 A2 | 3.77 | HIV-SMART | 605,867 | 51 | 0.01 | 54 | 1.2 | 1.4 | |
| – | – | – | – | HIV-SMART-xGen | 85,628 | 45,233 | 52.83 | 100 | 1,039 | 1,847 | |
| 2106 | DRC | HIV-1 A1 | 3.76 | HIV-SMART | 1,664,855 | 598 | 0.04 | 80 | 13 | 14 | |
| – | – | – | – | HIV-SMART-xGen | 1,355,794 | 1,054,751 | 77.80 | 100 | 24,313 | 57,096 | |
| P4039 | DRC | HIV-1 A1 | 3.58 | HIV-SMART | 419,163 | 132 | 0.03 | 71 | 2.8 | 3 | |
| – | – | – | – | HIV-SMART-xGen | 398,324 | 228,302 | 57.32 | 100 | 5,084 | 6,066 | |
| P3844 | DRC | HIV-1 K | 4.67 | HIV-SMART | 37,076 | 7 | 0.02 | 8 | 1.27 | 0.59 | |
| – | – | – | – | HIV-SMART-xGen | 1,883,032 | 992,447 | 52.7 | 97 | 23,300 | 57,444 |
Previously published data reported in Berg et al. (2016).
Genome completed (100%) with Sanger.
Genome coverage increased with Sanger but still incomplete.
NGS data for samples sequenced only by HIV xGen.
| 459-16 | Cameroon | HIV-1 CRF02 | 5.31 | 41,345 | SSIII-xGen | 8,303,330 | 8,043,055 | 96.87 | 100 | 108,492.77 | 214,888.09 |
| A1699 | Cameroon | HIV-1 F2 | 5.04 | 21,587 | HIV SMART-xGen | 5,380,619 | 5,336,115 | 99.17 | 100 | 70,586.38 | 117,687.56 |
| A1786 | Cameroon | HIV-1 G | 4.73 | 10,573 | HIV SMART-xGen | 108,242 | 93,856 | 86.71 | 83 | 1,247.45 | 2,496.38 |
| 8128965 | South Africa | HIV-1 C | 4.50 | 6,226 | HIV SMART-xGen | 243,571 | 232,511 | 95.46 | 95 | 3,017.97 | 8,065.69 |
| 144-26 | Cameroon | HIV-1 G | 4.03 | 2,110 | HIV SMART-xGen | 247,845 | 219,844 | 88.70 | 93 | 2,941.81 | 4,593.91 |
| A1185 | Cameroon | HIV-1 CRF01 | 3.87 | 1,459 | HIV SMART-xGen | 96,329 | 69,459 | 72.11 | 79 | 940.92 | 2,784.90 |
| 5056135 | Senegal | HIV-1 C | 3.67 | 921 | HIV SMART-xGen | 517,740 | 454,502 | 87.79 | 100 | 6,237.01 | 9,850.11 |
| 577-27 | Cameroon | HIV-1 A | 3.67 | 921 | HIV SMART-xGen | 111,404 | 94,003 | 84.38 | 79 | 1,278.21 | 3,069.71 |
| 42-877 | Uganda | HIV-1 D | 3.66 | 900 | HIV SMART-xGen | 313,662 | 279,983 | 89.26 | 100 | 3,829.33 | 5,495.33 |
| 112-11 | Cameroon | HIV-1 F2 | 3.63 | 840 | HIV SMART-xGen | 204,740 | 158,815 | 77.57 | 91 | 2,236.33 | 3,634.36 |
| P4142 | DRC | HIV-1 A1 | 3.29 | 383 | HIV SMART-xGen | 814501 | 24 | 0.29 | 34 | 0.34 | 0.70 |
| 2528 | DRC | HIV-1 A1 | 3.26 | 358 | HIV SMART-xGen | 553,492 | 0 | 0 | 0 | 0 | 0 |
| PBS70-233 | DRC | HIV-1 A1 | 3.24 | 342 | HIV SMART-xGen | 1,322 | 0 | 0 | 0 | 0 | 0 |
| PBS369-87 | DRC | HIV-1 A1 | 3.18 | 297 | HIV SMART-xGen | 947,145 | 30 | 0.31 | 14 | 0.42 | 1.13 |
| 475-17 | Cameroon | HIV-1 A | 3.00 | 262 | HIV SMART-xGen | 238,092 | 1,753 | 0.74 | 20 | 21.24 | 73.91 |
| 669-17 | Cameroon | HIV-1 CRF22 | 2.93 | 172 | SSIII-xGen | 119,116 | 38,654 | 32.45 | 51 | 509.50 | 1584.65 |
| 129-26 | Cameroon | HIV-1 D | 2.82 | 173 | HIV SMART-xGen | 441,846 | 40,680 | 9.21 | 53 | 579.49 | 2,232.28 |
| 844-55 | Cameroon | HIV-1 B/02 | 2.77 | 155 | HIV SMART-xGen | 224,916 | 2,542 | 1.13 | 18 | 28.16 | 121.64 |
| 5020-19 | Cameroon | HIV-1 C | 2.76 | 151 | HIV SMART-xGen | 602,884 | 1,444 | 0.24 | 12 | 6.80 | 60.30 |
| 4188-11 | Cameroon | HIV-1 CRF11 | 2.75 | 148 | HIV SMART-xGen | 702,804 | 516 | 0.07 | 12 | 5.58 | 22.09 |
| 10047105267 | Thailand | HIV-1 CRF01 | 2.72 | 138 | HIV SMART-xGen | 893,628 | 149,379 | 16.72 | 42 | 2,126.61 | 10,928.35 |
| 193-6 | Cameroon | HIV-1 A | 2.65 | 117 | HIV SMART-xGen | 370,450 | 496 | 0.13 | 14 | 3.37 | 14.69 |
| 961-09 | Cameroon | HIV-1 CRF37 | 2.43 | 71 | HIV SMART-xGen | 229,794 | 197 | 0.09 | 10 | 1.45 | 9.31 |
| 363-24 | Cameroon | HIV-1 CRF13 | nd | n/a | HIV SMART-xGen | 133,832 | 117,148 | 87.53 | 85 | 1,478.12 | 4,140.94 |
| 1173-31 | Cameroon | HIV-1 CRF37 | nd | n/a | HIV SMART-xGen | 51,448 | 38,288 | 74.42 | 77 | 524.27 | 1,835.32 |
| 9505343 | Senegal | HIV-1 CRF06 | nd | n/a | HIV SMART-xGen | 99,834 | 37,446 | 37.51 | 91 | 484.82 | 945.49 |
| 814-43 | Cameroon | HIV-1 URF | nd | n/a | HIV SMART-xGen | 485,500 | 438,989 | 90.42 | 93 | 5,872.10 | 11,973.02 |
| 770-8 | Cameroon | HIV-1 URF | nd | n/a | HIV SMART-xGen | 5,649,660 | 5,605,300 | 99.21 | 100 | 74,539.08 | 135,007.96 |
| AB260 | Cameroon | HIV-1 group O | nd | n/a | HIV SMART-xGen | 8,587,083 | 7,390,731 | 86.07 | 99 | 80,634.90 | 117,442.21 |
| 296 | Cameroon | HIV-1 group O | nd | n/a | SSIII-xGen | 17,777,038 | 17,243,456 | 97.00 | 100 | 220,976.08 | 354,818.32 |
| 108-08 | Cameroon | HIV-1 group O | nd | n/a | SSIII-xGen | 115,400 | 8,312 | 7.20 | 22 | 119.92 | 422.06 |
| 1095-04 | Cameroon | HIV-1 group O | nd | n/a | SSIII-xGen | 255,480 | 77,095 | 30.18 | 52 | 972.67 | 4,214.80 |
| 1225-51 | Cameroon | HIV-1 group O | nd | n/a | SSIII-xGen | 1,067,940 | 805,255 | 75.40 | 100 | 9,803.67 | 15,258.08 |
| 126-12 | Cameroon | HIV-1 group O | nd | n/a | SSIII-xGen | 284,826 | 46,947 | 16.48 | 56 | 463.94 | 1,313.88 |
| 20-02 | Cameroon | HIV-1 group O | nd | n/a | SSIII-xGen | 308,694 | 174,250 | 56.45 | 94 | 2,183.30 | 3,452.81 |
| 136-16 | Cameroon | HIV-2 group B | nd | n/a | SSIII-xGen | 6,068 | 2,922 | 48.15 | 75 | 36.03 | 47.93 |
Genome completed (100%) with Sanger.
Genome coverage increased with Sanger but still incomplete.
Figure 1HIV-xGen strategy. 651 probes were selected to tile all HIV-1 and HIV-2 strains present in the phylogenetic tree at 1X coverage. Reverse transcription and second strand synthesis were performed by random priming with Superscript/Sequenase or by the HIV-SMART method. Nextera XT was used to convert cDNA to barcoded Illumina libraries consisting of both HIV (red inserts) and background (black) reads. Pooled libraries were hybridized to xGen probes (green) with 5′-biotin tags (gold) for a single capture and selected by magnetic bead separation. Multiplexed libraries were amplified by universal KAPA primers, sequenced on a MiSeq, and reads were parsed by barcode.
Figure 2Diverse sequences are captured and enriched in HIV-xGen libraries. (A) Coverage plots of SSRTIII libraries enriched by HIV-xGen that were sequenced previously: HIV-1 group M subtype C (8119636), group O (LA34), group N (LA28), and HIV-2A (LA38). (B) Coverage plots of new HIV-1 strains libraries generated by HIV-SMART and followed by enrichment with HIV-xGen: HIV-1 group M subtype F2 (A1699), CRF06 (9505343), URF (770-8), and group O (AB260).
Figure 3A HIV-xGen dramatically increases sensitivity for low titer specimens. (A) Coverage plots are shown for HIV-SMART libraries (top panels) without (orange) and with (blue) HIV-xGen as well as Superscript libraries without (orange) and with (blue) HIV-xGen. PBS1342 is a URF, PBS1191 is subtype G and PBS888 is subtype A. (B) Coverage plots for strains with titers ranging from log 2.7–3.6 copies/ml including a subtype F2 (112-11), a subtype D (129-26), a CRF01 (10047105267) and an HIV-2 strain (136-16) of unknown viral load. (C) Plot of viral load vs. genome coverage for all new strains sequenced with a known titer. Red dashed line = log 3.4 copies/ml; orange dots = no enrichment; blue dots = xGen enrichment.
Figure 4HIV-xGen enables full genome characterization of rare and diverse strains. (A) Neighbor joining phylogenetic tree of 28 new full genomes (red) generated from a 6,252 nt gap-stripped alignment. Specimens sequenced in this study are shown in red. (B) Boot-scanning analysis of unique recombinant forms 770-80 and 814-43.