| Literature DB >> 27099321 |
Soo-Yon Rhee1, Kris Sankaran2, Vici Varghese3, Mark A Winters3,4, Christopher B Hurt5, Joseph J Eron5, Neil Parkin6, Susan P Holmes2, Mark Holodniy3,4, Robert W Shafer3.
Abstract
UNLABELLED: HIV-1 protease (PR), reverse transcriptase (RT), and integrase (IN) variability presents a challenge to laboratories performing genotypic resistance testing. This challenge will grow with increased sequencing of samples enriched for proviral DNA such as dried blood spots and increased use of next-generation sequencing (NGS) to detect low-abundance HIV-1 variants. We analyzed PR and RT sequences from >100,000 individuals and IN sequences from >10,000 individuals to characterize variation at each amino acid position, identify mutations indicating APOBEC-mediated G-to-A editing, and identify mutations resulting from selective drug pressure. Forty-seven percent of PR, 37% of RT, and 34% of IN positions had one or more amino acid variants with a prevalence of ≥1%. Seventy percent of PR, 60% of RT, and 60% of IN positions had one or more variants with a prevalence of ≥0.1%. Overall 201 PR, 636 RT, and 346 IN variants had a prevalence of ≥0.1%. The median intersubtype prevalence ratios were 2.9-, 2.1-, and 1.9-fold for these PR, RT, and IN variants, respectively. Only 5.0% of PR, 3.7% of RT, and 2.0% of IN variants had a median intersubtype prevalence ratio of ≥10-fold. Variants at lower prevalences were more likely to differ biochemically and to be part of an electrophoretic mixture compared to high-prevalence variants. There were 209 mutations indicative of APOBEC-mediated G-to-A editing and 326 mutations nonpolymorphic treatment selected. Identification of viruses with a high number of APOBEC-associated mutations will facilitate the quality control of dried blood spot sequencing. Identifying sequences with a high proportion of rare mutations will facilitate the quality control of NGS. IMPORTANCE: Most antiretroviral drugs target three HIV-1 proteins: PR, RT, and IN. These proteins are highly variable: many different amino acids can be present at the same position in viruses from different individuals. Some of the amino acid variants cause drug resistance and occur mainly in individuals receiving antiretroviral drugs. Some variants result from a human cellular defense mechanism called APOBEC-mediated hypermutation. Many variants result from naturally occurring mutation. Some variants may represent technical artifacts. We studied PR and RT sequences from >100,000 individuals and IN sequences from >10,000 individuals to quantify variation at each amino acid position in these three HIV-1 proteins. We performed analyses to determine which amino acid variants resulted from antiretroviral drug selection pressure, APOBEC-mediated editing, and naturally occurring variation. Our results provide information essential to clinical, research, and public health laboratories performing genotypic resistance testing by sequencing HIV-1 PR, RT, and IN.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27099321 PMCID: PMC4907232 DOI: 10.1128/JVI.00495-16
Source DB: PubMed Journal: J Virol ISSN: 0022-538X Impact factor: 5.103
FIG 1Distribution of the number of HIV-1 protease (PR) amino acid variants by position stratified by prevalence: ≥1% (A), 0.1% to 1% (B), 0.01% to 0.1% (C), and <0.01% (D). The total number of sequences analyzed at each position is shown on a log10 scale (E).
Amino acid variants according to frequency
| Frequency (%) | Protease | Reverse transcriptase | Integrase | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| No. of amino acid variants | % of positions with variant | Median similarity score | % found in electrophoretic mixtures | No. of amino acid variants | % of positions with variant | Median similarity score | % found in electrophoretic mixtures | No. of amino acid variants | % of positions with variant | Median similarity score | % found in electrophoretic mixtures | |
| <0.01 | 655 | 100 | −2 | 60 | 2,487 | 99 | −2 | 60 | 504 | 85 | −1 | 54 |
| 0.01–0.1 | 260 | 89 | −1 | 45 | 1,091 | 91 | −1 | 49 | 460 | 81 | 0 | 45 |
| 0.1–1 | 119 | 56 | 0 | 26 | 379 | 47 | 0 | 30 | 214 | 47 | 0 | 26 |
| 1–10 | 65 | 38 | 0 | 17 | 202 | 31 | 1 | 18 | 107 | 28 | 1 | 14 |
| >10 | 17 | 17 | 2 | 9 | 55 | 12 | 1 | 9 | 25 | 8 | 1 | 7 |
Protease positions 1 to 99 were analyzed using 109,497 protease sequences, RT positions 1 to 400 were analyzed using 108,848 RT sequences, and integrase positions 1 to 288 were analyzed using 11,778 integrase sequences.
BLOSUM62 similarity score to the consensus amino acid.
FIG 2Distribution of the number of HIV-1 reverse transcriptase (RT) amino acid variants by position stratified by prevalence: ≥1% (A), 0.1% to 1% (B), 0.01% to 0.1% (C), and <0.01% (D). The total number of sequences analyzed at each position is shown on a log10 scale (E).
FIG 3Distribution of the number of HIV-1 integrase (IN) amino acid variants by position stratified by prevalence: ≥1% (A), 0.1% to 1% (B), 0.01% to 0.1% (C), and <0.01% (D). The total number of sequences analyzed at each position is shown on a log10 scale (E).
FIG 4Distribution of the number of HIV-1 protease (PR) amino acid variants present at prevalences of ≥1% (blue) and ≥0.1% (green) by subtype.
FIG 5Distribution of the number of HIV-1 reverse transcriptase (RT) amino acid variants present at prevalences of ≥1% (blue) and ≥0.1% (green) by subtype.
FIG 6Distribution of the number of HIV-1 integrase (IN) amino acid variants present at prevalences of ≥1% (blue) and ≥0.1% (green) by subtype.
PI nonpolymorphic treatment-selected mutations
| Position | Cons | TSM(s) | No. of individuals | |
|---|---|---|---|---|
| PI treated | PI naïve | |||
| 10 | L | 15,231 | 60,294 | |
| 11 | V | 15,244 | 60,351 | |
| 20 | K | 15,278 | 61,114 | |
| 22 | A | 15,292 | 61,145 | |
| 23 | L | 15,295 | 61,252 | |
| 24 | L | 15,282 | 61,263 | |
| 30 | D | 15,302 | 61,316 | |
| 32 | V | 15,302 | 61,323 | |
| 33 | L | 15,302 | 61,317 | |
| 34 | E | 15,302 | 61,315 | |
| 36 | M | A0.1 | 15,296 | 61,306 |
| 38 | L | 15,304 | 61,319 | |
| 43 | K | 15,420 | 61,587 | |
| 45 | K | Q0.3
| 15,421 | 61,587 |
| 46 | M | 15,412 | 61,594 | |
| 47 | I | 15,423 | 61,595 | |
| 48 | G | 15,423 | 61,597 | |
| 50 | I | 15,423 | 61,597 | |
| 51 | G | 15,422 | 61,592 | |
| 53 | F | 15,423 | 61,598 | |
| 54 | I | 15,422 | 61,594 | |
| 55 | K | 15,421 | 61,596 | |
| 66 | I | 15,423 | 61,593 | |
| 67 | C | 15,418 | 61,577 | |
| 71 | A | 15,415 | 61,592 | |
| 72 | I | L2.5 K0.7 | 15,417 | 61,574 |
| 73 | G | 15,423 | 61,592 | |
| 74 | T | 15,421 | 61,591 | |
| 76 | L | 15,419 | 61,585 | |
| 79 | P | 15,421 | 61,591 | |
| 82 | V | 15,414 | 61,582 | |
| 83 | N | 15,421 | 61,584 | |
| 84 | I | 15,421 | 61,584 | |
| 85 | I | 15,420 | 61,582 | |
| 88 | N | 15,418 | 61,543 | |
| 89 | L | 15,412 | 61,533 | |
| 90 | L | 15,416 | 61,537 | |
| 91 | T | 15,417 | 61,536 | |
| 92 | Q | 15,416 | 61,527 | |
| 95 | C | 15,404 | 61,251 | |
| 96 | T | S0.3 | 15,391 | 61,129 |
Cons, consensus.
Nonpolymorphic treatment-selected mutations (TSMs) in boldface were previously reported as being associated with drug resistance (18).
NRTI nonpolymorphic treatment-selected mutations
| Position | Cons | TSM(s) | No. of individuals | |
|---|---|---|---|---|
| RTI treated | RTI naive | |||
| 40 | E | 28,619 | 51,040 | |
| 41 | M | 28,761 | 51,192 | |
| 43 | K | 28,768 | 51,944 | |
| 44 | E | 28,769 | 51,957 | |
| 64 | K | 28,796 | 51,997 | |
| 65 | K | 28,803 | 52,000 | |
| 67 | D | 28,792 | 51,999 | |
| 68 | S | K0.1 | 28,804 | 52,003 |
| 69 | T | 28,789 | 52,005 | |
| 70 | K | 28,797 | 52,013 | |
| 73 | K | M0.1 | 28,804 | 52,017 |
| 74 | L | 28,799 | 52,021 | |
| 75 | V | 28,798 | 52,034 | |
| 77 | F | 28,805 | 52,035 | |
| 115 | Y | 28,806 | 52,037 | |
| 116 | F | 28,807 | 52,044 | |
| 117 | S | 28,802 | 52,037 | |
| 151 | Q | 28,792 | 52,026 | |
| 157 | P | A0.2 | 28,791 | 52,029 |
| 159 | I | L0.1 | 28,792 | 52,027 |
| 162 | S | D1.9 | 28,763 | 51,998 |
| 164 | M | L0.1 | 28,786 | 52,028 |
| 165 | T | 28,787 | 52,021 | |
| 167 | I | 28,788 | 52,020 | |
| 184 | M | 28,777 | 52,016 | |
| 203 | E | K5.4 V0.4
| 28,736 | 51,864 |
| 205 | L | F0.1 | 28,738 | 51,841 |
| 208 | H | 28,725 | 51,820 | |
| 210 | L | 28,688 | 51,798 | |
| 211 | R | 28,700 | 51,755 | |
| 212 | W | 28,705 | 51,789 | |
| 215 | T | 28,657 | 51,505 | |
| 218 | D | 28,653 | 51,454 | |
| 219 | K | 28,639 | 51,435 | |
| 304 | A | 11,563 | 19,788 | |
Cons, consensus.
Nonpolymorphic treatment-selected mutations (TSMs) in boldface were previously reported as being associated with drug resistance (18). Lowercase “i” indicates an insertion; lowercase “d” indicates a deletion.
NNRTI nonpolymorphic treatment-selected mutations
| Position | Cons | TSM(s) | No. of individuals | |
|---|---|---|---|---|
| RTI treated | RTI naive | |||
| 94 | I | 28,810 | 52,041 | |
| 98 | A | 28,802 | 52,042 | |
| 100 | L | 28,796 | 51,999 | |
| 101 | K | 28,794 | 52,039 | |
| 102 | K | 28,804 | 52,028 | |
| 103 | K | 28,805 | 52,032 | |
| 105 | S | 28,808 | 52,045 | |
| 106 | V | 28,805 | 52,045 | |
| 108 | V | I7.4 | 28,808 | 52,043 |
| 132 | I | L0.7 | 28,800 | 52,037 |
| 138 | E | 28,798 | 52,024 | |
| 139 | T | 28,798 | 52,037 | |
| 178 | I | 28,781 | 52,001 | |
| 179 | V | 28,774 | 52,010 | |
| 181 | Y | 28,780 | 52,016 | |
| 188 | Y | 28,758 | 52,014 | |
| 190 | G | 28,771 | 52,015 | |
| 221 | H | 28,565 | 50,963 | |
| 225 | P | 28,386 | 50,583 | |
| 227 | F | 28,165 | 50,128 | |
| 230 | M | 28,081 | 49,720 | |
| 232 | Y | 27,827 | 49,437 | |
| 234 | L | 27,760 | 49,216 | |
| 238 | K | 27,404 | 47,232 | |
| 240 | T | K0.1 | 23,831 | 46,204 |
| 241 | V | 23,586 | 44,549 | |
| 242 | Q | H0.9 L0.2
| 23,529 | 43,984 |
| 318 | Y | 10,809 | 15,668 | |
| 348 | N | 6,367 | 5,528 | |
| 404 | E | N1.3 | 1,207 | 3,663 |
Cons, consensus.
Nonpolymorphic treatment-selected mutations (TSMs) in boldface were previously reported as being associated with drug resistance (18).
Undifferentiated RTI nonpolymorphic treatment-selected mutations
| Position | Cons | TSM(s) | No. of individuals | |
|---|---|---|---|---|
| RTI treated | RTI naive | |||
| 3 | S | 19,241 | 42,633 | |
| 16 | M | V0.4 | 19,884 | 43,640 |
| 31 | I | 21,490 | 45,863 | |
| 33 | A | V0.2 | 21,573 | 46,050 |
| 34 | L | I0.7 | 21,582 | 46,129 |
| 54 | N | I0.1 | 28,794 | 51,991 |
| 58 | T | 28,795 | 51,994 | |
| 109 | L | 28,808 | 52,043 | |
| 202 | I | T0.1 | 28,742 | 51,873 |
| 223 | K | Q2.1 E1.7
| 28,537 | 50,880 |
| 228 | L | R5.4
| 28,148 | 50,071 |
| 302 | E | D0.3 | 12,507 | 20,464 |
| 312 | E | G0.4 | 10,935 | 17,751 |
| 341 | I | F1.4 | 6,671 | 5,802 |
| 394 | Q | S0.8 | 6,108 | 4,874 |
| 399 | E | G1.2 | 5,882 | 4,830 |
| 547 | Q | R3.6 | 473 | 2,559 |
Cons, consensus.
Nonpolymorphic treatment-selected mutations (TSMs) in boldface were previously reported as being associated with drug resistance (18).
INSTI nonpolymorphic treatment-selected mutations
| Position | Cons | TSM(s) | No. of individuals | |
|---|---|---|---|---|
| INSTI treated | INSTI naive | |||
| 51 | H | 1,019 | 6,609 | |
| 66 | T | 1,019 | 6,619 | |
| 79 | V | I2.5 | 1,020 | 6,625 |
| 92 | E | 1,020 | 6,628 | |
| 95 | Q | 1,020 | 6,627 | |
| 121 | F | 1,020 | 6,631 | |
| 138 | E | 1,020 | 6,631 | |
| 140 | G | 1,020 | 6,631 | |
| 142 | P | T0.6 | 1,020 | 6,631 |
| 143 | Y | 1,020 | 6,631 | |
| 147 | S | 1,020 | 6,631 | |
| 148 | Q | 1,020 | 6,629 | |
| 155 | N | 1,020 | 6,629 | |
| 230 | S | 1,018 | 6,608 | |
| 253 | D | Y1.0 | 1,018 | 6,588 |
Cons, consensus.
Nonpolymorphic treatment-selected mutations (TSMs) in boldface were previously reported as being associated with drug resistance (9).