| Literature DB >> 32990139 |
Leonard Kingwara1,2, Muthoni Karanja2, Catherine Ngugi2, Geoffrey Kangogo1,2, Kipkerich Bera1, Maureen Kimani2, Nancy Bowen1, Dorcus Abuya1,2, Violet Oramisi2, Irene Mukui2.
Abstract
INTRODUCTION: With the rapid scale-up of antiretroviral therapy (ART) to treat HIV infection, there are ongoing concerns regarding probable emergence and transmission of HIV drug resistance (HIVDR) mutations. This scale-up has to lead to an increased need for routine HIVDR testing to inform the clinical decision on a regimen switch. Although the majority of wet laboratory processes are standardized, slow, labor-intensive data transfer and subjective manual sequence interpretation steps are still required to finalize and release patient results. We thus set out to validate the applicability of a software package to generate HIVDR patient results from raw sequence data independently.Entities:
Keywords: drug resistance testing; efficiency and results; validation
Mesh:
Substances:
Year: 2020 PMID: 32990139 PMCID: PMC7536479 DOI: 10.1177/2325958220962687
Source DB: PubMed Journal: J Int Assoc Provid AIDS Care ISSN: 2325-9574
Configuration Variables for Nucleotide Mixture Calling and Base “Marking” for Clinical Drug Resistance Genotyping.[22,23]
| Parameter | Value | Interpretation |
|---|---|---|
| Quality censoring cutoff | <10 | Phred quality scores cut off for excluding bases during assembly. |
| Mixture area (%) | ≥20 | The area of the uncalled peak must be at least 20% of the called peak area. If 50% of the reads pass this threshold, then a mixture is called. |
| Mark area (%) | ≥15 | The area of the uncalled peak must have at least 17.5% of the called peak area. If ≥50% of the reads pass this threshold, then a mark is made. |
| Mark average quality cut-off phred score Additional marks | <20 | If the average quality of the base across all reads is below the cutoff, then a mark is made. Insertions, deletions, and single primer coverage are also marked. |
Criteria Used by RECall for Rejecting a Sequence.[20,24]
| Failure category | Description |
|---|---|
| Stop codon | Any unambiguous stop codon (TGA, TAA, or TAG) |
| Bad inserts | An insertion relative to the reference sequence that is not a multiple of 3 bases, resulting in a frameshift |
| Bad deletion | A deletion relative to the reference sequence that is not a multiple of 3 bases, resulting in a frameshift |
| Too many mixtures | >3.5% of nucleotides sequences called as mixtures |
| N count | ≥5 Ns (any base) in the sequence |
| Mark count | ≥100 positions marked as being potentially problematic |
| Single coverage | >3 consecutive bases of single-read coverage with phred scores of 40 |
| Low quality | Any section where the quality of all coverage is too low to make a call |
Distribution of the HIV-1 Subtype in the Samples Used for Validation.
| Number of Remnant Clinical Samples Used | |
|---|---|
| Subtype A | 54(43%) |
| Subtype D | 31(25%) |
| Subtype C | 14(11%) |
| Subtype G | 8(6%) |
| URF (AD, AC, AG) | 11(9%) |
| AD. | 7(6%) |
| AC | 2(2%) |
| AG | 2(2%) |
| CRF01_AE | 8(6%) |
| VL < 1000 cp/ml | 34(27%) |
| VL > 1000 cp/ml | 92(73%) |
| Therapy naïve | 9(7%) |
| Therapy experience | 103(82%) |
| Therapy unknown | 14(11%) |
Performance in Generating Consensus Pol Sequences for HIV-1 Samples by the Different Editing Approaches.
| Editing method | Results | No results | Total |
|---|---|---|---|
| Exatype | 126 (93%) | 9 (7%) | 135 |
| Standard analysis Procedure | 132 (98%) | 3 (2%) | 135 |
Figure 1.Concordant and discordant nucleotide base call in sequences analyzed by Gold Standard and by Exatype. Matrices depict the frequencies of nucleotides called by Exatype (vertical axis) and by Standard (horizontal axis). Green highlight implies a concordant base call. Partially discordant base calls (implies mixtures called by one method but not the other) is in yellow highlight. Entirely discordant base calls are in red highlight. Zero is represented by blank cells. International Union of Biochemistry and Molecular Biology ambiguity codes are as follows; R A/G, Y C/T, W A/T, M A/C, K G/T, S G/C, B C/G/T, D A/G/T, H A/C/T, and V A/C/G. Overall, 99.8% concordance was observed for 311,227 bases compared.
Differences in Gold Standard and Exatype Editing of HIV-1 pol Sequences From Clinical Samples and Impact on Drug Resistance Interpretation.
| # with | # with AA differences | # with difference in resistance interpretation | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Region | # | # with NT differences | Difference Mix | Different NTb | Gap manualc | Error Manuald | Error Exatypee | ||
| PR | 126 | 17 | 0 | 0 | 3 | 9 | 24 | 24 | 1(ANRS) |
| RT | 126 | 17 | 4 | 1 | 1 | 8 | 31 | 54 | 1(ANRS); 2(REGA) |
We considered sequences that passed both Exatype and Gold standard editing. #, number of samples; NT, nucleotide; AA, amino acid; genotypic drug resistance interpretation systems: ANRS version 27, HIVDB version 8.9-1, and REGA version 8.0.2.
a) Number of samples with mixtures scored differently by the 2 approaches.
b) Number of samples with pure nucleotides scored separately by the 2 approaches.
c) Number of samples with parts of sequences that were not analyzable as judged by the editor.
d) The number of samples containing differences between Recall and Exatype editing due to manual editing.
e) Number of samples containing differences between Exatype and Recall editing due to errors made during automatic editing in Exatype.
Differences in RECall and Exatype Editing of HIV-1 pol Sequences From All EQA Samples and Impact on Drug Resistance Interpretation.
| # With NT differences compared to the reference sequence | # With AA, differences | # With differences in resistance interpretation | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PR | RT. | PR. | RT. | ||||||||||||||
| Total | Missed mix | False mix | Different NT/mix | Total | Missed mix | False mix | Different NT/mix | PR. | RT. | ANRS | HIVDB | REGA | ANRS | HIVDB | REGA | ||
| Exatype | 22/10 | 5/4 | 2/2 | 3/3 | 1/- | 2/- | 1/- | 1/- | -/- | 3/4 | 2/- | -/- | -/- | -/- | -/- | -/- | -/- |
| Recall | 22/12 | 5/3 | 1/1 | 3/2 | 1/- | -/2 | -/- | -/2 | -/- | 2/3 | -/- | -/- | -/- | -/- | -/- | -/- | -/- |
This analyses were confined to drug resistance positions (PR: 10, 20, 24, 30, 32, 33, 36, 46, 47, 48, 50, 53, 54, 63, 71, 73, 77, 82, 84, 88, 90; RT: 41, 62, 65, 67, 69ins, 69, 70, 74, 75, 77, 100, 103, 106, 108, 115, 116, 151, 181, 184, 188, 190, 210, 215, 219, 225). #, number of samples; PR, protease; RT, reverse transcriptase; NT, nucleotide; AA, amino acid; genotypic drug resistance interpretation systems: ANRS version 27, HIVDB version 8.9-1 and REGA version 8.0.2. The number of sequences that passed Exatype and RECall editing are before the slash. Number of sequences that did not pass either of the 2 approaches are behind the slash.
I. The number of samples with mixtures present in the reference sequence, but not scored by the editing approach (pure wild-type or mutant NT).
II. The number of samples with mixtures scored by the editing approach that was not present according to the reference sequence (pure wild-type or mutant NT).
III. Number of samples with mixtures and pure nucleotides scored differently by the editing approach and the reference sequence.
Comparison of RECall, Exatype Editing of WHO Dry Sample EQA Panel With the Reference Sequence at NT and AA level.
| RECall | Exatype | |||
|---|---|---|---|---|
| PR | RT. | PR. | RT. | |
| # sequences without NT differences | 28/34(82%) | 32/34(94%) | 24/32(75%) | 30/32(94%) |
| # sequences without AA differences | 30/34(88%) | 34/34(100%) | 26/32(81%) | 30/32(94%) |
| # NT differences/total # NT | 9/2112(0.43%) | 1/2562(0.04%) | 18/2023(0.88%) | 4/2400(0.17%) |
| # AA differences/total # AA | 5/724(0.72%) | 0/912(0%) | 10/675(1.48%) | 2/800(0.25) |
| # Me ∩ Mr | 18 | 18 | 12 | 8 |
| # Mr | 21 | 19 | 16 | 11 |
| P(Me|Mr) | 0.83 | 1 | 0.7 | 0.85 |
| # Me ∩ Pr | 7 | 2 | 8 | 1 |
| # Pr | 2081 | 2679 | 1999 | 2381 |
| P(Me|Pr) | 0.002 | 0.0008 | 0.004 | 0.0004 |
To meet the CLSI guidelines of 40% reference panels being EQA standards, we included dry panels from the WHO ResNet group. #, number of; AA, amino acids; NT, nucleotides; Me, mixtures present in the results of the editing approach; Mr, mixtures present in the reference sequences; Me ∩ Mr, mixtures present in the reference sequences that scored as a mixture by the editing approach; Pr, pure nucleotides present in the reference sequences; P(Me|Mr), the probability that a mixture scored if present in the reference sequence; Me∩Pr, pure nucleotides in the reference sequences that scored as a mixture by the editing approach; P(Me|Pr), the probability that a mixture scored if no mixture was present in the reference sequence.
Number of Samples Displaying (intermediate) Resistance to Different Drug Classes, According to ANRS, HIVDB, REGA, and Geno2Pheno.
| Data set | According to | ANRS | HIVDB | REGA | |||
|---|---|---|---|---|---|---|---|
| PI | RTI | PI | RTI | PI | RTI | ||
| Clinical | Exatype FASTA file | 82/126 (65%) | 51/126 (41%) | 33/126 (26%) | 42/126 (33%) | 34/126 (27%) | 41/126 (33%) |
| RECall FASTA file | 81/126 (64%) | 52/126 (41%) | 33/126 (26%) | 42/126 (33%) | 34/126 (27%) | 43/126 (34%) | |
| WHO Dry panel | Reference | 18/40 (45%) | 20/40 (50%) | 18/40 (45%) | 20/40 (50%) | 17/40 (43%) | 20/40 (50%) |
For the HIV-1 RNA remnant dataset, we included only the sequences that passed for both RECall, and Exatype editing. In contrast, we included resistance information of all reference sequences for the WHO dry panel dataset. FPR, false-positive rate; RTI, reverse transcriptase inhibitor; PI, protease inhibitor; genotypic drug resistance interpretation systems: ANRS version 27, HIVDB version 8.9-1 for the clinical dataset and HIVDB version 8.9-1 for the EQA dataset, and REGA version 8.0.2, G2P Geno2Pheno.