| Literature DB >> 34578305 |
Hope R Lapointe1,2, Weiyan Dong2, Winnie W Y Dong2, Don Kirkby2, Conan Woods2, Art F Y Poon3, Anita Y M Howe4, P Richard Harrigan1, Chanson J Brumme2,5.
Abstract
Despite the effectiveness of direct-acting antiviral agents in treating hepatitis C virus (HCV), cases of treatment failure have been associated with the emergence of resistance-associated substitutions. To better guide clinical decision-making, we developed and validated a near-whole-genome HCV genotype-independent next-generation sequencing strategy. HCV genotype 1-6 samples from direct-acting antiviral agent treatment-naïve and -treated HCV-infected individuals were included. Viral RNA was extracted using a NucliSens easyMAG and amplified using nested reverse transcription-polymerase chain reaction. Libraries were prepared using Nextera XT and sequenced on the Illumina MiSeq sequencing platform. Data were processed by an in-house pipeline (MiCall). Nucleotide consensus sequences were aligned to reference strain sequences for resistance-associated substitution identification and compared to NS3, NS5a, and NS5b sequence data obtained from a validated in-house assay optimized for HCV genotype 1. Sequencing success rates (defined as achieving >100-fold read coverage) approaching 90% were observed for most genotypes in samples with a viral load >5 log10 IU/mL. This genotype-independent sequencing method resulted in >99.8% nucleotide concordance with the genotype 1-optimized method, and 100% agreement in genotype assignment with paired line probe assay-based genotypes. The assay demonstrated high intra-run repeatability and inter-run reproducibility at detecting substitutions above 2% prevalence. This study highlights the performance of a freely available laboratory and bioinformatic approach for reliable HCV genotyping and resistance-associated substitution detection regardless of genotype.Entities:
Keywords: HCV; direct-acting antiviral agent; genotype-independent; resistance-associated substitutions; whole-genome sequencing
Mesh:
Substances:
Year: 2021 PMID: 34578305 PMCID: PMC8473162 DOI: 10.3390/v13091721
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Specimen characteristics for assessing individual assay validation parameters.
| Test | No. of Samples | Replicates per Sample | HCV pVL log10 (IU/mL), Median (Range) | Genotypes (LiPA Assay) |
|---|---|---|---|---|
| Accuracy | 93 | 2 | 6.8 (3.0–7.6) | 1a ( |
| Repeatability | 4 | 12 | 6.5 (6.1–7.3) | 1a ( |
| Reproducibility | 12 | 5 | 6.5 (6.0–7.7) | 1a ( |
| Sensitivity (viral load) & Genotype coverage | 148 | 1 | 6.1 (3.0–7.6) | 1a ( |
| Sensitivity (minor species) | 95 | 2 | NA | Unknown ( |
| Specificity-Negatives | 5 | 3 | NA | NA |
HCV, Hepatitis C Virus; pVL, plasma viral load; IU, international units; LiPA, line probe assay; NA, not available. Dashes indicate no data were available for the associated parameter.
Figure 1HCV genome map. A schematic representation of the HCV genome depicting coverage of the genotype-independent, near-whole genome sequencing method wherein the genome is amplified in two fragments then sequenced in parallel.
Figure 2Accuracy of RAS quantification using the genotype-independent sequencing method compared to a genotype 1-optimized assay. Accuracy for NS3 and NS5a was assessed by the comparison of amino acid substitution frequencies at predefined, resistance-associated position as determined by our previously validated genotype 1-optimized and genotype-independent assays. Each point represents one possible RAS; note that positions with multiple possible amino acid residues are represented as separate data points. (A) Amino acid prevalence is compared between both sequencing methods for the same samples. (B) The difference in RAS frequencies between both sequencing methods as a function of mean RAS prevalence. The two red dotted lines represent the 95% limits of agreement.
Figure 3Repeatability and reproducibility of amino acid substitutions in NS3, NS5a and NS5b. Substitutions observed at a prevalence ≥2% in a given replicate were considered “detected”. Variant detection rate (defined as the % of replicates per sample, in which a substitution was detected) is categorized by the mean prevalence of a substitution across all replicates of a sample. Replicate testing of each sample began from the same RNA extract; all steps beginning from the RT-PCR were repeated. Numbers indicated on the top of each graph represent the total number of substitutions in each bin. (A) Repeatability of amino acid substitutions was determined using four samples (3 GT1a, 1 GT1b) in 12 replicates processed on a single MiSeq run. (B) The reproducibility of amino acid substitutions was determined using 12 samples (10 GT1a, 1 GT1b, 1 GT3) in 5 replicates processed on separate days on five separate MiSeq runs.
Sequencing success of the genotype-independent NGS assay in GT1 and GT3-6 sample sets stratified by viral load category.
| Genotype 1 ( | Genotypes 3–6 ( | |||||
|---|---|---|---|---|---|---|
| HCV pVL log10 (IU/mL) | NS3 | NS5A | NS5B | NS3 | NS5A | NS5B |
| >7 | 23/23 (100%) | 23/23 (100%) | 23/23 (100%) | 2/2 (100%) | 2/2 (100%) | 2/2 (100%) |
| 6.6–7.0 | 32/33 (97%) | 32/33 (97%) | 32/33 (97%) | 1/1 (100%) | 1/1 (100%) | 1/1 (100%) |
| 6.1–6.5 | 14/14 (100%) | 14/14 (100%) | 14/14 (100%) | 4/4 (100%) | 4/4 (100%) | 4/4 (100%) |
| 5.6–6.0 | 16/16 (100%) | 16/16 (100%) | 16/16 (100%) | 10/10 (100%) | 10/10 (100%) | 10/10 (100%) |
| 5.1–5.5 | 1/2 (50%) | 1/2 (50%) | 1/2 (50%) | 20/23 (87%) | 20/23 (87%) | 15/23 (65%) |
| <5 | 1/5 (20%) | 2/5 (40%) | 1/5 (20%) | 7/13 (7/54%) | 7/13 (54%) | 7/13 (54%) |
HCV, hepatitis C virus; pVL, plasma viral load; IU, international units. Genotypes were inferred through sequence analysis or in the absence of sequence data from assigned genotypes from LiPA assays. HCV subtype information was obtained using phylogenetic analyses based on MiSeq data. Sequencing success was defined as sequences having a minimum of 100-fold coverage at all RAS-associated positions in each respective gene. ^ Two genotype 3–6 samples from the set of 55 samples were excluded from this table, as no viral load data were available.
Figure 4Assay sensitivity in detecting minority variants between sample duplicates at different mean nucleotide frequencies (defined as the mean prevalence of all permutations of nucleotides and positions within a sequence). This parameter was assessed using 95 samples run in duplicate, of which 84 passed sequencing. (A) The total number of nucleotide substitutions (N) detected at a particular mean substitution prevalence. (B) Median coefficient of variation (CV) of minor species with interquartile range (IQR) at different mean substitution prevalence.
Sequencing success for the HCV NS3, NS5a and NS5b genes using the genotype-independent-HCV assay for genotypes 1–6.
| Genotype | Genotype Subtypes * | Samples Attempted ^ | Sequenced (%) | ||
|---|---|---|---|---|---|
| NS3 | NS5a | NS5b | |||
| 1 | 1a | 78 | 72 (92%) | 72 (92%) | 72 (92%) |
| 1b | 11 | 11 (100%) | 11 (100%) | 11 (100%) | |
| 1e | 3 | 2 (67%) | 2 (67%) | 2 (67%) | |
| 2 | 2b | 2 | 2 (100%) | 2 (100%) | 2 (100%) |
| 3 | 3a | 20 | 17 (100%) | 17 (100%) | 17 (100%) |
| 4 | 4a, 4n | 7 | 5 (71%) | 5 (71%) | 2 (29%) |
| 5 | 5a | 4 | 3 (75%) | 3 (75%) | 3 (75%) |
| 6 | 6a, 6e, 6h, 6k, 6t | 21 | 20 (95%) | 20 (95%) | 17 (85%) |
| Overall | - | 146 | 132 (90%) | 132 (90%) | 126 (86%) |
* Genotypes were inferred through sequence analysis or in the absence of sequence data from assigned genotypes from LiPA assays. HCV subtype information was obtained using phylogenetic analyses based on MiSeq data. Sequencing success was defined as sequences having a minimum of 100-fold coverage at all RAS-associated positions in each respective gene. ^ Only samples with viral loads above the estimated assay limit of detection (5 log10 IU/mL) were included.