Literature DB >> 29362671

The global origins of resistance-associated variants in the non-structural proteins 5A and 5B of the hepatitis C virus.

Bradley R Jones1, Anita Y M Howe1, P Richard Harrigan1,2, Jeffrey B Joy1,2.   

Abstract

New, costly, fast acting, therapies targeting the non-structural proteins 5A and 5B (NS5A and NS5B) regions of the hepatitis C virus (HCV) genome are curative in the majority of cases. Variants with certain mutations in the NS5A and NS5B regions of HCV have been shown to reduce susceptibility to direct-acting NS5A and NS5B therapy and are found in treatment naïve patients. Despite this, the ease with which these variants evolve is poorly known, as are their evolutionary and geographic origins. To address this crucial gap we inferred the evolutionary and geographic origins of resistance-associated variants (RAVs) in the HCV NS5A and NS5B regions of subtypes 1a, 1b, and 3a sequences available from global databases. We found that RAVs in the NS5A region of HCV, when prevalent, were widely dispersed throughout the phylogenetic tree of HCV with multiple independent origins and that these variants are globally distributed. In contrast, most of the NS5B C316N variants came from one of two clades in the phylogenetic tree of HCV subtype 1b. The presence of serine (S) at codon 218 of HCV NS5B appears to facilitate the evolution of the C316N RAV. Other NS5B RAVs did not arise very frequently in our data set, except for S556G in subtype 1b and with respect to geography NS5B RAVs were also globally distributed. The inferred distribution of RAVs in the NS5A region and frequency of their origin suggest a low fitness barrier without the need for co-evolution of compensatory mutations. A low fitness barrier may allow rapid selection of de novo resistance to NS5A inhibitors during therapy.

Entities:  

Keywords:  drug resistance; hepatitis C virus; phylogenetics; phylogeography

Year:  2018        PMID: 29362671      PMCID: PMC5769712          DOI: 10.1093/ve/vex041

Source DB:  PubMed          Journal:  Virus Evol        ISSN: 2057-1577


1. Introduction

Understanding the origin and evolution of genetic variation conferring viral resistance to therapeutic agents is crucial to the long-term durability of therapy. Despite increasing access to novel, short-duration, curative, and direct-acting antiviral (DAA) treatment for hepatitis C virus (HCV), HCV associated mortality now exceeds all other infectious conditions in some developed countries (Ly et al. 2016); HCV contains a variety of naturally occurring mutations in the non-structural proteins 5A and 5B (NS5A and NS5B) that reduce susceptibility to currently approved therapies targeting these regions of the HCV genome (Fridell et al. 2011; Lam et al. 2012; Gao 2013; Gentile, Buonomo, and Borgia 2014; Poordad et al. 2014; Liu et al. 2015; Hezode et al. 2016). These mutations occasionally rise in frequency during treatment with antiviral drugs that target the NS5A or NS5B regions to become dominant in the virus population causing treatment failure and virological rebound. In addition, NS5A and NS5B ‘resistance-associated variants’ (RAVs), variants with mutations conferring resistance, have been found in viruses from treatment-naïve individuals, making treatment for those individuals by specific antiretroviral drugs ineffective. Since NS5A and NS5B inhibitors are relatively new, research into the capacity of HCV to develop resistance to these drugs is just beginning, thus little is known about the mutability of RAVs with or without the presence of therapy. Some of these RAVs confer high EC50 fold-shift—the change in the concentration of drug needed to eradicate 50 per cent of the virus with the RAV relative to wildtype virus—such as the Y93H NS5A RAV for the NS5A inhibitor, daclatasvir (1600× in subtype 1a (Gao 2013; Wong et al. 2013, Lontok et al. 2015), 3× in subtype 1b (Lontok et al 2015) and 537× in subtype 3a (Wang et al. 2013; Lontok et al. 2015)). Many RAVs exhibit increased resistance in the presence of one or more other RAVs in the same region. For example, a combination of S556G and C316N exhibits a 38× EC50 fold-shift for the NS5B inhibitor, dasabuvir, compared to the 5× EC50 fold-shiftC316N and the 11× EC50 fold-shift of S556G in subtype 1b (Dietz et al. 2015). An NS5B RAV of note is the S282T variant. The S282T RAV is the only variant to confer resistance to the NS5B inhibitor, sofosbuvir, producing a 13× fold-shift in subtype 1a (Lam et al. 2012; Lontok et al. 2015) and a 7.3× fold-shift in subtype 1b (Lam et al. 2012). Previous analyses of HCV RAVs in other gene regions, notably the Q80K variant in non-structural protein 3 (NS3), revealed that most resistant lineages were descended from a single origin and that they had a global common origin in North America (McCloskey et al. 2015; Cuypers et al. 2017). Additionally, these RAVs were shown to be strongly coupled to compensatory mutations suggesting both a high fitness barrier and that it could be relatively difficult to select these RAVs de novo during therapy (McCloskey et al. 2015). However, the evolutionary and geographic origins of RAVs in NS5A and NS5B and their dependence on compensatory mutations in these gene regions are poorly known. Thus, the relative ease with which RAVs in NS5A/B arise and may be driven to higher frequency by broad scale selection from DAA therapy remains unclear. If NS5A and NS5B RAVs have been shaped by similar evolutionary dynamics as NS3 RAVs (e.g. Q80K), we expect the NS5A and NS5B phylogenies to contain few large clades of RAVs. Alternatively, we would expect many independent origins of NS5A and NS5B RAVs, indicating a high mutation rate and low fitness barrier for those RAVs. Finally, we may observe few or no instances of an RAV in our datasets. The lack of, or infrequent, observation of a particular RAV in our data could occur for several reasons: (1) the variant has a high fitness barrier and thus does not arise frequently in treatment naïve cases; (2) the variant is not readily transmissible; and (3) it may be geographically distributed non-randomly and thus its infrequency could occur as a result of sampling. To address these hypotheses we inferred the global phylogenetic history of HCV RAVs in subtypes 1a, 1b, and 3a from public databases. We then analysed the phylogenetic and geographic origins of RAVs in NS5A and NS5B of HCV. Finally, we investigated a possible permissive mutation for the C316N variant in NS5B.

2. Materials and methods

2.1 Data collection and curation

We collected all of the HCV sequences from GenBank using the query ‘hepatitis + C+virus[orgn]’ on 30 August 2016, receiving 200,863 sequences. We removed all records not annotated with year and country, resulting in a dataset composed of 71,590 records. Using MAFFT v7.300b (Katoh and Standley 2013), we aligned each sequence to the HCV subtype 1a reference genome H77 (accession NC 004102). BioPython v1.67 (Cock et al. 2009) was used to strip insertions relative to H77 and clip the sequences to the NS5A and NS5B regions. Finally, we removed sequences with <50 per cent coverage over the NS5A/NS5B regions of H77 and removed duplicate sequences, retaining 4,916 NS5A sequences and 11,195 NS5B sequences. The sequences were then genotyped by adding reference sequences for the HCV subtypes: 1a, 1b, 1c, 1g, 2, 3a, 3b, 3i, 3k, 4, 5, 6, and 7 from the Los Alamos National Laboratory HCV Database (LANL) to the NS5A and NS5B alignment. We inferred a distribution of 1,000 bootstrap replicates of the approximate maximum likelihood (ML) trees for each region (NS5A and NS5B) with a generalized time reversible substitution model as implemented in FastTree v2.1.7 (Price, Dehal, and Arkin 2010). To ascribe sequences to particular subtypes, we selected the largest clade in each tree with all of the reference sequences of a particular subtype and no other reference sequence. Sequences that were assigned different subtypes in different replicate trees were discarded. To validate our HCV genotype assignment, results were compared against subtypes assigned by the HCV genotype assignment tool, COMET HCV (Struck et al. 2014); we discarded each sequence whose subtype disagreed with COMET HCV. When both the NS5A and NS5B regions were available for a sequence, if either method assigned different subtypes to the NS5A and NS5B regions then the sequence was discarded. Each sequence in our dataset was then realigned to a reference sequence of the same subtype obtained from LANL and clipped to the NS5A and NS5B regions as above. Sequences with <75 per cent coverage over the NS5A/NS5B region were subsequently discarded. Supplementary Tables S1 and S2 present the number of sequences found per subtype—at this stage we retained 4,510 NS5A sequences and 1,462 NS5B sequences. At this point we removed all clonal sequences from our datasets by assessing equality in all nucleotide positions of the sequences with BioPython and identical sequences were censured from the dataset. Next, trees were pruned to remove epidemiologically linked sequences. To accomplish this, we created another 1,000 bootstrap replicates of the ML trees for each region and subtype using FastTree, as above, and pruned tips with pairwise distance <0.025 substitutions per base keeping the tip closest to the root. We kept each sequence that was retained in at least 50 per cent of the 1,000 bootstrap trees. Because there were less than 100 sequences in each of subtypes: 1c, 1g, 2, 3b, 3i, 3k, 4, 5, 6, 7, we did not consider them further and only analysed NS5A sequences from subtypes 1a, 1b, and 3a and NS5B sequences from subtypes 1a and 1b since there were fewer than 100 NS5B sequences from subtype 3a. After data curation, our final dataset was composed of 1,390 NS5A sequences and 986 NS5B sequences. A diagram illustrating our data filtering process is shown in Supplementary Fig. S1. Since the collection dates of these sequences pre-date the use of NS5A inhibitors and NS5B inhibitors, these sequences are assumed to be from individuals naïve to treatment with NS5A and NS5B inhibitors; though some individuals may have undergone other HCV treatment.

2.2 Library of RAVs

We curated a list of RAVs in the NS5A region from the literature that confer resistance to at least one the following NS5A inhibitors: daclatasvir, ledipasvir, velpatasvir, ombitasvir and elbasvir (see Table 1). The RAVs were selected on the basis of two criteria: (1) reduced drug susceptibility comparing to wild-type HCV (EC50 fold-change greater than 5- to 100-fold depending on the drug) and (2) the RAVs were observed in patients who failed DAA treatment in the clinic. We curated an analogous list of RAVs in the NS5B region for the NS5B inhibitors: dasabuvir and sofosbuvir (see Table 2).
Table 1.

The NS5A RAVs considered in this study.

SubtypeVariantDaclatasvirLedipasvirVelpatasvirOmbitasvirElbasvir
1aM28A4,9511>1,0002613–5
1aM28G>1,000271,4293,4
1aM28T2056–8612,88,9658–11153–5,12
1aM28V588,10,11
1aQ30D9254,5
1aQ30E7,5006–85,4582,7,83713564,12
1aQ30G100–1,0002844,12
1aQ30H4356–8742,6–883–5
1aQ30K24,5451, 8>1,000218310,11
1aQ30R3656–81712,6–88008–11,141253–5,12
1aL31F963–5
1aL31I5081344
1aL31M4056–81412,6–81213,15103–5,12
1aL31V1,0006, 8>1002,6613–5
1aP32L50–1002
1aH58D5001,71,1272,7,82438, 1063–5,12
1aY93C5551,6–81,6022,6–811131,6758–10114,5,12
1aY93H1,6006–81,6772,6–860913,1541,3838–10,142203–5,12
1aY93N14,1006–8>14,7062,7,8366,7408–11,149293–5,12
1aY93S>1,00021,01310
1bL28T6619,10
1bL31F581010,11154,5
1bL31M74,5
1bL31V281,6,8810,11134,5
1bY93H241,6,81,3256,8778,10,11,14
1bY93S1211
3aM28T63910
3aA30K448,16,17148,161613,15505
3aL31F6031728101435
3aL31M5378,16
3aL31V1,1818,16
3aY93H2,1548,16,17288,16246136,728104855

Only RAVs deemed likely to cause resistance to at least one NS5A inhibitor according to the literature were considered. The values are the EC50 fold-shift of the variant with respect for the drug. A blank entry indicates that the RAV was not shown to cause resistance to the particular NS5A inhibitor.

Source: 1Fridell et al. (2011); 2Sarrazin et al. (2014); 3Black et al. (2015); 4Jacobson et al. (2015); 5Liu et al. (2015); 6Gao (2013); 7Wong et al. (2013); 8Lontok et al. (2015); 9Gentile, Buonomo, and Borgia (2014); 10Krishnan et al. (2015a); 11Krishnan et al. (2015b); 12Lahser et al. (2016b); 13Cheng et al. (2013); 14Poordad et al. (2014); 15Hezode et al. (2016); 16Hernandez et al. (2013); 17Wang et al. (2013).

Table 2.

The NS5B RAVs considered in this study.

SubtypeVariantEC50 fold-shift
1aS282T13 for sofosbuvir1,2
1aC316Y1,472 for dasabuvir1,3,4
1aA395G20 for dasabuvir5
1aM414I17 for dasabuvir1,5
1aM414T32 for dasabuvir1,3,5
1aM414V18 for dasabuvir5
1aN444K23 for dasabuvir5
1aY448C400 for dasabuvir5
1aY448H975 for dasabuvir1,4,5
1aA553T152 for dasabuvir1,3
1aG554S120 for dasabuvir1,3
1aS556G30 for dasabuvir1,3,4,5
1aS556R261 for dasabuvir5
1bS282T7.8 for sofosbuvir2
1bC316N5 for dasabuvir4,6
1bC316Y5 for dasabuvir1,7,5
1bS368T1,569 for dasabuvir5
1bN411S54 for dasabuvir5
1bM414I15 for dasabuvir7,5
1bM414T26 for dasabuvir5
1bM414V18 for dasabuvir5
1bY448C160 for dasabuvir5
1bY448H37 for dasabuvir5
1bA553V58 for dasabuvir5
1bS556G11 for dasabuvir1,5
1bD559G100 for dasabuvir5

Only RAVs deemed likely to cause resistance to at least one NS5B inhibitor according to the literature were considered. All substitutions are likely to cause resistance to dasabuvir, except S282T, which is likely to cause resistance to sofosbuvir.

Source: 1Lontok et al. (2015); 2Lam et al. (2012); 3Krishnan et al. (2015b); 4Dietz et al. (2015); 5Koev et al. (2015); 6Kati et al. (2015); 7Poordad et al. (2014).

The NS5A RAVs considered in this study. Only RAVs deemed likely to cause resistance to at least one NS5A inhibitor according to the literature were considered. The values are the EC50 fold-shift of the variant with respect for the drug. A blank entry indicates that the RAV was not shown to cause resistance to the particular NS5A inhibitor. Source: 1Fridell et al. (2011); 2Sarrazin et al. (2014); 3Black et al. (2015); 4Jacobson et al. (2015); 5Liu et al. (2015); 6Gao (2013); 7Wong et al. (2013); 8Lontok et al. (2015); 9Gentile, Buonomo, and Borgia (2014); 10Krishnan et al. (2015a); 11Krishnan et al. (2015b); 12Lahser et al. (2016b); 13Cheng et al. (2013); 14Poordad et al. (2014); 15Hezode et al. (2016); 16Hernandez et al. (2013); 17Wang et al. (2013). The NS5B RAVs considered in this study. Only RAVs deemed likely to cause resistance to at least one NS5B inhibitor according to the literature were considered. All substitutions are likely to cause resistance to dasabuvir, except S282T, which is likely to cause resistance to sofosbuvir. Source: 1Lontok et al. (2015); 2Lam et al. (2012); 3Krishnan et al. (2015b); 4Dietz et al. (2015); 5Koev et al. (2015); 6Kati et al. (2015); 7Poordad et al. (2014).

2.3 Phylogenetic and phylogeographic inferences and ancestral genome reconstruction

For each subtype (1a, 1b, and 3a) and region (NS5A and NS5B), we built 1,000 bootstrap phylogenetic trees with RAxML v8.2.10 (Stamatakis 2014) and a GTR + Γ model of nucleotide substitution. Each tree was rooted with the rtt function of the R package ape (Paradis, Claude, and Strimmer 2004) and time scaled with node.dating (Jones and Poon 2017). We simultaneously reconstructed the ancestral sequences and country of origin using BEAST v1.8.3 (Drummond et al. 2012) Markov chain Monte Carlo with two parallel runs per data set each with 10 million generations. For the BEAST runs, we drew trees from our distribution of 1,000 bootstrap phylogenetic trees generated by RAxML, to reconstruct the ancestral sequences we used a relaxed lognormal molecular clock model and the GTR + Γ nucleotide substitution model, and to reconstruct the country of origin, we employed a symmetric substitution model and a strict clock model. The state trees from the chosen parallels runs were combined and down sampled to 1,000 trees with LogCombiner v.2.4.7 (Bouckaert et al. 2014). Finally, we inferred maximum clade credibility (MCC) trees with mean heights for each subtype with TreeAnnotator v2.4.7 (Bouckaert et al. 2014). For each codon position with an RAV observed in at least ten sequences, we computed the distribution of continents of origin of sequences exhibiting RAVs at that codon position. We compared this to the distribution of continents of sequences not exhibiting an RAV at that codon position by computing the concordance (Lin 1989) between the two distributions. We employed Fisher’s exact test to test for associations between the presence of RAVs in a sequence and variants at other positions in the same sequence. We performed the concentrated changes test (CCT) (Maddison 1990) using a custom R script to verify whether the A218S variant influenced the evolution of the C316N RAV. Plots were created using the R package ggtree v1.11.0 (Yu et al. 2017) and DensiTree v2.0 (Bouckaert 2010).

3. Results

3.1 Genotypic root ages

We inferred a distribution of 1,000 BEAST trees for each subtype (1a, 1b and 3a) and region (NS5A and NS5B). Figures 1 and 2 show the MCC trees for the NS5A and NS5B data sets, respectively. See Supplementary Figs S1 and S2 for composite plots of the 1,000 BEAST trees with DensiTree (Bouckaert 2010). The roots of the trees were situated in the early twentieth century. In particular, for NS5A subtype 1a, the mean root date was 1902 with a 95 per cent highest posterior density (HPD) interval of (1885, 1935); for NS5A subtype 1b, the mean root date was 1896 with an HPD of (1862, 1924); and for NS5A subtype 3a, the mean root date was 1912 with an HPD of (1888, 1948). The mean root date for NS5B subtype 1a was 1910 with an HPD of (1889, 1934) and the mean root date for NS5B subtype 1b was 1900 with an HPD of (1876, 1923).
Figure 1.

Geographic history of HCV NS5A. Each figure represents the phylogeographic history of the global HCV NS5A population for a particular subtype as an MCC tree. The trees are scaled to units of time with the dates in years common era (CE). Black bars show the 95 per cent HPD interval of the date of the root of the tree. Nodes with posterior probability >90 per cent are marked with an asterisk (*). The edges of the trees are coloured by the inferred continent of origin of the child of the edge. (A) NS5A subtype 1a, (B) NS5A subtype 1b and (C) NS5A subtype 3a.

Figure 2.

Geographic history of HCV NS5B. Each figure represents the phylogeographic history of the global HCV NS5B population for a particular subtype as a composite of the MCC tree. The trees are scaled to units of time with the dates in years CE. Black bars show the 95 per cent HPD interval of the date of the root of the tree. Nodes with posterior probability >90 per cent are marked with an asterisk (*). The edges of the trees are coloured by the inferred continent of origin of the child of the edge. (A) NS5B subtype 1a and (B) NS5B subtype 1b.

Geographic history of HCV NS5A. Each figure represents the phylogeographic history of the global HCV NS5A population for a particular subtype as an MCC tree. The trees are scaled to units of time with the dates in years common era (CE). Black bars show the 95 per cent HPD interval of the date of the root of the tree. Nodes with posterior probability >90 per cent are marked with an asterisk (*). The edges of the trees are coloured by the inferred continent of origin of the child of the edge. (A) NS5A subtype 1a, (B) NS5A subtype 1b and (C) NS5A subtype 3a. Geographic history of HCV NS5B. Each figure represents the phylogeographic history of the global HCV NS5B population for a particular subtype as a composite of the MCC tree. The trees are scaled to units of time with the dates in years CE. Black bars show the 95 per cent HPD interval of the date of the root of the tree. Nodes with posterior probability >90 per cent are marked with an asterisk (*). The edges of the trees are coloured by the inferred continent of origin of the child of the edge. (A) NS5B subtype 1a and (B) NS5B subtype 1b.

3.2 RAVs in NS5A

Ancestral reconstruction reveals that variants conferring reduced susceptibility to NS5A inhibitors (e.g. L31M, Y93H; Table 3) have many independent origins. Figure 3 and Supplementary Fig. S4 show MCC trees detailing the evolutionary history of the RAVs in NS5A; only the trees for RAVs observed in the data are shown (see Table 3 for the list of NS5A RAVs found and see Supplementary Figs S5–S7 for composite plots).
Table 3.

The distribution of NS5A RAVs.

SubtypeVariantPrevalence (%)SingletonsCladesLargest Clade
1aM28T0.31 [2]2 (0)0 (0)
1aM28V3.0 [19]19 (0.91)0.28 (0.52)2.3 (0.49)
1aQ30H0.79 [5]5.0 (0.083)0.007 (0.083)2 (0)
1aL31M1.3 [8]4.8 (0.41)1.2 (0.40)3.0 (0.84)
1aH58D0.31 [2]2 (0)0 (0)
1aY93C0.63 [4]4.0 (0.083)0.007 (0.083)2 (0)
1aY93H0.63 [4]4.0 (0.0.32)0.001 (0.032)2 (0)
1bL31M4.0 [24]20 (1.4)1.9 (0.67)2.2 (0.55)
1bY93H3.5 [21]18 (1.2)1.4 (0.61)2.1 (0.57)
3aA20K3.8 [6]1.4 (0.66)1.0 (0.055)3.6 (1.4)
3aY93H4.4 [7]6.4 (0.83)0.44 (0.57)2.1 (0.22)

Prevalence is the percentage of tips that exhibit the RAV (quantity shown in square brackets). Singletons are the average number of RAV tips that did not have an inferred RAV ancestor. Clade sizes are the average number of clades with an inferred RAV ancestor. Largest clade is the average size of the largest clade with an inferred RAV ancestor (size is calculated by the number of tips in the clade; this may include tips in the clade that do not exhibit an RAV). Largest clade is calculated over all replicates that contain at least one clade. Values in parentheses indicate standard deviations of the data. RAVs not found in the data set are excluded from this table.

Figure 3.

The phylogenetic history of select RAVs in NS5A. Each figure represents the phylogenetic history of the global HCV NS5A population for a particular subtype as an MCC tree. The trees were scaled to units of time with dates in years CE. The edges of the trees are coloured by the inferred state of the specified codon of the child of the edge. (A) M28T/V in subtype 1a, (B) L31M in subtype 1b, (C) A30K in subtype 3a, (D) Y93C/H in subtype 1a, (E) Y93H in subtype 1b and (F) Y93H in subtype 3a.

The distribution of NS5A RAVs. Prevalence is the percentage of tips that exhibit the RAV (quantity shown in square brackets). Singletons are the average number of RAV tips that did not have an inferred RAV ancestor. Clade sizes are the average number of clades with an inferred RAV ancestor. Largest clade is the average size of the largest clade with an inferred RAV ancestor (size is calculated by the number of tips in the clade; this may include tips in the clade that do not exhibit an RAV). Largest clade is calculated over all replicates that contain at least one clade. Values in parentheses indicate standard deviations of the data. RAVs not found in the data set are excluded from this table. The phylogenetic history of select RAVs in NS5A. Each figure represents the phylogenetic history of the global HCV NS5A population for a particular subtype as an MCC tree. The trees were scaled to units of time with dates in years CE. The edges of the trees are coloured by the inferred state of the specified codon of the child of the edge. (A) M28T/V in subtype 1a, (B) L31M in subtype 1b, (C) A30K in subtype 3a, (D) Y93C/H in subtype 1a, (E) Y93H in subtype 1b and (F) Y93H in subtype 3a. In subtypes 1a and 1b, tips displaying an NS5A RAV rarely coalesced. Most of the phylogenies inferred using BEAST revealed no large clades of RAVs. What clades they had were composed of only two or three tips (see Table 3). Codons 28, 31, and 93 of NS5A in subtype 1a showed significant presence of RAVs with 1.2–3.3 per cent of the sequences containing an RAV at those codons. The other NS5A RAVs found in subtype 1a (Q30H and H58D) had a lower prevalence (<1%). The only NS5A RAVs found in subtype 1b were L31M and Y93H also at a significant prevalence (see Table 3). In contrast to RAVs in other gene regions, the high frequency of independent origins of NS5A RAVs and their shallow depth in the tree suggests that RAVs in NS5A have a low fitness barrier and that NS5A RAVs could evolve rapidly in vivo. In subtype 3a, the A30K RAV displayed a different pattern. Most of 1,000 BEAST trees contained a single coalescing clade of A30K variants (see Fig. 3C and Table 3). The Y93H RAV, however, behaved the same in subtype 3a as in subtypes 1a and 1b, with many widely dispersed variants few of which coalesced (see Fig. 3D–F).

3.3 RAVs in NS5B

Most RAVs in NS5B had low prevalence or were not present in our data set at all (see Table 4 and Supplementary Figs S8–S11). Interestingly, the S282T RAV, the only known RAV conferring resistance to sofosbuvir, was not found in subtype 1a nor subtype 1b in our data set. However, S556G and C316N RAVs were found in many samples in subtype 1b and were likely to belong to large clades of variants with that RAV (see Table 4). In contrast, we found only four samples with the S556G RAV and no samples with the C316 RAV in subtype 1a.
Table 4.

The distribution of NS5B RAVs.

SubtypeVariantPrevalence (%)SingletonsCladesLargest clade
1aY448H0.19 [1]1 (0)0 (0)
1aS556G0.75 [4]4.0 (0.063)0.005 (0.071)2.2 (0.49)
1aS556R0.19 [1]0.802 (0.40)0.20 (0.40)2.6 (0.53)
1bC316N116 [71]0.23 (0.46)1.7 (0.56)58 (22)
1bS368T0.22 [1]1 (0)0 (0)
1bM414I0.22 [1]1 (0)0 (0)
1bY448H0.22 [1]1 (0)0 (0)
1bS556G9.2 [41]30 (2.4)3.3 (1.2)13 (30)

Prevalence is the percentage of tips that exhibit the RAV (quantity shown in square brackets). Singletons are the average number of RAV tips that did not have an inferred RAV ancestor. Clade sizes are the average number of clades with an inferred RAV ancestor. Largest clade is the average size of the largest clade with an inferred RAV ancestor (size is calculated by the number of tips in the clade; this may include tips in the clade that do not exhibit an RAV). Largest clade is calculated over all replicates that contain at least one clade. Values in parentheses brackets indicate standard deviations of the data. RAVs not found in the data set are excluded from this table.

The distribution of NS5B RAVs. Prevalence is the percentage of tips that exhibit the RAV (quantity shown in square brackets). Singletons are the average number of RAV tips that did not have an inferred RAV ancestor. Clade sizes are the average number of clades with an inferred RAV ancestor. Largest clade is the average size of the largest clade with an inferred RAV ancestor (size is calculated by the number of tips in the clade; this may include tips in the clade that do not exhibit an RAV). Largest clade is calculated over all replicates that contain at least one clade. Values in parentheses brackets indicate standard deviations of the data. RAVs not found in the data set are excluded from this table.

3.4 The C316N RAV in NS5B subtype 1b

The C316N RAV in NS5B subtype 1b produces a 5× EC50 fold-shift in dasabuvir (Dietz et al. 2015). Two large clades of C316N are present in most of the BEAST trees of NS5B subtype 1b inferred by our analysis (see Fig. 4 and Supplementary Fig. S12). After compiling co-occurring mutations and testing for association with C316N, we discovered very strong associations of C316N to the variants L159F, A207T and A218S (all P < 10−3). Whether or not these variants confer resistance to NS5B inhibitors is unknown.
Figure 4.

The phylogenetic history of the C316N and A218S variants in NS5B subtype 1b. An MCC tree shows the phylogenetic history of the global HCV NS5B population in subtype 1b. The tree was scaled to units of time with dates in years CE. The edges of the tree are coloured by the inferred states of codons 316 and 218 in NS5B of the child of the edge. Note the two clades of green variants with both the C316N and A218S variants—red bars mark these clades. These clades are each contained in separate clades of A218S variants.

The phylogenetic history of the C316N and A218S variants in NS5B subtype 1b. An MCC tree shows the phylogenetic history of the global HCV NS5B population in subtype 1b. The tree was scaled to units of time with dates in years CE. The edges of the tree are coloured by the inferred states of codons 316 and 218 in NS5B of the child of the edge. Note the two clades of green variants with both the C316N and A218S variants—red bars mark these clades. These clades are each contained in separate clades of A218S variants. Only one of the seventy-one sequences containing the C316N variant did not contain the A218S variant. However, this sequence appears in most of BEAST trees in a clade containing other C316N variants. Figure 4 shows the evolutionary relationships between C316N and A218S. In 80 per cent of the BEAST trees, each of the clades of C316N variants is contained within clades that exhibit A218S variants, suggesting that A218S serves as a permissive mutation for C316N. This is reinforced by the fact that there are isolated C316N variants and each of these variants has the A218S variant. Further support for this pattern is provided by the CCT (Maddison 1990), which reveals a significant effect of the state of codon 218 on the mutability of codon 316 (P < 10−2). The other highly associated variants, L159F and A207T, only appear alongside C316N in the presence of A218S; in fact, A207T was found exclusively in sequences with the A218S variant. There is also a strong association between C316N and the S556G RAV (P < 10−3). It has been shown that the combination of C316N and S556G has a 38× EC50 fold-shift for dasabuvir compared to the 5× EC50 fold-shift of C316N and the 11× EC50 fold-shift of S556G in subtype 1b (Dietz et al. 2015). The large clusters of S556G and C316N variants in NS5B subtype 1b contain sequences from multiple continents. Interestingly, in subtype 1a, cysteine (C) was found at codon 316 and serine (S) was found at codon 218 in every sequence considered in this study.

3.5 RAVs and geography

The phylogenetic trees of each subtype and region had a variety of inferred origins. For the NS5A data sets, most (99.9%) of the subtype 1a BEAST trees had a geographic origin in the Americas, most (96%) of the subtype 1b BEAST trees had a geographic origin in Europe and most (97%) of the subtype 3a BEAST trees had a geographic origin in Asia. For the NS5B data sets, most of the subtype 1a (99.9%) and subtype 1b (95%) BEAST trees had a geographic origin in the Americas. The disagreement of geographic origin between NS5A subtype 1b and NS5B subtype 1b is probably due to oversampling of European sequences in NS5A subtype 1b and American sequences in NS5B subtype 1b. The RAVs in the NS5A region are widely dispersed around the world and were evenly dispersed geographically within our dataset (Fig. 1). The concordance of the continents of origin of RAVs compared to the continents of origin of other variants at the same codon was high in the NS5A region—in subtype 1a (M28T/V: 0.96, Q30H: 0.89, L31M: 0.89, Y93C/H: 0.93), in subtype 1b (L31M: 0.89, Y93H: 0.91) and in subtype 3a (Y93H: 0.97). However, in subtype 3a, the NS5A RAV, A30K, had a concordance of 0.22. This was due to there being more American and Oceanic samples exhibiting the A30K variant. As highlighted previously, the A30K RAV behaves differently than the other RAV in NS5A in that it is highly localized in one clade. The geographic distribution of RAVs in NS5B subtype 1b is similar to those in NS5A (Fig. 2), with an even dispersion and global distribution of RAVs. Similarly, the concordance of the continents of origin of RAVs compared to the continents of origin of other variants at the same codon was high in the NS5B region (C316N: 0.84, S556G: 0.83). However, in subtype 1a, the concordance for the NS5B RAV, S556G/R, was 0.69. This was due to more sequences exhibiting the S556G/R RAVs having originated from Europe and Oceania.

4. Discussion

The RAVs in the NS5A region have a strikingly different distribution relative to that of the Q80K RAV in NS3 (see Fig. 1 in McCloskey et al. (2015)). While most of the Q80K variants in NS3 with requisite compensatory mutations are descended from a common ancestor, the RAVs in the NS5A region show multiple origins dispersed throughout the HCV phylogeny. The low prevalence of coalescing RAVs combined with the high prevalence of RAVs, suggest that RAVs mutate readily in this gene region and have a low-fitness barrier. The plasticity of the development of RAVs supports the notion that selection for these variants could occur rapidly during treatment with NS5A inhibitors (Lahser et al. 2016a, 2016b). Furthermore, these RAVs could be maintained after treatment is withdrawn and be readily transmitted between individuals (Dvory-Sobol et al. 2015). Our results suggest that in the NS5B region, the A218S variant acts as a permissive mutation for the C316N RAV and in particular, the A218S variant seeds the C316N RAV (see Fig. 4). This relationship could confer a higher probability of HCV evolving the C316N variant in patients on therapy with dasabuvir if the A218S variant is present and thus lead to resistance and ultimately treatment failure. Future work should investigate the EC50 fold-shift on dasabuvir with respect to the A218S variant in combination and apart from the C316N variant. We recommend that the effect of the A218S variant on the fitness of the C316N RAV and mutability of codon 316 in the presence of dasabuvir be explored in vivo. Future research would also profitably focus on the interaction between the C316N and S556G RAVs in NS5B. Our analyses reveal a high prevalence and strong association of these two RAVs in HCV subtype 1b. This is clinically significant as this combination exhibits a 38× EC50 fold-shift for dasabuvir compared to the 5× EC50 fold-shift of C316N and the 11× EC50 fold-shift of S556G in subtype 1b (Dietz et al. 2015). Overall, the NS5A and NS5B trees of each subtype have roots that date to the beginning of the twentieth century. Between regions the subtype 1a and 1b trees have root dates that agree within 10 years. However, there is disagreement between the geographic origin of subtype 1b in the NS5A and NS5B trees. The NS5A origin is in Europe and the NS5B origin is in the Americas. Considering that the most sampled continent was always the inferred originating continent of the tree this discrepancy is likely due to oversampling of locations. There is no discordance in the geographic origin of subtype 1a between NS5A and NS5B. To account for the oversampling we ran our pipeline through multiple trial runs subsampling the sequences to obtain a more representative distribution of continents sampled. All of our subsampling trials in subtype 1a agreed with our main results in that the roots of the majority of trees were inferred to be American. However, our NS5A subtype 1b trials produced trees whose roots were in the Americas and our NS5B subtype 1b trials produced trees whose roots were in Europe—the opposite of our main result. Due to this we conclude that the geographic origin of subtype 1b ancestor could not be resolved (see Supplementary Materials). In conclusion, we found that NS5A RAVs, when found in a subtype, were widely dispersed geographically and phylogenetically, in contrast to the behaviour of the Q80K mutation in NS3. The RAVs in NS5B subtype 1a were scarce; however, C316N NS5B RAVs in subtype 1b formed two large clades and their evolution appears to be facilitated by the A218S variant in NS5B. Click here for additional data file.
  30 in total

1.  DensiTree: making sense of sets of phylogenetic trees.

Authors:  Remco R Bouckaert
Journal:  Bioinformatics       Date:  2010-03-12       Impact factor: 6.937

2.  FastTree 2--approximately maximum-likelihood trees for large alignments.

Authors:  Morgan N Price; Paramvir S Dehal; Adam P Arkin
Journal:  PLoS One       Date:  2010-03-10       Impact factor: 3.240

3.  A concordance correlation coefficient to evaluate reproducibility.

Authors:  L I Lin
Journal:  Biometrics       Date:  1989-03       Impact factor: 2.571

4.  Global origin and transmission of hepatitis C virus nonstructural protein 3 Q80K polymorphism.

Authors:  Rosemary M McCloskey; Richard H Liang; Jeffrey B Joy; Mel Krajden; Julio S G Montaner; P Richard Harrigan; Art F Y Poon
Journal:  J Infect Dis       Date:  2014-11-10       Impact factor: 5.226

5.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

Review 6.  Antiviral activity and resistance of HCV NS5A replication complex inhibitors.

Authors:  Min Gao
Journal:  Curr Opin Virol       Date:  2013-07-27       Impact factor: 7.090

7.  In vitro activity and resistance profile of dasabuvir, a nonnucleoside hepatitis C virus polymerase inhibitor.

Authors:  Warren Kati; Gennadiy Koev; Michelle Irvin; Jill Beyer; Yaya Liu; Preethi Krishnan; Thomas Reisch; Rubina Mondal; Rolf Wagner; Akhteruzzaman Molla; Clarence Maring; Christine Collins
Journal:  Antimicrob Agents Chemother       Date:  2014-12-22       Impact factor: 5.191

8.  Natural prevalence of NS5A polymorphisms in subjects infected with hepatitis C virus genotype 3 and their effects on the antiviral activity of NS5A inhibitors.

Authors:  Dennis Hernandez; Nannan Zhou; Joseph Ueland; Aaron Monikowski; Fiona McPhee
Journal:  J Clin Virol       Date:  2013-02-04       Impact factor: 3.168

9.  In vitro activity of daclatasvir on hepatitis C virus genotype 3 NS5A.

Authors:  Chunfu Wang; Lourdes Valera; Lingling Jia; Melissa J Kirk; Min Gao; Robert A Fridell
Journal:  Antimicrob Agents Chemother       Date:  2012-10-22       Impact factor: 5.191

10.  Susceptibilities of genotype 1a, 1b, and 3 hepatitis C virus variants to the NS5A inhibitor elbasvir.

Authors:  Rong Liu; Stephanie Curry; Patricia McMonagle; Wendy W Yeh; Steven W Ludmerer; Patricia A Jumes; William L Marshall; Stephanie Kong; Paul Ingravallo; Stuart Black; Irene Pak; Mark J DiNubile; Anita Y M Howe
Journal:  Antimicrob Agents Chemother       Date:  2015-08-24       Impact factor: 5.938

View more
  3 in total

1.  Intra-host evolutionary dynamics of the hepatitis C virus among people who inject drugs.

Authors:  Vincent Montoya; Anita Y M Howe; Weiyan Y Dong; Winnie Dong; Chanson J Brumme; Andrea D Olmstead; Kanna Hayashi; P Richard Harrigan; Jeffrey B Joy
Journal:  Sci Rep       Date:  2021-05-11       Impact factor: 4.379

Review 2.  Advances in Visualization Tools for Phylogenomic and Phylodynamic Studies of Viral Diseases.

Authors:  Kristof Theys; Philippe Lemey; Anne-Mieke Vandamme; Guy Baele
Journal:  Front Public Health       Date:  2019-08-02

3.  One day at a time.

Authors:  Aila Akosua Kattner
Journal:  Biomed J       Date:  2022-01-15       Impact factor: 7.892

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.