| Literature DB >> 35890052 |
Petra Simicic1, Anamarija Slovic2, Leona Radmanic1, Adriana Vince3,4, Snjezana Zidovec Lepej1.
Abstract
Molecular epidemiology of hepatitis C virus (HCV) is exceptionally complex due to the highly diverse HCV genome. Genetic diversity, transmission dynamics, and epidemic history of the most common HCV genotypes were inferred by population sequencing of the HCV NS3, NS5A, and NS5B region followed by phylogenetic and phylodynamic analysis. The results of this research suggest high overall prevalence of baseline NS3 resistance associate substitutions (RAS) (33.0%), moderate prevalence of NS5A RAS (13.7%), and low prevalence of nucleoside inhibitor NS5B RAS (8.3%). Prevalence of RAS significantly differed according to HCV genotype, with the highest prevalence of baseline resistance to NS3 inhibitors and NS5A inhibitors observed in HCV subtype 1a (68.8%) and subtype 1b (21.3%), respectively. Phylogenetic tree reconstructions showed two distinct clades within the subtype 1a, clade I (62.4%) and clade II (37.6%). NS3 RAS were preferentially associated with clade I. Phylogenetic analysis demonstrated that 27 (9.0%) HCV sequences had a presumed epidemiological link with another sequence and classified into 13 transmission pairs or clusters which were predominantly comprised of subtype 3a viruses and commonly detected among intravenous drug users (IDU). Phylodynamic analyses highlighted an exponential increase in subtype 1a and 3a effective population size in the late 20th century, which is a period associated with an explosive increase in the number of IDU in Croatia.Entities:
Keywords: direct acting antivirals; epidemiology; genotypes; hepatitis C virus; intravenous drug users; phylodynamics; phylogenetics; resistance associated substitutions; subtypes
Year: 2022 PMID: 35890052 PMCID: PMC9323280 DOI: 10.3390/pathogens11070808
Source DB: PubMed Journal: Pathogens ISSN: 2076-0817
Demographic, epidemiological, clinical, and laboratory characteristics of the study population.
| Overall | Subtype 1a | Subtype 1b | Subtype 3a | |
|---|---|---|---|---|
| Patients | N = 300 | N = 109 | N = 80 | N = 111 |
| Gender, | ||||
| M | 187 (62.3) | 73 (67.0) | 36 (45.0) | 78 (70.3) |
| F | 113 (37.7) | 36 (33.0) | 44 (55.0) | 33 (29.7) |
| Age (years), | ||||
| 18–35 | 32 (10.7) | 11 (10.1) | 5 (6.3) | 16 (14.4) |
| 36–47 | 140 (46.7) | 66 (60.6) | 11 (13.8) | 63 (56.8) |
| 48–59 | 72 (24.0) | 27 (24.8) | 21 (26.3) | 24 (21.6) |
| >60 | 56 (18.7) | 5 (4.6) | 43 (53.8) | 8 (7.2) |
| Age (years), median (Q1–Q3) | 45 | 44 | 61 | 43 |
| (40.0–57.0) | (40.0–50.5) | (52.0–65.8) | (39.0–49.0) | |
| Fibrosis stage *, | ||||
| F0/F1 | 103 (35.2) | 48 (44.9) | 22 (27.8) | 33 (30.8) |
| F2 | 72 (24.6) | 27 (25.2) | 21 (26.6) | 24 (22.4) |
| F3 | 44 (15.0) | 15 (14.0) | 12 (15.2) | 17 (15.9) |
| F4 | 74 (25.3) | 17 (15.9) | 24 (30.4) | 33 (30.8) |
| Reported HCV infection route, | ||||
| IDU | 95 (31.7) | 48 (44.0) | 1 (1.3) | 46 (41.4) |
| Iatrogenic | 48 (16.0) | 14 (12.8) | 25 (31.3) | 9 (8.1) |
| perinatal/sexual | 11 (3.7) | 2 (1.8) | 1 (1.3) | 8 (7.2) |
| Unknown | 146 (48.7) | 45 (41.3) | 53 (66.3) | 48 (43.2) |
| Viral load, median (Q1–Q3), log IU/mL | 6.0 (5.5–6.3) | 6.0 (5.5–6.5) | 5.9 (5.5–6.3) | 6.0 (5.5–6.3) |
Q1: first quartile; Q3: third quartile; IDU: intravenous drug use; *: available for 107 subtype 1a, 79 subtype 1b and 107 subtype 3a patients.
Comparison of demographic, clinical, and laboratory characteristics of patients with and without NS3, NS5A, or NS5B RAS.
| Presence of NS3 RAS | Absence of NS3 RAS | Presence of NS5A RAS | Absence of NS5A RAS | Presence of NS5B RAS | Absence of NS5B RAS | Total, N (%) | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Patients, | 98 (32.7) | 202 (67.3) | 41 (13.7) | 259 (86.3) | 25 (8.3) | 275 (91.7) | 300 (100) | |||
| Gender, | ||||||||||
| M | 61 (32.6) | 126 (67.4) | 0.982 | 27 (14.4) | 160 (85.6) | 0.617 | 13 (7.0) | 174 (93.1) | 0.265 | 187 (62.3) |
| F | 37 (32.7) | 76 (67.3) | 14 (12.4) | 99 (87.6) | 12 (10.6) | 101 (89.4) | 113 (37.7) | |||
| Age, median years (Q1–Q3) c | 44 (40–54) | 46 (40–58) | 0.168 | 46 (42.5–60.5) | 45 (40–57) | 0.271 | 57 (39–63) | 45 (40–56) | 0.111 | 45 (40–57) |
| HCV subtype, | ||||||||||
| 1a | 75 (68.8) | 34 (31.2) |
| 17 (15.6) | 92 (84.4) |
| 0 (0.0) | 109 (100.0) |
| 109 (36.3) |
| 1b | 21 (26.3) | 59 (73.8) | 17 (21.3) | 63 (78.8) | 25 (31.3) | 55 (68.8) | 80 (26.7) | |||
| 3a | 2 (1.8) | 109 (98.2) | 7 (6.3) | 104 (93.7) | 0 (0.0) | 111 (100.0) | 111 (37.0) | |||
| Fibrosis stage, | ||||||||||
| F0/1–F2 | 61 (34.9) | 114 (65.1) | 0.278 | 23 (13.1) | 152 (86.7) | 0.609 | 14 (8.0) | 161 (92.0) | 0.920 | 175 (59.7) |
| F3–F4 | 34 (28.8) | 84 (71.2) | 18 (15.3) | 100 (84.8) | 11 (9.3) | 107 (90.7) | 118 (40.3) | |||
| Viral load, median (Q1–Q3), log IU/mL | 6.0 (5.5–6.4) | 6.0 (5.4–6.3) | 0.303 | 6.0 (5.5–6.4) | 6.0 (5.4–6.3) | 0.849 | 5.9 (5.7–6.4) | 6.0 (5.5–6.3) | 0.682 | 6.0 (5.5–6.3) |
| Subtype 1a clade, | ||||||||||
| 1 | 54 (79.4) | 14 (20.6) |
| 12 (17.6) | 56 (82.4) | 0.589 | 0 (0.0) | 68 (100.0) | 1 | 68 (62.4) |
| 2 | 21 (51.2) | 20 (48.8) | 5 (12.2) | 36 (87.8) | 0 (0.0) | 41 (100.0) | 41 (37.6) | |||
| Part of TC, | ||||||||||
| yes | 6 (22.2) | 21 (77.8) | 0.225 | 4 (14.8) | 23 (85.2) | 0.225 | 2 (7.4) | 25 (92.6) | 0.855 | 27 (9.0) |
| no | 92 (33.7) | 181 (66.3) | 37 (13.6) | 236 (86.4) | 23 (8.4) | 250 (91.6) | 273 (81.0) | |||
Q1: first quartile; Q3, third quartile; IDU: intravenous drug use; RAS: resistance-associated substitution; TC: transmission cluster or pair; a: the association between the selected variables was analysed by Pearson’s chi squared test; b: the association between the selected variables was analysed by the Fisher’s exact test; c: the association between the selected variables was analysed by the Mann–Whitney test; *: available for total number of patients N = 293; **: available for total number of subtype 1a patients N = 109.
Figure 1Prevalence of overall and resistance conferring RAS according to geno2pheno [HCV] algorithm and HCV genotype in: (a) NS3 region (b) NS5A region.
Detected baseline RAS to NS3 inhibitors.
| Codon Position | NS3 RAS | RAS † Prevalence according to HCV Subtype | ||
|---|---|---|---|---|
| 1a (N = 109) (%) | 1b (N = 80) (%) | 3a (N = 111) (%) | ||
| 54 | T54S |
|
| / |
| 55 | V55A |
|
| / |
| 56 | Y56F | / | 18 (22.5) | / |
| 80 | Q80K |
| / | / |
| 117 | R117H | 2 (1.8) | 4 (5.0) | / |
| 168 | D/Q * 168E | 1 (0.9) | / | / |
| D/Q * 168K | / | / | 1 (0.9) | |
| D/Q * 168R | / | / |
| |
| 174 | N/S/T * 174F | / |
| / |
| N/S/T * 174S | 43 (39.5) | / | / | |
| Patients with at least one NS3 RAS | any RAS | 75 (68.8) | 21(26.3) | 2 (1.8) |
| resistance-conferring RAS | 57 (52.3) | 4 (5.0) | 1 (0.9) | |
RAS: resistance associated substitutions; †: resistance conferring RAS are shown in bold; *: D168 and N174 are wild type amino acids in GT1a; D168 and S174 are wild type amino acids in GT1b; Q168 and T174 are wild type amino acids in GT3a.
Detected baseline RAS to NS5A inhibitors.
| Codon Position | NS5A RAS | RAS † Prevalence according to HCV Subtype | ||
|---|---|---|---|---|
| 1a (N = 109) (%) | 1b (N = 80) (%) | 3a (N = 111) (%) | ||
| 24 | K/Q/S * 24R | 1 (0.9) | / | / |
| 28 | M/L * 28V | 11 (10.1) | / | / |
| M/L * 28T |
| / | / | |
| 30 | Q/R/A * 30R |
| / | / |
| Q/R/A * 30Q | / |
| / | |
| Q/R/A * 30K | / | / |
| |
| 31 | L31M |
|
| / |
| L31I | / | 3 (3.8) | / | |
| 62 | E/Q/A * 62L | / | / | 4 (3.6) |
| 93 | Y93H | / |
|
|
| Patients with at least one NS5A RAS | any RAS | 17 (15.6) | 17 (21.3) | 7 (6.3) |
| resistance-conferring RAS | 5 (4.6) | 16 (20.0) | 3 (2.7) | |
RAS: resistance associated substitutions; †: resistance conferring RAS are shown in bold; *: K24, M28, Q30 and E62 are wild type amino acids in GT1a; Q24, L28, R30 and Q62 are wild type amino acids in GT1b; S24, M28, A30, and A62 are wild type amino acids in GT3a.
Figure 2Maximum likelihood phylogenetic analysis of the HCV NS5B gene sequences constructed by applying GTR+G+I model with 1000 bootstrap replicates. Scale bar represents 0.1 nucleotide substitutions per site. Bootstrap values between 70 and 100% are displayed at the branch nodes as blue triangles with circle size corresponding to magnitude of bootstrap. Branches of reference sequences are colored black, while branches of Croatian sequences are colored red. Genotypes and subtypes are indicated by different color strips.
Figure 3Maximum likelihood phylogenetic analysis of the NS3 gene sequences of HCV 1a subtype constructed with GTR+G+I model and 1000 bootstrap replicates. Bootstrap values between 70 and 100% are displayed at the branch nodes as blue triangles with size corresponding to magnitude of bootstrap. Branches of two most similar control sequences per each local sequence obtained by searching the BLAST database and removing duplicates are colored black. Branches of Croatian sequences without RAS are colored green, sequences with RAS conferring resistance to DAA are colored red, and sequences with RAS associated with reduced susceptibility to DAA are colored orange. Reference sequences are colored gray. Clade I sequences are highlighted blue, while clade II sequences are highlighted pink. All identified RAS are positioned on the phylogenetic tree along with the corresponding sequences. Resistance conferring RAS are marked with filled symbols, while RAS causing reduced susceptibility to at least one DAA are marked with open symbols. Identified transmission pairs are highlighted gray, while transmission pairs identified consistently across all genomic regions for GT1a are highlighted yellow.
Comparison of demographic, clinical, and laboratory characteristics of patients in and out of transmission clusters.
| Patients in TC | Patients out of TC | Total, N (%) | ||
|---|---|---|---|---|
| Patients | 27 (9.0) | 273 (91.0) | 300 (100) | |
| Gender, | ||||
| M | 18 (9.6) | 169 (90.4) | 0.626 | 187 (62.3) |
| F | 9 (8.0) | 104 (92.0) | 113 (37.7) | |
| Age, median years (Q1–Q3) c | 37 (33–41) | 46 (41–57.5) |
| 45 (40–57) |
| HCV subtype, | ||||
| 1a | 8 (7.3) | 101 (92.7) |
| 109 (36.3) |
| 1b | 2 (2.5) | 78 (97.5) | 80 (26.7) | |
| 3a | 17 (15.3) | 94 (84.7) | 111 (37.0) | |
| Fibrosis stage, | ||||
| F0/1–F2 | 20 (11.4) | 155 (88.6) | 0.111 | 175 (59.7) |
| F3–F4 | 7 (5.9) | 111 (94.1) | 118 (40.3) | |
| Viral load, median (Q1–Q3), log IU/mL | 6.0 (4.9–6.6) | 6.0 (5.5–6.3) | 0.944 | 6.0 (5.5–6.3) |
| Subtype 1a clade, | ||||
| 1 | 8 (11.8) | 60 (88.2) |
| 68 (62.4) |
| 2 | 0 (0.0) | 41 (100) | 41 (37.6) | |
| Risk factor a | ||||
| IDU | 14 (14.7) | 81 (85.3) |
| 95 (31.7) |
| other/unknown | 13 (6.3) | 192 (93.7) | 205 (68.3) | |
Q1: first quartile; Q3: third quartile; IDU: intravenous drug use; RAS: resistance-associated substitution; TC: transmission cluster or pair; a: the association between the selected variables was analysed by Pearson’s chi squared test; b: the association between the selected variables was analysed by the Fisher’s exact test; c: the association between the selected variables was analysed by the Mann-Whitney test; *: available for total number of patients N = 293; **: available for total number of subtype 1a patients N = 109.
Estimated time of the most recent common ancestor (tMRCA) and substitution rate (substitutions/site/year) with 95% lower and upper values from the highest posterior density (95% HPD) under various molecular clock and population growth models for HCV subtype 1a, 1b, and 3a sequences.
|
| ||||
|
|
| |||
|
|
|
| ||
|
| tMRCA (year) * | 1963 (1943–1981) | 1961 (1939–1980) | 1941 (1893–1975) |
| substitution rate (10−3) (s/s/y) * | 1.42 (0.96–1.93) | 1.31 (0.84–1.76) | 1.02 (0.50–1.52) | |
| marginal likelihood estimate (log) | −33778.19 | −33691.43 | −33669.42 | |
|
| tMRCA (year) * | 1980 (1959–1995) | 1982 (1963–1988) | 1960 (1911–1994) |
| substitution rate (10−3) (s/s/y) * | 2.38 (1.35–3.45) | 1.99 (1.12–3.00) | 1.48 (0.52–2.40) | |
| marginal likelihood estimate (log) | −33666.33 | −33581.52 | −33570.04 | |
|
| tMRCA (year) * | *** | 1996 (1984–2006) | 1966 (1892–2004) |
| substitution rate (10−3) (s/s/y) * | *** | 3.61 (1.84–5.69) | 2.46 (0.46–4.53) | |
| marginal likelihood estimate (log) | *** | −33629.85 | −33617.95 | |
|
| ||||
|
|
| |||
|
|
|
| ||
|
| tMRCA (year) * | 1994 (1985–2002) | 1990 (1980–1999) | 1984 (1965–1999) |
| substitution rate (10−3) (s/s/y) * | 4.00 (2.71–5.38) | 3.42 (2.27–4.63) | 2.86 (1.59–4.32) | |
| marginal likelihood estimate (log) | −32729.35 | −32649.40 | −32628.27 | |
|
| tMRCA (year) * | 2004 (1997–2009) | 1998 (1986–2007) | 1994 (1973–2009) |
| substitution rate (10−3) (s/s/y) * | 6.95 (4.18–9.71) | 4.52 (2.30–6.80) | 3.78 (1.30–6.59) | |
| marginal likelihood estimate (log) | −32680.06 | −32595.23 | −32572.04 | |
|
| tMRCA (year) * | *** | 2002 (1991–2010) | 2003 (1983–2013) |
| substitution rate (10−3) (s/s/y) * | *** | 7.06 (2.87–11.80) | 8.24 (1.29–15.60) | |
| marginal likelihood estimate (log) | *** | −32639.60 | −32631.35 | |
|
| ||||
|
|
| |||
|
|
|
| ||
|
| tMRCA (year) * | 1964 (1940–1985) | 1961 (1934–1981) | 1929 (1853–1978) |
| substitution rate (10−3) (s/s/y) * | 1.92 (1.15–2.70) | 1.68 (1.02–2.38) | 1.22 (0.48–2.02) | |
| marginal likelihood estimate (log) | −38484.10 | −38378.81 | −38362.98 | |
|
| tMRCA (year) * | 1993 (1978–2004) | 1990 (1975–2002) | 1978 (1933–2004) |
| substitution rate (10−3) (s/s/y) * | 3.85 (2.21–5.53) | 2.86 (1.60–4.24) | 2.04 (0.47–3.67) | |
| marginal likelihood estimate (log) | −38404.69 | −38298.16 | −38269.86 | |
|
| tMRCA (year) * | *** | 1999 (1987–2008) | *** |
| substitution rate (10−3) (s/s/y) * | *** | 4.87 (2.18–7.77) | *** | |
| marginal likelihood estimate (log) | *** | −38342.03 | *** | |
*: numbers in brackets indicate 95% lower and upper values from the highest posterior density (95% HPD); ***: model did not converge.
Figure 4Bayesian skyline plots showing the epidemic history of the HCV subtype 1a, 1b, and 3a sequences in Zagreb, Croatia. Mean (solid blue line) and upper and lower 95% HPD (solid blue area) estimates of the effective population size (Y-axis; log10 scale) through time (X-axis; calendar years) from the time of the most recent common ancestor (tMRCA) are shown.