| Literature DB >> 25079603 |
Maureen J Donlin1, Elena Lomonosova2, Alexi Kiss2, Xiaohong Cheng2, Feng Cao1, Teresa M Curto3, Adrian Di Bisceglie4, John E Tavis5.
Abstract
Hepatitis C virus (HCV) is a major cause of hepatitis and hepatocellular carcinoma (HCC) world-wide. Most HCV patients have relatively stable disease, but approximately 25% have progressive disease that often terminates in liver failure or HCC. HCV is highly variable genetically, with seven genotypes and multiple subtypes per genotype. This variation affects HCV's sensitivity to antiviral therapy and has been implicated to contribute to differences in disease. We sequenced the complete viral coding capacity for 107 HCV genotype 1 isolates to determine whether genetic variation between independent HCV isolates is associated with the rate of disease progression or development of HCC. Consensus sequences were determined by sequencing RT-PCR products from serum or plasma. Positions of amino acid conservation, amino acid diversity patterns, selection pressures, and genome-wide patterns of amino acid covariance were assessed in context of the clinical phenotypes. A few positions were found where the amino acid distributions or degree of positive selection differed between in the HCC and cirrhotic sequences. All other assessments of viral genetic variation and HCC failed to yield significant associations. Sequences from patients with slow disease progression were under a greater degree of positive selection than sequences from rapid progressors, but all other analyses comparing HCV from rapid and slow disease progressors were statistically insignificant. The failure to observe distinct sequence differences associated with disease progression or HCC employing methods that previously revealed strong associations with the outcome of interferon α-based therapy implies that variable ability of HCV to modulate interferon responses is not a dominant cause for differential pathology among HCV patients. This lack of significant associations also implies that host and/or environmental factors are the major causes of differential disease presentation in HCV patients.Entities:
Mesh:
Year: 2014 PMID: 25079603 PMCID: PMC4117537 DOI: 10.1371/journal.pone.0103748
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1HCV genome.
The HCV genome contains 5′ and 3′ untranslated regions and a single, long open reading frame that encodes 10 proteins. The mature viral proteins encoded within the open reading frame and their major functions are indicated. Reprinted from [25] under the creative commons license.
Age and gender of patients from whom the HCC and cirrhotic samples were derived.
| Cirrhotic control | HCC | P value | |
| Number of patients | 25 | 22 | – |
| Gender (M/F) | 19/6 | 16/6 | ns |
| Age (mean ± SD) | 57.4±9.6 | 61.7±8.2 | ns |
| Age (range) | 49–76 | 44–82 | ns |
ns, non-significant.
Positions where the distribution of amino acids differ in HCC and cirrhotic control sequences.
| Position | Protein | P value |
| 75 | Core | 0.038 |
| 274 | E1 | 0.015 |
| 330 | E1 | 0.033 |
| 438 | E2 | 0.020 |
| 476 | E2 | 0.025 |
| 496 | E2 | 0.028 |
| 524 | E2 | 0.026 |
| 538 | E2 | 0.046 |
| 699 | E2 | 0.013 |
| 741 | p7 | 0.044 |
| 759 | p7 | 0.019 |
| 760 | p7 | 0.018 |
| 767 | p7 | 0.028 |
| 938 | NS2 | 0.046 |
| 1087 | NS3 | 0.016 |
| 1323 | NS3 | 0.018 |
| 1329 | NS3 | 0.008 |
| 1536 | NS3 | 0.012 |
| 1694 | NS4A | 0.038 |
| 2016 | NS5A | 0.029 |
| 2278 | NS5A | 0.031 |
| 2356 | NS5A | 0.046 |
| 2385 | NS5A | 0.028 |
| 2543 | NS5B | 0.001 |
| 2650 | NS5B | 0.014 |
Codons under positive selection in the HCC and cirrhotic control sequences.
| Codon | Gene | Group | dN-dS | P value |
|
|
|
|
|
|
| 384 | E2 | Cirrhotic | 6.042 | 0.008 |
| 387 | E2 | Cirrhotic | 3.455 | 0.039 |
| 397 | E2 | Cirrhotic | 5.512 | 0.011 |
| 401 | E2 | Cirrhotic | 5.514 | 0.011 |
|
|
|
|
|
|
| 478 | E2 | Cirrhotic | 4.741 | 0.021 |
| 479 | E2 | Cirrhotic | 3.470 | 0.030 |
| 522 | E2 | Cirrhotic | 4.359 | 0.005 |
|
|
|
|
|
|
|
|
|
|
|
|
| 2968 | NS5B | Cirrhotic | 4.578 | 0.002 |
| 2983 | NS5B | Cirrhotic | 2.804 | 0.015 |
| 387 | E2 | HCC | 3.283 | 0.026 |
| 401 | E2 | HCC | 4.197 | 0.004 |
|
|
|
|
|
|
| 434 | E2 | HCC | 3.795 | 0.013 |
| 461 | E2 | HCC | 3.025 | 0.017 |
| 522 | E2 | HCC | 2.811 | 0.036 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2632 | NS5B | HCC | 3.983 | 0.009 |
| 2983 | NS5B | HCC | 4.187 | 0.001 |
Bold indicates positions unique to the HCC or cirrhotic sequence alignments compared to alignments of randomly selected HCV control sequences.
Basic amino acid covariance network parameters for the HCC and cirrhotic sequences.
| Network | Number of sequences | Number of nodes | Number of edges | Average number of neighbors | Network density |
| Experimental sequences | |||||
| HCC | 22 | 86 | 80 | 1.9 | 0.022 |
| Cirrhotic | 25 | 128 | 296 | 4.6 | 0.036 |
| Randomly sampled sets of 22 cirrhotic sequences | |||||
| Set 1 | 22 | 123 | 320 | 5.2 | 0.043 |
| Set 2 | 22 | 120 | 255 | 4.2 | 0.036 |
| Set 3 | 22 | 118 | 309 | 5.2 | 0.045 |
| Set 4 | 22 | 121 | 327 | 5.4 | 0.045 |
| Set 5 | 22 | 118 | 215 | 3.6 | 0.03 |
| Set 6 | 22 | 112 | 160 | 2.8 | 0.03 |
| External HCC sequence sets | |||||
| Pooled set 1 | 22 | 116 | 505 | 8.7 | 0.76 |
| Pooled set 2 | 22 | 138 | 663 | 9.6 | 0.07 |
| Takahashi et al. | 15 | 88 | 441 | 10 | 0.12 |
| Nagayama et al. | 13 | 99 | 195 | 3.9 | 0.04 |
Sequences obtained from [65].
Sequences obtained from [64].
Figure 2Amino acid covariance networks for the HCC and cirrhotic sequences.
Amino acid covariances within alignments of the HCV cirrhotic (left) and HCC (right) sequences were graphed with the covarying positions (nodes) represented as circles and the covariances between the positions (edges) as lines. The size of the nodes is proportional to the number of edges that they contact. Yellow nodes are within structural proteins and green nodes are in non-structural proteins. The amino acid residue position numbered relative to the HCV polyprotein is indicated in the larger nodes.
Positions in core associated with HCC.
| Residue identity | Control-type/HCC-type residues (number of sequences) | ||||
| Residue number | Control-type | HCC-type | Cirrhotic sequences | HCC sequences | External cirrhotic sequences |
|
| |||||
| 36 | A | G/C | 25/0 | 21/1 | 22/5 |
| 78 | U | C | 1/24 | 2/20 | 1/25 |
| 209 | G | A | 7/18 | 5/17 | 6/19 |
| 271 | U/C | A | 3/22 | 5/17 | 6/19 |
| 309 | U | C/A | 21/4 | 17/5 | na |
| 384 | C | U | 25/0 | 22/0 | na |
| 408 | C | U | 24/1 | 21/1 | na |
| 435 | G | A/C | 25/0 | 22/0 | na |
| 465 | C | U | 24/1 | 20/2 | na |
| 481 | G | A | 25/0 | 21/1 | na |
| 546 | G | A/C | 23/2 | 19/3 | na |
|
| |||||
| 70 | R | non-R | 7/18 | 5/17 | 5/21 |
| 91 | L | M | 3/22 | 5/17 | 6/19 |
Numbered relative to the start of the polyprotein.
Sequences obtained from [62].
Identified in [54], genotype 1b.
na, not available.
Identified in [55], [57].
Baseline characteristics of patients from whom the rapid and slow progressor samples were derived.
| Slow progressors | Rapid progressors | P value | |
| Number of patients | 30 | 30 | – |
| Age (mean ± SD) | 49.5±6.0 | 47.6±5.9 | ns |
| Female sex (% of patients) | 26.7 | 43.3 | ns |
| Duration of exposure to HCV (yr) (mean ± SD) | 26.3±7.1 | 25.7±6.3 | ns |
| Race or ethnic group (% of patients) | ns | ||
| White | 70.0 | 70.0 | |
| Black | 26.7 | 23.3 | |
| Hispanic | 3.3 | 6.7 | |
| Body-mass index (BMI) (mean ± SD) | 28.6±5.2 | 31.8±7.6 | ns |
| Diabetes (% of patients) | 13.3 | 20.0 | ns |
| Lifetime alcohol consumption (no. of drinks) (median) | 8713 | 10516 | ns |
| Lifetime alcohol consumption (no. of drinks) (interquartile range) | 2062–30286 | 1397–25913 | |
| Baseline serum HCV RNA (log10 IU/ml) (mean ± SD) | 6.5±0.4 | 6.5±0.5 | ns |
| Serum alanine aminotransferase (ALT) (U/liter) (mean ± SD) | 82.8±42.5 | 100.1±56.9 | ns |
| Ratio of the patient’s alanine aminotransferase (ALT) level to the upper limit of normal (ULN)(mean ± SD) | 1.7±1.1 | 1.8±0.9 | ns |
| Total serum bilirubin (mg/dl) (mean ± SD) | 0.7±0.3 | 0.8±0.5 | ns |
| Serum albumin (g/dl) (mean ± SD) | 4.0±0.3 | 3.9±0.4 | ns |
| Prothrombin time (INR) (mean ± SD) | 1.0±0.1 | 1.0±0.1 | ns |
| Ishak fibrosis score | 3.1±0.6 | 3.1±0.6 | ns |
| Ishak inflammation score | 7.7±2.0 | 7.5±1.8 | ns |
| Esophageal varices (% of patients) | 10.0 | 13.3 | ns |
T-test or chi-square test unless otherwise indicated.
Non-significant (p<0.05).
Race or ethnic group was self-reported.
BMI is the weight in kilograms divided by the square of the height in meters.
Wilcoxon rank-sum test.
The Ishak fibrosis score range is 0 (no fibrosis) to 6 (cirrhosis).
The Ishak inflammation score range is 0 (best) to 18 (worst).
Positions where the distribution of amino acids differ between the rapid and slow progressor sequences.
| Time point 1 | Time point 2 | ||||
| Position | Protein | P value | Position | Protein | P value |
| 308 | E1 | 0.038 | 308 | E1 | 0.032 |
| 333 | E1 | 0.040 | 391 | E2 | 0.047 |
| 453 | E2 | 0.031 | 464 | E2 | 0.009 |
| 464 | E2 | 0.028 | 853 | NS2 | 0.034 |
| 490 | E2 | 0.045 | 883 | NS2 | 0.040 |
| 766 | p7 | 0.038 | 1655 | NS3 | 0.043 |
| 827 | NS2 | 0.045 | 1723 | NS4B | 0.032 |
| 883 | NS2 | 0.043 | 1747 | NS4B | 0.002 |
| 1723 | NS4B | 0.006 | 2047 | NS5A | 0.044 |
| 1746 | NS4B | 0.046 | 2361 | NS5A | 0.043 |
| 1747 | NS4B | 0.010 | 2369 | NS5A | 0.043 |
| 2181 | NS5A | 0.040 | 2414 | NS5A | 0.043 |
| 2185 | NS5A | 0.010 | 2482 | NS5B | 0.007 |
| 2361 | NS5A | 0.040 | |||
| 2482 | NS5B | 0.012 | |||
Codons under positive selection in the rapid and slow progressor sequences.
| Rapid progressors | ||||||||
| Time point 1 | Time point 2 | |||||||
| Codon | Protein | Normalized dN-dS | P value | Codon | Protein | Normalized dN-Ds | P value | |
| 235 | E1 | 3.751 | 0.002 | 235 | E1 | 3.699 | 0.005 | |
| 384 | E2 | 5.284 | 0.007 | 397 | E2 | 4.167 | 0.041 | |
| 622 | E2 | 2.491 | 0.017 | 522 | E2 | 2.311 | 0.047 | |
| 790 | p7 | 2.054 | 0.033 | 570 | E2 | 2.558 | 0.026 | |
|
|
|
|
| 781 | E2 | 3.021 | 0.018 | |
| 2411 | NS5B | 1.854 | 0.046 |
|
|
|
| |
| 2431 | NS5B | 2.287 | 0.022 | 2431 | NS5B | 2.798 | 0.021 | |
| 2963 | NS5B | 2.806 | 0.020 | |||||
|
| ||||||||
|
|
| |||||||
|
|
|
|
|
|
|
|
| |
| 235 | E1 | 2.989 | 0.048 | 303 | E1 | 3.363 | 0.044 | |
| 384 | E2 | 5.399 | 0.005 |
|
|
|
| |
| 395 | E2 | 3.764 | 0.020 | 395 | E2 | 4.359 | 0.037 | |
| 396 | E2 | 3.123 | 0.042 |
|
|
|
| |
|
|
|
|
|
|
|
|
| |
| 522 | E2 | 2.330 | 0.014 | 522 | E2 | 3.288 | 0.008 | |
| 622 | E2 | 2.182 | 0.036 | 622 | E2 | 2.718 | 0.035 | |
| 781 | p7 | 2.175 | 0.038 |
|
|
|
| |
| 2079 | NS5A | 1.984 | 0.045 | 790 | p7 | 3.403 | 0.041 | |
|
|
|
|
| 2079 | NS5A | 2.765 | 0.032 | |
| 2411 | NS5B | 2.240 | 0.029 |
|
|
|
| |
| 2537 | NS5B | 2.815 | 0.017 |
|
|
|
| |
| 2600 | NS5B | 2.817 | 0.017 | 2411 | NS5A | 2.749 | 0.036 | |
| 2729 | NS5B | 2.674 | 0.022 | 2431 | NS5B | 2.751 | 0.032 | |
| 2537 | NS5B | 4.094 | 0.009 | |||||
| 2600 | NS5B | 3.510 | 0.017 | |||||
| 2729 | NS5B | 3.282 | 0.022 | |||||
| 2963 | NS5B | 2.784 | 0.028 | |||||
Bold indicates positions unique to HALT-C compared to random control alignments.
Basic amino acid covariance network parameters for the rapid and slow progressor sequences.
| Network | Number of sequences | Number of nodes | Number of edges | Average number of neighbors | Network density |
| TP1-rapid | 30 | 122 | 1226 | 20.1 | 0.17 |
| TP1-slow | 30 | 124 | 1163 | 18.7 | 0.15 |
| TP2-rapid | 28 | 122 | 1201 | 19.7 | 0.16 |
| TP2-slow | 29 | 123 | 1051 | 18.4 | 0.15 |