| Literature DB >> 23710455 |
Kamila Caraballo Cortés1, Osvaldo Zagordi, Tomasz Laskus, Rafał Płoski, Iwona Bukowska-Ośko, Agnieszka Pawełczyk, Hanna Berak, Marek Radkowski.
Abstract
Genetic variability of hepatitis C virus (HCV) determines pathogenesis of infection, including viral persistence and resistance to treatment. The aim of the present study was to characterize HCV genetic heterogeneity within a hypervariable region 1 (HVR1) of a chronically infected patient by ultradeep 454 sequencing strategy. Three independent sequencing error correction methods were applied. First correction method (Method I) implemented cut-off for genetic variants present in less than 1%. In the second method (Method II), a condition to call a variant was bidirectional coverage of sequencing reads. Third method (Method III) used Short Read Assembly into Haplotypes (ShoRAH) program. After the application of these three different algorithms, HVR1 population consisted of 8, 40, and 186 genetic haplotypes. The most sensitive method was ShoRAH, allowing to reconstruct haplotypes constituting as little as 0.013% of the population. The most abundant genetic variant constituted only 10.5%. Seventeen haplotypes were present in a frequency above 1%, and there was wide dispersion of the population into very sparse haplotypes. Our results indicate that HCV HVR1 heterogeneity and quasispecies population structure may be reconstructed by ultradeep sequencing. However, credible analysis requires proper reconstruction methods, which would distinguish sequencing error from real variability in vivo.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23710455 PMCID: PMC3655449 DOI: 10.1155/2013/626083
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1The SSCP image of HVR1 amplified from the serum of HCV-infected patient.
HVR1 HCV characteristics obtained by pyrosequencing using GS Junior System (454/Roche).
| Number of sequenced nucleotides | 1.37 × 108 |
| Number of individual sequences that passed the quality control* | 76 332 |
| Number of individual sequences aligned to reference genome | 73 236 |
| Mean coverage per sequence | 4.6 |
| Identified haplotypes | 15 917 |
*No undetermined bases, 100% match with primer sequences.
The impact of haplotype reconstruction method on the variability parameters of HVR1.
| Correction method | I | II | III |
|---|---|---|---|
| Number of haplotypes | 8 | 40 | 186 |
| Number of nucleotide substitutions within HVR1 | 51 | 59 | 70 |
| Percentage of mutated amino acid positions within HVR1 (%) | 55.6 | 55.6 | 74.1 |
| Genetic distance | 3.874 | 0.065 | 0.110 |
| Genetic diversity | 0.923 | 0.998 | 0.984 |
Figure 2Amino acid sequences of HVR1 populations inferred after the application of three different error correction methods. (a) Cut-off method >1% (I), (b) bidirectional coverage (II), and (c) ShoRAH computation (III). Top sequence corresponds to reference sequence AJ406073 for genotype 1b HCV. Dots indicate consensus positions. Dashes indicate positions not present in the sequence. Asterisks indicate stop codons.
Figure 3Molecular phylogenetic analysis of HVR1 populations inferred after the application of three different error correction methods. (a) Cut-off method >1% (I), (b) bidirectional coverage method (II), and (c) ShoRAH algorithm (III). The evolutionary history was inferred by using the Maximum Likelihood method based on the Tamura-Nei model [23]. Evolutionary analyses were conducted using MEGA 5.0 [22].