| Literature DB >> 24115039 |
Donald B Smith1, Jens Bukh, Carla Kuiken, A Scott Muerhoff, Charles M Rice, Jack T Stapleton, Peter Simmonds.
Abstract
UNLABELLED: The 2005 consensus proposal for the classification of hepatitis C virus (HCV) presented an agreed and uniform nomenclature for HCV variants and the criteria for their assignment into genotypes and subtypes. Since its publication, the available dataset of HCV sequences has vastly expanded through advancement in nucleotide sequencing technologies and an increasing focus on the role of HCV genetic variation in disease and treatment outcomes. The current study represents a major update to the previous consensus HCV classification, incorporating additional sequence information derived from over 1,300 (near-)complete genome sequences of HCV available on public databases in May 2013. Analysis resolved several nomenclature conflicts between genotype designations and using consensus criteria created a classification of HCV into seven confirmed genotypes and 67 subtypes. There are 21 additional complete coding region sequences of unassigned subtype. The study additionally describes the development of a Web resource hosted by the International Committee for Taxonomy of Viruses (ICTV) that maintains and regularly updates tables of reference isolates, accession numbers, and annotated alignments (http://talk.ictvonline.org/links/hcv/hcv-classification.htm). The Flaviviridae Study Group urges those who need to check or propose new genotypes or subtypes of HCV to contact the Study Group in advance of publication to avoid nomenclature conflicts appearing in the literature. While the criteria for assigning genotypes and subtypes remain unchanged from previous consensus proposals, changes are proposed in the assignment of provisional subtypes, subtype numbering beyond "w," and the nomenclature of intergenotypic recombinant.Entities:
Mesh:
Year: 2014 PMID: 24115039 PMCID: PMC4063340 DOI: 10.1002/hep.26744
Source DB: PubMed Journal: Hepatology ISSN: 0270-9139 Impact factor: 17.425
Fig. 1Phylogenetic tree of 129 representative complete coding region sequences. Up to two representatives of each confirmed genotype/subtype were aligned (together with a third extreme variant of subtypes 4g and 6e) and a neighbor joining tree constructed using maximum composite likelihood nucleotide distances between coding regions using MEGA5.83 Sequences were chosen to illustrate the maximum diversity within a subtype. Tips are labeled by accession number and subtype (*unassigned subtype). For genotypes 1, 2, 3, 4, and 6, the lowest common branch shared by all subtypes and supported by 100% of bootstrap replicates (n = 1,000) is indicated by ·.
Confirmed HCV Genotypes/Subtypes
| Genotype | Locus/Isolate(s) | Accession number(s) | Reference(s) |
|---|---|---|---|
| 1a | HPCPLYPRE, HPCCGAA | M62321, M67463 | |
| 1b | HPCJCG, HPCHUMR | D90208, M58335 | |
| 1c | HPCCGS, AY051292 | D14853, AY051292 | |
| 2a | HPCPOLP, JFH-1 | D00944, AB047639 | |
| 2b | HPCJ8G, JPUT971017 | D10988, AB030907 | |
| 2c | BEBE1 | D50409 | |
| 2k | VAT96 | AB031663 | |
| 3a | HPCEGS, HPCK3A | D17763, D28917 | |
| 3b | HPCFG | D49374 | |
| 3k | HPCJK049E1, | D63821, | |
| 4a | ED43 | Y11604 | |
| 5a | EUH1480, SA13 | Y13184, AF064490 | |
| 6a | EUHK2,6a33 | Y12083, | |
| 6b | Th580 | D84262 | |
| 6d | VN235 | D84263 | |
| 6g | HPCJK046E2 | D63822 | |
| 6h | VN004 | D84265 | |
| 6k | VN405 | D84264 | |
Additions and changes from assignments proposed in 2 shown in bold.
Consensus proposed genotype/subtype names. Where multiple sequences of a HCV genotype are available, two sequences have been listed, prioritized by (a) publication date or (b) submission date when unpublished.
Locus (or isolate name if locus is the same as the accession number).
Previously described as 4b.7,14
Sequence obtained from acute phase plasma of a chimpanzee experimentally infected with (human-derived) isolate SA13.
Previously described as 6u.18
Fig. 2Distribution of p-distances between complete coding region sequences. The frequency of p-distances was calculated within and between genotypes using SSE.12 Intra-genotype pairwise distances were calculated for all available complete coding region sequences except for subtypes 1a, 1b, and 2b where 20 random sequences were used. For p-distances >0.15 (equivalent to a percent difference of 15%), frequencies were scaled to reduce the maximum frequency to less than 300. Distances between genotypes were calculated using one or two representatives of each confirmed and unassigned subtype, with the frequencies scaled as above.
Unassigned Complete Coding Region Sequences
| Genotype | Locus/Isolate(s) | Accession no(s) | Reference |
|---|---|---|---|
| 1_AJ851228 | AJ851228 | AJ851228 | |
| 1_KC248195 | 160526 | KC248195 | |
| 1_ HQ537007 | CYHCV025 | HQ537007 | |
| 2_JF735119 | QC331 | JF735119 | |
| 2_JF735112 | QC182 | JF735112 | |
| 2_JF735110 | QC114 | JF735110 | |
| 2_JF735117 | QC297 | JF735117 | |
| 2_JF735116 | QC289 | JF735116 | |
| 2_JF735118 | QC302 | JF735118 | |
| 3_JF735124 | QC115 | JF735124 | |
| 4_JX227964 | BID-G1253 | JX227964 | |
| 4_FJ025854 | P026 | FJ025854 | |
| 6_DQ278891 | KM45,KM41 | DQ278891,DQ278893 | |
| 6_JX183550 | QC273 | JX183550 | |
| 6_JX183552 | TV476 | JX183552 | |
| 6_JX183549 | KM35 | JX183549 | |
| 6_JX183551 | TV257 | JX183551 | |
| 6_JX183553 | TV533 | JX183553 | |
| 6_JX183554 | L349 | JX183554 | |
| 6_JX183557 | DH027 | JX183557 | |
| 6_JX183558 | QC271 | JX183558 | |
Classification of sequences into genotypes but without subtype assignments using the format “genotype_Accession number.”
Locus (or isolate name if locus is the same as the accession number).
Previously described as 4b.14
Previously described as 6k.17
Remaining Provisionally Assigned HCV Subtypes
| Accession number(s) | ||||
|---|---|---|---|---|
| Isolate | Core/E1 | NS5B | Reference(s) | |
| 1d | HC1-N15, HC1-N16 | L39299, L39302 | L38377, L38372 | |
| 1f | FR2 | L38350 | L38371 | |
| 1i | FR16, QC77 | n.a., AY434119 | L48495, AY434120 | |
| 1j | QC2, QC89 | AY434158, AY434128 | AY434106, AY434129 | |
| 1k | QC68, QC82 | AY434112, AY434122 | AY434113, AY434123 | |
| 2f | JK081, JK139 | D49754, D49757 | D49769, D49777 | |
| 2g | MED017 | n.a. | X93323 | |
| 2h | MED007 | n.a. | X93327 | |
| 2l | FR15 | n.a. | L48494 | |
| 2n | NL50 | L39309 | L44602 | |
| 2o | FR4 | L38333 | L38373 | |
| 2p | NL33 | L39300 | L44601 | |
| 3c | NE048 | D16612 | D14198/D16613 | |
| 3d | NE274 | D16620 | D14200/D16621 | |
| 3e | NE145 | D16618 | D16619 | |
| 3f | NE125, PK64 | D16614, n.a. | D14203/D16615, L78842 | |
| 4e | CAM600, GB809 | L29589, L29629 | L29590, L29626 | |
| 4h | GB438, FrSSD35 | L29610, n.a. | L29611, AJ291249 | |
| 4i | CAR4/1205 | L36439 | L36437 | |
| 4j | CAR1/501 | n.a. | L36438 | |
Accession numbers of sequences from the core/E1 and NS5B regions. “n.a.”: not available; “/”: denotes that the core/E1 or NS5B sequences are available from two different accession numbers.
Examples of each provisionally assigned HCV.
Recombinant (RF) HCV Complete Coding Region Sequences
| RF | Breakpoint | Accession | Isolates | Reference |
|---|---|---|---|---|
| RF2k/1b | 3186 | AY587845 | 33 | |
| RF2i/6p | 3405-3464 | DQ155560 | 1 | |
| RF2b/1b_1 | 3456 | DQ364460 | 1 | |
| RF2/5 | 3366-3389 | AM408911 | 1 | |
| RF2b/6w | 3429 | EU643835 | 1 | |
| RF2b/1b_2 | 3432 | AB622121 | 1 | |
| RF2b/1a | 3429-3440 | JF779679 | 1 | |
| RF2b/1b_3 | 3286-3293 | AB677530 | 1 | |
| RF2b/1b_4 | 3286-3293 | AB677527 | 1 |
Recombinant forms (RF) for which complete genome sequences are available are named according to the subtypes from which they are derived and in the order in which these appear in the genome.
Breakpoints are numbered with reference to H77 (AF009606).
Number of individuals from whom the RF has been isolated.