| Literature DB >> 25105489 |
James C Iles1, Jayna Raghwani1, G L Abby Harrison2, Jacques Pepin3, Cyrille F Djoko4, Ubald Tamoufe5, Matthew LeBreton5, Bradley S Schneider5, Joseph N Fair5, Felix M Tshala6, Patrick K Kayembe7, Jean Jacques Muyembe8, Samuel Edidi-Basepeo9, Nathan D Wolfe10, Peter Simmonds11, Paul Klenerman12, Oliver G Pybus13.
Abstract
HCV genotype 4 is prevalent in many African countries, yet little is known about the genotype׳s epidemic history on the continent. We present a comprehensive study of the molecular epidemiology of genotype 4. To address the deficit of data from the Democratic Republic of the Congo (DRC) we PCR amplified 60 new HCV isolates from the DRC, resulting in 33 core- and 48 NS5B-region sequences. Our data, together with genotype 4 database sequences, were analysed using Bayesian phylogenetic approaches. We find three well-supported intra-genotypic lineages and estimate that the genotype 4 common ancestor existed around 1733 (1650-1805). We show that genotype 4 originated in central Africa and that multiple lineages have been exported to north Africa since ~1850, including subtype 4a which dominates the epidemic in Egypt. We speculate on the causes of the historical intra-continental spread of genotype 4, including population movements during World War 2.Entities:
Keywords: Africa; DRC; Egypt; Hepatitis C; Molecular clock; Molecular epidemiology; Phylogeny; Prevalence; Skyline plot
Mesh:
Year: 2014 PMID: 25105489 PMCID: PMC4162651 DOI: 10.1016/j.virol.2014.07.006
Source DB: PubMed Journal: Virology ISSN: 0042-6822 Impact factor: 3.616
Fig. 2The age distribution of HCV RNA positivity among 2298 blood samples from the DRC. Samples are assigned to one of four age categories by year of birth. Numbers below each category indicate the number of positive samples/total number of samples. Fifteen samples (all negative) did not have date of birth information and are not included. The y-axis shows the proportion of samples in each age category that were HCV RNA-positive. The errors bars represent 95% confidence limits of this proportion, estimated using the adjusted Wald method (Agresti and Coull, 1998).
Subtype summary.
| 11 | 46 | Egypt | |
| 1 | 10 | Democratic Republic of the Congo | |
| 35 | 80 | Gabon | |
| 17 | 45 | France | |
| 29 | 142 | Gabon | |
| 16 | 106 | Cameroon | |
| 4 | 7 | Gabon | |
| 1 | 15 | Cameroon | |
| 23 | 77 | Democratic Republic of the Congo | |
| 1 | 10 | Egypt | |
| 2 | 14 | Egypt | |
| 1 | 7 | Egypt | |
| 3 | 24 | Egypt | |
| 1 | 13 | Cameroon | |
| 2 | 6 | Rwanda | |
| 12 | 40 | Democratic Republic of the Congo | |
| 9 | 31 | Cameroon | |
| 1 | 14 | Egypt | |
| 2 | 7 | Rwanda | |
| 5 | 3 | Portugal | |
| 0 | 26 | Central African Republic | |
| 0 | 3 | Democratic Republic of the Congo | |
| 0 | 45 |
Fig. 3Maximum clade credibility molecular clock phylogeny, estimated from the whole genome alignment. Branch lengths represent time (see scale bar at the bottom of the figure). Posterior clade probabilities are shown next to each node. Sequences are labelled with their subtype and accession number. Subtypes are indicated on the right side of the diagram. The three intra-genotypic lineages discussed in the main text are labelled L1, L2, and L3.
Fig. 4Maximum clade credibility molecular clock phylogeny, estimated from the concatenated alignment. Taxa are coloured according to location of sequence origin (blue=sub-Saharan Africa; red=Middle East and North Africa; grey=rest of the world). The locations of internal branches were inferred using parsimony and are coloured similarly. Branch lengths represent time (see scale bar at the bottom of the figure). Nodes with a posterior probability >0.9 are labelled with a white circle. Sequences are labelled as follows: subtype, sampling location using two-letter country codes (ISO 3166; see Table 4), sampling date, isolate name, accession number. XX represents an unknown location. Subtypes are indicated on the right side of the diagram. The three intra-genotypic lineages discussed in the main text are labelled L1, L2, and L3. The four clusters of samples obtained in this study discussed in the main text are labelled C1, C2, C3, and C4.
Two-letter country codes used in this study.
| Argentina | |
| Belgium | |
| Bahrain | |
| Burundi | |
| Canada | |
| Democratic Republic of the Congo | |
| Central African Republic | |
| Republic of the Congo | |
| Cameroon | |
| Cyprus | |
| Germany | |
| Denmark | |
| Algeria | |
| Egypt | |
| Spain | |
| France | |
| Gabon | |
| United Kingdom | |
| India | |
| Ireland | |
| Italy | |
| Japan | |
| Martinique | |
| Netherlands | |
| Pakistan | |
| Portugal | |
| Russia | |
| Rwanda | |
| Saudi Arabia | |
| Syria | |
| Tunisia | |
| Uganda | |
| United States of America | |
| Unknown | |
| Yemen | |
| South Africa |
Fig. 5Bayesian skyline plot, showing the epidemic history of genotype 4 estimated from the concatenated alignment. The black line represents the estimated effective number of infections through time. The blue lines represent the 95% highest posterior density confidence intervals of this estimate. The earliest date in the plot is the median estimate of the TMRCA of genotype 4, while the dotted line shows the upper 95% highest posterior density confidence interval of this date.
Fig. 1Map of the world showing the country of origin and sample size of samples used in this study. Where a single isolate yielded multiple sequences, they are only counted as one sample on this figure. Countries less than three samples are not shown.
Details of primers used in this study.
| GAAAGCGTCTAGCCATGGCGTTAGT | 71–95 | ||
| CTCGCAAGCACCCTATCAGG | 311–292 | ||
| This study | CCTTGTGGTACTGCCTGATAG | 279–299 | |
| This study | AGTGCCARRAGGAACATAGA | 883–864 | |
| This study | CTGATAGGGTGCTTGCGAGTG | 293–313 | |
| This study | AGTGCCARRAGGAAGATAGARAA | 883–861 | |
| This study | TGGGGATCCCGTATGATACCCGCTGCTTTGA | 8245–8275 | |
| This study | CGGAATTCCTGGTCATAGCCTCCGTGAA | 8643–8616 | |
| This study | GACACCCGCTGCTTTGACTC | 8259–8278 | |
| This study | GAGTCTTCACGGAGGCTATGACNAGGTA | 8638–8611 |
Numbering relative to isolate H77 (Genbank accession number AF009606).
Estimated evolutionary rate parameters.
| 342–944 | 5.39×10–4 | 3.41–7.46×10–4 | |
| 8274–8612 | 9.87×10–4 | 6.74–14.4×10–4 | |
| 342–944 and 8274–8612 | 7.43×10–4 | 4.91–10.4×10–4 | |
| 342–9374 | 13.5×10–4 | 9.97–17.0×10–4 |
Numbering relative to isolate H77 (Genbank accession number AF009606).