| Literature DB >> 28322313 |
Ahmed A Al-Qahtani1,2, Guy Baele3, Nisreen Khalaf1, Marc A Suchard4,5, Mashael R Al-Anazi1, Ayman A Abdo6, Faisal M Sanai7, Hamad I Al-Ashgar8, Mohammed Q Khan8, Mohammed N Al-Ahdal1,2, Philippe Lemey3, Bram Vrancken3.
Abstract
The relatedness between viral variants sampled at different locations through time can provide information pertinent to public health that cannot readily be obtained through standard surveillance methods. Here, we use virus genetic data to identify the transmission dynamics that drive the hepatitis C virus subtypes 4a (HCV4a) and 4d (HCV4d) epidemics in Saudi Arabia. We use a comprehensive dataset of newly generated and publicly available sequence data to infer the HCV4a and HCV4d evolutionary histories in a Bayesian statistical framework. We also introduce a novel analytical method for an objective assessment of the migration intensity between locations. We find that international host mobility patterns dominate over within country spread in shaping the Saudi Arabia HCV4a epidemic, while this may be different for the HCV4d epidemic. This indicates that the subtypes 4a and 4d burden can be most effectively reduced by combining the prioritized screening and treatment of Egyptian immigrants with domestic prevention campaigns. Our results highlight that the joint investigation of evolutionary and epidemiological processes can provide valuable public health information, even in the absence of extensive metadata information.Entities:
Mesh:
Year: 2017 PMID: 28322313 PMCID: PMC5359580 DOI: 10.1038/srep44947
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Overview of the sequence dataset composition.
| HCV4a | HCV4d | |
|---|---|---|
| Canada | 14 (3) | 2 (1) |
| China | — | 1 |
| Cyprus | 6 | — |
| DRC | 2 (2) | — |
| Denmark | 2 | — |
| Egypt | 366 (3) | 3 (3) |
| France | 1 | 2 |
| India | — | 1 |
| Japan | 1 | — |
| Saudi Arabia | 135* | 110* (2) |
| Pakistan | 2 | — |
| Portugal | 24 (1) | — |
| South Africa | 2 (2) | — |
| Sri Lanka | 2 (1) | — |
| USA | 10 | 1 |
| Uganda | 1 (1) | — |
| 568 (13) | 120 (6) |
The number of sequences per location is given for both subtypes. Numbers between brackets indicate how many sequences of that location did not cluster in the diversity of interest, and were not retained in the final dataset (see main text). The multiple sequence alignments for HCV4a and HCV4d span 576 nt (corresponding to H77 nt positions 914–1490). The HCV4a isolates were collected between 2000 and 2014, and those for HCV4d between 2002 and 2014. *All Saudi Arabia sequences were newly generated for this study.
Posterior probabilities for all possible types of migration events.
| location | posterior probability (%) | |
|---|---|---|
| from | to | |
| Egypt | Saudi Arabia | 79.12 |
| Egypt | other | 15.37 |
| Saudi Arabia | Egypt | 1.95 |
| Saudi Arabia | other | 2.96 |
| other | Egypt | 0.46 |
| other | Saudi Arabia | 0.15 |
Figure 1Bayesian maximum clade credibility tree of the HCV4a dataset.
Tips and internal branches are colored according to the most probable reconstructed ancestral state (location). The correspondence between the colors and locations is as in the legend.
Figure 2Combined plot of HCV4a population size growth and migration time density.
The solid black line represents the estimated product of the effective population size and generation time (N*tau) trajectory on a log10 scale (left axis), which reflects the number of individuals that contribute offspring to the descendent generation. The grey-shaded area marks the associated 95% HPD interval. The striped line shows the migration density (right axis). After a steep increase in the number of migrations in the period between 1920 and 1950, the migration intensity remained fairly steady.
Figure 3Phylogeographic rarefaction curve.
The solid line represents the expected number of introduction events in Saudi Arabia for n randomly chosen Saudi Arabian taxa. The grey-shaded area marks the 95% credible interval.
Figure 4Left panel: Bayesian maximum clade credibility tree of the HCV4d dataset.
Tips and internal branches are colored according to the most probable reconstructed ancestral state (location). The correspondence between the colors and locations is as in the legend. Right panel: Plot of the demographic history for HCV4d. The black line represents the change in mean effective number of infections through time on a log10 scale. The surrounding grey shaded area indicates the associated 95% HPD interval. The mean estimate is largest in 1994 and the timings of the maximum values for the upper and lower boundaries of the 95% HPD interval are 1992 and 1997.