| Literature DB >> 28270091 |
Lize Cuypers1, Bram Vrancken2, Lavinia Fabeni3, Nadia Marascio2,4, Valeria Cento5, Velia Chiara Di Maio5, Marianna Aragri5, Andrea Clemencia Pineda-Peña6,7, Yoeri Schrooten2, Kristel Van Laethem2, Daniel Balog8, Alfredo Focà4, Carlo Torti9, Frederik Nevens10,11, Carlo Federico Perno5, Anne-Mieke Vandamme2,6, Francesca Ceccherini-Silberstein5.
Abstract
BACKGROUND: In-depth phylogeographic analysis can reveal migration patterns relevant for public health planning. Here, as a model, we focused on the provenance, in the current Italian HCV subtype 1a epidemic, of the NS3 resistance-associated variant (RAV) Q80K, known to interfere with the action of NS3/4A protease inhibitor simeprevir. HCV1a migration patterns were analysed using Bayesian phylodynamic tools, capitalising on newly generated and publicly available time and geo-referenced NS3 encoding virus genetic sequence data.Entities:
Keywords: Europe; HCV; HCV1a; Italy; Phylogeography; Public health policy; Q80K; US
Mesh:
Substances:
Year: 2017 PMID: 28270091 PMCID: PMC5341469 DOI: 10.1186/s12862-017-0913-3
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Overview of the complete dataset used in this study
| Geographic origin | HCV1a dataset (n, % dataset) | Q80K (n, % country) | Sample time span |
|---|---|---|---|
| Italya | 280 (45.9) | 52 (18.6) | 2004-2015 |
| USb | 145 (23.8) | 58 (24.6)c | 1989-2008 |
| Germany | 67 (11.0) | 19 (28.4) | 2003-2013 |
| Switzerland | 46 (4.5) | 6 (13.0) | 2002-2006 |
| Belgium | 25 (4.1) | 8 (32.0) | 2007-2013 |
| Thailand | 19 (3.1) | 3 (15.8) | 2007-2010 |
| France | 13 (2.1) | / | 2007-2011 |
| Spain | 5 (0.8) | / | 2001-2002 |
| UK | 4 (0.7) | / | 2008-2010 |
| Brazil | 2 (0.3) | / | 2001-2003 |
| Australia | 2 (0.3) | / | 2007 |
| China | 1 (0.2) | / | 2009 |
| Japan | 1 (0.2) | / | 2009 |
| Total | 610 | 146 (23.9) | 1989-2015 |
The absolute and relative contribution of each country, the number of Q80K variants and their share by country, and the time span of the samples included per country, are detailed. a183 (63%) of the Italian sequences have not been published before. bTo assess the proportion of Q80K strains, the 91 taxa from the removed cluster were taken into account (see Methods). cSince the Q80K prevalence in the US was lower than expected, it was recalculated taking strains without sampling time information into account, resulting into a prevalence of 57.5%
Fig. 1The United States as the immediate origin of HCV1a spread to Europe and Italy. The branches in the HCV1a maximum clade credibility (MCC) tree constructed from the complete dataset are colored according to sampling country with strains grouped into four locations (see legend). All but five Q80K strains are found in the same clade, which is highlighted in grey. We refer to Fig. 3 for further details on the Q80K history. Mixing of strains among the four locations can be observed in the entire tree. HCV entered Italy via the US and continental migration in the two major clades of HCV1a variability (Clade I and II) as well as in the Q80K clade. Posterior root node support is visualized in a selection of deeper nodes. a: entire dataset. b: MCC tree estimated from subsample 1 (see Methods). c: MCC tree estimated from subsample 2. d: MCC tree estimated from subsample 3
Posterior probabilities for all possible types of migrations
| Migration from | Migration to | Complete dataset | subsample 1 | subsample 2 | subsample 3 |
|---|---|---|---|---|---|
| Europe | Italy |
| 2.88 |
|
|
| US | 0.01 | 0.08 | 0.09 | 0.08 | |
| Other |
| 0.81 |
|
| |
| Italy | Europe |
|
|
| 0.52 |
| US | 0 | 0.06 | 0.07 | 0.05 | |
| Other | 0.07 |
| 0.47 | 1.15 | |
| US | Europe |
|
|
|
|
| Italy |
|
|
|
| |
| Other |
|
|
|
| |
| Other | Europe | 0.03 | 1.3 | 0.37 |
|
| Italy | 0.03 | 0.33 | 1.43 | 0.23 | |
| US | 0 | 0.11 | 0.11 | 0.08 |
Posterior probabilities for migration rates supported by a BF ≥3 are marked in bold, indicated for the complete dataset as well as for the three subsamples created as described in methods
Fig. 3a: A founder effect dominates the history of the Q80K clade. The branches in the HCV1a MCC tree are colored according to the inferred amino acid at NS3 position 80 (see legend). All except five Q80K variants descend from a common ancestor that is inferred to have existed around 1957 (95% HPD: 1947–1959). Posterior root node support is visualized in a selection of deeper nodes. Amino acids other than Q or K were found in only 13 lineages. b: Histogram of the size of the (second) largest clade with an independent origin of Q80K among the plausible trees. The size of the largest clade remains more or less constant, and the limited size of the second largest clade shows that most contemporaneous Q80K lineages reside in the major Q80K clade. c: Histogram of the number of independent origins of a Q80K lineage in the plausible trees. This shows that in the most plausible trees three to four de novo appearances of the Q80K variant appear
Fig. 2Immigration and emigration rates of HCV1a strains from and to Italy, over time period 1940–2015. Left panel: Over 99% of exportation events from Italy are directed towards other European countries (in blue). Note that the rate decline in the most recent years reflects the loss of branches in the sampling time period, rather than the start of an actual tendency. Right panel: The origin of virtually all strains entering Italy can be traced back to the United States (US) and other European countries (see Table 2). At the onset of the Italian HCV1a epidemic, immigration was most intense from the US (in blue). From 1970 onwards, introductions from the European continent started to dominate (in red). The introduction rate (y-axis) was defined as the median posterior estimate of the expected number of migration events from that particular location per time unit
Highly supported Italian clusters identified in the MCC tree obtained from the complete dataset
| Sample size | Tree root support | Presence of Q80K (N, %) | Time span | Regional information | |
|---|---|---|---|---|---|
| Cluster 1 | 2 | 1 | 0 (0) | 2.23 | Unknown: 2 |
| Cluster 2 | 2 | 1 | 0 (0) | 3.4 | North: 1 – South: 1 |
| Cluster 3 | 2 | 1 | 2 (100) | 8.4 | Unknown: 2 |
| Cluster 4 | 2 | 1 | 2 (100) | 15.0 | Central: 2 |
| Cluster 5 | 2 | 1 | 0 (0) | 17.6 | Unknown: 2 |
| Cluster 6 | 2 | 0.98 | 2 (100) | 22.8 | Unknown: 2 |
| Cluster 7 | 2 | 0.99 | 0 (0) | 32.3 | Unknown: 2 |
| Cluster 8 | 2 | 0.97 | 0 (0) | 32.8 | Central: 1 – Unknown: 1 |
| Cluster 9 | 2 | 0.92 | 0 (0) | 33.1 | Unknown: 2 |
| Cluster 10 | 5 | 0.99 | 0 (0) | 13.9 | Unknown: 5 |
| Cluster 11 | 6 | 1 | 0 (0) | 12.6 | Central: 1 – Unknown: 5 |
| Cluster 12 | 7 | 1 | 0 (0) | 7.8 | Central: 6 – Unknown: 1 |
| Cluster 13 | 34 | 0.98 | 3 (8.8) | 35.4 | North: 2 - Central: 21 - South: 7 – Unknown: 4 |
In total, 13 highly supported clusters were identified (posterior root node support ≥0.90), with 2 to 34 Italian member taxa. Sampling region information was unavailable for many taxa. For each cluster, the sample size, posterior root node support, presence of Q80K and the time spanned by the cluster, is listed. Clusters are ordered according to sample size and time span