| Literature DB >> 31712570 |
Kimia Kamelian1,2, Vincent Montoya2, Andrea Olmstead2, Winnie Dong2, Richard Harrigan1, Muhammad Morshed3,4, Jeffrey B Joy5,6.
Abstract
In 2018, the World Health Organization identified the Zika virus (ZIKV) as a pathogen that should be prioritized for public health research due to its epidemic potential. In this study, whole-genome sequencing (WGS) of travel-acquired ZIKV infections was used to examine the limitations of phylogenetic analysis. WGS and phylogenetic analysis were performed to investigate geographic clustering of samples from five Canadians with travel-acquired ZIKV infections and to assess the limitations of phylogenetic analysis of ZIKV sequences using a phylogenetic cluster approach. Genomic variability of ZIKV samples was assessed and for context, compared with hepatitis C virus (HCV) samples. Phylogenetic analysis confirmed the suspected region of ZIKV infection for one of five samples and one sample failed to cluster with sequences from its suspected country of infection. Travel-acquired ZIKV samples depicted low genomic variability relative to HCV samples. A floating patristic distance threshold classified all pre-2000 ZIKV sequences into separate clusters, while only Cambodian, Peruvian, Malaysian, and South Korean sequences were similarly classifiable. While phylogenetic analysis of ZIKV data can identify the broad geographical region of ZIKV infection, ZIKV's low genomic variability is likely to limit precise interpretations of phylogenetic analysis of the origins of travel-related cases.Entities:
Mesh:
Year: 2019 PMID: 31712570 PMCID: PMC6848190 DOI: 10.1038/s41598-019-52613-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Sequencing results of five samples with confirmed travel-acquired ZIKV infection.
| Samplea | Location of Infection | Ct Valueb | No. of Readsc | No. of Human Reads | No. of ZIKV Reads | % ZIKV Virus | Median Depth of Coveraged |
|---|---|---|---|---|---|---|---|
| Sample 1 | Belize | 33 | 3,342,514 | 737,467 | 1,442,159 | 43 | 673 |
| Sample 2 | Mexico | 28 | 3,052,866 | 7,863 | 2,532,761 | 83 | 25143 |
| Sample 3 | Caribbean | 30 | 3,088,142 | 73,386 | 2,427,755 | 79 | 23884 |
| Sample 4 | Barbados | 26 | 2,046,216 | 1,774 | 1,672,980 | 82 | 16832 |
| Sample 5 | Panama | 21 | 3,282,070 | 4,388 | 2,629,877 | 80 | 32939 |
aSamples were provided by the BCCDC and were anonymized again at the BC-CfE.
bCyclic threshold (Ct) was obtained through quantitative RT-PCR.
cTotal number of human, virus, and random reads obtained per sample.
dMedian depth of coverage was calculated using the median number of reads across the genome for each sample.
Figure 1Phylogenetic analysis of five samples with confirmed ZIKV infection collected by the BCCDC. Phylogenetic analysis of Canadian travel-related clinical cases of ZIKV infection with geographically annotated reference sequences from GenBank confirmed the suspected region of ZIKV infection for one (Sample 2 – Mexico) of the five samples. Two samples, Sample 1 – Belize and Sample 4 – Barbados, were missing whole-genome GenBank sequences from suspected areas of infection. However, they clustered within close geographical proximity to neighboring regions. The majority of GenBank reference sequences used in this study were from 2016 (n = 212) followed by 2015 (n = 77). Values for bootstraps greater than 70 are shown at the nodes. Scalebar shows the genetic difference through number of substitutions per site. Branches corresponding to taxa from the same geographical region were collapsed. An imbalance of taxa among different clades is observed within the tree topology.
Figure 2Nucleotide variation of samples. Nucleotide variation was measured for each sample as the number of consensus nucleotide to minor variants. Shown in Fig. 2, is the nucleotide variation averaged per gene over the samples sequenced in the study. In general, an overall relatively low nucleotide variation was observed despite the samples allegedly having five different origins of infections.
Figure 3Shannon diversity index of ZIKV and HCV samples. The Shannon diversity index for ZIKV samples (n = 5) and HCV samples (n = 3) shows a relatively low diversity index of ZIKV samples compared to HCV samples. A sliding window approach was used to highlight diversification across all genomic motifs where diversities at each position were summed over each 100 base-pair window.
Figure 4Nucleotide variability of ZIKV sequences after the year 2000 in dataset. Analysis of the genomic diversity of circulating ZIKV strains after the year 2000 revealed reduced genetic variability as the geographical expansion of ZIKV occurred from Asia to the Americas. A consensus sequence of the protein-coding regions was obtained for each country that had >1 whole-genome sequence (n = 24) and countries with a single representative sequence were also included in the analysis (n = 12). Countries with reference sequences acquired prior to the year 2000 were excluded (n = 5). Whole-genome sequences were aligned and rooted to the WHO reference from French Polynesia (GenBank accession number: KX369547). Variable positions were identified if they were present in >50% of all of the sequences corresponding to a specific country, but not present in the consensus sequence corresponding to any alternative country.
Figure 5Country-specific nucleotide variants. (A) The quantity of variable positions for each country in the dataset are shown if they were present in >50% of all sequences corresponding to a specific country, but not present in the consensus sequence corresponding to any alternative country. (B) The number of nucleotide positions unique to a country is shown. Unique variants were identified relative to any individual sequences from alternative countries by comparing each variant to all nucleotides at the respective position. With geographical expansion into the Western Hemisphere and north- and southward spread of ZIKV, the number of unique variants identified in whole-genome sequences decreased.