| Literature DB >> 23825605 |
Jiufeng Sun1, Yan Huang, Huaiqiu Huang, Pei Liang, Xiaoyun Wang, Qiang Mao, Jingtao Men, Wenjun Chen, Chuanhuan Deng, Chenhui Zhou, Xiaoli Lv, Juanjuan Zhou, Fan Zhang, Ran Li, Yanli Tian, Huali Lei, Chi Liang, Xuchu Hu, Jin Xu, Xuerong Li.
Abstract
Clonorchis sinensis, an ancient parasite that infects a number of piscivorous mammals, attracts significant public health interest due to zoonotic exposure risks in Asia. The available studies are insufficient to reflect the prevalence, geographic distribution, and intraspecific genetic diversity of C. sinensis in endemic areas. Here, a multilocus analysis based on eight genes (ITS1, act, tub, ef-1a, cox1, cox3, nad4 and nad5 [4.986 kb]) was employed to explore the intra-species genetic construction of C. sinensis in China. Two hundred and fifty-six C. sinensis isolates were obtained from environmental reservoirs from 17 provinces of China. A total of 254 recognized Multilocus Types (MSTs) showed high diversity among these isolates using multilocus analysis. The comparison analysis of nuclear and mitochondrial phylogeny supports separate clusters in a nuclear dendrogram. Genetic differentiation analysis of three clusters (A, B, and C) showed low divergence within populations. Most isolates from clusters B and C are geographically limited to central China, while cluster A is extraordinarily genetically diverse. Further genetic analyses between different geographic distributions, water bodies and hosts support the low population divergence. The latter haplotype analyses were consistent with the phylogenetic and genetic differentiation results. A recombination network based on concatenated sequences showed a concentrated linkage recombination population in cox1, cox3, nad4 and nad5, with spatial structuring in ITS1. Coupled with the history record and archaeological evidence of C. sinensis infection in mummified desiccated feces, these data point to an ancient origin of C. sinensis in China. In conclusion, we present a likely phylogenetic structure of the C. sinensis population in mainland China, highlighting its possible tendency for biogeographic expansion. Meanwhile, ITS1 was found to be an effective marker for tracking C. sinensis infection worldwide. Thus, the present study improves our understanding of the global epidemiology and evolution of C. sinensis.Entities:
Mesh:
Year: 2013 PMID: 23825605 PMCID: PMC3688995 DOI: 10.1371/journal.pone.0067006
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Locations where wild cats, dogs and fishes were captured.
Field sites are shown in relation to the ranges of the 17 proposed provinces (marked with red and purple) in mainland China. International borders and major rivers are also shown in the figure.
Polymorphism summary and tests for neutral evolution in each locus.
| Parameters | Phylogenetic Marker | |||||||
| ITS1 |
|
|
|
|
|
|
| |
| No. of sequences | 256 | 256 | 256 | 256 | 256 | 256 | 256 | 256 |
| No. of characters | 572 | 368 | 345 | 731 | 651 | 782 | 760 | 777 |
| No. of cordons | n.a | 80 | 82 | 208 | 216 | 258 | 221 | 243 |
|
| ||||||||
| No. of mutations (η) | 12 | 8 | 14 | 47 | 32 | 12 | 16 | 7 |
| Nucleotide Variability | 1.22% | 2.17% | 4.57% | 6.42% | 4.91% | 1.53% | 2.1% | 0.9% |
| Amino acid Variability | n.a | 2.5% | 3.66% | 5.29% | 4.62% | 0 | 0 | 1.2% |
| No. of haplotypes | 33 | 14 | 16 | 101 | 119 | n.a | n.a | n.a |
| Haplotype diversity | 0.587 | 0.631 | 0.35 | 0.9684 | 0.9707 | n.a | n.a | n.a |
| Nucleotide diversity (Pi) | 0.00128 | 0.00248 | 0.00156 | 0.00479 | 0.00531 | 0.00303 | 0.00439 | 0.00154 |
|
| ||||||||
| Tajima's D | −1.68611 (0.05<p<0.10) | −0.64840(p>0.10) | −1.87004(p<0.05) | −1.57585(0.05<p<0.1) | −0.94367(p>0.10) | 0.49353(p>0.10) | 0.85054(p>0.10) | 0.08993(p>0.10) |
n.a: not applicable.
Figure 2Neighbor-joining tree showing the phylogenetic relationships of the C. sinensis isolates included in this study (n = 256).
All isolates were delineated into three major groups: the cluster A (n = 225) complex, cluster B (n = 19) and cluster C (n = 12). Geographically unique clusters were detected within the dendrogram tree: Henan (purple), Guangxi (green), Guangdong (blue) and Shandong (red). Isolates collected from fish and dogs are marked with symbols for fish and dog. The percentages of replicate trees in which the associated taxa clustered together in the bootstrap test (1000 replications) are indicated. The evolutionary distances were computed using the Maximum Composite Likelihood method and are presented in units of the number of base substitutions per site.
cox3
(p<0.05) and nad4 (0.05
Divergence and differentiation among sub-populations.
| Cluster A | Cluster B | Cluster C | |
| No. Sequences | 225 | 19 | 12 |
| No. Segregating sites (S) | 37 | 29 | 44 |
| Average number of differences (K) | 7.76278 | 10.18713 | 19.01515 |
| Nucleotide diversity (Pi) | 0.00280 | 0.00368 | 0.00686 |
| Mutation rates(θ) | 0.00281 | 0.00369 | 0.00690 |
|
| |||
| Tajima's D | 0.46958(p>0.1) | 0.90132 (p>0.1) | 1.26275 (p>0.1) |
| Differentiation | Cluster A | Cluster B | Cluster C |
| Cluster A | 0.05623*** | 0.09727*** | |
| Cluster B |
| 0.13318*** | |
| Cluster C |
|
| |
| Cluster A | |||
| Cluster B | 0.00443 | ||
| Cluster C | 0.00819 | 0.00681 | |
KST values are displayed above the diagonal and represent the weighted measure of the ratio of the average pair-wise differences within groups to the total average pair-wise differences.
Snn values are displayed below the diagonal in bold and represent the proportion of nearest neighbors in sequence space that are found in the same group. Significance levels for KST and Snn were assessed using permutation tests, with 1000 permutations: ns = non-significant, *0.01
Dxy values are displayed at the bottom of the diagonal and represent the minimum estimate of the number of nucleotide differences per site between groups.
Divergence and differentiation among geographic distributions.
| South | North | East | Central | |
| No. Sequences | 32 | 33 | 23 | 38 |
| No. Segregating sites (S) | 75 | 82 | 74 | 104 |
| Average number of differences (K) | 16.92540 | 15.05303 | 17.47036 | 20.12233 |
| Nucleotide diversity (Pi) | 0.00353 | 0.00314 | 0.00364 | 0.00419 |
| Mutation rates(θ) | 0.00388 | 0.00421 | 0.00424 | 0.00516 |
|
| ||||
| Tajima's D | −0.34186(p>0.1) | −0.95434(p>0.1) | −0.55673(p>0.1) | −0.69051(p>0.1) |
| Differentiation | South | North | East | Central |
| South | 0.01862** | 0.0270* | 0.02528 *** | |
| North |
| 0.02441*** | 0.03626 *** | |
| East |
|
| 0.01375* | |
| Central |
|
|
| |
| South | ||||
| North | 0.00346 | |||
| East | 0.00366 | 0.00356 | ||
| Central | 0.00406 | 0.00394 | 0.00403 | |
KST values are displayed above the diagonal and represent the weighted measure of the ratio of the average pair-wise differences within groups to the total average pair-wise differences.
Snn values are displayed below the diagonal in bold and represent the proportion of nearest neighbors in sequence space that are found in the same group. Significance levels for KST and Snn were assessed using permutation tests, with 1000 permutations:
ns = non-significant, *0.01
Dxy values are displayed at the bottom of the diagonal and represent the minimum estimate of the number of nucleotide differences per site between groups.
Divergence and differentiation among different hosts.
| Cat | Fish | Dog | |
| No. Sequences | 191 | 32 | 33 |
| No. Segregating sites (S) | 157 | 75 | 82 |
| Average number of differences (K) | 18.57085 | 16.92540 | 15.05303 |
| Nucleotide diversity (Pi) | 0.00387 | 0.00353 | 0.00314 |
| Mutation rates(θ) | 0.00388 | 0.00354 | 0.00314 |
|
| |||
| Tajima's D | −0.34186(p>0.1) | −0.95434(p>0.1) | −1.03550 (p>0.1) |
| Differentiation | Cat | Fish | Dog |
| Cat | 0.00556*** | 0.00721*** | |
| Fish |
| 0.01862** | |
| Dog |
|
| |
| Cat | |||
| Fish | 0.00379 | ||
| Dog | 0.00361 | 0.00346 | |
KST values are displayed above the diagonal and represent the weighted measure of the ratio of the average pair-wise differences within groups to the total average pair-wise differences.
Snn values are displayed below the diagonal in bold and represent the proportion of nearest neighbors in sequence space that are found in the same group. Significance levels for KST and Snn were assessed using permutation tests, with 1000 permutations: ns = non-significant, *0.01
Dxy values are displayed at the bottom of the diagonal and represent the minimum estimate of the number of nucleotide differences per site between groups.
Divergence and differentiation among water bodies.
| Songhua River | Yellow River | Yangzi River | Pearl River | |
| No. Sequences | 33 | 70 | 87 | 32 |
| No. Segregating sites (S) | 82 | 130 | 117 | 75 |
| Average number of differences (K) | 15.05303 | 21.17723 | 16.55654 | 16.92540 |
| Nucleotide diversity (Pi) | 0.00314 | 0.00441 | 0.00345 | 0.00353 |
| Mutation rates(θ) | 0.00421 | 0.00571 | 0.00488 | 0.00388 |
|
| ||||
| Tajima's D | −0.95434(p>0.1) | −0.78167 (p>0.1) | −0.98406 (p>0.1) | −0.34186 (p>0.1) |
| Differentiation | Songhua River | Yellow River | Yangzi River | Pearl River |
| Songhua River | 0.01383*** | 0.01708*** | 0.01862*** | |
| Yellow River |
| 0.00973*** | 0.01366** | |
| Yangzi River |
|
| 0.01243** | |
| Pearl River |
|
|
| |
| Songhua River | ||||
| Yellow River | 0.00390 | |||
| Yangzi River | 0.00344 | 0.00401 | ||
| Pearl River | 0.00346 | 0.00410 | 0.00360 | |
KST values are displayed above the diagonal and represent the weighted measure of the ratio of the average pair-wise differences within groups to the total average pair-wise differences.
Snn values are displayed below the diagonal in bold and represent the proportion of nearest neighbors in sequence space that are found in the same group. Significance levels for KST and Snn were assessed using permutation tests, with 1000 permutations: ns = non-significant, *0.01
Dxy values are displayed at the bottom of the diagonal and represent the minimum estimate of the number of nucleotide differences per site between groups.
Figure 3Phylogram constructed from UPGMA analysis of p-distances based on concatenated sequences of the ITS1 locus using global penal isolates (n = 288, C. sinensis: 271, O. viverrini: 7, O. felineus: 10).
The fixed cluster I (red) is composed of 234 isolates from 17 provinces in China and 14 isolates from Korea and Japan. Cluster II (light blue) is composed of 16 isolates from Henan, Shannxi, Zhejiang, Guangxi, Anhui and Hubei province. Cluster III (light red) is composed of 6 isolates from Henan province and an ancient corpse from Hubei province in 176 BC. Seventeen O. viverrini (purple) and O. felineus (green) were used as outgroups. The percentage replicates are 1000 replications. The evolutionary distances were computed using the Maximum Composite Likelihood method.
Figure 4Haplotype networks of 288 isolates at ITS1 locus.
The sampled haplotypes are indicated by circles; the geographical regions from which the sample was collected and the size of the circles are proportional to the observed haplotype frequency. The super linkage populations depict the haplotype with the highest ancestral probability (light red), and each branch indicates mutational separation (light blue and purple). Internal nodes (yellow) are representative of ancestral haplotypes. Media Vector (mv, red) indicate the vector between two ancestral haplotypes.
Summarized archaeological evidence of C. sinensis eggs in desiccated fecal remains in mummies according to archived archaeology studies.
| Number | Gender | Age | Origin of isolation | Time of isolation | Buried time |
| 1 | Female | 80 | Guangzhou, Guangdong | 1956 AD | Ming dynasty, 1505 AD |
| 2 | Male | 50 | Hengyang, Hunan | 1973 AD | Song dynasty, 960–1127 AD |
| 3 | Male | 60 | Jiangling, Hubei | 1975 AD | Han dynasty, 167 BC |
| 4 | Male | 50 | Fuqing, Fujian | 1980 AD | Ming dynasty, 1558 AD |
| 5 | Female | 50–60 | Fuzhou, Fujian | 1980 AD | Ming dynasty, ∼1558 AD |
| 6 | Female | 40–45 | Jiangling, Hubei | 1982 AD | The Warring States, 475–221 BC |
| 7 | Female | 70–75 | Jingmen, Hubei | 1994 AD | The Warring States, 475–221 BC |