| Literature DB >> 33355530 |
Swapnil Tichkule1,2,3, Aaron R Jex4,2, Cock van Oosterhout5, Anna Rosa Sannella6, Ralf Krumkamp7,8, Cassandra Aldrich7,9,8, Oumou Maiga-Ascofare10,7,8, Denise Dekker7,8, Maike Lamshöft7,8, Joyce Mbwana11, Njari Rakotozandrindrainy12, Steffen Borrmann13,14, Thorsten Thye7,8, Kathrin Schuldt7,8, Doris Winter7,8, Peter G Kremsner13,14, Kwabena Oppong10, Prince Manouana14, Mirabeau Mbong14, Samwel Gesase11, Daniel T R Minja11, Ivo Mueller1,2, Melanie Bahlo1,2, Johanna Nader15, Jürgen May7,8, Raphael Rakotozandrindrain12, Ayola Akim Adegnika13,14, John P A Lusingu11, John Amuasi10, Daniel Eibach7,8, Simone Mario Caccio6.
Abstract
Cryptosporidiosis is a major cause of diarrhoeal illness among African children, and is associated with childhood mortality, malnutrition, cognitive development and growth retardation. Cryptosporidium hominis is the dominant pathogen in Africa, and genotyping at the glycoprotein 60 (gp60) gene has revealed a complex distribution of different subtypes across this continent. However, a comprehensive exploration of the metapopulation structure and evolution based on whole-genome data has yet to be performed. Here, we sequenced and analysed the genomes of 26 C. hominis isolates, representing different gp60 subtypes, collected at rural sites in Gabon, Ghana, Madagascar and Tanzania. Phylogenetic and cluster analyses based on single-nucleotide polymorphisms showed that isolates predominantly clustered by their country of origin, irrespective of their gp60 subtype. We found a significant isolation-by-distance signature that shows the importance of local transmission, but we also detected evidence of hybridization between isolates of different geographical regions. We identified 37 outlier genes with exceptionally high nucleotide diversity, and this group is significantly enriched for genes encoding extracellular proteins and signal peptides. Furthermore, these genes are found more often than expected in recombinant regions, and they show a distinct signature of positive or balancing selection. We conclude that: (1) the metapopulation structure of C. hominis can only be accurately captured by whole-genome analyses; (2) local anthroponotic transmission underpins the spread of this pathogen in Africa; (3) hybridization occurs between distinct geographical lineages; and (4) genetic introgression provides novel substrate for positive or balancing selection in genes involved in host-parasite coevolution.Entities:
Keywords: Africa; Cryptosporidium hominis; genetic introgression; population structure; recombination; whole-genome sequencing
Year: 2020 PMID: 33355530 PMCID: PMC8115899 DOI: 10.1099/mgen.0.000493
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
List of the C. hominis isolates included in the study and sequencing statistics
|
Sample code |
Country |
Sampling date (yyyy-mm-dd) |
GC (%) |
% of reference genome covered at >5× |
Coverage (mean) |
No. of SNPs |
SRA accession |
|---|---|---|---|---|---|---|---|
|
Afr1 |
Ghana |
2016-05-18 |
33 |
99.4 |
142X |
2810 |
SRR11843813 |
|
Afr2 |
Ghana |
2016-05-16 |
31 |
99.9 |
570X |
2797 |
SRR11843812 |
|
Afr3 |
Ghana |
2016-06-02 |
32 |
99.8 |
352X |
2782 |
SRR11843801 |
|
Afr5 |
Ghana |
2016-07-25 |
32 |
99.8 |
354X |
2788 |
SRR11843792 |
|
Afr6 |
Ghana |
2016-08-12 |
31 |
99.9 |
446X |
2797 |
SRR11843791 |
|
Afr7 |
Ghana |
2016-09-20 |
31 |
99.8 |
135X |
2953 |
SRR11843790 |
|
Afr9 |
Ghana |
2016-11-22 |
31 |
99.8 |
135X |
2953 |
SRR11843789 |
|
Afr10 |
Madagascar |
2017-02-24 |
31 |
99.8 |
156X |
2252 |
SRR11843788 |
|
Afr11 |
Madagascar |
2017-03-20 |
32 |
93.6 |
63X |
3012 |
SRR11843787 |
|
Afr12 |
Madagascar |
2017-03-22 |
32 |
99.8 |
180X |
3055 |
SRR11843786 |
|
Afr13 |
Madagascar |
2017-03-24 |
31 |
99.8 |
182X |
3058 |
SRR11843811 |
|
Afr14 |
Tanzania |
2016-12-08 |
31 |
99.8 |
160X |
3122 |
SRR11843810 |
|
Afr15 |
Tanzania |
2017-01-02 |
31 |
99.8 |
166X |
3010 |
SRR11843809 |
|
Afr16 |
Tanzania |
2017-01-03 |
31 |
99.8 |
154X |
2976 |
SRR11843808 |
|
Afr17 |
Tanzania |
2017-01-06 |
31 |
99.8 |
168X |
2839 |
SRR11843807 |
|
Afr18 |
Tanzania |
2017-01-30 |
31 |
99.8 |
195X |
3015 |
SRR11843806 |
|
Afr20 |
Tanzania |
2017-03-06 |
32 |
99.7 |
143X |
3102 |
SRR11843805 |
|
Afr21 |
Tanzania |
2017-03-08 |
31 |
99.9 |
202X |
3018 |
SRR11843804 |
|
Afr22 |
Tanzania |
2017-03-13 |
31 |
99.8 |
132X |
2831 |
SRR11843803 |
|
Afr23 |
Tanzania |
2017-03-27 |
31 |
99.8 |
173X |
2912 |
SRR11843802 |
|
Afr24 |
Tanzania |
2017-03-29 |
31 |
99.8 |
168X |
3024 |
SRR11843800 |
|
Afr25 |
Tanzania |
2017-03-29 |
31 |
99.8 |
195X |
2826 |
SRR11843799 |
|
Afr26 |
Tanzania |
2017-01-18 |
31 |
99.8 |
156X |
2902 |
SRR11843798 |
|
Afr29 |
Gabon |
2017-08-23 |
32 |
95.4 |
18X |
2987 |
SRR11843797 |
|
Afr30 |
Gabon |
2017-09-28 |
32 |
99.7 |
162X |
3035 |
SRR11843796 |
|
Afr33 |
Gabon |
2017-07-28 |
32 |
98.2 |
40X |
3145 |
SRR11843795 |
|
Afr34 |
Gabon |
2018-02-22 |
31 |
99.8 |
202X |
3177 |
SRR11843794 |
|
Afr35 |
Gabon |
2017-08-24 |
31 |
99.8 |
131X |
3187 |
SRR11843793 |
Fig. 1.Phylogenetic relationships among C. hominis African isolates. Isolates are coloured according to their geographical origin in panels (a–f). (a) Principal component analysis (PCA), where PC1 and PC2 account for variability among C. hominis African isolates, differentiating them by physical boundaries. (b) Phylogenetic analysis inferred by Bayesian inference (MrBayes) using a concatenated set of 3128 genomic SNPs. (c) A cloudogram of 2130000 trees obtained by DensiTree. The consensus tree is represented in dark blue. Note the transparent green branches (indicated by the arrow), which reveals that a small part of the Madagascar genomes has a different phylogenetic origin. (d) A phylogenetic network based on all genomic SNPs. The network shows geographical clustering of African isolates by the country of origin. The loops in the network are suggestive of recombination events both within and between isolates from different countries. (e) Network of the gp60 gene reveals admixture of samples from Madagascar and Tanzania. (f) Network of the CHUDEA6_5260 gene shows no evidence of admixture.
Fig. 2.Population structure and recombination analyses. (a) structure plot representing the percentage of shared ancestry among the four African C. hominis metapopulation (for K=6). (b) HybridCheck plot illustrating the sequence similarity among the three isolates (Madagascar Afr12, Ghana Afr9 and Tanzania Afr14) involved in recombination at the gp60 locus.
Fig. 3.Polymorphism and linkage analyses. Pa(a) Nucleotide variation at chromosome 6 is significantly higher than that of all other chromosomes (GLM: F 1,32=25.16, P=5.3e-7), and coding regions show a higher diversity than con-coding regions (GLM: F 1,72=56.65, P=5.4e-14). (b) LD decay versus the mean distance between SNPs calculated across the genome for the African metapopulation and the Bangladeshi population. Also shown is the LD decay for chromosome 6 and all other chromosomes in the African metapopulation. The LD decays more rapidly in the African sample compared to the Bangladeshi, and the LD decay is particularly rapid for SNPs in chromosome 6. (c) Nucleotide diversity (π) of each polymorphic gene. The 37 genes above the threshold (dashed line) are considered to be outliers, and genes with π>0.002 are labelled. (d) Tajima’s D values for each polymorphic gene. Genes with >3 Tajima’s D values are labelled. In (c) and (d) each chromosome is represented with a different colour.
Fig. 4.Distribution and density plots. (a) Density of K a/K s ratio of polymorphic genes. K a/K s ratios of the CHUDEA2_430 (red arrow), CHUDEA2_440 (blue arrow) and CHUDEA2_450 (green arrow) genes are labelled along with their ranks. The CHUDEA2_430 gene has the highest K a/K s ratio (K a/K s >1) and hence acts as an outlier. Panels (b) and (c) represent density plots of polymorphic genes in recombinant and recombinant-free regions. (b) Nucleotide diversity. (c) Haplotype diversity. Green and red arrows indicate values for the CHUDEA6_1070 and CHUDEA6_1080 (gp60) genes, respectively, in each panel.