Literature DB >> 24050210

Genetic recombination and Cryptosporidium hominis virulent subtype IbA10G2.

Na Li, Lihua Xiao, Vitaliano A Cama, Ynes Ortega, Robert H Gilman, Meijin Guo, Yaoyu Feng.   

Abstract

Little is known about the emergence and spread of virulent subtypes of Cryptosporidium hominis, the predominant species responsible for human cryptosporidiosis. We conducted sequence analyses of 32 genetic loci of 53 C. hominis specimens isolated from a longitudinally followed cohort of children living in a small community. We identified by linkage disequilibrium and recombination analyses only limited genetic recombination, which occurred exclusively within the 60-kDa glycoprotein gene subtype IbA10G2, a predominant subtype for outbreaks in industrialized nations and a virulent subtype in the study community. Intensive transmission of virulent subtype IbA10G2 in the study area might have resulted in genetic recombination with other subtypes. Moreover, we identified selection for IbA10G2 at a 129-kb region around the 60-kDa glycoprotein gene in chromosome 6. These findings improve our understanding of the origin and evolution of C. hominis subtypes and the spread of virulent subtypes.

Entities:  

Keywords:  Cryptosporidium hominis; cryptosporidiosis; genetic recombination; molecular epidemiology; parasites; population genetics; subtype; subtype IbA10G2; virulence

Mesh:

Substances:

Year:  2013        PMID: 24050210      PMCID: PMC3810731          DOI: 10.3201/eid1910.121361

Source DB:  PubMed          Journal:  Emerg Infect Dis        ISSN: 1080-6040            Impact factor:   6.883


Cryptosporidium spp. are emerging pathogens of humans and a variety of vertebrates, and cause severe diarrhea in immunocompetent and immunocompromised persons (). Cryptosporidium hominis is responsible for >70% of human infections in most areas, especially North America and developing countries (). C. hominis is primarily transmitted anthroponotically, has several transmission routes, and causes numerous waterborne outbreaks of diarrheal illness each year in the United States and other industrialized nations. Among several C. hominis subtype groups (e.g., Ia, Ib, Id, Ie, If, Ig) identified by sequence analysis of the 60-kDa glycoprotein (gp60) gene, the Ib subtype is the major subtype responsible for waterborne and foodborne outbreaks of cryptosporidiosis in many countries. Subtype IbA10G2 has been found in ≈50% of C. hominis–associated outbreaks in the United States, including the massive outbreak in Milwaukee, Wisconsin, USA, in 1993 (,). It is the only subtype identified in cryptosporidiosis outbreaks by C. hominis in countries in Europe and in Australia (–). In a longitudinal birth-cohort study of cryptosporidiosis in a periubran shantytown in Lima, Peru, IbA10G2 was more virulent than other C. hominis subtypes (). Genetic determinants for virulence of Cryptosporidium spp. and reasons for emergence of virulent subtypes are poorly understood because of availability of only limited genomic sequence data and lack of robust cultivation systems and genetic manipulation tools (). We conducted a comparative population genetic analysis of virulent C. hominis subtype IbA10G2 in children living in a periurban community in Lima, Peru, by multilocus sequence typing (MLST) of 32 genetic markers. Data obtained should be useful in understanding emergence and spread of virulent C. hominis subtypes.

Materials and Methods

Specimens, Species, and Subtype Determination

Fecal specimens microscopically positive for Cryptosporidium spp. were collected during a longitudinal cohort study of enteric diseases in children living in a periurban shantytown in Lima, Peru, during 2004–2006. The field study was conducted in the same community used in a previous longitudinal cohort study of enteric pathogens and had a similar study design (), except that children enrolled in this study were older (mean age 3.43 years vs. 14 days). The study was approved by the institutional review boards of Johns Hopkins University and the Centers for Diseases Control and Prevention. DNA was extracted from 200 μL of microscopy-positive specimens by using the FastDNA SPIN Kit for Soil (MP Biomedicals, Irvine, CA, USA). Cryptosporidium spp. in specimens were genetically characterized at the species level by PCR–restriction fragment length polymorphism analysis of the small subunit rRNA gene () and at the subtype level by DNA sequence analysis of the gp60 gene (). A total of 53 C. hominis–positive specimens (1 specimen/child) belonging to 4 gp60 subtype groups were selected for MLST analysis: Ia (9 specimens of IaA13R8 and 1 specimen of IaA13R7), Ib (26 specimens of IbA10G2), Id (6 specimens of IdA10 and 5 specimens of IdA20), and Ie (6 specimens of IeA11G3T3). No specimens used in the study had mixed C. hominis subtypes.

MLST Markers

Among the 8 chromosomes of Cryptosporidium spp., chromosome 6 was fully sequenced for 2 C. parvum specimens and 1 C. hominis specimen, which facilitated search for additional microsatellite and minisatellite markers for population genetics analysis. The 8 polymorphic markers on chromosome 6 (gp60, CP47, CP56, MSC6–5, MSC6–7, TSP8, Mucin1, and DZ-HRGP) used in several MLST and population genetic analyses of C. hominis and C. parvum (–) were included in this study. We searched for additional microsatellite and minisatellite markers in the chromosome 6 genome by using Tandem Repeats Finder software (http://tandem.bu.edu/trf/trf.html). Of 325 short tandem repeat sequences identified, 46 loci were initially selected at spaced intervals that covered the entire chromosome 6. A nested PCR approach was used for amplification of all potential targets. After an initial evaluation of primers by PCR and DNA sequencing of IaA15R3, IaA20R3, IbA10G2, IdA10, and IeA11G3T3 subtypes of C. hominis from the United States and Peru, 24 polymorphic loci (Table 1, Appendix), and the 8 markers previously identified, were used in this study.
Table 1

Primer sequences of 24 additional microsatellite and minisatellite loci used in the multilocus typing analysis of Cryptosporidium hominis*

No.LocusPolymorphismPrimerSequence, 5′→3′PCR product , bp
1C6–60MicrosatelliteF1TGGACAAGAGTGTCGCTTCTA≈866
CACR1GTATTGGGTCCTTTCAGCTCA
F2TCCAAAGTACACTGCGTAAATAT
R2TCATGTCTCAACATGCTGAGTA
2C6–160MicrosatelliteF1GTACTCAAGTTCCAGTTGTGGA≈582
TAR1CCAATACATTCAGAAGACAGTCT
F2CATCCTCAGAAGAGTACTGTA
R2GTGCTGAATGGCTTCCTATTA
3C6–190Microsatellite-F1CTATCTCTCAAGCAATGCAAAC≈534
AATR1CAAGCTCCATCAGATTATCAATA
F2CAAGTACTGGAATAGTTACTCA
R2CTGGCAATACGTAATGCCTTT
4C6–230MicrosatelliteF1GGCCACTTCATGGTAATACTC≈627
TAAR1GTCAACGAATCATCCAGAGATT
F2GGTACAATTGGAGGTATCAGT
R2GAATCTCGAACCTGAAGACTT
5C6–280MicrosatelliteF1GATTAAAATCAGGCGGGCCAA≈764
TCAR1GAGCAATTTCTTGAATAATATTGAG
F2GAGGACTGTGGGGGAATATAA
R2GACAGGAAATCCAATCAATGAATTT
6C6–350MicrosatelliteF1GTTCTTTCAATTGTGTACCATGA≈715
ATTR1GCTATTGGTTCAAATAATGGTATGA
F2GCATCGCAAATCATCATCTATTT
R2CTACATGGGATTGTTTATATGATA
7C6–500′MicrosatelliteF1GACAAAGAGTATTAGCATCGACA≈484
ATAR1CTTCATGCTCTAGTTCTTCTAAT
F2GAGTCATATGAAACAGAATCTTT
R2CAAATTGATCGAGATTTGTTGAA
8C6–580MicrosatelliteF1CTGGATGACCAAGGTGTTAAT≈632
TCTR1CTGTATTGTTACTGTCAATGCTT
F2GAGATGAATATTGAGAATGAATCCA
R2CTCGAGATGATCGATGAATCAT
9C6–740MicrosatelliteF1GAAACAGTTAAAGATAGCTTGGA≈703
ATR1GAAGATGATGGAATTCTTCTCAT
F2GTATCTTATGTGTATTCAAGCAT
R2CTAATTCTTTGTATGACAAGATCAT
10C6–830′MicrosatelliteF1GAGGCCTTAAAGCTTCTTTTA≈679
ATTR1GCACTACAACAAGTGGTACTA
F2GAGATTGTATTCTTCTATACCACT
R2CAAGTTCCAATGATAGGATCAGT
11C6–830MicrosatelliteF1GATGTATTACTACCAGAATTGAGA≈692
TAGR1CTACTCAATCAATGGAAGTTACT
F2CCAACTGATGATATTCCAGTATT
R2GGTATTATTACATCTAGTGGTGA
12C6–870MicrosatelliteF1CAGTGACCGAATTTACTCTCT≈801
ATR1GTCTCTTTAGTCAAGTTTTCCT
F2CGCTCTTCCACGTACAATTT
R2CGAGAATAAAGCTGACATTTCTT
13C6–1000′MinisatelliteF1GGAGCCTAATTGTGCTCATAT≈583
GATAAAAAGR1GGTTATCATGCACTGACTGTA
AAGAGGGAF2CTCAATTGCTAAAGATGCAGAT
R2GAATCCTGTTGTTCTCCATCT
14C6–1420MinisatelliteF1GCATTAGAGCATCCATGGTTA≈534
GAAAGAGAR1CCTGTGGTCTCGATATTCATT
GCGTF2CATCTTCAGCCGTTGATGAAA
R2GTAAACTCCTTGGAGGTGAAT
15C6–2600MicrosatelliteF1GAGGATGAACTTGTACTGCAAT≈806
GCAGCTR1CCTAGTATTGGTGGTACTTGTA
F2GGATCGAATTCTGGGTTGAAT
R2GTATCAACTTGCCCTGGAATA
16C6–2970MicrosatelliteF1GACTGTATGCCTTTGTTCCTT≈723
GAR1GTTGAAGATACTCCTGAAGCT
F2GAGCGTCTTGTCTACTGACTT
R2CTGTCCTCCAGGCATTATTGA
17C6–3110′MinisatelliteF1GTGGATAAGAGAACCCGTCAT≈725
TTCCTCCTAR1CACAAATTCTGGTGGGAATAGT
F2CCAGTGATAGTTAATCCAGCT
R2GAGCCAGAAGTCAGTATTTCT
18C6–3520MicrosatelliteF1GTTGGAGTATGACAATTACCTAA≈748
TTAR1CATCATGGAAATCTTACAGGATT
F2GCATTGTACAACATCCATGTCT
R2GGTGAACATTTTGGATTAGTATCT
19C6–3520′MicrosatelliteF1CTCTGTACAGCTGGTGTAATT≈577
TATR1CCTGGTGTGAATCAATGTTGT
F2CTTCTGGAAGCTCACCAAGAA
R2GCTCCTTATAGATTAGCTGAGT
20C6–3690MicrosatelliteF1CTTCTGGAGATTCCATATCATTGA≈640
TCTR1CCTTAAGGCTGGTTGATATGATT
F2GTGCAAGTGATATTTCCACTATCT
R2CTGTCGAATCACTAGGCAGAA
21C6–4110MicrosatelliteF1CGGAAGATAATGCTCAACTGT≈709
ACAR1CTGCCTTACTATCATTCGCAT
F2CAAGTGGAAACATTGAGCCAA
R2GGAACTGACTGGTTTGATGAT
22C6–5110′MicrosatelliteF1CACTCTTAATTCCTTCATGGCT≈533
CTCR1GGTCAATAGGACGTATTTCTTA
F2GCATCCTCAGCTTTAGAATTCT
R2GGTACAGAAGATCAGCTTTCT
23C6–5120MicrosatelliteF1GGATTGATCAGTGACAGTGAA≈569
ATR1CATAATTGGAGGACTATGTCAAAT
F2GCTGTAGTTCTAAGAGTCCAT
R2GTCTCAAATATCATTAGAACAAGGT
24C6–5410MicrosatelliteF1GACTCAGTTCGAGAGAAGTCA≈646
GTAGTGR1GGTGAATCATCTTCATGCAGT
F2CATCGAGCGTGCACTTAATCA
R2CTCAGACTCAAGGACCTGAAT

*All annealing temperatures for primers were 55°C. F, forward; R, reverse.

*All annealing temperatures for primers were 55°C. F, forward; R, reverse.

MLST PCR and Sequencing

Conditions for nested PCR analysis of each marker were similar to those described (–), except for differences in annealing temperatures specified in Table 1. One negative control and 1 positive control (C. parvum IIaA16G2R1 subtype) were used in each PCR. PCR products of the expected size were sequenced in both directions by using an ABI 3130 Genetic Analyzer (Applied Biosystems, Foster City, CA, USA). Sequences obtained from each locus were edited and aligned by using BioEdit 7.04 version (www.mbio.ncsu.edu/BioEdit/bioedit.html) and ClustalX version 1.81 (www.clustal.org/), respectively. Nucleotide sequences of all haplotypes of polymorphic loci were deposited in GenBank under accession nos. JX088398–JX088417 and JX088427–JX088501.

Analyses of Sequence Polymorphism

For each marker, sequences of 53 C. hominis specimens were used for calculation of haplotype diversity (Hd), number of haplotypes, number of polymorphic and segregating sites, intragenic linkage disequilibrium (LD), and intragenic recombination rates by using DnaSP version 5.10.00 (www.ub.es/dnasp/). The ratio of nonsynonymous substitutions per nonsynonymous site to synonymous substitutions per synonymous site was calculated for gp60 and several nearby loci (C6–830′, C6–1000′, C6–1420, CP47, and C6–2600) by using DnaSP. Thereafter, sequences of polymorphic loci were concatenated as a single multilocus contig to calculate gene diversity, interlocus LD, and recombination rates by using DnaSP. Pairwise intergenic LD was also evaluated among polymorphic loci by using the Fisher exact test and Markov chain parameters in Arlequin version 3.1 (http://cmpg.unibe.ch/software/arlequin3/).

Substructure Analysis

Population substructures were analyzed by using the Bayesian analysis tool STRUCTURE version 2.2 (http://pritch.bsd.uchicago.edu/structure.html). Conversion of microsatellite and minisatellite sequences to allelic data was performed by using the Excel Microsatellite Tool Kit 3.1.1 (http://animalgenomics.ucd.ie/sdepark/ms-toolkit/) and Genepop version 4.0 (http://genepop.curtin.edu.au/). Several analyses of allelic data were performed by using K (likely populations) ranging from 2 to 8 and 50,000 iterations after a burn-in of 50,000 iterations. Output at K = 2–5 provided the best fit to MLST data and were used in further analyses. To provide an alternative view of substructuring, a median-joining algorithm analysis () was conducted by using Network version 4.6.1.0 (www.fluxus-ngineering.com/sharenet.htm). Each gp60 subtype was further analyzed for LD and recombination by using DnaSP and Arlequin. Population differentiation (FST) between IbA10G2 and non-IbA10G2 subtypes was also assessed at each genetic locus by using Arlequin.

Results

Multilocus Gene Diversity

Sequence polymorphism among the 53 C. hominis specimens was detected at 25 loci (Table 2), including variations in copy numbers of microsatellite or minisatellite repeat and single nucleotide substitutions (SNPs) and insertion and deletion (indels) in the nonrepeat regions. The remaining 7 loci (C6–190, C6–500′, TSP8, C6–830, C6–870, MSC6–5, and Mucin1) were monomorphic in the study population. The Hd of individual polymorphic locus ranged from 0.0377 to 0.9231, and most loci with higher Hd were situated in the first 25% of chromosome 6 (Table 2). Number of haplotypes per locus ranged from 2 to 21, and more haplotypes were at loci C6–60 (21), CP47 (6), gp60 (6), C6–230 (6), and C6–740 (5). MLST analysis confirmed the absence of concurrence of mixed C. hominis subtypes in the 53 specimens.
Table 2

Intragenic analyses of 53 Cryptosporidium hominis specimens at 25 polymorphic loci in chromosome 6*

No.LocusPosition in chromosome 6Haplotype diversityNo. haplotypesPolymorphic sites, bp†Segregating sites, bp‡Intragenic LDIntragenic recombination events
1CP565451–61290.4819321NANA
2MSC6–712415–129450.48193401NANA
3C6–6022017–228070.923121983Y = 1.0000 + 0.0000X0
4C6–16044235–447830.6727370NANA
5C6–23058658–592340.49066791NANA
6C6–28070850–714540.55303423Y = 1.0000 + 0.0000X0
7C6–35084688–852770.413622112Y = 1.0000 + 0.0000X0
8C6–580138267–1388660.2409322NANA
9C6–740188916–1895410.5791571NANA
10C6–830′210232–2108840.1742211NANA
11C6–1000′251594–2520870.383233520Y = 1.0000 + 0.0000X0
12gp60266540–2673500.70906431297Y = 0.9729 + 0.0130X26
13C6–1420338725–3392420.20833131NANA
14CP47372535–3729070.808461137Y = 1.0000 + 0.0000X0
15C6–2600606026–6067980.3570211NANA
16C6–2970685852–6865410.3570262NANA
17C6–3110′721467–7221690.0377290NANA
18C6–3520820500–8212150.4136260NANA
19C6–3520′822524–8230760.0377211NANA
20C6–3690864809–8654160.4688211NANA
21DZ-HRGP917600–9181390.1742230NANA
22C6–4110956461–9570830.41154275Y = 1.0000 + 0.0000X0
23C6–5110′1206378–12067740.2663382NANA
24C6–51201213134–12136770.2612260NANA
25C6–54101285377–12860030.5218390NANA

*LD, linkage disequilibrium, where Y is the LD value, and X is the nucleotide distance in kilobases; NA, analysis is not applicable for the locus, which has <3 segregating sites.
†Polymorphic sites including insertions/deletions (indels).
‡Segregating sites excluding indels.

*LD, linkage disequilibrium, where Y is the LD value, and X is the nucleotide distance in kilobases; NA, analysis is not applicable for the locus, which has <3 segregating sites.
†Polymorphic sites including insertions/deletions (indels).
‡Segregating sites excluding indels. Alignment of combined multilocus sequences covered 15,717 bp and 968 polymorphic sites, including 362 segregating sites and 606 indels. Multilocus sequences had 43 multilocus genotypes (MLGs), an Hd of 0.9898, and a nucleotide diversity of 0.339. The frequency of MLGs ranged from 7.5% (1 MLG with 4 specimens), 5.7% (1 MLG with 3 specimens), and 3.8% (5 MLGs each with 2 specimens) to 1.9% (36 MLGs each with 1 specimen). Because 431/968 polymorphic sites and 297/362 segregating sites occurred within the gp60 locus, a second analysis was performed by using concatenated sequences excluding gp60 (14,768 bp) with 537 polymorphic sites, including 65 segregating sites and 472 indel sites. A lower nucleotide diversity of 0.289 was observed. However, the number of MLGs (43) and haplotype diversity (0.9898) remained the same.

LD and Recombination

Intragenic LD between pairs of segregating sites was assessed for each polymorphic locus. The analysis was possible only for 7 loci (C6–60, C6–280, C6–350, C6–1000′, gp60, CP47, and C6–4110) that had at ≥3 segregating sites. Incomplete intragenic LD (|D′| Y = 0.9729 + 0.0130X), where Y is the LD value and X is the nucleotide distance in kb, was observed at the gp60 locus, and complete intragenic LD (|D′| Y = 1.0000 + 0.0000X) was found at each of the 6 remaining loci (Table 2). An intragenic recombination test identified 26 potential recombination events (Rms) at the gp60 locus and no recombination at the remaining loci (Table 2). On the basis of the concatenated multilocus sequence data, interlocus LD was assessed over all segregating sites by using pairwise comparisons (). Analysis of combined multilocus sequences of all loci resulted in an overall interlocus genetic association (Zns) value of 0.2711 (95% CI 0.0695–0.3918; the probability p for expected Zns ≤0.2711 was 0.883). Of 47,895 pairwise comparisons, 25,904 were significant by Fisher exact test, and 8,714 were significant after Bonferroni correction (Table 3). In an additional LD analysis per site, strong but incomplete LD (|D′| Y = 0.9289 – 0.0299X) was detected with a negative slope, indicating a decrease in linkage with increased nucleotide distance (Table 3). When analysis was performed after excluding gp60, the test produced a Zns value of 0.2184 (95% CI 0.0654–0.4469, p = 0.759). Of 2,080 pairwise comparisons, 921 were significant by Fisher exact test, and 333 were significant after Bonferroni correction. Incomplete LD (|D′| Y = 0.7886 + 0.0051X) was also observed in LD analysis per site (Table 3). Thus, recombination might occur because of incomplete LD. An overall recombination test showed a minimum of 33 potential recombination events and an estimated 5.0 R/gene (Table 3). When analysis was performed after exclusion of gp60, only 6 recombination events and 1.8 R/gene were observed (Table 3). Thus, recombination was occurring mostly at the gp60 locus.
Table 3

Pairwise interlocus linkage disequilibrium and recombination analysis of concatenated multilocus sequences from various subtypes of Cryptosporidium hominis*

PopulationNo. samplesNo. segregating sites analyzedNo. pairwise comparisonsNo. significant pairwise comparisons†Zns|D'|LDEstimate of R/geneMinimum no. recombination events
Including gp60‡
All5336247,89525,904 (8,714)0.2711Y = 0.9289 – 0.0299XIncomplete5.033
IaA13R8(7)102019000.7581Y = 1.0000 + 0.0000XComplete0.0010
IbA10G22631465160 (140)0.3660Y = 0.9342 + 0.0001XIncomplete0.34
IdA106149101.0000Y = 1.0000 + 0.0000XComplete0.0010
IdA2053300.3750Y = 1.0000 + 0.0000XComplete0.0010
IeA11G3T361917101.0000Y = 1.0000 + 0.0000XComplete0.0010
Excluding gp60§
All53652080921 (333)0.2184Y = 0.7886 + 0.0051XIncomplete1.86
IaA13R8(7)102019000.7581Y = 1.0000 + 0.0000XComplete0.0010
IbA10G22631465160 (140)0.3660Y = 0.9302 + 0.0010XIncomplete0.34
IdA106149101.0000Y = 1.0000 + 0.0000XComplete0.0010
IdA2053300.3750Y = 1.0000 + 0.0000XComplete0.0010
IeA11G3T361917101.0000Y = 1.0000 + 0.0000XComplete0.0010

*Zns, interlocus genetic association; |D'|, linkage disequilibrium (LD) value, where Y is LD value and X is nucleotide distance in kilobases; gp 60, 60-kDa glycoprotein gene.
†By Fisher exact test (after Bonferroni correction).
‡Based on concatenated multilocus gene sequence of all loci (15,717 bp). 
§Based on concatenated multilocus gene sequence of all loci excluding gp60 (14,768 bp).

*Zns, interlocus genetic association; |D'|, linkage disequilibrium (LD) value, where Y is LD value and X is nucleotide distance in kilobases; gp 60, 60-kDa glycoprotein gene.
†By Fisher exact test (after Bonferroni correction).
‡Based on concatenated multilocus gene sequence of all loci (15,717 bp). 
§Based on concatenated multilocus gene sequence of all loci excluding gp60 (14,768 bp).

Substructure in C. hominis

The evolutional relationship among gp60 subtypes of C. hominis was inferred by using STUCTURE and predicted population numbers K = 2–5. The ancestral population size K = 5 was considered the best estimate of current population substructure (Figure 1). The most dominant feature in the output was heterogeneity of specimens belonging to subtype IbA10G2 compared with other subtypes. Within 26 specimens of IbA10G2, the pattern of combinations in STRUCTURE suggested a mixture of ancestral types, reflecting likely genetic recombination in the subtype. In contrast, the 10 specimens of the Ia subtype (including 9 specimens of IaA13R8 and 1 specimen of IaA13R7) had mostly a single pattern, thus providing strong support for it being a separate population with a unique ancestry. Likewise, intrasubtype homogeneity was also observed in 6 specimens of IdA10, five specimens of IdA20, and 6 specimens of IeA11G3T3. Within the Id subtype, specimens of IdA10 and IdA20 showed different patterns regardless of the K value, indicating that these 2 Id subtypes had distinct ancestries with little mixture and genetic recombination. On the basis of patterns at K = 5 (Figure 1), some IbA10G2 specimens had a mixture of patterns, which were the dominant patterns in subtype IaA13R8 and subtype IeA11G3T3, respectively, suggesting that these IbA10G2 specimens might have resulted from genetic exchange between the ancestor of subtype IbA10G2 and the ancestor of subtype IaA13R8 or IeA11G3T3.
Figure 1

Relationship among various 60-kDa glycoprotein gene subtypes of Cryptosporidium hominis by substructure analysis. Predicted population numbers K = 2–5 were applied in STRUCTURE version 2.2 (http://pritch.bsd.uchicago.edu/structure.html) analysis of the data. Colored regions indicate major ancestral contributions. Mixed genotypes are indicated by the pattern of color combinations. Values along the baseline indicate C. hominis–positive fecal specimens.

Relationship among various 60-kDa glycoprotein gene subtypes of Cryptosporidium hominis by substructure analysis. Predicted population numbers K = 2–5 were applied in STRUCTURE version 2.2 (http://pritch.bsd.uchicago.edu/structure.html) analysis of the data. Colored regions indicate major ancestral contributions. Mixed genotypes are indicated by the pattern of color combinations. Values along the baseline indicate C. hominis–positive fecal specimens. We also conducted a median-joining network analysis of the MLST data for the 53 specimens. Among 21 MLGs generated on the basis of segregating sites of concatenated sequences, excluding gp60, we did not find any shared MLGs between gp60 subtypes of C. hominis (Figure 2). Compared with STRUCTURE analyses, the results of network analysis showed similarity to patterns at K = 2 (Figure 1), and supported the conclusion on the heterogeneity of subtype IbA10G2. In network analysis, central types are usually possible ancestors, and peripheral types are descendants (). Because we did not find any MLG as the central type, it was impossible to define a single ancestral line that gave rise to other lines. This finding could be caused by the small sample size and few MLGs in this study.
Figure 2

Median-joining network for various subtypes of Cryptosporidium hominis. Circles are proportional to the frequency of each multilocus genotype (MLG) (21 MLGs on the basis of segregating sites of concatenated sequences excluding the 60-kDa glycoprotein gene). The color of each circle represents the 60-kDa glycoprotein gene subtypes of the C. hominis specimens: IaA13R8 (green), IbA10G2 (blue), IdA10 (purple), IdA20 (yellow), and IeA11G3T3 (red). Length of lines connecting MLGs is proportional to the number of single-nucleotide polymorphisms.

Median-joining network for various subtypes of Cryptosporidium hominis. Circles are proportional to the frequency of each multilocus genotype (MLG) (21 MLGs on the basis of segregating sites of concatenated sequences excluding the 60-kDa glycoprotein gene). The color of each circle represents the 60-kDa glycoprotein gene subtypes of the C. hominis specimens: IaA13R8 (green), IbA10G2 (blue), IdA10 (purple), IdA20 (yellow), and IeA11G3T3 (red). Length of lines connecting MLGs is proportional to the number of single-nucleotide polymorphisms.

Comparative Population Genetics of IbA10G2

The population genetics of various gp60 subtypes was assessed by analyses of LD and recombination rates. In pairwise interlocus LD analysis of multilocus sequences including or excluding gp60, strong but incomplete LD (|D′| Y = 0.9342 + 0.0001X or |D′| Y = 0.9302 + 0.0010X) was observed in subtype IbA10G2, suggesting recombination within this subtype. In contrast, all other subtypes showed complete LD (|D′| Y = 1.0000 + 0.0000X) among all sites, indicative of no genetic recombination within these subtypes (Table 3). In addition, pairwise intergenic LD was also evaluated between 25 loci, resulting in 87 instances of significant pairwise LD (p<0.05) observed in subtype IbA10G2 compared with 227 significant LD in the remaining subtypes (Figure 3). In IbA10G2 specimens, loci around gp60 (locus 12) had no LD with any other loci in chromosome 6. This finding indicated genetic recombination in subtype IbA10G2 at loci near gp60.
Figure 3

Pairwise intergenic linkage disequilibrium (LD) matrix among 25 polymorphic loci of Cryptosporidium hominis. The 25 polymorphic loci are indicated in gray (see Table 2 for identification of loci). Significant LD between loci is indicated in red. Subtype IbA10G2 (A), which has 87 instances of pairwise LD, has fewer LD in comparison with the remaining subtypes (B), which have 227 instances of pairwise LD. In IbA10G2, loci around the 60-kDa glycoprotein gene (locus 12) have no LD with any other loci in chromosome 6.

Pairwise intergenic linkage disequilibrium (LD) matrix among 25 polymorphic loci of Cryptosporidium hominis. The 25 polymorphic loci are indicated in gray (see Table 2 for identification of loci). Significant LD between loci is indicated in red. Subtype IbA10G2 (A), which has 87 instances of pairwise LD, has fewer LD in comparison with the remaining subtypes (B), which have 227 instances of pairwise LD. In IbA10G2, loci around the 60-kDa glycoprotein gene (locus 12) have no LD with any other loci in chromosome 6. To validate these observations, recombination tests were conducted for all gp60 subtypes. Using full multilocus sequences, we found that among 33 potential Rms observed in overall recombination analysis, 4 Rms were detected in subtype IbA10G2 but no Rm was found within each of the remaining subtypes (Table 3). When analysis was performed that excluded gp60, among 6 potential Rms in the overall recombination analysis, 4 Rms were still detected in subtype IbA10G2 but no Rm was seen in the other subtypes (Table 3). Thus, genetic recombination occurred only within the subtype IbA10G2, a conclusion in agreement with results of the STRUCTURE and LD analyses. The genetic determinant for differences between IbA10G2 and non-IbA10G2 subtypes was assessed by comparison of haplotype diversity of the 2 groups at all 25 polymorphic loci in chromosome 6. At most of the genetic loci, the haplotype diversity of the 2 groups was similar, and there was no clear population differentiation at these loci (Figure 4, Table 4). However, at 4 loci (C6–830′, C6–1000′, gp60, C6–1420) around gp60, an absence of genetic diversity was observed in subtype IbA10G2 compared with non-IbA10G2 subtypes (Figure 4). The region (129 kb) of homogeneity in subtype IbA10G2 was located in a 210–339-kb region in chromosome 6. Pairwise FST analysis showed highly significant differentiation between the IbA10G2 subtype and non-IbA10G2 subtypes at 4 loci around gp60, C6–1000′ (p<0.00001), gp60 (p<0.00001), CP47 (p<0.00001), and C6–2600 (p<0.001). The only other locus that showed such a level of differentiation between the 2 groups was C6–3520 (Table 4).
Figure 4

Genetic variation in haplotype diversity at 25 polymorphic loci in chromosome 6 of Cryptosporidium hominis (see Table 2 for identification of loci). Red squares indicate subtype IbA10G2 and black circles indicate non-IbA10G2 subtypes. A homogeneity (reduced haplotype diversity) of subtype IbA10G2 was seen in 4 loci around the 60-kDa glycoprotein gene.

Table 4

Estimates of haplotype diversity in IbA10G2subtype and non-IbA10G2 subtypes of Cryptosporidium hominis and comparison of 2 populations at each locus by pairwise FST analysis*

No.LocusHaplotype diversity
Population pairwise FSTp value ± SEM
IbA10G2 subtypeNon-IbA10G2 subtypes
1CP560.52000.4615−0.034540.88867 ± 0.0104
2MSC6–70.36920.55270.074910.10156 ± 0.0074
3C6–600.92310.91450.009220.21387 ± 0.0105
4C6–1600.67380.66670.007350.31738 ± 0.0131
5C6–2300.38460.56700.050700.07422 ± 0.0069
6C6–2800.40920.65530.063000.06836 ± 0.0076
7C6–3500.40920.4330−0.037360.99902 ± 0.0002
8C6–5800.00000.42740.172050.00781 ± 0.0028†
9C6–7400.46460.66950.033430.12891 ± 0.0126
10C6–830′0.00000.31340.150310.04883 ± 0.0067‡
11C6–1000′0.00000.61540.302880.00000 ± 0.0000§
12gp600.00000.78350.603400.00000 ± 0.0000§
13C6–14200.00000.37320.156650.02051 ± 0.0038‡
14CP470.68000.77210.180330.00000 ± 0.0000§
15C6–26000.07690.50140.298880.00098 ± 0.0010§
16C6–29700.36920.3590−0.039010.99902 ± 0.0002
17C6–3110’0.00000.0741−0.001430.99902 ± 0.0002
18C6–35200.07690.51850.420360.00000 ± 0.0000§
19C6–3520’0.07690.00000.001480.48535 ± 0.0148
20C6–36900.44310.5014−0.017000.55859 ± 0.0145
21DZ-HRGP0.00000.31340.150310.03809 ± 0.0056‡
22C6–41100.46460.31340.108170.01855 ± 0.0048‡
23C6–5110’0.07690.39890.170720.00781 ± 0.0028†
24C6–51200.07690.39890.141260.05664 ± 0.0066
25C6–54100.55690.5014−0.026140.67383 ± 0.0182

*FST, population differentiation.
†p<0.01.
‡p<0.05.
§p<0.001.

Genetic variation in haplotype diversity at 25 polymorphic loci in chromosome 6 of Cryptosporidium hominis (see Table 2 for identification of loci). Red squares indicate subtype IbA10G2 and black circles indicate non-IbA10G2 subtypes. A homogeneity (reduced haplotype diversity) of subtype IbA10G2 was seen in 4 loci around the 60-kDa glycoprotein gene. *FST, population differentiation.
†p<0.01.
‡p<0.05.
§p<0.001.

Discussion

Like other apicomplexan parasites, the life cycle of Cryptosporidium spp. has a sexual phase, during which sexual recombination can occur between genetically distinct strains (). C. parvum, the species that infects humans and some animals, undergoes meiotic recombination between different lineages in genetic-crossing experiments (,). LD analyses of natural C. parvum populations have also shown genetic recombination in most study areas (,–). In contrast, C. hominis is primarily infectious to humans, and previous MLST studies showed a clonal population structure. Genetic recombination was believed to be rare or nonexistent in C. hominis (,,–). However, the small number of markers used in previous studies might have resulted in relatively low resolution in population structure analysis, which could have led to failure in detecting genetic recombination in C. hominis. In the present study, we examined population substructure of C. hominis in a cohort of children living in a small study area by using 32 genetic markers. On the basis of multilocus sequence data and allelic profiles for 53 specimens, analysis showed strong LD among 25 polymorphic loci, suggesting an overall nonpanmictic population structure of C. hominis. A recombination test showed only limited genetic recombination at the gp60 locus. Thus, the high level of LD and limited recombination found in the overall population could be explained by an essential clonal population structure of C. hominis in the field site in Peru, which is consistent with information on C. hominis population genetics in other countries (,,,). Results of LD and recombination analyses suggest that limited recombination in the study population of C. hominis occurred mostly at gp60 or loci around gp60. Thus far, the gp60 gene is the most polymorphic marker identified in the Cryptosporidium genome. Because of its high sequence heterogeneity, gp60 has become the most widely used gene in Cryptosporidium spp. subtyping, which categorizes C. hominis and C. parvum into several subtype groups and various subtypes within each subtype group (). The gp60 gene encodes glycoproteins gp15 and gp45, which are implicated in attachment to and invasion of host cells (,). Because attachment of sporozoites to epithelial cells and invasion of the host cell membrane are crucial steps in the pathogenesis of cryptosporidiosis, these 2 glycoproteins are presumed to be surface-associated virulence determinants that may be under host immune selection, which might explain the extensive polymorphism in the gp60 gene (). Genetic recombination appears to be associated with high sequence polymorphism in the gp60 gene (). However, a less speculative understanding of the role the gp60 gene in pathogenesis of cryptosporidiosis requires further investigations. Recent studies have suggested that telomeric/subtelomeric regions are highly polymorphic and might encode putative virulence factors (,). However, these studies did not compare phenotypic differences among isolates, and data for the present study do not relate directly to sequence variations at telomeres. In the present study, because C. hominis subtype IbA10G2 was shown to be more virulent than other subtypes in the study community (), we compared population genetics of IbA10G2 and other gp60 subtypes. Heterogeneity within subtype IbA10G2 was observed in STRUCTURE and Network analyses compared with homogeneity within each of the remaining subtypes. This finding suggested a mixture of ancestral genetic elements and genetic recombination in virulent subtype IbA10G2. This finding was confirmed by incomplete LD and several recombination events (4 Rms) found in IbA10G2. In addition, the pattern of shared ancestral types in this study suggests that genetic exchange might have occurred between the ancestor of subtype IbA10G2 and the ancestor of subtype IaA13R8 or IeA11G3T3. Pairwise FST analysis of 25 polymorphic loci in chromosome 6 between the virulent IbA10G2 subtype and non-IbA10G2 subtypes showed population differentiation at 4 loci around gp60. When a locus shows extraordinary levels of genetic population differentiation compared with other loci, this finding might be interpreted as evidence for positive selection (). Thus, the region around gp60 in subtype IbA10G2 might be under selection pressure. This finding was further confirmed by comparison of haplotype diversity of the 2 groups at the 25 polymorphic loci. Although a similar Hd was found between the 2 groups in most regions of chromosome 6, a region of Hd reduction was observed in subtype IbA10G2 in a 129-kb region flanking gp60, compared with the non-IbA10G2 subtypes. One explanation for this pattern of genetic diversity is that the region surrounding gp60 was probably affected by selective sweep or genetic hitchhiking caused by selection of the virulence subtype. A hallmark of a selective sweep is a chromosomal region with reduced diversity associated with a specific phenotype. In previous studies of other apicomplexan parasites, similar patterns of reduced genetic variations were observed in chromosomal regions surrounding sites under selection pressure. Wootton et al. () found a dramatic reduction in genetic variation in chloroquine-resistant parasites within a region spanning >200 kb around the Plasmodium falciparum chloroquine-resistance transporter gene in chromosome 7 as the result of selection for chloroquine resistance. Nair et al. () observed decreased variation in an ≈100-kb region flanking the dihydrofolate reductase gene in chromosome 4 of P. falciparum in association with pyrimethamine resistance. Consistent with these findings, we detected reduced sequence variation around gp60 in virulent subtype IbA10G2, suggesting that the 129-kb region surrounding gp60 in chromosome 6, perhaps gp60 itself, might be involved in selection for virulent gp60 subtype IbA10G2 in C. hominis. In a comparison of the number of nonsynonymous substitutions per nonsynonymous site to the number of synonymous substitutions per synonymous site analysis of loci around gp60, we confirmed the presence of positive section at least at the CP47 locus. The reason for exclusive occurrence of genetic recombination in IbA10G2 is not clear. Because IbA10G2 is the major subtype of C. hominis responsible for numerous waterborne and foodborne outbreaks of cryptosporidiosis in many countries (), it is transmitted frequently among humans, resulting in increased probability of mixed infections with other genetically heterogeneous subtypes, especially in countries in Europe in which autochthonous C. hominis infections are caused mostly by IbA10G2 and imported cases by other subtypes (). In a long evolutionary process, the common occurrence and biologic fitness of the gp60 IbA10G2 subtype may facilitate genetic recombination with other C. hominis subtypes and subsequent spread of the recombinant parasite with the fitness gene. Recently, genetic recombination was shown to be a key strategy for selection of virulent clones of Toxoplasma gondii, an apicomplexan parasite with a largely clonal population structure in North America and Europe (). In conclusion, we have shown complex substructures in a natural C. hominis population in a cohort of children living in a small community in Peru. Although C. hominis from the community has an overall clonal population structure, genetic recombination occurs within subtype IbA10G2 around the gp60 locus, which might be involved in pathogenicity. Common occurrence of its parental subtypes and biologic fitness of the recombinant subtype with the IbA10G2 trait have probably facilitated genetic exchange and spread of the virulent subtype. In addition, we were able to localize selection for the virulent subtype IbA10G2 to a 129-kb region surrounding gp60 in chromosome 6. These observations could improve our understanding of emergence and spread of virulent C. hominis subtypes.
  39 in total

1.  Development of a multilocus sequence typing tool for Cryptosporidium hominis.

Authors:  Wangeci Gatei; C Anthony Hart; Robert H Gilman; Pradeep Das; Vitaliano Cama; Lihua Xiao
Journal:  J Eukaryot Microbiol       Date:  2006       Impact factor: 3.346

2.  Molecular characterization of the Cryptosporidium parvum IOWA isolate kept in different laboratories.

Authors:  Vitaliano A Cama; Michael J Arrowood; Ynes R Ortega; Lihua Xiao
Journal:  J Eukaryot Microbiol       Date:  2006       Impact factor: 3.346

3.  Tripping over emerging pathogens around the world: a phylogeographical approach for determining the epidemiology of Porcine circovirus-2 (PCV-2), considering global trading.

Authors:  Pedro M P Vidigal; Claudio L Mafra; Fernanda M F Silva; Juliana L R Fietto; Abelardo Silva Júnior; Márcia R Almeida
Journal:  Virus Res       Date:  2011-10-26       Impact factor: 3.303

4.  Molecular characterization of cryptosporidium oocysts in samples of raw surface water and wastewater.

Authors:  L Xiao; A Singh; J Limor; T K Graczyk; S Gradus; A Lal
Journal:  Appl Environ Microbiol       Date:  2001-03       Impact factor: 4.792

5.  Molecular epidemiology and spatial distribution of a waterborne cryptosporidiosis outbreak in Australia.

Authors:  Liette S Waldron; Belinda C Ferrari; Cristel Cheung-Kwok-Sang; Paul J Beggs; Nicola Stephens; Michelle L Power
Journal:  Appl Environ Microbiol       Date:  2011-09-09       Impact factor: 4.792

6.  Identification of Cpgp40/15 Type Ib as the predominant allele in isolates of Cryptosporidium spp. from a waterborne outbreak of gastroenteritis in South Burgundy, France.

Authors:  Seth Cohen; Frédéric Dalle; Anne Gallay; Marc Di Palma; Alain Bonnin; Honorine D Ward
Journal:  J Clin Microbiol       Date:  2006-02       Impact factor: 5.948

7.  Simultaneous increase of Cryptosporidium infections in the Netherlands, the United Kingdom and Germany in late summer season, 2012.

Authors:  N Fournet; M P Deege; A T Urbanus; G Nichols; B M Rosner; R M Chalmers; R Gorton; K G Pollock; J W van der Giessen; P C Wever; J W Dorigo-Zetsma; B Mulder; T G Mank; I Overdevest; J G Kusters; W van Pelt; L M Kortbeek
Journal:  Euro Surveill       Date:  2013-01-10

8.  Molecular epidemiology of cryptosporidiosis in children in Malawi.

Authors:  Michael M Peng; Steve R Meshnick; Nigel A Cunliffe; Benson D M Thindwa; C Anthony Hart; Robin L Broadhead; Lihua Xiao
Journal:  J Eukaryot Microbiol       Date:  2003       Impact factor: 3.346

9.  Cryptosporidium species and subtypes and clinical manifestations in children, Peru.

Authors:  Vitaliano A Cama; Caryn Bern; Jacqueline Roberts; Lilia Cabrera; Charles R Sterling; Ynes Ortega; Robert H Gilman; Lihua Xiao
Journal:  Emerg Infect Dis       Date:  2008-10       Impact factor: 6.883

10.  Geographic linkage and variation in Cryptosporidium hominis.

Authors:  Rachel M Chalmers; Stephen J Hadfield; Colin J Jackson; Kristin Elwin; Lihua Xiao; Paul Hunter
Journal:  Emerg Infect Dis       Date:  2008-03       Impact factor: 6.883

View more
  34 in total

1.  Development of a multilocus sequence typing tool for high-resolution subtyping and genetic structure characterization of Cryptosporidium ubiquitum.

Authors:  Ying Tang; Na Li; Mingxin Song; Dawn M Roellig; Yaoyu Feng; Lihua Xiao
Journal:  Infect Genet Evol       Date:  2016-09-12       Impact factor: 3.342

2.  Evolutionary processes in populations of Cryptosporidium inferred from gp60 sequence data.

Authors:  Juan C Garcia-R; David T S Hayman
Journal:  Parasitol Res       Date:  2017-05-13       Impact factor: 2.289

3.  Genomic Variation in IbA10G2 and Other Patient-Derived Cryptosporidium hominis Subtypes.

Authors:  Per Sikora; Sofia Andersson; Jadwiga Winiecka-Krusnell; Björn Hallström; Cecilia Alsmark; Karin Troell; Jessica Beser; Romanico B G Arrighi
Journal:  J Clin Microbiol       Date:  2016-12-21       Impact factor: 5.948

Review 4.  Comparative genomics: how has it advanced our knowledge of cryptosporidiosis epidemiology?

Authors:  Yingying Fan; Yaoyu Feng; Lihua Xiao
Journal:  Parasitol Res       Date:  2019-11-14       Impact factor: 2.289

5.  Isolation and enrichment of Cryptosporidium DNA and verification of DNA purity for whole-genome sequencing.

Authors:  Yaqiong Guo; Na Li; Colleen Lysén; Michael Frace; Kevin Tang; Scott Sammons; Dawn M Roellig; Yaoyu Feng; Lihua Xiao
Journal:  J Clin Microbiol       Date:  2014-12-17       Impact factor: 5.948

6.  Cyclospora cayetanensis infections among diarrheal outpatients in Shanghai: a retrospective case study.

Authors:  Yanyan Jiang; Zhongying Yuan; Guoqing Zang; Dan Li; Ying Wang; Yi Zhang; Hua Liu; Jianping Cao; Yujuan Shen
Journal:  Front Med       Date:  2018-01-26       Impact factor: 4.592

7.  Multilocus sequence typing of an emerging Cryptosporidium hominis subtype in the United States.

Authors:  Yaoyu Feng; Narry Tiao; Na Li; Michele Hlavsa; Lihua Xiao
Journal:  J Clin Microbiol       Date:  2013-12-04       Impact factor: 5.948

8.  Cryptosporidiosis Outbreak in Immunocompetent Children from a Remote Area of French Guiana.

Authors:  Emilie Mosnier; Noémie Martin; Romy Razakandrainibe; Frédéric Dalle; Guillaume Roux; Arnaud Buteux; Loïc Favennec; Paul Brousse; Basma Guarmit; Denis Blanchet; Loïc Epelboin; Camille Girouin; Elise Martin; Félix Djossou; Mathieu Nacher; Magalie Demar
Journal:  Am J Trop Med Hyg       Date:  2018-04-19       Impact factor: 2.345

9.  Evolutionary genomics of anthroponosis in Cryptosporidium.

Authors:  Cock van Oosterhout; Kevin M Tyler; Johanna L Nader; Thomas C Mathers; Ben J Ward; Justin A Pachebat; Martin T Swain; Guy Robinson; Rachel M Chalmers; Paul R Hunter
Journal:  Nat Microbiol       Date:  2019-03-04       Impact factor: 17.745

Review 10.  Molecular Epidemiology of Human Cryptosporidiosis in Low- and Middle-Income Countries.

Authors:  Xin Yang; Yaqiong Guo; Lihua Xiao; Yaoyu Feng
Journal:  Clin Microbiol Rev       Date:  2021-02-24       Impact factor: 26.132

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.