| Literature DB >> 31117941 |
Zhixiao Xu1, Yaqiong Guo2, Dawn M Roellig3, Yaoyu Feng4,5, Lihua Xiao6.
Abstract
BACKGROUND: Cryptosporidiosis is a major cause of gastrointestinal diseases in humans and other vertebrates. Previous analyses of invasion-related proteins revealed that Cryptosporidium parvum, Cryptosporidium hominis, and Cryptosporidium ubiquitum mainly differed in copy numbers of secreted MEDLE proteins and insulinase-like proteases and sequences of mucin-type glycoproteins. Recently, Cryptosporidium chipmunk genotype I was identified as a novel zoonotic pathogen in humans. In this study, we sequenced its genome and conducted a comparative genomic analysis.Entities:
Keywords: ABC transporters; Comparative genomics; Cryptosporidium chipmunk genotype I; Insulinase-like proteases; MEDLE family proteins
Mesh:
Substances:
Year: 2019 PMID: 31117941 PMCID: PMC6532270 DOI: 10.1186/s12864-019-5788-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Genomic features of Cryptosporidium chipmunk genotype I in comparison with some other Cryptosporidium spp
|
|
|
|
|
|
| |||
|---|---|---|---|---|---|---|---|---|
| Total length (Mb) | 9.05 | 9.1 | 9.06 | 8.97 | 8.97 | 8.5 | 9.09 | 9.21 |
| No. of super contigs | 50 | 8 | 97 | 57 | 27 | 153 | 135 | 45 |
| GC content (%) | 32 | 30.3 | 30.1 | 31 | 30.8 | 24.3 | 28.5 | 28.4 |
| Nucleotide sequence identity (%) | – | 82.25 | 82.48 | 81.22 | 78.65 | 46.08 | 26.44 | 26.89 |
| Number of genes | 3783 | 3805 | 3819 | 3782 | 3767 | 3728 | 3905 | 3937 |
| Total length of CDS (Mb) | 6.94 | 6.83 | 6.81 | 6.91 | 6.94 | 6.69 | 6.86 | 6.93 |
| GC content in CDS (%) | 33.6 | 31.9 | 31.8 | 32.4 | 33 | 25.6 | 30.1 | 30 |
| Amino acid sequence identity (%) | – | 83.49 | 83.99 | 81.68 | 79.04 | 58.89 | 47.03 | 47.22 |
| GC content at 3rd position in codons (%) | 26.9 | 22.5 | 23.5 | 24.1 | 24.5 | 12.6 | 18.1 | 17.8 |
| Gene density (gene/Mb) | 418 | 418.1 | 421.5 | 421.6 | 420 | 438.6 | 429.6 | 427.5 |
| Percent coding (%) | 76.7 | 75 | 75.2 | 77 | 77.4 | 78.7 | 75.5 | 75.2 |
| No. of genes with intron | 515 | 163 | 417 | 506 | 758 | 763 | 832 | 798 |
| Genes with intron (%) | 13.6 | 4.2 | 10.9 | 13.4 | 20.1 | 20.5 | 21.3 | 20.3 |
| No. of tRNA | 45 | 45 | 45 | 45 | 45 | 46 | 44 | 45 |
| No. of tRNAmet | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 |
| Proteins with signal peptide | 396 | 397 | 391 | 397 | 399 | 344 | 309 | 323 |
| Proteins with transmembrane domain | 793 | 832 | 817 | 805 | 772 | 813 | 839 | 836 |
| Proteins with GPI anchor | 57 | 63 | 54 | 55 | 50 | 57 | 47 | 52 |
Fig. 1Syntenic relationship and shared orthologous genes among Cryptosporidium spp. a Syntenic relationship in gene organization among genomes of Cryptosporidium chipmunk genotype I, Cryptosporidium parvum, C. hominis, C. ubiquitum, C. baileyi, and C. andersoni. Syntenic blocks (regions with orthologous genes) are connected with lines, with the colors representing 8 chromosomes of C. parvum. b Venn diagram of orthologous genes shared by five Cryptosporidium spp. Abbreviations of taxa: Cryptosporidium parvum IOWA (Cpa); C. hominis Ude (Cho); C. meleagridis (Cme); Cryptosporidium chipmunk genotype I (Cch); C. ubiquitum (Cub)
Fig. 2Phylogenetic relationship of Cryptosporidium spp. a Phylogenetic relationship of Cryptosporidium spp. based on maximum likelihood analysis of sequences of 100 shared proteins. b Phylogenetic relationship of Cryptosporidium spp. based on maximum likelihood analysis of TRAP sequences. c Phylogenetic relationship of Cryptosporidium spp. based on maximum likelihood analysis of mucin-type glycoproteins. d Phylogenetic relationship of Cryptosporidium spp. based on maximum likelihood analysis of insulinase-like proteases
Fig. 3Protein architecture network based on sequence similarity of all proteins in proteomes of Cryptosporidium chipmunk genotype I, Cryptosporidium parvum, and Cryptosporidium meleagridis. a Proteins of Cryptosporidium chipmunk genotype I, C. parvum, and C. meleagridis, represented by the colors red, green, and blue, respectively. b Identity of major clusters in the Cryptosporidium proteome
Fig. 4Mitochondrial metabolism of Cryptosporidium chipmunk genotype I. Abbreviation of enzymes: AOX: alternative oxidase; PNO: pyruvate:NADP(+) oxidoreductase; MAT: methionine adenosyl transferase; MQO: malate:quinone oxidoreductase. Abbreviation of metabolites: Q: ubiquinone (coenzyme Q); CoA: coenzyme A; IPP: isopentenyl diphosphate; DMAPP: dimethylallyl diphosphate; FPP: farnesyl diphosphate; PPP: polyprenyl diphosphate. Abbreviation of transporter proteins: AATP: amino acids transporter protein; ACTP: acetyl-CoA transporter protein
Putative transporters in Cryptosporidium spp. and some other common apicomplexan parasitesa
| Substrates | Cellular location | Cchi | Cpar | ChomUde | Cmel | Cubi | Cbai | Cand | Cmur | Pfal | Tgon |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Hexose | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 3 | 2 | 5 | |
| Triose phosphate | Plasma/Apicoplast membrane | 7 | 8 | 8 | 8 | 8 | 7 | 8 | 8 | 4 | 4 |
| Amino acids | Plasma membrane | 10 | 10 | 10 | 10 | 10 | 10 | 12 | 12 | 1 | 6 |
| Nucleobase/nucleoside | Plasma membrane | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 4 | 4 |
| Nucleotide-sugar | Plasma membrane | 3 | 3 | 3 | 3 | 3 | 2 | 2 | 2 | 1 | 4 |
| Folate/pterine | Plasma membrane | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 2 | 7 |
| Formate/nitrite | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 3 | |
| GABA (aminobutanoate) | Plasma/Mitochondrial membrane | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 5 |
| Acetyl-CoA | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Chloride | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | |
| Inorganic phosphate | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | |
| Sulfate | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 4 | |
| Sodium/potassium/calcium | 2 | 2 | 2 | 2 | 2 | 2 | 3 | 3 | 0 | 9 | |
| Zinc | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 4 | |
| Copper | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 3 | |
| Choline | Plasma membrane | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 |
| Cadmium/zinc/cobalt (efflux) | Plasma membrane | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| Glycerol/water | Plasma membrane | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 |
| ABC transporter | Plasma membrane | 21 | 21 | 21 | 21 | 21 | 22 | 21 | 21 | 16 | 24 |
| Mitochondrial carrier | Mitochondrial membrane | 8 | 9 | 9 | 8 | 6 | 6 | 13 | 12 | 14 | 21 |
Cchi Cryptosporidium chipmunk genotype I, Cpar Cryptosporidium parvum, ChomUde C. hominis UdeA01, Cmel C. meleagridis, Cubi C. ubiquitum, Cbai C. baileyi, Cand C. andersoni, Cmur C. muris, Pfal Plasmodium falciparum, Tgon Toxoplasma gondii
aThe detection of these transporter proteins was based on the Pfam search results
Comparison of essential metabolic pathways among Cryptosporidium spp. and some other common apicomplexan parasites
| Category | Metabolic pathway | Cchi | Cpar | Chom | Cmel | Cubi | Cbai | Cand | Pfal | Tgon |
|---|---|---|---|---|---|---|---|---|---|---|
| Carbohydrate and energy metabolism | Glycolysis | + | + | + | + | + | + | + | + | + |
| Methylcitrate cycle | – | – | – | – | – | – | – | – | + | |
| TCA cycle | – | – | – | – | – | – | + | + | + | |
| Pentose phosphate pathway | – | – | – | – | – | – | – | + | + | |
| Shikimate biosynthesis | – | – | – | – | – | – | – | + | + | |
| Folate biosynthesis | – | – | – | – | – | – | – | + | + | |
| Synthesis of pterin | – | – | – | – | – | – | – | – | + | |
| Galactose metabolism | – | – | – | – | – | – | – | – | + | |
| Synthesis of starch | + | + | + | + | + | + | + | – | + | |
| Synthesis of trehalose | + | + | + | + | + | + | + | – | + | |
| Synthesis of 1,3-beta-glucan | – | – | – | – | – | – | – | – | + | |
| Conversion between UDP-Glc and UDP-Gal | + | + | + | + | + | + | + | – | + | |
| Conversion between GDP-Man and GDP-Fuc | – | – | – | – | – | – | – | + | + | |
| Conversion from UDP-Glc to UDP-GlcA to UDP-Xyl | + | + | + | + | + | + | + | – | – | |
| Synthesis of mannitol from fructose | + | + | + | + | + | + | + | – | – | |
| Fatty acid biosynthesis in cytosol (FAS I) | + | + | + | + | + | + | + | – | + | |
| Fatty acid biosynthesis in apicoplast (FAS II) | – | – | – | – | – | – | – | + | + | |
| Fatty acid degradation | – | – | – | – | – | – | – | – | + | |
| Oxidative phosphorylation (NADH dehydrogenase) | + | + | + | + | + | + | + | + | + | |
| Oxidative phosphorylation (Complex II) | – | – | – | – | – | – | + | + | + | |
| Oxidative phosphorylation (Complex III) | – | – | – | – | – | – | 1 sub | + | + | |
| Oxidative phosphorylation (Complex IV) | – | – | – | – | – | – | – | + | + | |
| F-ATPase | 2 sub | 2 sub | 2 sub | 2 sub | 2 sub | 2 sub | + | + | + | |
| Alternative oxidase (AOX) | + | + | + | + | – | – | + | – | – | |
| Glyoxalase metabolism producing D-lactate | – | – | – | – | – | – | – | + | + | |
| Synthesis of isoprene (MEP/DOXP) | – | – | – | – | – | – | – | + | + | |
| Synthesis of farnesyl/polyprenyl diphosphate | + | + | + | + | – | – | + | + | + | |
| Nucleotide metabolism | Synthesis of purine rings de novo | – | – | – | – | – | – | – | – | – |
| Conversion from IMP to XMP | + | + | + | + | + | – | – | + | + | |
| Conversion from XMP to GMP | – | + | + | – | – | – | – | + | + | |
| Synthesis of pyrimidine de novo | – | – | – | – | – | – | – | + | + | |
| Amino acid metabolism | Synthesis of alanine from pyruvate | – | – | – | – | – | – | – | – | + |
| Synthesis of glutamate from nitrite/nitrate | – | – | – | – | – | – | – | + | + | |
| Conversion from glutamate to glutamine | + | + | + | + | + | + | + | + | + | |
| Synthesis of aspartate from oxaloacetate and glutamate | – | – | – | – | – | – | – | + | + | |
| Conversion from aspartate to asparagine | + | + | + | + | + | – | – | + | + | |
| Conversion from glutamate to proline | + | + | + | + | + | + | + | – | + | |
| Synthesis of serine from glycerate/glycerol phosphate | – | – | – | – | – | – | – | – | + | |
| Conversion from serine to cysteine | – | – | – | – | – | – | – | – | + | |
| Conversion from serine to glycine | + | + | + | + | + | + | + | + | + | |
| Recycle homocysteine into methionine | – | – | – | – | – | – | – | + | + | |
| Synthesis of lysine from aspartate | – | – | – | – | – | – | – | – | + | |
| Synthesis of threonine from aspartate | – | – | – | – | – | – | – | – | + | |
| Synthesis of ornithine from arginine | – | – | – | – | – | – | – | + | – | |
| Synthesis of ornithine from proline | – | – | – | – | – | – | – | + | + | |
| Synthesis of polyamine from ornithine | – | – | – | – | – | – | – | + | – | |
| Polyamine pathway backward | + | + | + | + | + | + | + | – | + | |
| Degradation of branch-chain amino acids | – | – | – | – | – | – | – | – | + | |
| Synthesis of tryptophan | + | + | + | – | + | – | – | – | – | |
| Aromatic amino acid hydroxylases (AAAH) | – | – | – | – | – | – | – | – | + | |
| Vitamin and others | Synthesis of ubiquinone (Coenzyme Q) | + | + | + | + | – | – | + | + | + |
| Synthesis of Fe-S cluster | + | + | + | + | + | + | + | + | + | |
| Synthesis of heme | – | – | – | – | – | – | – | + | + | |
| Synthesis of thiamine (Vitamin B1) | – | – | – | – | – | – | – | + | – | |
| Conversion from thiamine to thiamine pyrophosphate (TPP) | – | – | – | – | – | – | – | + | + | |
| Synthesis of FMN/FAD from riboflavin | – | – | – | – | – | – | – | + | + | |
| Synthesis of pyridoxal phosphate (Vitamin B6) de novo | – | – | – | – | – | – | – | + | + | |
| Synthesis of NAD(P) + de novo from nicotinate/nicotinamide | – | – | – | – | – | – | – | + | + | |
| Synthesis of pantothenate from valine | – | – | – | – | – | – | – | – | + | |
| Synthesis of CoA from pantothenate | + | + | + | + | + | + | + | + | + | |
| Synthesis of lipoic acid de novo in apicoplast | – | – | – | – | – | – | – | + | + | |
| Salvage of lipoic acid in mitochondria | – | – | – | – | – | – | + | + | + | |
| Synthesis of porphyrin/cytochrome proteins | – | – | – | – | – | – | – | + | + |
Plus symbols denote that these metabolic pathways were identified in this apicomplexan parasite, whereas minus symbols denote that these metabolic pathways are absent from this apicomplexan parasite. Abbreviation: Cchi Cryptosporidium chipmunk genotype I, Cpar Cryptosporidium parvum, Chom C. hominis, Cmel C. meleagridis, Cubi C. ubiquitum, Cbai C. baileyi, Cand C. andersoni, Pfal Plasmodium falciparum, Tgon Toxoplasma gondii
Fig. 5Comparison of major invasion-related protein families among Cryptosporidium species. The number of each protein family was identified based on Pfam domain search. The dark yellow cell represents the largest number of members in the protein families while the green cell represents the smallest number of members in the protein families in Cryptosporidium spp. Because of highly-fragmented draft genome of C. hominis TU502, some Pfam domains were not detected in C. hominis but observed in C. hominis UKH1. Abbreviations of taxa: Cryptosporidium chipmunk genotype I (Cchi); Cryptosporidium parvum (Cpar); C. hominis TU502 (ChomT); C. hominis UKH1 (ChomU); C. meleagridis (Cmel); C. ubiquitum (Cubi); C. baileyi (Cbai); C. andersoni (Cand); C. muris (Cmur)
Fig. 6Selective pressure in genes encoding major groups of proteins as indicated by the dN/dS ratios between Cryptosporidium chipmunk genotype I and C. parvum. Red categories represent groups of proteins with mean dN/dS ratios higher than all proteins in the proteome, while blue categories represent groups of proteins with reduced dN/dS ratios. Triangles: mean dN/dS; horizontal black line: median dN/dS
Twenty orthologous genes with the highest dN/dS ratios between Cryptosporidium chipmunk genotype I and Cryptosporidium parvum
| Gene in | Gene in | dN/dS ratio | TMHMM | Signal peptide | Subtelomeric location | Annotation |
|---|---|---|---|---|---|---|
| C_ch_8.3686 | cgd2_90 | 1.25 | YES | NO | YES | ABC transporter with 9 transmembrane domains and 2 AAA domains |
| C_ch_8.3664 | cgd2_70 | 1.24 | YES | NO | YES | ABC transporter, with 12 transmembrane domains and 2 AAA domains |
| C_ch_11.460 | cgd3_60 | 1.15 | NO | NO | YES | Putative hydrolase |
| C_ch_105.389 | cgd5_4570 | 1.11 | NO | NO | YES | Hypothetical protein with disordered regions |
| C_ch_10.167 | cgd7_640 | 1.04 | NO | NO | YES | Prp16p pre-mRNA splicing factor, HrpA family SFII helicase |
| C_ch_22.2069 | cgd6_3780 | 1.04 | YES | YES | NO | Hypothetical membrane protein with signal peptide and transmembrane domain |
| C_ch_10.307 | cgd8_5370 | 0.99 | YES | NO | YES | Conserved secreted protein |
| C_ch_37.2969 | cgd7_5510 | 0.99 | YES | NO | YES | Secreted protein |
| C_ch_105.390 | cgd6_5490 | 0.98 | NO | YES | YES | Conserved hypothetical protein with low sequence complexity regions |
| C_ch_1.56 | cgd6_50 | 0.97 | YES | NO | YES | Predicted secreted protein |
| C_ch_50.3279 | cgd1_120 | 0.95 | NO | YES | YES | Predicted secreted protein with a cysteine cluster at the C-terminus |
| C_ch_18.1418 | cgd4_2900 | 0.93 | NO | NO | NO | Polyketide synthase |
| C_ch_19.1673 | cgd4_2510 | 0.92 | NO | YES | NO | Predicted secreted protein |
| C_ch_19.1715 | cgd3_2180 | 0.92 | NO | NO | NO | Type I fatty acid synthase |
| C_ch_35.2958 | cgd5_4610 | 0.89 | YES | YES | YES | Conserved secreted protein |
| C_ch_23.2117 | cgd4_1380 | 0.86 | YES | NO | NO | ABC transporter with 2 AAA domains and 14 transmembrane regions |
| C_ch_50.3280 | cgd1_130 | 0.82 | YES | YES | YES | Predicted secreted protein with a cysteine cluster at the C-terminus |
| C_ch_17.1234 | cgd7_3440 | 0.78 | YES | YES | NO | Predicted secreted protein |
| C_ch_50.3278 | cgd1_110 | 0.77 | NO | YES | YES | Predicted secreted protein |
| C_ch_21.2011 | cgd8_40 | 0.76 | YES | YES | YES | Predicted secreted protein of |
| Subtotal | – | – | 11/20 (55.0%) | 9/20 (45.0%) | 14/20 (70.0%) | – |