| Literature DB >> 27129308 |
Shiyou Liu1,2, Lin Wang1, Huajun Zheng3, Zhixiao Xu1, Dawn M Roellig2, Na Li1,2, Michael A Frace4, Kevin Tang4, Michael J Arrowood2, Delynn M Moss2, Longxian Zhang5, Yaoyu Feng6, Lihua Xiao7.
Abstract
BACKGROUND: Cyclospora cayetanensis is an apicomplexan that causes diarrhea in humans. The investigation of foodborne outbreaks of cyclosporiasis has been hampered by a lack of genetic data and poor understanding of pathogen biology. In this study we sequenced the genome of C. cayetanensis and inferred its metabolism and invasion components based on comparative genomic analysis.Entities:
Keywords: Apicomplexan; Cyclospora; Evolution; Genetics; Genome; Genomics
Mesh:
Substances:
Year: 2016 PMID: 27129308 PMCID: PMC4851813 DOI: 10.1186/s12864-016-2632-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Comparison of genomic features of Cyclospora cayetanensis (Ccay) and other apicomplexan parasitesa
| Category | Cpar | Pfal | Bbov | Tgon | Eten | Ccay |
|---|---|---|---|---|---|---|
| No. of chromosomes | 8 | 14 | 4 | 14 | 14 | - |
| Total length of assembly (Mb) | 9.10 | 22.85 | 8.18 | 65.67 | 51.86 | 44.03 |
| No. of super contigs | 8 | 16 | 14 | 2,263 | 4,664 | 2,297 |
| GC content (%) | 30.3 | 20.0 | 41.5 | 48.5 | 52.5 | 51.8 |
| No. of genes | 3,805 | 5,542 | 3,706 | 8,322 | 8,597 | 7,457 |
| Total length of CDS (Mb) | 6.83 | 12.58 | 5.58 | 20.03 | 13.05 | 11.92 |
| GC content in CDS (%) | 31.9 | 25.0 | 43.7 | 56.0 | 58.1 | 55.8 |
| Mean length of genes (bp) | 1,720 | 2,271 | 1,506 | 2,407 | 1,518 | 1,599 |
| Gene density (genes/Mb) | 418.1 | 242.5 | 453.1 | 126.7 | 165.8 | 169.4 |
| Percent coding (%) | 75.0 | 55.1 | 68.2 | 30.5 | 25.2 | 27.1 |
| No. of genes with intron | 163 | 3,055 | 2,241 | 6,729 | 6,563 | 6,358 |
| % genes with introns | 4.2 | 55.1 | 60.5 | 80.9 | 76.3 | 85.3 |
| No. of tRNA | 45 | 72 | 70 | 174 | - | 144 |
| No. of tRNAMet | 2 | 2 | 4 | 8 | - | 7 |
| No. of rRNAb | 15 | 28 | - | 420 | 4 | 11 |
| No. of proteins with signal peptide | 397 | 638 | 350 | 759 | 775 | 538 |
| No. of proteins with apicoplast targeting signal | (22) | 189 | 99 | 148 | 182 | 105 |
| No. of proteins with transmembrane domain | 832 | 1,754 | 677 | 1,103 | 1,378 | 1,247 |
| No. of proteins with GPI-anchor | 63 | 62 | 51 | 255 | 371 | 225 |
| Apicoplast genome size (bp) | - | 34,682 | 33,351 | 34,996 | 34,750 | 34,155 |
| Mitochondrial genome size (bp) | - | 5,967 | 6,005 | ~6,000c | 6,213 | 6,229 |
aSources of data: Cryptosporidium parvum (Cpar): CryptoDB release-6.0; Plasmodium falciparum (Pfal): PlasmoDB release-11.1; Babesia bovis (Bbov): PiroplasmaDB release-5.1; Toxoplasma gondii (Tgon): ToxoDB release-11.0; Eimeria tenella (Eten): ToxoDB release-11.0. Data on proteins with signal peptides, apicoplast targeting signal peptides and GPI-anchors were based on calculations using software specified in Methods. Dashes indicate the lack of data (for E. tenella) or the absence of organelles (for C. parvum)
bBased on annotation; actual numbers are greater due to the repetitive nature of the rRNA unit
cBased on Seeber et al. (2014) [72]
Fig. 1Alternating regions of repeat-rich and repeat-poor sequences and common short tandem repeats (STRs) in the Cyclospora cayetanensis genome. A total of 1,700 STRs were detected in the C. cayetanensis genome a Alternating pattern of repeat-rich and repeat-poor within several large contigs of the C. cayetanensis genome. The black bands represent STRs. b Distribution of STRs. The most common STRs are the “CAG” motif and its variations, like seen in Eimeria spp. c Jitter plot showing length distribution of common STRs
Fig. 2Orthologs in the predicated proteome of Cyclospora cayetanensis. a In addition to alveolates, a few of the orthologs of C. cayetanensis are from other organisms, probably resulted from lateral gene transfers. b Functional protein domains shared by apicomplexan parasites Cryptosporidium parvum, Plasmodium falciparum, Toxoplasma gondii, Eimeria tenella and C. cayetanensis. c Phylogenetic relationship of C. cayetanensis and other common apicomplexan parasites based on a neighbor-joining analysis of concatenated protein sequences from 100 orthologs; a concatenated sequence from the free-living photosynthetic chromerid, Chromera velia was used to root the tree. The maximum composite likelihood method was used in the calculation of genetic distances. Numbers on branches are percent bootstrap values >50 from 1,000 replications. d Comparison of major protein families potentially involved in host cell invasion among common apicomplexan parasites. Taxa name abbreviations: Cryptosporidium parvum (Cpar); Plasmodium falciparum (Pfal); Babesia bovis (Bbov); Toxoplasma gondii (Tgon); Eimeria tenella (Eten); Cyclospora cayetanensis (Ccay)
Comparison of some essential metabolic pathways among common apicomplexan parasitesa
| Category | Metabolic pathway | Cpar | Pfal | Bbov | Tgon | Eten | Ccay |
|---|---|---|---|---|---|---|---|
| Carbohydrate and energy metabolism | Glycolysis | + | + | + | + | + | + |
| Degradation of propionyl-CoA into pyruvate and succinate | - | - | - | + | - | + | |
| TCA cycle | - | + | + | + | + | + | |
| Pentose phosphate pathway | - | + | + | + | + | + | |
| Shikimate biosynthesis | - | + | - | + | + | + | |
| Folate biosynthesis | - | + | - | + | + | + | |
| Synthesis of tetrahydrobiopterin/dihydrobiopterin/molybdopterin | - | - | - | + | - | - | |
| Galactose metabolism | - | - | - | + | + | + | |
| Synthesis of starch | + | - | - | + | + | + | |
| Synthesis of trehalose | + | - | + | + | + | + | |
| Synthesis of 1,3-beta-glucan | - | - | - | + | + | + | |
| Conversion between UDP-Glc and UDP-Gal | + | - | - | + | + | + | |
| Conversion between GDP-Man and GDP-Fuc | - | + | - | + | + | + | |
| Conversion of UDP-Glc to UDP-GlcA then to UDP-Xyl | + | - | - | - | - | - | |
| Synthesis of mannitol from mannose or fructose | + | - | - | - | + | + | |
| Fatty acid biosynthesis in cytosol (FAS I) | + | - | - | + | + | + | |
| Fatty acid biosynthesis in apicoplast (FAS II) | - | + | - | + | + | + | |
| Fatty acid degradation | - | - | - | + | + | + | |
| Oxidative phosphorylation (NADH dehydrogenase) | + | + | + | + | + | + | |
| Oxidative phosphorylation (Complex II) | - | + | + | + | + | + | |
| Oxidative phosphorylation (Complex III) | - | + | + | + | + | + | |
| Oxidative phosphorylation (Complex IV) | - | + | + | + | + | + | |
| F-ATPase | 2 subunits | + | + | + | + | + | |
| V-ATPase | + | + | + | + | + | + | |
| Glyoxalase metabolism producing D-lactate | - | + | + | + | + | + | |
| Synthesis of isoprene (MEP/DOXP) | - | + | + | + | + | + | |
| Nucleotide metabolism | Synthesis of purine rings | - | - | - | - | - | - |
| Synthesis of pyrimidine | - | + | + | + | + | + | |
| Amino acid metabolism | Synthesis of alanine from pyruvate | - | - | - | + | + | + |
| Synthesis of glutamate from nitrite/nitrate | - | + | + | + | + | + | |
| Conversion from glutamate to glutamine | + | + | - | + | + | - | |
| Synthesis of aspartate from oxaloacetate and glutamate | - | + | + | + | + | + | |
| Conversion from aspartate to asparagine | + | + | - | + | + | + | |
| Conversion from glutamate to proline | + | - | - | + | + | + | |
| Synthesis of serine from glycerate/glycerol phosphate | - | - | - | + | + | + | |
| Conversion from serine to cysteine | - | - | - | + | + | + | |
| Conversion from serine to glycine | + | + | + | + | + | + | |
| Recycle homocysteine into methionine | - | + | - | + | - | - | |
| Synthesis of lysine from aspartate | - | - | - | + | - | - | |
| Synthesis of threonine from aspartate | - | - | - | + | - | - | |
| Synthesis of ornithine from arginine | - | + | - | - | - | - | |
| Synthesis of ornithine from proline | - | + | - | + | + | + | |
| Synthesis of polyamine from ornithine | - | + | - | - | - | - | |
| Polyamine pathway backward | + | - | - | + | + | + | |
| Degradation of leucine to acetyl-CoA | - | - | - | + | - | - | |
| Degradation of isoleucine/valine | - | - | - | + | + | + | |
| Aromatic amino acid hydroxylases (AAAH) | - | - | - | + | - | - | |
| Vitamin and others | Synthesis of thiamine (vitamin B1) | - | + | - | - | - | - |
| Conversion from thiamine to thiamine pyrophosphate (TPP) | - | + | - | + | - | + | |
| Synthesis of FMN/FAD from riboflavin | - | + | + | + | + | + | |
| Synthesis of pyridoxal phosphate (vitamin B6) | - | + | - | + | - | - | |
| Synthesis of NAD(P) + | - | + | - | + | + | + | |
| Synthesis of pantothenate from valine | - | - | - | + | + | + | |
| Synthesis of CoA from pantothenate | + | + | + | + | + | + | |
| Synthesis of lipoic acid | - | + | - | + | + | + | |
| Salvage lipoic acid in mitochondria | - | + | + | + | - | + | |
| Synthesis of porphyrin/cytochrome proteins | - | + | - | + | + | + |
aPlus symbol denotes that the essential enzymes for pathways were identified, whereas minus symbol denotes that the essential enzymes for pathways were absent. Only 2 subunits of the F-type ATPase are present in Cryptosporidium parvum. Abbreviation: Cryptosporidium parvum (Cpar); Plasmodium falciparum (Pfal); Babesia bovis (Bbov); Toxoplasma gondii (Tgon); Eimeria tenella (Eten); Cyclospora cayetanensis (Ccay)
Putative transporters in common apicomplexan parasites*
*The detection of putative transporter proteins was based on Pfam search. Abbreviation: Cryptosporidium parvum (Cpar); Plasmodium falciparum (Pfal); Babesia bovis (Bbov); Toxoplasma gondii (Tgon); Eimeria tenella (Eten); Cyclospora cayetanensis (Ccay)
**ABC transporter and mitochondrial carrier have a broad range of substrates
Fig. 3Post-translational modifications of proteins in Cyclospora cayetanensis and other apicomplexans. a Core structure of GPI-anchor precursor and critical enzymes involved in its biosynthesis. Genes encoding PIG-V and PIG-B (shown in red) are absent in the C. cayetanensis genome. b Structure of hypothetical N-glycan precursors in different apicomplexans. Due to the secondary loss of ALG-encoding genes, the precursors of N-glycan are divergent in apicomplexans from 10 sugars in Toxoplasma gondii to 2 sugars in Plasmodium falciparum and Eimeria tenella. C. cayetanensis possesses a 7 sugar precursor which is just enough for glycosylation but not sufficient for N-glycan-dependent quality control of protein folding. c Phylogenetic relationship of ppGalNAc-Ts, the critical enzyme in mucin-type O-glycan biosynthesis, from Cryptosporidium parvum (Cp), T. gondii (Tg), E. tenella (ETH), and C. cayetanensis (cyc) based on a neighbor-joining analysis using genetic distances calculated with the maximum composite likelihood method. Numbers on branches are percent bootstrap values >50 from 1,000 replications. C. cayetanensis has ppGalNAc-Ts similar to other coccidia, especially E. tenella
Predicted rhoptry protein kinases (ROPKs) in Cyclospora cayetanensis using HMM profiles search and their orthologs in other coccidia
| Gene ID | Best hit HMM family | E-value | Score | General PK score |
|
|
|
|---|---|---|---|---|---|---|---|
| cyc_02428 | ROP21/27 | 1.4E-104 | 348.6 | 94.4 | ETH_00014495 | EfaB_PLUS_7742.g778 | TGME49_263220 |
| cyc_03750 | ROP21/27 | 3.1E-100 | 334.4 | 82.9 | EfaB_PLUS_47595.g2679 | TGME49_313330 | |
| cyc_04230 | ROP35 | 1.6E-39 | 134.9 | 40.3 | ETH_00005905 | EfaB_MINUS_42996.g2710 | |
| cyc_03158 | ROP35 | 4.3E-83 | 277.5 | 89.3 | ETH_00026495 | EfaB_PLUS_8664.g829 | TGME49_304740 |
| cyc_00988 | ROPK-Eten1 | 3.0E-108 | 361.3 | 75.0 | ETH_00027705 | EfaB_PLUS_15899.g1411 | |
| cyc_00989 | ROPK-Eten1 | 2.6E-77 | 259.6 | 79.2 | ETH_00027695 | ||
| cyc_03944 | ROPK-Eten1 | 4.0E-29 | 100.8 | 57.2 | ETH_00027700 | ||
| cyc_05579 | ROPK-Eten2a | 3.9E-60 | 202.5 | 78.4 | ETH_00028765 | ||
| ROPK-Eten2b | ETH_00028855 | ||||||
| cyc_08168 | ROPK-Eten3 | 1.1E-35 | 122.2 | 40.4 | ETH_00020585 | ||
| ROPK-Eten3 | ETH_00020615 ETH_00020590 | ||||||
| ETH_00020610 ETH_00005840 | |||||||
| ETH_00021185 ETH_00020620 | |||||||
| ROPK-Eten4 | ETH_00000075 ETH_00000080 | ||||||
| ROPK-Eten5 | ETH_00005415 ETH_00005400 | ||||||
| ETH_00005405 ETH_00005410 | |||||||
| cyc_02713 | ROPK-Eten6 | 1.6E-66 | 223.3 | 64.0 | ETH_00002510 | EfaB_MINUS_32658.g2475 | |
| cyc_05580 | ROPK-Unique | 1.3E-71 | 240.3 | 78.2 | ETH_00028835 | EfaB_MINUS_17096.g1521 | |
| cyc_04110 | ROPK-Unique | 4.1E-56 | 189.4 | 28.0 | ETH_00013325 | EfaB_PLUS_24117.g1969 | |
| cyc_07646 | ROPK-Unique | 1.1E-48 | 165.0 | 41.4 | ETH_00005170 | EfaB_PLUS_33184.g2393 | |
| ROPK-Unique | ETH_00005335 |
Fig. 4Phylogeny of putative rhoptry protein kinase (ROPK) families and conserved catalytic domains in Eimeria tenella, E. falciformis and Cyclospora cayetanensis. a Phylogenetic relationship among ROPKs. The neighbor-joining phylogenetic tree was constructed based on genetic distances calculated using the maximum composite likelihood method. Numbers on branches are percent bootstrap values >50 from 1,000 replications. The ROPKs from C. cayetanensis are shown in bold. b Catalytic kinase subdomains and key residues of ROPK subfamilies. The catalytic lysine in subdomain II and aspartic acids in subdomain VIb and VII are considered the key residues in active protein kinases. Their presence along with five other residues in different subdomains suggests that sub-families ROP21/27, ROP35, and ROPK-Eten1 are active protein kinases; the remaining sub-families are either inactive or potentially non-canonical rhoptry kinases. In C. cayetanensis, at least four ROPKs, including cyc_02428 from the ROP21/27 subfamily, cyc_03158 from the ROP35 subfamily, cyc_00989 and cyc_00988 from the ROPK-Eten1 subfamily, are active protein kinases