| Literature DB >> 17212834 |
Nandita Mullapudi1, Cheryl A Lancto, Mitchell S Abrahamsen, Jessica C Kissinger.
Abstract
BACKGROUND: Cryptosporidium parvum is a unicellular eukaryote in the phylum Apicomplexa. It is an obligate intracellular parasite that causes diarrhea and is a significant AIDS-related pathogen. Cryptosporidium parvum is not amenable to long-term laboratory cultivation or classical molecular genetic analysis. The parasite exhibits a complex life cycle, a broad host range, and fundamental mechanisms of gene regulation remain unknown. We have used data from the recently sequenced genome of this organism to uncover clues about gene regulation in C. parvum. We have applied two pattern finding algorithms MEME and AlignACE to identify conserved, over-represented motifs in the 5' upstream regions of genes in C. parvum. To support our findings, we have established comparative real-time -PCR expression profiles for the groups of genes examined computationally.Entities:
Mesh:
Year: 2007 PMID: 17212834 PMCID: PMC1779779 DOI: 10.1186/1471-2164-8-13
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Flow-chart illustrating methodology. Pattern-finding was carried out in two iterations, first de novo and a second time using information obtained from comparative real-time PCR expression profiles.
Genes used in this study
| Hexokinase | HK | 852 | cgd6_3800 |
| Phosphoglycerokinase | PGK | 986 | cgd7_910 |
| Phosphofructokinase 1 | PFK1 | 455 | cgd3_1400 |
| Phospphofructokinase 2 | PFK2 | 575 | cgd2_2130 |
| Enolase | Eno | 364 | cgd5_1960 |
| Glyceraldehyde-3-phosphate dehydrogenase | GAPDH | 837 | cgd6_3790 |
| Fructose-bis-phosphate Aldolase | Ald | 482 | cgd1_3020 |
| Glucose-6-phosphate isomerase | G6PI | 225 | cgd2_3200 |
| Lactate Dehydrogenase | LDH | 891 | cgd7_480 |
| Phosphoglucomutase | PGM | 606 | cgd7_4270* |
| Pyruvate kinase | PyK | 718 | cgd1_2040 |
| Triose Phosphate Isomerase | TPI | 500 | cgd1_3040 |
| Inosine monophosphate dehydrogenase | IMPDH | 1065 | cgd6_20* |
| Adenosine Transporter | AT | 879 | cgd2_310 |
| Cytidine Triphosphate Synthase | CTPS | 856 | cgd5_1710 |
| Guanidine Monophosphate Synthase | GMPS | 423 | cgd5_4520* |
| RibonucleosideDiphosphate Reductase | RDPR | 627 | cgd6_1950 |
| deoxycytidine Monophosphate Deaminase | dCMPD | 512 | cgd2_2780 |
| Dihydrofolate reductase-Thymidyl Synthase | DHFR-TS | 481 | cgd4_4460 |
| Adenosine Kinase | AK | 170 | cgd8_2370* |
| Uridine Kinase | UK | 404 | cgd8_2810 |
| Thymidine Kinase | TK | 362 | cgd5_4440 |
| DNA polymerase -α-subunit | DNA pol α | 879 | cgd8_870 |
| DNA polymerase -δ-catalytic subunit | DNA pol δ | 234 | cgd6_4410* |
| MCM 10p-like | MCM 10p | 92 | cgd6_1710 |
| MCM 2-like | MCM2 | 485 | cgd2_1100 |
| MCM3-associated | MCM3-a | 882 | cgd3_3570 |
| MCM3-like | MCM3 | 370 | cgd2_1600 |
| MCM4-like | MCM4 | 638 | cgd2_1250 |
| MCM5-like | MCM5 | 155 | cgd7_2920 |
| MCM6-like | MCM6 | 604 | cgd6_240* |
| MCM7-like | MCM7 | 374 | cgd4_970 |
| ORC/CDC6-like | CDC6 | 1301 | cgd4_4320 |
| RP-A ssb protein | RP-A | 730 | cgd2_4080 |
| RAD24/RF-C activator | RAD24 | 571 | cgd7_2660 |
| MCM7 domain containing | MCM7-c | 760 | cgd8_3360 |
| OOCYST WALL PROTEINS | |||
| COWP1 | COWP1 | 366 | cgd6_2090 |
| COWP3 | COWP3 | 530 | cgd4_670 |
| COWP4 | COWP4 | 197 | cgd8_3350 |
| COWP5 | COWP5 | 170 | cgd7_5150 |
| COWP6 | COWP6 | 688 | cgd4_3090 |
| COWP8 | COWP8 | 604 | cgd6_200 |
| COWP9 | COWP9 | 170 | cgd6_210* |
| LSP1 | LSP1 | 370 | cgd7_3800* |
| LSP2 | LSP2 | 370 | cgd7_3810 |
| LSP3 | LSP3 | 225 | cgd7_3820* |
| LSP4 | LSP4 | 454 | cgd7_3830 |
| LSP5 | LSP5 | 142 | cgd7_3840 |
| LSP6 | LSP6 | 257 | cgd7_3860 |
| LSP7 | LSP7 | 257 | cgd7_3870 |
Descriptions of the genes used in this study, organized by functional group. The lengths of the respective upstream sequences (distance until the previous CDS) and annotated gene ids for each C. parvum gene are shown. Gene ids marked by a * are those for which corresponding ortholog information in C. hominis could not be obtained.
Figure 2Motifs identified upstream of oocyst wall and large secretory proteins. (a) Upstream regions of genes encoding cryptosporidial oocyst wall proteins, and the occurrences of the most significant upstream motif shared by all of these upstream regions. The positions of the motifs are drawn to scale. All positions are shown with respect to the translational start. Solid black symbols denote a motif located on the reverse strand. Sequence-logo displaying the information content for the over-represented motif. (b) Upstream regions of genes encoding cryptosporidial large secretory proteins, and the occurrences of the most significant upstream motif shared by all of these upstream regions. Sequence-logo displaying the information content for the over-represented motif. Expression profiles for both families of genes were published elsewhere (Abrahamsen et al. 2004; Templeton et al. 2004).
Figure 3Results of motif and expression analyses. (a) Motifs and expression profiles associated with genes involved in nucleotide metabolism. Schematic representation of the upstream regions are shown for each gene. The location of 4 different candidate motifs are indicated by the use of four different shapes. The single motif found in each gene of the group is indicated by a circle. The locations of three additional candidate motifs, each associated with a sub-set of sequences are indicated by the remaining shapes drawn on the upstream regions and as indicated to the left of each sequence logo. Solid black shapes indicate motifs found on the reverse strand. Comparative real-time PCR profiles for sub-sets of each group of genes organized by expression profile over a 72 h period are shown as sub-sets 1–3 Each sub-set is associated with a single candidate motif. (b) Motifs and expression profiles associated with genes involved in DNA replication. (c) Motifs and expression profiles associated with genes involved in glycolyis.