| Literature DB >> 20003517 |
Eric Peyretaillade1, Olivier Gonçalves, Sébastien Terrat, Eric Dugat-Bony, Patrick Wincker, Robert S Cornman, Jay D Evans, Frédéric Delbac, Pierre Peyret.
Abstract
BACKGROUND: Microsporidia are obligate intracellular eukaryotic parasites with genomes ranging in size from 2.3 Mbp to more than 20 Mbp. The extremely small (2.9 Mbp) and highly compact (approximately 1 gene/kb) genome of the human parasite Encephalitozoon cuniculi has been fully sequenced. The aim of this study was to characterize noncoding motifs that could be involved in regulation of gene expression in E. cuniculi and to show whether these motifs are conserved among the phylum Microsporidia.Entities:
Mesh:
Substances:
Year: 2009 PMID: 20003517 PMCID: PMC2803860 DOI: 10.1186/1471-2164-10-607
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Experimental evidences to explain transcriptional process in . A: schematic representation of the genomic DNA region from the E. cuniculi chromosome 2 containing the genes unk, htr, h2a, pap, ubi and rib. White rectangles represent the position of the genes in genomic DNA (with gene names inside) and the arrowhead represents their transcriptional directions. The length in base pairs (bp) of the CDSs and the intergenic regions is also indicated. Black arrows represent the orientation of the specific primers used for RT, PCR and RACE-PCR experiments. B: results of RT-PCR experiments, indicating that both DNA strands are transcribed. Discontinuous lines indicate that respective transcription start site and transcription terminator one could not be defined. C: Schematic representation of RACE-PCR experiments. Black lines represent the length and position of the 5'-RACE (1) and the 3' RACE (2) fragments. PolyA tails are represented by AAAA and 5' end by a dot.
Figure 2Identification of 5' and 3' maturation sites for the six monocistronic genes. 5' and 3'RACE-PCR products were generated using specific primers designed from the hypothetical protein (unk), the ubiquitin (ubi), the polyA polymerase (pap), the histone H2A (h2a) the histone transcription regulator (htr) and the 60S ribosomal protein L27A (rib). Sequences of 5' and 3'UTR are given. Translation initiation codon and stop codon are highlighted in light-grey for all genes. 3'UTR lengths are indicated in square brackets. Putative polyadenylation signals are underlined. Distances between putative polyadenylation signal and polyadenylation site are indicated in parenthesis. DNA sequences upstream transcription start sites that may represent core promoters are highlighted in dark grey. Complete CDS sequences are not given and are represent by gene names.
Identification of conserved DNA signals shared by different microsporidian species potentially implicated in ribosomal protein gene expression.
| Product name | |||||
|---|---|---|---|---|---|
| 60S RIBOSOMAL PROTEIN L11 | |||||
| 60S RIBOSOMAL PROTEIN L7A | NS | ||||
| 40S RIBOSOMAL PROTEIN S17 | |||||
| 60S RIBOSOMAL PROTEIN L9 | |||||
| 60S RIBOSOMAL PROTEIN L24 | NS | NS | NS | ||
| 40S RIBOSOMAL PROTEIN S8 | NS | ||||
| 60S RIBOSOMAL PROTEIN L35A | NS | NS | NS | ||
This table represents DNA sequences upstream the ATG codon for 7 ribosomal protein genes. isolated on the E. cuniculi chromosome 2 and their orthologs in A. locustae, E. bieneusi, A. algerae, and N. ceranae. DNA signals are highlighted as bold characters for AAATTT-like sequence and underlined characters for CCC- and GGG-like sequences. (NS indicates a Not already Sequenced gene and the star character stands for mispredicted start codon).
Study of Kozak sequences for correctly and mispredicted start codons.
| POSITION | NUCLEOTIDE | correctly predicted AUG with CCC-like motif (1474) | mispredicted AUG (364) | corrected AUG and BLASTP validated (187) | corrected AUG but not BLASTP validated (177) | correctly predicted (117) and corrected (18) AUG with GGG-like motif |
|---|---|---|---|---|---|---|
| A | ||||||
| T | ||||||
| G | ||||||
| +4 | A | |||||
| T | 10.92 | 22.53 | 10.16 | 11.86 | 7.41 | |
| G | ||||||
| C | 10.85 | 25.82 | 6.42 | 9.04 | 9.63 | |
| +5 | A | |||||
| T | 19.54 | 28.57 | 14.44 | 15.82 | 27.41 | |
| G | 19.74 | 18.68 | 20.32 | 19.77 | 22.96 | |
| C | 15.4 | 27.47 | 18.18 | 18.64 | 12.59 | |
| +6 | A | 26.87 | 24.73 | 28.88 | 27.12 | 32.59 |
| T | 23.47 | 32.14 | 20.86 | 23.16 | 23.7 | |
| G | 28.7 | 22.53 | 26.2 | 30.51 | 28.15 | |
| C | 20.96 | 20.6 | 24.06 | 19.21 | 15.56 | |
Nucleotide frequencies of occurrence downstream (+4 to +6) the ATG codons Significant nucleotide biases are indicated in bold
List of oligonucleotide primers used for RT, PCR and RACE-PCR experiments.
| GENE | ORF NUMBER | PRIMER NAME | SEQUENCE (5'-3') | POSITION ON CHROMOSOME SEQUENCE |
|---|---|---|---|---|
| ECU02_0700 | 5'-unk | ATGCCATCTCGGCACTTAGAAGGCTC | complement (86499..86524) | |
| 3'-unk | GCGCTCCAGCTCATCAGCGGCGAG *# | 86443..86466 | ||
| ECU02_0710 | 5'-htr | ACCTGTCCGTCCATACCACGTTGC | 88604..88627 | |
| 3'-htr | GATACCGACCTGGCGCTTGAACACG # | complement (87230..87254) | ||
| ECU02_0720 | 5'-h2a | TGATCTCGCTGATTAGGTACATTAC | 89217..89241 | |
| 3'-h2a | GGACCAGGATGAGGATATCGAAGG | complement (89268..89291) | ||
| ECU02_0730 | 5'-pap | TCAAAGAACCCACGCTCCTGTAGG | 90874..90897 | |
| 3'-pap | GTCAAAGTTCGAGGCGGTTGACGAC | complement (89812..89836) | ||
| ECU02_0740i | 5'ubi | CTCGATACTGTCCGAAGGCTCGACT # | 91300..91324 | |
| 3'-ubi | AGTCGAGCCTTCGGACAGTATCGAG | complement (91300..91324) | ||
| ECU02_0750i | 5'-rib | TGTCATGCTCTGACAGAACCGTCC *# | complement (91758..91781) | |
| 3'-rib | AGAAGAAGGTATCCAAGCTGGCC | 91698..91720 | ||
For each gene two specific primers were designed. Each of them was used for RACE-PCR. * and # characters indicate the primers used for RT and PCR, respectively. ORF number are also indicated.