| Literature DB >> 15707503 |
Abstract
BACKGROUND: Mammalian ribosomes contain 79 different proteins encoded by widely scattered single copy genes. Coordinate expression of these genes at transcriptional and post-transcriptional levels is required to ensure a roughly equimolar accumulation of ribosomal proteins. To date, detailed studies of only a very few ribosomal protein (rp) promoters have been made. To elucidate the general features of rp promoter architecture, I made a detailed sequence comparison of the promoter regions of the entire set of orthologous human and mouse rp genes.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15707503 PMCID: PMC554972 DOI: 10.1186/1471-2148-5-15
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1The location of insertion elements in the 5'-flanking regions of rp genes. Insertion (ins) elements were identified by the RepeatMasker program in scans of up to one kilobase of sequence 5' of the tsp, and the location of the element nearest to the tsp recorded for 79 human and mouse rp genes [see Additional file 2]. The distance from the tsp was divided into 200 bp intervals, the percentage of rp genes with an element within each interval determined, and the values plotted cumulatively against the distance from the tsp.
Figure 2Criteria for rp promoter annotation. (a). Quality of the TATA box for TBP binding based on a structural analysis of TBP-DNA complexes. Rules adopted for + and +/- ranking and some examples in rp promoters are shown at the right. (b). Criteria used for identification of potential transcription factor-binding sites by the rVISTA program for motifs aligned in human-mouse promoter sequence comparisons and with the FindPatterns program for unaligned motifs.
Figure 3Examples of comparisons of human and mouse rp promoter sequences from -200 to +100. Exon I is in uppercase letters, highlighted in yellow with the ATG translation initiation codon, if present, highlighted in gray. The sequences evaluated for TATA box quality are highlighted in green. Putative transcription factor-binding sites in aligned sequences are shown above the sequences, highlighted in fuchsia; sites in unaligned sequences are shown above the human or below the mouse sequences and enclosed in carets. An upstream sequence identified as an insertion element is in red letters. Gaps inserted by the alignment program to maximize overall sequence identity are indicated by dashes. (a) rpS18 , (b) rpL30 , (c) rpS4 , (d) rpL13a . In rpL30 the elements with known functional relevance are underlined with asterisks.
Ribosomal protein promoters Quality of the TATA Box Motifs, the Number of Transcription Factor-Binding Sites and the Location of AUG Codons.
| SA | + | + | 58 | 1 | 2 | E2 | |||||
| S2 | + | + | 54 | 1 | E2 | ||||||
| S3 | + | + | 60 | E1 | |||||||
| S3a | – | – | 75 | E1 | |||||||
| S4 | +/– | +/– | 70 | 1 | E1e | ||||||
| S5 | +/– | + | 62 | E2 | |||||||
| S6 | – | – | 56 | 1 | 1 | E1 | |||||
| S7 | +/– | + | 56 | 4 | 2 | E2 | |||||
| S8 | +/– | +/– | 60 | E1e | |||||||
| S9 | + | + | 50 | 1 | E2 | ||||||
| S10 | – | – | 64 | 1 | E2 | ||||||
| S11 | +/– | +/– | 53 | 1 | E1 | ||||||
| S12 | + | + | 62 | E2 | |||||||
| S13 | – | +/– | 61 | E1 | |||||||
| S14 | – | – | 62 | E2 | |||||||
| S15 | +/– | +/– | 52 | E1e | |||||||
| S15a | +/– | +/– | 59 | 1 | E2 | ||||||
| S16 | – | +/– | 74 | 2 | E1 | ||||||
| S17 | +/– | + | 60 | 1 | 2 | 1 | E1e | ||||
| S18 | + | + | 64 | E1e | |||||||
| S19 | +/– | +/– | 72 | 1 | E2 | ||||||
| S20 | + | + | 63 | E1e | |||||||
| S21 | + | +/– | 52 | 1 | E2 | ||||||
| S23 | – | – | 65 | E1e | |||||||
| S24 | + | + | 59 | 1 | 1 | E1e | |||||
| S25 | – | – | 56 | 1 | E1e | ||||||
| S26 | + | + | 53 | E1e | |||||||
| S27 | + | + | 71 | E1 | |||||||
| S27a | – | – | 57 | 1 | E2 | ||||||
| S28 | + | + | 67 | 1 | 1 | E1 | |||||
| S29 | – | – | inc | E1 | |||||||
| S30 | – | – | 68 | E2 | |||||||
| L3 | + | + | 56 | 2 | 1 | E1e | |||||
| L4 | + | + | 68 | 2 | E1e | ||||||
| L5 | + | + | 62 | 1 | E1e | ||||||
| L6 | +/– | +/– | 62 | 1 | E2 | ||||||
| L7 | +/– | +/– | 69 | E1 | |||||||
| L7a | + | + | 51 | 1 | 1 | E1e | |||||
| L8 | +/– | +/– | 50 | 1 | 1 | E2 | |||||
| L9 | – | – | 57 | 1 | E2 | ||||||
| L10 | + | + | 60 | 1 | E2 | ||||||
| L10a | + | + | 55 | E1 | |||||||
| L11 | +/– | +/– | 64 | E2 | |||||||
| L12 | + | + | 73 | 1 | E1 | ||||||
| L13 | – | +/– | 55 | 1 | E2 | ||||||
| L13a | +/– | +/– | 67 | 2 | E1 | ||||||
| L14 | +/– | +/– | 58 | 1 | E1e | ||||||
| L15 | – | – | 66 | 1 | E2 | ||||||
| L17 | – | – | 69 | 1 | 1 | E2 | |||||
| L18 | – | – | 51 | E1e | |||||||
| L18a | – | – | 67 | E1 | |||||||
| L19 | +/– | +/– | 65 | 1 | E1 | ||||||
| L21 | – | – | 62 | E2 | |||||||
| L22 | – | – | 61 | E1 | |||||||
| L23 | + | + | 64 | 1 | 1 | E1 | |||||
| L23a | +/– | +/– | 63 | E1 | |||||||
| L24 | +/– | +/– | 56 | 1 | E1 | ||||||
| L26 | +/– | +/– | 64 | 1 | E2 | ||||||
| L27 | – | – | 66 | E2 | |||||||
| L27a | – | – | 56 | 1 | E1e | ||||||
| L28 | + | +/– | 56 | 1 | 1 | E2 | |||||
| L29 | +/– | +/– | 68 | 1 | 1 | E1e//2 | |||||
| L30 | – | – | 60 | 1 | E2 | ||||||
| L31 | – | – | 62 | E2 | |||||||
| L32 | – | – | 68 | 1 * | E2 | ||||||
| L34 | – | – | 55 | 2 | E2 | ||||||
| L35 | – | – | 59 | 4 | E1e | ||||||
| L35a | – | – | 64 | 1 | E2 | ||||||
| L36 | +/– | + | 51 | 1 | 1 | E2 | |||||
| L36a | + | + | 62 | 2 | E1e | ||||||
| L37 | + | – | 60 | 2 | 1 | E1e | |||||
| L37a | +/– | +/– | 52 | E1e | |||||||
| L38 | +/– | – | 62 | E2 | |||||||
| L39 | – | +/– | 61 | 1 | 1 | E1e | |||||
| L40 | – | – | 62 | 3 | E2 | ||||||
| L41 | + | + | 54 | 1 | E2 | ||||||
| LP0 | + | + | 59 | 1 | E2 | ||||||
| LP1 | +/– | +/– | 62 | E1 | |||||||
| LP2 | +/– | – | 55 | E2 | |||||||
Bolded numbers indicate that the sites are aligned in orthologous human and mouse genes. An asterisk indicates that there is experimental evidence for the functionality of one or more of the sites.
‡ From –200 to +100 relative to the tsp.
† E1, within Exon 1; E1e, at the extreme 3' end of Exon 1; E2, within Exon 2. The locations are conserved in all rp genes except rpL29 .
Figure 4The consensus initiator sequence of mammalian rp genes. Seventy-nine pairs of orthologous human and mouse rp gene sequences were compared at positions -8 to +10 and the occurrence of each nucleotide or pair of nucleotides depicted by the height of the letters: A, G, C. T, Y = C/T, R = A/G, W = A/T, K = G/T, S = C/G, M = C/A. The tsp is the C at position +1.
Figure 5Comparison of . Alignment by the ClustalW program with 100% and 75% identities enclosed in boxes. The first exon is highlighted in yellow with the ATG translation initiation codon in gray. Putative binding sites for TBP, GABP, Sp1 and YY1 are highlighted in green, red, blue and pink, respectively.
Comparison of promoter features in mammals and lower vertebrates
| Hs/Mm | E1 | AB061838, NM_012052 | |||||||
| Xl | E1 | Z34530 | |||||||
| Fr | E1 | X97794 | |||||||
| Hs/Mm | E1 | X67309, D28348, Z54209 | |||||||
| Xl | E1 | AF020551 | |||||||
| Hs/Mm | E2 | Z25749, AB055774, NM_011300 | |||||||
| Xl | E2 | X71081 | |||||||
| Fr | E2 | X94942 | |||||||
| Hs/Mm | E1e | M32405, AB055776, NM_001018 | |||||||
| Gg | E1e | D10167 | |||||||
| Hs/Mm | E1e | AB062069, U12202, NM_011297 | |||||||
| Xl | E1e | M33517 | |||||||
| Fr | E1e | AJ001398 | |||||||
| Hs/Mm | E1e | AB055762, NM_00969, mCG13589 | |||||||
| Gg | E1e | D10737 | |||||||
| Hs/Mm | E1e | X61923, X54067 | |||||||
| Gg | E1e | X62641 | |||||||
| Fr | E1e | Y15171 | |||||||
| Hs/Mm | E1e | AB061822, mCG22708 | |||||||
| Xl | E1e | X06552 | |||||||
| Hs/Mm | E1e | AB061825, mCG132477 | |||||||
| Xl | E1e | X05025 | |||||||
| Om | E1e | AF240376 | |||||||
| Hs/Mm | E2 | AB070559, K02928 | |||||||
| Gg | E2 | D14521 | |||||||
| Hs/Mm | E1e | NM_000998, NM_009084 | |||||||
| Gg | E1e | D14167 |
Hs/Mm: Homo sapiens/Mus musculus; Gg: Gallus gallus; Xl: Xenopus laevis; Fr: Fugu rubripes; Om: Oreochromis mossambicus (tilapia). For the initiator sequences, represents an alleged tsp based on various types of experimental evidence. For the transcription factor binding sites, only those sites that are conserved (aligned) between human and mouse are listed. A † symbol indicates that the promoter contains an unaligned site for the factor. Designation of AUG locations as in Table 1.
Non-rp top genes
| + | + | 44 | 2 | E2 | |||||||
| + | + | 68 | E1 | ||||||||
| + | + | 55 | E1e | ||||||||
| + | + | 66 | 1 | E1 | |||||||
| – | – | 71 | E1 | ||||||||
| + | + | 73 | 1 | E1 | |||||||
| + | + | 67 | 1 | 1 | E1 | ||||||
| + | + | 55 | 2 | -- | |||||||
| + | + | 43 | -- | ||||||||
* From -300 to +100 relative to the tsp.
† The bolded C is the tsp.
Other symbols are as in Table 1.
Comparison ofthe promoters ofrp genes, non-rp top genes, and other housekeeping genes
| GABP (5' of tsp) | 54 % | 68 % | 22 % * | 33 % | 35% |
| SP1 (5' of tsp) | 48 % | 70 % | 89 % | 89 % | 90 % |
| GABP or SP1 | 76 % | 92 % | 100 % | 100 % | 95 % |
| YY1 (3' of tsp) | 52 % | 59 % | 11 % † | 11 % † | 10 % † |
| + | 35 % | 89 % ‡ | 30 % | ||
| +/– | 25 % | – | 20 % | ||
| – | 39 % | 11 % ‡ | 50 % | ||
Based on analyses of 79 pairs of orthologous human and mouse rp genes, 9 pairs of orthologous non-rp TOP genes and 20 randomly selected human or mouse housekeeping genes. The symbols *, †, and ‡ indicate the percentages considered to be significantly different from the corresponding values for rp genes according to a Fisher's two sided exact test of the three sets of data. *: p = 0.086; †: p = 0.031 and 0.00075; ‡: p = 0.0029.
For the rp genes, the TATA box quality was scored as:
+ when both orthologues are + or when one is + and the other +/-,
+/– when both orthologues are +/– or when one is + and the other is –, and – when both orthologues are – or when one is - and the other is +/–.
For the non-rp TOP genes, both orthologues were either + or –.
Figure 6Classification of rp gene promoters. The data in Table 1 were used to classify the rp promoters into eight groups according to whether they contain aligned sites for GABP, Sp1, YY1, or various combinations of these factors. The promoters highlighted in green are those in which the TATA box quality of both orthologues was ranked as + or in which one ranked as + and the other as +/-. The promoters highlighted in yellow are those in which both orthologues ranked as +/- or in which one ranked as + and the other as -. Non highlighted promoters are those in which both orthologues ranked as – or in which one ranked as +/- and the other as -.