| Literature DB >> 12537549 |
Jose L Jiménez1, Michael P Mitchell, John G Sgouros.
Abstract
BACKGROUND: Genome projects have provided a vast amount of sequence information. Sequence comparison between species helps to establish functional catalogues within organisms and to study how they are maintained and modified across phylogenetic groups during evolution. Microarray studies allow us to determine groups of genes with similar temporal regulation and perhaps also common regulatory upstream regions for binding of transcription factors. The integration of sequence and expression data is expected to refine our current annotations and provide some insight into the evolution of gene regulation across organisms.Entities:
Mesh:
Year: 2002 PMID: 12537549 PMCID: PMC151285 DOI: 10.1186/gb-2002-4-1-r4
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
COG codes for general function and pathway/systems
| Code | Genes | Description |
| 416 | Information storage and processing | |
| J | 242 | Translation, ribosomal structure and biogenesis |
| K | 80 | Transcription |
| L | 94 | DNA replication, recombination and repair |
| 429 | Metabolism | |
| C | 62 | Energy production and conversion |
| G | 71 | Carbohydrate transport and metabolism |
| E | 153 | Amino acid transport and metabolism |
| F | 50 | Nucleotide transport and metabolism |
| H | 54 | Coenzyme metabolism |
| I | 39 | Lipid metabolism |
| 226 | Cellular processes | |
| D | 4 | Cell division and chromosome partitioning |
| O | 98 | Posttranslational modification, protein turnover, chaperones |
| M | 9 | Cell envelope biogenesis, outer membrane |
| N | 6 | Cell motility and secretion |
| P | 71 | Inorganic ion transport and metabolism |
| T | 38 | Signal transduction mechanisms |
| 127 | Poorly characterized | |
| R | 119 | General function prediction only |
| S | 8 | Function unknown |
| - | 613 | No pathway/system assigned |
| C1 | 2 | Pyruvate decarboxylation |
| C2 | 18 | TCA cycle |
| C3 | 3 | Glyoxylate bypass |
| E1 | 6 | Arginine biosynthesis |
| E2 | 3 | Phenylalanine/tyrosine biosynthesis |
| E3 | 3 | Tryptophan biosynthesis |
| E4 | 2 | Threonine biosynthesis |
| E5 | 1 | Isoleucine biosynthesis |
| E7 | 19 | Leucine biosynthesis |
| E8 | 7 | Methionine biosynthesis |
| E9 | 4 | Proline biosynthesis |
| E10 | 4 | Histidine biosynthesis |
| F1 | 15 | Purine biosynthesis |
| F2 | 3 | Purine salvage |
| F3 | 12 | Pyrimidine biosynthesis |
| F4 | 2 | Pyrimidine salvage |
| F5 | 7 | Thymidylate biosynthesis |
| G1 | 4 | Glycolysis |
| G2 | 14 | Gluconeogenesis |
| G3 | 6 | Pentose phosphate pathway |
| G4 | 4 | Entner-doudoroff pathway |
| H1 | 8 | Heme biosynthesis |
| H3 | 2 | FAD biosynthesis |
| H4 | 7 | Biotin biosynthesis |
| H5 | 1 | NAD biosynthesis |
| H6 | 7 | Ubiquinone biosynthesis |
| H7 | 3 | Menaquinone biosynthesis |
| H8 | 2 | Thiamine biosynthesis |
| H9 | 1 | Pyridoxal phosphate biosynthesis |
| I1 | 10 | Fatty acid biosynthesis |
| J1 | 26 | Translation factors and enzymes involved in translation |
| J2 | 30 | Aminoacyl-tRNA synthetases, amino acid activation |
| J3 | 47 | Ribosomal proteins - small subunit |
| J4 | 77 | Ribosomal proteins - large subunit |
| K1 | 14 | DNA-dependent RNA polymerase subunits |
| K2 | 7 | Basal transcription factors |
| K3 | 2 | Transcriptional regulators |
| L1 | 20 | Basal replication machinery |
The COG letter-codes for general function and pathway/system categories are given the first column (Code). The Genes column holds the number of genes in the microarray data studied that were found for each functional class. The last column (Description) gives the description for each category.
Figure 1Correlation inconsistency in the expression of some gene pairs between different experiments. For each gene pair, CCF values obtained from combining expression profiles of all experiments (CCFTOTAL, x-axis) are plotted versus those of individual experiments (CCFEXPERIMENT, y-axis). Only gene pairs with a CCFTOTAL value above 0.5 or below -0.5 are shown. x- and y-axis values go from -1 to 1 in steps of 0.5 as depicted in (a). (b-d) Genes involved in the cell cycle: (b) alpha-factor, (c) elutriation and (d) cdc-15 strain. (e) Sporulation. (f, g) Response to stress: (f) heat and (g) cold shocks. (h) Diauxic shift. CCFEXPERIMENT values can spread over a considerable range regardless of their CCFTOTAL (for example, heat shock), even to the point of being significant and with opposite sign to CCFTOTAL (for example, cold stress and diauxic shift). This inconsistency is less pronounced in the sporulation data, perhaps because it contains very high intensity values that may bias the CCFTOTAL to be more similar to CCFspo. Note that only genes whose expression was increased or decreased by 2.3-fold were considered. Those that did not pass the filtering were given a value equal to zero.
Figure 2Proteins localized in the same compartment tend to be expressed at the same time. The plots show the trend for the percentage of genes from the same compartment (%LOC, y-axis) with respect to the total number of genes to which they correlate at a given threshold (CCF, x-axis). The solid lines correspond to the values obtained from the experimental data, whereas the dashed lines are the values expected by chance. The expected trend matches with that of a set of proteins with unknown localization (a), in contrast to sets of proteins with identified compartment: (b) mitochondrion, (c) endoplasmic reticulum, (d) plasma membrane, (e) cytoplasm and (f) nucleus.
Figure 3Agreement of functional annotation and expression data of genes. (a) Two groups of genes obtained after an a priori classification by, for example, functional annotation. The red lines connecting the elements within each group represent significant similarity between their expression profiles. The number of red lines with respect to all possible ones gives and indication of the consistency of the group. Thus, group a is very consistent, as all the possible connections but one, a3-a4, are made. On the other hand, the consistency of group b is poorer, as its elements form two subgroups (b1-b2 and b3-b4). The blue lines connecting the elements between each group are relations lost upon the a priori classification used. The higher the number of lost connections the less comprehensive the classification will be. In this case, the subgroup b1-b2 significantly correlates with group a. (b) Consistency of the functional groups established by gene annotation. A decreasing trend implies loss of interactions between members. F--, -P-, FP-, F-L, -PL and FPL indicate functional classes as defined in the text. For example, class F-- contains all gene pairs in which both members have the same general function regardless of their pathway/system (P) or location (L), class FP- contains all gene pairs with the same general function and the same pathway/system but not necessarily the same location, and so on. (c) Comprehensiveness of significant gene-expression pairs in the functional groups established by gene annotation. The increasing trend suggests that genes correlating at high CCF values tend to belong to the same functional class. This is especially obvious when a broad functional classification is used in which nearly all the possible pairs in the experiment are represented at high thresholds. The percentage of the gene pairs in each group with respect to the total number of pairs was: ALL (100%), F-- (8.8%), -P- (1.2%), FP- (4%), F-L (6.7%), -PL (1%) and FPL (3.1%). The group --- was not included in the -P- and -PL classes.
Figure 4Examples of gene subgrouping within broad functional classes. Expression profiles for several sets of genes after combining experiments under different conditions. The experiments are color-coded (upper bar) as follows: yellow (cell cycle), orange (sporulation), red (stress) and purple (diauxic shift). (a) Subgroups obtained from the 'L - -' class ('DNA replication, recombination and repair'). (b) Subgroups obtained from the 'O - -' class ('post-translational modification, protein turnover and chaperones'). (c) Subgroups obtained from the 'GEPR - -' class, which contains permeases of the major facilitator superfamily. GEPR comprises several functional groups reflecting that their actual function is not clear (R) although they may be involved in the transport of sugars (G), aminoacids (E) and inorganic ions (P). The vertical axis represents the differential expression of genes as the log ratio of the mRNA abundance in experimental versus control samples. At zero values, the mRNA levels are identical. The list of genes included in every subgroup can be found in the additional data files.
Figure 5'DNA replication-related' genes. The experiments are color-coded as in Figure 4. (a) The four subgroups with very similar periodic profiles are shown. As mentioned in the text, they correspond to DNA repair and replication, thymidylate biosynthesis, and chromosome partitioning. (b) Profile obtained by averaging those shown in (a).
Common upstream motifs in the genes involved in cytoplasmic protein synthesis
| Presence of regulatory motif | COG general functional category | COG pathway/system | Subcellular localization | GOC number | ORF name | Gene description | ||
| rap1 | - | - | EH | E7 | - | COG0028 | YLR044C | PDC1 pyruvate decarboxylase, isozyme 1 |
| rap1 | - | - | G | G1 | - | COG0469 | YAL038W | CDC19 pyruvate kinase |
| RAP1 | - | - | G | G2 | - | COG0126 | YCR012W | PGK1 phosphoglycerate kinase |
| RAP1 | - | - | G | G2 | - | COG0057 | YGR192C | TDH3 glyceraldehyde-3-phosphate dehydrogenase 3 |
| RAP1 | - | - | G | G2 | - | COG0588 | YKL152C | GPM1 phosphoglycerate mutase |
| rap1 | - | - | G | G2 | - | COG0148 | YHR174W | ENO2 enolase II (2-phosphoglycerate dehydratase) |
| rap1 | - | - | G | G2 | - | COG0057 | YJR009C | TDH2 glyceraldehyde-3-phosphate dehydrogenase 2 |
| RAP1 | RRPE | - | J | - | - | COG2238 | YOL121C | RPS19A 40S small subunit ribosomal protein S19.E |
| RAP1 | - | - | J | - | - | COG2238 | YNL302C | RPS19B 40S small subunit ribosomal protein S19.E |
| rap1 | - | PAC | J | - | - | COG2451 | YPL143W | RPL33A ribosomal protein L35A.E.C16 |
| rap1 | - | - | J | - | - | COG2451 | YOR234C | RPL33B ribosomal protein L35A.E.C15 |
| RAP1 | - | - | J | J2 | - | COG1190 | YDR037W | KRS1 lysyl-tRNA synthetase, cytosolic |
| rap1 | RRPE | - | J | J2 | - | COG0008 | YOR168W | GLN4 glutaminyl-tRNA synthetase |
| RAP1 | RRPE | PAC | J | J3 | - | COG0199 | YDL061C | RPS29B ribosomal protein S29.E.B |
| RAP1 | RRPE | PAC | J | J3 | - | COG0092 | YNL178W | RPS3 ribosomal protein S3.E |
| RAP1 | RRPE | PAC | J | J3 | - | COG0522 | YPL081W | RPS9A ribosomal protein S9.E.A |
| RAP1 | RRPE | - | J | J3 | - | COG0103 | YDL083C | RPS16B ribosomal protein S16.E |
| RAP1 | RRPE | - | J | J3 | - | COG2004 | YIL069C | RPS24B 40S small subunit ribosomal protein S24.E |
| RAP1 | RRPE | - | J | J3 | - | COG0052 | YLR048W | RPS0B 40S ribosomal protein P40 HOMOLOG B |
| RAP1 | RRPE | - | J | J3 | - | COG0185 | YOL040C | RPS15 40S small subunit ribosomal protein |
| RAP1 | - | PAC | J | J3 | - | COG0052 | YGR214W | RPS0A 40S ribosomal protein P40 HOMOLOG A |
| RAP1 | - | - | J | J3 | - | COG0186 | YBR048W | RPS11B ribosomal protein S11.E.B |
| RAP1 | - | - | J | J3 | - | COG2125 | YBR181C | RPS6B ribosomal protein S6.E |
| RAP1 | - | - | J | J3 | - | COG0522 | YBR189W | RPS9B ribosomal protein S9.E.B |
| RAP1 | - | - | J | J3 | - | COG0100 | YCR031C | RPS14A 40S ribosomal protein S14.E |
| RAP1 | - | - | J | J3 | - | COG0184 | YDR064W | RPS13 ribosomal protein |
| RAP1 | - | - | J | J3 | - | COG1383 | YDR447C | RPS17B ribosomal protein S17.E.B |
| RAP1 | - | - | J | J3 | - | COG0099 | YDR450W | RPS18A ribosomal protein S18.E.C4 |
| RAP1 | - | - | J | J3 | - | COG2004 | YER074W | RPS24A 40S small subunit ribosomal protein S24.E |
| RAP1 | - | - | J | J3 | - | COG0098 | YGL123W | RPS2 40S small subunit ribosomal protein |
| RAP1 | - | - | J | J3 | - | COG0048 | YGR118W | RPS23A 40S small subunit ribosomal protein S23.E |
| RAP1 | - | - | J | J3 | - | COG0051 | YHL015W | RPS20 ribosomal protein |
| RAP1 | - | - | J | J3 | - | COG1471 | YHR203C | RPS4B ribosomal protein S4.E.C8 |
| RAP1 | - | - | J | J3 | - | COG0096 | YJL190C | RPS22A ribosomal protein S15A.E.C10 |
| RAP1 | - | - | J | J3 | - | COG0049 | YJR123W | RPS5 ribosomal protein S5.E |
| RAP1 | - | - | J | J3 | - | COG1471 | YJR145C | RPS4A ribosomal protein S4.E.C10 |
| RAP1 | - | - | J | J3 | - | COG2051 | YKL156W | RPS27A ribosomal protein S27.E |
| RAP1 | - | - | J | J3 | - | COG1890 | YLR441C | RPS1A ribosomal protein S3A.E |
| RAP1 | - | - | J | J3 | - | COG1890 | YML063W | RPS1B ribosomal protein S3A.E |
| RAP1 | - | - | J | J3 | - | COG2125 | YPL090C | RPS6A ribosomal protein S6.E |
| rap1 | RRPE | - | J | J3 | - | COG0100 | YJL191W | RPS14B 40S small subunit ribosomal protein S14.E.B |
| rap1 | RRPE | - | J | J3 | - | COG2053 | YOR167C | RPS28A 40S small subunit ribosomal protein S28.E.C15 |
| rap1 | - | - | J | J3 | - | COG2007 | YBL072C | RPS8A ribosomal protein S8.E |
| rap1 | - | - | J | J3 | - | COG0186 | YDR025W | RPS11A ribosomal protein S11.E |
| rap1 | - | - | J | J3 | - | COG2007 | YER102W | RPS8B ribosomal protein S8.E |
| rap1 | - | - | J | J3 | - | COG0048 | YPR132W | RPS23B 40S small subunit ribosomal protein S23.E |
| RAP1 | RRPE | - | J | J4 | - | COG2058 | YDL130W | RPP1B 60S large subunit acidic ribosomal protein L44PRIME |
| RAP1 | RRPE | - | J | J4 | - | COG0093 | YER117W | RPL23B ribosomal protein L23.E |
| RAP1 | RRPE | - | J | J4 | - | COG0081 | YGL135W | RPL1B 60S large subunit ribosomal protein |
| RAP1 | RRPE | - | J | J4 | - | COG0097 | YGL147C | RPL9A ribosomal protein L9.E |
| RAP1 | RRPE | - | J | J4 | - | COG0198 | YGR034W | RPL26B 60S large subunit ribosomal protein |
| RAP1 | RRPE | - | J | J4 | - | COG1631 | YHR141C | RPL42B ribosomal protein L36A.E |
| RAP1 | RRPE | - | J | J4 | - | COG1552 | YIL148W | RPL40A ubiquitin |
| RAP1 | RRPE | - | J | J4 | - | COG0091 | YJL177W | RPL17B 60S large subunit ribosomal protein L17.E |
| RAP1 | RRPE | - | J | J4 | - | COG1358 | YLL045C | RPL8B 60S large subunit ribosomal protein L7A.E.B |
| RAP1 | RRPE | - | J | J4 | - | COG1632 | YLR029C | RPL15A 60S large subunit ribosomal protein L15.E.C12 |
| RAP1 | RRPE | - | J | J4 | - | COG2058 | YOL039W | RPP2A acidic ribosomal protein P2.BETA |
| RAP1 | RRPE | - | J | J4 | - | COG1727 | YOL120C | RPL18A 60S large subunit ribosomal protein S18.E |
| RAP1 | RRPE | - | J | J4 | - | COG1841 | YPL198W | RPL7B 60S large subunit ribosomal protein |
| RAP1 | - | PAC | J | J4 | - | COG2147 | YBL027W | RPL19B 60S large subunit ribosomal protein L19.E |
| RAP1 | - | PAC | J | J4 | - | COG0197 | YLR075W | RPL10 60S large subunit ribosomal protein |
| RAP1 | - | - | J | J4 | - | COG0093 | YBL087C | RPL23A 60S large subunit ribosomal protein L23.E |
| RAP1 | - | - | J | J4 | - | COG2147 | YBR084C-A | RPL19A 60S large subunit ribosomal protein L19.E |
| RAP1 | - | - | J | J4 | - | COG2139 | YBR191W | RPL21A ribosomal protein L21.E |
| RAP1 | - | - | J | J4 | - | COG0255 | YDL191W | RPL35A 60S large subunit ribosomal protein |
| RAP1 | - | - | J | J4 | - | COG0080 | YDR418W | RPL12B 60S large subunit ribosomal protein L12.E |
| RAP1 | - | - | J | J4 | - | COG2126 | YDR500C | RPL37B ribosomal protein L.37.E |
| RAP1 | - | - | J | J4 | - | COG1911 | YGL030W | RPL30 60S large subunit ribosomal protein L30.E |
| RAP1 | - | - | J | J4 | - | COG1841 | YGL076C | RPL7A 60S large subunit ribosomal protein L7.E.A |
| RAP1 | - | - | J | J4 | - | COG0200 | YGL103W | RPL28 60S large subunit ribosomal protein L27A.E |
| RAP1 | - | - | J | J4 | - | COG0094 | YGR085C | RPL11B ribosomal protein |
| RAP1 | - | - | J | J4 | - | COG1358 | YHL033C | RPL8A 60S large subunit ribosomal protein L7A.E.A |
| RAP1 | - | - | J | J4 | - | COG0090 | YIL018W | RPL2B 60S large subunit ribosomal protein L8.E |
| RAP1 | - | - | J | J4 | - | COG2174 | YIL052C | RPL34B ribosomal protein L34.E |
| RAP1 | - | - | J | J4 | - | COG2167 | YJL189W | RPL39 60S large subunit ribosomal protein L39.E |
| RAP1 | - | - | J | J4 | - | COG2163 | YKL006W | RPL14A ribosomal protein |
| RAP1 | - | - | J | J4 | - | COG0244 | YLR340W | RPP0 acidic ribosomal protein L10.E |
| RAP1 | - | - | J | J4 | - | COG0198 | YLR344W | RPL26A 60S large subunit ribosomal protein |
| RAP1 | - | - | J | J4 | - | COG0102 | YNL069C | RPL16B 60S large subunit ribosomal protein |
| RAP1 | - | - | J | J4 | - | COG1631 | YNL162W | RPL42A ribosomal protein L36A.E |
| RAP1 | - | - | J | J4 | - | COG0089 | YOL127W | RPL25 ribosomal protein L23A.E |
| RAP1 | - | - | J | J4 | - | COG2157 | YOR312C | RPL20B 60S large subunit ribosomal protein |
| RAP1 | - | - | J | J4 | - | COG1358 | YOR369C | RPS12 40S small subunit acidic ribosomal protein S12 |
| RAP1 | - | - | J | J4 | - | COG2139 | YPL079W | RPL21B ribosomal protein L21 |
| RAP1 | - | - | J | J4 | - | COG0256 | YPL131W | RPL5 60S large subunit ribosomal protein L5.E |
| RAP1 | - | - | J | J4 | - | COG1997 | YPR043W | RPL43A ribosomal protein L37A.E |
| RAP1 | - | - | J | J4 | - | COG0094 | YPR102C | RPL11A ribosomal protein L11.E |
| rap1 | RRPE | - | J | J4 | - | COG2058 | YDR382W | RPP2B 60S large subunit acidic ribosomal protein |
| rap1 | RRPE | - | J | J4 | - | COG1727 | YNL301C | RPL18B 60S large subunit ribosomal protein L18.E |
| rap1 | RRPE | - | J | J4 | - | COG0087 | YOR063W | RPL3 60S large subunit ribosomal protein L3.E |
| rap1 | - | - | J | J4 | - | COG1717 | YBL092W | RPL32 60S large subunit ribosomal protein L32.E |
| rap1 | - | - | J | J4 | - | COG2097 | YDL075W | RPL31A 60S large subunit ribosomal protein L31.E |
| rap1 | - | - | J | J4 | - | COG0080 | YEL054C | RPL12A 60S large subunit ribosomal protein L12.E |
| rap1 | - | - | J | J4 | - | COG2075 | YGL031C | RPL24A 60S large subunit ribosomal protein L24.E.A |
| rap1 | - | - | J | J4 | - | COG2075 | YGR148C | RPL24B 60S large subunit ribosomal protein L24.E.B |
| rap1 | - | - | J | J4 | - | COG2163 | YHL001W | RPL14B ribosomal protein |
| rap1 | - | - | J | J4 | - | COG0102 | YIL133C | RPL16A 60S large subunit ribosomal protein |
| rap1 | - | - | J | J4 | - | COG1552 | YKR094C | RPL40B ubiquitin |
| rap1 | - | - | J | J4 | - | COG2126 | YLR185W | RPL37A ribosomal protein L37.E |
| RAP1 | RRPE | PAC | K | - | - | COG0724 | YGR159C | NSR1 nuclear localization sequence binding protein |
| RAP1 | RRPE | PAC | K | K1 | - | COG0202 | YPR110C | RPC40 DNA-directed RNA polymerase I, III 40 kD |
| subunit | ||||||||
| RAP1 | RRPE | - | E | - | - | COG0421 | YLR146C | SPE4 spermine synthase |
| - | RRPE | - | E | - | - | COG0031 | YGR155W | CYS4 cystathionine beta-synthase |
| - | - | PAC | E | - | - | COG0833 | YGR191W | HIP1 histidine permease |
| - | - | - | E | - | - | COG0531 | YGL077C | HNM1 choline permease |
| - | - | - | E | - | - | COG0367 | YGR124W | ASN2 asparagine synthetase |
| - | - | - | E | - | - | COG0367 | YPR145W | ASN1 asparagine synthetase |
| - | RRPE | - | E | E7 | MIT | COG0059 | YLR355C | ILV5 ketol-acid reducto-isomerase |
| - | - | - | E | E7 | MIT | COG0473 | YIL094C | LYS12 homoisocitrate dehydrogenase |
| - | - | - | E | H7 | - | COG0722 | YBR249C | ARO4 2-dehydro-3-deoxyphosphoheptonate aldolase, tyrosine-inhibited |
| - | - | - | E | H7 | - | COG0082 | YGL148W | ARO2 chorismate synthase |
| - | - | - | EH | E7 | - | COG0028 | YGR087C | PDC6 pyruvate decarboxylase 3 |
| - | - | - | EH | E7 | - | COG0028 | YLR134W | PDC5 pyruvate decarboxylase, isozyme 2 |
| - | RRPE | - | F | F2 | - | COG0503 | YML022W | APT1 adenine phosphoribosyltransferase |
| - | - | PAC | F | F2 | - | COG0005 | YLR017W | MEU1 multiple enhancer of UAS2 |
| - | RRPE | PAC | F | F3 | - | COG0504 | YBL039C | URA7 CTP synthase 1 |
| - | RRPE | - | F | F3 | - | COG0167 | YKL216W | URA1 dihydroorotate dehydrogenase |
| - | RRPE | - | F | F3 | - | COG0461 | YML106W | URA5 orotate phosphoribosyltransferase |
| - | - | - | F | F3 | - | COG0563 | YDR226W | ADK1 adenylate kinase, cytosolic |
| - | - | - | F | F3 | - | COG0284 | YEL021W | URA3 orotidine-5'-phosphate decarboxylase |
| - | - | - | F | F3 | - | COG0563 | YKL024C | URA6 uridine-monophosphate kinase |
| - | - | - | F | F3 | - | COG0418 | YLR420W | URA4 dihydroorotase |
| - | RRPE | - | FE | F1 | - | COG0462 | YHL011C | PRS3 ribose-phosphate pyrophosphokinase |
| - | RRPE | - | FE | F1 | - | COG0462 | YKL181W | PRS1 ribose-phosphate pyrophosphokinase |
| - | RRPE | - | G | G1 | - | COG0205 | YGR240C | PFK1 6-phosphofructokinase, alpha subunit |
| - | - | - | G | G1 | - | COG0205 | YMR205C | PFK2 6-phosphofructokinase, beta subunit |
| - | - | - | G | G2 | - | COG0166 | YBR196C | PGI1 glucose-6-phosphate isomerase |
| - | - | - | G | G2 | - | COG0149 | YDR050C | TPI1 triose-phosphate isomerase |
| - | - | - | G | G2 | - | COG0057 | YJL052W | TDH1 glyceraldehyde-3-phosphate dehydrogenase 1 |
| - | - | - | G | G2 | - | COG0191 | YKL060C | FBA1 fructose-bisphosphate aldolase |
| - | - | - | G | G2 | - | COG0158 | YLR377C | FBP1 fructose-1,6-bisphosphatase |
| - | RRPE | PAC | G | G3 | - | COG0120 | YOR095C | RKI1 D-ribose-5-phosphate ketol-isomerase |
| - | RRPE | PAC | G | G3 | - | COG0021 | YPR074C | TKL1 transketolase 1 |
| - | RRPE | - | H | - | - | COG0499 | YER043C | SAH1 S-adenosyl-l-homocysteine hydrolase |
| - | - | - | H | - | - | COG0192 | YDR502C | SAM2 S-adenosylmethionine synthetase 2 |
| - | - | - | H | - | - | COG0192 | YLR180W | SAM1 S-adenosylmethionine synthetase 1 |
| - | RRPE | PAC | J | - | - | COG1889 | YDL014W | NOP1 fibrillarin |
| - | RRPE | PAC | J | - | - | COG1499 | YHR170W | NMD3 nonsense-mediated mRNA decay protein |
| - | RRPE | PAC | J | - | - | COG1498 | YLR197W | SIK1 involved in pre-rRNA processing |
| - | RRPE | PAC | J | - | - | COG0144 | YNL061W | NOP2 nucleolar protein |
| - | RRPE | PAC | J | - | - | COG1498 | YOR310C | NOP58 required for pre-18S rRNA processing |
| - | RRPE | PAC | J | - | - | COG1374 | YPL211W | NIP7 required for efficient 60S ribosome subunit biogenesis |
| - | RRPE | PAC | J | - | - | COG0030 | YPL266W | DIM1 rRNA (adenine-N6, N6-)-dimethyltransferase |
| - | RRPE | - | J | - | - | COG0293 | YCL054W | SPB1 required for ribosome synthesis, putative methylase |
| - | RRPE | - | J | - | - | COG0689 | YGR095C | RRP46 involved in rRNA processing |
| - | RRPE | pac | J | - | - | COG3277 | YHR089C | GAR1 nucleolar rrna processing protein |
| - | RRPE | - | J | - | - | COG2519 | YJL125C | GCD14 translational repressor of GCN4 |
| - | RRPE | - | J | - | - | COG0349 | YOR001W | RRP6 similarity to human nucleolar 100K polymyositis-scleroderma protein |
| - | - | PAC | J | - | - | COG0009 | YGL169W | SUA5 translation initiation protein |
| - | rrpe | PAC | J | - | - | COG1097 | YHR069C | RRP4 3' → 5' exoribonuclease required for 3' end formation of 5.8S rRNA |
| - | - | PAC | J | - | - | COG1736 | YKL191W | DPH2 diphtheria toxin resistance protein |
| - | - | PAC | J | - | - | COG1798 | YLR172C | DPH5 diphthamide methyltransferase |
| - | - | - | J | - | - | COG2123 | YDR280W | RRP45 rRNA processing protein |
| RAP1 | RRPE | - | J | J1 | - | COG2092 | YAL003W | EFB1 translation elongation factor eEF1beta |
| - | RRPE | - | J | J1 | - | COG1503 | YBR143C | SUP45 translational release factor |
| - | RRPE | - | J | J1 | - | COG0480 | YOR133W | EFT1 translation elongation factor EEF2 |
| - | RRPE | - | J | J1 | - | COG1601 | YPL237W | SUI3 translation initiation factor EIF2 beta subunit |
| - | - | - | J | J1 | - | COG0480 | YDR385W | EFT2 translation elongation factor EEF2 |
| - | - | - | J | J1 | - | COG0361 | YMR260C | TIF11 translation initiation factor EIF1A |
| - | RRPE | PAC | J | J2 | - | COG0215 | YNL247W | Cysteinyl-tRNA synthetase |
| - | RRPE | - | J | J2 | - | COG0423 | YBR121C | GRS1 glycine-tRNA ligase |
| - | RRPE | - | J | J2 | - | COG0008 | YGL245W | Strong similarity to glutamine-tRNA ligase |
| - | RRPE | - | J | J2 | - | COG0017 | YHR019C | DED81 asparaginyl-tRNA-Synthetase |
| - | RRPE | - | J | J2 | - | COG0008 | YOR168W | GLN4 glutaminyl-tRNA synthetase |
| - | - | - | J | J2 | - | COG0172 | YDR023W | SES1 seryl-tRNA synthetase, cytosolic |
| RAP1 | - | - | J | J3 | - | COG2051 | YHR021C | RPS27B ribosomal protein S27.E |
| - | RRPE | - | J | J3 | - | COG0199 | YLR388W | RPS29A ribosomal protein S29.E.A |
| - | RRPE | - | J | J3 | - | COG2053 | YOR167C | RPS28A 40S small subunit ribosomal protein S28.E.C15 |
| - | - | - | J | J3 | - | COG1998 | YLR167W | RPS31 ubiquitin/40S small subunit ribosomal protein |
| - | - | - | J | J3 | - | COG2053 | YLR264W | RPS28B 40S small subunit ribosomal protein S28.E.C12 |
| - | - | - | J | J3 | - | COG0096 | YLR367W | RPS22B ribosomal protein S15A.E.C12 |
| - | RRPE | PAC | J | J4 | - | COG0244 | YKL009W | MRT4 mRNA turnover 4 |
| - | RRPE | PAC | J | J4 | - | COG2075 | YLR009W | Similarity to ribosomal protein L24.E.B |
| - | RRPE | - | J | J4 | - | COG0088 | YBR031W | RPL4A ribosomal protein |
| - | RRPE | - | J | J4 | - | COG0088 | YDR012W | RPL4B ribosomal protein L4.E.B |
| - | - | - | J | J4 | - | COG2058 | YDL081C | RPP1A 60S large subunit acidic ribosomal protein A1 |
| - | - | - | J | J4 | - | COG1632 | YMR121C | RPL15B 60S large subunit ribosomal protein L15.E.C13 |
| - | - | - | J | J4 | - | COG2157 | YMR242C | RPL20A 60S large subunit ribosomal protein |
| - | - | - | J | J4 | - | COG0097 | YNL067W | RPL9B ribosomal protein L9.E.C14 |
| - | - | - | J | J4 | - | COG0081 | YPL220W | RPL1A ribosomal protein |
| - | RRPE | - | JE | J1 | - | COG0050 | YDR172W | SUP35 eukaryotic peptide chain release factor GTP-binding subunit |
| - | RRPE | - | JE | J1 | - | COG0050 | YER025W | GCD11 translation initiation factor EIF2 gamma chain |
| - | RRPE | PAC | K | - | - | COG0571 | YMR239C | RNT1 double-stranded ribonuclease |
| - | RRPE | PAC | K | - | - | COG0724 | YPL043W | NOP4 nucleolar protein |
| - | RRPE | - | K | - | - | COG0724 | YER165W | PAB1 mRNA polyadenylate-binding protein |
| - | - | - | K | - | - | COG0724 | YDR429C | TIF35 translation initiation factor EIF3 (P33 subunit) |
| - | - | - | K | - | - | COG0724 | YOR361C | PRT1 translation initiation factor EIF3 subunit |
| - | RRPE | PAC | K | K1 | - | COG2012 | YBR154C | RPB5 DNA-directed RNA polymerase I, II, III 25 kD subunit |
| - | - | PAC | K | K1 | - | COG1761 | YNL113W | RPC19 DNA-directed RNA polymerase I, III 16 kD subunit |
| - | - | - | K | K1 | - | COG0202 | YIL021W | RPB3 DNA-directed RNA-polymerase II, 45 kDA |
| - | - | - | K | K1 | - | COG1644 | YOR210W | RPB10 DNA-directed polymerase I, II, III 8.3 subunit |
| - | - | - | K | K1 | - | COG1758 | YPR187W | RPO26 DNA-directed RNA polymerase I, II, III 18 kD subunit |
| - | RRPE | PAC | L | - | - | COG1643 | YGL120C | PRP43 involved in spliceosome disassembly |
| - | RRPE | PAC | L | - | - | COG1643 | YMR128W | ECM16 similarity to helicases |
| - | RRPE | PAC | LKJ | - | - | COG0513 | YGL078C | DBP3 putative RNA helicase required for pre-rRNA processing |
| - | RRPE | PAC | LKJ | - | - | COG0513 | YGL171W | ROK1 ATP-dependent RNA helicase |
| - | RRPE | PAC | LKJ | - | - | COG0513 | YJL033W | HCA4 can suppress the U14 snoRNA rRNA processing function |
| - | RRPE | PAC | LKJ | - | - | COG0513 | YKR024C | DBP7 RNA helicase required for 60S ribosomal subunit assembly |
| - | RRPE | PAC | LKJ | - | - | COG0513 | YLL008W | DRS1 RNA helicase of the DEAD box family |
| - | RRPE | - | LKJ | - | - | COG0513 | YKR059W | TIF1 translation initiation factor 4A |
| - | - | PAC | LKJ | - | - | COG0513 | YJL138C | TIF2 translation initiation factor EIF4A |
| - | RRPE | - | O | - | - | COG0545 | YLR449W | FPR4 strong similarity to peptidylprolyl isomerase FPR3P |
| - | RRPE | - | O | - | - | COG0443 | YNL209W | SSB2 heat shock protein of HSP70 family, cytosolic |
| - | - | - | O | - | - | COG0443 | YHR064C | PDR13 regulator protein involved in pleiotropic drug resistance |
The first list comprises those genes known to be targeted by Rap1 [18] and the second one those that are not. In the presence of motif columns, '-' means that no motif was identified. Regulatory motifs in uppercase correspond to those found by our computational analysis, those in lowercase correspond to motifs found by others [10,18]. The genes are ordered by their functional category.
Figure 6Motifs found in upstream regions of genes involved in protein synthesis. (a-c) The motif logo and the distribution of scores from the matches of these motifs to the upstream regions of all yeast genes (dark blue) or just to those from which the motif was built (red). (a) The motif rap1 presents a periodicity that roughly corresponds to the pitch of a DNA helix and is similar to the sequence repeat found in telomeres, which is also targeted by the protein Rap1 [64]. (b) The motif rrpe contains an A-rich patch followed by a T-rich patch. The lengths of these two patches vary between genes. This motif may be palindromic. (c) The motif pac is made of highly conserved residues (around 100%) at several neighboring positions. (d) Ratio of genes in the cytoplasmic protein translation set and all S. cerevisiae genes matching at a given score to rap1 (purple), rrpe (red) and pac (orange) motifs.
Figure 7Cytoplasmic versus mitochondrial translation machinery. (a, b) The distribution of CCF values for gene pairs in which both members belong to the COG functional category J, protein translation. (a) The distribution of CCF values for gene pairs in which both members are mitochondrial proteins (dark blue) or one member is mitochondrial and the another one is not (red). (b) The distribution of CCF values for gene pairs in which both members are cytoplasmic (orange), both nuclear (red), or one member is cytoplasmic and the another one is nuclear (dark blue). (c) The averaged expression profiles of the mitochondrial translation machinery (dark blue) and the set of genes involved in protein translation in the cytoplasm, including those involved in the metabolism of sugar, amino acids and nucleotides, as well as RNA processing proteins and polymerases. The color-coding for the experiments is the same as in Figure 4. (d) Distribution of CCF values for pairs of genes that belong to the same COG. The cytoplasmic pairs (dark blue) correspond to paralog-paralog relationships whereas the ones involving cytoplasmic and mitochondrial proteins (red) correspond to orthologs. The expression of duplicated genes can be distinguished from each other because even though two genes can encode two proteins with identical amino-acid sequences, the degree of identity at the DNA level can be low enough to make a selective DNA hybridization onto the chip feasible. This seems to be the case for the genes analyzed in (d).
List of proteins in the E. coli protein translation operons
| Operon name | Eco ORF | Yeast | COG ID | Function | ORF description |
| tRNA synthetase and peptidase | b0026 | Yes* | COG0060 | J2 | Isoleucine tRNA synthetase |
| tRNA synthetase and peptidase | b0027 | No | COG0597 | N | Prolipoprotein signal peptidase |
| tRNA synthetase and peptidase | b0028 | No | COG1047 | O | Probable FKBX-type 16kD peptidyl-prolyl |
| rRNA modification and chaperone | b0049 | Yes* | COG0639 | T | Diadenosine tetraphosphatase |
| rRNA modification and chaperone | b0050 | No | COG2967 | P | Uncharacterized protein affecting Mg2+/Co2+ transport |
| rRNA modification and chaperone | b0051 | Yes** | COG0030 | J | 6-m-2-A methyltransferase; put. 16S rRNA methyltransferase |
| rRNA modification and chaperone | b0052 | No | COG1995 | H9 | Pyridoxine and pyridoxal phosphate biosynthesis |
| rRNA modification and chaperone | b0053 | Yes | COG0760 | O | Peptidyl prolyl isomerase |
| Ribosomal protein 1 | b0169 | Yes** | COG0052 | J3 | Ribosomal protein S2 |
| Ribosomal protein 1 | b0170 | No | COG0264 | J1 | Translation elongation factor EF-Ts |
| Ribosomal protein 1 | b0171 | No | COG0528 | F3 | Uridylate kinase |
| Ribosomal protein 1 | b0172 | Yes* | COG0233 | J1 | Ribosome releasing factor operon? |
| tRNA modification and protein export | b0405 | No | COG0809 | J | |
| tRNA modification and protein export | b0406 | No | COG0343 | J | Queuine tRNA-ribosyltransferase |
| tRNA modification and protein export | b0407 | No | COG1862 | N | ORF, hypothetical protein |
| tRNA modification and protein export | b0408 | No | COG0342 | N | Protein-export membrane protein SecD |
| tRNA modification and protein export | b0409 | No | COG0341 | N | Protein-export membrane protein SecF |
| Ribosomal protein 2 | b0910 | No | COG0283 | F3 | Cytidine monophosphate kinase |
| Ribosomal protein 2 | b0911 | Yes* | COG0539 | J3 | Ribosomal protein S1 |
| Ribosomal protein 3 | b1088 | No | COG1399 | R | Predicted metal-binding, possibly nucleic acid-binding protein |
| Ribosomal protein 3 | b1089 | Yes | COG0333 | J4 | Ribosomal protein L32 |
| tRNA synthetase and oxidase | b1637 | Yes* | COG0162 | J2 | Transfer RNA-Tyr synthetase |
| tRNA synthetase and oxidase | b1638 | Yes | COG0259 | H9 | Pyridoxamine 5'-phosphate oxidase |
| Phenylalanine tRNA synthetase | b1713 | Yes | COG0073 | R | Phenylalanine tRNA synthetase, beta subunit |
| Phenylalanine tRNA synthetase | b1714 | Yes* | COG0016 | J2 | Phenylalanine tRNA synthetase, alpha subunit |
| Ribosomal protein 4 | b2606 | Yes* | COG0335 | J4 | Ribosomal protein L19 |
| Ribosomal protein 4 | b2607 | No | COG0336 | J | tRNA (m1G) methyltransferae |
| Ribosomal protein 4 | b2608 | No | COG0806 | J | ORF, hypothetical protein |
| Ribosomal protein 4 | b2609 | Yes | COG0228 | J3 | Ribosomal protein S16 |
| Ribosomal protein 5 | b3065 | No | COG0828 | J3 | Ribosomal protein S21 |
| Ribosomal protein 5 | b3066 | No | COG0358 | L1 | DNA primase |
| Ribosomal protein 5 | b3067 | No | COG0568 | K1 | RNA polymerase sigma-subunit |
| Ribosomal protein 6 | b3164 | No | COG1185 | J | Polynucleotide phosphorilase |
| Ribosomal protein 6 | b3165 | Yes** | COG0184 | J3 | Ribosomal protein S15 |
| RNA modification | b3166 | Yes* | COG0130 | J | tRNA pseudouridine 55 synthase; P35 |
| RNA modification | b3167 | No | COG0858 | J | Ribosome-binding factor A; P15B |
| Transcription and translation | b3168 | Yes | COG0532 | J1 | Initiation factor IF2-alpha (infB) |
| Transcription and translation | b3169 | No | COG0195 | K2 | NusA protein |
| Transcription and translation | b3170 | No | COG0779 | S | Hypothetical 16.8 kD protein |
| Ribosomal protein 7 | b3185 | Yes* | COG0211 | J4 | Ribosomal protein L27 |
| Ribosomal protein 7 | b3186 | No | COG0261 | J4 | Ribosomal protein L21 |
| Ribosomal protein 8 | b3230 | Yes** | COG0103 | J3 | 30S ribosomal protein S9 |
| Ribosomal protein 8 | b3231 | Yes** | COG0102 | J4 | 50S ribosomal protein L13 |
| Ribosomal protein 9 | b3258 | Yes* | COG0591 | EHR | Pantothenate permease |
| Ribosomal protein 9 | b3259 | No | COG2264 | J | Ribosomal protein L11 methyltransferase |
| tRNA modification | b3287 | No | COG0242 | J1 | |
| tRNA modification | b3288 | Yes | COG0223 | J1 | Methionyl-tRNA formyltransferase |
| tRNA modification | b3289 | Yes** | COG0144 | J | Sun protein (Fmu protein) |
| Ribosomal protein 10 | b3294 | Yes* | COG0203 | J4 | 50S ribosomal protein L17 |
| Ribosomal protein 10 | b3295 | Yes** | COG0202 | K1 | RNA polymerase, alpha subunit |
| Ribosomal protein 10 | b3296 | Yes** | COG0522 | J3 | 30S ribosomal protein S4 |
| Ribosomal protein 10 | b3297 | Yes** | COG0100 | J3 | 30S ribosomal protein Sll |
| Ribosomal protein 10 | b3298 | Yes** | COG0099 | J3 | 30S ribosomal protein S13 |
| Ribosomal protein 11 | b3299 | Yes | COG0257 | J4 | 50S ribosomal protein X |
| Ribosomal protein 11 | b3300 | No | - | - | (secY) membrane protein, protein secretion |
| Ribosomal protein 11 | b3301 | Yes** | COG0200 | J4 | 50S ribosomal protein L15 |
| Ribosomal protein 11 | b3302 | Yes** | COG1841 | J4 | 50S ribosomal protein L30 |
| Ribosomal protein 11 | b3303 | Yes** | COG0098 | J3 | 30S ribosomal protein S5 |
| Ribosomal protein 11 | b3304 | Yes** | COG0256 | J4 | 50S ribosomal protein L18 |
| Ribosomal protein 11 | b3305 | Yes** | COG0097 | J4 | 50S ribosomal protein L6 |
| Ribosomal protein 11 | b3306 | Yes** | COG0096 | J3 | 30S ribosomal protein S8 |
| Ribosomal protein 11 | b3307 | Yes** | COG0199 | J3 | 30S ribosomal protein S14 |
| Ribosomal protein 11 | b3308 | Yes** | COG0094 | J4 | 50S ribosomal protein L5 |
| Ribosomal protein 11 | b3309 | Yes** | COG0198 | J4 | 50S ribosomal protein L24 |
| Ribosomal protein 11 | b3310 | Yes** | COG0093 | J4 | 50S ribosomal protein L14 |
| Ribosomal protein 12 | b3311 | Yes** | COG0186 | J3 | 30S ribosomal protein S17 |
| Ribosomal protein 12 | b3312 | Yes** | COG0255 | J4 | 50S ribosomal protein L29 |
| Ribosomal protein 12 | b3313 | Yes** | COG0197 | J4 | 50S ribosomal protein L16 |
| Ribosomal protein 12 | b3314 | Yes** | COG0092 | J3 | 30S ribosomal protein S3 |
| Ribosomal protein 12 | b3315 | Yes** | COG0091 | J4 | 50S ribosomal protein L22 |
| Ribosomal protein 12 | b3316 | Yes** | COG0185 | J3 | 50S ribosomal protein S19 |
| Ribosomal protein 12 | b3317 | Yes** | COG0090 | J4 | 50S ribosomal protein L2 |
| Ribosomal protein 12 | b3318 | Yes** | COG0089 | J4 | 50S ribosomal protein L23 |
| Ribosomal protein 12 | b3319 | Yes** | COG0088 | J4 | 50S ribosomal protein L4 |
| Ribosomal protein 12 | b3320 | Yes** | COG0087 | J4 | 50S ribosomal protein L3 |
| Ribosomal protein 12 | b3321 | Yes** | COG0051 | J3 | 30S ribosomal protein S10 |
| Ribosomal protein 13 | b3339 | Yes** | COG0050 | J1 | Protein chain elongation factor EF-Tu |
| Ribosomal protein 13 | b3340 | Yes** | COG0480 | J1 | Protein chain elongation factor EF-G |
| Ribosomal protein 13 | b3341 | Yes** | COG0049 | J3 | 30S ribosomal protein S7 |
| Ribosomal protein 13 | b3342 | Yes** | COG0048 | J3 | 30S ribosomal protein S12 |
| Dam superoperon | b3384 | Yes* | COG0180 | J2 | Tryptophanyl-tRNA synthetase |
| Dam superoperon | b3385 | Yes | COG0546 | R | 2-phosphoglycolate phosphatase |
| Dam superoperon | b3386 | Yes | COG0036 | G3 | D-ribulose-5-phosphate epimerase |
| Dam superoperon | b3387 | No | COG0338 | L | Adenine methylase |
| Dam superoperon | b3388 | No | COG3266 | S | Putative membrane protein; interferes with cell division |
| Dam superoperon | b3389 | Yes | COG0337 | H7 | 3-dehydroquinate synthase |
| Dam superoperon | b3390m | Yes | COG0703 | H7 | Shikimic acid kinase I |
| Glycine trna synthetase | b3559 | No | COG0751 | J2 | Glycine tRNA synthetase, beta chain |
| Glycine trna synthetase | b3560 | No | COG0752 | J2 | Glycine tRNA synthetase, alpha chain |
| Ribonuclease and pyrimidine biosynthesis | b3642 | Yes** | COG0461 | F3 | Orotate phosphoribosyltransferase |
| Ribonuclease and pyrimidine biosynthesis | b3643 | Yes** | COG0689 | J | Ribonuclease PH |
| Ribosomal protein 14 | b3703 | Yes | COG0230 | J4 | Ribosomal protein L34 |
| Ribosomal protein 14 | b3704 | No | COG0594 | J | Ribonuclease P protein component |
| Protein export and transcription | b3981 | No | COG0690 | N | Preprotein translocase secE subunit |
| Protein export and transcription | b3982 | Yes* | COG0250 | K2 | Transcription antitermination |
| Ribosomal protein 15 | b3983 | Yes** | COG0080 | J4 | 50S ribosomal protein Lll |
| Ribosomal protein 15 | b3984 | Yes** | COG0081 | J4 | 50S ribosomal protein Ll |
| Ribosomal protein 16 | b3985 | Yes** | COG0244 | J4 | 50S ribosomal protein L10 |
| Ribosomal protein 16 | b3986 | Yes | COG0222 | J4 | 50S ribosomal protein L7/L12 |
| Ribosomal protein 16 | b3987 | Yes* | COG0085 | K1 | DNA-directed RNA polymerase beta chain |
| Ribosomal protein 16 | b3988 | Yes* | COG0086 | K1 | DNA-directed RNA polymerase beta' chain |
| Ribosomal protein 17 | b4200 | Yes* | COG0360 | J3 | 30S ribosomal protein S6 |
| Ribosomal protein 17 | b4201 | No | COG2965 | L | Primosomal replication protein N |
| Ribosomal protein 17 | b4202 | Yes | COG0238 | J3 | 30S ribosomal protein S18 |
| Ribosomal protein 17 | b4203 | Yes | COG0359 | J4 | SOS ribosomal protein L9 |
The Yeast column indicates whether or not there is a yeast homolog of the bacterial gene (Yeast). The notation is as follows: No (genes lacking a yeast homolog), Yes (genes with a yeast homolog that was not present in the microarray data), Yes* (genes with a yeast homolog that, although present in the microarray data, were not part of the 'cytoplasmic translation' cluster) and Yes** (genes with a yeast homolog that are found in the cluster)
Distribution of functional classes in bacterial and archaeal 'translation operons'
| Function | Total | High frequency | Low frequency | |
| No | F3 | 1.57 | 0 | 2.94 |
| H | 0.79 | 0 | 1.47 | |
| I | 0.79 | 0 | 1.47 | |
| I2 | 1.57 | 0 | 2.94 | |
| J3 | 0.79 | 0 | 1.47 | |
| K2 | 1.57 | 0 | 2.94 | |
| L | 0.79 | 0 | 1.47 | |
| L1 | 0.79 | 0 | 1.47 | |
| M | 1.57 | 0 | 2.94 | |
| N | 0.79 | 0 | 1.47 | |
| P | 1.57 | 0 | 2.94 | |
| R | 2.36 | 0 | 4.41 | |
| S | 2.36 | 0 | 4.41 | |
| D | 0.79 | 0 | 1.47 | |
| Yes | E | 0.79 | 0 | 1.47 |
| H3 | 0.79 | 0 | 1.47 | |
| H5 | 0.79 | 0 | 1.47 | |
| I | 1.57 | 0 | 2.94 | |
| J | 0.79 | 0 | 1.47 | |
| L1 | 0.79 | 0 | 1.47 | |
| N | 1.57 | 0 | 2.94 | |
| R | 2.36 | 0 | 4.41 | |
| S | 0.79 | 0 | 1.47 | |
| Yes* | E9 | 0.79 | 0 | 1.47 |
| F5 | 0.79 | 0 | 1.47 | |
| I1 | 0.79 | 0 | 1.47 | |
| J | 1.57 | 0 | 2.94 | |
| J1 | 2.36 | 0 | 4.41 | |
| J2 | 2.36 | 0 | 4.41 | |
| K1 | 1.57 | 0 | 2.94 | |
| L1 | 0.79 | 0 | 1.47 | |
| O | 0.79 | 0 | 1.47 | |
| R | 1.57 | 0 | 2.94 | |
| Yes** | F1 | 0.79 | 0 | 1.47 |
| F3 | 0.79 | 0 | 1.47 | |
| J | 0.79 | 0 | 1.47 | |
| No | E | 0.88 | 0 | 2.78 |
| EM | 0.88 | 0 | 2.78 | |
| G2 | 0.88 | 0 | 2.78 | |
| J | 3.51 | 0 | 11.11 | |
| L | 0.88 | 0 | 2.78 | |
| R | 0.88 | 0 | 2.78 | |
| T | 0.88 | 0 | 2.78 | |
| Yes | J | 0.88 | 0 | 2.78 |
| J1 | 0.88 | 0 | 2.78 | |
| O | 0.88 | 0 | 2.78 | |
| R | 0.88 | 0 | 2.78 | |
| Yes* | C | 0.88 | 0 | 2.78 |
| E2 | 0.88 | 0 | 2.78 | |
| F | 0.88 | 0 | 2.78 | |
| F5 | 0.88 | 0 | 2.78 | |
| L | 0.88 | 0 | 2.78 | |
| O | 0.88 | 0 | 2.78 | |
| Yes** | ||||
| J | 4.39 | 0 | 13.89 | |
The list of functions is divided into blocks according to the presence or absence of a yeast homolog in the microarray data with the same coding as in Figure3. The last three columns represent the percentage of genes in each functional category with respect to all the genes in the 'operon' (Total), those genes found only in more than (High frequency) or less than 50% (Low frequency) of all the species studied. Functions found with high frequency in different organisms are highlighted in bold. It can be seen that the COG functional categories of the yeast, bacterial and archaeal genes are very similar, suggesting also that proteins with the same function, but not necessarily evolutionarily related, could replace each other in different organisms. More comprehensive tables of the data summarized in Tables 1,2,3,4 are provided as additional data files.
Figure 8Procedure for splitting genes of the same functional class into finer subgroups. See text for details.
Figure 9The range of CCF values for the selected classes and their genes. The red dots are the CCFs of the expression profiles of all the genes versus the averaged profile of the class they belong to. The blue points correspond to the means after averaging the CCFs of all the genes within a class.