| Literature DB >> 23593152 |
Robert Ashley Cox1, Maria Jesus Garcia.
Abstract
BACKGROUND: Microarray analysis is a powerful technique for investigating changes in gene expression. Currently, results (r-values) are interpreted empirically as either unchanged or up- or down-regulated. We now present a mathematical framework, which relates r-values to the macromolecular properties of population-average cells. The theory is illustrated by the analysis of published data for two species; namely, Mycobacterium bovis BCG Pasteur and Mycobacterium smegmatis mc(2) 155. Each species was grown in a chemostat at two different growth rates. Application of the theory reveals the growth rate dependent changes in the mycobacterial proteomes. PRINCIPALEntities:
Mesh:
Substances:
Year: 2013 PMID: 23593152 PMCID: PMC3625197 DOI: 10.1371/journal.pone.0059883
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Definitions of Variables.
| Symbol | Definition of variable (units in parentheses) |
|
| Dry cell mass (femtograms) per population-average cell. |
|
| Mass (femtograms) of RNA per population-average cell. |
|
| Number of amino acid residues of the protein fraction of population-average cells. |
|
| Numbers of population-average cells needed for synthesis of 1 pg of cDNA probe. |
|
| Gross number of copies per population-average cell of proteins |
|
| Apparent number, per population-average cell, of a reference culture of copies of, respectively, proteins |
|
| Apparent number, per population-average cell, of an experimental culture of copies of, respectively, proteins |
|
| The number of ribosomes per population-average cell. |
|
| Numbers, per population-average cell, of transcripts of ORF(i), ORF(j) and ORF(k) respectively. |
| ORF(i) | Open reading frame encoding protein |
| ORF(j) | ORF(j) encoding protein |
| ORF(k) | ORF(k) encoding protein |
|
| Fluorescence of cDNA samples prepared from reference and experimental cells respectively |
|
| The |
|
| Expression ratio for ORF(i), ORF(j) and ORF(k) respectively. |
|
| Fraction of ribosomes, per population-average cell, that is actively synthesizing protein. |
|
| The polypeptide chain elongation rate (amino acids h−1) of the protein fraction of population-average cells. |
|
| The polypeptide chain elongation rates of proteins |
| µ | Specific growth rate (h−1). |
|
| Number of transcripts of ORF(i) per picogram of RNA substrate for cDNA synthesis. |
|
| Specific protein synthesis rates (amino acid residues h−1) of population-average cells of reference and experimental cultures. |
Theoretical values are shown by a prime and double prime which, respectively, denote values for reference and experimental cell cultures. Empirical values are shown by hashes and asterisks which, respectively, denote values for reference and experimental cell cultures.
Equations used in the analysis of microarray data.
| Equation Number | Equation | Comment |
| 6 |
| General equation for transcripts of ORF(i): µ′≥ µ′′ |
| 9 |
| The gross abundance |
| 11b | ε′aa(k)/ε′′aa(k) = [(µ′/µ′′) +0.69]/1.69 | The equation is linear when µ′≥ µ′′. |
| 13c |
| Derived from equation (6) by substitution of < |
| 21 |
|
|
Genomic properties and growth conditions of the bacterial species studied.
| Property | BCG-Pasteur | Msmeg | |
| Genome | Size (base-pairs) | 4,374,522 | 6,988,209 |
| Pseudo genes | 32 | 168 | |
| ORFs | Total number | 4,033 | 6,938 |
| Protein coding | 3,949 | 6,716 | |
| Average size (base-pairs) | 976 | 907 | |
| Growth conditions | Chemostat | carbon limited | carbon limited# |
| Faster growth rate | 0.03 h−1 | 0.15 h−1 | |
| Slower growth rate | 0.01 h−1 | 0.01 h−1 | |
| Temperature | 37 °C | 37 °C | |
, [11]; #, [12].
Figure 1Histograms of expression ratios reported for M.tuberculosis, BCG-Pasteur and Msmeg.
(a) Profile of expression ratios reported for M.tuberculosis grown in rolling bottles. The wild type (the reference culture) was compared with a dosR minus mutant (the experimental culture). Both cultures were found to grow at the same rate (a doubling time of about 17 h). The profile represents 3850 ORFs. The red crosses mark the Gaussian distribution calculated for 3850 expression ratios of r = 1.0±0.09. (b) Profile of expression ratios reported for cultures of BCG-Pasteur grown in a chemostat at growth rates of µ = 0.01 h−1 (the experimental culture) and µ = 0.03 h−1 (the reference culture). A total of 3475 expression ratios were reported [11]; the average value was found to be r = 1.05±0.33. (c) Profile of expression ratios reported for cultures of Msmeg grown in a chemostat at growth rates of µ = 0.01 h−1 (the experimental culture) and µ = 0.15 h−1 (the reference culture). The complete profile comprised 6864 ORFs [12]. In all (a), (b) and (c) the hatched regions refer to 50 ORFs encoding for Zur-independent ribosomal proteins.
Macromolecular compositions of population-average cells of M.bovis BCG Pasteur grown in a chemostat§.
| Property | µ′′ = 0.01 h−1 | µ′ = 0.03 h−1 |
| Genomes per cell | 1.31 | 1.34 |
| DNA (fg) per cell, | 6.30 | 6.44 |
| Protein per cell | ||
| mass (fg), | 44.15 | 69.00 |
| amino acid residues ( | 2.47×108 | 3.86×108 |
| RNA (fg) per cell, | 2.62 | 9.53 |
| Ribosomes per cell ( | 1020 | 3730 |
| Fraction of ribosomes synthesizing proteins | 0.80 | 0.80 |
| Specific protein synthesis rate (amino acid residues per cell h−1) | 2.47×106 | 11.55×106 |
| Peptide chain elongation rate (amino acid residues per ribosome h−1) | 3.03×103 | 3.90×103 |
| Dry mass (fg) per cell, | 177.6 | 256.9 |
|
| 0.25 | 0.26 |
, The table is based on the ratios DNA: RNA: protein : dry cell mass reported for the two cultures of M. bovis BCG Pasteur [21] which were later investigated by microarray analysis [11].
, The numbers of genome equivalents per population-average cell were obtained by the methods described in Supporting data for Table 4 (Material S1).
, n aa(av) was calculated from m p(av) on the basis that 1 fg protein is equal to 5.6×106 amino acid residues [19].
, At any instant during exponential growth about 80% of ribosomes are engaged in peptide bond formation [19].
Figure 2Comparisons of observed and simulated profiles of expression ratios reported for BCG-Pasteur and Msmeg.
The experimental results are shown in black and the simulated profiles in red with crosses marking calculated values. (a) BCG-Pasteur. The calculations for the simulated profile were based on the assumption that the expression ratios of each of the proposed subgroups of ORFs conform to a normal Gaussian distribution with a standard deviation of ±0.15. The dominant subgroup (ORF(j) r (j) = 1.0±0.15) comprised 2350 constitutively expressed genes. The second subgroup (ORF(k)) comprised 100 ORFs (r (k) = 0.70±0.15). The third subgroup (ORF(l)) comprised 500 ORFs (r (l) = 0.80±0.15). The fourth subgroup (ORF(m)) comprised 500 ORFs (r (m) = 1.45±0.15). Thus 3450 genes were considered. The average expression ratio was found to be r = 1.02 per ORF. The distribution of ORFs according to the relative number (n*c–p(i)/n # c–p(i)) of copies of protein p (i) in experimental and reference cultures was shown. (b) Msmeg. The simulated profile was calculated for 6180 ORFs in the range r = 0.0–2.0 comprising five subgroups conforming to a normal Gaussian distribution, each with standard deviation of ±0.20. The compositions of the subgroups were as follows: 2750 constitutively expressed ORFs (r (j) = 1.0±0.20); 1750 ORFs (r (a) = 0.60±0.20), which includes ORFs of the subgroup ORF(k) (r (k) = 0.60±0.20); 1000 ORFs (r (b) = 1.35±0.20); 600 ORFs (r (c) = 1.70±0.20) and 350 ORFs (r (d) = 2.30±0.20). In all, 6450 ORFs were considered. (c) Refined simulated expression profiles for BCG-Pasteur. (d) Refined simulated expression profiles for Msmeg. The simulated profiles shown in (a) and (b) were recalculated in (c) and (d) by changing the standard deviations to ±0.1.
Expression ratios of ORF(k) encoding protein, p (k), whose gross abundance is directly proportional to the RNA content of population-average cells.
| Gene family | Average Expression ratios (<r | |
| BCG-Pasteur | M.smeg | |
| Ribosomal proteins | 0.71±0.10 | 0.62±0.25 |
| ATP synthase (subunits) | 0.75±0.10 | 0.53±0.11 |
| Aminoacyl tRNA synthases | 0.88±0.13 | 0.59±0.22 |
Comparison of properties of BCG-Pasteur derived from microarray studies with properties derived from the ratios DNA: RNA: protein: dry cell mass.
| Values calculated | ||
| Property | Microarray analysis | DNA : RNA : protein |
| (µ′′/µ′) | 0.33 | 0.33 |
| ( | 2.18 | 1.29 (2.18) |
| < | 0.71 | 0.43 (0.71) |
| ( | 2.20 | 3.64 (2.20) |
| < | 1.53 | 1.55 |
| ( | 1.56 | 1.56 |
| ( | na | 1.45 |
, See Table 4.
, Calculated using the equation in the shown comment to equation (21) (see Table 2) on the basis of the assumption that m′p(av)/m′′p(av) = 1.56
, Calculated using the equation in the shown comment to equation (21) (see Table 2) on the basis of the assumption that n′R(av)/n′′R(av) = 2.20.
na, not accessible from microarray data. Preferred values are enclosed in brackets.
Figure 3The relation of the ratios of the specific growth rates (µ′/µ′′) to the ratios of the peptide chain elongation rates (ε′aa(k)/ε′′aa(k)) of ribosomal proteins.
Values of (ε′aa(k)/ε′′aa(k)) were evaluated using equation (9), Table 2. The plot is linear when µ′>µ′′ (see equation (11b), Table 2). The reference point (µ′/µ′′) = (ε′aa(k)/ε ′′aa(k)) = 1.0 is shown as a square; the open circles refer to BCG-Pasteur and Msmeg; the filled circle refers to E. coli B/r (see Table 3, [19]). The broken line applies when µ′< µ′′.
Table 7. Examples of genes (ORF(j)) regulated by cell size (constitutive expression) ¶.
| BCG-Pasteur (H37Rv) | Msmeg | ||||
| gene | locus tag§ | r(j) value | locus tag | r(f) value | |
| ORFs involved in DNA replication | |||||
|
| BCG_0023c (Rv3917c) | 0.90 | MSMEG_6938 | 1.25 | |
|
| BCG_0024c (Rv3918c) | 0.96 | MSMEG_6939 | 0.74 | |
|
| BCG_0031 (Rv0001) | nr | MSMEG_6947 | 0.98 | |
|
| BCG_0032 (Rv0002) | 0.75 | MSMEG_0010 | 0.89 | |
|
| BCG_0390 (Rv0351) | nr | MSMEG_4504 | 0.89 | |
|
| BCG_1589 (Rv1537) | 1.06 | MSMEG_3172 | 0.87 | |
|
| BCG_1600 (Rv1547) | 0.99 | MSMEG_3178 | 0.91 | |
| BCG_2429c (Rv2413c) | 1.18 | MSMEG_4572 | 0.82 | ||
|
| BCG_2498 (Rv2478c) | 1.00 | MSMEG_4701 | 1.05 | |
|
| BCG_3442c (Rv3370c) | 1.71 | MSMEG_1633 | 0.95 | |
|
| BCG_3771c (Rv3711c) | 1.06 | MSMEG_6275 | 1.25 | |
| BCG_3781c (Rv3721c) | 1.46 | MSMEG_6285 | 0.47 | ||
| Not in BCG Pasteur (Rv1985c) | MSMEG_0548 | 0.96 | |||
| ORFs involved in cell wall formation | |||||
|
| BCG_1376 (Rv1315) | 0.89 | MSMEC_4932 | 0.95 | |
|
| BCG_1400 (Rv1338) | 1.06 | MSMEC_4903 | 1.01 | |
|
| BCG_2167c (Rv2150c) | 1.00 | MSMEG_4222 | 1.09 | |
|
| BCG_2498 (Rv2478c) | 0.88 | MSMEG_2690 | 0.93 | |
| ORFs involved in DNA repair | |||||
|
| BCG_0033 (Rv0003) | 0.89 | MSMEG_0003 | 0.86 | |
|
| BCG_0675c (Rv0629c) | 1.38 | MSMEG_1325 | 0.83 | |
|
| BCG_0677c (Rv0630c) | nr | MSMEG_1327 | 1.06 | |
|
| BCG_0678c (Rv0631c) | 1.43 | MSMEG_1328 | 0.81 | |
| BCG_1217 (Rv1156) | 0.70 | MSMEG_5156 | 1.05 | ||
|
| BCG_1734 (Rv1696) | 1.38 | MSMEG_3749 | 0.91 | |
| BCG_2136 (Rv2119) | 1.00 | MSMEG_3907 | 0.93 | ||
| BCG_2744 (Rv2731) | 0.89 | MSMEG_2731 | 1.13 | ||
|
| BCG_2749c (Rv2736c) | 1.15 | MSMEG_2724 | 0.71 | |
|
| BCG_2750c (Rv2737c) | 0.87 | MSMEG_2723 | 1.18 | |
|
| BCG_3081 (Rv3056) | 1.34 | MSMEG_2294 | 1.07 | |
|
| BCG_3650 (Rv3585) | 1.27 | MSMEG_6079 | 0.80 | |
|
| Not applicable | MSMEG_5397 | 0.85 | ||
| Other ORFs regulated by cell size | |||||
|
| BCG_0716 (Rv0667) | 1.11 | MSMEG_1367 | 1.33 | |
|
| BCG_0717 (Rv0668) | 0.90 | MSMEG_1368 | 0.96 | |
| BCG_2228 (Rv2212) | 1.22 | MSMEG_4279 | 0.94 | ||
|
| BCG_2555c (Rv2533c) | 1.14 | MSMEG_3036 | 0.86 | |
|
| BCG_2861c (Rv2841c) | 0.93 | MSMEG_2625 | 1.10 | |
, see equation (11) for definition; §, the loci numbers corresponding to M. tuberculosis H37Rv are also indicated between brackets, as it appears in [11]; nr, no result.
Figure 4Effects of growth rate on the expressions of ORFs encoding proteins involved in the bacterial outer permeability barrier.
(A) Two Component systems proteins of the BCG-Pasteur (a) and Msmeg (b). (B) ATP binding casette proteins of BCG-Pasteur (a) and Msmeg (b).
Figure 5A schematic view of a population-average cell of BCG-Pasteur (t d = 23 h) illustrating features important on growth including transcription/translation activities.
The cell is represented as a cylinder with hemispherical ends (axial ratio of 1:2). Transcription/translation activities are based on µ = 0.03 h−1, nR(av) = 3730 ribosomes and mp(av) = 69 fg protein per population-average cell (see Table 4). Porins are shown as Y-shaped channels traversing the outer membrane permeability barrier. (i). Summary of genomic properties per population-average cell (see Tables 3 and 4). The properties of the genome; namely the number of rrn operons (solid bars) and the number of ‘average’ ORFs are (stippled bars) presented within the square brackets. The number of genome equivalents is indicated outside the brackets on the lower right hand side. (ii)Transcription/translation activity of an ‘average’ ORF is represented schematically within the square brackets in the form of a fibril diagram. The size (bp) of an ‘average’ ORF is shown and also the locations of the 5′-ends of nascent mRNA transcripts. Nascent polypeptide chains are not shown. The proportion of non-programmed ribosomes per ORF is indicated by the relative number of free ribosomes. The number of ‘average’ ORFs being transcribed/ translated at any instant is shown by the number outside the square brackets on the lower right hand side; vertical. Black bars along the ORF represent RNAP holoenzymes; filled circles represent ribosomes; and the lines joining these circles represent nascent mRNA. (iii)Rate of synthesis of ‘average’ proteins. The rate of synthesis per fibril is indicated within the square brackets. The number of fibrils synthesizing protein is given outside the square brackets on the lower right-hand side. The product of the two numbers provides the specific protein synthesis rate (amino acid residues h−1). Sections (i), (ii) and (iii) are based on Cox [16].
Concentrations of components of population-average cells of BCG-Pasteur; comparison with E.coli B/r.
| BCG-Pasteur | ||||
| Property | Unit | Experimental culture | Reference culture |
|
| Growth rate | h−1 | 0.01 | 0.03 | 0.42 |
|
| fg | 0.30 | 6.44 | 8.09 |
|
| Ribosomes/cell | 1735 | 3730 | 6800 |
|
| fg | 7.80 | 16.80 | 30.60 |
|
| fg | 44 | 69 | 100 |
|
| fg | 178 | 257 | 198 |
|
| fg | 0.25 | 0.27 | 0.51 |
|
| fl, µm3 | 0.54 | 0.79 | 0.61 |
|
| fg/fl, mg/ml | 11.7 | 8.2 | 13.3 |
|
| fg/fl, mg/ml | 14.4 | 21.3 | 50.2 |
|
| fg/fl, mg/ml | 81.5 | 87.3 | 163.9 |
[19].
m R(av), the mass of n R(av) ribosomes was calculated on the basis of the assumption that Mr = 2.7×106 Da so that the mass of a ribosome is 4.5×10−3 fg.
ν (av), the volume of a population-average cell was calculated from the equation ν (av) fl = (m dc(av)/1,000)/(0.3ρc) where ρ c = 1.09 fg/fl is the density of a population-average cell [45]: m dc(av) is thought to comprise 30% of the cell mass.
c DNA (av) etc. refer to concentrations (m DNA(av)/ (av)) of DNA etc.