| Literature DB >> 30558585 |
Gabriela I Guzmán1, Connor A Olson1, Ying Hefner1, Patrick V Phaneuf2, Edward Catoiu1, Lais B Crepaldi1,3, Lucas Goldschmidt Micas1,4, Bernhard O Palsson1,5,6, Adam M Feist7,8.
Abstract
BACKGROUND: Essentiality assays are important tools commonly utilized for the discovery of gene functions. Growth/no growth screens of single gene knockout strain collections are also often utilized to test the predictive power of genome-scale models. False positive predictions occur when computational analysis predicts a gene to be non-essential, however experimental screens deem the gene to be essential. One explanation for this inconsistency is that the model contains the wrong information, possibly an incorrectly annotated alternative pathway or isozyme reaction. Inconsistencies could also be attributed to experimental limitations, such as growth tests with arbitrary time cut-offs. The focus of this study was to resolve such inconsistencies to better understand isozyme activities and gene essentiality.Entities:
Keywords: Adaptive evolution; Essentiality; Genome-scale model
Mesh:
Substances:
Year: 2018 PMID: 30558585 PMCID: PMC6296033 DOI: 10.1186/s12918-018-0653-z
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Fig. 1Project workflow and growth characterizations of false positive strains. a A workflow summarizing the sequence of analyses followed and results from this study. A Keio gene KO collection strain was not available for approximately half of the corresponding false positive genes listed in [13], most likely due to essentiality on rich growth media [22]. Of those strains that were viable in rich media and thus available for a longer growth test, approximately half showed growth in minimal defined media. Those strains that grew were analyzed for mutations. Five false positive strains showed mutations in all replicate experiments sequenced. Four strains showed mixed results, meaning that only some populations accrued mutations. Two strains showed no mutations in any replicate samples. The nine strains that showed mutations in at least some populations were further analyzed in the context of model-predicted alternate pathways and historical data. Six of these cases showed agreement with model-predictions, two showed agreement with previous reports of multi-copy suppression [81], and mutation analysis for one case was not clearly linked to either. b Growth curves of eleven Keio collection strains associated with false positive predictions in defined minimal media. Growth data in terms of cellular density in grams of dry weight per Liter (gDW/L) is reported for the FP gene KO strains. Those strains that accrued mutations in all replicate populations during this growth test are noted with small dashed lines. Those strains that showed mixed results, showing mutations in only some populations, are noted with larger dashed lines. All Keio strains were grown in M9 minimal medium with glucose as the carbon source with the exception of ΔcysK and ΔcysP which utilized a glycerol carbon source. Growth of the wild type strain in glucose and glycerol is also provided as a point of reference (black and grey growth curves)
Strain details from growth characterizations
| Keio strain | Possible alternate | Final cell density (gDW/L) | Time to > half | Mutations |
|---|---|---|---|---|
| genes/pathways | *mean, Std. Dev., %RSD | final density (Hrs) | flask1 | |
| *mean, Std. Dev., %RSD | population? | |||
| WT glucose | - | 1.0, <0.01, 0.2% | 10, <1, <0.01% | No |
| WT glycerol | - | 1.1, 0.04, 3% | 12, <1, 0.04% | No |
|
| Alternate growth using | 0.43, 0.08, 20% | 14, 1, 9% | No |
| demethylmenaquinone1 | ||||
|
|
| 1.0, 0.1, 10% | 21, <1, <0.01% | Variable |
|
| ( | 1.0, 0.1, 9% | 28, 2, 7% | Variable |
|
| ( | 0.28, 0.1, 50% | 36, 9, 20% | Variable |
| ( | ||||
|
| ( | 1.1, 0.1, 8% | 43, <1, 0.04% | Yes |
|
| ( | 0.50, 0.2, 40% | 58, 17, 30% | Variable |
| ( | ||||
|
| ( | 1.1, 0.1, 10% | 65, <1, 0.2% | No |
|
|
| 1.1, 0.3, 20% | 190, 26, 10% | Yes |
|
|
| 0.66, 0.3, 50% | 83, 10, 10% | Yes |
|
| 1.1, 0.1, 10% | 100, 5, 5% | Yes | |
|
| 0.95, 0.2, 20% | 110, 8, 7% | Yes |
1Evidence for this described in [64]. 2In silico prediction, iJO1366 genome-scale model of metabolism [13]. 3Experimental multicopy suppression evidence [19]. *The data used from triplicate experiments represented in Fig. 1b was used to calculate means, standard deviations (St. Dev.), and percent relative standard deviations (%RSD)
Flask 1 population mutations
| Keio | Exp. # | Fraction | Gene | Protein change | Perceived impact |
|---|---|---|---|---|---|
| strain | population | ||||
|
| 4 | 0.58 |
| N226S | - |
| 2 | 0.46 |
| G235A | - | |
| 2 | 0.23 |
| V46E | *Reduce MetJ repression1 | |
| 1 | 0.79 | Intergenic (-211/-66) | *Reduce MetJ repression1 | ||
| 3 | - | 117 genes | 130 kbp, 1.2X GDA | Increase | |
| [ | |||||
|
| 4, 5 | 1.0, 0.82 |
| G65S | - |
| (1, 4, | (1.0, 1.0, | Intergenic (+328/-96) | *Reduce GalR repression2 | ||
| 5, 6) | 0.36, 0.39) | ||||
| 5 | 0.25 | Intergenic (+333/-91) | Reduce GalR repression2 | ||
| 3 | 1.0 | Intergenic (+334/-90) | Reduce GalR repression2 | ||
| 6 | 0.57 | Intergenic (+339/-85) | Reduce GalR repression2 | ||
| 3 | 1.0 |
| T141P | Global regulatory effects3 | |
| 5 | 0.70 |
| G142S | Global regulatory effects3 | |
| 6 | 0.59 |
| G142D | Global regulatory effects3 | |
| 4 | 1.0 |
| R143H | Global regulatory effects3 | |
| 1 | 1.0 |
| A145V | Global regulatory effects3 | |
| 6 | 0.43 |
| I187T | Global regulatory effects3 | |
| 7 | 1.0 | 96 genes | 99 kbp, 2X GDA | Affect | |
| [ | |||||
|
| 1 | 0.67 | Intergenic (+41/-105) | *Increase | |
| 5 | 0.43 | Intergenic (+653/+370) | - | ||
| 1 | 0.27 | Intergenic (+883/+140) | - | ||
| 6 | 0.88 |
| His tRNA (5/77 bp) | Increase | |
| 2 | 0.82 |
| His tRNA (48/77 bp) | Increase | |
| 4 | 0.92 |
| His tRNA (67/77 bp) | *Increase | |
| 7 | 0.71 |
| His tRNA (72/77 bp) | Increase | |
|
| 2 | 1.0 |
| 1 bp Del | - |
| 1 | 0.77 |
| G282D | Reduce ArgD activity | |
| 2 | 0.56 |
| Del (772-774/1221 bp) | Reduce ArgD activity | |
| 4 | 0.43 |
| Q154* | Reduce ArgD activity | |
| 3 | 0.34 |
| G49R | Reduce ArgD activity | |
|
| 6, 7 | 0.83, 0.72 |
| F463L | Reduce GlnA activity |
| 5 | 0.86 |
| D187E | Reduce GlnA activity | |
| 4 | 0.45 |
| G179C | Reduce GlnA activity | |
| 4 | 0.23 |
| H172R | Reduce GlnA activity | |
| 1 | 1.0 |
| G171S | Reduce GlnA activity | |
| 2 | 0.86 |
| E156D | Reduce GlnA activity | |
| 3 | 1.0 |
| S148F | Reduce GlnA activity | |
|
| 7 | 0.27 | Intergenic (+2/-16) | Increase | |
| 5 | 0.86 |
| L11L | - | |
| 3, 7 | 0.31, 0.81 |
| E96* | - | |
| 3, 7 | 1.0 | 508 genes | 520 kbp, 2X GDA | Increase | |
| [ | |||||
|
| 4 | 0.55 |
| 10bp Dup (227/1029 bp) | Reduce MalI activity5 |
| 2 | 0.32 |
| Q55* | Reduce MalI activity5 | |
| 3 | 0.55 |
| Q529Q | - | |
|
| 2, 4 | 0.72, 0.38 |
| L317R | - |
| 4 | 1.0 |
| 1 bp Del (1144/1161 bp) | - | |
|
| 1 | 1.0 | 2,062 genes | 2.1 Mbp, 2X GDA | Increase |
| [ | |||||
| 3 | 0.21 | Intergenic (-10/+385) | - |
GDA abbreviation stands for genome duplication amplification event. 1Binding site recognition sites predicted in [26]. 2Evidence to support binding site in [31]. 3Information regarding CRP global regulatory effects in [32–34]. 4Attenuator-model of regulation for histidine operon described in [44–47]. 5Information related to the MalI transcriptional repressor in [38, 39]. *Perceived impact was further confirmed by RNA sequencing results (Fig. 3, Table 3)
Fig. 3RNA sequencing data for ΔthrA and ΔserB experiments support isozyme predictions and mutation analysis. Volcano plots display log2(fold change) on the x-axis and -log10(p-adjusted value) on the y-axis for RNA sequencing data acquired for ΔthrA and ΔserB experiments. The log2(fold change) and p-adjusted value data was acquired comparing the experimental condition expression data (biological duplicate replicates) to wild type BW25113 expression data (biological duplicate replicates). Each dot in the plots corresponds to the differential expression data of a gene. Panels A. and B. correspond to the data for ΔthrA experiments and panels c and d correspond to the data for ΔserB experiments. The vertical dashed lines in each plot corresponds to the 2X fold change line in either direction (log2(fold change) = 1, -1) and the horizontal dashed line corresponds to the 0.05 p-adjusted value line (-log10(p-adjusted value) = 1.3). Highlighted in red in panels a and b are the metL and metB genes, which are expressed in the same operon. Of interest for the thrA experiments was the up-regulation of the predicted isozyme, metL. Highlighted in red in panels c and d are his operon genes that were up-regulated in both ΔserB experiments. Of particular interest was hisB expression, the predicted isozyme for serB
Differential expression of alternate pathways/genes in knockout strains compared to wild type (RNAseq Data)
| Keio strain | Experiment | Predicted | Log2(Fold Change) | P-adjusted value |
|---|---|---|---|---|
| clone/population | alternate gene | experiment vs. wild type | ||
|
| Exp.1, Flask 5, Clone |
| 2.18 | 2.06E-7 |
|
| Exp.2, Flask 1, Population |
| 1.13 | 3.48E-2 |
|
| Exp.1, Flask 5, Clone |
| 1.73 | 2.50E-2 |
|
| Exp.3, Flask 1, Population |
| 0.908 | 5.46E-1 |
|
| Exp.1, Flask 5, Clone |
| 3.75 | 3.60E-11 |
|
| Exp.4, Flask 1, Population |
| 3.28 | 2.36E-9 |
Fig. 2Pathway maps related to ΔthrA, ΔptsI, and ΔserB false positive cases. Whole genome sequencing analysis revealed that for two out of three of these cases the model prediction was in agreement with the observed utilized pathway as inferred from mutation analysis. The associated gene-protein-reaction information for each case is highlighted. In a, the mutation results for ΔthrA imply that MetL (highlighted in orange) is the enzyme responsible for the isozyme activity as predicted. b Results for ΔptsI suggest that the predicted alternate pathway related to GalP is utilized in the absence of PtsI. c Results for ΔserB suggest that contrary to the predicted GlyA associated alternate pathway, HisB is responsible for rescuing growth in the absence of SerB
Fig. 4Structural mutations observed in ΔproA and ΔproB experiments analyzed in relation to ArgE underground activity. a Metabolic pathway maps related to ΔproA and ΔproB false positive cases. Both are involved in L-proline synthesis. Model simulations predict using an alternate pathway related to arginine and ornithine synthesis to rescue a proA/proB deficient E. coli strain. Mutations were observed in the coding regions of the metabolic genes argD and glnA. It is suggested that reduced flux through these enzymes increases flux through the ArgE associated underground activity, thus increasing production of L-proline and allowing for cell growth. b Mutation analysis in relation to the glutamine synthetase (GlnA) protein structure. An I-Tasser-predicted protein structure is provided [55] and the amino acid residue associated with observed glnA mutations in the ΔproB populations are highlighted in red. Those residues associated with ligand binding based on the crystal structure of the Salmonella typhimurium GlnA enzyme [56] are highlighted in blue. The mutations appear to be in buried regions of the homo-dodecameric enzyme at the interface of chain-chain interactions