| Literature DB >> 19690570 |
Patrick F Suthers1, Alireza Zomorrodi, Costas D Maranas.
Abstract
Synthetic lethals are to pairs of non-essential genes whose simultaneous deletion prohibits growth. One can extend the concept of synthetic lethality by considering gene groups of increasing size where only the simultaneous elimination of all genes is lethal, whereas individual gene deletions are not. We developed optimization-based procedures for the exhaustive and targeted enumeration of multi-gene (and by extension multi-reaction) lethals for genome-scale metabolic models. Specifically, these approaches are applied to iAF1260, the latest model of Escherichia coli, leading to the complete identification of all double and triple gene and reaction synthetic lethals as well as the targeted identification of quadruples and some higher-order ones. Graph representations of these synthetic lethals reveal a variety of motifs ranging from hub-like to highly connected subgraphs providing a birds-eye view of the avenues available for redirecting metabolism and uncovering complex patterns of gene utilization and interdependence. The procedure also enables the use of falsely predicted synthetic lethals for metabolic model curation. By analyzing the functional classifications of the genes involved in synthetic lethals, we reveal surprising connections within and across clusters of orthologous group functional classifications.Entities:
Mesh:
Year: 2009 PMID: 19690570 PMCID: PMC2736653 DOI: 10.1038/msb.2009.56
Source DB: PubMed Journal: Mol Syst Biol ISSN: 1744-4292 Impact factor: 11.429
Figure 1Topological and functional classification of clusters of SL gene pairs. Three types of network motifs are present: disjoint pairs (left); stars, or 1-connected motifs (center); and highly connected subgraphs, or k-connected motifs (right). Genes are color-coded in accordance to the COG (Tatusov ) functional categorization. Names of genes are set in italics and the names of non-gene associated reactions are set in roman. Note that all the reaction abbreviations follow those in iAF1260 (Feist ).
Figure 2(A) Reaction centric view of cluster J in Figure 1. The reaction abbreviations follow those in iAF1260 (Feist ). GLYCL, glycine cleavage system; PGCD, phosphoglycerate dehydrogenase; PSP_L, phosphoserine phosphotase (PSP_L); GHMT2r, glycine hydroxymethyl). (B) Arranged gene/reaction associations of cluster J.
Number of essential genes and genes involved in SL gene pairs for different COG (Tatusov et al, 2003) functional classes
| COG functional class | COG Abbreviation | # of essential genes | # of genes involved in SL pairs |
|---|---|---|---|
| Amino acid transport and metabolism | E | 55 | 28 |
| Nucleotide transport and metabolism | F | 22 | 13 |
| Inorganic ion transport and metabolism | P | 7 | 12 |
| Energy production and conversion | C | 7 | 8 |
| Cell wall/membrane/envelope biogenesis | M | 20 | 6 |
| Carbohydrate transport and metabolism | G | 3 | 6 |
| Coenzyme transport and metabolism | H | 59 | 6 |
| Replication, recombination and repair | L | — | 2 |
| Lipid transport and metabolism | I | 18 | 1 |
| Post-translational modification, protein turnover, chaperons | O | — | 1 |
| Secondary metabolites biosynthesis, transport and catabolism | Q | 3 | — |
| General function prediction only | R | 2 | — |
| Translation, ribosomal structure and biogenesis | J | 1 | — |
| Signal transduction mechanism | T | 1 | — |
| Defense mechanisms | V | 1 | — |
Direct and indirect experimental evidence for predicted SL gene pairs and their auxotrophic characteristics
| SL gene pair | Topology | Experimental evidence | Growth supplementation of mutant strain |
|---|---|---|---|
| | Disjoint pair | D-alanyl-D-alanine (D-Ala-D-Ala) dipeptide | |
| | Disjoint pair | D-Ala | |
| | Disjoint pair | D-arabinose 5-phosphate | |
| | Disjoint pair | Glutamate | |
| | Disjoint pair | Aromatic amino acids and Vitamin B6 | |
| | Disjoint pair | High CO2 concentration | |
| | Disjoint pair | Methionine | |
| | Disjoint pair | Cysteine | |
| | Disjoint pair | Arginine | |
| | Disjoint pair | Aromatic amino acids | |
| | Cluster A | Purine | |
| SLs with indirect evidence | |||
| | Disjoint pair | — | |
| | Disjoint pair | — | |
| | Disjoint pair | — | |
| | Cluster E | — | |
| | Cluster H | — |
Mismatches between the predicted SL pairs and experimental data for single gene knockouts
| Gene (Blattner no.)a | Topology | Experimental condition for which is essentialb |
|---|---|---|
| aAll listed genes are reported as essential based on experimental data on glucose MOPS medium ( | ||
| b | ||
| cAt least one of the genes forming a pair with these genes is not expressed under aerobic glucose conditions based on data from ( | ||
| dClassified as non-essential based on the analysis of the glucose minimal medium data of ( | ||
| Disjoint pair | Always | |
| Disjoint pair | Always | |
| Disjoint pair | Shared | |
| Disjoint pair | Always | |
| Disjoint pair | Always | |
| Disjoint pair | Always | |
| Disjoint pair | Glucose | |
| Disjoint pair | Shared | |
| Disjoint pair | Shared | |
| Disjoint pair | Shared | |
| Disjoint pair | Shared | |
| Disjoint pair | Shared | |
| Cluster C | Always | |
| Cluster C | Always | |
| Cluster C | Always | |
| Cluster D | Shared | |
| Cluster E | Always | |
| Cluster E | Always | |
| Cluster F | Always | |
| Cluster H | Always | |
| Cluster H | Always | |
| g | Cluster J | Shared |
| Cluster J | Glucose | |
| Cluster J | Shared | |
| Cluster J | Shared | |
Model refinements for iAF1260 suggested by SL gene pair analysis
| Modificationa | Comments |
|---|---|
| aAll modifications are for aerobic glucose conditions unless specified otherwise. GPR, gene–protein–reaction association. | |
| bNot expressed under aerobic glucose conditions based on data from ( | |
| cThe expression level of | |
| Suppress | Cannot complement |
| Suppress | Cannot complement |
| Suppress | Cannot complement |
| Suppress | Cannot complement |
| Suppress | Cannot complement |
| Suppress | Cannot complement |
| Suppress | Cannot complement |
| Suppress PDX5PO2 | Cannot complement |
| Suppress OPHHX3 under aerobic conditions | Cannot complement |
| Suppress R1PK under aerobic conditions | Cannot complement |
| Suppress | Cannot complement |
| Suppress | Cannot complement |
| Suppress | Cannot complement |
| Suppress | Cannot complement |
| Change HSDy GPRa relationship from OR to AND | |
| Suppress | Hypothesis |
| Suppress | Cannot complement |
| Suppress | |
Figure 3Topological classification of motifs in SL gene triples. Both disjoint triples (left) and k-connected triples (right) are seen. Names of genes are set in italics and the names of non-gene associated reactions are set in roman type. Note that all the reaction abbreviations follow those in iAF1260 (Feist ).
Comparison of the approximate CPU time (single 3 GHz) for finding each essential reaction, SL reaction pair, triples and quadruples using the exhaustive and targeted enumeration approaches, respectively
| Order of SLs | Exhaustive enumeration | Targeted enumeration | ||
|---|---|---|---|---|
| Possible combinations | SL (%) | CPU time/SL | CPU time/SL | |
| ND, not determined. | ||||
| ∼2050 | 13.5 | ∼1 s | ∼5 s | |
| ∼313,000 | 0.03 | ∼28 min | ∼12 min | |
| ∼8.3 × 107 | 2.9 × 10−4 | ∼2 days | ∼40 min | |
| ∼1.6 × 1010 | ND | ND | ∼5 h/SL | |
Figure 4Topological classification of motifs in SL reaction triples. Similarly with SL gene triples in Figure 3, SL reaction triples occur as both disjoint triples (left) and k-connected triples (right). Note that all the reaction abbreviations follow those in iAF1260 (Feist ).
Frequency of participation of reactions in multiple SL triples
| # of SL triples | # of reactions (Rxn abb.) |
|---|---|
| All abbreviations follow those in | |
| 1 | 42 |
| 2 | 67 |
| 3 | 8 |
| 4 | 12 |
| 5 | 3 |
| 6 | 5 |
| 7 | 1 |
| 8 | 4 |
| 9 | 4 |
| 10 | 5 |
| 11 | 1 (RPI) |
| 12 | 1 (PPS) |
| 17 | 1 (PGI) |
| 20 | 1 (PGM) |
| 27 | 3 (GAPD, PGK, TALA) |
| 33 | 1 (RPE) |
| 34 | 3 (ATPS4rpp, FTHFD, GARFT) |
| 35 | 1 (TPI) |
Figure 5Venn diagram of the number of genes and reactions participating in SLs of order one, two and three.
Figure 6Pictorial view of all SLs for reaction GLYCL (glycine cleavage system). Double, solid and dashed lines depict SL pairs, triples and quadruples, respectively. The reaction abbreviations follow those in iAF1260 (Feist ). Note that for ease of presentation reactions that occur in both SL triples and SL quadruples are repeated (marked with a star) and we show neither the SL quadruples containing the coupled fluxes nor any of the SL quintuples. The complete list of all SL quadruples and quintuples for GLYCL are given in Supplementary information.
Distribution of the degree of essentiality (DOE) for all genes/reactions in the network. Table entries represent how many gene/reactions with a specific degree of essentiality belong to each COG functional class (see Table I for COG abbreviations). Note that some gene/reactions may belong to more than one COG functional class
| Cog Class | Genes | Reactions | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| DOE | Blocked | Total | DOE | Blocked | Total | |||||||
| 1 | 2 | 3 | 4+ | 1 | 2 | 3 | 4+ | |||||
| J | 1 | 0 | 1 | 0 | 24 | 26 | 1 | 0 | 1 | 3 | 25 | 30 |
| K | 0 | 0 | 0 | 2 | 2 | 4 | 0 | 1 | 0 | 6 | 1 | 8 |
| L | 0 | 2 | 0 | 3 | 0 | 5 | 1 | 0 | 1 | 12 | 0 | 14 |
| V | 1 | 0 | 1 | 4 | 0 | 6 | 4 | 1 | 1 | 38 | 0 | 44 |
| T | 1 | 0 | 0 | 2 | 1 | 4 | 1 | 0 | 0 | 9 | 8 | 18 |
| M | 19 | 6 | 20 | 37 | 13 | 95 | 42 | 7 | 25 | 173 | 210 | 457 |
| U | 0 | 0 | 0 | 2 | 0 | 2 | 0 | 1 | 0 | 4 | 7 | 12 |
| O | 0 | 1 | 3 | 11 | 8 | 23 | 1 | 10 | 2 | 8 | 16 | 37 |
| C | 7 | 8 | 28 | 62 | 67 | 172 | 7 | 19 | 11 | 55 | 99 | 191 |
| G | 2 | 6 | 9 | 62 | 103 | 182 | 3 | 12 | 24 | 74 | 123 | 236 |
| E | 52 | 28 | 4 | 116 | 30 | 230 | 72 | 20 | 12 | 140 | 70 | 314 |
| F | 22 | 13 | 2 | 25 | 5 | 67 | 28 | 18 | 13 | 49 | 43 | 151 |
| H | 49 | 6 | 3 | 11 | 28 | 97 | 60 | 12 | 4 | 37 | 47 | 160 |
| I | 17 | 1 | 4 | 19 | 13 | 54 | 48 | 25 | 3 | 118 | 70 | 264 |
| P | 7 | 9 | 10 | 34 | 33 | 93 | 11 | 19 | 6 | 112 | 58 | 206 |
| Q | 0 | 0 | 0 | 6 | 4 | 10 | 22 | 4 | 2 | 39 | 22 | 89 |
| R | 0 | 0 | 2 | 16 | 14 | 32 | 15 | 6 | 9 | 97 | 78 | 205 |
| S | 0 | 0 | 0 | 3 | 1 | 4 | 1 | 0 | 2 | 9 | 3 | 15 |
| None | 10 | 3 | 13 | 73 | 55 | 154 | 30 | 8 | 17 | 123 | 125 | 303 |
Figure 7Percentage of degree of essentiality in different parts of the network arranged by the COG (Tatusov ) functional classifications. (A) Genes. (B) Reactions. The numbers on the right indicate the total number of items in each COG classification. Note that some genes/reactions belong to more than one COG classification.
Figure 8Color-coded representation of the reactions in central metabolism according to their degree of essentiality.