| Literature DB >> 25315994 |
Mathieu Larocque1, Thierry Chénard2, Rafael Najmanovich3.
Abstract
BACKGROUND: Clostridium difficile is the leading cause of hospital-borne infections occurring when the natural intestinal flora is depleted following antibiotic treatment. Current treatments for Clostridium difficile infections present high relapse rates and new hyper-virulent and multi-resistant strains are emerging, making the study of this nosocomial pathogen necessary to find novel therapeutic targets.Entities:
Mesh:
Year: 2014 PMID: 25315994 PMCID: PMC4207893 DOI: 10.1186/s12918-014-0117-z
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Definition of the different media used in this study
|
|
|
|---|---|
| Minimala | Tryptophan, cysteine, isoleucine, leucine, methionine, proline, valine, R-pantothenate, Pyridoxine, biotin, glucose |
| BDMb | Minimal plus thiamine, nicotinamide, Riboflavin, 4-aminobenzoate, folic acid, vitamin B12 |
| CADMc | BDM plus the 13 standard amino acids not listed in the minimal medium |
| Complexd | All the components of the minimal medium, the remaining 13 standard amino acids, riboflavin, folic acid, inosine, vitamin B12, chorismate, glucose, glycerol-3-phosphate, ribose, hypoxanthine, deoxycytidine, uracil, uridine, dexoyadenosine, adenosine, thymidine |
a[36].
b[37].
c[38,39].
d[23].
eThe following metabolites or proteins are added in SA analysis to permit the production of biomass: ATP, nicotinate, oxidized ferredoxin, oxidized thioredoxin, acyl and sulfur carrier proteins and are present in unchanging equilibrium concentrations in FBA.
Comparison between the automatic , MLTC806cdf and networks
|
|
|
|
| ||
|---|---|---|---|---|---|
| Reactions | 3211 | 914 | 740 | ||
| With genes | 1311 (40.8%) | 706 (77.9%) | 490 (66.2%) | ||
| With an EC number | 1435 (44.7%) | 750 (82.1%) | 649 (87.7%) | ||
| With KEGG ID | 1038 (32.3%) | 703 (76.9%) | 652 (88.4%) | ||
| Identified as unclear reaction | 4 (0.4%) | 0 | 2 (0.2%) | ||
| Associated with multiple reactions | 84 (8.1.6%) | 6 (0.7%) | 0 | ||
| Involving polymers | 26 (0.8%) | 0 | 0 | ||
| Involving generic metabolites | 548 (29.8%) | 31 (3.4%) | 44 (5.9%) | ||
| Non-metabolic reactions | 138 (4.3%) | 2 (0.2%) | 15 (2.0%) | ||
| Involving oxygen | 29 (0.9%) | 0 | -b | ||
| Transport reactions | 1576 (49.1%) | 145 (15.9%) | 66 (8.9%) | ||
| Export | 1562 (99.1%) | 16 (11.0%) | 3 (4.5%) | ||
| Passive | 14 (0.9%) | 15 (10.3%) | 11 (16.7%) | ||
| with genes | 0 | 95 (65.5%) | 43 (65.15%) | ||
| Unique reactions in common | 530 | 416 | |||
| Metabolites | 3133 | 705 | 709 | ||
| Unique | 1551 | 592 | 654 | ||
| With KEGG ID | 1092 (70.4%) | 563 (94.9%) | 613 (93.7%) | ||
| Generic | 485 (31.3%) | 23 (3.9%) | 50 (7.6%) | ||
| Duplicated | 15 (1.0%) | 0 | 0 | ||
| Dead-end | 648 (41.8%) | 8 (1.2%) | 186 (28.4%) | ||
| Unique metabolites in common | 533 | 389 | |||
| Genes | 1336 | 806 | 490 | ||
| Unique loci | 788 | 806 | 490 | ||
| With a homologa | 788 (100%) | 658 (81.8%) | 417 (85.1%) | ||
| Non-metabolic loci | 66 (8.4%) | 3 (0.4%) | 11 (2.2%) | ||
| Loci linked to multiple entries | 382 (48.5%) | 0 | 0 | ||
| Unique loci in common | 534 | 350 | |||
aThe values in the first and last columns represent the number of homologs with respect to iMLTC806cdf. The values in the middle column represent the number of homologs with respect to C. acetobutylicum.
bReactions involving oxygen are allowed in C. acetobutylicum thus were not quantified.
List of 76 predicted essential gene and essentiality of their homologs
|
|
|
|---|---|
| Essential (46) | accA3, accB3, accC3, accD3, acpS2↓, adk2↓, |
| Non-essential (22) | coaBC3, coaD3, crt13↓, |
| No |
|
Number in parenthesis represent the loss of biomass according to FBA. Those without number prevent the production of biomass or were identified only by SA. Genes in bold (25 in total) do not have a human functional homolog. The 4 genes in italics have a human homolog shown to be essential when their associated reactions are removed from RECON2.
1Genes predicted as essential exclusively by FBA analysis (16 in total).
2Genes predicted as essentials by the SA analysis (10 in total).
3Genes predicted as essentials by both FBA and SA (50 in total).
↓Genes shown to be down-regulated in vivo (9 in total) [57].
↑Genes shown to be up-regulated in vivo (6 in total) [57].
*Genes not present in the Database of Essential Genes (DEG) (15 in total) [56].
Figure 1Example of biding site similarities between the modelled asd gene product and the photosynthetic a2b2-glyceraldehyde-phosphate dehydrogenase bound to NADP. The two binding-sites share 39 atoms of equivalent atom types in corresponding positions in space (Z-score 3.92, p-value 0.012). This protein from spinach (PDB ID 2PKQ) belongs to Pfam family PF00044 that contain human homologs. The inset shows the superimposition of the bound NADP molecules found among 5 of the top 7 most similar binding-sites belonging to different protein families.
Figure 2Effect of the deletion of essential genes and deletion of essential pairs of genes in term of biomass lost. Essential genes removal identified by SA were considered to give a null biomass if one of the component of the biomass was impossible to produce otherwise they were arbitrary attributed an effect of 5-10% since the number of reaction required to produce biomass augmented by less than 10% in every cases. The number of cases in the unlabelled sections of the pie chart is in clockwise order 1, 1, 3 and 1.
Figure 3Distribution of genes (A), deleterious genes (B) and genes involved in deleterious pair (C) in the different metabolic pathways. If one gene was present in more than one reaction that did not share the same metabolic function the most relevant function was manually chosen based on gene description.
List of 29 potential targets associated with 125 potential binders based on sequence identity
|
|
|
| |
|---|---|---|---|
|
|
| ||
| accC | P24182 [53%] | Q96RQ3 [48%] | LZJ, LZK, LZL, OA1, OA2, OA3, OA4, OA5, L21, L22, L23 (P24182 [53%]); BTI (Q99UY8 [48%]); |
| adk | P69441 [52%] | P54819-2 [45%] | IPE (P30085 [32%]) |
| crt1 | M9BEQ1 [37%] | P30084 [44%] | QUE (Q6NVY1 [30%]) |
| fabF | P0A6Q6 [49%] | Q9NWU1 [45%] |
|
| fabH | P0ACC7 [42%] | - |
|
| fabZ | P0ADG7 [36%] | P20839 [35%] | QUE, 2BC, 2BE, 2RB, 3BE, 4BB, 4BE, AGI, EMO, SAK (Q5G940 [41%]); BDE (O25928 [41%]) |
| glmU | P62617 [55%] | - | P21 (P43889 [42%]) |
| guaB | P14900 [32%] | P20839 [35%] |
|
| ispA | B1J028 [42%] | Q5T2R2 [31%] | IPR (Q83M58 [41%]) |
| ispH | M8X9N5 [30%] | - | DMA, IPE (P62624 [30%]) |
| ispF | P0AD61 [51%] | P14618-2 [44%] | FPP (Q8EBR3 [62%]); GPP (P62617 [55%]); IPP (A0R559 [34%]) |
| murD | P0A858 [44%] | P60174-1 [44%] | LK1, LK2, LK3, LK4, LKM (P14900 [32%]); |
| ntpA | - | P38606 [51%] |
|
| ntpB | - | P21281 [53%] |
|
| pykF | A0A029KL96 [52%] | P14618-2 [44%] | D8G, DYY, DZG, PGA (P14618 [44%]) |
| deoD (deoB, deoC) | L3ICZ2 [46%] | P00491 [48%] |
|
| fabG (CD2577) | P33898 [61%] | P04406 [44%] | EMO (P16544 [37%]); TAQ, CB3, DVP, FE1 (P0CG22 [34%]); MAX (Q6PKH6 [33%]); TDB (Q16698 [31%]) |
| CD2577 (fabG) | N3Z8L0 [36%] | Q92506 [33%] |
|
| gapA (gapB) | P0A9B2 [52%] | O14556 [47%] | BRZ, NMD, TND (O14556 [44%]); AES (P04406 [49%]) |
| gapB (gapA) | P25526 [35%] | P00352 [35%] | APR, SND, AES (P04406 [49%]); BRZ, NMD, TND (O14556 [47%]) |
| uppS (CD2762) | P0A825 [56%] | P34897-3 [45%] | B08, B28, B29, B76, FPP, FPS, IPE (P60472 [44%]); GPP (P60379 [43%]) |
| glyA (CD2834) | P0A825 [48%] | P34897-3 [41%] |
|
| CD2834 (glyA) | - | B7ZLW7 [31%] |
|
| CD0727 (folD | fhs | fchA)d | P24182 [53%] | Q96RQ3 [48%] |
|
| scoB (folD | fhs | fchA)d | E9YQ86 [56%] | B7Z609 [51%] |
|
| scoA (folD | fhs | fchA)d | G1YD51 [53%] | Q6IAV5 [42%] |
|
| CD3231 (hpt) | W1H7G6 [47%] | Q9NRG1 [33%] |
|
| hpt (CD3231) | W1H7G6 [55%] | P00492 [34%] |
|
| aldh (CD2733) | - | P51648 [43%] |
|
aGenes that are essential as part of a double mutant pair have the name of the partner gene identified in parenthesis.
bUniprot ID of the closest sequence homolog.
c3-letter PDB code of ligand that is a known inhibitor or binder of a sequence homolog with the Uniprot ID and the level of sequence identity given in parenthesis. Approved drugs are in italics.
dThe target gene is part of more than one essential pairs, one with each of the proteins in parenthesis.
Figure 4Flow chart representing the main steps of reconstruction of a metabolic network. The creation of the network begins by the building of a draft by extracting data from various databases. The draft will then enter an iterative cycle between curation and validation, each steps of validation bringing a new step of curation if it fails. The final version of the network can then be used to perform various analyses (growth tests, in silico gene deletion, etc.).