| Literature DB >> 22087229 |
Rembert Pieper1, Quanshun Zhang, David J Clark, Shih-Ting Huang, Moo-Jin Suh, John C Braisted, Samuel H Payne, Robert D Fleischmann, Scott N Peterson, Saul Tzipori.
Abstract
BACKGROUND: The recent outbreak of severe infections with Shiga toxin (Stx) producing Escherichia coli (STEC) serotype O104:H4 highlights the need to understand horizontal gene transfer among E. coli strains, identify novel virulence factors and elucidate their pathogenesis. Quantitative shotgun proteomics can contribute to such objectives, allowing insights into the part of the genome translated into proteins and the connectivity of biochemical pathways and higher order assemblies of proteins at the subcellular level. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2011 PMID: 22087229 PMCID: PMC3210124 DOI: 10.1371/journal.pone.0026554
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Experimental design combining two-dimensional LC-MS/MS with upfront fractionation of proteins derived from insoluble and soluble, SEC-separated cell lysate fractions.
Abbreviations: EHEC, enterohemorrhagic E. coli; SEC, size exclusion chromatography; FASP, filter-aided sample preparation for digestion of complex protein mixtures; SCX, strong cation exchange chromatography; BR, biological replicate; T1R and T4R, technical replicate stages. The total number of SCX fractions subjected to LC-MS/MS sequentially was 52 in a given experiment.
Figure 2Size exclusion chromatography fractionating E. coli O157:H7 cell lysate proteins prior to 2D-LC-MS/MS.
The fractions (F1-s, F2-s and F3-s) were eluted from the G3000-SWXL column in PBS and 0.01% Triton X-100 and pooled as shown in the chromatogram. The A280 profiles are shown for an EHEC lysate sample and a mixture of five Mr protein standards, the latter of which pertains to the line with a low void volume peak at 667 kDa. Protein separations occurred in the 500-10 kDa range. On the right, protein bands visualized by Coomassie Blue staining in a 4–12%T SDS-PAGE gel are shown in the same order as the SEC fractions (with F3-s before pooling in two consecutive SEC fractions).
Putative EHEC lipoproteins markedly enriched in size exclusion chromatography fraction F1-sSEC.
| Ca. A | Protein description B | Locus tag B | Gene name | Mr (Da) | Sub. Localiz. MPC C | PSort-B D | APEX F1–13 E | APEX F14–26 E | Sp II site F | Notes G |
| 1 | OM lipoprotein SlyB | Z2655 | slyB | 15601 | L-OM | OM | 31547 | 2607 | LVGC18V | |
| 3 | uncharacterized protein YjeI | Z5749 | yjeI | 13064 | L (hydr LS) | unkn | 20511 | 3999 | MAGC30S | |
| 1* | entericidin B membrane lipoprotein | Z5754 | ecnB | 4809 | L-IM, MPC-2 | IM | 11329 | 1749 | LTAC22N | |
| 2* | regulator in colanic acid synthesis | Z0208 | rcsF | 14177 | L-OM | OM | 8780 | 1375 | LSGC16S | |
| 2* | predicted lipoprotein YajG | Z0537 | yajG | 20977 | L (IM or OM) | unkn | 8499 | 520 | LAGC18A | Δseq, 34 |
| 1 | acridine efflux pump lipoprotein AcrA | Z0578 | acrA | 42195 | L-IM, MPC-3 | IM | 8012 | 709 | LTGC25D | |
| 3 | putative Bor protein of prophage CP-933X | Z1878 | (borD) | 10416 | L (hydr LS) | unkn | 7222 | 0 | ITGC17A | s. Z1474 |
| 2* | protein DcrB involved in bacteriophage adsorption | Z4846 | (dcrB) | 19787 | L (IM or OM) | PP | 7205 | 505 | LAAC20D | Δseq, 18 |
| 2* | putative lipoprotein YbjP | Z1095 | ybjP | 19150 | L (IM or OM) | unkn | 7180 | 37 | LSAC21T | |
| 2 | putative lipoprotein YgdI | Z4126 | (ygdI) | 8342 | L (IM or OM) | unkn | 5912 | 1112 | LSAC21S | |
| 3 | putative Bor protein of bacteriophage BP-933W | Z1474 | (borW) | 10477 | L (hydr LS) | unkn | 5666 | 0 | ITGC17A | s. Z1878 |
| 2 | OM starvation lipoprotein Slp | Z4908 | slp | 22210 | L-OM | OM | 5566 | 1211 | LAAC19S | Δseq, 11 |
| 3 | uncharacterized protein YijI | Z5503 | yijI | 16572 | L (no hydr LS) | unkn | 5532 | 4210 | CASC49S | |
| 2 | uncharacterized lipoprotein YifI | Z5325 | (yifI) | 7177 | L (IM or OM) | unkn | 5265 | 569 | LTGC20G | |
| 2 | put. collagen-binding surface adhesin protein YcfM | Z1744 | ycfM | 22515 | L-OM | unkn | 5166 | 0 | LAGC20V | |
| 2 | predicted lipoprotein YedD | Z3018 | yedD | 15001 | L (IM or OM) | unkn | 4965 | 346 | LAGC16A | |
| 2* | predicted lipoprotein YraP | Z4509 | yraP | 20027 | L (IM or OM) | PP | 4549 | 392 | LQGC19V | |
| 2* | glycoprotein/polysaccharide metabolism protein | Z0565 | ybaY | 19440 | L (IM or OM) | PP | 3878 | 91 | LAAC19A | |
| 2 | uncharacterized lipoprotein YgdR | Z4151 | (ygdR) | 7877 | L (IM or OM) | unkn | 3742 | 376 | VSGC19S | |
| 2 | putative OM lipoprotein YiaD | Z4977 | yiaD | 22196 | L (IM or OM) | OM | 3322 | 45* | VSGC21T | |
| 2* | conserved putative lipoprotein YiaF | Z4979 | yiaF | 30157 | L (IM or OM) | unkn | 3297 | 157 | LSGC26F | |
| 2 | predicted metallopeptidase YggG | Z4280 | yggG | 31826 | L (IM or OM) | unkn | 2523 | 326 | LTGC19Q | Δseq, 42 |
| 2 | putative glycosylase/lipoprotein YraM | Z4506 | yraM | 72808 | L (IM or OM) | IM | 2431 | 145 | FVGC27G | |
| 2 | put. c-type lysozyme inhibitor lipoprotein YdhA | Z2653 | ydhA | 9553 | L (IM or OM) | unkn | 2013 | 472 | LTGC18S | Δseq, 27 |
| 2 | OM lipoprotein/putative pectinesterase | Z0943 | ybhC | 45979 | L-OM | OM | 1830 | 91 | LTAC22S | |
| 2 | predicted outer membrane lipoprotein YfeY | Z3697 | (yfeY) | 20814 | L (IM or OM) | unkn | 1775 | 0 | LTGC18S | |
| 3 | unknown protein encoded by prophage CP-933K | Z0955 | - | 34904 | L (hydr LS) | unkn | 1564 | 0 | FTTC18S | |
| 2 | predicted exopolysaccharide export protein GfcE | Z1400 | gfcE | 41739 | L (IM or OM) | OM | 1386 | 43 | LTAC21T | |
| 2* | putative OM lipid asymmetry protein MlaA (VacJ) | Z3610 | mlaA | 28007 | L (IM or OM) | OM | 1300 | 0 | LVGC18A | |
| 2* | putative OM-associated lipoprotein | Z2849 | (yeaY) | 20866 | L (IM or OM) | OM | 1040 | 0 | LSGC23V | |
| 2 | uncharacterized lipoprotein YdcL | Z2287 | ydcL | 24426 | L (IM or OM) | unkn | 901 | 0 | LSGC21A | |
| 2 | type II secretion protein | L7044 | etpO | 15051 | L (IM or OM) | mscl | 901 | 36 | ISGC24H | |
| 3 | unknown protein encoded by prophage CP-933V | Z3360 | - | 19772 | L (no hydr LS) | unkn | 858 | 0 | MKAC44S | |
| 2 | conserved membrane lipoprotein YoaF | Z2835 | yoaF | 8899 | L (IM or OM) | unkn | 853 | 0 | LAGC17S | |
| 2 | predicted lipoprotein YceB | Z1700 | yceB | 20469 | L (IM or OM) | unkn | 833 | 67 | LVGC19N | |
| 3* | unknown protein encoded by prophage CP-933K | Z0984 | - | 11416 | L (no hydr LS) | unkn | 563 | 0 | LCGC27G | s. Z2148 |
| 3 | heat-inducible protein HslJ | Z2330 | hslJ | 15181 | L (IM or OM) | unkn | 350 | 29 | MAGC17V | |
| 2 | uncharacterized protein L7092 | L7092 | - | 16108 | L (IM or OM) | unkn | 321 | 0 | VAGC31A | |
| 3 | uncharacterized protein YegR | Z3251 | (yegR) | 13836 | L (IM or OM) | unkn | 276 | 37 | MSGC38A | |
| 2 | putative α-macroglobulin | Z3787 | (yfhM) | 181409 | L (IM or OM) | mscl | 261 | 4* | LAGC18D | |
| 3 | uncharacterized protein YdjY | Z2783 | ydjY | 30511 | L (hydr LS) | PP | 207 | 0 | LSGC81D | Δseq, 54 |
| 2 | uncharacterized lipoprotein YajI | Z0513 | yajI | 19602 | L (IM or OM) | unkn | 169 | 0 | LSAC40V | Δseq, 20 |
| 2* | putative tetratricopeptide repeat lipoprotein | Z4203 | (ygeR) | 23480 | L (IM or OM) | unkn | 130 | 24 | LAGC13S | Δseq, 13 |
| 2* | put. G4C polysaccharide synthesis lipoprotein YmcA | Z1401 | ymcA | 78668 | L (IM or OM) | OM | 120 | 9 | SSAC19H | |
| 2 | hypothetical protein Z4394 | Z4394 | ygiB | 23478 | L (IM or OM) | PP | 110 | 46 | LAGC37E | |
| 2 | putative lipoprotein YnbE | Z2327 | ynbE | 6815 | L (IM or OM) | unkn | 109 | 0 | LVGC17T | |
| 2 | conserved lipoprotein YmbA | Z1302 | ymbA | 19949 | L (IM or OM) | unkn | 107 | 0 | LAGC11S | |
| 2 | predicted outer membrane lipoprotein YfgH | Z3769 | yfgH | 17704 | L (IM or OM) | OM | 97 | 0 | LAGC22Q | |
| 3 | put. cell division inhibitor (prophage CP-933U) | Z3128 | - | 6991 | L (no hydr LS) | unkn | 94 | 0 | SEGC14F | |
| 2 | conserved putative lipoprotein YfhG | Z3831 | yfhG | 27347 | L (IM or OM) | unkn | 78 | 0 | LLGC26V | |
| 2 | put. G4C polysaccharide synthesis lipoprotein YmcC | Z1403 | ymcC | 24248 | L (IM or OM) | IM | 61 | 0 | LAGC16T | |
| 2 | predicted thiamine biosynthesis lipoprotein ApbE | Z3472 | yojL | 38547 | L (IM or OM) | unkn | 52 | 0 | FVGC25D | |
| 2 | uncharacterized protein YaiW | Z0474 | yaiW | 40416 | L (IM or OM) | unkn | 52 | 0 | LAGC21S |
: Categories. 1, verified lipoprotein (three listed out of >20); 2, conserved Sp II signal peptidase motif with a hydrophobic leader sequence; 3, tentative Sp II signal peptidase motif; *protein was also identified in a membrane phospholipid-enriched cell lysate fraction derived from iodixanol density gradient centrifugation experiments.
: Protein annotations from the databases EcoCyc or E. coli O157:H7 strain EDL933; gene name (in parentheses): listed only in the UniProt.
: Prediction of subcellular localization and other annotation features such as multi-protein complex (MPC) with 2 or more subunits based database entries; L: lipoprotein; IM, inner membrane; OM, outer membrane; no hydr/hydr LS (presence/absence of hydrophobic leader sequence N-terminal to Sp II motif).
: Subcellular localization prediction based on PSORTb assignment; unkn (unknown); PP (periplasm); CY (cytoplasm); mscl (multiple localizations).
: Average APEXi protein abundance score (n = 12) for protein in size exclusion chromatography fractions F1-sSEC (F1–13, Mr >280 kDa) where membrane proteins were enriched and F2-sSEC (F14–26, Mr 280-80 kDa); *score from fraction F3-sSEC if score was ‘0’ in F2-sSEC.
: Signal peptidase II motif surrounding the lipid-anchored cysteine residue; tentative motif assignment for category 3 proteins. The number denotes the Cys amino acid position in the protein sequence provided in the E. coli O157:H7 EDL933 database.
: Δseq: likely incorrect prediction of translational start site in E. coli O157:H7 EDL933 database (number of amino acids represents the respective shift based on our corrections); s. Z2148, Z1474 etc.: high sequence similarity among genes encoded by prophage regions.
Figure 3Correlation of native and sequence-based Mr values of proteins with their quantitative elution profiles in SEC fractions associated with three approximate Mr ranges (F1-sSEC, >280 kDa, F2-sSEC 280-80 kDa, F3-sSEC 80-10 kDa).
Each heat map column under the broad blue arrow indicating a Mr range represents 11 individual, replicate APEXi datasets. Each row corresponds to one of 760 EHEC proteins, quantified with the APEXi tool, that were altered with statistical significance using the Kruskal-Wallis test (p-value<0.02) comparing protein abundances in the three SEC fractions (F1-sSEC, F2-sSEC and F3-sSEC). The range of colors in the heat map corresponds to APEXi values from 0 to 5000. Left heap map: proteins are ordered based on calculated, amino acid sequence-based Mr values; right heat map: proteins are ordered based on native Mr values including homooligomers and multi-protein complexes (for calculations, see ). The maximum Mr value was 999 kDa, including larger complexes such as the ribosome. Protein positions are indicated for the monomer/complex as follows: Ftn/Ftn24, Udp/Udp6 and Hfq/Hfq6. The correlation of protein elution patterns by SEC with calculated native Mrs is improved.
Selection of proteins predicted to participate in di- or oligomeric complexes based on HCL analysis of SEC fraction abundance patterns.
| Cl. | Protein descriptions | Locu tag | Gene name | M.W. | Sub. Localiz. MPC | KWS | APEXAv
| APEXAV
| APEXAV
| Functional roles |
| C1 | RNA-binding protein Hfq | Z5779 | hfq | 11166 | CY, Ho-6 | 1.95E-02 | 25095 | 9957 | 2079 | mRNA degradosome component |
| C1 | putative acetoin dehydrogenase UcpA | Z3691 | ucpA | 27831 | CY | 2.73E-03 | 4843 | 5038 | 1953 | (oxidoreductive) sulfate metabolism |
| C1 | short chain dehydrogenase YbbO | Z0646 | ybbO | 29428 | CY | 1.95E-02 | 377 | 253 | 0 | oxidoreductive processes |
| C2 | conserved protein YicC | Z5069 | yicC | 33158 | CY | 1.39E-04 | 2382 | 635 | 0 | unknown function |
| C2 | ferritin | Z2960 | ftn | 19423 | CY, Ho-24 | 3.11E-05 | 56748 | 14396 | 7016 | iron transport and binding protein |
| C2 | putative ATP-bdg. protein in pho regulon | Z0809 | ybeZ | 40653 | CY, RIB-ppi | 1.95E-05 | 5273 | 749 | 356 | unknown function |
| C2 | phosphoglucosamine mutase | Z4538 | glmM | 47545 | CY | 2.07E-06 | 4799 | 2061 | 117 | UDP-NAc-glucosamine biosynthesis |
| C3 | molybdenum cofactor biosynth. protein | Z0009 | mogA | 21222 | CY, Ho-3 | 9.39E-07 | 0 | 1939 | 81 | cofactor biosynthesis |
| C3 | D-lactate dehydrogenase | Z2329 | ldhA | 36519 | CY | 1.93E-05 | 92 | 2477 | 109 | mixed acid fermentation |
| C3 | ureidoglycolate dehydrogenase | Z0672 | ylbC | 37889 | CY | 7.88E-04 | 0 | 666 | 43 | allantoin assimilation |
| C3 | D-glucuronate/galacturonate isomerase | Z4445 | uxaC | 53927 | CY | 4.60E-05 | 0 | 466 | 0 | carbohydrate metabolic process |
| C3 | oxidative stress response protein | Z4259 | yggE | 26634 | PP, RIB-ppi | 9.62E-06 | 1541 | 5964 | 31 | increased by AI-2 quorum signaling |
| C3 | agmatinase | Z4281 | speB | 33556 | CY, Ho-2 | 4.24E-06 | 852 | 5326 | 0 | 2nd step in putrescine biosynthesis |
| C3 | succinylglutamic semialdehyde dehydrogenase | Z2778 | astD | 52983 | CY | 1.35E-06 | 0 | 2849 | 86 | arginine catabolic pathway |
| C3 | predicted transferase/phosphorylase | Z1021 | ybiB | 35183 | CY | 2.01E-06 | 0 | 2998 | 251 | unknown function |
| C3 | uridine phosphorylase | Z5353 | udp | 27149 | CY, Ho-6 | 1.58E-05 | 2893 | 23958 | 3090 | purine and pyrimidine biosynthesis |
| C3 | conserved protein YfbU | Z3555 | yfbU | 19536 | CY, RIB-ppi | 3.01E-06 | 43 | 13088 | 512 | unknown function |
| C3 | tartronate semialdehyde reductase | Z4477 | garR | 30658 | CY | 1.80E-06 | 102 | 4430 | 37 | carbohydrate acid metabolism |
| C3 | putative chromate reductase | Z5208 | yieF | 20375 | CY, Ho-2 | 6.99E-07 | 0 | 13599 | 5096 | flavoprotein containing FMN cofactor |
| C3 | ribulose-phosphate 3-epimerase | Z4739 | rpe | 24553 | CY | 2.13E-06 | 233 | 4390 | 68 | carbohydrate metabolism |
| C3 | nucleoid-associated protein YejK | Z3445 | yejK | 37822 | CY | 1.63E-05 | 338 | 1890 | 21 | unknown function |
| C3 | glyoxylate/hydroxypyruvate reductase | Z1666 | ghrA | 36853 | CY | 1.91E-04 | 91 | 2819 | 541 | response to cellular stress |
| C3 | predicted dehydrogenase YbiC | Z1022 | ybiC | 38930 | CY | 1.33E-04 | 319 | 3683 | 477 | unknown function |
| C3 | ribonuclease activity regulator protein RraA | Z5476 | rraA | 17360 | CY, ybeZ-ppi | 5.29E-05 | 2331 | 7463 | 1899 | mRNA degradosome component |
| C3 | S-ureidoglycine aminohydrolase | Z0670 | ylbA | 28731 | CY | 5.34E-03 | 1930 | 4266 | 118 | allantoin assimilation |
| C3 | peptidyl-prolyl cis-trans isomerase | Z5818 | fklB | 22237 | CY/PP, Ho-2 | 3.82E-07 | 0 | 11764 | 3295 | protein folding in periplasm |
| C3 | Xaa-Pro dipeptidase | Z5369 | pepQ | 50194 | CY | 6.19E-03 | 1411 | 4303 | 1069 | peptide catabolic process |
: Clusters derived from Hierarchical clustering (HCL) using the Pearson correlation metric in the software MeV featuring groups of proteins with similar SEC fractionation patterns.
In each cluster, more than 70% of the proteins were subunits of known homo- or hetero-oligomeric protein complexes with M.W. values in better agreement with a complex than monomer. Examples of such characterized proteins are ferritin and uridine phosphorylase. Clusters are also visualized in Figure S2 (Suppl. Information).
: Gene name, locus tag and protein description are derived from annotations in the E. coli O157:H7 EDL933 genome (UniProt database) or the EcoCyc database.
: Subcellular localization derived from EcoCyc data or annotations in UniProt for the EDL933 genome and multi-protein complexes (MPC). CY: cytoplasm; PP, periplasm; Ho-‘2’, Ho-6’ etc., homo-oligomeric protein with the number of subunits; ppi, proven protein interactions with another protein or a large assembly such as the ribosome (RIB).
: KWS: the Kruskal-Wallis statistic was applied to the entire EHEC proteome dataset (2521 proteins) to select proteins with statistically significant abundance differences comparing. the size exclusion chromatography (SEC) fractions (>280 kDa, 280-80 kDa and 80-10 kDa). After elimination of integral membrane proteins and setting the KWS p-value at <0.02, 760 proteins were selected for the HCL analysis.
: Average protein abundances (APEXi scores) in each of the three SEC fractions. n = 11; normalization factor: 2.5×106; ‘0’: no peptide spectra were detected for a given protein.
: Functional roles of proteins predicted for the EHEC protein or the respective E. coli ortholog (data from EcoCyc or UniProt annotations).
Proteins predicted to participate in multi-protein structures such as cell division and O-antigen biosynthesis, and prone to aggregation based on relative abundance in insoluble E. coli O157:H7 fraction.
| Cl. | Protein Description | Locus tag | Gene name | M.W. (kDa) | Sub. Localiz. | APEXAv F1–39 | APEXAV F40–52 | W-test p- value |
| C3 | aldose 1-epimerase | Z0926 | galM | 38176 | CY | 964 | 4334 | 4.14E-05 |
| C1 | galactokinase | Z0927 | galK | 41504 | CY | 15 | 3871 | 3.60E-10 |
| C3 | UTP-glucose-1-phosphate uridylyltransferase | Z2012 | galU | 32941 | CY | 1431 | 4089 | 1.37E-05 |
| n.d. | regulator of LPS O-antigen chain | Z3189 | wzzB | 37810 | IM | 868 | 144 | n.d. |
| C2 | UDP-glucose 6-dehydrogenase | Z3190 | ugd | 43711 | CY | 0 | 1133 | 1.11E-09 |
| n.d. | putative 6-phosphogluconate dehydrogenase | Z3191 | gnd | 51475 | CY | 6111 | 503 | n.d. |
| n.d. | putative N-acetyl transferase | Z3192 | wbdR | 23742 | CY | 53 | 261 | n.d. |
| C1 | phosphomannomutase | Z3194 | manB | 50338 | CY | 91 | 4107 | 5.23E-08 |
| C2 | mannose-1-P guanosyltransferase | Z3195 | manC | 54268 | CY | 44 | 6027 | 1.06E-09 |
| C3 | GDP-mannose mannosylhydrolase (gmm) | Z3196 | wbdQ | 19567 | CY | 23 | 4400 | 1.06E-09 |
| C3 | fucose synthetase | Z3197 | fcI | 36267 | CY | 170 | 4328 | 4.75E-08 |
| C3 | GDP-mannose dehydratase | Z3198 | gmd | 41673 | CY | 557 | 8901 | 4.62E-07 |
| C2 | putative glycosyl transferase | Z3199 | wbdP | 46609 | CY | 94 | 2026 | 2.43E-08 |
| C3 | perosamine synthetase | Z3200 | per | 41553 | CY | 774 | 7924 | 6.71E-07 |
| n.d. | putative glycosyl transferase | Z3202 | wbdO | 28358 | CY | 0 | 153 | n.d. |
| o.C | putative glycosyl transferase | Z3204 | wbdN | 30345 | CY | 0 | 485 | 4.96E-05 |
| C3 | UTP-glucose-1-P uridylyltransferase subunit | Z3205 | galF | 32828 | CY | 199 | 6319 | 1.08E-07 |
| C3 | putative UDP-galactose 4-epimerase | Z3206 | gne | 37141 | CY | 209 | 5271 | 5.23E-08 |
| o.C | predicted acetyltransferase | Z5427 | yiid | 37065 | CY | 0 | 851 | 1.04E-07 |
| C2 | cell division protein FtsA | Z0104 | ftsA | 45328 | Z-ring, pIM | 0 | 1498 | 1.11E-09 |
| C3 | cell division protein FtsZ | Z0105 | ftsZ | 40322 | Z-ring, pIM | 1562 | 5064 | 1.72E-05 |
| C2 | UDP-3-O-[3-hydroxymyristoyl] N-AcGln deacetylase | Z0106 | lpxC | 33955 | CY | 0 | 2432 | 9.87E-11 |
| C2 | cell division protein FtsE | Z4837 | ftsE | 24438 | pIM | 0 | 1995 | 1.13E-08 |
| o.C | cell division factor ZapB | Z5473 | zapB | 9635 | Z-ring, CY | 9067 | 1087 | 2.72E-05 |
| C3 | cell division topological specificity factor | Z1936 | minE | 10235 | CY, pIM | 1781 | 4716 | 2.83E-03 |
| C3 | cell division inhibitor MinD | Z1937 | minD | 29613 | pIM | 180 | 10936 | 7.70E-08 |
| C2 | septum formation inhibitor MinC | Z1938 | minC | 24745 | CY, pIM | 110 | 1190 | 6.93E-07 |
| C3 | penicillin binding protein 3, MreB | Z4610 | mreB | 39019 | CY, pIM | 0 | 7334 | 9.87E-11 |
| C1 | plasmid-partitioning protein SopA | L7068 | sopA | 43658 | CY | 0 | 4258 | 9.87E-11 |
| C1 | plasmid-partitioning protein SopB | L7069 | sopB | 35371 | CY | 0 | 3232 | 9.87E-11 |
| o.C | elongation factor Tu | Z4697 | tuf | 43282 | CY, pIM | 22954 | 88106 | 9.97E-07 |
: Hierarchical clustering analysis using the Euclidian distance metric in MeV featuring groups of proteins with similar abundance patterns in the three SEC fractions compared to insoluble E. coli strain 86-24 lysate fraction. Clusters are visualized in Figure S2 (Suppl. Information). C1, C2 and C3 clusters contained many proteins part of large subcellular assemblies; o.C: other cluster; n.d. not significant in W-test.
: Gene name, locus tag and protein description are derived from annotations in the E. coli O157:H7 EDL933 genome (UniProt database) or the EcoCyc database. The first 19 genes/proteins have been associated with biosynthesis of the ‘O157’ O-antigen of EHEC serotype O157:H7. The next 12 genes/proteins have been associated with the cytoskeleton and cell (or plasmid) division. The flippase Wzx and polymerase Wzy, also part of the O-antigen biosynthesis apparatus and rich in transmembrane domains were not observed.
: Subcellular localization of proteins derived from EcoCyc data or annotations in UniProt for the EDL933 genome. CY: cytoplasm; IM: inner membrane; Z-ring: Z-ring formation during cell division septation; pIM: peripheral IM protein.
: Average protein abundances (APEXi scores) in the combined size exclusion chromatography (SEC) fractions (F1–39) pertaining to the soluble lysate fraction and the insoluble lysate fraction F4-p (F40–52). n = 11; APEXi normalization factor: 2.5×106; ‘0’: no peptide spectra were detected for a given protein.
: W-test: the Wilcoxon Rank Sum test was applied to the entire EHEC proteome dataset of 2521 proteins to select proteins with statistically significant abundance differences comparing the combined SEC fractions (F1–39; >280 kDa, 280-80 kDa and 80-10 kDa) with the insoluble lysate fraction F4-p. The proteins with a p-value of <0.02 (802 entries) were used for Hierarchical Clustering using the Euclidian distance metric.