Literature DB >> 17567905

A semi-quantitative GeLC-MS analysis of temporal proteome expression in the emerging nosocomial pathogen Ochrobactrum anthropi.

Robert Leslie James Graham1, Mohit K Sharma, Nigel G Ternan, D Brent Weatherly, Rick L Tarleton, Geoff McMullan.   

Abstract

BACKGROUND: The alpha-Proteobacteria are capable of interaction with eukaryotic cells, with some members, such as Ochrobactrum anthropi, capable of acting as human pathogens. O. anthropi has been the cause of a growing number of hospital-acquired infections; however, little is known about its growth, physiology and metabolism. We used proteomics to investigate how protein expression of this organism changes with time during growth.
RESULTS: This first gel-based liquid chromatography-mass spectrometry (GeLC-MS) temporal proteomic analysis of O. anthropi led to the positive identification of 131 proteins. These were functionally classified and physiochemically characterized. Utilizing the emPAI protocol to estimate protein abundance, we assigned molar concentrations to all proteins, and thus were able to identify 19 with significant changes in their expression. Pathway reconstruction led to the identification of a variety of central metabolic pathways, including nucleotide biosynthesis, fatty acid anabolism, glycolysis, TCA cycle and amino acid metabolism. In late phase growth we identified a number of gene products under the control of the oxyR regulon, which is induced in response to oxidative stress and whose protein products have been linked with pathogen survival in response to host immunity reactions.
CONCLUSION: This study identified distinct proteomic profiles associated with specific growth points for O. anthropi, while the use of emPAI allowed semi-quantitative analyses of protein expression. It was possible to reconstruct central metabolic pathways and infer unique functional and adaptive processes associated with specific growth phases, thereby resulting in a deeper understanding of the physiology and metabolism of this emerging pathogenic bacterium.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17567905      PMCID: PMC2394761          DOI: 10.1186/gb-2007-8-6-r110

Source DB:  PubMed          Journal:  Genome Biol        ISSN: 1474-7596            Impact factor:   13.583


Background

The α-Proteobacteria are a biologically diverse group with many members capable of interaction with eukaryotic cells and able to function as intracellular symbionts or as pathogens of plants and animals. Some members are important human pathogens, some can establish asymptomatic chronic animal infections, and others are agriculturally important, assisting plants with nitrogen fixation [1]. The α-2 subgroup of the Proteobacteria contain the well-known genera Rhizobacteria, Agrobacterium, Rickettsia, Bartonella and Brucella, which include species of widespread medical and agricultural importance [2]. A less well known member of this group is the genus Ochrobactrum, which is genetically most closely related to the genus Brucella [3]. Until 1998, Ochrobactrum anthropi was considered to be both the sole and type species of the genus Ochrobactrum, despite the genetic and phenotypic heterogeneity visible within isolates of the species [4]. Subsequent analysis by Velasco et al. [5] resulted in the description of O. intermedium as a second species. Two new species, O. grignonense and O. tritici, were isolated from soil and wheat rhizoplane systems by Lebuhn et al. [6], and most recently, O. gallinifaecis was isolated from a chicken fecal sample, O. cystisi from nodules of Cystisus scoparius and O. pseudintermedium from clinical isolates [7,8]. Ochrobactrum species have been described as being environmentally abundant free-living α-Proteobacteria. A number of reports exist in the literature describing the use of Ochrobactrum species as either a source of biotechnologically useful enzymes [9-11] or in the detoxification of xenobiotic compounds such as halobenzoates [12-16]. The ability of Ochrobactrum species to act as legume endosymbionts in temperate genera such as Lupinus, Musa and Acacia has also recently been demonstrated [17-19]. O. anthropi has been identified in clinical samples [20] and has been the cause of a growing number of hospital-acquired infections usually, but not always, in immunocompromised hosts [21-25]. The organism has been found to adhere, possibly as a result of biofilm formation, to the surface of catheters, pacemakers, intraocular lenses and silicon tubing, thus representing potential sources of infection in the clinical environment [26,27]. Upon infection, O. anthropi has been shown to cause pancreatic abscess, catheter-related bacteremia, endophthalmitis, urinary tract infection and endocarditis [21]. O. anthropi strains usually are resistant to all β-lactams, with the exception of the antibiotic imipenem. Nadjar and co-workers [20] demonstrated that in at least one isolate, such resistance was due to an extended spectrum β-lactamase. Other than imipenem, the most effective antimicrobial agents for treating human infection that have thus far been reported are trimethoprim-sulfamethoxazole and ciprofloxacin [23,24]. As with its closest genetically related genus, Brucella, the genomes of O. intermedium and O. anthropi are composed of two independent circular chromosomes [28]. Recent work by Teyssier et al. [29] revealed an exceptionally high level of genomic diversity within Ochrobactrum species, possibly reflecting their adaptability to various ecological niches. Whilst there is currently no publicly available genome sequence data for any Ochrobactrum species, genome information does exist for 20 α-Proteobacteria species, including four species of Brucella. The availability of such information not only offers an excellent model system to study the forces, mechanisms and rates by which bacterial genomes evolve [30] but also to carry out functional genomic and proteomic investigations of these and closely related organisms. Beynon [31] identified a number of phases in the proteomic study of an organism or disease process. In the initial 'identification' phase, scientists are predominantly concerned with gaining insight into the identities of proteins present within the system with which they are working. Recently, we reported such a study of the soluble sub-proteome of O. anthropi [32]. This allowed the identification of 249 proteins involved in a variety of essential cellular pathways, including nucleic acid, amino and fatty acid anabolism and catabolism, glycolysis, TCA cycle, pyruvate and selenoamino acid metabolism. In addition, we identified a number of potential virulence factors of relevance to both plant and human disease. This previous study is a valuable reference point for the proteome of this emerging pathogen. These types of 'identification' studies, whilst useful, tell us very little about the functional role of these proteins within cellular networks. Further developmental phases were described by Beynon [31], including 'characterization' proteomics, and finally 'quantitative' proteomics in which the emphasis is on the pair-wise comparison of two proteomes and the quantifying of specific proteins present. To develop further our understanding of O. anthropi we have performed a comparative and semiquantitative proteomic analysis to identify the temporal changes in expression and abundance of proteins during growth of this organism. The soluble sub-proteome of O. anthropi grown aerobically in nutrient broth was compared at early phase and late phase growth, with 19 proteins having significant changes in their observed expression. Pathway reconstruction analysis was carried out and led to the identification of a variety of core metabolic processes, thus giving insights into the underlying physiology and biochemistry of this organism. During the late phase of growth of O.anthropi a number of gene products normally induced in response to oxidative stress were identified. These expressed gene products, part of the OxyR regulon, have been linked with pathogen survival in the host environment.

Results and discussion

Comprehensive analysis of the O. anthropi soluble sub-proteome

In this study we report the first gel based comparative proteomic analysis of the α-Proteobacterium O. anthropi at two distinct phases of growth. This multidimensional analysis involved the soluble sub-proteome being first separated by one-dimensional PAGE. The resultant gel was then cut into nine fractions based on the SeeBlue™ Plus 2 molecular mass markers. Each gel fraction was then trypsinized and the extracted peptides separated on a reversed phase C18 column over a 60 minute time period prior to being introduced onto the mass spectrometer. This methodology allowed the identification of a total of 131 proteins from the soluble sub-proteome under the two growth phases. This expressed gene product subset represents an estimated 3% of the total O. anthropi proteome, employing data based upon the typical predicted genome size [29]. No data are currently available in the literature on the expected distribution of proteins within sub-proteomic fractions of O. anthropi. As a benchmark, however, a study concentrating mainly on the analysis of the cytosolic proteins of Brucella melitensis 16M, a phylogenetically closely related organism, identified 187 proteins equating to 6% of its predicted proteome [33,34]. As previously reported, [35] due to the complex nature of the peptide mixtures to be analyzed, the separation capabilities of the liquid chromatography (LC)-mass spectrometry (MS) systems are often exceeded. In this study all peptide fractions were analyzed three separate times in order to increase overall peptide identifications. In the current study, automated curation of our initial dataset by the heuristic bioinformatic tool PROVALT [36], along with manual curation, led to the positive identification of 89 proteins at early phase and 95 proteins at late phase growth.

Characterisation of the O. anthropi soluble sub-proteome at early and late phase growth

Within the protein subset identified from the soluble sub-proteome, 34 proteins were uniquely identified in the early phase of growth, 55 proteins were found under both growth conditions and 40 were found to be unique to the later growth phase. The identified proteins had a wide range of physio-chemical properties in respect to pI and molecular mass (Mr) (Figure 1). This two-dimensional visualization showed that the smallest protein identified in early growth was the 30S ribosomal protein S17 (Mr = 9,123 Da) whilst at the late growth condition it was the cold shock protein CSPA (Mr = 8,963 Da). The largest protein identified under both conditions was DNA directed RNA polymerase beta chain (Mr = 153,688 Da). The most acidic protein identified under both conditions was the 30S ribosomal protein S1 (pI = 4.28) while the most basic in the early growth condition was the 30S ribosomal protein S5 (pI = 10.49) and in the late growth condition was the 30S ribosomal protein S20 (pI = 11.63).
Figure 1

Theoretical two-dimensional map of the soluble sub-proteome of O. anthropi. Diamonds, early growth phase; squares, both growth conditions; triangles, late growth phase.

Theoretical two-dimensional map of the soluble sub-proteome of O. anthropi. Diamonds, early growth phase; squares, both growth conditions; triangles, late growth phase. Proteins identified within the two growth conditions were quantified using the Exponentially Modified Protein Abundance Index (emPAI) and can be seen in Table 1 (for those proteins unique to early phase growth), Table 2 (for those proteins common to both growth conditions) and Table 3 (for those proteins unique to late phase growth) [37]. This method allows the quantification of individual identified proteins by utilizing database and Mascot output information, in order to give an emPAI value. The emPAI value can then be used to estimate the protein content within the sample mixture in molar fraction percentages. In addition, the fold change in expression level of proteins identified under both growth conditions can be estimated, thus giving further insights into cellular processes. The most abundant protein as calculated by molar fraction percentages under both conditions was the 30S ribosomal protein S1 (Table 2). The least abundant protein under early growth conditions was 30S ribosomal protein S17 (Table 1) and under late phase growth conditions was Valyl-tRNA synthetase (Table 3).
Table 1

Proteins identified in early growth phase with their bioinformatic analysis and emPAI calculation

Accession no. (NCBI)ProteinMowsePSortBSignalP SPSecPemPAIProtein (M%)Species

LSP
17984580GTP-binding tyrosine phosphorylated protein189CNoNoNo0.1120.442Bm
1798276730S ribosomal protein S2158CNoNoNo0.1990.785Bm
17983035Glutamyl-tRNA amidotransferase, beta subunit145CNoNoNo0.1170.461Bm
17984058Phenylalanyl-tRNA synthetase beta subunit141CNoNoNo0.0790.311Bm
17982501UDP-N-acetylmurate - alanine ligase (cytoplasmic peptidoglycan synthetase128CNoNoNo0.1040.410Bm
179840073-Oxoacyl-(acyl-carrier-protein) synthase 1110CNoN0No0.1860.733Bm
17982216Hypothetical cytosolic protein109CNoNoY 0.690.1380.544Bm
17982947Methionyl-tRNA synthetase101CNoYHA-LL14,15No0.0500.197Bm
17982718Adenylate kinase99CNoNoNo0.1780.702Bm
17984859Glutamyl-tRNA amidotransferase, alpha subunit87UNoNoNo0.1780.702Bm
17984546Piperideine-6-carboxylate dehydrogenase85CNoNoNo0.0760.300Bm
17982155Branched chain amino acid ABC transporter, periplasmic AA binding protein83PNoNoNo0.2741.080Bm
17982770Ribosome recycling factor82CNoNoNo0.1300.513Bm
17983887Dihydroxy-acid dehydratase80CNoAGA-AG20,21No0.0740.292Bm
17982681Transcription antitermination protein nusG77UNoNoNo0.1860.733Bm
17983656Glucose-6-phosphate isomerase74UNoNoNo0.0840.331Bm
17984871Glucosamine-fructose-6-phosphate aminotransferase (isomerizing)74CNoNoNo0.1510.595Bm
17982453Hypothetical protein (immunoreactive 28 kDa omp)69PNoAFA-QE28,29Y 0.90.1380.544Bm
1774038430S ribosomal protein S866CNoNoNo0.0960.379At
17983241Nucleoside diphosphate kinase64CNoNoNo0.1560.615Bm
17983005ABC transporter ATP-binding protein63UNoNoNo0.0640.252Bm
17982925NAD-dependent malic enzyme, malic oxidoreductase62UNoNoNo0.0670.262Bm
179839493-Deoxy-manno-oculosonate cytidylyltransferase62CNoANG-YI28,29No0.0520.205Bm
1798314630S ribosomal protein S960UNoNoY 0.700.1460.576Bm
17982830Single-stranded DNA binding protein59UNoNoY 0.820.1720.678Bm
17982823ATP-dependent Clp protease proteolytic subunit58CNoNoNo0.2330.919Bm
17984491Lipoprotein (ABC transporter substrate binding protein)57UYesSHA-ED37,38No0.0760.300Bm
17982653Methionine aminopeptidase56CNoNoNo0.1170.461Bm
17984405GTP-binding protein LepA51CNoNoNo0.0570.225Bm
492381702-Dehydro-3-deoxyphosphooctonate aldolase51CNoNoNo0.1380.544Bh
1798269530S ribosomal protein S1050CNoNoNo0.1940.765Bm
27353255Transriptional regulatory protein47UNoSHS-DR12,13No0.0960.379Bj
86284664ABC transporter ATP-binding42CMNoNoNo0.1020.402Re
17984791Branched chain amino acid ABC aminotransferase40CNoNoNo0.2100.828Bm

Cellular localizations: C, cytoplasmic; CM, cytoplasmic membrane; E, extracellular; P, periplasmic; U, unknown. SecP, SecretomeP; SP, signal peptide. Species: At, Agrobacterium tumefaciens; Ba, Brucella abortus; Bh, Bartonella henselae; Bj, Bradyrhizobium japonicum; Bm, Brucella melitensis; Bs, Brucella suis; Re, Rhizobium etli.

Table 2

Proteins identified in both growth phases with their bioinformatic analysis and emPAI calculation

Accession no. (NCBI)ProteinMowsePSortBSignalPSecPemPAIProtein (M%)Fold changeSpecies


0.31.2LSPSP0.31.20.31.2
1798526760 kDa chaperonin GroEl13341734CNoNoNo0.7780.8843.0682.9851.0Bm
17982679Protein translation elongation factor Tu8281133CNoAMA-KS17,18No0.8971.1533.5373.8930.9Bm
17982693Protein translation elongation factor G547884CNoNoNo0.4590.5031.8101.6981.1Bm
17982686DNA directed RNA polymerase beta chain601686CNoNoNo0.2110.1830.8320.6181.3Bm
17982688DNA directed RNA polymerase beta' chain461675CNoNoNo0.1320.1720.5200.5810.9Bm
17984056DNAK protein (HSP 70)404613CNoNoY 0.690.2250.2880.8870.9720.9Bm
1798296130S ribosomal protein S1541611UNoNoY 0.854.6233.64518.22812.3081.5Bm
17983895Aconitate hydratase288563CNoNoNo0.180.2970.7101.0330.7Bm
17981970Electron transfer flavoprotein beta subunit396342UNoNoY 0.630.4690.4691.8491.5841.2Bm
17982110Membrane-bound lytic murien transglycosylase B238103CMNoNoNo0.4690.2021.8490.6822.7Bm
17984018N utilization protein NusA75135CNoNoNo0.0960.1670.3790.5640.7Bm
17982394Ribose-phosphate pyrophosphokinase95192UNoNoNo0.1460.2500.5760.8440.7Bm
17982015Malate dehydrogenase174409CNoTLA-HL25,26No0.2910.8161.1472.7550.4Bm
17982340Periplasmic dipeptide transport protein pre323371PNoASA-KT37,38Y 0.930.390.511.5381.7220.9Bm
17982978Fumarate hydratase class I aerobic301309CNoNoNo0.4060.2911.6010.9831.6Bm
17982732Isocitrate dehydrogenase (NADP)275396UNoNoNo0.2410.3330.9501.1240.8Bm
17982121Phosphoribosylaminoimidazolecarboxamide formyltransferase261365CNoNoNo0.2160.3150.8521.0640.8Bm
17983182Aspartyl-tRNA synthetase262334CNoNoNo0.1560.1970.6150.6650.9Bm
17982205Transketolase252213CNoKAA-DG16,17No0.2220.1430.8750.4831.8Bm
17982204Glyceraldehyde 3-phosphate dehydrogenase230288CNoNoNo0.2910.3771.1471.2730.9Bm
17983520Enoyl-(acyl carrier protein) reductase (NADH)232216CNoNoNo0.6480.4932.5551.6651.5Bm
17984008Enoyl-(acyl carrier protein) reductase (NADH)202197CNoNoNo0.4220.5561.6641.8770.9Bm
17982437Carbamoyl-phosphate synthase large chain125286UYesNoNo0.0380.1610.1500.5440.3Bm
1798310730S ribosomal protein S420662UNoNoY 0.540.3580.1071.4120.3613.9Bm
1798269230S ribosomal protein S781225UNoNoNo0.1670.3580.6581.2090.5Bm
1798526610 kDa chaperonin GroES192168CNoNoNo0.3680.3681.4511.2431.2Bm
23463995Conserved hypothetical protein94225CNoNoNo0.1460.4030.5761.3610.4Bs
86279873Polyribonucleotide nucleotidyltransferase protein190146CNoNoNo0.1140.1140.4500.3851.2Re
17983037Trigger factor, peptidylprolyl isomerase131224CNoNoNo0.0890.1190.3510.4080.9Bm
17982138ATP synthase F1, alpha chain158112UNoNoNo2.632.6310.3708.8801.2Bm
17982141ATP synthase F1, beta chain180173UNoAEA-KP15,16No0.2330.1690.9190.5711.6Bm
17984734Glycine dehydrogenase (decarboxylating)99218CNoNoNo0.0640.1270.2520.4290.6Bm
17982133Transaldolase179218UNoNoNo0.3740.3741.4751.2631.2Bm
1798271330S ribosomal protein S5164133CNoNoNo0.1380.0890.5440.3011.8Bm
1798270530S ribosomal protein S1796205UNoNoY 0.730.0250.3740.0991.2630.1Bm
17983483Malonyl coa-acyl carrier protein transacylase160135CNoNoNo0.2590.2591.0210.8751.2Bm
17982471ABC transporter ATP-binding protein YjjK84185CMNoNoNo0.0840.1140.3310.3850.9Bm
17984086Adenosylhomocysteinase158159CNoNoNo0.1610.1610.6350.5441.2Bm
17983095Phosphoribosylaminoimidazole-succinocarboxamide synthase157166CNoNoNo0.3210.231.2660.7771.6Bm
17982017Succinyl-CoA synthetase alpha chain155176CNoNoNo0.2910.2911.1470.9831.2Bm
17982721DNA directed RNA polymerase alpha chain13465CNoNoNo0.2110.1380.8320.4661.8Bm
17982016Succinyl-CoA synthetase beta chain76157CNoNoNo0.0940.1970.3710.6650.6Bm
17983100Phosphoribosylformylglycinamidine synthase II119150UNoNoNo0.130.130.5130.4391.2Bm
1798270030S ribosomal protein S1992150UNoNoY 0.90.2910.4691.1471.5840.7Bm
1798348630S ribosomal protein S1811963UNoNoY 0.530.1970.0910.7770.3072.5Bm
17982938Glutamine synthetase I11563CNoNoY 0.630.1040.1040.4100.3511.2Bm
23463708GMP synthase (glutamine-hydrolyzing)11387CNoNoNo0.2270.1070.8950.3612.5Bs
1798270230S ribosomal protein S357136CNoNoNo0.0450.1430.1770.4830.4Bm
17982781Citrate synthase112114CNoNoNo0.1460.1460.5760.4931.2Bm
17983059Arginyl-tRNA synthetase111113CNoNoNo0.0570.0570.2250.1921.2Bm
17982196Hypothetical cytosolic protein10273UNoNoY 0.690.140.0690.5520.2332.4Bm
17982768Protein translation elongation factor Ts99132CNoNoNo0.0670.1380.2640.4660.6Bm
17983768Aldehyde dehydrogenase41111CNoNoNo0.0890.1380.3510.4660.8Bm
17982113Chorismate mutase7682UNoNoNo0.390.391.5381.3171.2Bm
17983157Integration host factor alpha subunit57102UNoNoY 0.510.0890.1860.3510.6280.6Bm

Cellular localizations: C, cytoplasmic; CM, cytoplasmic membrane; E, extracellular; P, periplasmic; U, unknown. SecP, SecretomeP; SP, signal peptide. Species: Bm, Brucella melitensis; Bs, Brucella suis; Re, Rhizobium etli.

Table 3

Proteins identified in late growth phase with their bioinformatic analysis and emPAI calculation

Accession no. (NCBI)ProteinMowsePSortBSignalPSecPemPAIProtein (M%)Species

LSPSP
17984094Phosphoenol pyruvate carboxylase (ATP)468UNoNoNo0.3001.013Bm
17983911Arginosuccinate synthase313CNoNoNo0.2760.932Bm
1798269850S ribosomal protein L23273UNoNoY 0.50.0970.327Bm
17983035Glutamyl-tRNA(GLN) amidotransferase subunit B263CNoNoNo0.1670.557Bm
17982203Phosphoglycerate kinase178CNoNoNo0.2390.807Bm
17984924Periplasmic oligopeptide-binding protein precursor176PNoNoY 0.890.1830.618Bm
17982826DNA-binding protein HU alpha170UNoLVA-AV10,11Y 0.950.4691.584Bm
1798269130S ribosomal protein S12169UNoNoY 0.830.2910.983Bm
17982154Leucine, isoleucine, valine, threonine and alanine binding protein157PYesAWA-DV28,29Y 0.950.1940.655Bm
179840063-Hydroxydecanoyl-(acyl-carrier-protein) dehydratase149CNoNoNo0.6262.114Bm
17983192General L-amino acid-binding periplasmic protein AAPJ precursor132PYesASA-DT24,25Y 0.650.2250.760Bm
17984058Phenylalanyl-tRNA synthetase beta chain131CNoNoNo0.0540.182Bm
17984780N-methylhydantoinase (ATP-hydrolising) 5-oxoprolinase(EC3.5.2.9)125CNoNoNo0.0810.274Bm
17983089Adenylosuccinate lyase120CNoNoNo0.0860.290Bm
1798399330S ribosomal protein S20120UNoNoY 0.580.1610.544Bm
17983794Hypothetical protein119CNoNoNo0.4691.584Bm
17983437Pyruvate, phosphate dikinase112CNoNo0.0470.159Bm
17982937Nitrogen regulatory protein P-II108CNoNoNo0.2330.789Bm
17984078Thioredoxin C-1108CNoNoY 0.840.2910.983Bm
17983171Serine hydroxymethyltransferase103CNoNoNo0.1670.564Bm
179844162,3,4,5-Tetrahdropyridine-2-carboxylate N-succinyltransferase101CNoNoNo0.1220.412Bm
1798401230S ribosomal protein S1595UNoNoY 0.540.0690.233Bm
17983482Short-chain dehydrogenase92CNoNoNo0.1940.655Bm
492381353-Oxoacyl-(acyl carrierprotein) reductase92CNoNoNo0.0760.257Bh
1798268250S ribosomal protein L1191UNoAGA-AN17,18Y 0.950.1940.655Bm
17984753Alkyl hyroperoxide reductase C22 protein85CNoNoNo0.2740.925Bm
17982411Cold shock protein CSPA82CNoNoY 0.810.5841.972Bm
17983290Dihydrodipicolinate synthase82CNoITA-LV22,23No0.1220.412Bm
17982131Leucyl-tRNA synthetase77CNoNoNo0.0230.078Bm
86283673Dipeptide ABC transporter, substrate binding75PYesAFA-ET31,32Y 0.910.0720.243Re
1798271250s ribosomal protein L1874UNoNoNo0.0720.243Bm
23347767Valyl-tRNA synthetase73CNoNoNo0.0190.064Bs
1798271930S ribosomal protein S1372CNoNoNo0.0720.243Bm
17983459Thiol peroxidase69UNoNoY 0.890.1220.412Bm
17982531Hypothtical cytosolic protein68CNoNoNo0.1860.628Bm
17984569Osmotically inducible protein C68UNoNoY 0.820.0690.233Bm
17981953Histidinol-phosphate aminotransferase66CNoNoNo0.0670.226Bm
15073728Probable isoleucyl-tRNA synthetase protein64CNoNoNo0.0380.128Sm
17984859Glutamyl-tRNA(GLN) amidotransferase subunit A63UNoNoNo0.1090.368Bm
17984521Urocanate hydratase57UNoNoNo0.0690.233Bm

Cellular localizations: C, cytoplasmic; CM, cytoplasmic membrane; E, extracellular; P, periplasmic; U, unknown. SecP, SecretomeP; SP, signal peptide. Species: Bm, Brucella melitensis; Bs, Brucella suis; Re, Rhizobium etli; Sm, Sinorhizobium meliloti.

Proteins identified in early growth phase with their bioinformatic analysis and emPAI calculation Cellular localizations: C, cytoplasmic; CM, cytoplasmic membrane; E, extracellular; P, periplasmic; U, unknown. SecP, SecretomeP; SP, signal peptide. Species: At, Agrobacterium tumefaciens; Ba, Brucella abortus; Bh, Bartonella henselae; Bj, Bradyrhizobium japonicum; Bm, Brucella melitensis; Bs, Brucella suis; Re, Rhizobium etli. Proteins identified in both growth phases with their bioinformatic analysis and emPAI calculation Cellular localizations: C, cytoplasmic; CM, cytoplasmic membrane; E, extracellular; P, periplasmic; U, unknown. SecP, SecretomeP; SP, signal peptide. Species: Bm, Brucella melitensis; Bs, Brucella suis; Re, Rhizobium etli. Proteins identified in late growth phase with their bioinformatic analysis and emPAI calculation Cellular localizations: C, cytoplasmic; CM, cytoplasmic membrane; E, extracellular; P, periplasmic; U, unknown. SecP, SecretomeP; SP, signal peptide. Species: Bm, Brucella melitensis; Bs, Brucella suis; Re, Rhizobium etli; Sm, Sinorhizobium meliloti. Proteomic analysis of the origin of the identified proteins in this study supports previous genomic studies showing that, phylogentically, the genus Ochrobactrum is most closely related to Brucella, with 93.9% of the proteins identified having closest match to this genus. The remaining proteins were matched to other members of the α-2 subgroup of the Proteobacteria (Rhizobacteria (3.8%), Bartonella (1.5%) and Agrobacterium (0.8%)). Of the 131 proteins detected in this study, functional roles for 125 proteins (95.4%) were known or could be predicted from database analysis. Proteins within this soluble sub-proteome were assigned to functional categories utilizing methodologies as previously described by Takami et al. [38] and Wasinger et al. [39]. Figure 2 shows that proteins of the largest category of identified proteins under both growth conditions were involved in protein synthesis (ribosomal proteins), followed by those involved in metabolism of nucleotides and nucleic acids, then those involved in metabolism of amino acids and related molecules. The remaining proteins were distributed amongst the other functional categories. The functional categories of Metabolism of nucleotides, DNA replication, RNA synthesis (elongation), Protein modification and Protein folding are found to be present at higher levels in early growth phase compared to late phase growth. In the late phase of growth, Transport proteins, Specific pathways, Metabolism of amino acids, Protein synthesis (ribosomal proteins) and Protein synthesis (tRNA synthetases) are better represented. Furthermore, the late growth phase was the only one to have proteins present from the Adaptation to atypical conditions (2.1%) and Detoxification (4.2%) functional categories. It is worth noting that assignment of proteins to functional categories is complicated, as exemplified in the case of the Metabolism of nucleotides category, by the anaplerotic nature of bacterial enzymes with a number of proteins that could also have been classified within the Metabolism of amino acids category.
Figure 2

Functional categorisation of identified proteins from the soluble sub-proteome of O. anthropi. Gray bars, early growth phase; black bars, late growth phase.

Functional categorisation of identified proteins from the soluble sub-proteome of O. anthropi. Gray bars, early growth phase; black bars, late growth phase. The rapid increase in genomic data over the past decade has revealed many important aspects of microbial cellular processes; however, there are still a significant number of potential gene products for which we know nothing, save that they are classified as 'hypothetical proteins'. Indeed, within the genome sequence of B. melitensis strain 16M, the closest relative phylogenetically of O. anthropi for which genomic data are available, some 716 predicted gene products, equivalent to 22% of the total genome, are predicted to be either hypothetical proteins or proteins of unknown function. In previous work we have underlined the necessity to assign, where possible, an element of biological functionality to such gene products in order to develop both systems biology and our understanding of cellular processes within these organisms. Within the current study we have established the presence of six proteins that had previously been annotated as hypothetical conserved proteins. The identification of such proteins within the cell-extract of O. anthropi establishes the biological functionality of these 'hypothetical' predicted protein coding sequences, and once more elegantly demonstrates the potential of proteomics to validate bioinformatics predictions. Having established the presence of such proteins and wishing to understand how they contribute to functional processes, we further examined them using NCBI BLASTp [40]. Such an approach allows conserved domains within protein sequences to be identified and thereby enables a degree of inferred functionality. Using this methodology, however, allowed us to assign putative function to only one of these proteins, NCBI:23463995. The search identified two conserved domains, pfam 01480, GFO_IDH_MocA; Oxidoreductase family involved in utilization of NADP or NAD and COG 1748; Saccharopine dehydrogenase and related proteins involved in amino acid transport and metabolism.

Sub-cellular protein localization

Sub-cellular localization prediction tools have been used for many years to identify those proteins that are retained by and exported from cells. They may also have uses in identifying possible diagnostic and therapeutic targets as well as providing information on the functionality of a protein [41]. In the current study a number of bioinformatics tools, including PSortB [41,42], SignalP [43,44] and SecretomeP [45,46] were utilized. These bioinformatics tools endeavor to assign a sub-cellular location for each protein. These tools use a set of descriptor rules and a variety of computational algorithms and networks to analyze a protein's amino acid composition in an attempt to identify known motifs or cleavage sites. The proteins identified in this study were separated into three groups and analyzed using the above bioinformatics tools. The groups were: those proteins only identified in early growth (bioinformatics search results can be seen in Table 1); those proteins found to be common to both growth conditions (bioinformatics search results can be seen in Table 2); and those proteins identified only at late growth phase (bioinformatics search results can be seen in Table 3). Overviews of the bioinformatic analysis on the proteins from the soluble sub-proteome of O. anthropi are shown for early growth (Figure 3), for both growth conditions (Figure 4) and for late growth (Figure 5).
Figure 3

Overview of identified proteins from the soluble sub-proteome of O. anthropi at the early growth phase. Cellular localization was predicted based upon the use of PSortB v2.0.4 [41,42], SignalP v3.0 [43,44], and SecretomeP v2.0 [45,46].

Figure 4

Overview of identified proteins from the soluble sub-proteome of O. anthropi present in both growth conditions. Cellular localization was predicted based upon the use of PSortB v2.0.4 [41,42], SignalP v3.0 [43,44], and SecretomeP v2.0 [45,46].

Figure 5

Overview of identified proteins from the soluble sub-proteome of O. anthropi present in late growth conditions. Cellular localization was predicted based upon the use of PSortB v2.0.4 [41,42], SignalP v3.0 [43,44], and SecretomeP v2.0 [45,46].

Overview of identified proteins from the soluble sub-proteome of O. anthropi at the early growth phase. Cellular localization was predicted based upon the use of PSortB v2.0.4 [41,42], SignalP v3.0 [43,44], and SecretomeP v2.0 [45,46]. Overview of identified proteins from the soluble sub-proteome of O. anthropi present in both growth conditions. Cellular localization was predicted based upon the use of PSortB v2.0.4 [41,42], SignalP v3.0 [43,44], and SecretomeP v2.0 [45,46]. Overview of identified proteins from the soluble sub-proteome of O. anthropi present in late growth conditions. Cellular localization was predicted based upon the use of PSortB v2.0.4 [41,42], SignalP v3.0 [43,44], and SecretomeP v2.0 [45,46]. Within the protein subset identified only in early growth, nine proteins were predicted to be secreted (26.5%), with six of those identified as possessing an amino-terminal signal peptide (Table 1); of those proteins common to both growth conditions, 15 were predicted to be secreted (27.3%), with five of those identified as possessing an amino-terminal signal peptide (Table 2); and of those identified only in late growth, 15 were predicted to be secreted (37.5%), with six of those identified as possessing an amino-terminal signal peptide (Table 3). The subset of 17 proteins identified as possessing an amino-terminal signal peptide were further analyzed for the presence of lipobox, RR-motif, and signal peptide cleavage sites to allow assignment, where possible, to a particular secretion pathway [31,32] (Table 4). Of these 17 proteins, only seven had the required architecture that would allow them to be assigned to the Sec pathway (NCBI:17982453, 17984491, 17982015, 17982340, 17982154, 17983192 and 86283673). The remainder of the proteins, whilst containing the correct cleavage site for a signal peptide, did not, in fact, have the full amino-terminal architecture that would be required to allow us to classify them as secreted proteins [47,48]. This once again highlights the limitations of some of the present generation of bioinformatic tools, which presently are concentrated largely on motif-based predictors. This aptly demonstrates the absolute necessity of manual interpretation of results in order to gain any level of biological significance.
Table 4

Proteins identified within the soluble sub-proteome of O. anthropi containing predicted export signal peptides

Accesion no. (NCBI)FunctionSignal peptide
17982947Methionyl-tRNA synthetaseMSPLTNFFSRAYHA
17983887Dihydroxy-acid dehydrataseMKMPPYRSRTTTHGRNMAGA
17982453*Hypothetical protein (immunoreactive 28 kDa omp)MNTRASNFLAASFSTIMLVGAFSLPAFA
179839493-Deoxy-manno-oculosonate cytidylyltransferaseMVLLPPRKTARVGTRRKPVFLSQTCANG
17984491*Lipoprotein (ABC transporter substrate binding protein)MSSVLSRYALTRRAGLKALLFTAAALTVGFASAPSHA
27353255Transriptional regulatory proteinMRAFTRFSYSHS
17982679Protein translation elongation factor TuMCWRLSGSRTKRTTAMA
17982015*Malate dehydrogenaseMRKETIMARNKIALIGSGMIGGTLA
17982340*Periplasmic dipeptide transport protein precursorMGCARQAFPWRRTIMKFYQKLLAATALVALMSGAASA
17982205TransketolaseMLCVPLPSGASSRKAA
17982141ATP synthase F1, beta chainMAKAATPKTTAAAEA
17982826DNA-binding protein HU alphaMPMNKNELVA
17982154*Leucine, isoleucine, valine, threonine and alanine binding proteinMGVPTMRKTLFSGVALAAVIAFGGSAWA
17983192*General L-amino acid-binding periplasmic protein AAPJ precursorMKKTLMTGVLGAAALFGIASGASA
1798268250S ribosomal protein L11MAKKVAGQLKLQVPAGA
17983290Dihydrodipicolinate synthaseMLGVPSICFRSSRMLKGSITA
86283673*Dipeptide ABC transporter, substrate bindingMMITRLSRKFRLLSAGAALSLLMMAAPSAFA

Putative signal peptides were predicted as described by Tjalsma et al. [47] and Pugsley [48]. *Proteins likely to be secreted via the Sec pathway. Signal peptide: the hydrophobic H-domain is in italics; positively charged amino acids are underlined; the signal peptide cleavage sites comprise the last three amino acid residues and are in bold.

Proteins identified within the soluble sub-proteome of O. anthropi containing predicted export signal peptides Putative signal peptides were predicted as described by Tjalsma et al. [47] and Pugsley [48]. *Proteins likely to be secreted via the Sec pathway. Signal peptide: the hydrophobic H-domain is in italics; positively charged amino acids are underlined; the signal peptide cleavage sites comprise the last three amino acid residues and are in bold.

Protein expression changes and pathway reconstruction

Utilizing the emPAI calculation for measuring protein abundance within our proteomic investigation allowed us to use the molar fraction percentage values for proteins common to both growth conditions; this enabled us to compare the fold change in protein expression that occurs under the two different conditions [37,49,50]. Two ranges are generally used in comparative proteomics to ascertain if the fold change in expression is significant. In the isobaric labeling technology iTRAQ™, a ≥20% change is considered significant and sufficient to take account of systematic errors; therefore, fold changes of ≥1.2 or ≤0.8 are significant, with a fold change value of 1 representing no difference in protein levels between the two states [51,52]. In the comparative two-dimensional PAGE technologies, a ≥50% change is considered significant and sufficient to take account of systematic errors; therefore, fold changes of ≥1.5 or ≤0.5 are significant, again with a fold change value of 1 representing no difference in protein levels between the two states [53,54]. The fold change in protein expression between proteins from the two growth conditions can be seen in Table 2. Taking the ≥20% cut-off value, 44 proteins significantly changed in expression; at the ≥50% cut-off value this is reduced to 19 proteins that significantly changed in expression between the two growth conditions. Utilizing the more stringent ≥50% value as a measure of differential protein expression, it can be seen that 11 proteins have much higher expression levels in the early growth condition and 6 have higher expression levels in the later growth condition (Figure 6).
Figure 6

Differential expression profile of proteins common to both growth phases of O. anthropi. Fold changes of ≥1.5 or ≤0.5 are significant.

Differential expression profile of proteins common to both growth phases of O. anthropi. Fold changes of ≥1.5 or ≤0.5 are significant. Using the available genome sequence of B. melitensis 16M, the closest relative of O. anthropi, and assuming a high degree of synteny between the genomes of these organisms, we investigated the genomic context of each gene found to be differentially expressed in this study. In this manner, we hoped that predicted transcriptional units for these proteins would be identified, thus elevating our functional understanding of the processes occurring within the organism. Of the 30S ribosomal proteins identified as differentially expressed, all were predicted to be transcribed independently [55], and four (30S ribosomal proteins S3, S5, S7 and S17) were found within the same region of the B. melitensis 16M genome. The reported differential expression of ribosomal proteins is not unusual in proteomics investigations; however, little information is available as to why certain component proteins of the 30S ribosome should be present at different levels. Of the remaining 12 proteins that were differentially expressed, the available in silico evidence suggests they are independently transcribed within the B. melitensis 16M genome [55].

Pathway identification

Previously, Djordjevic et al. [56] reported the necessity to identify within a proteomic study three enzymes present in a particular biochemical pathway in order to definitively state that such a pathway is present and active within the system under study. Ergo, in conjunction with the pathway reconstruction tool BioCyc [57], we have been able to identify the following pathways: superpathway of glycolysis, pyruvate dehydrogenase and TCA cycle (10 proteins) (Additional data file 1); superpathway of glyoxylate cycle (3 proteins) (Additional data file 1); fatty acid elongation (4 proteins) (Additional data file 2); de novo purine nucleotide biosynthesis (9 proteins) (Additional data file 3); arginine biosynthesis (3 proteins). Lying outside of our stringent rules for pathway identification but nonetheless worthy of note are two additional short pathways for which two out of four proteins (non-oxidative branch of the pentose phosphate pathway) and two out of three proteins (phenylalanine biosynthesis II) were identified in the current study. At both time points it is clear, as would be expected, that central metabolic pathways such as the TCA cycle and fatty acid biosynthesis are active in O. anthropi. In addition, two key enzymes involved in the oxidative pentose phosphate pathway, transketolase and transaldolase, are also found. These enzymes are essential in the recycling of excess pentose phosphate, formed when there is high demand on NADPH2-dependent biosynthetic pathways. In addition, this pathway also provides intermediates for nucleotide biosynthesis, and indeed nucleotide biosynthetic pathways are also apparently active, as might be expected, at both sampling points [58]. In early phase growth, enzymes necessary for peptidoglycan and lipid A biosynthesis were specifically detected, presumably due to the high demand for new cell wall and outer membrane layer components at this growth point [59]. Whilst the enzymes essential for ribonucleotide biogenesis were present at both conditions, only in early phase growth was nucleoside diphosphate kinase, the key component for deoxyribonucleotide synthesis, detectable, indicative of a demand for components involved in DNA replication. In late phase growth, evidence for the activation of gluconeogenesis was found as a likely result of nutrient depletion, with expression of malate dehydrogenase upregulated by 2.4-fold, and the two enzymes phosphoenol pyruvate (PEP) carboxykinase and pyruvate phosphate dikinase, essential for the production of PEP from oxaloacetate and pyruvate, respectively, detectable for the first time. It is of note that this pathway is considered to be important for the establishment of host infection by certain bacteria, such as Mycobacterium bovis and Xanthomonas campestris [60,61]. Similarly, the presence of serine hydroxymethyltransferase, which converts glycine to L-serine prior to its potential conversion to pyruvate, was found only in late phase growth. Intriguingly, the enzymes involved in biosynthesis of the amino acids arginine, lysine and phenylalanine were also unique to late phase growth. Whilst this may reflect a cellular demand for these amino acids, it is of note that the co-product of arginine synthesis is fumarate, which has the dual role of being an intermediate in both the TCA cycle as well as in gluconeogensis. One additional feature of late phase growth is the presence of a number of stress response proteins, particularly those of importance in oxidative stress resistance. The proteins thioredoxin (TrxC), alkyl hydroperoxide reductase (AhpC) and thiol peroxidase have been reported to be important for the survival of pathogenic bacteria within a host organism [62,63]. Indeed, these proteins have also been detected in both transcriptomic and proteomic studies investigating the role of oxidative stress in a number of important pathogenic organisms that include Escherichia coli, Candida albicans and Porphyromonas gingivalis [64-68]. The TrxC and AhpC genes are subject to control by the oxyR regulon, which is induced in response to oxidative stress resulting from hydrogen peroxide and other free oxygen radicals. During this process, the regulatory protein oxyR becomes oxidized by these reactive oxygen species to form an intramolecular disulphide bond, thus allowing the activation of expression from trxC, grxA, gorA and hence other genes of the OxyR regulon (Figure 7). Glutathione is an essential element in the regulatory cycle of oxyR, which may explain the presence of N-methylhydantoinase in late phase growth, as it is an integral part of the γ-Glutamyl pathway, which is responsible for the generation of glutathione from 5-oxoproline. During oxidative stress thioredoxin is produced and in its reduced form it acts as an acceptor of oxygen radicals generated as a result of the catalytic activity of thiol peroxidase on H2O2, thus scavenging and detoxifying the damaging oxygen radicals and concomitantly forming H2O and thioredoxin disulphide [69]. Thioredoxins may also be involved in a cascade that triggers the transcription of other detoxifying genes, as they have also been shown to interact with DNA gyrase and thus influence a multitude of transcriptional responses in the cell by increasing or decreasing DNA supercoiling. This strongly suggests that the gyrase-mediated effect of thioredoxins on gene expression is a common redox-dependent signaling pathway in bacterial adaptation [70]. Intriguingly, AhpC has also been found to be an important conserved bacterial allergen that interacts with mammalian Toll-like receptors, specifically the MyD88 protein, thus activating innate and adaptive immune responses within the host [71].
Figure 7

Proposed model for induction of the OxyR regulon in O. anthropi. Oxidized oxyR regulates expression of the OxyR regulon in response to oxidative and nitrosative stress, inducing trxC, grxA, gorA and other OxyR regulon genes. ahpC, alkyl hydroperodide reductase I; fhuF, ferric reductase; fur, ferric uptake repressor; GSSH/GSH, oxidized/reduced glutathione; GorA, glutathione reductase; GrxA, glutaredoxin A; katG, catalase (hydroperoxidase I); OMP agn43, outer membrane protein; oxyS, regulatory RNA; RNO, reactive nitrogen species; ROS, reactive oxygen species; TP, thiol peroxidase; TR, thioredoxin 2. (Adapted (with kind permission of Springer Science and Business Media) from Figure 2a [70].)

Proposed model for induction of the OxyR regulon in O. anthropi. Oxidized oxyR regulates expression of the OxyR regulon in response to oxidative and nitrosative stress, inducing trxC, grxA, gorA and other OxyR regulon genes. ahpC, alkyl hydroperodide reductase I; fhuF, ferric reductase; fur, ferric uptake repressor; GSSH/GSH, oxidized/reduced glutathione; GorA, glutathione reductase; GrxA, glutaredoxin A; katG, catalase (hydroperoxidase I); OMP agn43, outer membrane protein; oxyS, regulatory RNA; RNO, reactive nitrogen species; ROS, reactive oxygen species; TP, thiol peroxidase; TR, thioredoxin 2. (Adapted (with kind permission of Springer Science and Business Media) from Figure 2a [70].)

Conclusion

The popularity of 'identification' proteomics is evident from the abundant studies reported in the literature, and their contribution to our understanding of the diversity of protein expression in cells and organisms has been immense; however, these studies have clear limitations with regard to the amount of insight they can give on the function of a system. The trend within the proteomics community has, therefore, moved from this cataloguing approach towards the development of comparative and quantitative analyses that, as outlined in the present study, give greater insights into the functional processes occurring. Combining both of these techniques with rigorous data curation and interpretation, coupled with the vast array of bioinformatics tools available to the life scientist is the only way to ensure that these processes are accurately described such that meaningful data can be provided for the wider scientific community. In this study we were able to identify distinct proteomic profiles associated with specific growth points for the emerging nosocomial pathogen O. anthropi. For those proteins common to both growth phases, the use of emPAI allowed a semi-quantitative analysis of protein expression to be made and it was possible to reconstruct core metabolic pathways functioning within this organism. It was also possible to infer unique functional and adaptive processes associated with specific growth phases and, therefore, gain a much deeper understanding of the physiology and metabolism of this pathogenic bacterium. Of particular interest was the identification of a number of protein products involved in oxidative stress response that are known to be regulated as part of the oxyR regulon and have previously been shown to be key in pathogen survival within host environments.

Materials and methods

Reagents

All reagents were purchased from Sigma-Aldrich (Poole, UK) with the exception of mass spectrometry grade water and acetonitrile, which were purchased from Romil (Cambridge, UK) and trypsin, which was purchased from Promega (Southampton, UK).

Cell culture and growth conditions

O. anthropi UU551 was routinely maintained at 37°C on nutrient agar. Routine growth of the organism involved the inoculation of nutrient broth (50 ml in 250 ml Erlenmeyer flasks) with a loop of fresh, actively growing (16 h) culture from agar plates. Flasks were incubated aerobically at 37°C with orbital shaking at 200 rpm in an Innova™ 4230 refrigerated incubator shaker (New Brunswick Scientific, Edison, NJ, USA). Growth was monitored by the increase in culture attenuance at 600 nm.

Protein extraction and quantification

O. anthropi cultures were harvested under two separate growth conditions, at early phase (D600 = 0.3) and at late phase (D600 = 1.2) growth, by centrifugation at 9000 × g for 10 minutes at 3-5°C. The cell pellet was weighed and resuspended in 10 mM phosphate-buffered saline (pH 7.8) at a ratio of 1 g cells to 2 ml buffer. The cells were then broken using sonication as described previously by Graham et al. [32]. The soluble proteome fraction was isolated by centrifugation of the homogenate at 25,000 × g for 30 minutes at 3-5°C (Beckman J2-HS, Beckman Instruments, Fullerton, CA, USA) followed by ultracentrifugation at 150,000 × g for 2 hours at 3-5°C (Beckman L8-M, Beckman Instruments) to sediment the insoluble fraction. The supernatant was decanted and stored frozen in 1 ml aliquots at -70°C until required. The total soluble protein content was measured using the Bradford assay [72].

One-dimensional gel electrophoresis

An aliquot of the supernatant was diluted ten-fold with deionised water; 10 μl of this diluted sample was added to 10 μl Tris-Glycine SDS sample loading buffer (Invitrogen, Paisley, Renfrewshire, UK) and boiled for 5 minutes. The samples (20 μl; 100 μg total protein) were loaded onto a 1 mm thick Nu-Page 4-12% Bis-Tris gel (Invitrogen). SeeBlue™ Plus 2 (Invitrogen) was used as a protein molecular mass marker. The gel was electrophoresed, using MES SDS running buffer, in an X-Cell II mini gel system (Invitrogen) at 200 V, 120 mA, 25 W per gel for 35 minutes. Proteins were visualized using SimplyBlue™ Safestain (Invitrogen). The entire lane was excised from the gel and cut into nine fractions based on molecular mass as previously described by Graham et al. [35].

In-gel tryptic digestion

Excised gel fractions were washed for 30 minutes in 200 mM NH4HCO3, pH 7.8 at 37°C. These fractions were then dehydrated by incubation for 30 minutes in 200 mM NH4HCO3 pH 7.8/MeCN (4:6 v/v) at 37°C, followed by rehydration for 30 minutes in 50 mM NH4HCO3, pH 7.8 at 37°C. Following incubation in 100% acetonitrile for 2 minutes, 0.1 μg trypsin in 50 mM NH4HCO3, pH 7.8 was added to each sample, which was then incubated overnight at 37°C. The supernatant was subsequently recovered into microcentrifuge tubes and a second peptide extraction from these gel pieces was carried out (0.1% trifluoroacetic acid (TFA) in 60% acetonitrile for 5 minutes). Peptide-containing liquid fractions were pooled, dried under vacuum and re-suspended in 20 μl 0.1% formic acid in 2% acetonitrile prior to storage at -70°C until required.

Liquid chromatography-mass spectrometric analysis

Mass spectrometry was performed using a 3200 Q-TRAP Hybrid ESI Quadropole linear ion trap mass spectrometer, ESI-Q-q-Qlinear ion trap-MS/MS (Applied Biosystems/MDS SCIEX, Toronto, Canada) with a nanospray interface, coupled with an online Ultimate 3000 nanoflow liquid chromatography system (Dionex/LC Packings, Amsterdam, The Netherlands). A μ-Precolumn™ Cartridge (300 μm × 5 mm, 5 μm particle size) was placed prior to the C18 capillary column (75 μm × 150 mm, 3 μm particle size) to enable desalting and filtering. Both columns contained the reversed phase material PepMAP™ 100 (C18 silica-based) with a 100Å pore size (Dionex/LC Packings). The elution buffers used in the gradient were Buffer A (0.1% formic acid in 2% acetonitrile) and Buffer B (0.1% formic acid in 80% acetonitrile). The nanoLC gradient used was 60 minutes in length: 0-55% B in 45 minutes, 10 minutes at 90% B followed by 5 minutes at 100% A. The flow rate of the gradient was 300 nlmin-1. The detector mass range was set at 400-1,800 m/z. MS data acquisition was performed in positive ion mode. During MS acquisition, peptides with 2+ and 3+ charge states were selected for fragmentation.

Database searching, protein identification and PROVALT analysis

Protein identification was carried out using an internal MASCOT server (version 1.9; Matrix Science, London, UK) searching against the bacteria sub-set of the MSDB database (latest version at the time of processing). Peptide tolerance was set at ± 1.2 Da with MS/MS tolerance set at ± 0.6 Da and the search set to allow for one missed cleavage. In order to expedite the curation of the identified protein list from MASCOT, the result files were re-analyzed against an extracted database comprising eleven α-proteobacterial genome databases downloaded from NCBI using the heuristic method known as the protein validation tool PROVALT [36]. This automated program takes large proteomic MS datasets and reorganizes them by taking multiple MASCOT results and identifying those peptides that match. Redundant peptides are removed and related peptides are grouped together associated with their predicted matching protein; thus, the program dramatically reduces this portion of the curation process. For identification purposes the minimum peptide length was set at 6 amino acids, the minimum peptide MOWSE score was set at 25 and the minimum high quality peptide MOWSE score was set at 40. PROVALT also uses peptide matches from a random database (in this case the extracted α-proteobacterial database was randomized) to calculate false-discovery rates (FDRs) for protein identifications as previously described by Weatherley et al. [36]. Briefly, identifications from searching the normal and random databases are used to calculate the FDRs and set score thresholds and thus identify as many 'actual' proteins as possible while encountering a minimal number of false-positive protein identifications. Rather than calculate error rates at the peptide level, the FDR calculations employed by PROVALT provide a reasonable balance between the number of correct and incorrect protein assignments. In this study the FDR was set at 1%, meaning that 99% of the reported proteins identified should be correct.

Protein quantification and abundance measurements

Proteins within the two growth conditions were quantified utilizing emPAI [37,49,50]. This method allows the quantification of individual identified proteins by utilizing database and Mascot output information, in order to give an emPAI value. The emPAI value can then be used to estimate the protein content within the sample mixture in molar fraction percentages. Also, the fold change in expression levels of proteins identified under both growth conditions can be estimated, allowing further insights into cellular processes.

Pathway reconstruction

Pathways were reconstructed utilizing the BioCyc database [57], a collection of 160 pathway/genome databases for most eukaryotic and prokaryotic species whose genomes have been completely sequenced. The BioCyc collection provides a unique resource for computational systems biology by enabling global and comparative analyses of genomes and metabolic networks. Identified proteins can be entered into the database and searched against specific species, thus allowing scientists to visualize combinations of gene expression maps of these organisms and thus reconstruct pathways that are present.

Bioinformatics

PSORTb version 2.0.4 [41,42] was used for the prediction of bacterial protein subcellular localization. SignalP 3.0 [43,44] was used to predict the presence and location of signal peptide cleavage sites in amino acid sequences, for classically secreted proteins. SecretomeP 2.0 [45,46] was used for the prediction of non-classical protein secretion (that is, protein secretion that is not triggered by signal peptides).

Additional data files

The following additional data are available with the online version of this paper. Additional data file 1 is a figure illustrating the superpathway of glycolysis, pyruvate dehydrogenase, TCA and the superpathway of the glyoxylate cycle. Additional data file 2 is a figure illustrating fatty acid elongation. Additional data file 3 is a figure illustrating de novo purine nucleotide biosynthesis. Within these pathways proteins unique to the early growth phase are boxed in green, those identified in both growth conditions are boxed in blue and those unique to the late growth phase are boxed in yellow.

Additional data file 1

Proteins unique to the early growth phase are boxed in green, those identified in both growth conditions are boxed in blue and those unique to the late growth phase are boxed in yellow. Click here for file

Additional data file 2

Proteins unique to the early growth phase are boxed in green, those identified in both growth conditions are boxed in blue and those unique to the late growth phase are boxed in yellow. Click here for file

Additional data file 3

Proteins unique to the early growth phase are boxed in green, those identified in both growth conditions are boxed in blue and those unique to the late growth phase are boxed in yellow. Click here for file
  63 in total

1.  In vivo transcription of the Escherichia coli oxyR regulon as a function of growth phase and in response to oxidative stress.

Authors:  C Michán; M Manchado; G Dorado; C Pueyo
Journal:  J Bacteriol       Date:  1999-05       Impact factor: 3.490

2.  A Heuristic method for assigning a false-discovery rate for protein identifications from Mascot database search results.

Authors:  D Brent Weatherly; James A Atwood; Todd A Minning; Cameron Cavola; Rick L Tarleton; Ron Orlando
Journal:  Mol Cell Proteomics       Date:  2005-02-09       Impact factor: 5.911

Review 3.  Genome reduction in the alpha-Proteobacteria.

Authors:  Björn Sällström; Siv G E Andersson
Journal:  Curr Opin Microbiol       Date:  2005-10       Impact factor: 7.934

4.  Pulsed-field gel electrophoresis to study the diversity of whole-genome organization in the genus Ochrobactrum.

Authors:  Corinne Teyssier; Hélène Marchandin; Agnès Masnou; Jean-Luc Jeannot; Michèle Siméon de Buochberg; Estelle Jumas-Bilak
Journal:  Electrophoresis       Date:  2005-08       Impact factor: 3.535

5.  Unconventional genomic organization in the alpha subgroup of the Proteobacteria.

Authors:  E Jumas-Bilak; S Michaux-Charachon; G Bourg; M Ramuz; A Allardet-Servent
Journal:  J Bacteriol       Date:  1998-05       Impact factor: 3.490

6.  Molecular characterization of chromosomal class C beta-lactamase and its regulatory gene in Ochrobactrum anthropi.

Authors:  D Nadjar; R Labia; C Cerceau; C Bizet; A Philippon; G Arlet
Journal:  Antimicrob Agents Chemother       Date:  2001-08       Impact factor: 5.191

7.  Thermoadaptation trait revealed by the genome sequence of thermophilic Geobacillus kaustophilus.

Authors:  Hideto Takami; Yoshihiro Takaki; Gab-Joo Chee; Shinro Nishi; Shigeru Shimamura; Hiroko Suzuki; Satomi Matsui; Ikuo Uchiyama
Journal:  Nucleic Acids Res       Date:  2004-12-01       Impact factor: 16.971

8.  Xanthomonas campestris pv. campestris possesses a single gluconeogenic pathway that is required for virulence.

Authors:  Dong-Jie Tang; Yong-Qiang He; Jia-Xun Feng; Bao-Ren He; Bo-Le Jiang; Guang-Tao Lu; Baoshan Chen; Ji-Liang Tang
Journal:  J Bacteriol       Date:  2005-09       Impact factor: 3.490

Review 9.  Escherichia coli--a model system that benefits from and contributes to the evolution of proteomics.

Authors:  Pat S Lee; Kelvin H Lee
Journal:  Biotechnol Bioeng       Date:  2003-12-30       Impact factor: 4.530

Review 10.  Thioredoxins in bacteria: functions in oxidative stress response and regulation of thioredoxin genes.

Authors:  Tanja Zeller; Gabriele Klug
Journal:  Naturwissenschaften       Date:  2006-06
View more
  12 in total

1.  Mass spectrometry-based proteomics and peptidomics for biomarker discovery in neurodegenerative diseases.

Authors:  Xin Wei; Lingjun Li
Journal:  Int J Clin Exp Pathol       Date:  2008-06-20

2.  Genomic library screening for viruses from the human dental plaque revealed pathogen-specific lytic phage sequences.

Authors:  Ahmed Nasser Al-Jarbou
Journal:  Curr Microbiol       Date:  2011-10-04       Impact factor: 2.188

3.  Exoproteome of Staphylococcus aureus reveals putative determinants of nasal carriage.

Authors:  Gowrishankar Muthukrishnan; Gerry A Quinn; Ryan P Lamers; Carolyn Diaz; Amy L Cole; Sixue Chen; Alexander M Cole
Journal:  J Proteome Res       Date:  2011-03-07       Impact factor: 4.466

4.  Comparative proteomic analysis of pathogenic and non-pathogenic strains from the swine pathogen Mycoplasma hyopneumoniae.

Authors:  Paulo M Pinto; Cátia S Klein; Arnaldo Zaha; Henrique B Ferreira
Journal:  Proteome Sci       Date:  2009-12-21       Impact factor: 2.480

5.  Comparative genomics and proteomics of Helicobacter mustelae, an ulcerogenic and carcinogenic gastric pathogen.

Authors:  Paul W O'Toole; William J Snelling; Carlos Canchaya; Brian M Forde; Kim R Hardie; Christine Josenhans; Robert Lj Graham; Geoff McMullan; Julian Parkhill; Eugenio Belda; Stephen D Bentley
Journal:  BMC Genomics       Date:  2010-03-10       Impact factor: 3.969

6.  Comparative proteome analysis of Milnesium tardigradum in early embryonic state versus adults in active and anhydrobiotic state.

Authors:  Elham Schokraie; Uwe Warnken; Agnes Hotz-Wagenblatt; Markus A Grohme; Steffen Hengherr; Frank Förster; Ralph O Schill; Marcus Frohme; Thomas Dandekar; Martina Schnölzer
Journal:  PLoS One       Date:  2012-09-27       Impact factor: 3.240

7.  Comparative Proteomic Analysis of saccharopolyspora spinosa SP06081 and PR2 strains reveals the differentially expressed proteins correlated with the increase of spinosad yield.

Authors:  Yushuang Luo; Xuezhi Ding; Liqiu Xia; Fan Huang; Wenping Li; Shaoya Huang; Ying Tang; Yunjun Sun
Journal:  Proteome Sci       Date:  2011-07-16       Impact factor: 2.480

8.  An economical high-throughput protocol for multidimensional fractionation of proteins.

Authors:  David John Tooth; Varun Gopala Krishna; Robert Layfield
Journal:  Int J Proteomics       Date:  2012-09-12

9.  Comparative transcriptional analysis of clinically relevant heat stress response in Clostridium difficile strain 630.

Authors:  Nigel G Ternan; Shailesh Jain; Malay Srivastava; Geoff McMullan
Journal:  PLoS One       Date:  2012-07-30       Impact factor: 3.240

10.  iTRAQ analysis of complex proteome alterations in 3xTgAD Alzheimer's mice: understanding the interface between physiology and disease.

Authors:  Bronwen Martin; Randall Brenneman; Kevin G Becker; Marjan Gucek; Robert N Cole; Stuart Maudsley
Journal:  PLoS One       Date:  2008-07-23       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.