| Literature DB >> 14653859 |
Wei Wang1, Jibin Sun2, Manfred Nimtz3, Wolf-Dieter Deckwer1, An-Ping Zeng2.
Abstract
Separation of proteins by two-dimensional gel electrophoresis (2-DE) coupled with identification of proteins through peptide mass fingerprinting (PMF) by matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) is the widely used technique for proteomic analysis. This approach relies, however, on the presence of the proteins studied in public-accessible protein databases or the availability of annotated genome sequences of an organism. In this work, we investigated the reliability of using raw genome sequences for identifying proteins by PMF without the need of additional information such as amino acid sequences. The method is demonstrated for proteomic analysis of Klebsiella pneumoniae grown anaerobically on glycerol. For 197 spots excised from 2-DE gels and submitted for mass spectrometric analysis 164 spots were clearly identified as 122 individual proteins. 95% of the 164 spots can be successfully identified merely by using peptide mass fingerprints and a strain-specific protein database (ProtKpn) constructed from the raw genome sequences of K. pneumoniae. Cross-species protein searching in the public databases mainly resulted in the identification of 57% of the 66 high expressed protein spots in comparison to 97% by using the ProtKpn database. 10 dha regulon related proteins that are essential for the initial enzymatic steps of anaerobic glycerol metabolism were successfully identified using the ProtKpn database, whereas none of them could be identified by cross-species searching. In conclusion, the use of strain-specific protein database constructed from raw genome sequences makes it possible to reliably identify most of the proteins from 2-DE analysis simply through peptide mass fingerprinting.Entities:
Year: 2003 PMID: 14653859 PMCID: PMC317362 DOI: 10.1186/1477-5956-1-6
Source DB: PubMed Journal: Proteome Sci ISSN: 1477-5956 Impact factor: 2.480
Figure 1Typical image of 2-DE gel electrophoresis analysis of intracellular proteins of K. pneumoniae grown on glycerol. Spots numbers refer to those given in Tables 1 and 2. Red marked are proteins related to the dha regulon that is essential for the anaerobic glycerol metabolism in this organism.
Compare cross-species identification using public protein databases with identification using the specific protein database ProtKpn for the identification of protein of dha regulon of Klebsiella pneumoniae
| 5 | Glycerol dehydrogenase (EC 1.1.1.6) (GLDH) P45511_GLDA_CITFR | 862 | n.i. | n.i. | - | n.i. | - | 121 | 39 |
| 39 | n.i. | n.i. | - | n.i. | - | 136 | 34 | ||
| 97 | Dihydroxyacetone kinase (EC 2.7.1.29) sp| P45510| DAK_CITFR | 332 | n.i. | 0.19 | 9.4 | n.i. | - | 131 | 31 |
| 42 | Dehydroxyacetone kinase II, subunit 1 (EC 2.7.1.121) P76015_YCGT_ECOLI | 863 | n.i. | n.i. | - | n.i. | - | 102 | 42 |
| 114 | n.i. | n.i. | - | n.i. | - | 89 | 29 | ||
| 18 | Dihydroxyacetone kinase II, subunit 2 (EC 2.7.1.121) P76014_YCGS_ECOLI | 864 | n.i. | n.i. | - | n.i. | - | 152 | 77 |
| 153 | n.i. | n.i. | - | n.i. | - | 103 | 60 | ||
| 33 | Glycerol dehydratase beta subunit (EC 4.2.1.30) tr| O08505 [Klebsiella pneumoniae] | 855 | 91 | n.i. | - | n.i. | - | 71 | 46 |
| 62 | glycerol dehydratase small subunit (EC 4.2.1.30) tr| Q59475 | 854 | 98 | n.i. | - | 94 | 52 | 96 | 52 |
| 12 | 1,3-propanediol dehydrogenase (EC 1.1.1.202) Q59477_DHAT_KLEPN | 858 | n.i | 0.31 | 34.1 | 94 | 34 | 105 | 36 |
| 35 | 99 | 0.35 | 54.0 | 182 | 54 | 175 | 48 | ||
| 36 | 91 | 0.18 | 37.0 | 75 | 36 | 63 | 26 | ||
| 103 | n.i. | 0.25 | 31.8 | 114 | 38 | 114 | 38 | ||
| 107 | 99 | 0.42 | 35.9 | 158 | 41 | 184 | 41 | ||
| 6 | Hypothetical oxidoreductase yqhD (EC 1.1.-.-) Q46856_YQHD_ECOLI | 3405 | n.i. | 0.1916 | 15.5 | n.i. | - | 199 | 40 |
| 40 | n.i. | n.i. | - | n.i. | - | 130 | 29 | ||
| 117 | n.i. | n.i. | - | n.i. | - | 119 | 37 | ||
| 129 | Putative glycerol dehydrogenasetr| Q8ZR27 | 2195 | 74 | 42 | |||||
| 151 | orfY, unknown function gi| 940439| (U30903) [Klebsiella pneumoniae] | 857 | 84 | n.i | - | 81 | 48 |
aRefers to the proteins labelled in Figure 1b Protein access numbers in the ProtKpn database cKnexus uses ProFound as search program. Profound calculates the probability that a candidate in a database search is the protein being analysed., A Z score is estimated as an indicator of the quality of the search result, when the search result is compared against an estimated random match population. Z score is the distance to the population mean in unit of standard deviation. It also corresponds to the percentile of the search in the random match population. d Using Peptident score is the number of peptides that match the theoretical peptides from a database entry divided by the total number of peptide masses specified for the search. Using Mascot score is -10*Log(P), where P is the probability that the observed match is a random event. If there is also a superscript number beside the score, it represents the position of this protein in the protein candidate list. Otherwise, it is the top one. eSC: Sequence coverage, defined as the ratio of the portion of protein sequence covered by matched peptides to the whole length of protein sequence.