| Literature DB >> 25888384 |
Sa Rang Kim1, Tuong Vi Nguyen2, Na Ri Seo3, Seunghup Jung4, Hyun Joo An5, David A Mills6, Jae Han Kim7.
Abstract
BACKGROUND: Comparative proteomics in bacteria are often hampered by the differential nature of dataset quality and/or inherent biological deviations. Although common practice compensates by reproducing and normalizing datasets from a single sample, the degree of certainty is limited in comparison of multiple dataset. To surmount these limitations, we introduce a two-step assessment criterion using: (1) the relative number of total spectra (R TS ) to determine if two LC-MS/MS datasets are comparable and (2) nine glycolytic enzymes as internal standards for a more accurate calculation of relative amount of proteins. Lactococcus lactis HR279 and JHK24 strains expressing high or low levels (respectively) of green fluorescent protein (GFP) were used for the model system. GFP abundance was determined by spectral counting and direct fluorescence measurements. Statistical analysis determined relative GFP quantity obtained from our approach matched values obtained from fluorescence measurements.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25888384 PMCID: PMC4704264 DOI: 10.1186/s12859-015-0561-9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Cell growth curve of L. lactis HR279 (triangle) and JHK24 (circle). The fluorescence from the HR279 and JHK24 are depicted as open triangles and open circles. The arrows indicated the induction of GFP expression by adding a nisin and the sample collection points.
The summary of the LC-MS/MS results
|
|
|
|
|
|
| ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
|
| |||||||||||||||
|
| exp | 2406 | 2878 | 4150 | 1.20 | 1.72 | 1.44 | 221 | 7 | 11 | 20 | 4 | 4 | 17 | 284 |
|
| stat | 4492 | 4362 | 4347 | 1.03 | 1.03 | 1.00 | 262 | 18 | 9 | 6 | 2 | 4 | 2 | 303 |
|
| |||||||||||||||
|
| exp | 3226 | 2522 | 4514 | 1.28 | 1.40 | 1.79 | 233 | 7 | 26 | 21 | 1 | 3 | 17 | 308 |
|
| stat | 3810 | 4339 | 4259 | 1.14 | 1.12 | 1.02 | 242 | 6 | 6 | 43 | 4 | 9 | 5 | 315 |
SpC is a total number of MS/MS spectra used to identify proteins in sample. False discovery rates (FDR) of peptides were calculated by searching the MS/MS spectra against the forward and the reversed entry database independently. The searching was performed with the 1% of FDR level.
R is the relative number of total spectra.
The A, B and C represent the replicates of each sample., A∩B∩C and A∩B represent the number of proteins appeared in all triplicates and two (A and B) out of three replicates, respectively.
Total indicated the number of proteins identified from biological replicates. The guideline of protein identification was described in Result section. The number of unique peptide, X! Tandem value and the number presence among replicates were used as a parameter to decide the presence of proteins.
e,Bacterial cell was taken at the early exponential and early stationary phase of cell growth stage, respectively.
Figure 2Correlation between R and variability of biological replicates. Three way correlations on 3D space was projected on each xyz-plane describing the linear correlation between (A) R and standard deviation of total proteins (SD_SRA[rep, k]), (B) R and standard deviation of internal standards (SD_SRA[rep, INj]), and (C) standard deviation of total protein (SD_SRA[rep, k]) and internal standards (SD_SRA[rep, INj]). The X-axis of graph (A) and Y-axis of graph (C) shard the same value and range of SD_SRA[rep, k]. The circles, squares, diamonds, and triangles represent comparisons between biological replicates of samples High-1, High-2, Low-1 and Low-2 in Table 2, respectively.
Relative amount of GFP between high and low expression system at different stage of cell growth
|
|
|
|
|
| |
|---|---|---|---|---|---|
|
|
| ||||
| High-1 | 1A/3A | 1.34 | −1.43 ± 0.44 | 2.49 ± 0.57 | 2.86 ± 0.42 |
| 1B/3A | 1.12 | −1.03 ± 0.32 | 2.66 ± 0.71 | ||
| 1C/3A | 1.29 | −1.07 ± 0.38 | 2.51 ± 0.75 | ||
| 1A/3B | 1.05 | −1.23 ± 0.66 | 3.88 ± 1.69 | ||
| 1B/3B | 1.14 | 1.23 ± 0.53 | 2.12 ± 0.66 | ||
| 1C/3B | 1.65 | 1.22 ± 0.77 | 2.06 ± 0.94 | ||
| 1A/3C | 1.88 | −1.44 ± 0.87 | 7.09 ± 3.17 | ||
| 1B/3C | 1.57 | −1.01 ± 0.51 | 3.89 ± 1.23 | ||
| 1C/3C | 1.09 | 1.35 ± 0.44 | 3.48 ± 0.59 | ||
| High-2 | 2A/4A | 1.18 | −1.09 ± 0.27 | 3.27 ± 0.70 | 4.00 ± 0.62 |
| 2B/4A | 1.14 | −1.12 ± 0.37 | 3.66 ± 1.08 | ||
| 2C/4A | 1.14 | −1.14 ± 0.34 | 4.00 ± 1.09 | ||
| 2A/4B | 1.04 | −1.04 ± 0.19 | 3.87 ± 0.61 | ||
| 2B/4B | 1.01 | −1.06 ± 0.20 | 4.28 ± 0.76 | ||
| 2C/4B | 1.00 | −1.09 ± 0.27 | 4.73 ± 1.10 | ||
| 2A/4C | 1.05 | −1.06 ± 0.23 | 3.80 ± 0.79 | ||
| 2B/4C | 1.02 | −1.08 ± 0.30 | 4.24 ± 1.06 | ||
| 2C/4C | 1.02 | −1.05 ± 0.19 | 4.58 ± 0.89 | ||
aNumber indicates the group of sample and A, B, and C indicates the biological replicates as described in Table 1.
bInternal standard is the average relative amount of internal standards between two biological replicates.
cThe relative amount of GFP between two biological replicates of samples calculated by the spectral counting method with the use of internal standards.
dFluorescence was measured in triplicate from separate biological replicates.
Figure 3Correlation of R and the standard deviation of a relative amount of internal standards. Correlation of R and the standard deviation of a relative amount of internal standards between replicates (SD_SRA[rep, INj]) and between two samples (SD_SRA[comp, INj]). The standard deviations obtained from the comparison of High-1vs Low-1, High-2 vs. Low-2 and replicates in each group are shown as squares, triangles and circles, respectively. The diamonds depict the outliers from the comparisons between the sample High-1 and Low-1.
Figure 4The impact of comparability assessment and the use of internal standards. The relative increase of GFP expression calculated by the spectral counting method (black bar) was compared to that obtained by the external measurement using fluorescent (white bar). IS and CA represented the use of internal standards and comparability assessment, and the sign of plus (+) and minus (−) indicates the “the use” and “without the use” of method, respectively. For example, IS(+)/CA(+) meant the relative amount of GFP calculated with the use of internal standard and comparability assessment of replicates. The p-values were obtained by the Student t-test (see the Experimental procedures). (A) and (B) is the comparison of sample obtained at early and late exponential phase, respectively.
Protein scoring system for the protein determination in replicates
|
|
|
| |
|---|---|---|---|
| ≥2 |
| ≥10 | 3 |
| ≥2 |
| ≥10 | 2 |
| =1 |
| 2 ≤ −Log(e) < 10 | 1 |
aPepuniq is the number of unique peptide used to the protein identification.
b-Log (e) is the expectation value of protein identification by X!Tandem.
Identification score of protein k (ID_S(P )) was calculated as a sum of each scores obtained from each replicates. When n = 3 (triplicate), protein with ID_S(P ) ≥ 3 was considered present in a sample.
Internal standards used in this study
|
|
|
|
|---|---|---|
| 15674150 |
| glucose-6-phosphate isomerase |
| 15673315 |
| 6-phosphofructokinase |
| 15673891 |
| fructose-bisphosphate aldolase |
| 15673116 |
| triosephosphate isomerase |
| 15674228 |
| glyceraldehyde 3-phosphate dehydrogenase |
| 15672227 |
| phosphoglycerate kinase |
| 15672318 |
| phosphoglyceromutase |
| 15672626 |
| phosphopyruvate hydratase |
| 15673314 |
| pyruvate kinase |