| Literature DB >> 18312684 |
Mark K Titulaer1, Dana A N Mustafa, Ivar Siccama, Marco Konijnenburg, Peter C Burgers, Arno C Andeweg, Peter A E Sillevis Smitt, Johan M Kros, Theo M Luider.
Abstract
BACKGROUND: A Java application is presented, which compares large numbers (n > 100) of raw FTICR mass spectra from patients and controls. Two peptide profile matrices can be produced simultaneously, one with occurrences of peptide masses in samples and another with the intensity of common peak masses in all the measured samples, using the peak- and background intensities of the raw data. In latter way, more significantly differentially expressed peptides are found between groups than just using the presence or absence in samples of common peak masses. The software application is tested by searching angiogenesis related proteins in glioma by comparing laser capture micro dissected- and enzymatic by trypsin digested tissue sections.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18312684 PMCID: PMC2323386 DOI: 10.1186/1471-2105-9-133
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Overview of all the instruments and MS file types that can be handled by the database application.
| Equipment | Type | File names | File content |
| Ultraflex ™ MALDI-TOF (Bruker Daltonics) | fid | fid, acqus, acqu | raw binary data, calibration constants |
| text | *.txt | mass and intensity peak list, space or tab separated | |
| APEX-Q 9.4T MALDI-FTICR (Bruker Daltonics) | fid | fid, acqus, acqu | raw binary data, calibration constants |
| text | *.txt | mass and intensity peak list, space or tab separated | |
| xml | exportedSpectrum.xml | <pk mz="mass1" i="intensity1"/> | |
| Off-line LC-MALDI (TOF/TOF) (file-export WARP-LC™ software, Bruker Daltonics) | text | *.txt | mass, intensity and Retention time peak list, space or tab separated data |
| xml | *.xml | <Compound mass="mass1" | |
The database application is written in Java and R and uses a MySQL™ database. The files types may consist of 1) raw binary MS spectra (the fid files), 2) exported peak lists containing tab or space separated masses and corresponding intensities, and 3) XML files.
A fragment of mass and intensity peaks lists of 2 samples, illustrating the "missing data" problem.
| A) | ||||
| Sample 1 | Sample 2 | |||
| Mass (Da) | Intensity (A.U.) | Mass (Da) | Intensity (A.U.) | |
| ? | ← | 1193.6166 | 756021 | |
| ? | ← | 1194.5728 | 191048 | |
| 1194.6477 | 175777 | 1194.6477 | 271465 | |
| 1197.1376 | 126487 | → | ? | |
| ? | ← | 1197.6611 | 124472 | |
| ? | ← | 1198.5193 | 162275 | |
| 1198.7047 | 3154774 | 1198.7047 | 2435350 | |
| B) | ||||
| Sample 1 | Sample 2 | |||
| Mass (Da) | Intensity (A.U.) | Mass (Da) | Intensity (A.U.) | |
| 0 | ← | 1193.6166 | 756021 | |
| 114322 | ← | 1194.5728 | 191048 | |
| 1194.6477 | 175777 | 1194.6477 | 271465 | |
| 1197.1376 | 126487 | → | 80604 | |
| 0 | ← | 1197.6611 | 124472 | |
| 0 | ← | 1198.5193 | 162275 | |
| 1198.7047 | 3154774 | 1198.7047 | 2435350 | |
A) Missing data problem. It appears that the signal intensity from of a peptide mass of 1193.6166 Da in sample 2 is not matched with an intensity of the peptide mass from another sample 1 and vice versa the intensity of peptide mass of 1197.1376 Da of sample 2 is not observed in sample 1.
B) More reliable comparison including background noise signal by recording all the peaks with intensities between the baseline and the S/N threshold in a "noise peaks" file.
A.U. = arbitrary units
Figure 1A fragment of a MALDI-FTICR MS spectrum as generated from the raw binary fid file. The baseline is the bottom horizontal line. The signal to noise threshold is the upper horizontal line. The peaks at masses 1808.9034 Da and 1809.9062 Da have intensities above the S/N threshold. The masses at 1808.9034 and 1809.9062 Da are used to create the peptide profile matrix. The intensities of all four peaks, including the background signal at approximately masses at 1810.9 and 1811.9 Da are used in the intensity peptide profile matrix.
Figure 2Distribution of the mean isotopic distances . The dashed lines represent the acceptance window, between μs – σs and μs + σs of mean isotopic distances of real isotopic clusters. The larger mass accuracy with larger peptide masses is reflected by the larger scatter of isotopic distances μj
Mass accuracy of peptide masses when measured with FTICR MS.
| Average mass accuracy in ppm from calculated masses | Standard deviation in ppm | |
| Theoretical with zero-filling factor 4 | 1.05 | 0.27 |
| Data analysis 3.4 (Build 169) peak-list of sample H7 masses with intensities higher than a threshold of signal to noise 1.7 | 0.81 | 0.59 |
| Peak list of sample H7 exported from this application with masses with intensities higher than a threshold of signal to noise 4 | 1.10 | 0.91 |
| Masses in the peptide profile matrix which are present in 5 or more samples of total 38 samples | 1.03 | 0.72 |
| Masses in the peptide profile matrix, which are present in 18 samples or more of total 38 samples | 0.67 | 0.48 |
| 20 peak masses with the highest mean peak intensities based on 586 peak and background signals | 0.91 | 0.58 |
| 20 peak masses with middle mean peak intensities based on 383 peak and background signals | 0.66 | 0.39 |
| 20 peak masses with the lowest mean peak intensities based on 254 peak and background signals | 1.54 | 0.82 |
| Measured on Glial fibrillary acidic protein (GFAP) with SwissProt™ accession code : P14136, masses MH+: 856.4523, 906.46795, 911.45811, 986.52653, 988.5058, 1032.52076, 1039.55307, 1042.60036, 1050.52143, 1087.5742, 1098.62654, 1103.53273, 1108.47613, 1117.57353, 1177.62113, 1208.59055, 1215.63277, 1244.65934, 1245.66846, 1263.63274,1277.70995, 1357.75461, 1405.80492, 1409.74227, 1499.78523, 1505.76281, 1629.93224, 1634.76897, 1697.79852, 1746.91329, 1841.97153, 1853.89963, 1878.96679, 2075.14949, 2332.9597, 2348.14766 Da | ||
| Measured on Keratin, type I cytoskeletal 10 with SwissProt™ accession code: P13645, masses MH+ : 847.45197, 995.51965, 1031.59829, 1109.48979, 1118.50858, 1165.58475, 1201.61713, 1234.67896, 1237.58476, 1262.59711, 1324.61679, 1365.6393, 1381.6481, 1390.68082, 1434.76995, 1493.73429, 1707.77202, 1797.01158, 1996.97095, 2025.94334, 2096.04671, 2224.14169, 2295.12852, 2367.26265, 2872.39273, 3052.62729 Da. | ||
The mass accuracy is determined on tryptic fragments. The experimental mass accuracy was determined from the calculated value of tryptic peptide fragments of 2 proteins; 1) Glial fibrillary acidic protein with entry name GFAP_HUMAN and SwissProt™ accession number P14136, and 2) Keratin, type I cytoskeletal 10 with entry name K1C10_HUMAN and SwissProt™ accession number P13645.
Figure 3Heat-map of the hierarchical clustering on the presence-absence peptide mass profile matrix in two dimensions of the 38 samples and 2375 peptide masses. The data of the clustering table is added as an additional file 7. A number of 10 spectra of glioma blood vessels with codes H1 to H10, 10 spectra of tissue surrounding the glioma vessels with codes TH1 to TH10, 10 spectra of normal vessels with codes S1 to S10, and 10 spectra of tissue surrounding the normal vessels with codes TS1 to TS10 are included. Two "normal vessels" samples, S5 and TS5, were excluded because they could not be calibrated. The highlighted box in Figure 3 represents the hierarchical clustering order 490 with mass 1037.5355 Da to 789 with mass 1665.7891 Da as presented (see Additional file 7).
Proteins significantly up-regulated in the "glioma vessels" group with respect to the "normal vessels" group.
| Name | SwissProt™ acc. code | Nr. hits p-value< 0.01 | Nr. hits p-value< 0.1 | MS-MS glioma vessels | MS-MS glioma tissue and vessels | MS-MS normal vessels | MS-MS normal tissue and vessels |
| Vimentin | P08670 | 12 | 18 | yes | yes | no | no |
| Glial fibrillary acidic protein (GFAP) | P14136 | 3 | 12 | yes | yes | yes | yes |
| Serum albumin precursor | P02768 | 4 | 10 | yes | yes | no | no |
| Annexin A5 | P08758 | 2 | 5 | no | yes | no | no |
| Actin, cytoplasmic 1 (Beta-actin) | P60709 | 0 | 12 | yes | yes | yes | yes |
| Keratin, type I cytoskeletal 10 | P13645 | 0 | 8 | yes | no | yes | no |
| Actin, alpha cardiac (Alpha-cardiac actin) | P68032 | 0 | 7 | no | yes | no | yes |
| Desmin | P17661 | 0 | 6 | no | yes | no | no |
| Calcium-binding protein p22 (Calcium-binding protein CHP) | Q99653 | 0 | 3 | no | no | no | no |
A number of 95 (4%) of the measured peptide masses with Wilcoxon-Mann-Whitney p-values < 0.01 and raised intensities in the glioma vessel group are compared with calculated peptide masses from proteins in the SwissProt™ database using MASCOT™. A mass tolerance of 2 ppm is used. The MASCOT™ search is repeated with another number of 442 (19%) of peptide masses that have a p-value < 0.1 and with up-regulated intensities in the "glioma vessels" group. Only proteins are listed that have at least three hits of experimental measured peptide masses corresponding within 2 ppm with the calculated peptide masses of these proteins.
Figure 4Increased peak intensity at the mass MH. The MALDI-FTICR Calcium-binding protein p22 peptide peaks of glioma samples are represented by the dark lines (by contrast the peptide mass of 2',3'-cyclic-nucleotide 3'-phosphodiesterase at 1508.8739 Da represented by the grey lines is as expected not present in glioma vessels).
Proteins associated with glioma vessel formation established from enzymatic digested by trypsin peptide masses that are exclusively present in the "glioma vessels" group.
| Measured mass MH+ (Da) | Calculated mass MH+ (Da) | ppm | p-value Intensity H against S | Intensity H > S | Glioma vessels (H) | Normal vessels (S) | Glioma tissue(TH) | Normal tissue (TS) |
| 1229.6644 | 1229.66365 | 0.61 | 0.00411 | yes | 6 | 0 | 0 | 0 |
| 1656.8022 | 1656.80295 | 0.45 | 0.13120 | yes | 5 | 1& | 0 | 0 |
| 1620.7609 | 1620.76073 | 0.10 | 0.25509 | yes | 4 | 0 | 0 | 0 |
| 1772.8870 | 1772.88871 | 0.96 | 0.80459 | no | 3 | 3 | 1 | 1 |
| 1807.9063 | 1807.90469 | 0.89 | 0.00089 | yes | 8 | 0 | 0 | 0 |
| 1355.6954 | 1355.69536 | 0.03 | 0.00125 | yes | 7 | 0 | 0 | 0 |
| 1625.8539 | 1625.85196 | 1.19 | 0.00232 | yes | 5 | 0 | 0 | 0 |
| 1349.6848* | 1349.68481 | 0.01 | 0.00345 | yes | 7 | 0 | 0 | 0 |
| 1926.0516 | 1926.04832 | 1.70 | 0.00503 | yes | 8 | 1& | 0 | 0 |
| 1629.8728* | 1629.87067 | 1.31 | 0.00873 | yes | 5 | 0 | 0 | 0 |
| 2470.3200 | 2470.31874 | 0.51 | 0.01233 | yes | 7 | 1 | 1 | 0 |
| 1078.5537 | 1078.55275 | 0.88 | 0.01835 | no | 0 | 3 | 1 | 1 |
| 1818.9739 | 1818.97078 | 1.72 | 0.02049 | yes | 5 | 0 | 0 | 0 |
| 2524.3649 | 2524.36568 | 0.31 | 0.02049 | yes | 5 | 0 | 0 | 0 |
| 1401.6649* | 1401.66580 | 0.64 | 0.02170 | yes | 6 | 0 | 0 | 0 |
| 1341.6739 | 1341.67570 | 1.34 | 0.02663 | yes | 5 | 1 | 3 | 0 |
| 1110.5413 | 1110.54257 | 1.14 | 0.06243 | yes | 4 | 0 | 2 | 0 |
| 1593.8126 | 1593.81186 | 0.46 | 0.06243 | yes | 6 | 1 | 1 | 0 |
| 1292.6740 | 1292.67324 | 0.59 | 0.06373 | yes | 5 | 1 | 0 | 1 |
| 1297.7117 | 1297.71102 | 0.52 | 0.09473 | yes | 3 | 0 | 1 | 1 |
| 1591.8091 | 1591.80744 | 1.04 | 0.12668 | yes | 4 | 0 | 2 | 0 |
| 1235.5266 | 1235.52620 | 0.32 | 0.13120 | yes | 3 | 1 | 0 | 1 |
| 1431.7507 | 1431.74915 | 1.08 | 0.18754 | yes | 5 | 2 | 0 | 0 |
| 2139.9619 | 2139.96514 | 1.51 | 0.45543 | yes | 5 | 1 | 6 | 0 |
| 858.4301 | 858.43157 | 1.71 | 0.95436 | no | 2 | 1 | 2 | 0 |
| 1291.7265 | 1291.72560 | 0.70 | 1.00000 | yes | 3 | 3 | 1 | 0 |
| 960.5588 | 960.55850 | 0.31 | 0.02082 | yes | 6 | 0 | 4 | 0 |
| 1138.5646 | 1138.56262 | 1.74 | 0.00959 | yes | 5 | 0 | 0 | 1 |
| 1508.7103 | 1508.70876 | 1.02 | 0.05363 | yes | 5 | 0 | 0 | 0 |
| 1869.9677* | 1869.96645 | 0.67 | 0.00170 | yes | 6 | 0 | 0 | 0 |
| 1155.6643 | 1155.66325 | 0.91 | 0.00310 | yes | 5 | 0 | 0 | 0 |
| 1949.9955 | 1949.99268 | 1.45 | 0.01285 | yes | 8 | 0 | 0 | 0 |
| 2493.1740* | 2493.17392 | 0.03 | 0.04085 | yes | 6 | 0 | 0 | 0 |
| 2472.1699 | 2472.17356 | 1.48 | 0.05363 | yes | 5 | 0 | 0 | 0 |
| 1393.7099# | 1393.71102 | 0.80 | 0.07335 | no | 1 | 4 | 3 | 7 |
| 1946.0059 | 1946.00765 | 0.90 | 0.33559 | yes | 7 | 4 | 6 | 0 |
| 924.4946 | 924.49377 | 0.90 | 0.60387 | yes | 3 | 2 | 3 | 0 |
| 1309.6023† | 1309.60188 | 0.32 | 0.61715 | yes | 4 | 3 | 1 | 0 |
| 1193.6166* | 1193.61606 | 0.45 | 0.75200 | yes | 3 | 2 | 4 | 2 |
| 1310.7038$ | 1310.70626 | 1.88 | 1.00000 | yes | 1 | 0 | 7 | 0 |
The listed proteins are found by a MASCOT™ search with 2 or more hits between measured and calculated tryptic peptide masses with an error tolerance of 2 ppm. Only proteins are listed that had at least one peptide mass with a Wilcoxon-Mann-Whitney p-value < 0.01 comparing the peak intensities of glioma vessels with normal vessels.
* WARP-LC MS-MS MALDI-TOF glioma tissue and vessels (see Additional file 1).
# SwissProtTM accession code: P40925, name: Malate dehydrogenase, cytoplasmic, identified by WARP-LC MS-MS on glioma tissue and vessels and normal tissue and vessels (see Additional file 1).
† SwissProtTM accession code: P08670, name: Vimentin, sequence: NLQEAEEWYK, MALDI TOF MS-MS by hand on the “glioma tissue and vessels” LC-MALDI plate using the BiotoolsTM 3.0 (build 1.68) software.
$ SwissProtTM accession code: P62805, name: Histone H4, identified by WARP-LC MS-MS on glioma tissue and vessels and normal tissue and vessels (see Additional file 1).
& outlier sample S10.
Distribution of tryptic peptide fragments of colligin-2 among the tissue sections of different groups Peptide masses obtained by MALDI-FTICR MS are listed that match within 2 ppm with the calculated values of enzymatic by trypsin digested fragments.
| SwissProt™ accession code P50454, name: Collagen-binding protein 2 precursor (Colligin 2) | ||||||||
| Measured mass MH+ (Da) | Calculated mass MH+ (Da) | ppm | p-value Intensity H against S | Intensity H > S | Glioma vessels (H) | Normal vessels (S) | Glioma tissue (TH) | Normal tissue (TS) |
| 1021.5304 # | 1021.53213 | 1.69 | 0.77475 | no | 1 | 2 | 0 | 3 |
| 1293.6805 | 1293.67973 | 0.60 | 0.01244 | yes | 5 | 1 | 1 | 3 |
| 1659.8011 * | 1659.80126 | 0.10 | 0.02170 | yes | 5 | 1& | 0 | 0 |
| 1675.7747 ? | 1675.77577 | 0.64 | 0.01088 | yes | 1 | 0 | 2 | 4 |
* WARP-LC MS-MS MALDI-TOF glioma tissue and vessels (see additional file 1)
# SwissProt™ accession code: P07196, name: Neurofilament triplet L protein, identified by WARP-LC MS-MS MALDI-TOF on normal tissue and vessels (see additional file 1)
? SwissProt™ accession code: P17677, name: Neuromodulin (Axonal membrane protein GAP-43) ?
&outlier sample S10
Figure 5Increased peak intensity at the mass MH. The peaks of glioma vessel samples are represented by the dark lines in MALDI-FTICR mass spectra of all samples. The first isotopic mass of Neurofilament triplet L protein at 1154.7128 Da, represented by the grey lines, is as expected not present in Glioma vessels.
Figure 6An equal distribution of peak intensities among all samples of a Myosin-9 peptide mass at 1193.6166 Da. The dark lines represent the peak intensities of the glioma vessel samples.
Figure 7Strong up-regulated peptides of Myosin-9 in the "glioma vessels group" versus the "normal vessels group" located at the C-terminus of the protein. The strong up-regulated peptides are approximately located from amino-acid position 1301 to 1959.
Wilcoxon-Mann-Whitney p-values of Myosin-9 peptide peak mass intensities between the "glioma vessels" (series H) and "normal vessels" (series S) The p-value gradually drops from 0.60387 to 0.00310 from start position 711 to 1923 of the amino acid in the protein.
| MS-MS | Position start | Position end | Observed mass MH+ | Calculated mass MH+ | ppm | Missed cleavages | Sequence | Wilcoxon-Mann-Whitney p-value | Mean intensity glioma vessels (H) *103 (A.U.) | Mean intensity normal vessels (S) *103 (A.U.) | Intensity H/S |
| 711 | 717 | 924.4946 | 924.49373 | 0.94 | 0 | R.VVFQEFR.Q | 0.60387 | 117 | 30 | 4 | |
| * | 745 | 754 | 1193.6166 | 1193.61604 | 0.47 | 0 | K.ALELDSNLYR.I | 0.75200 | 100 | 44 | 2 |
| 1277 | 1294 | 1946.0059 | 1946.00763 | 0.89 | 0 | K.LQVELDNVTGLLSQSDSK.S | 0.33559 | 287 | 179 | 2 | |
| * | 1301 | 1321 | 2493.174 | 2493.17392 | 0.03 | 0 | K.DFSALESQLQDTQELLQEENR.Q | 0.04085 | 211 | 45 | 5 |
| 1417 | 1432 | 1949.9955 | 1949.99268 | 1.45 | 0 | R.LQQELDDLLVDLDHQR.Q | 0.01285 | 264 | 95 | 3 | |
| 1730 | 1750 | 2472.1699 | 2472.17356 | 1.48 | 0 | R.IAQLEEELEEEQGNTELINDR.L | 0.05363 | 154 | 11 | 14 | |
| * | 1754 | 1769 | 1869.9677 | 1869.96645 | 0.67 | 0 | K.ANLQIDQINTDLNLER.S | 0.00170 | 263 | 39 | 7 |
| 1923 | 1932 | 1155.6643 | 1155.66325 | 0.91 | 1 | R.GDLPFVVPRR.M | 0.00310 | 82 | 7 | 11 |
* WARP-LC MS-MS MALDI-TOF glioma tissue and vessels (see additional file 1)