| Literature DB >> 26510657 |
Gelio Alves1, Guanghui Wang2, Aleksey Y Ogurtsov1, Steven K Drake3, Marjan Gucek2, Anthony F Suffredini3, David B Sacks4, Yi-Kuo Yu5.
Abstract
Correct and rapid identification of microorganisms is the key to the success of many important applications in health and safety, including, but not limited to, infection treatment, food safety, and biodefense. With the advance of mass spectrometry (MS) technology, the speed of identification can be greatly improved. However, the increasing number of microbes sequenced is challenging correct microbial identification because of the large number of choices present. To properly disentangle candidate microbes, one needs to go beyond apparent morphology or simple 'fingerprinting'; to correctly prioritize the candidate microbes, one needs to have accurate statistical significance in microbial identification. We meet these challenges by using peptidome profiles of microbes to better separate them and by designing an analysis method that yields accurate statistical significance. Here, we present an analysis pipeline that uses tandem MS (MS/MS) spectra for microbial identification or classification. We have demonstrated, using MS/MS data of 81 samples, each composed of a single known microorganism, that the proposed pipeline can correctly identify microorganisms at least at the genus and species levels. We have also shown that the proposed pipeline computes accurate statistical significances, i.e., E-values for identified peptides and unified E-values for identified microorganisms. The proposed analysis pipeline has been implemented in MiCId, a freely available software for Microorganism Classification and Identification. MiCId is available for download at http://www.ncbi.nlm.nih.gov/CBBresearch/Yu/downloads.html . Graphical Abstract ᅟ.Entities:
Keywords: Mass spectrometry; Microorganism classification; Pathogen identification; Proteomics; Statistical significance
Mesh:
Substances:
Year: 2015 PMID: 26510657 PMCID: PMC4723618 DOI: 10.1007/s13361-015-1271-2
Source DB: PubMed Journal: J Am Soc Mass Spectrom ISSN: 1044-0305 Impact factor: 3.109
List of Commonly Used Acronyms
|
|
|
| CIP | confidently identified peptide |
| CI | cluster index |
|
|
|
|
| unified |
| E[X] | expected value of variable X |
|
| the |
| FP | false positives |
| IF | identification fraction |
| NIP | number of identified peptides |
| NUP | number of unique peptides |
| MWET | molecular weight error tolerance |
|
| number of MS/MS spectra from a given sample |
|
| number of qualified peptides in the database |
| MCS | missed cleavage sites |
| OD | optical density |
| P |
|
|
| unified |
| ppm | parts per million |
| PNNL | Pacific Northwest National Laboratory |
| R | rank |
| SN | sample number |
| SSE | statistically significant |
| WPC | weighted peptide count |
Monitor Culture Growth
|
| |||||
| Time (h) | Sample label | Number vials | ODa
| OD | |
| 0:00 | 0.03 | 0.03 | |||
| 2:00 | Low | 4 | 0.30 | 0.38 | |
| 3:30 | Medium | 2 | 0.75 | 0.65 | |
| 5:00 | High | 1 | 1.07 | ||
| 6:00 | High | 1 | 0.90 | ||
| 14:00 | Saturated | 1 | 1.34 | 1.50 | |
|
| |||||
| Time (h) | Sample label | Number vials | ODa
| OD | OD |
| 0:00 | 0.03 | 0.03 | 0.03 | ||
| 2:10 | Low | 4 | 0.34 | 0.43 | 0.42 |
| 3:00 | Medium | 2 | 0.66 | 0.68 | |
| 3:40 | Medium | 2 | 0.64 | ||
| 5:40 | High | 1 | 1.01 | ||
| 7:10 | High | 1 | 1.12 | 0.96 | |
| 14:00 | Saturated | 1 | 1.34 | 1.50 | 1.34 |
|
| |||||
| time (h) | Sample label | Number vials | ODa
| OD | |
| 3:00–4:00 | Medium | 4 | 0.6–0.7 | 0.6–0.7 | |
a Optical density (OD) 0.39 ≈ 8 × 108 cells. Roughly linear for OD between 0 and 0.40
Figure 1Status of bacterial database used in this investigation. The curve in panel (a) shows the number of proteins used for each strain included in the database. Panel (b) ((c)) displays the number of strains (species) that belongs to a given species (genus). With 0.01 Da as the bin size, panel (d) displays the histogram for the number of peptides present in the database as a function of the molecular weight. The vertical dash-line (at 6000 Da) in panel (d) indicates the upper bound of molecular weight of peptides included in the current database
Figure 2Accuracy assessment of the spectrum-specific E-value. The agreement between the expected error per query and the computed E-value is examined using the MS/MS spectra from sample numbers 1–4, panels (a)–(c), and from sample numbers 5–8, panels (d)–(e). The molecular weight (MW) range considered while searching the database is [MW - 3⋅ ⋅MW, MW + 3⋅ ⋅MW]. In each panel the dashed line, = , corresponds to the theoretical line and is used to provide a visual guide regarding how close/off the computed E-value curves are from the theoretical line
Figure 3Accuracy assessment of the unified E-value (E ). The accuracy of the computed E s is evaluated by plotting the E[E ] versus rank. The E[E ]s at a given rank were obtained by averaging over all the computed E s from sample numbers 1–81 having the same rank. The curve made of red circles displayed in panels (a), (b), (c) are the curves of E[E ]s for microbial identification performed at the genus, species, and strain level. In each panel, the = line, corresponding to the theoretical line, together with the two dashed lines, = 3 and = /3, provide a visual guide regarding the accuracy of the computed E[E ] curves
The Number of CIPs at the 1% False Discovery Rate for the In-House Dataset
| Sample from batch one | Samples from batch two | |||||||
|---|---|---|---|---|---|---|---|---|
|
|
| |||||||
| SN | 1 | 2 | 3 | 4 | 9 | 10 | 11 | 12 |
| OD | 0.30 | 0.75 | 1.07 | 1.34 | 0.34 | 0.66 | 1.01 | 1.34 |
| NMCS | Number of CIPs | Number of CIPs | ||||||
| 2 | 67 | 129 | 15 | 47 | 480 | 197 | 214 | 486 |
| 3 | 101 | 186 | 23 | 72 | 731 | 303 | 322 | 725 |
| 4 | 132 | 227 | 30 | 95 | 941 | 385 | 415 | 906 |
| 5 | 155 | 264 | 39 | 116 | 1077 | 456 | 484 | 1036 |
|
|
| |||||||
| SN | 5 | 6 | 7 | 8 | 13 | 14 | 15 | 16 |
| OD | 0.38 | 0.65 | 0.90 | 1.50 | 0.43 | 0.64 | 1.12 | 1.50 |
| NMCS | Number of CIPs | Number of CIPs | ||||||
| 2 | 115 | 147 | 70 | 147 | 187 | 175 | 166 | 177 |
| 3 | 196 | 383 | 129 | 244 | 320 | 294 | 281 | 296 |
| 4 | 266 | 514 | 180 | 317 | 420 | 377 | 372 | 389 |
| 5 | 311 | 610 | 228 | 376 | 491 | 448 | 436 | 456 |
|
| ||||||||
| SN | 17 | 18 | 19 | 20 | ||||
| DO | 0.42 | 0.68 | 0.96 | 1.34 | ||||
| NMCS | Number of CIPs | Number of CIPs | ||||||
| 2 | 186 | 186 | 156 | 179 | ||||
| 3 | 283 | 284 | 233 | 263 | ||||
| 4 | 367 | 365 | 291 | 334 | ||||
| 5 | 427 | 429 | 337 | 378 | ||||
Bacterial Identification at the Genus Level for the PNNL Dataseta
|
| |||||||
| Genus | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 11/11 | 1.0 | –6498.6 ± 349.0 | 597.7 | 1594 | 15 | 1 |
|
| 2/11 | 2.0 | –14.4 ± 3.0 | 1.7 | 12 | 1 | 6 |
|
| 1/11 | 2.0 | –6.4 ± 0.0 | 1.0 | 3 | 1 | 38 |
|
| 1/11 | 3.0 | –4.4 ± 0.0 | 1.0 | 2 | 1 | 5 |
|
| 1/11 | 2.0 | –3.9 ± 0.0 | 1.0 | 2 | 0 | 24 |
|
| |||||||
| Genus | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 9/9 | 1.0 | –6784.8 ± 729.6 | 725.6 | 937 | 433 | 1 |
|
| 1/9 | 2.0 | –6.3 ± 0.0 | 1.0 | 1 | 1 | 65 |
|
| 3/9 | 2.3 | –5.6 ± 0.3 | 1.0 | 1 | 1 | 15 |
|
| 1/9 | 2.0 | –4.5 ± 0.0 | 1.0 | 1 | 0 | 316 |
|
| 2/9 | 2.5 | –2.9 ± 0.1 | 0.5 | 3 | 0 | 3 |
|
| |||||||
| Genus | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 8/8 | 1.0 | –5596.0 ± 670.5 | 546.8 | 1050 | 204 | 1 |
|
| 1/8 | 2.0 | –14.4 ± 0.0 | 2.0 | 6 | 1 | 10 |
|
| 1/8 | 2.0 | –6.7 ± 0.0 | 1.0 | 2 | 1 | 14 |
|
| 1/8 | 3.0 | –2.1 ± 0.0 | 1.0 | 5 | 0 | 2 |
|
| 3/8 | 5.3 | –0.8 ± 2.4 | 0.7 | 3 | 0 | 7 |
|
| |||||||
| Genus | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 9/9 | 1.0 | -9201.2 ± 897.8 | 847.0 | 1336 | 466 | 1 |
|
| 1/9 | 2.0 | –6.8 ± 0.0 | 1.2 | 6 | 0 | 2 |
|
| 1/9 | 2.0 | –5.1 ± 0.0 | 1.0 | 1 | 1 | 9 |
|
| 1/9 | 3.0 | –4.1 ± 0.0 | 1.0 | 3 | 0 | 16 |
|
| 1/9 | 3.0 | –4.1 ± 0.0 | 1.2 | 7 | 0 | 10 |
|
| |||||||
| Genus | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 9/9 | 1.0 | –8043.2 ± 803.2 | 749.7 | 1187 | 400 | 1 |
|
| 1/9 | 2.0 | –21.3 ± 0.0 | 3.0 | 9 | 2 | 7 |
|
| 4/9 | 3.5 | –1.5 ± 4.3 | 0.9 | 5 | 0 | 11 |
|
| 4/9 | 2.8 | 0.1 ± 0.8 | 0.2 | 2 | 0 | 63 |
|
| 2/9 | 4.5 | 0.4 ± 0.4 | 0.5 | 4 | 0 | 5 |
|
| |||||||
| Genus | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 7/7 | 1.0 | –14534.8 ± 8196.6 | 1369.1 | 1841 | 1022 | 1 |
|
| 1/7 | 2.0 | –15.5 ± 0.0 | 3.2 | 15 | 1 | 7 |
|
| 1/7 | 2.0 | –7.7 ± 0.0 | 1.0 | 1 | 1 | 49 |
|
| 1/7 | 3.0 | –5.6 ± 0.0 | 1.8 | 19 | 0 | 6 |
|
| 1/7 | 3.0 | –4.1 ± 0.0 | 0.7 | 6 | 0 | 12 |
The numerical entries in the table are the expected values E[X]. The E[ln (E )] is followed by its standard deviation ± σ
Bacterial Identification at the Species Level for the PNNL Dataset
|
| |||||||
| Species | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 11/11 | 1.0 | –6473.8 ± 341.9 | 595.4 | 1582 | 14 | 1 |
|
| 2/11 | 2.0 | –14.0 ± 3.3 | 1.7 | 12 | 1 | 2 |
|
| 1/11 | 2.0 | –6.2 ± 0.0 | 1.0 | 3 | 1 | 50 |
|
| 1/11 | 3.0 | –5.7 ± 0.0 | 1.0 | 2 | 1 | 27 |
|
| 1/11 | 3.0 | –3.7 ± 0.0 | 1.0 | 2 | 1 | 31 |
|
| |||||||
| Species | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 9/9 | 1.0 | –3697.0 ± 394.8 | 391.5 | 887 | 5 | 1 |
|
| 1/9 | 2.0 | –5.7 ± 0.0 | 1.0 | 1 | 1 | 93 |
|
| 1/9 | 2.0 | –3.7 ± 0.0 | 1.0 | 1 | 0 | 550 |
|
| 1/9 | 2.0 | –3.6 ± 0.0 | 1.5 | 2 | 0 | 779 |
|
| 1/9 | 3.0 | –2.7 ± 0.0 | 1.0 | 1 | 0 | 47 |
|
| |||||||
| Species | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 8/8 | 1.0 | –5099.6 ± 604.7 | 498.5 | 1043 | 133 | 1 |
|
| 1/8 | 2.0 | –14.3 ± 0.0 | 2.0 | 6 | 1 | 4 |
|
| 1/8 | 2.0 | –6.9 ± 0.0 | 1.0 | 2 | 1 | 24 |
|
| 1/8 | 2.0 | –5.7 ± 0.0 | 1.2 | 3 | 1 | 13 |
|
| 1/8 | 3.0 | –5.6 ± 0.0 | 1.0 | 1 | 0 | 220 |
|
| |||||||
| Species | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 9/9 | 1.0 | –5888.8 ± 566.1 | 549.4 | 1317 | 24 | 1 |
|
| 1/9 | 4.0 | –7.0 ± 0.0 | 1.5 | 7 | 0 | 4 |
|
| 1/9 | 2.0 | –4.5 ± 0.0 | 1.0 | 1 | 1 | 3 |
|
| 1/9 | 3.0 | –3.8 ± 0.0 | 1.0 | 3 | 0 | 11 |
|
| 1/9 | 3.0 | –3.7 ± 0.0 | 1.0 | 2 | 0 | 43 |
|
| |||||||
| Species | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 9/9 | 1.0 | –5198.7 ± 516.8 | 490.1 | 1173 | 23 | 1 |
|
| 1/9 | 2.0 | –15.9 ± 0.0 | 2.5 | 6 | 2 | 4 |
|
| 4/9 | 4.2 | –0.3 ± 4.5 | 0.8 | 5 | 0 | 4 |
|
| 4/9 | 2.2 | 0.6 ± 0.9 | 0.2 | 2 | 0 | 116 |
|
| 1/9 | 2.0 | 0.8 ± 0.0 | 0.5 | 3 | 0 | 21 |
|
| |||||||
| Species | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 7/7 | 1.0 | –10280.9 ± 5954.4 | 949.7 | 1660 | 402 | 1 |
|
| 1/7 | 2.0 | –7.6 ± 0.0 | 1.0 | 1 | 1 | 35 |
|
| 1/7 | 2.0 | –5.2 ± 0.0 | 1.7 | 11 | 0 | 7 |
|
| 1/7 | 5.0 | –3.8 ± 0.0 | 1.8 | 11 | 0 | 6 |
|
| 1/7 | 2.0 | –3.7 ± 0.0 | 1.0 | 3 | 1 | 20 |
The numerical entries in the table are the expected values E[X]. The E[ln (E )] is followed by its standard deviation ± σ
Bacterial Identification at the Strain Level for the PNNL Dataseta
| Strain | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|---|---|---|---|---|---|---|---|
|
| 11/11 | 1.1 | –6363.9 ± 333.9 | 583.0 | 1566 | 0 | 1 |
|
| 11/11 | 2.0 | –6353.7 ± 331.3 | 582.0 | 1564 | 0 | 1 |
|
| 11/11 | 2.9 | –6339.0 ± 332.1 | 580.9 | 1561 | 0 | 1 |
|
| 2/11 | 4.0 | –13.9 ± 3.4 | 1.7 | 12 | 1 | 2 |
|
| 1/11 | 4.0 | –6.1 ± 0.0 | 1.0 | 3 | 1 | 66 |
|
| |||||||
| Strain | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 9/9 | 1.3 | –3656.0 ± 386.2 | 387.3 | 884 | 0 | 1 |
|
| 9/9 | 1.7 | –3652.6 ± 386.4 | 386.9 | 883 | 0 | 1 |
|
| 9/9 | 3.9 | –3640.2 ± 384.6 | 385.9 | 881 | 0 | 1 |
|
| 9/9 | 4.2 | –3639.3 ± 383.3 | 385.8 | 880 | 0 | 1 |
|
| 9/9 | 4.6 | –3637.0 ± 381.9 | 385.5 | 880 | 0 | 1 |
|
| |||||||
| Strain | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 8/8 | 1.9 | –4093.9 ± 498.5 | 403.9 | 1027 | 0 | 1 |
|
| 8/8 | 2.5 | –4089.1 ± 493.6 | 403.3 | 1025 | 0 | 1 |
|
| 8/8 | 3.0 | –4087.1 ± 495.2 | 403.0 | 1025 | 0 | 1 |
|
| 8/8 | 4.1 | –4083.3 ± 497.5 | 402.5 | 1024 | 0 | 1 |
|
| 8/8 | 4.1 | –4080.9 ± 494.9 | 402.6 | 1023 | 0 | 1 |
|
| |||||||
| Strain | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 9/9 | 1.0 | –5699.2 ± 545.9 | 531.8 | 1317 | 0 | 1 |
|
| 6/9 | 2.2 | –5452.2 ± 346.2 | 512.0 | 1266 | 0 | 1 |
|
| 3/9 | 3.3 | –5272.1 ± 273.3 | 497.7 | 1231 | 0 | 1 |
|
| 1/9 | 2.0 | –5071.8 ± 0.0 | 486.0 | 1212 | 0 | 1 |
|
| 1/9 | 4.0 | –6.9 ± 0.0 | 1.5 | 7 | 0 | 4 |
|
| |||||||
| Strain | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 9/9 | 1.0 | –5048.1 ± 502.3 | 475.7 | 1172 | 3 | 1 |
|
| 7/9 | 2.0 | –4868.1 ± 499.6 | 463.8 | 1145 | 0 | 1 |
|
| 1/9 | 3.0 | –15.7 ± 0.0 | 2.5 | 6 | 2 | 4 |
|
| 4/9 | 7.8 | –0.1 ± 4.5 | 0.8 | 5 | 0 | 4 |
|
| 4/9 | 2.8 | 0.7 ± 1.0 | 0.2 | 2 | 0 | 184 |
|
| |||||||
| Strain | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 7/7 | 1.0 | –10280.4 ± 5954.4 | 949.5 | 1660 | 402 | 1 |
|
| 1/7 | 2.0 | –7.6 ± 0.0 | 1.0 | 1 | 1 | 54 |
|
| 1/7 | 3.0 | –6.4 ± 0.0 | 1.1 | 8 | 0 | 2 |
|
| 1/7 | 2.0 | –4.2 ± 0.0 | 1.7 | 11 | 0 | 4 |
|
| 1/7 | 2.0 | –3.7 ± 0.0 | 1.0 | 3 | 1 | 13 |
The numerical entries in the table are the expected values E[X]. The E[ln (E )] is followed by its standard deviation ± σ
Bacterial Identification at the Genus Level for the In-House Dataseta
|
| |||||||
| Genus | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 14/14 | 1.4 | –365.2 ± 290.4 | 42.1 | 115 | 1 | 1 |
|
| 14/14 | 1.6 | –349.6 ± 271.8 | 40.1 | 114 | 0 | 1 |
|
| 2/14 | 5.0 | –36.8 ± 24.4 | 5.1 | 18 | 0 | 1 |
|
| 1/14 | 3.0 | –13.9 ± 0.0 | 1.8 | 8 | 0 | 1 |
|
| 1/14 | 4.0 | –13.2 ± 0.0 | 1.5 | 6 | 0 | 1 |
|
| |||||||
| Genus | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 10/10 | 1.0 | –598.7 ± 647.8 | 71.3 | 102 | 34 | 1 |
|
| 1/10 | 2.0 | –23.0 ± 0.0 | 2.0 | 4 | 2 | 3 |
|
| 5/10 | 4.2 | –5.6 ± 8.3 | 1.2 | 2 | 0 | 3 |
|
| 1/10 | 2.0 | –5.6 ± 0.0 | 1.0 | 2 | 1 | 5 |
|
| 1/10 | 3.0 | –4.8 ± 0.0 | 1.0 | 1 | 0 | 7 |
|
| |||||||
| Genus | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 4/4 | 1.0 | –232.1 ± 21.1 | 27.1 | 61 | 7 | 1 |
|
| 2/4 | 2.5 | –7.2 ± 0.5 | 1.0 | 1 | 1 | 10 |
|
| 1/4 | 2.0 | –5.5 ± 0.0 | 1.0 | 1 | 0 | 11 |
|
| 4/4 | 2.8 | –5.3 ± 2.5 | 1.1 | 2 | 1 | 6 |
|
| 3/4 | 4.0 | –1.7 ± 2.3 | 0.8 | 2 | 0 | 4 |
a The numerical entries in the table are the expected values E[X]. The E[ln (E )] it is followed by its standard deviation ± σ
Bacterial Identification at the Species Level for the In-House Dataset
|
| |||||||
| Species | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 14/14 | 1.4 | –364.2 ± 288.0 | 41.9 | 115 | 1 | 1 |
|
| 10/14 | 2.1 | –188.3 ± 80.8 | 22.8 | 65 | 0 | 1 |
|
| 10/14 | 4.1 | –178.0 ± 76.3 | 22.1 | 63 | 0 | 1 |
|
| 9/14 | 3.4 | –168.0 ± 73.6 | 20.4 | 57 | 0 | 1 |
|
| 8/14 | 4.2 | –154.9 ± 69.2 | 19.1 | 54 | 0 | 1 |
|
| |||||||
| Species | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 10/10 | 1.0 | –526.3 ± 557.5 | 59.9 | 94 | 23 | 1 |
|
| 1/10 | 2.0 | –26.0 ± 0.0 | 2.0 | 4 | 2 | 4 |
|
| 1/10 | 2.0 | –16.1 ± 0.0 | 1.8 | 7 | 0 | 2 |
|
| 1/10 | 3.0 | –7.5 ± 0.0 | 1.2 | 5 | 0 | 2 |
|
| 1/10 | 5.0 | –7.1 ± 0.0 | 1.2 | 6 | 0 | 2 |
|
| |||||||
| Species | IF | E[R] | E[ln ( | E[WPC] | E[NIP] | E[NUP] | E[CI] |
|
| 4/4 | 1.0 | –206.7 ± 15.2 | 24.7 | 61 | 4 | 1 |
|
| 1/4 | 2.0 | –157.3 ± 0.0 | 19.6 | 56 | 0 | 1 |
|
| 2/4 | 3.0 | –7.6 ± 0.8 | 1.0 | 1 | 1 | 11 |
|
| 4/4 | 2.5 | –6.7 ± 2.9 | 1.1 | 2 | 1 | 8 |
|
| 1/4 | 4.0 | –5.5 ± 0.0 | 1.0 | 1 | 0 | 10 |
a The numerical entries in the table are the expected values E[X]. The E[ln (E )] is followed by its standard deviation ± σ
Figure 4Molecular weight and charge distribution. The curves in panels (a)–(f) display the molecular weight and charge distributions obtained for the in-house dataset under protocols one and two. Also displayed in panels (a) and (b) are the molecular weight and charge distributions for SN 31 from the PNNL dataset. Similar molecular weight and charge distributions like the one observed for SN 31 are obtained for the other samples from the PNNL dataset