| Literature DB >> 30598095 |
Jongsu Jun1, Jungsoo Gim2, Yongkang Kim1, Hyunsoo Kim3,4, Su Jong Yu5, Injun Yeo3, Jiyoung Park3, Jeong-Ju Yoo5, Young Youn Cho5, Dong Hyeon Lee5, Eun Ju Cho5, Jeong-Hoon Lee5, Yoon Jun Kim5, Seungyeoun Lee6, Jung-Hwan Yoon5, Youngsoo Kim3,4, Taesung Park7,8.
Abstract
BACKGROUND: Discovering reliable protein biomarkers is one of the most important issues in biomedical research. The ELISA is a traditional technique for accurate quantitation of well-known proteins. Recently, the multiple reaction-monitoring (MRM) mass spectrometry has been proposed for quantifying newly discovered protein and has become a popular alternative to ELISA. For the MRM data analysis, linear mixed modeling (LMM) has been used to analyze MRM data. MSstats is one of the most widely used tools for MRM data analysis that is based on the LMMs. However, LMMs often provide various significance results, depending on model specification. Sometimes it would be difficult to specify a correct LMM method for the analysis of MRM data. Here, we propose a new logistic regression-based method for Significance Analysis of Multiple Reaction Monitoring (LR-SAM).Entities:
Keywords: Hepatocellular carcinoma (HCC); Logistic regression-based method for significance analysis of multiple reaction monitoring (LR-SAM); Multiple reaction-monitoring (MRM); Protein; Sorafenib response
Mesh:
Substances:
Year: 2018 PMID: 30598095 PMCID: PMC6311902 DOI: 10.1186/s12918-018-0656-9
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
MRM dataset structure
| Sample ID | Group | Run | Protein | Peptide | Area Ratio |
|---|---|---|---|---|---|
| 1 | 1 | 1 | X1 | 1 |
|
| 1 | 1 | 1 | X1 | 2 |
|
| 2 | 2 | 1 | X1 | 1 |
|
| 2 | 2 | 1 | X1 | 2 |
|
The number of proteins of each number of peptides
| The number of peptides | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
| The number of proteins | 48 | 57 | 12 | 5 | 0 | 1 | 1 |
Estimated type I error of LMMs and LR-SAM methods
| True subject type | True run type | Sample size | Model | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LMM | LR-SAM | |||||||||||
| Best | (FF) | (FR) | (RF) | (RR) |
|
|
|
|
| |||
| Fixed | Fixed | 20 | 0.053 | 0.054 | 0.037 | 0.006 | 0.002 | 0.093 | 0 | 0.002 | 0.002 | 0.019 |
| 50 | 0.038 | 0.038 | 0.033 | 0.006 | 0.001 | 0.044 | 0.002 | 0 | 0.004 | 0.012 | ||
| 100 | 0.048 | 0.048 | 0.03 | 0.003 | 0 | 0.035 | 0.014 | 0.001 | 0.001 | 0.012 | ||
| Random | 20 | 0.052 | 0.052 | 0.055 | 0.004 | 0.002 | 0.082 | 0 | 0.002 | 0.001 | 0.014 | |
| 50 | 0.054 | 0.054 | 0.049 | 0.01 | 0.005 | 0.055 | 0.007 | 0.008 | 0.006 | 0.017 | ||
| 100 | 0.032 | 0.032 | 0.033 | 0.003 | 0.001 | 0.043 | 0.025 | 0.003 | 0.003 | 0.024 | ||
| Random | Fixed | 20 | 0.152 | 0.153 | 0.211 | 0.029 | 0.031 | 0.107 | 0 | 0.008 | 0.015 | 0.039 |
| 50 | 0.178 | 0.179 | 0.26 | 0.06 | 0.055 | 0.065 | 0.014 | 0.042 | 0.053 | 0.05 | ||
| 100 | 0.148 | 0.148 | 0.204 | 0.041 | 0.029 | 0.06 | 0.033 | 0.033 | 0.039 | 0.04 | ||
| Random | 20 | 0.157 | 0.159 | 0.184 | 0.046 | 0.053 | 0.114 | 0 | 0.008 | 0.027 | 0.053 | |
| 50 | 0.168 | 0.168 | 0.195 | 0.055 | 0.06 | 0.068 | 0.019 | 0.039 | 0.047 | 0.049 | ||
| 100 | 0.16 | 0.16 | 0.17 | 0.045 | 0.04 | 0.06 | 0.031 | 0.034 | 0.039 | 0.054 | ||
Estimated power of LMMs and LR-SAM methods for GS 0. Bolded number indicates power of methods were more than 0.8
| Interaction scenario | True subject type | True run type | Sample size | Model | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LMM | LR-SAM | ||||||||||||
| Best | (FF) | (FR) | (RF) | (RR) |
|
|
|
|
| ||||
| IS1 | Fixed | Fixed | 20 | 0.049 | 0.05 | 0.03 | 0.002 | 0.001 | 0.344 | 0 | 0 | 0.001 | 0.135 |
| 50 | 0.048 | 0.048 | 0.029 | 0.002 | 0 | 0.692 | 0.385 | 0.008 | 0.002 | 0.506 | |||
| 100 | 0.053 | 0.053 | 0.041 | 0.007 | 0.001 |
|
| 0.007 | 0.007 |
| |||
| Random | 20 | 0.051 | 0.052 | 0.066 | 0.004 | 0.002 | 0.339 | 0.001 | 0.002 | 0.002 | 0.14 | ||
| 50 | 0.047 | 0.047 | 0.042 | 0.002 | 0.001 | 0.691 | 0.397 | 0.007 | 0.002 | 0.538 | |||
| 100 | 0.045 | 0.045 | 0.043 | 0.003 | 0.003 |
|
| 0.008 | 0.003 |
| |||
| Random | Fixed | 20 | 0.147 | 0.149 | 0.198 | 0.051 | 0.046 | 0.402 | 0.001 | 0.009 | 0.027 | 0.213 | |
| 50 | 0.177 | 0.177 | 0.234 | 0.048 | 0.042 | 0.745 | 0.415 | 0.025 | 0.04 | 0.593 | |||
| 100 | 0.157 | 0.157 | 0.201 | 0.054 | 0.049 |
|
| 0.037 | 0.051 |
| |||
| Random | 20 | 0.158 | 0.159 | 0.19 | 0.052 | 0.06 | 0.415 | 0 | 0.003 | 0.026 | 0.216 | ||
| 50 | 0.148 | 0.15 | 0.168 | 0.04 | 0.046 | 0.726 | 0.455 | 0.027 | 0.034 | 0.602 | |||
| 100 | 0.164 | 0.165 | 0.171 | 0.04 | 0.043 |
|
| 0.026 | 0.037 |
| |||
| IS2 | Fixed | Fixed | 20 | 0.054 | 0.055 | 0.04 | 0.005 | 0 | 0.611 | 0.001 | 0.001 | 0.001 | 0.353 |
| 50 | 0.051 | 0.051 | 0.034 | 0.009 | 0.002 |
| 0.787 | 0.018 | 0.004 |
| |||
| 100 | 0.045 | 0.045 | 0.036 | 0.007 | 0.001 |
|
| 0.016 | 0.006 |
| |||
| Random | 20 | 0.044 | 0.046 | 0.058 | 0.004 | 0.006 | 0.606 | 0 | 0.002 | 0.001 | 0.354 | ||
| 50 | 0.045 | 0.046 | 0.046 | 0.001 | 0.001 |
| 0.8 | 0.014 | 0.001 |
| |||
| 100 | 0.056 | 0.056 | 0.062 | 0.01 | 0.009 |
|
| 0.012 | 0.009 |
| |||
| Random | Fixed | 20 | 0.145 | 0.147 | 0.2 | 0.055 | 0.046 | 0.592 | 0 | 0.005 | 0.033 | 0.44 | |
| 50 | 0.143 | 0.145 | 0.2 | 0.034 | 0.028 |
|
| 0.029 | 0.026 |
| |||
| 100 | 0.166 | 0.166 | 0.223 | 0.049 | 0.04 |
|
| 0.043 | 0.045 |
| |||
| Random | 20 | 0.162 | 0.164 | 0.188 | 0.048 | 0.057 | 0.639 | 0 | 0.001 | 0.021 | 0.443 | ||
| 50 | 0.144 | 0.145 | 0.177 | 0.04 | 0.044 |
|
| 0.02 | 0.031 |
| |||
| 100 | 0.17 | 0.17 | 0.206 | 0.053 | 0.06 |
|
| 0.032 | 0.048 |
| |||
Estimated power of LMMs and LR-SAM methods for GS 1. Bolded number indicates power of methods were more than 0.8
| Interaction scenario | True subject type | True run type | Sample size | Model | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LMM | LR-SAM | ||||||||||||
| Best | (FF) | (FR) | (RF) | (RR) |
|
|
|
|
| ||||
| IS1 | Fixed | Fixed | 20 | 0.296 | 0.301 | 0.459 | 0.077 | 0.044 | 0.417 | 0 | 0.01 | 0.042 | 0.258 |
| 50 | 0.627 | 0.63 |
| 0.333 | 0.216 | 0.798 | 0.548 | 0.14 | 0.291 | 0.756 | |||
| 100 |
|
|
| 0.709 | 0.632 |
|
| 0.424 | 0.694 |
| |||
| Random | 20 | 0.302 | 0.303 | 0.39 | 0.085 | 0.093 | 0.42 | 0 | 0.007 | 0.034 | 0.253 | ||
| 50 | 0.642 | 0.646 | 0.715 | 0.321 | 0.336 |
| 0.534 | 0.154 | 0.288 | 0.775 | |||
| 100 |
|
|
| 0.681 | 0.753 |
|
| 0.405 | 0.668 |
| |||
| Random | Fixed | 20 | 0.351 | 0.35 | 0.472 | 0.152 | 0.159 | 0.472 | 0.001 | 0.016 | 0.095 | 0.304 | |
| 50 | 0.626 | 0.628 | 0.782 | 0.384 | 0.403 |
| 0.574 | 0.218 | 0.359 | 0.782 | |||
| 100 |
|
|
| 0.659 | 0.705 |
|
| 0.457 | 0.652 |
| |||
| Random | 20 | 0.379 | 0.381 | 0.43 | 0.176 | 0.186 | 0.488 | 0 | 0.024 | 0.106 | 0.348 | ||
| 50 | 0.613 | 0.615 | 0.674 | 0.366 | 0.389 |
| 0.552 | 0.19 | 0.342 | 0.775 | |||
| 100 |
|
|
| 0.655 | 0.679 |
|
| 0.447 | 0.639 |
| |||
| IS2 | Fixed | Fixed | 20 | 0.328 | 0.331 | 0.482 | 0.083 | 0.039 | 0.662 | 0 | 0.005 | 0.042 | 0.503 |
| 50 | 0.619 | 0.62 |
| 0.298 | 0.199 |
|
| 0.095 | 0.268 |
| |||
| 100 |
|
|
| 0.696 | 0.621 |
|
| 0.279 | 0.676 |
| |||
| Random | 20 | 0.288 | 0.29 | 0.382 | 0.067 | 0.074 | 0.649 | 0 | 0.006 | 0.023 | 0.466 | ||
| 50 | 0.648 | 0.652 | 0.735 | 0.339 | 0.363 |
|
| 0.112 | 0.305 |
| |||
| 100 |
|
|
| 0.703 | 0.748 |
|
| 0.288 | 0.685 |
| |||
| Random | Fixed | 20 | 0.36 | 0.363 | 0.493 | 0.148 | 0.152 | 0.683 | 0.001 | 0.009 | 0.093 | 0.524 | |
| 50 | 0.599 | 0.602 | 0.76 | 0.363 | 0.377 |
|
| 0.137 | 0.334 |
| |||
| 100 |
|
|
| 0.674 | 0.715 |
|
| 0.321 | 0.66 |
| |||
| Random | 20 | 0.384 | 0.386 | 0.462 | 0.175 | 0.191 | 0.682 | 0 | 0.007 | 0.113 | 0.523 | ||
| 50 | 0.583 | 0.586 | 0.637 | 0.357 | 0.384 |
|
| 0.123 | 0.323 |
| |||
| 100 |
|
|
| 0.655 | 0.698 |
|
| 0.33 | 0.644 |
| |||
Fig. 1Pairwise scatter plot of –log10-transformed p-values from the LMM(RF), LMM(RR), W, W, WS and SVC models based on multiple reaction monitoring (MRM) data. Vertical and horizontal dashed red lines represent Bonferroni-corrected significance levels, −log10(0.05/124). Diagonally dashed gray line represents one to one slope
List of proteins and their p-values simultaneously identified by LMM or LR-SAM methods
| Protein | Models | # of peptide | Simultaneously Identified model | |||||
|---|---|---|---|---|---|---|---|---|
| LMM(RF) | LMM(RR) |
|
|
|
| |||
| IGJ | 2.E-18 | 3.E-21 | 1.E-05 | 1.E-04 | 2.E-07 | 3.E-14 | 3 | All |
| IGHG3 | 2.E-10 | 3.E-18 | 2.E-07 | 2.E-07 | 2.E-07 | 9.E-11 | 1 | All |
| IGHG1 | 3.E-09 | 8.E-17 | 3.E-07 | 3.E-07 | 3.E-07 | 3.E-10 | 1 | All |
| GPX3 | 5.E-06 | 5.E-09 | 9.E-05 | 9.E-05 | 9.E-05 | 1.E-05 | 1 | All |
| FBLN1 | 1.E-01 | 7.E-11 | 3.E-05 | 1.E-05 | 5.E-06 | 2.E-07 | 2 | LR-SAM |
| C163A | 3.E-03 | 2.E-08 | 7.E-07 | 7.E-07 | 7.E-07 | 4.E-10 | 1 | LR-SAM |
| ISLR | 2.E-01 | 6.E-04 | 4.E-07 | 4.E-07 | 4.E-07 | 1.E-09 | 1 | LR-SAM |
| FCG3A | 8.E-02 | 8.E-04 | 6.E-07 | 6.E-07 | 6.E-07 | 3.E-09 | 1 | LR-SAM |
| QSOX1 | 2.E-01 | 2.E-03 | 8.E-07 | 8.E-07 | 8.E-07 | 3.E-09 | 1 | LR-SAM |
| FBLN3 | 2.E-02 | 2.E-06 | 4.E-07 | 4.E-07 | 4.E-07 | 3.E-09 | 1 | LR-SAM |
| SHBG | 4.E-03 | 1.E-04 | 1.E-05 | 5.E-05 | 2.E-06 | 9.E-08 | 2 | LR-SAM |
| LG3BP | 1.E-02 | 4.E-06 | 3.E-05 | 5.E-06 | 5.E-06 | 2.E-07 | 2 | LR-SAM |
| CATB | 2.E-01 | 2.E-03 | 7.E-06 | 7.E-06 | 7.E-06 | 3.E-07 | 1 | LR-SAM |
| HPT | 6.E-06 | 3.E-03 | 1.E-04 | 2.E-05 | 3.E-05 | 7.E-06 | 2 | LR-SAM |
| POSTN | 9.E-02 | 3.E-04 | 1.E-04 | 1.E-04 | 1.E-04 | 1.E-05 | 1 | LR-SAM |
| SODE | 4.E-02 | 7.E-05 | 2.E-04 | 2.E-04 | 2.E-04 | 2.E-05 | 1 | LR-SAM |
| 1433S | 1.E-01 | 8.E-04 | 1.E-04 | 1.E-04 | 1.E-04 | 2.E-05 | 1 | LR-SAM |
| FSTL1 | 2.E-01 | 2.E-03 | 5.E-05 | 5.E-05 | 5.E-05 | 7.E-05 | 1 | LR-SAM |
| CD5L | 3.E-15 | 1.E-30 | 8.E-05 | 4.E-01 | 2.E-06 | 9.E-13 | 3 | LMM |
| APOA4 | 3.E-04 | 1.E-07 | 1.E-03 | 1.E-01 | 7.E-04 | 2.E-04 | 2 | LMM |
Fig. 2Venn diagram of proteins identified from LMM (RF, RR) and LR-SAM (W, W1, WS and SVC), under a Bonferroni correction significance level of −log10(0.05/124)