| Literature DB >> 31784980 |
Malgorzata A Komor1,2, Linda Jw Bosch1, Veerle Mh Coupé3, Christian Rausch1, Thang V Pham2, Sander R Piersma2, Sandra Mongera4, Chris Jj Mulder5, Evelien Dekker6, Ernst J Kuipers7, Mark A van de Wiel3, Beatriz Carvalho1, Remond Ja Fijneman1, Connie R Jimenez2, Gerrit A Meijer1, Meike de Wit1.
Abstract
Screening to detect colorectal cancer (CRC) in an early or premalignant state is an effective method to reduce CRC mortality rates. Current stool-based screening tests, e.g. fecal immunochemical test (FIT), have a suboptimal sensitivity for colorectal adenomas and difficulty distinguishing adenomas at high risk of progressing to cancer from those at lower risk. We aimed to identify stool protein biomarker panels that can be used for the early detection of high-risk adenomas and CRC. Proteomics data (LC-MS/MS) were collected on stool samples from adenoma (n = 71) and CRC patients (n = 81) as well as controls (n = 129). Colorectal adenoma tissue samples were characterized by low-coverage whole-genome sequencing to determine their risk of progression based on specific DNA copy number changes. Proteomics data were used for logistic regression modeling to establish protein biomarker panels. In total, 15 of the adenomas (15.8%) were defined as high risk of progressing to cancer. A protein panel, consisting of haptoglobin (Hp), LAMP1, SYNE2, and ANXA6, was identified for the detection of high-risk adenomas (sensitivity of 53% at specificity of 95%). Two panels, one consisting of Hp and LRG1 and one of Hp, LRG1, RBP4, and FN1, were identified for high-risk adenomas and CRCs detection (sensitivity of 66% and 62%, respectively, at specificity of 95%). Validation of Hp as a biomarker for high-risk adenomas and CRCs was performed using an antibody-based assay in FIT samples from a subset of individuals from the discovery series (n = 158) and an independent validation series (n = 795). Hp protein was significantly more abundant in high-risk adenoma FIT samples compared to controls in the discovery (p = 0.036) and the validation series (p = 9e-5). We conclude that Hp, LAMP1, SYNE2, LRG1, RBP4, FN1, and ANXA6 may be of value as stool biomarkers for early detection of high-risk adenomas and CRCs.Entities:
Keywords: biomarkers; colorectal cancer; early detection; high-risk adenomas
Mesh:
Substances:
Year: 2020 PMID: 31784980 PMCID: PMC7065084 DOI: 10.1002/path.5369
Source DB: PubMed Journal: J Pathol ISSN: 0022-3417 Impact factor: 7.996
Figure 1Overview of the design of this study. The discovery series consisted of control, colorectal adenoma, and colorectal cancer (CRC) samples. FFPE tissue blocks were obtained from 71 adenoma patients and low‐coverage whole‐genome sequencing was performed to identify DNA copy number aberrations. Fifteen high‐risk adenomas were identified according to their DNA copy number profiles. Whole stool samples of individuals from the discovery series were used for mass spectrometry proteomics analysis. Proteins identified were used for biomarker panel identification for high‐risk adenomas and high‐risk adenomas together with CRCs. An immunoassay was applied on 158 FIT samples from the discovery series and 795 FIT samples from the validation series for biomarker validation, to evaluate quantitative difference of Hp between controls, low‐risk adenomas, high‐risk adenomas, and CRCs.
Figure 2Proteomics profiling of human stool samples. (A) Multidimensional scaling of protein expression profiles of stool samples derived from controls (n = 129), individuals with low‐risk adenomas (n = 56), high‐risk adenomas (n = 15), and cancers (n = 79). (B) Hierarchical clustering of protein profiles of stool samples derived from high‐risk adenomas and controls based on 31 proteins expressed more highly in high‐risk adenomas compared with controls. (C) Hierarchical clustering of protein profiles of stool samples derived from CRCs, high‐risk adenomas, and controls based on 61 proteins expressed more highly in CRCs and high‐risk adenomas compared with controls.
Figure 3Biomarker panels from logistic regression analysis to identify high‐risk adenomas and CRCs. (A) ROC curve of the regression model using the four‐biomarker panel (Hp, LAMP1, SYNE2, and ANXA6) to distinguish between stool samples from individuals with high‐risk adenomas (n = 15) and controls (n = 129). ROC curve was obtained from logistic regression predictions from the leave‐one‐out cross‐validation analysis. Partial area under the curve (pAUC) was calculated for specificity of 95–100% and compared with pAUC of hemoglobin to obtain the P value. (B) Frequency plot of biomarkers occurring in the regression models built during the cross‐validation analysis to distinguish between the high‐risk adenomas and controls. Four proteins were clearly selected more frequently by the Lasso regularization in the cross‐validation analysis.
Confusion matrix for the cross‐validated performance of the models of biomarker panels. Performance of the biomarker panel regression models was evaluated at 95% specificity and compared with hemoglobin. (A) High‐risk adenomas versus controls and (B) high‐risk adenomas and CRCs versus controls
| A | |||
|---|---|---|---|
| Protein(s) | Control | High‐risk adenoma | Sensitivity at 95% specificity [95% confidence intervals] |
| Hp, LAMP1, SYNE2, ANXA6 | |||
| Predicted control | 123 | 7 | 53% [27–79%] |
| Predicted high‐risk adenoma | 6 | 8 | |
| HBA1 | |||
| Predicted control | 123 | 13 | 13% [2–40%] |
| Predicted high‐risk adenoma | 6 | 2 | |
Figure 4Biomarker panels from logistic regression analysis to identify high‐risk adenomas and CRCs. (A) ROC curve of the model based on the panel of four biomarkers (Hp, LRG1, RBP4, and FN1) for high‐risk adenomas and CRCs (n = 94) compared with controls (n = 129). ROC curve was obtained from logistic regression predictions from the leave‐one‐out cross‐validation analysis. (B) Frequency plot of biomarkers occurring in the regression models built during the cross‐validation analysis to discriminate high‐risk adenomas and CRCs from controls based on four proteins. Four proteins were clearly selected more frequently by the Lasso regularization in the cross‐validation analysis. (C) ROC curve of the model based on the panel of two biomarkers (Hp and LRG1) for high‐risk adenomas and CRCs (n = 94) compared with controls (n = 129). ROC curve was obtained from logistic regression predictions from the leave‐one‐out cross‐validation analysis. (D) Frequency plot of biomarkers occurring in the regression models built during the cross‐validation analysis to discriminate high‐risk adenomas and CRCs from controls based on two proteins. The same two proteins were consistently selected in the cross‐validation analysis.
Figure 5Validation of Hp protein expression with the use of an immunoassay. (A) The discovery series. (B) The validation series.