| Literature DB >> 34527671 |
Maowei Ni1,2,3, Jie Zhou4,5, Zhihui Zhu1, Jingtao Yuan1, Wangang Gong2,3, Jianqing Zhu2,3, Zhiguo Zheng2,3, Huajun Zhao1.
Abstract
BACKGROUND: Preoperative differentiation of benign and malignant tumor types is critical for providing individualized treatment interventions to improve prognosis of patients with ovarian cancer. High-throughput proteomics analysis of urine samples was performed to identify reliable and non-invasive biomarkers that could effectively discriminate between the two ovarian tumor types.Entities:
Keywords: machine learning; mass spectrometry; non-invasive biomarkers; ovarian cancer; urinary proteomics
Year: 2021 PMID: 34527671 PMCID: PMC8437375 DOI: 10.3389/fcell.2021.712196
Source DB: PubMed Journal: Front Cell Dev Biol ISSN: 2296-634X
Clinical information of patients in this study.
| C1 Cohort | C2 Cohort | ||
| Training dataset | Test dataset | Validation dataset | |
|
| |||
| Total | 70 | 20 | 42 |
| Benign | 30 | 10 | 19 |
| Malignant | 40 | 10 | 23 |
|
| |||
| Mean ± SD | 54.0 ± 14.9 | 53.8 ± 13.4 | 53.8 ± 11.7 |
| Median | 55.5 | 54 | 54.5 |
| Range | 18–91 | 21–84 | 24–77 |
|
| |||
| Mean ± SD | 22.5 ± 3.5 | 21.0 ± 2.1 | 22.2 ± 3.2 |
| Median | 22.4 | 21.3 | 22.4 |
| Range | 15.0–31.8 | 17.2–24.2 | 15.0–28.2 |
|
| |||
| Menopause | 62.9% (44/70) | 60.0% (12/20) | 59.5% (25/42) |
| Non-menopause | 37.1% (26/70) | 40.0% (8/20) | 40.5% (17/42) |
FIGURE 1Workflow of MS analysis from urinary proteomics based on machine learning for distinguishing between benign and malignant ovarian tumors.
FIGURE 2Data quality evaluation. (A) Pearson correlation of 8 HeLa cell lysate samples in all batches for evaluation of the reproducibility of mass spectrometry. (B) Pearson correlation of 8 repeat-tested urine samples in all batches for evaluation of the reproducibility of methodology. (C) Violin plot of protein identification numbers in benign and malignant groups. (D) Violin plot of precursor identification numbers in benign and malignant groups. (E) Box plot of protein abundance in each sample. *B, benign group; M, malignant group.
FIGURE 3Analysis of differentially expressed proteins in C1 dataset. (A) Differentially expressed proteins in benign and malignant groups with q-value < 0.01 and absolute log2 fold change > 1. (B) Average protein identification numbers in each sample, benign and malignant groups of C1 dataset. (C) Volcano plot of down-regulation and up-regulation in malignant group. (D) Pathway analysis of the differentially expressed proteins using Metascape web-based platform. *B, benign group; M, malignant group.
FIGURE 4Separation of benign and malignant patients by machine learning of proteomic features. (A) Top 5 proteins prioritized by random forest analysis ranked by the mean decrease in accuracy > 5. (B) Receiver operating characteristic (ROC) analysis of the classifier and each feature in the training dataset. (C) Expression levels of the five proteins; p-value was calculated in t-test medtod. *B, benign group; M, malignant group.
FIGURE 5Performance of the classifier in diagnosing malignant from benign in different datasets. (A,C,E) ROC analysis of the classifier in training, test and validation datasets. (B,D,F) t-SNE analysis of the classifier in training, test and validation datasets. *B, benign group; M, malignant group.
FIGURE 6Performance of the classifier, serum CA125 and HE4 in all patients. (A,B) The expression of serum CA125 and HE4 in all patients. (C) ROC analysis of the classifier, serum CA125 and HE4 in all patients. (D) Performance of the classifier in early stage ovarian cancer diagnoses. **p-value < 0.01.