| Literature DB >> 30222738 |
Po-Hsiung Lin1, Hui-Ju Yang2, Wei-Chung Hsieh3, Che Lin4, Ya-Chi Chan5, Yu-Fen Wang5, Yuan-Ting Yang6, Kuo-Juei Lin7, Li-Sheng Lin8, Dar-Ren Chen4,5,9.
Abstract
Cumulative estrogen concentration is an important determinant of the risk of developing breast cancer. Estrogen carcinogenesis is attributed to the combination of receptor-driven mitogenesis and DNA damage induced by quinonoid metabolites of estrogen. The present study was focused on developing an improved breast cancer prediction model using estrogen quinone-protein adduct concentrations. Blood samples from 152 breast cancer patients and 71 healthy women were collected, and albumin (Alb) and hemoglobin (Hb) adducts of estrogen-3,4-quinone and estrogen-2,3-quinone were extracted and evaluated as potential biomarkers of breast cancer. A multilayer perceptron (MLP) was used as the predictor model and the resultant prediction of breast cancer was more accurate than other existing detection methods. A MLP using the logarithm of the concentrations of the estrogen quinone-derived adducts (four input nodes, 10 hidden nodes, and one output node) was used to predict breast cancer risk with accuracy close to 100% and area under curve (AUC) close to one. The AUC value of one showed that both data sets were separable. We conclude that Alb and Hb adducts of estrogen quinones are promising biomarkers for the early detection of breast cancer.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30222738 PMCID: PMC6141067 DOI: 10.1371/journal.pone.0201241
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Scatter plots of the natural logarithm values of E2-3,4-Q and E2-2,3-Q adduct concentrations.
(A) Hemoglobin adducts. (B) Albumin adducts. The asterisks (*) represent the cancer patients while the triangles (Δ) represent the healthy controls.
Fig 2Typical training results.
AUC analysis for five combinations of adducts.
All 223 samples (152 BCP samples and 71 HC samples) were used to obtain the MLP for prediction.
| Case | Hemoglobin Adducts | Albumin Adducts | AUC | ||
|---|---|---|---|---|---|
| 1 | E2-3,4-Q | E2-2,3-Q | E2-3,4-Q | E2-2,3-Q | 1.000 |
| 2 | – | E2-2,3-Q | E2-3,4-Q | E2-2,3-Q | 1.000 |
| 3 | E2-3,4-Q | – | E2-3,4-Q | E2-2,3-Q | 0.999 |
| 4 | E2-3,4-Q | E2-2,3-Q | – | E2-2,3-Q | 0.999 |
| 5 | E2-3,4-Q | E2-2,3-Q | E2-3,4-Q | – | 1.000 |
Validation results.
| Case | Train Error (SD) | Test Error (SD) | Train AUC (SD) | Test AUC (SD) |
|---|---|---|---|---|
| 1 | 0 (0) | 0.0011 (0.0049) | 1 (0) | 1 (0) |
| 2 | 0 (0) | 0.0011 (0.0049) | 1 (0) | 1 (0) |
| 3 | 0 (0) | 0.0033 (0.0146) | 1 (0) | 0.9998 (0.0010) |
| 4 | 0.0059 (0.0130) | 0.0120 (0.0131) | 0.9992 (0.0010) | 0.9995 (0.0011) |
| 5 | 0.0028 (0.0045) | 0.0120 (0.0131) | 0.9999 (0.0001) | 0.9997 (0.0011) |
SD, Standard error.
Comparisons with previously reported results.
| References | Biomarkers (No.) | Model | Test Error | AUC |
|---|---|---|---|---|
| This study | Adducts of estrogen quinone (4) | MLP | 0.0011 | 1 |
| [ | SNP (1) | NB | 0.33 | – |
| [ | SNP (2) | DT | 0.32 | – |
| [ | SNP (3) | SVM | 0.31 | – |
| [ | Serum proteins, Age, Race (100) | DF | – | 0.84 |
| [ | Serum proteins (3) | BMA | 0.15 | 0.82 |
| [ | Gene expression | PLSR | 0.205 | 0.88 |
| [ | SNP from GWAS | KNN | 0.3975 | – |
| [ | Gene expression (42) | SVM | – | 0.7879 |
| [ | SNP (200) | SVM | 0.0395 | 0.94 |
| [ | Mammogram | RF | 0.0838 | 0.938 |
| NC | 0.0859 | 0.962 | ||
| KNN | 0.0644 | 0.967 |
BMA, Bayesian modeling averaging; DF, Data fusion; DT, Decision tress
GWAS, Genome-wide association study; KNN, K-nearest neighbors
MLP, Multilayer perceptron; NB, Niäve Bayes; NC, Nearest centroid
PLSR, Partial least square regression; RF, Random forest; SNP, Single nucleotide polymorphism
SVM, Support vector machine.