| Literature DB >> 31780739 |
Eun Kyung Park1, Kwang-Sig Lee2, Bo Kyoung Seo3, Kyu Ran Cho4, Ok Hee Woo5, Gil Soo Son6, Hye Yoon Lee6, Young Woo Chang6.
Abstract
Radiogenomics investigates the relationship between imaging phenotypes and genetic expression. Breast cancer is a heterogeneous disease that manifests complex genetic changes and various prognosis and treatment response. We investigate the value of machine learning approaches to radiogenomics using low-dose perfusion computed tomography (CT) to predict prognostic biomarkers and molecular subtypes of invasive breast cancer. This prospective study enrolled a total of 723 cases involving 241 patients with invasive breast cancer. The 18 CT parameters of cancers were analyzed using 5 machine learning models to predict lymph node status, tumor grade, tumor size, hormone receptors, HER2, Ki67, and the molecular subtypes. The random forest model was the best model in terms of accuracy and the area under the receiver-operating characteristic curve (AUC). On average, the random forest model had 13% higher accuracy and 0.17 higher AUC than the logistic regression. The most important CT parameters in the random forest model for prediction were peak enhancement intensity (Hounsfield units), time to peak (seconds), blood volume permeability (mL/100 g), and perfusion of tumor (mL/min per 100 mL). Machine learning approaches to radiogenomics using low-dose perfusion breast CT is a useful noninvasive tool for predicting prognostic biomarkers and molecular subtypes of invasive breast cancer.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31780739 PMCID: PMC6882909 DOI: 10.1038/s41598-019-54371-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
List of evaluated perfusion CT parameters.
| CT perfusion parameters | Features |
|---|---|
| PEI (Hounsfield Units)* | Peak enhancement intensity by APPA: The peak enhancement due to injected contrast agent |
| PD (mL/min per 100 mL)* | Perfusion on deconvolution model by APPA: Blood flow through the vasculature in a defined tissue or mass volume on deconvolution model |
| PM (mL/min per100 mL)* | Perfusion on maximum slope model by APPA: Blood flow through the vasculature in a defined tissue or mass volume on maximum slope model |
| BV (mL/100 g)* | Blood volume by APPA: The total blood volume over the region during the period of the scan and it is determined by the area under the time-attenuation curve |
| MTT (seconds)* | Mean transit time by APPA: Average transit time of contrast agent in a given tissue |
| TTP (seconds)* | Time to peak by APPA: The time it takes for the peak enhancement to be reached |
| Permeability (mL/min per 100 g)* | Permeability by APPA: The flow of molecules through the capillary membranes in a certain volume of tissue |
| BV permeability (mL/100 g)* | Blood volume permeability by APPA: The blood volume passed through the contrast agent from the intravascular space into the extravascular space |
| Standard PD (mL/min per 100 mL)* | Standardization of perfusion on deconvolution model by APPA: Calculated perfusion value based on four inputs including cardiac output, body weight of the patient, volume and density of the injected contrast agent, and conversion factor (contrast agent density unit to HU [HU/(mg/mL)] on deconvolution model |
| Standard PM (mL/min per 100 mL)* | Standardization of perfusion on maximum slope model by APPA: Calculated perfusion value based on four inputs including cardiac output, body weight of the patient, volume and density of the injected contrast agent, and conversion factor (contrast agent density unit to HU [HU/(mg/mL)] on maximum slope model |
| Perfusion-Function (mL/min per 100 mL)* | Perfusion by FUNCTION |
| PEI-Function (Hounsfield Units)* | Peak enhancement intensity by FUNCTION |
| TTP-Function (seconds)* | Time to peak by FUNCTION |
| BV-Function (mL/100 g)* | Blood volume by FUNCTION |
| Perfusion-Function-Whole (mL/min per 100 mL) | Perfusion of whole tumor by FUNCTION |
| PEI-Function-Whole (Hounsfield Units) | Peak enhancement intensity of whole tumor by FUNCTION |
| TTP-Function-Whole (seconds) | Time to peak of whole tumor by FUNCTION |
| BV-Function-Whole (mL/100 g) | Blood volume of whole tumor by FUNCTION |
*Perfusion parameters measured by ROI placed for breast cancer covering the hot spot of the tumor.
CT computed tomography, APPA advanced perfusion and permeabiliy application software, FUNCTION Functional CT software.
Figure 1Evaluated 18 hemodynamic parameters on low-dose breast perfusion CT. (A) Ten parameters were obtained at the hot spot of the tumor using the Advanced Perfusion and Permeability Application software; peak enhancement intensity (PEI), perfusion on deconvolution model (PD), perfusion on maximum slope model (PM), blood volume (BV), mean transit time (MTT), time to peak (TTP), permeability (Permeability), blood volume permeability (BV permeability), standardization of perfusion on deconvolution model (Standard PD), and standardization of perfusion on maximum slope model (Standard PM). (B) Four parameters were obtained at the hot spot of the tumor using the Functional CT software; perfusion (Perfusion-Function), PEI (PEI-Function), TTP (TTP-Function), and BV (BV-Function). (C) Four parameters were obtained at the whole tumor range using the Functional CT software; perfusion of whole breast cancer (Perfusion-Function-Whole), PEI of whole breast cancer (PEI-Function-Whole), TTP of whole breast cancer (TTP-Function-Whole), and BV of whole breast cancer (BV-Function-Whole).
Descriptive statistics for prognostic biomarkers and molecular subtypes.
| Dependent variable | Count | Percentage (%) |
|---|---|---|
| Lymph node | ||
| Negative | 414 | 57 |
| Positive | 309 | 43 |
| Tumor grade | ||
| Low | 459 | 63 |
| High | 264 | 37 |
| Tumor size | ||
| ≤20 | 360 | 50 |
| >20 | 363 | 50 |
| ER | ||
| Negative | 237 | 33 |
| Positive | 486 | 67 |
| PR | ||
| Negative | 270 | 37 |
| Positive | 453 | 63 |
| HER2 | ||
| Negative | 570 | 79 |
| Positive | 153 | 21 |
| Ki67 | ||
| Negative | 339 | 47 |
| Positive | 384 | 53 |
| Molecular subtype | ||
| Luminal A | 294 | 41 |
| Luminal B | 207 | 29 |
| HER2 overexpression | 126 | 17 |
| Triple negative | 96 | 13 |
ER estrogen receptor, PR progesterone receptor, HER2 human epidermal growth factor receptor 2.
Descriptive statistics for CT perfusion parameters.
| Independent variable* | Mean | SD | Min | 25% | 50% | 75% | Max |
|---|---|---|---|---|---|---|---|
| PEI (Hounsfield Units) | 112.15 | 34.03 | 20.00 | 94.10 | 112.60 | 130.80 | 251.60 |
| PD (mL/min per 100 mL) | 108.14 | 36.11 | 31.20 | 81.85 | 102.00 | 128.60 | 356.80 |
| PM (mL/min per100 mL) | 219.05 | 89.99 | 79.30 | 159.15 | 200.90 | 257.40 | 950.10 |
| BV (mL/100 g) | 20.57 | 6.81 | 4.80 | 15.90 | 19.80 | 23.75 | 51.30 |
| MTT (seconds) | 14.90 | 5.97 | 7.80 | 12.60 | 14.70 | 16.60 | 156.20 |
| TTP (seconds) | 48.10 | 8.42 | 26.40 | 41.60 | 47.60 | 54.35 | 67.10 |
| Permeability (mL/min per 100 g) | 14.80 | 8.83 | 0.10 | 10.50 | 13.80 | 17.30 | 118.40 |
| BV permeability (mL/100 g) | 13.29 | 17.44 | 0.00 | 5.40 | 10.50 | 17.25 | 371.50 |
| Standard PD (mL/min per 100 mL) | 10.19 | 3.21 | 3.12 | 7.80 | 9.90 | 12.08 | 24.54 |
| Standard PM (mL/min per 100 mL) | 20.67 | 8.00 | 6.51 | 14.68 | 19.55 | 24.85 | 63.61 |
| Perfusion-Function (mL/min per 100 mL)* | 31.81 | 31.30 | 2.22 | 9.23 | 22.15 | 45.36 | 255.45 |
| PEI-Function (Hounsfield Units) | 71.87 | 28.08 | 12.08 | 53.74 | 68.91 | 84.58 | 192.24 |
| TTP-Function (seconds) | 55.97 | 49.32 | 3.05 | 18.34 | 30.56 | 92.75 | 366.69 |
| BV-Function (mL/100 g) | 42.83 | 36.64 | 0.58 | 24.28 | 33.39 | 48.62 | 268.54 |
| Perfusion-Function-Whole (mL/min per 100 mL) | 12.97 | 17.01 | 0.76 | 3.26 | 5.59 | 15.15 | 134.82 |
| PEI-Function-Whole (Hounsfield Units) | 44.40 | 19.46 | 4.25 | 31.67 | 42.82 | 53.60 | 193.63 |
| TTP-Function-Whole (seconds) | 89.29 | 54.22 | 6.11 | 27.51 | 95.81 | 135.56 | 168.19 |
| BV-Function-Whole (mL/100 g) | 28.75 | 26.71 | 0.16 | 15.63 | 22.81 | 32.59 | 256.66 |
*The meanings of independent variables are described in Table 1.
CT computed tomography, SD standard deviation, Min minimum value, 25% 25 percentile value of the distribution, 50% 50 percentile value of the distribution, 75% 75 percentile value of the distribution, Max maximum value.
Average performance of logistic regression and machine learning models for predicting prognostic biomarkers and molecular subtypes.
| Lymph node | Tumor grade | Tumor size | ER | PR | HER2 | Ki67 | Molecular subtype | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Accuracy | AUC | Accuracy | AUC | Accuracy | AUC | Accuracy | AUC | Accuracy | AUC | Accuracy | AUC | Accuracy | AUC | Accuracy | AUC | |
| Logistic regression | 62% | 0.66 | 67% | 0.71 | 64% | 0.69 | 70% | 0.73 | 66% | 0.68 | 78% | 0.69 | 65% | 0.70 | 48% | 0.69 |
| Decision tree | 66% | 0.65 | 71% | 0.69 | 65% | 0.65 | 72% | 0.69 | 69% | 0.67 | 77% | 0.67 | 66% | 0.66 | 50% | 0.63 |
| Naïve Bayes | 53% | 0.58 | 63% | 0.70 | 58% | 0.63 | 68% | 0.71 | 65% | 0.65 | 72% | 0.69 | 59% | 0.69 | 49% | 0.70 |
| Random forest | ||||||||||||||||
| SVM | 57% | 0.34 | 63% | 0.35 | 48% | 0.42 | 67% | 0.44 | 64% | 0.39 | 79% | 0.47 | 53% | 0.35 | 41% | 0.65 |
| ANN | 64% | 0.68 | 68% | 0.73 | 65% | 0.71 | 75% | 0.77 | 69% | 0.72 | 76% | 0.73 | 66% | 0.71 | 35% | 0.72 |
ER estrogen receptor, PR progesterone receptor, HER2 human epidermal growth factor receptor 2, AUC the area under the receiver-operating-characteristic curve, SVM support vector machine, ANN artificial neural network.
Variable importance of CT parameters from the random forest in predicting prognostic biomarkers and molecular subtypes.
| Independent variable* | Lymph node | Tumor grade | Tumor size | ER | PR | HER2 | Ki67 | Molecular subtype |
|---|---|---|---|---|---|---|---|---|
| PEI (Hounsfield Units) | 0.0589† | 0.0685† | 0.0860† | 0.0498 | 0.0520 | 0.0681† | 0.0664† | 0.0614† |
| PD (mL/min per 100 mL) | 0.0564 | 0.0505 | 0.0508 | 0.0433 | 0.0430 | 0.0408 | 0.0552 | 0.0478 |
| PM (mL/min per100 mL) | 0.0583† | 0.0444 | 0.0507 | 0.0445 | 0.0497 | 0.0460 | 0.0433 | 0.0484 |
| BV (mL/100 g) | 0.0433 | 0.0479 | 0.0455 | 0.0391 | 0.0448 | 0.0436 | 0.0478 | 0.0479 |
| MTT (seconds) | 0.0413 | 0.0458 | 0.0487 | 0.0413 | 0.0488 | 0.0480 | 0.0399 | 0.0430 |
| TTP (seconds) | 0.0420 | 0.0771† | 0.0566 | 0.0689 | 0.0743† | 0.0743† | 0.0686† | 0.0665† |
| Permeability (mL/min per 100 g) | 0.0491 | 0.0462 | 0.0482 | 0.0731† | 0.0835† | 0.0526 | 0.0505 | 0.0651† |
| BV permeability (mL/100 g) | 0.0428 | 0.0668† | 0.0462 | 0.0830† | 0.0675† | 0.0718† | 0.1054† | 0.0823† |
| Standard PD (mL/min per 100 mL) | 0.0530 | 0.0547 | 0.0450 | 0.0463 | 0.0483 | 0.0460 | 0.0457 | 0.0454 |
| Standard PM (mL/min per 100 mL) | 0.0474 | 0.0483 | 0.0459 | 0.0413 | 0.0532 | 0.0439 | 0.0406 | 0.0464 |
| Perfusion-Function (mL/min per 100 mL) | 0.0500 | 0.0576 | 0.0752† | 0.0727† | 0.0666† | 0.0493 | 0.0625† | 0.0657† |
| PEI-Function (Hounsfield Units) | 0.0607† | 0.0612 | 0.0556 | 0.0550 | 0.0485 | 0.0619 | 0.0631† | 0.0575 |
| TTP-Function (seconds) | 0.0435 | 0.0377 | 0.0491 | 0.0642 | 0.0653 | 0.0460 | 0.0506 | 0.0555 |
| BV-Function (mL/100 g) | 0.0556 | 0.0537 | 0.0745† | 0.0391 | 0.0393 | 0.0418 | 0.0598 | 0.0526 |
| Perfusion-Function-Whole (mL/min per 100 mL) | 0.0757† | 0.0728† | 0.0586† | 0.0686† | 0.0657† | 0.0848† | 0.0487 | 0.0574 |
| PEI-Function-Whole (Hounsfield Units) | 0.0638† | 0.0451 | 0.0652† | 0.0436 | 0.0519 | 0.0721† | 0.0581 | 0.0471 |
| TTP-Function-Whole (seconds) | 0.0556 | 0.0645† | 0.0457 | 0.0808† | 0.0561 | 0.0564 | 0.0381 | 0.0550 |
| BV-Function-Whole (mL/100 g) | 0.0469 | 0.0571 | 0.0525 | 0.0455 | 0.0416 | 0.0527 | 0.0556 | 0.0550 |
*The meanings of independent variables are described in Table 1.
†Top 5 important variables with the highest variable importance scores in predicting a prognostic biomarker or molecular subtype.
CT computed tomography, ER estrogen receptor, PR progesterone receptor, HER2 human epidermal growth factor receptor 2.