| Literature DB >> 34836507 |
Liu Mingzhu1, Ge Yaqiong2, Li Mengru3, Wei Wei4.
Abstract
BACKGROUND: The objective of this study is to explore the value of two-dimensional (2D) and three-dimensional (3D) radiomics models based on enhanced computed tomography (CT) images in predicting BRCA gene mutations in patients with epithelial ovarian cancer.Entities:
Keywords: BRCA gene; Mutation; Ovarian cancer; Radiomics
Mesh:
Substances:
Year: 2021 PMID: 34836507 PMCID: PMC8626978 DOI: 10.1186/s12880-021-00711-3
Source DB: PubMed Journal: BMC Med Imaging ISSN: 1471-2342 Impact factor: 1.930
Fig. 1Flow chart of screening and grouping the enrolled cases
Fig. 2a–-d Images of a 35-year-old patient with high-grade (stage IIIC) serous carcinoma of the ovary, which was genetically detected with a non-BRCA gene mutation. ITK-SNAP software was used to segment and label the lesion layer-by-layer to generate a 3D image of the lesion
General characteristics of the three groups
| Variable | Sample | Class meaning unknown | Class mutation | Class wild | Statistics | |
|---|---|---|---|---|---|---|
| Age | 106 | 55.00 (49.00, 58.05) | 52.00 (47.70, 57.20) | 54.00 (48.00, 59.30) | 0.781 | 0.677 |
| Maximum_Diameter | 106 | 8.75 (5.95, 11.00) | 7.00 (4.57, 9.24) | 7.00 (4.49, 10.00) | 2.929 | 0.231 |
| Tumor marker CA125 | 86 | 15 (83.33%) | 42 (93.33%) | 63 (81.82%) | 3.601 | 0.463 |
| Tumor marker CA125、CA199 | 19 | 3 (16.67%) | 3 (6.67%) | 13 (16.88%) | ||
| Tumor marker CA199 | 1 | 0 (0.00%) | 0 (0.00%) | 1 (1.30%) |
Fig. 3Application of LASSO (Least absolute shrinkage and selection operator)-logistic regression to imaging feature screening in the 2D + 3D model shows that the LASSO-logistic regression model selects tuning parameters (λ) through tenfold cross-validation and obtains the relationship between binomial variance and logarithm (λ) (a). The relationship is retained with the parameters that yield the smallest binomial deviation, and the 12 best features with non-zero coefficients (b) are retained in the final model. The relationships between the features and gene mutation status (correlation coefficient × 100) are shown in the heat map (c)
Radscores of each patient in the training and validation sets calculated by different models, and their distribution and difference statistics in patients with wild-type and mutant type BRCA genes
| Variable | Sample | Wild | Mutation | Statistics | ||
|---|---|---|---|---|---|---|
| Train | Radscore _2D3D | 76 | − 1.22 ± 1.06 | 0.52 ± 0.94 | − 7.329 | < 0.001 |
| Radscore _3D | 76 | − 0.71 (− 1.06, − 0.28) | 0.00 (− 0.36, 0.28) | − 4.413 | < 0.001 | |
| Radscore _2D | 76 | − 0.51 ± 0.23 | − 0.19 ± 0.30 | − 5.303 | < 0.001 | |
| Test | Radscore_2D3D | 30 | − 1.03 (− 2.32, − 0.01) | 0.66 (− 0.23, 4.92) | − 2.963 | 0.003 |
| Radscore_3D | 30 | − 0.61 (− 0.95, − 0.25) | − 0.04 (− 0.48, 0.49) | − 2.159 | 0.031 | |
| Radscore_2D | 30 | − 0.41 ± 0.22 | − 0.17 ± 0.23 | − 2.9 | 0.007 |
Cutoff value prediction ability of the 2D, 3D, and 2D + 3D joint radiomics models
| AUC | Accuracy | Sensitivity | Specificity | Positive value | Negative value | Cut-off | |
|---|---|---|---|---|---|---|---|
| 2D Training set | 0.81 (0.71–0.91) | 0.75 (0.63–0.84) | 0.68 | 0.83 | 0.86 | 0.65 | − 0.43 |
| 2D validation set | 0.78 (0.61–0.96) | 0.73 (0.54–0.87) | 0.61 | 0.91 | 0.91 | 0.61 | |
| 3D training set | 0.80 (0.70–0.90) | 0.75 (0.63–0.84) | 0.71 | 0.81 | 0.84 | 0.65 | − 0.61 |
| 3D validation set | 0.75 (0.55–0.92) | 0.74 (0.55–0.88) | 0.84 | 0.58 | 0.76 | 0.70 | |
| 2D + 3D Training set | 0.91 (0.84–0.97) | 0.86 (0.77–0.93) | 0.95 | 0.74 | 0.84 | 0.92 | 0.14 |
| 2D + 3D validation set | 0.82 (0.67–0.98) | 0.73 (0.54–0.87) | 0.611 | 0.91 | 0.91 | 0.61 |
Fig. 4ROC curves of 2D model, 3D model, and 2D + 3D model in the training group (a) and validation group (b)
Fig. 5The yellow line, black dotted line, and blue dotted line represent the data obtained from the 2D, 3D, and 2D + 3D images, respectively. The x-axis represents the patient’s personal threshold probability (e.g., x = 0.6 means that the high-risk threshold of ovarian cancer and BRCA gene mutation is 60%). The y-axis represents net income. The line labeled “All” represents the hypothesis that all ovarian cancer cases are caused by BRCA gene mutations. The thin line labeled “None” represents the assumption that there are no BRCA gene mutations in patients with ovarian cancer
The detailed information of the features
| Image type | Features | Features explanation |
|---|---|---|
| X3D-venous-wavelet.LHH | Glcm–cluster shade | Cluster Shade is a measure of the skewness and uniformity of the GLCM. A higher cluster shade implies greater asymmetry about the mean |
| X3D-venous-wavelet.HHH | Firstorder –mean absolute deviation (MAD) | Mean Absolute Deviation is the mean distance of all intensity values from the Mean Value of the image array |
| X2D-venous-wavelet.HLH X2D-delay-wavelet.LHHX2D-Artery-wavelet.HHH | Firstorder—skewness | Skewness measures the asymmetry of the distribution of values about the Mean value. Depending on where the tail is elongated and the mass of the distribution is concentrated, this value can be positive or negative |
| X3D-delay-wavelet.LHH | Glcm—IDMN | IDMN (inverse difference moment normalized) is a measure of the local homogeneity of an image. IDMN weights are the inverse of the Contrast weights (decreasing exponentially from the diagonal i = ji = j in the GLCM). Unlike Homogeneity2, IDMN normalizes the square of the difference between neighboring intensity values by dividing over the square of the total number of discrete intensity values |
| X2D-venous | Original shape elongnation | Elongation shows the relationship between the two largest principal components in the ROI shape. For computational reasons, this feature is defined as the inverse of true elongation |
| X2D-venous-wavelet.HHH | Glszm–Zone entropy (ZE) | ZE measures the uncertainty/randomness in the distribution of zone sizes and gray levels. A higher value indicates more heterogeneneity in the texture patterns |
| X2D-venous-wavelet.HHH | Glszm–gray level non-uniformity (GLN) | GLN measures the variability of gray-level intensity values in the image, with a lower value indicating more homogeneity in intensity values |
| X3D-artery-wavelet.HLL | Firstorder—Kurtosis | Kurtosis is a measure of the ‘peakedness’ of the distribution of values in the image ROI. A higher kurtosis implies that the mass of the distribution is concentrated towards the tail(s) rather than towards the mean. A lower kurtosis implies the reverse: that the mass of the distribution is concentrated towards a spike near the Mean value |
| X2D-delay-wavelet.HHH | Glszm–small area high gray level emphasis (SAHGLE) | SAHGLE measures the proportion in the image of the joint distribution of smaller size zones with higher gray-level values |
| X2D-venous-wavelet.LLL | Firstorder–range max(X)-min(X) | The range of gray values in the ROI |