| Literature DB >> 33972663 |
Ji Eun Park1, Dain Eun2,3, Ho Sung Kim4, Da Hyun Lee1, Ryoung Woo Jang2, Namkug Kim1,2.
Abstract
Generative adversarial network (GAN) creates synthetic images to increase data quantity, but whether GAN ensures meaningful morphologic variations is still unknown. We investigated whether GAN-based synthetic images provide sufficient morphologic variations to improve molecular-based prediction, as a rare disease of isocitrate dehydrogenase (IDH)-mutant glioblastomas. GAN was initially trained on 500 normal brains and 110 IDH-mutant high-grade astocytomas, and paired contrast-enhanced T1-weighted and FLAIR MRI data were generated. Diagnostic models were developed from real IDH-wild type (n = 80) with real IDH-mutant glioblastomas (n = 38), or with synthetic IDH-mutant glioblastomas, or augmented by adding both real and synthetic IDH-mutant glioblastomas. Turing tests showed synthetic data showed reality (classification rate of 55%). Both the real and synthetic data showed that a more frontal or insular location (odds ratio [OR] 1.34 vs. 1.52; P = 0.04) and distinct non-enhancing tumor margins (OR 2.68 vs. 3.88; P < 0.001), which become significant predictors of IDH-mutation. In an independent validation set, diagnostic accuracy was higher for the augmented model (90.9% [40/44] and 93.2% [41/44] for each reader, respectively) than for the real model (84.1% [37/44] and 86.4% [38/44] for each reader, respectively). The GAN-based synthetic images yield morphologically variable, realistic-seeming IDH-mutant glioblastomas. GAN will be useful to create a realistic training set in terms of morphologic variations and quality, thereby improving diagnostic performance in a clinical model.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33972663 PMCID: PMC8110557 DOI: 10.1038/s41598-021-89477-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Process for inclusion of the study population and the training dataset for the generative adversarial network.
Figure 2Morphologic characteristics of real IDH-mutant glioblastomas (left) and synthetic IDH-mutant glioblastomas generated by a generative adversarial network (right) based on contrast-enhanced T1-weighted (CE-T1w) and paired FLAIR images. (A) CE-T1w images showing different contrast patterns of rim enhancement, thick nodular enhancement, and patch enhancement. (B) FLAIR images showing types of surrounding high signal intensity (tumor dominant and edema dominant) and margins of non-enhancing lesions (clear and indistinct). Although the appearances of synthetic images are similar to those of real images, there were no exact matches.
Clinical and Imaging characteristics of the study patients.
| Variables | Training set | Validation set | |||||||
|---|---|---|---|---|---|---|---|---|---|
| IDH-wild | IDH-mutant | IDH-mutant (GAN) | IDH-wild | IDH-mutant | |||||
| No. of patients | 80 | 38 | 38 | 33 | 11 | ||||
| Median age (years) | 60 | 47 | – | – | < 0.001 | < 0.001 | 58 | 42 | = 0.003 |
| 0.01 | < 0.001 | 0.002 | 0.01 | ||||||
| Rim enhancing | 47 | 19 | 9 | 24 | 3 | ||||
| Thick nodular | 28 | 8 | 5 | 6 | 3 | ||||
| Patch enhancing | 5 | 11 | 24 | 3 | 5 | ||||
| 0.55 | 0.01 | 0.008 | 0.36 | ||||||
| Frontal or insula | 29 | 25 | 25 | 16 | 8 | ||||
| Other | 42 | 10 | 12 | 13 | 2 | ||||
| Thalamus or brainstem | 9 | 3 | 1 | 4 | 1 | ||||
| 0.35 | 0.001 | < 0.001 | 0.007 | ||||||
| Yes | 72 | 23 | 19 | 30 | 6 | ||||
| No | 8 | 15 | 19 | 3 | 5 | ||||
| 0.39 | 0.001 | < 0.001 | < 0.001 | ||||||
| Tumor dominant | 35 | 29 | 32 | 8 | 11 | ||||
| Edema dominant | 45 | 9 | 6 | 25 | 0 | ||||
| 0.10 | < 0.001 | < 0.001 | < 0.001 | ||||||
| Clear | 4 | 20 | 27 | 0 | 6 | ||||
| Indistinct | 76 | 18 | 11 | 33 | 5 | ||||
Data are expressed as the counts and median. P + indicates differences between real and synthetic IDH-mutant data. P* and P** indicate differences between real IDH-wild type and real IDH-mutant data and between real IDH-wild type and synthetic IDH-mutant data, respectively.
IDH isocitrate dehydrogenase.
Figure 3Representative synthetic images correctly determined to be synthetic by neuroradiologists. (A) Contrast-enhanced T1-weighted (CE-T1w) image similar to a real image, coupled with a FLAIR image showing an open rim of hypointensity, suggesting that the image was not real. (B) CE-T1w images showing nodular enhancement with a mesh-like artifact, suggesting that these images were not real. (C) CE-T1w images showing bizarre-shaped linear enhancement, suggesting that these images were not real.
Univariable and multivariable binomial logistic regression analysis of factors predicting IDH mutation in the training dataset.
| Variables | With real data (model 1) (n = 118) | With IDH-mutant synthetic data (model 2) (n = 118) | Augmented with real and synthetic data (model 3) (n = 156) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Univariate analysis | Multivariate analysis | Univariate analysis | Multivariate analysis | Univariate analysis | Multivariate analysis | |||||||
| Beta coefficient | Beta coefficient | Beta coefficient | Beta coefficient | Beta coefficient | Beta coefficient | |||||||
| Other | Ref. | Ref. | Ref. | Ref. | Ref. | Ref. | ||||||
| Frontal or insula | 1.29 | 1.34 (1.24, 1.45) | 1.10 | 0.009 | 1.52 (1.04, 2.35) | 1.19 | 1.32 (1.24, 1.4) | |||||
| Thalamus or brainstem | 0.95 | 0.18 | 0.25 (0.09, 0.41) | 0.77 | 2.05 | 0.06 | 3.33 (2.89, 3.77) | 0.77 | 1.35 | 0.78 (0.64, 0.91) | 0.36 | |
| Absence of necrosis | 1.77 | 1.91 (1.77, 2.05) | 2.19 | < 0.001 | − 0.06 (− 0.24, 0.12) | 0.95 | 1.98 | 1.13 (− 0.52, 0.78) | 0.08 | |||
| Rim enhancing | Ref. | Ref. | Ref. | Ref. | ||||||||
| Thick nodular | − 0.35 | 0.47 | − 1.50 (− 1.64, − 1.36) | 0.05 | − 0.07 | 0.90 | 0.25 (0.086,0.414) | 0.78 | − 0.25 | 0.54 | − 0.58 (0.66,− 0.49) | 0.27 |
| Patch enhancing | 1.69 | 0.035 (− 0.135, 0.205) | 0.97 | 3.22 | 3.46 (3.26, 3.66) | 2.46 | 1.97 (1.87, 2.07) | |||||
| Edema− dominant surrounding high signal intensity | − 0.91 | − 0.54 (− 0.646, − 0.434) | 0.35 | − 3.56 | − 1.29 (− 1.5, − 1.08) | 0.26 | − 1.62 | − 0.83 (− 0.91, − 0.75) | 0.11 | |||
| Indistinct | Ref. | Ref. | Ref. | Ref. | Ref. | Ref. | ||||||
| Distinct | 2.94 | 2.68 (2.56, 2.8) | 3.84 | < 0.001 | 3.88 (3.04, 4.72) | 3.47 | < 0.001 | 2.96 (2.86, 3.06) | ||||
Data in parentheses are 95% confidence intervals.
Diagnostic performance of the models for prediction of IDH mutation.
| Model from real data (model 1) | Model from IDH-mutant synthetic data (model 2) | Combined with real and synthetic data (model 3) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AUC | 95% CI | Sensitivity | Specificity | AUC | 95% CI | Sensitivity | Specificity | AUC | 95% CI | Sensitivity | Specificity | |
| Training set | 0.864 | 0.789, 0.920 | 57.9% | 95.0% | 0.958 | 0.904, 0.986 | 84.2% | 92.5% | 0.899 | 0.841, 0.942 | 72.37% | 91.25% |
| Reader 1 | 0.713 | 0.535, 0.892 | 36.4% | 100% | 0.747 | 0.517, 0.978 | 63.6% | 100% | 0.747 | 0.517, 0.978 | 63.6% | 100% |
| Reader 2 | 0.773 | 0.565, 0.981 | 62.7% | 63.6% | 0.773 | 0.565, 0.981 | 62.7% | 63.6% | 0.821 | 0.653, 0.989 | 63.6% | 93.9% |
| Reader 1 | 0.871 | 0.722, 1.00 | 63.6% | 100% | 0.826 | 0.662, 0.991 | 63.6% | 100% | 0.826 | 0.662, 0.991 | 63.6% | 100% |
| Reader 2 | 0.855 | 0.710, 1.00 | 78.8% | 81.8% | 0.861 | 0.720, 1.00 | 69.7% | 90.9% | 0.861 | 0.720, 1.00 | 69.7% | 90.9% |
AUC area under the receiver operating characteristics curve.