| Literature DB >> 31001929 |
Xin Liao1, Bo Cai2, Bin Tian1, Yilin Luo1, Wen Song1, Yinglong Li3.
Abstract
BACKGROUND: This study aimed to examine multi-dimensional MRI features' predictability on survival outcome and associations with differentially expressed Genes (RNA Sequencing) in groups of glioblastoma multiforme (GBM) patients.Entities:
Keywords: zzm321990EREGzzm321990; ROS1; TIMP1; death day to diagnosis; glioblastoma multiforme; machine learning; radiogenomics
Mesh:
Substances:
Year: 2019 PMID: 31001929 PMCID: PMC6533509 DOI: 10.1111/jcmm.14328
Source DB: PubMed Journal: J Cell Mol Med ISSN: 1582-1838 Impact factor: 5.310
Clinical characteristics of the cohort. This table shows the clinical information of the data analysed in this study. Gene∩MRI means that the dataset has both genetic data and MRI data
| Gender | Death days to diagnosis | Number | Age | ||||
|---|---|---|---|---|---|---|---|
| Men | Female | Long (>1 year) | Short (<1 year) | Total | Mean | SD | |
| Gene | 85 | 44 | 68 | 61 | 129 | 62.05 (25‐89) | 12.55 |
| MRI | 85 | 52 | 71 | 66 | 137 | 61.24 (16‐86) | 13.53 |
| Gene+MRI | 27 | 19 | 25 | 21 | 46 | 61.86 (32‐86) | 12.04 |
Figure 1The workflow of this study. The radiomics workflow. Lesions were segmented from untreated MR images. Feature extraction was performed from lesions by pyradiomics. The radiomics features were selected for classifier model constructing. And the classifier model was evaluated by confusion matrix and ROC curves
Detailed names and abbreviations of 72 features
| Full name | Short name |
|---|---|
| log‐sigma‐3‐0‐mm‐3D_gldm_SmallDependenceEmphasis | gldm‐SDE |
| wavelet‐HHL_gldm_DependenceNonUniformityNormalized | gldm‐DNUN |
| log‐sigma‐3‐0‐mm‐3D_firstorder_Uniformity | firstorder‐Uniformity |
| wavelet‐HHH_glszm_GrayLevelNonUniformityNormalized | glszm‐GLNUN |
| wavelet‐LLL_glcm_InverseVariance | glcm‐IV |
| wavelet‐LLH_glszm_ZonePercentage | glszm‐ZP |
| wavelet‐LLH_glcm_Idm | glcm‐LLH‐Idm |
| wavelet‐HLH_glcm_InverseVariance | glcm‐HLH‐IV |
| log‐sigma‐4‐0‐mm‐3D_glcm_Idm | glcm‐Idm |
| wavelet‐HLH_glcm_SumSquares | glcm‐HLH‐SS |
| wavelet‐HLH_gldm_GrayLevelVariance | gldm‐HLH‐GLV |
| wavelet‐LLL_glszm_ZoneVariance | glszm‐LLL‐ZV |
| log‐sigma‐4‐0‐mm‐3D_glcm_Id | glcm‐Id |
| wavelet‐HHL_glrlm_RunLengthNonUniformityNormalized | glrlm‐HHL‐RLNUN |
| wavelet‐HLL_glrlm_RunLengthNonUniformityNormalized | glrlm‐HLL‐RLNUN |
| wavelet‐HLH_glszm_SmallAreaEmphasis | glszm‐HLH‐SAE |
| wavelet‐LLL_glcm_Correlation | glcm‐LLL‐Correlation |
| wavelet‐HHH_glszm_SmallAreaEmphasis | glszm‐SAE |
| wavelet‐HHL_glcm_DifferenceAverage | glcm‐DA |
| log‐sigma‐5‐0‐mm‐3D_glcm_Correlation | glcm‐Correlation |
| log‐sigma‐4‐0‐mm‐3D_glrlm_ShortRunEmphasis | glrlm‐SRE |
| original_glrlm_RunLengthNonUniformityNormalized | glrlm‐RLNUN |
| wavelet‐LHL_glcm_Idn | glcm‐LHL‐Idn |
| wavelet‐HLH_glcm_Idn | glcm‐HLH‐Idn |
| wavelet‐HHL_glcm_Idn | glcm‐HHL‐Idn |
| wavelet‐LLL_glrlm_RunLengthNonUniformityNormalized | glrlm‐LLL‐RLNUN |
| wavelet‐LLL_glcm_Imc2 | glcm‐LLL‐Imc2 |
| log‐sigma‐5‐0‐mm‐3D_glcm_Idn | glcm‐Idn |
| log‐sigma‐4‐0‐mm‐3D_glcm_Idmn | glcm‐Idmn |
| wavelet‐HLH_glcm_ClusterProminence | glcm‐HLH‐CP |
| wavelet‐HHL_glcm_DifferenceEntropy | glcm‐HHL‐DE |
| wavelet‐HHH_firstorder_InterquartileRange | firstorder‐HHH‐IR |
| wavelet‐HHL_firstorder_InterquartileRange | firstorder‐HHL‐IR |
| log‐sigma‐3‐0‐mm‐3D_firstorder_Entropy | firstorder‐Entropy |
| wavelet‐LLH_gldm_LargeDependenceEmphasis | gldm‐LLH‐LDE |
| wavelet‐LLH_glcm_DifferenceEntropy | glcm‐LLH‐DE |
| wavelet‐HLH_firstorder_InterquartileRange | firstorder‐HLH‐IR |
| wavelet‐LHL_gldm_LargeDependenceEmphasis | gldm‐LHL‐LDE |
| original_gldm_LargeDependenceEmphasis | gldm‐LDE |
| wavelet‐LHH_glcm_SumEntropy | glcm‐LHH‐SE |
| wavelet‐LHH_glszm_LargeAreaEmphasis | glszm‐LHH‐LAE |
| log‐sigma‐5‐0‐mm‐3D_glcm_SumSquares | glcm‐SS |
| log‐sigma‐2‐0‐mm‐3D_glcm_Contrast | glcm‐Contrast |
| wavelet‐LHH_gldm_LargeDependenceEmphasis | gldm‐LHH‐LDE |
| log‐sigma‐5‐0‐mm‐3D_glrlm_RunEntropy | glrlm‐RE |
| log‐sigma‐5‐0‐mm‐3D_glszm_ZoneVariance | glszm‐ZV |
| wavelet‐HHL_glcm_JointEntropy | glcm‐JointEntropy |
| log‐sigma‐3‐0‐mm‐3D_glszm_LargeAreaEmphasis | glszm‐LAE |
| wavelet‐LLH_gldm_GrayLevelNonUniformity | gldm‐GLNU |
| wavelet‐HLL_glszm_GrayLevelNonUniformity | glszm‐GLNU |
| log‐sigma‐5‐0‐mm‐3D_glszm_ZoneEntropy | glszm‐ZE |
| wavelet‐HLL_gldm_GrayLevelNonUniformity | gldm‐HLL‐GLNU |
| wavelet‐LLL_glcm_SumEntropy | glcm‐LLL‐SE |
| wavelet‐HHH_glrlm_HighGrayLevelRunEmphasis | glrlm‐HHH‐HGLRE |
| wavelet‐HHH_firstorder_Maximum | firstorder‐Max |
| wavelet‐LLH_gldm_GrayLevelVariance | gldm‐LLH‐GLV |
| wavelet‐LLH_glcm_SumSquares | glcm‐LLH‐SS |
| original_firstorder_MeanAbsoluteDeviation | firstorder‐MAD |
| wavelet‐LLH_glcm_JointAverage | glcm‐JointAverage |
| wavelet‐LLH_glrlm_GrayLevelVariance | glrlm‐GLV |
| log‐sigma‐2‐0‐mm‐3D_gldm_DependenceNonUniformity | gldm‐DNU |
| log‐sigma‐5‐0‐mm‐3D_glrlm_HighGrayLevelRunEmphasis | glrlm‐HGLRE |
| wavelet‐HLL_firstorder_Variance | firstorder‐Variance |
| wavelet‐LHL_firstorder_RootMeanSquared | firstorder‐RMS |
| original_glrlm_ShortRunHighGrayLevelEmphasis | glrlm‐SRHGLE |
| original_glszm_SmallAreaHighGrayLevelEmphasis | glszm‐SAHGLE |
| log‐sigma‐2‐0‐mm‐3D_glcm_ClusterProminence | glcm‐CP |
| original_glszm_HighGrayLevelZoneEmphasis | glszm‐HGLZE |
| wavelet‐LLL_gldm_SmallDependenceHighGrayLevelEmphasis | gldm‐SDHGLE |
| original_glszm_LargeAreaHighGrayLevelEmphasis | glszm‐LAHGLE |
| log‐sigma‐2‐0‐mm‐3D_gldm_LargeDependenceHighGrayLevelEmphasis | gldm‐LDHGLE |
| wavelet‐LLL_gldm_LargeDependenceHighGrayLevelEmphasis | gldm‐LLL‐LDHGLE |
Figure 2The performance of the GBDT classifier. A, Confusion matrix (The horizontal line means the number of predicted in each group; the vertical line means the actual number of each group. The leading diagonal represents correct prediction; the minor diagonal represents incorrect prediction). B, Receiver operating characteristic (ROC) curve. (X axis represents false positive rate and Y axis is true positive rate.)
Figure 3Gene expressions of six gene. The distribution of six Gene expressions among patients with short vs. long survival time. The expression levels of six genes were significantly different in two classes of survival patients
Intersection of difference analysis between group long and short. Threshold of difference analysis adjusted P < 0.05 & |log2FoldChange|>1
| mRNA | Base mean | log2FC |
|
| Base mean | log2FC |
|
|
|---|---|---|---|---|---|---|---|---|
| WDR72 | 22.54202 | −1.53057 | 0.00001 | 0.00327 | 22.10411 | −2.66113 | <0.00001 | 0.00678 |
| C14orf39 | 9.74465 | −1.03545 | 0.00051 | 0.04247 | 13.31277 | −2.21750 | 0.00002 | 0.02407 |
| TIMP1 | 25087.60318 | 1.06274 | <0.00001 | 0.00109 | 28495.57870 | 1.53657 | 0.00004 | 0.03445 |
| CHIT1 | 345.04844 | 1.40483 | <0.00001 | 0.00109 | 329.45162 | 2.05879 | 0.00001 | 0.02339 |
| ROS1 | 16.00196 | 1.42552 | <0.00001 | 0.00178 | 21.31838 | 2.24119 | 0.00003 | 0.03445 |
| EREG | 57.47073 | 2.63671 | <0.00001 | <0.00001 | 69.45137 | 2.75592 | 0.00002 | 0.00678 |
FC: fold change; p.adj: adjusted p value.
Figure 4Correlation between genes and image features. The matrix correlation between top image features and genes. A, The matrix showing the correlations between top image features and genes. B, The correlations between top image features and genes after the threshold of 0.4 was applied to filter out features that had weak correlations with corresponding genes
Figure 5Correlation between three genes and nine image features. The correlations of nine image features and three genes. The solid line represents a positive correlation, and the dotted line represents a negative correlation
Figure 6The Kaplan‐Meier survival curve of six genes. KM survival curves show significant overall survival differences between higher‐expression levels and lower‐expression levels of survival rates of patients. For all the subplots, the ‘group 1’, coloured by yellow, stands for higher‐expression group at the optimal cut point identified by maximally selected rank statistics
Associations between image features and metagenes. This table shows the associations between nine image features and three metagenes, and the last column is the values of Pearson correlation coefficient
| Efficient DEGs | Important image features | PCC |
|---|---|---|
| EREG | wavelet‐HHL_gldm_DependenceNonUniformityNormalized | 0.41 |
| EREG | log‐sigma‐4‐0‐mm‐3D_glcm_Id | −0.46 |
| EREG | wavelet‐HHL_glcm_DifferenceAverage | 0.42 |
| EREG | log‐sigma‐2‐0‐mm‐3D_glcm_Contrast | 0.49 |
| EREG | log‐sigma‐5‐0‐mm‐3D_glszm_ZoneVariance | −0.56 |
| EREG | log‐sigma‐3‐0‐mm‐3D_glszm_LargeAreaEmphasis | −0.51 |
| EREG | wavelet‐LHL_firstorder_RootMeanSquared | −0.41 |
| EREG | log‐sigma‐2‐0‐mm‐3D_glcm_ClusterProminence | 0.46 |
| TIMP1 | log‐sigma‐4‐0‐mm‐3D_glcm_Id | −0.43 |
| TIMP1 | log‐sigma‐2‐0‐mm‐3D_glcm_Contrast | 0.42 |
| TIMP1 | log‐sigma‐5‐0‐mm‐3D_glszm_ZoneVariance | −0.47 |
| TIMP1 | log‐sigma‐3‐0‐mm‐3D_glszm_LargeAreaEmphasis | −0.49 |
| TIMP1 | log‐sigma‐2‐0‐mm‐3D_glcm_ClusterProminence | 0.43 |
| ROS1 | wavelet.LLH_glcm_Idm | −0.40 |
DEG: differentially expressed genes; PCC: Pearson correlation coefficient.