Indre Drulyte1, Tomas Ruzgas2, Renaldas Raisutis1,3, Skaidra Valiukeviciene4, Gintare Linkeviciute4. 1. a Prof. K. Baršauskas Ultrasound Research Institute , Kaunas University of Technology , Kaunas , Lithuania. 2. b Department of Applied Mathematics, Faculty of Mathematics and Natural Sciences , Kaunas University of Technology , Kaunas , Lithuania. 3. c Department of Electrical Power systems, Faculty of Electrical and Electronics Engineering , Kaunas University of Technology , Kaunas , Lithuania. 4. d Department of Skin and Venereal Diseases , Lithuanian University of Health Sciences , Kaunas , Lithuania.
Abstract
Ultrasonic and digital dermatoscopy diagnostic methods are used in order to estimate the changes of structure, as well as to non-invasively measure the changes of parameters of lesions of human tissue. These days, it is very actual to perform the quantitative analysis of medical data, which allows to achieve the reliable early-stage diagnosis of lesions and help to save more lives. The proposed automatic statistical post-processing method based on integration of ultrasonic and digital dermatoscopy measurements is intended to estimate the parameters of malignant tumours, measure spatial dimensions (e.g. thickness) and shape, and perform faster diagnostics by increasing the accuracy of tumours differentiation. It leads to optimization of time-consuming analysis procedures of medical images and could be used as a reliable decision support tool in the field of dermatology.
Ultrasonic and digital dermatoscopy diagnostic methods are used in order to estimate the changes of structure, as well as to non-invasively measure the changes of parameters of lesions of human tissue. These days, it is very actual to perform the quantitative analysis of medical data, which allows to achieve the reliable early-stage diagnosis of lesions and help to save more lives. The proposed automatic statistical post-processing method based on integration of ultrasonic and digital dermatoscopy measurements is intended to estimate the parameters of malignant tumours, measure spatial dimensions (e.g. thickness) and shape, and perform faster diagnostics by increasing the accuracy of tumours differentiation. It leads to optimization of time-consuming analysis procedures of medical images and could be used as a reliable decision support tool in the field of dermatology.
The number of incidences of skin melanoma had grown up faster over the last three decades, avoiding exclusion to different age groups. Looking at the statistics of melanoma cases, the rate per period 2005–2014 has increased by 3% per year in the group of men and women who are 50 years old and older but had stabilized among those younger than age 50. Even though melanoma is diagnosed for only about 1% of skin cancers, it leads to the majority of skin cancer deaths [1]. This trend is mainly caused because of genetics, environmental factors, and other addictions [2-4].The purposes of exploration of the lesion parameters of human tissue are to have a faster diagnosis, informative prediction of the illness, also to reduce the cost of possible treatment, and to save as much lives as it is feasible. The main task is to find the way which leads to accurate diagnostics. First signs of the dermatoscopy appeared in 1948 and a Spitz nevus was introduced as ‘melanoma of childhood’ due to the potentiality of the technologies and histopathologic features at that time [5]. In 1953, Arthur C. Allen, Helwig in 1954, and other researches continued their investigations from the classification of benign and malignant nevus perspective [6]. Later, in 1987, Pehamberger et al. presented a new diagnostic approach named as a ‘pattern analysis’ [7]. Pattern analysis was designated to detect and provide diagnostic accuracy for pigmented skin lesions diseases, such as melanoma and other skin damages [7].Another one approach which is leading to the faster diagnostic was proposed in 1990 by Bahmer et al. It is named as the seven-point checklist. This approach is based on a simplified pattern analysis and it uses seven standard criteria presented in the guidelines of the terminology consensus on dermoscopy [8,9]. In 1994, Stolz et al. have presented a paper introducing a modification of the ABCD rule in order to influent early-stages diagnosis of malignant skin melanoma [10]. Two years later, Menzies et al. have presented a new diagnostic approach, based on the recognition of two negative dermoscopic features (not favouring melanoma diagnosis) and nine positive features (favouring melanoma diagnosis). This method has shown sensitivity and specificity of 92% and 71%, respectively [11]. Choosing of method or an optimal algorithm depends on aims of task analysis and data characteristics.Over the last decade, enough methods of data mining application in medicine are found. In diagnosis, there are widely applied neural networks, decision trees, decision rules [12], methods for search of associative rules (for costs analysis) [13], prediction of patient health, and treatment probability, as well as, very popularly use, combinations of prediction algorithms [14]. In 2014, N. Esfandiari et al. [15] carried out a literature review; there are described applications of data mining in medicine based on analysis of the structured data. They stated that classification (neural networks, decision trees, decision rules, support vector model), clustering (k-means, hierarchical clustering), and associative search (a priori associative rules search) models are the most popular in medicine. Lalayants et al. [16] have said that the solution of successful medical data mining is to identify the right activity of healthcare institution or to find the clinical problem. Data mining methods are usually used in biomedical data analysis and visualization tasks in order to facilitate decision-making [17]. If the data mining process would be simple enough, the management of information problems would be already solved long time ago (R. Bellazzi, B. Zupan [14]). Practical data mining application in medicine has some obvious barriers as technological problems, trans-disciplinary communication, ethics, and patient data security [13,17,18].Medical research leads to a lot of data characterizing the condition of patient. All these data are dynamically changing and depend on patient illness, patient biological condition, environment, the quality of life, related diseases, and other actual reasons that can be described as a random factor. The change of medical statistics observations is described by primary statistics analysis. Jose et al. [19] have presented a new approach in order to improve early diagnostic of skin melanoma from the dermatoscopic images. They have used a general ABCD features and involved other personalized features as skin type, age, and gender. The accuracy of this approach is equal to 86%. In the meantime, a sensitivity of this method is equal to 94% and specificity is equal to 68%.Other approach for decision support tool was proposed by Daniel Ruiz et al. in 2011 [20]. This independently working method includes artificial neural network (ANN) classifiers, a Bayesian classifier, and the algorithm of the K-nearest neighbours. In 2012, a group of researches has presented an automatic detection of melanoma method based on ANN model [21].The dermatoscopy-based analysis methods allow to analyse the surface of the skin. The analysis of deeper skin layer can also be informative in order to recognize malignant skin tumour at the early stage. Unfortunately, most of the studies have been related with the thickness measurements of melanocytic skin tumours [22,23]. There are only few studies, related with the melanocytic skin tumour diagnosis based on the analysis of the acquired ultrasonic data.The objective of the presented research was the development and application of statistical post-processing method in order to achieve faster, more accurate, and independent experience of the investigator in early-stage diagnosis of the melanocytic skin tumours.In this study, the automatic statistical post-processing method based on analysis of ultrasonic and digital dermatoscopy images of melanocytic skin tumours is presented. The method is intended to estimate the parameters of malignant and benign skin tumours and to increase the accuracy of the early-stage diagnosis. It leads to faster diagnostics and optimization of time-consuming analysis procedures of medical images and could be used in the field of dermatology.
Method
An automatic segmentation of boundaries of skin tumours
Distribution of melanoma thickness comparing to Breslow’s depth stage and 5 years survival rate.
Distribution of melanoma thickness comparing to Breslow’s depth stage and 5 years survival rate.The application of contours selection of ultrasonic and digital dermatoscopy skin melanoma images is presented in Figures 2–5. Ultrasonic raw B-scan images are presented in Figures 2(a) and 4(a). The results of transforming raw ultrasound B-scan and digital dermatoscopy images are shown in Figures 2–5. Meanwhile, the results of application of Gaussian smoothing and thresholding procedure as detected informative regions are shown in Figures 2(c)–5(c) images.
Figure 2.
Ultrasonic B-scan images (raw and processed) of skin melanoma, axes are in millimetres: a – ultrasonic raw B-scan image, b – binary B-scan image, c – detected informative region.
Figure 5.
Digital dermatoscopy images (raw and processed) of benign nevus, axes are in millimetres: a – raw optical image, b – binary optical image, c – detected informative region.
Figure 4.
Ultrasonic B-scan images (raw and processed) of benign nevus, axes are in millimetres: a – ultrasonic raw B-scan image, b – binary B-scan image, c – detected informative region.
Ultrasonic B-scan images (raw and processed) of skin melanoma, axes are in millimetres: a – ultrasonic raw B-scan image, b – binary B-scan image, c – detected informative region.Digital Dermatoscopy Images (raw and processed) of skin melanoma, axes are in millimetres: a – raw optical image, b – binary optical image, c – detected informative region.Ultrasonic B-scan images (raw and processed) of benign nevus, axes are in millimetres: a – ultrasonic raw B-scan image, b – binary B-scan image, c – detected informative region.Digital dermatoscopy images (raw and processed) of benign nevus, axes are in millimetres: a – raw optical image, b – binary optical image, c – detected informative region.
Quantitative parameters evaluation and selection
The purpose of this research part was to separate significant and not relevant parameters in order to increase the classification accuracy of ultrasonic and digital dermatoscopy images. The selection of parameters does not include peculiarities of vascularity structure. The thickness of analysed skin tumours in this research varies between 0.36 mm and 1.72 mm. For all B-scan ultrasonic and digital dermatoscopy images, 46 parameters of tumour structure were evaluated, such as form features and spatial region criteria, i.e. 19 parameters of ultrasonic images and 27 parameters of digital dermatoscopy images.For ultrasonic B-scan images, it was made a direct calculation of length, thickness, and spatial region, as well as other form features and relative parameters, such as maximum length, area, perimeter, average skewness (from 1000 directions), maximum skewness, the skewness of length projection, average kurtosis (from 1000 directions), maximum kurtosis, minimum kurtosis, and the kurtosis of length projection. Also the relative estimations, such as the ratio of average skewness (from 1000 directions) and maximum skewness, the ratio of average skewness (from 1000 directions) and the skewness of length projection, the ratio of maximum skewness and the skewness of length projection, the ratio of average kurtosis (from 1000 directions) and maximum kurtosis, the ratio of average kurtosis (from 1000 directions) and minimum kurtosis, the ratio of average kurtosis (from 1000 directions) and the kurtosis of length projection, the ratio of maximum kurtosis and the kurtosis of length projection, the ratio of minimum kurtosis and the kurtosis of length projection, and the ratio of perimeter and area, were made.For digital dermatoscopy images, there were estimated different shape parameters as maximum diameter, minimum diameter, perimeter, area, average skewness (from 1000 directions), maximum skewness, average kurtosis (from 1000 directions), maximum kurtosis, minimum kurtosis, and 10 deciles and the relative parameters, such as the ratio of maximum diameter and average diameter, the ratio of minimum diameter and average diameter, the ratio of maximum and minimum diameters, the ratio of average skewness (from 1000 directions) and maximum skewness, the ratio of average kurtosis (from 1000 directions) and maximum kurtosis, the ratio of average kurtosis (from 1000 directions) and minimum kurtosis, and the ratio of perimeter and area.In this study, F test was used in order to select informative parameters for discriminant analysis of malignant and benign skin tumours. The F test is closely related to the analysis of variance, which is known as a parametric statistical way to estimate the significances between two or more groups [59]. In 1987, Parasurama has shown that univariate F test could be used to assess the significant differences between two variances [60]:The results of F test depend on the degrees of freedom, which correspond to the numerator and the denominator and also depend on the level of significance. If the value of F test is less than the critical value, then there is no significant difference between variances, and the null hypothesis should be returned or otherwise, we should reject it. In such research, the univariate F-ratio test is used in order to estimate the significance of the discriminating power of all of the common variables, taken separately, excluding among and between the various sets of groups [61].Under the application of Fisher’s test (F test) used in the case of discriminant analysis and the Chi-squared test used in logistic regression model, all the estimated parameters of the 31 lesions (19 – melanoma, 12 – benign nevus) of the human tissue were clustered to significant and not relevant groups of quantitative parameters.The logistic regression model is parametric because it has a finite set of parameters. Specifically, the parameters are the regression coefficients. These correspond to one for each predictor plus a constant. The Chi-squared test (also called the Wald Chi-squared test) is a way to find out if parameters in this model are significant. The test can be used for a multitude of different models including those with binary variables or continuous variables [62].The null hypothesis for the test is: some parameter equals to some value. If the null hypothesis is rejected, it suggests that the variables in question can be removed without significant impact to the model fit [62].The Wald test statistic [62] is evaluated according to the following equation:where is maximum likelihood estimator (MLE), is …., is expected Fisher information (evaluated at the MLE).To satisfy the condition of null hypothesis, W is distributed by asymptotic Chi-square distribution together with the number of r degrees of freedom, where r indicates the rank of the parameter.For discriminant analysis model under the F test, 5 significant (informative) parameters of ultrasonic images and 17 of digital dermatoscopy images were selected. By applying Chi-squared test for logistic regression model, two significant parameters of ultrasonic images and two of digital dermatoscopy images were identified. The significant parameters of two different classification models are shown in Table 1.
Table 1.
Significant parameters evaluated from detected skin lesion region of ultrasonic and dermatoscopic images.
Discriminant analysis aodel (Fisher’s test)
Significant parameters of ultrasonic images: (n = 5/19)
Significant parameters of digital dermatoscopy images: (n = 17/27)
Maximum length, area,perimeter, the ratio of average kurtosis (1000 directions) and maximum excess,the ratio of perimeter and area.
Average diameter, minimum diameter, area, average excess (1000 directions), maximum kurtosis, the ratio of maximum diameter and average diameter, the ratio of minimum diameter and average diameter, the ratio of maximum diameter and minimum diameter, 9 of 10 deciles.
Logistic regression model (Chi-squared test)
Significant parameters of ultrasonic images: (n = 2/19)
Significant parameters of digital dermatoscopy images: (n = 2/27)
The ratio of average kurtosis (1000 directions) and maximum excess,the ratio of perimeter and area.
The ratio of minimum diameter and average diameter,2nd decile.
Significant parameters evaluated from detected skin lesion region of ultrasonic and dermatoscopic images.
The classification of tumours by analysis of quantitative parameters extracted from ultrasonic and digital dermatoscopy images
As the purpose of this research was to increase the accuracy of medical measurements, an automatic classification between benign and malignant tumours is done by using discriminant and logistic regression models. These models were chosen due to the small amount of the data, but as a result they gave significant results. Discriminant analysis model was approximated by Normal distribution with the cross-validation. In the meantime, as the other classification model, a stepwise logistic regression with cross-validation was used [63,64].For the discriminant analysis method, it is assumed that the prior probabilities of the set of observations are noted and that the group-specific densities at are evaluated, so that the probability of belonging to group , in the way of discriminant analysis, can be computed by using the statement of the theorem of Bayes [65]:Here, ԛ is a prior probability of group-specific observations, and f(x) is a group-specific density.For the purpose of classification, discriminant analysis method allows to divide p-dim vector area into separate regions, named, and the particular region is the subarea including the p-dim vectors so that is the highest across the sets. If observation is in the region so that it can be classified from group [65].Nonparametric discriminant methods depend on nonparametric evaluation of group-specific probability densities [65]. For the calculation of density to group t for all observations vector x, a defined radius r and an elaborate kernel are used.Assume that is a -dimensional vector. Here, a size of a ρ-dimensional piece of sphere bordered by ′ can be defined as [65]:Gamma function is represented by .In a set of variables t, the size of a -dimensional ellipsoid bordered by can be expressed as:In this case, a Gaussian kernel density function with the mean zero and variance is used in order to establish a nonparametric density in each set and to build up a classification parameter [65]. Gaussian kernel is calculated by using this expression:Here, .The classification of the observations is based on the set of unique densities which are calculated from the training group. After the evaluation of the group densities, the ulterior probabilities of group dependence at are also estimated. Then, the sample is assigned into group if assigned produces the highest value of [65].Another one of the prognostic models is the logistic regression prognostic model. All the observations for binary response (Y) models of an experimental or an individual observation can get one of the two possible values, for instance, if it is true and Y = 0 if it is false. Let be a vector of expository variables and is the probability of the refusal which needs to be estimated. Linear logistic model could be described as follows and is used by many researches [66-71]:Here, represents the intercept criterion and is the vector of s grade parameters.If the nominal response logistic models are being used, with the k+1 maximum number of plausible responses which do not have a natural grading, then the logistic model can be expanded to a multinomial form, as it is shown below [65]:Here, are intercept parameters, and the are vectors of slope parameters. More about the discrete choice or conditional logit models could be found in [72].
Results
Diagnostic performance rate can be introduced in terms of the ability to classify objects into clinically relevant groups, as well as it also represents the accuracy of diagnostic tools. The obtained accuracy of the estimation depends on the quality of the outcomes provided by the classification results. A receiver-operating characteristic, known as a ROC graph, is a technique for visualizing, organizing, and selecting classifiers based on their performance [73]. ROC curve informs about the degree of accuracy by showing the limits of an ability to discriminate between alternative states of health over the disease possibility. ROC methodology is based on statistical decision theory and was developed in the context of electronic signal detection and problems with radar in the early 1950s [74]. The first signs of possibilities to use ROC curve analysis in medical decision-making tools were first suggested by Lusted [75-78]. After this, researches started to use this method in medical diagnostic [79-92] and medical imaging [93-103] aspects seeking to ensure faster diagnostic and save more lives of patients. In the last two decades, the ROC curve analysis is widely used in the field of dermatology [87-92]. As the ROC curve represents the ratio of sensitivity and 1-specificity and is one of the powerful tools used to check the accuracy of the model, it was agreed to it in this experimental study also. ROC curve of the classification of melanoma and benign melanocytic nevi analysing from ultrasonic B-scan images only by using discriminant analysis is presented in Figure 6. ROC curve of indirectly combined digital dermatoscopy and ultrasonic medical images classification by using discriminant analysis is presented in Figure 7. ROC curve of the classification of melanoma and benign melanocytic nevi analysing ultrasonic B-scan images only by using logistic regression is presented in Figure 8. ROC curve of indirectly combined digital dermatoscopy and ultrasonic medical images classification by using logistic regression is presented in Figure 9.
Figure 6.
ROC curve of the classification of melanoma and benign melanocytic nevi analysing ultrasonic B-scan images only by using discriminant analysis.
Figure 7.
ROC curve of indirectly combined digital dermatoscopy and ultrasonic B-scan images classification by using discriminant analysis.
Figure 8.
ROC curve of the classification of melanoma and benign melanocytic nevi analysing ultrasonic B-scan images only by using logistic regression.
Figure 9.
ROC curve of indirectly combined digital dermatoscopy and ultrasonic medical images classification by using logistic regression.
ROC curve of the classification of melanoma and benign melanocytic nevi analysing ultrasonic B-scan images only by using discriminant analysis.ROC curve of indirectly combined digital dermatoscopy and ultrasonic B-scan images classification by using discriminant analysis.ROC curve of the classification of melanoma and benign melanocytic nevi analysing ultrasonic B-scan images only by using logistic regression.ROC curve of indirectly combined digital dermatoscopy and ultrasonic medical images classification by using logistic regression.After the classification of malignant and benign tumour by analysis of quantitative parameters extracted from ultrasonic and digital dermatoscopy images, the results are presented in the Figure 10.
Figure 10.
The results of classification of melanoma and benign melanocytic nevi analysing ultrasonic B-scan images and in combination with analysis of digital dermatoscopy images.
The results of classification of melanoma and benign melanocytic nevi analysing ultrasonic B-scan images and in combination with analysis of digital dermatoscopy images.In the case of discriminant analysis, the probability of correct prediction during the classification by using significant parameters of ultrasonic images is equal to 62%. Meanwhile, the probability of correct prediction during the classification by using significant parameters of indirectly combined ultrasonic and digital dermatoscopy images is also equal to 62% and the estimated area under the ROC curve is only 0.671. It means that there was no improvement made comparing the results of the discriminant analysis classification models. Looking at the outcomes of an application of stepwise logistic regression, it is obvious that the classification was improved by 12% (up to 82%) comparing with case of ultrasonic B-scan analysis only. The estimated area under the ROC curve is 0.908.
Conclusions
Within this study, an automatic statistical post-processing method for analysis of ultrasonic B-scan and digital dermatoscopy images was proposed. Method is able to estimate the set of quantitative parameters for differentiation of malignant (melanomas) and benignant (melanocytic nevus) tumours. For the analysis of region selection, an automatic segmentation of boundaries of skin tumours within ultrasonic B-scan and digital dermatoscopy images was performed. The significance of classification parameters was estimated for discriminant analysis model (Fisher’s test) and logistic regression model (Chi-squared test). For the discriminant analysis model, the sets of significant parameters of ultrasonic images (n = 5/19) and digital dermatoscopy images (n = 17/27) were estimated. In addition, for the logistic regression model, the sets of significant parameters of ultrasonic images (n = 2/19) and digital dermatoscopy images (n = 2/27) were estimated. By indirect combination of quantitative parameters estimated from ultrasonic B-scan images and digital dermatoscopy images, the probability of correct prediction by classification using the logistic regression model was improved by 12% (up to 82%) comparing with the case of ultrasonic B-scan analysis only. The estimated area under the ROC curve is 0.908. In the case of application of discriminant analysis classification models, the improvement was not obtained and the estimated area under the ROC curve is 0.671. Even though the logistic regression model has main advantages over discriminant analysis, i.e. it is more robust, it does not assume a linear relationship between the independent and dependent variables, but unfortunately, the advantages of logistic regression come at a cost: it requires much more data to achieve stable and meaningful results.One more important thing is the quality of images of skin damages which can be affected by many different factors, such as technical and software issues of ultrasonic imaging system used for examination, the improper gain adjustment of the reived ultrasonic signals (which are reflected from the superficial tissue), and issues of ultrasonic transducer positioning over the damaged region of the skin.Factors such as human factor during the examination (e.g. unpredictable lateral shift of transducer by operator) and patient inability to be in stable position during the ultrasonic examination have a big impact to the results of medical examination. The influence of biological variability also gives the effect to image quality; due to presence of similar acoustic properties (densities and ultrasound velocities) in healthy tissue and damaged tissue, the reflections of ultrasonic waves are sufficiently low amplitude giving the lower quality ultrasonic image.As a result, the proposed automatic statistical post-processing method leads to faster diagnostics by increasing the accuracy of differentiation of malignant tumours and optimization of time-consuming analysis procedures of medical images. It could be used as a reliable decision support tool in the field of dermatology.