Literature DB >> 24325128

GyneScan: an improved online paradigm for screening of ovarian cancer via tissue characterization.

U Rajendra Acharya¹, S Vinitha Sree, Sanjeev Kulshreshtha, Filippo Molinari, Joel En Wei Koh, Luca Saba, Jasjit S Suri.

Abstract

Ovarian cancer is the fifth highest cause of cancer in women and the leading cause of death from gynecological cancers. Accurate diagnosis of ovarian cancer from acquired images is dependent on the expertise and experience of ultrasonographers or physicians, and is therefore, associated with inter observer variabilities. Computer Aided Diagnostic (CAD) techniques use a number of different data mining techniques to automatically predict the presence or absence of cancer, and therefore, are more reliable and accurate. A review of published literature in the field of CAD based ovarian cancer detection indicates that many studies use ultrasound images as the base for analysis. The key objective of this work is to propose an effective adjunct CAD technique called GyneScan for ovarian tumor detection in ultrasound images. In our proposed data mining framework, we extract several texture features based on first order statistics, Gray Level Co-occurrence Matrix and run length matrix. The significant features selected using t-test are then used to train and test several supervised learning based classifiers such as Probabilistic Neural Networks (PNN), Support Vector Machine (SVM), Decision Tree (DT), k-Nearest Neighbor (KNN), and Naive Bayes (NB). We evaluated the developed framework using 1300 benign and 1300 malignant images. Using 11 significant features in KNN/PNN classifiers, we were able to achieve 100% classification accuracy, sensitivity, specificity, and positive predictive value in detecting ovarian tumor. Even though more validation using larger databases would better establish the robustness of our technique, the preliminary results are promising. This technique could be used as a reliable adjunct method to existing imaging modalities to provide a more confident second opinion on the presence/absence of ovarian tumor.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2013 PMID： 24325128 PMCID： PMC4527478 DOI： 10.7785/tcrtexpress.2013.600273

Source DB: PubMed Journal: Technol Cancer Res Treat ISSN： 1533-0338

Introduction

Nowadays, ovarian neoplasm cancers represent a significant health problem in industrialized nations with the female population that has a 2.5% lifetime chance of developing ovarian cancer (1, 2). Between the age group 55 to 74 years, more than 50% of ovarian cancer deaths happen, and around 25% of deaths occur between 35 and 54 years (3, 4). It is the fifth highest reason for cancer in women (affecting about 1 out of 70 women) and the leading cause of death (1% of all women die of it) from gynecological cancers (5). The incidence of this cancer is higher in developed countries owing to lifestyle and heredity factors (6). Many heredity factors are associated with ovarian cancer occurrence risk, in particular age (7) and the presence of harmful mutations in tumor suppressor BRCA1 or BRCA2 genes (7, 8). The rapid and precise diagnosis of cancer pathology is extremely important to offer a better survival rate to the affected women, and, in this setting, imaging analysis represents a key technique. Currently, three main techniques are used to image adnexa: Ultrasound (US), Computed Tomography (CT) and Magnetic Resonance (MR) (1, 9-12). CT, MR, and radioimmunoscintigraphy have one or more of the following limitations: cost, device availability, radiation exposure. The appearances of both the normal and cancerous ovaries on ultrasound images have been studied since the use of pelvis ultrasound (13-15). Barua et al. (15) have recently studied the feasibility of a preclinical animal model in determining the effectiveness of contrast enhanced ultrasonography in detecting early stage ovarian cancer. Transvaginal Ultrasonography (TVUS) is the first-choice technique ovarian neoplasm characterization because of the excellent temporal and spatial resolution and the absence of risk related to radiation and the administration of contrast material (16). With the introduction of TVUS and 3D-ultrasonography, the sensitivity and specificity of ultrasonography have been shown to have improved significantly (17, 18). However, the effectiveness of ultrasonography is mainly related to the level of expertise of the reader (19), and in one study, it was observed that the most experienced sonographer obtained an accuracy of 92%, and the less experienced observers had only an accuracy in the range of 82% and 87% (20). Furthermore, studies (21) have shown that the nature of benign and malignant ovarian tumors may sometimes overlap in the acquired images, and thereby, make it difficult for the ultrasonographers or physicians to detect the exact type of tumor. Such ambiguous appearances result in unnecessary biopsies, which increase cost, time, and patient anxiety. Therefore, there is a need for an adjunct modality that could provide more objective information on the nature of the tumor. Over the past few years, techniques for the Computer Aided Diagnosis (CAD) of specific pathologies have been proposed for more objective determination of the presence/ absence of disease and for the improvement of differential diagnosis of lesions (22-24). These techniques generally select features that quantify the grayscale intensity variations in the images and use them to develop classifiers that automatically detect the presence of disease. Due to the minimal involvement of human interpretation in the entire protocol, such CAD based techniques can provide objective and reproducible results. Most of CAD studies for ovarian cancer detection use features based on (a) blood test results (25) (b) Mass Spectrometry (MS) data (26-28) and (c) ultrasound images (29-31). The curse of dimensionality issue affected the MS based studies (32) as they have to study a huge amount of features extracted from a relatively small dataset. Ultrasound is currently a very commonly offered affordable technique. A literature review of ultrasound-based techniques describes that there is still room for improvement in the detection accuracy. Therefore, in this work, we have proposed a CAD technique for ovarian tumor classification in ultrasound images. Comprehensive morphological characteristics of malignant and benign tumors can be evaluated by 3D ultrasonography compared to 2D ultrasonography (33). Even though some studies have concluded that 3D ultrasound did not have a better diagnostic performance than its 2D counterpart (34, 35), few other studies have indicated that power Doppler ultrasound and the selective use of 3D ultrasonography can improve the accuracy of ovarian tumor diagnosis (36, 37). Hence, in this work, we have developed our protocol using images acquired using 3D transvaginal ultrasound.

Methods

Data

In the present work, twenty non-consecutive women with previous diagnosis of ovarian mass (10 malignant, 10 benign; nine post-menopausal, eleven pre-menopausal; age: 29 to 74 years) were evaluated. The study was approved by the Institutional Review Board and the procedure was explained to each woman before obtaining informed consent. One of the authors of this paper consecutively selected these women during presurgical evaluation. Women with no anatomopathological evaluation were excluded from the study. First these subjects were scanned by B-mode ultrasonography to study the adnexal masses. Subsequently, the imaged masses were subdivided into unilocular, multilocular, unilocular-solid, multilocularsolid or solid. The tumor vascularization was evaluated by 2D power Doppler. To minimize noise, the power Doppler setting was specifically tuned for each subject in order to obtain maximum sensitivity while avoiding artifacts. Prior to surgery, all the patients underwent 3D-transvaginal ultrasonography evaluation, and 3D volumes of the suspicious areas were acquired. Depending on the size of the volume box, the acquisition time varied between two and six seconds. In the case where more than one volume was recorded for an adnexal mass, only the first volume was used for further analysis. We wanted our database to contain 1300 benign and 1300 malignant images to build and evaluate the classifiers. Therefore, we selected the middle 130 images from each 3D volume acquired from each of the 10 benign and 10 malignant subjects, thus making our database to have 1300 malignant and 1300 benign images. To obtain the Region of Interest (ROI), the image was cropped automatically using the horizontal and vertical gradients to detect the boundaries of the black frame border around the image. Subsequently, we captured images of the size of 256 × 256 and a gynecologist and radiologist marked out the squared ROI from individual cropped images. Figure 1 depicts a few examples of ultrasound images of benign and malignant ovarian tumors.

Figure 1:

Sample ultrasound images of (A) benign ovarian tumor (upper panels) (B) malignant ovarian tumor (bottom panels).

Overall GyneScan Architecture

Our proposed system for ovarian tumor classification GyneScan is presented in Figure 2. The on-line classification system part of the figure indicates the steps in processing a test/new patient image. This system determines the class of the test image (benign/malignant) by using the features extracted from the test image in the classifiers that have already been trained by using the training parameters assessed by the off-line learning system. The off-line classification system evaluates the training parameters of the classifiers by using the combination of the features extracted from the training dataset and the corresponding ground truth training class labels. In this work, we developed and evaluated the following classifiers: Probabilistic Neural Networks (PNN), Support Vector Machine (SVM), Decision Tree (DT), ^-Nearest Neighbor (KNN), and Naive Bayes (NB) using stratified ten-fold cross-validation. By comparing the predicted class labels of the test images and the corresponding ground truth labels, various performance measures (accuracy, sensitivity, specificity, and Positive Predictive Value (PPV)) were calculated for each classifier.

Figure 2:

Block diagram of the proposed system GyneScan™ for ovarian tumor detection.

Texture based Features Extraction

We used Gray Level Co-occurrence Matrix (GLCM) (38) and the run length matrix (39) texture methods for feature extraction. Let the image be represented by a M × N gray-scale matrix I(i, j), where each element of the matrix indicates the intensity of a single pixel in the image. The co-occurrence matrix C(i, j | Δx, Δy) is the second-order probability function estimation. This matrix denotes the rate of occurrence of a pixel pair with gray levels i and j, given the distances between the pixels are Δ and Δ in the and directions, respectively. The co-occurrence matrix C(i, j | Δ, Δ) is defined as where (p, q), (p + Δx, q + Δy) ∈ M × N, d = (Δx, Δy) and | ʘ | denotes the cardinality of a set. The probability that a gray level pixel i is at a distance (Δx, Δy) away from the gray level pixel j is given by The following features were computed from the co-occurrence matrix: First order statistical features: Based on the first order statistics, five features were extracted from the pre-processed fundus image f(x, y). They are mean, variance, skewness, kurtosis and energy. Table I presents the description of these features.

Table I

Definition of first order statistical features.

S. No.	Features	Description
1	Mean (m)	m=∑x=1M∑y=1Nf(x,y)M×N
2	Variance (σ2)	σ2=∑x=1M∑y=1N{f(x,y)−m2}M×N
3	Skewness (S_k)	Sk=1M×N∑x=1M∑y=1N{f(x,y)−m}3σ3
4	Kurtosis (K_t)	Kt=1M×N∑x=1M∑y=1N{f(x,y)−m}4σ4
5	Energy (E)	E=∑x=1M∑y=1Nf(x,y)2

Definition of first order statistical features. GLCM based textural features: Let I(i, j) denote the original fundus image (normal or abnormal) and let the image have distinct gray level intensities. Firstly, we calculated the GLCM of order N × N, where N refers the number of gray levels. An element of the GLCM matrix (i, j, d, θ) is defined as the joint probability of the gray levels I and j separated by distance d and along direction θ. To reduce the computation, we have used θ as 0°, 45°, 90°, and 135°, and d is defined as the Manhattan or city block distance based on this GLCM. These features are mathematically defined as shown in Table II.

Table II

Description of GLCM based textural features.

S. No.	Haralick feature	Description
1	Contrast	Icon=∑n=0N-1n2{∑i=0N∑j=0NP(i,j)}
2	Autocorrelation	Iautocor=∑i=0N-1∑j=0N-1(ij)P(i,j)
3	Maximum probability	Imprb=∑i=0N-1∑j=0N-1max P(i,j)
4	Dissimilarity	Idsmlrt=∑i=0N-1∑j=0N-1\|i−j\| P(i,j)
5	Homogeneity	Ihmg=∑i=0N-1∑j=0N-111+(i−j)2 P(i,j)
6	Entropy	IEntr=−∑i=0N-1∑j=0N-1P(i,j)log(P(i,j))
7	Energy	IEnrg=∑i=0N-1∑j=0N-1P(i,j)2
8	Correlation	Icor=∑i=0N−1∑j=0N−1(i,j)P(i,j)−μxμyσxσywhere σ_x,σ_y, μ_x, μ_y are the standard deviations and means of P_x, P_y · P_x, P_y are the partial probability density functions. p_x(i) = i^th entry in the marginal-probability matrix obtained by summing the rows of P(i, j)
9	Cluster shade	Iclsh=∑i=0N−1∑j=0N−1{i+j−μx−μy}3×P(i,j)
10	Variance	Ivarinance=∑i=0N−1∑j=0N−1(i−μ)2log (P(i,j))
11	Sum average	Isave=∑i=22NiPx+y(i)
12	Sum entropy	Isentr=−∑i=22NPx+y(i)log{Px+y(i)}
13	Sum variance	Isvar=∑i=22N(i−Isentr)2Px+y(i)
14	Difference variance	Idvar=∑i=22N(i−Isavg)2P(x+y)(i)
15	Difference entropy	Identr=−∑i=2N-1Px−y(i)log{Px−y(i)}
16	Information correlation measure 1	IIMC1=HXY-HXY1max(HX-HY)
17	Information correlation measure 2	IIMC2=1−exp[−2(HXY2−HXY)]where HX and HY are the entropies for P_x and P_y HX=−∑i=0N−1Px(i)(log(Px(i))) HY=−∑j=0N−1Py(i)(log(Py(i))) HXY=−∑i,j=0N−1P(i,j)(log(P(i,j))) HXY1=−∑i,j=0N−1P(i,j)log(Px(i)Py(j)) HXY2=−∑i,j=0N−1=0Px(i)Py(j)log(Px(i)Py(j))

Description of GLCM based textural features. Run length matrix based texture features: Galloway (39) observed that in coarse texture, long gray level runs may be exist more frequently as compared to fine texture which generally contains short runs. Galloway (39) studied the application of run length matrix for texture feature extraction. Run length matrix, R(i, j), records the frequency that j points with a gray level i continue in the direction θ. Here, we consider the run lengths matrices for angles θ = 0°, 45°, 90°, 135°. The following features, shown in Table III, were calculated from the run length matrix.

Table III

Description of run length matrix based textural features.

S.No	Feature	Description
1	Short Run Emphasis (SRE)	SRE=∑i=1Ng∑j=1NrR(i,j)j2∑i=1Ng∑j=1NrR(i,j)
2	Long Run Emphasis (LRE)	LRE=∑i=1Ng∑j=1Nrj2R(i,j)∑i=1Ng∑j=1NrR(i,j)
3	Gray-level Non-uniformity (GLNU)	GLNU=∑i=1Ng(∑j=1NrR(i,j))2∑i=1Ng∑j=1NrR(i,j)
4	Run length Non-uniformity (RLNU)	RLNU=∑j=1Nr(∑i=1NgR(i,j))2∑i=1Ng∑j=1NrR(i,j)
5	Run Percentage (RP)	RP=∑i=1Ng∑j=1NrR(i,j)P
6	Low Gray-level Run Emphasis (LGRE)	LGRE=∑i=1Ng∑j=1NrR(i,j)i2∑i=1Ng∑j=1NrR(i,j)
7	High Gray-level Run Emphasis (HGRE)	HGRE=∑i=1Ng∑j=1NrR(i,j)⋅i2∑i=1Ng∑j=1NrR(i,j)
8	Short Run Low Gray-level Run Emphasis (SRLGE)	SRLGE=∑i=1Ng∑j=1NrR(i,j)i2⋅j2∑i=1Ng∑j=1NrR(i,j)
9	Short Run High Gray-level Run Emphasis (SRHGE)	SRHGE=∑i=1Ng∑j=1NrR(i,j)⋅i2j2∑i=1Ng∑j=1NrR(i,j)
10	Long Run Low Gray-level Run Emphasis (LRLGE)	LRLGE=∑i=1Ng∑j=1NrR(i,j)⋅j2i2∑i=1Ng∑j=1NrR(i,j)
11	Long Run High Gray-level Run Emphasis (LRHGE)	LRHGE=∑i=1Ng∑j=1NrR(i,j)⋅j2⋅i2∑i=1Ng∑j=1NrR(i,j)

Description of run length matrix based textural features.

Classifiers

Support Vector Machine (SVM): It is an efficient classifier especially for the data distributed in higher dimensions. It works by linearly separating two data points belonging to two different classes by a hyperplane (40). Non-linear classification can be performed using kernel functions. It can directly solve two class problems but multi-class solution can also be obtained by breaking them into several two class problems. We have used the linear kernel, quadratic kernel, polynomial kernel of order 1, 2, and 3 and the Radial Basis Function (RBF) kernels in this work. Decision Tree (DT): Computationally cheap and user friendly decision trees have been used. These are one of the easiest supervised learners which follow tree structure for depicting decisions (41). Every parent node in the tree is an objective node which branches into child nodes as either a decision of belongingness of data or another objective node or both. A statistical property called information gain is calculated which is a measure of separation between training examples and target classification. Nearest neighbor is a non-parametric algorithm. In this algorithm it is assumed that a test observation closer to a trained labeled data should have same belongingness (42). The closeness is calculated by distance metrics. In KNN, ‘k’ implies the number of observations near to the test point. The value k should be tactically selected and it should be small enough to contain only relevant data points and large enough to not miss any data points which would decide its belongingness to a class. This classifier can perform well even with lesser training data. Naive Bayes (NB): Bayes’ rule says that posterior probability is proportional to prior probability times likelihood (43). Nai've Bayes algorithm is based on the Bayes’ rule but here it is assumed that features are independent of each other i.e. presence of one feature is totally independent of presence of another feature. Even maximum likelihood is also used for parameter estimation in several applications. As they are based on probabilistic model they are very good supervised learners and can be trained even with lesser data. Probabilistic Neural Network (PNN): It is a feed-forward network of multiple layers where the input layer, pattern layer, summation/category layer and output layer are arranged sequentially to receive inputs from previous layer and forward the output to the input of next layer (44). Every input in the input layer is fed to every node in the pattern layer; here, unlike common back-propagation algorithm where sigmoid function is used for activation, a non-linear function is used.

Feature Selection, Classification, Probabilistic Neural Network (PNN) Parameter Tuning and Genetic Algorithm

We used the Maximum Relevance Minimum Redundancy (mRMR) - Mutual Information Quotient (MIQ) method as the feature selection method. This technique relates the highest relevance of a feature to its class (45). It does that by determining mutual information (a statistical measure) between (a) target feature and its class, which should be maximized for class determination and (b) between two features, which should be minimized to remove information redundancy. They together are known as mRMR. A difference operator called Mutual Information Quotient (MIQ) is introduced to optimize both the relevance and redundancy values. The extracted features were evaluated and further selected using student’s t-test, which was used to assess whether the means of a feature in two groups are statistically different from each other by comparing with p-values at less than 0.05 which were considered clinically significant. The classifier robustness was evaluated using ten-fold cross validation technique.

Results

Selected Features

In our work, 40 out of 42 extracted texture features were clinically significant (p < 0.0001). Table IV also shows the rank of each feature (mean and standard deviation) using the mRMR-MIQ feature selection method.

Table IV

Results of (Mean ± SD) for various features extracted.

		Benign	Malignant
	Rank of feature using mRMR-MIQ	Mean ± SD	Mean ± SD	p-value
Autocorrelation	1	18.962 ± 4.132	18.030 ± 3.516	<0.0001
Homogeneity 90	27	0.705 ± 0.054	0.726 ± 0.066	<0.0001
Dissimilarity	3	0.799 ± 0.180	0.720 ± 0.209	<0.0001
Max probability	2	0.151 ± 0.122	0.179 ± 0.150	<0.0001
Contrast 0	13	0.930 ± 0.313	0.813 ± 0.301	<0.0001
Information correlation measure 2	12	0.801 ± 0.065	0.829 ± 0.068	<0.0001
Sum variance	8	43.727 ± 9.655	41.503 ± 7.589	<0.0001
Cluster shade	5	12.815 ± 19.309	20.148 ± 29.866	<0.0001
Correlation 90	19	0.799 ± 0.080	0.831 ± 0.087	<0.0001
Energy 0	21	0.076 ± 0.051	0.092 ± 0.090	<0.0001
Energy 135	24	0.061 ± 0.050	0.079 ± 0.091	<0.0001
Energy 90	23	0.067 ± 0.050	0.084 ± 0.091	<0.0001
Skewness	29	0.264 ± 0.326	0.333 ± 0.362	<0.0001
Homogeneity 45	26	0.661 ± 0.063	0.687 ± 0.076	<0.0001
Energy 45	22	0.061 ± 0.050	0.079 ± 0.091	<0.0001
Run length non-uniformity	35	3538.049 ± 981.493	3056.297 ± 1039.805	<0.0001
Short run low gray-level run emphasis	38	0.103 ± 0.117	0.111 ± 0.102	0.047
Variance	31	4600.694 ± 613.812	4817.160 ± 714.229	<0.0001
Kurtosis	30	2.240 ± 0.413	2.292 ± 0.457	0.002
Long run high gray-level run emphasis	40	6240082.394 ± 10506429.767	12769818.857 ± 12309279.552	<0.0001
Gray-level non-uniformity	33	14966.417 ± 10250.377	24261.805 ± 15036.827	<0.0001
Run percentage	34	0.629 ± 0.148	0.567 ± 0.148	<0.0001
High gray-level run emphasis	37	3369.684 ± 2831.230	5109.719 ± 2934.456	<0.0001
Low gray-level run emphasis	36	3.714 ± 7.851	5.168 ± 6.819	<0.0001
Short run emphasis	32	0.768 ± 0.050	0.747 ± 0.065	<0.0001
Long run low gray-level run emphasis	39	9904.270 ± 27160.338	14878.838 ± 23762.561	<0.0001
Entropy	4	3.320 ± 0.323	3.212 ± 0.432	<0.0001
Sum average	6	7.877 ± 1.231	7.580 ± 1.145	<0.0001
Sum entropy	7	2.520 ± 0.162	2.487 ± 0.253	<0.0001
Difference variance	9	1.550 ± 0.487	1.330 ± 0.535	<0.0001
Difference entropy	10	1.158 ± 0.135	1.088 ± 0.181	<0.0001
Information correlation measure 1	11	-0.292 ± 0.092	-0.336 ± 0.111	<0.0001
Contrast 45	14	1.886 ± 0.600	1.615 ± 0.659	<0.0001
Contrast 90	15	1.485 ± 0.488	1.273 ± 0.546	<0.0001
Contrast 135	16	1.899 ± 0.593	1.619 ± 0.655	<0.0001
Correlation 0	17	0.875 ± 0.049	0.893 ± 0.050	<0.0001
Correlation 45	18	0.745 ± 0.098	0.786 ± 0.106	<0.0001
Correlation 135	20	0.744 ± 0.097	0.785 ± 0.106	<0.0001
Homogeneity 0	25	0.753 ± 0.049	0.772 ± 0.057	<0.0001
Homogeneity 135	28	0.661 ± 0.062	0.687 ± 0.075	<0.0001

Results of (Mean ± SD) for various features extracted.

Classification Results

To evaluate the classifiers, we used ten-fold stratified cross validation technique. The entire dataset (1300 benign and 1300 malignant) was divided into ten equal groups, with each group containing the equal number of images from each class. During the first trial, nine groups were used to train the classifier and the remaining one part was used to test the classifiers and to obtain the performance measures. This procedure was repeated nine more times by using a different test set each time. The averages of the performance metrics (sensitivity, specificity, diagnostic accuracy, and PPV) obtained in all the iterations are reported as the overall performance metrics (Table V). It is evident from Table V that among all the classifiers, the PNN and KNN classifiers presented 100% average accuracy, sensitivity, specificity, and PPV using only 11 significant features.

Table V

Results of average accuracy, sensitivity, specificity and PPV for various classifiers.

Classifiers	No. of features	Accuracy (%)	PPV (%)	Sensitivity (%)	Specificity (%)
SVM, RBF	31	100.00	100.00	100.00	100.00
SVM, linear	40	84.73	87.59	81.00	88.46
SVM, quadratic	38	100.00	100.00	100.00	100.00
SVM, poly3	15	100.00	100.00	100.00	100.00
Decision tree	22	98.54	98.92	98.15	98.92
KNN	11	100.00	100.00	100.00	100.00
Naïve bayes	3	67.35	69.93	60.62	74.08
PNN	11	100.00	100.00	100.00	100.00

Results of average accuracy, sensitivity, specificity and PPV for various classifiers.

Discussion

Besides ultrasonography, another most commonly used technique for detecting ovarian cancer is to determine the levels of a tumor marker called Cancer-Antigen 125 (CA125). However, CA125 marker has been found to be elevated only in 50% of stage 1 cancers (46), and also CA125 can be increased in pancreatic and uterine malignancies, and frequently in benign conditions also (47). There is limited literature on CAD based studies for ovarian tumor classification. In Table VI, we present a summary of the findings of these published studies. It can be seen that the MS based studies (26-28, 48, 49) have resulted in high accuracies. However, they are limited by the cost and availability of the data analysis equipment. Menon (50) examined women with elevated CA125 levels and concluded that sensitivity of ultrasound reading can be increased by the usage of ovarian morphology and PPV can be increased by the use of complex ovarian morphology. Tailor et al. (51) and Biagiotti et al. (29) used operator suggested features (Table VI), and hence, are subjective in nature (features). The techniques developed by Zimmer et al. (31) and Lucidarme et al. (30) presented accuracies of only 70% and 91.73%, respectively.

Table VI

Summary of results of CAD based studies for ovarian tumor classification.

Literature	No. of samples	Features	Classifier	Performance
Renz et al. (25)	Benign, early stage and late stage cancers (55 cases)	Blood test data and age	Multilayer perceptron	Accuracy: 92.9%
Assareh and Moradi (48)	Dataset 1:91 normal, 162 cancersDataset 2:100 normal, 16 benign and 100 cancers	Three significant biomarkers from protein mass spectra	Two fuzzy linguistic rules	Dataset 1: Accuracy: 100%Dataset 2: Accuracy: 86.36%
Tan et al. (26)	24 normal, 30 cancers	DNA micro-array, blood test, and proteomics data	Complementary Learning Fuzzy Neural Network	Accuracy: 84.72%
Tang et al. (27)	95 normal, 121 cancers	Four statistical moments (mean, variance, skewness and kurtosis) obtained from SELDI-TOF mass spectroscopy data	Kernel partial least square classifier	Accuracy: 99.35%Sensitivity: 99.5%Specificity: 99.16%
Petricoin (28)	66 benign, 50 cancers	Proteomic spectra	Genetic algorithm with self organizing cluster analysis	Sensitivity: 100%Specificity: 95%
Tailor et al. (51)	52 benign, 15 cancers	Clinical and ultrasound based variables from TVUS images	Back propagation neural network	Sensitivity: 100%Specificity: 98.1%
Biagiotti et al. (29)	175 benign, 51 cancers	Age and parameters from TVUS images	Three layer back propagation network	Sensitivity: 96%
Zimmer et al. (31)	-	B-scan ultrasound images	Morphological Analysis	Accuracy: 70%
Lucidarme et al. (30)	234 benign, 141 cancers	Quantification of tissue disorganization in backscattered ultrasound (3D TVUS)	Ovarian HistoScanning (OHS) system	Sensitivity: 98%Specificity: 88%Accuracy: 91.73%
Acharya et al. (52)	1000 benign, 1000 cancers	Local Binary Pattern 1 Law’s Mask Energy	SVM classifier	Sensitivity: 100%Specificity: 99.8%Accuracy: 99.9%
Acharya et al. (53)	1300 benign, 1300 cancers	Hu’s invariant moments 1 Gabor wavelet features 1 Entropies	PNN classifier, tuned with genetic algorithm	Sensitivity: 99.2%Specificity: 99.6%Accuracy: 99.8%
Acharya et al. (54)	1000 benign, 1000 cancers	Texture and higher-order spectra based features	DT classifier	Sensitivity: 94.3%Specificity: 99.7%Accuracy: 97.0%
Proposed method	1300 benign, 1300 cancers	Features based on first order statistics, GLCM and run length matrix	KNN/PNN classifiers	Sensitivity: 100%Specificity: 100%Accuracy: 100%

Summary of results of CAD based studies for ovarian tumor classification. Recently, our group (52) presented a classification model to automatically discriminate the malignant and benign ovarian tumors in ultrasound images. We used texture features based on Laws Texture Energy and Local Binary Patterns extracted from 1000 benign and 1000 malignant images in a SVM classifier, and obtained an accuracy of 99.9%, sensitivity of 100% and specificity of 99.8% using 2000 ultrasound images. In another study (53), we extracted Hu’s invariant moments, Gabor transform parameters and entropies from 1300 benign and 1300 malignant ovarian tumors. Significant features were fed to the PNN classifier fine-tuned by genetic algorithm (GA) achieved an average classification accuracy of 99.8%, sensitivity of 99.2% and specificity of 99.6% at σ = 0.264. In our last study (54), we extracted features based on the textural changes and higher-order spectra from 1000 images in each category (benign and malignant), and used them in a DT classifier. An accuracy of 97%, sensitivity of 94.3%, and specificity of 99.7% was obtained. After evaluating a variety of features that quantify the gray-level intensity variations in the ultrasound images, we concluded that there is still room for improvement in the accuracy. Therefore, we studied texture features based on first order statistics, GLCM and run length matrices in this work, and using 11 significant features in KNN/PNN classifiers, we were able to achieve 100% classification accuracy in detecting ovarian tumor. The following are some key features of our proposed technique: Since the proposed GyneScan algorithm is automated, the final diagnosis result is objective and does not require specific training or expertise to understand the end-results. Due to the use of a large sample size (2600 images) for the training and evaluation of classifiers, and also because of the use of stratified cross validation technique for data resampling, the classifiers are generalized to effectively handle new images. The accuracy was obtained using only 11 features, and hence there is no problem of curse of dimensionality that is an issue for MS data. The GyneScan system can be easily deployed on any computer and does not require expensive software. Since the algorithm works on ultrasound images which are now commonly acquired and affordable, the over-all set-up and use of the proposed system is cost-effective. Besides the afore-mentioned advantages, the key finding in this preliminary study is the algorithm’s capability to detect ovarian tumor with a high accuracy of 100%. On the limitations side, we understand the need for more validation using larger databases to establish the accuracy of the proposed CAD algorithm. Moreover, we propose to continue this study to 3D, where we use the spatial information of the 3D slices taken from a single patient for further analysis.

Conclusion

In our earlier studies in the area of CAD based ovarian tumor classification, we found that the classification accuracy could be further improved. Therefore, in this work, we have proposed another CAD technique GyneScan that successfully captures the subtle variations in the gray-level intensity variations in the ultrasound images of benign and malignant ovarian tumors using several texture features based on first order statistics, Gray Level Co-occurrence Matrix and run length matrix. On using 11 significant features extracted from 1300 benign and 1300 malignant images to train/test KNN/ PNN classifiers, we were able to achieve 100% classification accuracy, sensitivity, specificity, and positive predictive value. Thus, the proposed technique could be a more objective adjunct method to detect the presence/absence of ovarian tumor.

42 in total

1. Three-dimensional ultrasonographic evaluation of ovarian tumours: a preliminary study.

Authors: T Hata; T Yanagihara; K Hayashi; C Yamashiro; Y Ohnishi; M Akiyama; A Manabe; K Miyazaki
Journal: Hum Reprod Date: 1999-03 Impact factor: 6.918

2. Ovarian cancer: relevant therapy, not timing, is paramount.

Authors: Robert T Morris; Bradley J Monk
Journal: Lancet Date: 2010-10-02 Impact factor: 79.321

3. Ovarian tumor characterization using 3D ultrasound.

Authors: U Rajendra Acharya; S Vinitha Sree; M Muthu Rama Krishnan; Luca Saba; Filippo Molinari; Stefano Guerriero; Jasjit S Suri
Journal: Technol Cancer Res Treat Date: 2012-07-10

4. Comparison of 2-dimensional and 3-dimensional power-Doppler imaging in complex adnexal masses for the prediction of ovarian cancer.

Authors: Juan Luis Alcázar; Gerardo Castillo
Journal: Am J Obstet Gynecol Date: 2005-03 Impact factor: 8.661

5. Ovarian tumor characterization and classification using ultrasound-a new online paradigm.

Authors: U Rajendra Acharya; S Vinitha Sree; Luca Saba; Filippo Molinari; Stefano Guerriero; Jasjit S Suri
Journal: J Digit Imaging Date: 2013-06 Impact factor: 4.056

6. The diagnosis of ovarian cancer: is color Doppler imaging reproducible and accurate in examiners with different degrees of experience?

Authors: Stefano Guerriero; Juan Luis Alcazar; Maria Angela Pascual; Silvia Ajossa; Betlem Graupera; Lourdes Hereter; Gian Benedetto Melis
Journal: J Womens Health (Larchmt) Date: 2011-01-25 Impact factor: 2.681

7. Use of proteomic patterns in serum to identify ovarian cancer.

Authors: Emanuel F Petricoin; Ali M Ardekani; Ben A Hitt; Peter J Levine; Vincent A Fusaro; Seth M Steinberg; Gordon B Mills; Charles Simone; David A Fishman; Elise C Kohn; Lance A Liotta
Journal: Lancet Date: 2002-02-16 Impact factor: 79.321

8. Characterization of single thyroid nodules by contrast-enhanced 3-D ultrasound.

Authors: Filippo Molinari; Alice Mantovani; Maurilio Deandrea; Paolo Limone; Roberto Garberoglio; Jasjit S Suri
Journal: Ultrasound Med Biol Date: 2010-10 Impact factor: 2.998

9. Intraobserver and interobserver agreement of grayscale typical ultrasonographic patterns for the diagnosis of ovarian cancer.

Authors: Stefano Guerriero; Juan Luis Alcazar; Maria Angela Pascual; Silvia Ajossa; Marta Gerada; Roberta Bargellini; Bruna Virgilio; Gian Benedetto Melis
Journal: Ultrasound Med Biol Date: 2008-06-04 Impact factor: 2.998

Review 10. Screening for ovarian cancer.

Authors: Cleola Anderiesz; Michael A Quinn
Journal: Med J Aust Date: 2003-06-16 Impact factor: 7.738

13 in total

1. Wilson disease tissue classification and characterization using seven artificial intelligence models embedded with 3D optimization paradigm on a weak training brain magnetic resonance imaging datasets: a supercomputer application.

Authors: Mohit Agarwal; Luca Saba; Suneet K Gupta; Amer M Johri; Narendra N Khanna; Sophie Mavrogeni; John R Laird; Gyan Pareek; Martin Miner; Petros P Sfikakis; Athanasios Protogerou; Aditya M Sharma; Vijay Viswanathan; George D Kitas; Andrew Nicolaides; Jasjit S Suri
Journal: Med Biol Eng Comput Date: 2021-02-05 Impact factor: 2.602

Review 2. Multimodality carotid plaque tissue characterization and classification in the artificial intelligence paradigm: a narrative review for stroke application.

Authors: Luca Saba; Skandha S Sanagala; Suneet K Gupta; Vijaya K Koppula; Amer M Johri; Narendra N Khanna; Sophie Mavrogeni; John R Laird; Gyan Pareek; Martin Miner; Petros P Sfikakis; Athanasios Protogerou; Durga P Misra; Vikas Agarwal; Aditya M Sharma; Vijay Viswanathan; Vijay S Rathore; Monika Turk; Raghu Kolluri; Klaudija Viskovic; Elisa Cuadrado-Godia; George D Kitas; Neeraj Sharma; Andrew Nicolaides; Jasjit S Suri
Journal: Ann Transl Med Date: 2021-07

Review 3. Cardiovascular/Stroke Risk Assessment in Patients with Erectile Dysfunction-A Role of Carotid Wall Arterial Imaging and Plaque Tissue Characterization Using Artificial Intelligence Paradigm: A Narrative Review.

Authors: Narendra N Khanna; Mahesh Maindarkar; Ajit Saxena; Puneet Ahluwalia; Sudip Paul; Saurabh K Srivastava; Elisa Cuadrado-Godia; Aditya Sharma; Tomaz Omerzu; Luca Saba; Sophie Mavrogeni; Monika Turk; John R Laird; George D Kitas; Mostafa Fatemi; Al Baha Barqawi; Martin Miner; Inder M Singh; Amer Johri; Mannudeep M Kalra; Vikas Agarwal; Kosmas I Paraskevas; Jagjit S Teji; Mostafa M Fouda; Gyan Pareek; Jasjit S Suri
Journal: Diagnostics (Basel) Date: 2022-05-17

4. Ultrasonography in the Diagnosis of Adnexal Lesions: The Role of Texture Analysis.

Authors: Paul-Andrei Ștefan; Roxana-Adelina Lupean; Carmen Mihaela Mihu; Andrei Lebovici; Mihaela Daniela Oancea; Liviu Hîțu; Daniel Duma; Csaba Csutak
Journal: Diagnostics (Basel) Date: 2021-04-29

5. A Novel Block Imaging Technique Using Nine Artificial Intelligence Models for COVID-19 Disease Classification, Characterization and Severity Measurement in Lung Computed Tomography Scans on an Italian Cohort.

Authors: Mohit Agarwal; Luca Saba; Suneet K Gupta; Alessandro Carriero; Zeno Falaschi; Alessio Paschè; Pietro Danna; Ayman El-Baz; Subbaram Naidu; Jasjit S Suri
Journal: J Med Syst Date: 2021-01-26 Impact factor: 4.460

Review 6. Cardiovascular/Stroke Risk Stratification in Parkinson's Disease Patients Using Atherosclerosis Pathway and Artificial Intelligence Paradigm: A Systematic Review.

Authors: Jasjit S Suri; Sudip Paul; Maheshrao A Maindarkar; Anudeep Puvvula; Sanjay Saxena; Luca Saba; Monika Turk; John R Laird; Narendra N Khanna; Klaudija Viskovic; Inder M Singh; Mannudeep Kalra; Padukode R Krishnan; Amer Johri; Kosmas I Paraskevas
Journal: Metabolites Date: 2022-03-31

Review 7. COVID-19 pathways for brain and heart injury in comorbidity patients: A role of medical imaging and artificial intelligence-based COVID severity classification: A review.

Authors: Jasjit S Suri; Anudeep Puvvula; Mainak Biswas; Misha Majhail; Luca Saba; Gavino Faa; Inder M Singh; Ronald Oberleitner; Monika Turk; Paramjit S Chadha; Amer M Johri; J Miguel Sanches; Narendra N Khanna; Klaudija Viskovic; Sophie Mavrogeni; John R Laird; Gyan Pareek; Martin Miner; David W Sobel; Antonella Balestrieri; Petros P Sfikakis; George Tsoulfas; Athanasios Protogerou; Durga Prasanna Misra; Vikas Agarwal; George D Kitas; Puneet Ahluwalia; Raghu Kolluri; Jagjit Teji; Mustafa Al Maini; Ann Agbakoba; Surinder K Dhanjil; Meyypan Sockalingam; Ajit Saxena; Andrew Nicolaides; Aditya Sharma; Vijay Rathore; Janet N A Ajuluchukwu; Mostafa Fatemi; Azra Alizad; Vijay Viswanathan; Pudukode R Krishnan; Subbaram Naidu
Journal: Comput Biol Med Date: 2020-08-14 Impact factor: 4.589

8. Ultrasonography in the Differentiation of Endometriomas from Hemorrhagic Ovarian Cysts: The Role of Texture Analysis.

Authors: Roxana-Adelina Ștefan; Paul-Andrei Ștefan; Carmen Mihaela Mihu; Csaba Csutak; Carmen Stanca Melincovici; Carmen Bianca Crivii; Andrei Mihai Maluțan; Liviu Hîțu; Andrei Lebovici
Journal: J Pers Med Date: 2021-06-28

Review 9. Bias Investigation in Artificial Intelligence Systems for Early Detection of Parkinson's Disease: A Narrative Review.

Authors: Sudip Paul; Maheshrao Maindarkar; Sanjay Saxena; Luca Saba; Monika Turk; Manudeep Kalra; Padukode R Krishnan; Jasjit S Suri
Journal: Diagnostics (Basel) Date: 2022-01-11

10. COVLIAS 1.0 vs. MedSeg: Artificial Intelligence-Based Comparative Study for Automated COVID-19 Computed Tomography Lung Segmentation in Italian and Croatian Cohorts.

Authors: Jasjit S Suri; Sushant Agarwal; Alessandro Carriero; Alessio Paschè; Pietro S C Danna; Marta Columbu; Luca Saba; Klaudija Viskovic; Armin Mehmedović; Samriddhi Agarwal; Lakshya Gupta; Gavino Faa; Inder M Singh; Monika Turk; Paramjit S Chadha; Amer M Johri; Narendra N Khanna; Sophie Mavrogeni; John R Laird; Gyan Pareek; Martin Miner; David W Sobel; Antonella Balestrieri; Petros P Sfikakis; George Tsoulfas; Athanasios Protogerou; Durga Prasanna Misra; Vikas Agarwal; George D Kitas; Jagjit S Teji; Mustafa Al-Maini; Surinder K Dhanjil; Andrew Nicolaides; Aditya Sharma; Vijay Rathore; Mostafa Fatemi; Azra Alizad; Pudukode R Krishnan; Ferenc Nagy; Zoltan Ruzsa; Archna Gupta; Subbaram Naidu; Kosmas I Paraskevas; Mannudeep K Kalra
Journal: Diagnostics (Basel) Date: 2021-12-15