| Literature DB >> 26153161 |
Saeedeh Pourahmad1, Mohsen Azad, Shahram Paydar.
Abstract
To diagnose the malignancy in thyroid tumor, neural network approach is applied and the performances of thirteen batch learning algorithms are investigated on accuracy of the prediction. Therefore, a back propagation feed forward neural networks (BP FNNs) is designed and three different numbers of neuron in hidden layer are compared (5, 10 and 20 neurons). The pathology result after the surgery and clinical findings before surgery of the patients are used as the target outputs and the inputs, respectively. The best algorithm(s) is/are chosen based on mean or maximum accuracy values in the prediction and also area under Receiver Operating Characteristic Curve (ROC curve). The results show superiority of the network with 5 neurons in the hidden layer. In addition, the better performances are occurred for Polak-Ribiere conjugate gradient, BFGS quasi-newton and one step secant algorithms according to their accuracy percentage in prediction (83%) and for Scaled Conjugate Gradient and BFGS quasi-Newton based on their area under the ROC curve (0.905).Entities:
Mesh:
Year: 2015 PMID: 26153161 PMCID: PMC4803901 DOI: 10.5539/gjhs.v7n6p46
Source DB: PubMed Journal: Glob J Health Sci ISSN: 1916-9736
General characteristics of FNNs in present study
| Hidden layers number | 1 |
| Neurons in hidden | 5, 10, 20 |
| layer Neurons in output layer (decision classes) | 1 |
| Inputs number (quantitative and indicator variables) | 10 |
| Learning algorithm | 13 mentioned batch learning method |
| Learning Rate | 0.1 |
| (difference between two adjacent error components) | 0.05 |
| Number of tours | 20 |
| Max Iterations | 5000 |
| Validation method | 10-fold |
| The objective function | MSE (Mean squared error) |
| Size of training set | 276(80%) |
| Size of validation set | 69(20%) |
Description of patients’ attributes
| Attributes | Statistical description |
|---|---|
| No. (%) | |
| Gender | |
| Man | 66(19.1) |
| Woman | 279(80.9) |
| Having multiple nodules | |
| Yes | 182(52.8) |
| No | 163(47.2) |
| Having fast growth of thyroid gland | |
| Yes | 251(72.8) |
| No | 94(27.2) |
| Family history of thyroid disease | |
| Yes | 60(17.4) |
| No | 285(82.6) |
| Family history of cancer in general | |
| Yes | 61(17.7) |
| No | 284(82.3) |
| FNA test result | |
| Benign | 173(50.1) |
| Malignant | 172(49.9) |
| Mean (SD) | |
| Age (year) | 40.9 (13.4) |
| Maximum size of the right thyroid gland (cm) | 5.2(2.7) |
| Maximum size of the left thyroid gland (cm) | 4.7(2.7) |
| Maximum size of nodules in the right thyroid gland (cm) | 1.1(1.5) |
| Maximum size of nodules in the left thyroid gland (cm) | 1(1.6) |
| Duration of disease (year) | 4.2(3.3) |
| No. (%) | |
| Tumor type after surgery | |
| Benign | 189(54.8) |
| Malignant | 156(45.2) |
Prediction accuracy of thirteen learning algorithms on validation data
| Learning algorithm | Number of tours | Percentage of accuracy in prediction | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 5 neurons | 10 neurons | 20 neurons | ||||||||
| Min | Max | Mean(SD) | Min | Max | Mean(SD) | Min | Max | Mean(SD) | ||
| traingd | 20 | 61.0 | 80.0 | 71.0±6.0 | 58.0 | 80.0 | 69.0±4.9 | 58.0 | 75.0 | 66.0± 4.6 |
| traingdm | 20 | 60.0 | 78.0 | 69.0±5.2 | 61.0 | 75.0 | 68±5.0 | 55.0 | 72.0 | 65.0± 4.5 |
| traingda | 20 | 54.0 | 77.0 | 67.0±5.6 | 58.0 | 75.0 | 65.0± 4.5 | 46.0 | 78.0 | 64.0± 7.2 |
| trainrp | 20 | 58.0 | 80.0 | 67.0±5.0 | 52.0 | 78.0 | 68.0± 5.9 | 51.0 | 73.0 | 65.0± 5.9 |
| traincgb | 20 | 57.0 | 75.0 | 68.0±4.7 | 59.0 | 72.0 | 67.0± 3.0 | 51.0 | 77.0 | 66.0± 6.4 |
| traincgp | 20 | 55.0 | 83.0 | 67.0±7.1 | 58.0 | 74.0 | 65.0±4.8 | 46.0 | 77.0 | 65.0± 7.2 |
| traincgf | 20 | 46.0 | 73.0 | 64.0±6.1 | 46.0 | 74.0 | 66.0±6.4 | 58.0 | 74.0 | 65.0±5.1 |
| trainscg | 20 | 61.0 | 78.0 | 68.0± 4.5 | 59.0 | 75.0 | 65.0±4.3 | 54.0 | 75.0 | 64.0±5.3 |
| trainlm | 20 | 55.0 | 72.0 | 64.0± 5.4 | 54.0 | 74.0 | 65.0± 6.0 | 45.0 | 72.0 | 62.0±7.3 |
| trainbfg | 20 | 55.0 | 83.0 | 67.0± 7.0 | 57.0 | 74.0 | 66.0±4.7 | 45.0 | 72.0 | 63.0± 7.2 |
| trainoss | 20 | 64.0 | 83.0 | 69.0± 5.0 | 51.0 | 77.0 | 66.0±4.2 | 46.0 | 72.0 | 63.0±6.2 |
| traingdx | 20 | 64.0 | 78.0 | 69.0±4.0 | 58.0 | 74.0 | 68.0± 4.5 | 49.0 | 74.0 | 65.0±6.3 |
| trainbr | 20 | 60.0 | 77.0 | 67.0±4.6 | 59.0 | 75.0 | 68.0±4.0 | 52.0 | 77.0 | 68.0±6.2 |
Comparison of the thirteen batch learning algorithms based on the area under the ROC curve
| Learning algorithm | Area Under the ROC curve | ||
|---|---|---|---|
| 5 neurons | 10 neurons | 20 neurons | |
| Traingd | 0.837 | 0.823 | 0.810 |
| traingdm | 0.832 | 0.768 | 0.863 |
| traingda | 0.848 | 0.819 | 0.814 |
| trainrp | 0.827 | 0.824 | 0.753 |
| traincgb | 0.745 | 0.797 | 0.837 |
| traincgp | 0.865 | 0.785 | 0.809 |
| traincgf | 0.814 | 0.788 | 0.811 |
| trainscg | 0.905 | 0.768 | 0.824 |
| trainlm | 0.768 | 0.734 | 0.800 |
| trainbfg | 0.905 | 0.814 | 0.779 |
| trainoss | 0.859 | 0.784 | 0.765 |
| traingdx | 0.875 | 0.745 | 0.825 |
| trainbr | 0.811 | 0.835 | 0.817 |