| Literature DB >> 27668260 |
Manana Khachidze1, Magda Tsintsadze1, Maia Archuadze1.
Abstract
According to the Ministry of Labor, Health and Social Affairs of Georgia a new health management system has to be introduced in the nearest future. In this context arises the problem of structuring and classifying documents containing all the history of medical services provided. The present work introduces the instrument for classification of medical records based on the Georgian language. It is the first attempt of such classification of the Georgian language based medical records. On the whole 24.855 examination records have been studied. The documents were classified into three main groups (ultrasonography, endoscopy, and X-ray) and 13 subgroups using two well-known methods: Support Vector Machine (SVM) and K-Nearest Neighbor (KNN). The results obtained demonstrated that both machine learning methods performed successfully, with a little supremacy of SVM. In the process of classification a "shrink" method, based on features selection, was introduced and applied. At the first stage of classification the results of the "shrink" case were better; however, on the second stage of classification into subclasses 23% of all documents could not be linked to only one definite individual subclass (liver or binary system) due to common features characterizing these subclasses. The overall results of the study were successful.Entities:
Year: 2016 PMID: 27668260 PMCID: PMC5030470 DOI: 10.1155/2016/8313454
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
The data.
| Data | Total | Training data | Testing data |
|---|---|---|---|
|
| 12.864 | 7.720 | 5.144 |
| Liver | 1.360 | 784 | |
| Biliary system | 1.227 | 957 | |
| Kidney and urinary system | 896 | 536 | |
| Gynecologic | 1.372 | 942 | |
| Thyroid | 1.101 | 851 | |
| Breast | 771 | 531 | |
| Vascular Doppler | 993 | 543 | |
|
| |||
|
| 10.524 | 5.262 | 5.262 |
| Chest | 849 | 849 | |
| Abdomen | 1.057 | 1.057 | |
| Spine | 946 | 946 | |
| Limbs | 612 | 612 | |
| Esophagus and stomach | 1.224 | 1.224 | |
| Large and small bowels | 574 | 574 | |
|
| |||
|
| 1.468 | 734 | 734 |
|
| |||
|
| |||
| Document number (ultrasonography, X-ray, endoscopy) | 24.856 | 13.716 | 11.140 |
Figure 1Example of medical record form.
Retrieval analysis (shrink case/SVM).
| Calculation formulas | tp/(tp + fn) | tp/(tp + fp) | 2 | (tp + tn)/(tp + fp + fn + tn) | (fp + fn)/ | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Medical tests performed | Total testing documents | Retrieved | True positive (tp) | False positive (fp) | False negative (fn) | True negative (tn) |
|
|
| Acc | ERR |
| Ultrasonography | 11.140 | 5.802 | 5004 | 798 | 140 | 5.198 | 0,973 | 0,862 | 0,914 | 0,916 | 0,084 |
| X-ray | 5.478 | 5010 | 468 | 252 | 5.410 | 0,952 | 0,915 | 0,933 | 0,935 | 0,065 | |
| Endoscopy | 872 | 705 | 167 | 29 | 9.701 | 0,960 | 0,808 | 0,878 | 0,982 | 0,018 | |
Retrieval analysis (classic case/SVM).
| Calculation formulas | tp/(tp + fn) | tp/(tp + fp) | 2 | (tp + tn)/(tp + fp + fn + tn) | (fp + fn)/ | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Medical tests performed | Total testing documents | Retrieved | True positive (tp) | False positive (fp) | False negative (fn) | True negative (tn) |
|
|
| Acc | ERR |
| Ultrasonography | 11.140 | 5.798 | 4.657 | 1.141 | 487 | 4.855 | 0,905 | 0,803 | 0,851 | 0,854 | 0,146 |
| X-ray | 5.784 | 4.703 | 1.081 | 559 | 4.797 | 0,894 | 0,813 | 0,852 | 0,853 | 0,147 | |
| Endoscopy | 881 | 708 | 173 | 26 | 9.698 | 0,965 | 0,804 | 0,877 | 0,981 | 0,019 | |
Results of SVM versus KNN (feature selection classic method).
| Feature selection classic method | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Class name | Subclass name | SVM | KNN | ||||||||
|
|
|
| Acc | ERR |
|
|
| Acc | ERR | ||
|
|
|
|
|
|
|
|
|
|
|
| |
| Liver | 0,75 | 0,53 | 0,62 | 0,86 | 0,14 | 0,72 | 0,51 | 0,60 | 0,85 | 0,15 | |
| Biliary system | 0,57 | 0,44 | 0,50 | 0,78 | 0,22 | 0,56 | 0,40 | 0,46 | 0,76 | 0,24 | |
| Kidney and urinary system | 0,94 | 0,81 | 0,87 | 0,97 | 0,03 | 0,78 | 0,59 | 0,67 | 0,92 | 0,08 | |
| Gynecologic | 0,91 | 0,87 | 0,89 | 0,96 | 0,04 | 0,70 | 0,63 | 0,66 | 0,87 | 0,13 | |
| Thyroid | 0,83 | 0,79 | 0,81 | 0,94 | 0,06 | 0,81 | 0,71 | 0,76 | 0,91 | 0,09 | |
| Breast | 0,90 | 0,77 | 0,83 | 0,96 | 0,04 | 0,87 | 0,59 | 0,70 | 0,92 | 0,08 | |
| Vascular Doppler | 0,87 | 0,72 | 0,79 | 0,95 | 0,05 | 0,87 | 0,70 | 0,77 | 0,95 | 0,05 | |
|
| |||||||||||
|
|
|
|
|
|
|
|
|
|
|
| |
| Chest | 0,89 | 0,85 | 0,87 | 0,96 | 0,04 | 0,78 | 0,84 | 0,81 | 0,94 | 0,06 | |
| Abdomen | 0,86 | 0,76 | 0,81 | 0,92 | 0,08 | 0,76 | 0,64 | 0,70 | 0,87 | 0,13 | |
| Spine | 0,89 | 0,77 | 0,83 | 0,93 | 0,07 | 0,78 | 0,63 | 0,70 | 0,88 | 0,12 | |
| Limbs | 0,99 | 0,77 | 0,87 | 0,97 | 0,04 | 0,83 | 0,57 | 0,68 | 0,91 | 0,09 | |
| Esophagus and stomach | 0,91 | 0,88 | 0,89 | 0,95 | 0,05 | 0,69 | 0,64 | 0,66 | 0,84 | 0,16 | |
| Large and small bowels | 0,91 | 0,94 | 0,93 | 0,98 | 0,02 | 0,84 | 0,66 | 0,74 | 0,94 | 0,07 | |
|
| |||||||||||
|
|
|
|
|
|
|
|
|
|
|
| |
Result evaluation for Level II (shrink and classic case/SVM).
| Subclass/level 2 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Shrink case | Classic case | |||||||||
|
|
|
| Acc | Err |
|
|
| Acc | Err | |
|
| ||||||||||
| Liver (784) | 0,610 | 0,398 | 0,481 | 0,800 | 0,200 | 0,749 | 0,528 | 0,619 | 0,860 | 0,140 |
| Biliary system (957) | 0,416 | 0,360 | 0,386 | 0,753 | 0,247 | 0,572 | 0,436 | 0,495 | 0,783 | 0,217 |
| Kidney and urinary system (536) | 0,806 | 0,635 | 0,711 | 0,932 | 0,068 | 0,942 | 0,813 | 0,873 | 0,971 | 0,029 |
| Gynecologic (942) | 0,781 | 0,738 | 0,759 | 0,909 | 0,091 | 0,908 | 0,871 | 0,889 | 0,958 | 0,042 |
| Thyroid (851) | 0,805 | 0,752 | 0,778 | 0,924 | 0,076 | 0,825 | 0,794 | 0,809 | 0,936 | 0,064 |
| Breast (531) | 0,793 | 0,594 | 0,679 | 0,923 | 0,077 | 0,904 | 0,769 | 0,831 | 0,962 | 0,038 |
| Vascular Doppler (543) | 0,888 | 0,666 | 0,761 | 0,941 | 0,059 | 0,866 | 0,721 | 0,787 | 0,950 | 0,050 |
|
| ||||||||||
|
| ||||||||||
| Chest (849) | 0,802 | 0,629 | 0,705 | 0,892 | 0,108 | 0,888 | 0,852 | 0,870 | 0,957 | 0,043 |
| Abdomen (1057) | 0,729 | 0,634 | 0,678 | 0,861 | 0,139 | 0,855 | 0,762 | 0,806 | 0,917 | 0,083 |
| Spine (946) | 0,785 | 0,521 | 0,626 | 0,831 | 0,169 | 0,889 | 0,774 | 0,827 | 0,933 | 0,067 |
| Limbs (612) | 0,822 | 0,403 | 0,541 | 0,838 | 0,162 | 0,990 | 0,774 | 0,869 | 0,965 | 0,035 |
| Esophagus and stomach (1224) | 0,714 | 0,679 | 0,696 | 0,855 | 0,145 | 0,905 | 0,881 | 0,893 | 0,949 | 0,051 |
| Large and small bowels (574) | 0,796 | 0,603 | 0,686 | 0,921 | 0,079 | 0,911 | 0,944 | 0,927 | 0,984 | 0,016 |