| Literature DB >> 33265857 |
Donghua Chen1, Runtong Zhang1, Xiaomin Zhu2.
Abstract
This study aimed to propose a mapping framework with entropy-based metrics for validating the effectiveness of the transition between International Classification of Diseases 10th revision (ICD-10)-coded datasets and a new context of ICD-11. Firstly, we used tabular lists and mapping tables of ICD-11 to establish the framework. Then, we leveraged Shannon entropy to propose validation methods to evaluate information changes during the transition from the perspectives of single-code, single-disease, and multiple-disease datasets. Novel metrics, namely, standardizing rate (SR), uncertainty rate (UR), and information gain (IG), were proposed for the validation. Finally, validation results from an ICD-10-coded dataset with 377,589 records indicated that the proposed metrics reduced the complexity of transition evaluation. The results with the SR in the transition indicated that approximately 60% of the ICD-10 codes in the dataset were unable to map the codes to standard ICD-10 codes released by WHO. The validation results with the UR provided 86.21% of the precise mapping. Validation results of the IG in the dataset, before and after the transition, indicated that approximately 57% of the records tended to increase uncertainty when mapped from ICD-10 to ICD-11. The new features of ICD-11 involved in the transition can promote a reliable and effective mapping between two coding systems.Entities:
Keywords: ICD-10; ICD-11; Shannon entropy; transition; validation
Year: 2018 PMID: 33265857 PMCID: PMC7512330 DOI: 10.3390/e20100769
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1Overview of an International Classification of Diseases (ICD) mapping framework.
Figure 2Flowchart of the Single-Code Validation (SV) in a mapping framework from ICD-10 to ICD-11.
Figure 3Empirical cumulative distribution of frequency of standardizing rate (SR) in an ICD-10-coded dataset.
Figure 4Pareto chart of the entropy interval of the three cases in SV.
Frequency of different ranges of information gain (IG) during mapping.
| Interval of Information Gain | Frequency | Percentage (%) | Number of ICD-10 Codes |
|---|---|---|---|
| [−12, −10) | 86 | 0.00 | 2 |
| [−10, −8) | 0 | 0.00 | / |
| [−8, −6) | 0 | 0.00 | / |
| [−6, −4) | 63,256 | 0.17 | 107 |
| [−4, −2) | 101,378 | 0.27 | 523 |
| [−2, 0) | 50,695 | 0.13 | 264 |
| [0, 2) | 156,884 | 0.42 | 1405 |
| [2, 4) | 5277 | 0.01 | 92 |
| [4, 6) | 12 | 0.00 | 4 |
| [6, 8) | 1 | 0.00 | 1 |
Figure 5Changes of the number of ICD-10 codes in different chapters of ICD-10, over varying information-gain intervals.