| Literature DB >> 35404262 |
Louis Falissard1, Claire Morgand1, Walid Ghosn1, Claire Imbaud1, Karim Bounebache1, Grégoire Rey1.
Abstract
BACKGROUND: The recognition of medical entities from natural language is a ubiquitous problem in the medical field, with applications ranging from medical coding to the analysis of electronic health data for public health. It is, however, a complex task usually requiring human expert intervention, thus making it expansive and time-consuming. Recent advances in artificial intelligence, specifically the rise of deep learning methods, have enabled computers to make efficient decisions on a number of complex problems, with the notable example of neural sequence models and their powerful applications in natural language processing. However, they require a considerable amount of data to learn from, which is typically their main limiting factor. The Centre for Epidemiology on Medical Causes of Death (CépiDc) stores an exhaustive database of death certificates at the French national scale, amounting to several millions of natural language examples provided with their associated human-coded medical entities available to the machine learning practitioner.Entities:
Keywords: ICD-10 coding; automated medical entity recognition; deep learning; machine learning; machine translation; mortality statistics
Year: 2022 PMID: 35404262 PMCID: PMC9039820 DOI: 10.2196/26353
Source DB: PubMed Journal: JMIR Med Inform
Example of a causal chain of events leading to death as written in natural language and as ICD-10 codes.
| Part of form | Natural language | ICD-10a,b encoding | |
|
| |||
|
| Line 1 | Stroke in September left hemiparesis | I64 G819 |
|
| Line 2 | Fall scalp laceration fracture humerus | S010 W19 S423 |
|
| Line 3 | Coronary artery disease | I251 |
|
| Line 4 | Acute intracranial hemorrhage | I629 |
| Part 2 | Dementia depression hypertension | F03 F329 I10 | |
aICD-10: International Statistical Classification of Diseases and Related Health Problems, Tenth Revision.
bSome natural language lines correspond to several ICD-10 codes, whose orders matter in the overall coding process.
Death certificate from showcasing the misalignment phenomenon.
| Part of form | Natural language | ICD-10a encoding | |
|
| |||
|
| Line 1 | Stroke in September left hemiparesis | I64 G819 |
|
| Line 2 | Fall scalp laceration fracture humerus | S010 W19 S423 |
|
| Line 3 | Coronary artery disease | I629b I251 |
|
| Line 4 | Acute intracranial hemorrhageb | N/Ac |
| Part 2 | Dementia depression hypertension | F03 F329 I10 | |
aICD-10: International Statistical Classification of Diseases and Related Health Problems, Tenth Revision.
bThe ICD-10 code related to line 4 has been moved to line 3 by a human coder. Concatenating lines in a backward fashion restores alignment while preserving ordering.
cN/A: not applicable; the code that was previously here was moved to line 3, leaving this line blank.
Figure 1The original modeling problem and the modified investigated problem. In the original modeling problem (left), each certificate line is taken as an input variable to predict its corresponding ICD-10 code line. In the modified investigated problem (right), all certificate lines are concatenated and taken as an input variable to predict the corresponding concatenated ICD-10 code line. Lines 1-5 are from part 1 of the death certificate, and line 6 is part 2 of the certificate. ICD-10: International Statistical Classification of Diseases and Related Health Problems, Tenth Revision.
Examples of how the selected performance metrics behave for different predictions. The input text was “stroke in September left hemiparesis” and the true ICD-10 encoding was I64 and G819.
| Prediction example | ICD-10a codes | Precision | Recall | F-measure |
| 1 | B189 H155 | 0.0 | 0.0 | 0.0 |
| 2 | I64 G81 | 0.5 | 0.5 | 0.5 |
| 3 | I64 | 1.0 | 0.5 | 0.66 |
| 4 | I64 G819 A338 B87 | 0.5 | 1.0 | 0.66 |
| 5 | I64 G819 | 1.0 | 1.0 | 1.0 |
| 6 | G819 I64 | 1.0 | 1.0 | 1.0 |
aICD-10: International Statistical Classification of Diseases and Related Health Problems, Tenth Revision.
Assessments of the current state-of-the-art model and the proposed approach.
| Approach | F-measure (95% CI)a | Precision (95% CI) | Recall (95% CI) |
| Current state of the art: LIMSIb | 0.825c | 0.872c | 0.784c |
| Proposed approach: electronic certificates | 0.952 (0.946-0.957) | 0.955 (0.95-0.96) | 0.948 (0.943-0.954) |
| Proposed approach: paper-based certificates | 0.942 (0.941-0.944) | 0.949 (0.947-0.95) | 0.936 (0.934-0.937) |
| Proposed approach: all certificates | 0.943 (0.941-0.944) | 0.949 (0.948-0.951) | 0.937 (0.935-0.938) |
a95% CIs were derived by bootstrapping.
bLIMSI: Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur.
c95% CIs were not provided in the LIMSI’s publication and are, therefore, not displayed.
False positive, false negative, and prevalence rates for each ICD-10 chapter, sorted in descending order by prevalence.
| ICD-10a chapter | False positives, % | False negatives, % | Prevalence, % |
| Diseases of the circulatory system | 3.75 | 4.98 | 22.4 |
| Symptoms, signs and abnormal clinical and laboratory findings, not elsewhere classified | 3.87 | 4.12 | 21.8 |
| Neoplasms | 4.07 | 5.07 | 15.9 |
| Diseases of the respiratory system | 3.02 | 4.00 | 8.76 |
| Endocrine, nutritional and metabolic diseases | 2.17 | 3.44 | 4.83 |
| Diseases of the nervous system | 2.70 | 4.12 | 3.89 |
| Mental, behavioral and neurodevelopmental disorders | 2.88 | 4.14 | 3.58 |
| Diseases of the digestive system | 5.72 | 8.10 | 3.53 |
| Factors influencing health status and contact with health services | 19.2 | 19.6 | 3.08 |
| Diseases of the genitourinary system | 5.45 | 7.59 | 2.71 |
| External causes of morbidity and mortality | 16.6 | 23.5 | 2.57 |
| Certain infectious and parasitic diseases | 7.98 | 9.23 | 2.55 |
| Injury, poisoning and certain other consequences of external causes | 14.0 | 19.8 | 2.07 |
| Diseases of the blood and blood-forming organs and certain disorders involving the immune mechanism | 6.72 | 12.2 | 0.77 |
| Diseases of the musculoskeletal system and connective tissue | 12.2 | 17.3 | 0.62 |
| Diseases of the skin and subcutaneous tissue | 8.72 | 8.16 | 0.51 |
| Certain conditions originating in the perinatal period | 14.5 | 20.5 | 0.16 |
| Congenital malformations, deformations and chromosomal abnormalities | 22.4 | 25.6 | 0.15 |
| Diseases of the eye and adnexa | 4.93 | 13.6 | 0.076 |
| Codes for special purposes | 24.0 | 34.0 | 0.047 |
| Diseases of the ear and mastoid process | 5.60 | 33.3 | 0.017 |
| Pregnancy, childbirth and the puerperium | 50.0 | 33.3 | 0.0056 |
aICD-10: International Statistical Classification of Diseases and Related Health Problems, Tenth Revision.
Figure 2Percentage of rejected predictions versus F-measure for accepted ones. The score threshold values defining the accepted predictions are displayed as colored points.
Example of preprocessing used for the experiment on a real error example. The predicted and database ICD-10 sequences only differ by one code, while they share five codes. All shared codes were deleted from all ICD-10 sequences prior to estimation of performance metrics.
| Source of ICD-10a codes | ICD-10 codes before preprocessing | ICD-10 codes after preprocessing |
| Predicted by the model | I259 Z951 I719 C679 I10 R092 | Z951 |
| Present in the database | I259 I251 I719 C679 I10 R092 | I251 |
| Predicted by medical expert | I259 I251 I719 C679 I10 R092 | I251 |
aICD-10: International Statistical Classification of Diseases and Related Health Problems, Tenth Revision.
F-measure, precision, and recall of both the database ICD-10 codes and the model’s prediction of codes compared to that of the medical expert for sampled certificates without missing data.
| Source of ICD-10a codes | F-measure (95% CI) | Precision (95% CI) | Recall (95% CI) |
| Presence in database against medical expert prediction | 0.483 (0.383-0.589) | 0.443 (0.341-0.555) | 0.531 (0.425-0.636) |
| Model prediction against medical expert prediction | 0.431 (0.316-0.542) | 0.458 (0.338-0.580) | 0.407 (0.295-0.519) |
aICD-10: International Statistical Classification of Diseases and Related Health Problems, Tenth Revision.
F-measure, precision, and recall of both the database ICD-10 codes and the model’s prediction of codes compared to that of the medical expert for all sampled certificates.
| Source of ICD-10a codes | F-measure (95% CI) | Precision (95% CI) | Recall (95% CI) |
| Presence in database against medical expert prediction | 0.613 (0.486-0.733) | 0.630 (0.492-0.761) | 0.596 (0.471-0.721) |
| Model prediction against medical expert prediction | 0.370 (0.237-0.504) | 0.392 (0.250-0.540) | 0.351 (0.222-0.482) |
aICD-10: International Statistical Classification of Diseases and Related Health Problems, Tenth Revision.
Example of death certificate format given to the medical expert for the second experiment. The medical expert was asked, based on the information available in the line, to guess which of propositions 1 or 2 was produced by a human coder, with the other being the proposed model’s output.
| Item | Sexa of deceased | Year of death | Age of deceased (years) | Certificate textb | Proposition 1 (ICD-10c codes) | Proposition 2 (ICD-10 codes) |
| Death certificate | 2 | 2013 | 90 | 90 ans, péritonite, perforation grêle, occlusion, chirurgie digestive, infection pulmonaire, arrêt respiratoire | R54 K566 K659, K631 Y839 J958 R092 | R54 K659 K631 K566 Y839 J189 R092 |
aSex is a two-state categorical variable: 1 (female) or 2 (male).
bThe certificate text was taken from a death certificate in France and is, therefore, written in French.
aICD-10: International Statistical Classification of Diseases and Related Health Problems, Tenth Revision.