| Literature DB >> 28423783 |
Stefan Berndorfer1, Aron Henriksson2.
Abstract
Automated diagnosis coding can be provided efficiently by learning predictive models from historical data; however, discriminating between thousands of codes while allowing a variable number of codes to be assigned is extremely difficult. Here, we explore various text representations and classification models for assigning ICD-9 codes to discharge summaries in MIMIC-III. It is shown that the relative effectiveness of the investigated representations depends on the frequency of the diagnosis code under consideration and that the best performance is obtained by combining models built using different representations.Keywords: Electronic health records; diagnosis coding; predictive modeling
Mesh:
Year: 2017 PMID: 28423783
Source DB: PubMed Journal: Stud Health Technol Inform ISSN: 0926-9630