Literature DB >> 34996364

Simple Excel and ICD-10 based dataset calculator for the Charlson and Elixhauser comorbidity indices.

Pärt Prommik1,2,3, Kaspar Tootsi4,5, Helgi Kolk4,5, Aare Märtson4,5, Toomas Saluse5, Eiki Strauss5.   

Abstract

BACKGROUND: The Charlson and Elixhauser Comorbidity Indices are the most widely used comorbidity assessment methods in medical research. Both methods are adapted for use with the International Classification of Diseases, which 10th revision (ICD-10) is used by over a hundred countries in the world. Available Charlson and Elixhauser Comorbidity Index calculating methods are limited to a few applications with command-line user interfaces, all requiring specific programming language skills. This study aims to use Microsoft Excel to develop a non-programming and ICD-10 based dataset calculator for Charlson and Elixhauser Comorbidity Index and to validate its results with R- and SAS-based methods.
METHODS: The Excel-based dataset calculator was developed using the program's formulae, ICD-10 coding algorithms, and different weights of the Charlson and Elixhauser Comorbidity Index. Real, population-wide, nine-year spanning, index hip fracture data from the Estonian Health Insurance Fund was used for validating the calculator. The Excel-based calculator's output values and processing speed were compared to R- and SAS-based methods.
RESULTS: A total of 11,491 hip fracture patients' comorbidities were used for validating the Excel-based calculator. The Excel-based calculator's results were consistent, revealing no discrepancies, with R- and SAS-based methods while comparing 192,690 and 353,265 output values of Charlson and Elixhauser Comorbidity Index, respectively. The Excel-based calculator's processing speed was slower but differing only from a few seconds up to four minutes with datasets including 6250-200,000 patients.
CONCLUSIONS: This study proposes a novel, validated, and non-programming-based method for calculating Charlson and Elixhauser Comorbidity Index scores. As the comorbidity calculations can be conducted in Microsoft Excel's simple graphical point-and-click interface, the new method lowers the threshold for calculating these two widely used indices. TRIAL REGISTRATION: retrospectively registered.
© 2021. The Author(s).

Entities:  

Keywords:  Charlson comorbidity index; Comorbidity calculator; Elixhauser comorbidity index; ICD-10; Research methodology

Mesh:

Year:  2022        PMID: 34996364      PMCID: PMC8742382          DOI: 10.1186/s12874-021-01492-7

Source DB:  PubMed          Journal:  BMC Med Res Methodol        ISSN: 1471-2288            Impact factor:   4.615


Background

Identification of preexisting clinical conditions or comorbidities is of interest in all types of medical research. The Charlson and Elixhauser Comorbidity Indices are of the most widely used comorbidity assessment methods [1-10] validated on several patient populations like cancer [11], chronic renal failure [12], coronary artery bypass grafting [12], diabetes [12], hip fracture [10, 13–15], and stroke [16]. The Charlson Comorbidity Index (CCI) is a weighted score that accounts for the presence of 19 comorbid diseases [1]. CCI was later adapted for use with administrative data based on the International Classification of Diseases [4]. Later, Quan and his colleagues (2011) provided updated weights for CCI, as treatment of some diseases has improved in time [5]. The Elixhauser comorbidity system was initially developed for administrative data when measuring the presence of 30 comorbidities [17]. Van Walraven and his colleagues (2009) later modified the initial classification system into a single weighted score – Elixhauser Comorbidity Index (ECI) [9]. Later studies have provided different weighting schemes for ECI: Thompson and AHRQ (the Agency for Healthcare Research and Quality, Canada) weights [18, 19]. Currently, available CCI and ECI calculation methods are limited to a few software applications, which can only be operated through a command-line user interface, requiring R, SAS or SQL programming language skills [20-27]. Other available calculators allow measuring only a single patient’s comorbidities at a time [28]. As all CCI and ECI dataset calculators require specific programming-based software, the indices’ accessibility is limited for those who use other software for statistical analyses or have no prior programming experience. Thus, the accessibility of CCI and ECI can be increased by developing new methods using more user-friendly interfaces. Microsoft Excel is a widely used spreadsheet programme, and its graphical point-and-click interface allows use without programming. Thus, this study aims to use Microsoft Excel to develop a non-programming and ICD-10 based dataset calculator for CCI and ECI and to validate its results with R- and SAS-based methods.

Methods

Patients

We used real retrospective hip fracture population-wide data from the Estonian Health Insurance Fund. The Estonian Health Insurance Fund organises a national, solidarity-based mandatory health insurance system in Estonia, covering 94% of the population [29]. The hip fracture population was chosen as ICD-10 codes have been found suitable for fracture identification [30], and CCI and ECI have been validated among these patients [10, 13–15, 31]. The inclusion and exclusion criteria were chosen in concordance to multiple other studies [30, 32–35]: (1) age 50 or over; (2) ICD-10 codes S72.0–2 identifying index hip fracture between 1 January 2009–30 September 2017; (3) data validation confirming hip fracture diagnosis and excluding isolated acetabular, pelvic, periprosthetic, isolated greater and lesser trochanter fractures.

Data validation

The data validation was based on a logic check or the reviewal of patients’ medical information (Fig. 1). Firstly, patients’ Nordic Medico-Statistical Committee’s Classification of Surgical Procedures (NOMESCO) surgical management was reviewed to confirm their hip fracture diagnosis. Following codes confirmed the diagnosis: NFB20, NFB30, NFB40, NFB99, NFB00–9; NFB10–9, NFJ70–3, NFJ60–3, NFJ80–3, NFJ50–3 [36]. If these codes were not available, a patient’s digital images and medical records were reviewed. Two national databases were used to review digital images and medical records: the Foundation of Estonian PACS (an image archiving and communication system database) and the Estonian National Health Information System (https://ap.digilugu.ee/arstiportaal). Uploading medical data to both databases is mandatory by law, particularly since 2010 for medical records and since 2014 for digital images. Digital images were reviewed from January to July 2017 and medical records from January to March 2019. An orthopaedic surgeon and a radiologist reviewed the digital images, and a geriatrician reviewed the medical records. Hip fracture diagnosis was confirmed if one or both of the data sources approved its presence.
Fig. 1

Flowchart showing the validation of hip fracture diagnoses. Abbreviations: NCSP: Nordic-Medico-Statistical Committee’s Classification of Surgical Procedures

Flowchart showing the validation of hip fracture diagnoses. Abbreviations: NCSP: Nordic-Medico-Statistical Committee’s Classification of Surgical Procedures

Patients’ comorbidities

Comorbidities were defined as diagnoses coded as ICD-10 at any hospital or outpatient health care claims during a four-year period: at the time of the index HF and during the preceding 4 years. The 4 year preceding period was chosen to avoid under-ascertainment of comorbidities [37]. Finally, a restriction was applied to increase the validity of comorbidity assessment: only ICD-10 codes that appeared at least two times, and at least 7 days apart were included [13, 38].

Development of excel-based calculator

The Microsoft Excel-based dataset calculator was developed using ICD-10 coding algorithms [4], and different weighting schemes of CCI and ECI, the program’s basic formulae and wide format (Additional file 1). The 10th revision of the International Classification of Diseases was chosen as it is used by more than a hundred countries, including Estonia, and cited in more than 20,000 scientific articles world [4, 39]. The weighting schemes included the original [1] and the updated [5] CCI weights, and van Walraven [9] and AHRQ weights [18]. The calculator also takes into account the hierarchy of comorbidities: milder disease forms are excluded if a more severe one is present. Excel’s basic formulae were used for making the calculator as this makes it simple and flexible for users. It calculates comorbidity scores in two steps. If cell A2 is a patient’s ID and B2 contains her/his diseases as ICD-10 codes, the first step identifies the patient’s comorbidity categories [=IF (SUM (IF((LEN(B2)-LEN (SUBSTITUTE (UPPER(B2),{“CODE-1”;"CODE-2”; …; “CODE-N”},”“))),1,0)) > 0,1,0)] and the second step uses the output of the previous step and calculates total score using necessary weights (multiplications) and hierarchical conditions (IF functions) [=(C2*1) + IF(C2 = 0,D2*1,0) + … + IF(M2 = 0,L2*2,0)]. These basic formulae also allow users to edit or adapt the calculator for other weights or versions of the International Classification of Diseases codes. Wide-format, showing one subject per row, was preferred as this is the most used final data structure in statistical analysis. As ICD-10 data is occasionally in long format - one morbidity per row, simple data transformation solutions are included in the calculator’s instructions and in its one-minute instructional video (Additional file 2). Data transformations were done with an Excel’s add-in named Ablebits (www.ablebits.com). The add-in’s functions ‘Merge Duplicates’ (transforms long format to wide format), ‘Merge Cells’ (combines codes from multiple columns into one) and ‘Split Text (splits codes from one cell to multiple columns or rows; transforms wide format to long format) are useful for such purposes. The calculator allows ICD-10 codes to be inserted in any format: lowercase, uppercase, with or without punctuation, and any separators can be used between diagnoses. Finally, the calculator’s ability to identify all ICD-10 codes used in CCI and ECI was tested since the used hip fracture population may not cover all of the diseases used in the indicies. The calculator identified all ICD-10 codes used in the two indices.

Statistical analysis

Continuous variables were presented as “median (25th-75th percentile)” and categorical as proportions. The patients’ Charlson weight comorbidity score and the presence of different diseases were calculated using the Excel-based calculator and the R package “comorbidity” [21] and two SAS macros [40, 41]. The Excel-based calculator was validated by comparing the three methods’ results. The calculators’ processing speeds were compared using the study’s data (multiplicated for larger sample sizes). Excel-based calculator’s processing speed was assessed by running formulae in all columns at once. The analyses were run on a Lenovo T480 laptop released at the beginning of 2018 (i5-8250U 1.6 GHz CPU, 16GB RAM, Windows 10 Enterprise 20H2). Data analyses were done in Microsoft™ Excel™ 365 MSO 16.0.13528.203018 64bit (Microsoft Corporation, Redmond, Washington, USA), R 4.0.4 (R Core Team, 2017) and SAS OnDemand for Academics, release 3.8 (Enterprise edition) (SAS Institute Inc., Cary, NC, USA). Adobe Illustrator and Adobe InDesign (versions CC, Adobe Systems, San Jose, CA, USA) and GraphPad Prism (version 7.0, GraphPad Software, Incorporation, San Diego, CA, USA) were used for creating or finalising figures. Wondershare Filmora (version 10.1.20.16(6.0.0..54.8), Wondershare Technology Corporation, South Shenzhen, China) was used for video editing.

Results

Patients and their comorbidities

A total of 11,491 patients were included in the study (Fig. 1). Their median age was 81 years (73–87), 72% (8246) were female, and 51% (5883) had an intracapsular fracture. The Excel-based calculator’s results are presented in two tables: the original and the updated weight CCI scores in Table 1; and AHRQ and van Walraven weight ECI scores in Table 2.
Table 1

Original and updated Charlson comorbidity scores and disease categories

VariableValue
Original Charlson weight score
 Median (25th–75th percentile)1 (0–3)
Binned estimates
 03521 (30.6)
 1–24841 (42.1)
 3–42289 (19.9)
  ≥ 5840 (7.3)
Updated Charlson weight score
 Median (25th–75th percentile)2 (0–2)
Binned estimates
 04495 (39.1)
 1–24127 (35.9)
 3–42258 (19.7)
  ≥ 5611 (5.3)
Categories
 Myocardial infarction796 (6.9)
 Congestive heart failure5025 (43.7)
 Peripheral vascular disease1197 (10.4)
 Cerebrovascular disease2477 (21.6)
 Dementia1106 (9.6)
 Chronic pulmonary disease1243 (10.8)
 Rheumatic disease383 (3.3)
 Peptic ulcer disease542 (4.7)
 Mild liver disease174 (1.5)
 Diabetes without complications1242 (10.8)
 Diabetes with complications678 (5.9)
 Hemi- or paraplegia530 (4.6)
 Moderate/severe renal disease465 (4.0)
 Any malignancy1179 (10.3)
 Moderate/severe liver disease36 (0.3)
 Metastatic solid tumor42 (0.4)
 AIDS/HIV1 (< 0.0)

Values are n (%) unless otherwise specified

Table 2

AHRQ and van Walraven Elixhauser comorbidity scores and disease categories

VariableValue
AHRQ Elixhauser score
 Median (25th–75th percentile)3.0 (0–8)
Binned estimates
 04790 (41.7)
 1–41257 (10.9)
 5–93181 (27.7)
  ≥ 102263 (19.7)
van Walraven Elixhauser score
 Median (25th–75th percentile)5.0 (0–10)
Binned estimates
 04295 (37.4)
 1–41187 (10.3)
 5–92844 (24.7)
  ≥ 103165 (27.5)
Categories
 Congestive heart failure5025 (43.7)
 Cardiac arrhythmias2373 (20.7)
 Valvular disease357 (3.1)
 Pulmonary circulation disorders152 (1.3)
 Peripheral vascular disorders1197 (10.4)
 Hypertension, uncomplicated2972 (25.9)
 Hypertension, complicated6230 (54.2)
 Paralysis530 (4.6)
 Other neurological disorders1060 (9.2)
 Chronic pulmonary disease1243 (10.8)
 Diabetes, uncomplicated968 (8.4)
 Diabetes, complicated1063 (9.3)
 Hypothyroidism587 (5.1)
 Renal failure465 (4.0)
 Liver disease183 (1.6)
 Peptic ulcer disease excluding bleeding263 (2.3)
 AIDS/HIV1 (< 0.0)
 Lymphoma69 (0.6)
 Metastatic cancer42 (0.4)
 Solid tumour without metastasis1085 (9.4)
 Rheumatoid arthritis collagen vascular diseases414 (3.6)
 Coagulopathy34 (0.3)
 Obesity184 (1.6)
 Weight loss21 (0.2)
 Fluid and electrolyte disorders46 (0.4)
 Blood loss anaemia124 (1.1)
 Deficiency anaemia771 (6.7)
 Alcohol abuse367 (3.2)
 Drug abuse16 (0.1)
 Psychoses254 (2.2)
 Depression1125 (9.8)

Values are n (%) unless otherwise specified. Abbreviations: AHRQ the Agency for Healthcare Research and Quality

Original and updated Charlson comorbidity scores and disease categories Values are n (%) unless otherwise specified AHRQ and van Walraven Elixhauser comorbidity scores and disease categories Values are n (%) unless otherwise specified. Abbreviations: AHRQ the Agency for Healthcare Research and Quality

Comparison of the two methods

A total of 192,690 Charlson’s and 353,265 Elixhauser’s output values were compared. The Excel-based calculator’s results were consistent, revealing no discrepancies, with the R- and SAS-based methods.

Processing speed

The Excel-based calculator performed well with sample sizes of up to 200,000 patients, showing a processing time from 2 s up to 4 min and 10 s (Fig. 2). However, calculating comorbidities for 400,000 patients took 21 min and 32 s for CCI and 36 min and 34 s for ECI with the used hard- and software. In contrast, the R- and SAS-based calculators performed all calculations in less than 22 s.
Fig. 2

Processing speeds of Excel-based calculator and R- and SAS-based methods with different sample sizes. Abbreviations: CCI: Charlson Comorbidity Index; ECI: Elixhauser Comorbidity Index; SAS – Statistical Analysis Software

Processing speeds of Excel-based calculator and R- and SAS-based methods with different sample sizes. Abbreviations: CCI: Charlson Comorbidity Index; ECI: Elixhauser Comorbidity Index; SAS – Statistical Analysis Software

Discussion

This study proposes a novel, simple and validated tool for the most used comorbidity indices in medical research – CCI and ECI. The compared methods performed similarly in terms of accuracy, although the new tool has advantages and disadvantages that should be considered. The main advantage of the Excel-based method is its ease of use: comorbidity scores can be calculated by just copying and pasting patients’ identification numbers and ICD-10 codes from one spreadsheet to another. This can be done in a simple graphical point-and-click interface, requiring no coding skills from its user. However, the Excel-based calculator has limitations. Other programming-based methods allow calculating CCI and ECI with earlier versions or adaptions of the International Classification of Diseases: ICD-9, ICD-9-CM, Enhanced ICD-9-CM codes, or ICD-10-CM [5, 20–27]. The new calculator’s processing speed is reasonable with datasets of up to 200,000 patients and relatively capable hardware, taking up to few minutes in total. Still, it may take a considerable amount of time with larger data. This is explained by the calculator’s formulae-based nature, as they are duplicated in millions of spreadsheet cells, requiring a considerable amount of computing power. Computers with better hardware specifications (especially central processing unit’s [CPU] speed, random-access memory [RAM]) and 64bit version Microsoft Excel are therefore recommended for analysing large data. On the other hand, most medical research studies examine smaller sample sizes, large data splitting is always an option, and an hour-long calculation still takes significantly less time than learning to code. Another limitation is that Excel’s spreadsheets are limited to slightly over a million rows. Therefore, large long-format data may require splitting and should be prepared using multiple sheets. Ultimately, the final choice between using the Excel-based and other methods depends on a user’s skills, preference, available software and needs. All these factors vary among researchers. The new calculator may be useful for users preferring Microsoft Excel to prepare or analyse data, or those who have no programming skills, or whose used statistical software does not have a module for calculating CCI or ECI. Thus, the new simple Excel-based method lowers the threshold for calculating CCI and ECI, making these indices accessible to a broader audience.

Conclusions

This study proposes a novel, validated, non-programming based method for calculating two of the most used comorbidity indices in medical research - CCI and ECI. The Excel-based calculator allows calculating these comorbidity indices by simply copying and pasting data in a graphical point-and-click interface, thereby lowering the threshold for calculating CCI and ECI. The method may be useful for users preferring Microsoft Excel to prepare or analyse data, or those who have no programming skills, or whose statistical software does not have a corresponding module. The calculator’s slower processing speed is a downside that should be taken into account with very large datasets or less capable hardware or 32bit version of Microsoft Excel. Additional file 1. Additional file 2.
  29 in total

1.  Nonoperative treatment of hip fractures.

Authors:  Rina Jain; Antoni Basinski; Hans J Kreder
Journal:  Int Orthop       Date:  2002-11-12       Impact factor: 3.075

Review 2.  How to measure comorbidity. a critical review of available methods.

Authors:  Vincent de Groot; Heleen Beckerman; Gustaaf J Lankhorst; Lex M Bouter
Journal:  J Clin Epidemiol       Date:  2003-03       Impact factor: 6.437

3.  Updating and validating the Charlson comorbidity index and score for risk adjustment in hospital discharge abstracts using data from 6 countries.

Authors:  Hude Quan; Bing Li; Chantal M Couris; Kiyohide Fushimi; Patrick Graham; Phil Hider; Jean-Marie Januel; Vijaya Sundararajan
Journal:  Am J Epidemiol       Date:  2011-02-17       Impact factor: 4.897

4.  The ICD-10 Charlson Comorbidity Index predicted mortality but not resource utilization following hip fracture.

Authors:  Barbara Toson; Lara A Harvey; Jacqueline C T Close
Journal:  J Clin Epidemiol       Date:  2014-10-28       Impact factor: 6.437

5.  Coding algorithms for defining comorbidities in ICD-9-CM and ICD-10 administrative data.

Authors:  Hude Quan; Vijaya Sundararajan; Patricia Halfon; Andrew Fong; Bernard Burnand; Jean-Christophe Luthi; L Duncan Saunders; Cynthia A Beck; Thomas E Feasby; William A Ghali
Journal:  Med Care       Date:  2005-11       Impact factor: 2.983

6.  Toolkit to Compute Time-Based Elixhauser Comorbidity Indices and Extension to Common Data Models.

Authors:  Shorabuddin Syed; Ahmad Baghal; Fred Prior; Meredith Zozus; Shaymaa Al-Shukri; Hafsa Bareen Syeda; Maryam Garza; Salma Begum; Kim Gates; Mahanazuddin Syed; Kevin W Sexton
Journal:  Healthc Inform Res       Date:  2020-07-31

7.  Comorbidity risk-adjustment strategies are comparable among persons with hip fracture.

Authors:  David C Radley; Daniel J Gottlieb; Elliot S Fisher; Anna N A Tosteson
Journal:  J Clin Epidemiol       Date:  2008-02-14       Impact factor: 6.437

8.  Charlson Index comorbidity adjustment for ischemic stroke outcome studies.

Authors:  Larry B Goldstein; Gregory P Samsa; David B Matchar; Ronnie D Horner
Journal:  Stroke       Date:  2004-07-01       Impact factor: 7.914

9.  Osteoporosis-related fracture case definitions for population-based administrative data.

Authors:  Lisa M Lix; Mahmoud Azimaee; Beliz Acan Osman; Patricia Caetano; Suzanne Morin; Colleen Metge; David Goltzman; Nancy Kreiger; Jerilynn Prior; William D Leslie
Journal:  BMC Public Health       Date:  2012-05-18       Impact factor: 3.295

10.  Coding algorithms for defining Charlson and Elixhauser co-morbidities in Read-coded databases.

Authors:  David Metcalfe; James Masters; Antonella Delmestri; Andrew Judge; Daniel Perry; Cheryl Zogg; Belinda Gabbe; Matthew Costa
Journal:  BMC Med Res Methodol       Date:  2019-06-06       Impact factor: 4.615

View more
  1 in total

1.  Isolated greater trochanter fracture may impose a comparable risk on older patients' survival as a conventional hip fracture: a population-wide cohort study.

Authors:  Pärt Prommik; Kaspar Tootsi; Helgi Kolk; Aare Märtson; Karin Veske; Eiki Strauss; Toomas Saluse
Journal:  BMC Musculoskelet Disord       Date:  2022-04-27       Impact factor: 2.562

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.