Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 The usefulness of administrative databases for identifying disease cohorts is increased with a multivariate model.

Literature DB >> 20457509

The usefulness of administrative databases for identifying disease cohorts is increased with a multivariate model.

Carl van Walraven¹, Peter C Austin, Douglas Manuel, Greg Knoll, Allison Jennings, Alan J Forster.

Abstract

BACKGROUND: Administrative databases commonly use codes to indicate diagnoses. These codes alone are often inadequate to accurately identify patients with particular conditions. In this study, we determined whether we could quantify the probability that a person has a particular disease-in this case renal failure-using other routinely collected information available in an administrative data set. This would allow the accurate identification of a disease cohort in an administrative database.
METHODS: We determined whether patients in a randomly selected 100,000 hospitalizations had kidney disease (defined as two or more sequential serum creatinines or the single admission creatinine indicating a calculated glomerular filtration rate less than 60 mL/min/1.73 m²). The independent association of patient- and hospitalization-level variables with renal failure was measured using a multivariate logistic regression model in a random 50% sample of the patients. The model was validated in the remaining patients.
RESULTS: Twenty thousand seven hundred thirteen patients had kidney disease (20.7%). A diagnostic code of kidney disease was strongly associated with kidney disease (relative risk: 34.4), but the accuracy of the code was poor (sensitivity: 37.9%; specificity: 98.9%). Twenty-nine patient- and hospitalization-level variables entered the kidney disease model. This model had excellent discrimination (c-statistic: 90.1%) and accurately predicted the probability of true renal failure. The probability threshold that maximized sensitivity and specificity for the identification of true kidney disease was 21.3% (sensitivity: 80.0%; specificity: 82.2%).
CONCLUSION: Multiple variables available in administrative databases can be combined to quantify the probability that a person has a particular disease. This process permits accurate identification of a disease cohort in an administrative database. These methods may be extended to other diagnoses or procedures and could both facilitate and clarify the use of administrative databases for research and quality improvement.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2010 PMID： 20457509 DOI： 10.1016/j.jclinepi.2010.01.016

Source DB: PubMed Journal: J Clin Epidemiol ISSN： 0895-4356 Impact factor: 6.437

Keyword Cloud
Cited

10 in total

1. Consideration of ICD-9 code-derived disease-specific safety indicators in CKD.

Authors: Iris R Hartley; Jennifer S Ginsberg; Clarissa J Diamantidis; Min Zhan; Loreen Walker; Gail B Rattinger; Jeffrey C Fink
Journal: Clin J Am Soc Nephrol Date: 2013-09-19 Impact factor: 8.237

2. A population-based study to develop juvenile arthritis case definitions for administrative health data using model-based dynamic classification.

Authors: Allison Feely; Lily Sh Lim; Depeng Jiang; Lisa M Lix
Journal: BMC Med Res Methodol Date: 2021-05-16 Impact factor: 4.615

3. Combining structured and unstructured data to identify a cohort of ICU patients who received dialysis.

Authors: Swapna Abhyankar; Dina Demner-Fushman; Fiona M Callaghan; Clement J McDonald
Journal: J Am Med Inform Assoc Date: 2014-01-02 Impact factor: 4.497

4. Tradeoffs between accuracy measures for electronic health care data algorithms.

Authors: Jessica Chubak; Gaia Pocobelli; Noel S Weiss
Journal: J Clin Epidemiol Date: 2011-12-23 Impact factor: 6.437

5. Patient, physician, encounter, and billing characteristics predict the accuracy of syndromic surveillance case definitions.

Authors: Geneviève Cadieux; David L Buckeridge; André Jacques; Michael Libman; Nandini Dendukuri; Robyn Tamblyn
Journal: BMC Public Health Date: 2012-03-08 Impact factor: 3.295

6. Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy.

Authors: Geneviève Cadieux; Robyn Tamblyn; David L Buckeridge; Nandini Dendukuri
Journal: Med Care Date: 2017-08 Impact factor: 2.983

7. A systematic review of database validation studies among fertility populations.

Authors: V Bacal; M Russo; D B Fell; H Shapiro; M Walker; L M Gaudet
Journal: Hum Reprod Open Date: 2019-06-06

8. A data mining approach for grouping and analyzing trajectories of care using claim data: the example of breast cancer.

Authors: Nicolas Jay; Gilles Nuemi; Maryse Gadreau; Catherine Quantin
Journal: BMC Med Inform Decis Mak Date: 2013-11-30 Impact factor: 2.796

9. Subarachnoid hemorrhage admissions retrospectively identified using a prediction model.

Authors: Shane W English; Lauralyn McIntyre; Dean Fergusson; Alexis Turgeon; Marlise P Dos Santos; Cheemun Lum; Michaël Chassé; John Sinclair; Alan Forster; Carl van Walraven
Journal: Neurology Date: 2016-09-14 Impact factor: 9.910

10. Routine primary care data for scientific research, quality of care programs and educational purposes: the Julius General Practitioners' Network (JGPN).

Authors: Hugo M Smeets; Marlous F Kortekaas; Frans H Rutten; Michiel L Bots; Willem van der Kraan; Gerard Daggelders; Hanneke Smits-Pelser; Charles W Helsper; Arno W Hoes; Niek J de Wit
Journal: BMC Health Serv Res Date: 2018-09-25 Impact factor: 2.655

10 in total