Literature DB >> 22249968

Impact of data fragmentation across healthcare centers on the accuracy of a high-throughput clinical phenotyping algorithm for specifying subjects with type 2 diabetes mellitus.

Wei-Qi Wei1, Cynthia L Leibson, Jeanine E Ransom, Abel N Kho, Pedro J Caraballo, High Seng Chai, Barbara P Yawn, Jennifer A Pacheco, Christopher G Chute.   

Abstract

OBJECTIVE: To evaluate data fragmentation across healthcare centers with regard to the accuracy of a high-throughput clinical phenotyping (HTCP) algorithm developed to differentiate (1) patients with type 2 diabetes mellitus (T2DM) and (2) patients with no diabetes.
MATERIALS AND METHODS: This population-based study identified all Olmsted County, Minnesota residents in 2007. We used provider-linked electronic medical record data from the two healthcare centers that provide >95% of all care to County residents (ie, Olmsted Medical Center and Mayo Clinic in Rochester, Minnesota, USA). Subjects were limited to residents with one or more encounter January 1, 2006 through December 31, 2007 at both healthcare centers. DM-relevant data on diagnoses, laboratory results, and medication from both centers were obtained during this period. The algorithm was first executed using data from both centers (ie, the gold standard) and then from Mayo Clinic alone. Positive predictive values and false-negative rates were calculated, and the McNemar test was used to compare categorization when data from the Mayo Clinic alone were used with the gold standard. Age and sex were compared between true-positive and false-negative subjects with T2DM. Statistical significance was accepted as p<0.05.
RESULTS: With data from both medical centers, 765 subjects with T2DM (4256 non-DM subjects) were identified. When single-center data were used, 252 T2DM subjects (1573 non-DM subjects) were missed; an additional false-positive 27 T2DM subjects (215 non-DM subjects) were identified. The positive predictive values and false-negative rates were 95.0% (513/540) and 32.9% (252/765), respectively, for T2DM subjects and 92.6% (2683/2898) and 37.0% (1573/4256), respectively, for non-DM subjects. Age and sex distribution differed between true-positive (mean age 62.1; 45% female) and false-negative (mean age 65.0; 56.0% female) T2DM subjects.
CONCLUSION: The findings show that application of an HTCP algorithm using data from a single medical center contributes to misclassification. These findings should be considered carefully by researchers when developing and executing HTCP algorithms.

Entities:  

Mesh:

Year:  2012        PMID: 22249968      PMCID: PMC3277630          DOI: 10.1136/amiajnl-2011-000597

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  28 in total

1.  Bias resulting from missing information: some epidemiological findings.

Authors:  A Cox; M Rutter; B Yule; D Quinton
Journal:  Br J Prev Soc Med       Date:  1977-06

2.  Performance of linkage analysis under misclassification error when the genetic model is unknown.

Authors:  M Martinez; M Khlat; M Leboyer; F Clerget-Darpoux
Journal:  Genet Epidemiol       Date:  1989       Impact factor: 2.135

3.  Measuring diagnoses: ICD code accuracy.

Authors:  Kimberly J O'Malley; Karon F Cook; Matt D Price; Kimberly Raiford Wildes; John F Hurdle; Carol M Ashton
Journal:  Health Serv Res       Date:  2005-10       Impact factor: 3.402

4.  History of the Rochester Epidemiology Project.

Authors:  L J Melton
Journal:  Mayo Clin Proc       Date:  1996-03       Impact factor: 7.616

5.  Missing clinical information during primary care visits.

Authors:  Peter C Smith; Rodrigo Araya-Guerra; Caroline Bublitz; Bennett Parnes; L Miriam Dickinson; Rebecca Van Vorst; John M Westfall; Wilson D Pace
Journal:  JAMA       Date:  2005-02-02       Impact factor: 56.272

6.  Use of a medical records linkage system to enumerate a dynamic population over time: the Rochester epidemiology project.

Authors:  Jennifer L St Sauver; Brandon R Grossardt; Barbara P Yawn; L Joseph Melton; Walter A Rocca
Journal:  Am J Epidemiol       Date:  2011-03-23       Impact factor: 4.897

7.  Prevalence and impact of information gaps in the emergency department.

Authors:  Joseph Kim; David Chuun; Akshay Shah; Dominik Aronsky
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

8.  Combining free text and structured electronic medical record entries to detect acute respiratory infections.

Authors:  Sylvain DeLisle; Brett South; Jill A Anthony; Ericka Kalp; Adi Gundlapallli; Frank C Curriero; Greg E Glass; Matthew Samore; Trish M Perl
Journal:  PLoS One       Date:  2010-10-14       Impact factor: 3.240

9.  Use of an electronic medical record for the identification of research subjects with diabetes mellitus.

Authors:  Russell A Wilke; Richard L Berg; Peggy Peissig; Terrie Kitchner; Bozana Sijercic; Catherine A McCarty; Daniel J McCarty
Journal:  Clin Med Res       Date:  2007-03

10.  Prevalence of information gaps for seniors transferred from nursing homes to the emergency department.

Authors:  Matthew A Cwinn; Alan J Forster; A Adam Cwinn; Guy Hebert; Lisa Calder; Ian G Stiell
Journal:  CJEM       Date:  2009-09       Impact factor: 2.410

View more
  63 in total

1.  Combining billing codes, clinical notes, and medications from electronic health records provides superior phenotyping performance.

Authors:  Wei-Qi Wei; Pedro L Teixeira; Huan Mo; Robert M Cronin; Jeremy L Warner; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2015-09-02       Impact factor: 4.497

2.  A comparison of phenotype definitions for diabetes mellitus.

Authors:  Rachel L Richesson; Shelley A Rusincovitch; Douglas Wixted; Bryan C Batch; Mark N Feinglos; Marie Lynn Miranda; W Ed Hammond; Robert M Califf; Susan E Spratt
Journal:  J Am Med Inform Assoc       Date:  2013-09-11       Impact factor: 4.497

Review 3.  Intelligent use and clinical benefits of electronic health records in rheumatoid arthritis.

Authors:  Robert J Carroll; Anne E Eyler; Joshua C Denny
Journal:  Expert Rev Clin Immunol       Date:  2015-02-08       Impact factor: 4.473

4.  Assessing the Quality of Electronic Data for 'Fit-for-Purpose' by Utilizing Data Profiling Techniques Prior to Conducting a Survival Analysis for Adults with Acute Lymphoblastic Leukemia.

Authors:  Victoria Ngo; Theresa H Keegan; Brian A Jonas; Michael Hogarth; Katherine K Kim
Journal:  AMIA Annu Symp Proc       Date:  2021-01-25

5.  A distribution-based method for assessing the differences between clinical trial target populations and patient populations in electronic health records.

Authors:  C Weng; Y Li; P Ryan; Y Zhang; F Liu; J Gao; J T Bigger; G Hripcsak
Journal:  Appl Clin Inform       Date:  2014-05-07       Impact factor: 2.342

6.  Identification of type 2 diabetes subgroups through topological analysis of patient similarity.

Authors:  Li Li; Wei-Yi Cheng; Benjamin S Glicksberg; Omri Gottesman; Ronald Tamler; Rong Chen; Erwin P Bottinger; Joel T Dudley
Journal:  Sci Transl Med       Date:  2015-10-28       Impact factor: 17.956

Review 7.  Surveying Recent Themes in Translational Bioinformatics: Big Data in EHRs, Omics for Drugs, and Personal Genomics.

Authors:  J C Denny
Journal:  Yearb Med Inform       Date:  2014-08-15

8.  Design and analytic considerations for using patient-reported health data in pragmatic clinical trials: report from an NIH Collaboratory roundtable.

Authors:  Frank W Rockhold; Jessica D Tenenbaum; Rachel Richesson; Keith A Marsolo; Emily C O'Brien
Journal:  J Am Med Inform Assoc       Date:  2020-04-01       Impact factor: 4.497

9.  Learning statistical models of phenotypes using noisy labeled training data.

Authors:  Vibhu Agarwal; Tanya Podchiyska; Juan M Banda; Veena Goel; Tiffany I Leung; Evan P Minty; Timothy E Sweeney; Elsie Gyang; Nigam H Shah
Journal:  J Am Med Inform Assoc       Date:  2016-05-12       Impact factor: 4.497

10.  Characterization of statin dose response in electronic medical records.

Authors:  W-Q Wei; Q Feng; L Jiang; M S Waitara; O F Iwuchukwu; D M Roden; M Jiang; H Xu; R M Krauss; J I Rotter; D A Nickerson; R L Davis; R L Berg; P L Peissig; C A McCarty; R A Wilke; J C Denny
Journal:  Clin Pharmacol Ther       Date:  2013-10-04       Impact factor: 6.875

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.