| Literature DB >> 32695611 |
Jennifer G Nooney1, M Sue Kirkman2, Kai McKeever Bullard3, Zachary White1, Kristi Meadows1, Joanne R Campione1, Russ Mardon1, Gonzalo Rivero1, Stephen R Benoit3, Emily Pfaff4, Deborah Rolka3, Sharon Saydah3.
Abstract
OBJECTIVES: Surveys for U.S. diabetes surveillance do not reliably distinguish between type 1 and type 2 diabetes, potentially obscuring trends in type 1 among adults. To validate survey-based algorithms for distinguishing diabetes type, we linked survey data collected from adult patients with diabetes to a gold standard diabetes type. RESEARCH DESIGN AND METHODS: We collected data through a telephone survey of 771 adults with diabetes receiving care in a large healthcare system in North Carolina. We tested 34 survey classification algorithms utilizing information on respondents' report of physician-diagnosed diabetes type, age at onset, diabetes drug use, and body mass index. Algorithms were evaluated by calculating type 1 and type 2 sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) relative to a gold standard diagnosis of diabetes type determined through analysis of EHR data and endocrinologist review of selected cases.Entities:
Keywords: Algorithms; Diabetes surveillance; Surveillance methodology; Type 1 diabetes
Year: 2020 PMID: 32695611 PMCID: PMC7365930 DOI: 10.1016/j.jcte.2020.100231
Source DB: PubMed Journal: J Clin Transl Endocrinol ISSN: 2214-6237
Fig. 1Study design.
Characteristics of survey respondents reporting diabetes (N = 698).
| Characteristics | Unweighted Sample % | Weighted Sample % |
|---|---|---|
| Type 1 | 32.2 | 5.3 |
| Type 2 | 65.5 | 94.3 |
| Other/indeterminate | 2.3 | 0.4 |
| % using insulin | 66.5 | 42.1 |
| % using non-insulin | 54.4 | 80.2 |
| % on no diabetes drugs | 4.6 | 6.7 |
| Mean onset age, years | 33.3 | 46.8 |
| Mean count of type 1 Dx codes | 4.6 | 0.6 |
| Mean count of type 2 Dx codes | 9.0 | 9.6 |
| 18–44 | 29.1 | 7.6 |
| 45–64 | 42.6 | 44.8 |
| 65+ | 28.4 | 47.6 |
| Female | 51.7 | 53.9 |
| Male | 48.3 | 46.1 |
| Non-Hispanic white | 32.7 | 65.6 |
| Non-Hispanic black | 28.2 | 26.1 |
| Hispanic | 23.1 | 3.3 |
| Non-Hispanic other | 16.0 | 5.0 |
| Less than high school | 17.2 | 8.6 |
| High school graduate | 24.9 | 28.0 |
| Some college or more | 58.0 | 63.4 |
Notes: Gold Standard Diabetes Type refers to the “true” diabetes type determined through the use of structured and unstructured electronic health records data, as opposed to the type reported by respondents on the survey.
EHR = electronic health record; Dx = diagnosis.
Prevalence and Performance of Self Report-Based Algorithms (N = 698).
| Conditional Inference Tree: Self-report of type 1 = type 1; all others are type 2 | Self-report of type 1 + current insulin use = type 1; self-report of Other is Other; all others are type 2 | Self-report of type 1, current insulin use, started ins in 1 year of Dx = type 1; Self-report of Other is Other; all others are type 2 | Self-report of type 1, current insulin use, started ins in 1 year of Dx without stopping = type 1; self-report of Other is Other; all others are type 2 | Self-report of type 1, current insulin use, started ins in 1 year of Dx without stopping except in 1st year = type 1; self-report of Other is Other; all others are type 2 | |
|---|---|---|---|---|---|
| Type 1 Sensitivity (%95 CI) | 92.7% (88.6%–96.9%) | 91.6% (87.4%–95.8%) | 81.6% (75.2%–88.0%) | 79.0% (72.4%–85.6%) | 79.0% (72.4%–85.6%) |
| Type 1 PPV (%95 CI) | 72.9% (56.2%–89.5%) | 82.5% (70.8%–94.3%) | 83.4% (70.7%–96.0%) | 82.9% (70.0%–95.9%) | 82.9% (70.0%–95.9%) |
| Type 1 Specificity (%95 CI) | 98.1% (96.5%–99.7%) | 98.9% (98.0%–99.8%) | 99.1% (98.3%–99.9%) | 99.1% (98.3%–99.9%) | 99.1% (98.3%–99.9%) |
| Type 1 NPV (%95 CI) | 99.6% (99.3%–99.8%) | 99.5% (99.3%–99.8%) | 99.0% (98.6%–99.4%) | 98.8% (98.4%–99.3%) | 98.8% (98.4%–99.3%) |
| Type 1 Prevalence: Algorithm | 6.7% | 5.9% | 5.2% | 5.0% | 5.0% |
| Type 1 Prevalence: Gold Standard | 5.3% | 5.3% | 5.3% | 5.3% | 5.3% |
| Type 2 Sensitivity (%95 CI) | 98.2% (96.6%–99.8%) | 99.0% (98.2%–99.9%) | 99.2% (98.4%–100.0%) | 99.2% (98.4%–100.0%) | 99.2% (98.4%–100.0%) |
| Type 2 PPV (%95 CI) | 99.3% (99.0%–99.6%) | 99.4% (99.1%–99.7%) | 98.9% (98.4%–99.3%) | 98.7% (98.3%–99.1%) | 98.7% (98.3%–99.1%) |
| Type 2 Specificity (%95 CI) | 88.6% (83.8%–93.4%) | 90.1% (85.9%–94.4%) | 80.8% (74.6%–87.0%) | 78.4% (72.0%–84.8%) | 78.4% (72.0%–84.8%) |
| Type 2 NPV (%95 CI) | 74.6% (57.7%–91.6%) | 84.9% (73.3%–96.6%) | 86.1% (73.6%–98.5%) | 85.7% (72.9%–98.4%) | 85.7% (72.9%–98.4%) |
| % Correctly classified | 97.5% | 98.3% | 97.9% | 97.8% | 97.8% |
Notes: Type 1 Prevalence: Algorithm prevalence refers to the prevalence of type 1 generated by applying the algorithm to the sample. Type 1 Prevalence: Gold Standard refers to the prevalence of type 1 according to the “true” diabetes type determined through the use of structured and unstructured electronic health records data. The first column describes the algorithm that resulted from a conditional inference tree modeling technique; the tree optimized with the use of a single variable, self-reported diabetes type. All estimates are weighted to the full diabetes population of UNC Healthcare. Bold numbers signify the primary metric used to evaluate algorithms.
PPV = positive predictive value; NPV = negative predictive value, Dx = diagnosis; CI = confidence interval.
Prevalence and Performance of Drug-Based Algorithms in 2019, UNC Health Care System (N = 698).
| Current insulin use within 1 yr of Dx = type 1; all others are type 2 | Current insulin use within 1 yr of Dx without stopping = type 1; all others are type 2 | Current insulin use within 1 yr of Dx without stopping except in first year = type 1; all others are type 2 | Current insulin use, started insulin within 1 year of Dx, Dx age less than 40 = type 1; all others are type 2 | |
|---|---|---|---|---|
| Type 1 Sensitivity (%95 CI) | 84.8% (78.7%–91.0%) | 82.2% (75.9%–88.6%) | 83.8% (77.6%–90.0%) | 75.8% (69.0%–82.5%) |
| Type 1 PPV (%95 CI) | 32.9% (22.0%–43.8%) | 42.2% (29.6%–54.8%) | 42.0% (29.7%–54.2%) | 61.9% (46.1%–77.6%) |
| Type 1 Specificity (%95 CI) | 90.4% (85.7%–95.0%) | 93.7% (90.5%–96.9%) | 93.5% (90.4%–96.7%) | 97.4% (95.7%–99.1%) |
| Type 1 NPV (%95 CI) | 99.1% (98.7%–99.5%) | 99.0% (98.5%–99.4%) | 99.0% (98.6%–99.5%) | 98.6% (98.2%–99.1%) |
| Type 1 Prevalence Algorithm | 13.6% | 10.3% | 10.6% | 6.5% |
| Type 1 Prevalence Gold Standard | 5.3% | 5.3% | 5.3% | 5.3% |
| Type 2 Sensitivity (%95 CI) | 90.6% (85.9%–95.2%) | 93.9% (90.7%–97.1%) | 93.7% (90.5%–96.9%) | 97.6% (95.9%–99.3%) |
| Type 2 PPV (%95 CI) | 98.9% (98.5%–99.4%) | 98.7% (98.3%–99.2%) | 98.8% (98.4%–99.3%) | 98.5% (98.0%–98.9%) |
| Type 2 Specificity (%95 CI) | 83.3% (77.2%–89.4%) | 79.9% (73.5%–86.4%) | 81.7% (75.4%–87.9%) | 74.7% (68.0%–81.3%) |
| Type 2 NPV (%95 CI) | 34.7% (23.2%–46.1%) | 44.0% (30.9%–57.0%) | 43.8% (31.1%–56.6%) | 65.4% (48.8%–81.9%) |
| % Correctly classified | 89.9% | 92.9% | 92.8% | 96.1% |
Notes: Type 1 Prevalence: Algorithm prevalence refers to the prevalence of type 1 generated by applying the algorithm to the sample. Type 1 Prevalence: Gold Standard refers to the prevalence of type 1 according to the “true” diabetes type determined through the use of structured and unstructured electronic health records data. All estimates are weighted to the full diabetes population of UNC Healthcare. Bold numbers signify the primary metric used to evaluate algorithms.
PPV = positive predictive value; NPV = negative predictive value; Dx = diagnosis; CI = confidence interval.
Type 1 Prevalence and Performance of Top-Performing Survey Algorithm in Subpopulations (N = 698).
| Type 1 Sensitivity (95% CI) | Type 1 PPV (95% CI) | Type 1 Specificity (95% CI) | Type 1 NPV (95% CI) | Average of Type 1 Sensitivity and PPV | Type 1 Prevalence – Algorithm | Type 1 Prevalence – Gold Standard | |
|---|---|---|---|---|---|---|---|
| 92.9% (87.4%–98.4%) | 95.0% (89.9%–100.0%) | 99.7% (99.4%–100.0%) | 99.6% (99.3%–99.9%) | 5.0% | 5.1% | ||
| 90.2% (83.7%–96.8%) | 72.1% (53.2%–90.9%) | 98.0% (96.1%–99.8%) | 99.4% (99.0%–99.8%) | 6.9% | 5.5% | ||
| 88.6% (80.5%–96.7%) | 90.2% (80.8%–99.5%) | 96.9% (93.6%–100.0%) | 96.4% (93.3%–99.4%) | 23.9% | 24.3% | ||
| 94.9% (89.2%–100.0%) | 82.8% (62.1%–100.0%) | 99.0% (97.5%–100.0%) | 99.7% (99.4%–100.0%) | 5.8% | 5.0% | ||
| 89.9% (81.2%–98.6%) | 72.6% (47.6%–97.6%) | 99.1% (98.1%–100.0% | 99.7% (99.5%–99.9%) | 3.1% | 2.5% | ||
| 92.0% (86.8%–97.2%) | 93.6% (86.0%–97.2%) | 99.6% (99.0%–100.0%) | 99.5% (99.1%–99.8%) | 6.1% | 6.2% | ||
| 84.2% (74.2%–94.2%) | 39.1% (13.6%–64.6%) | 97.2% (94.2%–100.0%) | 99.7% (99.4%–99.9%) | 4.6% | 2.1% | ||
| 82.7% (60.5%–100.0%) | 57.4% (16.4%–98.5%) | 98.8% (96.9%–100.0%) | 99.7% (99.2%–100.0%) | 2.8% | 2.0% | ||
| 72.5% (38.3%–100.0%) | 12.7% (0.0%–33.9%) | 94.1% (86.4%–100.0%) | 99.7% (99.3%–100.0%) | 6.7% | 1.2% | ||
| 87.7% (78.3%–97.2%) | 68.5% (41.8%–95.2%) | 98.4% (96.6%–100.0%) | 99.5% (99.1%–99.9%) | 4.8% | 3.7% | ||
| 93.0% (88.2%–97.9%) | 97.4% (94.1%–100.0%) | 99.8% (99.6%–100.0%) | 99.5% (99.2%–99.9%) | 6.3% | 6.6% |
Notes: Type 1 Prevalence: Algorithm prevalence refers to the prevalence of type 1 generated by applying the algorithm to the sample. Type 1 Prevalence: Gold Standard refers to the prevalence of type 1 according to the “true” diabetes type determined through the use of structured and unstructured electronic health records data. All estimates are weighted to the full diabetes population of UNC Healthcare. Bold numbers signify the primary metric used to evaluate algorithms.
PPV = positive predictive value; NPV = negative predictive value; CI = confidence interval.