Literature DB >> 31612200

Validation of an Optimized Algorithm for Identifying Persons Living With Diagnosed HIV From New York State Medicaid Data, 2006-2014.

Sarah E Macinski, Jayleen K L Gunn, Mona Goyal, Charles Neighbors, Rajeev Yerneni, Bridget J Anderson.   

Abstract

Algorithms are regularly used to identify persons living with diagnosed human immunodeficiency virus (HIV) (PLWDH) in Medicaid data. To our knowledge, there are no published reports of an HIV algorithm from Medicaid claims codes that have been compared with an HIV surveillance system to assess its sensitivity, specificity, positive predictive value, and negative predictive value in identifying PLWDH. Therefore, our aims in this study were to 1) develop an algorithm that could identify PLWDH in New York State Medicaid data from 2006-2014 and 2) validate this algorithm using the New York State HIV surveillance system. Classification and regression tree analysis identified 16 nodes that we combined to create a case-finding algorithm with 5 criteria. This algorithm identified 86,930 presumed PLWDH, 88.0% of which were verified by matching to the surveillance system. The algorithm yielded a sensitivity of 94.5%, a specificity of 94.4%, a positive predictive value of 88.0%, and a negative predictive value of 97.6%. This validated algorithm has the potential to improve the utility of Medicaid data for assessing health outcomes and programmatic interventions.
© The Author(s) 2019. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health.

Entities:  

Keywords:  HIV; Medicaid; acquired immunodeficiency syndrome; algorithms; classification and regression trees; validation studies

Year:  2020        PMID: 31612200      PMCID: PMC7306686          DOI: 10.1093/aje/kwz225

Source DB:  PubMed          Journal:  Am J Epidemiol        ISSN: 0002-9262            Impact factor:   4.897


Abbreviations

acquired immunodeficiency syndrome All Patient Refined Diagnosis Related Groups classification and regression tree Centers for Disease Control and Prevention case-finding algorithm Health Home human immunodeficiency virus International Classification of Diseases, Ninth Revision, Clinical Modification negative predictive value New York State New York State Department of Health persons living with diagnosed HIV positive predictive value The federal Medicaid program is the largest health insurer of people who have been diagnosed with human immunodeficiency virus (HIV) infection (40%) (1, 2). Consequently, Medicaid data are a robust source of health-care information and are often used to study multiple facets of HIV disease in the United States, including health-care expenses (3–5), health disparities (6), trends in hospitalization (7, 8), and quality of HIV care (9–11). The size of the Medicaid data set allows for high statistical power and subgroup analysis, as well as the study of low-prevalence events, such as HIV diagnosis, with high statistical power (12–14). While there are many advantages of using Medicaid data to study HIV care outcomes, data are collected for billing rather than clinical or research purposes (14–17). Medicaid researchers have indicated problems with accuracy and reliability when using Medicaid data (16, 18–25), as claims codes often vary by procedure, type of medical encounter, and clinical condition (14, 22–24, 26). A variety of algorithms have been developed to identify likely persons living with diagnosed HIV (PLWDH) within the Medicaid population (4, 7, 9–11, 13, 22, 23, 27–29). The accuracy and reliability of using diagnosis information in Medicaid data are higher for inpatient claims than for outpatient claims (13), but the number of hospitalizations for PLWDH are decreasing (8, 30). Because most Medicaid claims for PLWDH are for outpatient services (13) and not for diagnostic purposes, new algorithms need be sensitive to a variety of procedure and service codes, in addition to diagnosis codes. Optimal algorithms should share attributes of the Centers for Disease Control and Prevention (CDC)-defined “optimal surveillance system” (31), such as simplicity, flexibility, representativeness, sensitivity, and a high positive predictive value (PPV) (32). Despite the regular use of algorithms to identify PLWDH from Medicaid data (7, 9–11, 13, 22, 27), to our knowledge there are no published reports of an HIV algorithm derived from Medicaid data that has been validated by matching the results to a population-based HIV surveillance system (the gold standard) to test the algorithm’s sensitivity, specificity, PPV, and negative predictive value (NPV). The New York State (NYS) Medicaid program utilizes its own internally developed algorithm to identify HIV-positive persons. That algorithm has not been validated against the NYS HIV surveillance system and has not been published. NYS has approximately 113,000 PLWDH (33) and more PLWDH on Medicaid than any other state (34). A validated algorithm that accurately identifies PLWDH in NYS Medicaid data would enable researchers and policy-makers to better understand the impact of expanded Medicaid programs for PLWDH (34). Thus, our aims in this study were 2-fold: 1) to develop an algorithm for identifying PLWDH using NYS Medicaid data and 2) to validate the algorithm using the population-based NYS HIV surveillance system.

METHODS

Case identification and validation

Name-based reports for all HIV-related laboratory test results (e.g., CD4 cell counts and percentages, viral load tests, genotype resistance testing) are received by the New York State Department of Health’s (NYSDOH) HIV surveillance system for persons who reside or receive HIV-related care in NYS. This information is supplemented with data from the medical provider and partner report forms. In accordance with NYS public health law, clinicians are required to submit a report for all new or previously unreported diagnoses of HIV infection/acquired immunodeficiency syndrome (AIDS) within 14 days of diagnosis. The surveillance system routinely undergoes quality control and verification processes to ensure quality and completeness of the data. For this analysis, PLWDH were defined as persons with confirmed cases of HIV infection per the revised 2014 CDC definition (35). Only confirmed HIV diagnoses (regardless of disease stage) that met the CDC case definition were included in these analyses (Figure 1).
Figure 1

Identification and evaluation of presumptive persons living with diagnosed HIV (PLWDH) from New York State (NYS) Medicaid data, 2006–2014. CART, classification and regression tree; HIV, human immunodeficiency virus; NPV, negative predictive value; PPV, positive predictive value.

Identification and evaluation of presumptive persons living with diagnosed HIV (PLWDH) from New York State (NYS) Medicaid data, 2006–2014. CART, classification and regression tree; HIV, human immunodeficiency virus; NPV, negative predictive value; PPV, positive predictive value. Medicaid encounter and claims data for 2006–2014 were obtained from the NYS Medicaid Data Mart for persons aged 18–64 years who met eligibility criteria for the NYS Medicaid Health Home (HH) Program. To qualify for the HH Program in NYS, individuals must have 2 or more medically documented chronic conditions included in the “major” categories of the 3M (3M Company, St. Paul, Minnesota) Clinical Risk Groups (e.g., alcohol and substance use disorder, mental health, cardiovascular disease, metabolic disease, or respiratory disease) or have a single qualifying diagnosis such as HIV, AIDS, and/or a serious mental illness (36) (Figure 1). Because of the resource-heavy nature of matching large data sets, it was infeasible to compare all New Yorkers enrolled in Medicaid against the NYS HIV surveillance system; rather, for this study, an inclusive cohort of individuals—designated “potential PLWDH”—were identified from Medicaid-enrolled HH-eligible persons (n = 269,257). Published literature (3–5, 7, 9–11, 13, 22, 27, 28, 37–41) and an internal NYSDOHAIDS Institute algorithm (42) were used to establish criteria for identification of potential PLWDH in Medicaid data. Criteria for identification of PLWDH in the HH-eligible Medicaid data included: 1) a diagnosis code for HIV (3–5, 7, 9–11, 13, 22, 27, 28, 37–40, 42); 2) rate codes for AIDS hospice care (4, 38, 42), HIV-related outpatient services (4, 37), or an HIV Special Needs Plan (4, 38, 42); 3) a National Drug Code for an HIV antiretroviral medication (4, 13, 37, 38, 40, 43, 44); 4) a procedure code for HIV viral quantification, phenotyping, or genotyping (4, 9, 38, 40, 43, 44); or 5) 3 claims with a 3M Clinical Risk Group between 102 and 105 (45). Individuals were also identified as potential PLWDH if they had a combination of claims with codes for HIV-related outpatient services (4, 5, 13, 37, 38, 42, 44), provider specialties (4, 37, 42), or a diagnosis of an opportunistic infection (4, 42, 44). (See Web Table 1, available at https://academic.oup.com/aje, for a detailed description of the inclusion criteria.) Demographic Characteristics of Potential and Presumptive Persons Living With Diagnosed HIV in Medicaid Data, New York State, 2006–2014 Abbreviations: HIV, human immunodeficiency virus; PLWDH, persons living with diagnosed HIV. a Individuals were often identified in multiple years, so the sum for all years may be greater than the total number of individuals. b Sex was unknown for 3 persons. A potential PLWDH was categorized as a true case if the record could be matched to an existing record in the NYS HIV surveillance system, which was considered the gold standard for this analysis. Potential PLWDH were probabilistically matched to the surveillance system using combinations of name, sex, race/ethnicity, address, date of birth, health-care provider, and identification numbers from Medicaid and the NYS Department of Corrections and Community Supervision. Matching was conducted using IBM InfoSphere QualityStage software (IBM, Armonk, New York). Through probabilistic matching and manual review, 80,899 persons from NYS Medicaid data were confirmed as true PLWDH (Table 1); the software identified 63,047 of these cases as exact matches, and 17,852 were identified through manual review of uncertain matches by trained surveillance staff.
Table 1

Demographic Characteristics of Potential and Presumptive Persons Living With Diagnosed HIV in Medicaid Data, New York State, 2006–2014

Characteristic No. of Potential PLWDH No. of Presumptive PLWDH
Total269,25786,930
Yeara
 2006 63,65041,136
 2007 59,14540,750
 2008 57,47541,645
 2009 61,42142,857
 2010 64,91144,443
 2011 68,34445,144
 2012 72,01745,446
 2013100,89247,067
 2014160,72549,177
Sexb
 Female134,75932,556
 Male134,49554,374
Age group, years
 Missing data 5,920 4,628
 ≤19 4,513 1,732
 20–24 19,139 4,311
 25–29 23,466 5,636
 30–39 50,51816,293
 40–49 71,45230,012
 50–59 62,99419,777
 ≥60 31,255 4,541
Race/ethnicity
 Non-Hispanic white 52,88411,705
 Non-Hispanic black 78,44338,493
 Non-Hispanic Asian/ Pacific Islander 17,037 981
 Non-Hispanic American Indian/Alaska Native 409 75
 Hispanic 76,58527,078
 Other 13,840 4,376
 Unknown 30,059 4,222

Abbreviations: HIV, human immunodeficiency virus; PLWDH, persons living with diagnosed HIV.

a Individuals were often identified in multiple years, so the sum for all years may be greater than the total number of individuals.

b Sex was unknown for 3 persons.

Demographic information was obtained from the Medicaid eligibility data. Age was defined as an individual’s age at which he/she was first identified for inclusion in this study. Race/ethnicity was defined as non-Hispanic white, non-Hispanic black, non-Hispanic Asian or Pacific Islander, non-Hispanic American Indian or Alaska Native, Hispanic, or other. Persons without a race or ethnicity listed were recoded as having an unknown race/ethnicity. Records were obtained for each month in which an individual was eligible for Medicaid. Because a person’s eligibility status can fluctuate, the Medicaid enrollees may have had data in some years and not in others. Therefore, data were evaluated by year. Components of a Case-Finding Algorithm Used to Identify 86,930 Presumptive Persons Living With Diagnosed HIV in Medicaid Data, New York State, 2006–2014 Abbreviations: APR-DRG, All Patient Refined Diagnosis Related Groups; HIV, human immunodeficiency virus; ICD-9-CM, International Classification of Diseases, Ninth Revision, Clinical Modification; PLWDH, persons living with diagnosed HIV. a 46.5% of presumptive PLWDH were captured by more than 1 component of the case-finding algorithm, so the column percentages may not add to 100. b 53.6% of presumptive PLWDH were captured by only 1 component of the case-finding algorithm, so the column percentages may not add to 100.

Statistical evaluation of algorithm performance

Classification and regression tree (CART) analysis was used to predict which of the potential PLWDH had diagnosed HIV (“presumptive PLWDH”). CART statistically determines which factors in a model have the greatest explanatory power (46). The data from 2013 were chosen to develop the case-finding algorithm (CFA) through CART analysis because the 2013 data allowed for both retrospective (2006–2012) and prospective (2014) cross-validation through unique CART analyses. Because of a natural break in the data, nodes that had a match rate of 69% or higher were retained for the CFA (Web Figure 1). The sensitivity, specificity, PPV, and NPV of the CFA were evaluated among the potential PLWDH by comparing the presumptive PLWDH with the gold standard NYS HIV surveillance system. The flexibility and representativeness of the CFA were assessed by stratifying the population by claim year, sex, race/ethnicity, and age.

Ethical considerations

This study was designated as “Exemption 5—Public Benefit or Service Programs” by the NYSDOH Institutional Review Board. Claims data for 2006–2014 were used pursuant to Data Exchange and Application and Data Use Agreements between the NYSDOH Bureau of HIV/AIDS Epidemiology, the NYSDOH Office of Health Insurance Programs, and the National Center on Addiction and Substance Abuse.

RESULTS

Demographic characteristics of potential PLWDH

The potential PLWDH included 269,257 unduplicated individuals meeting the inclusion criteria (Table 1). There were equal numbers of males and females (134,495 and 134,759, respectively; sex was unknown for 3 persons). More than half of the potential PLWDH were non-Hispanic black (29.1%) or Hispanic (28.4%). Persons aged 40–49 years constituted the largest category of the study population (26.5%). The number of people identified for inclusion increased over the study years. Between 2006 and 2012, the annual number of persons identified for inclusion ranged between 57,475 (2008) and 72,017 (2012). Coinciding with the implementation of the NYS HH Program in 2013, the number of potential PLWDH that met inclusion criteria increased to 100,892 in 2013 and 160,725 in 2014.

Use of CART analysis to identify presumptive PLWDH

The 2013 CART model (Web Figure 1) identified 21 unique nodes, each representing a combination of selection criteria. Sixteen of these nodes indicated presumptive PLWDH and were combined to create the CFA (Table 2). The CFA identified “presumptive PLWDH” if a person met the criteria in one of the following categories: 1) 2 outpatient claims with an International Classification of Diseases, Ninth Revision, Clinical Modification (ICD-9-CM) (38) diagnosis code or an All Patients Refined Diagnosis Related Groups (APR-DRG) (42) code indicating an HIV diagnosis; 2) 1 inpatient claim with an ICD-9-CM diagnosis code or an APR-DRG code indicating an HIV diagnosis; 3) 1 claim for an HIV Special Needs Plan rate code; 4) 1 claim with a procedure code for viral quantification, phenotyping, or genotyping combined with 1 National Drug Code for an antiretroviral medication and an ICD-9-CM diagnosis code for an opportunistic infection; or 5) 1 claim for an HIV antiretroviral medication plus 1 claim with a rate code for outpatient services, such as an HIV clinic, an HIV medical evaluation visit, or HIV HH case management. By applying these 5 criteria to the whole sample, we identified 86,930 presumptive PLWDH in the Medicaid claims data. As Table 2 demonstrates, 82.6% of these presumptive PLWDH had at least 2 outpatient claims with a code indicating an HIV diagnosis.
Table 2

Components of a Case-Finding Algorithm Used to Identify 86,930 Presumptive Persons Living With Diagnosed HIV in Medicaid Data, New York State, 2006–2014

Component Total a Component Contribution Unique b Component Contribution
No. of Persons % No. of Persons %
2 outpatient claims with an ICD-9-CM diagnosis code or an APR-DRG code indicating an HIV diagnosis71,77682.633,37038.4
1 inpatient claim with an ICD-9-CM diagnosis code or an APR-DRG code indicating an HIV diagnosis27,50231.6 9,77911.3
1 claim for an HIV Special Needs Plan rate code19,75522.7 1,772 2.0
1 claim with a procedure code for viral quantification, phenotyping, or genotyping combined with 1 National Drug Code for an antiretroviral medication and an ICD-9-CM diagnosis code for an opportunistic infection14,08616.2 416 0.5
1 claim for an HIV antiretroviral medication plus 1 claim with a rate code for outpatient services16,85719.4 1,210 1.4

Abbreviations: APR-DRG, All Patient Refined Diagnosis Related Groups; HIV, human immunodeficiency virus; ICD-9-CM, International Classification of Diseases, Ninth Revision, Clinical Modification; PLWDH, persons living with diagnosed HIV.

a 46.5% of presumptive PLWDH were captured by more than 1 component of the case-finding algorithm, so the column percentages may not add to 100.

b 53.6% of presumptive PLWDH were captured by only 1 component of the case-finding algorithm, so the column percentages may not add to 100.

Demographic characteristics of presumptive PLWDH identified by the CFA

The CFA identified more males than females (62.6% and 37.5%, respectively) (Table 1). Presumed PLWDH varied by race/ethnicity (non-Hispanic black, 44.3%; Hispanic, 31.2%; non-Hispanic white, 13.5%; non-Hispanic Asian/Pacific Islander, 1.1%; non-Hispanic American Indian/Alaska Native, 0.1%; other, 5.0%). Persons aged 40–49 years were the largest age group identified, constituting 34.5% of the presumptive PLWDH. On average, the CFA identified 44,184 presumptive PLWDH each year between 2006 and 2014, with a slight increase in the yearly number of presumptive PLWDH identified between 2011 and 2014.

Comparison of potential PLWDH and presumptive PLWDH

Demographic characteristics varied between the potential PLWDH in the Medicaid data and the presumptive PLWDH identified from the CART analysis (Table 1). The proportion of females dropped from 50.0% to 37.5%. The proportion increased for persons aged 40–49 years (from 26.5% to 34.5%) and decreased for those aged 20–24 years (from 7.1% to 5.0%), 25–29 years (from 8.7% to 6.5%), and ≥60 years (from 11.6% to 5.2%). The proportions of non-Hispanic black (from 29.1% to 44.3%) and Hispanic (from 28.4% to 31.1%) presumptive PLWDH increased. The proportions of non-Hispanic whites (from 19.6% to 13.5%), non-Hispanic Asians/Pacific Islanders (from 6.3% (from 11.2% to 4.9%) decreased. Roughly one-third of potential PLWDH and 88.0% of presumptive PLWDH were true cases (Table 3). A higher percentage of true cases were identified among the presumptive PLWDH for each of the demographic categories assessed.
Table 3

Numbers of Potential and Presumptive Persons Living With Diagnosed HIV in New York State Medicaid Data Matched to New York State HIV Surveillance Data, 2006–2014

Characteristic Potential PLWDH Presumptive PLWDH
Total No. of Persons True Cases Total No. of Persons True Cases
No. % No. %
Total269,25780,89930.186,93076,46788.0
Yeara
 2006 63,65048,95176.941,13639,75196.6
 2007 59,14544,79375.740,75039,73997.5
 2008 57,47545,30178.841,64540,55297.4
 2009 61,42146,36175.542,85741,62897.1
 2010 64,91147,57773.344,44343,04996.9
 2011 68,34448,60171.145,14443,96897.4
 2012 72,01749,19268.345,44644,03696.9
 2013100,89249,73949.347,06744,19993.9
 2014160,72555,02934.249,17744,80091.1
Sexb
 Female134,75929,44521.932,55628,11586.4
 Male134,49551,45438.354,37448,35288.9
Age group, years
 Missing data 5,920 4,54176.7 4,628 4,37494.5
 ≤19 4,513 1,67237.1 1,732 1,59792.2
 20–24 19,139 4,04521.1 4,311 3,77387.5
 25–29 23,466 5,16722.0 5,636 4,85986.2
 30–39 50,51815,32130.316,29314,48388.9
 40–49 71,45228,38439.730,01227,00590.0
 50–59 62,99418,12428.819,77717,03586.1
 ≥60 31,255 3,64511.7 4,541 3,34173.6
Race/ethnicity
 Non-Hispanic white 52,88410,64920.111,70510,13086.5
 Non-Hispanic black 78,44334,75244.338,49334,69390.1
 Non-Hispanic Asian/Pacific Islander 17,037 779 4.6 981 75176.6
 Non-Hispanic American Indian/Alaska Native 409 7518.3 75 7194.7
 Hispanic 76,58523,63030.927,07823,66787.4
 Other 13,840 3,68726.6 4,376 3,63183.0
 Unknown 30,059 7,32724.4 4,222 3,51583.3

Abbreviations: HIV, human immunodeficiency virus; PLWDH, persons living with diagnosed HIV.

a Individuals were often identified in multiple years, so the column percentages may not add to 100.

b Sex was unknown for 3 persons.

Numbers of Potential and Presumptive Persons Living With Diagnosed HIV in New York State Medicaid Data Matched to New York State HIV Surveillance Data, 2006–2014 Abbreviations: HIV, human immunodeficiency virus; PLWDH, persons living with diagnosed HIV. a Individuals were often identified in multiple years, so the column percentages may not add to 100. b Sex was unknown for 3 persons.

Measures of validity for the CFA

The CFA had a sensitivity of 94.5%, a specificity of 94.4%, a PPV of 88.0%, and an NPV of 97.6% (Table 4). The measures of validity were similar between male and female presumptive PLWDH. Sensitivity was 94.0% for males and 95.5% for females, and specificity was 92.8% for males and 95.8% for females. The PPV was 88.9% for males and 86.4% for females, while the NPV was 96.1% for males and 98.7% for females.
Table 4

Validity Statistics for a Case-Finding Algorithm Created to Identify Presumptive Persons Living With Diagnosed HIV in Medicaid Data, New York State, 2006–2014

Characteristic Total No. of PLWDH Match to Surveillance Data Measure of Validity
No. of Persons % 95% CI Sensitivity, % Specificity, % PPV, % NPV, %
Total86,93076,46788.087.8, 88.294.594.488.097.6
Yeara
 200641,13639,75196.696.5, 96.881.290.696.659.1
 200740,75039,73997.597.4, 97.788.793.097.572.5
 200841,64540,55297.491.2, 97.589.591.097.470.0
 200942,85741,62897.197.0, 97.389.892.097.174.9
 201044,44343,04996.996.7, 97.090.592.096.977.9
 201145,14443,96897.497.3, 97.590.594.097.480.0
 201245,44644,03696.996.7, 97.189.593.896.980.6
 201347,06744,19993.993.7, 94.188.994.493.989.7
 201449,17744,80091.190.9, 91.481.495.991.190.8
Sexb
 Female32,55628,11586.486.0, 86.795.595.886.498.7
 Male54,37448,35288.988.7, 89.294.092.888.996.1
Age group, years
 Missing data 4,628 4,37494.593.9, 95.296.381.694.587.1
 ≤19 1,732 1,59792.290.9, 93.595.595.292.297.3
 20–24 4,311 3,77387.586.5, 88.593.396.487.598.2
 25–29 5,636 4,85986.285.3, 87.194.095.886.298.3
 30–3916,29314,48388.988.4, 89.494.594.988.997.6
 40–4930,01227,00590.089.6, 90.395.193.090.096.7
 50–5919,77717,03586.185.7, 86.694.093.986.197.5
 ≥60 4,541 3,34173.672.3, 74.991.795.773.698.9
Race/ethnicity
 Non-Hispanic white11,70510,13086.585.9, 87.292.396.486.598.0
 Non-Hispanic black38,49334,69390.189.8, 90.495.391.790.196.1
 Non-Hispanic Asian/Pacific Islander 981 75176.673.9, 79.292.598.776.699.6
 Non-Hispanic American Indian/Alaska Native 75 7194.789.6, 99.893.498.994.798.6
 Hispanic27,07823,66787.487.0, 87.896.093.987.498.2
 Other 4,376 3,63183.081.9, 84.193.793.183.097.6
 Unknown 4,212 3,51583.582.3, 84.685.695.383.595.9

Abbreviations: CI, confidence interval; HIV, human immunodeficiency virus; NPV, negative predictive value; PLWDH, persons living with diagnosed HIV; PPV, positive predictive value.

a Individuals were often identified in multiple years, so the column percentages may not add to 100.

b Sex was unknown for 3 persons.

Validity Statistics for a Case-Finding Algorithm Created to Identify Presumptive Persons Living With Diagnosed HIV in Medicaid Data, New York State, 2006–2014 Abbreviations: CI, confidence interval; HIV, human immunodeficiency virus; NPV, negative predictive value; PLWDH, persons living with diagnosed HIV; PPV, positive predictive value. a Individuals were often identified in multiple years, so the column percentages may not add to 100. b Sex was unknown for 3 persons. The measures of validity for presumptive PLWDH were similar when they were compared across racial/ethnic categories (Table 4). Sensitivity ranged from 92.3% among non-Hispanic white presumptive PLWDH to 96.0% among Hispanic presumptive PLWDH. Specificity ranged from 91.7% among non-Hispanic black presumptive PLWDH to 98.9% among non-Hispanic American Indian/Alaska Native presumptive PLWDH. The PPV ranged from 76.6% among non-Hispanic Asian/Pacific Islander presumptive PLWDH to 94.7% among non-Hispanic American Indian/Alaska Native presumptive PLWDH. The NPV ranged from 96.1% among non-Hispanic black presumptive PLWDH to 99.6% among non-Hispanic Asian/Pacific Islander presumptive PLWDH. The measures of validity were similar across age groups (Table 4). Sensitivity ranged from 91.7% among presumptive PLWDH aged ≥60 years to 95.5% for presumptive PLWDH aged 18–19 years. Specificity ranged from 93.0% among presumptive PLWDH aged 40–49 years to 96.4% among presumptive PLWDH aged 20–24 years. The PPV ranged from 73.6% among presumptive PLWDH aged ≥60 years to 92.2% among presumptive PLWDH aged 18–19 years. The NPV varied from 96.7% among presumptive PLWDH aged 40–49 years to 98.9% among presumptive PLWDH aged ≥60 years. The measures of validity improved over the study years (Table 4). Sensitivity ranged from 81.2% in 2006 to 90.5% in 2010 and 2011. Specificity increased, from 90.6% in 2006 to 95.9% in 2014. The PPV ranged from 91.1% in 2014 to 97.5% in 2007. The NPV increased from 59.1% in 2006 to 90.8% in 2014.

DISCUSSION

The CFA developed through CART analysis successfully identified PLWDH from the NYS Medicaid data. The CFA was designed with the CDC’s 2014 updated guidelines for evaluating public health surveillance systems and validated using multiple measures of accuracy (sensitivity, specificity, PPV, and NPV). All measures of validity demonstrated that the CFA had a high likelihood of accurately capturing HIV status among persons listed in the Medicaid data. German et al. (32) defined a flexible system as one that can adapt easily to changing information and stated that flexibility is best evaluated through retrospective analysis. The CFA was created using data from 2013 and retrospectively applied to data from 2006–2012, and then prospectively applied to 2014 data. This time period included the implementation of the Affordable Care Act (47), expanded Medicaid service, and the initiation of the HH Program in NYS (48). The CFA adapted well to these changes; each year, the sensitivity was above 80% and the PPV was above 90%. While the CFA presented in this paper is not as simple as the CFAs used by other researchers (9–11, 13, 22), the CFA presented here was easy to use, easy to understand, and replicable by year. Researchers should be able to use this algorithm to identify PLWDH in their states’ Medicaid data. The NPV indicated that 98% of PLWDH identified by the CFA as HIV-negative were truly negative (not diagnosed with HIV) based on comparison with the gold standard NYS HIV surveillance system. Similarly, the PPV indicated that 88% of the PLWDH identified by the CFA were truly positive. Additionally, the high specificity (94%) observed with the CFA is important in an algorithm, because specificity and PPV are closely related (49) and a decrease in specificity leads to both a decrease in PPV and an increase in the number of people falsely identified by the algorithm. The overall sensitivity was 95%, indicating that the CFA identified nearly all of the true cases from the potential PLWDH. These measures of validity varied but remained high for each of the racial/ethnic, sex, and age groups examined. This demonstrated that the CFA works well for different groups and is representative for all of the subgroups tested (Table 4). A representative system better characterizes the epidemiologic features of events and interventions (32).

Strengths and weaknesses

The CFA performed well with regard to important characteristics of a public health surveillance system, including specificity, sensitivity, PPV, and NPV, demonstrating patterns of flexibility and representativeness across several years. However, limitations of this study should be noted. In 2017, Ngugi et al. (50) used content analysis methods to systematically evaluate the data quality of the CDC’s HIV surveillance data reports and identified problems in several areas, including potential incomplete and incorrect gaps in the HIV surveillance data. Because of NYS’s stringent name-based reporting laws, which require all HIV-related laboratory test results (e.g., CD4 cell counts and percentages, viral load, genotypic resistance) for persons who reside or receive HIV-related care in the state to be reported to the NYSDOH, it is thought that the effect of underreporting to the NYS HIV surveillance system is minimal. However, there is potential that PLWDH may not be reported in NYS if 1) they received HIV testing or care under a research protocol approved by an institutional review board, 2) they received anonymous testing, 3) testing and diagnosis were performed in the federal health-care system (i.e., the Veterans Affairs system, the military, or federal prisons), or 4) they received testing from an out-of-state laboratory that failed to report this information to the NYSDOH (51). Even with these limitations, utilizing the HIV surveillance system as the gold standard for the development and evaluation of the CFA is thought to have provided the most accurate picture of PLWDH in NYS. A second limitation of the CFA is that the analysis did not include all Medicaid data. Therefore, some potential PLWDH who had Medicaid coverage may not have been included in the analysis. However, it is believed that the number of missing cases was minimal, as the initial selection of potential PLWDH was designed to be as inclusive as possible. In NYS, there were over 6 million people on Medicaid in 2013 (1), of whom fewer than 2% were diagnosed with HIV (34). Increasing the number of true negatives by millions through screening of the entire Medicaid database would increase the NPV and specificity to a level where the estimates would be close to 100%. This would be not only costly but unfeasible in a short period of time. In addition, the annual number of potential PLWDH in this algorithm almost tripled between 2006 and 2014, with a sharp increase in potential PLWDH occurring in 2013 and 2014. This correlates with the increasing number of enrollees in the NYS HH Program and was validated by examining the numbers of HH outreach rate codes by year. This change increased the numbers of true negatives in the analysis but had minimal impact on the number of presumptive PLWDH, adding confidence to the assumption that running the CFA using the entire Medicaid data set would have a limited impact. Thirdly, the NYS HIV surveillance system and the Medicaid data system collect demographic information differently, making it difficult to compare demographic distributions of the CFA with those of the HIV surveillance system. An example is that the HIV surveillance system collects information on both sex at birth and current sex; however, Medicaid data do not differentiate between these 2 variables. With regard to demographic characteristics that provided a direct comparison, the CFA and the HIV surveillance system yielded similar proportions (data not shown (33)). This adds to the confidence that the CFA was not biased on the basis of individual demographic information. Fourthly, a limit to the generalizability of the CFA is that it uses NYS-specific Medicaid rate codes. However, the impact of these codes is minimal, as 46.5% of the presumptive PLWDH identified by the CFA had claims in 2 or more of the categories, and 91.8% of the presumptive PLWDH identified using rate codes had claims in 2 or more of the categories. It is also important to note that algorithms developed for internal programmatic purposes may need to consider the inclusion of program-specific billing codes. Lastly, the CFA used ICD-9-CM diagnosis codes from the Medicaid billing data. As of October 1, 2015, the ICD-9-CM was replaced by the International Classification of Diseases, Tenth Revision, Clinical Modification (52). To properly apply this CFA to Medicaid billing data after September 2015, the ICD-9-CM diagnosis codes would need a cross-walk comparison with International Classification of Diseases, Tenth Revision, Clinical Modification codes.

Implications

Utilizing billing data is an unobtrusive method of analysis; therefore, the privacy risk posed to PLWDH whose information is being used is minimal. The successful derivation and application of the CFA have promising implications for future HIV analyses. Utilizing the CFA could help improve both clinical research and program evaluation, leading to better care for PLWDH. Applied to the whole Medicaid data set, the CFA may have the ability to look for PLWDH who have not been reported to the NYSDOH HIV surveillance system. Identifying these potential gaps in reporting would allow PLWDH to be better surveilled, further bolstering the HIV surveillance system.

Conclusion

Medicaid is a robust data system, containing treatment information not captured in the NYS HIV surveillance system, such as claims for health-care provider visits, tests, and antiretroviral medications. Researchers have employed other algorithms to answer questions using Medicaid data, but those algorithms have not been evaluated with respect to their sensitivity, specificity, PPV, and NPV by comparison with the gold standard, an HIV surveillance system. The CFA described in this paper was evaluated against this gold standard and performed well. It was deemed simple, flexible, and representative and had high sensitivity and PPV. It successfully identified PLWDH in the Medicaid data, improving the utility of those data for assessing health outcomes and programmatic interventions such as HHs. Findings from this paper provide future researchers with a tool for identifying PLWDH through Medicaid data, potentially helping to increase linkage with HIV care and retention in care among PLWDH and decreasing future transmission. Click here for additional data file.
  34 in total

1.  Use of Medicaid data to explore community characteristics associated with HIV prevalence among beneficiaries with schizophrenia.

Authors:  James Walkup; Ayse Akincigil; Donald R Hoover; Michele J Siegel; Shahla Amin; Stephen Crystal
Journal:  Public Health Rep       Date:  2011 Sep-Oct       Impact factor: 2.792

2.  Specificity and sensitivity of claims-based algorithms for identifying members of Medicare+Choice health plans that have chronic medical conditions.

Authors:  Thomas S Rector; Steven L Wickstrom; Mona Shah; N Thomas Greeenlee; Paula Rheault; Jeannette Rogowski; Vicki Freedman; John Adams; José J Escarce
Journal:  Health Serv Res       Date:  2004-12       Impact factor: 3.402

3.  Thirty-day hospital readmissions for adults with and without HIV infection.

Authors:  S A Berry; J A Fleishman; R D Moore; K A Gebo
Journal:  HIV Med       Date:  2015-07-14       Impact factor: 3.180

4.  Trends in reasons for hospitalization in a multisite United States cohort of persons living with HIV, 2001-2008.

Authors:  Stephen A Berry; John A Fleishman; Richard D Moore; Kelly A Gebo
Journal:  J Acquir Immune Defic Syndr       Date:  2012-04-01       Impact factor: 3.731

5.  Variation in outcomes of quality measurement by data source.

Authors:  Heather Angier; Rachel Gold; Charles Gallia; Allison Casciato; Carrie J Tillotson; Miguel Marino; Rita Mangione-Smith; Jennifer E DeVoe
Journal:  Pediatrics       Date:  2014-06       Impact factor: 7.124

6.  Are claims data accurate enough to identify patients for performance measures or quality improvement? The case of diabetes, heart disease, and depression.

Authors:  Leif I Solberg; Karen I Engebretson; Joann M Sperl-Hillen; Mary C Hroscikoski; Patrick J O'Connor
Journal:  Am J Med Qual       Date:  2006 Jul-Aug       Impact factor: 1.852

7.  Measuring Preventive Care Delivery: Comparing Rates Across Three Data Sources.

Authors:  Steffani R Bailey; John D Heintzman; Miguel Marino; Megan J Hoopes; Brigit A Hatch; Rachel Gold; Stuart C Cowburn; Christine A Nelson; Heather E Angier; Jennifer E DeVoe
Journal:  Am J Prev Med       Date:  2016-08-10       Impact factor: 5.043

8.  Rates of hospitalizations and associated diagnoses in a large multisite cohort of HIV patients in the United States, 1994-2005.

Authors:  Kate Buchacz; Rose K Baker; Anne C Moorman; James T Richardson; Kathleen C Wood; Scott D Holmberg; John T Brooks
Journal:  AIDS       Date:  2008-07-11       Impact factor: 4.177

9.  Medicaid data as a resource for epidemiologic studies: strengths and limitations.

Authors:  R A Bright; J Avorn; D E Everitt
Journal:  J Clin Epidemiol       Date:  1989       Impact factor: 6.437

10.  Validation of case-finding algorithms derived from administrative data for identifying adults living with human immunodeficiency virus infection.

Authors:  Tony Antoniou; Brandon Zagorski; Mona R Loutfy; Carol Strike; Richard H Glazier
Journal:  PLoS One       Date:  2011-06-30       Impact factor: 3.240

View more
  3 in total

1.  PrEP Use and Adherence Among Transgender Patients.

Authors:  Jae Downing; Kimberly Yee; Jae M Sevelius
Journal:  AIDS Behav       Date:  2021-10-13

2.  Predictors of Medication Utilization for Opioid Use Disorder Among Medicaid-Insured HIV Patients in New York.

Authors:  Sugy Choi; Rajeev Yerneni; Shannon Healy; Mona Goyal; Charles J Neighbors
Journal:  Am J Addict       Date:  2020-01-17

3.  A Phenotyping Algorithm to Identify People With HIV in Electronic Health Record Data (HIV-Phen): Development and Evaluation Study.

Authors:  Sarah B May; Thomas P Giordano; Assaf Gottlieb
Journal:  JMIR Form Res       Date:  2021-11-25
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.