Literature DB >> 11481769

Using diagnoses to describe populations and predict costs.

A S Ash1, R P Ellis, G C Pope, J Z Ayanian, D W Bates, H Burstin, L I Iezzoni, E MacKay, W Yu.   

Abstract

The Diagnostic Cost Group Hierarchical Condition Category (DCG/HCC) payment models summarize the health care problems and predict the future health care costs of populations. These models use the diagnoses generated during patient encounters with the medical delivery system to infer which medical problems are present. Patient demographics and diagnostic profiles are, in turn, used to predict costs. We describe the logic, structure, coefficients and performance of DCG/HCC models, as developed and validated on three important data bases (privately insured, Medicaid, and Medicare) with more than 1 million people each.

Entities:  

Mesh:

Year:  2000        PMID: 11481769      PMCID: PMC4194673     

Source DB:  PubMed          Journal:  Health Care Financ Rev        ISSN: 0195-8631


Introduction

Role of Health-Based Payment Models

Since 1985, HCFA has made capitated payments to managed care organizations that enroll Medicare beneficiaries. HCFA, using a demographic risk adjuster to calculate payments equal to 95 percent of what health maintenance organization (HMO) enrollees “would have cost” had they remained in the traditional fee-for-service Medicare program, paid less-than-average dollars for the group who originally transferred into these programs. However, HCFA still appears (on average) to have overpaid, because the early switchers into Medicare managed care were healthier than comparably aged non-switchers (Brown et al., 1993). Anticipating and responding to this problem, HCFA has sponsored much research, including development of the Diagnostic Cost Group (DCG) models, with the goal of being able to better match HMO payments to the health care needs of enrollees. Since 1984, when researchers at Boston University and Brandeis initiated this work for HCFA, DCGs have evolved into a family of methods for using administrative data collected during patient encounters to calculate health-based “expected costs” for populations (Ash et al., 1986, 1989, 1998; Ellis and Ash, 1995; Ellis et al., 1996a, 1996b; Pope et al., 1998, 1999, 2000). DCG models use age, sex, and diagnoses generated from patient encounters with the medical delivery system to infer which medical problems are present for each individual and their likely effect on health care costs for a population. Some versions of the DCG models focus on diagnoses that form the principal reason for an inpatient admission, now called “PIP diagnoses” (Ash et al., 1989; Ellis and Ash, 1995; Pope et al., 2000). Other versions, such as the DCG/HCC models of this article, utilize the full range of diagnoses generated during all face-to-face encounters with clinicians (Ellis et al., 1996a, 1996b; Ash et al., 1998; Pope et al., 1998). Whereas previous publications using DCGs have calibrated models solely for Medicare samples, in this study, we contrast the ability of DCG/HCC models to predict resources in three different samples: privately insured, Medicaid, and Medicare. Payment methods establish incentives. For example, when payments follow a “piecework” model, as in traditional fee-for-service medicine, providers are rewarded for doing more—whether the additional utilization is valuable or not. Conversely, capitated payments encourage doing less—whether through efficiency or stinting. Further, flat-rate capitated payments introduce a new perverse incentive: to enroll healthy people and to do the very little required to keep them enrolled. Models that pay each person's expected cost eliminate the incentive to “select on risk” and make efficiency the main way for a plan to achieve a competitive advantage (Van de Ven and Ellis, 2000). Although risk-adjusted payment solves the problem of perverse patient-selection incentives, linking payments to a risk-adjustment model may lead plans to invest unproductive effort in making their enrollees “look needier” according to that model. For example, models that pay more for health care “users” encourage both appropriate and unnecessary utilization; those that identify illness only through hospitalizations encourage admissions, and those that pay more for people with more coded illnesses encourage “diagnostic discovery.” This last incentive can be good to the extent that it rewards plans that keep better track of their members' chronic illnesses (Greenwald et al., 1998). The degree of imperfection in incentive-setting is one criterion in choosing among payment models. Furthermore, how much imperfection is acceptable depends upon the nature and level of problems associated with available alternatives.

Predicting Costs in a Range of Populations

The original DCG models are prospective, that is, they use baseline, or year 1, data to infer the level of need for health care in year 2 and were developed to predict costs for Medicare beneficiaries. Medical conditions (diagnoses) detected in year 1 are used to organize people into groups with similar levels of future health care need. The distribution of all members by levels of future need characterizes an enrolled group and is used to determine a health-based payment. More recently, we have developed DCG models to calculate expected concurrent expenses, that is, expenses that occur in the same year as the diagnoses used to characterize the population (Pope et al., 1998, 1999, 2000). We have also adapted both prospective and concurrent modeling frameworks for use in Medicaid and commercially insured (private) populations under the age of 65 (Ash et al., 1998). Concurrent models may be particularly useful for provider profiling and monitoring, because knowing all the medical problems being treated during a period of time is particularly relevant for estimating the level of resources used to treat them. However, prospective models, which predict future costs, are more appropriate for creating payments to managed care organizations that assume financial risk, because they focus on the presence of illnesses, such as cancer and heart disease, that predictably make people more expensive to treat. In this article, we describe prospective models only, as they apply to three separate populations: a national sample of commercially insured enrollees under age 65, enrollees in Michigan's Medicaid program, and a national sample of Medicare beneficiaries. We refer to these three populations and the models that pertain to them as private, Medicaid, and Medicare. Continuing the tradition in which DCG models were originally developed, these models reflect concern for appropriate incentives in payments to health care plans and providers. All DCG/HCC models (regardless of the population or whether they are concurrent or prospective) rely on a common classification structure, which we describe later. Diversity across populations is handled by using different coefficients, different exclusions of potential predictors from payment models, and different constraints on coefficients across age or eligibility groups.

Model Criteria: Accuracy, Feasibility, and Incentives

The DCG models strive for accurate predictions in the face of limitations on the available data and concerns about incentives. The goal is to effectively predict costs from data that should be present in any health care delivery system, while limiting the rewards for undesirable behavior with respect to either treatment or reporting. Although our descriptive system does classify all recorded diagnoses in order to create a comprehensive picture of problems seen, concerns about incentives cause us to not model some information. For example, we do not use the number of hospitalizations to predict cost, so as to avoid disadvantaging medical care organizations that are good at treating sick people with fewer hospitalizations. Nor do we count how often a diagnosis appears. Conceptually, DCG models are designed to predict higher costs when they detect additional conditions associated with elevated costs. Based on clinical judgment and concerns about incentives, we exclude some condition categories (CCs) from contributing to predictions entirely. For example, the presence of chemotherapy is noted in the diagnostic codes, and, therefore we classify it into a CC (number 115); however, our prospective models do not pay more for it. Higher payments are based on the presence of a particular type of cancer, rather than a choice of therapy.

Methods

Populations and Data

We describe payment models for three populations whose types of health coverage span the major ways in which health care is provided in the United States today. Specifically, we use: A nationally dispersed, privately insured (indemnity-covered) population of 1.4 million people in 1992 and 1993 (the private data). One million individuals covered by Michigan's Medicaid program in 1991-1992 (Medicaid). Medicare's 5-percent research sample from 1991 and 1992. The outcome variable, total program costs in year 2, is defined as total covered expenses—an amount that includes copayments, deductibles, and third-party payments—in each data set. Costs for people with less than a full year of entitlement in year 2 are annualized, based on their observed cost per month; in analyses, we treat their data as “fractional observations” (Ellis and Ash, 1995). The three populations differ substantially with respect to age and sex distributions, health care costs, and hospital experience (Table 1). In each population, most of the data are used (in a development sample) to establish the model structure and to fit coefficients, while the rest of the data (the validation sample) are used for measuring model performance. Finally, regressions based upon all the data are used to produce the model coefficients in this article.
Table 1

Age, Sex, Hospital Experience, and Total Health Care Costs in Three Populations

Characteristic or StatisticPrivateMedicaidMedicare
Number1,379,9701,103,3671,360,626
Prediction Year199319921992
Percent by Age
0-17 Years26.751.40.0
18-44 Years44.940.03.2
45-64 Years28.48.75.8
65 Years or Over0.00.091.0
Percent Female by Age
0-17 Years51.350.90.0
18-44 Years44.929.436.1
45-64 Years46.640.039.4
65 Years or Over60.7
Total Prediction-Year Costs
Mean$1,592$1,430$3,778
Standard Deviation8,2365,40710,523
Coefficient of Variation517378279
Median85121516
99th Percentile25,47223,20857,423
Maximum2,412,7071,253,8801,533,060
Percent with Zero Prediction-Year Costs42.932.316.1
Percent Hospitalized in the Prediction Year4.88.421.2

For people with at least 1 month of eligibility in each of the baseline and prediction years.

SOURCE: (Ash et al., 1998; Pope et al., 1998.)

A fourth data set, consisting of 191,877 people under age 65 in a State employee benefit program (State data), is used to further validate the private model's ability to discriminate costs within important subsets of a new population, as described later.

DCG/HCC Models

The letters DCG/HCC are used to distinguish the multicondition Hierarchical Condition Category (HCC) models from the single-condition PIP-DCG model that HCFA is using to calculate payments to Medicare HMOs in the year 2000 (Ingber, 1998; Iezzoni et al., 1998; Health Care Financing Administration, 1999). Each DCG model is designed to use the diagnostic codes from the International Classification of Diseases, 9th Revision, Clinical Modification (ICD-9-CM) (Public Health Service and Health Care Financing Administration, 1980) on the claims that hospitals and physicians submit to payers. (For a discussion of diagnostic coding issues, refer to Iezzoni, 1997.) Each DCG/HCC model uses the same CCs for prediction, all of which are based on diagnostic codes, rather than procedures. DCG models summarize a person's health from his or her CCs and estimate expected costs based on these profiles. Although DCG models reward medical-problem identification, not all CCs are or should be used to modify payments to plans. In designing DCG models, we have anticipated “DCG creep” (changes in diagnostic coding for the purpose of increasing DCG-based payments) by making the models less sensitive to expected changes. In particular, our models exclude some CCs and impose hierarchies to reduce the sensitivity of predicted costs to three things: (1) variations in coding practice; (2) intentional coding proliferation with the aim of improving provider reimbursement (gaming); and (3) inconsistent coding of less serious or vague conditions.

Diagnostic Groups (DxGROUPs)

With more than 15,000 codes, the distinctions created by ICD-9-CM are too fine to be used directly as a payment classification system. Therefore, we group ICD-9-CM codes into 543 categories, called “DxGROUPs,” which are the building blocks of DCG/HCC models. Each DxGROUP has a two-level numerical label and a short, clinically informative text name. All DxGROUPs with the same “whole number” stem are clinically related. For example, the “4.xy” series refers to infectious diseases, with 4.01 being bacterial enteritis, 4.02 viral enteritis, 4.03 other intestinal infections, 4.04 tuberculosis, and so on. Each recognized ICD-9-CM code maps to a unique DxGROUP; each DxGROUP encompasses diagnostic codes that describe very similar medical problems. We place in the same DxGROUP alternative codes that can be used for the medical conditions that clinicians generally think of together (such as congestive heart failure and cardiomyopathy or deep vein thrombosis and deep vein thrombosis in pregnancy) or codes for medical conditions that are not easily distinguished (such as chronic bronchitis and emphysema).

Condition Categories

DxGROUPs are clustered in a CC when they contain medically related problems with similar expected costs. We created the 118 diagnosis-based CCs used for modeling in each population using a mix of clinical judgment and empirical cost data. The core physician panel making these judgments consisted of four internists experienced in health services research. Specialist consultants assisted in several areas including pediatrics, HIV/AIDS (human immunodeficiency virus/acquired immunodeficiency syndrome), pediatric surgery, obstetrics, and neonatology. Although we sought to create CCs with at least 500 cases in our private sample of around 1 million people, that goal is subordinated to the objective of clinical homogeneity. For a few conditions (such as mental retardation, quadriplegia, and underweight neonates), we accept significantly smaller numbers. We eliminate logical inconsistencies in diagnostic coding that can be identified by comparing with age and sex. For example, we drop a diagnosis of uterine disorder in a male. However, we do not drop neonatal codes found in the records of non-infant females. When an infant dies shortly after birth, insurance companies sometimes do not create a separate eligibility record but rather assign the neonatal codes to the mother. Currently, two CCs are used to classify neonatal codes assigned to mothers. The CCs are organized in broad system groups (such as four CCs for infections, eight for neoplasms, and three each for diabetes and metabolic disorders). Short names (such as Infection1, Diabetes3) denote such CC groups; numbering within a short-name series generally indicates decreasing expected costs (e.g., Neoplasm1 contains metastatic cancers, Neoplasm2 contains high-cost site-specific cancers, Neoplasm3 has moderate-cost cancers, on down to Neoplasm8, benign neoplasms). Table A in the Technical Note shows for each CC, its number, long name, short name and the CCs that it donimates in the model hierarchy explained in the following section. A complete list of the individual DxGROUPs indicating their organization into CCs is available at www.dxcg.com under the heading “DCG Clinical Classification System in Detail.” (This information can also be obtained by contacting the lead author.)
Table A

Condition Category (CC) Numbers, Long Names, Short Names, and Hierarchies

CC NumberCC Long NameCC Short NameDominated CCs
1HIV/AIDSInfection1None
2Septicemia (Blood Poisoning)/ShockInfection2None
3Central Nervous System InfectionsInfection3None
4Other Infectious DiseaseInfection4None
5Metastatic CancerNeoplasm16, 7, 8, 9, 10, 11, 12
6High-Cost CancerNeoplasm27, 8, 9, 10, 11, 12
7Moderate-Cost CancerNeoplasm38, 9, 10, 11, 12
8Lower Cost Cancers/TumorsNeoplasm49, 10, 11, 12
9Carcinoma in SituNeoplasm510, 11, 12
10Uncertain NeoplasmNeoplasm611, 12
11Skin Cancer, Except MelanomaNeoplasm712
12Benign NeoplasmNeoplasm8None
13Diabetes with Chronic ComplicationsDiabetes114, 15
14Diabetes with Acute Complications/Non-Proliferative RetinopathyDiabetes215
15Diabetes with No or Unspecified ComplicationsDiabetes3None
16Protein-Calorie MalnutritionMetabolic1None
17Moderate-Cost Endocrine/Metabolic/Fluid-Electrolyte DisordersMetabolic2None
18Other Endocrine, Metabolic, Nutritional DisordersMetabolic3None
19Liver DiseaseLiverNone
20High-Cost Chronic Gastrointestinal DisordersGI122, 23
21High-Cost Acute Gastrointestinal DisordersGI222, 23
22Moderate-Cost Gastrointestinal DisordersGI323
23Lower Cost Gastrointestinal DisordersGI4None
24Bone/Joint Infections/NecrosisMSK1None
25Rheumatoid Arthritis and Connective Tissue DiseaseMSK226
26Other Musculoskeletal and Connective Tissue DisordersMSK3None
27Aplastic and Acquired Hemolytic AnemiasBlood128, 29
28Blood/Immune DisordersBlood229
29Iron Deficiency and Other/Unspecified AnemiasBlood3None
30DementiaDementiaNone
31Drug/Alcohol Dependence/PsychosesMental132, 33, 34, 35
32Psychosis and Other Higher Cost Mental DisordersMental233, 34, 35
33Depression and Other Moderate-Cost Mental DisordersMental334, 35
34Anxiety DisordersMental435
35Lower Cost Mental Disorders/Substance MisuseMental5None
36Profound Mental RetardationMR137, 38, 39
37Severe Mental RetardationMR238, 39
38Moderate Mental RetardationMR339
39Mild/Unspecified Mental RetardationMR4None
40QuadriplegiaNeuro141, 42, 43, 44
41ParaplegiaNeuro242, 43, 44
42Higher Cost Neurological DisordersNeuro343, 44
43Moderate-Cost Neurological DisordersNeuro444
44Lower Cost Neurological DisordersNeuro5None
45Respirator Dependence/Tracheostomy StatusArrest146, 47
46Respiratory ArrestArrest247
47Cardio-Respiratory Failure and ShockArrest3None
48Congestive Heart FailureHrt_CHF55, 56, 57
49Heart ArrhythmiaHrt_ARR55, 56, 57
50Acute Myocardial InfarctionHrt_AMI51, 52, 55, 56, 57
51Other Acute Ischemic Heart DiseaseHrt_CAD152, 55, 56, 57
52Chronic Ischemic Heart DiseaseHrt_CAD255, 56, 57
53Valvular and Rheumatic Heart DiseaseHrt_VHD55, 56, 57
54Hypertensive Heart DiseaseHrt_HTN55, 56, 57
55Other Heart DiagnosesHrt_Misc56
56Heart Rhythm and Conduction DisordersHrt_RhythmNone
57Hypertension (High Blood Pressure)HTNNone
58Higher Cost Cerebrovascular DiseaseStroke159
59Lower Cost Cerebrovascular DiseaseStroke2None
60High-Cost Vascular DiseaseVascular162, 63
61Thromboembolic Vascular DiseaseVascular262, 63
62Atherosclerosis/UnspecifiedVascular3None
63Other Circulatory DiseaseVascular4None
64Chronic Obstructive Pulmonary DiseaseLung170, 71
65Higher Cost PneumoniaLung266, 67, 69, 71
66Moderate-Cost PneumoniaLung367, 69, 71
67Lower Cost PneumoniaLung471
68Pulmonary Fibrosis and Other Chronic Lung DisordersLung570, 71
69Pleural Effusion/PneumothoraxLung671
70AsthmaLung771
71Other Lung DiseaseLung8None
72Higher Cost Eye DisordersEye173
73Lower Cost Eye DisordersEye2None
74Higher Cost Ear, Nose, and Throat DisordersENT175
75Lower Cost Ear, Nose, and Throat DisordersENT2None
76Dialysis StatusUrinary177, 78, 79, 80
77Kidney Transplant StatusUrinary278, 79, 80
78Renal FailureUrinary379, 80
79NephritisUrinary480
80Other Urinary System DisordersUrinary5None
81Female InfertilityGenital182, 83
82Moderate-Cost Genital DisordersGenital283
83Low-Cost Genital DisordersGenital3None
84Ectopic PregnancyPreg185, 89, 90
85Miscarriage/AbortionPreg289, 90
86Completed Pregnancy with Major ComplicationsPreg387, 88, 89, 90
87Completed Pregnancy with ComplicationsPreg488, 89, 90
88Completed Pregnancy Without Complications (Normal Delivery)Preg589, 90
89Uncompleted Pregnancy with ComplicationsPreg690
90Uncompleted Pregnancy with No or Minor ComplicationsPreg7None
91Chronic Ulcer of SkinSkin192
92Other Dermatological DisordersSkin2None
93Vertebral Fractures and Spinal Cord InjuriesInjury197
94Hip Fracture/DislocationInjury297
95Head InjuriesInjury397
96Drug Poisonings, Internal Injuries, Traumatic Amputations, BurnsInjury497
97Other Injuries and PoisoningsInjury5None
98Complications of CareComplicNone
99Major SymptomsSymptom1None
100Minor Symptoms, Signs, FindingsSymptom2None
101Very-High-Cost Pediatric DisordersPeds20, 22, 23, 28, 29, 43, 44, 68, 70, 71
102Higher Cost Congenital/Pediatric DisordersCong1104
103Moderate-Cost Congenital DisorderCong2104
104Lower Cost Congenital DisorderCong3None
105Extremely-Low-Birthweight NeonatesBaby1106, 107, 108, 109
106Very-Low-Birthweight NeonatesBaby2107, 108, 109
107Serious Perinatal Problem Affecting NewbornBaby3109
108Other Perinatal Problems Affecting NewbornBaby4109
109Normal, Single BirthBaby5None
110Heart, Lung, Liver Transplant StatusTransplant1None
111Other Organ Transplant/ReplacementTransplant2None
112Artificial Opening Status/AttentionOpeningsNone
113Elective/AftercareSurgeryNone
114Radiation TherapyRadiationNone
115ChemotherapyChemoNone
116RehabilitationRehabNone
117Screening/Observation/Special ExamsScreeningNone
118History of DiseaseHistoryNone

NOTES: HIV is human immunodeficiency virus. AIDS is acquired immunodeficiency syndrome.

SOURCE: (Ash et al., 1998.)

CC Hierarchies

A payment model should not be sensitive to every diagnostic code recorded because this will result in poorly specified coefficients and unstable estimates of the relative risk of populations. For example, a female who has metastatic cancer (CC 5) could also be coded with cancer in two or more specific body sites, such as the liver (CC 6) or connective and soft tissue (CC 7). She may also have been tested for other “uncertain” (CC 10) or “benign” cellular changes (CC 12). A regression model that separately assigns credit for each of these diagnoses will have confounded parameter estimates, because the costs of people with only the simpler problems get averaged in, or confounded, with costs for people with both simple and more consequential conditions. Also, such models reward most the plans that capture as many codes as can be legitimately defended in an audit—a behavior with little social value. To dampen these incentives, we use hierarchies to constrain CC assignment as follows: a person classified into a CC is not also classified into a lower ranked CC in the same hierarchy. An important feature of an HCC model is that the hierarchies are not imposed across unrelated medical problems. For example, for a female with both cancer and diabetes, hierarchies are used to retain only the “worst” evidence of each disease, but both cancer and diabetes CCs are used in predicting her costs next year. Hierarchies are identified for each CC in the rightmost column of Table A by indicating which CCs are dominated; dominated CCs are zeroed out for a person when a dominating CC is present. The CC hierarchies capture both chronic and serious acute manifestations of particular disease processes, as well as their seriousness in terms of expected costs. Some hierarchies, such as neoplasm, are simple; CC 5 dominates CC 6, which dominates CC 7, all the way down to CC 12. Other hierarchies, such as gastrointestinal, are more complex, as illustrated in Figure 1. A person may be classified with either, or both, acute and chronic high-cost gastrointestinal problems; however, if either of these is coded, information about moderate or lower cost GI disorders is ignored.
Figure 1

Sample of a Condition Category Hierarchy: Gastrointestinal (GI) Disorders

Clinically, hierarchies reduce the sensitivity of predicted payments to the coding of less serious manifestations of the same condition; statistically, they make explanatory variables more nearly orthogonal, increasing statistical precision. Imposing hierarchies typically increases the estimated coefficients and t-ratios of serious condition categories.

Excluded Condition Categories

We also exclude some CCs from the models entirely, by constraining their coefficients to be zero; the result is that the presence of that condition for an individual will not increase his or her predicted cost. Money that “disappears” from the prediction when a positive coefficient is constrained to zero is redistributed—generally reappearing as slight increments to demographic variables. Each model still accounts for the costs of treating all conditions. The most common reason for exclusion is the a priori medical judgment that a current problem triggering this CC this year should have little effect on next-year costs. Examples are (non-melanoma) skin cancers; benign cancers; lower cost ear, nose, and throat disorders; minor injuries; and screening (for example, presence of a routine checkup). A second reason for exclusion is that a CC does not add to expected costs (either its coefficient in our modeling sample is actually negative or it is not statistically significantly positive). Reassuringly, these are generally the same CCs clinically thought to have little effect on future costs. Excluding CCs that would subtract from the payment preserves the monotonic character of the model. To ensure that adding a code does not reduce predicted costs, each CC with a non-positive coefficient is excluded or constrained, even if it might seem that the CC should be in the model. A final reason for exclusion is concern over “gaming,” that is, a perverse health plan response to the incentives created by the model. Thus, the models do not pay for the often vague or discretionary conditions included in CCs such as moderate and other endocrine disorders (CCs 17 and 18), and lower cost mental disorders (CC 35). Such exclusions improve the models' attractiveness for setting payments, at the cost of some loss in accuracy.

Coefficient Constraints

Especially for conditions that are rare (such as mental retardation, ranging from mild to profound, in an employed population), unconstrained models can lead to higher payments for less serious conditions. Thus, in a few cases, we impose restrictions across sets of CCs, forcing predictions for conditions that are higher in a hierarchy to be at least as large as predictions for conditions that they dominate (as “profound mental retardation” dominates “mild mental retardation”). These restrictions avoid plans receiving higher payments for “downcoding.” We also do not modify some surprisingly low-cost coefficients that appear to be real artifacts of the coverage or delivery systems to which they apply, in the sense that they capture all costs covered by the program that collected the data but do not reflect expenditures from other sources. An example of this is the relatively low cost for people with renal failure in Medicaid because Medicare is likely to be the primary payer for most of the very high treatment costs for these people.

Clinical Refinements, Including Interactions with Age

The DCG classification system, originally focused on chronic conditions of the elderly, now handles distinctions for a full age and population spectrum. There are 21 DxGROUPs organized into 5 CCs for neonates (ages 0 to 1). Additional new CCs include four for the mentally retarded (common only in Medicaid), five for mental health and substance abuse, five for accidents and injuries, seven for pregnancy, and four for congenital and/or distinctly pediatric problems. Ultimately, a single comprehensive classification system, with 543 DxGROUPs organized into 118 CCs and a common set of imposed hierarchies, is used to profile the medical problems present for any person, regardless of age, sex, or type of insurance. However, the cost consequences of a given diagnostic profile can be affected by demographics. For example, some CCs are separately priced for pediatric populations (age under 18), in the private or Medicaid populations, when clinical judgment and empirical evidence find substantial differences in utilization by age (e.g., CC 70, asthma, adds $1,513 for adults and only $825 for children in the private data.) The Medicare model also recognizes age/medical interactions for a few conditions (such as HIV and aplastic anemia). For people with such conditions, certain costs are associated with it in elderly persons (those age 65 or over), but additional dollars are associated with costs of care among the disabled (younger persons whose Medicare entitlement derives from disability).

Demographic Variables

In a given year, healthy people, whether they are age 8 or 80, incur few medical expenses. However, average health care costs differ dramatically by age and somewhat by sex. Much of this is driven by differences in disease prevalence because, for example, most children 8 years of age are fully healthy, while most persons age 80 have one or more chronic conditions requiring medical attention. Some of the cost difference is attributable to differences in the nature of certain diseases (or how they are treated) in children, young adults, or seniors. Additionally, however, even among those with no medical problems this year, demographically defined subgroups, such as females of childbearing age, or the oldest old, have different average costs next year. In a prospective model, even after accounting for the medical problems present, the additional effects of age and sex on expected costs remain important. The three models (private, Medicaid, and Medicare) recognize three key age groups: 0-17 years, 18-64 years, 65 years or over, either by allowing some distinct CC coefficients for those under as opposed to over age 18 in the younger populations or by using distinct Medicare coefficients for those 65 or over. The private and Medicaid models contain 16 indicators that place people within same-sex, similar-age groups (ages 0-5, 6-12, 13-17, 18-24, 25-34, 35-44, 45-54, and 55-64). The Medicare model constrains coefficients among its disabled enrollees under age 65 to distinguish only ages 0-34 and 35-64; it then makes 5-year breaks between 65 and 94 years of age; the highest age category is 95 or over.

Eligibility Categories

In addition to age and sex categories, the Medicaid model incorporates nine additional variables that distinguish among five distinct groups of enrollees: (1) the blind and disabled (11 percent); (2) those eligible because of other medical problems (8 percent); (3) pregnant women (2 percent); (4) those with poverty-related entitlement (71 percent); and (5) others (9 percent). We assign each person to one of the categories based on reason for entitlement during his or her earliest month of enrollment in year 1. Observed annual expenditures per person in year 2 averaged $1,430 and differed substantially by category. The blind and disabled are by far the most expensive, at $5,585 annually in year 2. Pregnant women cost about twice the average ($2,708); the “other medical” and “other” groups are about average ($1,281 and $1,500, respectively) and the non-medical, poverty-related group, consisting mainly of children enrolled under Aid to Families with Dependent Children, cost about one-half the average ($731). Four of the new variables are indicators (yes-no variables) that distinguish Medicaid's other subpopulations from the least expensive, poverty-related subgroup. The remaining five additional demographic variables in the model are interactions of eligibility category and duration of (year 1) Medicaid enrollment. These variables allow the model to reflect the fact that recent entrants to the Medicaid program cost more than longer term “stayers,” and that the “premium” for recent entry varies not only by duration of enrollment but by eligibility type. These five variables each have the form: How all this works is best illustrated with examples, as shown in the following section.

Sample Calculations of Expected Costs

Each HCC model prediction is the sum of a demographic part and a health-status part. We illustrate this in Figure 2 for two privately insured females 58 years of age. The numbers here are private-model coefficients, shown in the first column of Table 2. Both patients' estimated costs begin with a demographic component of $1,730, which is the final prediction for any fully healthy, privately insured female between the ages of 55 and 64. Each of these patients, however, also has medical conditions with expected consequences for future health care costs.
Figure 2

Sample Information Used to Predict Next Year's Expenses for Privately Insured Patients

Table 2

Statistics for Private, Medicaid, and Medicare Prospective Payment Models

Statistic or VariablePrivateMedicaidMedicare
Number of Observations1,379,0231,103,3671,360,626
Prediction Year Mean Total Costs1,5931,4303,778
Number of Model Parameters10213696
R2 × 1009.421.18.8
Validated R2 × 1009.123.18.5
Standard Error7,8434,8029,963
Age/Sex GroupsModel Coefficients
Female
 0-5 Years295-6 1,324
 6-12 Years24101,324
 13-17 Years4792701,324
 18-24 Years6135601,324
 25-34 Years1,1873371,324
 35-44 Years1,1203451,155
 45-54 Years1,4014461,202
 55-64 Years1,7305371,698
 65-69 Years1,042
 70-74 Years1,318
 75-79 Years1,675
 80-84 Years1,962
 85-89 Years2,161
 90-94 Years2,258
 95 Years or Over1,897
Male
 0-5 Years31287 955
 6-12 Years271113955
 13-17 Years473334955
 18-24 Years37086955
 25-34 Years574132955
 35-44 Years778392904
 45-54 Years1,218571887
 55-64 Years2,1265261,403
 65-69 Years1,428
 70-74 Years1,743
 75-79 Years2,215
 80-84 Years2,426
 85-89 Years2,725
 90-94 Years3,027
 95 Years or Over2,980
Medicaid Eligibility Categories
Blind/Disabled1,449
Other Medical429
Poverty-Related476
Pregnant Women96
Other-263
Medicaid Amount Added per Missing Base Year Month for
Blind/Disabled179
Other Medical71
Poverty-Related56
Pregnant Women296
Other100
Condition Categories2
1Infection1HIV/AIDS22,5805,2841,076
2Infection2Septicemia (Blood Poisoning)/Shock8,6773,6633,253
3Infection3Central Nervous System Infections4,658760
5Neoplasm1Metastatic Cancer21,8846,3316,185
6Neoplasm2High-Cost Cancer11,9673,2783,905
7Neoplasm3Moderate-Cost Cancer5,8631,2882,128
8Neoplasm4Lower Cost Cancers/Tumors2,372550873
13Diabetes1Diabetes with Chronic Complications7,7263,6863,582
14Diabetes2Diabetes with Acute Complications/Non-Proliferative Retinopathy3,8062,3922,396
15Diabetes3Diabetes with No or Unspecified Complications1,9613691,147
16Metabolic1Protein-Calorie Malnutrition13,6395,0123,594
19LiverLiver Disease5,7004,0073,028
20GI1High-Cost Chronic Gastrointestinal Disorders4,3122,9441,336
21GI2High-Cost Acute Gastrointestinal Disorders2,0871,2131,329
22GI3Moderate-Cost Gastrointestinal Disorders1,432748730
24MSK1Bone/Joint Infections/Necrosis3,6533,5632,070
25MSK2Rheumatoid Arthritis and Connective Tissue Disease2,3808701,218
27Blood1Aplastic and Acquired Hemolytic Anemias9,8016,5624,035
28Blood2Blood/Immune Disorders4,2483,637709
30DementiaDementia4,8221,324438
31Mental1Drug/Alcohol Dependence/Psychoses3,5682,2231,122
32Mental2Psychosis and Other Higher Cost Mental Disorders3,0923,5991,288
33Mental3Depression and Other Moderate-Cost Mental Disorders2,171834540
34Mental4Anxiety Disorders1,788771511
36MR1Profound Mental Retardation 2,54422,370
37MR2Severe Mental Retardation2,54416,064
38MR3Moderate Mental Retardation2,54411,677
39MR4Mild/Unspecified Mental Retardation2,5445,508
40Neuro1Quadriplegia 12,5065,6325,686
41Neuro2Paraplegia12,5063,4675,788
42Neuro3Higher Cost Neurological Disorders3,9391,4521,851
43Neuro4Moderate-Cost Neurological Disorders1,9361,0371,261
45Arrest1Respirator Dependence/Tracheostomy Status41,46524,2479,117
46Arrest2Respiratory Arrest13,3963,5388,087
47Arrest3Cardio-Respiratory Failure and Shock3,4162,6732,809
48Hrt_CHFCongestive Heart Failure5,1142,7142,069
49Hrt_ARRHeart Arrhythmia1,872928670
50Hrt_AMIAcute Myocardial Infarction4,7233,7921,778
51Hrt_CAD1Other Acute Ischemic Heart Disease3,4421,6391,807
52Hrt_CAD2Chronic Ischemic Heart Disease2,871511883
53Hrt_VHDValvular and Rheumatic Heart Disease1,128741938
54Hrt_HTNHypertensive Heart Disease1,346436347
57HTNHypertension (High Blood Pressure)915312216
58Stroke1Higher Cost Cerebrovascular Disease3,9021,5231,919
59Stroke2Lower Cost Cerebrovascular Disease1,795645835
60Vascular1High-Cost Vascular Disease2,4861,4201,268
61Vascular2Thromboembolic Vascular Disease2,5052,3161,429
64Lung1Chronic Obstructive Pulmonary Disease2,6331,0341,669
65Lung2Higher Cost Pneumonia8,0923,4554,037
66Lung3Moderate-Cost Pneumonia3,4114921,229
68Lung5Pulmonary Fibrosis and Other Chronic Lung Disorders3,254936829
69Lung6Pleural Effusion/Pneumothorax2,2392,5061,456
70Lung7Asthma1,513409624
72Eye1Higher Cost Eye Disorders7831,110242
74ENT1Higher Cost Ear, Nose, and Throat Disorders685620147
76Urinary1Dialysis Status37,2873,6936,821
77Urinary2Kidney Transplant Status10,3332156,468
78Urinary3Renal Failure17,8345,7423,107
79Urinary4Nephritis1,0501,0261,627
81Genital1Female Infertility2,242455
82Genital2Moderate-Cost Genital Disorders88934589
84Preg1Ectopic Pregnancy1,957951
85Preg2Miscarriage/Abortion1,8921,064
86Preg3High-Cost Completed Pregnancy 572 262
87Preg4Moderate-Cost Completed Pregnancy572262
88Preg5Normal Delivery572262
89Preg6Higher Cost Pregnancy without Completion 4,060 1,674 1,634
90Preg7Lower Cost Pregnancy without Completion4,0601,6741,634
91Skin1Chronic Ulcer of Skin3,7562,4682,473
93Injury1Vertebral Fractures and Spinal Cord Injuries2,9925461,289
94Injury2Hip Fracture/Dislocation1,280463993
95Injury3Head Injuries76395428
96Injury4Drug Poisoning, Internal Injury, Traumatic Amputation, Burn1,5889321,256
98ComplicComplications of Care2,3691,380798
101PedsVery-High-Cost Pediatric Disorders5,9012,067
102Cong1Higher Cost Congenital/Pediatric Disorders4,9487102,081
103Cong2Moderate-Cost Congenital Disorder1,603355532
104Cong3Lower Cost Congenital Disorder829334348
105Baby1Extremely-Low-Birthweight Neonates 13,2381,852
106Baby2Very-Low-Birthweight Neonates13,2381,163
107Baby3Serious Perinatal Problem Affecting Newborn1,010323
108Baby4Other Perinatal Problem Affecting Newborn145 78
109Baby5Normal, Single Birth33278
110Transplant1Heart, Lung, Liver Transplant Status26,5765,3123,552
112OpeningsArtificial Opening Status/Attention5,5884,3172,696
Age-Interacted Condition Category3
AI-1Infection1HIV/AIDS8,735
AI-2Infection2Septicemia (Blood Poisoning)/Shock-2,615
AI-15Diabetes3Diabetes with No or Unspecified Complications-157
AI-20GI1High-Cost Chronic Gastrointestional Disorders-9244,241
AI-21GI2High-Cost Acute Gastrointestional Disorders1,406-313
AI-22GI3Moderate-Cost Gastrointestinal Disorders-1,044-460
AI-24MSK1Bone/Joint Infections/Necrosis-3,047
AI-25MSK2Rheumatoid Arthritis and Connective Tissue Disease-812
AI-27Blood1Aplastic and Acquired Hemolytic Anemias-4,8723,365
AI-28Blood2Blood/Immune Disorders-2,1082,019
AI-30DementiaDementia373
AI-31Mental1Drug/Alcohol Dependence/Psychoses-1,1353,315
AI-32Mental2Psychosis and Other Higher Cost Mental Disorders3463,8421,204
AI-33Mental3Depression and Other Moderate Cost Mental Disorders1,876
AI-36MR1Profound Mental Retardation-4,752
AI-37MR2Severe Mental Retardation-6,924
AI-38MR3Moderate Mental Retardation-5,056
AI-39MR4Mild/Unspecified Mental Retardation-1,717
AI-42Neuro3Higher Cost Neurological Disorders1,377
AI-43Neuro4Moderate-Cost Neurological Disorders-929-224
AI-58Stroke1Higher Cost Cerebrovascular Disease-1,450
AI-59Stroke2Lower Cost Cerebrovascular Disease1,417
AI-64Lung1Chronic Obstructive Pulmony Disease-1,904-734
AI-65Lung2Higher Cost Pneumonia365
AI-70Lung7Asthma-688
AI-82Genital2Moderate-Cost Genital Disorders348364
AI-88Preg5Normal Delivery395
AI-90Preg7Lower Cost Pregnancy without Completion472
AI-94Injury2Hip Fracture/Dislocation245
AI-96Injury4Drug Poisoning, Internal Injury, Traumatic Amputation, Burn-1,336-554
AI-98ComplicComplications of Care-710
AI-102Cong1Higher Cost Congenital/Pediatric Disorders2,757
AI-103Cong2Moderate-Cost Congenital Disorder1,383911

Indicates a coefficient constrained to zero.

Indicates a variable that is not relevant for a particular model.

The Medicare model combines age/sex categories 0-34 years for each of females and males.

Lines for CCs that are zeroed out in all three prospective models are not listed in this table.

Values are increments or decrements for younger persons in this CC (under 18 for private and Medicaid; under 65 for Medicare) after receiving the basic CC payment coefficient listed in this table.

NOTES: Coefficients joined by a brace are constrained to be the same. CC is condition category. HIV is human immunodeficiency virus. AIDS is acquired immunodeficiency syndrome.

SOURCE: (Ash et al., 1998; Pope et al., 1998.)

Figure 2 shows how the model organizes each patient's ICD-9-CM data into a clinical profile that leads to the health-status part of her prediction. For patient 1, her breast cancer diagnosis adds $2,372; her hypertension, a distinct medical problem, adds another $915, for a total of $5,017. Patient 2 has breast cancer, too, but her cancer has metastasized and is coded at multiple sites (lung, liver, and bone). Note the different ways that additional information about cancer is reflected in the classification: in one, distinct but related diagnoses are classified into the same DxGROUP; in another, related DxGROUPs are classified into the same CC; in a third, one CC is ranked higher than another. In the end, only a single payment amount ($21,884) is calculated for metastatic cancer; any additional codes pertaining to benign or malignant neoplasms are ignored. Another example clarifies how the Medicaid demographic/eligibility variables work. This time the numbers are drawn from the Medicaid column of Table 2. We compute the predicted cost for a female age 20 with no medical problems and a full year 1 of poverty-related Medicaid by adding $560 (the “female, age 18-24” base amount) to $476 (poverty-related eligibility) for a total of $1,036. If the female had been present for only 10 months in year 1, we add another $112, that is, $56 for each of the two missing year-1 months, for a total of $1,148. If she were present for only 2 months in year 1, we would add 10×56 to $1,036, for a total of $1,596. In contrast, consider a female of the same age and present for 10 months in year 1 but who is eligible for Medicaid because of disability rather than poverty. We add three numbers to arrive at the demographic part of this female's prediction: $560 for age and sex, $1,449 for disability entitlement, and $179×2 for her two missing year-1 months as a disability-entitled person. The demographic part of this female's expected cost next year is then $2,367; in computing her total expected costs, dollars for the future cost implications of her year-1 medical conditions are added to $2,367. We include one final example to illustrate how health-status information can interact with age. Consider the payment for a Medicare-entitled male 66 years of age, under treatment for drug dependence (CC 31) but with no other recorded illness. His predicted cost is $2,540, computed as the sum of $1,428 for the demographic part (the same for all males between ages 65 and 69) and a $1,122 contribution for CC 31. Consider, however, a second male, also drug-dependent, but only 30 years of age and entitled to Medicare through disability. Here, there is a $5,392 total prediction, the sum of a $955 demographic part (the same for any male under age 35) and $4,437 for drug dependence. The latter number is computed by adding a $3,315 age-interaction for a Medicare enrollee under age 65 in CC 31 to the $1,122 basic payment for any Medicare enrollee in CC 31. The number $3,315 is in the last column of Table 2 in the row labeled AI-31; drug problems cost, on average, $3,315 more to treat in younger (disabled) Medicare enrollees than in the elderly.

Models

Table 2 shows the complete detail (summary statistics and all coefficients for all variables) for the three DCG/HCC models. The models are distinguished in several ways by: (1) which CCs are excluded, (2) which coefficients are constrained, (3) which demographic variables and demographic-medical interactions are included, and (4) what the model coefficients are. We discuss each of these in turn. Exclusions, which result in coefficients being set to zero, were made for reasons previously described. The lines for the 33 CCs that are excluded from all three models are omitted from Table 2. Exclusions used in specific models appear in Table 2 as omitted coefficients (†). The private model has no model-specific exclusions, Medicaid has one (CC3 central nervous system infections) and Medicare has 16, most of them related to maternity, neonatal and pediatric conditions that are extremely rare in Medicare's predominantly elderly population. We indicate coefficients that are constrained to be equal by connecting them with a brace. For example, because only 165 people were classified in the 4 mental retardation categories (CCs 36 through 39) in the private model, these 4 coefficients are constrained to a common value of $2,544. The three models differ in the number of constraints imposed across sets of CC coefficients, with the private model employing the most (five) and the Medicare model, the least (one). A third difference is in the variables included in addition to the age/sex and CC predictors that characterize prospective DCG/HCC models. The Medicaid model has the most: including eligibility categories, missing-months variables, and 31 coefficients for selected age-medical interactions (labeled as AI-2, AI-15, and so on, where the AI number indicates an associated condition category). The private model includes 9 AI variables and the Medicare model, 6. The AI coefficients shown at the end of Table 2 are the increments (decrements, for negative numbers) to the basic CC payments for a younger person with those particular medical problems. “Younger” means under age 65 in Medicare and under age 18 in the other two populations. Finally, the models differ in the values of their coefficients. A striking feature of Table 2 is the similarity between the CC coefficients in Medicare and Medicaid, estimated to within 20-30 percent for about one-half of the categories; also, for any particular CC, the larger coefficient is about equally likely to be found in either model. Thus, even though average costs are much higher in Medicare than in Medicaid, the incremental costs of treating particular conditions do not differ systematically. Although one source of higher expected costs next year in Medicare is larger age/sex coefficients, the more important explanation is greater disease prevalence. For example, 1.3 percent of the Medicare population has metastatic cancer (CC 5) but only 0.2 percent of the Medicaid population; for chronic complications of diabetes (CC 13), the rates are 1.6 percent versus 0.2 percent; for congestive heart failure (CC 48), 9.8 percent versus 1.1 percent; for acute myocardial infarction (CC 50), 4.2 percent versus only 8 in 10,000. Medicaid and Medicare coefficients, although similar to each other, are almost always much smaller than coefficients in the private model. Typically, they are not even one-half as large as the private model coefficients. For only a handful of CCs, the Medicaid coefficient exceeds the private model coefficient: CC 32—depression and other moderate-cost mental illness; the four mental retardation CCs—36 through 39; CC 69—pleural effusion/pneumothorax; and CC 72—high-cost eye disorders. In only one instance, CC 79—nephritis, is the Medicare coefficient greater than the private one. We have no explanation for this unusual finding. It is encouraging that the private and Medicaid models are similar in terms of the age-interacted coefficients estimated for the pediatric conditions. Of the eight AI parameters present in both models, seven are of the same sign. Most of the pediatric coefficients, which were identified in the development samples, remain highly significant in these full-data re-estimated models. In considering the plausibility of particular model coefficients, we note that each coefficient for a CC reflects the increment to expected costs that is independently associated with having the condition. An HIV-positive male's prediction, for example, is the sum of the CC 1 coefficient, all coefficients associated with his other medical problems, and any relevant demographic coefficients. If this male has multiple medical problems, his predicted total costs will be much larger than the coefficient for CC 1 alone. This feature is an important strength of the DCG/HCC multiple-condition model structure (in contrast to single-condition models, such as PIP-DCG), because, in fact, people who are HIV-positive differ widely in the range of medical problems they experience and how expensive they are to treat. This model does not simply pay more for HIV but rather establishes appropriately different payment amounts within the community of people living with HIV by recognizing comorbid conditions.

Measuring Model Performance

Because implementing a risk-adjustment model has serious consequences, we must understand how well the models work. The one universally reported, single-number summary performance measure for risk-adjustment payment models is the R2, or the proportion of variance in costs that the model explains. For reference, demographic payment models in private and Medicare populations have R2 values of less than 2 percent, and the R2 for a demographic/eligibility model in our Medicaid data is 7 percent (Greenwald et al., 1998; Ash et al., 1998; Pope et al., 1998). Our Medicaid model has the highest explanatory power, with a validated R2 of more than 20 percent, compared with 8 to 9 percent in the other two populations (refer to the fifth row of Table 2). The better fit in Medicaid is attributable to several factors. For one, the distribution of the outcome variable, cost, has a less extreme upper tail (virtually no million-dollar cases) in Medicaid. Additionally, many people with Medicaid coverage are eligible for medical reasons (such as pregnancy or disability), and expenditures within medically defined groups are more predictable than among populations with many non-users (Kronick et al., 1996). Medicaid eligibility categories also distinguish groups (such as children in poor families) that have predictably lower medical costs because they are basically healthy. Finally, the “months out” variables capture the higher expected costs of recent entrants, an important factor in a system with sporadic entitlement. All three prospective DCG/HCC models rely upon age and sex in addition to diagnostic information, and costs in these populations do differ substantially by age. For example, in the Medicaid and private samples, annual costs are each about $3,500 more for males age 60 than for females age 5; in Medicare, there is a similar difference in annual costs for males age 90 versus females age 65. However, after accounting for differences in the prevalence of medical problems, the demographic coefficients in our models differentiate less. (The disease-adjusted differences are about $1,400 for males age 60 than for females age 5 among the privately insured, about $500 for a similar demographic difference in Medicaid, and $2,000 for males age 90 versus females age 65 in Medicare.) Although age and sex coefficients remain highly statistically significant in each model, information about the presence of serious, chronic disease groups, such as diabetes and renal insufficiency, is far more useful for predicting costs.

Average Costs for Important Subgroups

Although R2 values are always reported, other ways of examining model performance may be more useful in assessing the value of a payment model (Ash and Byrne-Logan, 1998). We use some of these to examine the private DCG/HCC model's performance in a fourth, entirely new data set (a State employee health insurance plan). The methodology is to compare predicted versus actual year-2 average costs within significant subgroups. A predictive ratio (PR) for a model applied to a subgroup of people is formed by dividing the model-predicted costs for the group by their actual costs. Thus, for example, when an age/sex model is used to predict costs for a group of sick people, the PR is likely to be much less than 1.00. Alternatively, when people are identified retrospectively as a group whose costs turned out to be very low, PRs for any prospective model will be much larger than 1.00. Prospective models should never predict zero costs, because no one has zero expected future health care costs. Figure 3 shows PRs for several clinically defined groups of people in the State data, as predicted by the private DCG/HCC model and by an age/sex model. The medical condition groups were defined by an outside panel convened by HCFA, and membership in each group is contingent upon the presence (during year 1) of at least one panel-specified ICD-9-CM code. Although the age/sex prediction is never more than one-half the actual costs for any of these groups (all PRs are 0.50 or less), the DCG prediction is commonly between 0.95 and 1.05. The DCG model underpredicts most seriously in arthritis, where nearly 4,000 people predicted to cost around $4,300 actually cost nearly $5,800 (PR = 0.74). This is because the panel-identified arthritis subgroup includes anyone with any arthritis code regardless of its specificity, but the DCG model identifies only a smaller, sicker subgroup. The model does pay $2,357 for the presence of a well-defined, systemic rheumatoid disease, such as rheumatoid arthritis (ICD-9-CM 714); however, it does not add dollars for vague codes, such as ICD-9-CM 713 (other arthropathy, joint disorders, derangements, joint pain/stiffness). When a model excludes payment for vague codes associated with real costs, it becomes less accurate; in particular, this model underpays for people with low-level or non-specific joint disorders, even though these disorders can result in significant disability.
Figure 3

Predictive Ratios for the Private Validation Sample, by Presence of Medical Condition

In another illustration of the predictive value of DCG/HCC models, we divide the private validation sample into 18 groups based on predicted cost levels specified by the DCG/HCC model. The healthiest group, with predicted costs between $250 and $500, contains 21,650 people, or 11.3 percent of the population. (The model does not predict costs of less than $250 for anyone.) The next group, with predicted costs of at least $500 but less than $750, contains another 21.6 percent of people. At the other end of the spectrum, the model predicts costs of $5,000 or more for 5.6 percent of people; among these, just 74 (4/100 of 1 percent) fall into our highest cost prediction group ($40,000 and over). Within each group, we calculate mean actual costs, as well as the means for DCG/HCC-predicted costs and age/sex predicted costs. At the high end, for those with predicted costs over $5,000, the DCG/HCC-predicted amounts track actual costs quite well (meaning that PRs within these groups are not far from 1.00), while the age/sex predicted costs plateau at about $3,300. Figure 4, in which average actual costs, age-sex predicted costs, and DCG-predicted costs are plotted for people in each of these 18 prediction groups, illustrates these points. The data for Figure 4 are in Table 3.
Figure 4

Means of Actual and Predicted Costs for the Private Validation Sample, by DCG-Prediction Group

Table 3

Means of Actual and Predicted Cost for the Private Validation Sample, by DCG-Prediction Group

Predicted Cost Group1Actual CostsDCG-Predicted CostsAge/Sex Predicted CostsCounts
Less than $2500
250$510$417$57021,650
50067262085541,384
7509318671,26222,649
1,0001,3911,3471,91528,782
1,5001,7071,7142,33530,786
2,0002,2952,2423,14016,100
2,5002,5102,7793,1955,828
3,0003,3733,4063,3329,086
4,0003,9934,4512,6954,944
5,0004,7345,4852,8593,474
6,0006,4786,6242,9512,747
7,5008,0258,5573,0121,980
10,00011,41511,9393,0061,314
15,00015,74117,0423,217441
20,00020,42622,3772,853257
25,00031,80427,1812,929227
30,00040,55934,0873,010154
40,00061,38052,0262,92674

Each predicted cost group contains all people whose DCG-predicted dollar cost are at least this great but less than the next higher number.

NOTES: DCG is Diagnostic Cost Group. n = 191,877.

SOURCE: (Ash and Byrne-Logan, 1998.)

In summary, the private model, which was built on a large national data set, predicts costs well within a new population of State employees. It not only distinguishes groups of high- and low-cost individuals but also even identifies a high-cost tail, with small numbers of very expensive people. The Medicaid and Medicare DCG/HCC models work similarly well (and demographic-only models, similarly poorly) in analogous comparisons of actual and predicted costs in out-of-sample validation data sets (Ash et al., 1998; Pope et al., 1998).

Conclusion

We have extracted disease profiles of individual patients and groups of patients from the kinds of administrative records that many providers have been supplying to health care payers for years. Until now, few plans have used these data to construct a solid “information backbone” for managing care. The unified, multiple-condition DCG modeling framework characterizes individual health status and the disease burden of populations, as well as predicting future levels of resource need. When comparing physicians' practices, patient profiles can be aggregated to describe the various mixes of medical problems that providers handle, at the same time that the model's predictions can help establish fair (risk-adjusted) resource allocations. Although the original purpose of these models was to enable health care purchasers, such as HCFA, to identify an efficient capitation price, the models actually provide detailed information on the prevalence of disease. Such information helps explain why some providers and plans use more-than-average resources. The DCG/HCC health profiles and the model predictions can be used together to routinely identify patients who are likely to be very costly and to find the particular medical problems that contribute to this expectation. Such information is invaluable for identifying opportunities for selecting, implementing, and evaluating the effectiveness of disease management programs.
  9 in total

1.  The current state of risk adjustment technology for capitation.

Authors:  M J Ingber
Journal:  J Ambul Care Manage       Date:  1998-10

2.  Risk Adjustment for the Medicare program: lessons learned from research and demonstrations.

Authors:  L M Greenwald; A Esposito; M J Ingber; J M Levy
Journal:  Inquiry       Date:  1998       Impact factor: 1.730

3.  Paying more fairly for Medicare capitated care.

Authors:  L I Iezzoni; J Z Ayanian; D W Bates; H R Burstin
Journal:  N Engl J Med       Date:  1998-12-24       Impact factor: 91.245

4.  Do health maintenance organizations work for Medicare?

Authors:  R S Brown; D G Clement; J W Hill; S M Retchin; J W Bergeron
Journal:  Health Care Financ Rev       Date:  1993

5.  Principal inpatient diagnostic cost group model for Medicare risk adjustment.

Authors:  G C Pope; R P Ellis; A S Ash; C F Liu; J Z Ayanian; D W Bates; H Burstin; L I Iezzoni; M J Ingber
Journal:  Health Care Financ Rev       Date:  2000

6.  Diagnostic risk adjustment for Medicaid: the disability payment system.

Authors:  R Kronick; T Dreyfus; L Lee; Z Zhou
Journal:  Health Care Financ Rev       Date:  1996

7.  Diagnosis-based risk adjustment for Medicare capitation payments.

Authors:  R P Ellis; G C Pope; L Iezzoni; J Z Ayanian; D W Bates; H Burstin; A S Ash
Journal:  Health Care Financ Rev       Date:  1996

8.  Adjusting Medicare capitation payments using prior hospitalization data.

Authors:  A Ash; F Porell; L Gruenberg; E Sawitz; A Beiser
Journal:  Health Care Financ Rev       Date:  1989
  9 in total
  104 in total

1.  Risk selection in the Massachusetts State employee health insurance program.

Authors:  W Yu; R P Ellis; A Ash
Journal:  Health Care Manag Sci       Date:  2001-12

2.  Longitudinal patterns in survival, comorbidity, healthcare utilization and quality of care among older women following breast cancer diagnosis.

Authors:  Amresh D Hanchate; Kerri M Clough-Gorr; Arlene S Ash; Soe Soe Thwin; Rebecca A Silliman
Journal:  J Gen Intern Med       Date:  2010-06-08       Impact factor: 5.128

3.  Disease burden profiles: an emerging tool for managing managed care.

Authors:  Yang Zhao; Arlene S Ash; Randall P Ellis; James P Slaughter
Journal:  Health Care Manag Sci       Date:  2002-08

4.  The risk-adjusted vision beyond casemix (DRG) funding in Australia. International lessons in high complexity and capitation.

Authors:  Kathryn M Antioch; Michael K Walsh
Journal:  Eur J Health Econ       Date:  2004-06

5.  Diagnostic cost groups (DCGs) and concurrent utilization among patients with substance abuse disorders.

Authors:  Amy K Rosen; Susan A Loveland; Jennifer J Anderson; Cheryl S Hankin; James N Breckenridge; Dan R Berlowitz
Journal:  Health Serv Res       Date:  2002-08       Impact factor: 3.402

6.  The impact of exclusion criteria on a physician's adenoma detection rate.

Authors:  Felippe O Marcondes; Katie M Dean; Robert E Schoen; Daniel A Leffler; Sherri Rose; Michele Morris; Ateev Mehrotra
Journal:  Gastrointest Endosc       Date:  2015-10       Impact factor: 9.427

7.  Case-mix adjusting performance measures in a veteran population: pharmacy- and diagnosis-based approaches.

Authors:  Chuan-Fen Liu; Anne E Sales; Nancy D Sharp; Paul Fishman; Kevin L Sloan; Jeff Todd-Stenberg; W Paul Nichol; Amy K Rosen; Susan Loveland
Journal:  Health Serv Res       Date:  2003-10       Impact factor: 3.402

8.  Using claims data to examine mortality trends following hospitalization for heart attack in Medicare.

Authors:  Arlene S Ash; Michael A Posner; Jeanne Speckman; Shakira Franco; Andrew C Yacht; Lindsey Bramwell
Journal:  Health Serv Res       Date:  2003-10       Impact factor: 3.402

9.  Colon Cancer Staging in Vulnerable Older Adults: Adherence to National Guidelines and Impact on Survival.

Authors:  T B Leal; T Holden; L Cavalcante; G O Allen; J R Schumacher; M A Smith; J M Weiss; H B Neuman; N K LoConte
Journal:  Ann Hematol Oncol       Date:  2014

10.  Validation of an algorithm for categorizing the severity of hospital emergency department visits.

Authors:  Dustin W Ballard; Mary Price; Vicki Fung; Richard Brand; Mary E Reed; Bruce Fireman; Joseph P Newhouse; Joseph V Selby; John Hsu
Journal:  Med Care       Date:  2010-01       Impact factor: 2.983

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.