Literature DB >> 29888093

Trend Analysis of Aggregate Outcomes in Complex Health Survey Data.

Wesley M Durand¹, Paul C Stey¹, Elizabeth S Chen¹, Indra Neil Sarkar¹.

Abstract

Public health and clinical practice pattern trends are often analyzed using complex survey data. Use of statistical approaches that do not account for survey design predisposes to error, potentially leading to resource misdirection and inefficiency. This study examined two techniques for analyzing trends in complex survey data: (1) design-corrected logistic regression and (2) jackknife re-weighted linear regression. These approaches were compared toweighted least squares regression, as well as non-design corrected techniques. Data were obtained from NEISS, a complex survey of emergency departments that can be weighted to produce national estimates of injury occurrence. Trends were analyzed in rug-related injuries among male versus female patients ≥65 years of age. All design-corrected techniques performed comparably in assessment of trend within sex-based subgroups. In almost all cases, design-corrected approaches contrasted profoundly with standard statistical techniques. Future analyses may employ these design-corrected approaches to appropriately account for estimate variance in complex survey data.

Entities: Chemical Disease Gene Species

Year: 2018 PMID： 29888093 PMCID： PMC5961819

Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc

Introduction

Analysis of complex survey data often requires specialized statistical software or packages[1, 2]. Such procedures utilize survey design parameters to calculate variance associated with estimates from these data (i.e., design-corrected analysis). Use of standard statistical procedures assuming simple random sampling to analyze survey data predisposes to error[3]. Studies of public health trends and clinical practice patterns often utilize complex survey data. Accordingly, analyses utilizing non-design-corrected statistical techniques risk misguided conclusions, potentially leading to resource misdirection and inefficiency. The National Health and Nutrition Examination Survey (NHANES; CDC), National Inpatient Sample (NIS; HCUP, AHRQ), and National Electronic Injury Surveillance System (NEISS; CPSC, CDC) are complex surveys commonly employed in biomedical and public health research[4,5]. The longitudinal nature of these datasets is particularly usefulfor evaluating trends in event occurrence over time. In these analyses, the outcome is often an aggregatewithin levels of the variable over which trend analysis is desired (e.g., number of liver transplants per year, or aggregate cost of coronary artery bypass graft procedures per year). The requirement to utilize survey design-corrected procedures, however, limits the array of tools available to researchers. This issue is compounded by significant inter-software variability in available procedures and functionality. Accordingly, when examining aggregate outcome trends, many studies in the biomedical literature have either utilized non-design-corrected statistical approaches (e.g., standard linear regression or Cochran Armitage trend tests), refrained from statistical trend analysis, or circumvented direct trend analysis through a variety of methods[6-25]. Weighted least-squares regression is a technique for regression analysis of survey data, as published by Gillum, Graves, and Jean in 1996. While widened availability of survey analysis software has reduced use of this technique for continuous, categorical, and count outcomes, select researchers have applied it to trend assessment of aggregate outcomes in recent years[26-31]. This approach, however, has several disadvantages. It assumes that standard errors are derived from a sufficiently large sample such that they are without sampling error;[26] this may not always be the case. The magnitude of the regression estimate under weighted least squares may differ from that observed without weighting. Finally, the approach may require ad hoc calculation, complicating “out of the box” usage, particularly when analyzing variable interactions. This study aimed to develop and evaluate approaches for trend assessment of aggregate outcomes in complex survey data that can be easily applied across a broad array of statistical software. Here, the use of two methods is described: (1) design-corrected logistic regression and (2) jackknife re-weighted linear regression. These methods are compared to an existing design-corrected method (weighted least squares regression) as well as non-design-corrected methods (standard linear regression and the Cochran Armitage trend test). The impact of using these techniques is illustrated with data from NEISS, analyzing trends in rug-related injuries among patients ≥65 years of age.

Methods

Data Source: Full-year NEISS data (1999-2015) were abstracted from the Consumer Product Safety Commission query builder (https://www.cpsc.gov/cgibin/NEISSQuery/home.aspx). The NEISS is a stratified sample of ~100 hospital emergency departments (EDs) that can be weighted to produce national estimates of product-related injuries. EDs are divided into strata based on hospital size. Year was treated as an additional stratification variable, such that size-based strata were temporally separated; this replicates the NEISS’s annual sampling approach. Patient Selection: Records were included based on presence of: a product code for “rugs” (codes 0612, 0676, 0613); patient age 㩥 and diagnosis code for fracture (code = 57). Variables: The primary dependent variable in this analysis was rug-related fracture occurrence among elderly patients. Independent variables were sex and year.

Statistical Analysis:

Design-Corrected: Logistic Regression: The design-corrected logistic regression approach utilized a scheme for dataset preparation to treat an aggregated outcome as binary. The overall analysis approach is detailed in Figure 1.

Figure 1.

Dataset Creation Strategy for Design-Corrected Logistic Regression.

First, a dataset containing dummy records for each primary sampling unit (PSU) by year was created and merged with isolated patient records, ensuring that all PSUs were represented in the analysis. Patient cases were assigned “cohort=1” and dummy PSU records were assigned “cohort=0” for domain analysis. Note that this step would be required in any analysis seeking to correctly calculate estimate variance, as not all PSUs may have contributed patient records to the analysis. Subsequently, a denominator dataset was created containing observations for each combination of variables over which trend analysis would occur; in our case, all year*sex variable combinations. These observations were assigned an arbitrary but consistent stratum (stratum = P) and PSU (psu = 999) values. By placing these observations in “lonely PSUs”, estimates from these observations do not add to the overall variance of the sample[1]. Observations were weighted as the difference between an arbitrary but consistent large number (in our case, K = 1,000,000) and the actual observed number of injuries for that year*sex combination. Importantly, patient cases were assigned “event=1” and denominator dataset observations were assigned “event=0”. Denominator dataset observations were also assigned “cohort=1”. Design-corrected logistic regression was accomplished with the survey R package, utilizing “event” as the binary dependent variable and sex (reference = male), year, and an interaction term as independent variables, as illustrated in Equation 1[1]. A domain analysis was performed for observations with “cohort=1”. A custom linear hypothesis test was requested to determine the effect of year among females. where annual change among males is represented as βyear,males = β2, the difference inannual change among females versus males as βyear,Δ = β3, and the annual change among females as the linear hypothesis test of βyear,females = β2 + β3 Accordingly, the modeled absolute number of events can be expressed as a function of the probability of event occurrence, calculated according to Equation 2: where logit(sex,year) is calculated according to Equation 1, and K is the large number used in weighting denominator dataset observations Design-Corrected: Jackknife Re-weighted Linear Regression: In contrast to the design-corrected logistic regression approach, the jackknife re-weighted linear regression method involved primarily computational steps. The unique hospital dataset and rug-related injury datasets were combined for jackknife re-weighted analysis (the denominator weight dataset was excluded). The jackknife approach to complex survey analysis is available as an alternative to Taylor Series linearization in many statistical packages, though, to our knowledge, there is no literature illustrating use in analyzing aggregate outcomes. In this approach, the analysis is repeated for R replicates, where R is equal to the number of PSUs in the sample. For each replicate r, a sequential PSU is selected for deletion from the sample, and observations from the remaining PSUs within the donor stratum (i.e., the stratum of the deleted PSU) are re-weighted according to Equation 3a. A standard linear regression for number of events by year, sex, and their interaction was conducted for each replicate, defined in Equation 4. Estimate magnitude was calculated as the arithmetic mean of replicate estimates, and variance was computed according to Equation 3b. All calculations for the jackknife re-weighted linear regression approach were conducted in Julia 0.6.0, using the RCall.jl package for regression and linear hypothesis testing. where w is the weight for the j member of the i PSU, and n is the number of PSUs in the donor stratum h for replicate r = 1, 2, 3… R, where β are the estimated regression coefficients from the full sample for β, and β, is the regression coefficient from the r replicate where annual change among males is represented as βyear,males = β2, the difference in annual change among females versus males as βyear,Δ = β3, and the annual change among females as the linear hypothesis test of βyear,females = β2 + β3 Design-Corrected: Weighted Least Squares Regression: Estimates and standard errors were computed using the survey package for all year*sex combinations. As described by Gillum, Graves, and Jean, regression coefficients and standard errors were calculated according to Equations 5a and 5b. Weighted least squares regression was implemented in Julia 0.6.0. where, For inter-group analysis of trend, the difference between the two groups (i.e., number of injuries among females minus number of injuries among males) was modeled as a function of year. Regression estimate and standard error for this value were calculated as in Equation 5a and 5b, with pooled standard deviation determined according to Equation 6: where, s are the standard deviations to be pooled, and n are the sample sizes Standard Analysis: Linear Regression: For comparison, standard linear regression was performed (regression equation identical to Equation 4) for number of events as the dependent variable, with sex, year, and their interaction term as independent variables. Linear regression was implemented in Julia 0.6.0, with use of the RCall.jl package for linear hypothesis testing. Standard Analysis: Cochran Armitage Trend Test: The Cochran Armitage trend test was implemented, comparing linear trend among male versus female injuries without correction for survey design. This approach was implemented in Julia 0.6.0 using the RCall.jl package and the DescTools R package. Software: Dataset cleaning and preparation were accomplished using SAS 9.4 (SAS Institute, Cary, NC) and Julia 0.6.0. Analysis was accomplished as detailed above. Plots of predicted versus observed injury counts were produced in Julia with the PlotlyJS.jl package.

Results

Descriptive Statistics: In total, 8,005 unweighted records were analyzed, corresponding to a national estimate of 349,605 patients (SE 10,676) (all future statistics are derived from weighted estimates). The cohort was 82.2% female (SE 0.50%) and 17.8% male (SE 0.50%), corresponding to 287,244 (SE 8,923) and 62,361 (SE 2,603) discharges, respectively. Design-Corrected Logistic Regression: The design-corrected logistic regression approach indicated a robustincreasing trend in the annual number of rug-related injuries among both males (βyear,males = 5.120e-02, SE 9.153e-03) and females (βyear,females = 2.035e-02, SE 6.901e-03). The difference between the two was similarly marked (βyear,Δ = -3.085e-02, SE 7.150e-03). A plot of modeled annual injury count was slightly non-linear. (Table 1) (Figure 2) (Figure 3)

Table 1.

Comparison of Select Coefficients for Trend Analysis.

Figure 2.

Comparison of t Values from Regression Trend Analyses. DC: Design-Corrected.

Figure 3.

Observed and Predicted Values from Design-Corrected Logistic and Linear Regression. DCLR, Design-Corrected Logistic Regression.

Jackknife Re-weighted Linear Regression: The jackknife re-weighted linear regression approach also exhibited robust positive trends among both males (βyear,males = 184.8, SE 33.06) and females (βyear,females = 337.2, SE 115.8). Notably, the t-values for these coefficients were very similar to those observed with design-corrected logistic regression. The difference between the two trends, however, was less significant (βyear,Δ = 152.5, SE 97.09), and was not similar to that produced above. (Table 1)(Figure 2) (Figure 3) Weighted Least Squares Regression: This approach produced standard errors that were relatively similar to those produced in jackknife re-weighted linear regression, though with coefficients differing to a greater degree (βyear,males = 153.1, SE 30.37) (βyear,females = 219.3, SE 101.2). The difference observed between the two trends exhibited acomparable standard error to that produced using jackknife re-weighted linear regression (βyear,Δ = 63.08, SE 92.80). (Table 1) (Figure 2) Standard Linear Regression: Use of standard linear regression produced coefficients with identical magnitude but differing standard errors as compared to those from jackknife re-weighted linear regression (βyear,males = 184.8, SE 68.05) (βyear,females = 337.2, SE 68.05). The results for the difference between the trends was near identical (βyear,Δ = 152.5, SE 96.24). (Table 1) (Figure 2) Cochran Armitage Trend Test: The Cochran Armitage test indicated a highly robust difference in trend between the two sexes (Z = 32.92). (Table 1)

Discussion

Drivers of U.S. healthcare costs will experience increased scrutiny as health-related expenditures continue to grow. Trend analysis of public health and clinical practice patterns will play a significant role in characterizing these dynamics, and these analyses will often be conducted using complex survey data. Use of statistical techniques that account for estimate variance is critical, particularly when resource-allocation decisions may be made based on the presence or absence of a significant trend. This investigation illustrated two methods for assessing trends in aggregate outcomes from complex survey data: design-corrected logistic and jackknife re-weighted linear regression. While this study utilized the NEISS dataset, these results are readily generalizable to analysis of other complex survey data (e.g., NIS and NHANES). These approaches were compared to weighted least squares regression—a previously-employed technique for complex survey trend analysis—and standard, non-design-corrected analytic approaches. Both design-corrected and non-design-corrected techniques incorporated survey weights and thus produced identical point estimates for annual event occurrence. Non-design-corrected techniques, however, did not account for survey design parameters and therefore produced skewed values for the variability of these estimates. Ultimately, there was generally greater similarity between design-corrected approaches than when compared to techniques that did not account for survey design.

Comparison of Trends Among Subgroups:

Comparison between logistic and linear regression techniques is more apt with t-values than standard errors. Considering this, all design-corrected techniques produced comparable results for trends among both male and female subgroups, and the logistic and jackknife re-weighted linear techniques yielded profoundly similar values. The design-corrected approaches were highly distinct computationally; such mutual corroboration increases confidence in their validity. All design-corrected techniques exhibited markedly greater t-values for subgroup trends than did standard linear regression. Accordingly, public health and biomedical research that utilizes standard linear regression to analyze trends from complex survey data may be predisposed to misguided conclusions; researchers should instead utilize only design-corrected approaches. Among techniques employing linear regression, jackknife re-weighted and standard linear regression yielded near-identical point estimates for regression parameters. Weighted least squares regression exhibited generally lower values. This likely occurred due to weighting, with greater importance given to observations with lower standard errors. Higher y-values may therefore be disproportionately penalized, yielding attenuated trend magnitude. This finding has implications for researchers interested in preserving equal-weighting in trend analysis, particularly when their data exhibits a wide range of outcome values.

Comparison of Inter-Group Trends Between Subgroups:

The difference in trend among males versus females varied greatly between design-corrected approaches. For the design-corrected logistic regression approach, this is attributable to the modeling of relative versus absolute annual change (i.e., log-odds of event occurrence versus number of events). As expected, the trend modeled with logistic regression is non-linear. Accordingly, while linear regression coefficients indicated that the annual absolute change in event number among females is greater than among males, logistic regression coefficients showed that the annual relative change was greater among males than among females. Researchers conducting trend analysis should therefore consider whether their hypothesis is best assessed in relative or absolute terms and choose their technique accordingly. While standard errors were very similar between jackknife re-weighted and weighted least squares regression, regression parameter magnitude was markedly different. As above, this is likely attributable to disproportionate penalizing of larger observations under the weighted least squares approach. Indeed, the difference between jackknife re-weighted and weighted least squares regression coefficients for βyear,Δ is approximately equal to the difference in differences for the two approaches between βyear,males and β year,females. Notably, jackknife re-weighted linear regression produced comparable results for the difference in trend between males and females as compared to those from standard linear regression. It is possible that this is observation is related to features of our dataset, rather than of the method employed; we will test this hypothesis in planned simulation studies. In contrast, all regression-based approaches (including standard linear regression) yielded profoundly different outcomes as compared to a non-design-corrected Cochran Armitage trend test, which showed a highly significant difference between trends among males versus females. This finding has significant implications for future trend analyses of complex survey data. Use of non-design-corrected techniques may not only predispose to error, but the direction of such error (i.e., overestimation versus underestimation) may be dependent on the technique employed.

Holistic Method Comparison:

Performance aside, it is important to consider ease of implementation for design-corrected techniques. We observed design-corrected logistic regression to be the most straightforward of those tested, followed by weighted least squares and jackknife re-weighted linear regression. Design-corrected logistic regression required only a single incremental data cleaning step, leveraging existing survey data analysis techniques for further computation. Implementation was straightforward in R, SAS, and Julia, and required very little de novo programming. In contrast, the other approaches required a fair degree of customization. Jackknife re-weighted linear regression required not only replicate creation, but also custom variance, standard error, and t-value calculations. Additionally, the considerable number of replicates required for jackknife re-weighted linear regression (1717 replicates in total) made this approach more computationally intensive. Weighted least squares regression required custom regression parameter and variance calculations, as well as pooling of standard errors, techniques with which many researchers may not be familiar. Nevertheless, design-corrected logistic regression had a notable disadvantage: it modeled the linear annual change in log-odds and therefore assessed non-linear trends in annual event occurrence. As we observed, there may be significant differences between trends modeled in absolute versus relative annual change. While multivariate models are commonplace in epidemiology, researchers desirous of modeling strictly linear absolute annual change may opt for jackknife re-weighted regression. The choice of technique may ultimately be influenced by a priori assumptions regarding relationships between dependent and independent variables, as well as patterns observed in exploratory analyses.

Future Directions

Further characterization of these techniques will involve simulated survey datasets to assess reproducibility of trend assessment with replicate sampling. Reproducibility will be measured as the distribution of trend t-values as assessed by each technique across replicate samples. If one believes that a method accurately accounts for sampling approach (i.e., is “design-corrected”), then one should believe that the method will produce relatively more consistent results with replicate sampling than would a method that is not design-corrected. We hypothesize that design-corrected techniques will offer superior reproducibility over non-design-corrected techniques.

Conclusion

This investigation showed that, with proper dataset cleaning, design-corrected logistic regression is an accurate and easily implemented method to account for variance in trend analysis of aggregate outcomes from complex survey data. In the context of rug-related fractures among the elderly from the NEISS dataset, the results produced with design-corrected logistic regression were highly similar to those from jackknife re-weighted linear regression; the latter may also be employed when researchers are desirous of modeling strictly linear change in absolute quantities. Generally, all design-corrected techniques produced markedly different results as compared to those from standard linear regression and Cochran Armitage trend test. Thus, the results from non-design-corrected analysis may not only lead to misguided conclusions regarding trend robustness, but also manifest in either overestimation or underestimation, depending on the non-design-corrected technique employed. Future analyses of trends in aggregate outcomes from complex survey data should be conducted using only statistical techniques that account for estimate variance. Use of standard statistical procedures predisposes to error and should be avoided.

25 in total

1. Nationwide trends in inpatient admissions of pulmonary hypertension in the United States from 2000 to 2013.

Authors: Rutuja R Sikachi; Sonu Sahni; Dhruv Mehta; Abhishek Agarwal; Abhinav Agrawal
Journal: Adv Respir Med Date: 2017

2. National trends of incidence, treatment, and hospital charges of isolated C-2 fractures in three different age groups.

Authors: Sunil Kukreja; Piyush Kalakoti; Richard Murray; Menarvia Nixon; Symeon Missios; Bharat Guthikonda; Anil Nanda
Journal: Neurosurg Focus Date: 2015-04 Impact factor: 4.047

3. Spinal fusion for pediatric neuromuscular scoliosis: national trends, complications, and in-hospital outcomes.

Authors: Kavelin Rumalla; Chester K Yarbrough; Andrew J Pugely; Linda Koester; Ian G Dorward
Journal: J Neurosurg Spine Date: 2016-05-20

4. Trends in Hospitalizations Related to Invasive Aspergillosis and Mucormycosis in the United States, 2000-2013.

Authors: Snigdha Vallabhaneni; Kaitlin Benedict; Gordana Derado; Rajal K Mody
Journal: Open Forum Infect Dis Date: 2017-01-13 Impact factor: 3.835

5. 16-year trends in the infection burden for pacemakers and implantable cardioverter-defibrillators in the United States 1993 to 2008.

Authors: Arnold J Greenspon; Jasmine D Patel; Edmund Lau; Jorge A Ochoa; Daniel R Frisch; Reginald T Ho; Behzad B Pavri; Steven M Kurtz
Journal: J Am Coll Cardiol Date: 2011-08-30 Impact factor: 24.094

6. National Differences in Trends for Heart Failure Hospitalizations by Sex and Race/Ethnicity.

Authors: Boback Ziaeian; Gerald F Kominski; Michael K Ong; Vickie M Mays; Robert H Brook; Gregg C Fonarow
Journal: Circ Cardiovasc Qual Outcomes Date: 2017-07

7. Hospital discharges, readmissions, and ED visits for COPD or bronchiectasis among US adults: findings from the nationwide inpatient sample 2001-2012 and Nationwide Emergency Department Sample 2006-2011.

Authors: Earl S Ford
Journal: Chest Date: 2015-04 Impact factor: 9.410

8. Plateaued national utilization of adolescent bariatric surgery despite increasing prevalence of obesity-associated co-morbidities.

Authors: Tammy L Kindel; Daniel Lomelin; Corrigan McBride; Vishal Kothari; Dmitry Oleynikov
Journal: Surg Obes Relat Dis Date: 2015-09-21 Impact factor: 4.734

9. Rapid increase in hospitalization and mortality rates for severe sepsis in the United States: a trend analysis from 1993 to 2003.

Authors: Viktor Y Dombrovskiy; Andrew A Martin; Jagadeeshan Sunderram; Harold L Paz
Journal: Crit Care Med Date: 2007-05 Impact factor: 7.598

10. Nationwide trends of hospital admissions for acute cholecystitis in the United States.

Authors: Vaibhav Wadhwa; Yash Jobanputra; Sushil K Garg; Soumil Patwardhan; Dhruv Mehta; Madhusudhan R Sanaka
Journal: Gastroenterol Rep (Oxf) Date: 2016-05-11

1 in total

1. Trends in Regionalization of Emergency Care for Common Pediatric Conditions.

Authors: Anna M Cushing; Emily Bucholz; Kenneth A Michelson
Journal: Pediatrics Date: 2020-03-13 Impact factor: 7.124

1 in total