Literature DB >> 28693579

Hospital daily outpatient visits forecasting using a combinatorial model based on ARIMA and SES models.

Li Luo1, Le Luo1, Xinli Zhang2, Xiaoli He3.   

Abstract

BACKGROUND: Accurate forecasting of hospital outpatient visits is beneficial for the reasonable planning and allocation of healthcare resource to meet the medical demands. In terms of the multiple attributes of daily outpatient visits, such as randomness, cyclicity and trend, time series methods, ARIMA, can be a good choice for outpatient visits forecasting. On the other hand, the hospital outpatient visits are also affected by the doctors' scheduling and the effects are not pure random. Thinking about the impure specialty, this paper presents a new forecasting model that takes cyclicity and the day of the week effect into consideration.
METHODS: We formulate a seasonal ARIMA (SARIMA) model on a daily time series and then a single exponential smoothing (SES) model on the day of the week time series, and finally establish a combinatorial model by modifying them. The models are applied to 1 year of daily visits data of urban outpatients in two internal medicine departments of a large hospital in Chengdu, for forecasting the daily outpatient visits about 1 week ahead.
RESULTS: The proposed model is applied to forecast the cross-sectional data for 7 consecutive days of daily outpatient visits over an 8-weeks period based on 43 weeks of observation data during 1 year. The results show that the two single traditional models and the combinatorial model are simplicity of implementation and low computational intensiveness, whilst being appropriate for short-term forecast horizons. Furthermore, the combinatorial model can capture the comprehensive features of the time series data better.
CONCLUSIONS: Combinatorial model can achieve better prediction performance than the single model, with lower residuals variance and small mean of residual errors which needs to be optimized deeply on the next research step.

Entities:  

Keywords:  ARIMA; Combinatorial forecasting model; Daily outpatient visits; SES

Mesh:

Year:  2017        PMID: 28693579      PMCID: PMC5504658          DOI: 10.1186/s12913-017-2407-9

Source DB:  PubMed          Journal:  BMC Health Serv Res        ISSN: 1472-6963            Impact factor:   2.655


Background

Due to a rapid growing aging population, medical resources in China are comparatively scarce, with disproportional distribution and generally insufficient medical investment. It causes a great gap between supply and demand, as well as the increasing attention on optimal management of healthcare resources, which makes accurate forecasts of future healthcare demand and resource availability more significant and critical. Outpatient department (OD) is the window of the hospital external service from the hospital actual operation, and is experiencing increasing stress from year to year because of the increasing patient volumes, the growing complexity of health conditions and so on. The ability to predict outpatient visits is crucial for resource planning and allocation as well as efficient appointment scheduling in OD aimed to avoiding overcrowding and providing high quality patient care service [1]. Moreover, as the key and major element to implement the hospital management, daily outpatient visits are directly related to the workload of medical examinations and hospitalization services. Precise and reliable forecasts of the different types of outpatients amount can contribute to allocate the key healthcare resources scientifically (such as medical equipment, surgical equipment and hospital beds). So that it is significant to make an accurate forecast for the daily outpatient visits in advance, in order to help hospital managers to make the right decisions to meet the expected healthcare demand effectively and timely. In the field of health care system, increasing attention has been paid to the prediction of patient arrivals in hospital establishments, especially the prediction of Emergency department(ED)visits or admission which is substantially differ from the outpatient visits prediction [2-5]. According to ED data,patients arrive in an unplanned fashion beyond the control of the hospital, and the number of ED visits shows strong periodical, seasonal and stochastic fluctuations driven by factors such as climate, epidemic disease, the type of the day and socio demographic effects [6-8]. Based on these features, researchers used different methods to predict the number of ED visits hourly, daily and monthly [5, 6, 9]. In contrast, mainly affected by the current healthcare system, China’s outpatient visit is more of a planned elective nature. Patients can self-refer to any provider they could afford when they feel sick, and they should be booked days or weeks in advance to seek outpatient service in most of large-scale hospitals. However, in light of the fixed team arrangement and the limited outpatient service resources, the hospital can only accept a certain amount of outpatient visits every day, and the number of OD visits in large-scale hospitals fluctuates much less violently on a weekly, monthly or yearly basis. Instead, the daily number of OD visits shows a periodic fluctuation on a weekly basis, with a strong day-in-week variation mostly attributed to the doctors’ influence, which can be transferred to the downstream service nodes and have much influence on preparing the required reserve capacity in all sectors of hospital. Therefore, accurate prediction of the daily outpatient visits in 1 week can contribute to reasonable formulation of the weekly rotating shift schedule, which is of high importance to achieve an efficient routine management of outpatient services in hospitals. Meanwhile, it is also crucial for administrators in other sectors to optimize resource allocation and strategic planning. Motivated by overcrowding and resource scheduling problems arising in healthcare systems in China, we choose a typical hospital-West China Hospital (WCH) as cooperated hospital, and consider a time-series forecasting problem of daily outpatient visits. WCH is a public tertiary teaching hospital in Chengdu, China, which has large-scale outpatients. It served about 4.9 million outpatients and emergency attendances in 2014, and the number of outpatient visits accounted for about a quarter of the total amount. Research shows the amount of outpatient attendance and medical service needs are significantly affected by patient’s disease type, distance to healthcare providers and other factors, which present high level varying characteristics [10]. It’s worth noting that Chengdu patient accounts for the largest proportion of outpatients by each specialty of WCH. According to its internal statistical analysis on outpatient amount in 9 common and main specialty care units, the proportion of outpatients living in Chengdu city reached 46.31% in 2014.Compared with patients from other cities of Sichuan province and those from other provinces, the arrival pattern of outpatients from Chengdu showed higher randomness, moreover, the daily outpatient volume fluctuated relatively frequently. Therefore, this paper chose two common disease types (Endocrine and Respiratory disease) as illustrations to study the daily volume prediction of Chengdu outpatients.

Literature

For the past few years, increasing attention has been accorded to time series models to predict demands for medical services, such as patient arrivals, hospital discharges and length of stay [2, 3]. Compared to another classical method of regression, time series model [4, 5] is important for healthcare managers to capture features of short-term fluctuations better [3]. Being a general class of linear model, the ARIMA model can perfectly capture linear patterns in a time series with minimum computational efforts, so most studies adopt them to describe the relationship between the variables or use them as the benchmark to test the effectiveness of combined models with mixed results [9]. Sun et al. [11] and Kadri et al. [1] developed ARIMA models for forecasting daily attendances at ED of hospitals to prove time-series analysis to be a useful, readily available tool for predicting ED workload. Li et al. adopted ARIMA model to forecast monthly outpatient visits in a general hospital in China [9]. However, relationship between target variable and factors in many time series is nonlinear and complex, many recent studies have focused on the use of machine learning techniques as alternatives to the traditional time series methods, such as Fuzzy logic, Artificial Neural Network(ANN), Genetic Algorithms, Support Vector Machine(SVM), Taylor expansion and non-parametric smoothing etc. [1, 12–14]. Cheng et al. [15] and Garg et al. [12] proposed new fuzzy time series methods to forecast the number of outpatient visits. Hadavandi et al. [1] presented a hybrid artificial intelligence model based on Genetic Fuzzy Systems to forecast monthly outpatient visits of the department of internal medicine with high accuracy. Wang et al. [4] developed a novel forecasting model based on hybridization the firefly algorithm and support vector regression to forecast the daily diarrhoeal outpatient visits in Shanghai. Moreover, because a real-world time series is complex in nature, and a single linear or nonlinear model may not be totally sufficient to identify all the characteristics of the time series datasets, so many hybrid or combined models have been proposed to complement each other to improve forecasting accuracy while the literature on this topic has expanded dramatically [1, 16, 17]. Although machine learning techniques based on data-driven and non-parametric models could outperform many traditional techniques, they also have some limitations, such as unknown or hard to describe the underlying relationships among datasets [15]. As we known, high prediction accuracy depends not only on modeling but also on understanding of the data features, which means that advanced, sophisticated, and simpler extrapolation methods must be associated with specific features of data [18]. Hence, this paper differs from previous studies, in which we attempt to simplify the outpatient visits prediction problem and improve the forecasting accuracy by capturing the intrinsic fluctuating characteristics of the hospital’s daily outpatient visits. The analysis and forecasting of hospital daily outpatient visits belongs to time series prediction problem, in which the features of randomness and cyclic fluctuations have the biggest effect on forecasting accuracy [18]. Due to the ability of effectively extracting the linear features of randomness and trend, ARIMA model has found extensive applications in forecasting hospital daily visits of ED and OD [6, 19, 20]. Furthermore, aiming to better capture cyclicity over a period of week in daily time series data, ARIMA models have been extended and modified among which seasonal ARIMA (SARIMA) and multiplicative seasonal ARIMA (MSARIMA) models are the most widely used [3, 7, 21]. In addition to randomness and periodicity, the day of the week effect is also present in daily volume of patients in ED and OD as well as daily number of discharged patients [3, 8], which refers to the situation where the time series at the same time point within 7 days a week during different weeks show non-linear trend variation and some change patterns can be found in the data from Monday to Sunday. Several researchers proposed seasonal regression model to capture the day of the week effect of time series data, but it may be problematic and lead to poor forecasts due to the nonlinear patterns. Because of the good performance in capturing the nonlinear pattern and description of both trend and seasonality in the data, exponential smoothing(ES) methods have since been among the most widely used forecasting procedures in industry and commerce as well as healthcare [7, 22, 23]. Automated ES approach [5] and simple seasonal ES model [7] were applied to forecast the demand of medical care in terms of monthly and daily visits in ED of hospitals. Previous study indicated that the ES model incorporated the time correlation knowledge, made full use of the historical information [16] and consequently yielded superior performance in modeling ED demand due to its simplicity, robustness and accuracy [7]. Moreover, it was also demonstrated that although model selection should be done individually (per series) when we deal with heterogeneous data as to capture the different features met in each series [18], forecasting models alone are not sufficient for decision making [24], and combining models could improve predictive accuracy [25]. While ES model which is based on a description of nonlinear feature of trend and seasonality and SARIMA model which aims to capture the linear feature of autocorrelations and periodicity in the data provide complementary approaches to the prediction problem, we try to develop a new model to forecast the daily number of OD visits by combing them. This paper examines the issue of forecasting the daily outpatient visits in China’s large hospitals which are often overloaded with various types of patients. The objective is to develop a methodology, present data and modeling details and provide results of modeling the daily outpatient visits. In order to achieve good forecasting performance, several time series models are built for different features in each series respectively, and a combinatorial model is obtained for short-term forecasting. First we construct a SARIMA model to capture the cyclicity and autocorrelation in daily time series data. In terms of non-linearity in weekly time series data of the same day over different weeks, single exponential smoothing (SES) model is adopted to treat the day of the week effect. After that, we establish a linear combinatorial model based on the above models through residual modification approach. Finally the combinatorial forecasting model is verified by empirical studies involving two time series data of Chengdu outpatients in departments of endocrinology and respiratory in WCH over the entire year of 2014. The daily outpatient visits are predicted about 1 week ahead using the proposed model, and the results indicate that the combinatorial model outperforms any of the above two single traditional models.

Methods

In this section, we describe several models for forecasting the daily outpatient visits. Two models on time series are built for different traits of the outpatient visit data, and the combinatorial forecasting model is established based on the residual modification idea: Where represents the estimate value of daily outpatient visit for daily time series data, which is estimated by SARIMA model in “Seasonal ARIMA model” section; represents the estimate value of daily outpatient visit for the day of the week time series (weekly time series) data, which is estimated by SES model in “Single exponential smoothing model” section; li is the weighting coefficient of the single model i, calculated as describe in “Combinatorial model based on SARIMA and SES” section.

Seasonal ARIMA model

ARIMA model is an extension of ARMA model, combining differential operation and ARMA model. Seasonal ARIMA model is to model on the time series presenting with obvious seasonal effect (cyclic effect), by smoothing over the time series through trend and seasonal differencing along with the fitting of ARMA model. Its theoretical model has the following form:where and ε are the observed value of hospital daily outpatient visits and random error at time period t, D is step length for single cycle, d is order of differential equation for extracting trend and B is backshift operator. and are the polynomials in B for AR and MA components, respectively. Four steps are involved for modeling and forecasting using SARIMA model: smoothing over time series (differential operation), model recognition, parameter estimation and model diagnosis, and model forecasting. In the smoothing step, data transformation is often needed to make the time-series stationary. Results or plots of autocorrelation function (ACF) and partial autocorrelation function (PACF) of the sample data are the basic tools for analyzing stationary and order of model. When trend and seasonality are observed in the time series, differencing and log transformation are applied to eliminate the fluctuation to stabilize the variance. Once the initial model is determined, the key difficulty is the estimation of the other model parameters specifically, which can be estimated by an optimization procedure according to the Bayesian information criterion (BIC). To recognize whether the chosen model can describe the fluctuation pattern of time series data well, goodness-of-fit test is needed mainly through white noise test. If the residual series is not a white noise sequence, it is necessary to repeat above steps to create a new model.

Single exponential smoothing model

To deal with non-linearity in moving average interval, larger weights are assigned to short-term observations, while smaller weights to long-term observation, so as to achieve better forecasting result. Time series of daily outpatient visits of the same day in different weeks are modeled using SES model, and its general form is as follows:where α is smoothing constant, ranging from 0 to 1; is the observed value of daily outpatient visits of day τ in week t′; is the smoothed value of daily outpatient visits of day τ in week t′ and is the smoothed value of daily outpatient visits of day τ in week t′ + 1, which is selected to be the predicted value. The optimal value of α is found using an optimization method according to the criteria of minimum values of mean square error (MSE) and mean absolute percentage error (MAPE) between the predicted and observed values. The means of the observed values in the first 3 weeks in the same time series is taken as the initial value for SES forecasting [25].

Combinatorial model based on SARIMA and SES

Combinatorial model

In order to better capture the characteristics of autocorrelation, cyclicity and the day of the week effect of daily outpatient visits, a combinatorial model based on the two above models is built through residual modification idea in the form of weighted averaging. The formulations can be written as: where and are the estimated values of daily outpatient visits from SARIMA and SES, respectively; is the predicted value of the weighted combinatorial model; and li is weighting coefficient of single model i which is formulated as Where l1 + l2 = 1, and . The weighing coefficient is determined by the error indicators of the single model, and E is the error variance sum of the single model which is the indicator of forecasting accuracy. The lower the Ei, the higher the forecasting accuracy of model i and hence the higher the importance in combinatorial forecasting model. And, the framework of proposed combinatorial model based on SARIMA and SES models is shown in Fig. 1.
Fig. 1

The framework of proposed combinatorial model based on SARIMA and SES models

The framework of proposed combinatorial model based on SARIMA and SES models

Testing indicator

MAPE is used as the indicator of forecasting accuracy, calculated as

Results

Daily visits data of Chengdu outpatients in departments of endocrinology and respiratory in WCH over the year of 2014 are used for forecasting the daily outpatient visits 1 week ahead. In 2014, the total number of endocrinology outpatient visits (EOV) in WCH was 120,107, 57,763 of which came from Chengdu, accounting for 48.01%, while the total number of respiratory outpatient visits (ROV) was 108,045, 45,629 of which came from Chengdu, accounting for 42.23%.

Preliminary data analysis

Daily visits data of Chengdu outpatients in departments of endocrinology and respiratory in WCH over the year of 2014 are used respectively, each totaling 365. The data are inputted into Excel 2010 and SAS 9.4. Time series charts are generated for data from January 1st to December 31st, 2014. The average number of EOV and ROV are about 158/day and 125/day respectively, and the standard deviations are 77.87 and 76.11 respectively, which might mainly result from large fluctuations in seasonal effects and special day effects, such as holidays and weekends. Figures 2 and 3 provide a first view of the data. It can be seen that the daily outpatient visits fluctuate greatly, especially on holidays and in summer and winter vacation. Due to summer and winter vacation and the Spring Festival holiday, the outpatient visits in February and August are the lowest throughout the year. Moreover, a cyclic variation pattern is found over the period of a week, while great differences exist among different points in a same cycle, such as fewer outpatient visits over the weekends.
Fig. 2

Daily time-series data of EOV in 2014

Fig. 3

Daily time-series data of ROV in 2014

Daily time-series data of EOV in 2014 Daily time-series data of ROV in 2014 To account for the cyclicity, daily outpatient visits data spanning over incomplete periods are removed. Therefore, the time series from week 2 to week 52 (totally 51 weeks) are used for model building. The models are fitted for the first 301 observations from week 2 to week 44 (N = 301, 84% of all data as training set, from January 1, 2014 to November 2, 2014). The remaining 56 observations from week 45 to week 52 are utilized for examining the forecasting ability of fitted models (T = 56, 16% as testing set, from November 3, 2014 to December 28, 2014). That is, time series of 43 weeks for model fitting and time series of 8 weeks for verification while predicting each week of last 2 months based on all observations 1 week before. We explain the process of how to predict the daily outpatient visits in the 45th week of time series in detail, and so on, for each of the rest 7 weeks.

Pre-processing of singularities on holidays

The daily outpatient visits present obvious fluctuations in weekly pattern and special day effect, and the decrease of visits during national holidays such as the Spring Festival, the May Day, the Mid-Autumn Festival, etc. are significant, mainly resulting from the patients’ traditional customs and behavior habits. They can’t real reflect the daily routine fluctuations of outpatient visits. Therefore, singularity is identified which is distributed outside of the scope of 2 times the standard deviation from the mean value in each the day of the week time series. Finally, ten singularities in time series data of EOV as well as twelve in time series data of ROV are picked out and pre-processed, by replacing them with the mean of the same day in the previous and the following weeks.

Building the seasonal ARIMA model

Data analysis and differential operation

The stationarity of the time series is assessed through autocorrelation chart (Figs. 4, 5, 6, 7, 8, 9). The time series of EOV and ROV are non-stationary sequences and display strong cyclicity feature over the period of 7 days. Deterministic information is extracted from the original time series in these two departments both using first-order 7-step differencing. And the differential time series are verified as non-random series through white noise test. Then ARMA models are fitted and used for forecasting.
Fig. 4

Time series data of EOV

Fig. 5

Autocorrelation coefficients of the original time series of EOV

Fig. 6

Partial autocorrelation coefficients of the original time series of EOV

Fig. 7

Time series data of ROV

Fig. 8

Autocorrelation coefficients of the original time series of ROV

Fig. 9

Partial autocorrelation coefficients of the original time series of ROV

Time series data of EOV Autocorrelation coefficients of the original time series of EOV Partial autocorrelation coefficients of the original time series of EOV Time series data of ROV Autocorrelation coefficients of the original time series of ROV Partial autocorrelation coefficients of the original time series of ROV

Model recognition and parameter estimation

According to BIC, the optimal model is the one with the minimum value of BIC function, which is ARIMA (1,1,7) model for EOV and ARIMA(2,1,7) model for ROV. Model parameters are estimated with least-squares estimation and the parameters failing to pass the significance test are further adjusted. The estimated parameters are shown in Tables 1 and 2. According to residual analysis, P value of Ljung-Box test statistics at all lags is more than 0.05, indicating the reliability of the model design.
Table 1

Parameter estimation and testing of EOV

ParameterEstimationStandard errort valueApproximate Pr > |t|Lag
MA1,10.453930.051658.79<.00012
MA1,20.259160.044575.81<.00013
MA1,3−0.141160.04477−3.150.00186
MA1,40.428070.048788.78<.00017
AR1,1−0.513640.05550−9.25<.00011
Table 2

Parameter estimation and testing of ROV

ParameterEstimationStandard errort valueApproximate Pr > |t|Lag
MA1,10.301180.045246.66<.00012
MA1,20.365050.043458.40<.00013
MA1,3−0.146690.04209−3.490.00065
MA1,4−0.215790.03792−5.69<.00016
MA1,50.653100.0379717.20<.00017
AR1,1−0.686020.05706−12.02<.00011
AR1,2−0.317080.06558−4.83<.00012
Parameter estimation and testing of EOV Parameter estimation and testing of ROV The model forecasting the EOV is fitted asand the model forecasting the ROV is fitted as

Forecasting

The fitted models are used for short-term forecasting on the time series and the 7-day predicted values of the following week are obtained (Figs. 10, 11). The MAPE of the fitted and predicted values is 16.90 and 19.31% respectively in endocrinology department, while 25.32 and 25.44% in respiratory department.
Fig. 10

Fitted and predicted results using ARIMA model in endocrinology department

Fig. 11

Fitted and predicted results using ARIMA model in respiratory department

Fitted and predicted results using ARIMA model in endocrinology department Fitted and predicted results using ARIMA model in respiratory department

Building SES model

Aiming to capture the feature of the day of week effect, SES model is implemented for the time series of the day of the week (the same day in different weeks). The initial value for the smoothing and the smoothing constant α are determined using the method described in the above section. The predicted values are calculated using formula (3), and the MAPE of the fitted and the predicted values of EOV is 16.44 and 19.55% respectively, while the corresponding values of ROV is 16.56 and 21.90%.The predicted results from the two models in two departments are shown in Table 3.
Table 3

Forecasting performance comparison of two models during the 45th week

Time series dataWithin a single weekMondayTuesdayWednesdayThursdayFridaySaturdaySundayForecasting performance MAPE
EOVobserved values2061931821711508634
ARIMA model194.40189.47194.27248.46197.9066.1141.0019.31%
SES model196.76195.68190.00256.56193.7676.3346.2819.55%
ROVobserved values1731781451611631811
ARIMA model197.32197.88160.23174.97121.8324.8518.0325.44%
SES model186.12196.28177.78185.36126.7627.3213.6121.90%
Forecasting performance comparison of two models during the 45th week

Building the combinatorial model

The combinatorial model by combining SARIMA and SES model is established. Through the residual modification approach, the weighting coefficients for different days within a week are calculated with formula (4-6). The predicted and fitted values are presented in Tables 4 and 5, while residuals comparison between ARIMA, SES and combinatorial models are presented in Table 6. In turn, the prediction of the remaining 7 weeks can be computed using the same approach and process, through which the prediction results are finally acquired (Tables 7, 8, 9, 10).
Table 4

Fitting and forecasting performances of combinatorial model in endocrinology department during the 45th week

Within a single weekMondayTuesdayWednesdayThursdayFridaySaturdaySundayOverallWorkdaysWeekends
Combinatorial model fitted15.70%10.39%8.86%17.55%11.91%16.21%27.63%15.46%12.87%21.92%
Combinatorial model predicted5.09%0.46%5.75%47.33%30.45%15.94%29.04%19.15%17.82%22.49%
Weighting coefficient l10.530.570.580.570.460.390.46
Weighting coefficient l20.470.430.420.430.540.610.54
Table 5

Fitting and forecasting performances of combinatorial model in respiratory department during the 45th week

Within a single weekMondayTuesdayWednesdayThursdayFridaySaturdaySundayOverallWorkdaysWeekends
Combinatorial model fitted9.85%8.86%10.60%7.73%14.20%28.69%32.15%17.14%11.82%30.37%
Combinatorial model predicted10.96%10.77%16.37%11.93%23.69%47.87%38.05%22.81%14.74%42.99%
Weighting coefficient l10.520.550.510.500.480.280.36
Weighting coefficient l20.480.450.490.500.520.720.64
Table 6

Residuals comparison between ARIMA, SES and combinatorial model in the 45th week

Time series dataModelARIMASESCombinatorial
EOVMean of residual0.2358−0.6678−0.1994
Standard deviation of residual27.298030.960027.1130
ROVMean of residual−1.8806−0.7896−1.2008
Standard deviation of residual20.481020.964019.4588
Table 7

Fitting and forecasting performances comparison of three models in two departments during 8 weeks (I)

Time series dataFitted performance(MAPE)Predicted performance(MAPE)
ARIMASESCombinatorialARIMASESCombinatorial
EOV15.97%16.72%14.47%11.77%13.25%10.61%
ROV23.48%16.55%16.95%15.26%13.60%13.49%
Table 8

Fitting and forecasting performances comparison of three models in two departments during 8 weeks (II)

Time series dataPrediction performanceOverallWorkdaysWeekends
EOVARIMA11.77%11.23%13.13%
SES13.25%12.57%14.95%
combinatorial10.61%10.19%11.68%
ROVARIMA15.26%9.59%28.49%
SES13.60%9.78%23.15%
combinatorial13.49%9.51%23.44%
Table 9

Residuals comparison between three models during 8 weeks

Time series dataResiduals comparisonARIMASESCombinatorial
EOVMean of residual0.66250.48010.0504
Standard deviation of residual26.955730.186426.6696
ROVMean of residual−1.57360.4905−0.8760
Standard deviation of residual24.744424.508222.5943
Table 10

Weighting coefficient comparison between three models during 8 weeks

Time series dataweighting coefficientMondayTuesdayWednesdayThursdayFridaySaturdaySundayOverallWorkdaysWeekends
EOVl10.520.570.570.570.470.380.450.510.540.42
l20.480.430.430.430.530.620.550.490.460.58
ROVl10.510.560.510.480.470.330.360.460.510.35
l20.490.440.490.520.530.670.640.540.490.65
Fitting and forecasting performances of combinatorial model in endocrinology department during the 45th week Fitting and forecasting performances of combinatorial model in respiratory department during the 45th week Residuals comparison between ARIMA, SES and combinatorial model in the 45th week Fitting and forecasting performances comparison of three models in two departments during 8 weeks (I) Fitting and forecasting performances comparison of three models in two departments during 8 weeks (II) Residuals comparison between three models during 8 weeks Weighting coefficient comparison between three models during 8 weeks As seen above, in the two empirical studies, the result of analysis show the combinatorial model has superior prediction performance, with small mean of residual errors and residuals variance.

Discussion

This paper examines the issue of forecasting the daily outpatient visits in China’s large hospital. Daily visits data of Chengdu outpatients in departments of endocrinology and respiratory in WCH over 2014 are used for forecasting. The proposed models, SARIMA model and SES model, can forecast daily outpatient visits in the short term well alone and there are considerable improvements in forecasting performance by combining the two complementary approaches together. One important criterions of model selection is to select the appropriate method that has the best trade-off between the goodness-of-fit and the complexity of modeling (number of parameters). In this paper, we choose SARIMA and SES models to capture linear and nonlinear patterns, describe the features of autocorrelation, trend and cyclicity as well as the day of the week effect from different sub-time-series data respectively, and then establish a linear combinatorial model based on the above models through residual modification approach. All of the three models have good performance in forecasting accuracy. SARIMA model has predicted the daily number of EOV with a MAPE value of 0.1177 whereas it is 0.1325 for SES model, and 0.1061 for combinatorial model. And for the time series of ROV, the MAPE value of SARIMA model is 0.1526 whereas it is 0.1360 for SES model, and 0.1349 for combinatorial model. Moreover, the daily visits fluctuation of weekends series is relatively complex, which cannot be effectively extracted by single time series model. In our study, the combinatorial model is superior in modifying the extreme value of deviation in daily outpatient volume than SARIMA model,especially for the value of weekend’s data. For example, the results of weekend’s volume prediction show that MAPE reduces from 13.13 and 28.49% to 11.68 and 23.44% for EOV and ROV respectively. In general, the two single traditional models and the combinatorial model are simplicity of implementation and low computational intensiveness, whilst being appropriate for short-term forecast horizons over a large number of items. As indicated both by the result of preliminary data analysis and the weighting coefficients, the fluctuations of daily outpatient visits within 1 week show different characteristics. Linear fluctuation is predominant from Monday to Thursday (l1 > 0.5) which can be better extracted by SARIAM model, while non-linear fluctuation is predominant from Friday to Sunday (l2 > 0.5) which can be better captured by SES model. That is to say, the fluctuation of daily outpatient visits from Monday to Thursday is more dominated by objective features of time series, while that from Friday to Sunday is dominated by subjective features (e.g., work shift schedule of doctors and healthcare seeking behaviors of patients). Moreover, the analysis result also show that most weighting coefficients of the combinatorial model range from 0.4 to 0.6, indicating that the linearity fluctuation (randomness and cyclicity) and non-linearity trends (the day of the week effect) captured by SARIMA model and SES model have a balanced impact on the final predictions of daily outpatient visits. In other words, for the forecasting of daily outpatient visits, both two features should be taken into account. Furthermore, different kinds of diseases have different volatility features. The results show the average value of weighting coefficient l 2 of ROV (0.54) is bigger than that of EOV (0.49), which means that the non-linear fluctuation of ROV is more significant. In addition to the autocorrelation in daily time series data, the time series of ROV are more easily affected by uncertain factors. So, the prediction models of different diseases can be further optimized based on their own particular features and influence factors to increase forecasting accuracy. The literatures show that there is no universal model applicable to any environment due to the inherent complexity of the time series data in the real world. The proposed model in this paper is designed to improve the forecasting accuracy from the data driven by incorporating the intrinsic characteristics of the historical time series data of hospital outpatient visits. It includes the randomness and cyclicity in the daily time series and the day of the week effect in weekly time series. According to this consideration, we adopt classical time series models and parametric estimation methods to ensure stability and convergence property. And based on this, we describe the combinatorial model to forecast the cross-sectional data for 7 consecutive days of daily outpatient visits over an 8-weeks period based on 43 weeks of observation data in two outpatient departments. As shown in the previous forecast results, the prediction performance of the combinatorial model is superior to the two single traditional models, as well as has the small mean of residual errors and better residuals variance. Thus it can be seen whether for the volume of ED visits or other time series data of healthcare system, the prediction model which is associated with effectively extracting meaningful features embedded implicitly in data is of great theoretical and practical significance. What’s more, by analyzing the research of outpatient visits forecasting, it shows that the combinatorial model can better observe and extract the general regularity of daily time series data in two different departments from a finite training sample size, while the ARIMA model has significant difference in feature identification among the two different diseases. And data collection and prediction of more kinds of disease should be discussed in the further research so as to validate the stable capability of the proposed model to discover the intrinsic laws of time series data. Moreover, the forecasting horizon of 7 days for outpatient visits prediction in our paper is determined based on the current circumstances of healthcare system and hospital management requirement in China. It’s not hard to see China’s outpatient visit is more of a planned elective nature, and in light of the total outpatient amount controlling policy for large hospitals, the limited outpatient service resources and the weekly fixed team arrangement in most large hospitals, daily fluctuation within 1 week changes significantly compared to no evident difference on monthly or yearly fluctuation. Therefore, the prediction of daily outpatient visits about 1 week ahead has great realistic significance. Meanwhile, under such stable macro-environment (healthcare system) and micro-environment (hospital management requirement), the model can be applied to other large or medium scale hospitals where the daily time series of patient flow has the feature of randomness, cyclicity as well as the weekly pattern (the day of the week effect). Furthermore, medium and long term forecasting can be carried out based on the proposed combined model by adding enough new data sample until it can provide the relative complete information and features of the new time series. The daily outpatient visits forecasting about 1 week ahead have practical implications in hospital operation management, not only contributing to guiding the long-term resource planning and scheduling in OD or other related sectors, but also helping to make tactical resolutions for short-term adjustment in special day. For an example, to deal with the problems of relative large prediction deviations for some certain day within a week, some operation measures such as appointment overbooking or adding capacities on the spot can be adopted in outpatient service management. The impact of non-linear variation of outpatient visits for Friday can be counteracted by reasonable management strategies, such as optimizing the work shift schedule of doctors who have strong effect of attracting patients and guiding the healthcare seeking behaviors of patients. There are several limitations in this study. Firstly, the combinatorial model used in this paper is only for short-term forecasting from Monday to Sunday with 7 days in advance. More studies are required for moderate- and long-term forecasting of daily outpatient visits. Secondly, in addition to the inherent features of time series, other influence factors also come into play, such as the planned supply of outpatient resource and patients’ preference for certain doctors. The forecasting model can be optimized based on these features so as to achieve better prediction accuracy, especially for the daily visits prediction over the weekend. Thirdly, it is the lack of more discussions and analyses on alternative forecasting models, and reasons why they are not applicable. Fourthly, this paper only used outpatients’ visit data of two specific departments in one large hospital for forecasting, and data collection and prediction of more kinds of disease should be discussed in the further research, especially the chronic diseases.

Conclusions

In conclusion, this paper contributes to exploration of prediction method to forecast outpatient visits in China’s large hospitals. We compared the forecast accuracy of a SARIMA model, a SES model, and that of a combinatorial model by combining SARIMA model with SES model for short-term daily outpatient visits forecasting. All of the selection models are relatively simple in terms of implementation and of low computational intensiveness, whilst being appropriate for short-term prediction. And compared simple models, the combinatorial model can more effectively extract depth information from a finite training sample size, and achieve better performance for predicting daily outpatient visits 1 week ahead, with lower residuals variance and relative small mean of residual errors which needs to be optimized deeply on the next research step. Also, the results can be used to support the decisions of outpatient resource planning and scheduling. It helps hospital managers to implement periodical scheduling of available resources on the basis of periodic features, as well as the proactive scheduling of additional resources based on the increasing regularity caused by the day of the week effect.
  10 in total

1.  Forecasting daily patient volumes in the emergency department.

Authors:  Spencer S Jones; Alun Thomas; R Scott Evans; Shari J Welch; Peter J Haug; Gregory L Snow
Journal:  Acad Emerg Med       Date:  2008-02       Impact factor: 3.451

Review 2.  A systematic review of models for forecasting the number of emergency department visits.

Authors:  M Wargon; B Guidet; T D Hoang; G Hejblum
Journal:  Emerg Med J       Date:  2009-06       Impact factor: 2.740

3.  Short-term forecasting of emergency inpatient flow.

Authors:  Gad Abraham; Graham B Byrnes; Christopher A Bain
Journal:  IEEE Trans Inf Technol Biomed       Date:  2009-02-24

4.  Knowing what to expect, forecasting monthly emergency department visits: A time-series analysis.

Authors:  Jochen Bergs; Philipe Heerinckx; Sandra Verelst
Journal:  Int Emerg Nurs       Date:  2013-08-24       Impact factor: 2.142

5.  Forecasting emergency department visits using internet data.

Authors:  Andreas Ekström; Lisa Kurland; Nasim Farrokhnia; Maaret Castrén; Martin Nordberg
Journal:  Ann Emerg Med       Date:  2014-12-05       Impact factor: 5.721

6.  Time-Series Approaches for Forecasting the Number of Hospital Daily Discharged Inpatients.

Authors: 
Journal:  IEEE J Biomed Health Inform       Date:  2015-12-23       Impact factor: 5.772

7.  Time series modelling and forecasting of emergency department overcrowding.

Authors:  Farid Kadri; Fouzi Harrou; Sondès Chaabane; Christian Tahon
Journal:  J Med Syst       Date:  2014-07-23       Impact factor: 4.460

8.  Forecasting Daily Volume and Acuity of Patients in the Emergency Department.

Authors:  Rafael Calegari; Flavio S Fogliatto; Filipe R Lucini; Jeruza Neyeloff; Ricardo S Kuchenbecker; Beatriz D Schaan
Journal:  Comput Math Methods Med       Date:  2016-09-20       Impact factor: 2.238

9.  Forecasting daily attendances at an emergency department to aid resource planning.

Authors:  Yan Sun; Bee Hoon Heng; Yian Tay Seow; Eillyne Seow
Journal:  BMC Emerg Med       Date:  2009-01-29

10.  A comparison of multivariate and univariate time series approaches to modelling and forecasting emergency department demand in Western Australia.

Authors:  Patrick Aboagye-Sarfo; Qun Mai; Frank M Sanfilippo; David B Preen; Louise M Stewart; Daniel M Fatovich
Journal:  J Biomed Inform       Date:  2015-07-04       Impact factor: 6.317

  10 in total
  14 in total

1.  A Machine Learning Based Discharge Prediction of Cardiovascular Diseases Patients in Intensive Care Units.

Authors:  Kaouter Karboub; Mohamed Tabaa
Journal:  Healthcare (Basel)       Date:  2022-05-24

2.  Prediction model for COVID-19 patient visits in the ambulatory setting.

Authors:  Riza C Li; Cecelia K Harrison; Claudine T Jurkovitz; Mia A Papas; Kevin Ndura; Roger Kerzner; Cydney Teal; Tze Chiam
Journal:  Res Sq       Date:  2021-03-02

3.  Exploring the growth patterns of medical demand for eye care: a longitudinal hospital-level study over 10 years in China.

Authors:  Meng Yuan; Wenben Chen; Ting Wang; Yiyan Song; Yi Zhu; Chuan Chen; Yahan Yang; Yizhi Liu; Yanzhi Li; Haotian Lin
Journal:  Ann Transl Med       Date:  2020-11

4.  Comparison of ARIMA and LSTM in Forecasting the Incidence of HFMD Combined and Uncombined with Exogenous Meteorological Variables in Ningbo, China.

Authors:  Rui Zhang; Zhen Guo; Yujie Meng; Songwang Wang; Shaoqiong Li; Ran Niu; Yu Wang; Qing Guo; Yonghong Li
Journal:  Int J Environ Res Public Health       Date:  2021-06-07       Impact factor: 3.390

5.  Spatiotemporal analysis and forecasting model of hemorrhagic fever with renal syndrome in mainland China.

Authors:  Ling Sun; Lu-Xi Zou
Journal:  Epidemiol Infect       Date:  2018-08-06       Impact factor: 4.434

6.  Comparison of ARIMA and GM(1,1) models for prediction of hepatitis B in China.

Authors:  Ya-Wen Wang; Zhong-Zhou Shen; Yu Jiang
Journal:  PLoS One       Date:  2018-09-04       Impact factor: 3.240

7.  A discrete event simulation approach for reserving capacity for emergency patients in the radiology department.

Authors:  Li Luo; Yumeng Zhang; Fang Qing; Hongwei Ding; Yingkang Shi; Huili Guo
Journal:  BMC Health Serv Res       Date:  2018-06-15       Impact factor: 2.655

8.  Prediction of Daily Blood Sampling Room Visits Based on ARIMA and SES Model.

Authors:  Xinli Zhang; Yu Yu; Fei Xiong; Le Luo
Journal:  Comput Math Methods Med       Date:  2020-09-03       Impact factor: 2.238

9.  Excess Patient Visits for Cough and Pulmonary Disease at a Large US Health System in the Months Prior to the COVID-19 Pandemic: Time-Series Analysis.

Authors:  Joann G Elmore; Pin-Chieh Wang; Kathleen F Kerr; David L Schriger; Douglas E Morrison; Ron Brookmeyer; Michael A Pfeffer; Thomas H Payne; Judith S Currier
Journal:  J Med Internet Res       Date:  2020-09-10       Impact factor: 5.428

10.  Medical service demand forecasting using a hybrid model based on ARIMA and self-adaptive filtering method.

Authors:  Yihuai Huang; Chao Xu; Mengzhong Ji; Wei Xiang; Da He
Journal:  BMC Med Inform Decis Mak       Date:  2020-09-19       Impact factor: 2.796

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.