Literature DB >> 30142765

The analysis of GM (1, 1) grey model to predict the incidence trend of typhoid and paratyphoid fevers in Wuhan City, China.

Xiaobing Yang1, Jiaojiao Zou, Deguang Kong, Gaofeng Jiang.   

Abstract

Typhoid and paratyphoid fevers (TPF), systemic emerging infectious diseases, is a serious health problem for society. If the incidence trend of TPF can be predicted, prevention and control measures can be taken in advance to reduce the harm to the people's health.Grey Model First Order One Variable [GM (1, 1)] was applied to predict the incidence trend of TPF with the incidence data of TPF in Wuhan City of China from 2004 to 2015. The original data were acquired from the national surveillance system.The GM (1, 1) model was established as ŷ (t + 1) = 0.88 e + 0.15. The goodness-of-fit test indicated that the precision (degree 2) was qualified (C = 0.40, P = .91). We further compared actual values with predicted values in 2016 and found that GM (1, 1) model we built has excellent performance in incidence trend prediction.Our prediction shows that the TPF incidences in Wuhan City will be slowly decreasing in the next 3 years. It is, however, still necessary to strengthen the comprehensive prevention and control to reduce the incidence level of TPF.

Entities:  

Mesh:

Year:  2018        PMID: 30142765      PMCID: PMC6112867          DOI: 10.1097/MD.0000000000011787

Source DB:  PubMed          Journal:  Medicine (Baltimore)        ISSN: 0025-7974            Impact factor:   1.817


Introduction

Typhoid and paratyphoid fevers (TPF), systemic emerging infectious diseases, are caused by Salmonella enterica serotype Typhi or serotype Paratyphi (A, B, or C), respectively.[ With improvements in municipal drinking water treatment, sanitation, hygiene, and food production and preparation, illness and death caused by TPF became rare in industrialized nations in Europe and North America, but it remains a serious public health problem in developing countries.[ The incidence of TPF in China has been gradually decreased and remained at a comparatively low level since 2004. However, TPF is still one of the important sporadic intestinal infectious diseases in Wuhan City.[ Ingestion of contaminated water and food is the most common route of TPF transmission.[ In addition, many social factors, especially economic development level, health facilities, environmental factors, and living conditions can influence the incidence of TPF.[ Environmental factors, such as climate, have also been investigated to assess their influence on water-/food-borne infections.[ As a result, effective preventive strategies are required to control TPF. In 1982, Deng[ firstly established the grey system theory, which shows great capability for studying uncertainty problems with small sample, poor information, uncertain system, and lack of data. Grey system theory is developing from information theory, cybernetic theory and mathematical method to solve incomplete and uncertain problem, and it also accord with current system science and uncertain system theory. Grey system theory focuses on the “poor information” systems with “partial information known, partial information unknown.”[ The theory studies and forecasts the unknown area to master the whole system, through extracting valuable information from known information.[ During the last 3 decades, the grey system theory has been developed rapidly and caught the attention of many researchers. It has been widely and successfully applied in many fields such as social, scientific and technological, geological, and medical systems.[ Although various types of grey models can be mentioned, most of the previous researchers have focused their attention on Grey Model First Order One Variable [GM (1, 1)] models in their predictions because of its computational efficiency. GM (1, 1) type of grey model is the most widely used in the literature, pronounced as “Grey Model First Order One Variable.”[ GM (1, 1) model is a time series forecasting model, which is able to make accurate predictions for forecasting of the monotonous type of processes. The differential equations of the GM (1, 1) model have time-varying coefficients. In other words, the model is renewed as the new data become available to the prediction model. In this study, a GM (1, 1) model was proposed to make prediction of the TPF incidence trend based on the epidemiologic data of TPF in Wuhan City, and provide reference for the government in policy making.

Materials and methods

Data collection

The monthly case number of TPF and the average monthly incidence rates from 2004 to 2015 in Wuhan City were collected from the national surveillance system. TPF are notifiable diseases in China. All clinical and hospital doctors are required to report TPF cases to the local Center for Diseases Control and Prevention. In China Information System for Diseases Control and Prevention (CISDCP), both clinically and laboratory diagnosed TPF cases were collected without further distinction between typhoid and paratyphoid. Therefore, the clinically diagnosed and laboratory confirmed cases of typhoid fever and paratyphoid fever were all combined as TPF in CISDCP and in this study.

Ethics statement

The ethics committee of Wuhan Center for Disease Prevention and Control approved this study.

GM model principles

GMs predict the future values of a time series based only on a set of the most recent data depending on the window size of the predictor. It is assumed that all data used in grey models are positive, and the sampling frequency of the time series is fixed. In grey systems theory, GM (n, m) denotes a grey model, where n is the order of the difference equation and m is the number of variables. In grey system theory, a grey prediction model is one of the most important parts, and the GM (1, 1) model is the core of grey prediction. The purpose of GM (1, 1) model is to work on system forecasting with poor, incomplete, or uncertain messages. The GM (1, 1) model has more advantages over those traditional prediction ways, because it does not need to know whether the prediction variables obey normal distribution, and also does not require too much statistic sample. In order to smooth the randomness, the primitive data obtained from the system to form the GM (1, 1) is subjected to an operator, named Accumulating Generation Operator. The differential equation [ie, GM (1, 1)] is solved to obtain the n-step ahead predicted value of the system. Finally, using the predicted value, the Inverse Accumulating Generation Operator is applied to find the predicted values of original data.

Construction of grey prediction model GM (1, 1)

Let original series x(0)(i) = x(0)(1),x(0)(2),…,x(0)(n). By defining , we get a new series x(1)(k): x(1)(k) = x(1)(1),x(1)(2),…,x(1)(n). To some processes, x(1)(k) is the solution of the following white-formed ordinary differential equation. where a and u are gray number, that is, pendent parameters, which are estimated by least square method. The equation (1) is called GM (1, 1). The solution of (1) is: The equation (2) is called time response function. For k ≥ 2, is called predicting formula.

Accuracy testing of GM (1, 1)

Model with preferable fitting accuracy can be used to extrapolate predicted value. Otherwise, residual correction has to be carried out first. Usually, posterior error detection method is used to test the accuracy of GM (1, 1). The indexes of fitting testing includes posterior error ration (C) and small error probability (P). Posterior error ration (C) is the ratio of residual standard deviation (S) and data standard deviation (S). Obviously, if the residual standard deviation is smaller, the prediction accuracy is more excellent. The specific formula is as followed: In the formula (3), Small error probability (P) is: The prediction level is graded to 4 according to the 2 indexes above (Table 1).
Table 1

Judgment standard of prediction accuracy for GM (1, 1).

Judgment standard of prediction accuracy for GM (1, 1).

Extrapolated prediction

If the result of fitting test is satisfactory, the model can be considered as credible. Then predicted value can be extrapolated by the followed formula:

Statistical analysis

Excel 2013 was used to set up the database and establish predicting model for TPF incidence.

Results

TPF incidences from 2004 to 2015 in Wuhan City

The incidences of TPF in Wuhan City from 2004 to 2015 are shown in Figure 1.
Figure 1

Typhoid and paratyphoid fevers incidences from 2004 to 2015 in Wuhan City of China.

Typhoid and paratyphoid fevers incidences from 2004 to 2015 in Wuhan City of China.

Establishment of GM (1, 1) model for TPF incidence prediction

Actual reported incidences of TPF from 2004 to 2015 in Wuhan were taken as time series to build GM (1, 1) predicting model. Its functional equation is as followed:

Accuracy examination of grey prediction model GM (1, 1)

Goodness of Fit Test showed that prediction accuracy grade (C) is 0.40; small error probability (P) is .91; the grade of prediction accuracy was 2 (qualified). So the established model can be used in extrapolated prediction. Incidences of TPF from 2004 to 2015 in Wuhan city were fitted and the results showed that estimated values coincided with the actual values. Actual values in 2016 was 0.22/100,000, estimated values in 2016 was 0.08/100,000. The average of absolute residual value was 0.10/100,000 (Table 2).
Table 2

Comparison of actual value and estimate value of typhoid and paratyphoid fevers incidence.

Comparison of actual value and estimate value of typhoid and paratyphoid fevers incidence.

Prediction of TPF incidence

Based on the established functional equation, a short-term extrapolated prediction was carried out to estimate incidences of TPF from 2017 to 2019. The results were listed in Table 2. Extrapolated prediction showed that the incidences of TPF from 2017 to 2019 were 0.08/100,000, 0.07/100,000, and 0.05/100,000, respectively. The incidences of TPF were obviously slowly decreasing.

Discussion

Time series prediction refers to the process by which the future values of a system is forecasted based on the information obtained from the past and current data points.[ Generally, a predefined mathematical model is used to make accurate predictions. Statistical and artificial intelligence–based approaches are the 2 main techniques for time series prediction seen in the literature. However, these techniques are not accurate for nonlinear problems. More importantly, they need large number of samples and are too complex to be used in predicting future values.[ In contrast, grey system theory is a better alternative method for time series prediction, which is designed to work with system in which the available information is insufficient to characterize the system.[ In systems theory, a system can be defined with a color that represents the amount of clear information about that system. For instance, if the information is known entirely, the system is called a white system. If the information is unknown, it is called a black system. If the information is being incomplete, it can be named as a grey system. Strictly, every system can be considered as a grey system because there are always some uncertainties. Because of the noise from both inside and outside of the system, the information we can reach about that system is always uncertain and limited in scope.[ As superiority to conventional statistical approaches, grey system theory requires only a limited number of data to estimate the behavior of unknown systems, which is different from the previous methods. Because of its simple calculation process and higher forecasting accuracy, grey system theory has been widely used in the prediction of a lot of fields. In recent years, grey system theory has become more and more popular in biomedical information and technology. In grey system theory, GM (1, 1) model is an effective approach, which can make use of relatively small data sets and does not require to comply with certain statistical laws strictly, simple or linear relationships among the observable variables. Thus, it can overcome the disadvantages of statistical method.[ Therefore, this study adopted GM (1, 1) model to predict the incidence of TPF in Wuhan City and provided the results as reference for future studies and policy makers. In the present study, based on the raw data of TPF incidence from 2004 to 2015 in Wuhan City, GM (1, 1) model was built to forecast the incidence in the next 3 years. The model accuracy examination results show that GM (1, 1) model is able to make accurate predictions for forecasting incidence of TPF. We compared actual value with predicted value in 2016. The result showed that predicted values are consistent with actual values, indicating that GM (1, 1) model we built is credible and effective in practice. Traditional TPF incidences estimation methods use statistics analysis, so that large data samples are required. With restricted conditions, it usually causes the results lack of authenticity and unsuitable to apply in practical use. By using the grey system modeling, the more reliable prediction can be obtained for future policy making in TPF prevention. In addition, incidences of infectious diseases are deeply influenced by social and natural factors, so database of grey model should be updated in time for long-term analysis. According to the prediction of the TPF incidences, we found that the TPF incidences in Wuhan City will be slowly decreasing in the future 3 years. However, we should know that TPF still causes approximately 200,000 deaths annually and >90% of TPF cases are estimated to occur in Asia, owing to the consumption of unsafe drinking water, inadequate sewage disposal, and flooding.[ The recent increase in fluoroquinolone resistance of S enterica serotype Typhi has raised concerns due to the limited treatment options available in TPF endemic countries.[ How to prevent TPF effectively is still a big challenge in less-industrialized countries, for preventive measures are vital to reduce the occurrence of typhoid fever and avoid new outbreaks and effective prevention will result in large cost savings to the national health care system. Contaminated water and food are important vehicles for transmission of TPF and preventive public health measures based on sanitation and hygiene have proved to be essential to the reduction of TPF. Therefore, careful food preparation and washing of hands are crucial in preventing TPF. Adequate water treatment, waste disposal, and protection of the food supply from contamination are also important public health measures. In addition, carriers of TPF must not be allowed to work as food handlers. And finally, the use of TPF vaccines will be helpful to reduce the susceptibility of hosts to infection. It was reported that the incidence of TPF was high among the residents of the densely populated urban community.[ As low-income workers are increasingly attracted to urban centers and rural-urban fringe zone with available jobs, the population residing in informal settlements will not have access to the available water and sanitary infrastructure. The incidence of TPF demonstrates the need for longer-term investment in improvement of water and sanitation services to reduce the burden of multiple fecal-oral transmitted pathogens in these communities. Targeted vaccination against TPF would be a valuable immediate step to reduce disease burden, especially in densely populated urban community. However, more emphasis should be placed upon sanitary improvements and health education, rather than focusing solely on improving the health delivery system. It is also likely that an important proportion of cases are due to travels. Pretravel screening and vaccination strategies are essential measures for travelers to endemic areas. Our present study must be interpreted in light of some limitations. Firstly, TPF cases reporting guideline were “GB 16001-1995” (2004–2007) and “WS 280-2008” (2008–2015), respectively. There are 3 slight differences between these 2 versions [(a) field epidemiological survey criterion in “WS 280-2008” not only contains factors such as epidemic area in “GB 16001-1995,” but also consists of specific time, close contact history, and hygienic habits. (b) unexplained persistent high fever was defined as clinical symptom in “WS 280-2008,” which was more specific than persistent high fever in “GB 16001-1995”. (c) in “WS 280-2008,” laboratory test results could present the count of eosinophilic granulocytes were decreased or disappeared, and the total number of white blood cells were normal or lower, whereas in “GB 16001-1995,” it described as the count of eosinophilic granulocytes disappeared, and the total number of white blood cells were lower], and it seemed the criterion of “WS 280-2008” is more extensive and more precise, which may slightly influence the efficacy of prediction. Secondly, we have not construct train set, because the data are actually not big enough for training a GM model if being divided into train set, validation set, and test set. Therefore, expanding the data of TPF should be considered for future research. In conclusion, the purpose of the present study was to adopt grey system theory to predict the incidence trend of TPF in Wuhan City, and provide reference for the government in policy making. Our study shows that grey forecasting model GM (1, 1) we built is an effective forecast method and can be used to predict the incidence of TPF. This finding may serve as a reference to future studies and policy making.

Acknowledgments

This work was supported by the Young and Middle-aged Medical Scholar Training Project of Wuhan from Health and Family Planning Commission of Wuhan Municipality.

Author contributions

Conceived and designed the experiments: GFJ. Analyzed the data: JJZ. Contributed reagents/materials/analysis tools: DGK. Wrote the article: XBY. Conceptualization: Gaofeng Jiang. Formal analysis: Jiaojiao Zou. Resources: Deguang Kong. Writing – original draft: Xiaobing Yang. Writing – review and editing: Xiaobing Yang. Author name: ORCID: 0000-0002-2472-5730
  31 in total

Review 1.  Climate change and human health: present and future risks.

Authors:  Anthony J McMichael; Rosalie E Woodruff; Simon Hales
Journal:  Lancet       Date:  2006-03-11       Impact factor: 79.321

2.  Analysis of aerobic granular sludge formation based on grey system theory.

Authors:  Cuiya Zhang; Hanmin Zhang
Journal:  J Environ Sci (China)       Date:  2013-04-01       Impact factor: 5.565

3.  iCataly-PseAAC: Identification of Enzymes Catalytic Sites Using Sequence Evolution Information with Grey Model GM (2,1).

Authors:  Xuan Xiao; Meng-Juan Hui; Zi Liu; Wang-Ren Qiu
Journal:  J Membr Biol       Date:  2015-06-16       Impact factor: 1.843

4.  iPhos-PseEvo: Identifying Human Phosphorylated Proteins by Incorporating Evolutionary Information into General PseAAC via Grey System Theory.

Authors:  Wang-Ren Qiu; Bi-Qian Sun; Xuan Xiao; Dong Xu; Kuo-Chen Chou
Journal:  Mol Inform       Date:  2016-05-12       Impact factor: 3.353

5.  Burden of typhoid and paratyphoid fever in a densely populated urban community, Dhaka, Bangladesh.

Authors:  Aliya Naheed; Pavani K Ram; W Abdullah Brooks; M Anowar Hossain; Michele B Parsons; Kaisar Ali Talukder; Eric Mintz; Stephen Luby; Robert F Breiman
Journal:  Int J Infect Dis       Date:  2010-03-16       Impact factor: 3.623

6.  Association between the incidence of typhoid and paratyphoid fever and meteorological variables in Guizhou, China.

Authors:  Lu-Xi Wang; Xiu-Jun Li; Li-Qun Fang; Duo-Chun Wang; Wu-Chun Cao; Biao Kan
Journal:  Chin Med J (Engl)       Date:  2012-02       Impact factor: 2.628

7.  Antimicrobial drug resistance of Salmonella enterica serovar typhi in asia and molecular mechanism of reduced susceptibility to the fluoroquinolones.

Authors:  Tran Thuy Chau; James Ian Campbell; Claudia M Galindo; Nguyen Van Minh Hoang; To Song Diep; Tran Thu Thi Nga; Nguyen Van Vinh Chau; Phung Quoc Tuan; Anne Laure Page; R Leon Ochiai; Constance Schultsz; John Wain; Zulfiqar A Bhutta; Christopher M Parry; Sujit K Bhattacharya; Shanta Dutta; Magdarina Agtini; Baiqing Dong; Yang Honghui; Dang Duc Anh; Do Gia Canh; Aliya Naheed; M John Albert; Rattanaphone Phetsouvanh; Paul N Newton; Buddha Basnyat; Amit Arjyal; Tran Thi Phi La; Nguyen Ngoc Rang; Le Thi Phuong; Phan Van Be Bay; Lorenz von Seidlein; Gordon Dougan; John D Clemens; Ha Vinh; Tran Tinh Hien; Nguyen Tran Chinh; Camilo J Acosta; Jeremy Farrar; Christiane Dolecek
Journal:  Antimicrob Agents Chemother       Date:  2007-10-01       Impact factor: 5.191

8.  The role of public health improvements in health advances: the twentieth-century United States.

Authors:  David Cutler; Grant Miller
Journal:  Demography       Date:  2005-02

9.  Forecasting and Analyzing the Disease Burden of Aged Population in China, Based on the 2010 Global Burden of Disease Study.

Authors:  Chengzhen Bao; Mamat Mayila; Zhenhua Ye; Jianbing Wang; Mingjuan Jin; Wenjiong He; Kun Chen
Journal:  Int J Environ Res Public Health       Date:  2015-06-25       Impact factor: 3.390

Review 10.  Overview of the impact of Typhoid and Paratyphoid fever. Utility of Ty21a vaccine (Vivotif®).

Authors:  D Amicizia; L Arata; F Zangrillo; D Panatto; R Gasparini
Journal:  J Prev Med Hyg       Date:  2017-03
View more
  7 in total

1.  Prediction of Incidence Trend of Influenza-Like Illness in Wuhan Based on ARIMA Model.

Authors:  Pai Meng; Juan Huang; Deguang Kong
Journal:  Comput Math Methods Med       Date:  2022-07-12       Impact factor: 2.809

2.  Impact of COVID-19 pandemic on the epidemiology of STDs in China: based on the GM (1,1) model.

Authors:  Jingmin Yan; Yanbo Li; Pingyu Zhou
Journal:  BMC Infect Dis       Date:  2022-06-04       Impact factor: 3.667

3.  Research on the predictive effect of a combined model of ARIMA and neural networks on human brucellosis in Shanxi Province, China: a time series predictive analysis.

Authors:  Mengmeng Zhai; Wenhan Li; Ping Tie; Xuchun Wang; Tao Xie; Hao Ren; Zhuang Zhang; Weimei Song; Dichen Quan; Meichen Li; Limin Chen; Lixia Qiu
Journal:  BMC Infect Dis       Date:  2021-03-19       Impact factor: 3.090

4.  The research of ARIMA, GM(1,1), and LSTM models for prediction of TB cases in China.

Authors:  Daren Zhao; Huiwu Zhang; Qing Cao; Zhiyi Wang; Sizhang He; Minghua Zhou; Ruihua Zhang
Journal:  PLoS One       Date:  2022-02-23       Impact factor: 3.240

5.  The research of SARIMA model for prediction of hepatitis B in mainland China.

Authors:  Daren Zhao; Huiwu Zhang; Qing Cao; Zhiyi Wang; Ruihua Zhang
Journal:  Medicine (Baltimore)       Date:  2022-06-10       Impact factor: 1.817

6.  Predictive analysis of the number of human brucellosis cases in Xinjiang, China.

Authors:  Yanling Zheng; Liping Zhang; Chunxia Wang; Kai Wang; Gang Guo; Xueliang Zhang; Jing Wang
Journal:  Sci Rep       Date:  2021-06-01       Impact factor: 4.379

7.  Time series analysis of cumulative incidences of typhoid and paratyphoid fevers in China using both Grey and SARIMA models.

Authors:  Jiaqi Gao; Jiayuan Li; Mengqiao Wang
Journal:  PLoS One       Date:  2020-10-28       Impact factor: 3.240

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.