| Literature DB >> 26901682 |
Xingyu Zhang1,2, Tao Zhang2, Jiao Pei3, Yuanyuan Liu2, Xiaosong Li2, Pau Medrano-Gracia1.
Abstract
BACKGROUND: The infection rate of syphilis in China has increased dramatically in recent decades, becoming a serious public health concern. Early prediction of syphilis is therefore of great importance for heath planning and management.Entities:
Mesh:
Year: 2016 PMID: 26901682 PMCID: PMC4763154 DOI: 10.1371/journal.pone.0149401
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Trend of syphilis incidence in China from 2005–2012 by year
Syphilis incidence seasonal indices.
| Jan | Feb | Mar | Apr | May | Jun | Jul | Aug | Sep | Oct | Nov | Dec | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Primary syphilis | 0.88 | 0.81 | 1.06 | 1.03 | 1.10 | 1.08 | 1.09 | 1.10 | 1.01 | 1.00 | 0.95 | 0.86 |
| Secondary syphilis | 0.86 | 0.77 | 0.98 | 0.99 | 1.19 | 1.25 | 1.27 | 1.24 | 1.04 | 0.96 | 0.80 | 0.65 |
| Tertiary syphilis | 0.96 | 0.82 | 1.05 | 1.02 | 1.14 | 1.07 | 1.05 | 1.06 | 0.97 | 1.00 | 0.97 | 0.87 |
| Latent syphilis | 0.82 | 0.78 | 1.06 | 1.01 | 1.08 | 1.06 | 1.07 | 1.08 | 1.03 | 1.04 | 1.02 | 0.96 |
| Congenital syphilis | 0.90 | 0.81 | 0.97 | 0.91 | 0.97 | 0.96 | 1.08 | 1.14 | 1.08 | 1.12 | 1.07 | 0.98 |
Fig 2Seasonal indices of each type of syphilis
Linear regression model for each syphilis time series removed seasonality.
| Constant | Coefficient | Coefficient 95% confidence Interval | R2 | P | ||
|---|---|---|---|---|---|---|
| Primary syphilis | 0.28232 | 0.00473 | 0.00442 | 0.00504 | 0.908 | 0.002 |
| Secondary syphilis | 0.21493 | 0.00237 | 0.00219 | 0.00255 | 0.872 | <0.001 |
| Tertiary syphilis | 0.00550 | 0.00015 | 0.00014 | 0.00016 | 0.903 | <0.001 |
| Latent syphilis | 0.16333 | 0.01390 | 0.01329 | 0.01451 | 0.955 | 0.007 |
| Congenital syphilis | 0.02824 | 0.00055 | 0.00050 | 0.00060 | 0.848 | <0.001 |
Note: The linear relationship between the deseasonalized syphilis incidence (dependent variable) and time t (independent variable) is estimated in the model.
Available ARIMA models fitted for each syphilis series.
| Syphilis | Model | AIC | AICC | SBC |
|---|---|---|---|---|
| Primary Syphilis | ARIMA(0,0,0)×(0,1,0) | -220.21 | -220.21 | -220.21 |
| ARIMA(1,0,0)×(0,1,0) | -240.1 | -240.04 | -237.83 | |
| ARIMA(0,0,1)×(0,1,0) | -264.49 | -264.43 | -262.23 | |
| ARIMA(1,0,1)×(0,1,0) | -240.1 | -240.04 | -237.83 | |
| ARIMA(1,0,0)×(0,1,1) | -262.03 | -261.86 | -257.5 | |
| ARIMA(0,0,1)×(1,1,0) | -273.28 | -273.11 | -268.76 | |
| ARIMA(1,0,0)×(1,1,0) | -251.45 | -251.28 | -246.93 | |
| ARIMA(2,0,0)×(0,1,0) | -248.59 | -248.24 | -244.06 | |
| ARIMA(3,0,0)×(0,1,0) | -257.4 | -257.05 | -250.61 | |
| Secondary Syphilis | ARIMA(0,0,0)×(0,1,0) | -299.15 | -299.15 | -299.15 |
| ARIMA(1,0,0)×(0,1,0) | -317.33 | -317.27 | -315.07 | |
| ARIMA(0,0,1)×(0,1,0) | -335.32 | -335.26 | -333.05 | |
| ARIMA(1,0,0)×(1,1,0) | -337.25 | -337.08 | -332.72 | |
| ARIMA(1,0,0)×(0,1,1) | -340.42 | -340.25 | -335.9 | |
| ARIMA(0,0,1)×(1,1,0) | -350.34 | -350.17 | -345.81 | |
| ARIMA(2,0,0)×(0,1,0) | -323.53 | -323.36 | -319 | |
| ARIMA(3,0,0)×(0,1,0) | -331.74 | -331.39 | -324.95 | |
| Tertiary Syphilis | ARIMA(0,0,0)×(0,1,0) | -656.31 | -656.31 | -656.31 |
| ARIMA(1,0,0)×(0,1,0) | -680.65 | -680.59 | -678.38 | |
| ARIMA(0,0,1)×(0,1,0) | -686.42 | -686.36 | -684.16 | |
| ARIMA(1,0,0)×(1,1,0) | -689.92 | -689.75 | -685.4 | |
| ARIMA(1,0,0)×(0,1,1) | -702.65 | -702.48 | -698.13 | |
| ARIMA(1,0,0)×(0,1,1) | -649.36 | -649.19 | -689.84 | |
| ARIMA(2,0,0)×(0,1,0) | -682.91 | -682.74 | -678.38 | |
| Latent Syphilis | ARIMA(0,0,0)×(0,1,0) | -129.69 | -129.69 | -129.69 |
| ARIMA(1,0,0)×(0,1,0) | -162.44 | -162.38 | -160.18 | |
| ARIMA(0,0,1)×(0,1,0) | -177.83 | -177.77 | -175.57 | |
| ARIMA(1,0,0)×(1,1,0) | -182.03 | -181.86 | -177.5 | |
| ARIMA(1,0,0)×(0,1,1) | -184.44 | -184.27 | -179.92 | |
| ARIMA(1,0,0)×(0,1,1) | -193.77 | -193.6 | -189.25 | |
| ARIMA(2,0,0)×(0,1,0) | -166.47 | -166.3 | -161.95 | |
| ARIMA(3,0,0)×(0,1,0) | -177.47 | -177.12 | -170.68 | |
| Congenital Syphilis | ARIMA(0,0,0)×(0,1,0) | -519.28 | -519.28 | -519.28 |
| ARIMA(1,0,0)×(0,1,0) | -537.62 | -537.56 | -535.36 | |
| ARIMA(0,0,1)×(0,1,0) | -558.88 | -558.82 | -556.62 | |
| ARIMA(1,0,0)×(1,1,0) | -546.76 | -546.59 | -542.23 | |
| ARIMA(1,0,0)×(0,1,1) | -559.86 | -559.69 | -555.33 | |
| ARIMA(1,0,0)×(0,1,1) | -565.61 | -565.44 | -561.09 | |
| ARIMA(2,0,0)×(0,1,0) | -545.65 | -545.3 | -541.13 | |
| ARIMA(3,0,0)×(0,1,0) | -552.22 | -551.87 | -545.43 |
Note: the first model for each syphilis ARIMA(0,0,0)×(0,1,0) only included the seasonal differencing term. The second model ARIMA(1,0,0)×(0,1,0), i.e., SAR(1) included the seasonal differencing and an autoregressive term. The third model ARIMA(0,0,1)×(0,1,0), i.e., SMA(1), included the differencing and moving average terms. These three models can be treated as the baseline. The final ARIMA model selected is highlighted in bold.
The highest cross-correlation between syphilis incidence time series (the values in the brackets are the lags, the lags are 0 except between secondary and congenital syphilis).
| Primary | Secondary | Tertiary | Congenital | Latent | |
|---|---|---|---|---|---|
| Primary | 1 | 0.87(0) | 0.95(0) | 0.93(0) | 0.97(0) |
| Secondary | 1 | 0.79(0) | 0.77(-2) | 0.76 (0) | |
| Tertiary | 1 | 0.90(0) | 0.95(0) | ||
| Congenital | 1 | 0.96(0) | |||
| Latent | 1 |
Fig 3Cross correlation analysis between primary syphilis and other syphilis time series (highest values appear when the lag is 0)
Fig 4Cross correlation analysis between secondary syphilis and other syphilis time series (highest values appear when the lag is 0 except between secondary and congenital syphilis).
Estimation of available ARIMAX models for each series.
| Syphilis | Model | Covariates | AIC | AICC | SBC |
|---|---|---|---|---|---|
| Primary | ARIMAX(0,0,0)×(0,0,0) | Secondary, Tertiary, Latent, Secondary | -451.83 | -451.32 | -442.1 |
| ARIMAX(1,0,0)×(0,0,0) | Secondary, Tertiary, Latent, Secondary | -469.62 | -468.85 | -457.47 | |
| ARIMAX(0,0,1)×(0,0,0) | Secondary, Tertiary, Latent, Secondary | -461.87 | -461.1 | -449.72 | |
| ARIMAX(1,0,0)×(0,0,1) | Secondary, Tertiary, Latent, Secondary | -470.54 | -469.45 | -455.95 | |
| ARIMAX(0,0,0)×(1,0,1) | Secondary, Tertiary, Latent, Secondary | -470.54 | -469.45 | -455.95 | |
| ARIMAX(1,0,0)×(1,0,1) | Secondary, Tertiary, Latent, Secondary | -471.67 | -470.2 | -454.65 | |
| Secondary | ARIMAX(0,0,0)×(0,0,0) | Primary, Latent, Primary | -346.31 | -346.01 | -339.02 |
| ARIMAX(1,0,0)×(0,0,0) | Primary, Latent, Primary | -376.06 | -375.55 | -366.34 | |
| ARIMAX(0,0,1)×(0,0,0) | Primary, Latent, Primary | -376.23 | -375.73 | -366.51 | |
| ARIMAX(1,0,1)×(0,0,0) | Primary, Latent, Primary | -381.75 | -380.98 | -396.6 | |
| ARIMAX(,0,1)×(0,0,1) | Primary, Latent, Primary | -401.33 | -400.56 | -389.17 | |
| ARIMAX(1,0,0)×(0,0,1) | Primary, Latent, Primary | -402.89 | -402.12 | -390.74 | |
| ARIMAX(0,0,1)×(1,0,0) | Primary, Latent, Primary | -441.6 | -440.83 | -429.44 | |
| Tertiary | ARIMAX(0,0,0)×(0,0,0) | Primary, Latent | -892.12 | -891.97 | -887.26 |
| ARIMAX(1,0,(3))×(0,0,0) | Primary, Latent | -902.55 | -902.04 | -892.83 | |
| ARIMAX(0,0,(3))×(0,0,0) | Primary, Latent | -897.93 | -897.63 | -890.64 | |
| ARIMAX(0,0,1)×(0,0,0) | Primary, Latent | -897.57 | -897.27 | -890.27 | |
| Latent | ARIMAX(0,0,0)×(0,0,0) | Primary, Secondary, Tertiary, Congenital, Primary | -284.93 | -283.46 | -267.91 |
| ARIMAX(0,0,1)×(0,0,0) | Primary, Secondary, Tertiary, Congenital, Primary | -283.52 | -281.6 | -264.08 | |
| ARIMAX(0,0,0)×(1,0,0) | Primary, Secondary, Tertiary, Congenital, Primary | -253.77 | -251.85 | -234.32 | |
| ARIMAX(0,0,0)×(0,0,1) | Primary, Secondary, Tertiary, Congenital, Primary | -304.83 | -302.91 | -285.39 | |
| Congenital | ARIMAX(0,0,0)×(0,0,0) | Primary,Secondary, Latent, Secondary | -672.88 | -672.37 | -663.25 |
| ARIMAX(0,0,1)×(0,0,0) | Primary,Secondary, Latent, Secondary | -682.67 | -681.9 | -670.63 | |
| ARIMAX(0,0,1)×(0,0,1) | Primary,Secondary, Latent, Secondary | -691.42 | -690.33 | -676.98 | |
| ARIMAX(0,0,1)×(1,0,0) | Primary,Secondary, Latent, Secondary | -692.54 | -691.45 | -678.1 |
Note:
* means the interaction term. The final ARIMAX model selected is highlighted in bold.
Fig 5Primary syphilis incidence fitting and testing performance by ARIMA and ARIMAX (U95 and L95 refer to the upper and lower 95% confidential interval respectively.
The vertical gray line separates modelling from estimates.)
Fig 9Congenital syphilis incidence fitting and testing performance by ARIMA and ARIMAX (U95 and L95refers to the upper and lower 95% confidential interval respectively.
The data were divided into modeling and forecasting groups with a vertical line; the left is the modeling part, and the right is the forecasting part.)
Comparison of the performances of the ARIMA model and ARIMAX model.
| MAE | MAPE | RMSE | U95-L95 | ||||||
|---|---|---|---|---|---|---|---|---|---|
| ARIMA | ARIMAX | ARIMA | ARIMAX | ARIMA | ARIMAX | ARIMA | ARIMAX | ||
| Modelling Performance | Primary | 0.0248 | 0.0109 | 0.05 | 0.022 | 0.0331 | 0.0134 | 0.1317 | 0.0552 |
| Secondary | 0.0146 | 0.0114 | 0.0464 | 0.0356 | 0.0199 | 0.0143 | 0.0791 | 0.0644 | |
| Tertiary | 0.0013 | 0.0009 | 0.1115 | 0.0771 | 0.0016 | 0.0011 | 0.0063 | 0.0046 | |
| Latent | 0.0469 | 0.0227 | 0.0621 | 0.0294 | 0.0601 | 0.0288 | 0.2389 | 0.113 | |
| Congenital | 0.0032 | 0.0027 | 0.0592 | 0.0492 | 0.0041 | 0.0033 | 0.0163 | 0.0132 | |
| Testing Performance | Primary | 0.0754 | 0.0393 | 0.1172 | 0.0598 | 0.0775 | 0.0419 | 0.1476 | 0.0579 |
| Secondary | 0.0183 | 0.0201 | 0.0465 | 0.0504 | 0.0195 | 0.0216 | 0.0947 | 0.0782 | |
| Tertiary | 0.001 | 0.0015 | 0.0534 | 0.0802 | 0.0012 | 0.039 | 0.0076 | 0.0046 | |
| Latent | 0.1237 | 0.0795 | 0.0958 | 0.0561 | 0.1463 | 0.0861 | 0.2627 | 0.1815 | |
| Congenital | 0.0137 | 0.0089 | 0.2054 | 0.1335 | 0.0149 | 0.0097 | 0.0181 | 0.0147 | |