| Literature DB >> 36042373 |
Meifang Li1, Xun Shi2, Jiang Gui3, Chao Song4, Angeline S Andrew5, Erik P Pioro6, Elijah W Stommel5, Maeve Tischbein5, Walter G Bradley7.
Abstract
We developed a disease registry to collect all incident amyotrophic lateral sclerosis (ALS) cases diagnosed during 2016-2018 in Ohio. Due to incomplete case ascertainment and limitations of the traditional capture-recapture method, we proposed a new method to estimate the number of cases not recruited by the Registry and their spatial distribution. Specifically, we employed three statistical methods to identify reference counties with normal case-population relationships to build a Poisson regression model for estimating case counts in target counties that potentially have unrecruited cases. Then, we conducted spatial smoothing to adjust outliers locally. We validated the estimates with ALS mortality data. We estimated that 119 total cases (95% CI [109, 130]) were not recruited, including 36 females (95% CI [31, 41]) and 83 males (95% CI [74, 99]), and were distributed unevenly across the state. For target counties, including estimated unrecruited cases increased the correlation between the case count and mortality count from r = 0.8494 to 0.9585 for the total, from 0.7573 to 0.8270 for females, and from 0.6862 to 0.9292 for males. The advantage of this method in the spatial perspective makes it an alternative to capture-recapture for estimating cases missed by disease registries.Entities:
Mesh:
Year: 2022 PMID: 36042373 PMCID: PMC9428141 DOI: 10.1038/s41598-022-18944-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Statistically estimated number of unrecruited ALS cases in Ohio during 2016–2018.
| Sex | Number of recruited cases | Number of expected cases | Estimated number of unrecruited cases | 95% CI of the estimated number of unrecruited cases |
|---|---|---|---|---|
| Female | 128 | 163.08 | 35.08 | [30.34, 39.82] |
| Male | 155 | 236.20 | 81.20 | [71.88, 96.58] |
| Total | 283 | 399.28 | 116.28 | [105.85, 126.81] |
The output from the Poisson regression model contains decimal digits.
Final estimated cases unrecruited by the Ohio ALS Registry during 2016–2018.
| Sex | Number of recruited cases in the | Number of recruited cases in all 88 counties | Estimated number of unrecruited cases | Rounded integer | % in the total number of cases in the | % in the total number of cases in all 88 counites |
|---|---|---|---|---|---|---|
| Female | 10 | 128 | 35.54 | 36 | 78.26 | 21.95 |
| Male | 36 | 155 | 82.81 | 83 | 69.75 | 34.87 |
| Total | 46 | 283 | 118.35 | 119 | 72.12 | 29.60 |
“% in the total number of cases in the target counties” and “% in the total number of cases in all 88 counties” were calculated with the rounded integer number of unrecruited cases divided by the number of expected cases, where the number of expected cases equals to the sum of the number of recruited cases and unrecruited cases.
Figure 1Estimated numbers of unrecruited ALS cases in individual Ohio counties.
Figure 2The incidence rate of ALS in Ohio. (a) Before including the estimated unrecruited cases; (b) after including the estimated unrecruited cases.
Pearson’s correlation coefficients (r) between county-level ALS case count and ALS mortality count in Ohio.
| Sex | Non-target counties | All 88 counties without estimated unrecruited cases | All 88 counties with statistically estimated unrecruited cases | All 88 counties with spatially adjusted unrecruited cases | Target counties without estimated unrecruited cases | Target counties with statistically estimated unrecruited cases | Target counties with spatially adjusted unrecruited cases |
|---|---|---|---|---|---|---|---|
| Female | 0.9114 | 0.8644 | 0.9022 | 0.9025 | 0.7573 | 0.8256 | 0.8270 |
| Male | 0.9395 | 0.8202 | 0.9341 | 0.9325 | 0.6862 | 0.9320 | 0.9292 |
| Total | 0.9666 | 0.9067 | 0.9668 | 0.9648 | 0.8494 | 0.9621 | 0.9585 |
Figure 3Sites are invited to participate in the Ohio ALS Registry. Participating sites are institutions and neurologists that have contributed cases to the Registry; Declining sites are those that have not.
ALS incidence rate ranges for determining reference counties in Ohio, as defined by the z-score.
| Sex | Mean | Std. dev | Incidence rate range | ||
|---|---|---|---|---|---|
| |z|< 0.25 | |z|< 0.5 | |z|< 0.75 | |||
| Female | 3.1345 | 4.3187 | 2.7928–4.1869 | 1.5490–5.0802 | 0–6.3371 |
| Male | 5.2285 | 6.3039 | 3.6893–6.4969 | 2.3656–8.3850 | 0.5062–9.9020 |
Figure 4Plots of the ranked ALS incidence and the difference and ratio between two consecutive values. All plots are for the counties with non-zero incidence rates in Ohio.
Sections of the ranked ALS incidence rates in Ohio defined by Jenk’s natural breaks.
| Sex | Section 1 | Section 2 | Section 3 | Section 4 | Section 5 |
|---|---|---|---|---|---|
| Female | 0–1.7920 | 2.7928–5.0802 | 5.8331–10.3416 | 12.0337–22.4039 | N/A |
| Male | 0–0.5062 | 2.3656–5.0181 | 5.8634–8.3850 | 8.9791–14.7384 | 18.4417–26.5113 |
For females, the specified number of sections = 4, and for males, the number = 5. N/A = not applicable.
The final determining ranges, in terms of the incidence rate, for identifying reference counties.
| Sex | Range of incidence rate for identifying | Number of identified |
|---|---|---|
| Female | 2.7928–4.5895 | 15 |
| Male | 5.8634–8.3850 | 15 |
Poisson regression models of county-level relationships between case count and population, built for reference counties.
| Female | Male | ||
|---|---|---|---|
| ln(female population) | Coefficient | 1.0525 | 0.9524 |
| Std. error | 0.1526 | 0.1208 | |
| Z value | 6.8950 | 7.8850 | |
| Pr( >|z|) | 0.0000 | 0.0000 | |
| Evaluation | Null deviance | 42.2857 on 14 df | 70.3104 on 14 df |
| Residual deviance | 0.7952 on 13 df | 0.4607 on 13 df | |
| AIC | 42.233 | 46.901 |
df degrees of freedom.
Figure 5Spatial smoothing for adjusting the statistically estimated case counts in hotspot and cold spot areas.