| Literature DB >> 34855847 |
Abstract
An unbiased, widely accepted estimate of the rate of occurrence of new cases of autism over time would facilitate progress in understanding the causes of autism. The same may also apply to other disorders. While incidence is a widely used measure of occurrence, birth prevalence-the proportion of each birth year cohort with the disorder-is the appropriate measure for disorders and diseases of early childhood. Studies of autism epidemiology commonly speculate that estimates showing strong increases in rate of autism cases result from an increase in diagnosis rates rather than a true increase in cases. Unfortunately, current methods are not sufficient to provide a definitive resolution to this controversy. Prominent experts have written that it is virtually impossible to solve. This paper presents a novel method, time-to-event birth prevalence estimation (TTEPE), to provide accurate estimates of birth prevalence properly adjusted for changing diagnostic factors. It addresses the shortcomings of prior methods. TTEPE is based on well-known time-to-event (survival) analysis techniques. A discrete survival process models the rates of incident diagnoses by birth year and age. Diagnostic factors drive the probability of diagnosis as a function of the year of diagnosis. TTEPE models changes in diagnostic criteria, which can modify the effective birth prevalence when new criteria take effect. TTEPE incorporates the development of diagnosable symptoms with age. General-purpose optimization software estimates all parameters, forming a non-linear regression. The paper specifies all assumptions underlying the analysis and explores potential deviations from assumptions and optional additional analyses. A simulation study shows that TTEPE produces accurate parameter estimates, including trends in both birth prevalence and the probability of diagnosis in the presence of sampling effects from finite populations. TTEPE provides high power to resolve small differences in parameter values by utilizing all available data points.Entities:
Mesh:
Year: 2021 PMID: 34855847 PMCID: PMC8638887 DOI: 10.1371/journal.pone.0260738
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Example of a survival process for two values of probability of diagnosis PD.
The green lines S denote survival, the blue lines D denote the rate of diagnoses, and the red lines CI denote cumulative incidence. The solid lines represent PD = 0.1, and the dashed lines represent PD = 0.25.
Fig 2Directed Acyclic Graph (DAG) showing the effects of birth year, diagnostic year and age on rates of diagnoses.
Fig 3Example where observed diagnosis rates represent two unidentified subgroups with different values of probability of diagnosis PD.
The red and green lines represent rates of diagnosis of the two subgroups. Group 1 (red line) has greater PD than group 2 (green line). The solid black line shows the aggregate diagnosis rates. The dotted line shows the exponential fit to the aggregate diagnosis rates. The age of complete eligibility AE = 3 in this example.
Simulation results of parameter optimization using real-valued proportions with no sampling.
| True Parameters | Bias | Bias | Bias | Bias | |
|---|---|---|---|---|---|
|
|
| ||||
| 0.1 | 0 | 5.9E-12 | 8.9E-11 | -5.5E-10 | -1.8E-10 |
| 0.08 | 0.02 | 0 | 0 | -2.8E-17 | -1.4E-17 |
| 0.06 | 0.04 | -1.7E-18 | 0 | 1.1E-16 | 1.4E-17 |
| 0.04 | 0.06 | 3.5E-18 | 1.4E-17 | -5.6E-17 | -6.9E-18 |
| 0.02 | 0.08 | 0 | 3.5E-18 | 5.6E-17 | -1.4E-17 |
| 0 | 0.1 | -4.7E-11 | -6.6E-10 | 2.2E-9 | 7.7E-10 |
BP = 0.01 at the final BY, PD = 0.25 at the final DY, AE* = AE = 3, M = 10, and there are 20 successive cohorts.
β, β are exponential coefficients for birth prevalence and probability of diagnosis, respectively.
BP, birth prevalence; BY, birth year; DY, diagnostic year.
Simulation results of parameter optimization using Monte Carlo binomial sampling, 1000 iterations.
| True Parameters |
|
| |||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
| Bias | SE | Bias | SE | Bias | SE | Bias | SE |
| 0.1 | 0 | -2.0E-6 | 1.0E-4 | -2.0E-5 | 0.0013 | 3.3E-5 | 0.0070 | -5.4E-6 | 0.0019 |
| 0.08 | 0.02 | -2.6E-6 | 1.1E-4 | -3.2E-5 | 0.0012 | 1.5E-4 | 0.0072 | 4.4E-5 | 0.0019 |
| 0.06 | 0.04 | 7.8E-6 | 1.2E-4 | 2.7E-5 | 0.0013 | -4.4E-4 | 0.0079 | -1.2E-4 | 0.0021 |
| 0.04 | 0.06 | 1.3E-5 | 1.5E-4 | 6.5E-5 | 0.0015 | -5.8E-4 | 0.0085 | -1.6E-4 | 0.0022 |
| 0.02 | 0.08 | -2.0E-6 | 1.6E-4 | -9.8E-6 | 0.0016 | 4.5E-4 | 0.0086 | 9.4E-5 | 0.0023 |
| 0 | 0.1 | 4.5E-6 | 1.8E-4 | 7.1E-6 | 0.0017 | 2.0E-4 | 0.0094 | 2.7E-5 | 0.0023 |
BP = 0.01 at the final BY, PD = 0.25 at the final DY, AE* = AE = 3, M = 10, and there are 20 successive cohorts. Population of each cohort = 500,000.
β, β are exponential coefficients for birth prevalence and probability of diagnosis, respectively.
BP, birth prevalence; BY, birth year; DY, diagnostic year.
Comparison of the effect of the choice of assumed AE* vs. true value of AE = 3, with one homogeneous group of cases.
| Bias | Bias | Bias | Bias | |
|---|---|---|---|---|
| 2 | 0.002 | -0.019 | -0.0096 | 0.036 |
| 3 | 5.9E-12 | 8.9E-11 | -5.5E-10 | -1.8E-10 |
| 4 | -4.4E-12 | -6.6E-11 | 4.8E-10 | 1.64E-10 |
| 5 | 1.5E-11 | 2.2E-10 | -1.85E-9 | -7.18E-10 |
AE, age of complete eligibility. True values: βBP = 0.1, βPD = 0, P = 0.01 at the final BY, PD = 0.25 at the final DY, AE = 3. Maximum age M = 10. Twenty successive cohorts. Probability of diagnosis PD is consistent across cases at each DY. Simulation uses real values, no random sampling.
Comparison of the effect of the choice of assumed AE* vs. true value of AE = 3, with two unidentified subgroups with different values of probability of diagnosis, mismatched to analysis.
| Bias | Bias | Bias | Bias | |
|---|---|---|---|---|
| 3 | -0.00043 | 0.001 | 0.0018 | -0.002 |
| 4 | -0.00038 | 0.00061 | -0.004 | -0.0016 |
| 5 | -0.00033 | 0.00035 | -0.0097 | -0.0011 |
AE, age of complete eligibility. True values: βBP = 0.1, βPD = 0, P = 0.01 at the final BY, PD = 0.25 at the final DY, AE = 3. Two equal-sized groups of cases where one group’s probability of diagnosis PD is twice that of the other, while the estimation assumes one homogeneous group. Maximum age M = 10. Twenty successive cohorts. Simulation uses real values, no sampling.