| Literature DB >> 25171152 |
Eliana Jimenez-Soto1, Andrew Hodge1, Kim-Huong Nguyen2, Zoe Dettrick1, Alan D Lopez3.
Abstract
BACKGROUND: Over recent years there has been a strong movement towards the improvement of vital statistics and other types of health data that inform evidence-based policies. Collecting such data is not cost free. To date there is no systematic framework to guide investment decisions on methods of data collection for vital statistics or health information in general. We developed a framework to systematically assess the comparative costs and outcomes/benefits of the various data methods for collecting vital statistics.Entities:
Mesh:
Year: 2014 PMID: 25171152 PMCID: PMC4149535 DOI: 10.1371/journal.pone.0106234
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Systematic assessment of data collection methods (DCMs) – the economics approach.
Data collection methods for vital statistics.
| System Type | DCM | Description |
| Complete or partial registration systems | Civil registration | Births and deaths in the population are continuously recorded. |
| Deaths are recorded with a medically certified COD. | ||
| Yearly statistics are generated based on this data. | ||
| Sample registration | Births and deaths in a representative sample of the population are continuously recorded. | |
| In some systems, deaths in the sample are recorded with MCOD. | ||
| In other cases, deaths in the sample are recorded with a COD assigned using VA. Depending on the method, the resulting data may be grouped either by broad or specific causes of death. | ||
| Yearly statistics are generated based on this data. | ||
| Demographic surveillance sites | Births and deaths in a non-representative sample of the population are continuously recorded. | |
| Deaths in the sample are recorded with a COD assigned using VA. Depending on the method, the resulting data may be grouped either by broad or specific causes of death. | ||
| Yearly statistics are generated based on this data. | ||
| Census and surveys | Population census | All households are queried regarding current occupants, as well as details of recent births and deaths. |
| For a system with full VA; that is, a COD exists for each recorded death, a VA questionnaire is used to assign a COD. | ||
| For a system with partial VA COD, for a representative sample of recorded deaths, a VA questionnaire is used to assign a COD. | ||
| Alternatively COD distribution may be generated through modelling based on age–sex patterns, prevalence of risk factors and intervention coverage. | ||
| Statistics are usually generated every 10 years. | ||
| National-level household survey: direct estimates | A representative sample of households is queried regarding current occupants, as well as details of recent births and deaths. | |
| For a survey using VA COD, for each recorded death a VA questionnaire is used to assign a COD. | ||
| Alternatively, COD distribution may also be generated through modelling based on age–sex patterns, prevalence of risk factors and intervention coverage. Statistics are usually generated every three to five years. | ||
| National-level household survey: indirect estimates | A representative sample of households is queried regarding current occupants, as well as survival status of siblings and/or children. | |
| COD distribution is generated through modelling based on age–sex patterns, prevalence of risk factors and intervention coverage. | ||
| Statistics are usually generated every three to five years. | ||
| Sub-national-level household survey | A sample of households is queried regarding current occupants, as well as details of recent births and deaths. | |
| For each recorded death, a VA questionnaire is used to assign a COD. | ||
| Statistics are usually generated every three to five years. | ||
| Facility-based collection | Facility-based reporting: wide scale | Births and deaths that occur within medical facilities are continuously recorded. |
| Deaths are recorded with MCOD. | ||
| Yearly statistics are generated based on this data. | ||
| Facility-based reporting: sentinel sites | Births and deaths that occur within a representative subset of medical facilities are continuously recorded. | |
| Deaths are recorded with MCOD. | ||
| Yearly statistics are generated based on this data. |
Notes: DMC, data collection method; MCOD, medical certification of death; VA, verbal autopsy; COD, cause-of-death.
Assessment framework for vital statistics.
| Criteria | General vital statistics | Cause-of-death statistics |
|
| ||
|
|
| |
| Coverage | % of population living in areas where vital event recording occurs | % of population living in areas where COD recording occurs |
| Completeness | % of events contributing to fertility/mortality statistics | % of deaths with appropriately certified COD |
| Missing data | % of key variables with response not stated | % of COD reports for which age–sex data are missing |
| Use of ill-defined categories | N/A | % of deaths classified under various miscellaneous and ill-defined categories |
| Improbable classifications | N/A | Number of deaths assigned to improbable age or sex categories per 100 000 coded deaths |
| Consistency between cause of death and general mortality | N/A | % of COD data points deviating more than 2 (or 3) SDs from general mortality-based predictions |
|
| ||
|
|
| |
| Routine tabulations | By sex and five-year age groups, based on place of usual residence. Deaths in children under five years tabulated by 0 and 1–4 year age group. | By sex, and at least by eight broad age groups; namely, 0, 1–4, 5–14, 15–29, 30–44, 45–59, 60–69 and 70+ years |
| Small-area statistics | Number of vital event tabulation areas per million population | Number of COD tabulation areas per million population |
|
| ||
|
|
| |
| Over time | Stability of key definitions over time | Consistency in the proportions of cause-specific mortality over consecutive years |
| Across space | Uniformity of definitions across areas | ICD to certify and code deaths, revision used and code level to which tabulations are published |
|
| ||
|
|
| |
| Production time | Mean time from end of reference period to publication | Mean time from end of reference period to publication |
| Regularity | SD of production time | SD of production time |
|
| ||
|
|
| |
| Media | Number of formats in which data are released | Number of formats in which data are released |
| Metadata | Availability and quality of documentation | Availability and quality of documentation |
| User service | Availability and responsiveness of user service | Availability and responsiveness of user service |
Notes: Adapted from Table 1, in Mahapatra et al. [2] (p. 1654). SD, standard deviation; ICD, International Classification of Diseases; COD, cause-of-death; N/A, not applicable.
Alternative methods for aggregating the quality scores into a single index.
| Methods | Description | Disadvantages | Advantages |
| Unweighted average | Sum of all scores divided by the total number of criteria | Assumes all quality attributes are equally important in all circumstances and all settings | Calculation simplicity |
| Does not require a priori knowledge about the relative importance of different quality criteria | |||
| Aggregation index for individual | |||
| DCM is independent from the rest | |||
| Weighted average – by expert opinion | Weights assigned to each criterion are identified a priori by experts. | The weights are subjective and expert consensus might be difficult to achieve. | Calculation simplicity |
| Reflects the relative importance of different quality criteria, which might vary by setting | |||
| DEA-based aggregation | Weights are assigned via an optimisation problem that includes all DCMs to be evaluated. Used in several applications of multi-criteria decision analysis (MCDA) in a wide range of policy areas, including health. | Requires a large number of observations. If only a handful of DCMs are evaluated, it would require several assessments (i.e. by various experts) of each DCM. | Does not require a priori knowledge about the relative importance of the quality criteria. |
| Data driven | |||
| No systematic bias toward any quality criteria. Might be useful when there are conflicting opinions on the relative importance of each quality criteria |
Notes: DEA, Data Envelopment Analysis; DMC, data collection method.
Stylised Example: Health information systems and their characteristics.
| No. | Information system | Data collection methods for vital statistics used | Collection objectives | Area of coverage | Number of participants |
| 1. | National housing and population census (census) | Population-based census: Long form survey | Demographic, poverty, housing, labour for participation and health indicators | Nationwide | 33 000 000 |
| 2. | National Household Survey (NHS) | Household survey | Income and poverty focus, with some demographic and health indicators | Nationwide | 23 000 |
| 3. | Demographic and Health Survey (DHS) | Household, community and facility based surveys | Mortality, fertility and use of maternal and child health service | Nationwide | 15 000 |
| 4. | Vital registration (CRVS) | Population-based forms | Compulsory birth and death registration | >60% districts | 3 000 000 |
| 5. | Health management system (HMS) | Facility-based forms | Continuous collection of morbidity, mortality, and service coverage | Health facility nationwide | 17 000 000 |
| 6. | Integrated disease surveillance (IDS) | Facility-based forms | Continuous data collection on disease and mortality | Health facility nationwide | 3 000 000 |
| 7. | Demographic surveillance system: Region X (X-DSS) | Population-based census: Mortality surveillance using verbal autopsy | Regular documentation of births, deaths, migrations and socioeconomic information | Some districts in Region X | 66 000 |
| 8. | Demographic surveillance system: Region Y (Y-DSS) | Population-based census: Mortality surveillance using verbal autopsy | Regular documentation of births, deaths, and health service utilisation | Some districts in Region Y | 83 000 |
| 9. | Adult morbidity and mortality project (MM) | Population-based census: Mortality surveillance using verbal autopsy | Regular collection of information on burden of disease and mortality | Some districts across the country | 500 000 |
| 10. | Demographic surveillance system for AIDS (A-DSS) | Population-based census: Mortality surveillance using verbal autopsy; population-based HIV surveillance | HIV surveillance and some mortality data collection within the surveillance site | Villages within a specific region | 23 000 |
Notes: No., number.
Stylised Example: Description of alternative scenarios.
| Assumption | Scenario A | Scenario B |
| Long-term policy of establishing a complete CRVS | Yes | No |
| (a) CRVS included in the list of DCMs to be evaluated | No | Yes |
| (b) Measure of ‘quantity of output’ | Unit records* | Target population |
| (c) ‘Improvement toward CRVS’ included in the list of quality criteria | Yes | No |
| (d) Apportioning rule for costs of CRVS | N/A | 30% |
Notes: * Unit records are approximated using sample size. CRVS, civil registration for vital statistics; DMC, data collection method; N/A, not applicable.
Simulated data for Scenario A.
| Data collection methods | Quality performance using consensus scoring matrix | Composite quality index | Costs | ||||||||||
| Accuracy | Relevance | Consistency | Timeliness | Accessibility | Improvement | Unweighted | DEA-based | Total cost | Apportion rule (%) | Cost for vital statistics | |||
| Consensus matrix | Individual matrix | Consensus matrix | Individual matrix | ||||||||||
| Census | 2.33 | 8.00 | 5.50 | 3.00 | 5.67 | 4.00 | 0.7600 | 0.7786 | 1.0000 | 1.0000 | $8 500 000 | 25 | $2 125 000 |
| NHS | 0.83 | 2.50 | 4.50 | 3.00 | 6.67 | 2.00 | 0.5200 | 0.5401 | 0.8860 | 0.8843 | $840 000 | 5 | $42 000 |
| DHS | 1.00 | 4.00 | 6.00 | 5.00 | 7.67 | 2.00 | 0.6844 | 0.7195 | 1.0000 | 1.0000 | $870 000 | 10 | $87 000 |
| HMS | 4.83 | 2.50 | 5.00 | 7.00 | 3.67 | 8.00 | 0.8267 | 0.8760 | 1.0000 | 1.0000 | $2 200 000 | 20 | $440 000 |
| IDS | 3.17 | 1.00 | 5.50 | 9.00 | 4.00 | 6.00 | 0.7644 | 0.7137 | 1.0000 | 0.9773 | $4 500 000 | 15 | $675 000 |
| X-DSS | 5.00 | 6.50 | 4.50 | 7.50 | 4.67 | 7.00 | 0.9378 | 0.9122 | 1.0000 | 0.9958 | $200 000 | 40 | $80 000 |
| Y-DSS | 6.00 | 6.50 | 4.50 | 8.50 | 5.00 | 7.00 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | $220 000 | 40 | $88 000 |
| MM | 4.50 | 5.00 | 5.50 | 7.50 | 4.00 | 6.00 | 0.8667 | 0.8760 | 1.0000 | 0.9932 | $100 000 | 60 | $60 000 |
| A-DSS | 5.00 | 4.50 | 5.50 | 8.50 | 5.00 | 5.00 | 0.8933 | 0.9561 | 1.0000 | 1.0000 | $15 000 | 30 | $4 500 |
Notes: See Table 4 for definitions and further details on the DCMs. DMC, data collection method; DEA, Data Envelopment Analysis.
Simulated data for Scenario B.
| Data collection methods | Quality performance using consensus scoring matrix | Composite quality index | Costs | ||||||||||
| Target population | Accuracy | Relevance | Consistency | Timeliness | Accessibility | Unweighted | DEA-based | Total cost | Apportion rule (%) | Cost for vital statistics | |||
| Consensus matrix | Individual matrix | Consensus matrix | Individual matrix | ||||||||||
| Census | 33 000 000 | 2.33 | 8.00 | 5.50 | 3.00 | 5.67 | 0.8033 | 0.7776 | 1.0000 | 1.0000 | $8 500 000 | 25 | $2 125 000 |
| NHS | 33 000 000 | 0.83 | 2.50 | 4.50 | 3.00 | 6.67 | 0.5738 | 0.5471 | 0.8700 | 0.8682 | $840 000 | 5 | $42 000 |
| DHS | 33 000 000 | 1.00 | 4.00 | 6.00 | 5.00 | 7.67 | 0.7760 | 0.7335 | 1.0000 | 1.0000 | $870 000 | 10 | $87 000 |
| CRVS | 20 500 000 | 0.33 | 0.50 | 4.50 | 5.00 | 0.67 | 0.3607 | 0.3407 | 0.7740 | 0.8252 | $750 000 | 30 | $225 000 |
| HMS | 33 000 000 | 4.83 | 2.50 | 5.00 | 7.00 | 3.67 | 0.7541 | 0.8677 | 0.9360 | 0.9843 | $2 200 000 | 20 | $440 000 |
| IDS | 6 000 000 | 3.17 | 1.00 | 5.50 | 9.00 | 4.00 | 0.7432 | 0.7114 | 1.0000 | 0.9750 | $4 500 000 | 15 | $675 000 |
| X-DSS | 510 000 | 5.00 | 6.50 | 4.50 | 7.50 | 4.67 | 0.9235 | 0.9178 | 0.9700 | 0.9888 | $200 000 | 40 | $80 000 |
| Y-DSS | 180 000 | 6.00 | 6.50 | 4.50 | 8.50 | 5.00 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | $220 000 | 40 | $88 000 |
| MM | 2 600 000 | 4.50 | 5.00 | 5.50 | 7.50 | 4.00 | 0.8689 | 0.8778 | 0.9980 | 0.9863 | $100 000 | 60 | $60 000 |
| A-DSS | 23 000 | 5.00 | 4.50 | 5.50 | 8.50 | 5.00 | 0.9344 | 0.9639 | 1.0000 | 1.0000 | $15 000 | 30 | $4 500 |
Notes: See Table 4 for definitions and further details on the DCMs. DMC, data collection method; DEA, Data Envelopment Analysis.
Results for Scenario A.
| DCM | Cost Effectiveness Analysis | Efficiency analysis | |||||||
| Cost per QADI: | Cost per QADI: | Cost-efficiency index | |||||||
| Unweighted quality index | DEA-based quality index | ||||||||
| Consensus matrix | Individual matrix | Rank | Consensus matrix | Individual matrix | Rank | Consensus matrix | Individual matrix | Rank | |
| Census | 0.0847 | 0.0827 | 2 | 0.0644 | 0.0644 | 2 | 1.0000 | 0.9667 | 2 |
| NHS | 3.5117 | 3.3812 | 8 | 2.0610 | 2.0649 | 8 | 0.0280 | 0.0248 | 8 |
| DHS | 8.4740 | 8.0615 | 9 | 5.8000 | 5.8000 | 9 | 0.0100 | 0.0102 | 9 |
| HMS | 0.0313 | 0.0295 | 1 | 0.0259 | 0.0259 | 1 | 1.0000 | 1.0000 | 1 |
| IDS | 0.2943 | 0.3152 | 5 | 0.2250 | 0.2302 | 5 | 0.1630 | 0.1585 | 5 |
| X-DSS | 1.2925 | 1.3288 | 7 | 1.2121 | 1.2172 | 7 | 0.0490 | 0.0485 | 7 |
| Y-DSS | 1.0602 | 1.0602 | 6 | 1.0602 | 1.0602 | 6 | 0.0570 | 0.0563 | 6 |
| MM | 0.1385 | 0.1370 | 3 | 0.1200 | 0.1208 | 3 | 0.4040 | 0.3817 | 3 |
| A-DSS | 0.2190 | 0.2046 | 4 | 0.1957 | 0.1957 | 4 | 0.2360 | 0.2410 | 4 |
Notes: See Table 4 for definitions and further details on the DCMs. DMC, data collection method; QADI, quality-adjusted data index; DEA, Data Envelopment Analysis.
Results for Scenario B.
| DCMs | Cost Effectiveness Analysis | Efficiency analysis | |||||||
| Cost per QADI: | Cost per QADI: | Cost efficiency index | |||||||
| unweighted quality index | DEA-based quality index | ||||||||
| Consensus matrix | Individual matrix | Rank | Consensus matrix | Individual matrix | Rank | Consensus matrix | Individual matrix | Rank | |
| Census | 0.0802 | 0.0828 | 6 | 0.0644 | 0.0644 | 6 | 0.0630 | 0.0580 | 5 |
| NHS | 0.0022 | 0.0023 | 1 | 0.0015 | 0.0015 | 1 | 1.0000 | 1.0000 | 1 |
| DHS | 0.0034 | 0.0036 | 2 | 0.0026 | 0.0026 | 2 | 0.8050 | 0.9113 | 2 |
| CRVS | 0.0312 | 0.0323 | 5 | 0.0145 | 0.0136 | 4 | 0.1930 | 0.1738 | 6 |
| HMIS | 0.0177 | 0.0154 | 3 | 0.0142 | 0.0135 | 3 | 0.5540 | 0.4513 | 3 |
| IDS | 0.1514 | 0.1574 | 7 | 0.1125 | 0.1154 | 7 | 0.0430 | 0.0320 | 7 |
| X-DSS | 0.1682 | 0.1690 | 8 | 0.1601 | 0.1571 | 8 | 0.0490 | 0.0353 | 8 |
| Y-DSS | 0.4835 | 0.4835 | 10 | 0.4835 | 0.4835 | 10 | 0.0190 | 0.0128 | 10 |
| MM | 0.0266 | 0.0266 | 4 | 0.0231 | 0.0234 | 5 | 0.2940 | 0.2213 | 4 |
| A-DSS | 0.2094 | 0.2030 | 9 | 0.1957 | 0.1957 | 9 | 0.0390 | 0.0307 | 9 |
Notes: See Table 4 for definitions and further details on the DCMs. DMC, data collection method; QADI, quality-adjusted data index; DEA, Data Envelopment Analysis.