Literature DB >> 34123367

Using DHS and MICS data to complement or replace NGO baseline health data: an exploratory study.

Peter R Berti1, Milena Nardocci2, Minh Hung Tran3, Malek Batal2, Rebecca Brodmann1, Nicolas Greliche4, Naomi M Saville5.   

Abstract

Background: Non-government organizations (NGOs) spend substantial time and resources collecting baseline data in order to plan and implement health interventions with marginalized populations. Typically interviews with households, often mothers, take over an hour, placing a burden on the respondents. Meanwhile, estimates of numerous health and social indicators in many countries already exist in publicly available datasets, such as the Demographic and Health Surveys (DHS) and the Multiple Indicator Cluster Surveys (MICS), and it is worth considering whether these could serve as estimates of baseline conditions. The objective of this study was to compare indicator estimates from non-governmental organizations (NGO) health projects' baseline reports with estimates calculated using the Demographic and Health Surveys (DHS) or the Multiple Indicator Cluster Surveys (MICS), matching for location, year, and season of data collection.
Methods: We extracted estimates of 129 indicators from 46 NGO baseline reports, 25 DHS datasets and three MICS datasets, generating 1,996 pairs of matched DHS/MICS and NGO indicators. We subtracted NGO from DHS/MICS estimates to yield difference and absolute difference, exploring differences by indicator. We partitioned variance of the differences by geographical level, year, and season using ANOVA.
Results: Differences between NGO and DHS/MICS estimates were large for many indicators but 33% fell within 5% of one another. Differences were smaller for indicators with prevalence <15% or >85%. Difference between estimates increased with increasing year and geographical level differences. However, <1% of the variance of the differences was explained by year, geographical level, and season. Conclusions: There are situations where publicly available data could complement NGO baseline survey data, most importantly when the NGO has tolerance for estimates of low or unknown accuracy. Copyright:
© 2021 Berti PR et al.

Entities:  

Keywords:  DHS; MICS; maternal and child health; surveys

Mesh:

Year:  2021        PMID: 34123367      PMCID: PMC8145218          DOI: 10.12688/f1000research.47618.1

Source DB:  PubMed          Journal:  F1000Res        ISSN: 2046-1402


Introduction

Non-government and civil society organizations spend substantial time and resources collecting baseline data in order to plan and implement health interventions with marginalized populations, and to measure the impact of those interventions ( Data for Impact, 2019). Typical methods involve baseline and endline household surveys, where the household residents are interviewed and asked a hundred or more questions about asset ownership, mother and child health, diet, health system access, and other topics of interest. The costs of these surveys vary depending on design, methods, sample size, survey length, and local context ( Data for Impact, 2019), but in the authors’ experience tens of thousands of dollars is typical, and in some cases, much more. Depending on the number and nature of questions, interviews can be over an hour long, placing a burden on the respondents. In addition, the accuracy of the indicator estimates in NGO-led surveys may be insufficient for project design and monitoring purposes, due to relatively small sample sizes and the inherent high variability of the indicators of interest. Meanwhile, estimates of numerous health and social indicators in many countries already exist in publicly available datasets, such as the Demographic and Health Surveys (DHS), supported by USAID ( U.S. Agency for International Development, 2018), and the Multiple Indicator Cluster Surveys (MICS), supported by UNICEF ( UNICEF, 2020), and it is worth considering whether these could serve as estimates of baseline conditions. DHS/MICS provide standardized data collected using rigorous methods and large sample sizes, and datasets are available on request for free. They are designed to be representative at the national, regional and provincial level (but rarely at lower levels, such as district and village, where NGOs are working), and probably exclude homeless, institutionalized and nomadic populations ( Carr-Hill, 2013). DHS/MICS are collected every three to ten years so there may up to ten-years gap between DHS/MICS data collection and the baseline conditions that the NGO wants characterized. Although some indicators’ descriptions have been modified and improved over time, caution is taken to ensure that data are directly comparable across countries, regions and years ( Hancioglu & Arnold, 2013; UNICEF, 2020; U.S. Agency for International Development, 2018). DHS/MICS surveys are adapted to specific country needs and are conducted by well-trained interviewers who have access to tools and guidelines for quality assurance throughout ( UNICEF, 2020; U.S. Agency for International Development, 2018). Using publicly available data to complement or replace NGOs’ primary data collection for project baseline measures and project monitoring would save valuable resources, reducing the burden on data collectors and respondents alike. A few studies have compared estimates between DHS/MICS and NGO surveys. One found that they provided very different estimates of electricity and water access in Kenya, Tanzania, and Uganda ( Carr-Hill, 2017), and a second found that DHS and a NGO-led survey provided similar estimates of several maternal and child health estimates in Rwanda ( Langston ). Other studies found that estimates of the market share of faith-based health care providers by DHS and NGO surveys in sub-Saharan Africa were within 5 to 50% of each other ( Wodon ), and the confidence intervals for the difference between Lot Quality Assurance Sampling (LQAS) and DHS district-level estimates were within +/-10% for 15 of 37 health indicators ( Anoke ). Therefore, no consensus exists on the potential for DHS/MICS to substitute NGO surveys. We hypothesized that publicly available data can provide estimates of baseline conditions similar to those reported in NGO baseline reports when matched as closely as possible for location, year, and season of data collection. We tested this hypothesis by comparing indicator estimates from NGO reports with estimates calculated using DHS/MICS.

Methods

Data from NGO baseline reports

We collected and retained a sample of 46 NGO baseline reports through a combination of internet search and personal contacts with Canadian and Vietnamese NGOs using the following selection criteria: household survey (n>100) which used valid methods and representative sampling to generate point estimates of maternal, newborn and child health indicators; conduced between 2005 and 2019; in a low- or middle-income country. The baseline reports from NGOs working on maternal, newborn and child health covered 23 countries spanning South Asia (Bangladesh, India, Pakistan), Africa (Burkina Faso, Ethiopia, Ghana, Kenya, Liberia, Malawi, Mali, Mozambique, Nigeria, Senegal, South Sudan, Tanzania, Zambia), South/Central America (Bolivia, Honduras), the Caribbean (Haiti), and SE Asia (Laos, Myanmar, Philippines, Vietnam) ( Table 1) ( Berti, 2021). From the reports, we extracted: country name, NGO name, dates of data collection, population of study, inclusion/exclusion criteria, indicator name and definition, sample size (total and n for each indicator), and the indicator estimate (percentage and standard deviation (SD) if available).
Table 1.

NGOs’ baseline report and matched data from DHS/MICS.

NGODHS/MICS
CountrySourceYearSample sizeGeographical locationLevel * SourceYearSample sizeGeographical locationLevel *
BangladeshA&T20104,400Divisions of Dhaka, Chittagong, Rajshahi, Khulna, Barisal, and Sylhet3rdDHS20074,923Divisions of Dhaka, Chittagong, Rajshahi, Khulna, Barisal, and Sylhet3rd
BangladeshPLAN (BORN)2016900Upazilas of Pirgachha, Pirganj, Mithapukur, Kaunia and Gangachara (rural area only) and district of Rangpur1st, 2ndDHS2014265Division of Rangpur (rural area only)3rd
BangladeshNIMS2017963Divisions of Dhaka, Chittagong, Khulna, Rajshahi, Sylhet, Barisal3rdDHS2014409Divisions of Dhaka, Chittagong, Khulna, Rajshahi, Sylhet, Barisal3rd
BangladeshPLAN (SHOW)2016864Districts of Barisal, Chittagong and Rangpur2ndDHS20141,314Divisions of Barisal, Chittagong and Rangpur3rd
BangladeshWV201833,600National and by districts (Barisal, Pirojpur, Bandarban, Chittagong, Comilla, Dhaka, Gazipur, Gopalganj, Tangail, Bagerhat, Satkhira, Mymensingh, Netrakona, Sherpur, Naogaon, Rajshahi, Dinajpur, Nilphamari, Rangpur, Thakurgaon, Sunamganj, Sylhet)2nd, 5thDHS20144,494National and by divisions (Barisal, Chittagong, Dhaka, Khulna, Rajshahi, Rangpur, Sylhet)3rd, 5th
BangladeshWV (ENRICH)20161,323Districts of Thakurgaon and Panchagarh2ndDHS2014550Division of Rangpur3rd
BoliviaPLAN2019214Regions of Chuquisaca, La Paz, Cochabamba, and Potosí4thDHS2008867Regions of Chuquisaca, La Paz, Cochabamba, and Potosí4th
Burkina FasoWUSC20161,005Regions North, Central-West and East4thDHS20102,709Regions North, Central-West and East4th
EthiopiaA&T20103,000Regions of Tigray and SNNP4thDHS20051,800Regions of Tigray and SNNP4th
EthiopiaPLAN (BORN)2017905Zones of North Gondar, South Gondar and West Gojjam and region of Amhara3rd, 4thDHS2016369Region of Amhara4th
EthiopiaCARE20161,261Zones of East and West Hararghe and region of Afar3rd, 4thDHS20161,630Regions of Oromia and Afar4th
EthiopiaNIMS2017440Regions of Amhara, Tigray, Oromia, Benishangul-Gumuz, and SNNP4thDHS2016508Regions of Amhara, Tigray, Oromia, Benishangul-Gumuz, and SNNP4th
EthiopiaPLAN2018537Regions of Amhara and SNNP4thDHS20161,651Regions of Amhara and SNNP4th
GhanaPLAN (SHOW)2014831Intervention/control districts in the regions of Eastern, Northern, and Volta2ndDHS2014775Regions of Eastern, Northern, and Volta4th
HaitiPLAN (SHOW)2016860Communes of Fort-Liberté, Ouanaminte, and Trou-du-Nord2ndDHS2012237Department of North-east3rd
HondurasRed Cross2007300Departments of Copán and Santa Bárbara3rdDHS2005/06524Departments of Copán and Santa Bárbara3rd
IndiaEficor2012300District of Pakur2ndDHS2005/06620State of Jharkand3rd
IndiaIntraHealth201014,090District of Pakur and Uttar Pradesh2ndDHS2005/061,649States of Jharkand and Uttar Pradesh3rd
KenyaNIMS20173,941Provinces of Rift Valley, Western, Nyanza, Eastern, Coast3rdDHS201412,011Provinces of Rift Valley, Western, Nyanza, Eastern, Coast3rd
KenyaRed Cross2012154Districts of East Pokot, Central Pokot, and East Marakwet2ndDHS2008/09694Province of Rift Valley3rd
KenyaWV (ENRICH)20161,274Counties of Elgeyo Marakwet and Baringo (subdivision of the before called Rift Valley province)2ndDHS20144,760Province of Rift Valley3rd
LaosNIOPH2018115Province of Vientiane3rdMICS2016/173,560Region North4th
LaosThe World Bank20167,355Provinces of Phongsaly, Oudomxay, Houaphan, Xaiyabouly, Borlikhamxay3rdMICS2016.57,131Region North4th
LiberiaRed Cross2012783Counties of Bomi, Gbarpolu, and Grand Gedeh3rdDHS2013848Counties of Bomi, Gbarpolu, and Grand Gedeh3rd
MalawiCARE2017708Traditional authorities of Kasakula, Kalumo, Dzoole, Kayembe and districts of Ntchisi and Dowa1st, 3rdDHS2015/16925Districts of Ntchisi and Dowa3rd
MaliPLAN (BORN)2017907Region of Sikasso4thDHS2012/13714Region of Sikasso4th
MozambiqueCARE20171,262Districts of Funhalouro and Homoine and province of Inhambane2nd, 3rdDHS2011570Province of Inhambane3rd
MozambiquePLAN20195,921Districts of Moma, Mogovolas, Nampula, Eráti, Memba, and Nacala Porto2ndDHS2011358Province of Nampula3rd
MyanmarWV2016831Village of Thabaung1stDHS2015/16275Region of Ayeyarwaddy4th
NigeriaPLAN (BORN)20161,658Local Government Areas of Bauchi, Dass, Katagum, Misau, Ningi, Alkaleri, Bogoro, Ganjuwa, Giade, Shira and state of Bauchi2nd, 3rdDHS2013577State of Bauchi3rd
NigeriaNIMS2018/19510States of Kebbi and Sokoto3rdDHS20181,525States of Kebbi and Sokoto3rd
NigeriaPLAN (SHOW)20161,770Intervention and control districts in the states of Sokoto and Zamfara2ndDHS20131,096States of Sokoto and Zamfara3rd
PakistanNIMS20171,620Cities of Lodhran, Rajanpur, Jamshoro and Swabi2nd, 3rdDHS2012.52,636Provinces of Punjab, Sindh, and Khyber Pakhtunkhwa3rd
PakistanRed Cross20121,166Districts of Battagram and Swat and province of Khyber Pakhtunkhwa2nd, 3rdDHS2012/131,532Province of Khyber Pakhtunkhwa3rd
PakistanWV2017942District of Sukkur2ndDHS2012/131,591Province of Sukkur3rd
PhilippinesNIMS20181,418Provinces of Camarines Norte, Masbate, Antique, Iloilo, Cebu, Bohol, and Zamboanga del Norte3rdDHS2017352Provinces of Camarines Norte, Masbate, Antique, Iloilo, Cebu, Bohol, and Zamboanga del Norte3rd
SenegalPLAN (SHOW)2016828Intervention/control districts in the regions of Dakar, Ziguinchor, Tambacounda, Kaolack, Louga, Kedougou and Sedhiou2ndDHS2010/112,307Regions of Dakar, Ziguinchor, Tambacounda, Kaolack, Louga, Kedougou and Sedhiou4th
South SudanCMMB2015500County of Nzarai1stMICS2010770State of Western Equatoria3rd
TanzaniaNIMS2017215Regions of Mwanza and Simiyu4thDHS2015/16408Regions of Mwanza and Simiyu4th
TanzaniaPLAN20173,207Region of Mbeya, and districts of Sumbawanga DC, Sumbawanga MC, Nkasi DC, and Kalambo DC (in the region of Rukwa)2nd, 4thDHS2015/16282Regions of Mbeya and Rukwa4th
TanzaniaWV20171,476Region of Kigoma4thDHS2015/16245Region of Kigoma4th
TanzaniaWV (ENRICH)20161,399Districts of Itigi, Manyoni, Ikungi, Kahma, Shinyanga, Kishapu and Ushetu2ndDHS2015/16556Regions of Shinyanga and Singida4th
VietnamA&T20114,029Regions of North Central and Central Coastal area, Northern Midlands - Mountainous area, Central Highlands, Mekong River Delta4thMICS2010/117,140Regions of North Central and Central Coastal area, Northern Midlands - Mountainous area, Central Highlands, and Mekong River Delta4th
VietnamCARE2015594Districts of Bao Lac, Tu Mo Rong, Que Phong and provinces of Nghe An, Cao Bang, and Kon Tum2nd, 3rdMICS2013/144,095Regions of North Central and Central Coastal area, Northern Midlands - Mountainous area, and Central Highlands4th
VietnamOxfam20141,982Districts of Da Bac, Hoa Binh, Binh Gia, Lang Son, Phu Cu, and Hung Yen, and provinces of Hoa Binh, Hung Yen, and Lang Son2nd, 3rdMICS2013/14573Regions of Northern Midlands - Mountainous area, and Red River Delta4th
ZambiaCARE2016735Towns of Mpika and Shiwang'andu2ndDHS2013/14854Province of Muchinga3rd

1st level represents village, town, locality or traditional authority; 2nd level: district or equivalent; 3rd level: province, state or equivalent; 4th level: region; 5th level: country.

DHS: Demographic and Health Surveys; MICS: Multiple Indicator Cluster Surveys; NGO: non-governmental organization.

1st level represents village, town, locality or traditional authority; 2nd level: district or equivalent; 3rd level: province, state or equivalent; 4th level: region; 5th level: country. DHS: Demographic and Health Surveys; MICS: Multiple Indicator Cluster Surveys; NGO: non-governmental organization. We also retained the location of data collection (e.g. country, region, province, district, or/and village) and geographical level. These geographical levels of data aggregation were defined as: (1) the smallest geographical subdivision in a country (village, town, locality, traditional authority); (2) district or district council (larger than a village but smaller than the third level); (3) province, state, department, county or district (if it refers to a division equivalent to province or state); (4) region (combining several units of level 3); (5) country level.

Data from DHS and MICS surveys

We matched 25 DHS and 3 MICS surveys (from Vietnam, Laos, and South Sudan) with 46 NGO baseline reports ( Table 1). We used the most recent DHS/MICS survey carried out prior to the NGO baseline survey, with some surveys matching more than one NGO survey. Indicators from DHS/MICS were calculated following the methods recommended by DHS/MICS accounting for weighting and sample selection ( Croft ). Wherever possible, we used the methods employed by the NGO to create the matching DHS/MICS indicator. For instance, if the NGO baseline survey included women of reproductive age and their children aged 0-24 months living in the district of Homoine in Mozambique, we extracted the same sample from the DHS/MICS. In the absence of representative data from the same geographical level, we used DHS/MICS data from the next level up in the geopolitical hierarchy to match the lower level from the NGO. For instance, if data from the district of Homoine were not available in the DHS, we used data from the province of Inhambane (one level up).

Indicators retained for analysis

We matched similar indicators from NGO baseline reports with DHS/MICS wherever available and excluded those that had no match in the DHS/MICS datasets. Table 2 provides an example of how the data were matched for the indicator “Woman received at least three antenatal care visits (ANC) during last pregnancy”.
Table 2.

Example of how the estimates from NGO and DHS/MICS were matched for the indicator “Woman had at least three ANC visits during last pregnancy (%)”.

NGODHS/MICS
CountryRegionProvinceDistrictSourceYearLevelnEstimateSourceYearLevelnEstimate
EthiopiaAmhara+Tigray+ Oromia+Benishangul-Gumuz+SNNP--NIMS2017.54th40977.0DHS20164th401750.0
India-JharkhandPakurEficor20122nd30029.3DHS2005.53rd61835.9
India-JharkhandJharkhandIntraHealth20102nd520347.0DHS2005.53rd32036.6
India-Uttar PradeshUttar PradeshIntraHealth20102nd886050.0DHS2005.53rd130725.6
Pakistan-Khyber PakhtunkhwaBattagramRed Cross20122nd58322.4DHS2012.53rd152937.3
Pakistan-Khyber PakhtunkhwaSwatRed Cross20122nd58336.3DHS2012.53rd152937.3
TanzaniaKigoma--World Vision20174th48567.7DHS2015.54th27869.6
VietnamNorth Central and Central Coastal areaNghe AnQue PhongCARE20152nd19677.6MICS2013.54th30092.8
VietnamNorthern Midlands - Mountainous areaCao BangBao LacCARE20152nd19872.2MICS2013.54th23072.2
VietnamCentral HighlandsKon TumTu Mo RongCARE20152nd20071.0MICS2013.54th10968.5
VietnamNorth Central and Central Coastal area, Northern Midlands - Mountainous area, Central HighlandsNghe An+Cao Bang+Kon Tum-CARE20153rd59473.6MICS2013.54th64081.2
VietnamNorthern Midlands - Mountainous areaHoa BinhDa Bac+Hoa BinhOxfam20142nd47294.7MICS2013.54th23072.2
VietnamRed River DeltaHung YenPhu Cu+Hung YenOxfam20142nd74398.6MICS2013.54th34392.6
VietnamNorthern Midlands - Mountainous areaLang SonBinh Gia+Lang SonOxfam20142nd76793.9MICS2013.54th23072.2
VietnamNorthern Midlands - Mountainous area, Red River DeltaHoa Binh+Hung Yen+Lang Son-Oxfam20143rd198295.9MICS2013.54th57384.4

1st level represents village, town, locality or traditional authority; 2nd level: district or equivalent; 3rd level: province, state or equivalent; 4th level: region; 5th level: country.

ANC: antenatal care; DHS: Demographic and Health Surveys; MICS: Multiple Indicator Cluster Surveys; NGO: non-governmental organization.

1st level represents village, town, locality or traditional authority; 2nd level: district or equivalent; 3rd level: province, state or equivalent; 4th level: region; 5th level: country. ANC: antenatal care; DHS: Demographic and Health Surveys; MICS: Multiple Indicator Cluster Surveys; NGO: non-governmental organization. In total there were 129 indicators ( Table 3) from eight main groups including child anthropometry, child diet, child health, household characteristics, household wealth, maternal characteristics, maternal health, and WASH. We excluded estimates based on fewer than ten observations (n=64), in either the DHS/MICS or NGO data, retaining a total of 1,996 pairs of NGO-DHS/MICS indicators for analyses.
Table 3.

List of indicators collected by group and subgroup *

GroupSubgroupN indicators in subgroupDetails
Child anthropometryStunting19There are separate indicators by age groups, and for boys and girls (separated and combined)
Child anthropometryUnderweight22There are separate indicators by age groups, and for boys and girls (separated and combined)
Child dietAte 4+ food groups5By age group and by breastfeeding status, and combined
Child dietBottle fed yesterday3By age group, and combined
Child dietConsumed iron-rich foods4By age group, and combined
Child dietConsumed vitamin A-rich foods1
Child dietContinued breastfeeding4By age group
Child dietExclusive breastfeeding: 0-6 m3Boys and girls separately and combined
Child dietInitiation of breastfeeding within 1 hour of birth3Boys and girls separately and combined
Child dietReceiving solid, semi-solid or soft foods: 6-8 m1
Child healthChild took supplement/vaccine4Child received iron or vitamin A supplements, child received DPT and measles by 12 months of age, newborn protected by tetanus vaccine
Child healthDiarrhea in last two weeks6By age group (diarrhea in 0-5m is separate subgroup)
Child healthDiarrhea in the last two weeks: 0-5 m1
Child healthReceived diarrhea treatment4Those with diarrhea received ORS, ORT, homemade fluids, ORS+ zinc
Child healthFor those with diarrhea in last 2 weeks, given more to drink1
Child healthFor those with diarrhea in last 2 weeks, given more to eat1
HH characteristicsIndividuals who have ever been married1
HH characteristicsHead of household is male1
HH characteristicsHousehold has electricity1
HH characteristicsUrban residence1
HH wealthHousehold has a car
HH wealthHousehold has agricultural land/bike/phone3Household has land, bike, phone
HH wealthHousehold has animals6Household has cattle, chickens, goats, horses, livestock, poultry, sheep
Maternal characteristicsWoman able to read1
Maternal characteristicsWoman never attended school1
Maternal healthBirth at a health facility/assisted by a skilled birth attendant (SBA)3Last birth at health facility, attended by SBA, assisted by SBA
Maternal healthWoman consumed/received iron supplements5Woman received iron supplements, woman consumed iron supplements on 1+, 90+, 100+, 150+ days
Maternal healthWoman received antenatal care (ANC)4In last pregnancy, woman had ANC in first trimester, woman had 1+, 3+, 4+ ANC visits
Maternal healthWoman received postnatal care (PNC)3Woman received PNC, Woman received PNC with 2 days/3 days of birth
Maternal healthWoman's antenatal care (ANC) content5During ANC woman had blood/urine test, blood pressure taken, received 2+ TT vaccines, was weighed.
WASHHandwash station has ash/sand/soap/water3Household handwash station has ash/sand, water, soap
WASHHousehold dispose child stool in toilet/latrine1
WASHHousehold has improved drinking water1
WASHHousehold has improved sanitation1
WASHHousehold shares toilet1
WASHHousehold treats drinking water2Household bleaches/boils drinking water
WASH30+ min for household to obtain drinking water1

for a complete list of all the indicators see Table 2 in HealthBridge (2020).

HH: household; WASH: Water, Sanitation, and Hygiene; DPT: diphtheria, pertussis and tetanus; ORS: oral rehydration salts; ORT: oral rehydration therapy; SBA: skilled birth attendant; ANC: antenatal care; PNC: postnatal care; TT: tetanus toxoid.

for a complete list of all the indicators see Table 2 in HealthBridge (2020). HH: household; WASH: Water, Sanitation, and Hygiene; DPT: diphtheria, pertussis and tetanus; ORS: oral rehydration salts; ORT: oral rehydration therapy; SBA: skilled birth attendant; ANC: antenatal care; PNC: postnatal care; TT: tetanus toxoid. After collating the data, we grouped similar indicators into 37 subgroups ( Table 3) on the basis of whether they had similar definitions/concepts (e.g. stunting prevalence in different age groups). We refined the grouping by using scatterplots of the difference of estimates by year difference and geographical level difference to check if any indicators differed widely from others in the grouping. After assessing the indicators graphically, we separated “Diarrhea in the last two weeks: 0-5m” from the same indicator for other age groups since the differences of estimates were closer to zero for this age group than the others. We also separated “Household has a car” from the subgroup “Household has agricultural land/bike/phone” since car ownership was much lower than ownership of other assets.

Analysis

NGO versus DHS/MICS We subtracted NGO from DHS/MICS estimates to calculate difference and absolute difference between estimates. To compare data from NGO and DHS/MICS we used: same or different season of data collection; number of years difference between data collection (DHS/MICS year - NGO year); and number of geographical levels difference (DHS/MICS level - NGO level). If data collection spanned two years, for instance data collection started in 2013 and was completed in 2014, the year of data collection was coded as “2013.5”. Geographical level difference was calculated by subtracting the NGO level from DHS/MICS level. For example, we subtracted district level data available from the Mozambique NGO survey (level=2) from province level data collected in the DHS (level=3), making the geographical level difference one. We grouped geographical level differences as: no difference; one level difference; 2-3 levels difference. We plotted how difference and absolute difference between DHS/MICS and NGO estimates varied with the indicator and indicator grouping. We used Analysis of Variance (ANOVA) to partition the variance of difference or absolute difference between estimates (DHS/MICS estimate - NGO estimate) by indicator, geographical level difference (as 0,1,2+), year difference (continuous), and season (same season, different season, season unknown). DHS versus DHS In order to better understand the contribution of difference in methods employed in the different sources of survey data (DHS/MICS and NGO) to the resulting difference in estimates, we repeated the analyses used to compare DHS/MICS and NGO estimates but this time comparing DHS data from one country, year and geographical level to a different year and/or geographical level from the same country. The assumption is that the DHS methods are similar between years and geographical levels, whereas DHS/MICS and NGOs may use somewhat different methods. There is a level of discordance between DHS/MICS and NGO estimates, and there would also be discordance between two DHS estimates. The difference between DHS/MICS-NGO discordance and DHS-DHS discordance will not be due to difference in years, or geographical levels, but rather due to difference in methods. For the DHS-DHS comparisons, we compiled DHS data from the seven countries that contributed the most pairs in the DHS/MICS-NGO dataset: Bangladesh, Ethiopia, Kenya, Malawi, Pakistan, Tanzania, and Zambia. Retaining the same indicators as in the DHS/MICS - NGO comparisons, we calculated estimates for different geographical levels, i.e. at the country level, and for each region, province and district available. For this analysis, we included district data to mimic the NGO data, even though these estimates are not always representative at this level in the DHS. We excluded indicators based on a sample size smaller than ten observations (n=26,539). We matched DHS indicators from different cycles and geographical levels using different combinations mimicking the actual DHS/MICS-NGO scenarios: indicators from the same level but different years (Scenario 1), indicators from the same year but different levels (Scenario 2), and indicators from different years and levels (Scenario 3). To mimic the NGO data, we used data from the most recent cycle and the lower geographical levels, whereas to represent the comparative DHS data we used older DHS cycle and higher geographical level data. Using DHS data only, we were not able to simulate a scenario where DHS/MICS and NGO data were from the same year and geographical level. Table 4 provides an example of how we compared the estimates for an ANC indicator in Zambia using 31 pairs from DHS in the three scenarios for this one country. Repeating across all indicators and all countries yielded 109,251 pairs of DHS-DHS indicators.
Table 4.

Example of how the estimates for the indicator “Woman had at least three ANC visits during last pregnancy (%)” in Zambia were compared using DHS data from different/same years and geographical levels.

Data from DHS earlier year or higher levelData from DHS later year or lower level
ProvinceLevelYearnEstimateLevelYearnEstimate
DHS data from different years but same geographical level (Scenario 1) Central3rd2013.578989.23rd201874689.0
Copperbelt3rd2013.585391.33rd201873091.4
Eastern3rd2013.51,13689.13rd201887593.6
Luapula3rd2013.598888.23rd201881791.9
Lusaka3rd2013.590488.53rd201881889.3
Muchinga3rd2013.585086.73rd201866191.2
North Western3rd2013.592786.13rd201860892.6
Northern3rd2013.598185.43rd201869290.6
Southern3rd2013.51,03689.93rd201874694.5
Western3rd2013.579385.23rd201861290.4
All5th2013.59,25788.55th20187,30591.5
DHS data from the same year but different geographical levels (Scenario 2) Central5th2013.59,25788.53rd2013.578989.2
Copperbelt5th2013.59,25788.53rd2013.585391.3
Eastern5th2013.59,25788.53rd2013.51,13689.1
Luapula5th2013.59,25788.53rd2013.598888.2
Lusaka5th2013.59,25788.53rd2013.590488.5
Muchinga5th2013.59,25788.53rd2013.585086.7
North Western5th2013.59,25788.53rd2013.592786.1
Northern5th2013.59,25788.53rd2013.598185.4
Southern5th2013.59,25788.53rd2013.51,03689.9
Western5th2013.59,25788.53rd2013.579385.2
DHS data from different years and different geographical levels (Scenario 3) Central5th2013.59,25788.53rd201874689.0
Copperbelt5th2013.59,25788.53rd201873091.4
Eastern5th2013.59,25788.53rd201887593.6
Luapula5th2013.59,25788.53rd201881791.9
Lusaka5th2013.59,25788.53rd201881889.3
Muchinga5th2013.59,25788.53rd201866191.2
North Western5th2013.59,25788.53rd201860892.6
Northern5th2013.59,25788.53rd201869290.6
Southern5th2013.59,25788.53rd201874694.5
Western5th2013.59,25788.53rd201861290.4

3rd level represents province level data and 5th level represents country-level data.

ANC: antenatal care; DHS: Demographic and Health Surveys.

3rd level represents province level data and 5th level represents country-level data. ANC: antenatal care; DHS: Demographic and Health Surveys. We calculated the difference and absolute difference between these pairs of estimates, mimicking the scenarios from the DHS/MICS-NGO data. Table 5 summarises the DHS cycles included as well as the geographical level comparison for each scenario in each of the seven countries.
Table 5.

Demographic and Health Survey (DHS) cycles and geographical level comparison included in the DHS vs DHS analysis.

Scenario 1 (N=9,024)Scenario 2 (N=56,185)Scenario 3 (N=44,042)
CountryDHS cycleGeographical level comparisonDHS cycleGeographical level comparisonDHS cycleGeographical level comparison
Bangladesh2011 20143rd-3rd 5th-5th20143rd-2nd 5th-2nd 5th-3rd2011 20143rd-2nd 5th-2nd 5th-3rd
Ethiopia2011 20163rd-3rd 5th-5th20165th-3rd2011 20165th-3rd
Kenya2008.5 20143rd-3rd 5th-5th20145th-3rd2008.5 20145th-3rd
Malawi2010 2015.53rd-3rd 4th-4th 5th-5th2015.54th-3rd 5th-3rd 5th-4th2010 2015.54th-3rd 5th-3rd 5th-4th
Pakistan2006.5 2012.53rd-3rd 5th-5th2012.53rd-2nd 5th-2nd 5th-3rd2006.5 2012.53rd-2nd 5th-2nd 5th-3rd
Tanzania2010 2015.54th-4th 5th-5th2015.54th-2nd 5th-2nd 5th-4th2010 2015.54th-2nd 5th-2nd 5th-4th
Zambia2013.5 20183rd-3rd 5th-5th2013.55th-3rd2013.5 20185th-3rd

Geographical level comparison: geographical level from older cycle vs geographical level from most recent cycle included.

2nd level represents district or equivalent; 3rd level: province, state or equivalent; 4th level: region; 5th level: country.

Scenario 1: DHS data from different years compared using the same geographical levels.

Scenario 2: DHS data from the same year compared using different geographical levels.

Scenario 3: DHS data from different years compared using different geographical levels.

Geographical level comparison: geographical level from older cycle vs geographical level from most recent cycle included. 2nd level represents district or equivalent; 3rd level: province, state or equivalent; 4th level: region; 5th level: country. Scenario 1: DHS data from different years compared using the same geographical levels. Scenario 2: DHS data from the same year compared using different geographical levels. Scenario 3: DHS data from different years compared using different geographical levels. Finally, as with DHS/MICS vs NGO estimates, we used ANOVA to partition the variance of difference or absolute difference between DHS estimates by indicator, geographical level difference, and year difference. We did not include season in this analysis since most DHS data are collected during the same season within a country.

Simulations

We simulated a situation where the only source of imprecision of the indicator’s measures would be from sampling error, in order to separate this known and estimable source of error from other sources of error that lead to differences in indicator estimates. The simulation samples from a "true" prevalence (p) of 1%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, and 99%. We assumed an n of 500, which was a typical sample size of both DHS and NGO samples in our data set. We then generated a “Baseline Estimate 1” (to mimic the DHS/MICS estimates) by drawing randomly from a binomial distribution with mean n*p and variance np(1-p). A “Baseline estimate 2” (to mimic the NGO estimate) was generated in the same way, and the difference between the first and second estimate was calculated. We ran 1,000 iterations to estimate the distribution of the differences. In order to investigate how absolute differences vary by the nature of the point prevalence estimates we used box plots to compare simulated, DHS-DHS and DHS/MICS-NGO absolute differences. All data were compiled in Microsoft Excel 15 and analyzed with SAS 9.4. This study respects current research ethics standards and it was approved by the Health Research Ethics Board of the Université de Montréal (CERSES-19-030-D).

Results

The NGO reports often presented over 100 indicators in their baseline reports. On average, 18 of their indicators were also available in the DHS/MICS datasets. The estimate sample size for the NGO surveys ranged from 12 to 16,530 and from 10 to 98,446 for the DHS/MICS. Table 6 presents, by indicator subgroup, mean DHS/MICS and NGO percentage prevalence estimates, mean difference between pairs (DHS/MICS minus NGO) and percentage of differences falling within 5 and 20 percentage points. Some subgroups have mean difference close to zero, but almost all have at least some pairs that are widely different (not within 20%). Fifteen subgroups had positive (DHSNGO) mean differences, but we identified no meaningful pattern in which indicators were negative and which were positive, and all the differences (except for consumption of vitamin A-rich foods) were within 1 standard deviation of 0.
Table 6.

DHS/MICS and NGO estimates, difference between estimates (DHS/MICS minus NGO) and proportion of estimates within 5% and 20% difference by subgroup of indicators.

DHS/MICS estimateNGO estimateDifference between estimatesPercentage of indicator pairs with difference within:
SubgroupNMeanSDMeanSDMeanSDMinMax5%20%
Child anthropometry
Stunting (%)13130.610.436.411.9-5.79.8-42.120.638.293.1
Underweight (%)13126.912.818.58.78.511.1-15.232.330.580.9
Child diet
Ate 4+ food groups (%)6721.39.222.612.3-1.312.3-23.228.925.494.0
Bottle fed yesterday (%)338.86.46.99.41.96.8-20.813.163.697.0
Consumption of iron-rich foods (%)3028.012.218.615.59.419.0-39.252.310.070.0
Consumption of vit A-rich foods (%)430.825.119.124.511.73.67.716.10.0100.0
Continued breastfeeding (%)3282.616.879.022.23.610.3-10.832.453.190.6
Exclusive breastfeeding: 0-6 m (%)6042.017.462.120.0-20.120.2-60.122.216.746.7
Initiation of breastfeeding within 1 hour of birth (%)6467.617.059.018.48.618.7-33.555.235.975.0
Receiving foods: 6-8 m (%)1869.818.266.123.23.730.3-53.450.616.744.4
Child health
Child received supplement/vaccine (%)1057.621.765.725.8-8.115.7-37.914.620.080.0
Diarrhea in the last two weeks (%)8619.19.430.720.2-11.614.5-46.815.833.770.9
Diarrhea in the last two weeks: 0-5 m (%)119.87.014.97.8-5.13.7-10.42.854.5100.0
Diarrhea treatment (%)3136.024.341.717.8-5.622.9-50.555.519.461.3
Diarrhea, given more to drink (%)2219.310.727.016.8-7.721.4-52.330.113.663.6
Diarrhea, given more to eat (%)148.84.46.84.92.03.9-6.07.964.3100.0
Household characteristics
Ever married (%)5796.59.685.814.510.810.3-5.031.742.177.2
Household has electricity (%)2043.840.644.638.8-0.89.2-21.915.460.095.0
Head of household is male (%)7885.611.687.98.4-2.38.8-25.325.856.492.3
Urban residence (%)1223.011.931.811.8-8.815.5-36.312.233.375.0
Household wealth
Household has a car (%)462.02.41.83.90.22.7-12.45.391.3100.0
Household has agricultural land/bike/phone (%)15056.128.450.529.55.714.4-52.941.734.782.7
Household has animals (%)7341.925.337.923.24.09.5-24.029.541.193.2
Maternal characteristics/health
Woman able to read (%)833.222.927.314.65.911.4-6.421.012.587.5
Woman never attended school (%)5840.329.936.528.53.812.5-43.753.137.994.8
Birth at a health facility/assisted by skilled birth attendant (%)12746.522.759.124.0-12.615.6-45.049.017.369.3
Woman received/consumed iron supplements (%)6349.728.449.832.8-0.217.8-38.055.533.376.2
Woman received antenatal care (%)16258.224.563.223.7-5.016.1-35.754.624.175.3
Woman received postnatal care (%)5141.617.144.023.6-2.429.2-65.678.05.951.0
Woman’s ANC content (%)5655.321.457.226.2-1.927.6-60.946.414.348.2
WASH
Household dispose child stool in toilet/latrine (%)1455.927.565.426.8-9.514.8-38.720.328.678.6
Household has improved drinking water (%)8764.526.870.324.2-5.717.2-54.651.934.581.6
Household has improved sanitation (%)8233.521.440.827.8-7.326.8-62.477.512.253.7
Household shares toilet (%)1131.713.928.513.73.218.0-21.247.727.381.8
Household treats drinking water (%)528.48.715.214.1-6.815.4-50.819.153.886.5
Handwash station has ash/sand/soap/water (%)2520.812.530.521.5-9.816.8-57.826.624.076.0
Time to obtain drinking water 30+ min (%)2033.412.136.120.2-2.722.8-46.258.325.075.0

DHS: Demographic and Health Surveys; MICS: Multiple Indicator Cluster Surveys; NGO: non-governmental organization; SD: standard deviation; ANC: antenatal care; WASH: Water, Sanitation and Hygiene.

DHS: Demographic and Health Surveys; MICS: Multiple Indicator Cluster Surveys; NGO: non-governmental organization; SD: standard deviation; ANC: antenatal care; WASH: Water, Sanitation and Hygiene. Figure 1 presents the scatterplots of NGO against DHS/MICS estimates by subgroup of indicators. For all subgroups, there was some correlation between the DHS/MICS and NGO estimates. Figure 2 shows the boxplot distribution of the mean difference between estimates by subgroup. The only subgroups that had all the pairs of indicators within ±20% were “Consumption of vitamin A-rich foods”, “Bottle fed yesterday”, “Diarrhea in the last two weeks: 0-5m”, “Diarrhea in the past two weeks: given more to eat”, and “Household has a car”. Other indicators that had most of their pairs within ±20% were “Household treats drinking water” and “Ever married”. All the indicators with the smallest differences between estimates had very low or very high prevalence ( Table 6), except for “Consumption of vitamin A-rich foods” (that was based on only four pairs of estimates).
Figure 1.

DHS/MICS estimate by NGO estimate by subgroup of indicators.

Abbreviations: BF: breastfeeding; HH: household; HF: health facility; SBA: skilled birth attendant; ANC: antenatal care; PNC: postnatal care; DHS: Demographic and Health Surveys; MICS: Multiple Indicator Cluster Surveys; NGO: non-governmental organization.

Figure 2.

Difference between estimates (DHS/MICS minus NGO) by subgroup of indicators.

Abbreviations: Anthros: anthropometry indicators; HH: household; WASH: Water, Sanitation, and Hygiene; BF: breastfeeding; HF: health facility; SBA: skilled birth attendant; ANC: antenatal care; PNC: postnatal care; DHS: Demographic and Health Surveys; MICS: Multiple Indicator Cluster Surveys; NGO: non-governmental organization.

DHS/MICS estimate by NGO estimate by subgroup of indicators.

Abbreviations: BF: breastfeeding; HH: household; HF: health facility; SBA: skilled birth attendant; ANC: antenatal care; PNC: postnatal care; DHS: Demographic and Health Surveys; MICS: Multiple Indicator Cluster Surveys; NGO: non-governmental organization.

Difference between estimates (DHS/MICS minus NGO) by subgroup of indicators.

Abbreviations: Anthros: anthropometry indicators; HH: household; WASH: Water, Sanitation, and Hygiene; BF: breastfeeding; HF: health facility; SBA: skilled birth attendant; ANC: antenatal care; PNC: postnatal care; DHS: Demographic and Health Surveys; MICS: Multiple Indicator Cluster Surveys; NGO: non-governmental organization. Table 7 summarizes the absolute differences between DHS/MICS and NGO, and between DHS and DHS. They are summarized according to the similarity of data collection timing (year and season), geographical level, and sample size. Using the absolute difference enabled us to see the size of the difference without taking the direction into account. The absolute difference between DHS/MICS and NGO estimates increases as year difference increases, as geographical levels difference increase, and as sample sizes decrease. The differences between DHS and DHS show similar patterns in terms of broad geographical level, sample size, and ≥3.5 years versus 0 to 3 years’ time differences.
Table 7.

Absolute difference of estimates by year difference, season, geographical level, and sample size.

DHS/MICS vs NGODHS vs DHS
VariableNMeanSDMedianIQRNMeanSDMedianIQR
Year difference
≤1 year49511.610.49.212.65618510.110.26.911.7
1.5-3 years86012.812.88.415.980249.39.16.610.7
≥3.5 years64113.813.210.115.14504213.613.99.215.3
Season
Same season115313.112.89.014.4-----
Different season60311.811.28.514.6-----
Season unknown24014.213.010.316.2-----
Geographical level difference
067712.512.68.314.2902410.111.56.211.4
189713.112.39.615.43027510.510.97.111.9
2+42212.812.29.014.96995212.112.48.213.8
Geographical level 1 [ a ] , [ b ]
Country147.77.73.96.16123011.912.28.113.7
Region125913.112.99.015.92524810.912.27.012.0
Province72312.411.49.313.42277310.810.87.512.6
Geographical level 2 [ c ] , [ d ]
Country147.77.73.96.18967.08.73.97.2
Region36912.612.28.714.2882611.313.17.112.7
Province42212.512.49.013.7308759.19.75.910.1
District96313.012.39.315.26865412.612.68.814.5
Village22813.513.38.617.6-----
Sample size 1 [ a ] , [ b ]
Tertile 1 (n [ a ]=335, n [ b ]=709)66314.113.19.816.63641811.512.17.812.6
Tertile 265612.912.49.315.03669511.211.77.512.9
Tertile 3 (n [ a ]=772, n [ b ]=5282)67711.611.58.213.33613811.712.17.813.8
Sample size 2 [ c ] , [ d ]
Tertile 1 (n [ c ]=236, n [ d ]=37)66414.813.710.417.53648013.713.010.014.9
Tertile 266812.012.08.114.03640711.711.98.013.1
Tertile 3 (n [ c ]=757, n [ d ]=104)66411.711.18.713.4363649.010.45.410.3

For the DHS/MICS - NGO comparison, refers to the DHS/MICS data.

For the DHS - DHS comparison, refers to the DHS data from the higher geographical level and earlier survey year.

For the DHS/MICS - NGO comparison, refers to the NGO data.

For the DHS - DHS comparison, refers to the data mimicking the NGO (from the lower geographical level and more recent survey year).

DHS: Demographic and Health Surveys; MICS: Multiple Indicator Cluster Surveys; NGO: non-governmental organization.

For the DHS/MICS - NGO comparison, refers to the DHS/MICS data. For the DHS - DHS comparison, refers to the DHS data from the higher geographical level and earlier survey year. For the DHS/MICS - NGO comparison, refers to the NGO data. For the DHS - DHS comparison, refers to the data mimicking the NGO (from the lower geographical level and more recent survey year). DHS: Demographic and Health Surveys; MICS: Multiple Indicator Cluster Surveys; NGO: non-governmental organization. Table 8 shows the partition of variation results from DHS/MICS vs NGO and DHS vs DHS comparison. For DHS/MICS vs NGO about 15% of the variance was attributed to the indicator and less than 1% attributed to geographical level, year and season difference. For DHS vs DHS, geographical level and year account for more variation in absolute difference (1.25 and 4.5% respectively). However, in all cases, most (>82%) of the variance was unattributed, that is, it remained unexplained by the model.
Table 8.

Partition of variance of difference and absolute difference between estimates by indicator, geographical level difference, year difference, and season.

DHS/MICS vs NGODHS vs DHS
Percent variance due to (%):Percent variance due to (%):
Dependent variablenIndicatorGeo. level differenceYear differenceSeasonOthernIndicatorGeo. level differenceYear differenceOther
Difference199616.690.000.610.0282.671092516.480.040.0093.48
Absolute difference199616.760.000.230.1582.8710925112.611.254.5081.63

Results from the ANOVA models.

Model: difference or absolute difference between estimates by indicator, geographical level difference (0,1,2+), year difference (continuous), and season (same season, different season, season unknown - in NGO vs DHS/MICS comparison only).

DHS: Demographic and Health Surveys; MICS: Multiple Indicator Cluster Surveys; NGO: non-governmental organization; ANOVA: Analysis of Variance.

Results from the ANOVA models. Model: difference or absolute difference between estimates by indicator, geographical level difference (0,1,2+), year difference (continuous), and season (same season, different season, season unknown - in NGO vs DHS/MICS comparison only). DHS: Demographic and Health Surveys; MICS: Multiple Indicator Cluster Surveys; NGO: non-governmental organization; ANOVA: Analysis of Variance. Results from all three comparisons, DHS/MICS - NGO, DHS - DHS, and Simulations, are shown in Figure 3 as boxplots of the absolute difference between estimates by the indicator reference value (the DHS estimate or the estimate simulating DHS). The distribution of absolute differences is similar between DHS/MICS - NGO and DHS - DHS, with DHS/MICS - NGO showing only a slightly larger spread. For all three types of comparisons, the distribution of the absolute difference between estimates is narrower in the extremes and larger when the reference value is between 35% and 65%. Since the simulated sampling error differences are small (range <10%), only a small proportion of the differences can be attributed to sampling error.
Figure 3.

Box plot of absolute difference between NGO and DHS/MICS estimates by the reference value.

Absolute difference between estimates calculated as:

Simulation: Simulated estimate 1 - Simulated estimate 2

DHS vs DHS: DHS estimate - DHS mimicking the NGO estimate (lower geographical level, more recent year of data collection)

DHS/MICS vs NGO: DHS/MICS estimate - NGO estimate

Reference value: DHS or the estimate mimicking DHS (higher geographical level, earlier year of data collection)

Abbreviations: DHS: Demographic and Health Surveys; MICS: Multiple Indicator Cluster Surveys; NGO: non-governmental organization.

Box plot of absolute difference between NGO and DHS/MICS estimates by the reference value.

Absolute difference between estimates calculated as: Simulation: Simulated estimate 1 - Simulated estimate 2 DHS vs DHS: DHS estimate - DHS mimicking the NGO estimate (lower geographical level, more recent year of data collection) DHS/MICS vs NGO: DHS/MICS estimate - NGO estimate Reference value: DHS or the estimate mimicking DHS (higher geographical level, earlier year of data collection) Abbreviations: DHS: Demographic and Health Surveys; MICS: Multiple Indicator Cluster Surveys; NGO: non-governmental organization.

Discussion

Our study showed that many indicators presented large differences between NGO and DHS/MICS estimates. Almost all indicators had at least some pairs that were widely different. Only about 33% of the pairs of indicators were within 5%, and about 80% of the pairs of indicators were within 20%. Agreement between indicators was higher when comparing indicators that had low or high prevalence (e.g. <15% or >85%), which is consistent with sampling theory, but throughout the prevalence range, the distribution of differences in the DHS/MICS-NGO and DHS-DHS comparisons is larger than that found from sampling error alone (reflected in the simulation distribution). An NGO could obtain an accurate estimate using DHS/MICS data for indicators with expected values close to 0% or 100%. We had hoped that if DHS/MICS and NGO estimates were similar, then NGOs could forego baseline data collection and use as a substitute DHS/MICS estimates, or estimates from some other publicly available dataset instead, saving NGO time and money, and reducing respondent burden. While we cannot give a blanket recommendation that DHS and MICS could always replace NGO baseline surveys, there are at least some situations where DHS/MICS could be used to the NGO’s advantage: when the estimate is expected to be less than 15% or above 85%; when the indicator of interest is one of the few with consistent similarity between DHS/MICS and NGO estimates; and when the NGO has tolerance for estimates of low or unknown accuracy. We had hypothesized that publicly available data can provide estimates of baseline conditions similar to those reported in NGO baseline reports when matched as closely as possible for location, year, and season of data collection. From the descriptive analyses, we found that as year difference increased, the mean difference between estimates slightly increased, and estimates derived from lower geographical levels (such as village or district from NGO and province for DHS/MICS) contributed to a higher mean absolute difference between estimates. In general, larger sample sizes were obtained at higher geographical levels and the larger the sample size (with their smaller sampling error) from DHS/MICS or NGO, the smaller the mean absolute difference between estimates. This meant that the advantage of geographical proximity is offset by the larger sampling error associated with small sample sizes. Whether the seasons of data collection were matched or different did not make a measurable difference to the similarity between estimates. However, the partition of variance analyses showed that DHS/MICS and NGO estimates differed, for the most part, in unpredictable ways, and geographical levels, years difference and seasons explained only a small part of the variation. We hypothesize that large differences between estimates from NGO baseline reports and DHS/MICS data are due to three main reasons: It is possible that NGOs’ estimates are collected from different populations with different underlying true values. NGOs often try to target lower wealth villages, and so baseline estimates may be worse off than the nationally representative DHS/MICS estimates. Note, however, that differences in household wealth indicators were small (e.g. “Household has electricity” 0.8% difference; “Household has a car” 0.2% difference). Additionally, the differences between DHS/MICS and NGO estimates might reflect actual changes over the years or across different geographical locations. Results from the analyses comparing data from the same source (DHS) but from different years and geographical levels also resulted in large differences between estimates. Different methods employed while sampling, collecting, processing and analyzing data might also have contributed to the differences between DHS/MICS and NGO estimates. Several indicators related to maternal and child health included in this study have not been validated and some have been shown to have low validity, such as maternal report of skilled birth attendance ( Blanc ). Inappropriate conflation of answer options and inconsistent coding and analysis of DHS surveys has also been documented ( Footman ). High measurement error can result in bias in unpredictable direction and dimension, resulting in large differences between estimates. Whatever the cause of the large differences between estimates was, it was not possible to know which of the data sources (DHS/MICS or NGO) provided the most accurate estimation of the true prevalence in the NGOs target populations. Furthermore, while we have been comparing DHS/MICS and NGO point estimates, these indicators are measured with error. The standard error (SE) for the DHS indicators is greater than 5% in eleven percent of the estimates. An estimate with a standard error of 5% will have a 95% confidence interval of ± 9.8%. Our analyses document and try to understand the large differences between NGO and DHS/MICS estimates. However, a study comparing DHS data to a small population-based survey from Rwanda showed that nine out of fifteen indicators related to maternal, newborn and child health were within a 10% difference ( Langston ). Similarly, in case studies from Nepal and Vietnam ( HealthBridge, 2020) there were many indicators where the DHS/MICS and NGO estimates were similar. In Nepal 70% of indicators were within 20% of one another. Estimates for ANC, iron-folic acid uptake, vitamin A supplementation at 18-23 months and mobile ownership were similar while breastfeeding, child dietary diversity and tetanus vaccination in pregnancy differed widely. In contrast, in Vietnam NGO estimates for exclusive/continued breastfeeding and dietary diversity at 6-8 months were close to DHS, while others differed by >30%. Using secondary data may be useful, especially in situations of budget or mobility restraint, such as during the COVID-19 pandemic with limited data collection opportunities. However, use of DHS surveys may risk underestimating the scale of problems for poor and marginalised groups such as nomads or slum dwellers ( Carr-Hill, 2017). When using DHS/MICS data, the user must keep in mind the potential differences between DHS/MICS and NGO estimates. This study had some limitations. Most NGO data we used came from unpublished, not peer-reviewed reports created for internal use only. Indicators extracted from NGO reports were not necessarily consistent across all reports and often SDs or SEs were missing. Although, we matched the methods employed by the NGO as closely as possible in order to obtain the same indicators from DHS/MICS, some reports provided limited information concerning methods of data collection and analysis. Dates of and season of data collection were impossible to assess for eight reports. Assigning the geographical level of data from the NGO report was also challenging for some settings due to lack of contextual information. However, we were able to communicate with several NGOs in order to obtain supplementary information about the reports’ methods.

Conclusion

Our hypothesis was that publicly available data can provide estimates of baseline conditions similar to those reported in NGO baseline reports when matched as closely as possible for location, year, and season of data collection. Our answer to this, in brief, is that publicly available data can be used, if the NGO is tolerant of imprecise estimates. While an NGO may use the evidence presented here to justify forgoing their own baseline survey, they should keep in mind that DHS and MICS provide estimates for only some of the indicators of interest to the NGO. On average, we estimated 18 of the NGO’s indicators using DHS/MICS, but NGOs were often reporting 100+ estimates. Furthermore, collecting data in the NGO working area can provide valuable insights for project design and implementation.

Data availability

This study used data owned by the DHS, the MICS and the NGOs that shared their baseline report. The DHS data can be downloaded at: https://www.dhsprogram.com, and the MICS data can be obtained at: https://mics.unicef.org. The DHS and MICS require registration and data access are only granted for legitimate research purposes. The NGO reports were either available online on each NGO website or obtained by personal contact by email. The full list of NGO reports used in this study including report title, year of publication, organization name and how to access each report can be found at: Harvard Dataverse: Details on reports used in the Maxdata project. https://doi.org/10.7910/DVN/32FUQV ( Berti, 2021). Data are available under the terms of the Creative Commons Zero “No rights reserved” data waiver (CC0 1.0 Public domain dedication). This paper addresses a very interesting question but I must admit I remain unconvinced that the analyses presented here actually answer the question raised. From my perspective and that of the authors, the answer to this question is primarily related to how representative the data from the big surveys are of the program population. So for me, at one level the answer to this question is rather straightforward. If an NGO is working at provincial level and there has been a recent (in the last year) DHS or MICS survey that was sampled to be representative at the provincial level, then I would always use the DHS and MICS data as the baseline data forgoing the data collection by the NGO. Not only does this save time and energy, but my bias is that I believe the methods (e.g., sample size, mapping and household listing for sampling) used by major surveys like DHS and MICS are almost better than what an NGO would use. Of course, the answer to this question becomes less clear as the DHS or MICS survey data become less representative of the ideal program baseline, either in time or population. This is in large part what the analyses presented in this paper were all about. On the time issue, would I still use the DHS data if it was two years old? In large part the answer to that question depends on how rapidly the indicators you are measuring change. If, for example, one was interested in measuring baseline values such as total fertility rate, household composition, wealth which change very slowly two or even five year period between the survey and the start of the NGO program would not be a big concern. However, for indicators that may change quickly, say coverage of some interventions like bednet ownership, vitamin A supplementation coverage which can quickly be scaled up with campaigns, data from a survey that is two or more years old would probably not provide a very accurate measure. While the analyses presented here did some work to look at this issue I am not really sure that it was captured in their analyses. On population representation, using data from a very recent DHS survey that is sampled to be representative at a provincial level as baseline data for a program that covers 80% of the province may be okay. The data are not perfectly representative but unless there is extreme heterogeneity in the indicators of interest using the indicator values from the province probably provide a reasonable estimate of baseline coverage and therefore could replace a separate NGO-run survey. As with time, the key is how much variability is there within the population that was sampled for the DHS or MICS survey. This is hard to know, but clearly urban rural differences, ethnic mixes, topology all link to this. Again, I would think this would be a major issue that again, am not sure is captured in the current analyses. I think these are exactly the issues that the authors were seeking to address in their analyses so why do I feel like these results do not really help us much in finding the answer? In their analyses they matched DHS/MICS data to data from NGO’s and then looked at how well these points matched when they varies in time or population. One major issue for me revolves around the NGO survey data. The analytical approach basically assumes that NGO surveys produced the right answer (as they are for the correct population and for the right time) ignoring that methodological or procedural weakness in the NGO surveys (e.g., lack of household listing, mapping during sampling, small sample size, poor training and supervision of interviewers) may make their results far from a gold standard comparison. I think I would be happier with analyses that restricted the comparisons to NGO surveys where a review of the methods and procedures and the sample size of the survey make me more confident about the quality of the NGO estimates. The second issue I have is the inclusion of data from DHS and MICS surveys that is broken down into smaller geographic region that was part of the sampling frame. Even if there are sufficient households at the district level, I am left wondering why we would expect the data to be very representative if that was not part of the sampling frame. I suppose that was part of the authors point of the analyses, but then we were not surprised that we found very weak correspondence between estimates of variables at the smaller geographic areas. I am a bit surprised the authors did not build a bit more on previous work on small area estimation that has tried to address many of the same issues focusing not just on differences due to population differences but also on techniques to adjust for these. Is the work clearly and accurately presented and does it cite the current literature? Partly If applicable, is the statistical analysis and its interpretation appropriate? Yes Are all the source data underlying the results available to ensure full reproducibility? Yes Is the study design appropriate and is the work technically sound? Partly Are the conclusions drawn adequately supported by the results? Yes Are sufficient details of methods and analysis provided to allow replication by others? Yes Reviewer Expertise: Modeling, program evaluation, estimation procedures I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard. The paper addresses an important issue tackled frequently by NGOs that struggle to decide whether to use existing data or collect their own at baseline. It follows a rigorous methodology and uses a good number of studies to draw conclusions from. Thank you for the excellent work. Below are some reflections, questions and suggestions. Introduction: Statement in p3 “In addition, the accuracy of the indicator estimates in NGO-led surveys may be insufficient for project design and monitoring purposes, due to relatively small sample sizes and the inherent high variability of the indicators of interest.” Table 4: Since the statement is in the introduction section, and since the paper shows later that sampling errors are not a factor for the studies that met the criteria of inclusion, how do we reconcile between both? A sample size of NGOs might be adequate for indicators such as antenatal care, postnatal care, delivery by skilled birth attendant, but not for illnesses among children under 5 years when we ask about children with symptoms in the past 2 weeks or when measuring the prevalence of early child marriage among adolescents since the age group is between 14 and 17, especially if the focus is on girls. Have such indicators been part of the simulations to see the potential errors by NGOs in sampling? Can the paper mention whether sample size by NGOs is a factor in such cases? Are such indicators the one that are in the small error of sampling in Figure 3? If so, would it be better for NGOs to use DHS/MICS data even when the surveys are a few years old or for a higher level of geography compared to their areas? In addition to the challenges in sample size, sometimes the questionnaire in NGOs’ baseline surveys are not technically sound to measure a standard indicator. Has this been observed from the cases reviewed? If so, the paper could refer to that and add suggest using the questionnaire of DHS/MICS if the NGO is going to collect its own baseline data, especially since they are adjusted to local context. Scenario 1 shows improvement for a geography over time but there is not much difference within the provinces or geographies in the same year. Scenario 2 shows there is not much difference between 3rd level and national level in the same survey. Scenario 3 shows estimates of a later study that is not much different from earlier (slight improvement) because the earlier performance was quite high at 88.5%. Question: The table illustrate an important point; to what level is it representative of all examined studies? It would be good to mention that. If it is not representative, could the paper cite another example where there are significant differences? Or, at least mention that there is, if this is the case. See related comment on Table 4 later. Table 6: Is there a need to explain in the methodology the rationale behind selecting 5% and 20% as the thresholds for comparison of differences? What would the picture be if the thresholds were 10% and 20%? Discussion: P19 statement “In general, larger sample sizes were obtained at higher geographical levels and the larger the sample size (with their smaller sampling error) from DHS/MICS or NGO, the smaller the mean absolute difference between estimates. This meant that the advantage of geographical proximity is offset by the larger sampling error associated with small sample sizes.” One would expect that the NGO would calculate the adequate sample size required using a sample calculator (like the one of RADAR project). If the resources available would result in a sample size that is significantly less than the adequate one, should there be a recommendation that the NGO uses DHS/MICS data? P19 statement “It is possible that NGOs’ estimates are collected from different populations with different underlying true values. NGOs often try to target lower wealth villages, and so baseline estimates may be worse off than the nationally representative DHS/MICS estimates. Note, however, that differences in household wealth indicators were small (e.g. “Household has electricity” 0.8% difference; “Household has a car” 0.2% difference).” Since the DHS presents some findings by wealth quintiles, one would expect that findings from the lowest quintile could represent the areas NGOs work in and be close to those from NGOs data. Was comparison between indicators values from NGOs and the lowest wealth quintile from DHS made? If so, could you add a statement to reflect that? P 19 statement “Additionally, the differences between DHS/MICS and NGO estimates might reflect actual changes over the years or across different geographical locations. Results from the analyses comparing data from the same source (DHS) but from different years and geographical levels also resulted in large differences between estimates.” Table 4 for Zambia does not show large differences. Are there other studies that have that? Conclusions: P 22 “Our hypothesis was that publicly available data can provide estimates of baseline conditions similar to those reported in NGO baseline reports when matched as closely as possible for location, year, and season of data collection. Our answer to this, in brief, is that publicly available data can be used, if the NGO is tolerant of imprecise estimates.” The paper also shows that NGO can use DHS/MICS when the values of the indicators are very high or very low. P 22 statement “While an NGO may use the evidence presented here to justify forgoing their own baseline survey, they should keep in mind that DHS and MICS provide estimates for only some of the indicators of interest to the NGO. On average, we estimated 18 of the NGO’s indicators using DHS/MICS, but NGOs were often reporting 100+ estimates. Furthermore, collecting data in the NGO working area can provide valuable insights for project design and implementation.” It would be good to expand on this in the discussion section. NGOs’ need to measure different outcome levels on knowledge, attitudes and practice; the first two guide project design, implementation and setting targets. NGOs also need to report on the different outcome levels to their donors. DHS/MICS focus more on practice/utilization of services and less on attitude and knowledge. Additional point In a webinar presenting the paper in March 2, 2021, it was mentioned if an NGO wants to have a baseline so it can compare with the end-line, it can have a properly randomized and controlled end-line that can give good findings on the project’s impact. Could that be added to the paper? Is the work clearly and accurately presented and does it cite the current literature? Yes If applicable, is the statistical analysis and its interpretation appropriate? I cannot comment. A qualified statistician is required. Are all the source data underlying the results available to ensure full reproducibility? Yes Is the study design appropriate and is the work technically sound? Yes Are the conclusions drawn adequately supported by the results? Yes Are sufficient details of methods and analysis provided to allow replication by others? Yes Reviewer Expertise: I have been involved in several baseline and end-line surveys for projects in pubic health conducted in developing countries and have witnessed the pros and cons of conducting them. My engagement has been in the planning, design, data analysis and reporting phases. I have also used findings from the DHS/MICS to guide project design, implementation and evaluation. I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.
  5 in total

1.  Comparing two survey methods of measuring health-related indicators: Lot Quality Assurance Sampling and Demographic Health Surveys.

Authors:  Sarah C Anoke; Paul Mwai; Caroline Jeffery; Joseph J Valadez; Marcello Pagano
Journal:  Trop Med Int Health       Date:  2015-10-27       Impact factor: 2.622

2.  Neglected value of small population-based surveys: a comparison with demographic and health survey data.

Authors:  Anne C Langston; Debra M Prosnitz; Eric G Sarriot
Journal:  J Health Popul Nutr       Date:  2015-03       Impact factor: 2.000

Review 3.  Using multi-country household surveys to understand who provides reproductive and maternal health services in low- and middle-income countries: a critical appraisal of the Demographic and Health Surveys.

Authors:  K Footman; L Benova; C Goodman; D Macleod; C A Lynch; L Penn-Kekana; O M R Campbell
Journal:  Trop Med Int Health       Date:  2015-03-05       Impact factor: 2.622

4.  Assessing the validity of indicators of the quality of maternal and newborn health care in Kenya.

Authors:  Ann K Blanc; Charlotte Warren; Katharine J McCarthy; James Kimani; Charity Ndwiga; Saumya RamaRao
Journal:  J Glob Health       Date:  2016-06       Impact factor: 4.413

Review 5.  Measuring coverage in MNCH: tracking progress in health for women and children using DHS and MICS household surveys.

Authors:  Attila Hancioglu; Fred Arnold
Journal:  PLoS Med       Date:  2013-05-07       Impact factor: 11.069

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.