Literature DB >> 29786689

Gridded birth and pregnancy datasets for Africa, Latin America and the Caribbean.

W H M James1, N Tejedor-Garavito1,2,3, S E Hanspal1, A Campbell-Sutton3, G M Hornby1,3, C Pezzulo1,2, K Nilsen1, A Sorichetta1,2, C W Ruktanonchai1,2, A Carioli1, D Kerr1, Z Matthews4, A J Tatem1,2.   

Abstract

Understanding the fine scale spatial distribution of births and pregnancies is crucial for informing planning decisions related to public health. This is especially important in lower income countries where infectious disease is a major concern for pregnant women and new-borns, as highlighted by the recent Zika virus epidemic. Despite this, the spatial detail of basic data on the numbers and distribution of births and pregnancies is often of a coarse resolution and difficult to obtain, with no co-ordination between countries and organisations to create one consistent set of subnational estimates. To begin to address this issue, under the framework of the WorldPop program, an open access archive of high resolution gridded birth and pregnancy distribution datasets for all African, Latin America and Caribbean countries has been created. Datasets were produced using the most recent and finest level census and official population estimate data available and are at a resolution of 30 arc seconds (approximately 1 km at the equator). All products are available through WorldPop.

Entities:  

Mesh:

Year:  2018        PMID: 29786689      PMCID: PMC5963337          DOI: 10.1038/sdata.2018.90

Source DB:  PubMed          Journal:  Sci Data        ISSN: 2052-4463            Impact factor:   6.444


Background & Summary

Accurate and detailed information on the spatial distribution and numbers of births and pregnancies is crucial for informing planning decisions related to public health[1]. The survival and health of women and their new-born babies in low income countries is a key priority, with the reduction of maternal and neonatal mortality central for meeting a number of the United Nations Sustainable Development Goals (specifically goals 3.1 and 3.2)[2]. Whilst progress has been made, there were still 303,000 maternal deaths in 2015 (ref. 3) and children in lower income countries are 14 times more likely to die during their first 28 days of life compared to their higher income counterparts. Despite this, the spatial detail of basic data on the numbers and distribution of births and pregnancies is often of a coarse resolution and difficult to obtain[4], with no co-ordination between countries and organisations to create one consistent set of subnational estimates for planning. Whilst there are clear inequalities of maternal and neonatal healthcare between nations[5], there are also large disparities within individual countries, with growing recognition that national levels and trends could be masking important sub-national variations[6]. For example, a study in Indonesia found that under-5 mortality was nearly four times higher in the poorest fifth of the population than in the richest fifth[7], and gaps like these are more likely to occur at the sub-national level[7-9]. Although progress has been made in reducing such inequalities, there is still substantial work to be done. As such, understanding sub-national variation and inequity in health status, wealth and access to resources is increasingly being recognised as central to meeting developmental goals[10]. To understand and tackle inequalities related to maternal and neonatal health, the first step is to have a detailed knowledge of the distribution of births and pregnancies, which is known to vary substantially due to population age and sex distribution and age specific fertility rates (ASFR)[4]. These are also valuable data for subnational planning and estimation, and calculation of subnational indicators that rely on births or pregnancies as a denominator. When considering maternal and neonatal health in lower income countries, infectious disease is a major concern as pregnant women and new-borns are particularly at risk from many diseases, such as malaria[11] and HIV[12]. This issue has recently been highlighted by the Zika virus outbreak in Latin America, further intensifying the need for detailed information on the number and distribution of births and pregnancies. Currently there is a clear lack of data for such analysis, with complete and continuous datasets of numbers of births only available at the national level (e.g., United Nations Population Division[13]). Whilst sub-national datasets are readily available for some countries, their spatial detail is often coarse with differences in the recorded metrics, sampling framework and data formats meaning that it is extremely difficult to assess burden within and across multiple nations. This study aims to overcome the data gap identified above by producing continental scale, gridded datasets of numbers of births and pregnancies with a spatial resolution of 30 arc seconds (approximately 1 km at the equator). Advances in computational power and spatial econometric techniques, as well as the increasing availability of geo-located data, have increased the ability to produce these fine spatial resolution datasets. As such, in the framework of the WorldPop project (www.worldpop.org), and extending the approaches described by Tatem et al.[4], an open access archive of gridded birth and pregnancy distribution datasets for all African, Latin America and Caribbean (LAC) countries has been created. This process used the most recent and finest level census, census microdata, household survey data and official population estimate data available to the authors at the time of writing, alongside a range of geospatial datasets.

Methods

Gridded estimates of live births were produced for 50 Latin American and Caribbean and 58 African countries at a spatial resolution of 30 arc seconds. This was achieved by combining the latest datasets on population distribution, population age and sex structure and fertility rates in a GIS environment. Estimates of pregnancies were additionally generated using national-level estimates for stillbirths, miscarriages and abortions from the Guttmacher Institute[14]. The workflow develops the methods presented by Tatem et al.[4], using a variety of data sources to construct continent wide datasets. The process is fully automated by a Python Script, allowing the rapid processing of multiple countries and alignment to a standard grid for the production of seamless continental scale datasets. The workflow is shown in Fig. 1 and described in detail below. Maps of the data sources and date for each country and whether urban and rural ASFR estimates were available can be found in the Supplementary Figures 1 and 2 respectively.
Figure 1

Schematic overview of the workflow adopted to generate gridded subnational births and pregnancy datasets.

ASFR=Age Specific Fertility Rate, DHS=Demographic and Health Survey, MICS=Multiple Indicator Cluster Survey, UNPD=United Nations Population Division.

The basis for estimation: population distributions

The population distribution forms a major component of the births and pregnancy estimation process. The WorldPop project has recently completed construction of gridded population distribution datasets for all low- and middle-income countries at a resolution of 30 arc seconds. Full details are provided on the WorldPop website (www.worldpop.org.uk) along with links describing the methods in detail[15-17]. This study uses the relevant regions of Africa (Data Citation 1) and Latin America and the Caribbean (Data Citation 2), whose total population is adjusted to match the most recent United Nations Population Division (UNPD)[18] 2015 estimates available when the population distribution datasets were produced. Figure 2a shows the gridded population distribution dataset for Bolivia as an example. To ensure data consistency, a WorldPop standard grid was used in processing; this is a gridded dataset providing ISO country codes at a resolution of 30 arc second (Data citation 3).
Figure 2

Examples of input, intermediate and output datasets for Bolivia.

(a) Gridded population input (WorldPop), (b) Age and sex structure disaggregated by administrative unit, (c) Derived gridded age and sex structure, (d) ASFRs disaggregated by region and urban vs rural, (e) Gridded births output, (f) Gridded pregnancies output.

Calculating the proportion of women of reproductive age

Sub-national information on age and sex structure was collected, specifically women of childbearing age grouped in seven 5-years age groups (i.e., 15–19, 20–24, 25–29, 30–34, 35–39, 40–44, 45–49), as defined by the UNPD[18]. Datasets for the majority of Africa were provided by Pezzulo et al.[19] whilst datasets for the remaining African, Latin America and the Caribbean countries were assembled from a variety of sources, following the protocols defined by Pezzulo et al.[19]. Table 1 (available online only) shows the source, spatial detail (i.e., administrative unit level) and reference year used for the countries processed in this study.
Table 1

Data sources for African, Latin American and Caribbean (LAC) countries from which age and sex proportions were derived

CountryISO CodeContinentData Type UsedYearAdministrative Unit Level (number of units)Data Source
Note: Data marked with (*) are from Pezzulo et al.[19]      
AnguillaAIALACCensus20010 (1)Statistics Department, Government of Anguilla
Antigua and BarbudaATGLACCensus20110 (1)Statistics Division, Government of Antigua and Barbuda
ArgentinaARGLACCensus20102 (528)Instituto Nacional de Estadística y Censos, Argentina
ArubaABWLACCensus20100 (1)Central Bureau of Statistics, Aruba
BahamasBHSLACCensus20101 (32)Department of Statistics, The Commonwealth of The Bahamas
BarbadosBRBLACCensus20101 (11)Barbados Statistical Service
BelizeBLZLACCensus20101 (6)Statistical Institute of Belize
BoliviaBOLLACCensus20122 (95)Instituto Nacional de Estadística, Bolivia
Bonaire, Saint Eustatius and SabaBESLACAnnual Stats20161 (3)Central Bureau of Statistics: Netherlands Antilles and Island Registries
BrazilBRALACCensus20102 (5570)Instituto Brasileiro de Geografia e Estatística
British Virgin IslandsVGBLACCensus20100 (1)Central Statistics Office, Government of the Virgin Islands
Cayman IslandsCYMLACCensus20100 (1)Economics and Statistics Office, Cayman Islands Government
ChileCHLLACCensus20141 (16)Instituto Nacional de Estadísticas, Chile
ColombiaCOLLACCensus20051 (33)National Administrative Department of Statistics, Colombia
Costa RicaCRILACCensus20111 (7)Instituto Nacional de Estadística y Censos, Costa Rica
CubaCUBLACCensus20121 (16)La Oficina Nacional de Estadísticas de Cuba
CuracaoCUWLACCensus20110 (1)Central Bureau of Statistics, Curaçao
DominicaDMALACCensus20111 (10)Central Statistics Office, Dominica
Dominican RepublicDOMLACCensus20102 (155)Oficina Nacional de Estadística, Dominican Republic
EcuadorECULACCensus20103 (1024)Instituto Nacional de Estadísticas, Ecuador
El SalvadorSLVLACAnnual Stats20091 (14)Directorate-General for Statistics and Census, El Salvador
Falkland IslandsFLKLACCensus20120 (1)Falkland Islands Government
French GuianaGUFLACAnnual Stats20160 (1)Institut National de la Statistique et des Etudes Economiques, France
GrenadaGRDLACCensus20011 (7)Regional Statistics Sub-Programme, Caribbean Community
GuadeloupeGLPLACAnnual Stats20160 (1)Institut National de la Statistique et des Etudes Economiques, France
GuatemalaGTMLACCensus20022 (352)Instituto Nacional de Estadística, Guatemala
GuyanaGUYLACCensus20121 (10)Bureau of Statistics, Guyana
HaitiHTILACCensus20011 (10)Institut Haïtien de Statistique et d'Informatique, Haïti
HondurasHNDLACCensus20131 (18)Instituto Nacional de Estadística, Honduras
JamaicaJAMLACCensus20111 (14)Statistical Institute of Jamaica
MartiniqueMTQLACAnnual Stats20160 (1)Institut National de la Statistique et des Etudes Economiques, France
MexicoMEXLACCensus20101 (32)Instituto Nacional de Estadística y Geografía, México
MontserratMSRLACCensus20110 (1)Statistics Department, Montserrat
NicaraguaNICLACCensus20052 (137)Instituto Nacional de Información de Desarrollo, Nicaragua
PanamaPANLACCensus20101 (13)Instituto Nacional de Estadística y Censos, Panamá
ParaguayPRYLACCensus microdata20021 (18)Integrated Public Use Microdata Series, International (IPUMSI)
PeruPERLACCensus20072 (194)Instituto Nacional de Estadística e Informática, Peru
Puerto RicoPRILACCensus20101 (78)Instituto de Estadísticas de Puerto Rico
Saint BarthelemyBLMLACAnnual Stats20130 (1)Institut National de la Statistique et des Etudes Economiques, France
Saint Kitts and NevisKNALACCensus20011 (14)Regional Statistics Sub-Programme, Caribbean Community
Saint LuciaLCALACCensus20100 (1)Central Statistics Office, Saint Lucia
Saint MartinMAFLACCensus20130 (1)Institut National de la Statistique et des Etudes Economiques, France
Saint Vincent and The GrenadinesVCTLACCensus20121 (6)Statistics Office, St. Vincent
Sint MaartenSXMLACCensus20110 (1)Department of Statistics, Sint Maarten
SurinameSURLACCensus20122 (62)General Bureau of Statistics, Suriname
Trinidad and TobagoTTOLACCensus20001 (15)Central Statistical Office, Trinidad and Tobago
Turks and Caicos IslandsTCALACCensus20120 (1)Statistical Office, Turks and Caicos Islands
UruguayURYLACCensus20112 (231)Instituto Nacional de Estadística, Uruguay
VenezuelaVENLACCensus20112 (337)Instituto Nacional de Estadística, Venezuela
Virgin Islands, U.S.VIRLACCensus20102 (20)U.S. Census Data and Statistics, USA
Algeria*DZAAfricaCensus20081 (48)Office National des Statistique, Algeria
Angola*AGOAfricaDemographic and Health Surveys- Malaria Indicators Surveys20111 (18)MEASURE Demographic and Health Surveys, USAID
Benin*BENAfricaDemographic and Health Surveys20111 (12)MEASURE Demographic and Health Surveys, USAID
Botswana*BWAAfricaCensus20062 (21)Central Statistics Office, Botswana
Burkina Faso*BFAAfricaCensus20061 (13)Institut National de la Statistique et de la Demographie (INSD), Burkina Faso
Burundi*BDIAfricaDemographic and Health Surveys20101 (17)MEASURE Demographic and Health Surveys, USAID
Cameroon*CMRAfricaDemographic and Health Surveys20111 (12)MEASURE Demographic and Health Surveys, USAID
Cape VerdeCPVAfricaCensus20101 (22)Instituto Nacional de Estatística, Cape Verde
Central African Republic*CAFAfricaMultiple Indicator Cluster Surveys20061 (17)UNICEF
Chad*TCDAfricaDemographic and Health Surveys20041 (8)MEASURE Demographic and Health Surveys, USAID
ComorosCOMAfricaCensus20010 (1)Department of Statistics, Comoros
Congo*COGAfricaDemographic and Health Surveys-Aids Indicator Surveys20091 (4)MEASURE Demographic and Health Surveys, USAID
Congo, The Democratic Republic Of The*CODAfricaDemographic and Health Surveys20131(11)MEASURE Demographic and Health Surveys, USAID
Cote D'ivoire*CIVAfricaDemographic and Health Surveys-Aids Indicator Surveys20051 (11)MEASURE Demographic and Health Surveys, USAID
Djibouti*DJIAfricaMultiple Indicator Cluster Surveys20061 (2)UNICEF
Egypt*EGYAfricaCensus microdata20061 (26)Integrated Public Use Microdata Series, International (IPUMSI)
Equatorial Guinea*GNQAfricaUnited Nations20100(1)World Population Prospects, United Nations Population Division
Eritrea*ERIAfricaUnited Nations20100 (1)World Population Prospects, United Nations Population Division
Ethiopia*ETHAfricaCensus20071 (11)Central Statistical Agency of Ethiopia
Gabon*GABAfricaDemographic and Health Surveys20121 (10)MEASURE Demographic and Health Surveys, USAID
Gambia*GMBAfricaMultiple Indicator Cluster Surveys20061 (13)MEASURE Demographic and Health Surveys, USAID
Ghana*GHAAfricaCensus—USCB20102 (110)United States Census Bureau, USAID
Guinea*GINAfricaDemographic and Health Surveys20121 (8)MEASURE Demographic and Health Surveys, USAID
Guinea-Bissau*GNBAfricaMultiple Indicator Cluster Surveys20061 (9)UNICEF
Kenya*KENAfricaCensus—USCB20101 (47)United States Census Bureau, USAID
Lesotho*LSOAfricaCensus20041 (10)Lesotho Bureau of Statistics
Liberia*LBRAfricaCensus20081 (15)Liberia Institute of Statistics and Geo-Information Service
Libyan Arab Jamahiriya*LBYAfricaUnited Nations20100 (1)World Population Prospects, United Nations Population Division
Madagascar*MDGAfricaDemographic and Health Surveys20091 (22)MEASURE Demographic and Health Surveys, USAID
Malawi*MWIAfricaCensus20082 (350)National Statistical Office, Malawi
Mali*MLIAfricaDemographic and Health Surveys20121 (9)MEASURE Demographic and Health Surveys, USAID
Mauritania*MRTAfricaMultiple Indicator Cluster Surveys20071 (13)UNICEF
MauritiusMUSAfricaCensus20111 (12)National Statistics Office, Mauritius
MayotteMYTAfricaCensus20160 (1)Institut National de la Statistique et des Etudes Economiques, France
Morocco*MARAfricaCensus20042 (15)Haut Commissariat au Plan, Morocco
Mozambique*MOZAfricaCensus20072 (129)Instituto Nacional de Estatística, Mozambique
Namibia*NAMAfricaCensus—USCB20102 (102)United States Census Bureau, USAID
Niger*NERAfricaDemographic and Health Surveys20121 (8)MEASURE Demographic and Health Surveys, USAID
Nigeria*NGAAfricaCensus20061 (37)National Bureau of Statistics, Nigeria
ReunionREUAfricaCensus20160 (1)Institut National de la Statistique et des Etudes Economiques, France
Rwanda*RWAAfricaCensus—USCB20102 (30)United States Census Bureau, USAID
Saint HelenaSHNAfricaCensus20082 (10)National Statistics Office, Saint Helena
Sao Tome and PrincipeSTPAfricaCensus20162 (7)Instituto Nacional de Estatística (INE), São Tomé e Príncipe
Senegal*SENAfricaCensus microdata20022 (30)Integrated Public Use Microdata Series, International (IPUMSI)
SeychellesSYCAfricaCensus20160 (1)National Bureau of Statistics, Seychelles
Sierra Leone*SLEAfricaCensus20042 (14)Statistics Sierra Leone
Somalia*SOMAfricaMultiple Indicator Cluster Surveys20061 (18)UNICEF
South Africa*ZAFAfricaCensus—USCB20103 (259)United States Census Bureau, USAID
South Sudan*SSDAfricaCensus microdata20081 (10)Integrated Public Use Microdata Series, International (IPUMSI)
Sudan*SDNAfricaCensus microdata20081 (15)Integrated Public Use Microdata Series, International (IPUMSI)
Swaziland*SWZAfricaDemographic and Health Surveys20071 (4)MEASURE Demographic and Health Surveys, USAID
Tanzania, United Republic Of*TZAAfricaCensus—USCB20102(117)United States Census Bureau, USAID
Togo*TGOAfricaDemographic and Health Surveys20131 (6)National Statistics Institute, Tunisia
Tunisia*TUNAfricaCensus20041 (24)MEASURE Demographic and Health Surveys, USAID
Uganda*UGAAfricaDemographic and Health Surveys20111 (10)United States Census Bureau, USAID
Western Sahara*ESHAfricaUnited Nations20100 (1)World Population Prospects, United Nations Population Division
Zambia*ZMBAfricaCensus—USCB20103 (150)United States Census Bureau, USAID
Zimbabwe*ZWEAfricaDemographic and Health Surveys20111 (10)MEASURE Demographic and Health Surveys, USAID
With the raw data recorded and documented according to different protocols determined by national governments, the project was presented with a wide range of table data formats and schemas. Data restructuring was achieved using scripting (R 3.3.1, Python 2.7) and table processing software (Microsoft Excel 2013). The resultant standardised tables contained fields corresponding to the proportionate values of people (both sexes) in each 5-year age group, and the overall proportion of males and females in each region. Table 2 shows an example of the standardised tables, for regions of Peru.
Table 2

Example of standardised table containing proportionate values of people by age group and sex for Peru.

Regiont_0_4t_5_9t_10_14t_15_19……t_60_64t_65_plusprop_m_tprop_f_t
(Note table is for illustrative purposes only and therefore does not display all age groups).         
PER_Callao_Callao0.090.080.090.09 0.030.060.490.51
PER_Cusco_Acomayo0.120.140.140.08 0.030.080.490.51
PER_Cusco_Anta0.100.120.140.10 0.030.080.500.50
PER_Cusco_Calca0.110.120.140.10 0.020.060.500.50
PER_Cusco_Canas0.120.140.140.09 0.030.080.500.50
PER_Cusco_Canchis0.100.120.130.10 0.030.070.490.51
Age and sex structure information was matched to vector geographical boundaries from the Global Administrative Areas (GADM) database[20], with the exceptions of Chile and Colombia where boundaries from the National Statistics Office were used. The extent of these boundaries was standardised to those defined by the WorldPop gridded ISO country code dataset using the Clip and Nibble tools in ArcGIS 10.3, executed as part of the Python[21] script. Figure 2b shows the distribution of females between the ages of 20 and 24 for Bolivia. Similar distributions were created for all other 5-year age groupings in the 15–49-year range.

Estimating fertility rates

Data on fertility was collected on a country-by-country basis to provide the most up to date and spatially detailed information. Data sources were chosen using a hierarchical approach as shown in Fig. 1, prioritising sources which included information on age specific fertility and those of the highest spatial detail. The type of fertility data used for each country is shown in Table 3 (available online only) and described in detail below. As with the age and sex structure datasets, restructuring and table processing was carried out using a variety of scripting and software packages (Python 2.7, R 3.3.1, Microsoft Excel 2013) to produce a common format and schema for each data type, as described in detail below.
Table 3

Fertility data sources for all African, Latin American and Caribbean countries

CountryISOTypeMeasureSourceYearNumber of Units
ASFR= Age Specific Fertility Rate, DHS=Demographic and Health Survey, MICS=Multiple Indicator Cluster Survey, UNPD=United Nations Population Division.      
ArubaABWCensusASFRNational institution[22]20001
AngolaAGOSurvey sampleASFRDHS[23]20118
AnguillaAIAVital registrationASFRUNPD 2008[24]20061
ArgentinaARGVital registrationBirth countNational institution[25]201224
Antigua and BarbudaATGNational estimateASFRUNPD 2015[18]2010–151
BurundiBDISurvey sampleASFRDHS[23]20129
BeninBENSurvey sampleASFRDHS[23]201223
Bonaire, Saint Eustatius and SabaBESVital registrationCrude birth rateNational institution[26]20153
Burkina FasoBFASurvey sampleASFRDHS[23]201426
BahamasBHSVital registrationBirth countNational institution[27]201318
Saint BarthelemyBLMVital registrationCrude birth rateNational institution[28]20131
BelizeBLZNational estimateASFRUNPD 2015[18]2010–151
BoliviaBOLSurvey sampleASFRDHS[23]200818
BrazilBRAVital registrationBirth countNational institution[29]20155570
BarbadosBRBNational estimateASFRUNPD 2015[18]2010–151
BotswanaBWAVital registrationBirth countNational institution[30]201415
Central African RepublicCAFSurvey sampleASFRDHS[23]199411
ChileCHLVital registrationASFRNational institution[31]201315
Cote D'ivoireCIVSurvey sampleASFRDHS[23]201221
CameroonCMRSurvey sampleASFRDHS[23]201122
Congo, The Democratic Republic Of TheCODSurvey sampleASFRDHS[23]201351
CongoCOGSurvey sampleASFRDHS[23]201115
ColombiaCOLSurvey sampleASFRDHS[23]201064
ComorosCOMSurvey sampleASFRDHS[23]20126
Cape VerdeCPVNational estimateASFRUNPD 2015[18]2010–151
Costa RicaCRIVital registrationBirth countNational institution[32]20157
CubaCUBSample surveyASFRMICS[33]201416
CuracaoCUWVital registrationBirth countNational institution[34]20131
Cayman IslandsCYMVital registrationBirth countNational institution[35]20151
DjiboutiDJINational estimateASFRUNPD 2015[18]2010–151
DominicaDMAVital registrationASFRUNPD 2008[24]20031
Dominican RepublicDOMSurvey sampleASFRDHS[23]201318
AlgeriaDZASurvey sampleASFRMICS[33]2012–1314
EcuadorECUVital registrationASFRNational institution[36]2012224
EgyptEGYSurvey sampleASFRDHS[23]201451
EritreaERISurvey sampleASFRNational institution[37]20106
Western SaharaESHSurvey sampleASFRDHS[23]2003–044
EthiopiaETHSurvey sampleASFRDHS[23]201121
Falkland IslandsFLKVital registrationCrude birth rateNational institution[38]20081
GabonGABSurvey sampleASFRDHS[23]201219
GhanaGHASurvey sampleASFRDHS[23]201420
GuineaGINSurvey sampleASFRDHS[23]201215
GuadeloupeGLPNational estimateASFRUNPD 2015[18]2010–151
GambiaGMBSurvey sampleASFRDHS[23]201314
Guinea-BissauGNBSurvey sampleASFRMICS[33]201417
Equatorial GuineaGNQNational estimateASFRUNPD 2015[18]2010–151
GrenadaGRDVital registrationASFRUNPD 2015[18]2010–151
GuatemalaGTMSurvey sampleASFRDHS[23]2014–1545
French GuianaGUFNational estimateASFRUNPD 2015[18]2010–151
GuyanaGUYSurvey sampleASFRDHS[23]200914
HondurasHNDSurvey sampleASFRDHS[23]201138
HaitiHTISurvey sampleASFRDHS[23]201221
JamaicaJAMVital registrationBirth countNational institution[39]201314
KenyaKENSurvey sampleASFRDHS[23]201510
Saint Kitts and NevisKNACensusASFRUNPD 2008[24]20011
LiberiaLBRSurvey sampleASFRDHS[23]201330
Libyan Arab JamahiriyaLBYNational estimateASFRUNPD 2015[18]2010–151
Saint LuciaLCANational estimateASFRUNPD 2015[18]2010–151
LesothoLSOSurvey sampleASFRDHS[23]201420
Saint MartinMAFNational estimateCrude birth rateWorld Bank[40]20141
MoroccoMARSurvey sampleASFRDHS[23]2003–0417
MadagascarMDGSurvey sampleASFRDHS[23]200842
MexicoMEXVital registrationBirth countNational institution[41]201532
MaliMLISurvey sampleASFRDHS[23]201517
MozambiqueMOZSurvey sampleASFRDHS[23]201121
MauritaniaMRTSurvey sampleASFRMICS[33]201123
MontserratMSRVital registrationASFRUNPD 2008[24]20041
MartiniqueMTQNational estimateASFRUNPD 2015[18]2010–151
MauritiusMUSNational estimateASFRUNPD 2015[18]2010–151
MalawiMWISurvey sampleASFRDHS[23]2015–1654
MayotteMYTNational estimateASFRUNPD 2015[18]2010–151
NamibiaNAMSurvey sampleASFRDHS[23]201326
NigerNERSurvey sampleASFRDHS[23]201215
NigeriaNGASurvey sampleASFRDHS[23]201572
NicaraguaNICSurvey sampleASFRDHS[23]200134
PanamaPANVital registrationBirth countNational institution[42]201212
PeruPERSurvey sampleASFRDHS[23]201248
Puerto RicoPRIVital registrationBirth countNational institution[43]2009–1078
ParaguayPRYVital registrationBirth countNational institution[44]201418
ReunionREUNational estimateASFRUNPD 2015[18]2010–151
RwandaRWASurvey sampleASFRDHS[23]2014–1560
SudanSDNSurvey sampleASFRMICS[33]201436
SenegalSENSurvey sampleASFRDHS[23]201428
Saint HelenaSHNVital registrationCrude birth rateNational institution[45]20131
Sierra LeoneSLESurvey sampleASFRDHS[23]201327
El SalvadorSLVSurvey sampleASFRMICS[33]201428
SomaliaSOMNational estimateASFRUNPD 2015[18]2010–151
South SudanSSDSurvey sampleASFRMICS[33]201020
Sao Tome and PrincipeSTPSurvey sampleASFRDHS[23]20088
SurinameSURNational estimateASFRUNPD 2015[18]2010–151
SwazilandSWZSurvey sampleASFRMICS[33]201416
Sint MaartenSXMNational estimateCrude birth rateWorld Bank[40]20131
SeychellesSYCNational estimateASFRUNPD 2015[18]2010–151
Turks and Caicos IslandsTCAVital registrationASFRUNPD 2008[24]20051
ChadTCDSurvey sampleASFRDHS[23]201441
TogoTGOSurvey sampleASFRDHS[23]201311
Trinidad and TobagoTTONational estimateASFRUNPD 2015[18]2010–151
TunisiaTUNSurvey sampleASFRMICS[33]2011–1218
Tanzania, United Republic OfTZASurvey sampleASFRDHS[23]201259
UgandaUGASurvey sampleASFRDHS[23]201419
UruguayURYVital registrationBirth countNational institution[46]201519
Saint Vincent and The GrenadinesVCTVital registrationBirth countNational institution[47]20135
VenezuelaVENVital registrationBirth countNational institution[48]2012335
British Virgin IslandsVGBVital registrationASFRUNPD 2008[24]20041
Virgin Islands, U.S.VIRNational estimateASFRUNPD 2015[18]2010–151
South AfricaZAFVital registrationBirth countNational institution[49]20159
ZambiaZMBSurvey sampleASFRDHS[23]201320
ZimbabweZWESurvey sampleASFRDHS[23]201019
To provide the finest spatial detail of the distribution of fertility across each country, ASFRs were estimated by 5-year age groups, disaggregated sub-nationally according to the relevant Demographic and Health Surveys (DHS), Multiple Indicator Cluster Surveys (MICS) or National Statistics Agency survey regions and by urban vs rural if available. Table 3 (available online only) indicates which countries had the required data for estimation. ASFRs for each 5-year age group were commonly derived from DHS or MICS, with the exceptions of Aruba, Chile and Eritrea where the relevant datasets were available from the corresponding National Statistics Office. For DHS and MICS surveys, ASFRs were estimated using a Stata program developed by Pullum[50], as discussed in Tatem et al.[4]. The program calculated the basic demographic indicator by deriving ASFRs for each of the seven 5-year age groups covering the reproductive life span from 15–49 years based on dividing the number of births to women in each age group, during a retrospective 3-years reference period, by the number of women-years during the same period. Data restructuring and table processing was carried out using R 3.3.1 and Microsoft Excel 2013 to produce a common format, as that shown in Table 4.
Table 4

Example of standardised country table containing the ASFR values for regions in Haiti with separate entries for urban and rural areas.

ISORegionRural/UrbanASFR 15_19ASFR 20_24ASFR 25_29ASFR 30_34ASFR 35_39ASFR 40_44ASFR 45_49Year
HTIWESTURBAN0.030.150.170.130.080.100.002012
HTINORTHURBAN0.050.100.120.140.070.030.012012
HTICENTRALURBAN0.050.100.220.200.090.060.002012
HTISOUTHURBAN0.030.100.140.170.070.070.022012
HTINIPPESURBAN0.010.090.180.130.090.040.002012
HTIWESTRURAL0.070.180.200.190.180.070.012012
HTINORTHRURAL0.080.170.210.160.130.070.022012
Datasets representing the boundaries of subnational regions (Table 3) (available online only) were assembled and the relevant ASFRs matched to them. If the ASFR data was available for urban and rural areas within the subnational regions, the MODIS 500 m Global Urban Extent dataset[51] was used to distinguish urban and rural areas and allocate the constant value within them. Figure 2d shows an example ASFR dataset for Bolivia, showing the ASFRs for one reproductive age range: the 20–24 age group by sub-region. Similar datasets were constructed for all other 5-year age groups within the 15 to 49 range. For countries where ASFRs disaggregated sub-nationally and by urban/rural were not available, information on the spatial variation of age structured fertility was sought from vital registration systems, census records and other national sources. This was routinely in the form of births registered per administrative unit per 5-year age grouping. Table 3 (available online only) shows for which countries this type of data (registered births per age group) was used, with Table 5 showing an example of the standardised table format for Venezuela.
Table 5

Example of standardised country table containing the number of registered births per age group for regions in Venezuela.

RegionISOb_15_19b_20_24b_25_29b_30_34b_35_39b_40_44b_45_49year
1VEN11110110658411242012
2VEN11313510980552472012
3VEN879102678842219458132012
4VEN11113111087532292012
5VEN150152129106623062012
6VEN151615127412012
7VEN203321179642012
8VEN3475024442991413402012
9VEN1971931377830602012
Sub-national ASFRs were calculated by dividing the number of births in each age group (e.g., Table 5) by the number of females in the corresponding group. The latter was derived from the WorldPop population distribution[15] and age-sex distributions produced following the methodology of Pezzulo et al.[19] and outlined in Table 1 (available online only). The UNPD provides national estimates of AFSRs by 5-year age grouping for the majority of countries[18,24]. These datasets were used where subnational information was not available. As with all other datasets, the country boundaries defined by WorldPop[15] were used to define the geographical extent. For 9 countries, there was no information available on age specific fertility, either sub-nationally or nationally. In these cases, crude birth rates were obtained from a variety of official sources (Table 3 (available online only)) which were subsequently matched to the appropriate GIS country boundaries supplied by WorldPop[15].

Estimating the number of births from fertility, population and age structure

For countries where measures of age specific fertility were available, the distribution of live births was estimated by multiplying the number of females in each age group (e.g., Fig. 2c) by the corresponding ASFR gridded dataset (e.g., Fig. 2d) or value (in the case of national ASFR estimates). The resultant seven age specific gridded datasets were summed to generate an estimate of total births. For countries where fertility was expressed simply as a crude birth rate (Table 3 (available online only)), the births distribution was calculated by multiplying the crude birth rate by the initial 30 arc second UNPD adjusted WorldPop population grid for 2015 (ref. 15). Finally, for each country, the distribution was scaled to match the UNPD estimate of the total number of births[18]. For countries where the UNPD does not provide an estimate of the total number of births, the initial total was used. An example of the final distributed births gridded dataset for Bolivia is shown in Fig. 2e, with results for Africa, Latin America and the Caribbean shown in Fig. 3 and Supplementary Table 1.
Figure 3

Estimated births and pregnancies per grid cell for Africa, Latin America and the Caribbean in 2015.

The grid cell resolution is 30 arc seconds (approximately 1 km at the equator) and co-ordinates refer to GCS WGS 1984.

Estimating the distribution of pregnancies

The Guttmacher institute has published country specific estimates of the number of stillbirths, miscarriages and abortions at the national level[14]. These estimates for 2014 were integrated with UNPD national estimates on numbers of live births[18] to construct a ratio between numbers of births and pregnancies. This ratio was applied to the live births distribution to generate an estimate of the distribution of pregnancies. For countries not covered by the Guttmacher dataset, the nearest suitable geographical country value was used. An example of the final distributed pregnancies gridded dataset for Bolivia is shown in Fig. 2f whilst results for the entire Africa, Latin America and the Caribbean are shown in Fig. 3.

Code availability

The Python code developed for production of the births and pregnancies datasets is publicly and freely available through Figshare[52]. The code consists of a Python programming language script (version 2.7; www.python.org) and relies on the ArcGIS 10.4.1 ArcPy site package for performing GIS specific spatial operations. The script is internally documented to both explain its purpose (including a description of the GIS-specific spatial operations it performs) and, when required, guiding the user through its customisation.

Data Records

The high-resolution births and pregnancies datasets described in this article referring to the 108 countries listed in Table 3 (available online only) are publicly and freely available through the WorldPop Repository (http://www.worldpop.org.uk/data/). A collection of these datasets has been compiled for the births for LAC (Data Citation 4) and Africa (Data Citation 5) and pregnancies for LAC (Data Citation 6) and Africa (Data Citation 7), as described in Table 6.
Table 6

Name and description of datasets available for Africa and Latin America and Caribbean countries.

NameDescriptionResolutionFiles FormatUniversity of Southampton DOI
Births in Latin America and the CaribbeanEstimated live births per grid cell for 2015 for LAC for 50 countries3 arc secondsGeoTIFF10.5258/SOTON/WP00529
Births in AfricaEstimated live births per grid cell for 2015 for Africa for 58 countries3 arc secondsGeoTIFF10.5258/SOTON/WP00528
Pregnancies in Latin America and the CaribbeanEstimated pregnancies per grid cell for 2015 for LAC for 50 countries3 arc secondsGeoTIFF10.5258/SOTON/WP00527
Pregnancies in AfricaEstimated pregnancies per grid cell for 2015 for Africa for 58 countries3 arc secondsGeoTIFF10.5258/SOTON/WP00526

Technical Validation

All data collected, assembled and used were (i) already validated by the corresponding data collector, owner and/or distributor, and (ii) further checked, in the framework of this project. The gridded 5-year age and sex count datasets constructed for Latin America and the Caribbean (e.g., Fig. 2c) were verified following the protocol outlined in Pezzulo et al.[19], who compiled and assessed similar datasets for Africa and Asia. Briefly, this comprised of summing all the layers into a single dataset (representing the total numbers of people for all age and sex groups at the grid cell level) and then subtracting it from the corresponding WorldPop continental gridded population count dataset to make sure that the country totals matched the UNPD estimates for the year in question. All fertility rates used in this study were checked, on a county-by-country basis, to make sure they were within reasonable ranges. Additionally, for countries where additional sources of fertility data were available, estimates were produced using all available sources to compare the adjusted total births. These results showed that, although differences may be observed at the grid cell level, the totals at the administrative unit level are very similar. Endeavours were made to assemble the most recent, reliable and spatially detailed data at the time of writing. However, additional input from readers who may have knowledge or access to more recent and/or better datasets are welcome for improving future iterations of the outputs. The accuracy and quality of fertility estimates from survey data such as those provided by the DHS, have been assessed in several reports, by testing the quality of the birth history data in a large number of countries. These checks were mainly aimed to identify potential omission and displacement of births, potential displacement of births, or misreporting of date of birth[53,54]. Overall, although a number of issues were identified for some countries, these studies found that most estimates were either good or of acceptable quality. Furthermore, outcomes from Pullum and Becker[53] show that in general the latest DHS surveys are less prone to issues like incomplete birthdates, omissions and displacement of births and deaths. Similarly, a more recent report from Pullum and Staveteig[55], exploring the quality and consistency of age and date reports in DHS surveys, demonstrates that DHS data is constantly evaluated to improve its quality. Modelled estimates of total number of births per country prior to adjustments (to match UNPD estimate) were also plotted against the UNPD estimates[18] to assess the size of differences obtained through using subnational data sources. Figure 4 shows the correlation for the 95 countries for which UNPD provides an estimate, with a corresponding R2 value of 0.982. Analysis was not possible for the 9 countries for which the UNPD does not provide an estimate, although these make up a very small proportion (0.01%) of the total births across the whole study area.
Figure 4

Total number of births per country estimated by this study plotted against the corresponding UNPD estimate.

Usage Notes

The datasets presented here can be used both to (i) support applications measuring sub-national metrics of maternal and new-born health and (ii) to inform planning decisions. However, considering that they represent modelling outputs generated using ancillary covariates for producing the underlying WorldPop population distribution datasets, to avoid circularity, they should not be used to make predictions or explore relationships about any of those ancillary datasets[56]. Thus, before using the births and pregnancies datasets in correlation analyses against factors which are included in the construction of the population distribution datasets (e.g., correlating birth distribution with land-cover), ideally the population modelling process should be re-run using the WorldPop-RF code[57] with the applicable covariates removed.

Additional information

How to cite this article: James, W. H. M. et al. Gridded birth and pregnancy datasets for Africa, Latin America and the Caribbean. Sci. Data 5:180090 doi: 10.1038/sdata.2018.90 (2018). Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
  12 in total

Review 1.  Determining global population distribution: methods, applications and data.

Authors:  D L Balk; U Deichmann; G Yetman; F Pozzi; S I Hay; A Nelson
Journal:  Adv Parasitol       Date:  2006       Impact factor: 3.870

2.  Quantifying the number of pregnancies at risk of malaria in 2007: a demographic study.

Authors:  Stephanie Dellicour; Andrew J Tatem; Carlos A Guerra; Robert W Snow; Feiko O ter Kuile
Journal:  PLoS Med       Date:  2010-01-26       Impact factor: 11.069

3.  Applying an equity lens to child health and mortality: more of the same is not enough.

Authors:  Cesar G Victora; Adam Wagstaff; Joanna Armstrong Schellenberg; Davidson Gwatkin; Mariam Claeson; Jean-Pierre Habicht
Journal:  Lancet       Date:  2003-07-19       Impact factor: 79.321

4.  Intended and unintended pregnancies worldwide in 2012 and recent trends.

Authors:  Gilda Sedgh; Susheela Singh; Rubina Hussain
Journal:  Stud Fam Plann       Date:  2014-09

5.  Mapping for maternal and newborn health: the distributions of women of childbearing age, pregnancies and births.

Authors:  Andrew J Tatem; James Campbell; Maria Guerra-Arias; Luc de Bernis; Allisyn Moran; Zoë Matthews
Journal:  Int J Health Geogr       Date:  2014-01-04       Impact factor: 3.918

6.  Disaggregating census data for population mapping using random forests with remotely-sensed and ancillary data.

Authors:  Forrest R Stevens; Andrea E Gaughan; Catherine Linard; Andrew J Tatem
Journal:  PLoS One       Date:  2015-02-17       Impact factor: 3.240

7.  Subnational variation for care at birth in Tanzania: is this explained by place, people, money or drugs?

Authors:  Corinne E Armstrong; Melisa Martínez-Álvarez; Neha S Singh; Theopista John; Hoviyeh Afnan-Holmes; Chris Grundy; Corrine W Ruktanochai; Josephine Borghi; Moke Magoma; Georgina Msemo; Zoe Matthews; Gemini Mtei; Joy E Lawn
Journal:  BMC Public Health       Date:  2016-09-12       Impact factor: 3.295

8.  High resolution global gridded data for use in population studies.

Authors:  Christopher T Lloyd; Alessandro Sorichetta; Andrew J Tatem
Journal:  Sci Data       Date:  2017-01-31       Impact factor: 6.444

9.  WorldPop, open data for spatial demography.

Authors:  Andrew J Tatem
Journal:  Sci Data       Date:  2017-01-31       Impact factor: 6.444

10.  The contribution of poor and rural populations to national trends in reproductive, maternal, newborn, and child health coverage: analyses of cross-sectional surveys from 64 countries.

Authors:  Cesar G Victora; Aluisio J D Barros; Giovanny V A França; Inácio C M da Silva; Liliana Carvajal-Velez; Agbessi Amouzou
Journal:  Lancet Glob Health       Date:  2017-02-24       Impact factor: 26.763

View more
  8 in total

1.  Geographical distribution of fertility rates in 70 low-income, lower-middle-income, and upper-middle-income countries, 2010-16: a subnational analysis of cross-sectional surveys.

Authors:  Carla Pezzulo; Kristine Nilsen; Alessandra Carioli; Natalia Tejedor-Garavito; Sophie E Hanspal; Theodor Hilber; William H M James; Corrine W Ruktanonchai; Victor Alegana; Alessandro Sorichetta; Adelle S Wigley; Graeme M Hornby; Zoe Matthews; Andrew J Tatem
Journal:  Lancet Glob Health       Date:  2021-06       Impact factor: 38.927

2.  Local level estimates of food, drink and tobacco expenditure for Great Britain.

Authors:  William H M James; Nik Lomax; Mark Birkin
Journal:  Sci Data       Date:  2019-05-13       Impact factor: 6.444

3.  How accurate are modelled birth and pregnancy estimates? Comparison of four models using high resolution maternal health census data in southern Mozambique.

Authors:  Yolisa Prudence Dube; Corrine Warren Ruktanonchai; Charfudin Sacoor; Andrew J Tatem; Khatia Munguambe; Helena Boene; Faustino Carlos Vilanculo; Esperanca Sevene; Zoe Matthews; Peter von Dadelszen; Prestige Tatenda Makanga
Journal:  BMJ Glob Health       Date:  2019-07-01

4.  Targeted policy intervention for reducing red meat consumption: conflicts and trade-offs.

Authors:  William H M James; Nik Lomax; Mark Birkin; Lisa M Collins
Journal:  BMC Nutr       Date:  2022-08-16

5.  Measuring the availability and geographical accessibility of maternal health services across sub-Saharan Africa.

Authors:  A S Wigley; N Tejedor-Garavito; V Alegana; A Carioli; C W Ruktanonchai; C Pezzulo; Z Matthews; A J Tatem; K Nilsen
Journal:  BMC Med       Date:  2020-09-08       Impact factor: 8.775

6.  The influence of distance and quality on utilisation of birthing services at health facilities in Eastern Region, Ghana.

Authors:  Winfred Dotse-Gborgbortsi; Duah Dwomoh; Victor Alegana; Allan Hill; Andrew J Tatem; Jim Wright
Journal:  BMJ Glob Health       Date:  2020-02-10

7.  Spatial inequalities in skilled attendance at birth in Ghana: a multilevel analysis integrating health facility databases with household survey data.

Authors:  Winfred Dotse-Gborgbortsi; Andrew J Tatem; Victor Alegana; C Edson Utazi; Corrine Warren Ruktanonchai; Jim Wright
Journal:  Trop Med Int Health       Date:  2020-07-18       Impact factor: 3.918

8.  Global holiday datasets for understanding seasonal human mobility and population dynamics.

Authors:  Shengjie Lai; Alessandro Sorichetta; Jessica Steele; Corrine W Ruktanonchai; Alexander D Cunningham; Grant Rogers; Patrycja Koper; Dorothea Woods; Maksym Bondarenko; Nick W Ruktanonchai; Weifeng Shi; Andrew J Tatem
Journal:  Sci Data       Date:  2022-01-20       Impact factor: 6.444

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.