| Literature DB >> 35592864 |
Francesco Checchi1, Emilie Sabine Koum Besson1.
Abstract
Introduction: Yemen has experienced widespread insecurity since 2014, resulting in large-scale internal displacement. In the absence of reliable vital events registration, we tried to reconstruct the evolution of Yemen's population between June 2014 and September 2021, at subdistrict (administrative level 3) resolution, while accounting for growth and internal migration.Entities:
Keywords: Armed conflict; Forced displacement; Humanitarian; Internally displaced; Population; Yemen
Year: 2022 PMID: 35592864 PMCID: PMC9111980 DOI: 10.1016/j.jmh.2022.100105
Source DB: PubMed Journal: J Migr Health ISSN: 2666-6235
Input values for parameters, by analysis.
| Parameter (symbol) | Main analysis | Reasonable-low sensitivity scenario | Reasonable-high sensitivity scenario |
|---|---|---|---|
| Monthly flow of IDP households from/to subdistricts ( | Prevalent-data instances in which the reported number of IDP households increased from the previous assessment point were adjusted as follows: | ||
| Average of reasonable-low and reasonable-high adjusted values (see columns to the right). | All values higher than the previous value were changed to the previous value. | All values lower than the next value were changed to the next value. | |
| Monthly prevalence predictions were based on the additive growth model (see text). | |||
| Mean number of IDPs per household | 6.70 (countrywide estimate) ( | 5.34 (assume −20%) | 8.04 (assume +20%) |
| Population of each subdistrict at the base time point, June 2014 ( | WorldPop estimates, corrected for prevalent displacement at that time point. | As for main analysis, but using | As for main analysis, but using |
| Relative change per month due to migration ( | First, we adjusted annual WorldPop estimates for 2014–2020 by eliminating UN-projected natural growth (2.8% per annum). Then, we inter- and extrapolated annual estimates using a natural cubic spline to obtain monthly values | As for main analysis | As for main analysis |
| Proportion of overlap between data on migration changes and displacement data ( | 0.25 | 0.50 | 0 (no overlap) |
| Crude birth rate ( | First, performed smooth interpolation of secular trends according to UN World Population Prospects to obtain an assumed monthly value in the absence of a crisis. Second, came up with value for rural and urban subdistricts based on the ratio of crude birth rate observed in the last DHS survey ( | See | See |
| Crude death rate ( | Same approach as for birth rate (see | See | See |
Fig. 1Reductions in subdistrict missingness achieved after each successive data management step. Only eligible IDP records are included in the denominator.
Confusion matrices summarising the performance of model 1 to correctly guess the number of subdistricts that IDPs came from or went to, out of all subdistricts within the parent district of origin/arrival. Cell percentages are column-wise. For any observed category, a random guess based only on the category frequencies is shown.
| Model to predict the number of different subdistricts of | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Predicted category | Observed category | Random guess based on category frequency: | |||||||||
| 1 | 2 | 3 | 4 | ≥ 5 | |||||||
| 1 | 93.9% | 44.7% | 22.5% | 25.0% | 9.5% | 97.3% | |||||
| 2 | 5.2% | 41.2% | 43.4% | 40.6% | 14.3% | 2.1% | |||||
| 3 | 0.6% | 10.2% | 22.5% | 12.5% | 33.3% | 0.4% | |||||
| 4 | 0.2% | 2.8% | 7.0% | 9.4% | 4.8% | 0.1% | |||||
| ≥ 5 | 0.1% | 1.1% | 4.7% | 12.5% | 38.1% | 0.1% | |||||
| Model to predict the number of different subdistricts of | |||||||||||
| Predicted category | Observed category | Random guess based on category frequency: | |||||||||
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ≥ 10 | ||
| 1 | 81.4% | 19.6% | 2.7% | 0.6% | 0.6% | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% | 82.7% |
| 2 | 14.9% | 52.8% | 32.9% | 15.9% | 5.3% | 2.1% | 1.5% | 0.0% | 0.0% | 0.0% | 10.4% |
| 3 | 2.8% | 17.7% | 36.7% | 30.3% | 14.2% | 7.3% | 5.9% | 0.0% | 0.0% | 0.0% | 3.6% |
| 4 | 0.6% | 6.5% | 16.5% | 27.4% | 22.5% | 15.6% | 1.5% | 13.6% | 9.1% | 0.0% | 1.5% |
| 5 | 0.2% | 1.9% | 6.7% | 12.7% | 27.2% | 21.9% | 8.8% | 9.1% | 13.6% | 1.7% | 0.7% |
| 6 | 0.1% | 0.6% | 1.4% | 6.9% | 13.6% | 14.6% | 22.1% | 6.8% | 22.7% | 3.3% | 0.4% |
| 7 | 0.0% | 0.1% | 0.7% | 0.9% | 4.7% | 14.6% | 26.5% | 22.7% | 13.6% | 5.0% | 0.3% |
| 8 | 0.0% | 0.4% | 0.5% | 2.9% | 4.7% | 8.3% | 14.7% | 18.2% | 22.7% | 11.7% | 0.2% |
| 9 | 0.0% | 0.2% | 1.4% | 2.0% | 5.3% | 9.4% | 8.8% | 13.6% | 0.0% | 13.3% | 0.1% |
| ≥ 10 | 0.0% | 0.0% | 0.5% | 0.6% | 1.8% | 6.2% | 10.3% | 15.9% | 18.2% | 65.0% | 0.3% |
Confusion matrices summarising the performance of a random forest model to correctly guess the percent share of IDPs coming from or to a given subdistrict (expressed as a categorical variable), out of all IDPs from or to the parent district. Cell percentages are column-wise. For any observed category, a random guess based only on the category frequencies is shown.
| Model to impute percent share of IDPs by subdistrict of | ||||||
|---|---|---|---|---|---|---|
| Predicted category | Observed category | Random guess based on category frequency: | ||||
| 1 to 19% | 20 to 39% | 40 to 59% | 60 to 79% | 80 to 99% | ||
| 1 to 19% | 43.4% | 27.5% | 22.5% | 21.9% | 23.1% | 21.6% |
| 20 to 39% | 21.3% | 22.9% | 19.2% | 23.8% | 16.0% | 27.5% |
| 40 to 59% | 12.0% | 17.7% | 28.5% | 21.6% | 14.1% | 25.4% |
| 60 to 79% | 10.0% | 20.2% | 19.0% | 16.7% | 19.2% | 17.2% |
| 80 to 99% | 13.2% | 11.7% | 10.8% | 16.0% | 27.6% | 8.3% |
| Model to impute percent share of IDPs by subdistrict of | ||||||
| Predicted category | Observed category | Random guess based on category frequency: | ||||
| 1 to 19% | 20 to 39% | 40 to 59% | 60 to 79% | 80 to 99% | ||
| 1 to 19% | 61.30% | 22.70% | 15.30% | 12.30% | 21.00% | 60.2% |
| 20 to 39% | 16.30% | 32.10% | 25.70% | 32.90% | 21.40% | 24.3% |
| 40 to 59% | 8.30% | 15.30% | 31.60% | 8.90% | 15.20% | 10.6% |
| 60 to 79% | 9.90% | 25.50% | 22.10% | 38.90% | 30.80% | 4.1% |
| 80 to 99% | 4.10% | 4.30% | 5.30% | 6.90% | 11.60% | 0.8% |
Fig. 2Illustration of four hypothetical scenarios (A to D) in which a group of IDPs arrives to a subdistrict from another subdistrict. In scenario A, nearly all IDPs remain in the subdistrict of refuge throughout the period of interest. In scenario B, only a fraction are left by the second assessment round. In scenario C, all IDPs have left the subdistrict (either returned to their subdistrict of origin, or moved elsewhere) by the second assessment, and in scenario D IDPs have left even before the first assessment, thereby potentially being missed altogether by the displacement tracking system.
Fig. 3Predictions of a generalised additive mixed growth model of IDP population as a function of time. Panel A shows the model's predicted percent change in IDP group size by month since displacement: the thick line is the main analysis prediction, and the shaded area indicates the 95%CI. Predictions using reasonable-high and -low adjustments for non-monotonic series are also shown. Panel B shows the model's predictions (blue dots and 95% confidence bands) and observed total prevalent number of IDPs assessed (orange squares) during any given month after displacement, for the main analysis only.
Fig. 4Countrywide average values of crude birth rate and crude death rate assumed, by scenario. The grey dotted line shows secular trends based on UN projections; the latter are represented by squares and centred at the mid-point of their period of reference. The shaded area indicates the crisis period.
Fig. 5Estimated population of Yemen over time, by scenario.
Estimated population by governorate as of September 2021, and percent change from the June 2014 baseline. Figures include the main analysis and, in parentheses, and reasonable-low and -high scenarios. Estimates (Yemen, 2022) published by UN OCHA as part of the Humanitarian Needs Overview are shown by comparison.
| Governorate | Estimated population (Sep 2021) | Percent change from Jun 2014 | Estimated population (OCHA, Dec 2021) |
|---|---|---|---|
| Abyan | 703,000 (665,000 to 729,000) | 22.4 (15.6 to 26.9) | 618,892 |
| Ad Dali' | 759,000 (721,000 to 781,000) | 20.8 (14.7 to 24.1) | 818,507 |
| Aden | 943,000 (911,000 to 956,000) | 17.8 (13.8 to 19.5) | 1053,455 |
| Al Bayda | 907,000 (866,000 to 926,000) | 19.1 (13.7 to 21.5) | 795,107 |
| Al Hodeidah | 3270,000 (3208,000 to 3221,000) | 13.7 (11.6 to 12.0) | 2996,334 |
| Al Jawf | 680,000 (653,000 to 688,000) | 17.5 (12.8 to 18.7) | 609,953 |
| Al Maharah | 148,000 (139,000 to 156,000) | 24.9 (17.0 to 30.7) | 175,606 |
| Al Mahwit | 807,000 (763,000 to 841,000) | 19.9 (13.4 to 25.0) | 770,920 |
| Amran | 1501,000 (1408,000 to 1595,000) | 20.5 (13.1 to 27.9) | 1221,908 |
| Dhamar | 2188,000 (2050,000 to 2304,000) | 22.1 (14.5 to 28.6) | 2194,159 |
| Hadramawt | 1568,000 (1493,000 to 1607,000) | 21.0 (15.3 to 24.1) | 1551,347 |
| Hajjah | 2273,000 (2203,000 to 2284,000) | 14.7 (11.2 to 15.3) | 2630,678 |
| Ibb | 3412,000 (3253,000 to 3534,000) | 17.0 (11.5 to 21.1) | 3143,818 |
| Lahj | 1148,000 (1085,000 to 1190,000) | 21.9 (15.2 to 26.4) | 1076,296 |
| Ma'rib | 668,000 (483,000 to 925,000) | 108.3 (50.7 to 188.2) | 1067,450 |
| Raymah | 629,000 (603,000 to 643,000) | 17.4 (12.4 to 19.9) | 562,930 |
| Sa'dah | 964,000 (996,000 to 846,000) | 5.3 (8.8 to −7.5) | 934,201 |
| Sana'a | 1331,000 (1279,000 to 1347,000) | 20.4 (15.8 to 21.9) | 1370,798 |
| Sana'a City | 2723,000 (2648,000 to 2729,000) | 15.2 (12.1 to 15.5) | 3296,342 |
| Shabwah | 762,000 (725,000 to 782,000) | 20.2 (14.4 to 23.2) | 676,408 |
| Socotra | 70,000 (67,000 to 72,000) | 21.5 (15.5 to 24.5) | 69,004 |
| Ta'iz | 3699,000 (3615,000 to 3666,000) | 13.5 (10.9 to 12.5) | 3104,579 |
Fig. 6Estimated number of IDPs in Yemen over time, by scenario. Squares with labels (in millions or M) indicate year-end estimates from the Internal Displacement Monitoring Centre (Internal Displacement Monitoring Centre, 2021).
Fig. 7Estimated percentage of IDPs amongst the entire population, by district, as of September 2021. Thick boundaries and text labels denote governorates; light boundaries denote districts.