| Literature DB >> 35994637 |
Nathan G Welch1, Adrian E Raftery1,2.
Abstract
We propose a method for forecasting global human migration flows. A Bayesian hierarchical model is used to make probabilistic projections of the 39,800 bilateral migration flows among the 200 most populous countries. We generate out-of-sample forecasts for all bilateral flows for the 2015 to 2020 period, using models fitted to bilateral migration flows for five 5-y periods from 1990 to 1995 through 2010 to 2015. We find that the model produces well-calibrated out-of-sample forecasts of bilateral flows, as well as total country-level inflows, outflows, and net flows. The mean absolute error decreased by 61% using our method, compared to a leading model of international migration. Out-of-sample analysis indicated that simple methods for forecasting migration flows offered accurate projections of bilateral migration flows in the near term. Our method matched or improved on the out-of-sample performance using these simple deterministic alternatives, while also accurately assessing uncertainty. We integrate the migration flow forecasting model into a fully probabilistic population projection model to generate bilateral migration flow forecasts by age and sex for all flows from 2020 to 2025 through 2040 to 2045.Entities:
Keywords: Bayesian hierarchical model; bilateral migration flows; international migration; probabilistic forecasting
Mesh:
Year: 2022 PMID: 35994637 PMCID: PMC9436307 DOI: 10.1073/pnas.2203822119
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 12.779
Out-of-sample MAE in thousands of migrants per period, MAPE, and 95% prediction interval (PI) coverage for models fitted to all 1990 to 1995 through 2010 to 2015 migration flows and tested on all 39,800 2015 to 2020 flows
| 95% PI | ||||||
|---|---|---|---|---|---|---|
| Method | MAE | MAPE | Flow | In | Out | Net |
| Historic mean flow | 1.2 | 139 | — | — | — | — |
| Persistence |
| 79 | — | — | — | — |
| Gravity model | 3.0 | 1,565 | 86 | 77 | 80 | 99 |
| Poisson hurdle model | 10.0 | 25,649 | 90 | 66 | 65 | 48 |
| Bayesian flow model | 1.2 |
|
|
|
|
|
A bold-face entry indicates the most accurate number for that metric.
Fig. 1.(A–D) Observed 2015 to 2020 (A) flow, (B) total country inflows, (C) total country outflows, and (D) total country net flows compared to Bayesian hierarchical model median forecasts colored by United Nations Area and sized according to the absolute error in millions of people.
Fig. 2.(A and B) Global migration flows in millions of migrants (A) and in percentage of global population migrating (B) during 5-y periods observed from 1990 through 2020 with median forecast (solid line) and 90% prediction interval for 5-y periods from 2020 through 2045.
Total global migration in millions of migrants per 5-y period and percentage of the population migrating
| 2040 to 2045 forecast | ||||
|---|---|---|---|---|
| 2015 to 2020 | 5% | 50% | 95% | |
| Sum of global flows, millions | 96 | 119 | 142 | 176 |
| Percentage of population migrating | 1.3 | 1.3 | 1.5 | 1.9 |
The columns correspond to the 5th, 50th (median), and 95th percentiles of the predictive distribution.
Fig. 3.(A–H) Observations and 90% prediction interval forecasts in millions of people per 5-y period for (A) total net flow, (B) population, (C) total inflow, (D) total outflow, (E) population by age and sex (black denotes 2015 to 2020 period), and (F–H) bilateral flows with Germany as origin in descending order by historic magnitude.
Bayesian flow model notation
| Parameter | Definition |
|---|---|
|
| Integer-valued flow (data) from origin |
|
| Number of migrants departing origin |
|
| Number of person years in origin |
|
| Out-migration rate in migrants per 1,000 person years for origin |
|
| Conditional probability of moving to |
|
| Conditional probability vector of migrants from origin |
|
| Global weight on mean departure rate function |
|
| Long-term mean out-migration rate for origin |
|
| Temporal variation around departure rate from origin |
|
| Destination weight at time |
|
| Mean destination weight for |
|
| Variance of destination weights |
|
| Grand mean of long-term departure rates |
|
| User-specified prior for long-term departure rate grand mean |
|
| User-specified variance about the long-term departure rate grand mean |
| User-specified parameters for temporal variation around departure rates | |
| User-specified parameters for destination probability variation for origin | |
|
| Number of countries |
Gravity model covariates
| Parameter | Definition |
|---|---|
|
| Population of country |
|
| Distance between capitals of country |
|
| Potential support ratio for country |
|
| Infant mortality ratio for country |
|
| Percentage of the population living in urban settings for country |
|
| Land area of country |
|
| Landlocked indicator for country |
|
| Indicator of shared land border between countries |
|
| Indicator of shared official language in countries |
|
| Indicator of colonial relationship between countries |
|
| First year of period |
|
| Variation not explained by the model |