| Literature DB >> 35664134 |
Mingming Li1, Xinxin Xu2.
Abstract
Although the Chinese government has shifted from a one-child policy to a two-child policy (allowing a couple to have up to two children) since 2016 in response to the aging population, the policy results have been unsatisfactory. This is the first paper to systematically investigate the factors influencing residents' intentions to have a second child. The research focuses on the perspective of individual, family, and social characteristics based on the Chinese General Social Survey (CGSS) from 2017 to 2018. Three machine learning methods are used in conjunction with logistic regression to reveal that the intention of having a second child increases heavily with age, more siblings in the family of origin, and better health. The family income, which is currently the focus of the literature and is statistically significant, is only sixth most important. This study further reveals differences between genders: Women with a lower level of education and religious beliefs prefer to have a second child, whereas for men, non-agricultural hukou and marriage are the position factors. The results of this study also illustrate the importance of future research focusing on the relationship of individuals to their family of origin and districts.Entities:
Keywords: XG-boost; artificial neural network; fertility intentions; logistic regression; machine learning; random forest; two-child policy
Year: 2022 PMID: 35664134 PMCID: PMC9161298 DOI: 10.3389/fpsyg.2022.883317
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
FIGURE 1Common two-layer and three-layer ANN models.
Confusion matrix.
| Predicted value | ||||
| 0 | 1 | Sum | ||
| Actual value | 0 | TN | FP | FP + TN |
| 1 | FN | TP | TP + FN | |
| Sum | FN + TN | TP + FP | TP + FN + FP + TN | |
Independent variable statement.
| Variable | Question | Value assignment | |
| Individual characteristics | Gender | What is your gender? | Male: 1, Female: 0 |
| Age | What is your date of birth? | For CGSS (2017): 2017–year of birth +1; for CGSS (2018): 2018–year of birth +1 | |
| Ethnicity | What is your ethnicity? | Han: 1, Other ethnicities: 0 | |
| Health status | What do you feel is your current physical condition? | Very unhealthy: 1, relatively unhealthy: 2, average: 3, relatively healthy: 4, very healthy: 5 | |
| Education level | What is your current level of education? | Illiterate: 0, Elementary: 1, Middle: 2, High: 3, Bachelor: 4, Graduate and above: 5 | |
| Family characteristics | Marital status | What is your current marital status? | No spouse: 0; first marriage with spouse: 1; remarriage with spouse: 2 |
| Family income level | Where does your family’s economic status fall in terms of location? | Far below average: 1, below average: 2, average taken: 3, above average taken 4, far above average taken: 5 | |
| Sibling | How many siblings you have? | Take values by the answered number | |
| Social characteristics | District | Which area do you live in? | West: 0, Central: 1, East: 2, Northeast: 3 |
| Religion | Do you have religious beliefs? | No: 0, Yes: 1 | |
| Health insurance | Do you have health insurance? | No: 0, Yes: 1 | |
| Hukou | What is your hukou status? | Agricultural hukou: 0, Non-agricultural hukou: 1 |
Confusion matrix of artificial neural network.
| The actual value in the test set | |||
| 1212 | 945 | ||
| Prediction | 0 | 731 | 348 |
| 1 | 481 | 597 | |
FIGURE 2Relationship between the out-of-bag error rate and the number of decision trees.
Confusion matrix of random forest (RF).
| The actual value in the test set | |||
| 1080 | 1077 | ||
| Prediction | 0 | 659 | 430 |
| 1 | 421 | 647 | |
Importance of feature variables.
| Variable | Importance |
| Age | 1 |
| Sibling | 2 |
| Health | 3 |
| Education | 4 |
| District | 5 |
| Income | 6 |
| Marriage | 7 |
| Hukou | 8 |
| Gender | 9 |
| Health_insurance | 10 |
FIGURE 3ROC curve.
Confusion matrix of XG-boost.
| The actual value in the test set | |||
| 1145 | 1058 | ||
| Prediction | 0 | 697 | 392 |
| 1 | 448 | 620 | |
FIGURE 4Importance of variables in XG-boost model.
Comparison of machine learning model results.
| Model | B-P ANN | RF | XG-boost |
| Precision rate | 64.10% | 61.00% | 63.12% |
| Recall rate | 63.64% | 59.88% | 61.56% |
Logistic full-sample regression results.
| Variable | Coefficient | Exp(β) | Significance | |
| Individual characteristics | Gender (Female) | |||
| Male | 0.029 | 1.029 | ||
| Age | 0.005 | 1.005 |
| |
| Ethnicity (Other ethnicities) | ||||
| Han | 0.039 | 1.040 |
| |
| Health | –0.009 | 0.991 | ||
| Education (Illiterate) | ||||
| Primary | 0.022 | 1.022 | ||
| Junior_high | –0.002 | 0.998 | ||
| High | –0.036 | 0.965 |
| |
| Bachelor | –0.009 | 0.991 |
| |
| Master and above | 0.021 | 1.021 | ||
| Family characteristics | Marital status (No spouse) | |||
| First_married | 0.052 | 1.053 |
| |
| Remarried | –0.013 | 0.987 | ||
| Family income | 0.024 | 1.024 |
| |
| Number of siblings | 0.041 | 1.042 |
| |
| Social characteristics | District (West) | |||
| East | –0.038 | 0.963 |
| |
| Central | 0.136 | 1.146 |
| |
| Northeast | –0.137 | 0.872 |
| |
| Religion (No religion) | ||||
| Has religion | 0.018 | 1.018 |
| |
| Insurance (No insurance) | ||||
| Has insurance | 0.007 | 1.007 | ||
| Hukou (Agricultural hukou) | ||||
| Non-agricultural hukou | –0.033 | 0.968 |
| |
| Observations | 6732 |
*Significance is indicated by different numbers of asterisk: *p < 0.1, **p < 0.05, ***p < 0.01.
Logistic regression results by gender.
| Variable | Coefficient | Exp(β) | Significance | Coefficient | Exp(β) | Significance | |
|
|
| ||||||
| Female | Male | ||||||
| Individual characteristics | Age | 0.004 | 1.004 |
| 0.005 | 1.005 |
|
| Ethnicity (Other ethnicities) | |||||||
| Han | –0.014 | 0.986 | 0.053 | 1.054 |
| ||
| Health | –0.003 | 0.997 | 0.004 | 1.004 | |||
| Education (Illiterate) | |||||||
| Primary | 0.021 | 1.021 | 0.023 | 1.023 | |||
| Junior_high | 0.004 | 1.004 | 0.000 | 1.000 | |||
| High | –0.064 | 0.938 |
| –0.005 | 0.995 | ||
| Bachelor | –0.034 | 0.967 | 0.004 | 1.004 | |||
| Master and above | 0.019 | 1.019 | 0.015 | 1.015 | |||
| Family characteristics | Marital status (No spouse) | ||||||
| First_married | 0.033 | 1.034 | 0.065 | 1.067 |
| ||
| Remarried | 0.021 | 1.021 | –0.025 | 0.975 | |||
| Income | 0.028 | 1.028 |
| 0.023 | 1.023 |
| |
| Number of siblings | 0.048 | 1.049 |
| 0.031 | 1.031 |
| |
| Social characteristics | District (West) | ||||||
| East | –0.027 | 0.973 | –0.043 | 0.958 |
| ||
| Central | 0.130 | 1.139 |
| 0.143 | 1.154 |
| |
| Northeast | –0.164 | 0.849 |
| –0.121 | 0.886 |
| |
| Religion (No religion) | |||||||
| Has religion | 0.035 | 1.036 |
| 0.005 | 1.005 | ||
| Insurance (No insurance) | |||||||
| Has insurance | 0.054 | 1.055 | –0.023 | 0.977 | |||
| Hukou (Agricultural hukou) | |||||||
| Non-agricultural hukou | –0.023 | 0.977 | 0.094 | 1.099 |
| ||
| Observations | 2295 | 4437 | |||||
*Significance is indicated by different numbers of asterisk: *p < 0.1, **p < 0.05, ***p < 0.01.