| Literature DB >> 32275720 |
Jun Wang1,2, Qihui Chen3, Gang Chen4, Yingxiang Li4, Guoshu Kong1,2, Chen Zhu3.
Abstract
This study uses a Mendelian randomization approach to resolve the difficulties of identifying the causal relationship between height and earnings by using a unique sample of 3,427 respondents from mainland China with sociodemographic information linked to individual genotyping data. Exploiting genetic variations to create instrumental variables for observed height, we find that while OLS regressions yield that an additional centimeter in height is associated with a 10-13% increase in one's annual earnings, IV estimates reveal only an insubstantial causal effect of height. Further analyses suggest that the observed height premium is likely to pick up the impacts of several cognitive/noncognitive skills on earnings confounded in previous studies, such as mental health, risk preference, and personality factors. Our study is the first empirical study that employs genetic IVs in developing countries, and our results contribute to the recent debate on the mechanism of height premium.Entities:
Year: 2020 PMID: 32275720 PMCID: PMC7147798 DOI: 10.1371/journal.pone.0230555
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Channels identified using genetic instruments.
Summary statistics of the analytical sample (N = 3,427).
| Variable | Mean/Percentage | Std.Dev. |
|---|---|---|
| Annual Earnings (in CNY) | 138,927 | 179,554 |
| Natural Logarithm of Annual Earnings | 11.570 | 0.893 |
| Age | 30.245 | 6.667 |
| Years of Schooling | 15.604 | 2.237 |
| Self-reported Height (in centimeter) | 169.088 | 8.331 |
| Male | 56.1% | - |
| Polygenic Score of Height [ | -0.598 | 0.446 |
| Risk Loving | 5.613 | 2.136 |
| Altruism | 4.862 | 2.490 |
| Trust | 5.084 | 2.719 |
| Polygenic Score of Cognitive Ability [ | -0.010 | 0.171 |
| Polygenic Score of Depression [ | 32.628 | 4.233 |
| Polygenic Score of Delay Discounting [ | 0.074 | 0.087 |
| Polygenic Score of Reproduction Preference [ | 0.029 | 0.166 |
Source: Data are drawn from the consumer information base of WeGene. Summary statistics are calculated by the author.
Fig 2Spatial distributions of observations.
Fig 3Distribution of the genetic instrumental variable.
2SLS results.
| Outcome variables | Model 1: Pooled | Model 2: Male | Model 3: Female | Model 4: Age 30–50 | ||||
|---|---|---|---|---|---|---|---|---|
| (1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | |
| 2SLS | First Stage | 2SLS | First Stage | 2SLS | First Stage | 2SLS | First Stage | |
| Ln(income) | Height | Ln(income) | Height | Ln(income) | Height | Ln(income) | Height | |
| Height | 0.0056 | - | 0.0063 | - | 0.0029 | - | 0.0075 | - |
| (0.0107) | - | (0.0172) | - | (0.0137) | - | (0.0152) | - | |
| Male | 0.0217 | 10.1720 | - | - | - | - | 0.0031 | 10.5547 |
| (0.0377) | (0.3530) | - | - | - | - | (0.0554) | (0.5591) | |
| Age | 0.2106 | -0.1478 | 0.2482 | -0.2206 | 0.1771 | -0.1977 | 0.2033 | 0.0973 |
| (0.0195) | (0.1608) | (0.0264) | (0.2207) | (0.0290) | (0.2440) | (0.0765) | (0.7687) | |
| Age^2 | -0.0024 | 0.0004 | -0.0030 | 0.0013 | -0.0019 | 0.0014 | -0.0023 | -0.0022 |
| (0.0003) | (0.0024) | (0.0004) | (0.0032) | (0.0004) | (0.0036) | (0.0010) | (0.0101) | |
| Years of schooling | 0.0495 | 0.0419 | 0.0212* | 0.0464 | 0.0755 | -0.0428 | 0.0509 | 0.0165 |
| (0.0092) | (0.0848) | (0.0121) | (0.1146) | (0.0142) | (0.1318) | (0.0132) | (0.1326) | |
| Risk loving | 0.0537 | 0.4047 | 0.0434 | 0.3397 | 0.0693 | 0.5015 | 0.0340 | 0.4152 |
| (0.0100) | (0.0833) | (0.0135) | (0.1128) | (0.0153) | (0.1290) | (0.0145) | (0.1331) | |
| Altruism | 0.0012 | 0.1350 | 0.0036 | 0.0677 | 0.0017 | 0.1835 | 0.0093 | 0.1253 |
| (0.0080) | (0.0737) | (0.0107) | (0.1019) | (0.0123) | (0.1116) | (0.0115) | (0.1161) | |
| Trust | 0.0010 | -0.2059 | -0.0100 | -0.1447 | 0.0115 | -0.2547 | 0.0013 | -0.0918 |
| (0.0076) | (0.0669) | (0.0096) | (0.0902) | (0.0120) | (0.1044) | (0.0109) | (0.1089) | |
| Cognitive ability | -0.1092 | -0.1979 | -0.1027 | -0.1497 | -0.0250 | -0.1688 | -0.0316 | -1.8792 |
| (0.1104) | (1.0313) | (0.1470) | (1.4084) | (0.1673) | (1.5711) | (0.1634) | (1.6424) | |
| Depression | -0.0052 | 0.0323 | -0.0070 | 0.0091 | -0.0002 | 0.0410 | -0.0145 | 0.0043 |
| (0.0042) | (0.0407) | (0.0055) | (0.0547) | (0.0065) | (0.0631) | (0.0063) | (0.0647) | |
| Delay discounting | -0.5255 | -2.3509 | -0.7054 | -0.8149 | -0.1790 | -3.1801 | -0.3644 | -1.8665 |
| (0.2182) | (2.0091) | (0.2869) | (2.7791) | (0.3279) | (2.9905) | (0.3129) | (3.1492) | |
| Reproduction preference | 0.0148 | -1.3113 | -0.0257 | 0.8948 | -0.0201 | -4.3051 | 0.0949 | -2.2624 |
| (0.1125) | (1.0587) | (0.1519) | (1.4567) | (0.1768) | (1.5927) | (0.1636) | (1.6250) | |
| Constant | 109.9583 | 96.1696 | 12.0476 | 86.0966 | -0.4013 | 39.8194 | 3.8526 | 15.5517 |
| (80.6906) | (200.3519) | (146.3101) | (204.1028) | (12.8593) | (210.7858) | (40.5838) | (174.1510) | |
| Province FE | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
| Ancestral controls | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
| - | 3.9284 | - | 3.2287 | - | 4.6578 | - | 3.6183 | |
| - | (0.3947) | - | (0.5302) | - | (0.6137) | - | (0.6200) | |
| First-stage F statistic | 85.1053 | 29.9235 | 52.1446 | 34.3092 | ||||
| p-value | 0.0000 | 0.0000 | 0.0000 | 0.0000 | ||||
| Observations | 3,427 | 1,922 | 1,505 | 1,843 | ||||
***
**, and
* indicate statistical significance at the 1%, 5%, and 10% levels, respectively. In all models, we control for province fixed effects and 42 individual ancestry composition variables.
Comparison of observable characteristics by PGS_Height (n = 3,427).
| Observable characteristics | Higher | Lower | Difference | t-statistics | p-value |
|---|---|---|---|---|---|
| Natural Logarithm of Annual Earnings | 11.585 | 11.553 | 0.032 | 0.835 | 0.404 |
| (0.026) | (0.029) | ||||
| Height (in centimeter) | 171.342 | 168.663 | 2.679 | 8.109*** | 0.000 |
| (0.225) | (0.242) | ||||
| Age | 30.150 | 30.353 | -0.202 | -0.756 | 0.450 |
| (0.186) | (0.191) | ||||
| Years of Schooling | 16.429 | 16.445 | -0.016 | -0.186 | 0.852 |
| (0.060) | (0.062) | ||||
| Risk loving | 5.604 | 5.623 | -0.019 | -0.221 | 0.825 |
| (0.060) | (0.062) | ||||
| Altruism | 4.814 | 4.916 | -0.102 | -1.017 | 0.309 |
| (0.067) | (0.075) | ||||
| Trust | 5.096 | 5.070 | 0.026 | 0.234 | 0.815 |
| (0.075) | (0.080) | ||||
| PGS of Cognitive Ability | -0.009 | -0.011 | 0.002 | 0.302 | 0.763 |
| (0.005) | (0.005) | ||||
| PGS of Depression | 32.591 | 32.671 | -0.080 | 0.466 | 0.641 |
| (0.131) | (0.112) | ||||
| PGS of Delay Discounting | 0.074 | 0.074 | 0.000 | -0.010 | 0.992 |
| (0.002) | (0.003) | ||||
| PGS of Reproduction Preference | 0.028 | 0.031 | -0.004 | -0.520 | 0.604 |
| (0.005) | (0.005) |
Descriptive statistics of top 10 ancestries.
| Ancestry/Population | Mean | Std.Dev. | Min | Max |
|---|---|---|---|---|
| Northern Han | 0.5530 | 0.2963 | 0.0000 | 0.9996 |
| Southern Han | 0.2579 | 0.2645 | 0.0000 | 0.9996 |
| Mongolian | 0.0604 | 0.1124 | 0.0000 | 0.7774 |
| Naxi/Yi | 0.0292 | 0.0610 | 0.0000 | 0.9996 |
| Japanese | 0.0202 | 0.0369 | 0.0000 | 0.2173 |
| Gaoshan | 0.0084 | 0.0171 | 0.0000 | 0.1163 |
| Korean | 0.0075 | 0.0105 | 0.0000 | 0.0575 |
| Dai | 0.0070 | 0.0203 | 0.0000 | 0.1696 |
| She | 0.0056 | 0.0096 | 0.0000 | 0.0540 |
| Kinh | 0.0054 | 0.0147 | 0.0000 | 0.1260 |
Source: author’s calculation.
Additional tests of the exclusion restriction assumption.
| (1) | (2) | (3) | |
|---|---|---|---|
| ln(income) | ln(income) | ln(income) | |
| 0.0047 | 0.0057 | 0.0046 | |
| (0.0436) | (0.0403) | (0.0435) | |
| Male | 0.027 | 0.0277 | |
| (0.0364) | (0.0392) | ||
| Age | 0.2042 | 0.2031 | |
| (0.0188) | (0.0202) | ||
| Age^2 | -0.0023 | -0.0023 | |
| (0.0003) | (0.0003) | ||
| Years of Schooling | 0.0498 | 0.0525 | |
| (0.0090) | (0.0096) | ||
| Constant | 11.5921 | 6.8570 | 6,236.7441 |
| (0.0328) | (0.3264) | (2,836.0718) | |
| Additional Controls of Personality and Other Polygenic Scores | Yes | Yes | Yes |
| Province FE | No | No | Yes |
| Ancestral Controls | No | No | Yes |
| Observations | 3,427 | 3,427 | 3,427 |
| R-squared | 0.0006 | 0.1468 | 0.1845 |
***
**, and
* indicate statistical significance at the 1%, 5%, and 10% levels, respectively.
OLS results.
| Variables | (1) | (2) | (3) | (4) | (5) |
|---|---|---|---|---|---|
| Pooled | Pooled | Male | Female | Age 30–50 | |
| ln(income) | ln(income) | ln(income) | ln(income) | ln(income) | |
| Height | 0.0130 | 0.0103 | 0.0138 | 0.0084 | 0.0128 |
| (0.0021) | (0.0023) | (0.0030) | (0.0037) | (0.0035) | |
| Male | -0.0105 | 0.0167 | - | - | 0.0036 |
| (0.0355) | (0.0382) | - | - | (0.0566) | |
| Age | 0.1994 | 0.1995 | 0.2347 | 0.1732 | 0.1888 |
| (0.0181) | (0.0194) | (0.0269) | (0.0290) | (0.0767) | |
| Age^2 | -0.0023 | -0.0023 | -0.0028 | -0.0018 | -0.0021 |
| (0.0003) | (0.0003) | (0.0004) | (0.0004) | (0.0010) | |
| Years of schooling | 0.0525 | 0.0555 | 0.0249 | 0.0880 | 0.0522 |
| (0.0088) | (0.0093) | (0.0124) | (0.0148) | (0.0134) | |
| Risk Loving | - | 0.0542 | 0.0415 | 0.0734 | 0.0374 |
| - | (0.0092) | (0.0122) | (0.0147) | (0.0139) | |
| Altruism | - | 0.0011 | 0.0033 | 0.0025 | 0.0098 |
| - | (0.0081) | (0.0111) | (0.0124) | (0.0120) | |
| Trust | - | 0.0006 | -0.0094 | 0.0090 | -0.0001 |
| - | (0.0073) | (0.0097) | (0.0116) | (0.0113) | |
| Cognitive ability | - | -0.1138 | -0.1152 | -0.0079 | -0.0325 |
| - | (0.1128) | (0.1519) | (0.1751) | (0.1699) | |
| Depression | - | -0.0050 | -0.0071 | 0.0002 | -0.0144 |
| - | (0.0043) | (0.0057) | (0.0068) | (0.0066) | |
| Delay discounting | - | -0.5264 | -0.6943 | -0.1819 | -0.3947 |
| - | (0.2218) | (0.2980) | (0.3415) | (0.3257) | |
| Reproduction preference | - | 0.0092 | -0.0318 | -0.0686 | 0.0583 |
| - | (0.1144) | (0.1562) | (0.1745) | (0.1678) | |
| Constant | 4.3258 | 4,431.2922 | 3,905.0754 | 4,307.0775 | 6,813.2636 |
| (0.4841) | (2,756.9803) | (3,648.9270) | (4,355.0797) | (4,118.1286) | |
| Province FE | Yes | Yes | Yes | Yes | Yes |
| Ancestral controls | Yes | Yes | Yes | Yes | Yes |
| Observations | 3,427 | 3,427 | 1,922 | 1,505 | 1,843 |
| R-squared | 0.1631 | 0.2187 | 0.2344 | 0.2829 | 0.1835 |
***
**, and
* indicate statistical significance at the 1%, 5%, and 10% levels, respectively. In all models, we control for province fixed effects and 42 individual ancestry composition variables.
Relationship between height and additional control variables.
| -1 | -2 | -3 | -4 | -5 | -6 | -7 | -8 | |
| Years of Schooling | Years of Schooling | Risk Loving | Risk Loving | Altruism | Altruism | Trust | Trust | |
| Height | -0.0018 | -0.0008 | 0.0294 | 0.0298 | 0.0247 | 0.0241 | -0.0124 | -0.0115 |
| -0.0062 | -0.0065 | -0.0065 | -0.0067 | -0.0076 | -0.0079 | -0.0083 | -0.0086 | |
| - | -0.1119 | - | -0.0986 | - | -0.0694 | - | 0.0508 | |
| - | -0.1162 | - | -0.1208 | - | -0.1416 | - | -0.1547 | |
| Constant | 16.7378 | 16.4893 | 0.6234 | 0.5003 | 0.7132 | 0.7736 | 7.2942 | 7.1546 |
| -1.0582 | -1.114 | -1.1026 | -1.157 | -1.2967 | -1.3559 | -1.4131 | -1.4808 | |
| Observations | 3,427 | 3,427 | 3,427 | 3,427 | 3,427 | 3,427 | 3,427 | 3,427 |
| R-squared | 0.0001 | 0.0007 | 0.013 | 0.0128 | 0.0067 | 0.0062 | 0.0014 | 0.0012 |
| -1 | -2 | -3 | -4 | -5 | -6 | -7 | -8 | |
| Variables | PGS: Cognitive Ability | PGS: Cognitive Ability | PGS: Depression | PGS: Depression | PGS: | PGS: | PGS: Reproduction Preference | PGS: Reproduction Preference |
| Delay Discounting | Delay Discounting | |||||||
| Height | 0.0003 | 0 | -0.0164 | -0.0183 | -0.0002 | -0.0002 | -0.0004 | -0.0005 |
| -0.0005 | -0.0005 | -0.0182 | -0.0121 | -0.0003 | -0.0003 | -0.0005 | -0.0005 | |
| - | 0.0206 | - | -0.2373 | - | 0.0028 | - | 0.0059 | |
| - | -0.0096 | - | -0.1118 | - | -0.005 | - | -0.0096 | |
| Constant | -0.0578 | -0.0026 | 34.7815 | 35.6082 | 0.1066 | 0.1135 | 0.0965 | 0.1112 |
| -0.0863 | -0.0915 | -3.0966 | -2.0798 | -0.0462 | -0.0478 | -0.0887 | -0.0918 | |
| Observations | 3,427 | 3,427 | 3,427 | 3,427 | 3,427 | 3,427 | 3,427 | 3,427 |
| R-squared | 0.0002 | 0.0032 | 0.0005 | 0.0028 | 0.0003 | 0.0005 | 0.0004 | 0.0007 |
***
**, and
* indicate statistical significance at the 1%, 5%, and 10% levels, respectively.