| Literature DB >> 23593239 |
Matthijs J H M van der Loos1, Cornelius A Rietveld, Niina Eklund, Philipp D Koellinger, Fernando Rivadeneira, Gonçalo R Abecasis, Georgina A Ankra-Badu, Sebastian E Baumeister, Daniel J Benjamin, Reiner Biffar, Stefan Blankenberg, Dorret I Boomsma, David Cesarini, Francesco Cucca, Eco J C de Geus, George Dedoussis, Panos Deloukas, Maria Dimitriou, Guðny Eiriksdottir, Johan Eriksson, Christian Gieger, Vilmundur Gudnason, Birgit Höhne, Rolf Holle, Jouke-Jan Hottenga, Aaron Isaacs, Marjo-Riitta Järvelin, Magnus Johannesson, Marika Kaakinen, Mika Kähönen, Stavroula Kanoni, Maarit A Laaksonen, Jari Lahti, Lenore J Launer, Terho Lehtimäki, Marisa Loitfelder, Patrik K E Magnusson, Silvia Naitza, Ben A Oostra, Markus Perola, Katja Petrovic, Lydia Quaye, Olli Raitakari, Samuli Ripatti, Paul Scheet, David Schlessinger, Carsten O Schmidt, Helena Schmidt, Reinhold Schmidt, Andrea Senft, Albert V Smith, Timothy D Spector, Ida Surakka, Rauli Svento, Antonio Terracciano, Emmi Tikkanen, Cornelia M van Duijn, Jorma Viikari, Henry Völzke, H-Erich Wichmann, Philipp S Wild, Sara M Willems, Gonneke Willemsen, Frank J A van Rooij, Patrick J F Groenen, André G Uitterlinden, Albert Hofman, A Roy Thurik.
Abstract
Economic variables such as income, education, and occupation are known to affect mortality and morbidity, such as cardiovascular disease, and have also been shown to be partly heritable. However, very little is known about which genes influence economic variables, although these genes may have both a direct and an indirect effect on health. We report results from the first large-scale collaboration that studies the molecular genetic architecture of an economic variable-entrepreneurship-that was operationalized using self-employment, a widely-available proxy. Our results suggest that common SNPs when considered jointly explain about half of the narrow-sense heritability of self-employment estimated in twin data (σ(g)(2)/σ(P)(2) = 25%, h(2) = 55%). However, a meta-analysis of genome-wide association studies across sixteen studies comprising 50,627 participants did not identify genome-wide significant SNPs. 58 SNPs with p<10(-5) were tested in a replication sample (n = 3,271), but none replicated. Furthermore, a gene-based test shows that none of the genes that were previously suggested in the literature to influence entrepreneurship reveal significant associations. Finally, SNP-based genetic scores that use results from the meta-analysis capture less than 0.2% of the variance in self-employment in an independent sample (p≥0.039). Our results are consistent with a highly polygenic molecular genetic architecture of self-employment, with many genetic variants of small effect. Although self-employment is a multi-faceted, heavily environmentally influenced, and biologically distal trait, our results are similar to those for other genetically complex and biologically more proximate outcomes, such as height, intelligence, personality, and several diseases.Entities:
Mesh:
Year: 2013 PMID: 23593239 PMCID: PMC3617140 DOI: 10.1371/journal.pone.0060542
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Tetrachoric correlations in the tendency to engage in self-employment for MZ and DZ twin pairs in STR for pooled males and females, males only, and females only.
| Pooled | Males | Females | ||||
| MZ | DZ | MZ | DZ | MZ | DZ | |
|
| 1,062 | 1,170 | 419 | 469 | 643 | 701 |
| Concordant pairs | 839 | 868 | 320 | 307 | 519 | 561 |
| Discordant pairs | 223 | 302 | 99 | 162 | 124 | 140 |
| Pairwise concordance (%) | 79.0 | 74.2 | 76.4 | 65.5 | 80.7 | 80.0 |
| Tetrachoric | 0.560 | 0.363 | 0.677 | 0.332 | 0.401 | 0.230 |
| s.e. | 0.042 | 0.052 | 0.053 | 0.072 | 0.078 | 0.090 |
n refers to the number of twin pairs; s.e.: standard error.
Results of fitting ACE, AE, and CE models to the tendency to engage in self-employment in STR for pooled males and females, males only, and females only.
| Sample | Model |
| (95% CI) |
| (95% CI) |
| (95% CI) |
|
| AIC |
| Pooled |
| 0.54 | (0.25–0.63) | 0.01 | (0.00–0.25) | 0.45 | (0.37–0.55) | – | – | −4,707.96 |
|
| 0.55 | (0.46–0.63) | – | – | 0.45 | (0.37–0.54) | 0.01 | 0.929 | −4,709.95 | |
|
| – | – | 0.42 | (0.35–0.49) | 0.58 | (0.51–0.65) | 13.60 | <0.001 | −4,696.36 | |
| Males |
| 0.67 | (0.33–0.76) | 0.00 | (0.00–0.28) | 0.33 | (0.24–0.44) | – | – | −1,417.15 |
|
| 0.67 | (0.56–0.76) | – | – | 0.33 | (0.24–0.44) | 0.00 | 1.000 | −1,419.15 | |
|
| – | – | 0.50 | (0.41–0.59) | 0.50 | (0.41–0.59) | 14.27 | <0.001 | −1,404.88 | |
| Females |
| 0.38 | (0.00–0.53) | 0.02 | (0.00–0.38) | 0.60 | (0.47–0.76) | – | – | −3,276.62 |
|
| 0.40 | (0.26–0.53) | – | – | 0.60 | (0.47–0.75) | 0.01 | 0.919 | −3,278.61 | |
|
| – | – | 0.31 | (0.19–0.42) | 0.69 | (0.58–0.81) | 2.50 | 0.114 | −3,276.12 |
For pooled males and females the analyses are based on 2,232 twin pairs (1,062 MZ and 1,170 DZ), for males only on 888 twin pairs (419 MZ and 469 DZ), and for females only on 1,344 twin pairs (643 MZ and 701 DZ). The share of self-employed was 21% for the pooled, 32% for the male, and 13% for the female sample. In all samples we controlled for age and in the pooled sample for sex; A: additive genetic component; C: shared common environment component; E: individual-specific environment component; 95% CI: 95% confidence interval; χ 2: χ 2 test for goodness-of-fit, the baseline model is the ACE model; AIC: Akaike information criterion.
Variance in the tendency to engage in self-employment explained by all autosomal SNPs in a combined sample of RS-I and STR for pooled males and females, males only, and females only.
| Sample |
| s.e. |
|
| Cases | (%) | Controls | (%) |
| Pooled | 0.25 | 0.14 | 0.032 | 6,223 | 905 | (14.5) | 5,318 | (85.5) |
| Males | 0.25 | 0.24 | 0.152 | 2,986 | 618 | (20.7) | 2,368 | (79.3) |
| Females | 0.00 | 0.28 | 0.499 | 3,835 | 353 | (9.2) | 3,482 | (90.8) |
The genetic relationships were estimated from 301,115 directly genotyped autosomal SNPs that were available in both studies. All analyses controlled for age, study, and the first 10 principal components of the genetic similarity matrix of the combined sample of RS-I and STR. In the pooled sample we also controlled for sex. The results did not change markedly when 4 or 20 principal components were included; σ 2/σ 2: proportion of phenotypic variance explained by the variance of the total additive genetic effects of the 301,115 autosomal SNPs; s.e.: standard error; p-value: p-value from a likelihood ratio (LR) test assuming that the LR is distributed as a 50∶50 mixture of zero and χ 1 2.
Descriptive statistics of the sixteen discovery studies and the replication study.
| Pooled | Males | Females | Demographics | |||||
| Study | Cases | Controls | Cases | Controls | Cases | Controls | Mean age | SD age |
| AGES | 529 | 2,690 | 439 | 913 | 90 | 1,777 | 51.2 | 6.5 |
| ASPS | 46 | 788 | 26 | 336 | 20 | 452 | 65.2 | 8.1 |
| ERF | 214 | 857 | 113 | 366 | 101 | 491 | 47.2 | 13.4 |
| GHS | 424 | 2,706 | 282 | 1,332 | 142 | 1,374 | 55.9 | 10.9 |
| H2000 | 228 | 1,895 | 145 | 890 | 83 | 1,005 | 50.7 | 11.1 |
| HBCS | 265 | 1,459 | 141 | 595 | 124 | 864 | 61.5 | 2.9 |
| HRS | 1947 | 4273 | 1048 | 1780 | 899 | 2493 | 63.6 | 7.9 |
| KORA S4 | 177 | 1,588 | 121 | 760 | 56 | 828 | 53.8 | 8.8 |
| NFBC1966 | 462 | 3,772 | 322 | 1,718 | 140 | 2,054 | 31.0 | 0.0 |
| NTR1 | 201 | 1,354 | 94 | 494 | 107 | 860 | 46.4 | 13.3 |
| NTR2 | 166 | 818 | 77 | 355 | 89 | 463 | 51.0 | 13.8 |
| RS-I | 531 | 4,843 | 319 | 1,994 | 212 | 2,849 | 68.8 | 8.8 |
| RS-II | 197 | 1,869 | 113 | 848 | 84 | 1,021 | 64.8 | 8.0 |
| RS-III | 209 | 1,716 | 138 | 746 | 71 | 970 | 56.1 | 5.8 |
| SardiNIA | 740 | 3,402 | 515 | 1,207 | 225 | 2,195 | 46.3 | 17.1 |
| SHIP | 157 | 3,906 | 107 | 1,891 | 50 | 2,015 | 49.7 | 16.3 |
| THISEAS | 204 | 481 | 176 | 243 | 28 | 238 | 51.1 | 11.2 |
| TwinsUK | 822 | 2,333 | – | – | 730 | 2,165 | 54.5 | 12.4 |
| YFS | 215 | 2,143 | 89 | 1,194 | 126 | 949 | 37.6 | 5.0 |
| Total discovery | 7,734 | 42,893 | 4,265 | 17,662 | 3,377 | 25,063 | 53.4 | 9.4 |
| STR | 737 | 2,534 | 484 | 925 | 253 | 1,609 | 60.6 | 4.3 |
| Total combined | 8,471 | 45,427 | 4,749 | 18,587 | 3,630 | 26,672 | 53.8 | 9.1 |
AGES: Age, Gene/Environment Susceptibility–Reykjavik Study; ASPS: Austrian Stroke Prevention Study; ERF: Erasmus Rucphen Family study; GHS: Gutenberg Health Study; H2000: Health 2000; HBCS: Helsinki Birth Cohort Study; HRS: Health and Retirement Study; KORA S4: Cooperative Health Research in the Region of Augsburg; NFBC1966: Northern Finland Birth Cohort 1966; NTR1: Netherlands Twin Register Cohort 1; NTR2: Netherlands Twin Register Cohort 2; RS-I: Rotterdam Study Baseline; RS-II: Rotterdam Study Extension of Baseline; RS-III: Rotterdam Study Young; SardiNIA: SardiNIA Study of Aging; SHIP: Study of Health in Pomerania; THISEAS: The Hellenic study of Interactions between SNPs & Eating in Atherosclerosis Susceptibility; TwinsUK: the UK Adult Twin Registry; YFS: the Cardiovascular Risk in Young Finns Study; STR: Swedish Twin Registry; Cases: number of participants that were at least once self-employed; Controls: number of participants that were not, and ideally never, self-employed; SD: standard deviation.
The number of male participants was insufficient for a male stratified analysis.
Figure 1Q–Q plots of the self-employment discovery meta-analyses.
Q–Q plot of the self-employment discovery meta-analysis for (A) pooled males and females, (B) males only, and (C) females only. The grey shaded areas in the Q–Q plots represent the 95% confidence bands around the p-values.
Figure 2Manhattan plots of the self-employment discovery meta-analyses.
Manhattan plot of the self-employment discovery meta-analysis for (A) pooled males and females, (B) males only, and (C) females only. SNPs are plotted on the x-axis according to their position on each chromosome against association with self-employment on the y-axis (shown as −log10 p-value). The solid line indicates the threshold for genome-wide significance (p<5×10−8) and the dashed line the threshold for suggestive SNPs (p<1×10−5).
Top SNPs (p<1×10−5) from the self-employment discovery meta-analyses for pooled males and females, males only, and females only.
| SNP | Chr. | Pos. | Effect /non-effect allele | EAF |
| Direction | Nearest gene | Number of SNPs in region |
| Pooled | ||||||||
| rs6906622 | 6 | 18,596,287 | T/C | 0.21 | 4.10×10−6 | ++−++++++++−++++?++ |
| 12 |
| rs2358531 | 5 | 75,515,542 | A/G | 0.71 | 4.79×10−6 | −−−?−−−−−−+−−+−−?−− |
| 2 |
| rs10776614 | 10 | 49,433,172 | T/C | 0.16 | 4.79×10−6 | −+−−−−−+−−−−−−−−?−− |
| 2 |
| rs17166082 | 7 | 131,363,900 | A/G | 0.06 | 5.82×10−6 | −?−?−−?−−+−−−−−−?−− |
| 1 |
| rs994208 | 14 | 33,531,622 | C/G | 0.66 | 6.11×10−6 | −+−−−−−−−−−−−−−−?−− |
| 1 |
| rs3847697 | 12 | 57,282,257 | T/C | 0.44 | 6.79×10−6 | −−+−−−−−?−+−+−?−?−− |
| 1 |
| rs3742467 | 14 | 49,709,284 | T/C | 0.88 | 9.11×10−6 | +++++−?+−+++−+++?++ |
| 1 |
| Males | ||||||||
| rs6738407 | 2 | 196,851,876 | A/G | 0.20 | 1.52×10−7 | −−−−−−−−−+−−−−−−?− |
| 18 |
| rs6825440 | 4 | 183,636,063 | A/T | 0.24 | 4.25×10−6 | −+−−−−−−−−+−−−+−?− |
| 1 |
| rs7904494 | 10 | 72,056,694 | A/T | 0.78 | 6.74×10−6 | +−+−−−?−−++−−−−−?− |
| 1 |
| rs4867424 | 5 | 32,331,331 | T/C | 0.49 | 8.39×10−6 | −−+−−−−−−−−−−−−−?− |
| 1 |
| rs2712008 | 4 | 38,752,396 | T/G | 0.14 | 9.94×10−6 | +−++++?+++++−+++?+ |
| 1 |
| Females | ||||||||
| rs2331548 | 4 | 170,199,179 | A/G | 0.96 | 1.93×10−6 | ??+?++++++++++++?++ |
| 1 |
| rs521326 | 6 | 52,927,336 | A/G | 0.61 | 2.92×10−6 | −−−−−−−−−−−−+−−−?−− |
| 5 |
| rs1022335 | 2 | 145,813,253 | A/T | 0.37 | 3.02×10−6 | −−−−−−?−−−−−+−−−?−− |
| 1 |
| rs10753804 | 1 | 168,583,032 | T/C | 0.49 | 3.92×10−6 | −−−−−−?−−−−+−−−−?−− |
| 2 |
| rs562487 | 5 | 78,442,190 | A/G | 0.48 | 4.49×10−6 | +++++−++−+−++−++?++ |
| 2 |
| rs9557259 | 13 | 99,031,403 | T/C | 0.06 | 5.16×10−6 | ??−?++?++++++?????+ |
| 1 |
| rs1383043 | 4 | 123,562,066 | A/G | 0.38 | 6.05×10−6 | −−+−−−−+−−−−−−−−??+ |
| 2 |
| rs9578700 | 13 | 23,775,308 | A/G | 0.67 | 6.53×10−6 | −+++−−−−−−−−−−−−?−+ |
| 2 |
Chr.: chromosome; Pos.: position; EAF: average effect allele frequency; In the column “direction”, the studies are in the following order: 1. AGES, 2. ASPS, 3. ERF, 4. GHS, 5. H2000, 6. HBCS, 7. HRS, 8. KORA, 9. NFBC1966, 10. NTR1, 11. NTR2, 12. RS-I, 13. RS-II, 14. RS-III, 15. SardINIA, 16. SHIP, 17. THISEAS, 18. TwinsUK (pooled and female sample)/YFS (male sample), 19. YFS (pooled and female sample); A question mark indicates that the SNP was not tested in that specific study; For SNPs that were located close together in the same region, only the most significant SNP is included in the table. The last column shows the number of neighboring SNPs that exceed the threshold for suggestive SNPs.
Figure 3Prediction results.
Variance explained (Nagelkerke pseudo-R 2 from logistic regression) vs. p-value threshold p T for including SNPs in the score calculation.