Literature DB >> 27456657

Changing Polygenic Penetrance on Phenotypes in the 20(th) Century Among Adults in the US Population.

Dalton Conley1, Thomas M Laidley2, Jason D Boardman3, Benjamin W Domingue4.   

Abstract

This study evaluates changes in genetic penetrance-defined as the association between an additive polygenic score and its associated phenotype-across birth cohorts. Situating our analysis within recent historical trends in the U.S., we show that, while height and BMI show increasing genotypic penetrance over the course of 20(th) Century, education and heart disease show declining genotypic effects. Meanwhile, we find genotypic penetrance to be historically stable with respect to depression. Our findings help inform our understanding of how the genetic and environmental landscape of American society has changed over the past century, and have implications for research which models gene-environment (GxE) interactions, as well as polygenic score calculations in consortia studies that include multiple birth cohorts.

Entities:  

Mesh:

Year:  2016        PMID: 27456657      PMCID: PMC4960614          DOI: 10.1038/srep30348

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


This study evaluates changes in polygenic penetrance—defined as the association between a polygenic score (PGS) and its associated phenotype—across recent birth cohorts in the United States. The answer to this question informs our understanding of how the genetic and environmental landscape of American society has changed over the past century, and offers suggestive evidence for the selective influence of environment on genetic expression. Our findings also have important implications for PGS calculations in consortia studies that include multiple birth cohorts. This inquiry would not have been possible even a decade ago, before the development of PGS techniques to predict complex phenotypes1. The approach is not without its limitations; however, the scalar variables provided by PGS construction are unique in that they allow researchers to ask a number of questions that were not possible with latent heritability models. This allows for fresh opportunities to explore a range of issues, from polygenicity of traits to gene-environment (GxE) interactions. In the present paper, we exploit this opportunity by asking whether the associations between PGS and several phenotypes have changed over the course of the 20th century in the U.S. Because the economic, social, and physical environments underwent dramatic changes during this period, it is likely that the association between a PGS and its related phenotype has also evolved as a consequence2. We examine five important phenotypes—height, body mass index (BMI), education, depression, and heart disease—chosen due to their key associations with health and mortality, the different age ranges at which they are salient345, and the fact that GWAS results (for all SNPs and not just top hits) are available for all five678910. We find that while height and BMI show increasing PGS penetrance over the course of the 20th century birth cohorts, education and heart disease exhibit the opposite trend. In contrast, the association between depression and its underlying genetic architecture remained stable over the same period. Additive heritability (for which PGS penetrance is a proxy), independently of how it is measured, is contingent on the social structure. Indeed, heritability is not a fixed parameter across time and place but is always a ‘local perturbation analysis'11. Supposing a phenotype to be the product of a complex process involving both genetics, environment, and perhaps their interactions (that is, y = f(G, E) + ε), a complete analysis would require that we first know the partial derivatives of the unknown function f(G, E). Absent a specified model of f(G, E), many scholars, particularly in the social sciences, have attempted to inductively model gene-environment correlations (rGE) and interactions (GxE). Starting with the seminal paper in this area of scholarship12, most of these studies rely on endogenous measures of environment and/or fail to adequately control for population structure, thereby producing under-identified results that may reflect rGE, GxE, ExE or GxG13. A few exceptions to this trend include studies that deploy nationally-representative, genome-wide data with controls for principle components in order to address population stratification on the genetic side while econometrically exploiting natural experiments on the environmental side to assure exogeneity of environment14. A promising avenue in this regard has been scholarship that takes advantage of data spanning a wide range of birth cohorts to assess how heritability may be changing over the shifting (if unmeasured) environment across decades. For instance, recent research has shown that a PGS for physiological predisposition to tobacco use has exhibited more robust correlations over time with phenotypic measures of smoking in the U.S. population15. Studies which employ sibling and twin comparisons and candidate gene studies show the same pattern of increasing genetic penetrance with respect to tobacco use among recent cohorts1617. These results suggest that as the dangers of tobacco use were publicized in the latter half of the 20th century, the underlying genotype signifying a greater propensity to smoke exerted a more pronounced influence on behavior. Other research shows a similar historical shift in genomic influence on physical characteristics, with increasing associations between genetic architecture and BMI in recent decades for US adults1819. Likewise, twin-based models of the heritability of education appear to show an increasing effect of genotype over a similar time period20. We expand on this literature by focusing on a wider breadth of phenotypes and employ polygenic scores based on millions of SNPs rather than individual markers in identifying historical shifts in genetic expression. Some have argued that these changes reflect the relative increase of genetic over social factors as determinants of complex behavioral traits like smoking, rather than a true increase in the causal association between genetic polymorphisms and phenotypes. This distinction is important because it emphasizes genetic penetrance rather than expression, per se. That is, the social and historical context can, at times, mask small genetic associations because the environment may be ‘pushing’ the phenotype, which limits our ability to observe penetrance16. The social environment can also serve as a trigger (or, alternatively, as a controlling influence) in which differential rates of expression (or methylation) in response to specific environmental signals denotes a biological mechanism, through which the environment causes genes to function in a particular manner21.

Results

We used data from the Health and Retirement Study (HRS). Details about inclusion in the sample and selective attrition can be found in the Supplementary Information notes. Our data are from the 2012 wave of the HRS, and allowed us to observe the consistency of PGS-phenotype correlations across birth cohorts in the mid-20th century among U.S. adults. Respondents were born between 1919 and 1955 and, on average, went on to complete over 13 years of education. Nearly 40% of the respondents self-reported heart disease. Baseline associations between the five traits and their respective polygenic scores (Supplemental Table S2) are significant at conventional alpha levels. The polygenic score for BMI is the best predictor of its associated outcome, followed by education and height. We interacted the PGS for each trait with birth year to predict the corresponding phenotypes in Fig. 1 (model also included main effects for both birth year and phenotype; see Equation 2 in Methods). We find that, while there is tendency for those in later birth cohorts to accrue more education, the predictive power of genotype for education is declining over time. This finding is contrary to some twin-based evidence that the genetic penetrance for education has risen20; this could be due to a number of dynamics including the inherent differences between twin methods and the PGS approach, differences in the birth cohorts studied or changing gender dynamics. (We discuss potential difference sand explanations in depth in the SI on pages 10–11). Similarly, declines in heart disease are matched by declines in the predictive natures of the heart disease PGS. Meanwhile, the predictive power of height and BMI polygenic scores have increased significantly, while depression appears flat. Our results showing an increased PGS penetrance of BMI in particular among more recent cohorts of Americans are broadly consistent with recent researchbased on a more limited polygenic score and other forms of genetic analysis1819.
Figure 1

Predicted standardized values of selected phenotypes by polygenic score (+1 or −1 standard deviations), across birth cohorts among genotyped respondents in the Health and Retirement Study (N = 8,865).

Height (p < 0.05) and BMI (p < 0.001) polygenic scores become more predictive in later birth cohorts while education (p < 0.05) and heart disease (p < 0.05) PGSs become less predictive. Depression does not show a significant trend. The lines show fitted values for those at 1 SD above (gray) and below (black) the mean. Points are based on binned means for two groups of respondents (standardized value below 0, black; standardized value above 0, dark gray). For each group, the distribution of birth years is divided into 20 subgroups with approximately equal numbers. Plotted points are the mean birth year and response for these subgroups.

One potential explanation for these trends in PGS penetrance could be due to changes in the genetic variation in the population that could result from differential fertility and/or genetic assortative mating222324. To assess this latter possibility, we calculated the variance for each of the five PGSs across birth cohorts. These are reported in the Supplemental Information Fig. S5, Panel B. For all the scores, variances are unchanged across birth cohorts, supporting the understanding that changes in PGS predictive power reflect GxE effects that result from a shifting environmental landscape. Namely, if the variance component for G is unchanged, any change in additive heritability or SNP-based PGS prediction is likely due to a shift in the variance component for the environmental portion. We also perform other sensitivity checks related to mortality and sample ascertainment (presented in detail in the Supplementary Information), and find that our results broadly reflect a changing influence in environmental conditions, and do not appear to be driven by biases introduced by the data (see SI, Page 5–10). Likewise, our results become stronger when measurement error for each PGS is taken into consideration through SIMEX analysis (Table S3) and are robust to Huber-White adjustments for clustering by household (Table S4). That said, our power to detect the interaction term is limited for some phenotypes (particularly depression—see Table S2 and SI notes for discussion), so replication of our results will be important.

Discussion

The twentieth century witnessed massive shifts in the social and nutritional environment of the United States. The change from an agrarian society to an industrial and post-industrial one has well documented effects on population health25 and is also associated with the expansion of schooling26, medical improvements27, increased longevity28, and caloric abundance29. Any or all of these changes may influence not only relationships between important phenotypes but between those phenotypes and their underlying genotypes as well. Under this multi-dimensionally shifting environmental regime, the genotypic effects of height and BMI PGSs evince trends of increasing predictive power, while education PGS shows a declining association with years of schooling, perhaps due to policy and structural changes in society that has reduced variation in the phenotype (see Panel B of SI Fig. S4). As nutritional deprivation receded as a restraining force on genetic expression, height and weight could more “accurately” reflect underlying genetic potential as measured by common SNPs. Meanwhile, educational “abundance” had the opposite effect: with the steady expansion of schooling we find that rather than constraints on the full extent of ability being lifted to reveal increasing genetic penetrance, we observed declining genetic prediction among more recent cohorts. During this time, secondary schooling became nearly universal and post-secondary education more common, yet the genetic signal was weakened. Thus, in some cases—like height and BMI— environmental barriers can act to suppress genetic effects, while in others (such as education) such obstacles can act to accentuate genetic associations. This may be a useful dichotomous classification scheme to apply to cohort analysis of genetic influence on other phenotypes going forward.

Materials and Methods

Phenotypes were computed based on RAND Fat Files, version N (which covers data collection up until 2012). We examined: Education: Total years of educational attainment. BMI: Mean BMI over all available waves. Height: Max height over all available waves. Heart Disease: Whether a respondent ever reports heart problems (rXheart). Depression: Mean CESD score over all available waves. This variable had a skewed distribution, so it was transformed via the logarithm (after adding one to everyone’s mean). Sample descriptives are shown in Table S1.

Methods

Polygenic Scores (PGSs) were first suggested in 2007 as flexible tools for quantifying the genetic contribution to a phenotype30. Polygenic scores have several attractive features. First, unlike candidate genes, they are “hypothesis-free” measures—i.e. ex ante knowledge about the biological processes involved is not needed to estimate a score for a particular phenotype. Rather, a polygenic scores casts a wide net across an individual’s entire genome to yield a single quantitative measure of genetic risk, or genetic risk score (GRS)31323334, allowing researchers to explore how genes operate within environments where the biological mechanisms are not yet fully understood35. PGSs were constructed based on publicly available data from recent GWAS (additional details on the genetic data and the construction of polygenic scores are available in the SI)678910. The same approach was conducted with each set of GWAS results. Briefly, SNPs in the HRS genetic database were matched to SNPs with reported results in a GWAS. Since the risk allele is not always readily identifiable, we removed all ambiguous SNPs. For each of these SNPs, a loading was calculated as the number of phenotypically associated reference alleles multiplied by the effect-size estimated in the original GWAS as shown in Equation 1, below. Thus, a polygenic score (PS) for individual is a weighted average across the number of SNPs (n) of the number of reference alleles x (0, 1 or 2) at that SNP multiplied by the score for that SNP (): where SNPs with relatively large p-values will have small effects (and thus be down weighted in creating the composite), so we do not impose a p-value threshold. Loadings were summed across the SNP set to calculate the polygenic score. The score was then standardized to have a mean of 0 and SD of 1 for ease of interpretation (though analysis of raw scores does not change results). Genetic analyses were done using the second-generation PLINK software36. Finally, scores were residualized on the top 10 principal components computed from the non-Hispanic whites in HRS to ensure that none of the reported results are due to changes in population stratification (though results without residualization on PCs do not change, see Fig. S3 of SI). To examine changes in PGS penetrance, we estimated Equation 2: Huber-White correction for the non-independence of spousal pairs does not change results (see Supplementary Information Table S4).

Additional Information

How to cite this article: Conley, D. et al. Changing Polygenic Penetrance on Phenotypes in the 20th Century Among Adults in the US Population. Sci. Rep. 6, 30348; doi: 10.1038/srep30348 (2016).
  29 in total

1.  The Long-Term Consequences of Vietnam-Era Conscription and Genotype on Smoking Behavior and Health.

Authors:  Lauren Schmitz; Dalton Conley
Journal:  Behav Genet       Date:  2015-09-04       Impact factor: 2.805

2.  Genetic and socioeconomic study of mate choice in Latinos reveals novel assortment patterns.

Authors:  James Y Zou; Danny S Park; Esteban G Burchard; Dara G Torgerson; Maria Pino-Yanes; Yun S Song; Sriram Sankararaman; Eran Halperin; Noah Zaitlen
Journal:  Proc Natl Acad Sci U S A       Date:  2015-10-19       Impact factor: 11.205

3.  The heritability hang-up.

Authors:  M W Feldman; R C Lewontin
Journal:  Science       Date:  1975-12-19       Impact factor: 47.728

4.  The challenge of causal inference in gene-environment interaction research: leveraging research designs from the social sciences.

Authors:  Jason M Fletcher; Dalton Conley
Journal:  Am J Public Health       Date:  2013-08-08       Impact factor: 9.308

5.  Prediction of individual genetic risk to disease from genome-wide association studies.

Authors:  Naomi R Wray; Michael E Goddard; Peter M Visscher
Journal:  Genome Res       Date:  2007-09-04       Impact factor: 9.043

6.  Lifetime Socioeconomic Status, Historical Context, and Genetic Inheritance in Shaping Body Mass in Middle and Late Adulthood.

Authors:  Hexuan Liu; Guang Guo
Journal:  Am Sociol Rev       Date:  2015-08

Review 7.  Epigenome-wide association studies for common human diseases.

Authors:  Vardhman K Rakyan; Thomas A Down; David J Balding; Stephan Beck
Journal:  Nat Rev Genet       Date:  2011-07-12       Impact factor: 53.242

8.  Influence of life stress on depression: moderation by a polymorphism in the 5-HTT gene.

Authors:  Avshalom Caspi; Karen Sugden; Terrie E Moffitt; Alan Taylor; Ian W Craig; HonaLee Harrington; Joseph McClay; Jonathan Mill; Judy Martin; Antony Braithwaite; Richie Poulton
Journal:  Science       Date:  2003-07-18       Impact factor: 47.728

9.  Polygenic risk, rapid childhood growth, and the development of obesity: evidence from a 4-decade longitudinal study.

Authors:  Daniel W Belsky; Terrie E Moffitt; Renate Houts; Gary G Bennett; Andrea K Biddle; James A Blumenthal; James P Evans; Honalee Harrington; Karen Sugden; Benjamin Williams; Richie Poulton; Avshalom Caspi
Journal:  Arch Pediatr Adolesc Med       Date:  2012-06-01

10.  Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease.

Authors:  Heribert Schunkert; Inke R König; Sekar Kathiresan; Muredach P Reilly; Themistocles L Assimes; Hilma Holm; Michael Preuss; Alexandre F R Stewart; Maja Barbalic; Christian Gieger; Devin Absher; Zouhair Aherrahrou; Hooman Allayee; David Altshuler; Sonia S Anand; Karl Andersen; Jeffrey L Anderson; Diego Ardissino; Stephen G Ball; Anthony J Balmforth; Timothy A Barnes; Diane M Becker; Lewis C Becker; Klaus Berger; Joshua C Bis; S Matthijs Boekholdt; Eric Boerwinkle; Peter S Braund; Morris J Brown; Mary Susan Burnett; Ian Buysschaert; John F Carlquist; Li Chen; Sven Cichon; Veryan Codd; Robert W Davies; George Dedoussis; Abbas Dehghan; Serkalem Demissie; Joseph M Devaney; Patrick Diemert; Ron Do; Angela Doering; Sandra Eifert; Nour Eddine El Mokhtari; Stephen G Ellis; Roberto Elosua; James C Engert; Stephen E Epstein; Ulf de Faire; Marcus Fischer; Aaron R Folsom; Jennifer Freyer; Bruna Gigante; Domenico Girelli; Solveig Gretarsdottir; Vilmundur Gudnason; Jeffrey R Gulcher; Eran Halperin; Naomi Hammond; Stanley L Hazen; Albert Hofman; Benjamin D Horne; Thomas Illig; Carlos Iribarren; Gregory T Jones; J Wouter Jukema; Michael A Kaiser; Lee M Kaplan; John J P Kastelein; Kay-Tee Khaw; Joshua W Knowles; Genovefa Kolovou; Augustine Kong; Reijo Laaksonen; Diether Lambrechts; Karin Leander; Guillaume Lettre; Mingyao Li; Wolfgang Lieb; Christina Loley; Andrew J Lotery; Pier M Mannucci; Seraya Maouche; Nicola Martinelli; Pascal P McKeown; Christa Meisinger; Thomas Meitinger; Olle Melander; Pier Angelica Merlini; Vincent Mooser; Thomas Morgan; Thomas W Mühleisen; Joseph B Muhlestein; Thomas Münzel; Kiran Musunuru; Janja Nahrstaedt; Christopher P Nelson; Markus M Nöthen; Oliviero Olivieri; Riyaz S Patel; Chris C Patterson; Annette Peters; Flora Peyvandi; Liming Qu; Arshed A Quyyumi; Daniel J Rader; Loukianos S Rallidis; Catherine Rice; Frits R Rosendaal; Diana Rubin; Veikko Salomaa; M Lourdes Sampietro; Manj S Sandhu; Eric Schadt; Arne Schäfer; Arne Schillert; Stefan Schreiber; Jürgen Schrezenmeir; Stephen M Schwartz; David S Siscovick; Mohan Sivananthan; Suthesh Sivapalaratnam; Albert Smith; Tamara B Smith; Jaapjan D Snoep; Nicole Soranzo; John A Spertus; Klaus Stark; Kathy Stirrups; Monika Stoll; W H Wilson Tang; Stephanie Tennstedt; Gudmundur Thorgeirsson; Gudmar Thorleifsson; Maciej Tomaszewski; Andre G Uitterlinden; Andre M van Rij; Benjamin F Voight; Nick J Wareham; George A Wells; H-Erich Wichmann; Philipp S Wild; Christina Willenborg; Jaqueline C M Witteman; Benjamin J Wright; Shu Ye; Tanja Zeller; Andreas Ziegler; Francois Cambien; Alison H Goodall; L Adrienne Cupples; Thomas Quertermous; Winfried März; Christian Hengstenberg; Stefan Blankenberg; Willem H Ouwehand; Alistair S Hall; Panos Deloukas; John R Thompson; Kari Stefansson; Robert Roberts; Unnur Thorsteinsdottir; Christopher J O'Donnell; Ruth McPherson; Jeanette Erdmann; Nilesh J Samani
Journal:  Nat Genet       Date:  2011-03-06       Impact factor: 38.330

View more
  14 in total

1.  Mortality selection in a genetic sample and implications for association studies.

Authors:  Benjamin W Domingue; Daniel W Belsky; Amal Harrati; Dalton Conley; David R Weir; Jason D Boardman
Journal:  Int J Epidemiol       Date:  2017-08-01       Impact factor: 7.196

2.  Translating Polygenic Analysis for Prevention: From Who to How.

Authors:  Daniel W Belsky
Journal:  Circ Cardiovasc Genet       Date:  2017-06

3.  Father Absence and Accelerated Reproductive Development in Non-Hispanic White Women in the United States.

Authors:  Lauren Gaydosh; Daniel W Belsky; Benjamin W Domingue; Jason D Boardman; Kathleen Mullan Harris
Journal:  Demography       Date:  2018-08

4.  Modeling Interaction and Dispersion Effects in the Analysis of Gene-by-Environment Interaction.

Authors:  Benjamin W Domingue; Klint Kanopka; Travis T Mallard; Sam Trejo; Elliot M Tucker-Drob
Journal:  Behav Genet       Date:  2021-12-02       Impact factor: 2.805

5.  The Effect of Vietnam-Era Conscription and Genetic Potential for Educational Attainment on Schooling Outcomes.

Authors:  Lauren L Schmitz; Dalton Conley
Journal:  Econ Educ Rev       Date:  2017-10-07

6.  Genotyping the dead: Using offspring as proxy to estimate the genetic correlation of education and longevity.

Authors:  Dalton Conley; Ramina Sotoudeh
Journal:  Proc Natl Acad Sci U S A       Date:  2016-11-11       Impact factor: 11.205

7.  The social genome of friends and schoolmates in the National Longitudinal Study of Adolescent to Adult Health.

Authors:  Benjamin W Domingue; Daniel W Belsky; Jason M Fletcher; Dalton Conley; Jason D Boardman; Kathleen Mullan Harris
Journal:  Proc Natl Acad Sci U S A       Date:  2018-01-09       Impact factor: 11.205

Review 8.  How are social determinants of health integrated into epigenetic research? A systematic review.

Authors:  Linnea Evans; Michal Engelman; Alex Mikulas; Kristen Malecki
Journal:  Soc Sci Med       Date:  2021-02-03       Impact factor: 4.634

9.  Hidden heritability due to heterogeneity across seven populations.

Authors:  Felix C Tropf; S Hong Lee; Renske M Verweij; Gert Stulp; Peter J van der Most; Ronald de Vlaming; Andrew Bakshi; Daniel A Briley; Charles Rahal; Robert Hellpap; Anastasia N Iliadou; Tõnu Esko; Andres Metspalu; Sarah E Medland; Nicholas G Martin; Nicola Barban; Harold Snieder; Matthew R Robinson; Melinda C Mills
Journal:  Nat Hum Behav       Date:  2017-09-11

10.  Father absence, age at menarche, and genetic confounding: A replication and extension using a polygenic score.

Authors:  Gabriel L Schlomer; Kristine Marceau
Journal:  Dev Psychopathol       Date:  2020-10-27
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.