Literature DB >> 35951648

Determining the stability of genome-wide factors in BMI between ages 40 to 69 years.

Nathan A Gillespie^1,2, Amanda Elswick Gentry¹, Robert M Kirkpatrick¹, Chandra A Reynolds³, Ravi Mathur⁴, Kenneth S Kendler¹, Hermine H Maes⁵, Bradley T Webb^1,4, Roseann E Peterson¹.

Abstract

Genome-wide association studies (GWAS) have successfully identified common variants associated with BMI. However, the stability of aggregate genetic variation influencing BMI from midlife and beyond is unknown. By analysing 165,717 men and 193,073 women from the UKBiobank, we performed BMI GWAS on six independent five-year age intervals between 40 and 72 years. We then applied genomic structural equation modeling to test competing hypotheses regarding the stability of genetic effects for BMI. LDSR genetic correlations between BMI assessed between ages 40 to 73 were all very high and ranged 0.89 to 1.00. Genomic structural equation modeling revealed that molecular genetic variance in BMI at each age interval could not be explained by the accumulation of any age-specific genetic influences or autoregressive processes. Instead, a common set of stable genetic influences appears to underpin genome-wide variation in BMI from middle to early old age in men and women alike.

Entities: Chemical

Mesh：

Year: 2022 PMID： 35951648 PMCID： PMC9398001 DOI： 10.1371/journal.pgen.1010303

Source DB: PubMed Journal: PLoS Genet ISSN： 1553-7390 Impact factor: 6.020

Introduction

The recent decade has witnessed significant advances in the detection of multiple loci underpinning variation in complex traits [1]. Among the most successful endeavors has been genome-wide association scan (GWAS) analyses of adult BMI [2-4]. Notwithstanding the predictive validity of common BMI variants [5], GWAS BMI loci are based on large, aggregated meta-analytic samples derived from varying geographic and economic regions, derived from different birth cohorts and age distributions. Here, we examine this last caveat for it remains to be empirically determined if the genome-wide variation associated with adult BMI is age-invariant or age-specific. This has public-health consequences. For example, given the positive association between increased age and BMI with COVID19-related hospitalization and mortality [6,7], determining if genetic variants in BMI at older ages are qualitatively distinct is important.

BMI heritability & longitudinal genetic correlations

Despite moderate stability across time [8-12], average adult BMI increases from age 20 to age 65 at which time it levels off until age 80 [12] before beginning to decline. Such changes might be attributed to variable contributions of genetic and environmental risks across the lifespan. Apart from birth cohort differences, Dahl et al. [12], found that factors such as an obesity genetic risk score, type-2 diabetes mellitus, cardiovascular disease, substance use, and educational attainment were all differentially predictive of both average BMI and changes in BMI before age 65. In contrast, many of these risks were no longer predictive after age 65. In terms of genetics, whereas the overall lifetime BMI heritability is 0.75 [13], heritability actually increases throughout infancy and adolescence [14] before decreasing during adulthood [13]. In terms of cross-temporal associations, genetic influences in BMI are correlated across time [8-11,14-16], sometimes very highly [17], which indicates continuous expression of the same genetic influences [18]. However, longitudinal genetic correlations for BMI never reach unity. Indeed, there is considerable variability in longitudinal genetic correlations [13,15,16]. This is also consistent with age-specific genetic influences, which could be obscured in GWAS meta-analytic results based on data aggregated across age or GWAS analyses with age as a covariate.

Linkage Disequilibrium Score (LDSC) regression

Until recently, estimates of heritability (h2) and genetic correlations (rG), have typically relied on twins reared together or family studies within a structural equation modeling (SEM) framework. The development LDSC [19] has circumvented the need for twin studies by now making it possible to estimate h2 and rG using unrelated and independent samples that have GWAS summary test statistics. Briefly, LDSC regression works by leveraging external linkage disequilibrium (LD) reference panels, which summarize correlations between genetic markers across the human genome, to produce genetic covariance matrices from GWAS summary statistics. In addition to estimating rG, these matrices can then be used within a traditional SEM framework to test hypotheses about comorbidity, or the nature of change. In terms of BMI, this approach could be used to address the question of whether or not genetic risks in BMI are correlated across time. Currently however, there is a paucity of molecular-based reports examining h2 and rG between BMI assessed at different ages, and they have relied on different approaches to produce mixed results. For instance, Trzaskowski et al. [20] used LDSR to report a genetic correlation (rg = 0.86) between BMI assessed at age 11 and 65. Winkler et. al. [21] estimated Spearman rank genetic correlations between BMI assessed in populations above and below age 50, which revealed much smaller correlations (rg = 0.05 to 0.12). Notwithstanding the need for greater precision regarding longitudinal genetic correlations, such correlations are descriptive and provide no insight regarding competing theories underlying developmental processes in BMI. We argue that at least two theoretical mechanisms [22] can be invoked to explain observed continuity in genetic correlations. The first is a common factor process whereby common genetic or environmental factors determine the levels and rates of change in BMI over time. In this model, variances and covariances between longitudinal measures of BMI depend on individual genetic or environmental differences in growth patterns unfolding with age i.e., random growth curves [23-27]. We are aware of three twin studies that have applied genetically informative growth models to longitudinal BMI data [28-30]. Unfortunately, random growth curves do not determine the extent to which stability or changes in BMI are governed by time-invariant versus age-specific genetic influences. To address this question, the second mechanism predicts that variances and covariances are determined by random, time-specific genetic and environmental effects, which are more or less persistent over time i.e., autoregressive effects’ [31-33]. Illustrated in Fig 1, this approach predicts a causal process of inertial effects, whereby BMI genetics at one time causally affect BMI at the next. We have applied this validated approach to personality [34], anxiety and depression [35,36], substance use [37] and brain aging [38]. We are aware of two reports that have tested autoregression models with respect to BMI data [9,18]. Cornes et al. [9] found evidence of distinct, age-specific genetic influences on BMI at ages 12, 14 and 16. To our knowledge, autoregressive effects have never been tested in adult BMI, especially across a wide window comprising narrow age intervals in adulthood. Fortunately, the recent, innovative application of structural equation modelling (SEM) [39,40] to LDSC regression genetic correlations [39] based on available GWAS results can now address the aforementioned gaps.

Fig 1

Autoregression model depicting genome-wide variation in BMI at each age interval.

Autoregression model depicting genome-wide variation in BMI at each age interval.

This development model predicts that genetic variation at each time interval can be decomposed into time-specific variation or ‘innovations’ & the causal contribution of genome-wide genetic variation from previous age intervals. Innovations refer to novel or age-specific genetic influences that are uncorrelated with previous genetic influences. This model also includes residual genetic variation not otherwise explained by the autoregression process. Double-headed arrows denote variation associated with innovations & residuals at each age interval. Beta (β) denotes the causal contribution of genetic variance from one age interval to the next. By applying genomic structural equation modeling or “genomicSEM” to BMI GWAS summary statistics from the UK Biobank [41], our aim was to determine if genetic influences across middle to early old age were best explained by age-dependent versus age-invariant processes. We also tested if alternative, more parsimonious theoretical explanations i.e., common factor models, whereby covariance between genetic influences across time could be better captured by a single latent factor [42], provided a better fit to the data. Given that standardized estimates of BMI heritability for men and women are statistically equal [13] and that there appear to be no sex differences in terms of the observed adulthood decline in heritability [43], we hypothesized that developmental processes governing changes in heritability over time likewise ought to be comparable across sex.

Methods

All BMI and GWAS summary statistics came from the UK Biobank, a major biomedical database. The UK Biobank is a large-scale biomedical database and research resource containing genetic, lifestyle and health information from half a million UK participants. UK Biobank’s database, which includes blood samples, heart and brain scans and genetic data of the 500,000 volunteer participants, is globally accessible to approved researchers who are undertaking health-related research that’s in the public interest [41].

BMI data

Described in detail elsewhere [44], weight was collected from subjects using a Tanita BC418MA body composition analyzer. Standing and sitting height measurements were collected from subjects using a Seca 240cm height measure. Body mass index (BMI) was calculated as weight divided by height squared (kg/m2). We divided the BMI data into six age intervals: 40–45; 46–50; 51–55; 56–60; 61–65; and 66–73 years. The range was based on available data whereas the number of age tranches was selected to maximize our power to choose between competing longitudinal and multivariate models without minimizing the statistical power of the GWAS analyses at each interval. The number of subjects with complete BMI and GWAS summary statistics are shown in S1 Table.

Genotypic data

Genotype data were filtered according to the Neale Lab pipeline, using filtration parameters and scripts publicly available from the lab GitHub [45]. Samples were filtered to retain only unrelated subjects of British ancestry (n = 359,980.) Imputed variants [46] were filtered for INFO scores > 0.8, MAF > 0.001, and HWE p-value > 1e-10.

GWAS analyses

This is proof-of-principle illustration of the application of structural equation modelling (SEM) to GWAS summary data to test competing longitudinal hypotheses. Since the number of UKB subjects with repeated BMI measures and GWAS summary statistics was insufficient to be divided into a minimum of three age tranches needed to test the autoregression hypothesis, we treated the GWAS summary statistics at each 5-year age interval as pseudo-longitudinal. This resulted in 6 separate and independent age tranches for GWAS. Three separate GWAS analyses were conducted for each age tranche (men, women and combined) using the BGENIE (version 1.3.) [46]. The first 20 ancestry principal components were included as covariates in all models. Sex was included as a covariate in the combined (men + women) model.

Genomic structural equation modelling

We then applied the GenomicSEM software package [39] in R (version 4.0.3) [47] to the BMI GWAS results to estimate separate genetic variance-covariance (S) and asymptotic sampling covariance ‘weight’ (V) matrices for the male, female and then the combined GWAS results. Estimation of the S and V matrices is a 3-step process. In step 1, the raw GWAS summary statistics were manipulated using the GenomicSEM munge option to remove all SNPs with MAF < 1%, information scores < 0.9, and SNPs in the MHC region. In step 2, we used the GenomicSEM ldsc option to run multivariate LD score regression [39] to estimate the S and V matrices between the GWAS summary statistics. This method has been successfully applied to detect genetic correlations between bio-medical, psychiatric and behavioural phenotypes [48-68], which are commensurate with previous biometrical genetic correlations [69-76] while revealing extensive pleiotropy across a wide variety of phenotypes. In step 3, the S and V matrices were then read into the lavaan (version 0.6–7) [77] SEM software package in R (version 4.0.3) [47] to fit and compare competing longitudinal and multivariate models. All GenomicSEM and lavaan scripts used here are publicly available at https://github.com/ToddWebb/UKBiobank_VIPBG/tree/master/LongitudinalGenomicSEM. The autoregression model predicts that time-specific random genetic or environmental effects are more or less persistent over time (autoregressive effects) [31]. As described by Eaves and others [31-33], autoregression models assume that the covariance structure arises because of random, age-specific genetic or environmental effects, which are, at least partially, carried forward. As illustrated in Fig 1, innovations at each assessment reflect novel, time- or age-specific genetic or environmental influences, which are uncorrelated with previous genetic influences. Genetic differences at each occasion are therefore a function of new random effects on the phenotype that arise as well as a (linear) function of individual genetic differences expressed at the preceding time. Here, we assume that cross-temporal correlations arise because the innovations have a more or less persistent effect over time and may, under some circumstances accumulate, potentially giving rise to developmental increases in genetic variance and increased correlations between adjacent measures. One consequence of the autoregressive model is the tendency of cross-temporal correlations to decay as a function of increasing lag-time. Depending on the magnitude of an innovation and its relative persistence, the observed variances and cross-temporal covariances may increase towards a stable asymptotic value. We began by fitting innovations at all six time-intervals, which were then successively dropped. We also specified an autoregression model that included a single innovation at BMI 40–44 accounting for all subsequent genetic variances. Technically, this first innovation includes all genetic variance accumulated up to this first age interval. Finally, we fitted a factor analysis comprising a single factor.

Model fit indices & comparisons

In GenomicSEM analyses there is no one sample size to speak of. This is because GWAS studies from which the summary statistics are derived can vary in size and subject overlap. Thus, potentially, a different (effective) sample size may apply to each element of S. We were therefore limited to fit indices that do not explicitly depend upon sample size: the pseudo Akaike Information Criterion (pseudoAIC); Comparative Fit Index (CFI); Tucker Lewis Index (TFI); and the Standardized Root Mean Square Residual (SRMR) to judge the best-fitting model. Both the CFI and TFI are incremental fit indices that penalize models with increasing complexity. The SRMR is an absolute measure of fit based on the difference between the observed and predicted correlations under each model, such that a value of zero indicates a perfect fit. The pseudoAIC is a comparative fit index, whereby the model with the lowest AIC values is interpreted as the best-fitting.

Results

Combined male & female analyses

The LDSR-based genome-wide genetic correlations between the six GWAS summary statistics, including GWAS sample sizes and the SNP-based heritability for each age interval, are shown in Table 1. The correlations do not decline with increasing time intervals, which would be indicative of a simplex structure best explained by autoregression models. For example, the LDSR genetic correlation (rg) between BMI at ages 40–45 and 66–73 years was higher than the rg between BMI at ages 40–45 and 56–60 years (rg = 0.97 vs 0.93). Overall, the genetic correlations were very high and ranged from 0.93 to 1.00.

Table 1

Sample sizes, estimates of SNP-based heritability (including standard errors along diagonal) & linkage disequilibrium score regression genetic correlations between the six age intervals based on the combined male & female GWAS summary statistics.

	Sample size	1.	2.	3.	4.	5.	6.
1. BMI GWAS 40–44 yrs	34,001	0.23 (0.02)
2. BMI GWAS 45–49 yrs	45,294	1.00	0.26 (0.02)
3. BMI GWAS 50–54 yrs	53,602	0.99	1.00	0.26 (0.02)
4. BMI GWAS 55–59 yrs	64,891	0.93	0.93	0.95	0.29 (0.01)
5. BMI GWAS 60–64 yrs	89,824	0.95	0.94	0.93	0.90	0.24 (0.01)
6. BMI GWAS 65–73 yrs	71,178	0.97	0.96	0.95	0.93	1.00	0.22 (0.01)

Formal model fitting comparisons are shown in Table 2. We began with a fully saturated autoregression model comprising unique genetic influences or innovations at each age interval (Fig 1). This provided a reasonable fit to the data as judged by the non-significant chi-square, very high CFI and TLI values and very low SRMR. Autoregression sub-models in which the genetic innovations at ages 66 to 73, 61 to 65, and 51 to 55 years were each successively removed provided only marginal improvements in terms of their pseudoAIC values. In contrast, the factor analysis with a single factor provided the overall best fit in terms of the smallest chi-square, lowest pseudoAIC and lowest SRMR. In this model (see Fig 2), genetic variance at each five-year age interval was best explained by a single factor with a genome-wide SNP heritability of 24%.

Table 2

Multivariate modeling fitting comparisons based on the combined male & female GWAS summary statistics.

Models	Chi-square_(df)	p	pseudoAIC	CFI	TLI	SRMR
Full auto-regression (AutoReg)	21.113₍₁₃₎	0.071	55.113	0.999	0.999	0.039
AutoReg: genetic innovation at 65–73 yrs dropped	22.419₍₁₄₎	0.070	54.419	1.000	0.999	0.039
AutoReg: genetic innovation at 60–64 yrs dropped	20.872₍₁₄₎	0.105	52.872	1.000	0.999	0.039
AutoReg: genetic innovation at 55–59 yrs dropped	25.403₍₁₄₎	0.031	57.403	0.999	0.999	0.041
AutoReg: genetic innovation at 50–54 yrs dropped	21.768₍₁₄₎	0.084	53.768	0.999	0.999	0.040
AutoReg: genetic innovation at 45–49 yrs dropped	34.073₍₁₄₎	0.002	66.073	0.998	0.998	0.051
AutoReg: genetic innovations at 45–73 yrs dropped	46.133₍₁₄₎	0.000	70.133	0.998	0.998	0.056
Factor analysis—1 factor	13.005₍₉₎	0.162	37.005	1.000	1.000	0.016

Note: AIC = Akaike Information Criterion, CFI = Comparative Fit Index, TLI = Tucker Lewis Index, SRMR = (Standardized) Root Mean Square Residual. Innovations refer to novel or age-specific genetic influences that are uncorrelated with previous genetic influences.

Fig 2

Best fitting factor analytic model with a single common factor (CF) based on the combined male and female data.

Best fitting factor analytic model with a single common factor (CF) based on the combined male and female data.

The CF explains covariation between the six GWAS summary statistics each based on five-year intervals between ages 40–73 years. To identify this model, the first factor loading from the CF to BMI GWAS at 40–45 years was constrained to one. The double-headed arrow on the CF denotes the standardized variance, or SNP-based heritability, for BMI. Double-headed arrows on the residuals denote genetic variation at each age interval not otherwise explained by the CF. Note: AIC = Akaike Information Criterion, CFI = Comparative Fit Index, TLI = Tucker Lewis Index, SRMR = (Standardized) Root Mean Square Residual. Innovations refer to novel or age-specific genetic influences that are uncorrelated with previous genetic influences.

Sex specific analyses

An identical pattern emerged when the model fitting was repeated by sex. Male and female sample sizes at each age interval are shown in S1 Table. Table 3 shows the LDSR genetic correlations for men and women. Varying only slightly, the separate male and female correlations were again high and ranged from rg = 0.88 to rg = 1.00. S1 Table also shows the SNP-based heritability estimates by sex, which were very similar at each age interval.

Table 3

Linkage disequilibrium score regression genetic correlations based on the male (below diagonal) & female (above diagonal italics) GWAS summary statistics at six age intervals.

	1.	2.	3.	4.	5.	6.
1. BMI GWAS 40–44 yrs	1	0.99	0.99	0.91	0.95	0.92
2. BMI GWAS 45–49 yrs	0.98	1	1.00	0.96	0.93	0.93
3. BMI GWAS 50–54 yrs	1.00	0.99	1	0.95	0.91	0.90
4. BMI GWAS 55–59 yrs	0.93	0.89	0.93	1	0.88	0.93
5. BMI GWAS 60–64 yrs	0.97	0.90	0.95	0.88	1	0.98
6. BMI GWAS 65–73 yrs	0.97	0.90	0.96	0.93	0.99	1

As shown in Table 4, the genetic innovations at ages 46 to 66+ years for men and women could each be dropped from the full autoregression model as judged by the non-significant chi-square value. Overall, for both sexes, the factor analysis with a single common factor again provided the best fit to the data in terms of the lowest chi-square, pseudoAIC and SRMR values (see Fig 3). This suggests that there is no evidence of age-specific genome-wide variation in BMI for either men or women.

Table 4

Multivariate modeling fitting comparisons based on the combined MALE GWAS summary statistics at six age intervals.

Women	ChiSquare_df	p	pseudoAIC	CFI	TLI	SRMR
Full auto-regression (AutoReg)	15.019₍₁₃₎	0.306	49.019	1.000	1.000	0.043
AutoReg: genetic innovation at 65–73 yrs dropped	14.866₍₁₄₎	0.387	46.866	1.000	1.000	0.043
AutoReg: genetic innovation at 60–64 yrs dropped	14.883₍₁₄₎	0.386	46.883	1.000	1.000	0.043
AutoReg: genetic innovation at 55–59 yrs dropped	16.813₍₁₄₎	0.266	48.813	0.999	0.999	0.046
AutoReg: genetic innovation at 50–54 yrs dropped	14.213₍₁₄₎	0.434	46.213	1.000	1.000	0.043
AutoReg: genetic innovation at 45–49 yrs dropped	21.482₍₁₄₎	0.090	53.482	0.998	0.998	0.057
AutoReg: genetic innovations at 45–73 yrs dropped	25.617₍₁₄₎	0.109	49.617	0.998	0.999	0.059
Factor analysis—1 factor	8.832₍₉₎	0.453	32.832	1.000	1.000	0.023
Men
Full auto-regression (AutoReg)	11.858₍₁₃₎	0.539	45.858	1.000	1.000	0.054
AutoReg: genetic innovation at 65–73 yrs dropped	12.814₍₁₄₎	0.541	44.814	1.000	1.000	0.054
AutoReg: genetic innovation at 60–64 yrs dropped	12.085₍₁₄₎	0.599	44.085	1.000	1.000	0.053
AutoReg: genetic innovation at 55–59 yrs dropped	11.889₍₁₄₎	0.615	43.889	1.000	1.000	0.053
AutoReg: genetic innovation at 50–54 yrs dropped	21.826₍₁₄₎	0.082	53.826	0.998	0.998	0.064
AutoReg: genetic innovation at 45–49 yrs dropped	12.718₍₁₄₎	0.549	44.718	1.000	1.000	0.059
AutoReg: genetic innovations at 45–73 yrs dropped	1628.983₍₁₄₎	0.000	1652.983	0.168	0.001	0.803
Factor analysis—1 factor	4.398₍₉₎	0.883	28.398	1.001	1.000	0.018

Note: AIC = Akaike Information Criterion, CFI = Comparative Fit Index, TLI = Tucker Lewis Index, SRMR = (Standardized) Root Mean Square Residual

Fig 3

Best fitting factor analytic model with a single common factor (CF) for men (A) & women (B). To identify this model, the first factor loading from the CD to BMI GWAS at 40–45 years was constrained to one. The double-headed arrow on the CF denotes the standardized variance, or SNP-based heritability, for BMI. Double-headed arrows on the residuals denote genetic variation at each age interval not otherwise explained by the CF.

Discussion

This is the first study to test a developmental theory regarding BMI heritability using molecular data and structural equation modeling. Between ages 40 and 73, changes in BMI heritability could not be explained by detectable age-specific genome-wide variation or an accumulation of genetic variants over time. Instead, individual differences in the molecular genetics of BMI across this time span were best explained by a single or common set of stable, genetic influences that are observable in early midlife. This pattern was observed in men and women. Our results are consistent with Silventoinen et al.’s meta-analysis of twin data that revealed only minor differences in BMI heritability estimates across cultural-geographic regions and measurement time [78,79]. Dahl et al.’s [12] analysis of Swedish twin data revealed that for men and women, BMI increases across midlife, before leveling off at 65 years and declining at approximately age 80. The extent to which Dahl et al.’s observed inflexion at age 65 years is indicative of age-specific, distinct genetic influences or variance components was inconsistent with our results. Instead, we found that the genetic correlations between BMI at ages 61–65 years and the remaining four age tranches were all equally high. Thus, the molecular genetic variance at age 65 does not appear to be linked to age-specific or distinct genetic processes occurring around this time. An outstanding question is whether or not our results generalize to earlier stages in life. Here, a number of reports, relying on different methods, suggest that genetic risks spanning childhood, adolescence and early adulthood likely comprise a combination of age-specific and age-invariant influences. For example, several studies have shown that the PRS for childhood BMI predicts adult BMI, metabolic outcomes and other complex traits [80-84]. Other studies relying on twin data have reported genetic correlations between BMI assessed at shorter age intervals spanning infancy, adolescence and teenage years that are much higher compared to genetic correlations based on wider age intervals [8,14,85,86]. This pattern is consistent with autoregressive features. Cornes et al.’s [9] application of autoregressive modelling found evidence of largely age-invariant genetic influences on BMI at 12, 14 and 16. The authors also reported smaller but significant age-specific genetic influences on BMI, depending on sex, at 14 and 16 years that were uncorrelated with BMI at age 12 [9]. The pattern of age-invariant influences is consistent with the study by Couto Alves et al. [87], which examined BMI spanning ages 2 to 18 years and reported a robust overlap between the genetics of child and adult BMI. The same study also identified a completely distinct genetic architecture in infancy [87]. The reports by Warrington et al. [88] and Felix et al. [89] have shown how numerous replicated adult BMI loci also reach genome-wide significance in childhood GWAS studies of BMI. In terms of other LDSR studies, Trzaskowski et al. [20] reported a genetic correlation between BMI at 11 and 65 years of 0.86. The same study also found that the adult PRS for BMI explained at most 10% of the phenotypic variance in childhood BMI. Combined, the findings based on twin and molecular studies suggest that variability in heritability estimates spanning childhood, adolescence and early adulthood is likely explained by combination of mostly age-invariant plus age-specific genetic influences, which could potentially be better captured by autoregressive modeling.

Limitations

Our findings should be interpreted in the context of four limitations. First, the BMI data used here were not repeated measures, but pseudo longitudinal. This approach assumes no year of birth or cohort-related genetic heterogeneity. Until now, GenomicSEM reports have typically leveraged LDSR-derived genetic covariances in the context of cross-sectional hypotheses. Our pseudo longitudinal modeling is not unlike standard cross-sectional GenomicSEM analyses. Both approaches depend upon the GWAS summary statistics being derived from a homogenous ancestral group. There is also no requirement for summary statistics to be based on the same subjects. It remains important to reduce the likelihood that our age-specific GWAS results comprised subjects from heterogenous populations. This is important because cohorts can have different BMI heritability, different environmental influences on BMI, or differences in the genetic control of sensitivity to the environment, which can bias the covariance estimates. Danish and Swedish twin studies have illustrated differential heritability by showing how increases in mean BMI in successively younger cohorts has been accompanied by increasing genetic variance [90,91]. Therefore, to determine if cohort effects existed, we inspected the LDSR genetic correlations between the youngest and oldest age tranches i.e., two maximally age-discrepant samples of unrelated individuals. Here, the rg was 0.97 (see Table 1), which suggests that the likelihood of any cohort-related genetic heterogeneity was minimal. Note, we also performed GWAS on a subset of men (N = 8,337) and women (N = 7,681) with repeated BMI measures at any time from 48 to 61 years and then from 62 to 72 years. Here, the test-rest LDSR correlations were rg = 0.99 (p = 1.04e-03) and rg = 0.95 (p = 1.68e-04) for men and women respectively. Thus, genetic correlations were very high between and within subjects. In another attempt to reduce the possibility of systematic differences between putative subpopulations in terms of allele frequencies, we re-ran the GWAS analyses using 40 PCs as covariates. As shown in S1 Table, there were only very minor differences in genomic inflation and SNP heritability between the 20 versus 40 PC results. Therefore, not only did genomic inflation remain the same regardless of the number of PCs, it did not change across age intervals. These results further reduce the likelihood of birth or cohort-related genetic heterogeneity. Second, the UKB recruitment process did not represent a random sample of the UK population [92]. Subjects were predominately European, more likely to be older, female, to live in less socioeconomically deprived areas than nonparticipants, and when compared with the general population, were also less likely to be obese, to smoke, and to drink alcohol daily while reporting fewer self-reported health conditions [93,94]. Although Silventoinen et al.’s meta-analysis of twin data reported only minor differences in BMI heritability across divergent cultural-geographic regions [78,79], the extent to which the molecular-based genetic covariance structure observed here generalizes to non-European populations remains to be determined. Third, while our results illustrate the flexibility of SEM in terms of its application to GWAS summary statistics to test a theory of longitudinal change, our modeling was not exhaustive. For instance, we did not test the hypothesis that changes in heritability could be better explained by latent growth or latent growth mixture models [95,96]. We note that the current method is limited to the analysis of summary variance-covariance matrices derived from the analysis of common variants. GenomicSEM does not model observed phenotypic information. Consequently, there was no mean information to model latent growth or mixture distributions. We also did not test hypotheses regarding sex differences other than to report results by sex. Dubois’ meta-analysis of 23 twin birth-cohorts found evidence of sex-limitation in terms of greater genetic variance in boys in early infancy through to 19 years [97]. In contrast, Elks et al.’s meta-regression of 88 twin-bases estimates of BMI heritability found no evidence of sex effects [13]. It remains to be determined if the observed minor differences in the genetic covariances and the ultimate, best fitting single-factor structure are empirically equivalent across sex. Finally, our genomic modelling was based on aggregated GWAS summary data and so was entirely independent of environmental risks, which are known to be significant in the etiology of psychiatric and behavioral traits [98]. Consequently, our current approach precludes modeling the contribution of environmental influences with increasing age [99] or making allowances for any genetic control of sensitivity to the environment i.e. G x E interaction [78]. In this regard, methods that can simultaneously model the joint effect of genes and environment are likely to prove more informative. For instance, innovative approaches capable of applying genomic-relatedness based restricted maximum-likelihood [100] to structural equation modeling in the OpenMx [40] software package have the potential to analyze individual GWAS and phenotypic data and hold promise.

Conclusion

Structural equation model of GWAS summary statistics between the ages of 40 and 73 revealed that molecular genetic variance in BMI at successive 5-year age intervals could not be explained by the accumulation of age-specific genetic influences or autoregressive processes. Instead, a common set of stable genetic influences appears to underpin all genome-wide variation in BMI from middle to early old age in men and women.

Number of men & women with complete BMI & GWAS summary statistics at each age interval, as well as genomic inflation (λ) & SNP-based heritability (h2) for the GWAS analyses comprising 20 versus 40 principal components (PCs).

(DOCX) Click here for additional data file. 9 Nov 2021 Dear Dr Gillespie, Thank you very much for submitting your Research Article entitled 'Determining the stability of genome-wide factors in BMI between ages 40 to 69 years.' to PLOS Genetics. The manuscript was fully evaluated at the editorial level and by independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the current manuscript. Based on the reviews, we will not be able to accept this version of the manuscript, but we would be willing to review a much-revised version. We cannot, of course, promise publication at that time. Because many readers may not be familiar with gSEMs, we would like to see a rewritten method with more elucidation. Additional analyses as suggested by reviewer 2 should be included. Should you decide to revise the manuscript for further consideration here, your revisions should address the specific points made by each reviewer. We will also require a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. If you decide to revise the manuscript for further consideration at PLOS Genetics, please aim to resubmit within the next 60 days, unless it will take extra time to address the concerns of the reviewers, in which case we would appreciate an expected resubmission date by email to plosgenetics@plos.org. If present, accompanying reviewer attachments are included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist. To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission. While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process. To resubmit, use the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder. [LINK] We are sorry that we cannot be more positive about your manuscript at this stage. Please do not hesitate to contact us if you have any concerns or questions. Yours sincerely, Xiaofeng Zhu Associate Editor PLOS Genetics Scott Williams Section Editor: Natural Variation PLOS Genetics Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: Gillespie et al perform a a valuable and insightful structural analysis of age stratified GWAS of BMI, effectively fitting a pseudo longitudinal model to cross sectional data. It is exciting to see GenomicSEM being used in a creative and informative manner. The results reflect very little novel genetic effects on BMI in older adults in the UKB, this is not unexpected as its generally the age period in which the rank order of BMI in a popualtion across time is more stable than during development or in old age. Id like too begin the review by mentioning some essential papers that do related things, without undermining the novelty of the current work, but that should be discussed or tied to the current effort. Here is work using PRS and/or multivariable MR and "BMI" age age 10 and in adulthood (I am not a author on any of these, just all very relevant lit): Richardson, T. G., Sanderson, E., Elsworth, B., Tilling, K., & Smith, G. D. (2019). Can the impact of childhood adiposity on disease risk be reversed? A Mendelian randomization study. medRxiv, 19008011. Brandkvist, M., Bjørngaard, J. H., Ødegård, R. A., Åsvold, B. O., Smith, G. D., Brumpton, B., ... & Vie, G. Å. (2020). Separating the genetics of childhood and adult obesity: a validation study of genetic scores for body mass index in adolescence and adulthood in the HUNT Study. Human molecular genetics, 29(24), 3966-3973. Richardson, T. G., Crouch, D. J., Power, G. M., Berstein, F. M., Hazelwood, E., Fang, S., ... & Smith, G. D. (2021). Disentangling the direct and indirect effects of childhood adiposity on type 1 diabetes and immune-associated diseases: a multivariable Mendelian randomization study. medRxiv. There is more work on repeatedly measured BMI in MoBa, which also is very relevant. Hone, L., Jacobs, B. M., Marshall, C. R., Giovannoni, G. R., Noyce, A., & Dobson, R. (2021). Age-specific effects of childhood BMI on multiple sclerosis risk: a Mendelian Randomisation study. medRxiv. Helgeland, Ø., Vaudel, M., Juliusson, P. B., Holmen, O. L., Juodakis, J., Bacelis, J., ... & Njølstad, P. R. (2019). Genome-wide association study reveals dynamic role of genetic variation in infant and early childhood growth. Nature communications, 10(1), 1-10. There are more, I feel it exceeds my responsibility as a reviewer to look those up. With those pre-requisite out of the way, lets get to the paper itself, mayor concerns: You conclude that: "differences in BMI across age could not be explained by the accumulation of age-specific genetic influences or autoregressive processes" This is true but it would be good to reflect on what, if anything, the genetic model selected based on parsimony would mean for the phenotypic model (if anything). I for one suspect most people would conceptualise causal auto-regressive effects of BMI at the phenotypic level, would your defining challenge such a model under what conditions would it/wouldn't it? There are actual repeated measured within UKB, 20k people went for a second clinic visit and 46k people went in for an imaging visit. your model makes predictions about the rg we expect to see there it would be good to test those. (I admit I expect very little based on your high rg, which makes for an interesting test, what if these data do not conform?) There is a further possible source of corroboration you could pursuit, your model implies a certain rg between BMI and change in weight, UKB includes a Q on recent weight change: https://biobank.ndph.ox.ac.uk/showcase/field.cgi?id=2306 and for those pprepeatedly measured you can compute actual weight change. The final GWAS I listed previously among the literature suggestions has sumstats available, we have been running simple models with those, without a clear goal in mind other than to develop some instructional examples, but you could consider integrating them into this paper into a single model (then age 6 weeks to age 70...), might be beyond the scope of this manuscript. This is actually specifically a point where GenomicSEM would complement full data SEM/gSEM, genomicSEM can go beyond a single cohort, raw data SEM can allow for additional types of models. The Genome wide model need not hold for individual GWAS hits, can you validate well know BMI loci (FTO etc) effect the age stratified outcomes in a manner consistent with the global model (You could use Q-statistic or inspect the effect sizes for various top hits) The discussion could use a clearer discussion of the limitation of pseudo longitudonal modeling in general, when do we expect it to fail etc. Reviewer #2: The authors applied SEMs to evaluate the stability of genome wide determinants of BMI in adult populations. The scientific question is good and clearly explained. While this is a worthwhile investigation, there are a few limitations that may make this manuscript less relevant, as it is, for the wide audience of readers interested in genetics in general. 1. The age range is limited -- only adults. It is not clear that the results is surprising given that. Previous results (Winkler et al) found lower genetic correlation between age groups of less and more than 50. It would be useful to look at younger individuals. 2. The modeling approach using gSEMs. Many researchers are not familiar with that and the specialized nomenclature used make it difficult to assess the methodology. It would be very useful-- and required -- to spell out the statistical model to explain things, e.g., what "innovations" are. 3. Also, it is not clear how the GWAS were used. Only for computing genetic correlations and heritability? (maybe it was clear, I just want to make sure). Then the Lavann software used these to construct and over all covariance matrix? 4. I was expecting to see some results estimating genetic associations of specific variants known to be highly associated with BMI, meaning, to look at their effects over time. 5. I also expected to see PRS associations evaluated over time. The authors can also construct PRS based on each of the GWAS and study associations with different age groups. ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: No: Please also upload GWAS sumstats to a public repository. Reviewer #2: None ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: Yes: Tamar Sofer 28 Apr 2022 Submitted filename: Response_to_reviewers.docx Click here for additional data file. 21 Jun 2022 Dear Dr Gillespie, We are pleased to inform you that your manuscript entitled "Determining the stability of genome-wide factors in BMI between ages 40 to 69 years." has been editorially accepted for publication in PLOS Genetics. Congratulations! Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional acceptance, but your manuscript will not be scheduled for publication until the required changes have been made. Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org. In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. If you have a press-related query, or would like to know about making your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date. Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics! Yours sincerely, Xiaofeng Zhu Section Editor: Methods PLOS Genetics Scott Williams Section Editor: Human Variation PLOS Genetics www.plosgenetics.org Twitter: @PLOSGenetics ---------------------------------------------------- Comments from the reviewers (if applicable): Reviewer's Responses to Questions Comments to the Authors: Please note here if the review is uploaded as an attachment. Reviewer #1: My comments were addressed, pseudo longitudinal modeling is an exciting potential application of genomicSEM. Reviewer #2: Thank you for responding to the earlier comments, I have no further comments. Nice work! ********** Have all data underlying the figures and results presented in the manuscript been provided? Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information. Reviewer #1: No: The underlying data is restricted access for good reasons (genotypes privacy etc etc). Qualified researchers in academia and industry can obtain access to the underlying raw data. Reviewer #2: None ********** PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No ---------------------------------------------------- Data Deposition If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website. The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-21-01158R1 More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support. Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present. ---------------------------------------------------- Press Queries If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org. 26 Jul 2022 PGENETICS-D-21-01158R1 Determining the stability of genome-wide factors in BMI between ages 40 to 69 years. Dear Dr Gillespie, We are pleased to inform you that your manuscript entitled "Determining the stability of genome-wide factors in BMI between ages 40 to 69 years." has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work! With kind regards, Zsofi Zombor PLOS Genetics On behalf of: The PLOS Genetics Team Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom plosgenetics@plos.org | +44 (0) 1223-442823 plosgenetics.org | Twitter: @PLOSGenetics

91 in total

1. UK Biobank, big data, and the consequences of non-representativeness.

Authors: Katherine M Keyes; Daniel Westreich
Journal: Lancet Date: 2019-03-30 Impact factor: 79.321

2. Genetics of body mass stability and risk for chronic disease: a 28-year longitudinal study.

Authors: Carol E Franz; Michael D Grant; Kristen C Jacobson; William S Kremen; Seth A Eisen; Hong Xian; James Romeis; Heather Thompson-Brenner; Michael J Lyons
Journal: Twin Res Hum Genet Date: 2007-08 Impact factor: 1.587

3. Estimating missing heritability for disease from genome-wide association studies.

Authors: Sang Hong Lee; Naomi R Wray; Michael E Goddard; Peter M Visscher
Journal: Am J Hum Genet Date: 2011-03-03 Impact factor: 11.025

4. Latent growth curves within developmental structural equation models.

Authors: J J McArdle; D Epstein
Journal: Child Dev Date: 1987-02

5. The genetic analysis of repeated measures. I. Simplex models.

Authors: D I Boomsma; P C Molenaar
Journal: Behav Genet Date: 1987-03 Impact factor: 2.805

Review 6. Twin studies of psychiatric illness. Current status and future directions.

Authors: K S Kendler
Journal: Arch Gen Psychiatry Date: 1993-11

Review 7. The genetics of schizophrenia: a current, genetic-epidemiologic perspective.

Authors: K S Kendler; S R Diehl
Journal: Schizophr Bull Date: 1993 Impact factor: 9.306

8. Molecular genetic overlap between migraine and major depressive disorder.

Authors: Yuanhao Yang; Huiying Zhao; Dorret I Boomsma; Lannie Ligthart; Andrea C Belin; George Davey Smith; Tonu Esko; Tobias M Freilinger; Thomas Folkmann Hansen; M Arfan Ikram; Mikko Kallela; Christian Kubisch; Christofidou Paraskevi; David P Strachan; Maija Wessman; Arn M J M van den Maagdenberg; Gisela M Terwindt; Dale R Nyholt
Journal: Eur J Hum Genet Date: 2018-07-11 Impact factor: 4.246

9. The CODATwins Project: The Current Status and Recent Findings of COllaborative Project of Development of Anthropometrical Measures in Twins.

Authors: K Silventoinen; A Jelenkovic; Y Yokoyama; R Sund; M Sugawara; M Tanaka; S Matsumoto; L H Bogl; D L Freitas; J A Maia; J V B Hjelmborg; S Aaltonen; M Piirtola; A Latvala; L Calais-Ferreira; V C Oliveira; P H Ferreira; F Ji; F Ning; Z Pang; J R Ordoñana; J F Sánchez-Romera; L Colodro-Conde; S A Burt; K L Klump; N G Martin; S E Medland; G W Montgomery; C Kandler; T A McAdams; T C Eley; A M Gregory; K J Saudino; L Dubois; M Boivin; M Brendgen; G Dionne; F Vitaro; A D Tarnoki; D L Tarnoki; C M A Haworth; R Plomin; S Y Öncel; F Aliev; E Medda; L Nisticò; V Toccaceli; J M Craig; R Saffery; S H Siribaddana; M Hotopf; A Sumathipala; F Rijsdijk; H-U Jeong; T Spector; M Mangino; G Lachance; M Gatz; D A Butler; W Gao; C Yu; L Li; G Bayasgalan; D Narandalai; K P Harden; E M Tucker-Drob; K Christensen; A Skytthe; K O Kyvik; C A Derom; R F Vlietinck; R J F Loos; W Cozen; A E Hwang; T M Mack; M He; X Ding; J L Silberg; H H Maes; T L Cutler; J L Hopper; P K E Magnusson; N L Pedersen; A K Dahl Aslan; L A Baker; C Tuvblad; M Bjerregaard-Andersen; H Beck-Nielsen; M Sodemann; V Ullemar; C Almqvist; Q Tan; D Zhang; G E Swan; R Krasnow; K L Jang; A Knafo-Noam; D Mankuta; L Abramson; P Lichtenstein; R F Krueger; M McGue; S Pahlen; P Tynelius; F Rasmussen; G E Duncan; D Buchwald; R P Corley; B M Huibregtse; T L Nelson; K E Whitfield; C E Franz; W S Kremen; M J Lyons; S Ooki; I Brandt; T S Nilsen; J R Harris; J Sung; H A Park; J Lee; S J Lee; G Willemsen; M Bartels; C E M van Beijsterveldt; C H Llewellyn; A Fisher; E Rebato; A Busjahn; R Tomizawa; F Inui; M Watanabe; C Honda; N Sakai; Y-M Hur; T I A Sørensen; D I Boomsma; J Kaprio
Journal: Twin Res Hum Genet Date: 2019-07-31 Impact factor: 1.587

10. Genetic contributions to self-reported tiredness.

Authors: V Deary; S P Hagenaars; S E Harris; W D Hill; G Davies; D C M Liewald; A M McIntosh; C R Gale; I J Deary
Journal: Mol Psychiatry Date: 2017-02-14 Impact factor: 15.992