| Literature DB >> 27478587 |
Lars Rönnegård1, S Eryn McFarlane2, Arild Husby3, Takeshi Kawakami4, Hans Ellegren4, Anna Qvarnström2.
Abstract
Genomewide association studies (GWAS) enable detailed dissections of the genetic basis for organisms' ability to adapt to a changing environment. In long-term studies of natural populations, individuals are often marked at one point in their life and then repeatedly recaptured. It is therefore essential that a method for GWAS includes the process of repeated sampling. In a GWAS, the effects of thousands of single-nucleotide polymorphisms (SNPs) need to be fitted and any model development is constrained by the computational requirements. A method is therefore required that can fit a highly hierarchical model and at the same time is computationally fast enough to be useful.Our method fits fixed SNP effects in a linear mixed model that can include both random polygenic effects and permanent environmental effects. In this way, the model can correct for population structure and model repeated measures. The covariance structure of the linear mixed model is first estimated and subsequently used in a generalized least squares setting to fit the SNP effects. The method was evaluated in a simulation study based on observed genotypes from a long-term study of collared flycatchers in Sweden.The method we present here was successful in estimating permanent environmental effects from simulated repeated measures data. Additionally, we found that especially for variable phenotypes having large variation between years, the repeated measurements model has a substantial increase in power compared to a model using average phenotypes as a response.The method is available in the r package RepeatABEL. It increases the power in GWAS having repeated measures, especially for long-term studies of natural populations, and the R implementation is expected to facilitate modelling of longitudinal data for studies of both animal and human populations.Entities:
Keywords: Ficedula albicollis; genomic relationship; hierarchical generalized linear model; single‐nucleotide polymorphisms
Year: 2016 PMID: 27478587 PMCID: PMC4950150 DOI: 10.1111/2041-210X.12535
Source DB: PubMed Journal: Methods Ecol Evol Impact factor: 7.781
Average (SD) of estimates from repeated measurement model (rGLS) compared to a model fitting average phenotypes as response (using the mmscore function in GenABEL)a
| Balanced | Year effects | Method |
|
|
|
|
|---|---|---|---|---|---|---|
| Yes | No | mmscore | 0·484 (0·14) | 0·392 (0·06) | NA | 1·01 (0·01) |
| Yes | No | rGLS | 0·484 (0·14) | 0·340 (0·04) | 0·334 (0·03) | 1·01 (0·01) |
| Yes | Yes | mmscore | 0·476 (0·07) | 0·424 (0·06) | NA | 1·01 (0·01) |
| Yes | Yes | rGLS | 0·475 (0·07) | 0·362 (0·03) | 0·321 (0·03) | 1·00 (0·01) |
| No | No | mmscore | 0·475 (0·08) | 0·366 (0·08) | NA | 1·00 (0·01) |
| No | No | rGLS | 0·474 (0·08) | 0·339 (0·05) | 0·329 (0·05) | 1·00 (0·01) |
| No | Yes | mmscore | 0·477 (0·17) | 0·233 (0·07) | NA | 1·00 (0·00) |
| No | Yes | rGLS | 0·466 (0·10) | 0·357 (0·04) | 0·313 (0·04) | 1·01 (0·01) |
Simulated , heritability , coefficient of permanent env. effects .
Simulated year effect explaining 57% of the total phenotypic variance. Year effects fitted as fixed effects in the repeated measurement model.
Comparison of P‐values between methods: repeated measurement model (using rGLS) vs. a model fitting average phenotypes as response (using mmscore)a
| Balanced | Year effects | Increase in | Correlation of |
|---|---|---|---|
| Yes | No | 1·0% | 0·9999 |
| Yes | Yes | 0·8% | 0·9999 |
| No | No |
| 0·9948 |
| No | Yes |
| 0·8319 |
Simulated , additional polygenic heritability , coefficient of permanent env. effects .
Simulated year effect explaining 57% of the total phenotypic variance. Year effects fitted as fixed effects in the repeated measurement model.
At the simulated QTL. Significant differences between methods shown as bold text.
Figure 1Percentage increase in ‐values depending on the size of the year effects. Flycatcher data were used to simulate a population with unbalanced number of observations per individual.
Performance of repeated measurement model for different number of SNP effects and effect sizes
| No. QTL |
|
| Average | Prop. | Estimated variance components |
| ||
|---|---|---|---|---|---|---|---|---|
| Polygenic | Perm. env. | Residual | ||||||
| 0 | 0 | 0·00 (0·11) | 0·50 ( 0·29) | 0 | 1·03 (0·10) | 0·98 (0·08) | 1·00 (0·03) | 1·00 (0·01) |
| 1 | 1·0 | 1·00 (0·10) | <0·01 (<0·01) | 0·97 | 1·30 (0·16) | 1·06 (0·10) | 1·00 (0·03) | 1·02 (0·02) |
| 1 | 1·5 | 1·50 (0·10) | <0·01 (<0·01) | 0·98 | 1·66 (0·27) | 1·14 (0·15) | 1·00 (0·03) | 1·01 (0·02) |
| 20 | 1·0 | 1·00 (0·21) | 0·01 (0·05) | 0·75 | 6·00 (0·54) | 2·58 (0·26) | 0·99 (0·03) | 1·21 (0·02) |
Average over the 20 estimated effects.
Performance of estimated variance components in the repeated measurement model. Two scenarios simulated including permanent environmental effects in the simulations (with a variance of 1), and without permanent environmental effects simulated
| Perm. env. effect simulated | Estimated variance components |
| ||
|---|---|---|---|---|
| Polygenic | Perm. env. | Residual | ||
| Yes | 1·03 (0·10) | 0·98 (0·08) | 1·00 (0·03) | 1·00 (0·01) |
| No | 0·90 (0·08) | 0·08 (0·03) | 0·99 (0·03) | 1·00 (0·01) |
Performance of repeated measurement model for binomial data. Using an underlying Gaussian distribution, an equal binary proportion of zeros and ones was simulated for individuals having the common allele
| No. QTL |
|
| Average | Prop. | Estimated variance components |
| ||
|---|---|---|---|---|---|---|---|---|
| Polygenic | Perm. env. | Residual | ||||||
| 0 | 0 | 0·00 (0·03) | 0·50 ( 0·29) | 0 | 0·06 (0·01) | 0·06 (0·01) | 0·13 (0·01) | 1·00 (0·01) |
| 1 | 1·0 | 0·22 (0·02) | <0·01 (<0·01) | 0·96 | 0·06 (0·01) | 0·06 (0·01) | 0·13 (0·01) | 1·02 (0·02) |
| 1 | 1·5 | 0·31 (0·02) | <0·01 (<0·01) | 0·98 | 0·07 (0·01) | 0·06 (0·07) | 0·12 (0·01) | 1·02 (0·02) |
| 20 | 1·0 | 0·13 (0·03) | 0·01 ( 0·07) | 0·52 | 0·11 (0·01) | 0·06 (0·01) | 0·07 (0·01) | 1·12 (0·02) |
Performance of repeated measurement model for different binary proportions. An additive effect size of 1·0 for one QTL was simulated on the underlying Gaussian scale
| Binary cut off | Proportion 1's | Expected estimate | Estimated effect size | Prop. |
|
|---|---|---|---|---|---|
| 0 | 0·63 | 0·201 | 0·202 (0·028) | 0·95 | 1·02 (0·02) |
| −0·5 | 0·73 | 0·172 | 0·172 (0·026) | 0·95 | 1·01 (0·02) |
| −0·75 | 0·77 | 0·155 | 0·155 (0·020) | 0·93 | 1·01 (0·03) |
| −1·0 | 0·80 | 0·136 | 0·137 (0·022) | 0·91 | 1·01 (0·02) |
| −1·25 | 0·84 | 0·118 | 0·118 (0·018) | 0·92 | 1·00 (0·02) |
Expected value on the observed binary scale derived in Appendix S1.
Not estimated for one of the replicates due to non‐convergence using the GenABEL estlambda function.