| Literature DB >> 29098496 |
Charles Laurin1, Gabriel Cuellar-Partida2, Gibran Hemani1, George Davey Smith2, Jian Yang3, David M Evans4,5.
Abstract
We propose a new method, G-REMLadp, to estimate the phenotypic variance explained by parent-of-origin effects (POEs) across the genome. Our method uses restricted maximum likelihood analysis of genome-wide genetic relatedness matrices based on individuals' phased genotypes. Genome-wide SNP data from parent child duos or trios is required to obtain relatedness matrices indexing the parental origin of offspring alleles, as well as offspring phenotype data to partition the trait variation into variance components. To calibrate the power of G-REMLadp to detect non-null POEs when they are present, we provide an analytic approximation derived from Haseman-Elston regression. We also used simulated data to quantify the power and Type I Error rates of G-REMLadp, as well as the sensitivity of its variance component estimates to violations of underlying assumptions. We subsequently applied G-REMLadp to 36 phenotypes in a sample of individuals from the Avon Longitudinal Study of Parents and Children (ALSPAC). We found that the method does not seem to be inherently biased in estimating variance due to POEs, and that substantial correlation between parental genotypes is necessary to generate biased estimates. Our empirical results, power calculations and simulations indicate that sample sizes over 10000 unrelated parent-offspring duos will be necessary to detect POEs explaining < 10% of the variance with moderate power. We conclude that POEs tagged by our genetic relationship matrices are unlikely to explain large proportions of the phenotypic variance (i.e. > 15%) for the 36 traits that we have examined.Entities:
Keywords: ALSPAC; G-REML; GCTA; Imprinting; Parent-of-origin effects
Mesh:
Year: 2017 PMID: 29098496 PMCID: PMC5752821 DOI: 10.1007/s10519-017-9880-0
Source DB: PubMed Journal: Behav Genet ISSN: 0001-8244 Impact factor: 2.805
Recoding a phased genotype using three orthogonal terms
| Phased genotype | Freq | Add Code | Std add | Dom code | Std dom | POE code | Std POE |
|---|---|---|---|---|---|---|---|
|
|
| 0 |
| 0 |
| 0 | 0 |
|
|
| 1 |
|
| 1 | − 1 |
|
|
|
| 1 |
|
| 1 | 1 |
|
|
|
| 2 |
|
|
| 0 | 0 |
The minor allele is , with frequency , major allele with frequency . : maternally inherited allele; : paternally inherited allele. “Freq” expected frequency of the genotype under Hardy–Weinberg equilibrium. “Add Code” additive coding of the phased genotype. “Std Add” standardized additive coding based on mean and variance . “Dom Code” dominance coding of the phased genotype. “Std Dom” standardized dominance coding based on mean and variance . “POE Code” parent-of-origin effect coding of the phased genotype. “Std POE” standardized parent-of-origin effect coding based on mean and variance
Fig. 1Phased genotype represented by 3 standardised codings: This figure shows the values of each of three effect-based codings for a given phased genotype, where a represents the major allele and A the minor (effect allele). The subscript mo indicates that the allele was transmitted maternally, while the subscript fa indicates paternal transmission. For a single locus with minor allele frequency , the standardised additive coding (0, 1, 2 standardised to ) of the phased genotype is given by the medium-weight line. The standardised dominance coding is the dashed line ( standardised to). The parent-of-origin effect coding is the thick line (0,− 1,1,0 standardised to )
Fig. 2Empirical power curves in simulated data: Curves generated using proportions of significant Wald tests in simulated data with all assumptions specified in the text satisfied. %PoE: proportion of phenotypic variance attributable to parent-of-origin effects in the model used to generate simulated data; Sample size: number of simulated individuals with parent-of-origin determined at all loci; Power: proportion of replicates with Wald tests having -values . Note that the number of effective loci in this figure is an order of magnitude lower than in Supplementary Figure S2
Absolute bias and variance for G-REMLadp variance components when HWE was met, for non-null simulated effects
| N |
|
|
| #Reps |
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|
| 1000 | 0.017 | 0.017 | 0.017 | 280 | 1.0e−03 | 1.1e−03 | 1.5e−03 | 8.9e−04 | 1.2e−03 | 1.1e−03 |
| 1000 | 0.033 | 0.033 | 0.033 | 290 | − 1.1e−04 | 1.9e−03 | − 1.7e−03 | 1.8e−03 | − 2.0e−03 | 1.8e−03 |
| 1000 | 0.100 | 0.100 | 0.100 | 290 | 3.0e−03 | 7.1e−03 | 9.0e−04 | 5.7e−03 | 2.6e−03 | 6.3e−03 |
| 2000 | 0.017 | 0.017 | 0.017 | 300 | − 2.2e−03 | 2.9e−04 | − 6.5e−05 | 2.6e−04 | 4.2e−04 | 2.6e−04 |
| 2000 | 0.033 | 0.033 | 0.033 | 300 | − 2.7e−04 | 5.7e−04 | − 1.9e−03 | 4.9e−04 | 1.2e−03 | 6.3e−04 |
| 2000 | 0.100 | 0.100 | 0.100 | 300 | 3.7e−03 | 1.6e−03 | 1.6e−03 | 1.7e−03 | −1.1e− 03 | 1.6e−03 |
| 4000 | 0.017 | 0.017 | 0.017 | 300 | 5.0e−04 | 6.5e−05 | 1.2e−05 | 6.8e−05 | −8.0e−04 | 7.5e−05 |
| 4000 | 0.033 | 0.033 | 0.033 | 300 | 2.9e−05 | 1.7e−04 | −6.2e−04 | 1.5e−04 | − 8.0e−04 | 1.4e−04 |
| 4000 | 0.100 | 0.100 | 0.100 | 300 | − 2.6e−03 | 4.8e−04 | − 1.8e−03 | 4.4e−04 | 1.5e−03 | 4.5e−04 |
Proportion of phenotypic variance explained: by additive effects given in the column, by dominance effects given in the column, and by parent-of-origin effects given in the column. N: number of simulated phased genotypes, #Reps: number of simulated replications, “Bias” is the absolute bias of estimates across simulated replications, “Var” is variance of estimates across simulated replications
Absolute bias and variance for G-REMLadp variance components when HWE was violated, for non-null simulated effects
| Parent Corr | MAF Diff | N |
|
|
| #Reps |
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.00 | 0.05 | 1000 | 0.017 | 0.017 | 0.017 | 300 | 1.7e−03 | 1.0e−03 | − 4.8e−04 | 9.7e−04 | 2.9e−03 | 1.0e−03 |
| 0.00 | 0.05 | 1000 | 0.033 | 0.033 | 0.033 | 300 | 1.1e−03 | 2.4e−03 | 1.1e−03 | 2.4e−03 | 6.6e−04 | 2.0e−03 |
| 0.00 | 0.05 | 1000 | 0.100 | 0.100 | 0.100 | 300 | − 1.0e−03 | 6.6e−03 | 2.9e−04 | 6.4e−03 | 3.1e−04 | 5.7e−03 |
| 0.00 | 0.05 | 2000 | 0.017 | 0.017 | 0.017 | 300 | − 1.4e−03 | 2.5e−04 | 3.5e−04 | 3.1e−04 | − 1.8e−05 | 2.8e−04 |
| 0.00 | 0.05 | 2000 | 0.033 | 0.033 | 0.033 | 300 | 9.4e−04 | 5.4e−04 | 1.6e−03 | 5.2e−04 | 3.0e−04 | 4.8e−04 |
| 0.00 | 0.05 | 2000 | 0.100 | 0.100 | 0.100 | 300 | 2.2e−03 | 1.6e−03 | − 5.2e−04 | 1.6e−03 | 6.9e−04 | 1.6e−03 |
| 0.00 | 0.05 | 4000 | 0.017 | 0.017 | 0.017 | 300 | 1.9e−04 | 7.2e−05 | − 6.7e−04 | 6.9e−05 | − 9.4e−04 | 8.1e−05 |
| 0.00 | 0.05 | 4000 | 0.033 | 0.033 | 0.033 | 300 | − 6.3e−04 | 1.5e−04 | 8.3e−04 | 1.8e−04 | − 8.2e−04 | 1.5e−04 |
| 0.00 | 0.05 | 4000 | 0.100 | 0.100 | 0.100 | 290 | − 2.1e−03 | 4.9e−04 | 1.7e−03 | 5.6e−04 | − 2.0e−04 | 4.3e−04 |
| 0.25 | 0.00 | 1000 | 0.017 | 0.017 | 0.017 | 300 | 6.1e−04 | 8.2e−04 | 1.5e−03 | 1.6e−04 | − 1.4e−03 | 1.1e−03 |
| 0.25 | 0.00 | 1000 | 0.033 | 0.033 | 0.033 | 290 | 1.9e−03 | 1.6e−03 | 1.9e−03 | 2.5e−04 | − 6.8e−03 | 2.0e−03 |
| 0.25 | 0.00 | 1000 | 0.100 | 0.100 | 0.100 | 290 | 1.7e−03 | 5.6e−03 | 2.1e−03 | 9.8e−04 | − 2.5e−02 | 8.3e−03 |
| 0.25 | 0.00 | 2000 | 0.017 | 0.017 | 0.017 | 300 | − 1.3e−04 | 2.0e−04 | − 3.5e−04 | 5.6e−05 | − 5.4e−03 | 2.8e−04 |
| 0.25 | 0.00 | 2000 | 0.033 | 0.033 | 0.033 | 300 | 2.0e−03 | 3.9e−04 | − 9.0e−05 | 1.0e−04 | − 6.9e−03 | 6.5e−04 |
| 0.25 | 0.00 | 2000 | 0.100 | 0.100 | 0.100 | 300 | 3.6e−03 | 1.5e−03 | 4.5e−03 | 3.8e−04 | − 2.5e−02 | 2.2e−03 |
| 0.25 | 0.00 | 4000 | 0.033 | 0.033 | 0.033 | 300 | 2.3e−04 | 1.2e−04 | 4.0e−04 | 4.6e−05 | − 7.1e−03 | 1.3e−04 |
| 0.25 | 0.00 | 4000 | 0.100 | 0.100 | 0.100 | 300 | 1.3e−03 | 3.6e−04 | 1.7e−03 | 1.4e−04 | − 2.3e−02 | 4.7e−04 |
| 0.25 | 0.05 | 1000 | 0.017 | 0.017 | 0.017 | 130 | − 4.1e−03 | 8.4e−04 | 3.5e−03 | 3.6e−04 | − 3.4e−03 | 1.3e−03 |
| 0.25 | 0.05 | 1000 | 0.033 | 0.033 | 0.033 | 160 | − 1.2e−03 | 1.7e−03 | 2.1e−04 | 5.6e−04 | − 1.5e−02 | 2.5e−03 |
| 0.25 | 0.05 | 1000 | 0.100 | 0.100 | 0.100 | 220 | − 9.1e−04 | 5.2e−03 | 7.1e−03 | 2.6e−03 | − 3.1e−02 | 8.2e−03 |
| 0.25 | 0.05 | 2000 | 0.017 | 0.017 | 0.017 | 280 | − 9.5e−04 | 1.9e−04 | 4.4e−05 | 1.1e−04 | − 4.7e−03 | 3.2e−04 |
| 0.25 | 0.05 | 2000 | 0.033 | 0.033 | 0.033 | 300 | 8.4e−05 | 3.8e−04 | 2.0e−03 | 2.8e−04 | − 8.6e−03 | 5.4e−04 |
| 0.25 | 0.05 | 2000 | 0.100 | 0.100 | 0.100 | 300 | 5.2e−03 | 1.4e−03 | 1.2e−03 | 7.4e−04 | − 2.1e−02 | 1.8e−03 |
| 0.25 | 0.05 | 4000 | 0.033 | 0.033 | 0.033 | 300 | − 2.9e−04 | 1.3e−04 | 1.5e−03 | 7.8e−05 | − 7.8e−03 | 1.6e−04 |
| 0.25 | 0.05 | 4000 | 0.100 | 0.100 | 0.100 | 300 | 2.2e−03 | 3.1e−04 | 3.5e−03 | 2.5e−04 | − 2.3e−02 | 4.8e−04 |
Parent Corr average correlation of parental genotypes, MAF Diff average difference in parental MAFs. Proportion of phenotypic variance explained by additive effects is given in the column, by dominance effects in the column, and by parent-of-origin effects in the column. N number of simulated phased genotypes, #Reps number of simulated replications, “Bias” is absolute bias of estimates across simulated replications, “Var” is variance of estimates across simulated replications
Large G-REMLadp parent of origin variance component estimates for five ALSPAC phenotypes
| Phenotype | VarA | SEvarA | VarD | SEvarD | VarP | SEvarP |
|---|---|---|---|---|---|---|
| Age at first tooth | 0.425 | 0.083 | 0.114 | 0.121 | 0.156 | 0.082 |
| Age at Menarche | 0.337 | 0.162 | 0.195 | 0.246 | 0.179 | 0.162 |
| DBP | 0.193 | 0.088 | − 0.076 | 0.132 | 0.114 | 0.088 |
| FVC | 0.370 | 0.097 | 0.017 | 0.136 | 0.159 | 0.097 |
| SBP | 0.313 | 0.088 | − 0.147 | 0.130 | 0.145 | 0.088 |
Variance component estimates of five standardized phenotypes in ALSPAC, estimates and standard errors generated with multiple components GCTA-GREML. VarA proportion of phenotypic variance attributable to additive genetic effects, SEvarA standard error of proportion of phenotypic variance attributable to additive genetic effects. VarD proportion of phenotypic variance attributable to dominance effects, SEvarD standard error of proportion of phenotypic variance attributable to dominance effects. VarP proportion of phenotypic variance attributable to parent-of-origin effects, SEvarP standard error of proportion of phenotypic variance attributable to parent-of-origin effects. DBP diastolic blood pressure. FVC forced vital capacity. SBP systolic blood pressure. See Supplementary Tables SIII for results on all phenotypes and SVII for unstandardized variance components
Fig. 3Histograms of the estimates of additive, dominance, and parent-of-origin variance components for 36 ALSPAC traits. Add Estimated proportions of phenotypic variance attributable to additive effects, Dom Estimated proportions of phenotypic variance attributable to dominance effects, PoE Estimated proportions of phenotypic variance attributable to parent-of-origin effects. The bin width is 0.025. To avoid biased variance estimates, they were not constrained to be positive; hence, for POEs, which the study was underpowered to detect, there are many negative estimates. The figure is intended to illustrate patterns in the variance component estimates for the phenotypes which happened to be included in this study; it would be inappropriate to consider these histograms as representative of the densities of different variance components for complex traits in general