| Literature DB >> 30940094 |
Miriam Gjerdevik1,2, Astanand Jugessur3,4,5, Øystein A Haaland3, Julia Romanowska3,6, Rolv T Lie3,5, Heather J Cordell7, Håkon K Gjessing3,5.
Abstract
BACKGROUND: Log-linear and multinomial modeling offer a flexible framework for genetic association analyses of offspring (child), parent-of-origin and maternal effects, based on genotype data from a variety of child-parent configurations. Although the calculation of statistical power or sample size is an important first step in the planning of any scientific study, there is currently a lack of software for genetic power calculations in family-based study designs. Here, we address this shortcoming through new implementations of power calculations in the R package Haplin, which is a flexible and robust software for genetic epidemiological analyses. Power calculations in Haplin can be performed analytically using the asymptotic variance-covariance structure of the parameter estimator, or else by a straightforward simulation approach. Haplin performs power calculations for child, parent-of-origin and maternal effects, as well as for gene-environment interactions. The power can be calculated for both single SNPs and haplotypes, either autosomal or X-linked. Moreover, Haplin enables power calculations for different child-parent configurations, including (but not limited to) case-parent triads, case-mother dyads, and case-parent triads in combination with unrelated control-parent triads.Entities:
Keywords: EMIM; Genome-wide association studies (GWAS); Haplin; Log-linear and multinomial models; Sample size estimation; Statistical power estimation
Mesh:
Year: 2019 PMID: 30940094 PMCID: PMC6444579 DOI: 10.1186/s12859-019-2727-3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Genetic effects
| Effects | Description |
|---|---|
| Child | A variant allele may increase the risk of a disease only when carried by an individual himself/herself. We refer to this as a “child effect” since it is frequently estimated from the offspring in a case-parent triad. However, the individual referred to as a child might be of any age, depending on the phenotype of interest, and the same effect can also be estimated in case-control studies. |
| Parent-of-origin (PoO) | A PoO effect occurs if the effect of a variant allele in the child depends on whether it is inherited from the mother or the father. In statistical terms, we define a PoO effect as the interaction effect RRR=RR |
| Maternal | A mother’s genotype may influence fetal development directly, for example through maternal metabolic factors operating in utero [ |
Parameterization of penetrances
| Effects | Parameterization of penetrances | |
|---|---|---|
| Child |
| (1) |
| Parent-of-origin (PoO) |
| (2) |
| Child and maternal |
| (3) |
| PoO and maternal |
| (4) |
B is the baseline risk level, typically associated with the (more common) reference allele; RR is the risk increase associated with allele A, relative to B; RR and RR are the relative risks associated with allele A, depending on whether the allele is transmitted from the mother or the father; the double-dose parameter measures the deviation from what would be expected in a multiplicative dose-response relationship, i.e., when j=l and when j≠l; is the relative risk associated with allele A carried by the mother, and is the maternal double-dose parameter, interpreted analogously to . To ensure that the model is not overparameterized, we set RR=1 for the reference allele
Fig. 1A selection of designs for genetic association studies: a Case-parent triad (mfc); b Case-parent triad with independent control-parent triad (mfc-mfc); c Case-mother dyad (mc); d Case-mother dyad with independent control-mother dyad (mc-mc)
Examples of asymptotic power calculations in Haplin
| Effects | Haplin commands | Output |
|---|---|---|
| a) Child | hapPowerAsymp(cases = c(mfc=500), | $haplo.power |
| haplo.freq = c(0.8,0.2), | Haplotype RR.power | |
| RR = c(1,1.4)) | 1 ref | |
| 2 0.88 | ||
| b) PoO | hapPowerAsymp(cases = c(mfc=500), | $haplo.power |
| haplo.freq = c(0.8,0.2), | Haplotype RRcm.power RRcf.power RRcm_cf.power | |
| RRcm = c(1,2), RRcf = c(1,1.5)) | 1 ref ref ref | |
| 2 1 0.87 0.48 | ||
| c) Child and maternal | hapPowerAsymp(cases = c(mfc=500), | $haplo.power |
| haplo.freq = c(0.8,0.2), | Haplotype RR.power RRm.power | |
| RR = c(1,1.4), RR.mat = c(1,1.2)) | 1 ref ref | |
| 2 0.9 0.42 | ||
| d) PoO and maternal | hapPowerAsymp(cases = c(mfc=500), | $haplo.power |
| haplo.freq = c(0.8,0.2), | Haplotype RRcm.power RRcf.power RRcm_cf.power RRm.power | |
| RRcm = c(1,2), RRcf = c(1,1.5), | 1 ref ref ref ref | |
| RR.mat = c(1,1.2)) | 2 0.99 0.65 0.2 0.17 |
The power is calculated for a diallelic SNP, using 500 case-parent triads (cases = c(mfc=500)), and a MAF of 0.2 (haplo.freq = c(0.8,0.2)). The argument RR specifies the relative risk associated with the child effect, whereas the power to detect a PoO effect is calculated by replacing RR by the two relative risk arguments RRcm and RRcf, which refer to the parental origin of the allele carried by the child. Maternal effects can be included by adding the maternal relative risk parameter RR.mat to the original child or PoO command. Note that the order of alleles to which the relative risk parameters refer corresponds to the order used for the haplotype frequencies. Here, the less frequent allele is set as the risk allele and the more frequent allele is used as reference. The nominal significance level defaults to 0.05, but other values can be specified by the argument alpha
Fig. 2Power analysis using the Haplin function hapPowerAsymp. a Child effects for varying numbers of case-parent triads, using a MAF of 0.2; b Child effects for varying values of MAFs, using a total of 500 case-parent triads; c PoO effects for varying numbers of case-parent triads, using a MAF of 0.2; d PoO effects for varying values of MAFs, using a total of 500 case-parent triads. For the PoO effects, RR=1, so that the value of RR/RR is equal to RR. A nominal significance level of 0.05 was used throughout. The power was calculated at relative risks/relative risk ratios of 1,1.05,1.10,…,2. Intermediate values correspond to line segments joining two adjacent points
Fig. 3Comparison of the asymptotic power calculations with the power attained by Haplin and EMIM in data simulations. The power was calculated for different child-parent configurations, assuming a MAF of 0.2 and a nominal significance level of 0.05. The results were based on 500 case families and, when applicable, 500 unrelated control families. All simulations were based on 10,000 replicates of data for a single SNP. Asymp: Power calculations in Haplin, based on asymptotic approximations (Haplin function hapPowerAsymp); Haplin: Power calculations in Haplin, based on data simulations. The power is the proportion of tests rejected by Haplin (Haplin functions hapRun and hapPower); EMIM: Power calculations based on data simulations in Haplin (Haplin function hapSim). The power is the proportion of tests rejected by EMIM. a Child effects (RR >1); b Child effects, adjusting for maternal effects (RR >1 and RR(=1); c PoO effects (RR/RR>1 and RR=1); d PoO effects, adjusting for maternal effects (RR/RR>1 and RR=RR(=1); e Maternal effects, adjusting for child effects (RR(>1 and RR = 1); f Maternal effects, adjusting for PoO effects (RR(>1 and RR=RR=1). The power was calculated at relative risks/relative risk ratios of 1,1.1,1.2,…,2. Intermediate values correspond to line segments joining two adjacent points. Note that for all study designs, the power was calculated based on asymptotic approximations in Haplin, as well as simulations where both Haplin and EMIM were used to analyze the genetic data. The lines for Asymp, Haplin and EMIM are nearly overlapping, demonstrating consistent results