| Literature DB >> 31974095 |
Xinzhu Zhou1, Celine L St Pierre2, Natalia M Gonzales3, Jennifer Zou4, Riyan Cheng5, Apurva S Chitre5, Greta Sokoloff6, Abraham A Palmer7,8.
Abstract
There has been extensive discussion of the "Replication Crisis" in many fields, including genome-wide association studies (GWAS). We explored replication in a mouse model using an advanced intercross line (AIL), which is a multigenerational intercross between two inbred strains. We re-genotyped a previously published cohort of LG/J x SM/J AIL mice (F34; n = 428) using a denser marker set and genotyped a new cohort of AIL mice (F39-43; n = 600) for the first time. We identified 36 novel genome-wide significant loci in the F34 and 25 novel loci in the F39-43 cohort. The subset of traits that were measured in both cohorts (locomotor activity, body weight, and coat color) showed high genetic correlations, although the SNP heritabilities were slightly lower in the F39-43 cohort. For this subset of traits, we attempted to replicate loci identified in either F34 or F39-43 in the other cohort. Coat color was robustly replicated; locomotor activity and body weight were only partially replicated, which was inconsistent with our power simulations. We used a random effects model to show that the partial replications could not be explained by Winner's Curse but could be explained by study-specific heterogeneity. Despite this heterogeneity, we performed a mega-analysis by combining F34 and F39-43 cohorts (n = 1,028), which identified four novel loci associated with locomotor activity and body weight. These results illustrate that even with the high degree of genetic and environmental control possible in our experimental system, replication was hindered by study-specific heterogeneity, which has broad implications for ongoing concerns about reproducibility.Entities:
Keywords: GWAS; MPP; Mouse Genetics; Multiparent Advanced Generation Inter-Cross (MAGIC); Replication; multiparental; populations
Mesh:
Substances:
Year: 2020 PMID: 31974095 PMCID: PMC7056977 DOI: 10.1534/g3.119.400763
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Minor allele frequency (MAF) distributions for F34 array, F34 GBS, F39-F43 GBS, and F34 and F39-F43 GBS SNP sets. The average MAF of those 4,593 array SNPs was 0.388; the average MAF of the 60,392 GBS-derived SNPs in 428 F34 AIL mice was 0.382; the average MAF of the 59,790 GBS-derived SNPs in 600 F39-43 AIL mice was 0.358; the average MAF of the 58,461 GBS-derived SNPs that existed in both F34 and F39-43 AIL mice was 0.370 (Table S2). MAF distributions are highly comparable between AIL generations.
Figure 2Significant loci on chromosome 17 for open field, distance traveled in periphery in F34 AIL. As exemplified in this pair of LocusZoom plots, GBS SNPs defined the boundaries of the loci much more precisely than array SNPs. GBS SNPs that are in high LD (r2 > 0.8, red dots) with lead SNP chr17:27130383 resides between 27 ∼28.3 Mb. In contrast, too few SNPs are present in the array plot to draw any definitive conclusion about the boundaries or LD pattern in this region. Purple track shows the credible set interval. LocusZoom plots for all loci identified in this paper are in Figure S8.
Replication of significant SNPs between F34 and F39-43 AIL association analyses. “Discovery set” indicates the AIL generation that significant SNPs were identified. “Replication set” shows the association p-value, β estimates, etc. of the “discovery set” significant SNPs in the replication AIL generation. SNPs that replicated (p < 0.05, same sign for the beta) between F34 and F39-43 are in bold italics, SNPs that replicated at the genome-wide threshold (see Table S5) are bold, italic and underlined. Genetic correlations (rG) for phenotypes measured in both F34 and F39-43 are listed (see also Table S11)
| Discovery set | Replication set | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Phenotype | rG(s.e.) | SNP | p | -log10(p) | af | beta | se | p | -log10(p) | af | beta | se |
| F34 GBS | F3943 GBS replicate | |||||||||||
| Body weight | 0.711(0.25)* | 0.419 | −0.25 | 0.05 | 0.406 | −0.13 | 0.04 | |||||
| 0.497 | 0.21 | 0.05 | 0.518 | 0.09 | 0.04 | |||||||
| 0.514 | −0.20 | 0.04 | 0.566 | −0.10 | 0.04 | |||||||
| Coat color, albino | 0.967(0.04)* | 0.432 | −0.58 | 0.02 | 0.388 | −0.61 | 0.02 | |||||
| Coat color, agouti | 0.971(0.04)* | 0.129 | 0.94 | 0.01 | 0.207 | 0.72 | 0.03 | |||||
| Locomotor test day 1, total distance traveled in 30min | 0.968(0.24)* | chr19.21812298 | 3.98×10−7 | 6.40 | 0.461 | −0.36 | 0.07 | 4.55×10−1 | 0.342 | 0.502 | −0.05 | 0.06 |
| Locomotor test day2, total distance traveled in 30min | 0.988(0.19)* | chr8.17410225 | 5.65×10−6 | 5.248 | 0.171 | 0.42 | 0.09 | 8.34×10−1 | 0.079 | 0.202 | 0.02 | 0.08 |
| Body weight | 0.711(0.25)* | chr1.89192209 | 6.42×10−6 | 5.19 | 0.22 | 0.22 | 0.05 | 5.16×10−2 | 1.29 | 0.276 | 0.10 | 0.05 |
| 0.658 | −0.22 | 0.04 | 0.575 | −0.19 | 0.05 | |||||||
| Coat color, albino | 0.967(0.04)* | 0.389 | −0.62 | 0.02 | 0.444 | −0.57 | 0.02 | |||||
| Coat color, agouti | 0.971(0.04)* | 0.218 | 0.74 | 0.02 | 0.135 | 0.90 | 0.01 | |||||
| Locomotor test day 2, total distance traveled in 30min | 0.988(0.19)* | chr15.67627183 | 3.33×10−6 | 5.478 | 0.461 | 0.30 | 0.06 | 2.07×10−1 | 0.683 | 0.522 | −0.08 | 0.07 |
Predicted replication rates. We applied the replication analysis to phenotypes with at least two genome-wide significant variants in the discovery study. These phenotypes include body weight, albino coat color, agouti coat color, locomotor test day 1, and locomotor test day 2. We computed the true replication rate as the fraction of variants that were genome-wide significant in the discovery study that also passed the Bonferroni significance threshold in the replication study (“Empirical replication rate”). The model accounting for Winner’s Curse and confounding (“Predicted replication rate WC+C”) explains the true replication rate more accurately than the model accounting for only Winner’s Curse (“Predicted replication rate WC”)
| Discovery set | Replication set | Phenotype | Empirical replication rate | Predicted replication rate (WC) | Predicted replication rate (WC+C) |
|---|---|---|---|---|---|
| Body weight | 0.009 | 1.000 | 0.044 | ||
| Coat color, albino | 1.000 | 1.000 | 0.997 | ||
| Coat color, agouti | 0.932 | 1.000 | 0.577 | ||
| Locomotor test day 1 | 0.000 | 1.000 | 0.028 | ||
| Locomotor test day 2 | 0.000 | 1.000 | 0.140 | ||
| Body weight | 0.297 | 1.000 | 0.071 | ||
| Coat color, albino | 0.911 | 1.000 | 0.932 | ||
| Coat color, agouti | 0.815 | 1.000 | 0.925 | ||
| Locomotor test day 2 | 0.000 | 1.000 | 0.053 |
Figure 3SNP-heritability estimates in F34 and F39-43 AILs. Square dots represent the SNP heritability estimated by the GCTA-GREML analysis (Yang ). The whiskers flanking the square dots show the ± 1 × standard error of the heritability estimate. All heritability estimates are highly significant (p < 1.0×10−05; see Table S12).
Figure 4Manhattan plots comparing F34 GBS, F39-43 GBS, and mega-analysis on locomotor day 1 test using 57,170 shared SNPs in all AIL generations. We performed mega-analysis of F34 and F39-43 animals (n = 1,028) for body weight, coat color, and locomotor activity, the set of traits that were measured in the same way in both cohorts.