| Literature DB >> 35069690 |
Richard F Oppong1,2, Thibaud Boutin3, Archie Campbell4, Andrew M McIntosh5, David Porteous4, Caroline Hayward3, Chris S Haley3,6, Pau Navarro3, Sara Knott2.
Abstract
We describe a genome-wide analytical approach, SNP and Haplotype Regional Heritability Mapping (SNHap-RHM), that provides regional estimates of the heritability across locally defined regions in the genome. This approach utilises relationship matrices that are based on sharing of SNP and haplotype alleles at local haplotype blocks delimited by recombination boundaries in the genome. We implemented the approach on simulated data and show that the haplotype-based regional GRMs capture variation that is complementary to that captured by SNP-based regional GRMs, and thus justifying the fitting of the two GRMs jointly in a single analysis (SNHap-RHM). SNHap-RHM captures regions in the genome contributing to the phenotypic variation that existing genome-wide analysis methods may fail to capture. We further demonstrate that there are real benefits to be gained from this approach by applying it to real data from about 20,000 individuals from the Generation Scotland: Scottish Family Health Study. We analysed height and major depressive disorder (MDD). We identified seven genomic regions that are genome-wide significant for height, and three regions significant at a suggestive threshold (p-value < 1 × 10-5) for MDD. These significant regions have genes mapped to within 400 kb of them. The genes mapped for height have been reported to be associated with height in humans. Similarly, those mapped for MDD have been reported to be associated with major depressive disorder and other psychiatry phenotypes. The results show that SNHap-RHM presents an exciting new opportunity to analyse complex traits by allowing the joint mapping of novel genomic regions tagged by either SNPs or haplotypes, potentially leading to the recovery of some of the "missing" heritability.Entities:
Keywords: MDD; genome-wide analysis; haplotypes; height; missing heritability; rare variation; regional heritability mapping
Year: 2022 PMID: 35069690 PMCID: PMC8770330 DOI: 10.3389/fgene.2021.791712
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1A Schema outlying SNHap-RHM.
FIGURE 2Plots of Likelihood ratio test (LRT) statistics at each QTL locus and five regions either side averaged for the 20 simulations of each of the five QTL phenotypes. Plot (A) is SNP QTL phenotypes analysed using the SNP-RHM and plot (B) is the haplotype QTL phenotypes analysed using the Hap-RHM. Both models can capture the simulated QTL effects for their respective SNP and haplotype phenotypes.
FIGURE 3Plots of average LRT statistics over replicates of QTL loci across the chromosomes for the 20 simulations of each of the two SNP QTL phenotypes. The red dashed lines are genome-wide significance threshold (for 48,772 regions) and the black dashed lines are Bonferroni significance threshold (for 220 regions). The upper plot (A) is the 1-SNP QTL phenotype, and the lower plot (B) is the multiple SNP QTL phenotype. The two phenotypes are analysed using both the SNP based model (SNP-RHM) (blue points) and the Haplotype based model (Hap-RHM) (red points). The Hap-RHM fails to capture the simulated effects for the SNP QTLs.
FIGURE 4Joint analysis of the SNP and haplotype phenotypes using SNHap-RHM. The plot is an analysis of one replicate of each of the simulated phenotypes. The LRT statistics are plotted over QTL loci across the chromosomes. The red dashed lines are genome-wide significance threshold (for 48,772 regions) and the black dashed lines are Bonferroni significance threshold (for 220 regions).
FIGURE 5The genome-wide evidence of haplotype block association for height. Analysis done with SNHap-RHM, SNP-RHM and Hap-RHM. The points are plots of −log10 of the p-values of regions tested with the LRT for the regional GREML analyses. The green lines are the Bonferroni-corrected genome-wide significance threshold and the red lines are the suggestive significance threshold calculated to be p-value < 1 × 10−5. The top association hits at p-value < 5 × 10−5 with genes located within the region are highlighted in blue for SNP-RHM and red for the Hap-RHM.
FIGURE 6The genome-wide evidence of haplotype block association for Major Depressive Disorder. Analysis done with SNHap-RHM, SNP-RHM and Hap-RHM. The points are plots of −log10 of the p-values of regions tested with the LRT for the regional GREML analyses. The green lines are the Bonferroni-corrected genome-wide significance threshold and the red lines are the suggestive significance threshold calculated to be p-value < 1 × 10−5. The top association hits at p-value < 5 × 10−5 with genes located within the region are highlighted in blue for SNP-RHM and red for the Hap-RHM.
SNP-based association test of MDD in the MYRIP gene region.
| SNP information | Major depressive disorder association | ||||||
|---|---|---|---|---|---|---|---|
| SNP ID | Chr | Pos | MAF | OR | Log (OR) | SE (logOR) |
|
| rs9842160 | 3 | 39844703 | 0.14 | 0.97 | −0.030 | 0.013 | 0.02 |
| rs9858242 | 3 | 39847606 | 0.19 | 1.02 | 0.025 | 0.011 | 0.03 |
| rs1599902 | 3 | 39954674 | 0.41 | 1.02 | 0.019 | 0.009 | 0.04 |
| rs7618607 | 3 | 39947936 | 0.41 | 1.02 | 0.019 | 0.009 | 0.04 |
| rs9860916 | 3 | 39944942 | 0.41 | 1.02 | 0.019 | 0.009 | 0.04 |
The columns are the SNP ID, chromosome, genome position of SNP, minor allele frequency, odds ratio, log of odds ratio, standard error of log odds ratio and association p-value.
Comparison of SNPs within significant regions identified by both models and published GWAS results for height and MDD.
| Trait | Number of SNPS | Number of overlapping SNPS | ||||
|---|---|---|---|---|---|---|
| SNP-RHM | Hap-RHM | pubGWAS | SNP-RHM & Hap-RHM | SNP-RHM & pubGWAS | Hap-RHM & pubGWAS | |
| Height | 1,380 | 45 | 4,960 | 0 | 57 | 0 |
| MDD | 78 | 495 | 1,815 | 0 | 0 | 0 |
The columns are the name of trait, number of SNPS in regions identified by SNP-RHM and HAP-RHM with p-value < 5 × 10−5 and SNPS in published GWAS (pubGWAS) for the traits, and the number of SNPS overlapping between the three.