Literature DB >> 25519517

Regional Heritability Mapping to identify loci underlying genetic variation of complex traits.

Valentina Riggio1, Ricardo Pong-Wong1.   

Abstract

BACKGROUND: Genome-wide association studies can have limited power to identify QTL, partly due to the stringent correction for multiple testing and low linkage-disequilibrium between SNPs and QTL. Regional Heritability Mapping (RHM) has been advanced as an alternative approach to capture underlying genetic effects. In this study, RHM was used to identify loci underlying variation in the 16(th) QTLMAS workshop simulated traits.
METHODS: The method was implemented by fitting a mixed model where a genomic region and the overall genetic background were added as random effects. Heritabilities for the genetic regional effects were estimated, and the presence of a QTL in the region was tested using a likelihood ratio test (LRT). Several region sizes were considered (100, 50 and 20 adjacent SNPs). Bonferroni correction was used to calculate the LRT thresholds for genome-wide (p < 0.05) and suggestive (i.e., one false positive per genome scan) significance.
RESULTS: Genomic heritabilities (0.31, 0.32 and 0.48, respectively) and genetic correlations (0.80, -0.42 and 0.19, between trait-pairs 1&amp;2, 1&amp;3 and 2&amp;3) were similar to the simulated ones. RHM identified 7 QTL (4 at genome-wide and 3 at suggestive level) for Trait1; 4 (2 genome-wide and 2 suggestive) for Trait2; and 7 (6 genome-wide and 1 suggestive) for Trait3. Only one of the identified suggestive QTL was a false-positive. The position of these QTL tended to coincide with the position where the largest QTL (or several of them) were simulated. Several signals were detected for the simulated QTL with smaller effect. A combined analysis including all significant regions showed that they explain more than half of the total genetic variance of the traits. However, this might be overestimated, due to Beavis effect. All QTL affecting traits 1&amp;2 and 2&amp;3 had positive correlations, following the trend of the overall correlation of both trait-pairs. All but one QTL affecting traits 1&amp;3 were negatively correlated, in agreement with the simulated situation. Moreover, RHM identified extra loci that were not found by association and linkage analysis, highlighting the improved power of this approach.
CONCLUSIONS: RHM identified the largest QTL among the simulated ones, with some signals for the ones with small effect. Moreover, RHM performed better than association and linkage analysis, in terms of both power and resolution.

Entities:  

Year:  2014        PMID: 25519517      PMCID: PMC4195407          DOI: 10.1186/1753-6561-8-S5-S3

Source DB:  PubMed          Journal:  BMC Proc        ISSN: 1753-6561


Background

Genome-wide association studies (GWAS) have generally failed to explain most of the known genetic variation influencing complex diseases [1]. This is partly due to the stringent correction for multiple testing and low linkage-disequilibrium (LD) between SNPs and QTL. Attempts to increase the power of GWAS have focused on increasing either the number of markers or the number of observations per trait. An alternative approach exploiting dense SNP chip data, known as Regional Heritability Mapping (RHM) [2], has been advanced as a better approach to capture more of the underlying genetic effects. This method provides heritability estimates attributable to small genomic regions, and it has the power to detect regions containing multiple alleles that individually contribute too little variance to be detected by GWAS. The aim of this study was to identify QTL affecting the three traits simulated in the 16th QTL-MAS workshop dataset and recover their possible pleiotropic actions, using RHM.

Methods

1. Dataset

The dataset, provided by the 16th QTLMAS workshop organisers, consisted of 3,000 individuals, all females, from three generations (G1-G3); all were genotyped for about 10,000 SNPs on five chromosomes of equal length (99.95 Mb each). The phenotypes (Trait1, Trait2, and Trait3) resembled three milk production traits, given as individual yield deviations, and generated in order to mimic two yields and the corresponding content.

2. QTL mapping analysis

The implementation of RHM is described in [2]. RHM is related to interval mapping methodology, using variance component approach [3]. Basically, RHM is a mixed model where the effect of a genomic region (attributable to the QTL within the region in question) plus the overall genetic background were added as random, with covariance structure proportional to the genetic relationship matrix calculated using genotype information. The relationship matrix modelling the overall genetic background was estimated using all SNPs, whereas the one for the region was estimated using the SNPs falling within that region. Heritabilities for the genetic regional effects were estimated [4], and the presence of a QTL in the region was tested using a likelihood ratio test (LRT). Several region sizes were considered (i.e. 100, 50 and 20 adjacent SNPs), and the regions shifted every 10 SNPs. After Bonferroni correction, the LRT thresholds for genome-wide (p < 0.05) and suggestive (i.e., one false positive per genome scan) significance levels were 10.83 and 6.64 (corresponding to -log10(p) of 3.30 and 2.00), 12.12 and 7.88 (-log10(p) of 3.60 and 2.30), and 13.83 and 9.55 (-log10(p) of 4.00 and 2.70) for the three region sizes, respectively. RHM results were compared with association and linkage analysis results, in order to assess its potential use as a tool for QTL mapping. The linkage analysis was implemented in GridQTL [5], studying the segregation of the paternal allele; the association analysis, using the GRAMMAR approach [6], which comprises two steps: first, phenotypes were adjusted for the polygenic effects and second, residuals were fitted against each SNP using additive model as implemented in GenABEL [7]. When a QTL was found significant for more than one trait, correlations between regional EBVs were estimated to evaluate possible pleiotropic effects among traits.

Results and discussion

Heritabilities obtained using the genomic relationship matrix were 0.31, 0.32 and 0.48 for Trait1, 2 and 3, respectively and similar to those estimated with a pedigree-based relationship matrix (0.38, 0.39 and 0.49, respectively) and those simulated (0.36, 0.35 and 0.52, respectively). Genetic correlations estimated with the genomic relationship matrix were 0.80 (Trait 1&2), -0.43 (Trait 1&3) and 0.19 (Trait 2&3), and similar to those estimated with a pedigree-based relationship matrix (0.83, -0.42 and 0.14, respectively), and those simulated (0.80, -0.43 and 0.17, respectively). The results from RHM were in general consistent across the three region sizes tested, hence, we will concentrate on the results with 20 SNPs per region. Figure 1 shows the Manhattan plot for the analyses of Trait1, 2 and 3 (a, b, and c, respectively). RHM identified 7 QTL (4 at genome-wide level and 3 at suggestive level) for Trait1; 4 (2 genome-wide and 2 suggestive) for Trait2; and 7 (6 genome-wide and 1 suggestive) for Trait3. Only one of the identified QTL, significant at the suggestive level, was a false-positive. The position of these QTL tended to coincide with the position where the largest QTL (or several of them) were simulated. RHM did not identify regions harbouring QTL with small effect, which is attributable to the size of the data. Nevertheless, several signals were detected, although they did not reach significance.
Figure 1

RHM results for Trait1 (a), Trait2 (b) and Trait3 (c) using 20 SNP region size. Genome-wide p < 0.05 and suggestive thresholds are shown (solid lines). Simulated QTL are also shown, with arrows whose size is proportional to the effect of the QTL.

RHM results for Trait1 (a), Trait2 (b) and Trait3 (c) using 20 SNP region size. Genome-wide p < 0.05 and suggestive thresholds are shown (solid lines). Simulated QTL are also shown, with arrows whose size is proportional to the effect of the QTL. When comparing RHM results with association and linkage analysis results, all three methods were successful in identifying the larger QTL and with some extend the other ones (Additional Files 1 to 3). However, for the QTL with smaller effect RHM performed better than both association and linkage analysis, i.e. more power than association and more resolution than linkage. Table 1 shows the heritability (h2reg) for all significant regions and the proportion of genetic variance explained when all significant QTL were simultaneously fitted into the model together with a genomic effect (to capture genetic variance not explained by the fitted QTL). The sum of all genetic variances from the joint analysis showed some discrepancies from the one observed in the analysis including one single genomic effect (especially with Trait3). Our results show that these regions explain more than half of the total genetic variance of the traits in question, suggesting that the undetected QTL explain a small proportion of genetic variance. However, the estimated variance might be inflated, due to Beavis effect [8], arising from small sample size.
Table 1

Regional heritability (h2reg) for regions significant both at the genomic level (p < 0.05) and at the suggestive level and proportion of genetic variance explained for the three traits with 20 SNP region size.

Heritability (h2reg)
ChromosomeRegionPosition (Kb)Trait1Trait2Trait3

13014500-154500.020.030.03
11758000-589500.030.05
16984000-849500.020.05

215979000-799500.01

352000-29500.030.03
7436500-374500.05

42411500-124500.01f.p.
4924000-249500.070.08
17185000-859500.010.01

513165000-659500.010.01

Sum of h2reg0.170.150.23
Remaining heritability0.140.140.16
Proportion of genetic variance explained by mapped QTL55%52%59%

f.p. False-positive

Regional heritability (h2reg) for regions significant both at the genomic level (p < 0.05) and at the suggestive level and proportion of genetic variance explained for the three traits with 20 SNP region size. f.p. False-positive Examination of the regional EBVs showed that some QTL have pleiotropic effects among traits. Genetic correlations (i.e., correlations between regional EBVs) between the regions in common across the three traits are in Table 2. All QTL affecting traits 1&2 and 2&3 had positive correlations, following the trend of the overall correlation of both trait-pairs. All but one QTL affecting traits 1&3 were negatively correlated. Our results were however in agreement with the simulated ones.
Table 2

Genetic correlations for regions which were found to affect more than one trait and overall genetic correlations.

Genetic correlations

ChromosomeRegionPosition (Kb)Trait 1&2Trait 1&3Trait 2&3
13014500-154500.880.290.70
11758000-58950-0.87
16984000-84950-0.89
352000-29500.63
44924000-249500.97
17185000-85950-0.85
513165000-659500.93

Overall correlation0.80-0.430.19
Genetic correlations for regions which were found to affect more than one trait and overall genetic correlations.

Conclusions

RHM identified the largest QTL among the simulated ones. Moreover, for the ones with smaller effect, several signals were detected, although they did not reach significance. In general RHM identified extra loci that were not found by association and linkage analysis, highlighting the improved power of this approach.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

VR carried out the analyses and drafted the manuscript; RPW implemented the method; both authors contributed to the design of the study and the final version of the manuscript.

Additional file 1

Comparison among RHM, association and linkage analysis results for Trait1. Click here for file

Additional file 2

Comparison among RHM, association and linkage analysis results for Trait2. Click here for file

Additional file 3

Comparison among RHM, association and linkage analysis results for Trait3. Click here for file
  5 in total

1.  Mapping quantitative trait loci in complex pedigrees: a two-step variance component approach.

Authors:  A W George; P M Visscher; C S Haley
Journal:  Genetics       Date:  2000-12       Impact factor: 4.562

2.  GenABEL: an R library for genome-wide association analysis.

Authors:  Yurii S Aulchenko; Stephan Ripke; Aaron Isaacs; Cornelia M van Duijn
Journal:  Bioinformatics       Date:  2007-03-23       Impact factor: 6.937

3.  Genomewide rapid association using mixed model and regression: a fast and simple method for genomewide pedigree-based quantitative trait loci association analysis.

Authors:  Yurii S Aulchenko; Dirk-Jan de Koning; Chris Haley
Journal:  Genetics       Date:  2007-07-29       Impact factor: 4.562

Review 4.  Finding the missing heritability of complex diseases.

Authors:  Teri A Manolio; Francis S Collins; Nancy J Cox; David B Goldstein; Lucia A Hindorff; David J Hunter; Mark I McCarthy; Erin M Ramos; Lon R Cardon; Aravinda Chakravarti; Judy H Cho; Alan E Guttmacher; Augustine Kong; Leonid Kruglyak; Elaine Mardis; Charles N Rotimi; Montgomery Slatkin; David Valle; Alice S Whittemore; Michael Boehnke; Andrew G Clark; Evan E Eichler; Greg Gibson; Jonathan L Haines; Trudy F C Mackay; Steven A McCarroll; Peter M Visscher
Journal:  Nature       Date:  2009-10-08       Impact factor: 49.962

5.  Localising loci underlying complex trait variation using Regional Genomic Relationship Mapping.

Authors:  Yoshitaka Nagamine; Ricardo Pong-Wong; Pau Navarro; Veronique Vitart; Caroline Hayward; Igor Rudan; Harry Campbell; James Wilson; Sarah Wild; Andrew A Hicks; Peter P Pramstaller; Nicholas Hastie; Alan F Wright; Chris S Haley
Journal:  PLoS One       Date:  2012-10-15       Impact factor: 3.240

  5 in total
  5 in total

1.  An efficient unified model for genome-wide association studies and genomic selection.

Authors:  Hengde Li; Guosheng Su; Li Jiang; Zhenmin Bao
Journal:  Genet Sel Evol       Date:  2017-08-24       Impact factor: 4.297

2.  Genome-Wide Association and Genomic Selection for Resistance to Amoebic Gill Disease in Atlantic Salmon.

Authors:  Diego Robledo; Oswald Matika; Alastair Hamilton; Ross D Houston
Journal:  G3 (Bethesda)       Date:  2018-03-28       Impact factor: 3.154

3.  Regional Heritability Mapping Provides Insights into Dry Matter Content in African White and Yellow Cassava Populations.

Authors:  Uche Godfrey Okeke; Deniz Akdemir; Ismail Rabbi; Peter Kulakow; Jean-Luc Jannink
Journal:  Plant Genome       Date:  2018-03       Impact factor: 4.089

4.  The potential of shifting recombination hotspots to increase genetic gain in livestock breeding.

Authors:  Serap Gonen; Mara Battagin; Susan E Johnston; Gregor Gorjanc; John M Hickey
Journal:  Genet Sel Evol       Date:  2017-07-04       Impact factor: 4.297

5.  A Low Resolution Epistasis Mapping Approach To Identify Chromosome Arm Interactions in Allohexaploid Wheat.

Authors:  Nicholas Santantonio; Jean-Luc Jannink; Mark Sorrells
Journal:  G3 (Bethesda)       Date:  2019-03-07       Impact factor: 3.154

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.