| Literature DB >> 36140810 |
Xiaodian Cai1, Jinyan Teng1, Duanyang Ren1, Hao Zhang1, Jiaqi Li1, Zhe Zhang1.
Abstract
Heritability enrichment analysis is an important means of exploring the genetic architecture of complex traits in human genetics. Heritability enrichment is typically defined as the proportion of an SNP subset explained heritability, divided by the proportion of SNPs. Heritability enrichment enables better study of underlying complex traits, such as functional variant/gene subsets, biological networks and metabolic pathways detected through integrating explosively increased omics data. This would be beneficial for genomic prediction of disease risk in humans and genetic values estimation of important economical traits in livestock and plant species. However, in livestock, factors affecting the heritability enrichment estimation of complex traits have not been examined. Previous studies on humans reported that the frequencies, effect sizes, and levels of linkage disequilibrium (LD) of underlying causal variants (CVs) would affect the heritability enrichment estimation. Therefore, the distribution of heritability across the genome should be fully considered to obtain the unbiased estimation of heritability enrichment. To explore the performance of different heritability enrichment models in livestock populations, we used the VanRaden, GCTA and α models, assuming different α values, and the LDAK model, considering LD weight. We simulated three types of phenotypes, with CVs from various minor allele frequency (MAF) ranges: genome-wide (0.005 ≤ MAF ≤ 0.5), common (0.05 ≤ MAF ≤ 0.5), and uncommon (0.01 ≤ MAF < 0.05). The performances of the models with two different subsets (one of which contained known CVs and the other consisting of randomly selected markers) were compared to verify the accuracy of heritability enrichment estimation of functional variant sets. Our results showed that models with known CV subsets provided more robust enrichment estimation. Models with different α values tended to provide stable and accurate estimates for common and genome-wide CVs (relative deviation 0.5-2.2%), while tending to underestimate the enrichment of uncommon CVs. As the α value increased, enrichments from 15.73% higher than true value (i.e., 3.00) to 48.93% lower than true value for uncommon CVs were observed. In addition, the long-range LD windows (e.g., 5000 kb) led to large bias of the enrichment estimations for both common and uncommon CVs. Overall, heritability enrichment estimations were sensitive for the α value assumption and LD weight consideration of different models. Accuracy would be greatly improved by using a suitable model. This study would be helpful in understanding the genetic architecture of complex traits and provides a reference for genetic analysis in the livestock population.Entities:
Keywords: LD; complex trait; genetic architecture; heritability enrichment; livestock
Mesh:
Year: 2022 PMID: 36140810 PMCID: PMC9498849 DOI: 10.3390/genes13091644
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
The attributes for each type of simulated phenotype.
| Type of Phenotype | MAF of Causal Variants a | Number of Causal Variants b | True Enrichment Fold c | |
|---|---|---|---|---|
| Functional Subset | Non-Functional Subset | |||
| T1 (genome-wide) | (0.005, 0.5) | 2000 | 3.000 | 1.000 |
| T2 (common) | (0.05, 0.5) | 3.000 | 1.000 | |
| T3 (uncommon) | (0.01, 0.05) | 3.000 | 1.000 | |
a The minor allele frequency range of causal variants; b The number of markers that control the phenotype; c True enrichment values for different subsets.
Figure 1Comparison of heritability enrichment estimation models. Showing the estimated enrichment for different models integrated with (A) “Functional subset” or (B) “Non-functional subset”. Colors represent the type of phenotype. The dotted lines represent the true enrichments (i.e., 3.000 and 1.000, respectively).
Figure 2Comparison of performance of different α values for α-model. Showing the estimated enrichments of different α values for α-models integrated with (A) “Functional subset” or (B) “Non-functional subset”. The different color and dotted lines are explained as in Figure 1.
Figure 3Comparison of different LD window settings for the LDAK model. The estimated enrichment of different LD window settings for the LDAK model integrated with “Functional subset”. The different colors and dotted line are explained as in Figure 1.
Figure 4Performance of enrichment model with (LDAK) and without (α-model) LD weights. Comparing the difference between the real enrichment and the estimated enrichment of the model with and without LD weights. Colors represent the existence of LD weight for models. The dotted line represents the true enrichment.