| Literature DB >> 33990588 |
Ting Li1, Zheng Ning1,2, Zhijian Yang1, Ranran Zhai1, Chenqing Zheng1, Wenzheng Xu1, Yipeng Wang1, Kejun Ying1,3,4, Yiwen Chen1,5, Xia Shen6,7,8.
Abstract
Quantifying the overall magnitude of every single locus' genetic effect on the widely measured human phenome is of great challenge. We introduce a unified modelling technique that can consistently provide a total genetic contribution assessment (TGCA) of a gene or genetic variant without thresholding genetic association signals. Genome-wide TGCA in five UK Biobank phenotype domains highlights loci such as the HLA locus for medical conditions, the bone mineral density locus WNT16 for physical measures, and the skin tanning locus MC1R and smoking behaviour locus CHRNA3 for lifestyle. Tissue-specificity investigation reveals several tissues associated with total genetic contributions, including the brain tissues for mental health. Such associations are driven by tissue-specific gene expressions, which share genetic basis with the total genetic contributions. TGCA can provide a genome-wide atlas for the overall genetic contributions in each particular domain of human complex traits.Entities:
Year: 2021 PMID: 33990588 PMCID: PMC8121943 DOI: 10.1038/s41467-021-23124-w
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Representative simulation results of total genetic contribution assessment (TGCA) under different true models.
200 independent Z-scores for a single genetic variant were drawn from a mixture distribution with (1) two non-null Gaussian components , (2) one non-null Gaussian component 0.5N(0, 1) + 0.5N(μ+, σ2), (3) three non-null Gaussian components , and (4) two non-null heavy-tailed t-distributed components 0.25[t(1) + μ−] + 0.5N(0, 1) + 0.25[t(1) + μ+], respectively. The simulation was repeated for 999 times. In each simulation, the negative effect size μ− was randomly drawn from −∣N(1, 1)∣, the positive effect size(s) μ+ and μ++ were drawn from ∣N(1, 1)∣, and the σ2 parameters from χ2(1). The y-axis compares the estimated and the relevant parameters with the true values. Source data are provided as a Source Data file.
Fig. 2Genome-wide total genetic contribution assessment (TGCA) and the p-values testing against Θ = 0 in three UK Biobank trait domains.
Nearest gene(s) to the lead variant at each top locus are labeled. Source data are provided as genome-wide summary statistics in Data Availability.
Fig. 3Gene-expression-induced association analysis between 48 human tissues and total genetic contribution assessment (TGCA) of five trait domains.
a Association between TGCA Θ and each tissue was tested using stratified LD score regression (LDSC) for the enrichment of Θ in tissue-specifically expressed genes. b The Θ-tissue association was analysed using a linear regression of on the annotations of top 10% tissue-specifically expressed genes, corrected for the LD scores of the SNPs. The median regression coefficients of the annotation variables across 100 sets of LD-pruned SNPs are plotted. c Comparison of the resulted association scores by the two methods in a and b via rank-based correlations. The regression lines with the 95% prediction interval bands are shown. Source data are provided as a Source Data file.
Fig. 4Distribution of total genetic contribution assessment (TGCA) χ2 statistic at cis-eQTL of specifically expressed genes in three different tissues.
The quantile-quantile plots compare the observed TGCA χ2 statistics for at the cis-eQTL of top 100 specifically expressed genes in each tissue. The expected null values were drawn from a χ2(1) distribution. Source data are provided as a Source Data file.