| Literature DB >> 28921393 |
Kaiyin Zhong1, Gu Zhu2, Xiaoxi Jing3,4, A Emile J Hendriks5,6, Sten L S Drop5, M Arfan Ikram7, Scott Gordon2, Changqing Zeng3, Andre G Uitterlinden7,8, Nicholas G Martin2, Fan Liu9,10,11, Manfred Kayser12.
Abstract
Adult height is the most widely genetically studied common trait in humans; however, the trait variance explainable by currently known height-associated single nucleotide polymorphisms (SNPs) identified from the previous genome-wide association studies (GWAS) is yet far from complete given the high heritability of this complex trait. To exam if compound heterozygotes (CH) may explain extra height variance, we conducted a genome-wide analysis to screen for CH in association with adult height in 10,631 Dutch Europeans enriched with extremely tall people, using our recently developed method implemented in the software package CollapsABEL. The analysis identified six regions (3q23, 5q35.1, 6p21.31, 6p21.33, 7q21.2, and 9p24.3), where multiple pairs of SNPs as CH showed genome-wide significant association with height (P < 1.67 × 10-10). Of those, 9p24.3 represents a novel region influencing adult height, whereas the others have been highlighted in the previous GWAS on height based on analysis of individual SNPs. A replication analysis in 4080 Australians of European ancestry confirmed the significant CH-like association at 9p24.3 (P < 0.05). Together, the collapsed genotypes at these six loci explained 2.51% of the height variance (after adjusting for sex and age), compared with 3.23% explained by the 14 top-associated SNPs at 14 loci identified by traditional GWAS in the same data set (P < 5 × 10-8). Overall, our study empirically demonstrates that CH plays an important role in adult height and may explain a proportion of its "missing heritability". Moreover, our findings raise promising expectations for other highly polygenic complex traits to explain missing heritability identifiable through CH-like associations.Entities:
Mesh:
Year: 2017 PMID: 28921393 PMCID: PMC5702380 DOI: 10.1007/s00439-017-1842-3
Source DB: PubMed Journal: Hum Genet ISSN: 0340-6717 Impact factor: 4.132
Fig. 1Manhattan plots from RS. Minus log of P value is represented by blue and red dots and grouped by chromosome. a Manhattan plot for single-SNP GWAS. The horizontal line (−log 5 × 10−8) indicates threshold of genome-wide significance. b Manhattan plot for GCDH scan, the horizontal line (−log 1.67 × 10−10) indicates threshold of GCDH genome-wide significance. Loci are annotated in green if they have not been reported before with genome-wide height association by the GIANT consortium (Wood et al. 2014)
Fig. 2QQ plots. a QQ plot of single-SNP P values. b QQ plot of GCDH P values. In both cases, P values from five permutation analyses were also plotted to estimate the null distribution
Discovery and replication of six CH SNP pairs
| Chr | Genes | SNP1 | EA1 | fEA1 | SNP2 | EA2 | fEA2 |
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|
| 3q23 | ZBTB38 | rs9821337 | G | 0.38 | rs6762826 | G | 0.09 | 1.69 × 10−7 | 2.77 × 10−2 | 9.59 × 10−11 |
|
| 5q35.1 | SLIT3 | rs1466947 | G | 0.11 | rs17070997 | T | 0.07 | 1.21 × 10−7 | 1.13 × 10−4 | 8.21 × 10−11 | 8.62 × 10−1 |
| 6p21.33 | PSORS1C3/HLA-C | rs6904669 | G | 0.41 | rs1960278 | C | 0.43 | 3.62 × 10−1 | 3.09 × 10−6 | 9.32 × 10−14 | 1.75 × 10−1 |
| 6p21.31 | GRM4/HMGA1/C6orf106 | rs1776897 | G | 0.09 | rs2744977 | G | 0.16 | 5.32 × 10−8 | 8.17 × 10−8 | 1.61 × 10−13 |
|
| 7q21.2 | CDK6/SAMD9 | rs7793983 | G | 0.09 | rs17164894 | A | 0.07 | 2.30 × 10−6 | 1.65 × 10−5 | 3.56 × 10−11 | 5.27 × 10−1 |
| 9p24.3 | DMRT2/SMARCA2 | rs514779 | C | 0.03 | rs10962274 | C | 0.08 | 1.51 × 10−5 | 1.98 × 10−4 | 9.34 × 10−11 |
|
EA , EA effect alleles for the first and second SNPs in RS data set, fEA , fEA minor allele frequency for the first and second SNPs in RS data set, P , P single-SNP P value for the first and second SNPs in RS data set, P GCDH P value in RS data set, and P P value from replication analysis in QIMR data set, where P values < 0.05 are highlighted with bold font
Fig. 3Regional Manhattan plot of GCDH P values from 9p24.3
Multivariable linear model including 14 single SNPs, 6 CH SNP pairs and the weighted allele sum of 658 SNPs previously reported to be associated with height as independent variables
| Variable | Cytoband | Gene | −log10
|
|
|---|---|---|---|---|
| PRS658 | 945.12 | 19.13 | ||
| rs10439884 | 21p11.1 | TPTE | 19.22 | 0.32 |
| rs7159961 | 14q11.2 | OR4Q3 | 17.25 | 0.33 |
| rs1366870 | 15q11.2 | LOC727924 | 16.33 | 0.31 |
| rs2279007 | 19p13.11 | MYO9B | 15.53 | 0.44 |
| rs4931222 | 12p11.22 | TMTC1 | 13.10 | 0.29 |
| rs12485899 | 3p22.1 | ZNF621 | 11.35 | 0.30 |
| rs4272 | 7q21.2 | CDK6 | 4.99 | 0.28 |
| rs6984782 | 8q12.1 | CHCHD7 | 3.33 | 0.26 |
| rs7741741 | 6q24.1 | GPR126 | 3.25 | 0.30 |
| rs1038196 | 12q14.3 | HMGA2 | 2.10 | 0.23 |
| rs9838625 | 3q23 | ZBTB38 | 1.93 | 0.12 |
| rs1265097 | 6p21.33 | PSORS1C1 | 1.89 | 0.04 |
| rs6060355 | 20q11.22 | UQCC1 | 1.38 | 0.01 |
| rs2780226 | 6p21.31 | GRM4 | 0.90 | 0.01 |
| rs514779_rs10962274 | 9p24.3 | DMRT2/SMARCA2 | 19.20 | 0.39 |
| rs1466947_rs17070997 | 5q35.1 | SLIT3 | 13.17 | 0.39 |
| rs7793983_rs17164894 | 7q21.2 | LOC101927497 | 11.65 | 0.50 |
| rs6904669_rs1960278 | 6p21.33 | PSORS1C3/HLA-C | 10.39 | 0.48 |
| rs1776897_rs2744977 | 6p21.31 | GRM4/HMGA1 | 4.02 | 0.39 |
| rs9821337_rs6762826 | 3q23 | ZBTB38 | 0.76 | 0.36 |
| Total 1a | 2.51 | |||
| Total 2b | 3.23 | |||
| Total 3c | 24.87 |
aTotal percentage of variation in height explained by 6 CH SNPs
bTotal percentage of variation in height explained by 14 single SNPs
cTotal percentage of variation in height explained by all independent variables in the model