| Literature DB >> 31562340 |
Masato Akiyama1,2,3, Kazuyoshi Ishigaki1, Saori Sakaue1,4, Yukihide Momozawa5, Momoko Horikoshi6, Makoto Hirata7, Koichi Matsuda8, Shiro Ikegawa9, Atsushi Takahashi1,10, Masahiro Kanai1,11, Sadao Suzuki12, Daisuke Matsui13, Mariko Naito14,15, Taiki Yamaji16, Motoki Iwasaki16, Norie Sawada16, Kozo Tanno17,18, Makoto Sasaki17, Atsushi Hozawa19,20, Naoko Minegishi19,20, Kenji Wakai14, Shoichiro Tsugane21, Atsushi Shimizu17, Masayuki Yamamoto19,20, Yukinori Okada1,22,23, Yoshinori Murakami24, Michiaki Kubo25, Yoichiro Kamatani26,27,28.
Abstract
Human height is a representative phenotype to elucidate genetic architecture. However, the majority of large studies have been performed in European population. To investigate the rare and low-frequency variants associated with height, we construct a reference panel (N = 3,541) for genotype imputation by integrating the whole-genome sequence data from 1,037 Japanese with that of the 1000 Genomes Project, and perform a genome-wide association study in 191,787 Japanese. We report 573 height-associated variants, including 22 rare and 42 low-frequency variants. These 64 variants explain 1.7% of the phenotypic variance. Furthermore, a gene-based analysis identifies two genes with multiple height-increasing rare and low-frequency nonsynonymous variants (SLC27A3 and CYP26B1; PSKAT-O < 2.5 × 10-6). Our analysis shows a general tendency of the effect sizes of rare variants towards increasing height, which is contrary to findings among Europeans, suggesting that height-associated rare variants are under different selection pressure in Japanese and European populations.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31562340 PMCID: PMC6764965 DOI: 10.1038/s41467-019-12276-5
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Plot of imputation qualities by different reference panels. The accuracy of imputation was evaluated for four different reference panels at “masked” HumanExome SNPs. BBJ1K + 1KGp3v5 panel (red) showed the best performance at every allele frequency bins. Even the BBJ1K panel showed much better accuracy than the 1KGp3v5 ALL panel regardless of the difference in sample size, indicating the contribution of deep haplotype information of the specific population to imputation accuracy. Median r2 values are shown on the y-axis
Fig. 2Scatter plot of MAF and effect size. We plotted the allele frequency (x-axis) and effect size (y-axis) of the minor alleles of the 609 variants that reached genome-wide significance in the discovery GWAS. Variants that remained genome-wide significant after the meta-analysis are indicated in blue. The orange line denotes 80% power to achieve genome-wide significance. To convert the standardized effect size to a centimeter scale, we regarded 1 standard error as 5.63 cm, which was calculated in the population-based cohorts (replication set)
Height-associated rare variants in the Japanese population
| Chr:Pos_REF/ALT | Candidate genes (annotation) | ALT freq. (%) |
|
| VE (%) |
| |||
|---|---|---|---|---|---|---|---|---|---|
| 1:17,2107,982_G/A | 0.25 | −0.195 (0.033) | 1.4 × 10−10 | −0.167 (0.089) | 8.9 × 10−2 | 0.010 | −0.192 (0.031) | 4.5 × 10−10 | |
| 2:1,638,793_C/T |
| 0.61 | −0.163 (0.021) | 6.5 × 10−17 | −0.129 (0.045) | 1.9 × 10−3 | 0.022 | −0.157 (0.019) | 5.5 × 10−17 |
| 2:219,729,259_G/A | 0.15 | −0.278 (0.051) | 8.7 × 10−9 | −0.241 (0.110) | 4.4 × 10−2 | 0.019 | −0.271 (0.046) | 4.0 × 10−9 | |
| 2:219,919,943_C/T | 0.30 | −0.177 (0.029) | 8.4 × 10−11 | −0.127 (0.070) | 4.9 × 10−2 | 0.009 | −0.170 (0.027) | 1.9 × 10−10 | |
| 3:52,781,335_G/A |
| 0.90 | −0.118 (0.018) | 6.9 × 10−11 | −0.152 (0.043) | 2.4 × 10−4 | 0.039 | −0.123 (0.016) | 8.4 × 10−14 |
| 3:145,794,588_T/C | 0.69 | 0.131 (0.021) | 3.8 × 10−11 | 0.164 (0.050) | 8.3 × 10−4 | 0.035 | 0.136 (0.019) | 1.4 × 10−12 | |
| 3:185,513,482_A/G |
| 0.26 | 0.384 (0.053) | 3.7 × 10−13 | 0.120 (0.099) | 2.1 × 10−1 | 0.009 | 0.325 (0.047) | 3.4 × 10−12 |
| 4:17,880,391_T/A | 0.88 | −0.135 (0.021) | 9.9 × 10−12 | −0.068 (0.047) | 9.8 × 10−2 | 0.008 | −0.124 (0.019) | 6.9 × 10−11 | |
| 6:33,096,707_T/C | 0.93 | −0.106 (0.018) | 1.8 × 10−8 | −0.050 (0.040) | 4.1 × 10−1 | 0.005 | −0.097 (0.017) | 7.4 × 10−9 | |
| 6:34,257,615_C/A | 0.42 | −0.245 (0.027) | 3.3 × 10−20 | −0.283 (0.061) | 1.3 × 10−6 | 0.070 | −0.251 (0.025) | 8.5 × 10−24 | |
| 7:45,954,540_C/T | 0.70 | −0.153 (0.021) | 3.5 × 10−14 | −0.142 (0.053) | 5.2 × 10−3 | 0.026 | −0.151 (0.019) | 7.4 × 10−15 | |
| 8:57,283,748_G/A | 0.46 | −0.161 (0.026) | 9.4 × 10−11 | −0.194 (0.062) | 3.7 × 10−4 | 0.032 | −0.166 (0.024) | 6.8 × 10−12 | |
| 9:5,089,770_G/A | 0.08 | −0.344 (0.060) | 1.1 × 10−8 | −0.190 (0.146) | 2.0 × 10−1 | 0.005 | −0.321 (0.056) | 8.5 × 10−9 | |
| 9:95,123,569_C/T |
| 0.54 | −0.177 (0.024) | 2.9 × 10−15 | −0.150 (0.055) | 8.1 × 10−3 | 0.025 | −0.172 (0.022) | 3.7 × 10−15 |
| 12:2,973,887_C/A | 0.78 | −0.115 (0.017) | 7.2 × 10−11 | −0.005 (0.042) | 9.2 × 10−1 | 0.000 | −0.099 (0.016) | 8.5 × 10−10 | |
| 15:85574,143_T/A | 0.50 | −0.149 (0.023) | 2.3 × 10−11 | −0.224 (0.055) | 5.8 × 10−5 | 0.045 | −0.160 (0.021) | 2.1 × 10−14 | |
| 15:89,438,963_ACG/A |
| 0.88 | −0.127 (0.019) | 1.2 × 10−10 | −0.086 (0.040) | 3.0 × 10−2 | 0.015 | −0.119 (0.017) | 4.2 × 10−12 |
| 15:89,459,676_G/A | 0.71 | −0.128 (0.021) | 2.5 × 10−9 | −0.165 (0.045) | 1.1 × 10−3 | 0.043 | −0.135 (0.019) | 1.6 × 10−12 | |
| 15:100,533,312_C/T | 0.25 | −0.275 (0.036) | 1.3 × 10−14 | −0.398 (0.081) | 2.8 × 10−7 | 0.078 | −0.295 (0.033) | 2.5 × 10−19 | |
| 17:6,925,490_C/T |
| 0.42 | 0.203 (0.026) | 4.3 × 10−15 | 0.219 (0.057) | 3.6 × 10−4 | 0.045 | 0.206 (0.024) | 9.5 × 10−18 |
| 17:48,545,926_C/A | 0.82 | 0.169 (0.018) | 9.8 × 10−22 | 0.171 (0.041) | 7.9 × 10−6 | 0.045 | 0.169 (0.016) | 2.2 × 10−25 | |
| 17:61,976,011_G/T | 0.83 | −0.154 (0.018) | 5.4 × 10−18 | −0.202 (0.043) | 2.4 × 10−6 | 0.065 | −0.162 (0.017) | 5.0 × 10−22 |
Results of the association analysis in GWAS and replication (Rep). Variants with minor allele frequency <1% satisfying genome-wide significance after meta-analysis were shown. Annotation: EAS, East Asian specific variant determined by allele frequency in 1000 Genomes Project; NS, variant is in LD with nonsynonymous variant(s); NL, variant was found in new loci identified by this study; EP, gene is associated with extremely phenotype related to height. Chromosomes and positions are based on Build37 (hg19)
REF reference allele, ALT alternative allele, SE standard error, VE variance explained
Height-associated low-frequency variants in the Japanese population
| Chr:Pos_REF/ALT | Candidate genes (annotation) | ALT freq. (%) |
|
| VE (%) |
| |||
|---|---|---|---|---|---|---|---|---|---|
| 1:119,482,365_T/TA | 4.02 | −0.051 (0.008) | 3.4 × 10−10 | −0.035 (0.019) | 8.7 × 10−2 | 0.010 | −0.049 (0.008) | 1.5 × 10−10 | |
| 1:151,035,628_A/C |
| 3.19 | 0.051 (0.009) | 1.1 × 10− 9 | 0.052 (0.021) | 1.2 × 10−2 | 0.016 | 0.051 (0.008) | 2.2 × 10−10 |
| 1:153,750,682_G/A | 1.42 | 0.092 (0.013) | 1.5 × 10−12 | 0.039 (0.031) | 3.7 × 10−1 | 0.004 | 0.084 (0.012) | 1.9 × 10−12 | |
| 1:218,698,632_CG/C |
| 3.85 | 0.051 (0.008) | 1.4 × 10−10 | 0.053 (0.019) | 6.6 × 10−3 | 0.021 | 0.052 (0.007) | 3.2 × 10−12 |
| 2:240,77,979_G/A | 3.54 | −0.048 (0.009) | 8.2 × 10−10 | −0.076 (0.019) | 5.5 × 10−5 | 0.044 | −0.053 (0.008) | 9.0 × 10−12 | |
| 2:62,191,772_A/G |
| 4.50 | 0.044 (0.008) | 7.9 × 10−9 | 0.065 (0.018) | 7.0 × 10−4 | 0.035 | 0.047 (0.007) | 1.7 × 10−11 |
| 2:72,360,160_C/T | 3.30 | 0.059 (0.009) | 6.9 × 10−13 | 0.073 (0.021) | 4.3 × 10−4 | 0.030 | 0.061 (0.008) | 3.5 × 10−14 | |
| 2:72,572,735_T/C | 2.97 | 0.122 (0.010) | 8.9 × 10−32 | 0.131 (0.025) | 1.2 × 10−6 | 0.086 | 0.123 (0.010) | 7.4 × 10−38 | |
| 2:74,781,375_C/G |
| 1.25 | −0.114 (0.015) | 3.6 × 10−14 | −0.053 (0.034) | 1.1 × 10−1 | 0.007 | −0.104 (0.014) | 6.2 × 10−14 |
| 2:95,336,090_G/T | 2.33 | −0.107 (0.017) | 2.2 × 10−10 | −0.066 (0.034) | 2.6 × 10−2 | 0.022 | −0.099 (0.015) | 8.2 × 10−11 | |
| 2:200,141,875_G/A |
| 2.71 | −0.076 (0.010) | 1.2 × 10−13 | −0.100 (0.022) | 1.8 × 10−5 | 0.058 | −0.080 (0.009) | 6.7 × 10−18 |
| 2:21,9894,893_T/A | 1.93 | −0.080 (0.012) | 1.6 × 10−12 | −0.081 (0.026) | 2.1 × 10−3 | 0.027 | −0.080 (0.011) | 4.4 × 10−14 | |
| 2:220,210,184_G/A | 2.03 | 0.062 (0.012) | 3.8 × 10−8 | 0.076 (0.026) | 6.2 × 10−3 | 0.024 | 0.064 (0.011) | 1.6 × 10−9 | |
| 2:241,783,039_C/A | 4.67 | 0.045 (0.008) | 7.9 × 10−10 | 0.031 (0.018) | 1.0 × 10−1 | 0.008 | 0.043 (0.007) | 4.6 × 10−10 | |
| 4:17,880,472_T/A |
| 3.60 | −0.067 (0.010) | 1.4 × 10−11 | −0.096 (0.023) | 8.5 × 10−5 | 0.063 | −0.072 (0.009) | 1.3 × 10−14 |
| 6:34,2409,03_C/T | 1.35 | 0.086 (0.015) | 1.5 × 10−9 | 0.172 (0.033) | 1.0 × 10−7 | 0.084 | 0.101 (0.013) | 7.2 × 10−14 | |
| 6:35,243,836_C/T |
| 4.95 | −0.045 (0.007) | 2.2 × 10−10 | −0.041 (0.017) | 1.2 × 10−2 | 0.016 | −0.044 (0.007) | 7.4 × 10−11 |
| 6:130,349,119_C/T |
| 96.46 | −0.050 (0.008) | 2.6 × 10−9 | −0.038 (0.020) | 3.3 × 10−2 | 0.010 | −0.048 (0.008) | 7.8 × 10−10 |
| 7:12,610,578_C/T | 3.43 | 0.055 (0.009) | 5.2 × 10−12 | 0.040 (0.019) | 7.0 × 10−2 | 0.011 | 0.052 (0.008) | 1.9 × 10−11 | |
| 7:19,529,484_G/C | 95.01 | 0.043 (0.007) | 2.5 × 10−9 | 0.029 (0.017) | 1.2 × 10−1 | 0.008 | 0.041 (0.007) | 6.2 × 10−10 | |
| 7:30,635,521_G/C | 4.27 | 0.045 (0.008) | 6.0 × 10−9 | 0.053 (0.019) | 7.1 × 10−3 | 0.023 | 0.047 (0.008) | 1.2 × 10−9 | |
| 8:19,363,306_G/A | 1.93 | −0.085 (0.011) | 5.5 × 10−14 | −0.081 (0.026) | 1.4 × 10−3 | 0.024 | −0.084 (0.010) | 3.7 × 10−16 | |
| 8:57,170,647_A/G | 4.60 | −0.063 (0.010) | 5.0 × 10−9 | −0.097 (0.024) | 9.0 × 10−5 | 0.082 | −0.068 (0.009) | 2.5 × 10−13 | |
| 9:34,766,707_A/G | 1.09 | −0.108 (0.016) | 4.8 × 10−12 | −0.154 (0.036) | 2.9 × 10−5 | 0.057 | −0.116 (0.015) | 4.0 × 10−15 | |
| 9:99,259,568_AGGCGGGGAGGCAGG/A | 4.50 | 0.055 (0.008) | 1.0 × 10−13 | 0.042 (0.017) | 1.9 × 10−2 | 0.016 | 0.053 (0.007) | 3.6 × 10−14 | |
| 9:109,463,806_C/T | 2.14 | 0.073 (0.012) | 2.9 × 10−9 | 0.084 (0.027) | 5.5 × 10−3 | 0.033 | 0.075 (0.011) | 2.6 × 10−11 | |
| 11:2,813,322_C/T | 1.56 | −0.115 (0.015) | 7.8 × 10−17 | −0.112 (0.034) | 3.5 × 10−3 | 0.036 | −0.114 (0.013) | 1.2 × 10−17 | |
| 11:27,016,411_A/G | 3.80 | −0.076 (0.009) | 1.4 × 10−19 | −0.049 (0.021) | 1.6 × 10−2 | 0.017 | −0.072 (0.008) | 4.1 × 10−19 | |
| 11:69,253,486_T/C | 1.57 | −0.068 (0.013) | 4.3 × 10−8 | −0.063 (0.031) | 6.3 × 10−2 | 0.012 | −0.067 (0.012) | 1.9 × 10−8 | |
| 12:14,365,109_T/C | 1.66 | −0.068 (0.012) | 1.7 × 10−8 | −0.060 (0.028) | 6.4 × 10−2 | 0.012 | −0.066 (0.011) | 2.9 × 10−9 | |
| 12:20,668,796_A/G |
| 4.75 | −0.043 (0.008) | 5.8 × 10−10 | −0.024 (0.017) | 9.0 × 10−2 | 0.005 | −0.040 (0.007) | 5.1 × 10−9 |
| 12:53,722,361_G/A | 1.15 | −0.164 (0.014) | 1.5 × 10−30 | −0.074 (0.035) | 2.3 × 10−2 | 0.011 | −0.151 (0.013) | 1.3 × 10−29 | |
| 12:66,349,812_A/G |
| 4.78 | −0.083 (0.007) | 2.1 × 10−29 | −0.065 (0.017) | 3.8 × 10−4 | 0.038 | −0.080 (0.007) | 1.0 × 10−33 |
| 13:114,301,338_G/A | 3.78 | 0.053 (0.008) | 1.8 × 10−9 | 0.035 (0.020) | 9.1 × 10−2 | 0.008 | 0.050 (0.008) | 1.4 × 10−10 | |
| 15:23,8106,52_C/T |
| 4.07 | −0.068 (0.009) | 1.1 × 10−15 | −0.053 (0.021) | 1.6 × 10−2 | 0.022 | −0.066 (0.008) | 6.2 × 10−16 |
| 17:42,272,909_C/T |
| 4.86 | −0.047 (0.007) | 4.9 × 10−10 | −0.028 (0.016) | 8.9 × 10−2 | 0.008 | −0.044 (0.007) | 6.1 × 10−11 |
| 17:54,605,677_G/A | 4.28 | 0.046 (0.009) | 1.4 × 10−8 | 0.040 (0.020) | 2.3 × 10−2 | 0.012 | 0.045 (0.008) | 1.3 × 10−8 | |
| 19:10,392,379_A/AC |
| 3.28 | −0.060 (0.010) | 1.0 × 10−11 | −0.046 (0.022) | 1.8 × 10−2 | 0.013 | −0.058 (0.009) | 5.1 × 10−11 |
| 19:13,264,398_C/T | 2.89 | −0.054 (0.009) | 5.5 × 10−9 | −0.012 (0.022) | 3.3 × 10−1 | 0.001 | −0.048 (0.009) | 2.3 × 10−8 | |
| 21:47,549,363_G/C | 1.34 | −0.080 (0.016) | 4.9 × 10−8 | −0.123 (0.037) | 7.2 × 10−4 | 0.037 | −0.087 (0.015) | 2.1 × 10−9 | |
| 22:24,087,185_T/C | 2.92 | −0.059 (0.009) | 3.2 × 10−10 | 0.007 (0.022) | 7.5 × 10−1 | 0.000 | −0.049 (0.009) | 8.3 × 10−9 | |
| X:105,477,754_T/C | 2.70 | 0.068 (0.010) | 4.0 × 10−11 | 0.013 (0.026) | 6.2 × 10−1 | 0.001 | 0.060 (0.010) | 2.4 × 10−10 |
Results of the association analysis in GWAS and replication (Rep). Variants with minor allele frequency (MAF) ≥1% and MAF <5% satisfying genome-wide significance after meta-analysis were shown. Annotation: EAS, East Asian specific variant determined by allele frequency in 1000 Genomes Project; NS, variant is in LD with nonsynonymous variant(s); LOF, variant is in LD with loss-of-function variant(s); NL, variant was found in new loci identified by this study; EP, gene is associated with extremely phenotype related to height; R, previously reported variant. Chromosomes and positions are based on Build37 (hg19)
REF reference allele, ALT alternative allele, SE standard error, VE variance explained
Fig. 3Summary of gene-based analysis. a Manhattan plot of the gene-based test. The yellow line represents significance threshold (P < 2.5 × 10−6). b Comparison of single-variant test P values and those of the gene-based test. Two genes (SLC27A3 and CYP26B1) showed much stronger associations in the gene-based test. c, d Positions and associations of nonsynonymous variants in SLC27A3 and CYP26B1. The colors of the plots denote the impact of the variants. The symbols (circle, diamond, and square) indicate the function of variants. The coding regions are colored according to their structural domains based on Pfam
Gene-level associations of SLC27A3 and CYP26B1
| Gene | Chr:Pos |
|
| ||
|---|---|---|---|---|---|
|
| 1:153,748,103–153,752,478 | 27 | 6.94 × 10−38 | 27 | 7.03 × 10−5 |
|
| 2:72,359,400–72,374,900 | 17 | 1.33 × 10−45 | 18 | 3.91 × 10−7 |
Chromosomes and positions are based on Build37 (hg19)
aA number of variants were used in gene-based analysis. Variants with an imputation quality score of Rsq ≥0.3 were employed in the analysis
bP values were calculated by SKAT-O
Fig. 4Relationship between the minor allele effect for height and MAF in the GWAS. A scatter plot of the mean minor allele effect according to each MAF bin (N = 100) is shown. The mean effects of the minor alleles were clearly greater in rarer variants