| Literature DB >> 30718733 |
Sanghoon Moon1, Young Jin Kim1, Sohee Han1, Mi Yeong Hwang1, Dong Mun Shin1, Min Young Park2, Yontao Lu3, Kyungheon Yoon1, Hye-Mi Jang1, Yun Kyoung Kim1, Tae-Joon Park1, Dae Sub Song4, Jae Kyung Park4, Jong-Eun Lee2, Bong-Jo Kim5.
Abstract
We introduce the design and implementation of a new array, the Korea Biobank Array (referred to as KoreanChip), optimized for the Korean population and demonstrate findings from GWAS of blood biochemical traits. KoreanChip comprised >833,000 markers including >247,000 rare-frequency or functional variants estimated from >2,500 sequencing data in Koreans. Of the 833 K markers, 208 K functional markers were directly genotyped. Particularly, >89 K markers were presented in East Asians. KoreanChip achieved higher imputation performance owing to the excellent genomic coverage of 95.38% for common and 73.65% for low-frequency variants. From GWAS (Genome-wide association study) using 6,949 individuals, 28 associations were successfully recapitulated. Moreover, 9 missense variants were newly identified, of which we identified new associations between a common population-specific missense variant, rs671 (p.Glu457Lys) of ALDH2, and two traits including aspartate aminotransferase (P = 5.20 × 10-13) and alanine aminotransferase (P = 4.98 × 10-8). Furthermore, two novel missense variants of GPT with rare frequency in East Asians but extreme rarity in other populations were associated with alanine aminotransferase (rs200088103; p.Arg133Trp, P = 2.02 × 10-9 and rs748547625; p.Arg143Cys, P = 1.41 × 10-6). These variants were successfully replicated in 6,000 individuals (P = 5.30 × 10-8 and P = 1.24 × 10-6). GWAS results suggest the promising utility of KoreanChip with a substantial number of damaging variants to identify new population-specific disease-associated rare/functional variants.Entities:
Mesh:
Year: 2019 PMID: 30718733 PMCID: PMC6361960 DOI: 10.1038/s41598-018-37832-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Contents summary of KoreanChip.
| Category | Number of SNPs* | Contents (%) |
|---|---|---|
| Tag SNPs for genome-wide coverage | 600,294 | 72.02 |
| Functional loci (nonsynonymous SNPs and Indels) | 208,039 | 24.96 |
| eQTL | 16,690 | 2.00 |
| HLA | 6,659 | 0.80 |
| Fingerprint | 255 | 0.03 |
| NHGRI GWAS catalog | 7,811 | 0.94 |
| KIR | 1,544 | 0.19 |
| Pharmacogenetics/ADME | 1,881 | 0.23 |
| Common mitochondrial DNA variants | 178 | 0.02 |
| Y chromosome markers | 806 | 0.10 |
| Total | 833,535 | — |
*Some SNPs are overlapped among categories.
eQTL, expression Quantitative Trait Loci; HLA, Human leukocyte antigen; KIR, Killer cell immunoglobulin like receptors; ADME, Absorption, Distribution, Metabolism, and Excretion.
Comparison of contents between KoreanChip and other genotyping platforms.
| Platform | Total marker | Annotated markera | Nonsyn markerb | ASN markerc |
|---|---|---|---|---|
| N | N | N (%) | N (%) | |
| Affymetrix 5.0 | 500,568 | 489,457 | 2,179 (0.4) | 769 (0.2) |
| Affymetrix 6.0 | 934,969 | 892,584 | 4,889 (0.5) | 1,750 (0.2) |
| Illumina Omni 1 M | 1,099,726 | 1,066,324 | 45,832 (4.3) | 12,516 (1.2) |
| Illumina Exome array | 242,901 | 241,923 | 217,775 (90.0) | 39,480 (16.3) |
| Illumina GSA | 700,078 | 688,062 | 87,759 (12.8) | 21,371 (3.1) |
| Axiom Biobank | 718,212 | 645,060 | 251,080 (38.9) | 46,416 (7.2) |
| Axiom UK Biobank | 845,487 | 823,336 | 104,058 (12.6) | 19,487 (2.4) |
| Axiom PMRA | 920,744 | 856,797 | 44,819 (5.2) | 6,088 (0.7) |
|
|
aAnnotated by snpEff v4.1d based on the database of dbNSFP2.7 (functional prediction and annotation of nonsynonymous marker).
bProportion of nonsynonymous markers among annotated markers.
cProportion of nonsynonymous makers, if damaging effect is predicted by more than one functional annotation software, and allele frequency >0 observed in East Asian ancestry among annotated markers.
ASN, Asian; Affymetrix 5.0, Affymetrix Genome-wide Human SNP array 5.0; AFFY6.0, Affymetrix Genome-wide Human SNP array 6.0; ILLU Omni 1 M, Illumina HumanOmni1-Quad BeadChip; Illumina Exome array, Illumina HumanExome BeadChip; Illumina GSA, Illumina Global Screening Array; Axiom PMRA, Axiom Precision Medicine Research Array.
Comparison of genomic coverage.
| Platform | Allele frequency | |||
|---|---|---|---|---|
| # of samples | Total (MAF ≥ 0.01) | Common (MAF ≥ 0.05) | Less common. (0.01 ≤ MAF < 0.05) | |
|
| ||||
| Affymetrix 5.0 | 6,949 | 76.25 | 84.78 | 51.23 |
| Affymetrix 6.0 | 3,695 | 83.93 | 91.67 | 61.23 |
| Illumina Omni 1 M | 3,666 | 86.97 | 94.10 | 66.01 |
|
|
| |||
| Axiom Biobank | 96 | 81.94 | 91.56 | 53.74 |
| UK Biobank | 96 | 85.21 | 94.05 | 59.30 |
| Axiom PMRA | 96 | 87.09 | 94.48 | 65.42 |
| Illumina GSA | 96 | 84.38 | 92.27 | 61.24 |
All genomic coverage information was calculated using imputed data.
Genomic coverages of KoreanChip are shown in bold-face.
Affymetrix 5.0, Affymetrix Genome-wide Human SNP array 5.0; AFFY6.0, Affymetrix Genome-wide Human SNP array 6.0; ILLU Omni 1 M, Illumina HumanOmni1-Quad BeadChip; Illumina Exome array, Illumina HumanExome BeadChip; Illumina GSA, Illumina Global Screening Array; Axiom PMRA, Axiom Precision Medicine Research Array.
Nine Newly identified missense variants in known or novel loci.
| Gene | SNPs | Position | RA/AA | EA/OA | Trait(s) | EAF(%) | Discovery | Replication | Availability in commercial chips | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| KOR | gnomAD | (~6,949 samples) | (~6,000 samples) | |||||||||||
| EAS | EUR | AFR | Beta(SE) | P-value | Beta(SE) | P-value | ||||||||
|
| ||||||||||||||
| AC109829.1 | rs965813 | chr2:27789861 | T/A | A/T | TG | 33.31 | 37.00 | 20.86 | 21.53 | −0.0415(0.0089) | 3.27E-06 | −0.0483(0.0105) | 4.26E-06 | ILMN 1 M |
| C2orf16 | rs1919128 | chr2:27801759 | A/G | G/A | TG | 52.87 | 47.81 | 27.16 | 6.61 | 0.0379(0.0084) | 7.20E-06 | 0.0560(0.0100) | 2.36E-08 | ABA, PMRA, UKB, ILMN 1 M, ILMN exome |
| BUD13 | rs10488698 | chr11:116633947 | G/A | A/G | HDL | 6.61 | 7.22 | 6.06 | 1.16 | 0.0330(0.0073) | 7.04E-06 | 0.0229(0.0081) | 4.66E-03 | ABA, UK Biobank, ILMN 1 M, ILMN exome |
| C19orf80, DOCK6 | rs2278426 | chr19:11350488 | C/T | T/C | LDL | 27.31 | 25.93 | 4.42 | 18.05 | −0.0203(0.0056) | 3.16E-04 | −0.0281(0.0058) | 1.57E-06 | ABA, PMRA, UKB, ILMN 1 M |
| TCHL | 27.02 | −3.8231(0.6689) | 1.14E-08 | −3.6170(0.7294) | 7.29E-07 | |||||||||
| APOE | rs440446 | chr19:45409167 | C/G | G/C | LDL | 37.47 | 39.62 | 63.57 | 85.81 | −0.2010(0.0052) | 1.23E-04 | −0.0210(0.0055) | 1.31E-04 | UKB |
| TG | 37.60 | 0.0314(0.0087) | 2.84E-04 | 0.0263(0.0104) | 1.17E-02 | |||||||||
|
| ||||||||||||||
| APOB | rs13306206 | chr2:21242731 | G/A | A/G | LDL | 0.97 | 0.26 | 0 | 0 | 0.1509(0.0259) | 5.87E-09 | 0.1117(0.0256) | 1.27E-05 | ABA, PMRA, ILMN exome |
| TCHL | 0.94 | 15.9680(3.1140) | 3.01E-07 | 13.2300(3.2040) | 3.69E-05 | |||||||||
| ALDH2 | rs671 | chr12:112241766 | G/A | A/G | ALT | 15.67 | 25.65 | 0.002 | 0.02 | −0.0586(0.0107) | 4.98E-08 | −0.0481(0.0114) | 2.86E-05 | ABA, PMRA, UKB, ILMN 1 M, ILMN exome, ILMN GSA |
| AST | 15.67 | −0.0541(0.0075) | 5.20E-13 | −0.0372(0.0075) | 8.14E-07 | |||||||||
|
| ||||||||||||||
| GPT | rs200088103 | chr8:145730416 | C/T | T/C | ALT | 0.12 | 0.10 | 0.004 | 0 | −0.6843(0.1140) | 2.02E-09 | −0.5574(0.1023) | 5.30E-08 | ILMN GSA |
| GPT | rs748547625 | chr8:145730446 | C/T | T/C | ALT | 0.14 | 0.11 | 0 | 0 | −0.5058(0.1048) | 1.41E-06 | −0.4972(0.1024) | 1.24E-06 | — |
Chromosomal position is based on hg19. Traits were transformed to follow a normal distribution by natural log except TCHL.
CHR, chromosome; POS, position; RA, Reference allele; AA, Alternative allele; EA, Effective allele; OA, Other Allele; EAF, Estimated Allele Frequency; GnomAD, the Genome Aggregation Database; KOR, Korean; EAS, East Asian; EUR, European (non-finnish); ABA, Axiom Biobank array; PMRA, Axiom Precision medicine research array; UKB, Axiom UK Biobank array; ILMN 1 M, Illumina HumanOmni1-Quad BeadChip; ILMN Exome, Illumina HumanExome BeadChip; ILMN GSA, Illumina Global Screening Array.
Figure 1Overall scheme of KoreanChip evaluation process. Evaluation of KoreanChip consisted of 5 steps: reproducibility, accuracy, content comparison, genomic coverage, and application to GWAS. All subjects were selected from three independent cohorts: Ansan and Ansung study, Health EXAminee (HEXA), and CArdioVascular disease Association Study (CAVAS). Datasets for the test are shown in colored squares with number. The number in parentheses indicates the number of subjects genotyped with the corresponding platform.