| Literature DB >> 36009466 |
Chih-Chien Hsu1,2, Hao-Kai Chuang1,3, Yu-Jer Hsiao1,3, Yuan-Chi Teng3, Pin-Hsuan Chiang4, Yu-Jun Wang4, Ting-Yi Lin3, Ping-Hsing Tsai3, Chang-Chi Weng2, Tai-Chi Lin1,2, De-Kuang Hwang1,2, Ai-Ru Hsieh4.
Abstract
Cataracts, characterized by crystalline lens opacities in human eyes, is the leading cause of blindness globally. Due to its multifactorial complexity, the molecular mechanisms remain poorly understood. Larger cohorts of genome-wide association studies (GWAS) are needed to investigate cataracts' genetic basis. In this study, a GWAS was performed on the largest Han population to date, analyzing a total of 7079 patients and 13,256 controls from the Taiwan Biobank (TWB) 2.0 cohort. Two cataract-associated SNPs with an adjustment of p < 1 × 10-7 in the older groups and nine SNPs with an adjustment of p < 1 × 10-6 in the younger group were identified. Except for the reported AGMO in animal models, most variations, including rs74774546 in GJA1 and rs237885 in OXTR, were not identified before this study. Furthermore, a polygenic risk score (PRS) was created for the young and old populations to identify high-risk cataract individuals, with areas under the receiver operating curve (AUROCs) of 0.829 and 0.785, respectively, after covariate adjustments. Younger individuals had 17.45 times the risk while older people had 10.97 times the risk when comparing individuals in the highest and lowest PRS quantiles. Validation analysis on an independent TWB1.0 cohort revealed AUROCs of 0.744 and 0.659.Entities:
Keywords: Asian population; biobank; cataract; genome-wide association studies; polygenic risk score; retrospective study
Year: 2022 PMID: 36009466 PMCID: PMC9406175 DOI: 10.3390/biomedicines10081920
Source DB: PubMed Journal: Biomedicines ISSN: 2227-9059
Participant characteristics from TWB 2.0 and TWB 1.0.
| Discovery | Validation | Statistics and | Statistics and | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Variables | Case < 60 ( | Case ≥ 60 ( | Control ( | Case < 60 ( | Case ≥ 60 ( | Control ( | ||||||
|
| ||||||||||||
| Male (%) | 490 (25.01) | 1500 (29.30) | 4898 (36.95) | <2.2 × 10−16 | <2.2 × 10−16 | 337 (44.12) | 998 (44.43) | 2610 (52.30) | 7.65 × 10−5 | 6.86 × 10−10 | <2.2 × 10−16 | <2.2 × 10−16 |
| Female (%) | 1469 (74.99) | 3620 (70.70) | 8358 (63.05) | 420 (55.48) | 1248 (55.57) | 2380 (47.70) | ||||||
|
| 54.04 ± 5.34 | 65.11 ± 3.11 | 63.68 ± 2.84 | <2.2 × 10−16 | <2.2 × 10−16 | 53.08 ± 6.12 | 66.24 ± 3.60 | 64.4 ± 3.37 | <2.2 × 10−16 | <2.2 × 10−16 | 5.079 × 10−9 | <2.2 × 10−16 |
|
| 23.94 ± 3.81 | 24.13 ± 3.41 | 24.42 ± 3.42 | 1.46 × 10−7 | 2.071 × 10−7 | 24.3454 ± 3.713 | 24.3618 ± 3.330 | 24.4594 ± 3.231 | 0.4242 | 0.2448 | 0.09964 | 0.05096 |
|
| ||||||||||||
| No (%) | 1771 (90.40) | 4284 (83.67) | 12,028 (90.74) | <2.2 × 10−16 | <2.2 × 10−16 | 650 (85.87) | 1862 (82.90) | 4457 (89.32) | <2.2 × 10−16 | 8.636 × 10−7 | 0.04622 | 0.4888 |
| Yes (%) | 188 (9.60) | 836 (16.33) | 1228 (9.26) | 107 (14.13) | 384 (17.10) | 533 (10.68) | ||||||
|
| ||||||||||||
| No (%) | 1632 (83.31) | 3607 (70.45) | 9948 (75.05) | <2.2 × 10−16 | <2.2 × 10−16 | 632 (83.49) | 1530 (68.12) | 3634 (72.83) | <2.2 × 10−16 | <2.2 × 10−16 | 0.5577 | 0.01028 |
| Yes (%) | 327 (16.69) | 1513 (29.55) | 3308 (24.95) | 125 (16.51) | 716 (31.88) | 1356 (27.14) | ||||||
|
| ||||||||||||
| No (%) | 1691 (86.32) | 4106 (80.20) | 11,431 (86.23) | <2.2 × 10−16 | <2.2 × 10−16 | 653 (86.26) | 1806 (80.41) | 4260 (85.37) | <2.2 × 10−16 | <2.2 × 10−16 | 0.8507 | 0.2707 |
| Yes (%) | 268 (13.68) | 1014 (19.80) | 1825 (13.77) | 104 (13.74) | 440 (4.85) | 730 (14.93) | ||||||
|
| ||||||||||||
| No (%) | 1886 (96.27) | 4899 (95.68) | 12,827 (96.76) | <2.2 × 10−16 | 3.394 × 10−16 | 721 (95.24) | 2137 (95.15) | 4838 (96.95) | <2.2 × 10−16 | 0.007776 | 0.1738 | 0.03337 |
| Yes (%) | 73 (3.73) | 221 (4.32) | 429 (3.24) | 36 (4.76) | 109 (4.85) | 152 (3.05) | ||||||
|
| ||||||||||||
| >60 (%) | 1913 (97.70) | 4899 (95.78) | 12,828 (96.79) | 0.03452 | 0.0009946 | 736 (97.23) | 2115 (94.17) | 4814 (96.51) | 0.3662 | 5.97 × 10−6 | 0.02036 | 5.6 × 10−8 |
| <60 (%) | 45 (2.30) | 216 (4.22) | 426 (3.21) | 21 (2.77) | 131 (5.83) | 174 (3.59) | ||||||
1 p-values for age and BMI were calculated by Student’s t-test, whereas the other characteristics were calculated by chi-squared tests. 2 p-values for comparison between the means of the discovery cohort and the validation cohort. Abbreviations: BMI = body mass index; GFR = glomerular filtration rate.
Figure 1Manhattan plot showing the SNPs associated with cataracts identified from TWB2.0. Older cataract cases (≥60 years old, n = 5120) are shown in the top panel, while younger cataract cases (<60 years old, n = 1959) are shown in the bottom panel.
Selected cataract-associated SNPs identified by GWAS in TWB2.0 1.
| Population | SNP 1 | CHR | Position | MAF | MAF | OR | adj. P | Nearest Gene | |
|---|---|---|---|---|---|---|---|---|---|
| rs7513180 | 1 | 63874130 | 0.03579 | 0.02373 | 7.39 × 10−6 | 1.527 | 2.91 × 10−6 | ROR1 | |
| rs117994780 | 2 | 71677869 | 0.02517 | 0.01539 | 8.69 × 10−6 | 1.651 | 1.29 × 10−5 | DYSF | |
| rs237885 | 3 | 8753857 | 0.2696 | 0.305 | 7.55 × 10−6 | 0.8412 | 2.57 × 10−6 | OXTR | |
| rs3814411 | 3 | 112333058 | 0.02214 | 0.01307 | 8.50 × 10−6 | 1.709 | 1.98 × 10−5 | CD200 | |
| rs143616043 | 5 | 51456733 | 0.02783 | 0.04289 | 9.11 × 10−6 | 0.6389 | 7.82 × 10−6 | ISL1 | |
| Younger | rs146654893 | 9 | 21619135 | 0.01959 | 0.01118 | 9.11 × 10−6 | 1.766 | 2.67 × 10−5 | F2Z2F3 |
| Population | rs117753381 | 10 | 10644914 | 0.02692 | 0.01619 | 2.15 × 10−6 | 1.681 | 3.29 × 10−6 | CELF2 |
| (<60) | rs77137422 | 12 | 20324909 | 0.04933 | 0.03411 | 2.03 × 10−6 | 1.469 | 1.04 × 10−5 | PDE3A |
| rs9788929 | 16 | 16829414 | 0.1914 | 0.1625 | 6.27 × 10−6 | 1.22 | 3.30 × 10−5 | XYLT1 | |
| rs374431 | 19 | 58279347 | 0.4243 | 0.4625 | 7.44 × 10−6 | 0.8563 | 1.07 × 10−5 | ZNF8-ERVK3-1 | |
| rs13046594 | 21 | 38436779 | 0.05021 | 0.03464 | 1.51 × 10−6 | 1.473 | 3.97 × 10−6 | ERG | |
| rs738096 | 22 | 17773177 | 0.3911 | 0.4289 | 8.24 × 10−6 | 0.8552 | 1.78 × 10−5 | BID | |
| rs76079963 | 22 | 48857843 | 0.03724 | 0.02495 | 8.57 × 10−6 | 1.511 | 1.33 × 10−4 | TAFA5 | |
| rs140318176 | 2 | 125365220 | 0.01348 | 0.02041 | 9.91 × 10−6 | 0.656 | 3.95 × 10−5 | - | |
| rs11133245 | 4 | 53154174 | 0.1834 | 0.2045 | 6.53 × 10−6 | 0.8737 | 7.65 × 10−6 | SCFD2 | |
| rs145208055 | 4 | 8895796 | 0.02181 | 0.01371 | 2.82 × 10−8 | 1.604 | 4.61 × 10−7 | HMX1 | |
| rs1521224 | 6 | 121973799 | 0.06622 | 0.08051 | 4.15 × 10−6 | 0.81 | 5.45 × 10−6 | HSF2 | |
| rs9345070 | 6 | 91015542 | 0.4682 | 0.4958 | 2.05 × 10−6 | 0.8952 | 3.39 × 10−6 | MAP3K7 | |
| Older | rs74774546 | 6 | 121787961 | 0.1254 | 0.1461 | 3.32 × 10−7 | 0.8378 | 1.10 × 10−6 | GJA1 |
| Population | rs4726966 | 7 | 148387557 | 0.04967 | 0.06191 | 7.70 × 10−6 | 0.7919 | 2.23 × 10−5 | CNTNAP2 |
| (≥60) | rs148814099 | 9 | 89141883 | 0.01917 | 0.01285 | 7.57 × 10−6 | 1.501 | 2.44 × 10−5 | SHC3 |
| rs10781570 | 10 | 132372299 | 0.1387 | 0.1214 | 8.37 × 10−6 | 1.166 | 3.21 × 10−5 | LRRC27 | |
| rs28503213 | 18 | 77663436 | 0.2459 | 0.2238 | 6.25 × 10−6 | 1.131 | 3.40 × 10−5 | GALR1 | |
| rs2272537 | 19 | 35704684 | 0.1347 | 0.1173 | 6.83 × 10−6 | 1.171 | 2.36 × 10−6 | ZBTB32 | |
| rs56792854 | 19 | 35737488 | 0.1316 | 0.1147 | 8.82 × 10−6 | 1.17 | 2.96 × 10−6 | KMT2B | |
| rs60128322 | 19 | 35768908 | 0.1351 | 0.1179 | 7.63 × 10−6 | 1.169 | 1.61 × 10−6 | PROSER3 |
1 All genome-wide significant SNPs for each independent locus were identified in the TWB2.0 Biobank. For a complete list of cataract risk SNPs (p < 1 × 10−4), please refer to Table S1 (<60 years old), Table S2 (≥60 years old), and Table S3 (all). Abbreviations: SNP = single nucleotide polymorphism; CHR = chromosome; MAF = minor allele frequency; OR = odds ratio.
Comparison of the predictive performance of PRS with different tuning parameters.
| Case < 60 | Case > 60 | |||||||
|---|---|---|---|---|---|---|---|---|
| Tuning Parameters 1 | N SNPs | Mean PRS | AUC (95% CI) | Top N SNPs Included | Mean PRS | AUC (95% CI) | ||
| Case | Control | TWB2.0 | for PRS Calculation | Case | Control | TWB2.0 | ||
| 95 | 0.0733 | 0.0130 | 0.7129 (0.6996, 0.7262) | 131 | 0.0138 | −0.0250 | 0.6693 (0.6597, 0.6790) | |
| 90 | 0.0818 | 0.0238 | 0.7102 (0.6969, 0.7235) | 130 | 0.0152 | −0.0231 | 0.6697 (0.6600, 0.6793) | |
| 228 | 0.1925 | 0.0366 | 0.7874 (0.7756, 0.7993) | 292 | 0.0404 | −0.0616 | 0.7383 (0.7295, 0.7472) | |
| 218 | 0.2046 | 0.0555 | 0.7862 (0.7743, 0.7980) | 287 | 0.0415 | −0.0595 | 0.7385 (0.7296, 0.7473) | |
| 428 | 0.3733 | 0.0602 | 0.8528 (0.8430, 0.8626) | 547 | 0.1099 | −0.0896 | 0.7907 (0.7826, 0.7987) | |
| 415 | 0.3814 | 0.0810 | 0.8527 (0.8429, 08625) | 535 | 0.1134 | −0.0830 | 0.7903 (0.7822, 0.7983) | |
| 643 | 0.5352 | 0.0641 | 0.8915 (0.8833, 0.8998) | 809 | 0.2018 | −0.0912 | 0.823 (0.8156, 0.8305) | |
| 617 | 0.5358 | 0.0836 | 0.8913 (0.8831, 0.8996) | 787 | 0.2065 | −0.0810 | 0.8226 (0.8151, 0.8300) | |
| 838 | 0.6788 | 0.0521 | 0.9166 (0.9095, 0.9237) | 1024 | 0.2918 | −0.0917 | 0.8464 (0.8394, 0.8533) | |
| 804 | 0.6716 | 0.0700 | 0.9165 (0.9094, 0.9236) | 991 | 0.2942 | −0.0812 | 0.8461 (0.8391, 0.8530) | |
1 Tuning parameters, including genome-wide significance (p) and r2 for LD clumping. The table shows that the mean PRS is higher among the cases than the controls across all PRS models. Abbreviations: SNP = single nucleotide polymorphism, PRS = polygenic risk score; AUC (95% C.I) = area under curve (95% confidence interval); TWB2.0 = Taiwan Biobank 2.0.
Figure 2Comparison of cataract risks in TWB2.0 classified by PRS quantile. (A) Distribution of the polygenic risk score (PRS_younger) in younger cataract cases (<60) and controls. (B) Distribution of younger cases and controls according to PRS_younger quantiles. (C) Odds ratio for developing cataract in younger population according to PRS_younger quantiles. (D–F) are the cataract risks classified by PRS_older in older populations (≥60 years old).
Distribution of cataract cases and controls regarding PRS quantiles in younger and older groups.
| (min,Q1) | (Q1,Q2) | (Q2,Q3) | (Q3,Q4) | ||
|---|---|---|---|---|---|
| Case <60, N = 1567 | 77 | 159 | 382 | 949 | |
| Younger | (age < 60%) | 4.91% | 10.15% | 24.38% | 60.56% |
| Population | Control, N = 10,603 | 2965 | 2884 | 2660 | 2094 |
| (age < 60) 1 | (n,%) | 27.96% | 27.20% | 25.09% | 19.75% |
|
|
|
|
|
| |
| Case >60, N = 4095 | 341 | 691 | 1119 | 1944 | |
| Older | (age ≥ 60,%) | 8.33% | 16.87% | 27.33% | 47.47% |
| Population | Control, N = 10,603 | 3333 | 2984 | 2555 | 1731 |
| (age ≥ 60) 1 | (n,%) | 31.43% | 28.14% | 24.10% | 16.33% |
|
|
|
|
|
|
1 PRS_younger was used to assess the younger population (<60 years old), while PRS_older was used to assess the older population (≥60 years old) Abbreviations: OR = odds ratio with the reference being the lowest PRS quantile group (min,Q1); Q = quantile; 95% C.I. = 95% confidence interval.
Risk of high PRS groups for development of cataracts for younger cases (<60) and older cases (≥60).
| High PRS Group | Reference Group | OR for Case < 60 (95% C.I) | OR1 for Case ≥ 60 (95% C.I) |
|---|---|---|---|
| Top 25% | Remaining 75% | 6.24 (5.58, 6.98) | 4.63 (4.28, 5.02) |
| Top 20% | Remaining 80% | 6.26 (5.59, 7.00) | 4.77 (4.38, 5.20) |
| Top 10% | Remaining 90% | 7.09 (6.22, 8.08) | 5.48 (4.89, 6.14) |
| Top 5% | Remaining 95% | 9.16 (7.73, 10.85) | 6.74 (5.74, 7.94) |
Abbreviations: PRS = polygenic risk score model; OR (95% C.I.) = odds ratio (95% confidence interval).
Figure 3Receiver operating characteristic (ROC) curves for the polygenic risk score (PRS) model. (A) PRS_younger refers to younger cataract cases (<60), and (B) PRS_older refers to the older cataract cases (≥60).