| Literature DB >> 29385134 |
Takahisa Kawaguchi1, Toshihide Shima2, Masayuki Mizuno2, Yasuhide Mitsumoto2, Atsushi Umemura2, Yoshihiro Kanbara3, Saiyu Tanaka4, Yoshio Sumida4, Kohichiro Yasui5, Meiko Takahashi1, Keitaro Matsuo6, Yoshito Itoh5, Katsutoshi Tokushige7, Etsuko Hashimoto7, Kendo Kiyosawa8, Masanori Kawaguchi9, Hiroyuki Itoh10, Hirofumi Uto11, Yasuji Komorizono12, Ken Shirabe13, Shiro Takami14, Toshinari Takamura15, Miwa Kawanaka16, Ryo Yamada1, Fumihiko Matsuda1, Takeshi Okanoue2.
Abstract
The genetic factors affecting the natural history of nonalcoholic fatty liver disease (NAFLD), including the development of nonalcoholic steatohepatitis (NASH) and NASH-derived hepatocellular carcinoma (NASH-HCC), are still unknown. In the current study, we sought to identify genetic factors related to the development of NAFLD, NASH, and NASH-HCC, and to establish risk-estimation models for them. For these purposes, 936 histologically proven NAFLD patients were recruited, and genome-wide association (GWA) studies were conducted for 902, including 476 NASH and 58 NASH-HCC patients, against 7,672 general-population controls. Risk estimations for NAFLD and NASH were then performed using the SNPs identified as having significant associations in the GWA studies. We found that rs2896019 in PNPLA3 [p = 2.3x10-31, OR (95%CI) = 1.85 (1.67-2.05)], rs1260326 in GCKR [p = 9.6x10-10, OR (95%CI) = 1.38(1.25-1.53)], and rs4808199 in GATAD2A [p = 2.3x10-8, OR (95%CI) = 1.37 (1.23-1.53)] were significantly associated with NAFLD. Notably, the number of risk alleles in PNPLA3 and GATAD2A was much higher in Matteoni type 4 (NASH) patients than in type 1, type 2, and type 3 NAFLD patients. In addition, we newly identified rs17007417 in DYSF [p = 5.2x10-7, OR (95%CI) = 2.74 (1.84-4.06)] as a SNP associated with NASH-HCC. Rs641738 in TMC4, which showed association with NAFLD in patients of European descent, was not replicated in our study (p = 0.73), although the complicated LD pattern in the region suggests the necessity for further investigation. The genetic variants of PNPLA3, GCKR, and GATAD2A were then used to estimate the risk for NAFLD. The obtained Polygenic Risk Scores showed that the risk for NAFLD increased with the accumulation of risk alleles [AUC (95%CI) = 0.65 (0.63-0.67)].Entities:
Mesh:
Substances:
Year: 2018 PMID: 29385134 PMCID: PMC5791941 DOI: 10.1371/journal.pone.0185490
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Clinical characteristics of the patient populations according to histological classification and the control population enrolled in the study.
| Clinical traits | Control | Matteoni classification of NAFLD | NASH-HCC | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Type 1 | Type 2 | Type 3 | Type 4 | Control vs. Type1 | Type1 vs. Type2 | Type 2vs. Type3 | Type3 vs. Type4 | Type4 vs. HCC | |||
| Number of samples | 7672 | 130 | 134 | 104 | 476 | 58 | |||||
| Sex (Male/Female) | 3022/4650 | 79/51 | 84/50 | 45/59 | 192/284 | 41/17 | 0.85 | 4.3x10-3 | 0.66 | ||
| Age (year) | 52.0±13.4 | 50.7±15.1 | 50.9±14.9 | 52.4±14.1 | 57.9±14.6 | 71.5±9.8 | 0.42 | 0.98 | 0.46 | ||
| Physical measurement | |||||||||||
| BMI | 22.4±3.2 | 26.2±4.3 | 27.4±4.8 | 28.1±4.7 | 27.9±5.1 | 25.0±2.8 | 0.097 | 0.19 | 0.69 | 8.0x10-3 | |
| Visceral fat (cm2) | - | 145.2±65.2 | 159.7±48.1 | 137.4±47.0 | 158.8±55.7 | 160.5±81.1 | - | 0.034 | 0.015 | 0.020 | 0.68 |
| Abdominal circumscription (cm) | 80.4±9.2 | 90.1±9.9 | 93.5±10.2 | 91.8±9.9 | 95.1±12.2 | 94.0±11.0 | 0.091 | 0.37 | 0.080 | 0.87 | |
| Biochemical traits | |||||||||||
| AST (IU/L) | 23.7±10.3 | 31.5±15.5 | 37.2±18.7 | 53.7±38.7 | 60.0±37.3 | 52.8±56.0 | 1.4x10-3 | 4.4x10-4 | 3.8x10-3 | 3.4x10-3 | |
| ALT (IU/L) | 21.7±15.9 | 48.2±31.9 | 63.6±46.4 | 82.6±58.6 | 85.2±66.3 | 51.7±65.8 | 2.4x10-3 | 9.2x10-3 | 0.81 | ||
| GGT (IU/L) | 32.4±45.0 | 80.5±71.5 | 71.5±85.9 | 82.1±74.2 | 80.2±72.7 | 108.4±123.7 | 0.47 | 6.8x10-3 | 0.87 | 0.21 | |
| Albumin (g/dL) | 4.5±0.2 | 4.5 ± 0.4 | 4.4 ± 0.3 | 4.5 ± 0.4 | 4.3 ± 0.4 | 3.9±0.5 | 0.24 | 0.017 | 0.13 | 1.4x10-3 | |
| Total bilirubin (mg/dL) | 0.7±0.3 | 0.9 ± 0.5 | 0.9±0.5 | 0.9±0.4 | 0.9±0.4 | 0.9±0.4 | 0.85 | 0.61 | 0.89 | 0.27 | |
| Cholinesterase (unit) | 335.7±77.1 | 377.9±103.5 | 356.6±97.0 | 332.6±116.0 | 342.2±90.3 | 252.1±77.6 | 0.17 | 0.32 | 0.90 | ||
| Type IV collagen 7S (ng/dL) | - | 3.7±0.7 | 4.0±0.9 | 4.3±1.2 | 5.3±2.1 | 7.0±2.2 | - | 0.057 | 0.031 | ||
| Hyaluronic acid (ng/dL) | - | 27.6±25.3 | 31.6±27.9 | 50.3±71.6 | 82.5±86.2 | 160±114.5 | - | 0.18 | 0.11 | ||
| Triglycerides (mg/dL) | 97.1±67.1 | 157.3±87.0 | 153.1±84.3 | 162.0±94.0 | 161.3±83.9 | 126.0±55.7 | 0.66 | 0.49 | 0.78 | 2.1x10-3 | |
| Total cholesterol (mg/dL) | 207.3±35.1 | 210.9±36.8 | 196.0±38.1 | 206.5±37.2 | 200.3±38.1 | 176.6±28.2 | 0.21 | 9.6x10-4 | 0.048 | 0.13 | |
| HbA1c (%) | 5.4±0.5 | 6.1±1.2 | 5.8±1.0 | 6.1±1.4 | 6.2±1.2 | 6.2±1.3 | 0.21 | 0.24 | 0.26 | 0.22 | |
| IRI (μg/dL) | 5.7±6.9 | 9.4±7.7 | 11.3±8.5 | 11.7±8.1 | 14.9±9.4 | 17.7±9.1 | 0.018 | 0.13 | 0.019 | ||
| FPG (mg/dL) | 90.8±12.5 | 113.0±34.7 | 105.5±25.6 | 111.8±27.2 | 113.7±33.4 | 126.7±30.1 | 0.12 | 0.015 | 0.94 | ||
| HOMA-IR | 1.2±1.0 | 2.4 ± 1.5 | 2.9 ± 2.4 | 3.4±2.3 | 4.2±2.9 | 5.7±3.9 | 0.19 | 0.11 | 1.1x10-3 | 4.2x10-3 | |
| hs-CRP (mg/dL) | 865.6±2970.0 | 1316±2524.4 | 944.2±1180.6 | 1107.9±2062.6 | 1513.8±2267.4 | 2100.4±2508.4 | 0.89 | 0.82 | 0.016 | 0.11 | |
| Adiponectin (μg/mL) | - | 7.2±3.4 | 6.4±2.9 | 7.0±3.3 | 6.6±3.3 | 8.6±4.3 | - | 0.092 | 0.22 | 0.28 | 0.032 |
| Leptin (ng/mL) | - | 9.8±7.0 | 9.5±8.1 | 11.8±8.6 | 13.6±10.5 | 14.5±18.3 | - | 0.43 | 0.056 | 0.25 | 0.5 |
| Ferritin (ng/mL) | - | 147.7±104.8 | 182.9±135.2 | 229.3±248.9 | 227.5±204.9 | 286.4±227.3 | - | 0.038 | 0.67 | 0.49 | 0.11 |
| Uric acid (mg/dL) | - | 5.9 ± 1.5 | 5.9 ± 1.2 | 5.7±1.6 | 5.7±1.5 | 5.4±1.5 | - | 0.59 | 0.41 | 0.94 | 0.32 |
| PLT (x104/μL) | 23.4±5.2 | 22.9±5.5 | 22.9±5.5 | 22.7±6.6 | 19.7±6.1 | 14.9±6.4 | 0.27 | 0.90 | 0.43 | ||
| ANA (0/1/2/3/4) | 2403/1561 | 51/24/7/1/1 | 58/27/7/1/2 | 38/15/7/1/0 | 198/115/41/12/16 | 12/8/2/3/3 | 0.090 | 0.97 | 0.93 | 0.11 | 0.29 |
| Clinical history | |||||||||||
| Diabetes (NGT/IGT/DM) | - | 48/17/38 | 48/11/32 | 35/9/11 | 214/51/120 | 31/4/8 | 0.59 | 0.15 | 0.23 | 0.11 | |
| Hyperlipidemia (-/+) | 4133/518 | 48/73 | 54/72 | 30/48 | 174/266 | 27/20 | 0.70 | 0.63 | 0.95 | 0.027 | |
| Hypertension (-/+) | 3806/845 | 77/44 | 69/57 | 43/35 | 219/222 | 20/32 | 0.19 | 0.92 | 0.44 | 0.17 | |
| Liver biopsy features | |||||||||||
| Brunt grade (1/2/3) | - | - | - | 71/24/4 | 203/196/69 | - | - | - | - | - | |
| Brunt stage (1/2/3/4) | - | - | - | - | 184/101/144/45 | - | - | - | - | - | - |
| Fat droplet (1/2/3/4) | - | 25/57/34/12 | 6/60/33/29 | 8/32/31/22 | 17/144/193/96 | 8/18/7/2 | - | 0.53 | 0.41 | ||
| Iron deposition (0/1/2/3/4) | - | 41/18/27/10/1 | 49/24/16/9/1 | 30/13/7/8/1 | 181/86/54/41/14 | 13/3/1/6/1 | - | 0.16 | 0.94 | 0.72 | 0.93 |
Measurements are shown as mean ± standard deviation. Categorical values are shown by the count number. The distribution of each class is compared to that immediately to its left. The p-values less than the threshold are in bold text. The threshold for significant association was set at 2.4x10-3 for the comparison between control and Matteoni type 1 and at 1.6x10-3 for the other comparisons. BMI: Body Mass Index, AST: Aspartate Aminotransferase, ALT: Alanine transaminase, GGT: Gamma-glutamyl transferase, IRI: Insulin, FPG: Fasting Plasma Glucose, HOMA-IR: Homeostasis model assessment-Insulin Resistance, PLT: Platelets, ANA: Antinuclear Antibodies. P-values are calculated by logistic regression otherwise stated:
†Chi-squared test.
Fig 1Manhattan plots of the GWA data.
The p-values for genotyped SNPs from the GWA study for all NAFLD patients (A) and for NASH-HCC patients (B) compared to controls are plotted along each chromosome in −log10 scale. Horizontal line indicates the Bonferroni significance threshold (p = 5.3×10−7).
SNP markers showing significant association in the GWA studies.
| dbSNPID | Chr. | Nearest Gene | Allele | Genotype counts | NAFLD vs Control | NASH-HCC vs Control | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NAFLD | Control | OR | OR | ||||||||||
| Type 1 | Type 2 | Type 3 | Type 4 | NASH-HCC | (95%CI) | (95%CI) | |||||||
| rs2896019 | 22q13 | T/G | 27/72/31 | 31/54/49 | 25/43/36 | 62/200/214 | 6/18/34 | 2259/3790/1621 | |||||
| (0.52) | (0.57) | (0.55) | (0.66) | (0.74) | (0.46) | ||||||||
| rs1260326 | 2p23 | C/T | 17/65/48 | 19/54/61 | 9/47/48 | 72/210/194 | 9/16/33 | 1461/3666/2541 | 0.0031 | 1.84 | |||
| (0.62) | (0.66) | (0.68) | (0.63) | (0.71) | (0.57) | (1.23–2.75) | |||||||
| rs4808199 | 19p13 | G/A | 72/43/15 | 61/57/16 | 58/36/10 | 208/216/52 | 21/33/4 | 4132/2990/544 | 0.020 | 1.59 | |||
| (0.28) | (0.33) | (0.27) | (0.34) | (0.35) | (0.27) | (1.08–2.35) | |||||||
| rs17007417 | 2p13 | C/T | 85/40/5 | 99/34/1 | 79/23/2 | 329/136/11 | 25/25/8 | 5252/2207/197 | 0.47 | 0.95 | |||
| (0.19) | (0.13) | (0.13) | (0.17) | (0.35) | (0.17) | (0.83–1.09) | |||||||
SNPs showing the strongest association in each chromosomal region are indicated.
*A2 represents the risk allele.
**Genotype counts are shown as (A1A1/A1A2/A2A2).
Odds ratio (OR) with 95% confidence interval (95%CI) was calculated for the risk allele. The p-values less than the threshold (p = 5.3×10−7) are in bold text.
Fig 2Regional Manhattan plots around the SNP markers showing genome-wide significance in the GWA studies.
P-values, gene organization, and linkage disequilibrium (LD) plots according to the chromosomal position of the three significantly genome-wide associated regions for NAFLD (A-C) and the one region for NASH-HCC (D). Each figure spans 200 kb (A, B, and D) or 300 kb (C) in both the 5’ and 3’ directions from the SNP with the strongest association (shown with a red arrow) in the GWA studies. P-values are plotted for both genotyped and imputed SNPs in the upper panels, and previously reported SNPs with genome-wide significance are indicated by green arrows. The colors of the circles representing p-values correspond to the strength of LD (r2) from the most significant SNP in the GWA studies. The brightness of the red color in the LD plots in the lower panels also corresponds to the strength of LD.
Fig 3Histogram of the odds ratios for genotype distributions between different subgroups.
Each box denotes the odds ratio (OR) comparing the control or patient subgroups shown on the horizontal axes. The p-values, ORs, and 95% confidence intervals are shown in the S6 Table.
Fig 4Risk estimation according to Polygenic Risk Scores for NAFLD patients compared with controls.
The dot and bar denote the odds ratio (OR) and its 95% confidence interval in each quintile compared to the 1st quintile using rs2896019 in PNPLA3, rs1260326 in GCKR, and rs4808199 in GATAD2A (A), and the receiver operating characteristic (ROC) curve from the model (red line), and the ROC curves including previously reported SNPs (green dashed line) or candidate SNPs (p<1x10-4) identified in our GWA study (black dashed line) (B) are shown.