| Literature DB >> 31311600 |
Bahram Namjou1,2, Todd Lingren3,4, Yongbo Huang5, Sreeja Parameswaran5, Beth L Cobb5, Ian B Stanaway6, John J Connolly7, Frank D Mentch7, Barbara Benoit8, Xinnan Niu9, Wei-Qi Wei9, Robert J Carroll9, Jennifer A Pacheco10, Isaac T W Harley11, Senad Divanovic11, David S Carrell12, Eric B Larson12, David J Carey13, Shefali Verma14, Marylyn D Ritchie14, Ali G Gharavi15, Shawn Murphy16, Marc S Williams17, David R Crosslin6, Gail P Jarvik18, Iftikhar J Kullo19, Hakon Hakonarson7,20, Rongling Li21, Stavra A Xanthakos22, John B Harley5,3,23.
Abstract
BACKGROUND: Non-alcoholic fatty liver disease (NAFLD) is a common chronic liver illness with a genetically heterogeneous background that can be accompanied by considerable morbidity and attendant health care costs. The pathogenesis and progression of NAFLD is complex with many unanswered questions. We conducted genome-wide association studies (GWASs) using both adult and pediatric participants from the Electronic Medical Records and Genomics (eMERGE) Network to identify novel genetic contributors to this condition.Entities:
Keywords: Fatty liver; GWAS; Genetic polymorphism; NAFLD; PheWAS; Polygenic risk score
Mesh:
Substances:
Year: 2019 PMID: 31311600 PMCID: PMC6636057 DOI: 10.1186/s12916-019-1364-z
Source DB: PubMed Journal: BMC Med ISSN: 1741-7015 Impact factor: 8.775
The demographic distribution of EMR-linked eMERGE cohorts
| Case_EA | Control_EA | Mean age | ♂/♀ | Mean BMI, kg/m2 | |
|---|---|---|---|---|---|
| Pediatrics* | 396 | 846 | 13.05 (SD 5.41) | 693/549 | 22.70 (SD 7.87) † |
| Adults | 710 | 7725 | 63.50 (SD 16.86) | 3810/4625 | 32.64 (SD 8.21) |
| Total | 1106 | 8571 |
*Defined as ≤ 21 years old
†The average BMI-for-age z score in pediatric cohorts was 1.16 (95% CI = 1.03–1.20, SD = 1.39)
Major SNP association results with NAFLD (case-control), and 4 quantitative case-only GWA studies (NAS score, fibrosis, liver enzymes ALT and AST) in the eMERGE Network. All results adjusted for age, gender, site of genotyping, 3 first principal components, and BMI. For more details and results with p < 10− 5, see Additional file 1: Table S2
| NAFLD-GWAS | |||||||||
| SNP | CHR | Positiona | Gene | Minor allele | MAFb | OR | L95 | U95 |
|
| rs738409 | 22 | 44,324,727 |
| G | 0.23 | 1.79 | 1.58 | 2.02 | 1.70 × 10−20 |
| rs738408 | 22 | 44,324,730 |
| T | 0.23 | 1.79 | 1.58 | 2.02 | 1.93 × 10−20 |
| rs3747207 | 22 | 44,324,855 |
| A | 0.23 | 1.78 | 1.58 | 2.02 | 2.63 × 10−20 |
| rs2294915 | 22 | 44,340,904 |
| T | 0.25 | 1.75 | 1.55 | 1.97 | 1.40 × 10−19 |
| rs2980888 | 8 | 126,507,308 |
| T | 0.31 | 1.36 | 1.20 | 1.53 | 5.98 × 10−07 |
| rs2954038 | 8 | 126,507,389 |
| C | 0.31 | 1.35 | 1.20 | 1.52 | 8.30 × 10−07 |
| NAS score | |||||||||
| SNP | CHR | Position | Gene | Minor allele | MAF | Beta | SE |
| |
| rs5748926 | 22 | 17,649,774 |
| T | 0.34 | 0.91 | 0.16 | 3.81 × 10−08 | |
| rs738409 | 22 | 44,324,727 |
| G | 0.41 | 0.85 | 0.15 | 3.94 × 10−08 | |
| Fibrosis | |||||||||
| SNP | CHR | Position | Gene | Minor allele | MAF | Beta | SE |
| |
| rs698718 | 16 | 68,560,185 |
| A | 0.23 | 0.83 | 0.12 | 2.74 × 10−11 | |
| rs1645976 | 16 | 68,563,509 |
| T | 0.23 | 0.83 | 0.12 | 2.79 × 10−11 | |
| rs72943235 | 2 | 88,500,646 |
| A | 0.01 | 2.38 | 0.43 | 8.18 × 10−08 | |
| ALT liver enzyme | |||||||||
| SNP | CHR | Position | Gene | Minor allele | MAF | Beta | SE |
| |
| rs206833 | 2 | 31,708,616 |
| A | 0.17 | 0.26 | 0.05 | 3.41 × 10−07 | |
| rs2294915 | 22 | 44,340,904 |
| T | 0.34 | 0.20 | 0.04 | 4.04 × 10−07 | |
| rs738409 | 22 | 44,324,727 |
| G | 0.33 | 0.20 | 0.04 | 4.68 × 10−07 | |
| AST liver enzyme | |||||||||
| SNP | CHR | Position | Gene | Minor allele | MAF | Beta | SE |
| |
| rs10272006 | 7 | 21,520,132 |
| G | 0.33 | 0.25 | 0.04 | 5.83 × 10−09 | |
| rs7796796 | 7 | 21,499,857 |
| A | 0.32 | 0.25 | 0.04 | 6.29 × 10− 09 | |
| rs62141163 | 2 | 31,663,114 |
| A | 0.11 | 0.34 | 0.07 | 2.30 × 10−07 | |
Abbreviations: MAF minor allele frequency, OR odds ratio, and 95% confidence interval (CI), Beta change in quantitative case-only phenotypes (NAS score, fibrosis (235 cases), ALT and AST liver enzymes (1075 cases)) per copy of minor allele (direction of beta is for minor alleles, SE standard error of beta; aPosition = GRch37/hg19; bThe direction of all effects is for the minor allele. The minor allele frequency for case-only GWA results is for cases
Fig. 1a, b Manhattan plot (a) and Q–Q plot (b) of genome-wide markers for NAFLD in European ancestry (1106 cases and 8571 controls). A total of 1106 cases of NAFLD and 8571 controls were analyzed after quality control. Logistic regression analysis was performed for 7,261,527 variants with MAF > 1% assuming an additive genetic model, adjusted for age, sex, BMI, genotyping site, and genetic ancestry (principal components 1 through 3). Results are plotted as –log10 p values on the y-axis by position in chromosome (x-axis) (NCBI build 37)
Laboratory, clinical, and histologic characteristics of NAFLD patients included in the case-only association analyses. All individuals were of European ancestry
| Pediatrics | Adult | Overall | |
|---|---|---|---|
| Histologic characteristic—NAS score (0–8) † | 4.01(SD 1.58) | 3.45(SD 1.74) | 3.78(SD 1.76) |
| NAS score ≥ 5 | 43/107 (40%) | 36/128 (28%) | 79/235 (34%) |
| Histologic characteristic—fibrosis score (0–4) † | 0.71(SD 0.67) | 1.01(SD 1.26) | 0.88(SD 1.06) |
| ALT U/L‡ | 40 (37–45) | 63 (59–67) | 53 (49–58) |
| AST U/L‡ | 45 (42–48) | 39 (37–41) | 41 (39–43) |
| Presence of cirrhosis | |||
| Presence of hepatocellular CA |
†NAS and fibrosis score were available for 235 subjects (107 pediatrics and 128 adult subjects). For histologic score, mean and standard deviation is shown
‡ ALT and AST lab values were available for 1075 of cases. Medians and 95% CI of medians are shown
Fig. 2a–c LocusZoom plot of the associations signals in three previously known regions for NAFLD. a Confirmation at 22q13 for PNPLA3. SNP rs738409 is a missense variation (I148M) in PNPLA3 produced the best effect (p = 1.70 × 10− 20). b Detected signal at 19p12 (GATAD2A, NCAN, TM6SF2) region. The best marker in this study was rs56408111 (p = 5.26 × 10− 6). The linkage disequilibrium (LD) between rs56408111 and previously known SNP rs4808199 was r2 = 0.24, D’ = 0.74. c Detected signal at 8q24 (TRIB1) genetic region. The best marker in this study (rs2980888) is shown (see also Additional file 1: Table S2). Estimated recombination rates (from HapMap) are plotted in cyan to reflect the local LD structure. The SNPs surrounding the most significant variant are color-coded to reflect their LD with the index SNP (taken from pairwise r2 values from the HapMap CEU database, www.hapmap.org). Regional plots were generated using LocusZoom (http://csg.sph.umich.edu/locuszoom)
Fig. 3a Means and standard deviations of NAS and fibrosis score (0–12) stratified by genotype of rs738409 at PNPLA3 in 235 NAFLD cases. The results are plotted as the sum of NAS and fibrosis score (0–12) (y-axis) against the three genotypes of rs738409 C>G polymorphism (x-axis). The results are further sub-divided by age groups (pediatrics, adult, and all). Results for IL17RA (b) and ZFP90 (c) also are shown
Fig. 4a–d Regional association plots of best effects in case-only linear regression analyses for continuous traits of NAS score, fibrosis, and ALT liver enzyme, respectively. a The best observed effect near the IL17RA region for NAS score. b The most significant effects at 16q22 near ZFP90 gene for fibrosis. c The effect near FABP1 locus for fibrosis. d An effect at 2p22 near XDH for AST liver enzyme
Fig. 5NAFLD case-control gene-based results using MAGMA as a base and tissue-specific gene expression (GTEx v7 with 30 general tissue types) as a source produced specific enrichment in liver (see “Methods”). List of all MAGMA gene-based results (P < 0.05) is shown in Additional file 1: Table S5
Fig. 6a–d ROC graphical plot that illustrates the diagnostic ability of the binary classifier NAFLD (cases and controls) and NAS score (above and below 5) using weighted GRS score of ten previously published SNPs (GRS-10, see “Results”). The sensitivity and specificity and AUC measures for each plot are also shown. a ROC curve for NAFLD-1106 cases and 8571 controls. b ROC curve for NAS score (79 cases above NAS score ≥ 5 versus 156 controls with score < 5). c Adding SNP rs5748926 near IL17RA improved the ROC curves for NAS score (GRS_11); difference between areas 0.035 (SE = 0.012, p = 0.004). d Distribution of quantiles of weighted 10-SNP GRS in NAFLD (cases and controls) and NAS score (above and below 5); percentage of NAFLD risk increases by increasing GRS quantiles; for NAFLD (cases and controls) from 17% in Q1 to 36% in Q4 (OR = 2.16, 95% CI = 1.81–2.58, p < 0.0001); for NAS score above 5 (defined as case) from 10% in Q1 to 43% in Q4 (OR = 8.50, 95% CI 3.45–20.96). The weighted 10-SNP GRS was calculated by multiplying the sum of the number of risk alleles (0, 1, 2) with the allele-specific effect sizes (beta coefficients) obtained from previous publications (see “Methods”)