| Literature DB >> 32841307 |
Marina Serper1,2,3, Marijana Vujkovic1, David E Kaplan1,2, Rotonya M Carr1,2, Kyung Min Lee4,5,6, Qing Shao4, Donald R Miller4, Peter D Reaven7, Lawrence S Phillips8,9, Christopher J O'Donnell10,11, James B Meigs12, Peter W F Wilson8,9, Rachel Vickers-Smith13, Henry R Kranzler13,14, Amy C Justice15,16,17, John M Gaziano10,18, Sumitra Muralidhar19, Saiju Pyarajan10,11, Scott L DuVall6,20, Themistocles L Assimes21,22, Jennifer S Lee21,22, Philip S Tsao21,22, Daniel J Rader2,23,24,25, Scott M Damrauer1,26, Julie A Lynch6,27, Danish Saleheen1,28, Benjamin F Voight1,23,29, Kyong-Mi Chang1,2.
Abstract
BACKGROUND & AIMS: Given ongoing challenges in non-invasive non-alcoholic liver disease (NAFLD) diagnosis, we sought to validate an ALT-based NAFLD phenotype using measures readily available in electronic health records (EHRs) and population-based studies by leveraging the clinical and genetic data in the Million Veteran Program (MVP), a multi-ethnic mega-biobank of US Veterans.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32841307 PMCID: PMC7447043 DOI: 10.1371/journal.pone.0237430
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Baseline characteristics of the MVP NAFLD analytic cohort defined by the ALT-threshold definition.
| All Ancestries (n = 192,616) | European Ancestry (n = 148,354) | African Ancestry (n = 31,878) | Hispanic/Latino Ancestry (n = 12,384) | ||||||
|---|---|---|---|---|---|---|---|---|---|
| CHARACTERISTIC | TOTAL | CASES | CONTROLS | CASES | CONTROLS | CASES | CONTROLS | CASES | CONTROLS |
| Participants, n | 192,616 | 60,542 | 132,074 | 46,653 | 101,701 | 8,019 | 23,859 | 5,870 | 6,514 |
| Age at enrollment, mean ± sd | 64.5 ± 13.1 | 61.8 ± 12.1 | 66.2 ± 13.7 | 63.1 ± 11.9 | 64.2 ± 14.8 | 58.2 ± 10.9 | 57.0 ± 14.5 | 56.5 ± 13.2 | 59.9 ± 16.0 |
| Female, n (%) | 16,209 (8.4%) | 6,107 (10.1%) | 10,102 (7.6%) | 4,325 (9.3%) | 6,169 (6.1%) | 1,230 (15.3%) | 3,508 (14.7%) | 552 (9.4%) | 425 (6.5%) |
| BMI ≥30 kg/m2, n (%) | 88,547 (50.0%) | 34,676 (57.3%) | 53,871 (40.8%) | 26,509 (56.8%) | 39,672 (39%) | 47,75 (59.5%) | 11,115 (46.6%) | 3,392 (57.8%) | 3,084 (47.3%) |
| Hypertension, n (%) | 136,862 (71.1%) | 49,423 (81.6%) | 87,439 (66.2%) | 38,375 (82.3%) | 66,946 (65.8%) | 6,921 (86.3%) | 16,668 (69.9%) | 4,127 (70.3%) | 3,825 (58.7%) |
| Diabetes, n (%) | 49,961 (25.9%) | 21,161 (35%) | 28,800 (21.8%) | 16,215 (34.8%) | 21,232 (20.9%) | 2,907 (36.3%) | 5,899 (24.7%) | 2,039 (34.7%) | 1,669 (25.6%) |
| Prediabetes, n (%) | 72,505 (37.6%) | 20,210 (33.4%) | 52,295 (39.6%) | 15,664 (33.6%) | 40,043 (39.4%) | 2,571 (32.1%) | 9,790 (41%) | 2,075 (35.3%) | 2,462 (37.8%) |
| Dyslipidemia, n (%) | 97,698 (50.7%) | 40,967 (67.7%) | 5,6731 (43%) | 31,900 (68.4%) | 44,450 (43.7%) | 4,693 (58.5%) | 8,578 (26.9%) | 4,374 (74.5%) | 3,703 (56.8%) |
| Metabolic Risk Factor, n (%) | 179,822 (93.4%) | 58,964 (97.4%) | 120,858 (91.5%) | 45,487 (97.5%) | 93,870 (92.3%) | 7,835 (97.4%) | 21,282 (89.2%) | 5,675 (96.7%) | 5,706 (87.6%) |
| Alcohol misuse, n (%) | 59,304 (30.8%) | 18,571 (30.7%) | 40,733 (30.8%) | 14,659 (31.4%) | 32,966 (32.4%) | 2,142 (26.7%) | 5,909 (18.5%) | 1,770 (30.2%) | 1,858 (28.5%) |
| Peak ALT U/L, median [IQR] | 28 [21 – 42] | 43 [33 – 57] | 23 [18 – 29] | 44 [33 – 60] | 24 [19 – 31] | 42 [31 – 57] | 23 [18 – 31] | 47 [35 – 64] | 24 [19 – 31] |
| Cirrhosis and chronic liver disease not due to alcohol | 1616 (0.8%) | 1212 (2%) | 404 (0.3%) | 574 (1.2%) | 217 (0.2%) | 169 (2.1%) | 115 (0.5%) | 469 (8%) | 72 (1.1%) |
Abbreviations: ALT = alanine aminotransferase, AST = aspartate aminotransferase, BMI = body mass index, IQR = Interquartile range. All between and within-group comparisons were P<0.001.
Previously published NAFLD risk variants with genome-wide significant association with clinical NAFLD phenotypes across all ancestries in the Million Veteran Program NAFLD analytic cohort.
| NAFLD Phenotype: ALT-Threshold (n = 192,616 Total: 60,542 cases, 132,074 controls) | NAFLD Phenotype: ALT-Metabolic (n = 191,038 Total: 58,964 cases, 132,074 controls | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Model 1 (Base) | Model 2 (Base + Alcohol)) | Model 3 (Base+Alcohol+Metab) | Model 1 (Base) | Model 2 (Base + Alcohol) | |||||||||||
| Previously published NAFLD risk variants | |||||||||||||||
| Gene | rsID | Chr | Pos (Mb) | EA | EAF | OR (95% CI) | P | OR (95% CI) | P | OR (95% CI) | P | OR (95% CI) | P | OR (95% CI) | P |
| LYPLAL1 | rs12137855 | 1 | 219.4 | C | 0.80 | 1.00 (0.98–1.02) | 0.9 | 1.00 (0.98–1.02) | 0.92 | 1.00 (0.98–1.02) | 0.83 | 1.00 (0.98–1.02) | 0.87 | 1.00 (0.98–1.02) | 0.89 |
| LYPLAL1 | rs3001032 | 1 | 219.7 | T | 0.69 | 1.04 (1.02–1.06) | 1.04 (1.02–1.05) | 1.04 (1.03–1.06) | 1.04 (1.02–1.05) | 1.04 (1.02–1.05) | |||||
| GCKR | rs780094 | 2 | 27.7 | T | 0.40 | 1.02 (1.00–1.03) | 0.02 | 1.02 (1.00–1.03) | 0.04 | 1.01 (0.99–1.03) | 0.20 | 1.02 (1.00–1.03) | 0.02 | 1.01 (1.00–1.03) | 0.06 |
| HSD17B13 | rs72613567 | 4 | 88.2 | T | 0.73 | 1.09 (1.07–1.11) | 1.09 (1.07–1.11) | 1.10 (1.08–1.12) | 1.09 (1.07–1.11) | 1.09 (1.07–1.11) | |||||
| PPP1R3B | rs4240624 | 8 | 9.2 | G | 0.09 | 1.12 (1.09–1.14) | 1.12 (1.09–1.14) | 1.12 (1.10–1.15) | 1.12 (1.09–1.14) | 1.12 (1.09–1.14) | |||||
| TM6SF2 | rs2228603 | 19 | 19.3 | T | 0.08 | 1.19 (1.15–1.22) | 1.19 (1.16–1.23) | 1.24 (1.20–1.27) | 1.19 (1.15–1.22) | 1.19 (1.16–1.23) | |||||
| TM6SF2 | rs58542926 | 19 | 19.4 | T | 0.07 | 1.23 (1.19–1.26) | 1.23 (1.20–1.27) | 1.29 (1.26–1.33) | 1.23 (1.19–1.26) | 1.23 (1.20–1.27) | |||||
| PNPLA3 | rs738409 | 22 | 44.3 | G | 0.23 | 1.31 (1.29–1.33) | 1.31 (1.29–1.33) | 1.35 (1.33–1.38) | 1.31 (1.29–1.33) | 1.31 (1.29–1.34) | |||||
*Newly defined LYPLAL1 variant associated with NAFLD in the Million Veteran Program based on the regional association plot (S1 Fig).
Abbreviations: rsID: dbSNP identifier (build 151), Chr: chromosome,Pos (Mb): megabase position on human genome reference hg19, EA: effect allele, EAF: effect allele frequency among Europeans (Million Veteran Program), OR: odds ratio of risk in cases compared to controls per effect allele (additive model), CI: confidence interval. LYPLAL1: Lysophospholipase-like Protein 1, GCKR: glucokinase regulatory protein, HSD17B13: Hydroxysteroid 17-Beta Dehydrogenase 13, PPP1R3B: protein phosphatase 1, TM6SF2: Transmembrane 6 Superfamily Member 2, PNPLA3: patatin-like phospholipase domain-containing protein 3. Model 1: adjusted for age, gender, and 10 principal components (PCs). Model 2: covariates in Model 1 + alcohol consumption at enrollment measured by the Alcohol Use Disorder Identification Test (AUDIT-C). Model 3: covariates in Model 2 + metabolic risk factors (Type II diabetes/prediabetes, hypertension, dyslipidemia and BMI ≥ 30 kg/m2). P-values below 0.006 (adjusted for multiple comparisons) are shown in bold font.
Previously published ALT-associated variants with genome-wide significance and association with maximal ALT at enrollment.
| NAFLD Phenotype: ALT-Max (n = 192,616) | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Model 1 (Base) | Model 2 (Base + Alcohol) | Model 3 (Base+Alcohol+Metab) | ||||||||||||
| Gene | rsID | Chr | Pos | EA | EAF | Beta | SE | P | Beta | SE | P | Beta | SE | P |
| HSD17B13 | rs6834314 | 4 | 88213808 | A | 0.72 | 0.781 | 0.090 | 0.759 | 0.091 | 0.805 | 0.090 | |||
| HSD17B13 | rs72613567 | 4 | 88231392 | T | 0.73 | 0.893 | 0.096 | 0.867 | 0.097 | 0.920 | 0.096 | |||
| TRIB1 | rs2954021 | 8 | 126482077 | A | 0.50 | 0.787 | 0.080 | 0.814 | 0.08 | 0.762 | 0.080 | |||
| ERLIN1 | rs10883437 | 10 | 101795361 | T | 0.61 | 0.501 | 0.081 | 0.524 | 0.081 | 0.534 | 0.081 | |||
| ERLIN1 | rs11597390 | 10 | 101861435 | G | 0.64 | 0.789 | 0.085 | 0.797 | 0.086 | 0.801 | 0.085 | |||
| ERLIN1 | rs11597086 | 10 | 101953705 | A | 0.58 | 1.114 | 0.087 | 1.115 | 0.087 | 1.140 | 0.087 | |||
| ERLIN1 | rs11591741 | 10 | 101976501 | G | 0.58 | 1.090 | 0.086 | 1.096 | 0.086 | 1.120 | 0.086 | |||
| PNPLA3 | *rs738409 | 22 | 44324727 | G | 0.23 | 2.580 | 0.097 | 2.574 | 0.098 | 2.684 | 0.097 | |||
| PNPLA3 | rs2281135 | 22 | 44332570 | A | 0.17 | 2.263 | 0.105 | 2.240 | 0.106 | 2.357 | 0.105 | |||
| PNPLA3 | rs2143571 | 22 | 44391686 | A | 0.18 | 1.235 | 0.098 | 1.212 | 0.098 | 1.276 | 0.097 | |||
Abbreviations: rsID: dbSNP identifier (build 151), Chr: chromosome, Pos: base pair position on human genome reference hg19, EA: effect allele, EAF: effect allele frequency among Europeans (Million Veteran Program), Beta: effect size estimated increase in trait per increase copy of the effect allele (additive model). SE: Standard error on Beta, HSD17B13: Hydroxysteroid 17-Beta Dehydrogenase 13, TRIB1:Tribbles Homolog 1, ERLIN1: ER Lipid Raft Associated 1, PNPLA3: patatin-like phospholipase domain-containing protein 3. Model 1: adjusted for age, gender, and 10 principal components (PCs), Model 2: covariates in Model 1 + alcohol consumption at enrollment measured by the Alcohol Use Disorder Identification Test (AUDIT-C), Model 3: covariates in Model 2 + Type II diabetes/prediabetes, hypertension, dyslipidemia and BMI ≥ 30 kg/m2. P-values below 0.006 (adjusted for multiple comparisons) are shown in bold font.
Electronic health record validation of NAFLD phenotype.
| (n = 241) | (n = 178) | |
| PPV | 0.89 | 0.89 |
| (n = 216) | (n = 142) | |
| PPV | 0.71 | 0.84 |
* ALT > 30 for men and > 20 for women during at least two time points at least 6 months apart within a two-year period and no other chronic liver disease.
** ALT > 40 for men and > 30 for women during at least two time points at least 6 months apart within a two-year period and no other chronic liver disease irrespective of metabolic risk factors. Sample size is lower due to exclusion of n = 137 from cases/control due to intermediate ALT values between 20–30 units/L.
Abbreviations: ALT:alanine aminotransferase, M:male, F:female; NAFLD = non-alcoholic fatty liver disease, PPV = positive predictive value in the validation sample.
Previously published ALT level/NAFLD risk-associated variants with genome-wide significance and associations with advanced fibrosis/cirrhosis and platelet count at enrollment among patients with NAFLD (n = 60,542).
| FIB4 score >2.670 (n = 7,376 cases, 53,166 controls) | NAFLD fibrosis score ≥0.676 (n = 18,363 cases, 42,179 controls) | Platelet Count (n = 60,542) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| GENE | rsID | Chr | Pos | EA | EAF | OR (95% CI) | P | OR (95% CI) | P | BETA | SE | P |
| LYPLAL1 | rs12137855 | 1 | 219448378 | C | 0.8 | 0.97 (0.94–1.01) | 0.11 | 0.98 (0.95–1.01) | 0.18 | 0.214 | 0.214 | 0.32 |
| LYPLAL1 | rs3001032 | 1 | 219727779 | T | 0.69 | 1.00 (0.97–1.03) | 0.91 | 0.98 (0.95–1.00) | 0.082 | -0.298 | 0.178 | 0.095 |
| *GCKR | rs780094 | 2 | 27741237 | T | 0.4 | 1.04 (1.01–1.07) | 0.0092 | 1.00 (0.97–1.02) | 0.8 | 1.605 | 0.179 | |
| HSD17B13 | rs6834314 | 4 | 88213808 | A | 0.72 | 1.05 (1.02–1.08) | 1.01 (0.98–1.04) | 0.47 | -1.504 | 0.193 | ||
| *HSD17B13 | rs72613567 | 4 | 88231392 | T | 0.73 | 1.06 (1.02–1.09) | 1.02 (0.99–1.05) | 0.25 | -1.688 | 0.206 | ||
| TRIB1 | rs2954021 | 8 | 126482077 | A | 0.5 | 0.99 (0.97–1.02) | 0.55 | 0.99 (0.96–1.01) | 0.25 | -0.259 | 0.169 | 0.13 |
| PPP1R3B | rs4240624 | 8 | 9184231 | G | 0.09 | 0.96 (0.92–1.01) | 0.084 | 1.00 (0.96–1.04) | 0.89 | -0.376 | 0.266 | 0.16 |
| ERLIN | rs10883437 | 10 | 101795361 | T | 0.61 | 1.02 (1.00–1.05) | 0.089 | 1.01 (0.99–1.04) | 0.4 | -0.183 | 0.171 | 0.29 |
| *ERLIN | rs11597390 | 10 | 101861435 | G | 0.64 | 1.01 (0.98–1.04) | 0.68 | 1.00 (0.97–1.02) | 0.72 | 0.164 | 0.18 | 0.36 |
| *ERLIN | rs11597086 | 10 | 101953705 | A | 0.58 | 1.01 (0.98–1.03) | 0.73 | 1.00 (0.97–1.03) | 0.95 | -0.068 | 0.184 | 0.71 |
| *ERLIN | rs11591741 | 10 | 101976501 | G | 0.58 | 1.00 (0.97–1.03) | 0.82 | 1.00 (0.97–1.02) | 0.87 | -0.064 | 0.182 | 0.72 |
| TM6SF2 | rs2228603 | 19 | 19329924 | T | 0.08 | 0.97 (0.92–1.03) | 0.32 | 0.97 (0.93–1.02) | 0.24 | 0.154 | 0.357 | 0.67 |
| *TM6SF2 | rs58542926 | 19 | 19379549 | T | 0.07 | 1.01 (0.96–1.06) | 0.64 | 0.97 (0.93–1.01) | 0.18 | -0.041 | 0.341 | 0.9 |
| *PNPLA3 | rs738409 | 22 | 44324727 | G | 0.23 | 1.08 (1.05–1.12) | 1.06 (1.03–1.09) | -2.88 | 0.204 | |||
| PNPLA3 | rs2281135 | 22 | 44332570 | A | 0.17 | 1.08 (1.04–1.11) | 1.05 (1.02–1.09) | -2.382 | 0.221 | |||
| PNPLA3 | rs2143571 | 22 | 44391686 | A | 0.18 | 1.05 (1.02–1.09) | 1.04 (1.01–1.07) | -1.437 | 0.206 | |||
Abbreviations: rsID: dbSNP identifier (build 151), Chr: chromosome, Pos (Mb): megabase position on human genome reference hg19, EA: effect allele, EAF: effect allele frequency among Europeans (Million Veteran Program), OR: odds ratio, increased risk in cases compared to controls per effect allele (additive model), CI: confidence interval, Beta: effect size estimated increase in trait per increase copy of the effect allele (additive model). SE: Standard error on Beta, LYPLAL1: Lysophospholipase-like Protein 1, GCKR: glucokinase regulatory protein, HSD17B13: Hydroxysteroid 17-Beta Dehydrogenase 13, PPP1R3B: protein phosphatase 1, TM6SF2: Transmembrane 6 Superfamily Member 2, PNPLA3: patatin-like phospholipase domain-containing protein 3. All analysis adjusted for age, gender, and 10 principal components (PCs); adjustments including alcohol consumption are presented in the Supplement (S6a–S6c Table in S1 File). P-values below 0.006 (adjusted for multiple comparisons) are shown in bold font.