| Literature DB >> 36042188 |
Jennifer L Halford1,2, Valerie N Morrill1,3, Seung Hoan Choi1, Sean J Jurgens1,4, Giorgio Melloni5, Nicholas A Marston5, Lu-Chen Weng1,3, Victor Nauffal1, Amelia W Hall6, Sophia Gunn7, Christina A Austin-Tse8,9,10, James P Pirruccello1,3, Shaan Khurshid1,3,11, Heidi L Rehm1,9,10, Emelia J Benjamin12,13,14, Eric Boerwinkle15, Jennifer A Brody16, Adolfo Correa17, Brandon K Fornwalt18,19,20, Namrata Gupta1, Christopher M Haggerty18,19, Stephanie Harris3, Susan R Heckbert16,21, Charles C Hong22, Charles Kooperberg23, Henry J Lin24, Ruth J F Loos25,26, Braxton D Mitchell22,27, Alanna C Morrison15, Wendy Post28, Bruce M Psaty16,21,29, Susan Redline30, Kenneth M Rice31, Stephen S Rich32, Jerome I Rotter24, Peter F Schnatz33, Elsayed Z Soliman34, Nona Sotoodehnia16,35, Eugene K Wong3,10, Marc S Sabatine5, Christian T Ruff5, Kathryn L Lunetta7, Patrick T Ellinor1,3,11, Steven A Lubitz36,37,38.
Abstract
Accurate and efficient classification of variant pathogenicity is critical for research and clinical care. Using data from three large studies, we demonstrate that population-based associations between rare variants and quantitative endophenotypes for three monogenic diseases (low-density-lipoprotein cholesterol for familial hypercholesterolemia, electrocardiographic QTc interval for long QT syndrome, and glycosylated hemoglobin for maturity-onset diabetes of the young) provide evidence for variant pathogenicity. Effect sizes are associated with pathogenic ClinVar assertions (P < 0.001 for each trait) and discriminate pathogenic from non-pathogenic variants (area under the curve 0.82-0.84 across endophenotypes). An effect size threshold of ≥ 0.5 times the endophenotype standard deviation nominates up to 35% of rare variants of uncertain significance or not in ClinVar in disease susceptibility genes with pathogenic potential. We propose that variant associations with quantitative endophenotypes for monogenic diseases can provide evidence supporting pathogenicity.Entities:
Mesh:
Year: 2022 PMID: 36042188 PMCID: PMC9427940 DOI: 10.1038/s41467-022-32009-5
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
Baseline cohort characteristics by endophenotype in the UK Biobank
| Characteristic | LDL-C (mg/dL) | QTc (ms) | HbA1c (%) |
|---|---|---|---|
| Participants with a measurable endophenotype, | 189,652 | 33,520 | 189,741 |
| Median endophenotype value (Q1–Q3) | 136 (114–159) | 411 (396–426) | 5.4 (5.1–5.6) |
| Male, | 85,197 (44.9) | 16,582 (49.5) | 85,210 (44.9) |
| European ancestry, | 165,783 (87.4) | 28,249 (84.3) | 166,335 (87.7) |
| Mean age, years (SD) | 57.0 (8.1) | 52.6 (5.7) | 57.0 (8.1) |
| Myocardial infarction, | 2995 (1.6) | 483 (1.4) | - |
| Statin usage, | 31,909 (16.8) | - | - |
| Median high-density lipoprotein, mg/dL (Q1–Q3) | 56.0 (46.3–64.0) | - | - |
| Heart failure, | - | 74 (0.2) | - |
| Beta blocker usage, | - | 1701 (5.1) | - |
| Calcium channel blocker usage, | - | 2455 (7.3) | - |
| Type 2 diabetes medication usage, | - | - | 7090 (3.7) |
| Mean corpuscular volume, femtoliters (SD) | - | - | 91.2 (4.5) |
NB: only select relevant characteristics for the given monogenic disease of interest are displayed.
This table includes median imputed values for select clinical covariates.
Fig. 1Association between effect size and variant pathogenicity for three monogenic disease endophenotypes.
A, B, and C display data for the LDL-C, QTc, and HbA1c endophenotypes, respectively, for rare variants found in the UK Biobank. Definitive familial hypercholesterolemia (FH) genes include LDLR, APOB, PCSK9; definitive long-QT syndrome (LQTS) genes include KCNQ1, KCNH2, SCN5A; common maturity-onset diabetes of the young (MODY) genes include HNF1A, HNF1B, HNF4A, GCK. Row 1 in each panel displays the variant effect size distribution by ClinVar pathogenicity category (colored). Row 2 in each panel displays the estimated difference in endophenotype value comparing carriers of a variant in each ClinVar pathogenicity category to carriers of benign variants (circles) and 95% confidence intervals. Two-sided P values are derived from multiple linear regression model t-statistics and are annotated for variant categories with P < 0.05. Variant classification includes B benign, LB likely benign, LP likely pathogenic, P pathogenic, VUS variant of uncertain significance, C conflicting.
Fig. 2Discrimination of variant pathogenicity by effect size and percent of variants with evidence of pathogenicity.
A This includes variants within familial hypercholesterolemia (FH) genes recommended for secondary findings return (LDLR, APOB, PCSK9); B includes variants within the long-QT syndrome (LQTS) genes recommended for which the return of a secondary variant finding is endorsed (KCNQ1, KCNH2, SCN5A); C includes variants within common maturity-onset diabetes of the young (MODY) genes (HNF1A, HNF1B, HNF4A, GCK). Column 1 includes variants from the UK Biobank, the primary cohort; Column 2 includes variants from FOURIER and TOPMed, the replication cohorts. Rows 1, 3, and 5 display receiver operating characteristic (ROC) curves depicting discrimination of pathogenic variants according to effect size. Variant pathogenicity is taken from ClinVar. The red curve includes “Pathogenic” variants only; the pink curve includes “Pathogenic” and “Likely pathogenic” variants combined; area under the curve (AUC) values and variant numbers are reported in the legend. Large effect size, defined as effect size greater than 0.5 standard deviations (SD) of the trait distribution, is plotted in purple on the “Pathogenic” variant ROC curve. Rows 2, 4, and 6 display variants by ClinVar category and shows the percent of variants with evidence for pathogenicity using the large effect size threshold defined in Rows 1, 3, and 5. Variants with evidence for pathogenicity, defined as having larger effect sizes than the threshold, are in pink; variants with evidence for a benign effect, defined as having smaller effect sizes than the threshold, are in light blue.