| Literature DB >> 30140000 |
Pradeep Natarajan1,2,3, Gina M Peloso4, Seyedeh Maryam Zekavat3,5,6, May Montasser7, Andrea Ganna3,8, Mark Chaffin3, Amit V Khera1,2,3, Wei Zhou9, Jonathan M Bloom3,8, Jesse M Engreitz3,10, Jason Ernst11, Jeffrey R O'Connell7, Sanni E Ruotsalainen12, Maris Alver13, Ani Manichaikul14, W Craig Johnson15, James A Perry7, Timothy Poterba3,8, Cotton Seed3,8, Ida L Surakka12, Tonu Esko13, Samuli Ripatti12, Veikko Salomaa12, Adolfo Correa16, Ramachandran S Vasan17,18,19, Manolis Kellis3,20, Benjamin M Neale1,2,3,8, Eric S Lander3, Goncalo Abecasis21, Braxton Mitchell7, Stephen S Rich14, James G Wilson16,22, L Adrienne Cupples4,19, Jerome I Rotter23, Cristen J Willer24, Sekar Kathiresan25,26,27.
Abstract
Large-scale deep-coverage whole-genome sequencing (WGS) is now feasible and offers potential advantages for locus discovery. We perform WGS in 16,324 participants from four ancestries at mean depth >29X and analyze genotypes with four quantitative traits-plasma total cholesterol, low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol, and triglycerides. Common variant association yields known loci except for few variants previously poorly imputed. Rare coding variant association yields known Mendelian dyslipidemia genes but rare non-coding variant association detects no signals. A high 2M-SNP LDL-C polygenic score (top 5th percentile) confers similar effect size to a monogenic mutation (~30 mg/dl higher for each); however, among those with severe hypercholesterolemia, 23% have a high polygenic score and only 2% carry a monogenic mutation. At these sample sizes and for these phenotypes, the incremental value of WGS for discovery is limited but WGS permits simultaneous assessment of monogenic and polygenic models to severe hypercholesterolemia.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30140000 PMCID: PMC6107638 DOI: 10.1038/s41467-018-05747-8
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
Fig. 1Schematic of genomic variant discovery and analyses. Variants were jointly discovered in three distinct sets: (1) FHS, JHS, and OOA; (2) MESA; and (3) EST and FIN. Cohorts included in analyses are denoted by color-coded icons. Allele frequency spaces assessed are indicated for analyses. EST Estonia, FHS Framingham Heart Study, FIN Finland, JHS Jackson Heart Study, MESA Multi-Ethnic Study of Atherosclerosis, OOA Old Order Amish
Fig. 2Deep-coverage WGS identifies genomic variation across the allelic spectrum. Variant counts by allele count/frequency bin within each of the cohorts. Singletons (“AC 1”) and doubletons (“AC 2”) are separately distinguished from allele frequency bins within each cohort. Variants were jointly discovered in three distinct sets: (1) FHS, JHS, and OOA; (2) MESA; and (3) EST and FIN. AC allele count, EST Estonia, FHS Framingham Heart Study, FIN Finland, JHS Jackson Heart Study, MAF minor allele frequency, MESA Multi-Ethnic Study of Atherosclerosis, OOA Old Order Amish
Fig. 3Schematic of non-coding rare variant analyses. Four grouping schematics of rare non-coding variants (MAF <1%). (1) The sliding window approach tiles across the genome at fixed widths, only including variants overlying annotations consistent with enhancers, promoters, and DHS in non-exonic regions. All other approaches attempt to map non-coding putative functional genomic regions with discrete genes as the analytical unit. Overall, they are based on: (2) promoter, enhancer, and DHS annotations near a gene’s transcription start site, (3) co-occurrence of enhancer and DHS annotations with HepG2 gene expression, and (4) H3K27ac marks within Hi-C contact regions mapped to genes. DHS DNase hypersensitivity site, MAF minor allele frequency
Effect of monogenic mutation or polygenic score on odds for extremely high or low LDL-C
| Extremely high LDL-C | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Ancestry |
|
| Monogenic carrier ( | High polygenic score ( | Monogenic carrier OR (95% CI) | Monogenic carrier | Monogenic carrier PAF | High polygenic score OR (95% CI) | High polygenic score | Top 5th percentile of polygenic score PAF |
| EA | 5910 | 284 | 5 | 64 | 10.92 (3.71, 32.14) | 1.4 × 10−5 | 1.60 | 7.65 (5.56, 10.52) | 5.7 × 10−36 | 19.6 |
| AA | 4380 | 217 | 7 | 29 | 7.43 (3.01, 18.35) | 1.4 × 10−5 | 2.79 | 3.2 (2.1, 4.89) | 6.7 × 10−8 | 9.2 |
|
| ||||||||||
|
|
|
|
|
|
|
|
|
|
| |
| EA | 5910 | 286 | 6 | 82 | 21.73 (6.2, 76.15) | 1.5 × 10−6 | 2.00 | 10.38 (7.69, 14.02) | 1.5 × 10−52 | 25.9 |
| AA | 4380 | 218 | 11 | 32 | 13.83 (6.25, 30.62) | 9.4 × 10−11 | 4.68 | 3.7 (2.46, 5.58) | 3.9 × 10−10 | 10.7 |
Values are represented as OR [95% CI] for association with given trait. (b). Effect of monogenic mutation or polygenic score on LDL-C in mg/dl. Values are represented as beta [95% CI] in mg/dl for LDL-C. Multi-variable associations were performed with sex + age + age2 (effects not listed) with monogenic carrier status + high polygenic score using logistic regression. Polygenic risk score was derived from 2 million variants using LDpred. High polygenic score was defined as membership in the top 5th percentile of the ancestry-specific score distribution. AA, African American; EA, European American; SE, standard error.
Effect of monogenic mutation or polygenic score on LDL-C in mg/dl
| Monogenic mutation or high polygenic score | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Ancestry |
| Monogenic carrier ( | High polygenic score ( | Monogenic carrier | Monogenic carrier SE | Monogenic carrier | Highpolygenic score | High polygenic score SE | High polygenic score |
| EA | 5910 | 18 | 297 | 29.98 | 8.07 | 2.1 × 10−4 | 33.07 | 2.05 | 1.7 × 10−57 |
| AA | 4380 | 25 | 220 | 41.05 | 7.93 | 2.3 × 10−7 | 16.96 | 2.74 | 6.4 × 10−10 |
|
| |||||||||
|
|
|
|
|
|
|
|
|
|
|
| EA | 5910 | 12 | 297 | −47.25 | 9.55 | 7.7 × 10−7 | −35.00 | 2.00 | 7.9 × 10−67 |
| AA | 4380 | 28 | 220 | −41.41 | 7.47 | 3.1 × 10−8 | −20.41 | 2.74 | 1.1 × 10−13 |
Values are represented as beta [95% CI] in mg/dl for LDL-C. Multi-variable associations were performed with sex + age + age2 (effects not listed) with monogenic carrier status + high polygenic score using linear regression. Polygenic risk score was derived from 2 million variants using LDpred. High polygenic score was defined as membership in the top 5th percentile of the ancestry-specific score distribution. AA, African American; EA, European American; SE, standard error.