| Literature DB >> 30405126 |
Arthur Gilly1, Daniel Suveges1, Karoline Kuchenbaecker1,2,3, Martin Pollard1,4, Lorraine Southam1,5, Konstantinos Hatzikotoulas1,6, Aliki-Eleni Farmaki7,8, Thea Bjornland9, Ryan Waples10, Emil V R Appel11, Elisabetta Casalone12, Giorgio Melloni13, Britt Kilian1, Nigel W Rayner1,5,14, Ioanna Ntalla15, Kousik Kundu1,16, Klaudia Walter1, John Danesh1,17,18, Adam Butterworth17,18,19, Inês Barroso1, Emmanouil Tsafantakis20, George Dedoussis8, Ida Moltke10, Eleftheria Zeggini21,22.
Abstract
The role of rare variants in complex traits remains uncharted. Here, we conduct deep whole genome sequencing of 1457 individuals from an isolated population, and test for rare variant burdens across six cardiometabolic traits. We identify a role for rare regulatory variation, which has hitherto been missed. We find evidence of rare variant burdens that are independent of established common variant signals (ADIPOQ and adiponectin, P = 4.2 × 10-8; APOC3 and triglyceride levels, P = 1.5 × 10-26), and identify replicating evidence for a burden associated with triglyceride levels in FAM189B (P = 2.2 × 10-8), indicating a role for this gene in lipid metabolism.Entities:
Mesh:
Year: 2018 PMID: 30405126 PMCID: PMC6220258 DOI: 10.1038/s41467-018-07070-8
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Variant discovery and quality in WGS data from 100 samples. a variant discovery rate in 22.5×; b allelic r2 for SNVs and INDELs in both 15× and 22.5× calls. Depth is downsampled randomly from 30×. INDEL: insertion/deletion. SNV: single nucleotide variant. Boxes represent the interquartile range. Bold horizontal lines in boxplots represent the median, the whiskers extend to 1.5 times the interquartile range, and grey dots represent outliers outside the whisker range
Fig. 2Variant count proportions and minor allele frequency bin by functional class. a, b Data is shown for MANOLIS (a) and INTERVAL (b). Functional classes are derived from the Ensembl VEP consequences as detailed in Supplementary Table 6. The number of intergenic variants is likely to be an underestimate due to Ensembl’s most severe consequence annotation. For each panel, the bottom half represents the proportion of variants in each class relative to the total number of variants, the upper half represents the frequency makeup of variants in each class
Fig. 3Regional association plots for burdens in APOC3, FAM189B, UGT1A9, ADIPOQ, and GGT1. Red lines denote the burden P-value and extend over the tested gene. Purple lines indicate the conditioned P-value for the variant described in the text (variants rs887829, rs62625753, and rs3859862 in c, d, e. respectively). Small grey dots indicate single-point P-value for variants in the region not included in the test. Larger coloured dots represent variants included in the test, with size and colour proportional to the score used in the most significant test (Supplementary Table 1 and Supplementary Data File 1). When no weights are applied (a and b), included variants are coloured blue-green. In the gene track below the regional plots, green bars below the gene, if present, denote regulatory regions associated with the gene which were used to include variants in the burden. These are present only for genes where regulatory regions were included in the burden