| Literature DB >> 25078778 |
Elaine T Lim1, Peter Würtz2, Aki S Havulinna3, Priit Palta4, Taru Tukiainen5, Karola Rehnström6, Tõnu Esko7, Reedik Mägi8, Michael Inouye9, Tuuli Lappalainen10, Yingleong Chan11, Rany M Salem12, Monkol Lek5, Jason Flannick13, Xueling Sim14, Alisa Manning15, Claes Ladenvall16, Suzannah Bumpstead6, Eija Hämäläinen4, Kristiina Aalto17, Mikael Maksimow17, Marko Salmi18, Stefan Blankenberg19, Diego Ardissino20, Svati Shah21, Benjamin Horne22, Ruth McPherson23, Gerald K Hovingh24, Muredach P Reilly25, Hugh Watkins26, Anuj Goel26, Martin Farrall26, Domenico Girelli27, Alex P Reiner28, Nathan O Stitziel29, Sekar Kathiresan30, Stacey Gabriel15, Jeffrey C Barrett6, Terho Lehtimäki31, Markku Laakso32, Leif Groop16, Jaakko Kaprio33, Markus Perola34, Mark I McCarthy35, Michael Boehnke14, David M Altshuler13, Cecilia M Lindgren36, Joel N Hirschhorn12, Andres Metspalu8, Nelson B Freimer37, Tanja Zeller19, Sirpa Jalkanen18, Seppo Koskinen38, Olli Raitakari39, Richard Durbin6, Daniel G MacArthur5, Veikko Salomaa3, Samuli Ripatti40, Mark J Daly5, Aarno Palotie41.
Abstract
Exome sequencing studies in complex diseases are challenged by the allelic heterogeneity, large number and modest effect sizes of associated variants on disease risk and the presence of large numbers of neutral variants, even in phenotypically relevant genes. Isolated populations with recent bottlenecks offer advantages for studying rare variants in complex diseases as they have deleterious variants that are present at higher frequencies as well as a substantial reduction in rare neutral variation. To explore the potential of the Finnish founder population for studying low-frequency (0.5-5%) variants in complex diseases, we compared exome sequence data on 3,000 Finns to the same number of non-Finnish Europeans and discovered that, despite having fewer variable sites overall, the average Finn has more low-frequency loss-of-function variants and complete gene knockouts. We then used several well-characterized Finnish population cohorts to study the phenotypic effects of 83 enriched loss-of-function variants across 60 phenotypes in 36,262 Finns. Using a deep set of quantitative traits collected on these cohorts, we show 5 associations (p<5×10⁻⁸) including splice variants in LPA that lowered plasma lipoprotein(a) levels (P = 1.5×10⁻¹¹⁷). Through accessing the national medical records of these participants, we evaluate the LPA finding via Mendelian randomization and confirm that these splice variants confer protection from cardiovascular disease (OR = 0.84, P = 3×10⁻⁴), demonstrating for the first time the correlation between very low levels of LPA in humans with potential therapeutic implications for cardiovascular diseases. More generally, this study articulates substantial advantages for studying the role of rare variation in complex phenotypes in founder populations like the Finns and by combining a unique population genetic history with data from large population cohorts and centralized research access to National Health Registers.Entities:
Mesh:
Year: 2014 PMID: 25078778 PMCID: PMC4117444 DOI: 10.1371/journal.pgen.1004494
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Allele frequency spectrum in Finns and NFEs, demonstrating that Finns have proportionally more deleterious rare and low-frequency variants.
(A) Ratio of the number of LoF, missense and synonymous variants found in Finns versus NFEs with the ratios for LoF variants highlighted in red text and the ratios for synonymous variants in black. The p-values represent the probabilities of the excess of variable sites in Finns occurring by chance. The p-values in red represent the probabilities for the LoF variants, the p-values in blue represent the probabilities for the missense variants and the p-values in black represent the probabilities for the synonymous variants. (B) Percentage of variants that are LoF across the allele frequency spectrum, with the numbers indicating the percentage of LoF variants in Finns versus NFEs. The p-values represent the p-values from the hypergeometric test of whether the ratio of LoF variants differ from the ratio of synonymous variants in Finns compared to NFEs.
Figure 2Study design figure for the project.
The analysis was performed from an initial set of exome sequences from Finns and NFEs, as well as the selection and survey of the 83 LoF variants across 60 quantitative traits and 13 disease categories.
List of top association results from the discovery dataset with p<2×10−4.
| Discovery | Replication | Combined (Discovery + Replication) | ||||||||||||||||
| Trait | Gene | Chr | Position | Ref | Alt | Allele Freq | N | Beta | SE | P-value | N | Beta | SE | P-value | N | Beta | SE | P-value |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Vitamin-B12 |
| 19 | 49206674 | G | A | 0.62 | 6087 | 0.199 | 0.019 | 3.68×10−26 | ||||||||
|
|
|
|
|
|
|
|
|
|
|
| ||||||||
| GCSF |
| 16 | 84495318 | A | C | 0.023 | 6660 | 0.272 | 0.055 | 6.98×10−7 | 2188 | −0.037 | 0.105 | 0.73 | 8848 | 0.206 | 0.049 | 2.2×10−5 |
| IL4 |
| 16 | 84495318 | A | C | 0.023 | 6660 | 0.258 | 0.055 | 2.48×10−6 | 2188 | 0.035 | 0.105 | 0.74 | 8846 | 0.209 | 0.061 | 5.91×10−4 |
| IFN-gamma |
| 16 | 84495318 | A | C | 0.023 | 6660 | 0.255 | 0.055 | 3.24×10−6 | 2188 | 0.060 | 0.105 | 0.57 | 8841 | 0.051 | 0.016 | 1.45×10−3 |
| IL6 |
| 16 | 84495318 | A | C | 0.023 | 6660 | 0.251 | 0.055 | 4.58×10−6 | 2188 | 0.073 | 0.105 | 0.49 | 8848 | 0.213 | 0.049 | 1.16×10−5 |
| Endothelin1 |
| 19 | 49206674 | G | A | 0.62 | 6146 | 0.086 | 0.019 | 5.63×10−6 | ||||||||
|
|
|
|
|
|
|
|
|
|
|
| ||||||||
| IL12 |
| 16 | 84495318 | A | C | 0.023 | 6660 | 0.245 | 0.055 | 8.13×10−6 | 2188 | 0.042 | 0.105 | 0.69 | 8848 | 0.201 | 0.049 | 3.45×10−5 |
| IL17 |
| 16 | 84495318 | A | C | 0.023 | 6660 | 0.241 | 0.055 | 1.12×10−5 | 2188 | −0.136 | 0.105 | 0.20 | 8848 | 0.160 | 0.049 | 9.91×10−4 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| IFN-gamma |
| 11 | 73978243 | G | A | 0.32 | 6655 | 0.080 | 0.019 | 1.70×10−5 | 2186 | −0.036 | 0.032 | 0.27 | 8841 | 0.051 | 0.016 | 1.45×10−3 |
| IL17 |
| 11 | 73978243 | G | A | 0.32 | 6655 | 0.080 | 0.019 | 1.72×10−5 | 2186 | 0.015 | 0.033 | 0.63 | 8841 | 0.064 | 0.016 | 7.27×10−5 |
|
|
|
|
|
|
|
|
|
|
|
| ||||||||
| TNF-beta |
| 8 | 38839282 | GAA | G | 0.013 | 6669 | −0.292 | 0.069 | 2.68×10−5 | 2188 | 0.378 | 0.141 | 0.73 | 8857 | −0.172 | 0.062 | 5.54×10−3 |
| IL4 |
| 5 | 160113099 | G | A | 0.034 | 6673 | 0.186 | 0.045 | 3.55×10−5 | 2189 | 0.141 | 0.086 | 0.10 | 8862 | 0.177 | 0.040 | 9.71×10−6 |
| FGF |
| 11 | 73978243 | G | A | 0.32 | 6655 | 0.076 | 0.019 | 4.58×10−5 | 2186 | 0.006 | 0.032 | 0.85 | 8841 | 0.059 | 0.016 | 2.81×10−4 |
| TNF-beta |
| 5 | 160113099 | G | A | 0.034 | 6673 | 0.184 | 0.045 | 4.65×10−5 | 2189 | 0.091 | 0.081 | 0.26 | 8862 | 0.164 | 0.039 | 3.26×10−5 |
| TNF-beta |
| 16 | 84495318 | A | C | 0.023 | 6660 | 0.223 | 0.055 | 4.69×10−5 | 2188 | −0.024 | 0.099 | 0.81 | 8848 | 0.170 | 0.048 | 3.86×10−4 |
| IFNG |
| 5 | 160113099 | G | A | 0.034 | 6673 | 0.183 | 0.045 | 4.81×10−5 | 2189 | 0.037 | 0.086 | 0.67 | 8862 | 0.152 | 0.040 | 1.45×10−4 |
| SDF1 |
| 16 | 84495318 | A | C | 0.023 | 6660 | 0.221 | 0.055 | 5.69×10−5 | 2188 | 0.057 | 0.105 | 0.59 | 8848 | 0.186 | 0.049 | 1.31×10−4 |
| TNF-beta |
| 11 | 73978243 | G | A | 0.32 | 6655 | 0.075 | 0.019 | 5.94×10−5 | 2186 | 0.007 | 0.031 | 0.81 | 8841 | 0.058 | 0.016 | 2.66×10−4 |
| FGF |
| 16 | 84495318 | A | C | 0.023 | 6660 | 0.220 | 0.055 | 5.98×10−5 | 2188 | 0.033 | 0.105 | 0.75 | 8848 | 0.180 | 0.049 | 2.11×10−4 |
| IL18 |
| 8 | 144942134 | CCTTT | C | 0.023 | 6677 | −0.232 | 0.058 | 6.20×10−5 | 2160 | −0.240 | 0.102 | 0.19 | 8837 | −0.234 | 0.050 | 3.42×10−6 |
| PDGF |
| 5 | 160113099 | G | A | 0.034 | 6673 | 0.181 | 0.045 | 6.20×10−5 | 2189 | −0.011 | 0.086 | 0.90 | 8862 | 0.139 | 0.040 | 4.84×10−4 |
| SDF1 |
| 5 | 160113099 | G | A | 0.034 | 6673 | 0.178 | 0.045 | 7.62×10−5 | 2189 | −0.118 | 0.085 | 0.17 | 8862 | 0.114 | 0.040 | 4.12×10−3 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| VEGF |
| 8 | 38839282 | GAA | G | 0.013 | 6669 | −0.274 | 0.069 | 7.86×10−5 | 2188 | −0.010 | 0.149 | 0.95 | 8857 | −0.227 | 0.063 | 3.10×10−4 |
| IL17 |
| 13 | 100518634 | C | T | 0.035 | 6671 | 0.185 | 0.047 | 8.77×10−5 | ||||||||
| IL10 |
| 16 | 84495318 | A | C | 0.023 | 6660 | 0.214 | 0.055 | 9.33×10−5 | 2188 | 0.134 | 0.105 | 0.20 | 8848 | 0.197 | 0.049 | 5.08×10−5 |
| IL6 |
| 11 | 73978243 | G | A | 0.32 | 6655 | 0.072 | 0.019 | 9.70×10−5 | 2186 | −0.016 | 0.033 | 0.62 | 8841 | 0.051 | 0.016 | 1.73×10−3 |
| IL17 |
| 8 | 38839282 | GAA | G | 0.013 | 6669 | −0.270 | 0.069 | 1.03×10−4 | 2188 | −0.008 | 0.149 | 0.96 | 8857 | −0.223 | 0.063 | 3.97×10−4 |
| IFN-gamma |
| 7 | 75401263 | C | T | 0.013 | 6663 | −0.307 | 0.080 | 1.15×10−4 | 2192 | −0.183 | 0.133 | 0.17 | 8855 | −0.274 | 0.068 | 5.94×10−5 |
| IL12 |
| 5 | 160113099 | G | A | 0.034 | 6673 | 0.174 | 0.045 | 1.20×10−4 | 2189 | 0.010 | 0.086 | 0.91 | 8862 | 0.138 | 0.040 | 5.41×10−4 |
| IL6 |
| 5 | 160113099 | G | A | 0.034 | 6673 | 0.173 | 0.045 | 1.23×10−4 | 2189 | 0.024 | 0.086 | 0.78 | 8862 | 0.141 | 0.040 | 4.15×10−4 |
| IL17 |
| 5 | 160113099 | G | A | 0.034 | 6673 | 0.173 | 0.045 | 1.24×10−4 | 2189 | −0.003 | 0.086 | 0.97 | 8862 | 0.135 | 0.040 | 7.13×10−4 |
| PDGF |
| 11 | 73978243 | G | A | 0.32 | 6655 | 0.071 | 0.019 | 1.31×10−4 | 2186 | −0.022 | 0.032 | 0.50 | 8841 | 0.048 | 0.016 | 2.86×10−3 |
| GCSF |
| 13 | 100518634 | C | T | 0.035 | 6671 | 0.180 | 0.047 | 1.41×10−4 | ||||||||
| PDGF |
| 16 | 84495318 | A | C | 0.023 | 6660 | 0.209 | 0.055 | 1.43×10−4 | 2188 | 0.048 | 0.105 | 0.65 | 8848 | 0.174 | 0.049 | 3.40×10−4 |
| GCSF |
| 17 | 60469326 | C | T | 0.043 | 6606 | 0.157 | 0.042 | 1.86×10−4 | 2192 | −0.157 | 0.079 | 0.046 | 8798 | 0.087 | 0.037 | 0.018 |
The associations that are experiment-wide significant highlighted in bold. The discovery dataset is a subset from FINRISK and the replication datasets are from the Health2000 and Young Finns studies.
* The 2 splice variants in LPA were combined to obtain a composite LPA variant for the association analyses.
Figure 3Forest plot for the LPA splice variants with cardiovascular diseases.
The cardiovascular diseases were defined as coronary heart disease (CHD), ischemic heart disease (IHD), heart failure (HF) or myocardial infarction (MI) from the various cohorts.