| Literature DB >> 28145424 |
Himanshu Chheda1, Priit Palta1,2, Matti Pirinen1, Shane McCarthy3, Klaudia Walter3, Seppo Koskinen4, Veikko Salomaa4, Mark Daly5,6,7, Richard Durbin3, Aarno Palotie1,5,6,8, Tero Aittokallio1,9, Samuli Ripatti1,3,10.
Abstract
Isolated populations with enrichment of variants due to recent population bottlenecks provide a powerful resource for identifying disease-associated genetic variants and genes. As a model of an isolate population, we sequenced the genomes of 1463 Finnish individuals as part of the Sequencing Initiative Suomi (SISu) Project. We compared the genomic profiles of the 1463 Finns to a sample of 1463 British individuals that were sequenced in parallel as part of the UK10K Project. Whereas there were no major differences in the allele frequency of common variants, a significant depletion of variants in the rare frequency spectrum was observed in Finns when comparing the two populations. On the other hand, we observed >2.1 million variants that were twice as frequent among Finns compared with Britons and 800 000 variants that were more than 10 times more frequent in Finns. Furthermore, in Finns we observed a relative proportional enrichment of variants in the minor allele frequency range between 2 and 5% (P<2.2 × 10-16). When stratified by their functional annotations, loss-of-function variants showed the highest proportional enrichment in Finns (P=0.0291). In the non-coding part of the genome, variants in conserved regions (P=0.002) and promoters (P=0.01) were also significantly enriched in the Finnish samples. These functional categories represent the highest a priori power for downstream association studies of rare variants using population isolates.Entities:
Mesh:
Year: 2017 PMID: 28145424 PMCID: PMC5346294 DOI: 10.1038/ejhg.2016.205
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 4.246
Summary of SNVs studied in Finnish and British samples
| 5 copies–0.5% | 1 629 869 | 2 313 870 | 72.6 | 59.9 |
| 0.5–2% | 2 020 773 | 2 119 423 | 84.9 | 94.3 |
| 2–5% | 1 388 186 | 1 325 135 | 95.2 | 99.3 |
| >5% | 5 418 973 | 5 413 803 | 99.99 | 99.99 |
Figure 1(a) Allele frequency spectrum of variants across the whole genome in Finns compared with the Britons. The black line represents the ratio of the number of variants observed in Finns to those in Britons. (b) The number of variants seen in each population across the genome in different MAF bins. The lines in blue and red represent the number of variants for each bin observed in Finns and Britons, respectively.
Figure 2Variants shared between the two populations. The percentage of variants that are shared between the Finns and the Britons across different allele frequency bins. The histograms represent the allele frequencies of the shared variants in the other population for the MAF bin 2–5%.
Figure 3Enrichment of variants across various categories. (a) Forest plot showing the enrichment across various functional categories for the variants in the minor allele frequency range 2–5%, where we observe consistent enrichment across most categories. The sizes of the boxes correspond to the size of each category and the black horizontal lines represent the 95% confidence intervals. Proportional enrichment is calculated compared with Britons. (b) Proportional enrichment of LoFs in Finns compared with Britons. The red line represents the ratio of the number of LoF variants in Finns compared to Britons. The black line shows the baseline enrichment observed across the whole genome. (c) Proportional enrichment of the number of variants in the conserved regions in Finns compared with Britons. The red line represents the variants common between conserved regions and the coding regions. The blue line represents the variants in the conserved regions but not in the coding regions. The black line shows the baseline enrichment observed across the whole genome.
MAF-enrichment of variants in Finns and Britons
| 2–5x | 722 587 (6.9%) | 177 (7.8%) | 1704 (9.9%) | 3109 (8.7%) | 22 823 (7.7%) | 9179 (7.2%) | 421;220;1 |
| 5–10x | 561 243 (5.4%) | 225 (9.9%) | 2083 (12.2%) | 3013 (8.4%) | 19 385 (6.5%) | 7415 (5.8%) | 73;100;5 |
| 10–50x | 682 275 (6.5%) | 233 (10.3%) | 2360 (13.8%) | 3643 (10.2%) | 23 207 (7.8%) | 9129 (7.2%) | 83;125;10 |
| ≥50x | 142 354 (1.4%) | 38 (1.7%) | 366 (2.1%) | 655 (1.8%) | 4584 (1.5%) | 1888 (1.5%) | 29;23;0 |
| 2–5x | 935 392 (8.4%) | 203 (8.6%) | 1839 (10%) | 3483 (9.1%) | 28 862 (9%) | 11 242 (8.2%) | 725;305;0 |
| 5–10x | 1 319 454 (11.8%) | 405 (17.1%) | 3991 (21.7%) | 6336 (16.5%) | 44 903 (14%) | 17 498 (12.8%) | 135;375;0 |
| 10–50x | 893 522 (8%) | 235 (9.9%) | 2447 (13.3%) | 4005 (10.4%) | 29 368 (9.2%) | 11 557 (8.4%) | 94;331;0 |
| ≥50x | 25 552 (0.2%) | 12 (0.5%) | 58 (0.3%) | 125 (0.3%) | 818 (0.3%) | 382 (0.3%) | 18;4;0 |
This table summarizes the number of variants that are MAF-enriched in both populations across the whole genome and the categories in which there is a significant enrichment. The percentages refer to the proportion of enriched variants in that particular category. The first column describes the fold-change of minor allele frequency between the two population samples. The last column describes the number of variants enriched in the GWAS Catalogue, ClinVar and FDH mutations, respectively.
Figure 4Statistical power gained due to enrichment of a variant in Finns. (a) Plot showing the number of Finnish samples required to detect association if a variant is enriched twofold (blue), fivefold (purple) and 10-fold (yellow) in Finns as compared with Britons. (b) Regression coefficient (beta) desired to achieve a statistical power of 80% at genome-wide significance level as a function of minor allele frequency for a quantitative trait for variants enriched 10-fold in Finns. The red line indicates the betas in Britons and the blue line indicates the betas in Finns. (c) Odds ratio desired to achieve a statistical power of 80% at genome-wide significance level as a function of minor allele frequency for a case–control analysis for variants enriched 10-fold in Finns. The red line indicates the odds ratio in Britons and the blue line indicates the odds ratio in Finns.