| Literature DB >> 25063994 |
Doug Speed1, Terence J O'Brien2, Aarno Palotie3, Kirill Shkura4, Anthony G Marson5, David J Balding6, Michael R Johnson7.
Abstract
Epilepsy is a disease with substantial missing heritability; despite its high genetic component, genetic association studies have had limited success detecting common variants which influence susceptibility. In this paper, we reassess the role of common variants on epilepsy using extensions of heritability analysis. Our data set consists of 1258 UK patients with epilepsy, of which 958 have focal epilepsy, and 5129 population control subjects, with genotypes recorded for over 4 million common single nucleotide polymorphisms. Firstly, we show that on the liability scale, common variants collectively explain at least 26% (standard deviation 5%) of phenotypic variation for all epilepsy and 27% (standard deviation 5%) for focal epilepsy. Secondly we provide a new method for estimating the number of causal variants for complex traits; when applied to epilepsy, our most optimistic estimate suggests that at least 400 variants influence disease susceptibility, with potentially many thousands. Thirdly, we use bivariate analysis to assess how similar the genetic architecture of focal epilepsy is to that of non-focal epilepsy; we demonstrate both significant differences (P = 0.004) and significant similarities (P = 0.01) between the two subtypes, indicating that although the clinical definition of focal epilepsy does identify a genetically distinct epilepsy subtype, there is also scope to improve the classification of epilepsy by incorporating genotypic information. Lastly, we investigate the potential value in using genetic data to diagnose epilepsy following a single epileptic seizure; we find that a prediction model explaining 10% of phenotypic variation could have clinical utility for deciding which single-seizure individuals are likely to benefit from immediate anti-epileptic drug therapy.Entities:
Keywords: association studies; complex trait prediction; epilepsy; heritability analysis
Mesh:
Year: 2014 PMID: 25063994 PMCID: PMC4163034 DOI: 10.1093/brain/awu206
Source DB: PubMed Journal: Brain ISSN: 0006-8950 Impact factor: 13.501
Figure 1Manhattan plot for single SNP tests of association. Points report -log10 P-values from single-SNP tests of association for the phenotype all epilepsy (1258 cases, 5129 controls). Red/blue points correspond to genotyped SNPs, grey to imputed. The conventional threshold for genome-wide significance (5 × 10−8) is marked by a horizontal dashed line. Manhattan plots for the phenotypes focal, non-focal and generalized epilepsy are provided in Supplementary Fig. 3.
Estimates of variance explained by common SNPs
| Phenotype | Population | Sibling | Total liability | Genotyped SNPs | Imputed SNPs | |||
|---|---|---|---|---|---|---|---|---|
| Prevalence | Relative risk | Heritability, | ||||||
| All epilepsy (1258 cases) | 0.005 | 3.3 [2.5–4.3] | 32 [24–41] | 31 (6) | 23 (4) | 42 (6) | 31 (5) | 26 |
| Focal (958 cases) | 0.003 | 2.6 [1.2–5.3] | 23 [5–43] | 33 (6) | 27 (5) | 41 (7) | 33 (5) | 27 |
| Non-focal (300 cases) | 0.002 | 4.7 [2.1–10.8] | 36 [15–59] | 21 (7) | 38 (12) | 24 (8) | 46 (14) | 44 |
For each phenotypes, we report estimates of , the percentage of variance explained on the observed scale (cases 1, controls 0), and , the corresponding estimate on the liability scale (standard deviations provided in parentheses). is obtained from by subtracting the estimated inflation due to population stratification and genotyping errors (see text). For comparison, , the total liability heritability, is estimated for each phenotype based on the prevalence and reported estimates of sibling relative risk (95% CI shown in square brackets).
Figure 2Estimating the number of causal variants. We suppose heritability is distributed over causal variants either equally (black), uniformly (red), exponentially (green) or χ2 (blue). (A) As the number of causal variants increases (x-axis), the average heritability of each variant decreases, and the probability of single-SNP analysis finding no significant associations increases (y-axis). For each distribution, our point estimates (lower bounds) for the number of causal variants are the numbers required for this probability to exceed 0.5 (0.05), and are marked by vertical lines. Based on the point estimates, the histograms in B show for each distribution how much heritability each causal variant explains. The values above bars report the proportion of variance explained by causal variants within each tranche.
Expected success of single-SNP analyses
| Expected number of associations | 0.1 | 0.6 | 3.9 | 147 | 710 |
| % of variance explained | 0.0 | 0.0 | 0.1 | 4.4 | 21.4 |
| Expected number of associations | 0.1 | 0.7 | 4.6 | 142 | 570 |
| % of variance explained | 0.0 | 0.0 | 0.2 | 4.9 | 17.8 |
| Expected number of associations | 0.1 | 0.9 | 5.7 | 103 | 401 |
| % of variance explained | 0.0 | 0.0 | 0.3 | 4.3 | 12.2 |
| Expected number of associations | 0.1 | 0.9 | 5.9 | 96 | 360 |
| % of variance explained | 0.0 | 0.1 | 0.3 | 4.1 | 11.1 |
For each assumed distribution of heritability across causal variants, using the corresponding point estimates for the number of causal variants, we estimate the expected number of causal variants detected and the total proportion of liability variation these explain, for different total sample size n. We assume the case-control ratio remains fixed at 1258:5129.
Figure 3Performance of prediction models for single-seizure patients. A and B illustrate the two extreme cases for the distribution of liabilities for single-seizure individuals who do not have epilepsy. In the ‘Best Case Scenario,’ their liability distribution matches that of population controls, while in the ‘Worst Case Scenario,’ their liabilities lie just below the case/control threshold. C and D show, again for the Best and Worst Case Scenarios, how the receiver operating curve depends on the proportion of variance explained by the prediction model (varied between 5% and 30%, indicated by line colour); the AUC (area under receiver operating curve) for each line is provided in parentheses.