| Literature DB >> 30679432 |
Mitja I Kurki1,2,3, Elmo Saarentaus3, Olli Pietiläinen2,4, Padhraig Gormley1,2, Dennis Lal1,2, Sini Kerminen3, Minna Torniainen-Holm3,5, Eija Hämäläinen3, Elisa Rahikkala6,7,8, Riikka Keski-Filppula6,7,8, Merja Rauhala9, Satu Korpi-Heikkilä9, Jonna Komulainen-Ebrahim10, Heli Helander10, Päivi Vieira10, Minna Männikkö11,12, Markku Peltonen5, Aki S Havulinna3,5, Veikko Salomaa5, Matti Pirinen3, Jaana Suvisaari5, Jukka S Moilanen6,7,8, Jarmo Körkkö9, Outi Kuismin3,6,7,8, Mark J Daly1,2,3,13,14, Aarno Palotie15,16,17,18,19.
Abstract
The contribution of de novo variants in severe intellectual disability (ID) has been extensively studied whereas the genetics of mild ID has been less characterized. To elucidate the genetics of milder ID we studied 442 ID patients enriched for mild ID (>50%) from a population isolate of Finland. Using exome sequencing, we show that rare damaging variants in known ID genes are observed significantly more often in severe (27%) than in mild ID (13%) patients. We further observe a significant enrichment of functional variants in genes not yet associated with ID (OR: 2.1). We show that a common variant polygenic risk significantly contributes to ID. The heritability explained by polygenic risk score is the highest for educational attainment (EDU) in mild ID (2.2%) but lower for more severe ID (0.6%). Finally, we identify a Finland enriched homozygote variant in the CRADD ID associated gene.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30679432 PMCID: PMC6345990 DOI: 10.1038/s41467-018-08262-y
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1ID prevalence estimates in different municipalities in Finland. The primary NFID collection municipalities of Northern Ostrobothnia, Kainuu and Lapland are outlined in solid black. The approximate boundary between early and late settlements is shown with a dashed line
Patient clinical characteristics in the NFID sample
| ID severity | Mild | Moderate | Profound | Undefined |
|---|---|---|---|---|
| Total | 259 | 126 | 72 | 41 |
| Female % | 37% | 41% | 36% | 44% |
| Epilepsy | 28 (11%) | 19 (15%) | 37 (51%) | 9 (22%) |
| ASD | 22 (8%) | 17 (13%) | 25 (35%) | 3 (7%) |
| Behavioral impairment | 38 (15%) | 28 (22%) | 15 (21%) | 0 (0%) |
| Psychotic disorder | 24 (9%) | 16 (13%) | 8 (11%) | 0 (0%) |
| Dysmorphism | 85 (33%) | 62 (49%) | 41 (57%) | 13 (32%) |
ASD autism spectrum disorder
Cohorts used in the analyses
| Cases | Controls | ||||||
|---|---|---|---|---|---|---|---|
| Analysis cohort | Analyses |
| ID | ASD | SCZ | EPI |
|
| NFID (primary cohort) | EXOME | 442 | 442 | 62 | 47 | 92 | 2206b |
| PRS | 439 | 439 | 62 | 47 | 86 | 2195b (matched) | |
| CNV | 433 | 433 | 57 | 47 | 84 | 1100b | |
| NFNDD (Northern Finland NeuroDevelopmental Disorder) | EXOME association analysis | 314 | 17 | 40a | 239 | 26 | 1548 |
| SFNDD (Southern Finland NeuroDevelopmental Disorder) | EXOME association analysis | 322 | 14 | 73a | 211 | 33 | 1594 |
| SISU controls | EXOME variant filtering | — | — | — | — | — | 5922 |
Sample sizes are the numbers used in the analysis after quality control. The number of individuals in the analyses are after QC and after related individuals have been removed
ASD autism spectrum disorder, SCZ schizophrenia, EPI epilepsy, PRS Polygenic Risk Score, CNV copy number variant
aComorbidities were not available from ASD cohorts
bNone of the controls from the FINRISK study have any NDD (intellectual disability, autism, schizophrenia) or epilepsy and all controls are genetically matched to cases
Fig. 2Enrichment of likely pathogenic and synonymous variants in known ID genes. Enrichment of variants in 818 known ID genes (Methods) in NFID cases compared to genetically matched controls. Heterozygotes were counted only for those genes for which a dominant inheritance mode is reported. The number of carriers and total individuals are given on the left and in parenthesis the proportion of carriers. Circles indicate the odds ratio (OR) and lines indicate 95% confidence interval of the OR. The synonymous variant identification comparison was performed to assess if possible differences in the variant identification rate due to batch/capture differences were adequately controlled. PTV protein truncating variant, CADD Combined Annotation-Dependent Depletion pathogenicity score, pLI probability of loss of function intolerance. Source data are provided as a Source Data file
Fig. 3Enrichment of rare variants in genes not previously associated with NDDs. Variants not observed in GnomAD or Finnish controls in novel ID genes in cases compared to genetically matched controls (see Methods). Rate in each variant category is first estimated for all novel genes and then after subsetting to only novel high pLI genes. All missense variants are predicted to be deleterious (MPC > 2, see Methods). On the left the number of carriers and total individuals are given and in parenthesis the proportion of carriers. Circles indicate the odds ratio (OR) and lines indicate the 95% confidence interval of the OR. The synonymous variant identification comparison was performed to assess if possible differences in the variant identification rate due to batches/capture differences were adequately controlled. PTV protein truncating variant, CADD Combined Annotation-Dependent Depletion pathogenicity score, pLI probability of loss of function intolerance. Source data are provided as a Source Data file
Fig. 4Distribution of different deletion categories. a Deletion categories in ID patients showed enrichment for deletions (>100 kb) in general, and specifically in deletions covering syndrome regions (as defined by the DECIPHER database), deletions that are located in an ID-associated gene region (see above) or CNVs deleting a gene intolerant of protein truncating mutations (pLI > 0.95). b Size distribution of Deletions by size bins, showing enrichment at all sizes > 500 kb . c Size distribution of Duplications by size bins > 500 kb, showing an enrichment for sizes > 1 Mb. Source data are provided as a Source Data file
Fig. 5Comparison of the total rate of different classes of variants. Comparison was performed in cases vs. genetically matched controls and in mild vs. more severe ID individuals for which both exome and CNV data was available. On the left the number of carriers and total individuals are given and in parenthesis the proportion of carriers. Circles indicate odds ratio and lines indicate 95% confidence intervals of the odds ratio estimate. a Total genetic diagnostic rate. b Variant classes in “Likely pathogenic” variant categories. c Comparison of the rate of identifying different classes of variants in mild vs. severe (moderate and profound ID combined) patients. d Comparison of the rate of variant types in “Likely pathogenic” category between mild and more severe forms of ID (moderate, severe and profound ID combined). Constrained missense (MPC > 2) variants were analyzed in all genes instead of only high pLI genes in C and D as MPC score incorporates regional missense constraint. Source data are provided as a Source Data file
Fig. 6Regional distributions of PRSs within Finland and between cases and contols. a Locally weighted educational attainment PRS distribution in Finnish population controls whose parents birthplace is within 100 km of each other. b Educational attainment PRS in cases and all population controls and genetically matched population controls. c Locally weighted schizophrenia PRS distribution in Finnish population controls whose parents birthplace is within 100 km of each other. d Schizophrenia PRS in cases and all population controls and genetically matched population controls. e Locally weighted IQ PRS distribution in Finnish population controls whose parents birthplace is within 100 km of each other. f IQ PRS in cases and all population controls and genetically matched population controls. Error bars indicate 95% confidence intervals around mean. Y-axis is in SD units of PRS standardized to all population controls. Source data are provided as a Source Data file
Homozygous Finnish enriched variants observed ≥2 times across NFID and the Southern and Northern Finnish NDD cases and not observed in any controls as homozygous
| Variant | Gene | Previous evidence | GnomAD Finnish AF | GnomAD pop. Max AF | NFID AF | RAFT meta | NFID case homs | NFID RAFT | Population NDD case homs | Population NDD RAFT |
|---|---|---|---|---|---|---|---|---|---|---|
| 12:94243956 G:A (mis) |
| AR Lissencephaly, ID[ | 6.01E−3 | 9.15E−4 | 7.83E−3 | 5.01E−8 a | 3 | 1.86E−6 | 1 | 9.30E−3 |
| 7:81374424 G:C (mis) |
| AR hearing loss (OMIM) | 1.12E−3 | 5.48E−4 | 2.52E−3 | 1.34E−5 a | 1 | 3.20E−3 | 1 | 2.54E−3 |
| 12:15784582 T:C (mis) |
| AR deafness (OMIM) Cognition defects in mice[ | 9.03E−3 | 1.30E−3 | 6.89E−3 | 1.28E−4 | 2 | 1.28E−4 | 0 | NA |
| 1:220236134 C:T (mis) |
| - | 1.07E−2 | 4.83E−3 | 6.89E−3 | 1.34E−4 | 2 | 1.34E−4 | 0 | NA |
| 7:1520077 T:C (mis) |
| AR ID[ | 1.30E−2 | 3.05E−3 | 1.71E−2 | 1.95E−4 | 3 | 1.95E−4 | 0 | NA |
| 2:95753239 A:G (mis) |
| - | 9.87E−3 | 4.56E−3 | 8.29E−3 | 2.88E−4 | 2 | 2.88E−4 | 0 | NA |
| 10:123844296 C:A (mis) |
| - | 1.32E−2 | 1.10E−3 | 1.14E−2 | 1.05E−3 | 2 | 1.05E−3 | 0 | NA |
| 1:155028692 C:T (mis) |
| - | 8.69E−3 | 2.37E−3 | 1.18E−2 | 2.11E−3 | 1 | 4.84E−2 | 1 | 3.58E−2 |
| 18:14542688 G:A (mis) |
| - | 1.86E−2 | 6.26E−3 | 1.98E−2 | 2.11E−3 | 2 | 1.01E−2 | 1 | 1.65E−1 |
| 21:19651329 G:C (mis) |
| Enterokinase deficiency (OMIM) | 1.73E−2 | 6.31E−3 | 1.93E−2 | 2.66E−3 | 1 | 1.42E−1 | 2 | 9.85E−3 |
| 15:60789800 T:C (mis) |
| AD ID[ | 1.02E−2 | 9.14E−4 | 1.59E−2 | 4.03E−3 | 1 | 9.35E−2 | 1 | 3.38E−2 |
| 11:6023849 C:T (mis) |
| - | 1.69E−2 | 1.82E−3 | 1.61E−2 | 4.45E−3 | 2 | 4.45E−3 | 0 | NA |
| 11:3681309 G:A (mis) |
| - | 1.64E−2 | 3.68E−3 | 1.53E−2 | 5.67E−3 | 1 | 8.43E−2 | 1 | 5.76E−2 |
| 19:56424477 TC:T (frameshift) |
| - | 1.65E−2 | 1.28E−3 | 1.40E−2 | 6.07E−3 | 1 | 7.44E−2 | 1 | 8.54E−2 |
| 8:17612739 G:C (mis) |
| - | 1.46E−2 | 2.01E−3 | 1.54E−2 | 6.74E−3 | 1 | 9.22E−2 | 1 | 7.45E−2 |
| 1:183520048 A:T (mis) |
| NMD-components linked to ID[ | 2.49E−2 | 1.10E−3 | 2.62E−2 | 3.85E−2 | 1 | 2.85E−1 | 1 | 1.58E−1 |
| X:23410887 C:T (mis) |
| x-linked ID/AUTISM[ | 2.43E−4 | 2.51E−5 | 2.27E−4 | NA | 1b | NA | 1b | NAb |
AR autosomal recessive, AD autosomal dominant, ID intellectual disability, AF allele frequency, mis missense
aSignificant after multiple testing correction
bRAFT statistic not valid for X-chromosome. Both carriers are hemizygote males
Fig. 7Estimate of heritability explained by different variant categories on liability scale. a Variance explained by genetic categories in all ID cases. b Variance explained delineated by ID severity. Variance explained was estimated by Nagelkerke r2 while controlling for the first four PCs. 95% Confidence intervals of variance explained were estimated by 5000 bootstrap samples. Source data are provided as a Source Data file