| Literature DB >> 29649218 |
Jean Monlong1,2, Simon L Girard1,3,4, Caroline Meloche4, Maxime Cadieux-Dion4,5, Danielle M Andrade6, Ron G Lafreniere4, Micheline Gravel4, Dan Spiegelman7, Alexandre Dionne-Laporte7, Cyrus Boelman8, Fadi F Hamdan9, Jacques L Michaud9, Guy Rouleau7, Berge A Minassian10, Guillaume Bourque1,2,11, Patrick Cossette4.
Abstract
Epilepsy will affect nearly 3% of people at some point during their lifetime. Previous copy number variants (CNVs) studies of epilepsy have used array-based technology and were restricted to the detection of large or exonic events. In contrast, whole-genome sequencing (WGS) has the potential to more comprehensively profile CNVs but existing analytic methods suffer from limited accuracy. We show that this is in part due to the non-uniformity of read coverage, even after intra-sample normalization. To improve on this, we developed PopSV, an algorithm that uses multiple samples to control for technical variation and enables the robust detection of CNVs. Using WGS and PopSV, we performed a comprehensive characterization of CNVs in 198 individuals affected with epilepsy and 301 controls. For both large and small variants, we found an enrichment of rare exonic events in epilepsy patients, especially in genes with predicted loss-of-function intolerance. Notably, this genome-wide survey also revealed an enrichment of rare non-coding CNVs near previously known epilepsy genes. This enrichment was strongest for non-coding CNVs located within 100 Kbp of an epilepsy gene and in regions associated with changes in the gene expression, such as expression QTLs or DNase I hypersensitive sites. Finally, we report on 21 potentially damaging events that could be associated with known or new candidate epilepsy genes. Our results suggest that comprehensive sequence-based profiling of CNVs could help explain a larger fraction of epilepsy cases.Entities:
Mesh:
Year: 2018 PMID: 29649218 PMCID: PMC5978987 DOI: 10.1371/journal.pgen.1007285
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1PopSV approach.
a) Technical bias across the genome remains after stringent correction and filtering. The distribution of the bin inter-sample mean coverage in the epilepsy cohort (red) is compared to null distributions (blue: bins shuffled, green: simulated normal distribution). b) PopSV approach. First the genome is fragmented and reads mapping in each bin are counted for each sample and GC corrected (1). Next, coverage of the sample is normalized (2) and each bin is tested by computing a Z-score (3), estimating p-values (4) and identifying abnormal regions (5). c) Number and proportion of calls from a twin that was replicated in the other monozygotic twin.
Fig 2CNVs in the epilepsy and control cohorts.
a) Regions with a CNV in each epilepsy patient. b) Each CNV in the CNV catalog of the epilepsy and control cohorts was annotated with its maximum frequency in five CNV databases. c) Enrichment in exonic sequence for all CNVs (left) and rare CNVs (right), larger than 50 Kbp (top) or smaller than 50 Kbp (bottom). The fold-enrichment (y-axis) represents how many CNVs overlap coding sequences compared to control regions randomly distributed in the genome.
Real-Time PCR validation rates of PopSV calls.
| Region | Validation rate | |
|---|---|---|
| Total | 151 | 0.907 |
| CNV type | ||
| Frequency in databases | ||
| Carrier in CENet cohorts | ||
| Size (Kbp) |
Number and proportion of regions validated for CNVs of different types, sizes and frequencies.
Fig 3CNVs and epilepsy genes.
a) Number of rare CNVs in or close to exons of protein-coding genes (top) or epilepsy genes (bottom), in the epilepsy cohort. b) Number of epilepsy genes hit by exonic deletions in the epilepsy cohort and never seen in the public and internal databases (dotted line), compared to the expected distribution in all genes and size-matched genes (histograms). c) Rare non-coding CNVs in functional regions near epilepsy genes. The graph shows the cumulative number of individuals (y-axis) with a rare non-coding CNV located at X Kbp or less (x-axis) from the exonic sequence of a known epilepsy gene. We used CNVs overlapping regions functionally associated with the epilepsy gene (eQTL or promoter-associated DNase site).
Pathogenic profiles in known epilepsy genes.
| Patient | Epilepsy type | Syndrome | Copy number | Chr. | CNV start | CNV end | Epilepsy gene with exon disrupted | Taqman probe | Discovery | Replication | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Patients | Controls | Patients | Controls | |||||||||
| CNET0108 | Generalized | Eyelid myoclonia epilepsy with absence | 1 | 1 | 44195001 | 44460000 | Hs05759463_cn | 1 DEL | 0 | 0 | - | |
| CNET0159 | Generalized | Eyelid myoclonia epilepsy with absence | 1 | 8 | 141925001 | 142010000 | Hs06202928_cn | 1 DEL | 0 | 0 | - | |
| CNET0093 | Generalized | Juvenile onset; GTCs, Abs, Comp Partial | 1 | 10 | 95525001 | 95545000 | Hs02682696_cn | 1 DEL | 0 | 0 | - | |
| CNET0140 | Generalized | Idiopathic generalized epilepsies | 1 | 13 | 35750001 | 35785000 | Hs05286691_cn | 1 DEL | 0 | 0 | - | |
| CNET0144 | Generalized | Eyelid myoclonia epilepsy with absence | 1 | 15 | 22745001 | 23275000 | Hs04452887_cn | 3 DEL | 2 DEL | 4 DEL (2DUP) | 1 DEL (5 DUP) | |
| CNET0009 | Generalized | Idiopathic generalized epilepsies | 1 | 15 | 30910001 | 32445000 | Hs03909657_cn | 1 DEL | 0 | 3 DEL | (1 DUP) | |
| CNET0119 | Generalized | Eyelid myoclonia epilepsy with absence | 1 | 15 | 93300001 | 93515000 | Hs05385106_cn | 1 DEL | 0 | 0 | - | |
| CNET0143 | Generalized | Childhood absence epilepsy | 1 | 93489776 | 93494317 | Hs026436998_cn | 1 DEL | 0 | 0 | - | ||
| CNET0130 | Generalized | Eyelid myoclonia epilepsy with absence | 3 | 93445001 | 93450000 | Hs01379802_cn | 1 DUP | 0 | 0 | - | ||
| CNET0074 | Focal | Frontal Lobe Epilepsy | 1 | 22 | 32125001 | 32255000 | Hs01632214_cn | 1 DEL | 0 | 0 | - | |
The 198 epileptic patients and 301 controls represent the discovery set. The replication set contains 325 epileptic patients and 380 controls. Variants that were not tested are marked with “-”.
Recurrent CNVs with a pathogenic profile.
| Patient | Epilepsy type | Syndrome | Copy number | Chr. | CNV start | CNV end | Gene with exon disrupted | Taqman probe | Discovery | Replication | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Patients | Controls | Patients | Controls | |||||||||
| CNET0184 | Generalized | Lennox-Gastaut syndrome | 3 | 2 | 32625001 | 33335000 | Hs03387774_cn | 2 DUP | 0 | 2 DUP | 0 | |
| CNET0097 | Generalized | Eyelid myoclonia epilepsy with absence | 3 | |||||||||
| CNET0020 | Generalized | Juvenile myoclonic epilepsy | 1 | 12 | 7995001 | 8125000 | Hs04406005_cn | 2 DEL | 2 DEL | 2 DEL | 2 DEL | |
| CNET0198 | Focal | Frontal lobe epilepsy | 1 | |||||||||
| CNET0012 | Generalized | Idiopathic generalized epilepsy | 3 | 15 | 90845001 | 90955000 | Hs03895490_cn | 2 DUP | 0 | (1 DEL) | 0 | |
| CNET0167 | Generalized | Childhood absence epilepsy | 3 | |||||||||
| CNET0063 | Generalized | Idiopathic generalized epilepsies | 3 | 16 | 15460001 | 16290000 | Hs05396556_cn | 1 DUP + 1 DEL | 0 | 1 DEL | 1 DUP | |
| CNET0037 | Generalized | Idiopathic generalized epilepsies | 1 | |||||||||
The 198 epileptic patients and 301 controls represent the discovery set. The replication set contains 325 epileptic patients and 380 controls.
Fig 4Exonic CNVs in CHD2 detected by PopSV.
The ‘CNV’ panel shows the exonic deletions (blue) and duplications (red) called by PopSV. The ‘Coverage’ panel shows the read depth signal in the affected individuals (colored points/lines) and the coverage distribution in the reference samples (boxplot and grey point).