| Literature DB >> 28749477 |
Kristof Van Schil1, Sarah Naessens1, Stijn Van de Sompele1, Marjolein Carron1, Alexander Aslanidis2, Caroline Van Cauwenbergh1, Anja Kathrin Mayer3, Mattias Van Heetvelde1, Miriam Bauwens1, Hannah Verdin1, Frauke Coppieters1, Michael E Greenberg4, Marty G Yang4, Marcus Karlstetter2, Thomas Langmann2, Katleen De Preter1, Susanne Kohl3, Timothy J Cherry5,6, Bart P Leroy1,7,8, Elfride De Baere1.
Abstract
PurposePart of the hidden genetic variation in heterogeneous genetic conditions such as inherited retinal diseases (IRDs) can be explained by copy-number variations (CNVs). Here, we explored the genomic landscape of IRD genes listed in RetNet to identify and prioritize those genes susceptible to CNV formation.MethodsRetNet genes underwent an assessment of genomic features and of CNV occurrence in the Database of Genomic Variants and literature. CNVs identified in an IRD cohort were characterized using targeted locus amplification (TLA) on extracted genomic DNA.ResultsExhaustive literature mining revealed 1,345 reported CNVs in 81 different IRD genes. Correlation analysis between rankings of genomic features and CNV occurrence demonstrated the strongest correlation between gene size and CNV occurrence of IRD genes. Moreover, we identified and delineated 30 new CNVs in IRD cases, 13 of which are novel and three of which affect noncoding, putative cis-regulatory regions. Finally, the breakpoints of six complex CNVs were determined using TLA in a hypothesis-neutral manner.ConclusionWe propose a ranking of CNV-prone IRD genes and demonstrate the efficacy of TLA for the characterization of CNVs on extracted DNA. Finally, this IRD-oriented CNV study can serve as a paradigm for other genetically heterogeneous Mendelian diseases with hidden genetic variation.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28749477 PMCID: PMC5787040 DOI: 10.1038/gim.2017.97
Source DB: PubMed Journal: Genet Med ISSN: 1098-3600 Impact factor: 8.822
Ranking of genomic landscape of IRD genes according to gene size, number of total associated repeats, Alu repeats, LINE and LTR repeats, and segmental duplications, together with the number of CNVs according to literature and DGV
| 24 | 9,470 | 24 | 9,470 | 0 | 27 | 24 | 9,470 | 0 | 10 | |||||
| 15 | 6,344 | 15 | 6,344 | 2 | 1 | 15 | 6,344 | 515 | 18 | |||||
| 57 | 598 | 0 | 174 | 515 | 18 | 0 | 174 | 75 | 445 | |||||
| 0 | 174 | 4 | 192 | 0 | 6 | 57 | 598 | 0 | 10 | |||||
| 4 | 192 | 3 | 740 | 0 | 117 | 4 | 192 | 1 | 123 | |||||
| 0 | 1945 | 57 | 598 | 0 | 420 | 7 | 897 | 235 | 141 | |||||
| 7 | 897 | 7 | 897 | 0 | 3 | 0 | 59 | 0 | 100 | |||||
| 0 | 723 | 0 | 287 | 1 | 60 | 0 | 1,945 | 13 | 10 | |||||
| 3 | 740 | 0 | 117 | 0 | 3 | 0 | 53 | 0 | 24 | |||||
| 0 | 59 | 0 | 27 | 1 | 44 | 5 | 73 | 0 | 8 | |||||
| 0 | 287 | 1 | 60 | 0 | 6 | 0 | 723 | 0 | 2 | |||||
| 0 | 117 | 0 | 404 | 0 | 11 | 0 | 143 | 0 | 7 | |||||
| 0 | 404 | 0 | 59 | 0 | 2 | 0 | 148 | 0 | 0 | |||||
| 0 | 148 | 515 | 18 | 0 | 6 | 0 | 287 | 7 | 77 | |||||
| 1 | 123 | 235 | 141 | 1 | 4 | 0 | 404 | 15 | 6,344 | |||||
| 7 | 60 | 0 | 113 | 0 | 4 | 3 | 740 | 0 | 2 | |||||
| 5 | 73 | 1 | 44 | 0 | 13 | 0 | 77 | 0 | 24 | |||||
| 1 | 96 | 0 | 69 | 0 | 2 | 1 | 149 | 1 | 8 | |||||
| 0 | 43 | 0 | 11 | 235 | 141 | 0 | 117 | 0 | 3 | |||||
| 0 | 53 | 0 | 53 | 0 | 5 | 6 | 85 | 0 | 43 | |||||
| 0 | 43 | 0 | 420 | 0 | 6 | 0 | 43 | 3 | 21 | |||||
| 0 | 45 | 0 | 1,945 | 0 | 0 | 0 | 45 | 0 | 3 | |||||
| 0 | 40 | 0 | 723 | 1 | 2547 | 0 | 120 | 0 | 4 | |||||
| 30 | 435 | 1 | 45 | 0 | 2 | 7 | 77 | 0 | 55 | |||||
| 1 | 55 | 0 | 53 | 0 | 1 | 0 | 20 | 0 | 19 | |||||
| 0 | 143 | 0 | 5 | 0 | 69 | 0 | 20 | 0 | 67 | |||||
| 7 | 77 | 0 | 3 | 5 | 5 | 7 | 90 | 0 | 420 | |||||
| 0 | 88 | 0 | 13 | 0 | 2 | 0 | 7 | 1 | 163 | |||||
| 0 | 69 | 5 | 73 | 0 | 113 | 2 | 10 | 2 | 171 | |||||
| 0 | 55 | 0 | 4 | 6 | 4 | 1 | 56 | 0 | 145 | |||||
CNV, copy-number variation; DGV, respective number of CNVs according to the Database of Genomic Variants; IRD, inherited retinal diseases; LINE, long interspersed nuclear elements; Lit., respective absolute number of CNVs in patients with IRD according to literature; LTR, long terminal repeat elements.
Overview of CNVs identified in this study
| P1 | ||||||||||||
| P2 | ||||||||||||
| P3 | ||||||||||||
| P4 | Del | Hom | arRP | E1 (NC) | c.-538-?_-448+?del | c.-538-?_-448+?del | SNP chip | TLA | ? | Yes | Ref. | |
| P5 | ||||||||||||
| P5 | ||||||||||||
| P6 | ||||||||||||
| P7 | ||||||||||||
| Del | Het | adOPA | E1-29 (WG) | c.1-?_*3211+ | NA | MLPA | NA | No | PMID: 17306754 | |||
| Del | Hom | arRP | E1 (NC) | c.-395-1216883_-28-54604del | c.-395-1216883_-28-54604del | SNP chip | PCR | chr10: g.56478660_57777934del | No | Ref. | ||
| P10 | ||||||||||||
| P11 | ||||||||||||
| P12 | ||||||||||||
| P13 | ||||||||||||
| P14 | ||||||||||||
| P15 | ||||||||||||
| Dup | Het | arUS | E4-13 (fs) | c.652-23900_2809+1417dupinsTTACTTGATTATTACTG | c.2276G>T, p.(Cys759Phe) (ExAC MAF : 0.078%) | MLPA | TLA | chr1: g.216418510_216562327dupinsTTACTTGATTATTACTG | Yes | Ref. | ||
| Dup | Het | arRP | E4-13 (fs) | c.652-23900_2809+1417dupinsTTACTTGATTATTACTG | c.2276G>T, p.(Cys759Phe) (ExAC MAF : 0.078%) | MLPA | TLA | chr1: g.216418510_216562327dupinsTTACTTGATTATTACTG | No | Ref. | ||
| Dup | Het | arUS | E4-13 (fs) | c.652-23900_2809+1417dupinsTTACTTGATTATTACTG | c.2209C>T, p.(Arg737*) (ExAC MAF : 0.00083%) | MLPA | TLA | chr1: g.216418510_216562327dupinsTTACTTGATTATTACTG | No | Ref. | ||
| Dup | Het | arUS | E4-13 (fs) | c.652-23900_2809+1417dupinsTTACTTGATTATTACTG | c.6587G>C, p.(Ser2196Thr) (ExAC MAF: 1.15%, benign) | MLPA | TLA | chr1: g.216418510_216562327dupinsTTACTTGATTATTACTG | No | Ref. | ||
| Dup | Het | arRP | E4-13 (fs) | c.652-23900_2809+1417dupinsTTACTTGATTATTACTG | c.2276G>T, p.(Cys759Phe) (ExAC MAF : 0.078%) | MLPA | TLA | chr1: g.216418510_216562327dupinsTTACTTGATTATTACTG | No | Ref. | ||
| Del | Hom | arUS | E5-10 (if) | c.785-6636_1840+208del | c.785-6636_1840+208del | MLPA | PCR | chr1: g.216465309_216507632del | No | PMID: 24944099 | ||
| P22 | ||||||||||||
| Del | Het | arUS | E22 (fs) | c.4628- | c.8167C>T, p.(Arg2723*) (ExAC MAF: 0.00082%) | MLPA | NA | ? | No | PMID: 25558175 | ||
| Del | Het | arUS | E22-24 (if) | c.4627+25434_4987+657del | c.14587G>T, p.(Glu4863*) (not present in ExAC) | MLPA | TLA | chr1: g.216259404_216323160del | No | PMID: 25804404 | ||
| Del | Het | arUS | E44 (fs) | c.8682-1666_8845+139del | c.10689T>A, p.(Tyr3563*) (not present in ExAC) | MLPA | PCR | chr1: g.216040210_216042178del | No | PMID: 25404053 | ||
| Del | Het | arUS | E44 (fs) | c.8682-1666_8845+139del | c.921_922insCAGC (ExAC MAF: 0.011%) | MLPA | PCR | chr1: g.216040210_216042178del | No | PMID: 25404053 | ||
| P27 | ||||||||||||
| P28 | ||||||||||||
| P29 |
ad, autosomal dominant; ar, autosomal recessive; B, bestrophinopathy; BMD, Best macular dystrophy; CNV, copy-number variation; CRD, cone rod dystrophy; Del, deletion; Dup, duplication; end, CNV extends until 3′ UTR; fs, frameshift; Het, heterozygous; Hom, homozygous; if, in-frame; LCA, Leber congenital amaurosis; MAF, minor allele frequency; MD, macular dystrophy; MLPA, multiplex ligation-dependent probe amplification; NC, noncoding; nt, nucleotide; OPA, optic atrophy; PCR, polymerase chain reaction; qPCR, quantitative PCR; RP, retinitis pigmentosa; SNP, single-nucleotide polymorphism; TLA, targeted locus amplification; US, Usher syndrome; UTR, untranslated region; WG, whole gene.
This table presents an overview of the different CNVs identified in this study, novel CNVs are indicated in bold. The “Exons” column shows which exons of the gene are affected by the CNV and shows whether the change is in-frame, causes a frameshift, affects the end of the gene including the 3′ UTR, is noncoding, or encompasses the whole gene. Most of the CNVs have been picked up by MLPA (P1, P2, P7, P8, P12, P13, P15–29); two are found using arrEYE (P10, P11), an in-house developed customized microarray (Van Cauwenbergh et al. 2016); three have been identified by SNP chip analysis in consanguineous families (P3, P4, P9); one homozygous deletion (P14) and another apparently homozygous deletion (P5, which later turned out to be two different overlapping heterozygous deletions) have been discovered by nonamplification when performing conventional PCR; and the presence of a last heterozygous CNV has been demonstrated by qPCR (P6). Apart from conventional PCR, targeted locus amplification (TLA) has been used to delineate the CNVs at nucleotide level. The last column shows whether the identified CNVs are novel or have been previously published. Novelty has been assessed based on exon level, not on nucleotide level. The variant identified in P19 in addition to the USH2A CNV is predicted to be benign and has a high MAF (>1%). This patient carries either a second yet unidentified USH2A mutation, or is a heterozygous carrier of the USH2A CNV and has another underlying disease gene.
Reference sequences: BEST1: NM_004183.3, EYS: NM_001292009.1, KCNV2: NM_133497.3, MERTK: NM_006343.2, OPA1: NM_015560.2, PCDH15: NM_033056.3, PDE6G: NM_002602.3, PRPH2: NM_000322.4, SPATA7: NM_018418.4, USH2A: NM_206933.2.
Figure 1Identification and characterization of a noncoding deletion. (a) Location of the deletion. The homozygous 1.3-Mb deletion identified in P9 is depicted in red, removing the first noncoding exon of the PCDH15 gene, the MTRNR2L5 gene with unknown function and two PCDH15-associated CRX-bound regions (CBR). CBR1 is situated in the promoter region, while CBR2 is located approximately 100 kb upstream. (b) Delineation of the deletion. Further refinement of the deletion breakpoint regions by conventional PCR. Black, orange and blue shaded boxes and connecting lines indicate nondeleted exon and introns, deleted exon, intron and upstream region, and nondeleted upstream region, respectively. Short grey horizontal lines correspond to designed PCR amplicons, used to delineate the deletion. The black dotted line indicates the junction product. (c) Sanger sequencing of the junction product. Delineation of the deletion at the nucleotide level, chr10: g.56478660_57777934del. (d) Electroporation assays. In order to assess the cis-regulatory effect of CBR1 and CBR2, electroporation reporter assays were performed in mouse retinal explants. The first construct consisted of CBR2 cloned in a dsRed expressing vector, upstream of a Rho-basal promoter, as CBR2 is located in a more distant regulatory region. For the second construct, CBR1 was cloned in a dsRed expressing vector without basal promoter, as it is located in the promoter region. Cis-regulatory activity could be demonstrated for the CBR2 construct, while CBR1 seems to fail in driving dsRed expression on its own. (e) Cis-regulatory landscape of PCDH15. Epigenomic marks and transcription factor binding were assessed in human adult retina, shown here for the PCDH15 locus: ATAC-seq; ChIP-seq for H3K27ac, H3K4me2, CRX, OTX2, NRL, RORB and MEF2D; and RNA-seq. The region of the deletion is marked with a shaded rectangle and the putative active promoter and enhancer regions included in the deleted region are in lighter shading.
Figure 2Identification and characterization of a deletion using targeted locus amplification (TLA) on extracted genomic DNA. (a) Location of the deletion. The heterozygous 8-kb deletion identified in P1 is depicted in red, removing the first two exons of the BEST1 gene. (b) Primer design TLA. Initial real-time polymerase chain reaction (qPCR) delineation of the deletion was hampered by highly repetitive breakpoint regions, followed by further delineation using TLA. The minimal deleted region based on the qPCR delineation is depicted in red; the gray dotted line corresponds to the 5′ and 3′ breakpoint regions and the surrounding genomic region in black. In case of a deletion, the outward-oriented TLA primers indicated in orange need to be designed in an anchor region that is not deleted, here in the downstream surrounding genomic region. (c) TLA read mapping. The amplified PCR products are sequenced by next-generation sequencing (NGS), followed by mapping of the resulting reads showing one distinct peak surrounding the anchor region. (d) Delineation of the deletion. Detailed analysis of the reads in the breakpoint regions led to the identification of the deletion at nucleotide level: chr11:g.61711373_61719810del. The upper, middle, and lower lines represent the sequences of the 5′ wild-type (wt) region, the deletion junction product, and the 3′ wt region, respectively. Microhomology of 3 bp is indicated in gray. The two boxes show the mapped reads in both breakpoint regions. Green sequences are forward reads; red sequences are reverse reads. The fully colored reads are wt reads, while the reads containing shaded sequences are junction reads spanning the deletion breakpoints. The turquoise and orange boxes highlight the 5′ and 3′ sequences, separated by the 3 bp of microhomology.
Figure 3Identification and characterization of a complex noncoding deletion using targeted locus amplification (TLA). (a) Location of the deletion. The homozygous 16-kb deletion identified in P3 is depicted in red, located upstream of the EYS gene. (b) Delineation of complex deletion. Due to problems using conventional polymerase chain reaction (PCR) as an initial delineation strategy, TLA was used for further characterization of the deletion at nucleotide level. Analysis of the resulting TLA reads revealed a complex copy-number variation (CNV) consisting of a deletion of chr6: 66,631,318-66,647,641 (hatched region), together with an insertion of a 455-bp fragment consisting of a 277-bp sequence (chr6: 66,628,168-66,628,444) located upstream of the deletion (turquoise region) joined to an inverted 178-bp fragment (chr6: 66,642,009-66,642,186) situated in the deleted region (orange region). The upper part of the figure represents the wt sequence, while the lower part shows the resulting mutated sequence. The underlying mutational mechanism, represented in the middle part, is most probably a replication-based mechanism. The numbers correspond to different template switches, based on the observed microhomology in every breakpoint region. (c) Microhomology at the breakpoints. Comparison of the sequences at the three breakpoint junctions shows microhomology at every junction, supporting microhomology-mediated break-induced replication as the underlying mechanism. (d) Cis-regulatory landscape of EYS. Epigenomic marks and transcription factor binding were assessed in human adult retina, shown here for the upstream EYS region: ATAC-seq; ChIP-seq for H3K27ac, H3K4me2, CRX, OTX2, NRL, RORB and MEF2D; and RNA-seq. The region of the deletion is marked with a shaded rectangle and the putative active promoter and enhancer region included in the deleted region are in lighter shading, for both the deletion presented in this figure (insertions/inversion indicated by black vertical lines) and another regulatory EYS deletion identified in P2. For the latter, both the promoter and an enhancer region coincide with the CNV, whereas for the complex CNV no cis-regulatory epigenomic marks could be seen in the affected region.