Literature DB >> 34728537

A hidden structural variation in a known IRD gene: a cautionary tale of two new disease candidate genes.

Hilary A Scott1, Anna Larson1, Shi Song Rong1, Sudeep Mehrotra1, Rossano Butcher1, Katherine R Chao2, Janey L Wiggs1, Emily M Place1, Eric A Pierce1, Kinga M Bujakowska1.   

Abstract

Rod-cone dystrophy (RCD), also known as retinitis pigmentosa, is an inherited condition leading to vision loss, affecting 1 in 3500 people. More than 270 genes are known to be implicated in the inherited retinal degenerations (IRDs), yet genetic diagnosis for ∼30% of IRD of patients remains elusive despite advances in sequencing technologies. The goal of this study was to determine the genetic causality in a family with RCD. Family members were given a full ophthalmic exam at the Retinal Service at Massachusetts Eye and Ear and consented to genetic testing. Whole-exome sequencing (WES) was performed and variants of interest were Sanger-validated. Functional assays were conducted in zebrafish along with splicing assays in relevant cell lines to determine the impact on retinal function. WES identified variants in two potential candidate genes that segregated with disease: GNL3 (G Protein Nucleolar 3) c.1187 + 3A > C and c.1568-8C > A; and PDE4DIP (Phosphodiester 4D Interacting Protein) c.3868G > A (p.Glu1290Lys) and c.4603G > A (p.Ala1535Thr). Both genes were promising candidates based on their retinal involvement (development and interactions with IRD-associated proteins); however, the functional assays did not validate either gene. Subsequent WES reanalysis with an updated bioinformatics pipeline and widened search parameters led to the detection of a 94-bp duplication in PRPF31 (pre-mRNA Processing Factor 31) c.73_266dup (p.Asp56GlyfsTer33) as the causal variant. Our study demonstrates the importance of thorough functional characterization of new disease candidate genes and the value of reanalyzing next-generation sequencing sequence data, which in our case led to identification of a hidden pathogenic variant in a known IRD gene.
© 2022 Scott et al.; Published by Cold Spring Harbor Laboratory Press.

Entities:  

Keywords:  peripheral visual field loss; progressive retinal degeneration

Mesh:

Substances:

Year:  2022        PMID: 34728537      PMCID: PMC8958919          DOI: 10.1101/mcs.a006131

Source DB:  PubMed          Journal:  Cold Spring Harb Mol Case Stud        ISSN: 2373-2873


INTRODUCTION

Rod-cone degeneration (RCD) also known as retinitis pigmentosa (RP) is the most prevalent type of inherited retinal disease (IRD) which causes progressive vision loss affecting 1 in 3500 people worldwide (Haim 2002). IRDs can be inherited as autosomal recessive, autosomal dominant, Chromosome X–linked, and mitochondrial traits. Genetic diagnosis may additionally be complicated because of the genetic heterogeneity of IRDs with a wide spectrum of clinical phenotypes attributed to pathogenic variants in more than 270 genes (RetNet—Retinal Information Network; uth.edu). Targeted exome panels that include known IRD genes as well as whole-exome sequencing are the standard approach for determining genetic diagnosis. These are largely focused on identifying single-nucleotide variants (SNVs), small indels, and increasingly large copy-number variations, allowing for the identification of causal mutations in ∼60% of cases (Consugar et al. 2014; Carss et al. 2017; Ellingford et al. 2018). Current gene-based therapies require accurate molecular diagnosis, which provides an important incentive for determining the cause of disease in the remaining patients (Bainbridge et al. 2008; Maguire et al. 2008; MacLaren et al. 2014). A potential source of this missing causality could be found in novel candidate IRD genes, although rare, new IRD disease genes have been discovered within the last few years (e.g., ARSG, CLCC1, POMGNT1, REEP6, AHR, IFT172) (Bujakowska et al. 2015; Arno et al. 2016; Xu et al. 2016; Khateb et al. 2018; Li et al. 2018; Zhou et al. 2018). Cosegregation of the variant with the ocular phenotype in large pedigrees is often the only indicator of a novel gene initially. To classify the variant as causal, first a gene-disease association must be established, which requires further genetic and experimental investigation. Candidate genes harboring rare variants can be prioritized based on previously characterized roles in retinal development or function, interaction of the encoded protein with known IRD-associated proteins (Zeitz et al. 2005, 2006; Audo et al. 2009), in silico pathogenicity prediction scores (Siepel et al. 2005; Kircher et al. 2014; Schwarz et al. 2014; Ionita-Laza et al. 2016; Jaganathan et al. 2019), and thorough validation with model organisms to characterize the impact on retinal function (Bujakowska et al. 2015). In this study we applied the above criteria to search for novel candidate gene in a large family with three affected siblings diagnosed with RCD. Two strong candidate genes were found to cosegregate with disease. G protein nucleolar 3 (GNL3) and Phosphodiesterase 4D interacting protein (PDE4DIP) both encode proteins that are involved with retinal development or known to interact with IRD-associated proteins (Kawashima et al. 2009; Overlack et al. 2011; Paridaen et al. 2011). Possible pathogenicity of the identified variants in both genes was studied in cell and zebrafish models; however, after exhaustive investigation, both candidate genes were excluded as the cause of disease. Ultimately the cause of disease was determined to be a difficult-to-detect structural variation in a known IRD gene, PRPF31 (Vithana et al. 2001). In this case, partial penetrance displayed in the unaffected mother and sibling further complicated the genetic analysis because of a misleading inheritance pattern that was actually determined to be autosomal dominant with haploinsufficiency (Vithana et al. 2003; Abu-Safieh et al. 2006). Our study demonstrates the importance of a thorough validation of new candidate genes and the value of sequence data reanalysis.

RESULTS

A 30-yr-old female (OGI842-1649) was seen in the IRD clinic at Massachusetts Eye and Ear (MEE) with RCD. Family history revealed that five out of seven children in the family suffered from visual impairment, with no other history of retinal degeneration in the family (Fig. 1A). Subject OGI842–1649 had decreased visual acuity, restricted visual fields, and severely reduced cone responses in full-field electroretinograms (ERGs) (Table 1). Her fundus images showed characteristic signs of rod-cone degeneration including peripheral bone spicule pigmentation, vessel attenuation, and abnormal granularity in the macula retinal pigment epithelium (RPE) (Fig. 1B). Both parents were evaluated and showed no signs of retinal abnormalities, and full-field ERGs were within normal ranges (Table 1; Fig. 1B).
Figure 1.

Pedigree and phenotypic details for family OGI842. (A) Multigenerational pedigree of OGI842 family. Affected are shown with darkened symbols, the proband is denoted with an arrow, and all subjects that were sequenced have a number designation. (B) Fundus images of the proband (OGI1649) age 33, unaffected 57-yr-old mother (OGI1831), and unaffected 58-yr-old father (OGI1653).

Table 1.

Clinical phenotypes of proband and parents

Subject IDSexAge at visit (yr)RelationshipStatusVisual acuity (best corrected)Visual field (normal ≥ 120)FundusFull-field ERG Normal values 0.5-Hz amplitude: 350–700 μV 30-Hz amplitude: 50–125 μV 30-Hz implicit time: 25–32 msec
OG1842–1649F34 ProbandRCDOD: 20/30OS: 20/30  OD: 93.98  OS: 91.41Waxy pallor of optic disc, abnormal granularity of the RPE in the macula OU; peripheral bone spicule pigmentation, as well as attenuated vessels OUOD:OS:
0.5-Hz amp:NA2.70 μV
30-Hz amp:0.59 μV0.45 μV
30-Hz time:78 msec78 msec
OG1842–1831F57 MotherUnaffectedOD:20/25+3OS: 20/30  FullOptic disc, macula, vessels, and periphery all normal; minimal cortical lens opacities indicating mild cataracts; no evidence of RCDOD:OS:
0.5-Hz amp:494 μVNo data
30-Hz amp:41.2 μV65 μV
30-Hz time:31 msec33 msec
OG1842–1653M58 FatherUnaffectedOD:20/20OS: 20/200  Full   OD:100  OS: 114Optic disc, macula, and vessels are normal; nasal fundus slightly tigroid in periphery; no evidence of RCDOD:OS:
0.5-Hz amp:317.6 μV470.6 μv
30-Hz amp:41.2 μV47.1 μV
30-Hz time:31 msecNA

(ERG) Electroretinogram, (RCD) rod-cone dystrophy, (OU) both eyes, (OD) right eyes, (OS) left eye, (RPE) retinal pigment epithelium, (F) female, (M) male.

Pedigree and phenotypic details for family OGI842. (A) Multigenerational pedigree of OGI842 family. Affected are shown with darkened symbols, the proband is denoted with an arrow, and all subjects that were sequenced have a number designation. (B) Fundus images of the proband (OGI1649) age 33, unaffected 57-yr-old mother (OGI1831), and unaffected 58-yr-old father (OGI1653). Clinical phenotypes of proband and parents (ERG) Electroretinogram, (RCD) rod-cone dystrophy, (OU) both eyes, (OD) right eyes, (OS) left eye, (RPE) retinal pigment epithelium, (F) female, (M) male. Molecular analysis was performed on the parents (OGI842–1653, OGI842–1831) and four siblings for whom DNA was available (OGI842–1649, OGI842–1650, OGI842–1651, and OGI842–1652). Potential causal variants obtained from exome sequencing were filtered based on minor allele frequency (MAF) ≤ 0.1%. Whole-exome sequencing (WES) analysis identified rare biallelic variants in two candidate genes, GNL3 with c.1187 + 3A > C and c.1568-8C > A splice region variants (Fig. 2A) and PDE4DIP with two missense variants (c.3868G > A, p.Glu1290Lys and c.4603G > A, p.Ala1535Thr) (Fig. 3A). Variants in both candidate genes were rare or absent in the gnomAD database and were selected for further analysis based on a combination of evolutionary conservation, aberrant splicing, and pathogenicity predictions (Table 2; Karczewski et al. 2020).
Figure 2.

Investigating variants in GNL3, a candidate gene for inherited retinal degeneration (IRD). (A) Pedigree analysis confirms segregation of GNL3 with disease; filled symbols represent affected individuals, empty symbols represent healthy individuals, circles are females, and squares are males. (B) Next-generation sequencing (NGS) analysis of mRNA splicing of the exon 10–13 minigene construct. Read coverage and exon junctions are represented by sashimi plots for the wild-type control (red), the c.1187 + 3A > C variant (blue), and the essential splice site control c.1187 + 1G > T (green). The corresponding minigene transcript model with the position of the mutation indicated with a red arrowhead is shown below. Exon 11 skipping (5605 junction reads) and intron 10 retention with partial exon 11 inclusion and exon 12 skipping (4035 reads) are the major effects of the c.1187 + 3A > C variant. Exon 11 skipping (7523 reads) and exon 11/12 skipping (8059) reads are the major essential in the essential splice site control. Normalizing exon 11 splicing events to adjacent skipping of exon 11 occurs in 60% of transcripts (5605/9234 read ratio) as compared to wild type 9242/10,914, in which exon 12 skipping occurs in 43% (4035/9234 read ratio). (C) Sashimi plots generated from the amplicon sequencing on wild type (red), c.1568-8C > A (blue), and essential splice site mutation c.1568-1G > T (green) across regions of the minigene including exons 13–15. No difference is observed in the splicing pattern between the wild-type control and the c.1568-8C > A variant. Essential splice site control, c.1568-1G > T, showed complete intron 14 inclusion.

Figure 3.

Investigating PDE4DIP as a candidate gene for inherited retinal degeneration (IRD). (A) Pedigree analysis confirming segregation of p.Glu1290Lys and p.Ala1535Thr missense variants. (B) Schematic of the PDE4DIP gene and the encoded protein (Myomegalin, PDB: MYOME_HUMAN (Q5VU43)) with mutations indicated (red arrowheads) in exons 26 and 28. (C) Retinal histology of pde4dip morphants at 5 d postfertilization (dpf), showing no observable differences in morphology as compared to controls. (D) Visual-motor response (VMR) assay showing a significant difference in response to light stimuli in pde4dip morphants (red, n = 144) and control MO (black, n = 143) (Tukey honestly significant difference [HSD], Light OFF P = 2.73 × 10−13, Light ON P = 0.0136). (E) Acoustically evoked behavior response (AEBR) assay showing decreased response in pde4dip morphants (red) compared to control fish (black) (n = 144, Tukey HSD, P = 0.8996) indicating general locomotor abnormalities. (F) Zebrafish analyzed at 5 dpf showing absence of gas bladders in pde4dip morphants. (G) Quantification of the gas bladder presence in pde4dip morphants (red) (n = 134, 54.48%) versus control fish (black) (n = 134, 98.51%), Fisher's exact test P < 0.0001.

Table 2.

Variants identified in whole-exome sequencing

GenecDNA changeConsequenceCADDMutation tasterSplice AIgnomADPhast ConEigenClinVarACMG
GNL3 c.1187 + 3A > CExon skipping14Disease-causing0.75Absent0.99918NRLP (likely pathogenic), PVS1, PS3, PM2, PP1, PP3, BP5
GNL3 c.1568-8C > ANA7.4Disease-causing0.15Absent0.0018.1NRVUS (variant of uncertain significance), PM2, PM3, PP3, PP4, BS3, BP5
PDE4DIP c.3868G > Ap.Glu1290Lys32Disease-causing0.01.32 × 10−51.006.9NRLB (likely benign), PM2, PP3, PP4, BS3, BP5
PDE4DIP c.4603G > Ap.Ala1535Thr23Polymorphism0.03.281 × 10−40.7731.0NRLB (likely benign), PM2, PP3, PP4, BS3, BP5
PRPF31 c.73_166dupp.Asp56Glyfs Ter33NADisease-causingNAAbsentaNANALPP (pathogenic), PVS1, PM2, PP1, PP3, PP4

(CADD) Combined Annotation Dependent Depletion, (gnomAD) Genome Aggregation Database, (PhastCon) PHylogenetic Analysis with Space/Time models, (ACMG) American College of Medical Genetics and Genomics, (NR) not reported.

aAbsent from both gnomAD and gnomAD-SV.

Investigating variants in GNL3, a candidate gene for inherited retinal degeneration (IRD). (A) Pedigree analysis confirms segregation of GNL3 with disease; filled symbols represent affected individuals, empty symbols represent healthy individuals, circles are females, and squares are males. (B) Next-generation sequencing (NGS) analysis of mRNA splicing of the exon 10–13 minigene construct. Read coverage and exon junctions are represented by sashimi plots for the wild-type control (red), the c.1187 + 3A > C variant (blue), and the essential splice site control c.1187 + 1G > T (green). The corresponding minigene transcript model with the position of the mutation indicated with a red arrowhead is shown below. Exon 11 skipping (5605 junction reads) and intron 10 retention with partial exon 11 inclusion and exon 12 skipping (4035 reads) are the major effects of the c.1187 + 3A > C variant. Exon 11 skipping (7523 reads) and exon 11/12 skipping (8059) reads are the major essential in the essential splice site control. Normalizing exon 11 splicing events to adjacent skipping of exon 11 occurs in 60% of transcripts (5605/9234 read ratio) as compared to wild type 9242/10,914, in which exon 12 skipping occurs in 43% (4035/9234 read ratio). (C) Sashimi plots generated from the amplicon sequencing on wild type (red), c.1568-8C > A (blue), and essential splice site mutation c.1568-1G > T (green) across regions of the minigene including exons 13–15. No difference is observed in the splicing pattern between the wild-type control and the c.1568-8C > A variant. Essential splice site control, c.1568-1G > T, showed complete intron 14 inclusion. Investigating PDE4DIP as a candidate gene for inherited retinal degeneration (IRD). (A) Pedigree analysis confirming segregation of p.Glu1290Lys and p.Ala1535Thr missense variants. (B) Schematic of the PDE4DIP gene and the encoded protein (Myomegalin, PDB: MYOME_HUMAN (Q5VU43)) with mutations indicated (red arrowheads) in exons 26 and 28. (C) Retinal histology of pde4dip morphants at 5 d postfertilization (dpf), showing no observable differences in morphology as compared to controls. (D) Visual-motor response (VMR) assay showing a significant difference in response to light stimuli in pde4dip morphants (red, n = 144) and control MO (black, n = 143) (Tukey honestly significant difference [HSD], Light OFF P = 2.73 × 10−13, Light ON P = 0.0136). (E) Acoustically evoked behavior response (AEBR) assay showing decreased response in pde4dip morphants (red) compared to control fish (black) (n = 144, Tukey HSD, P = 0.8996) indicating general locomotor abnormalities. (F) Zebrafish analyzed at 5 dpf showing absence of gas bladders in pde4dip morphants. (G) Quantification of the gas bladder presence in pde4dip morphants (red) (n = 134, 54.48%) versus control fish (black) (n = 134, 98.51%), Fisher's exact test P < 0.0001. Variants identified in whole-exome sequencing (CADD) Combined Annotation Dependent Depletion, (gnomAD) Genome Aggregation Database, (PhastCon) PHylogenetic Analysis with Space/Time models, (ACMG) American College of Medical Genetics and Genomics, (NR) not reported. aAbsent from both gnomAD and gnomAD-SV. GNL3 encodes G protein nucleolar 3, also known as nucleostemin, which maintains cell proliferation and is necessary for the correct timing of cell cycle exit and differentiation (Tsai and McKay 2002). GNL3 is expressed in retinal progenitor cells (Schmitt et al. 2009), and zebrafish gnl3 mutants were shown to have smaller eyes and delayed retinal cell development (Paridaen et al. 2011). Therefore, we hypothesized that the two splice region variants c.1187 + 3A > C and c.1568-8C > A in GNL3 may lead to retinal dysfunction. Both changes were absent in gnomAD and TOPMed. The c.1187 + 3A > C change, located in intron 11, was predicted to affect splicing by multiple splicing prediction software, whereas c.1568-8C > A in intron 14 had an elevated Eigen score (Ionita-Laza et al. 2016), and both were predicted to be disease causing by MutationTaster (Table 2; Schwarz et al. 2014). Both variants were studied by splicing assays in HEK293T cells. The splicing constructs spanned exons surrounding the studied variants with two controls: a wild type and an essential splice site control (Fig. 2B,C). Next-generation sequencing (NGS) analysis of the midigene transcripts revealed skipping of exon 11 and 12 in the c.1187 + 3A > C variant and essential splice site control (Fig. 2B). However, the second studied variant, c.1568-8C > A, had no effect on splicing (Fig. 2C). This variant was therefore considered to be nonpathogenic. In addition, knocking down gnl3 with a splice blocking morpholino oligonucleotide (MO) in zebrafish did not show any morphological or functional effects on the eye or the retina (Supplemental Fig. S1). These findings conclusively dismissed GNL3 as being the cause of disease in this family. We therefore turned our attention toward the second candidate gene, PDE4DIP, whose product, myomegalin, is involved in cAMP-dependent signaling (Verde et al. 2001) and organization of centrosomal and Golgi-derived microtubules (Wang et al. 2014). Isoforms of myomegalin in the retina have been shown to interact with Sans, encoded by USH1G, Usher syndrome 1G (Overlack et al. 2011). Loss of USH1G causes congenital hearing loss and RCD (Mustapha et al. 2002), thus the implication of PDE4DIP playing a role in the Usher interactome made it a promising candidate gene for retinal degeneration (Weil et al. 2003). WES detected two missense variants in PDE4DIP (c.3868G > A, p.Glu1290Lys and c.4603G > A, p.Ala1535Thr; NM_001198834.3) that are rare in gnomAD and are highly conserved (Table 2; Fig. 3A,B; Karczewski et al. 2020). Zebrafish studies were carried out to determine the effect of pde4dip MO knockdown on the retina (Fig. 3C,D; Supplemental Fig. S2). Even though no structural retinal abnormalities were observed (Fig. 3C), pde4dip morphants showed reduced function in the visual motor response (VMR) (Fig. 3D). To exclude the possibility that the reduced VMR response was a result of other developmental anomalies rather than retinal malfunction, we performed an acoustically evoked behavioral response (AEBR) assay. AEBR measures the motor response after an acoustic stimulus, providing a vision-independent control. In the AEBR experiment, pde4dip morphants had lowered startle response compared to control fish, indicating other developmental defects unrelated to vision were affecting motor response (Fig. 3E). Upon a closer inspection of the fish morphology, we realized that ∼50% of the pde4dip morphants showed maldeveloped gas bladders, which indicates a general swimming impediment and explains reduced or lack of startle response upon visual or acoustic stimulation (Fig. 3F,G). Similar experiments repeated in CRISPR–Cas9 generated pde4dip mutant larvae also showed no retinal phenotype (Supplemental Figs. S3 and S4). Given this, we concluded that the pde4dip morphant phenotype is not specifically related to vision and that the loss of pde4dip in zebrafish and likely other organisms has no effect on retinal development and is unlikely the cause of disease in the studied family. As both candidate genes were disproven, WES data was reanalyzed. Removing the inheritance and variant quality filters (variant quality score recalibration) (McKenna et al. 2010) led to the identification of a 94-nt tandem duplication within the first coding exon of Pre-mRNA Processing Factor 31, PRPF31, which is expressed in all tissues, including the retina (Lonsdale et al. 2013) (https://www.gtexportal.org/home/gene/PRPF31). The PRPF31 encoded protein, U4/U6 small nuclear ribonucleoprotein Prp31, facilitates protein–RNA interactions required for spliceosome assembly. The retina may be particularly sensitive to perturbations in splicing as mutations in splicing factors (Tanackovic et al. 2011), including SNRNP200, PRPF3, PRPF8, and PRPF31 among others, are known to cause autosomal dominant rod-cone dystrophy (McKie et al. 2001; Vithana et al. 2001; Chakarova et al. 2002; Zhao et al. 2009). The duplication, c.73_166dup, leads to a frameshift that creates a stop codon, p.Asp56GlyfsTer33 (Fig. 4A) most likely resulting in nonsense mediated decay of the transcript (Fig. 4B). Polymerase chain reaction (PCR) amplification of the region surrounding the duplication confirmed that all three affected family members (OGI842–1649, OGI842–1650, OGI842–1651) carry this variant, as well as the unaffected sibling (OGI842–1652) and the mother (OGI842–1831), indicating incomplete penetrance (Fig. 4C). Sanger sequencing confirmed mapping of the duplication to hg19: Chr 19:54,621,731–54,621,824 (Fig. 4D). The disease mechanism of PRPF31 is known to be haploinsufficiency, and therefore the uncovered heterozygous loss-of-function variant was determined to be the cause of disease in this family, with two asymptomatic mutation carriers. Partial penetrance, a known feature of PRPF31-associated RCD (Vithana et al. 2003; Rivolta et al. 2006), explains the initial assumption that disease inheritance in this pedigree was autosomal recessive. The inheritance of the wild-type allele was shown to determine disease penetrance (Vithana et al. 2003; Rivolta et al. 2006), which was also seen in the presented family, in which the unaffected sibling inherited a different wild-type allele from her father than the affected siblings (Fig. 4A).
Figure 4.

Validation of PRPF31 duplication. (A) Familial segregation of the c.73_166dup, p.Asp56GlyfsTer33 variant (red bar) on maternal allele (black bar) revealing autosomal dominant inheritance pattern with partial penetrance. Segregation of the paternal wild-type allele among siblings with unaffected (white bar) and affected (gray bar). (B) A schematic diagram of the first two coding exons of PRPF31 showing primers used to validate the proposed tandem duplication below. The altered amino acid sequence (p.Asp56GlyfsTer33) of the duplication results in the creation of a stop codon (red octagon). (C) Polymerase chain reaction (PCR) amplification of gDNA from all family members showing the heterozygous duplication (two bands) in all but the father (OGI842-1653). The lower band 335 bp is expected for wild-type allele, the upper band 429 bp is expected with the inclusion of the 94-bp duplication. (D) Chromatogram for Sanger sequencing shows start point for duplication with genomic location hg19 Chr 19:54621731-54621824.

Validation of PRPF31 duplication. (A) Familial segregation of the c.73_166dup, p.Asp56GlyfsTer33 variant (red bar) on maternal allele (black bar) revealing autosomal dominant inheritance pattern with partial penetrance. Segregation of the paternal wild-type allele among siblings with unaffected (white bar) and affected (gray bar). (B) A schematic diagram of the first two coding exons of PRPF31 showing primers used to validate the proposed tandem duplication below. The altered amino acid sequence (p.Asp56GlyfsTer33) of the duplication results in the creation of a stop codon (red octagon). (C) Polymerase chain reaction (PCR) amplification of gDNA from all family members showing the heterozygous duplication (two bands) in all but the father (OGI842-1653). The lower band 335 bp is expected for wild-type allele, the upper band 429 bp is expected with the inclusion of the 94-bp duplication. (D) Chromatogram for Sanger sequencing shows start point for duplication with genomic location hg19 Chr 19:54621731-54621824.

DISCUSSION

Our study describes a search for the genetic cause of disease in an RCD family whose inheritance pattern appeared to be autosomal recessive with three affected members. WGS identified rare variants in two candidate genes that segregated with the disease phenotype and met the criteria determined in the outset of the study of being highly conserved, rare, or absent in genome aggregation databases, being predicted to be disease causing, and having relevant interactions with known IRD genes. The first candidate gene investigated, GNL3, also known as nucleostemin, belongs to a family of nucleolar GTP-binding proteins that includes members GNL3L and GNL2, which has been shown to be expressed in retinal precursors (Kawashima et al. 2009; Schmitt et al. 2009). Zebrafish models show both that gnl2−/− and gnl3−/− play a role in retinal neurogenesis with their loss leading to delayed retinal differentiation. Proteins coded by these genes have a partly redundant function with gnl3−/− displaying a milder phenotype with transient abnormalities at early stages of development at 22 h postfertilization (hpf), which resolve at later time points (Paridaen et al. 2011). However, our results showed no phenotype associated with knockdown of gnl3 using a splice blocking morpholino. Our attempts to use a translational blocking morpholino that would have arguably been a stronger knockdown and more closely comparable to the previous knockout model resulted in a “monster” phenotype and therefore could not be studied further. The lack of developmental phenotype in our gnl3 morphants may also be due to the fact that we evaluated the morphant larvae at a later developmental stage, at 5 dpf rather than 22 hpf (Paridaen et al. 2011). Ultimately, the lack of phenotype in our zebrafish knockdown, as well as the absence of splicing aberrations in in vitro assays, led us to exclude GNL3 as the cause of disease in this family. The second gene, PDE4DIP, encodes myomegalin, which has been shown to interact with SANS, an Usher interactome protein encoded by USH1G (Overlack et al. 2011). In the end, functional assays in zebrafish failed to demonstrate PDE4DIP impacted retinal function. Our results thus demonstrate the importance of not only the stringent selection criteria for candidate genes but also of rigorous functional validation to avoid falsely reporting new disease genes. Even though most of the new disease–gene associations demonstrate multiple levels of evidence for the pathogenicity of discovered variants, a recent study by Hanany and Sharon (2019) demonstrated a high false discovery rate within autosomal dominant IRD genes that was a result of mistakenly classified pathogenic variants. In our case, both genes appeared to be strong candidates and only functional analyses led to the exclusion of both candidate genes as the cause of disease. Reanalysis of sequencing data led to the detection of a causative 94-bp tandem duplication (NM_015629.4:c.73_166dup (p.Asp56fs)), which creates a premature termination codon in PRPF31. The duplication was present in the unaffected mother and an unaffected sibling. The mother displayed no clinical manifestations of RCD even though she carried the mutation. The unaffected sibling not studied clinically, however, reported no symptoms. Additionally, she inherited an alternate wild-type allele compared to her affected siblings’ further supporting partial penetrance that obscured the autosomal dominant inheritance pattern. In addition, the 94-bp duplication is too large to be included in the standard high-quality alignments and too small to be detected by read-depth copy-number variation (CNV) detection algorithms, such as gCNV (McKenna et al. 2010). This led to the PRPF31 variant being excluded in the initial variant filtering, even though loss-of-function variants in PRPF31 are a known cause of dominant RCD through haploinsufficiency (Abu-Safieh et al. 2006). Structural changes have increasingly been shown to contribute to retinal disease, with an estimated contribution of CNVs of up to 9% of IRD cases (Van Cauwenbergh et al. 2017; Ellingford et al. 2018; Zampaglione et al. 2020). Gene size has been shown to be the strongest predictor for genes enriched with structural variation followed by long interspersed nuclear elements (LINEs), long terminal repeats (LTRs), and segmental duplications (Van Schil et al. 2018). For example, USH2A is one of the largest IRD genes (800.5 kb, hg19 Chr 1:215,796,236–216,596,738), and a recent publication found potentially causal structural variants in 9% of an Usher syndrome cohort (Bonnet et al. 2016). PRPF31, however, is an outlier. It is a small gene (16.3 kb, hg19 Chr 19: 54,621,731-54,621,824) that is not in proximity to regions prone to nonallelic homologous recombination (NAHR) (Bujakowska et al. 2016), yet likely causal CNVs account for a large number of patients with PRPF31-associated disease (Sullivan et al. 2006; Zampaglione et al. 2020). PRPF31 does contain a large number of total repeats relative to its size including low copy repeats that flank unique sequences that lead to nonrecurrent CNVs (Carvalho and Lupski 2016), as well as LINEs and LTRs (Van Schil et al. 2018). In the present case, the location of the duplication is in a low-complexity region containing simple repeats, and the flanking sequences of the duplication contains a repeat (GCTATGGG) that may have facilitated DNA polymerase errors leading to the inclusion of the intervening region (Fig. 4D). The enrichment of genomic rearrangements combined with the complex inheritance pattern likely contributes to the underreporting of causal variants in PRPF31. Recent studies have shown diagnostic rates are improved by applying CNV analysis in an unbiased way in the initial analysis along with standard NGS pipelines, and this would allow for detection of causal CNVs in autosomal dominant genes such as PRPF31 (Bonnet et al. 2016; Ellingford et al. 2018). The overall contribution of PRPF31 mutations in IRD is likely underestimated and should be considered in apparent autosomal recessive pedigrees without a clear solution in a known IRD gene. This case study highlights the importance of thorough functional characterization of new candidate genes and the value of reanalysis of NGS sequence data as false discovery of novel genes can obscure true genetic diagnosis and may prevent the use of appropriate gene-based therapies in affected patients.

METHODS

Ethical Guidelines

The study was approved by the institutional review board at the MEE, an affiliate of Mass General Brigham (MGB) healthcare system (Human Studies Committee MGB), and complied with the Health Information Portability and Accessibility Act (HIPAA). All aspects of the project adhered to the tenets of the Declaration of Helsinki. Informed consent was obtained from all individuals on whom genetic testing and further molecular evaluations were performed.

Clinical Evaluation

Ophthalmic evaluations were performed by clinicians experienced in inherited retinal degenerations at MEE. Visual acuity was measured using Snellen and Early Treatment of Diabetic Retinopathy Study (ETDRS) charts. Kinetic perimetry was performed using a Goldmann perimeter. Full-field ERG was performed using Burian Allen electrodes and custom ERG system with previously described parameters (Reichel et al. 1989; Marmor et al. 2009).

Genetic Screening

DNA extracted from venous blood using the DNeasy Blood and Tissue Kit (QIAGEN) was used for all the sequencing reactions. WES was performed on OGI842–1649, OGI842–1651, OGI842–1653, and OGI842–1831, and data processing was performed by the Genomics Platform at the Broad Institute of MIT and Harvard with an Illumina exome capture (38 Mb target) and sequenced (150-bp paired reads) to cover >80% of targets at 20× and a mean target coverage of 100× (Supplemental Table S2). Exome sequencing data was processed through a pipeline based on Picard and mapping done using the BWA aligner to the human genome build 37 (hg19). The variant call sets were uploaded to variant analysis platform seqr (http://seqr.broadinstitute.org). Sanger sequencing was performed in all family members to validate potentially causal variants identified in WES including individuals who were not previously analyzed by WES (OGI842–1650 and OGI842–1652).

Splicing Assay

Variants selected for analysis with the midigene assay were evaluated for possible effects on splicing using both Alamut software (Alamut Visual; http://www.interactive-biosoftware.com/alamut-visual/) and Splice AI neural network (Jaganathan et al. 2019). Two constructs containing GNL3 exons 10–13 including intronic regions and GNL3 with exons 13–15 along with downstream 3′ regions, were amplified from patient's genomic DNA (Pfu Ultra II polymerase, Agilent Technologies, primers listed in Supplemental Table S1). The PCR products were cloned into pENTR-TOPO/D (ThermoFisher) and verified by Sanger sequencing. Clones carrying essential splice-site mutations were created by site-directed mutagenesis (QuickChange II Site Directed mutagenesis kit, Agilent Technologies) on the wild-type constructs and verified by Sanger sequencing. Clones containing the desired sequences (wild-type, essential splice site, and variant to be studied) were subcloned into pCSGW2+ vector (Jamshidi et al. 2019) by Gateway cloning using LR clonase (Invitrogen). pCSGW2+ GNL3 constructs were transfected into HEK293 cells using 3 µg of total DNA and Lipofectamine reagent (Invitrogen) into six-well plates (Corning) seeded with 5 × 105 cells. Human embryonic kidney cells (HEK293T) were grown on Dulbecco's Modified Eagle's Media (Gibco) and 10% fetal bovine serum (Thermo Fisher) at 37°C and 5% CO2. Total RNA was extracted from cells 72 h after transfection (RNAeasy mini kit, QIAGEN). cDNA was synthesized with Super Script II reverse transcriptase kit (Invitrogen) with 500 ng RNA input. Finally, PCR amplification using a primer directed toward the plasmid and a second primer specific to the insert containing flanking exons of the splice-site mutations (Supplemental Table S1) was performed followed by visualization by agarose gel electrophoresis and NGS (Illumina). STAR (version 2.5.3a) aligner (Dobin et al. 2012) was used to generate an index of the human genome (GRCh37.75.dna.primary_assembly.fa) and to align the reads for the analysis of splicing patterns from amplicon sequencing. Integrative Genomics Viewer (IGV) (Robinson et al. 2011) was used to load the aligned sequences (BAM files) for data visualization with Sashimi plots.

Knockdown in Zebrafish

This study used an in-house cross of the AB and TL strain (source: ZIRC) for all experiments. Breeding and embryo maintenance were performed in accordance with MEE Zebrafish Core Facility standard procedures and IACUC-approved protocols. Embryos were injected at the one- to eight-cell stage, where each embryo received 2.3 nL of either 0.4 mM of the standard control MO or 0.2 mM of each splice site blocking MO (Supplemental Table S1). cDNA was extracted from each condition and analyzed via PCR and Sanger sequencing to confirm the effect on splicing. Zebrafish larvae external phenotypes were evaluated at day 5. Histology was performed by sectioning zebrafish larvae in paraffin blocks and hematoxylin and eosin staining (SERI histology core, MGB). To confirm morpholino results, CRISPR–Cas9 knockdown was performed targeting pde4dip exons 4 and 13. Zebrafish embryos were collected within 10 min of the beginning of breeding to ensure the one-cell stage of the embryos. Both sgRNA (targeting exons 4 and 13; Supplemental Table S1) and Cas9 protein (EnGen Spy Cas9, NEB) were coinjected into a cell cytoplasm of one-cell staged embryos. The mosaic F0 fish were kept for 5 d and analyzed functionally as morphant fish. Genomic DNA was extracted from the larvae after each assay and analyzed using NGS to observe cutting efficiency.

Visual-Motor Response and Acoustically Evoked Behavior Response

A combined VMR and AEBR assay was used to test the visual function and general motor function, respectively (Zeddies and Fay 2005; Emran et al. 2008; Faria et al. 2019). In brief, larvae at 5 dpf were tested using the ZebraLab system (ViewPoint Life Sciences). Larvae with body deformation or other severe external phenotypes were excluded. Individual larvae were placed in individual wells of a 96-well plate and placed in the Zebralab system, which is isolated from outside vibration and light. In the AEBR test, larvae were adapted for 10 min before being challenged by three consecutive vibrations (200 Hz for 500 msec) with 5-min intervals under normal light illumination. In the VMR test, larvae were dark-adapted for 40 min before going through three consecutive trials of light onset and light offset periods that last for 30 min each (Emran et al. 2008). The light change (on or off) was abrupt and was not fading. The animal's motor response to the stimuli was recorded using infrared light and camera. All the functional tests were performed between 10 a.m. and 3 p.m.

Statistical Analysis of Zebrafish Functional Data

For the combined AEBR and VMR analysis, activity summarization and subsequent statistical analyses were performed using a customized analytical pipeline developed using R software (version 3.5.0; http://www.r-project.org/) building on previous analytical methods (Emran et al. 2008; Liu et al. 2015). The statistical analysis part of our analytical pipeline script is accessible in a GitHub repository: https://github.com/rongshisong/zebrafish_VMR_analysis. The differences between conditions and control in the mean activity level at the onset of the acoustic stimulation and during the first second after light change were tested using Tukey HSD with a P-value of 0.05 or greater considered significant. All experiments were done with at least n = 96 fish in each condition. Comparison of gas bladder presence was tested using Fisher's exact test.

ADDITIONAL INFORMATION

Data Deposition and Access

The causal variant identified in the present study has been deposited to ClinVar (https://www.ncbi.nlm.nih.gov/clinvar/) with accession number SCV001950339.

Ethics Statement

The study was approved by the institutional review board at the Massachusetts Eye and Ear (MEE), an affiliate of Mass General Brigham (MGB) healthcare system (Human Studies Committee MGB, Boston, MA, USA) and complied with the Health Information Portability and Accessibility Act (HIPAA). All aspects of the project adhered to the tenets of the Declaration of Helsinki. Informed consent was obtained from all individuals on whom genetic testing and further molecular evaluations were performed.

Acknowledgments

The authors thank the patients and their family members for their participation in this study and the Ocular Genomics Institute Genomics Core members for their experimental assistance. The authors thank the Exome Aggregation Consortium, the Genome Aggregation Database (GnomAD), and the groups that provided exome variant data for comparison. A full list of contributing groups can be found at http://exac.broadinstitute.org/about and http://gnomad.broadinstitute.org/about.

Author Contributions

H.A.S. and A.L. performed experiments and contributed to the writing of the manuscript. R.B. assisted in animal model experiments. S.S.R. and S.M. provided data analysis. J.L.W. supported zebrafish data analysis and reviewed the manuscript. E.M.P. provided clinical data and reviewed the manuscript. E.A.P. contributed to the experimental design and reviewed the manuscript. K.R.C. analyzed sequencing data. K.M.B. guided the experimental design, provided variant analysis, and contributed to the writing of the manuscript.

Funding

This work was supported by grants from the National Eye Institute (R01EY026904 [K.M.B. and E.A.P.], R01EY012910 [E.A.P.], and P30EY014104 [MEEI core support]), the Foundation Fighting Blindness (EGI-GE-1218-0753-UCSD, K.M.B.), Research to Prevent Blindness (to K.M.B.), and the Curing Kids Foundation. Sequencing and analysis were provided by the Broad Institute of MIT and Harvard Center for Mendelian Genomics (Broad CMG) and were funded by the National Human Genome Research Institute, the National Eye Institute, and the National Heart, Lung, and Blood Institute grant UM1HG008900 and in part by National Human Genome Research Institute grant R01 HG009141.

Competing Interest Statement

The authors have declared no competing interest.

Referees

Peter N Robinson Anonymous
  58 in total

1.  Mutations in POMGNT1 cause non-syndromic retinitis pigmentosa.

Authors:  Mingchu Xu; Takeyuki Yamada; Zixi Sun; Aiden Eblimit; Irma Lopez; Feng Wang; Hiroshi Manya; Shan Xu; Li Zhao; Yumei Li; Adva Kimchi; Dror Sharon; Ruifang Sui; Tamao Endo; Robert K Koenekoop; Rui Chen
Journal:  Hum Mol Genet       Date:  2016-01-28       Impact factor: 6.150

2.  STAR: ultrafast universal RNA-seq aligner.

Authors:  Alexander Dobin; Carrie A Davis; Felix Schlesinger; Jorg Drenkow; Chris Zaleski; Sonali Jha; Philippe Batut; Mark Chaisson; Thomas R Gingeras
Journal:  Bioinformatics       Date:  2012-10-25       Impact factor: 6.937

3.  Mutations in the pre-mRNA splicing factor gene PRPC8 in autosomal dominant retinitis pigmentosa (RP13).

Authors:  A B McKie; J C McHale; T J Keen; E E Tarttelin; R Goliath; J J van Lith-Verhoeven; J Greenberg; R S Ramesar; C B Hoyng; F P Cremers; D A Mackey; S S Bhattacharya; A C Bird; A F Markham; C F Inglehearn
Journal:  Hum Mol Genet       Date:  2001-07-15       Impact factor: 6.150

4.  Variation in retinitis pigmentosa-11 (PRPF31 or RP11) gene expression between symptomatic and asymptomatic patients with dominant RP11 mutations.

Authors:  Carlo Rivolta; Terri L McGee; Thomas Rio Frio; Roderick V Jensen; Eliot L Berson; Thaddeus P Dryja
Journal:  Hum Mutat       Date:  2006-07       Impact factor: 4.878

5.  A novel locus for Usher syndrome type I, USH1G, maps to chromosome 17q24-25.

Authors:  Mirna Mustapha; Eliane Chouery; Delphine Torchard-Pagnez; Sylvie Nouaille; Awni Khrais; Fouad N Sayegh; André Mégarbané; Jacques Loiselet; Mark Lathrop; Christine Petit; Dominique Weil
Journal:  Hum Genet       Date:  2002-03-12       Impact factor: 4.132

6.  Autosomal-dominant retinitis pigmentosa caused by a mutation in SNRNP200, a gene required for unwinding of U4/U6 snRNAs.

Authors:  Chen Zhao; Deepti L Bellur; Shasha Lu; Feng Zhao; Michael A Grassi; Sara J Bowne; Lori S Sullivan; Stephen P Daiger; Li Jia Chen; Chi Pui Pang; Kanxing Zhao; Jonathan P Staley; Catharina Larsson
Journal:  Am J Hum Genet       Date:  2009-10-29       Impact factor: 11.025

7.  Effect of gene therapy on visual function in Leber's congenital amaurosis.

Authors:  James W B Bainbridge; Alexander J Smith; Susie S Barker; Scott Robbie; Robert Henderson; Kamaljit Balaggan; Ananth Viswanathan; Graham E Holder; Andrew Stockman; Nick Tyler; Simon Petersen-Jones; Shomi S Bhattacharya; Adrian J Thrasher; Fred W Fitzke; Barrie J Carter; Gary S Rubin; Anthony T Moore; Robin R Ali
Journal:  N Engl J Med       Date:  2008-04-27       Impact factor: 91.245

8.  Mutation in the intracellular chloride channel CLCC1 associated with autosomal recessive retinitis pigmentosa.

Authors:  Lin Li; Xiaodong Jiao; Ilaria D'Atri; Fumihito Ono; Ralph Nelson; Chi-Chao Chan; Naoki Nakaya; Zhiwei Ma; Yan Ma; Xiaoying Cai; Longhua Zhang; Siying Lin; Abdul Hameed; Barry A Chioza; Holly Hardy; Gavin Arno; Sarah Hull; Muhammad Imran Khan; James Fasham; Gaurav V Harlalka; Michel Michaelides; Anthony T Moore; Zeynep Hande Coban Akdemir; Shalini Jhangiani; James R Lupski; Frans P M Cremers; Raheel Qamar; Ahmed Salman; John Chilton; Jay Self; Radha Ayyagari; Firoz Kabir; Muhammad Asif Naeem; Muhammad Ali; Javed Akram; Paul A Sieving; Sheikh Riazuddin; Emma L Baple; S Amer Riazuddin; Andrew H Crosby; J Fielding Hejtmancik
Journal:  PLoS Genet       Date:  2018-08-29       Impact factor: 5.917

9.  Copy-number variation is an important contributor to the genetic causality of inherited retinal degenerations.

Authors:  Kinga M Bujakowska; Rosario Fernandez-Godino; Emily Place; Mark Consugar; Daniel Navarro-Gomez; Joseph White; Emma C Bedoukian; Xiaosong Zhu; Hongbo M Xie; Xiaowu Gai; Bart P Leroy; Eric A Pierce
Journal:  Genet Med       Date:  2016-10-13       Impact factor: 8.822

10.  Nucleostemin as a possible progenitor marker of corneal epithelial cells.

Authors:  Motoko Kawashima; Tetsuya Kawakita; Satoru Yoshida; Shigeto Shimmura; Kazuo Tsubota
Journal:  Mol Vis       Date:  2009-06-10       Impact factor: 2.367

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.