Literature DB >> 35759666

The landscape of submicroscopic structural variants at the OPN1LW/OPN1MW gene cluster on Xq28 underlying blue cone monochromacy.

Bernd Wissinger1, Britta Baumann1, Elena Buena-Atienza1, Zeinab Ravesh1, Artur V Cideciyan2, Katarina Stingl3,4, Isabelle Audo5,6, Isabelle Meunier7, Beatrice Bocquet7, Elias I Traboulsi8, Alison J Hardcastle9, Jessica C Gardner9, Michel Michaelides9,10, Kari E Branham11, Thomas Rosenberg12, Sten Andreasson13, Hélène Dollfus14, David Birch15, Andrea L Vincent16, Loreto Martorell17, Jaume Català Mora18, Ulrich Kellner19, Klaus Rüther20, Birgit Lorenz21,22, Markus N Preising21, Emanuela Manfredini23, Yuri A Zarate24, Raymon Vijzelaar25, Eberhart Zrenner26, Samuel G Jacobson2, Susanne Kohl1.   

Abstract

Blue cone monochromacy (BCM) is an X-linked retinal disorder characterized by low vision, photoaversion, and poor color discrimination. BCM is due to the lack of long-wavelength-sensitive and middle-wavelength-sensitive cone photoreceptor function and caused by mutations in the OPN1LW/OPN1MW gene cluster on Xq28. Here, we investigated the prevalence and the landscape of submicroscopic structural variants (SVs) at single-base resolution in BCM patients. We found that about one-third (n = 73) of the 213 molecularly confirmed BCM families carry an SV, most commonly deletions restricted to the OPN1LW/OPN1MW gene cluster. The structure and precise breakpoints of the SVs were resolved in all but one of the 73 families. Twenty-two families-all from the United States-showed the same SV, and we confirmed a common ancestry of this mutation. In total, 42 distinct SVs were identified, including 40 previously unreported SVs, thereby quadrupling the number of precisely mapped SVs underlying BCM. Notably, there was no "region of overlap" among these SVs. However, 90% of SVs encompass the upstream locus control region, an essential enhancer element. Its minimal functional extent based on deletion mapping in patients was refined to 358 bp. Breakpoint analyses suggest diverse mechanisms underlying SV formation as well as in one case the gene conversion-based exchange of a 142-bp deletion between opsin genes. Using parsimonious assumptions, we reconstructed the composition and copy number of the OPN1LW/OPN1MW gene cluster prior to the mutation event and found evidence that large gene arrays may be predisposed to the occurrence of SVs at this locus.

Entities:  

Keywords:  BCM; gene conversion; human visual pigment genes; locus control region; opsin gene deletion

Mesh:

Substances:

Year:  2022        PMID: 35759666      PMCID: PMC9271157          DOI: 10.1073/pnas.2115538119

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   12.779


Blue cone monochromacy (BCM, Online Mendelian Inheritance in Man [OMIM] no. 303700) is a congenital retinal disorder characterized by low vision, severe color vision abnormality, photoaversion, and a frequent occurrence of sensory defect nystagmus. Its prevalence has been estimated to about one in 100,000 in Western populations (1). BCM is inherited as an X-linked disorder and caused by mutations in the OPN1LW/OPN1MW gene cluster encoding the genes for the long-wavelength-sensitive (LWS) and the middle-wavelength-sensitive (MWS) cone opsins on Xq28 (2, 3). In healthy individuals the OPN1LW/OPN1MW gene cluster comprises a single OPN1LW gene followed by one or multiple OPN1MW gene copies organized in a tandem repeat structure. OPN1LW and OPN1MW sequences are more than 98% identical, including introns and the intergenic sequence between gene copies (4). A model of nonallelic homologous recombination (NAHR) between these conserved sequences has been proposed to explain the variability in copy number and the frequent occurrence of OPN1LW•OPN1MW hybrid genes (5). Expression of the gene copies within the OPN1LW/OPN1MW gene cluster is governed by a locus control region (LCR) upstream of the gene cluster. Through physical interaction of the LCR with the proximal promoter of the OPN1LW or the OPN1MW gene, the mutually exclusive expression of a single gene copy in an individual cone photoreceptor is determined and maintained (6–8). Yet, the probability of activation of gene expression decays with distance and only the two copies closest to the LCR are therefore expressed to an extent that influences color vision (9). BCM is due to a simultaneous loss of functional LWS and MWS opsins. Three main mutation mechanisms have been described to underlie BCM: 1) deletions or other structural variants (SVs) at the OPN1LW/OPN1MW gene cluster, 2) single or multiple gene copies carrying inactivating point mutations, and 3) single or multiple gene copies carrying rare exon 3 variant haplotypes inducing a splicing defect (2, 3, 10, 11). Deletions and SVs represent the category of mutations that has been studied least intensively. Except for the initial work by Nathans et al. (2), only a few additional SVs have been mapped at the nucleotide level (12–18), most likely due to the tandem repeat structure of the OPN1LW/OPN1MW gene cluster which hampers mapping of breakpoints located within the repeat sequence. In this study we investigated the frequency, extent, and precise breakpoint location of SVs underlying BCM in a cohort of 213 genetically confirmed BCM families collected over a period of 25 y. Seventy-three families were shown to carry an SV, and we determined its structure and breakpoints at single-base resolution in all but one family. Forty-two distinct SVs were observed, of which 40 are not reported previously. With this, we quadrupled the number of precisely mapped distinct SVs associated with BCM. This uniquely large collection of BCM-linked SVs not only includes a plethora of unique cases (such as the intrachromosomal gene conversion of an intragenic deletion or an SV undergoing a subsequent loss of a residual OPN1MW gene copy) but also allows us to extract some general features of the structural composition and insights into the mechanisms of nonrecurrent SVs at the OPN1LW/OPN1MW gene cluster. Moreover, our data suggest that such SVs often originate from subjects carrying high-number multicopy gene arrays which are predisposed to undergo genomic rearrangements.

Results

SVs at the OPN1LW/OPN1MW Gene Cluster Are a Common Cause of BCM.

Among our cohort of genetically confirmed BCM families (n = 213) which were collected and genetically investigated over a period of more than 25 y, we identified 73 families (34%) carrying an SV as a cause of the disease (Fig. 1). X-linked inheritance of the condition was commonly reported and documented in the families’ pedigree (Dataset S1). The cosegregation of the SV with the disease was tested and confirmed where affected family members were available. We only observed a single case for which we could establish a de novo event in the lineage of an individual family (BCM 17/SVar10, Table 1) (19).
Fig. 1.

SVs in BCM families: Study overview. Root diagram of the study population and the subcategorization of SVs. Numbers on the right indicate numbers of BCM families in the respective subcategory.

Table 1.

 Compilation of SVs at the OPN1LW/OPN1MW gene cluster in BCM families in this study

VariantVariant (nomenclature)*Deletion sizeOPN1copy no.No. of famsBreakpointanalysisLikely SV mechanism§Ref.
Intragenic deletions
 SVar1g.[154156356_154156497del;154193470_154193611del]142 bp + 142 bp1 x LW1 x MW1No homologyNHEJ + gene conversionThis study
LCR deletions
 SVar2g.154140509_154140919del411 bp1 x LW2 x MW14bp homology (GGGC)MMEJThis study
 SVar3g.154139421_154142962delins193,542 bp1 x LW3 x MW1No homologyReplicativeThis study
Deletions of the LCR and parts of the OPN1LW/OPN1MW gene cluster
 SVar4g.154101548_154180938del79,391 bp3 x MW1Homeology,Alu elementsAlu-AluThis study
 SVar5g.154106268_154173412delinsAAAC67,145 bp2 x Hyb2 x MW1No homologyNHEJThis study
 SVar6g.154107176_154212164del104,989 bp2 x MW1Homeology,Alu elementsAlu-AluThis study
 SVar7g.154109499_154145822del36,324 bp1 x pLW1 x Hyb1 x MW13-bp homology(ATC)MMEJThis study
 SVar8g.154109808_154237456del127,649 bp1 x MW1Homeology,Alu elementsAlu-AluThis study
 SVar9g.154112841_154214363del101,523 bp2 x MW2Homeology,Alu elementsAlu-AluThis study
 SVar10g.154118184_154266255del(de novo)148,071 bp1 x pMW1Homeology,Alu elementsAlu-Alu 19
 SVar11g.154120645_154184227delinsTAGCAGAG63,583 bp1 x Hyb3 x MW1No homologyNHEJThis study
 SVar12g.154124706_154153479del28,774 bp1 x pLW1 x MW12-bp homology(GC)NHEJThis study
 SVar13g.[154125822_154125825delCAGC;154126040_154180538delinsG]4 bp,54,498 bp1 x Hyb2 x MW1No homologyReplicative/NHEJThis study
 SVar14g.154127568_154170730del43,163 bp1 x Hyb6 x MW1No homologyNHEJThis study
 SVar15g.154127629_154239784del112,156 bp1 x MW1Homeology,Alu elementsAlu-AluThis study
 SVar16g.154130298_154239957del109,660 bp1 x Hyb2 x MW1No homologyNHEJThis study
 SVar17g.154133406_154186565del53,160 bp1 x pMW3 x MW1Homeology,Alu elementsAlu-AluThis study
 SVar18g.154134412_154175618del41,207 bp2 x MW12bp homology(CT)NHEJThis study
 SVar19g.[154135035_154135036ins26;154135114_154187762del]52,649 bp1 x pHyb1 x Hyb1 x MW4Insertion homologyReplicative 14
 SVar20g.154135236_154235350del100,115 bp1 x MW1Homeology,Alu elementsAlu-Aluthis study
 SVar21g.154136252_154225156del88,904 bp1 x pMW1 x MW24-bp homology (GTGC)MMEJ 14
 SVar22g.[154136509_154200716delinsCT;154200800_154200814del)64,208 bp,15 bp2 x MW2No homologyReplicativeThis study
 SVar23g.154136764_154145987del9,224 bp1 x pLW2 x MW23-bp homology(ATC)MMEJThis study
 SVar24g.[154136800_154218125delins51;154219480_154228735delinsGCC]81,326 bp,9,256 bp1 x pMW1 x Hyb1No homologyReplicativeThis study
 SVar25g.154136950_154167537del30,588 bp1 x Hyb3 x MW14-bp homology (CCAC)MMEJThis study
 SVar26g.154138410_154178979del40,570 bp3 x Hyb2 x MW22Homeology,Alu elementsAlu-AluThis study
 SVar27g.154139254_154194420del55,167 bp1 x pLW1 x Hyb1 x MW12-bp homology(CC)NHEJThis study
 SVar28g.154139374_154169145delins13929,772 bp3 x Hyb10 x MW3No homologyReplicativeThis study
 SVar29g.154139739_154160981del21,242 bp1 x Hyb4 x MW14-bp homology(GCTC)MMEJThis study
Deletions of parts of the OPN1LW/OPN1MW gene cluster (LCR intact)
 SVar30g.154143960_154258401del114,442 bp1 x pMW12-bp homology(AA)NHEJThis study
 SVar31g.154150103_154276841del126,739 bp1 x pLW13-bp homology(CAT)MMEJThis study
 SVar32g.154150323_154284093del133,771 bp1 x pLW13-bp homology(GAG)MMEJThis study
 SVar33g.154155559_154271811del116,253 bp1 x pLW1No homologyNHEJThis study
Complete OPN1LW/OPN1MW locus deletions
 SVar34g.154106985_154314620del207,636 bpNone13-bp homology (GGA), flanking AluMMEJ/Alu-AluThis study
 SVar35g.154113628_154285080del171,453 bpNone1No homologyNHEJThis study
 SVar36g.154120448_154281284delins180160,836 bpNone1Homeology with insertionReplicativeThis study
 SVar37g.[154129722_154137642delinsGCACT;154140507_154292032del]7,921 bp,151,526 bpNone14-bp homology (TGGG)ReplicativeThis study
 SVar38g.154135236_154273154del137,919 bpNone1Homeology,Alu elementsAlu-AluThis study
 SVar39g.154136998_154279848del142,851 bpNone12bp homology(GC)NHEJThis study
Complex SVs
 SVar40g.154111506_(154233181_154257596)delins [NC_000020.11: g.(pter_2875532)_2895538inv]121–146 kb1 x MW13-bp homology(CTG), flanking AluMMEJ/Alu-AluThis study
 SVar41g.154128542_154164019delins[g.154150904_154156875inv;AGTGCGG]35,377 bp1 x pLW3 x MW15-bp homology (ACTCC)ReplicativeThis study
 SVar42g.154143928_154338057delins[394;g.154394091_154480236inv]Del: 194 kb,Dup-Inv: 86 kbNone1No homologyReplicativeThis study

*Reference sequence: NC_000023.11 if not otherwise stated.

†Number and structure of remaining OPN1LW and OPNMW gene copies: LW, OPN1LW; MW, OPN1MW; Hyb, OPN1MW•OPN1LW hybrid; pLW/pMW, incomplete OPN1LW or OPNMW gene copy.

‡Number of BCM families.

§NHEJ, nonhomologous end joining; MMEJ, microhomology-mediated end joining; Replicative, replication-based mechanism of CNV formation (e.g., MMBIR); Alu-Alu, involving pairs of Alu repeat elements (e.g., Alu-mediated NAHR).

SVs in BCM families: Study overview. Root diagram of the study population and the subcategorization of SVs. Numbers on the right indicate numbers of BCM families in the respective subcategory. Compilation of SVs at the OPN1LW/OPN1MW gene cluster in BCM families in this study *Reference sequence: NC_000023.11 if not otherwise stated. †Number and structure of remaining OPN1LW and OPNMW gene copies: LW, OPN1LW; MW, OPN1MW; Hyb, OPN1MW•OPN1LW hybrid; pLW/pMW, incomplete OPN1LW or OPNMW gene copy. ‡Number of BCM families. §NHEJ, nonhomologous end joining; MMEJ, microhomology-mediated end joining; Replicative, replication-based mechanism of CNV formation (e.g., MMBIR); Alu-Alu, involving pairs of Alu repeat elements (e.g., Alu-mediated NAHR). Structure, extent, and composition of BCM-linked SVs observed in this study. (A) Map of the OPN1LW/OPN1MW gene array with a single OPN1LW and three downstream OPN1MW gene copies (according to the GRCh38/hg38 genome assembly). The OPN1LW and OPN1MW gene(s) are depicted by red and green arrows, respectively. The LCR is shown as a rectangle upstream of the OPN1LW gene. Flanking genes (MECP2, TEX28, and TKTL1) are shown by gray arrows. (B–F) Categories of BCM-linked SVs including deletions restricted to the LCR (B), SVs covering the LCR and parts of the OPN1LW/OPN1MW gene cluster (C), SVs covering OPN1LW/OPN1MW gene cluster but intact LCR (D), deletions of the OPN1LW/OPN1MW gene cluster (including the LCR) and extending into the downstream TEX28 and TEKTL1 genes (E), and complex structural rearrangements (F). The SV breakpoints are marked by brackets and deleted parts are indicated by lighter gray color. The presence of OPN1MW•OPN1LW hybrid genes is indicated by arrows half-colored in green and red. The blue box in SVar40 represents an interstitial insertion of chromosome 20 sequences. Additional copies of OPN1MW or OPN1MW•OPN1LW hybrid genes are indicated by the number in parentheses below the arrows. Note that the structure of SVar1, SVar2, and SVar42, which cannot be properly displayed at this scale, is displayed in , respectively.

Landscape of SVs at the OPN1LW/MW Gene Cluster in BCM Patients.

Of the 73 independent BCM families that tested positive for an SV in the initial PCR-based screening, we were able to define the exact breakpoint(s) at single-base resolution in all but one family. For breakpoint junction sequences see . In total, we identified 42 different SVs, of which 40 are not reported previously (Table 1). Seven of the SVs were observed recurrently in more than one family, including one very common deletion observed in 22 families (SVar26; see below). The 42 distinct SVs can be subclassified into 1) intragenic SVs (n = 1), 2) SVs restricted to the LCR (n = 2), 3) SVs encompassing the LCR and parts of the OPN1LW/OPN1MW gene cluster (n = 27), 4) SVs encompassing parts of the OPN1LW/OPN1MW gene cluster but excluding the LCR (n = 4), 5) SVs encompassing the LCR and the entire OPN1LW/OPN1MW gene cluster (n = 5), and 6) complex SVs (n = 3) (Table 1 and Fig. 2). All SVs involve deletions. Their size ranges from 142 bp to 207 kb with a majority between 20 and 100 kb in size and thus may escape detection by commercial CGH arrays. In fact, a benchmark experiment with DNA samples from three patients with SVs (SVar26, SVar41, and SVar42) using the Cytoscan HD probe array did only correctly call an SV at the OPN1LW/OPN1MW gene cluster for SVar42 ().
Fig. 2.

Structure, extent, and composition of BCM-linked SVs observed in this study. (A) Map of the OPN1LW/OPN1MW gene array with a single OPN1LW and three downstream OPN1MW gene copies (according to the GRCh38/hg38 genome assembly). The OPN1LW and OPN1MW gene(s) are depicted by red and green arrows, respectively. The LCR is shown as a rectangle upstream of the OPN1LW gene. Flanking genes (MECP2, TEX28, and TKTL1) are shown by gray arrows. (B–F) Categories of BCM-linked SVs including deletions restricted to the LCR (B), SVs covering the LCR and parts of the OPN1LW/OPN1MW gene cluster (C), SVs covering OPN1LW/OPN1MW gene cluster but intact LCR (D), deletions of the OPN1LW/OPN1MW gene cluster (including the LCR) and extending into the downstream TEX28 and TEKTL1 genes (E), and complex structural rearrangements (F). The SV breakpoints are marked by brackets and deleted parts are indicated by lighter gray color. The presence of OPN1MW•OPN1LW hybrid genes is indicated by arrows half-colored in green and red. The blue box in SVar40 represents an interstitial insertion of chromosome 20 sequences. Additional copies of OPN1MW or OPN1MW•OPN1LW hybrid genes are indicated by the number in parentheses below the arrows. Note that the structure of SVar1, SVar2, and SVar42, which cannot be properly displayed at this scale, is displayed in , respectively.

The centromeric breakpoints of all BCM-linked SVs are located between MECP2 and the OPN1LW/OPN1MW gene cluster or within the gene cluster, while nine SVs extend further into and disrupt downstream genes, either TEX28 or both TEX28 and TKTL1. Two SVs (SVar24 and SVar37) had single long, contiguous, and correctly oriented segments of OPN1LW/OPN1MW sequence of 1,353 bp and 2,863 bp, respectively, inserted between the proximal and distal breakpoints () and were thus formally considered as two consecutive deletions in cis (Table 1). More complex SVs were seen in SVar40, SVar41, and SVar42. SVar40 comprises a deletion of the LCR and large parts of the gene cluster combined with an interstitial insertion of >20 kb of chromosome 20 sequences, SVar41 is a deletion of the LCR and the OPN1LW gene combined with an insertion of a large segment of OPN1LW (exons 2 to 5) in reverse orientation, and SVar42 represents a deletion of the entire OPN1LW/OPN1MW gene cluster and the downstream genes TEX28 and TKTL1 combined with an inverted duplication of >80 kb of downstream sequence including eight genes (RPL10, DNASE1L1, TAZ, ATP6AP1, GDI1, FAM50A, PLXNA, and LAGE3) (). Notably, the inverted duplication is inserted about 60 kb upstream of its native copy and separated by the nonduplicated EMD and FLNA genes and their flanking low copy inverted sequence repeat which drives the frequent inversion of EMD and FLNA (20).

Breakpoint Sequence Analysis and Molecular Mechanism Underlying SVs.

The majority of BCM-linked SVs were deletions with or without few nucleotides (range 1 bp to 51 bp) inserted at the deletion breakpoint. Besides the more complex SVs with larger contiguous insertions (see above), we also observed five SVs where single (SVar3, SVar24, and SVar36) or multiple short discontiguous fragments (SVar28 and SVar42) of OPN1LW/OPN1MW gene cluster-specific sequences were inserted at the breakpoint junctions in direct or inverted orientation (). Moreover, we observed three SVs (SVar13, SVar19, and SVar22) with additional small deletions ranging from 4 bp to 26 bp in close vicinity upstream or downstream of the principal SV breakpoint (). Such small insertions of remnant sequences and close-by indels are typical features of replicative processes underlying SV formation (21, 22). We also investigated a 300-bp window of sequence upstream and downstream of the breakpoints for sequence homology and the presence of repetitive sequences which may provide clues for the underlying molecular mechanism. From this analysis we inferred about equal proportions of SVs most likely caused by nonhomologous end joining (NHEJ, n = 12), by microhomology-mediated end joining (MMEJ, n = 10), by inter-Alu deletion events (e.g., Alu-based NAHR, n = 10), and by replication-based structural rearrangements such as microhomology-mediated break-induced replication (MMBIR, n = 10) (Table 1) (23, 24). All but six of the SVs have their centromeric breakpoint located in a 39-kb region between MECP2 and the LCR. We observed some clustering of breakpoints, notably a cluster of six breakpoints in a narrow 750-bp region 7.2 to 8 kb upstream of OPN1LW. This sequence is characterized by a marked increase in GC content from below 25% to higher than 75% () and the presence of some predicted distal enhancer elements.

Evidence for the “Spread” of an Intragenic Deletion by Gene Conversion.

In family BCM 262 we detected an intraexonic deletion of 142 bp (c.807_948del) which is deduced to result in a truncated cone opsin lacking important functional domains of the polypeptide. Notably, this 142-bp deletion was present in the proximal OPN1LW gene as well as in the single distal OPN1MW gene copy (Table 1 and ). Such sequence homogenization of mutations in multiple OPN1LW/OPN1MW gene copies in the same gene array has been observed in several families with point mutations (11, 25, 26) and is explained by intrachromosomal gene conversion. The introduction of deleterious variants through gene conversion between paralogs (e.g., a pseudogene and a functional gene copy) usually involves base changes and small indels (27). To the best of our knowledge, the 142-bp deletion observed in this study is the largest reported disease-associated deletion most likely spread through intrachromosomal gene conversion. Mechanistically gene conversion relies on sequence homology between the donor and the recipient sequence. Therefore, gene conversion of a donor sequence containing a deletion in comparison to the recipient is sterically hindering and may involve a larger proportion of flanking homologous sequences.

SVar26 Is a Founder Mutation in BCM Families from the United States.

SVar26 was found recurrently in a total of 22 families, all originating from the United States. We employed haplotype marker analysis using 11 microsatellites encompassing a region of 3.9 Mb in the vicinity of the OPN1LW/OPN1MW gene cluster and demonstrated that all the 22 tested families share a common haplotype covering a physical region of 0.47 Mb (, Fig. S6). Similarly, we observed common haplotypes in families sharing SVar19 (three families investigated, all from the United States) and SVar28 (all three families investigated, all from France), respectively ().

Do SVs Still Undergo Unequal Homologous Recombination?

NAHR between the highly homologous OPN1LW and OPN1MW gene sequences has been attributed to underlie the variability in gene copy number at this locus as well as the frequent occurrence of OPN1LW•OPN1MW and OPN1MW•OPN1LW hybrid genes (5). Since NAHR relies on the homology between the recombining sequences, we asked whether NAHR can still occur on chromosomes rearranged by SVs. The occurrence of the common SVar26 deletion with multiple remaining OPN1LW/OPN1MW gene copies downstream of the deletion in 22 seemingly independent families provided the opportunity to address this question at a semipopulation level. We determined the copy number of OPN1LW and OPN1MW gene copies in all families carrying SVar26 and observed that there were five gene copies (three OPN1MW•OPN1LW hybrid gene copies and two OPN1MW gene copies as deduced from the multiplex ligation-dependent probe amplification [MLPA] results) in all tested subjects. Given the considerable age of this founder mutation—as suggested by the microsatellite marker data ()—this finding may be taken as evidence that intrachromosomal NAHR at the OPN1LW/OPN1MW gene locus is suppressed in subjects carrying SVar26. In contrast, we also noted that the centromeric breakpoints of SVar20 and SVar38 were identical while the telomeric breakpoints were fully conserved in sequence but just differ by the presence (in SVar20) or absence (in SVar38) of a single OPN1MW gene copy. Since the independent occurrence of these unique SVs at the homologous nucleotide positions is rather unlikely, we hypothesize that SVar38 derives from SVar20, or vice versa, by a subsequent intrachromosomal or interchromosomal NAHR event. In fact, we observed in these two families, both of German origin, a common haplotype including microsatellite and additional SNP markers strongly supporting our hypothesis that SVar20 and SVar38 belong to one lineage and that SVar20 underwent an intrachromosomal NAHR to give rise to SVar38 (Fig. 3).
Fig. 3.

Identical centromeric breakpoints and sequence-conserved telomeric breakpoints in SVar20 and SVar38 support a single lineage intrachromosomal NAHR event. (A) SVar20 and SVar38 share identical centromeric breakpoints while telomeric breakpoints share the same breakpoint sequence but differ by the presence/absence of a single OPN1MW gene copy. (B) Patient #11819/BCM 109 carrying SVar20 and patient #18281/BCM 89 carrying SVar38 share the same marker haplotype at Xq28. Markers including nine microsatellites and two SNPs are ordered according to their physical position (top to bottom). The localization of the OPN1LW/OPN1MW gene cluster is indicated by the arrow on the right. Shared alleles (microsatellite alleles coded in numbers) are depicted as gray squares. (C) Proposed sequence of events linking SVar20 and SVar38. Deletion of the LCR and parts of the OPN1LW/OPN1MW gene cluster results in SVar20, which retains a single OPN1MW gene copy. Subsequently, SVar20 undergoes intrachromosomal NAHR through homologous sequences downstream of the OPN1MW gene copies (relevant area of sequence homology indicated by the yellow patch; note that the intergenic sequence between OPN1MW gene copies is homologous to large parts of the TEX28 gene) which results in the loss of the terminal OPN1MW gene copy as seen in SVar38. In comparison with the structure of the normal gene cluster (Top), the SV breakpoints are marked by brackets and deleted parts are indicated by lighter gray color.

Identical centromeric breakpoints and sequence-conserved telomeric breakpoints in SVar20 and SVar38 support a single lineage intrachromosomal NAHR event. (A) SVar20 and SVar38 share identical centromeric breakpoints while telomeric breakpoints share the same breakpoint sequence but differ by the presence/absence of a single OPN1MW gene copy. (B) Patient #11819/BCM 109 carrying SVar20 and patient #18281/BCM 89 carrying SVar38 share the same marker haplotype at Xq28. Markers including nine microsatellites and two SNPs are ordered according to their physical position (top to bottom). The localization of the OPN1LW/OPN1MW gene cluster is indicated by the arrow on the right. Shared alleles (microsatellite alleles coded in numbers) are depicted as gray squares. (C) Proposed sequence of events linking SVar20 and SVar38. Deletion of the LCR and parts of the OPN1LW/OPN1MW gene cluster results in SVar20, which retains a single OPN1MW gene copy. Subsequently, SVar20 undergoes intrachromosomal NAHR through homologous sequences downstream of the OPN1MW gene copies (relevant area of sequence homology indicated by the yellow patch; note that the intergenic sequence between OPN1MW gene copies is homologous to large parts of the TEX28 gene) which results in the loss of the terminal OPN1MW gene copy as seen in SVar38. In comparison with the structure of the normal gene cluster (Top), the SV breakpoints are marked by brackets and deleted parts are indicated by lighter gray color.

High OPN1LW/OPN1MW Gene Copy Number May Predispose to the Occurrence of SVs.

Given the high prevalence of SVs in BCM we asked whether certain features of the OPN1LW/OPN1MW gene cluster predispose to the occurrence of SVs. A special characteristic of the OPN1LW/OPN1MW gene cluster is its repetitive nature with multiple copies of genes and intergenic sequences with very high sequence homology, a feature which is known to induce nonallelic recombination events. We therefore investigated whether a large copy number (prior to the SV event) may predispose to the occurrence of nonrecurrent SVs. For this purpose, we determined the copy number of remaining OPN1LW and OPN1MW genes in subjects with SVs utilizing qPCR and MLPA. We used these data to deduce—by parsimonious assumptions—the minimal number of copies of the individual ancestral array prior to the occurrence of the SV event. For instance, for a subject with a single full or partially intact OPN1MW copy we conservatively assumed an ancestral copy number of two (one OPN1LW and one OPN1MW gene copy). We excluded SVar1, the intragenic 142-bp deletion which implicates gene conversion, as an underlying mechanism as well as all SVs involving deletions of the entire gene locus or all OPN1MW genes which prohibits ancestral copy number reconstruction. We compared the data from the BCM-SV group (n = 33; i.e., distinct SVs) with those from a group of color vision-normal observers of German descent (CVNO, n = 35) (Table 2).
Table 2.

 Deduced total OPN1LW/OPN1MW copy number prior to SV in comparison with healthy controls

OPN1LW and OPN1MW copiesNo. of BCM (prior to SV)*No. of controls
n = 2716526
n = 3921
n = 471739
n = 565
n = 621
n = 700
n = 810
..00
n = 1410
Sum3335

*Number of deduced ancestral gene arrays (prior to the SV formation) with a given total OPN1LW and OPN1MW copy number among BCM-linked SVs.

†Number of subjects with experimentally determined total OPN1LW and OPN1MW copy number in healthy controls. Shadings distinguish groups of subjects with three or fewer and more than three OPN1LW/MW gene copies, respectively.

Deduced total OPN1LW/OPN1MW copy number prior to SV in comparison with healthy controls *Number of deduced ancestral gene arrays (prior to the SV formation) with a given total OPN1LW and OPN1MW copy number among BCM-linked SVs. †Number of subjects with experimentally determined total OPN1LW and OPN1MW copy number in healthy controls. Shadings distinguish groups of subjects with three or fewer and more than three OPN1LW/MW gene copies, respectively. Mean total copy number was 3.31 for the CVNO cohort and 4.03 for the BCM-SV group. However, this difference in mean total copy number is in parts inflated due to the very large gene arrays (reconstructed with n = 8 and n = 14 copies) in SVar14 and SVar28 in the BCM-SV group. Irrespective of these outliers, the distribution of the total copy number was shifted to higher copy numbers in the BCM-SV group with a statistically significant increase in the fraction of subjects with three or more gene copies in the BCM-SV group (chi-squared test, P = 0.028) and a reduced fraction of OPN1LW/OPN1MW arrays with three gene copies in the BCM-SV group (chi-squares test, P = 0.0065), with this statistical approach being rather robust against outliers. Moreover, we noted that a significant higher proportion of arrays in the BCM-SV group carry OPN1MW•OPN1LW hybrid genes (15/33, 44%) compared to the CVNO group (3/35, 8.5%). We thus hypothesize that large OPN1LW/OPN1MW gene arrays are less stable, i.e., prone to NAHR and the formation of OPN1MW•OPN1LW hybrid genes, and predisposed for the occurrence of deleterious nonrecurrent SVs.

Discussion

SVs account for 34% of the BCM index cases in our cohort of 213 molecularly confirmed families. There are only a few other genes or loci underlying inherited retinal dystrophies with a similar or exceeding fraction of SVs in their mutation spectrum, such as CHM, EYS, PRPF31, and CLN3, the latter due to the high prevalence of a founder mutation (see also ref. 28) or in a few retinal dystrophies in which SVs are the primary disease mechanisms due to chromatin topology disorganization (MCDR3, ref. 29; RP17-linked Retinitis pigmentosa, ref. 30; autosomal dominant cone dystrophy with early tritan color vision defect, ref. 31). SVs associated with BCM have been reported in a number of prior studies (2, 3, 12–18, 32–41). However, to the best of our knowledge, exact breakpoints of the SVs were only reported in a rather small subset of only 14 BCM families, all carrying different SVs except for an SV shared in two families (17). This scarce information on exact breakpoints is likely due to the complexity and repetitive structure of the OPN1LW/OPN1MW gene cluster that complicates and hampers molecular analysis. In the present study, we determined the exact breakpoints in 72 additional independent BCM families. We identified 42 different SVs of which 40 are not reported previously. Our study thereby increases the number of BCM families with precisely mapped SVs sixfold (from 14 to 86) and the number of different SVs by a factor of 4 (from 13 to 53). The knowledge of the precise SV breakpoints enabled us to design diagnostic PCR assays (primer sequences in ), which now allow easy and reliable female carrier testing in families at risk. This obviates the need for qPCR-based copy number analysis of LCR sequences or—for SVs with intact LCR—indirect marker-based segregation analysis in BCM families which are typically used for this purpose until now. The LCR upstream of the OPN1LW/OPN1MW gene cluster governs the expression of the downstream cone opsin genes (6, 7). Its absence (e.g., due to deletions) strongly impairs OPN1LW and OPN1MW gene expression and results in BCM. This offers the opportunity to indirectly map the extent of functionally relevant sequences of the human LCR in BCM patients with SVs. In the present patient series, we identified two deletions that were restricted to the LCR: SVar3 (in family BCM 215) with a deletion of 3,542 bp and SVar2 (in family BCM 74) with a deletion of 411 bp. The latter is the smallest LCR deletion ever reported in BCM. Together with an upstream breakpoint (HS102) reported by Nathans et al. (2), the present data refine the minimal crucial sequence of the LCR to 358 bp which covers the evolutionarily highest conserved sequence element in this region (, Fig. S8). Given the utility of the LCR/OPN1LW promoter (6, 42), our refinement may be important for the further development of more compact promoters to drive transgene expression in gene therapy applications aiming for strong cone photoreceptor-specific expression. The vast majority of SVs in the present study were unique and only observed in single families. However, we also identified one SV, SVar26, which was rather common and accounted for about 30% of all BCM families with SVs in the present study. Remarkably, SVar26 was found exclusively in families from the United States and our marker analysis confirmed the presence of a founder mutation. The size of the deletions and the localization of breakpoints of the SVs vary considerably. While the centromeric breakpoint is always located downstream of MECP2, the extent of deletion at the telomeric side can include parts of or the entire TEX28 and TKTL1 genes downstream of the OPN1LW/OPN1MW gene cluster. Notably, there is not a single region of overlap shared by all SVs. Although the majority of SVs did involve deletions of the LCR, it is still intact in five of the SVs. Therefore, multiple probes targeting different parts of the OPN1LW/OPNMW gene cluster are required to be tested for reliable detection or exclusion of SVs linked to BCM. Thirty-two of the SVs have at least one of the breakpoints located within the OPN1LW/OPN1MW gene cluster which represents a tandem arrangement of highly homologous sequences of 37- to 38-kb unit size composed of genic (OPN1LW or OPN1MW or OPN1LW•OPN1MW hybrid genes) and intergenic sequences. Such low copy repeat arrangements are frequently involved in the formation of recurrent copy number variations but also favor the occurrence of nonrecurrent SVs such as those reported in this study (24). In line with this, telomeric breakpoints of nonrecurrent duplications and complex SVs observed in patients with MECP2 duplication syndrome are frequently located in the OPN1LW/OPN1MW gene cluster (43–45). Such a predisposition to genomic rearrangements may further be enhanced by an increased number of repeats. Therefore, a major finding of this study is the evidence that BCM-linked SVs may occur most frequently in subjects with larger than average OPN1LW/OPN1MW gene arrays. Or, conversely, subjects with large arrays are at higher risk for the formation of a deleterious SV. While we were unable to determine the ancestral copy number (prior to the occurrence of the SV) experimentally, we applied very conservative and parsimonious assumptions for the “reconstruction” of the composition of the ancestral gene array. Thus, on average, the true OPN1LW/OPN1MW copy number on the ancestral chromosomes likely had been even higher. There is some prior evidence from human disease that a higher copy number of low copy repeat sequences predisposes to genomic rearrangements. For instance, Liu et al. observed that triplication at the CMT1A locus emerges from duplications (three copies of the CMT1A low copy repeat) at a much higher rate than the rate of de novo duplications (46). Moreover, duplication and subsequent triplication of the SCNA gene in different branches of the same family with autosomal dominant parkinsonism (47) and the unique presence of PRSS1 duplications and triplications in French families with hereditary pancreatitis that originate from a single founder allele (48) argue for an increased susceptibility for genomic rearrangement. Notably, in all three instances these changes in copy number involve intrachromosomal (i.e., interchromatidal recombination) rather than interchromosomal NAHR events suggesting some inherent instability of a chromosome carrying multiple copies of a low copy repeat sequence. All these studies involve a further gain in copy number. Yet, this likely represents an ascertainment bias since reversion events (e.g., a duplication reverted to normal single copy) will hardly be detected in such patient studies. In contrast, a predominance of copy losses over copy gains has already been noted in the rates of mutations and revertants at the Drosophila bar locus (e.g., bar to ultrabar, and vice versa) (49, 50), a prototypical locus for dosage effects due to changes in copy number (51, 52), and also observed in humans at several loci for genomic disorders upon sperm genotyping (53) as well as in the frequency of reciprocal recurrent deletions/duplication in the allelic Smith–Magenis syndrome and Potocki–Lupski syndrome (54). This predominance of deletions is at least in part due to intrachromosomal (i.e., intrachromatidal or interchromatidal) NAHR which is equal or higher than interchromosomal NAHR at all tested loci (53). Other than in these instances, our data suggest that also nonrecurrent SVs involving deletions are driven by an inherent increased instability of large, high-copy gene arrays at the OPN1LW/OPN1MW gene cluster. Although further studies are recommended for validation, such an increased instability of large arrays may explain the high proportion of SVs in subjects with BCM.

Subjects and Materials and Methods

Patient Recruitment and Clinical Evaluation.

The study was conducted pro- and retrospectively in accordance with the tenets of the World Medical Association Declaration of Helsinki and approval was obtained from the respective local research and ethical boards or dependent on the local regulatory bodies at the time the patients were recruited as part of local clinical studies or ad hoc at different centers specialized in inherited retinal diseases during routine clinical diagnostics. A clinical diagnosis of BCM was based on ophthalmologic examination according to local protocols or local clinical standards of the recruiting centers. Venous blood was taken from patients and family members after informed consent and sent to the Tuebingen group for genetic analysis. Specifically, this study was approved by the Ethics Board of the Medical Faculty, Eberhard Karls University Tuebingen under the study no. 349/2003V and 116/2015BO2.

Genotyping of the OPN1LW/OPN1MW Gene Cluster.

Genomic DNA was isolated from blood samples according to standard procedures. All patients underwent a routine screening protocol to test for the structure and integrity of the OPN1LW/OPN1MW gene cluster which includes PCR-based sequence-tagged site (STS) content mapping for amplicons covering the LCR (amplicon A), the proximal promoter and exon 1 of the OPN1LW gene and the OPN1MW gene, respectively (amplicons B and C), and amplicons D and E covering exon 4 and exon 5 not distinguishing between OPN1LW and OPN1MW (32). Each PCR assay for amplicons A to C was performed as duplex PCR including an additional primer pair for an autosomal locus as amplification control. The specificity of the screening protocol was validated in 50 males of European origin with normal color vision. Subjects with evidence for deletions or rearrangements from the initial screen were further investigated to delineate the structure, extent, and the breakpoints of the SVs. As a first step, we applied STS content mapping in unique sequences outside the OPN1LW/OPN1MW sequence repeat to refine centromeric breakpoints between MECP2 and OPN1LW, and where possible also the telomeric breakpoint downstream of the gene cluster. To cover breakpoint junctions, we used multiple approaches including 1) long-distance PCR with primers as defined by refined breakpoint mapping, 2) long-distance PCRs with a forward primer as defined by refined breakpoint mapping and a panel of reverse primers located at distinct sites within the OPN1LW/OPN1MW gene cluster, 3) genome walking applying the APAgene Gold Genome Walking Kit (Bio S&T), and 4) inverse PCR (55). Breakpoint junction PCR and long-distance PCR fragments were sequenced by primer walking using BigDye Terminator Cycle Sequencing 1.1 chemistry (Thermo Fisher Scientific GmbH) and sequencing products electrophoretically separated on an ABI 3130XL capillary sequencer instrument. Breakpoint junction sequences were used to design primers for SV-specific diagnostic PCR assays which were used for segregation analysis and carriership detection (, Table S2). The copy number of inserted chromosome 20 sequences in SVar40, and duplicated-inverted X-chromosomal sequences in SVar42 was determined by custom designed real-time qPCR assays using SYBR Green chemistry (Quantitect SYBR Green PCR Kit; Qiagen) on an ABI 7500 instrument. Prior to the copy number assay, DNA samples of patients and controls were adjusted and normalized based on Ct values obtained with a custom copy number control assay targeting a sequence at the SDC4 locus on chromosome 20q. The total number of OPN1LW/OPN1MW gene copies was determined by means of qPCR or MLPA with genomic DNA as template. For qPCR, the amount of input DNA from probands and controls was adjusted based on the result obtained with a RNaseP TaqMan Copy Number Reference Assay (Life Technologies). We used two different TaqMan assays that target different parts of the OPN1LW/OPN1MW gene cluster: the HS_01912094 assay (Thermo Fisher/Life Technologies) targeting a common sequence in exon 6 of OPN1LW and OPN1MW and a custom-designed TaqMan assay (employing RGCP_TQF [5′-CCCAACAGAAAGCTGAAAGC-3′] as forward and RGCP_TQR [5′-GTGCAAAACTTTCGGATTGG-3′] as reverse primers, respectively, and RGCP_TQP [5′-CAGCCCGAGTCCTGCCATTGG-3′] with 5′-FAM and 3′-BHQ1 modifications as probe primer) targeting a common segment of intron 1. The two qPCR assays targeting OPN1LW and OPN1MW sequences and another RNaseP TaqMan Copy Number Reference Assay—performed as triplicates for each sample—were done in parallel on the same instrument run. For MLPA we used a premarketing release version of the SALSA X080-B1 Opsin probe-mix (MRC Holland) targeting sequences of the LCR, and individual exons of OPN1LW and OPN1MW specifically. MLPA was performed according to the manufacturer’s recommendations and amplification products separated by capillary electrophoresis on an ABI 3130XL instrument. Data analysis was done using the Coffalyser.Net software (MRC Holland) which calculates ratios in comparison to reference samples coprocessed with the test samples. For both qPCR and MLPA, we used a series of male controls with defined OPN1LW/OPN1MW copy number (ranging from one to six copies) as determined by Southern blot of NotI-digested and pulsed-field electrophoretic separated genomic DNA and/or Fiber-FISH (56), to generate copy number-dependent probe/control ratio and ΔCt calibration curves, respectively.

In Silico Analysis of SV Breakpoints.

We performed in silico sequence analysis of 600 bp in the vicinity of the SV breakpoints (300 bp upstream and 300 bp downstream of the actual breakpoint). We used RepeatMasker for the identification of repetitive sequences and performed a BLAST2seq analysis of the upstream and downstream breakpoint sequences for the detection of sequence homologies. Local G/C content was calculated with a sliding window of 50 bp.

Microsatellite Markers.

Eleven microsatellite markers (including five markers from public databases and six novel designed CA- and TA-repeat markers) encompassing a 3.9-Mb region on chromosome Xq28 were used for haplotyping. Microsatellite markers were PCR-amplified from genomic DNA using AmpliTaq Gold (Thermo Fisher) or Multiplex PCR Master Kit (Qiagen) reagents and—owing to fluorescence labeling of primers—separated and detected on an ABI3130-XL capillary electrophoresis instrument.

Array-CGH Analysis.

Array-CGH analysis applying the Affymetrix Cytoscan HD array was done at Atlas Biolabs GmbH, Berlin, Germany.

Web Resources.

Resources used in this study include BLAST2Seq (https://blast.ncbi.nlm.nih.gov/Blast.cgi), GenBank (https://www.ncbi.nlm.nih.gov/genbank/), HGMD (www.hgmd.cf.ac.uk/), OMIM (https://omim.org/), and RepeatMasker (https://www.repeatmasker.org/).
  55 in total

1.  High-resolution microarray analysis unravels complex Xq28 aberrations in patients and carriers affected by X-linked blue cone monochromacy.

Authors:  S A Yatsenko; H A Bakos; K Vitullo; M Kedrov; A Kishore; B J Jennings; U Surti; M A Wood-Trageser; S Cercone; A N Yatsenko; A Rajkovic; A Iannaccone
Journal:  Clin Genet       Date:  2015-07-28       Impact factor: 4.438

2.  REVERSE MUTATION OF THE BAR GENE CORRELATED WITH CROSSING OVER.

Authors:  A H Sturtevant; T H Morgan
Journal:  Science       Date:  1923-06-29       Impact factor: 47.728

3.  A new mechanism in blue cone monochromatism.

Authors:  A S Ladekjaer-Mikkelsen; T Rosenberg; A L Jørgensen
Journal:  Hum Genet       Date:  1996-10       Impact factor: 4.132

4.  The cloning of the Bar region and the B breakpoint in Drosophila melanogaster: evidence for a transposon-induced rearrangement.

Authors:  S I Tsubota; D Rosenberg; H Szostak; D Rubin; P Schedl
Journal:  Genetics       Date:  1989-08       Impact factor: 4.562

5.  Molecular genetics of inherited variation in human color vision.

Authors:  J Nathans; T P Piantanida; R L Eddy; T B Shows; D S Hogness
Journal:  Science       Date:  1986-04-11       Impact factor: 47.728

6.  Genetic heterogeneity among blue-cone monochromats.

Authors:  J Nathans; I H Maumenee; E Zrenner; B Sadowski; L T Sharpe; R A Lewis; E Hansen; T Rosenberg; M Schwartz; J R Heckenlively
Journal:  Am J Hum Genet       Date:  1993-11       Impact factor: 11.025

7.  Deletion of the X-linked opsin gene array locus control region (LCR) results in disruption of the cone mosaic.

Authors:  Joseph Carroll; Ethan A Rossi; Jason Porter; Jay Neitz; Austin Roorda; David R Williams; Maureen Neitz
Journal:  Vision Res       Date:  2010-07-16       Impact factor: 1.886

8.  Human cone visual pigment deletions spare sufficient photoreceptors to warrant gene therapy.

Authors:  Artur V Cideciyan; Robert B Hufnagel; Joseph Carroll; Alexander Sumaroka; Xunda Luo; Sharon B Schwartz; Alfredo Dubra; Megan Land; Michel Michaelides; Jessica C Gardner; Alison J Hardcastle; Anthony T Moore; Robert A Sisk; Zubair M Ahmed; Susanne Kohl; Bernd Wissinger; Samuel G Jacobson
Journal:  Hum Gene Ther       Date:  2013-10-30       Impact factor: 5.695

9.  Mechanism, prevalence, and more severe neuropathy phenotype of the Charcot-Marie-Tooth type 1A triplication.

Authors:  Pengfei Liu; Violet Gelowani; Feng Zhang; Vivian E Drory; Shay Ben-Shachar; Erin Roney; Adam C Medeiros; Rebecca J Moore; Christina DiVincenzo; William B Burnette; Joseph J Higgins; Jun Li; Avi Orr-Urtreger; James R Lupski
Journal:  Am J Hum Genet       Date:  2014-02-13       Impact factor: 11.025

10.  Inverted genomic segments and complex triplication rearrangements are mediated by inverted repeats in the human genome.

Authors:  Claudia M B Carvalho; Melissa B Ramocki; Davut Pehlivan; Luis M Franco; Claudia Gonzaga-Jauregui; Ping Fang; Alanna McCall; Eniko Karman Pivnick; Stacy Hines-Dowell; Laurie H Seaver; Linda Friehling; Sansan Lee; Rosemarie Smith; Daniela Del Gaudio; Marjorie Withers; Pengfei Liu; Sau Wai Cheung; John W Belmont; Huda Y Zoghbi; P J Hastings; James R Lupski
Journal:  Nat Genet       Date:  2011-10-02       Impact factor: 38.330

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.