Literature DB >> 29531775

An unclassified variant of CHD7 activates a cryptic splice site in a patient with CHARGE syndrome.

Yuko Katoh-Fukui1, Shuichi Yatsuga2, Hirohito Shima1, Atsushi Hattori1, Akie Nakamura1, Kohji Okamura3, Kumiko Yanagi4, Manami Iso4, Tadashi Kaname4, Yoichi Matsubara5, Maki Fukami1.   

Abstract

CHARGE syndrome is a rare autosomal dominant disease that is typically caused by heterozygous CHD7 mutations. A de novo variant in a CHD7 splicing acceptor site (NM_017780.3:c.7165-4A>G) was identified in a Japanese boy with CHARGE syndrome. This variant has been considered to be an "unclassified variant" due to its position outside the consensus splicing sites. In this study, abnormal splicing derived from this known variant was confirmed by cDNA sequencing.

Entities:  

Year:  2018        PMID: 29531775      PMCID: PMC5842149          DOI: 10.1038/hgv.2018.6

Source DB:  PubMed          Journal:  Hum Genome Var        ISSN: 2054-345X


CHARGE syndrome (CS) (OMIM 214800) is a rare, autosomal dominant syndromic disease that is characterized by a variety of anomalies, including choanal atresia and malformations of the heart, ears and eyes. These malformations include coloboma, microphthalmia, septal defects in the heart, growth retardation and delayed mental development, genital hypoplasia, external and middle ear abnormalities, and deafness. Furthermore, facial palsy, cleft palate and dysphagia are commonly associated with CS.[1-3] CS is estimated to occur at a rate of 1 in 8,500–12,000 live births.[2,4,5] CS is mainly caused by loss-of-function mutations within CHD7 on chromosome 8q12. DNA sequencing detects CHD7 mutations in 58–64% of patients clinically diagnosed with CS.[5-7] A large-scale study indicated that among pathogenic CHD7 mutations, 78% constitute nonsense or frameshift mutations; 8% are missense mutations; and 11% are splice site mutations.[8] Currently, 581 mutations have been classified as pathogenic, whereas 93 mutations remain unclassified (CHD7 database: https://molgenis51.gcc.rug.nl/menu/main/dataexplorer?entity=Mutations&hideselect=true#). These unclassified mutations mostly comprise missense substitutions or occur around the exon/intron junctions. This study was approved by the Institutional Review Board Committees of the National Research Institute for Child Health and Development and Kurume University. The patient was a 7-year-old Japanese boy diagnosed with CS on the basis of the phenotypic expression of the disease, namely, choanal atresia, cleft palate, dysphagia, cryptorchidism, advanced hearing loss and cardiac septal defects. No coloboma or microphthalmia has been observed thus far. An apparent retardation of growth or development has also not been identified. The clinical details of the patient will be described in a subsequent report. Informed consent was obtained from the patient’s parents. Exome sequencing was performed as described earlier.[9] Using leukocytes obtained from the patient and his parents, genomic DNA was extracted using the DNeasy blood and tissue kit (Qiagen, Hilden, Germany). DNA fragments corresponding to protein-coding sequences were enriched using commercially available oligonucleotide libraries followed by next-generation sequencing (HiSeq 2500; Illumina, San Diego, CA, USA). The read lengths of the single- and paired-end libraries were 125 bases. The sequences obtained from the patient and his parents were aligned to the reference genome (GRCh37/hg19) using Burrows-Wheeler Aligner version 0.6.2 (Geeknet, Mountain View, CA, USA). Removal of potential PCR duplicates, recalibration of base quality values, local realignments and variation calls were performed using Samtools, Picard (http://broadinstitute.github.io/picard/) and GATK version 2.3–9. Filtration of putative false-positive variations was executed using optimization criteria, quality value recalibration, indel realignment, variant quality score recalibration, variant quality score recalibration PASS filter and GATK-recommended hard filters (GATK best practice v3/v4). Nucleotide substitutions with allele frequencies above 1% in the normal population were excluded as polymorphisms. A rare CHD7 variant detected by exome sequencing was confirmed by Sanger sequencing using the CHD7-intron 33 FW1 (5′- CAGCTCTGTGCACCAGTCAT-3′) and CHD7-exon 34 RV1-2 (5′- AAGGCATCAGACACTGGTGT-3′) primer pair. We retrieved the mutation from previous reports or from public databases of normal populations (The ExAC browser, http://exac.broadinstitute.org/; 1000 Genome Browser, https://www.ncbi.nlm.nih.gov/variation/tools/1000genomes/). Total RNA was extracted from lymphoblastoid cells from the patient’s blood sample using the RNeasy RNA extraction kit (Qiagen). cDNA was reverse-transcribed from total RNA with the EMD Millipore Novagen oligo(dT) primer (Thermo Fisher Scientific, Waltham, MA, USA) and Applied Biosystems High-Capacity RNA-to-cDNA kit (Foster City, CA, USA). cDNA was PCR-amplified using CHD7 coding sequence amplification with the CHD7-exon 33 FW2 (5′- GGCCTCATCCCAGGTTACAC-3′) and CHD7-exon 34 RV1-2 (5′- AAGGCATCAGACACTGGTGT-3′) primer pair. Amplified cDNA fragments were sub-cloned into the TOPO vector (Life Technologies, Carlsbad, CA, USA), and sequences were confirmed by Sanger sequencing. Mutation Taster (http://www.mutationtaster.org/index.html), NNSPLICE (Berkeley Drosophila Genome Project) (http://fruitfly.org/seq_tools/splice.html), Human Splicing Finder (http://www.umd.be/HSF3/index.html) and MaxEntScan::score3ss (http://genes.mit.edu/burgelab/maxent/Xmaxentscan_scoreseq_acc.html) were used for predicting splice site alterations. We detected a rare substitution that led to an alteration in the splicing acceptor and an ectopic premature termination within the thirty-third intron of CHD7 (GRCh37, Chr8:61769000, NM_017780.3:c.7165–4A>G). This genomic substitution has been reported in patients with CS (HGMD number CS061264), but has been referred to as an “unclassified variant”.[6,10] Exome trio analyses using leukocyte genomic DNA derived from the patient and his parents were performed. The NM_017780.3:c.7165–4A>G substitution was not detected in the parents; thus, this variant was considered to be a de novo mutation. Genomic DNA from the patient was sequenced directly to confirm the NM_017780.3:c.7165–4A>G substitution (Figure 1a).
Figure 1

Single-nucleotide substitution in the patient’s genome. (a) The electropherogram shows a sequence chromatogram of the genomic DNA of the patient with CHARGE syndrome. Rare substitutions in CHD7, namely, GRCh37, Chr8:61769000, NM_017780.3:c.7165–4A>G (red arrow) and the rs2272727 NM_017780.3:c.7356A/G polymorphism in exon 34 (black arrow), are shown. (b) The genomic linkage between the c.7165–4G substitution and the A polymorphism at c.7356 was confirmed using the cloned genomic sequence.

To identify the allele-specific sequence, the rs2272727 NM_017780.3:c.7356A>G polymorphism was used as a marker for each allele. Because the nucleotide position of rs2272727 is 195 bp away from the NM_017780.3:c.7165–4A>G substitution, the next-generation sequencing library could not determine the genomic linkage of these nucleotide variations. Thus, the genomic linkage between the c.7165–4G substitution and A polymorphism of rs2272727 (NM_017780.3:c.7356A/G) was confirmed by direct sequencing of genomic DNA sub-cloned into the TOPO vector (Life Technologies; Figure 1b). Exome sequencing indicated that the patient’s mother possessed the G polymorphism of rs2272727, while the A polymorphism was assumed to be of paternal origin. Moreover, to verify the single-nucleotide substitution-induced activation of cryptic splice sites, cDNA was reverse-transcribed using total RNA from lymphoblastoid cells derived from the patient’s blood sample, and the PCR-amplified cDNA was sub-cloned and sequenced. Of 12 cDNA clones, 5 contained the cryptic splice acceptor site three bases upstream of the original site, and the remaining 7 had the wild-type (WT) sequence (Figure 2a). All of the WT cDNA clones had the G polymorphic variation at c.7356, whereas all of the mutated cDNA clones had an A residue at this site (Figure 2b). These data indicate that most of the RNA transcribed from the mutated allele possessed the cryptic splice acceptor. Furthermore, the available cryptic splice site prediction methods[11-13] were examined to evaluate the impact of this variation on splicing in silico. The NM_017780.3:c.7165–4A>G substitution was predicted to lead to “WT acceptor loss/acceptor decrease” and a “cryptic acceptor site” on the basis of model prediction, whereas the calculated values of each predictor were moderate compared to those of the typical splice acceptor site mutations associated with patients with CS, such as rs587783448, which are considered pathogenic (Supplementary Table S1).
Figure 2

Single-nucleotide substitution-induced splicing abnormality in the patient. (a) CHD7 cDNA was reverse-transcribed using total RNA from blood-derived lymphoblastoid cells and was then PCR-amplified, cloned and sequenced. Sequencing revealed the activation of a cryptic splice site three bases upstream of position c.7165 in exon 34. Furthermore, the variant RNA contains the UAG (TAG) (*) termination codon immediately after the cryptic splicing acceptor. (b) The schematic shows wild-type RNA and the consequences of the substitution at the –4 position of intron 33. In the wild-type gene, splicing occurs at the wild-type splice acceptor at the beginning of exon 34. The c.7165–4A>G substitution at the end of intron 33 activates a cryptic splice site immediately upstream of position c.7165 in exon 34. Moreover, all tested cDNA clones with the A polymorphism at c.7356 possessed a cryptic splice site.

We detected the NM_017780.3:c.7165–4A>G substitution four bases upstream of the WT splice acceptor site in exon 34 of CHD7. According to cDNA analyses, this substitution leads to the loss of the normal acceptor site and gain of a cryptic splicing acceptor three bases upstream of the WT splice acceptor site. Moreover, the variant RNA contains a termination codon UAG (TAG) immediately after the cryptic splicing acceptor, leading to a loss of 610 C-terminal amino acids from the full-length CHD7 protein (2,998 amino acids). The WT exons at positions 34–38 encode a nuclear localization signal and the BRK domain, a protein module associated with the CHROMO domain and DEAD/DEAH box helicase domain.[14,15] More than 20 nonsense and frameshift mutations have been reported in patients with CS in this genomic region.[8,10] Notably, de novo heterozygous frameshift mutations in exon 38 (last exon), which lead to the C-terminal truncation of CHD7 (1–2,988), are reported to be pathogenic.[10,16,17] Therefore, we propose that premature termination of CHD7 due to the NM_017780.3:c.7165–4A>G substitution, adversely affects normal growth and development. Sequencing analyses of individual cDNA clones from the patient’s RNA indicated that the activity of nonsense-mediated mRNA decay is probably subtle, presumably because of extremely long 3′ untranslated regions.[18-20] To date, five patients with CS, including the patient in our study, have been found to harbor this substitution,[6,10] whereas this substitution is not found in the general population. In previous reports,[6,10] neither genomic DNA from parents nor RNA from patients was examined for the presence of cryptic splicing sites. Therefore, the NM_017780.3:c.7165–4A>G substitution has been considered to be an “unclassified variant”. This report is the first report of a laboratory-based assessment of this splicing variation and confirmation of the pathogenicity of the NM_017780.3:c.7165–4A>G substitution. Although reliable predictor systems have been developed, the prediction of some atypical sequence variations in border splice regions is unclear. The NM_017780.3:c.7165–4A>G substitution did not fit with the consensus sites that disrupt the splicing code well. The prediction scores of the NM_017780.3:c.7165–4A>G substitution did not show marked changes compared to a typical splice site substitution, NM_017780 c.5666-2A>C (Supplementary Table S1). In addition to the NM_017780.3:c.7165–4A>G substitution, 17 mutations at the exon/intron junction within CHD7 are also considered “unclassified variants” in the CHD7 database. These substitutions are located at 1–11 base pairs from the exon/intron junction and may therefore include additional pathogenic variants. In conclusion, we report a case of CS in a 7-year-old Japanese boy with a pathogenic de novo CHD7 mutation at an atypical splice site, which was previously described as an “unclassified variant”. Experimental assessment of splicing variants is critical and contributes to the clinical diagnosis of certain unclassified mutations. Availability of data and material The data and materials described in this report are available on request. The identified mutation will be uploaded on the ClinVar website https://www.ncbi.nlm.nih.gov/clinvar/ (the accession SCV000599799). Supplementary Information (Supplementary Table S1; NM_017780.3:c.7165–4A>G evaluation using a splice site predictor) is available on the Human Genome Variation website.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
  20 in total

1.  Improved splice site detection in Genie.

Authors:  M G Reese; F H Eeckman; D Kulp; D Haussler
Journal:  J Comput Biol       Date:  1997       Impact factor: 1.479

2.  Mutations in the CHD7 gene: the experience of a commercial laboratory.

Authors:  Cynthia F Bartels; Cheryl Scacheri; Lashonda White; Peter C Scacheri; Sherri Bale
Journal:  Genet Test Mol Biomarkers       Date:  2010-12

3.  Coloboma, congenital heart disease, and choanal atresia with multiple anomalies: CHARGE association.

Authors:  R A Pagon; J M Graham; J Zonana; S L Yong
Journal:  J Pediatr       Date:  1981-08       Impact factor: 4.406

4.  Molecular analysis of the CHD7 gene in CHARGE syndrome: identification of 22 novel mutations and evidence for a low contribution of large CHD7 deletions.

Authors:  Pia Vuorela; Sirpa Ala-Mello; Carola Saloranta; Maila Penttinen; Minna Pöyhönen; Kirsi Huoponen; Wiktor Borozdin; Birke Bausch; Elke M Botzenhart; Christian Wilhelm; Helena Kääriäinen; Jürgen Kohlhase
Journal:  Genet Med       Date:  2007-10       Impact factor: 8.822

5.  Maximum entropy modeling of short sequence motifs with applications to RNA splicing signals.

Authors:  Gene Yeo; Christopher B Burge
Journal:  J Comput Biol       Date:  2004       Impact factor: 1.479

Review 6.  Leveraging Rules of Nonsense-Mediated mRNA Decay for Genome Engineering and Personalized Medicine.

Authors:  Maximilian W Popp; Lynne E Maquat
Journal:  Cell       Date:  2016-06-02       Impact factor: 41.582

7.  Solution structure of the BRK domains from CHD7.

Authors:  Mark D Allen; Tomasz L Religa; Stefan M V Freund; Mark Bycroft
Journal:  J Mol Biol       Date:  2007-06-09       Impact factor: 5.469

8.  The Drosophila kismet gene is related to chromatin-remodeling factors and is required for both segmentation and segment identity.

Authors:  G Daubresse; R Deuring; L Moore; O Papoulas; I Zakrajsek; W R Waldrip; M P Scott; J A Kennison; J W Tamkun
Journal:  Development       Date:  1999-03       Impact factor: 6.868

9.  Identification of elements in human long 3' UTRs that inhibit nonsense-mediated decay.

Authors:  Kalodiah G Toma; Indrani Rebbapragada; Sébastien Durand; Jens Lykke-Andersen
Journal:  RNA       Date:  2015-03-24       Impact factor: 4.942

10.  Polypyrimidine tract binding protein 1 protects mRNAs from recognition by the nonsense-mediated mRNA decay pathway.

Authors:  Zhiyun Ge; Bao Lin Quek; Karen L Beemon; J Robert Hogg
Journal:  Elife       Date:  2016-01-08       Impact factor: 8.140

View more
  1 in total

1.  A case of combined 21-hydroxylase deficiency and CHARGE syndrome featuring micropenis and cryptorchidism.

Authors:  Satoko Umino; Miyuki Kitamura; Yuko Katoh-Fukui; Maki Fukami; Takeshi Usui; Shuichi Yatsuga; Yasutoshi Koga
Journal:  Mol Genet Genomic Med       Date:  2019-05-06       Impact factor: 2.183

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.