Literature DB >> 26996484

Targeted Genes Sequencing Identified a Novel 15 bp Deletion on GJA8 in a Chinese Family with Autosomal Dominant Congenital Cataracts.

Han-Yi Min1, Peng-Peng Qiao, Zhi-Hui Yan, Hui-Feng Jiang, Ya-Ping Zhu, Hui-Qian Du, Qin Li, Jia-Wei Wang, Jie Zhang, Jun Sun, Xin Yi, Ling Yang.   

Abstract

BACKGROUND: Congenital cataract (CC) is the leading cause of visual impairment or blindness in children worldwide. Because of highly genetic and clinical heterogeneity, a molecular diagnosis of the lens disease remains a challenge.
METHODS: In this study, we tested a three-generation Chinese family with autosomal dominant CCs by targeted sequencing of 45 CC genes on next generation sequencing and evaluated the pathogenicity of the detected mutation by protein structure, pedigree validation, and molecular dynamics (MD) simulation.
RESULTS: A novel 15 bp deletion on GJA8 (c.426_440delGCTGGAGGGGACCCT or p. 143_147delLEGTL) was detected in the family. The deletion, concerned with an in-frame deletion of 5 amino acid residues in a highly evolutionarily conserved region within the cytoplasmic loop domain of the gap junction channel protein connexin 50 (Cx50), was in full cosegregation with the cataract phenotypes in the family but not found in 1100 control exomes. MD simulation revealed that the introduction of the deletion destabilized the Cx50 gap junction channel, indicating the deletion as a dominant-negative mutation.
CONCLUSIONS: The above results support the pathogenic role of the 15 bp deletion on GJA8 in the Chinese family and demonstrate targeted genes sequencing as a resolution to molecular diagnosis of CCs.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 26996484      PMCID: PMC4819309          DOI: 10.4103/0366-6999.178966

Source DB:  PubMed          Journal:  Chin Med J (Engl)        ISSN: 0366-6999            Impact factor:   2.628


INTRODUCTION

Congenital cataract (CC) refers to cataract observable at early year of life[1] and is the leading cause of visual losses in children worldwide.[23] The incidence of CC varies from 0.01% to 0.06% in developed countries[4] to 0.05–0.15% in less developed areas of the world.[2356] CC can occur in isolation forms (nonsyndromic CC) or as part of a syndrome of ocular or systemic anomalies (syndromic CC).[7] Major causing factors of CC include metabolic disorders, intrauterine infectious, and genetic defects (chromosomal abnormalities or gene defects). Moreover, approximately 8.3–25.0% of the CC cases are thought to have a genetic basis of etiology, a majority of which are Mendelian diseases caused by monogenic mutations.[89] CC is a group of clinically and genetically heterogeneous diseases. Clinically, in addition to other abnormalities in syndromic CC, cataract itself includes a variety of morphologies, such as sutural, pulverulent, whole lens, nuclear, lamellar (also referred to as perinuclear), cortical, polar, cerulean, coralliform, and others.[910] Genetically, over 110 genes have been found associated with CC,[11] including more than 20 genes involved in nonsyndromic CC (hitherto the initial of the study).[9] Inherited patterns of nonsyndromic CC includes autosomal dominant (AD), autosomal recessive, and X-linked dominant patterns, with AD as the most common forms.[110] Of the more than 29 known nonsyndromic CC genes, at least 22 are involved in autosomal dominant congenital cataract (ADCC), including BEST1, BFSP2, CHMP4B, CRYAA, CRYAB, CRYBA1, CRYBA4,CRYBB1, CRYBB2, CRYBB3, CRYGC, CRYGD, CRYGS, EPHA2, GJA3, GJA8, HSF4, MAF, MIP, PITX3, SLC16A12, and VIM. Most of the genes mainly involve in specific processes in lens development, intercellular communication in the lens, or the organization of lens fibers.[9] Of note, the relation between cataract phenotypes and mutant genes is even more complex for nonsyndromic CC, i.e., mutations in different genes can cause similar cataracts phenotypes, mutations in a single gene can cause different types of cataracts, and one single mutation in a gene can cause different cataract phenotypes in patients. Identifying the precise genetic cause of CC is essential for providing accurate diagnostics for medical management, prognostics, and recurrence risk counseling for the patient and family.[12] However, due to the highly clinical and genetic heterogeneity, clinical genetic diagnostic practice of CC, especially for nonsyndromic CC, is greatly limited with the traditional sequencing method which sequences a few candidate genes at each time. Despite this, recently, the next generation sequencing (NGS) combined with targeted genomic enrichment has proved to be a cost-effective resolution to the genetic test of genetically heterogeneous diseases and provide a new opportunity for genetic diagnostics of CC. In this study, we performed parallel sequencing of 45 CC genes by combined NGS and targeted genomic enrichments to determine the genetic mutations in a three-generation Chinese family with congenital nuclear cataracts. We identified a novel c.426_440delGCTGGAGGGGACCCT in GJA8 that segregates with the disease phenotype in the family and evaluated potential pathogenicity of the deletion and modeled the functional impacts of the deletion on the structure of the connexin 50 (Cx50) protein. Our result demonstrates that the targeted gene sequencing using NGS can be used as an effective tool for molecular diagnosis of CC.

METHODS

Participants recruitment, blood sampling, and DNA extraction

This study was approved by the Institutional Review Boards of Beijing Genomic Institute (BGI)-Shenzhen. Nine members of a three-generation Chinese family from Guangdong with ADCCs were recruited in this study [Figure 1]. Careful ophthalmological examinations and hospital medical record reviews were performed for each affected member to confirm the clinical diagnosis. After obtained written informed consent, peripheral venous blood samples were collected for all participants. Genomic DNA was extracted using a QIAamp DNA Blood MiNi kit (Qiagen, Germany).
Figure 1

Pedigree and mutation analysis. (a) Pedigree of a three-generation Chinese family with congenital cataracts. The proband is indicated with an arrow. Squares and circles symbolize male and female individual, respectively. Black symbol indicates cataract affected status and white symbol indicates unaffected status. (b) DNA sequence chromatogram analysis. DNA sequence chromatograms of the unaffected members (top) and affected members (bottom) in the pedigree. Fifteen bases deletion in exon 2 causes a conservative deletion of LEGTL from codon 143–147 (p.143_147del). (c) Evolutionary sequence conservation analysis. Multiplex sequence alignment of connexin 50 from different species reveals that codon 143–147, where the mutation (p.143_147del) occurred, is located within a highly conserved region. (d) Allelic spectrum of reported disease-associated mutations. Schematic diagram of genomic structure of human GJA8 gene and allelic spectrum of diseases mutations are shown. The identified mutation in this study is marked with red line. CL (C1/C2/C3): cytoplasmic loop domain; TM: transmembrane domain; E1:extracellular domain 1; E2: extracellular domain 2.

Pedigree and mutation analysis. (a) Pedigree of a three-generation Chinese family with congenital cataracts. The proband is indicated with an arrow. Squares and circles symbolize male and female individual, respectively. Black symbol indicates cataract affected status and white symbol indicates unaffected status. (b) DNA sequence chromatogram analysis. DNA sequence chromatograms of the unaffected members (top) and affected members (bottom) in the pedigree. Fifteen bases deletion in exon 2 causes a conservative deletion of LEGTL from codon 143–147 (p.143_147del). (c) Evolutionary sequence conservation analysis. Multiplex sequence alignment of connexin 50 from different species reveals that codon 143–147, where the mutation (p.143_147del) occurred, is located within a highly conserved region. (d) Allelic spectrum of reported disease-associated mutations. Schematic diagram of genomic structure of human GJA8 gene and allelic spectrum of diseases mutations are shown. The identified mutation in this study is marked with red line. CL (C1/C2/C3): cytoplasmic loop domain; TM: transmembrane domain; E1:extracellular domain 1; E2: extracellular domain 2.

Capture probes design, targeted capture, and next generation sequencing

Forty-five genes implicated in the CC, including 29 nonsyndromic cataract genes (AGK, BEST1, BFSP1, BFSP2, CHMP4B, CRYAA, CRYAB, CRYBA1, CRYBA4, CRYBB1, CRYBB2, CRYBB3, CRYGC, CRYGD, CRYGS, EPHA2, FYCO1, GJA3, GJA8, HSF4, P3H2, LIM2, MAF, MIP, NHS, PITX3, SLC16A12, TDRD7, and VIM) and 16 syndromic cataract genes (ABHD12, CNBP, CTDP1, EYA1, FTL, GALK1, GCNT2, GFER, GJA1, JAM3, OPA3, PAX6, RAB3GAP2, SIL1, SIX6, and SLC33A1), were collected from careful literature and database search. The 45 genes selected contain almost all of the genes related to CC.[113] Moreover, genes related to ADCC, which are interested, were all included. Targeted sequencing capture DNA Probes were designed for exons and the flanking 30 bp intronic sequences using the Nimblegen SeqCap EZ Choice system (Roche NimbleGen, Madison, WI, USA). Targeted sequences capture and sequencing library preparation were performed as previously described.[14] Paired-end sequencing (PE100) was performed on the Illumina HiSeq2000 platform (Illumina, San Dieago, CA, USA).

Short-reads mapping, variant detection, and annotation

After filtering of reads of low-quality and potential adaptor contamination,[15] the clean reads were mapped to the reference human genome (hg19) using BWA software package (Burrows Wheeler Aligner http://sourceforge.net/projects/bio-bwa/).[16] Single nucleotide variants (SNVs) were identified using SOAPsnp software (http://soap.genomics.org.cn/),[17] and small insertion and deletions (InDels) were identified using the GATK InDel Genotyper (The Genome Analysis Toolkit, http://www.broadinstitute.org/gsa/wiki/index.php/).[18] The variants were annotated using a BGI in-house developed annotation pipeline.

Sequence conservation analysis and estimation of deleteriousness

Multiple alignments of Cx50 protein sequences between species were directly obtained from the UCSC genome browser home (http://genome.ucsc.edu/) and realigned with the Molecular Evolutionary Genetics Analysis (MEGA 6.06, http://www.megasoftware.net). The online free tool Combined Annotation-Dependent Depletion (CADD, http://cadd.gs.washington.edu/),[19] a newly developed framework that integrates diverse annotations into a single quantitative score (C-score) to measure the deleteriousness of SNVs and small InDels, was used to evaluate the pathogenicity of the deletion.

Construction of connexin 50 three-dimensional structure model

In the absence of Cx50 experimental structures, the comparative modeling methods based on high sequence identity can be employed to predict the Cx50 protein structure, which can subsequently be used to aid in the understanding of protein functional mechanisms. The amino acid sequences of wild type (WT) Cx50 and mutant Cx50 above were used to perform the sequence similarity searches using the NCBI BLAST in SWISS-MODEL server (http://swissmodel.expasy.org/).[20] The newly disclosed structure of human Cx26 (Protein Data Bank [PDB] code: 2ZW3)[21] was selected as the template to construct the Cx50 structure that shares a sequence identity of 57% with WT Cx50 sequence. The modeling result is shown in Supplementary Figure 1. (a and b) The overall structure of the connexin 50 chennal model in hydrophocipy surface representation. Click here for additional data file. The system of generating lipid bilayer structures as well as membrane-bound protein structures, CHARMM-GUI Lipid Builder[22] was used to build a Cx50 protein/membrane complex for molecular dynamics (MD) simulations. The Cx50 models (WT and mutant) were inserted into a fully hydrated palmitoyl-oleyl phosphatidylcholine (POPC) lipid bilayer of 400 molecules (set 200 on the lower leaflet and 200 on the upper leaflet) [Supplementary Figure 2]. The water model used was three-point model (TIP3P). To remove the net charge of the system, an ionic concentration of 150 mM KCl by transmuting random water molecules into K+ and Cl− was added as counterions. The illustration of homo-6-mer connexin 50 protein/membrane complex models. Click here for additional data file.

Molecular dynamics simulation

MD simulations were carried out using the parallel MD program GROMACS 4.6.5 for all protein molecules, POPC lipid molecules, and ions along with the TIP3P model for water molecules. A cutoff of 10 Å for van der Waals interactions was imposed. The particle mesh Ewald[23] technique with a short-range cutoff of 10 Å was employed to calculate long-range electrostatic forces. The periodic boundary conditions were introduced on all simulations with an integration time step of 2 fs to employ a multiple time stepping algorithms. The simulations were equilibrated as an NPT ensemble, using the Langevin dynamics[24] method to keep the pressure at 1 atm, and the temperature was maintained at 303 K using Langevin dynamics with a very weak friction coefficient. The MD simulation protocol is detailed description as follows: To remove the largest strains in the system, all simulation systems were first subjected to 5000 cycles of steepest descent, while position restraints were applied to the residues of the Cx50 models as well as the molecules of POPC lipid bilayer. Afterward, MD with position restraints applied only to the protein was performed for 0.1 ns. All the positional restraints were eliminated in the third round, and the systems were allowed to preequilibrate for 5 ns. The production simulations were finally conducted for 10 ns for all systems.

Sanger sequencing

Polymerase chain reaction (PCR) primer sets were designed to sequence the first exons of MAF,EPHA2, and NHS genes not covered in targeted sequencing, as well as to analyze the segregation of the GJA8 deletion in the family [Supplementary Table 1]. The PCR amplification was conducted in a 25 μl reaction volume containing 1X PCR buffer with 1.5 mM MgCl2, 200 μM each dNTP, 0.25 U Taq DNA polymerase (Takara, Dalian, China), 1.5 μM primers, and 50 ng genomic DNA. The PCR condition was 94°C for 4 min, 35 cycles of 94°C for 30 s, 59°C for 30 s, and 72°C for 45 s, and a final 72°C for 10 min. The PCR products were purified and sequenced at both ends on an ABI3730xl DNA sequencer (PE Applied Biosystems, Forest City, CA, USA).
Supplementary Table 1

Primers for PCR amplification of exons of candidate genes and the size of the PCR products

GeneExonPrimers sequence (5’–3’)Fragment size (bp)
GJA82F*-GTGCACTACGTCCGCATG298
R-CGAAGCAGTCCACCACATTG
MAF1F-AGCTGGTGACCATGTCTGTG407
R-AGAACTAGCAAGCCCACACC
F-AACTGGCAATGAGCAACTCC548
R-GTGGTGGTGGTGGTGGTAGT
F-GAGCGAGGGAGCACATTG352
R-CCGGTTCCTTTTTCACTTCA
F-CCGCACTACCACCACCAC432
R-CTGGTTCTTCTCCGACTCCA
EPHA21F-GACCAAGCTGAAACCGCTTA684
R-TACCAGGCTCAGAGATCCCT
NHS1F-AGGCAAGGTGAGCAGAGAAG764
R-CGCAGAAACCCATAGCCTG

*F: Forward primers; †R: Reverse primers; PCR: Polymerase chain reaction.

Primers for PCR amplification of exons of candidate genes and the size of the PCR products *F: Forward primers; †R: Reverse primers; PCR: Polymerase chain reaction.

RESULTS

Pedigree and clinical features

The family includes 5 affected and 4 unaffected members in a three-generation pedigree [Figure 1a]. The transmission of cataract from the female founder (also the proband) to four out of five of her offsprings with both genders supports a dominant inheritance pattern of the lens diseases in the family. All affected members have nuclear cataracts with no other ocular or systemic abnormalities [Figure 1a and Table 1]. The presences of cataracts at birth were confirmed by hospital records, and nonsyndromic CC was diagnosed for all affected members.
Table 1

Summary of clinical evaluations for the Chinese cataract family

MemberGenderAge at onsetAge at diagnosisCataract phenotypesOther abnormalitiesDiagnosis
I:1MaleNo65NormalNoNormal
I:2FemaleNo67NormalNoNormal
II:1MaleNo48NormalNoNormal
II:2FemaleOn birth42Nuclear, bilateralNoCongenital nuclear cataract
III:1MaleOn birth13Nuclear, bilateralNoCongenital nuclear cataract
III:2FemaleOn birth22Nuclear, bilateralNoCongenital nuclear cataract
III:3FemaleOn birth25NuclearNoCongenital nuclear cataract
III:4FemaleOn birth15NuclearNoCongenital nuclear cataract
III:5FemaleOn birth18NormalNoNormal
Summary of clinical evaluations for the Chinese cataract family

Targeted sequencing of 45 cataract genes

The coding sequence-exons and adjacent intronic sequences of 45 cataract genes, consisting of 366 exons and 63003 bp, were captured and sequenced for the proband DNA (see Materials and Methods). A total of 224,437 clean reads in 15,940,353 bp length (or ~15.94 Mb data) mapped to targeted regions were generated on Hiseq2000 [Table 2]. This formed a mean coverage depth of 218-fold on the targeted regions of the 45 cataract genes, with a general sequence coverage rate of 99.23% and a high-quality genotype assignment (>×10) rate of 97.29% [Table 2]. The uncovered targeted regions were all the first coding exons with extreme GC content [mean CC% >69%, Supplementary Table 2]. Supplementary sequencing was performed to cover the sequences of three of the uncovered first coding exons (see Materials and Methods) while the others were excluded for the analysis considering a syndromic form or a nondominant inheritance pattern of cataracts associated with the related genes [Supplementary Table 2].
Table 2

Summary statistics for targeted sequencing of 45 cataracts genes in the proband

GeneOMIM diseasesTranscriptNumber of coding exonsSizes (bp)Sequencing depth (X)Coverage rate (%)

>1X>4X>10X
ABHD12#612674 (AR, Sa)NM_015600.413121519299.2684.2884.28
AGK#614691 (AR, NSa)NM_018238.3161269270100.00100.00100.00
BEST1#193220 (AD, NS)NM_001139443.191815320100.00100.00100.00
BFSP1#611391 (AR, NS)NM_001195.38199827299.9597.0085.44
BFSP2#611597 (AD, NS)NM_003571.271248164100.00100.00100.00
CHMP4B#605387 (AD, NS)NM_176812.45675214100.00100.00100.00
CNBP#602668 (AD, S)NM_001127192451315597.8794.9293.94
CRYAA#604219 (AD, NS)NM_000394.23522154100.00100.00100.00
CRYAB#613763 (AD/AR, NS)NM_001885.13528313100.00100.00100.00
CRYBA1#600881 (AD, NS)NM_005208.46648245100.00100.00100.00
CRYBA4#610425(AD/AR, NS)NM_001886.26591141100.00100.00100.00
CRYBB1#611544 (AD/AR, NS)NM_001887.36759145100.00100.00100.00
CRYBB2#601547 (AD, NS)NM_000496.26618194100.00100.00100.00
CRYBB3#609741 (AD/AR, NS)NM_004076.36636144100.00100.00100.00
CRYGC#604307 (AD, NS)NM_020989.33525179100.00100.00100.00
CRYGD#115700 (AD, NS)NM_006891.33525179100.00100.00100.00
CRYGS#116100 (AD, NS)NM_017541.23537364100.00100.00100.00
CTDP1#604168 (AR, S)NM_004715.413288612692.3189.1289.12
EPHA2#116600 (AD, NS)NM_004431.3172931150100.0097.1097.10
EYA1#113650 (AD, S)NM_000503.4181779314100.00100.00100.00
FTL#600886 (AD, S)NM_000146.34528131100.00100.00100.00
FYCO1#610019 (AR, NS)NM_024513.3184437194100.00100.00100.00
GALK1#230200 (AR, S)NM_000154.18117980100.00100.00100.00
GCNT2#110800 (AR, S)NM_145649.451209421100.00100.00100.00
GFER#613076 (AR, S)NM_005262.23618120100.0095.6373.46
GJA1#257850 (AR, S)NM_000165.321149371100.00100.00100.00
GJA3#601885 (AD, NS)NM_021954.321308127100.00100.00100.00
GJA8#116200 (AD, NS)NM_005267.421302270100.00100.00100.00
HSF4#116800 (AD, NS)NM_001040667.2151479125100.00100.0098.24
JAM3#613730 (AR, S)NM_032801.49933275100.00100.00100.00
LEPREL1#614292 (AR, NS)NM_018192.3152127191100.0099.7692.57
LIM2#615277 (AR, NS)NM_030657.35648166100.00100.00100.00
MAF#610202 (AD, NS)NM_005360.42121210079.0476.3273.10
MIP#615274 (AD, NS)NM_012064.34792204100.00100.00100.00
NHS#302200 (XD, NS and S)NM_198270.28489335597.0695.2690.88
OPA3#165300 (AD, S)NM_001017989.2254375100.00100.00100.00
PAX6#106210 (AD, S)NM_001258462.1141311283100.00100.00100.00
PITX3#610623 (AD, NS)NM_005029.3490959100.00100.00100.00
RAB3GAP2#212720 (AR, S)NM_012414.3354182327100.00100.00100.00
SIL1#248800 (AR, S)NM_001037633.1111386193100.00100.00100.00
SIX6#212550 (AR, S)NM_007374.22741206100.00100.00100.00
SLC16A12#612018 (AD, NS)NM_213606.381551334100.00100.00100.00
SLC33A1#614482 (AR, S)NM_004733.361650391100.00100.00100.00
TDRD7#613887 (AR, NS)NM_014290.2173297368100.00100.00100.00
VIM#116300 (AD, NS)NM_003380.3101401201100.00100.00100.00

Total36663,00321899.2398.4397.29

OMIM: Online Mendelian Inheritance in Man; AR: Autosomal recessive; AD: Autosomal dominant; XD: X-linked dominant; S: Syndromic; NS: Non-syndromic.

Supplementary Table 2

Gene (exons) with poor sequencing coverage and the associated diseases

GeneRelated cataractsExonSize (bp)GC content (%)Sequencing depth (x)Coverage rate (%)Uncovered sequences

RegionLength (bp)Percentage of all exons
EPHA2#116600, cataract 6, multiple types, ADEX124074.1729.7054.12c.1_85852.14
BFSP1#611391, cataract 33, AREX141776.13100.8999.73c.1_37737718.87
ABHD12#612674, polyneuropathy, hearing loss, ataxia, retinitis pigmentosa, and cataract, AREX147078.1396.4395.29c.1_19119115.72
MAF#610202, cataract, pulverulent or cerulean, with or without microcornea, ADEX1194169.6572.1477.28c.1_1118111842.24
CTDP1#604168, congenital cataracts, facial dysmorphism, and neuropathy, AREX146178.095.3029.30c.1_3143148.37
NHS#302200, cataract 40, X-linked, XDEX190370.9979.5574.51c.1_5655656.45

AR: Autosomal recessive; AD: Autosomal dominant; XD: X-linked dominant.

Summary statistics for targeted sequencing of 45 cataracts genes in the proband OMIM: Online Mendelian Inheritance in Man; AR: Autosomal recessive; AD: Autosomal dominant; XD: X-linked dominant; S: Syndromic; NS: Non-syndromic. Gene (exons) with poor sequencing coverage and the associated diseases AR: Autosomal recessive; AD: Autosomal dominant; XD: X-linked dominant.

Identification of potential causal mutation

Mapped against the human reference genome sequences, a total of 43 variants, including forty substitutes and 3 small InDels, were identified on the 45 genes [Table 3 and Supplementary Table 3]. A vast majority of these variants (29/43) were either synonymous substitutes or from untranslated regions [Table 3]. These variants are thought to be of less potentially pathogenicity, and we put them aside first. Of the remaining variants with potential function importance (including ten nonsynonymous, 3 splice acceptor and donor site mutations, and 1 coding InDels), only the heterozygous in-frame coding on GJA8 was novel meeting the criteria as nonpolymorphic with an allele frequency <5% in any of the single nucleotide polymorphism database, HapMap, or 1000 genome project database. Multiplex protein sequences alignment shows that the deleted sequence was evolutionarily conserved among ten species with high-quality reference genome sequences available on UCSC genomes database [Figure 1c]. More importantly, the deletion was found in full cosegregation with cataract phenotypes in Sanger sequencing analysis of the mutation in the pedigree [Figure 1a and 1b] but was not detected in 1100 BGI in-house control exomes [Supplementary Table 3], genetically suggesting a potential pathogenic role of the deletion. Intriguingly, the deletion was not detected on both unaffected parents of the proband in Sanger sequencing validation analysis [Figure 1a and 1b], supporting a de novo occurrence of the deletion in the proband.
Table 3

Summary statistics for variants detected in targeted sequencing of 45 cataract genes in the proband

Mutation typeNumber
SNVs40
 NS10
 Synonymous28
 SS2
InDels3
 Coding (I)1
 SS1
 UTR1
NS/SS/I14
 Allele frequency ≤0.05 in dbSNP (snp137), HapMap or 1000 genomes project2
 Allele frequency ≤0.05 in 1100 BGI in-house control exomes1
 Frequency ≤0.05 in either1

NS: Nonsynonymous; SS: Spicing sites; UTR: Untranslated regions; dbSNP: Single nucleotide polymorphism database; BGI: Beijing Genomic Institute; SNVs: Single nucleotide variants; InDels: Insertion and deletions.

Supplementary Table 3

Variants detected in the 45 cataract genes in the proband

GeneTranscriptVariantHom/HetReadsMutation typeGene regionFunctional regionrsID*Frequency in dbSNPFrequency in HapMapFrequency in 1000 genomes§Frequency in 1100 exomes||
SLC16A12NM_213606c.49 T>GHetC99/103A;202SNVCDS1Missensers37400300.1630.380.15380.3641
PITX3NM_005029c.285 C>THetA25/22G;47SNVCDS2Synonymousrs22819830.4300.4240.3795
BEST1NM_001139443c.39 C>AHomA142;142SNVCDS1Synonymousrs11097480.0750.5470.14740.5333
BEST1NM_001139443c.1428 T>CHomC249/1T;250SNVCDS8Synonymousrs1800009000.35530.7638
JAM3NM_032801c.978−3 T>CHetC106/142T;248SNVIntron7Splicers6103820.4980.4340.4020.4342
GJA3NM_021954c.1017 G>AHetT41/19C;60SNVCDS1Synonymousrs116174150.37100.37360.1949
GJA3NM_021954c.895 C>AHomT103;103SNVCDS1Missensers9685660.96600.9671
CNBPNM_001127192c.156 C>THetA108/86GSNVCDS2Synonymousrs43038830.3810.2770.21150.2308
CTDP1NM_004715c.978 G>AHetA127/122G;249SNVCDS7Synonymousrs5995540.3260.1750.2830.1128
CTDP1NM_004715c.1461 G>AHetA54/28G;82SNVCDS8Synonymousrs21260820.30400.27560.1026
CTDP1NM_004715c.2817 T>CHomC33;33SNVCDS13Synonymousrs6261690.8460.9930.86720.9949
EPHA2NM_004431c.2874 C>THetA42/49G;91SNVCDS17Synonymousrs37543340.3550.2260.3260.241
EPHA2NM_004431c.1983 C>THetA77/94G;171SNVCDS11Synonymousrs109072230.2670.2060.20240.1949
EPHA2NM_004431c.987 C>THetA27/35G;62SNVCDS5Synonymousrs22305970.4330.1840.41580.2564
BFSP1NM_001195c.1749 A>GHomC250;250SNVCDS8Synonymousrs608071800.3580.54030.4154
BFSP1NM_001195c.1500 G>AHetT109/141C;250SNVCDS8Synonymousrs61361180.4310.4530.42950.4615
BFSP1NM_001195c.1033 G>AHetT102/108C;210SNVCDS7Missensers60807190.3780.4890.36080.4564
BFSP1NM_001195c.90 G>AHetT2/3C;5SNVCDS1Synonymoussnp1050000
ABHD12NM_015600c.1068 T>CHomG211;211SNVCDS12Synonymousrs109660.3100.35810.516
CRYAANM_000394c.6 C>THomT149;149SNVCDS1Synonymousrs8723310.2760.030.3150.2278
CRYBB3NM_004076c.337 C>GHomG144;144SNVCDS4Missensers96083780.4560.8670.49080.8718
CRYBB2NM_000496c.449+9 G>AHomA96;96SNVIntron5Splicers4049505000.61450.9026
CRYBB2NM_000496c.483 G>AHetA67/79G;146SNVCDS5Synonymousrs81409490.41200.38190.4923
CRYBA4NM_001886c.171 T>CHomC128;128SNVCDS3Synonymousrs57616370.84400.8480.9949
CRYGDNM_006891c.285 A>GHomC249;249SNVCDS3Synonymousrs23054300.6260.4530.63920.3846
CRYGDNM_006891c.51 T>CHetG37/40A;77SNVCDS2Synonymousrs20037528500.5180.4350.2974
FYCO1NM_024513c.3924 C>THomA194;194SNVCDS13Synonymousrs14636800.7390.9560.74730.8114
FYCO1NM_024513c.2036 C>THetA68/68G;136SNVCDS7Missensers37963750.4890.5190.42030.3808
FYCO1NM_024513c.1335 G>AHetT101/140C;241SNVCDS7Synonymousrs37963760.3150.5620.31680.3203
FYCO1NM_024513c.962 G>CHetG60/84C;144SNVCDS7Missensers37331000.4990.4890.4570.3879
FYCO1NM_024513c.749 G>AHomT219;219SNVCDS7Missensers4683158010.84980.8932
FYCO1NM_024513c.267 C>AHomT249;249SNVCDS3Synonymousrs46828010.62210.66670.8114
BFSP2NM_003571c.603 G>AHetA84/129G;213SNVCDS3Synonymousrs22767370.4950.5190.42770.4513
SLC33A1NM_004733c.512 A>GHetC113/137T;250SNVCDS1Missensers38047690.1740.2260.17220.2278
EYA1NM_172058c.1755 T>CHetG51/42A;93SNVCDS16Synonymousrs101033970.4970.4070.42670.388
EYA1NM_172058c.1278 C>THetA135/115G;250SNVCDS12Synonymousrs47381180.3990.4550.28110.4614
EYA1NM_172058c.813 A>GHomC248;248SNVCDS7Synonymousrs14453980.0340.2240.06320.1747
TDRD7NM_014290c.33 A>GHomG249;249SNVCDS1Synonymousrs13815320.3220.2990.31140.2811
TDRD7NM_014290c.449 T>CHomC243/5T;248SNVCDS3Missensers20457320.3220.2810.31140.2776
NHSNM_198270c.3955 T>CHetC128/121T;249SNVCDS6Missensers37472950.4690.250.13740.1692
MAFNM_005360c.–1637_−1639 delGGCHetW34/M14;48Deletion5-UTR5-UTR_No_frequencyNot_in_HapMap00.4744
GJA8NM_005267c.426_440 delGCTG GA GGGGACCCTHetW169/M78;247DeletionCDS1CDS_No_frequencyNot_in_HapMap00
RAB3GAP2NM_012414c.812−6 delTHetW67/M68;135DeletionIntron9Splice_No_frequency freqNot_in_HapMap00.3154

*NCBI dbSNP ID; †Allele frequency in NCBI dbSNP; ‡Allele frequency in HapMap; §Allele frequency in 1000 genome project; ||Allele frequency in BGI in-house control exomes. dbSNP: Single nucleotide polymorphism database; SNV: Single nucleotide variant; UTR: Untranslated regions; CDS: Coding sequence; BGI: Beijing Genomic Institute.

Summary statistics for variants detected in targeted sequencing of 45 cataract genes in the proband NS: Nonsynonymous; SS: Spicing sites; UTR: Untranslated regions; dbSNP: Single nucleotide polymorphism database; BGI: Beijing Genomic Institute; SNVs: Single nucleotide variants; InDels: Insertion and deletions. The pathogenicity of the deletion was further evaluated with CADD. The C-score of 15.52 supports a pathogenic role of the deletion (mutations with C-score >15 were supposed to be pathogenic).[19] Variants detected in the 45 cataract genes in the proband *NCBI dbSNP ID; †Allele frequency in NCBI dbSNP; ‡Allele frequency in HapMap; §Allele frequency in 1000 genome project; ||Allele frequency in BGI in-house control exomes. dbSNP: Single nucleotide polymorphism database; SNV: Single nucleotide variant; UTR: Untranslated regions; CDS: Coding sequence; BGI: Beijing Genomic Institute. This novel heterozygous deletion was concerned with a sequence deletion of 5 amino acids (or 15 bp) at the intracellular loop domain (cytoplasmic loop [CL]) of the Cx50 protein (or GJA8) [Figure 1d], which connects two transmembrane domains and was suspected to serving as binding sites of Ca2+/CaM in gap junction channel function.[2526] Mutations on GJA8 have caused multiplex ADCCs (Cataract 1, multiple types, OMIM#116200) including nuclear cataracts.

Potential functional impact of the deletion

To illustrate the dysfunction of Cx50 mutant, we performed MD simulation. At first, the best hit in PDB of Cx50 protein was used to predict the three-dimensional structure (see Materials and Methods). The overall structure of Cx50 represents a typical channel protein that contains a hydrophobic surface and hydrophilic channel [Supplementary Figure 1]. Then, the protein/membrane complex with lipid bilayer system and water was built to simulate the environment of Cx50 in cell. Both WT and mutant of Cx50 were inserted into the lipid bilayer system [Figure 2a and Supplementary Figure 2]. Finally, MD simulation was performed on both models (see Methods). As shown in Figure 2b, WT protein molecules were very stable in our system. However, the mutant molecules were more dynamic and the root-mean-square deviation increased when mutant molecules stay in the system for a longer time. Thus, the mutant Cx50 becomes unstable in the simulated system, which may also happen in real cellular environment.
Figure 2

Molecular dynamics simulation. (a) The illustration of mono connexin 50 protein/membrane complex models. The blue circle represents the lost region in connexin 50 mutant. (b) Simulation of molecular dynamics. The black curve denotes the structure dynamics of wild type and the red curve denotes mutants. X-axis is the time of simulation and Y-axis is the root-mean-square deviation of atoms.

Molecular dynamics simulation. (a) The illustration of mono connexin 50 protein/membrane complex models. The blue circle represents the lost region in connexin 50 mutant. (b) Simulation of molecular dynamics. The black curve denotes the structure dynamics of wild type and the red curve denotes mutants. X-axis is the time of simulation and Y-axis is the root-mean-square deviation of atoms.

DISCUSSION

The lens of human eye is an avascular structure, and gap junction channels encoded by Cxs function as the passage of intercellular transport of the ions and low molecular weight biomolecules. Cx50 (GJA8) and Cx46 (GJA3) are the major components of mammalian lens fiber cells, and mutations of these two genes account for approximately 20% of nonsyndromic familial cataract cases.[7] The typical structure of Cx includes a cytoplasmic N-terminal domain (NT), four transmembrane domains (TM1 to TM4), two extracellular loops (E1 and E2), a CL between TM2 and TM3, and a C-terminal domain (CT). In this study, we identified a novel in-frame deletion of 15 bp in the coding region of GJA8 gene (c.426_440delGCTGGAGGGGACCCT) associated with CCs in a three-generation Chinese family by targeted genes sequencing. Several lines of evidence support a pathogenic role of the deletion. First, the deletion was concerned with an in-frame deletion of 5 amino acids within a highly evolutionarily conserved region in the CL domain of Cx50 protein, which binds Ca2+/CaM to mediate gap junction channel in the lens of the eye.[2526] Second, the deletion was fully cosegregated with cataracts in the family but was not found in 1100 control exomes. Third, the cataract morphology of the affected members and inheritance pattern of the cataracts in the family were compatible with those of cataract caused by GJA8 mutations. Fourth, deleteriousness evaluation with CADD supported pathogenicity of the deletion. Fifth, protein structure modeling revealed that mutant protein disrupts the structure stability of Cx50 channel. To date, about 28 GJA8 mutations have been reported in CC patients [Figure 1d]. The majority of these mutations (26/28) are missense substitutes although rare cases of nonsense and frameshift mutations (2/28) have also been reported [Supplementary Table 4]. These mutations occur at the NT, TM1 to TM4, E1 and E2, or CT,[27] but none has been reported on the CL domain [Figure 1d and Supplementary Table 4]. No correlation between the mutation types or mutant region and the cataract phenotypes were observed. In vitro, functional analysis showed that these mutations triggered the formation of cataracts either through loss of normal channel functions (loss of function, altered gating, or reduced channel numbers) or gain of abnormal functions (gain of hemichannel function and formation of cytoplasmic accumulations).[927] The in-frame deletion reported here was first ever detected in the intracellular loop [Figure 1d and Supplementary Table 4] and its likely pathogenicity indicated the functional importance of the deleted sequence in the Cx50 gap junction protein. The structure modeling here showed that the deletion destabilized the structure of Cx50 protein channel. Thus, it is supposed that the mutation causes cataracts in the Chinese family as a dominant-negative function mutation due to impaired function of the channel or reduced number of normal channels.
Supplementary Table 4

Reported connexin 50 mutants and associated cataracts

MutationAmino acid changeLocationCataract typesInheritaryFamily originReferences
68G>CR23TNT*Progressive dense nuclearAutosomal dominantIranianWilloughby et al. 2003
92T>CI31TM1Nuclear cataractAutosomal dominantChineseWang et al. 2009
116C>GT39RM1Cataract and microcornea and iris hypoplasiaAutosomal dominantChineseSun et al. 2011
131T>AV44EM1Cataract and microcorneaAutosomal dominantIndianDevi and Vijayalakshmi 2006
131T>CV44AM1Suture-sparing nuclear cataractsAutosomal dominantChineseZhu et al. 2014
134G>CW45SM1Jellyfish-like bilateral and microcorneaAutosomal dominantIndianVanita et al. 2008b
136G>AG46RM1Complete and microcorneaAutosomal dominantChineseSun et al. 2011
137 G>TG46VM1Total cataractAutosomal dominantPakistaniMinogue et al. 2009
139G>AD47NE1Nuclear pulverulentAutosomal dominantBritishArora et al. 2008; Wang et al. 2011
139G>TD47YE1Nuclear cataractAutosomal dominantChineseLin et al. 2008
139G>CD47HE1Nuclear cataractAutosomal dominantChineseLi et al. 2013
142G>AE48KE1Zonular nuclear pulverulentAutosomal dominantPakistaniBerry et al. 1999
191T>GV64GE1NuclearAutosomal dominantChineseMa et al. 2005
218C>TS73FE1Dense and “star-shaped,” various locations in the nucleus or the polesAutosomal dominantDanishHansen et al. 2009
235G>CV79LM2§“Full moon” Y-sutural opacityAutosomal dominantIndianVanita et al. 2006
262C>TP88SM2Zonular pulverulentAutosomal dominantBritishShiels et al. 1998
262C>AP88QM2Lamellar pulverulentAutosomal dominantBritishArora et al. 2006
262C>AP88QM2“Balloon-like” Y-sutural opacitiesAutosomal dominantIndianVanita et al. 2008a
264C>TP88TM2Total cataractAutosomal dominantChineseGe et al. 2014
565C>TP189LE2||Cataract and microcorneaAutosomal dominantDanishHansen et al. 2007
593G>AR198QE2Cataract and microcorneaAutosomal dominantIndianDevi and Vijayalakshmi 2006
592C>TR198WE2Cataract and microcorneaAutosomal dominantChineseHu et al. 2010
601G>AE201KE2Perinuclear cataractsAutosomal dominantChineseSu et al. 2013
670insA203fsE2CataractAutosomal recessiveIndianPonnam et al. 2007
773C>TS258FCTNuclearAutosomal dominantChineseGao et al. 2010
776insGfsCTTriangularAutosomal recessiveGermanySchmidt et al. 2008
836C>AS259YCTAutosomal dominantDanishHansen et al. 2009
827C>TS276FCTNuclear pulverulentAutosomal dominantChineseYan et al. 2008
905T>CL281CCTLamellar/zonularAutosomal dominantIndianKumar et al. 2011

*Cytoplasmic amino-terminal, †First transmembrane domain, ‡Extracellular loop 1, §Second transmembrane domain, ||Extracellular loop 2, ¶Cytoplasmic carboxy-terminal. NT: N-terminal; CT: C-terminal.

Reported connexin 50 mutants and associated cataracts *Cytoplasmic amino-terminal, †First transmembrane domain, ‡Extracellular loop 1, §Second transmembrane domain, ||Extracellular loop 2, ¶Cytoplasmic carboxy-terminal. NT: N-terminal; CT: C-terminal. In recent several years, great advances have been achieved in NGS and targeted genomic enrichment technologies. The combination of these two technologies has shown considerable potential and value in clinical applications, especially in genetic diagnosis of highly heterogeneous rare genetic diseases.[28] First, targeted sequencing of a small proportion of the genome (about 1/1000–1/100 of the genome) could reduce the sequencing cost to a level acceptable in clinical context. Second, the flexibility of targeted gene panel design and the availability of deep sequencing depth on NGS allow to simultaneously sequence all known genes for certain genetic disease and to comprehensively analyze SNVs, small InDels, and copy number variations at a high accurate level. Here, we performed targeted sequencing of 45 genes involved in CC (mainly in nonsyndromic CC) to explore the utility of targeted NGS sequencing for genetic diagnosis of ADCC. The high depth and completeness of sequences coverage for these 45 genes, as well as the readily identification of potential causative mutation, indicate that the targeted genes sequencing using NGS provides a tool for genetic diagnostics of nonsyndromic CC. More importantly, this method could be expanded to all CC with the addition of more CC genes in target panel. However, a major limitation of this approach is that if the panel does not include genes responsible in tested patients, mutations will not be detected. Hence, the panel used needs to be updated when new genes are implicated in the disease. Another problem of the method is that some deep intronic mutations or mutations affecting regulatory elements may not be detected, which needs to be solved. In conclusion, clinical molecular diagnosis of CC, particularly for ADCC, remains a challenge because of highly genetic and clinical heterogeneity. In this study, by combined NGS and targeted genomic enrichment technology, we identified a novel 15 deletion in a highly conserved region of CL domain of the GJA8 gene associated with cataracts in a three-generation Chinese family with ADCC (mutations have never reported in CL domain before). Evidence supports a pathogenic role of the deletion in the family. More importantly, protein modeling indicates a dominant-negative impact of the deletion on the protein function. Our results demonstrate targeted genes sequencing on NGS as a useful tool for molecular diagnosis of CCs. Supplementary information is linked to the online version of the paper on the Chinese Medical Journal website.

Financial support and sponsorship

This work was supported by the experimental facilities and reagents of the BGI-Shenzhen, China.

Conflicts of interest

There are no conflicts of interest.
  26 in total

Review 1.  Elimination of cataract blindness: a global perspective entering the new millenium.

Authors:  D J Apple; J Ram; A Foster; Q Peng
Journal:  Surv Ophthalmol       Date:  2000-11       Impact factor: 6.048

2.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors:  Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal:  Genome Res       Date:  2010-07-19       Impact factor: 9.043

3.  Efficient Algorithms for Langevin and DPD Dynamics.

Authors:  N Goga; A J Rzepiela; A H de Vries; S J Marrink; H J C Berendsen
Journal:  J Chem Theory Comput       Date:  2012-06-13       Impact factor: 6.006

4.  Personalized diagnosis and management of congenital cataract by next-generation sequencing.

Authors:  Rachel L Gillespie; James O'Sullivan; Jane Ashworth; Sanjeev Bhaskar; Simon Williams; Susmito Biswas; Elias Kehdi; Simon C Ramsden; Jill Clayton-Smith; Graeme C Black; I Christopher Lloyd
Journal:  Ophthalmology       Date:  2014-08-19       Impact factor: 12.079

5.  CHARMM-GUI Membrane Builder toward realistic biological membrane simulations.

Authors:  Emilia L Wu; Xi Cheng; Sunhwan Jo; Huan Rui; Kevin C Song; Eder M Dávila-Contreras; Yifei Qi; Jumin Lee; Viviana Monje-Galvan; Richard M Venable; Jeffery B Klauda; Wonpil Im
Journal:  J Comput Chem       Date:  2014-08-07       Impact factor: 3.376

Review 6.  The genetic and molecular basis of congenital cataract.

Authors:  Alessandro Santana; Mauro Waiswo
Journal:  Arq Bras Oftalmol       Date:  2011 Mar-Apr       Impact factor: 0.872

Review 7.  Cat-Map: putting cataract on the map.

Authors:  Alan Shiels; Thomas M Bennett; J Fielding Hejtmancik
Journal:  Mol Vis       Date:  2010-10-08       Impact factor: 2.367

8.  Incidence and cumulative risk of childhood cataract in a cohort of 2.6 million Danish children.

Authors:  Birgitte Haargaard; Jan Wohlfahrt; Hans Callø Fledelius; Thomas Rosenberg; Mads Melbye
Journal:  Invest Ophthalmol Vis Sci       Date:  2004-05       Impact factor: 4.799

9.  Pathogenic mutations in two families with congenital cataract identified with whole-exome sequencing.

Authors:  Yukiko Kondo; Hirotomo Saitsu; Toshinobu Miyamoto; Byung Joo Lee; Kiyomi Nishiyama; Mitsuko Nakashima; Yoshinori Tsurusaki; Hiroshi Doi; Noriko Miyake; Jeong Hun Kim; Young Suk Yu; Naomichi Matsumoto
Journal:  Mol Vis       Date:  2013-02-18       Impact factor: 2.367

10.  Connexin mutants and cataracts.

Authors:  Eric C Beyer; Lisa Ebihara; Viviana M Berthoud
Journal:  Front Pharmacol       Date:  2013-04-15       Impact factor: 5.810

View more
  3 in total

Review 1.  Inherited Congenital Cataract: A Guide to Suspect the Genetic Etiology in the Cataract Genesis.

Authors:  Olga Messina-Baas; Sergio A Cuevas-Covarrubias
Journal:  Mol Syndromol       Date:  2017-02-07

2.  Novel mutations identified in Chinese families with autosomal dominant congenital cataracts by targeted next-generation sequencing.

Authors:  Shan Li; Jianfei Zhang; Yixuan Cao; Yi You; Xiuli Zhao
Journal:  BMC Med Genet       Date:  2019-12-16       Impact factor: 2.103

3.  Identification of a New Mutation p.P88L in Connexin 50 Associated with Dominant Congenital Cataract.

Authors:  Aixia Jin; Qingqing Zhao; Shuting Liu; Zi-Bing Jin; Shuyan Li; Mengqing Xiang; Mingbing Zeng; Kangxin Jin
Journal:  Front Cell Dev Biol       Date:  2022-04-21
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.