| Literature DB >> 30177809 |
Khurram Liaqat1, Ilene Chiu2, Kwanghyuk Lee3, Imen Chakchouk3, Paula B Andrade-Elizondo3, Regie Lyn P Santos-Cortez3, Shabir Hussain4, Shoaib Nawaz1, Muhammad Ansar4, Muhammad Nasim Khan5, Sulman Basit6, Isabelle Schrauwen3, Wasim Ahmad4, Suzanne M Leal7.
Abstract
LHFPL5, the gene for DFNB67, underlies autosomal recessive nonsyndromic hearing impairment. We identified seven Pakistani families that mapped to 6p21.31, which includes the LHFPL5 gene. Sanger sequencing of LHFPL5 using DNA samples from hearing impaired and unaffected members of these seven families identified four variants. Among the identified variants, two were novel: one missense c.452 G > T (p.Gly151Val) and one splice site variant (c.*16 + 1 G > A) were each identified in two families. Two known variants: c.250delC (p.Leu84*) and c.380 A > G (p.Tyr127Cys) were also observed in two families and a single family, respectively. Nucleotides c.452G and c.*16 + 1G and amino-acid residue p.Gly151 are under strong evolutionary conservation. In silico bioinformatics analyses predicted these variants to be damaging. The splice site variant (c.*16 + 1 G > A) is predicted to affect pre-mRNA splicing and a loss of the 5' donor splice site in the 3'-untranslated region (3'-UTR). Further analysis supports the activation of a cryptic splice site approximately 357-bp downstream, leading to an extended 3'-UTR with additional regulatory motifs. In conclusion, we identified two novel variants in LHFPL5, including a unique 3'-UTR splice site variant that is predicted to impact pre-mRNA splicing and regulation through an extended 3'-UTR.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30177809 PMCID: PMC6202120 DOI: 10.1038/s10038-018-0502-3
Source DB: PubMed Journal: J Hum Genet ISSN: 1434-5161 Impact factor: 3.172
Figure 1.Pedigree drawings of the seven ARNSHI families with LHFPL5 variants. Families 4275 and 4506 segregate the known variant c.250delC, the −/− signifies that the family member is homozygous for the c.250delC variant and +/− indicates individuals that are heterozygous c.250delC variant carrier. Family 4298 segregates the known variant c.380A>G. Families 4194 and 4464 segregate the novel variant c.452G>T and families 4072A and 4072B segregate another novel variant c.*16+1G>A. It was reported that families 4072A and 4072B are distantly related but the exact relationship is unknown. Filled symbols represent individuals with HI and clear symbols hearing individuals. The six individuals with arrows indicate their audiograms are displayed in Figure 2. Haplotypes are presented for the two families 4072A and 4464, which have novel variants. A boxed haplotype carries the pathogenic variant. For the other five families, the corresponding nucleotide substitutions are presented below each sequenced individual.
Figure 2.Air conduction thresholds of six hearing impaired individuals. Circles represent the right ear and crosses the left ear. All the tested subjects have bilateral and profound HI across all frequencies.
Bioinformatics analyses results of four pathogenic mutations found in seven Pakistani families
| Family | Maximum LOD | Chr6 coordinate | Variant | rs Id | GERP++ | Allele frequency | PolyPhen-2 | SIFT | Mutation taster | LRT | Mutation assessor | FATHMM | CADD score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| gnomAD | GME variome/BRAVO[ | |||||||||||||
| 4072A[ | 5.0 | 35,787,241 | N.A | 3.49 | 0.00 | 0.00 | N.A.[ | N.A. | Disease causing | N.A. | N.A. | N.A. | 23.3 | |
| 4072B[ | 2.4 | 35,787,241 | N.A | 3.49 | 0.00 | 0.00 | N.A. | N.A. | Disease causing | N.A. | N.A. | N.A. | 23.3 | |
| 4194 | 1.7 | 35,782,362 | rs762876554 | 5.53 | 8.1e-6 | 0.00 | Probably damaging | Damaging | Disease causing | D | Functional | Damaging | 28 | |
| 4275 | 2.7 | 35,773,697 | c.250delC (p.Leu84*)[ | N.A | 5.57 | 0.00 | 0.00 | N.A. | N.A. | Disease causing | N.A. | N.A. | N.A. | 25.8 |
| 4298 | 1.9 | 35,773,827 | c.380 A > G (p.Tyr127Cys)[ | rs104893975 | 4.46 | 2.8e-5 | 0.00 | Probably damaging | Tolerated | Disease causing | D | Functional | Damaging | 24.8 |
| 4464 | 5.4 | 35,782,362 | rs762876554 | 5.53 | 8.1e-6 | 0.00 | Probably damaging | Damaging | Disease causing | D | Functional | Damaging | 28 | |
| 4506 | 4.0 | 35,773,697 | c.250delC (p.Leu84*)[ | N.A | 5.57 | 0.00 | 0.00 | N.A. | N.A. | Disease causing | N.A. | N.A. | N.A. | 25.8 |
BRAVO is a database, which is powered by TOPMed Freeze 5 Public Subset. This dataset includes 463 million variants on 62,784 individuals
Families 4072A and 4072B reported to be related but the exact relationship is unknown
Mutations in bold indicate novel findings first reported in this study
Not applicable as those predictive software (except for MutationTaster) provide functional outcomes for missense variants
These mutations were previously reported in Pakistani and Indian families (Shabbir et al. 2006)
This mutation leads to NMD (nonsense-mediated decay) predicted by MutationTaster
rs104893975 is reported in dbSNP as a clinically associated pathogenic allele
Figure 3.A) Chromatograms displaying the novel variants c.452G>T in families 4194 and 4464 and c.*16+1G>A in families 4072A and 4072B. B) ClustalW2 sequence alignment of amino acids across LHFPL5 proteins from various species with conserved amino acids indicated with an asterisk, while colons indicate conservation between groups with strongly similar properties. The glycine151 residue is indicated with an arrow, and is fully conserved across all species (Left Panel). DNA sequence alignment containing the guanine nucleotide c.*16+1, indicated with an arrow. The position is fully conserved across various species (Right Panel). C) Schematic presentation of the exon-intron structure with eleven pathogenic variants. The boxed variants indicate novel variants found in this study (Top panel). The wild type structure of exon3-intron3-exon4 of LHFPL5 (Middle panel). In the c.*16+1 mutant transcript, exon3 was extended by 357 bp due to the activation of a cryptic splice site. The regulatory element binding sites in 3’-UTR are indicated with an arrow (Bottom Panel). D) Predicted transmembrane helices in LHFPL5 (adapted from the result of TMHMM 2.0 analysis) and depiction of the amino acid positions of previously reported two missense variants and the novel variant found in this study. The dotted line arrow indicates the location of the missense variant found in hscy mice.
Splice site analyses results for the mutation, c.*16+1G>A, found in families 4072A and 4072B
| Prediction Software | Natural Donor Splice site
[ | Predicted effect of c.*16+1G>A | Cryptic Donor Splice site | 3’-UTR Length Variation in Cryptic Donor Splice site (bp) | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Original Splice site[ | Chr6 Coordinate | Prediction Score [ | 3′ UTR length (bp) | New Splice site [ | Chr6 Coordinate | Prediction Score [ | 3’ UTR length (bp) | |||
| NNsplice | GGTCG | 35,787,241 | 0.98 | 1,110 | Natural Donor Splice site Broken
[ | TACAG | 35,787,598 | 0.99 | 1,467 | + 357 |
| HSF | GGTCG | 35,787,241 | 92.1 | TACAG | 35,787,598 | 90.0 | + 357 | |||
| NetGene2 | GGTCG | 35,787,241 | 0.99 | TACAG | 35,787,598 | 0.99 | + 357 | |||
Three splice site prediction software showed the highest donor splice site scores at the natural donor splice site position (NM_182548.3).
Nucleotides with capital letter indicate exon sequences and lowercase indicate intron sequences. Nucleotide in bold indicate the natural donor splice site (c.*16+1G) or the cryptic donor splice site.
Prediction score ranges 0-1, 0-100, 0-1 in NNsplice, HSF and NetGene2, respectively.
The three software predicted the mutation disrupts natural donor splice site and activate cryptic splice site.