Literature DB >> 30505812

Applying Two Different Bioinformatic Approaches to Discover Novel Genes Associated with Hereditary Hearing Loss via Whole-Exome Sequencing: ENDEAVOUR and HomozygosityMapper.

Mohammad Reza Pourreza1, Hannane Mohammadi2, Ladan Sadeghian3, Samira Asgharzadeh4, Mohammadreza Sehhati2, Mohammad Amin Tabatabaiefar1,5.   

Abstract

BACKGROUND: Hearing loss (HL) is a highly prevalent heterogeneous deficiency of sensory-neural system with involvement of several dozen genes. Whole-exome sequencing (WES) is capable of discovering known and novel genes involved with HL.
MATERIALS AND METHODS: Two pedigrees with HL background from Khuzestan province of Iran were selected. Polymerase chain reaction-sequencing of GJB2 and homozygosity mapping of 16 DFNB loci were performed. One patient of the first and two affected individuals from the second pedigree were subjected to WES. The result files were analyzed using tools on Ubuntu 16.04. Short reads were mapped to reference genome (hg19, NCBI Build 37). Sorting and duplication removals were done. Variants were obtained and annotated by an online software tool. Variant filtration was performed. In the first family, ENDEAVOUR was applied to prioritize candidate genes. In the second family, a combination of shared variants, homozygosity mapping, and gene expression were implemented to launch the disease-causing gene.
RESULTS: GJB2 sequencing and linkage analysis established no homozygosity-by-descent at any DFNB loci. Utilizing ENDEAVOUR, BBX: C.C857G (P.A286G), and MYH15: C.C5557T (P.R1853C) were put forward, but none of the variants co-segregated with the phenotype. Two genes, UNC13B and TRAK1, were prioritized in the homozygous regions detected by HomozygosityMapper.
CONCLUSION: WES is regarded a powerful approach to discover molecular etiology of Mendelian inherited disorders, but as it fails to enrich GC-rich regions, incapability of capturing noncoding regulatory regions and limited specificity and accuracy of copy number variations detection tools from exome data, it is assumed an insufficient procedure.

Entities:  

Keywords:  ENDEAVOUR; Iran; hearing loss; homozygosity mapping; whole-exome sequencing

Year:  2018        PMID: 30505812      PMCID: PMC6233028          DOI: 10.4103/abr.abr_80_18

Source DB:  PubMed          Journal:  Adv Biomed Res        ISSN: 2277-9175


Introduction

Hearing loss (HL) accounts for the most common sensory-neural defect worldwide with an incidence of 1–2 new cases/1000 newborns.[1] The disease incidence rises during childhood and adolescence lifetime to 2.83 and 3.5/1000 individuals, respectively.[23] According to the World Health Organization estimations, there are about 360 million deaf individuals throughout the world.[4] Due to the high rate of consanguineous marriages in Iran,[5] the disease frequency is much higher and stands second among disabilities in the country.[6] Patients with profound hearing difficulties suffer from poor linguistic communication, social engagement, cognitive activity, and low life quality.[7] More than 50% of the cases have a genetic background. Hereditary HL is divided into two main categories, syndromic and nonsyndromic. Syndromic HL manifests with involvement of other body organs, while deafness is the only symptom in nonsyndromic form. About 75%–85% of nonsyndromic cases have autosomal recessive mode of inheritance. Autosomal dominant inheritance manner comprises 15%–24% of the patients. Rare causes, X-linked recessive and mitochondrial patterns, include 1%–2% of the affected individuals.[8] Lesions of outer ear cause conductive HL, while defects of the middle and inner ear lead to sensory-neural HL. According to age of onset HL is divided into prelingual and postlingual. The severity of disease spectrum ranges from mild to profound based on audiogram profile.[9] More than 80 distinct loci and 60 genes have been discovered associated with autosomal recessive nonsyndromic HL (ARNSHL).[10] GJB2 mutations at DFNB1 locus are the most common cause of the disease in most part of the world.[1112] However, the role of DFNB1 locus is variable because of existence of various ethnic groups in Iran.[1314151617] Whole-exome sequencing (WES) is a high-throughput technique which gives the opportunity to detect single-nucleotide variants mainly in the coding regions of the genome.[1819] To some extent, it has the capability of finding copy number variations (CNVs) through special approaches.[20] Since the development of this technology, several novel genes have been introduced involved with ARNSHL.[21] Various genes associated with the same heritable disorder usually play a role in certain pathways or have structural and functional similarities. Hence, based on previous knowledge about HL-associated genes, discovering novel genes could be an appropriate selection. Obtaining exome data from different individuals in a family would help to detect homozygous haplotypes and common variants, with the same pattern of inheritance, among affected individuals. In this study, we selected two different approaches to discover novel disease-causing genes in two large Iranian families affected with ARNSHL. In one family, WES was performed for the proband and gene discovery was based on variant filtering and similarity searching using ENDEAVOR online software tool. In the second family, homozygosity mapping was performed by exome data of two affected individuals. Common disruptive variants, expression pattern and gene function were considered to prioritize candidate genes.

Materials and Methods

Subjects

Two large pedigrees (Ahv-17 and Sho-4) from Khuzestan province of Iran were ascertained with the history of several HL patients. Clinical investigations were accomplished including skeletal, eye, kidney, heart, and cutaneous examinations. In addition, magnetic resonance imaging and computed tomography scan were performed to detect inner ear malformations. Severity of deafness was assessed through pure-tone audiometry from 250 Hz to 8000 Hz. The study was approved by ethics committee and review boards of Isfahan University of Medical Sciences. About 5–10 mL of peripheral blood from healthy and affected members of the pedigrees were collected in ethylenediaminetetraacetic acid-containing tubes following getting informed written consent from the participants.

Genetic linkage analysis of common DFNB loci

Genomic DNA was extracted from peripheral lymphocytes using Prime Prep Genomic DNA Extraction kit from blood (GeNet Bio, Korea) according to the manufacturer's instruction. Coding mutations of the GJB2 gene were screened by polymerase chain reaction (PCR)-sequencing as described previously.[17] More than 60 Short Tandem Repeat Polymorphic (STRP) markers linked to DFNB1A/B, DFNB2, DFNB3, DFNB4, DFNB7/11, DFNB9, DFNB15, DFNB21, DFNB22, DFNB24, DFNB42, DFNB59, DFNB63, DFNB93, and DFNB101 were selected from NCBI Map Viewer and UCSC genome browser. Primers were designed using Primer3 v. 0.4.0 (http://bioinfo.ut.ee/primer3–0.4.0/) [Table 1] or selected through NCBI Probe. Specificity of the primers was checked through NCBI Primer-BLAST (https://blast.ncbi.nlm.nih.gov/Blast.cgi). Touchdown PCR was carried out to amplify STRP markers by 2X Master Mix (Ampliqon®, Denmark). To genotype STRP markers, PCR products were resolved on a nondenaturing 12%–15% polyacrylamide gel and stained under standard silver nitrate protocol. Genotypes were determined by visual inspection. Haplotype reconstruction was done by HaploPainter v. 1.043.
Table 1

Primers used for co-segregation of the variants in the BBX and MYH15 genes

Primers used for co-segregation of the variants in the BBX and MYH15 genes

Whole-exome sequencing experiment, bioinformatics analysis, and variant prioritization

One affected individual from Sho-4 pedigree and two affected individuals from Ahv-17 pedigree were subjected to high-throughput sequencing. Genomic DNA was isolated from peripheral blood lymphocytes according to phenol–chloroform protocol. Using 1% agarose gel and Nanospec Cube Biophotometer (Nanolytik®, Dusseldorf, Germany) DNA concentration, purity and integrity were checked. About 1 μg of genomic DNA of each individual was utilized to prepare library sequencing in a solution-phase targeted genomic enrichment by Agilent SureSelect Human All Exon kit v6 (Agilent Technologies, CA, USA) according to the manufacturer's instruction. Sequencing was carried out by Illumina HiSeq 2000 (Illumina, San Diego, California, USA) to generate 105 bp short reads. Sequence reads were aligned against human reference genome (hg19, NCBI Build 37) by BWA v0.7.8-r455 software. Samtools v1.0 was used to sort BAM files and Picard v1.111 was utilized to remark duplicates. Variant calling was accomplished by GATK v3.1 and variant annotation was done by wANNOVAR online software tool (http://wannovar.wglab.org/). Missense, nonsense, stop loss, initiation codon change, splice site, small in-frame, and frameshift indels with CAAD score of >10 and global minor allele frequency of <1% in NCBI dbSNP human build 147, 1000 genomes project phase3, NHLBI GO exome sequencing project, exome aggregation consortium version 0.3.1 were retained for further analysis. Pathogenic effect of the variants was evaluated by SIFT, PolyPhen-2 v2.2.2r398, and MutationTaster2.

ENDEAVOUR gene prioritization and conformation

For the Sho-4 family, ENDEAVOUR (https://endeavour.esat.kuleuven.be/Endeavour.aspx) was applied for novel gene discovery. Human was selected as species. Known deafness-associated genes with various patterns of inheritance were utilized for training. Genes from exome data were used as data source to build models. Prioritized candidate genes were fully investigated for their function and probable association with HL by means of comprehensive literature review and online sources including UniProt (http://www.uniprot.org/) and GeneCards (www.genecards.org). Primers were designed encompassing exon 10 of the BBX gene and exon 39 of the MYH15 gene [Table 1]. PCR-sequencing was carried out among healthy and affected family members.

Running autozygosity mapping by HomozygosityMapper web application

A Perl-based tool was accomplished to compare variant call format (VCF) files and pinpoint shared positions which variants are located. VCF tools v0.1.13 were utilized to find shared variants causing HL between candidates. The tool merges two output file and results in a VCF file with information about shared variants and facilitates decision-making. Using HomozygosityMapper web application (http://www.homozygositymapper.org/), homozygous chromosomal segments were detected. In the following, we focused on genes with expression in the inner and outer hair cells and auditory and vestibular ear ganglion neurons.[222324]

Results

Clinical findings

Two large inbred pedigrees, each with 4 HL patients, from Khuzestan province of Iran were recruited for molecular investigations. Pedigree analysis revealed autosomal recessive mode of inheritance in both of the pedigrees [Figure 1]. Affected individuals in pedigree Sho-4 showed mixed progressive severe-to-profound HL when they were teenager. At the age of 20, all individuals suffered from mixed profound HL. Affected individuals in pedigree Ahv-17 had severe sensory-neural HL. Involvement of other body organs were not observed in any of the pedigrees.
Figure 1

Pedigrees. There are four affected individuals in each pedigree due to consanguineous marriage that suggests autosomal recessive mode of inheritance

Pedigrees. There are four affected individuals in each pedigree due to consanguineous marriage that suggests autosomal recessive mode of inheritance

DFNB loci screening and whole-exome sequencing results

Direct sequencing showed no mutation in the coding exon of the GJB2 gene. DFNB1A/B linkage analysis ruled out the role of the most common cause of ARNSHL. After genotyping the STRP markers and haplotype reconstruction, we could not establish homozygosity-by-descent at any DFNB loci. WES was implemented to recognize the molecular etiology of the disease among known deafness-associated genes or finding novel genes. The mean depth of coverage was 100X and more than 99% of targeted regions were covered. No variant was found in known deafness-associated genes after proper variant filtering.

ENDEAVOUR gene prioritization results

Four candidate genes according to their function, domain, and expression pattern were prioritized by ENDEAVOR [Table 2]. Genes and variants include BBX: C.C857G (P.A286G) and MYH15: C.C5557T (P.R1853C) were evaluated through PCR-sequencing but none of the variants to be co-segregating with the phenotype in the pedigree [Figures 2 and 3]. Primer sequences are available in Table 2.
Table 2

List of candidate genes for families Sho-4 and Ahv-17

Figure 2

Sequence electropherogram of BBX: c.C857G (P.A286G) variant. The variant does not co-segregates with ARNSHL in the family as other affected members are heterozygous and one healthy sibling is homozygous for mutant allele

Figure 3

Sequence electropherogram of MYH15: c.C5557T (P.R1853C) variant. As one of the healthy individuals is homozygous for mutant allele and other affected individuals do not have homozygous mutant allele, this variant cannot be molecular etiology of the disease

List of candidate genes for families Sho-4 and Ahv-17 Sequence electropherogram of BBX: c.C857G (P.A286G) variant. The variant does not co-segregates with ARNSHL in the family as other affected members are heterozygous and one healthy sibling is homozygous for mutant allele Sequence electropherogram of MYH15: c.C5557T (P.R1853C) variant. As one of the healthy individuals is homozygous for mutant allele and other affected individuals do not have homozygous mutant allele, this variant cannot be molecular etiology of the disease

Homozygosity Mapper results

Two variants, UNC13B: C.C714A (P.D238E) and TRAK1: C.1841-1842insGGAGGA (P.T614insEE), remained after autozygosity mapping, expression in the inner and outer hair cells, auditory and vestibular ear ganglion neurons, and shared in both affected individuals as novel candidate genes for ARNSHL [Table 2].

Discussion

By applying two different approaches, ENDEAVOUR gene prioritization and homozygosity mapping using WES data, we aimed to introduce novel genes related to ARNSHL in two large families from Southwest of Iran. Although ENDEAVOUR prioritized four interesting genes, co-segregation analysis of the first two candidates ruled them out. In the second family, a combination of homozygosity mapping by exome data of two affected individuals, ear gene expression pattern, and shared variants in both individuals suggested the UNC13B and TRAK1 genes as potential candidate genes for ARNSHL. Although WES is a high-throughput sequencing method, some practical and analysis limitations prevent finding responsible variants for a single-gene disorder. First of all, this method is poor to enrich GC-rich regions of the genome.[25] Second, noncoding regulatory regions are not routinely captured by this methodology. Using whole-genome sequencing (WGS) would help us to uncover such pathogenic alterations, although variant interpretation and prioritization occurring in these regions is problematic.[26] The last limitation is detecting CNVs from exome data. Although several software and algorithms have been developed to overcome this problem, they have limited specificity and accuracy.[27] There are two disadvantages in using exome data for linkage analysis. First, only single-nucleotide polymorphisms (SNPs) at exonic regions will be captured and second, error rate is higher in comparison with array-based technologies.[28] Smith et al. implemented linkage analysis through exome data successfully to uncover genetic causes of recessive and dominant inherited diseases in three families and the results were comparable with SNP array-based linkage analysis.[28] Biallelic mutations in the WEE2 gene have been reported to cause female infertility using the same strategy.[29] ENDEAVOUR has been introduced as a powerful tool for gene prioritization through knowledge from former identified genes and genomics data to integrate them for global ranking.[30] Recent publications have verified the efficiency of ENDEAVOUR for discovering novel genes associated with neurodevelopmental disorder and congenital anomalies.[31323334] As the families are large enough for genome-wide linkage analysis, a dense SNP array and using STRP markers could help us to limit the disease gene to an interval. In addition, WGS would be a method of choice to detect CNVs, noncoding regulatory region variants, and mutations in GC-rich regions.

Conclusion

WES has revolutionized molecular investigations in medical genetics practice. However the procedure has wet-lab and dry-lab limitations. It seems that genomic approaches like SNP arrays could help us for fine mapping of the disease interval. In addition, WGS could be method of choice for scanning CNVs, noncoding regulatory regions and GC-rich content of the identified interval.

Financial support and sponsorship

This work was financially supported by Isfahan University of Medical Sciences grant NO.394805, NO.394531 and NO.194068.

Conflicts of interest

There are no conflicts of interest.
  33 in total

1.  Survey of residential and day schools for deaf students in the United States that identify themselves as bilingual-bicultural programs.

Authors:  Carol LaSasso; Jana Lollis
Journal:  J Deaf Stud Deaf Educ       Date:  2003

2.  GJB2 mutations: passage through Iran.

Authors:  Hossein Najmabadi; Carla Nishimura; Kimia Kahrizi; Yasser Riazalhosseini; Mahdi Malekpour; Ahmad Daneshi; Mohammad Farhadi; Marzieh Mohseni; Nejat Mahdieh; Ahmad Ebrahimi; Niloofar Bazazzadegan; Anoosh Naghavi; Matthew Avenarius; Sanaz Arzhangi; Richard J H Smith
Journal:  Am J Med Genet A       Date:  2005-03-01       Impact factor: 2.802

3.  A novel pathogenic variant in an Iranian Ataxia telangiectasia family revealed by next-generation sequencing followed by in silico analysis.

Authors:  Mohammad Amin Tabatabaiefar; Paria Alipour; Azam Pourahmadiyan; Najmeh Fattahi; Laleh Shariati; Neda Golchin; Javad Mohammadi-Asl
Journal:  J Neurol Sci       Date:  2017-06-12       Impact factor: 3.181

Review 4.  Detection of structural DNA variation from next generation sequencing data: a review of informatic approaches.

Authors:  Haley J Abel; Eric J Duncavage
Journal:  Cancer Genet       Date:  2013-11-20

Review 5.  Sequencing technologies - the next generation.

Authors:  Michael L Metzker
Journal:  Nat Rev Genet       Date:  2009-12-08       Impact factor: 53.242

Review 6.  Whole-exome sequencing and its impact in hereditary hearing loss.

Authors:  Tahir Atik; Guney Bademci; Oscar Diaz-Horta; Susan H Blanton; Mustafa Tekin
Journal:  Genet Res (Camb)       Date:  2015-03-31       Impact factor: 1.588

7.  Exome sequencing and disease-network analysis of a single family implicate a mutation in KIF1A in hereditary spastic paraparesis.

Authors:  Yaniv Erlich; Simon Edvardson; Emily Hodges; Shamir Zenvirt; Pramod Thekkat; Avraham Shaag; Talya Dor; Gregory J Hannon; Orly Elpeleg
Journal:  Genome Res       Date:  2011-04-12       Impact factor: 9.043

8.  Developmental profiling of spiral ganglion neurons reveals insights into auditory circuit assembly.

Authors:  Cindy C Lu; Jessica M Appler; E Andres Houseman; Lisa V Goodrich
Journal:  J Neurosci       Date:  2011-07-27       Impact factor: 6.167

9.  Haploinsufficiency of TAB2 causes congenital heart defects in humans.

Authors:  Bernard Thienpont; Litu Zhang; Alex V Postma; Jeroen Breckpot; Léon-Charles Tranchevent; Peter Van Loo; Kjeld Møllgård; Niels Tommerup; Iben Bache; Zeynep Tümer; Klaartje van Engelen; Björn Menten; Geert Mortier; Darrel Waggoner; Marc Gewillig; Yves Moreau; Koen Devriendt; Lars Allan Larsen
Journal:  Am J Hum Genet       Date:  2010-05-20       Impact factor: 11.025

10.  A Whole-Genome Analysis Framework for Effective Identification of Pathogenic Regulatory Variants in Mendelian Disease.

Authors:  Damian Smedley; Max Schubach; Julius O B Jacobsen; Sebastian Köhler; Tomasz Zemojtel; Malte Spielmann; Marten Jäger; Harry Hochheiser; Nicole L Washington; Julie A McMurry; Melissa A Haendel; Christopher J Mungall; Suzanna E Lewis; Tudor Groza; Giorgio Valentini; Peter N Robinson
Journal:  Am J Hum Genet       Date:  2016-08-25       Impact factor: 11.025

View more
  2 in total

1.  Phenotypic and genotypic characterization of families with complex intellectual disability identified pathogenic genetic variations in known and novel disease genes.

Authors:  Hossein Darvish; Luis J Azcona; Abbas Tafakhori; Roxana Mesias; Azadeh Ahmadifard; Elena Sanchez; Arman Habibi; Elham Alehabib; Amir Hossein Johari; Babak Emamalizadeh; Faezeh Jamali; Marjan Chapi; Javad Jamshidi; Yuji Kajiwara; Coro Paisán-Ruiz
Journal:  Sci Rep       Date:  2020-01-22       Impact factor: 4.379

2.  Novel MYO15A variants are associated with hearing loss in the two Iranian pedigrees.

Authors:  Somayeh Khatami; Masomeh Askari; Fatemeh Bahreini; Morteza Hashemzadeh-Chaleshtori; Saeed Hematian; Samira Asgharzade
Journal:  BMC Med Genet       Date:  2020-11-18       Impact factor: 2.103

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.