Literature DB >> 20711175

Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome.

Sarah B Ng1, Abigail W Bigham, Kati J Buckingham, Mark C Hannibal, Margaret J McMillin, Heidi I Gildersleeve, Anita E Beck, Holly K Tabor, Gregory M Cooper, Heather C Mefford, Choli Lee, Emily H Turner, Joshua D Smith, Mark J Rieder, Koh-Ichiro Yoshiura, Naomichi Matsumoto, Tohru Ohta, Norio Niikawa, Deborah A Nickerson, Michael J Bamshad, Jay Shendure.   

Abstract

We demonstrate the successful application of exome sequencing to discover a gene for an autosomal dominant disorder, Kabuki syndrome (OMIM%147920). We subjected the exomes of ten unrelated probands to massively parallel sequencing. After filtering against existing SNP databases, there was no compelling candidate gene containing previously unknown variants in all affected individuals. Less stringent filtering criteria allowed for the presence of modest genetic heterogeneity or missing data but also identified multiple candidate genes. However, genotypic and phenotypic stratification highlighted MLL2, which encodes a Trithorax-group histone methyltransferase: seven probands had newly identified nonsense or frameshift mutations in this gene. Follow-up Sanger sequencing detected MLL2 mutations in two of the three remaining individuals with Kabuki syndrome (cases) and in 26 of 43 additional cases. In families where parental DNA was available, the mutation was confirmed to be de novo (n = 12) or transmitted (n = 2) in concordance with phenotype. Our results strongly suggest that mutations in MLL2 are a major cause of Kabuki syndrome.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20711175      PMCID: PMC2930028          DOI: 10.1038/ng.646

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


Kabuki syndrome is a rare, multiple malformation disorder characterized by a distinctive facial appearance (Supplementary Fig. 1), cardiac anomalies, skeletal abnormalities, immunological defects, and mild to moderate mental retardation. Originally described by Niikawa et al.5 and Kuroki et al.6 in 1981, Kabuki syndrome has an estimated incidence of 1 in 32,0007 and about 400 cases have been reported worldwide. The vast majority of reported cases have been sporadic, but parent-to-child transmission in more than a half a dozen instances8 suggests that Kabuki syndrome is an autosomal dominant disorder. The relatively low number of cases, the lack of multiplex families, and the phenotypic variability of Kabuki syndrome have made the identification of the gene(s) underlying Kabuki syndrome intractable to conventional approaches of gene discovery, despite aggressive efforts. We sequenced the exomes of ten unrelated individuals with Kabuki syndrome, seven of European ancestry, two of Hispanic ancestry, and one of mixed European/Haitian ancestry (Supplementary Fig. 1, Supplementary Table 1). Enrichment was performed by hybridization of shotgun fragment libraries to custom microarrays, followed by massively parallel sequencing1–3. On average, 6.3 gigabases of sequence were generated per sample to achieve 40× coverage of the mappable, targeted exome (31 megabases). As previously, our analyses focused primarily on nonsynonymous (NS) variants, splice acceptor and donor site mutations (SS) and coding indels (I), anticipating that synonymous variants were far less likely to be pathogenic. We also predicted that variants underlying Kabuki syndrome are rare, and therefore likely to be novel. Novelty was defined here by absence from all datasets used for comparison, including dbSNP129, the 1000 Genomes Project, exome data from sixteen individuals previously reported by us2,3, and ten exomes sequenced as part of the Environmental Genome Project (EGP). Under a dominant model in which each case was required to have at least one novel NS/SS/I variant in the same gene, only a single candidate gene (MUC16) was shared by all ten exomes (Table 1, row 4; Supplementary Table 2). However, MUC16 was considered likely to be a false positive due to its extremely large size (14,507 aa). Potential explanations for our failure to find a compelling candidate gene in which novel variants are observed in all affected individuals included: (a) that Kabuki syndrome is genetically heterogeneous, and therefore not all affected individuals will have mutations in the same gene; (b) that we failed to identify all mutations in the targeted exome; (c) that some or all causative mutations were outside of the targeted exome, e.g., in non-coding regions or unannotated genes. To allow for a modest degree of genetic heterogeneity and/or missing data, we conducted a less stringent analysis by looking for candidate genes shared among subsets of affected individuals. Specifically, we searched for subsets of x out of 10 exomes having ≥ 1 novel variant in the same gene, for x = 1 to 10. For x = 9, 8, and 7, novel variants were shared in three genes, six genes, and sixteen genes, respectively (Table 1, row 4). However, there was no obvious way to rank these candidates.
Table 1

Number of genes common to any subset of x affected individuals

The number of genes with at least one non-synonymous variant (NS), splice-site acceptor/donor variants (SS) or coding indel (I) are listed under various filters. Variants were filtered by presence in dbSNP or 1000 genomes ("Not in dbSNP129 or 1000 genomes") and control exomes ("Not in control exomes") or both ("Not in either"); control exomes refer to those from 8 Hapmap3 4 FSS3, 4 Miller2 and 10 EGP samples. The number of genes found using the union of the intersection of x individuals is given.

a. Subset analysis (any x of 10)12345678910
NS/SS/I12,0428,7227,0846,0495,2894,5813,9403,2442,4861,459
Not in dbSNP129 or 1000 genomes7,41926971057488288192128886034
Not in control exomes7,8272865102539918490502272
Not in either6,93522277012421044416631
Is loss-of-function (nonsense/frameshift indel)7534973221000
We speculated that genotypic and/or phenotypic stratification would facilitate the prioritization of candidate genes identified by subset analysis. Specifically, we assigned a categorical rank to each Kabuki case based on a subjective assessment of the presence of, or similarity to, the canonical facial characteristics of Kabuki syndrome (Supplementary Fig. 1) and the presence of developmental delay and/or major birth defects (Supplementary Table 1). The highest ranked case was one of a pair of monozygotic twins with Kabuki syndrome. We then categorized the functional impact (i.e. nonsense versus nonsynonymous substitution, splice-site disruption, frameshift versus in-frame indel) of each novel variant in candidate genes shared by each subset of two or more ranked cases. Manual review of these data highlighted distinct, novel nonsense variants in MLL2 in each of the four highest ranked cases. On sequential analysis of phenotype-ranked cases with a loss-of-function filter, MLL2 is the only candidate gene remaining after addition of the second individual (Table 2, row 5, column "+2"). No novel variant in MLL2 was found in the Kabuki case ranked 5th, such that the number of candidate genes drops to zero after the fourth individual (Table 2, row 5). However, a 4-bp deletion was found in the case ranked 6th and nonsense variants in the cases ranked 7th and 9th. Thus, exome sequencing identified a nonsense substitution or frameshift indel in MLL2 in seven of the ten Kabuki cases.
Table 2

Number of genes common in sequential analysis of phenotypically ranked individuals

Variants were filtered as in Table 1. Exomes were added sequentially to the analysis by ranked phenotype, e.g. column "+ 3" shows the number of genes at the intersection of the three top ranked cases. (Supplementary Fig. 1). The gene with at least one NS/SS/I in all individuals is MUC16 which is very likely to be a false positive due to its extreme length (14,507 aa).

b. Sequential analysis1+2+3+4+5+6+7+8+9+10
NS/SS/I5,2823,8503,2502,3542,0281,8991,7721,6861,6001,459
Not in dbSNP129 or 1000 genomes68721414584635442403934
Not in control exomes675134502613138542
Not in either467893418984431
Is loss-of-function (nonsense/frameshift indel)25111000000
Retrospectively, if we apply a loss-of-function filter to the subset analysis of exome data (Table 1, row 5), at x = 7, MLL2 is the only candidate gene. We also developed a post hoc ranking of candidate genes based on functional impact of variants present (“variant score”) and the rank of the cases in which each variant was observed (“case score”). When applied to the exome data as a combined metric, MLL2 emerges as the top candidate (Supplementary Fig. 2). In parallel with these analyses, we applied genomic evolutionary rate profiling (GERP)9 to exome data. GERP uses mammalian genome alignments to define a rejected substitution (RS) score for each variant, regardless of functional class. We have previously shown that the quantitative ranking of candidate genes by the RS scores of their novel variants can facilitate the exome-based analysis of Mendelian disorders10. In subset analysis with GERP-based ranking, MLL2 remains on the candidate list up to x = 8, ranking 3rd in a list of 11 candidate genes at this threshold (Table 3, Supplementary Fig. 3). Interestingly, the additional MLL2 variant contributing to this analysis (such that MLL2 is still considered at x = 8) is a synonymous substitution with an RS score of 0.368 in the 5th ranked case.
Table 3

Analysis of exome variants using genomic evolutionary rate profiling

The number of genes with at least a single novel variant with an rejected substitution (RS) score10 > 0 in at least x individuals is given. A gene rank is assigned based on the average GERP score9 over all observed novel variants with RS score > 0 in all affected individuals.

c. GERP Score analysis (at least x of 10)12345678910
Variant RS score > 07,1762,36075426910639201131
MLL2 Rank3,7321,232399136471463NANA
We sought to confirm all novel variants identified in MLL2, particularly because loss-of-function variants identified through massively parallel sequencing have a higher prior probability of being false positives. All seven loss-of-function variants in MLL2 were validated by Sanger sequencing. We further analyzed the three cases in which we did not initially find a loss-of-function variant in MLL2, first by array comparative genomic hybridization (aCGH) to determine any gross structural changes, and then by Sanger sequencing of all exons of MLL2 in case of false negatives by exome sequencing. Since an average of 96% of coding bases in MLL2 were called at sufficient quality and coverage for single-nucleotide variant detection, we anticipated that any missed variants were more likely to be indels instead, because of the higher coverage required for confident indel detection in short-read sequence data. Indeed, although aCGH did not find any structural variants in the region, Sanger sequencing did identify frameshift indels in two of these three cases (ranked 8th and 10th). Ultimately, loss-of-function mutations in MLL2 were identified in nine out of ten cases in the discovery cohort (Fig. 1), making it a compelling candidate for Kabuki syndrome. For validation, we screened all 54 exons of MLL2 in 43 additional cases by Sanger sequencing. Novel nonsynonymous, nonsense or frameshift mutations in MLL2 were found in 26 of these 43 cases (Fig. 1 and Supplementary Table 3). In total, through either exome sequencing or targeted sequencing of MLL2, 33 distinct MLL2 mutations were identified in 35 of 53 families (66%) with Kabuki syndrome (Fig. 1 and Supplementary Table 3). In each of twelve cases for which DNA from both parents was available, the MLL2 variant was found to have occurred de novo. Three mutations were found in two cases each: one mutation was confirmed to have arisen de novo in one of the cases, indicating that some mutations are recurrent. Novel MLL2 mutations (K4527X and T5464M) were also identified in each of two families in which Kabuki syndrome was transmitted from parent-to-child. None of the additional MLL2 mutations were found in 190 control chromosomes from individuals of matched geographical ancestry.
Figure 1

Genomic structure and allelic spectrum of MLL2 mutations that cause Kabuki syndrome

MLL2 is composed of 54 exons that encode untranslated regions (orange) and protein coding sequence (blue) including 7 PHD fingers (yellow), FYRN (green), FYRC (green), and a SET domain (red). Arrows indicate the locations of 32 different mutations found in 53 families with Kabuki syndrome including: 20 nonsense, 7 indels, and 5 amino acid substitutions. Asterisks indicate mutations that were confirmed to be de novo and crosses indicate cases for which parental DNA was unavailable.

Our results strongly suggest that mutations in MLL2 are a major cause of Kabuki syndrome. MLL2 encodes a large 5,262 residue protein that is part of the SET family of proteins, of which Trithorax, the Drosophila homologue of MLL, is the best characterized11. The SET domain of MLL2 confers strong histone 3 lysine 4 methyltransferase activity and is important in the epigenetic control of active chromatin states12. Murine loss of Mll2 on a mixed 129Sv/C57BL/6 background slows growth, increases apoptosis and retards development leading to early embryonic lethality, due in part to mis-regulation of homeobox gene expression13. However, no morphological defects have been reported in Mll2+/− mice13. Most of the MLL2 variants identified in Kabuki cases are predicted to truncate the polypeptide chain before translation of the SET domain. Accordingly, though it is not certain whether Kabuki syndrome results from haploinsufficiency or a gain-of function at MLL2, haploinsufficiency seems to be the more likely mechanism. Deletion of chromosome 12q12-q13.2, which encompasses MLL2, has been reported in a child with characteristics of Noonan syndrome14. However, we re-analyzed this case using oligo aCGH (including 21 probes that cover MLL2) and found the distal breakpoint to be located ~700 kb proximal of MLL2 (data not shown). Interestingly, all of the pathogenic missense variants identified herein are located in regions of MLL2 that encode C-terminal domains. This suggests that missense variants elsewhere in MLL2 could be better tolerated or, alternatively, are embryonic lethal. For the 18 of 53 cases for which no novel protein-altering variant was found, it is possible that non-coding or other missed mutations in MLL2 are responsible instead. Alternatively, Kabuki syndrome could be genetically heterogeneous, and further analysis of these cases by exome sequencing may elucidate additional genes for Kabuki syndrome and potentially explain some of the phenotypic heterogeneity seen in this disease. Notably, 9 of 10 individuals in the discovery cohort (90%), but only 26 of 43 individuals in the replication cohort (60%), were ultimately found to have mutations in MLL2. It is therefore possible that the careful selection of canonical Kabuki cases for the discovery cohort enriched for a shared genetic basis. This underscores the importance of access to deeply phenotyped and well-characterized cases. In summary, we applied exome sequencing of a small number of unrelated cases to discover that mutations in MLL2 underlie Kabuki syndrome. As predicted in previous analyses2,3, allowing for even a small degree of genetic heterogeneity or missing data significantly confounds exome analysis by increasing the number of candidate genes consistent with the model of inheritance. To facilitate the prioritization of genes under such criteria, we stratified data by ranked phenotypes and found that MLL2 was prominent in the higher ranked cases. However, nine of the ten Kabuki cases in the discovery cohort were ultimately found to have MLL2 mutations, such that stratification by phenotype was of less importance than originally appeared to be the case. Nonetheless, the sequential analysis of ranked cases may have reduced the probability of confounding due to genetic heterogeneity. All of the MLL2 mutations found in the discovery set via exome sequencing were loss-of-function variants. As a result, MLL2 ranked highly among candidates assessed by predicted functional impact. Such a pattern will likely occur for some, but not all, Mendelian phenotypes subjected to this approach. We anticipate that the further development of strategies to stratify data at both the genotypic and phenotypic level will be critical for exome and whole genome sequencing to reach their full potential as tools for discovery of genes underlying Mendelian and complex diseases.
  14 in total

1.  Distribution and intensity of constraint in mammalian genomic sequence.

Authors:  Gregory M Cooper; Eric A Stone; George Asimenos; Eric D Green; Serafim Batzoglou; Arend Sidow
Journal:  Genome Res       Date:  2005-06-17       Impact factor: 9.043

2.  Patient with del(12)(q12q13.12) manifesting abnormalities compatible with Noonan syndrome.

Authors:  H Tonoki; S Saitoh; K Kobayashi
Journal:  Am J Med Genet       Date:  1998-02-03

3.  Single-nucleotide evolutionary constraint scores highlight disease-causing mutations.

Authors:  Gregory M Cooper; David L Goode; Sarah B Ng; Arend Sidow; Michael J Bamshad; Jay Shendure; Deborah A Nickerson
Journal:  Nat Methods       Date:  2010-04       Impact factor: 28.547

Review 4.  Kabuki make-up (Niikawa-Kuroki) syndrome: a study of 62 patients.

Authors:  N Niikawa; Y Kuroki; T Kajii; N Matsuura; S Ishikiriyama; H Tonoki; N Ishikawa; Y Yamada; M Fujita; H Umemoto
Journal:  Am J Med Genet       Date:  1988-11

5.  MLL2: A new mammalian member of the trx/MLL family of genes.

Authors:  K T FitzGerald; M O Diaz
Journal:  Genomics       Date:  1999-07-15       Impact factor: 5.736

6.  Further evidence for autosomal dominant inheritance and ectodermal abnormalities in Kabuki syndrome.

Authors:  W Courtens; A Rassart; J J Stene; E Vamos
Journal:  Am J Med Genet       Date:  2000-07-31

7.  Structure and expression pattern of human ALR, a novel gene with strong homology to ALL-1 involved in acute leukemia and to Drosophila trithorax.

Authors:  R Prasad; A B Zhadanov; Y Sedkov; F Bullrich; T Druck; R Rallapalli; T Yano; H Alder; C M Croce; K Huebner; A Mazo; E Canaani
Journal:  Oncogene       Date:  1997-07-31       Impact factor: 9.867

8.  A new malformation syndrome of long palpebral fissures, large ears, depressed nasal tip, and skeletal anomalies associated with postnatal dwarfism and mental retardation.

Authors:  Y Kuroki; Y Suzuki; H Chyo; A Hata; I Matsui
Journal:  J Pediatr       Date:  1981-10       Impact factor: 4.406

9.  Kabuki make-up syndrome: a syndrome of mental retardation, unusual facies, large and protruding ears, and postnatal growth deficiency.

Authors:  N Niikawa; N Matsuura; Y Fukushima; T Ohsawa; T Kajii
Journal:  J Pediatr       Date:  1981-10       Impact factor: 4.406

10.  Multiple epigenetic maintenance factors implicated by the loss of Mll2 in mouse development.

Authors:  Stefan Glaser; Julia Schaft; Sandra Lubitz; Kristina Vintersten; Frank van der Hoeven; Katharina R Tufteland; Rein Aasland; Konstantinos Anastassiadis; Siew-Lan Ang; A Francis Stewart
Journal:  Development       Date:  2006-03-15       Impact factor: 6.868

View more
  554 in total

1.  Deep sequencing reveals 50 novel genes for recessive cognitive disorders.

Authors:  Hossein Najmabadi; Hao Hu; Masoud Garshasbi; Tomasz Zemojtel; Seyedeh Sedigheh Abedini; Wei Chen; Masoumeh Hosseini; Farkhondeh Behjati; Stefan Haas; Payman Jamali; Agnes Zecha; Marzieh Mohseni; Lucia Püttmann; Leyla Nouri Vahid; Corinna Jensen; Lia Abbasi Moheb; Melanie Bienek; Farzaneh Larti; Ines Mueller; Robert Weissmann; Hossein Darvish; Klaus Wrogemann; Valeh Hadavi; Bettina Lipkowitz; Sahar Esmaeeli-Nieh; Dagmar Wieczorek; Roxana Kariminejad; Saghar Ghasemi Firouzabadi; Monika Cohen; Zohreh Fattahi; Imma Rost; Faezeh Mojahedi; Christoph Hertzberg; Atefeh Dehghan; Anna Rajab; Mohammad Javad Soltani Banavandi; Julia Hoffer; Masoumeh Falah; Luciana Musante; Vera Kalscheuer; Reinhard Ullmann; Andreas Walter Kuss; Andreas Tzschach; Kimia Kahrizi; H Hilger Ropers
Journal:  Nature       Date:  2011-09-21       Impact factor: 49.962

Review 2.  Exome sequencing: a transformative technology.

Authors:  Andrew B Singleton
Journal:  Lancet Neurol       Date:  2011-10       Impact factor: 44.182

3.  How genetically heterogeneous is Kabuki syndrome?: MLL2 testing in 116 patients, review and analyses of mutation and phenotypic spectrum.

Authors:  Siddharth Banka; Ratna Veeramachaneni; William Reardon; Emma Howard; Sancha Bunstone; Nicola Ragge; Michael J Parker; Yanick J Crow; Bronwyn Kerr; Helen Kingston; Kay Metcalfe; Kate Chandler; Alex Magee; Fiona Stewart; Vivienne P M McConnell; Deirdre E Donnelly; Siren Berland; Gunnar Houge; Jenny E Morton; Christine Oley; Nicole Revencu; Soo-Mi Park; Sally J Davies; Andrew E Fry; Sally Ann Lynch; Harinder Gill; Susann Schweiger; Wayne W K Lam; John Tolmie; Shehla N Mohammed; Emma Hobson; Audrey Smith; Moira Blyth; Christopher Bennett; Pradeep C Vasudevan; Sixto García-Miñaúr; Alex Henderson; Judith Goodship; Michael J Wright; Richard Fisher; Richard Gibbons; Susan M Price; Deepthi C de Silva; I Karen Temple; Amanda L Collins; Katherine Lachlan; Frances Elmslie; Meriel McEntagart; Bruce Castle; Jill Clayton-Smith; Graeme C Black; Dian Donnai
Journal:  Eur J Hum Genet       Date:  2011-11-30       Impact factor: 4.246

4.  Multiplexed array-based and in-solution genomic enrichment for flexible and cost-effective targeted next-generation sequencing.

Authors:  Magdalena Harakalova; Michal Mokry; Barbara Hrdlickova; Ivo Renkens; Karen Duran; Henk van Roekel; Nico Lansu; Mark van Roosmalen; Ewart de Bruijn; Isaac J Nijman; Wigard P Kloosterman; Edwin Cuppen
Journal:  Nat Protoc       Date:  2011-11-03       Impact factor: 13.491

Review 5.  Genetic variation in the epigenetic machinery and mental health.

Authors:  Chris Murgatroyd; Dietmar Spengler
Journal:  Curr Psychiatry Rep       Date:  2012-04       Impact factor: 5.285

Review 6.  Genetic [corrected] insights into the causes and classification of [corrected] cerebral palsies.

Authors:  Andres Moreno-De-Luca; David H Ledbetter; Christa L Martin
Journal:  Lancet Neurol       Date:  2012-01-18       Impact factor: 44.182

Review 7.  Applications of targeted gene capture and next-generation sequencing technologies in studies of human deafness and other genetic disabilities.

Authors:  Xi Lin; Wenxue Tang; Shoeb Ahmad; Jingqiao Lu; Candice C Colby; Jason Zhu; Qing Yu
Journal:  Hear Res       Date:  2012-01-14       Impact factor: 3.208

8.  Exome sequencing and diffusion tensor imaging in developmental disabilities.

Authors:  Senthil Sundaram; A H M Huq; Tammy Hsia; Harry Chugani
Journal:  Pediatr Res       Date:  2013-12-06       Impact factor: 3.756

9.  Mirror-image asymmetry in monozygotic twins with kabuki syndrome.

Authors:  A Riess; A Dufke; O Riess; S Beck-Woedl; B Fode; H Skladny; R Klaes; A Tzschach
Journal:  Mol Syndromol       Date:  2012-07-25

10.  Exploring the utility of whole-exome sequencing as a diagnostic tool in a child with atypical episodic muscle weakness.

Authors:  Neil A Hanchard; David R Murdock; Pilar L Magoulas; Matthew Bainbridge; Donna Muzny; YuanQing Wu; Min Wang; James R Lupski; Richard A Gibbs; Chester W Brown
Journal:  Clin Genet       Date:  2012-09-11       Impact factor: 4.438

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.