| Literature DB >> 20711175 |
Sarah B Ng1, Abigail W Bigham, Kati J Buckingham, Mark C Hannibal, Margaret J McMillin, Heidi I Gildersleeve, Anita E Beck, Holly K Tabor, Gregory M Cooper, Heather C Mefford, Choli Lee, Emily H Turner, Joshua D Smith, Mark J Rieder, Koh-Ichiro Yoshiura, Naomichi Matsumoto, Tohru Ohta, Norio Niikawa, Deborah A Nickerson, Michael J Bamshad, Jay Shendure.
Abstract
We demonstrate the successful application of exome sequencing to discover a gene for an autosomal dominant disorder, Kabuki syndrome (OMIM%147920). We subjected the exomes of ten unrelated probands to massively parallel sequencing. After filtering against existing SNP databases, there was no compelling candidate gene containing previously unknown variants in all affected individuals. Less stringent filtering criteria allowed for the presence of modest genetic heterogeneity or missing data but also identified multiple candidate genes. However, genotypic and phenotypic stratification highlighted MLL2, which encodes a Trithorax-group histone methyltransferase: seven probands had newly identified nonsense or frameshift mutations in this gene. Follow-up Sanger sequencing detected MLL2 mutations in two of the three remaining individuals with Kabuki syndrome (cases) and in 26 of 43 additional cases. In families where parental DNA was available, the mutation was confirmed to be de novo (n = 12) or transmitted (n = 2) in concordance with phenotype. Our results strongly suggest that mutations in MLL2 are a major cause of Kabuki syndrome.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20711175 PMCID: PMC2930028 DOI: 10.1038/ng.646
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Number of genes common to any subset of x affected individuals
The number of genes with at least one non-synonymous variant (NS), splice-site acceptor/donor variants (SS) or coding indel (I) are listed under various filters. Variants were filtered by presence in dbSNP or 1000 genomes ("Not in dbSNP129 or 1000 genomes") and control exomes ("Not in control exomes") or both ("Not in either"); control exomes refer to those from 8 Hapmap3 4 FSS3, 4 Miller2 and 10 EGP samples. The number of genes found using the union of the intersection of x individuals is given.
| a. Subset analysis (any | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
|---|---|---|---|---|---|---|---|---|---|---|
| NS/SS/I | 12,042 | 8,722 | 7,084 | 6,049 | 5,289 | 4,581 | 3,940 | 3,244 | 2,486 | 1,459 |
| Not in dbSNP129 or 1000 genomes | 7,419 | 2697 | 1057 | 488 | 288 | 192 | 128 | 88 | 60 | 34 |
| Not in control exomes | 7,827 | 2865 | 1025 | 399 | 184 | 90 | 50 | 22 | 7 | 2 |
| Not in either | 6,935 | 2227 | 701 | 242 | 104 | 44 | 16 | 6 | 3 | 1 |
| Is loss-of-function (nonsense/frameshift indel) | 753 | 49 | 7 | 3 | 2 | 2 | 1 | 0 | 0 | 0 |
Number of genes common in sequential analysis of phenotypically ranked individuals
Variants were filtered as in Table 1. Exomes were added sequentially to the analysis by ranked phenotype, e.g. column "+ 3" shows the number of genes at the intersection of the three top ranked cases. (Supplementary Fig. 1). The gene with at least one NS/SS/I in all individuals is MUC16 which is very likely to be a false positive due to its extreme length (14,507 aa).
| b. Sequential analysis | 1 | +2 | +3 | +4 | +5 | +6 | +7 | +8 | +9 | +10 |
|---|---|---|---|---|---|---|---|---|---|---|
| NS/SS/I | 5,282 | 3,850 | 3,250 | 2,354 | 2,028 | 1,899 | 1,772 | 1,686 | 1,600 | 1,459 |
| Not in dbSNP129 or 1000 genomes | 687 | 214 | 145 | 84 | 63 | 54 | 42 | 40 | 39 | 34 |
| Not in control exomes | 675 | 134 | 50 | 26 | 13 | 13 | 8 | 5 | 4 | 2 |
| Not in either | 467 | 89 | 34 | 18 | 9 | 8 | 4 | 4 | 3 | 1 |
| Is loss-of-function (nonsense/frameshift indel) | 25 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
Analysis of exome variants using genomic evolutionary rate profiling
The number of genes with at least a single novel variant with an rejected substitution (RS) score10 > 0 in at least x individuals is given. A gene rank is assigned based on the average GERP score9 over all observed novel variants with RS score > 0 in all affected individuals.
| c. GERP Score analysis (at least | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
|---|---|---|---|---|---|---|---|---|---|---|
| Variant RS score > 0 | 7,176 | 2,360 | 754 | 269 | 106 | 39 | 20 | 11 | 3 | 1 |
| MLL2 Rank | 3,732 | 1,232 | 399 | 136 | 47 | 14 | 6 | 3 | NA | NA |
Figure 1Genomic structure and allelic spectrum of MLL2 mutations that cause Kabuki syndrome
MLL2 is composed of 54 exons that encode untranslated regions (orange) and protein coding sequence (blue) including 7 PHD fingers (yellow), FYRN (green), FYRC (green), and a SET domain (red). Arrows indicate the locations of 32 different mutations found in 53 families with Kabuki syndrome including: 20 nonsense, 7 indels, and 5 amino acid substitutions. Asterisks indicate mutations that were confirmed to be de novo and crosses indicate cases for which parental DNA was unavailable.