| Literature DB >> 19915526 |
Sarah B Ng1, Kati J Buckingham, Choli Lee, Abigail W Bigham, Holly K Tabor, Karin M Dent, Chad D Huff, Paul T Shannon, Ethylin Wang Jabs, Deborah A Nickerson, Jay Shendure, Michael J Bamshad.
Abstract
We demonstrate the first successful application of exome sequencing to discover the gene for a rare mendelian disorder of unknown cause, Miller syndrome (MIM%263750). For four affected individuals in three independent kindreds, we captured and sequenced coding regions to a mean coverage of 40x and sufficient depth to call variants at approximately 97% of each targeted exome. Filtering against public SNP databases and eight HapMap exomes for genes with two previously unknown variants in each of the four individuals identified a single candidate gene, DHODH, which encodes a key enzyme in the pyrimidine de novo biosynthesis pathway. Sanger sequencing confirmed the presence of DHODH mutations in three additional families with Miller syndrome. Exome sequencing of a small number of unrelated affected individuals is a powerful, efficient strategy for identifying the genes underlying rare mendelian disorders and will likely transform the genetic analysis of monogenic traits.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19915526 PMCID: PMC2847889 DOI: 10.1038/ng.499
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Figure 1Clinical characteristics of an individual with Miller syndrome (A,B) and an individual with methotrexate embryopathy (C,D)
A 9 year-old boy with Miller syndrome (A and B) caused by mutations in DHODH. Facial anomalies (A) include cupped ears, coloboma of the lower eyelids, prominent nose, micrognathia and absence of the 5th digits of the feet (B). A 26 year-old man with methotrexate embryopathy (C and D). Note the cupped ears, hypertelorism, sparse eyebrows, and prominent nose (C) accompanied by absence of the 4th and 5th digits of the feet (D). C and D are reprinted with permission from Bawle et al. Teratology 57:51-55 (1978).
Direct identification of the gene for a Mendelian disorder by exome resequencing
Each cell indicates the number of genes with nonsynonymous (NS) variants, splice acceptor and donor site mutations (SS), and coding indels (I). Filtering by either requiring the presence of NS/SS/I variants in siblings (kindred 1 (A+B)), multiple unrelated individuals (columns), or by excluding annotated variants (rows) identifies 26 and 8 candidate genes under a dominant model and only a single candidate gene, DHODH, under a recessive model (light grey cells). Exclusion of mutations predicted to be benign using PolyPhen (row 5) increases sensitivity under a dominant model but excludes DHODH under a recessive model because a variant in kindred 1 is predicted to be benign. A single candidate gene is identified in kindred 1 under a recessive model (dark grey cell) but this candidate is excluded in comparisons with unrelated cases of Miller syndrome. Mutations in this candidate, DNAH5, were found to cause a primary ciliary dyskinesia in kindred 1. The asterisk indicates that a second gene, CDC27, was also identified as a candidate gene but on manual review it appears to be multiple copies of a processed pseudogene that cause a false positive.
| Filter | Kindred 1-A | Kindred 1-B | Kindred1 (A + B) | Kindreds 1+2 | Kindred 1+2+3 | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Dominant | Recessive | Dominant | Recessive | Dominant | Recessive | Dominant | Recessive | Dominant | Recessive | |
| 4670 | 2863 | 4687 | 2859 | 3940 | 2362 | 3099 | 1810 | 2654 | 1525 | |
| … not in dbSNP129 | 641 | 102 | 647 | 114 | 369 | 53 | 105 | 25 | 63 | 21 |
| … notin HapMap 8 | 898 | 123 | 923 | 128 | 506 | 46 | 117 | 7 | 38 | 4 |
| … not in either | 456 | 31 | 464 | 33 | 228 | 9 | 26 | 1* | 8 | 1* |
| … AND predicted damaging | 204 | 6 | 204 | 12 | 83 | 1 | 5 | 0 | 2 | 0 |
Summary statistics for exome sequencing of four individuals with Miller Syndrome
The total number of unpaired 76-bp sequencing reads per individual is reported (“Total”), along with the number that map uniquely to the human genome (“Uniquely Mapping”, Maq map score > 0), the number that overlap at least one base of the target space (“Overlapping Target”), and the number left after removing reads with duplicate start sites (“Non-duplicated”). Mean coverage over the whole of CCDS2008 is also given. Called bases refer to bases passing quality and coverage thresholds (Maq consensus quality >= 20 and read depth >= 8X). % of CCDS refers to the fraction of the mappable 26.6 megabases of CCDS2008 (i.e. masked for poorly mappable coordinates, as described in Ng et al. 2009) that is called in each exome.
| Sequencing reads | Called coverage | ||||||
|---|---|---|---|---|---|---|---|
| Kindred-Sibling | Total | Uniquely Mapping | Overlapping Target | Non-duplicated | Mean Coverage | Called Bases | % of CCDS |
| 1-A | 62,974,440 | 52,854,115 | 25,267,592 | 17,872,660 | 36.85 | 25,720,216 | 97 |
| 1-B | 72,539,306 | 61,940,123 | 40,335,280 | 21,971,509 | 44.24 | 25,825,104 | 97 |
| 2 | 63,839,828 | 55,022,098 | 29,987,198 | 19,686,779 | 40.31 | 25,790,427 | 97 |
| 3 | 68,690,600 | 57,970,901 | 36,180,596 | 19,649,281 | 39.81 | 25,617,361 | 96 |
Number of candidate genes identified based on different filtering strategies
Under the dominant model, at least one non-synonymous variant, splice acceptor or donor site variant or coding indel (NS/SS/I) in a gene was required in the gene. Under the recessive model, at least two novel variants were required, and these could be either at the same position (i.e. a homozygous variant) or at two different positions in the same gene (i.e. a potential compound heterozygote, though we were unable to ascertain phase at this stage). In each column is the range for the number of candidate genes for exomes considered individually (column 1) and all combinations of 2 to 4 exomes (columns 2-4). Note that the upper bound on the ranges may be inflated relative to what would be the case if four unrelated, affected individuals had been used because the comparisons in which the two siblings were included provided reduced power compared to unrelated individuals. Columns 5-9 show the number of candidate genes when at least 1, 2, or 3 individuals is required to have one variant in a gene (dominant model) or two or more variants in a gene (recessive model). This is a simple model of genetic heterogeneity or incomplete data. For example, the total number of candidate genes common to any 3 of all 4 exomes is shown in column 9. For columns 5-6, one of the siblings (Kindred 1-B) was not included in the analysis as sibs share 50% of variants.
| Dominant Model | Number of Affected Exomes | Subsets of 3 Exomes | Subsets of all 4 Exomes | |||||
|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | Any 1 | Any 2 | Any 1 | Any 2 | Any 3 | |
| NS/SS/I | 4645-4687 | 3358-3940 | 2850-3099 | 6658 | 4489 | 6943 | 5167 | 3920 |
| … not in dbSNP129 | 634-695 | 136-369 | 72-105 | 1617 | 274 | 1829 | 553 | 172 |
| … not in HapMap 8 | 898-979 | 161-506 | 55-117 | 2336 | 409 | 2628 | 835 | 222 |
| … not in either | 453-528 | 40-228 | 10-26 | 1317 | 109 | 1516 | 333 | 44 |
| … AND predicted damaging | 204-284 | 10-83 | 3-6 | 682 | 37 | 787 | 126 | 11 |
| Number of Affected Exomes | Subsets of 3 Exomes | Subsets of all 4 Exomes | ||||||
| 1 | 2 | 3 | Any 1 | Any 2 | Any 1 | Any 2 | Any 3 | |
| NS/SS/I | 2780-2863 | 1993-2362 | 1646-1810 | 4097 | 2713 | 4293 | 3172 | 2329 |
| … not in dbSNP129 | 92-115 | 30-53 | 22-31 | 226 | 61 | 270 | 90 | 42 |
| … not in HapMap 8 | 111-133 | 13-46 | 5-13 | 329 | 32 | 397 | 75 | 19 |
| … not in either | 31-45 | 2-9 | 2-3 | 100 | 6 | 121 | 14 | 4 |
| … AND predicted damaging | 6-16 | 0-2 | 0-1 | 35 | 2 | 44 | 4 | 1 |
Summary of DHODH mutations in kindreds with Miller syndrome
| Kindred | Mutation | Exon | Amino acid change | Location |
|---|---|---|---|---|
| 1 | c.454G>A | 4 | p.G152R | chr16:70608443 |
| c.605G>C | 5 | p.G202A | chr16: 70612611 | |
| 2 | c.403C>T | 3 | p.R135C | chr16: 70606041 |
| c.1036C>T | 8 | p.R346W | chr16: 70614936 | |
| 3 | c.595C>T | 5 | p.R199C | chr16: 70612601 |
| c.611delT | 5 | p.L204PfsX8 | chr16: 70612617 | |
| 4 | c.605G>A | 5 | p.G202D | chr16: 70612611 |
| c.730C>T | 6 | p.R244W | chr16: 70613786 | |
| 5 | c.56G>A | 2 | p.G19E | chr16: 70603484 |
| c.1036C>T | 8 | p.R346W | chr16: 70614936 | |
| 6 | c.851C>T | 7 | p.T284I | chr16: 70614596 |
| c.1175A>G | 9 | p.D392G | chr16: 70615586 |
denotes kindreds in which mutations were originally identified by exome resequencing
Figure 2Genomic structure of the exons encoding the open reading frame of DHODH
DHODH is composed of 9 exons that encode untranslated regions (orange) and protein coding sequence (blue). Arrows indicate the locations of 11 different mutations found in 6 families with Miller syndrome.