| Literature DB >> 33368787 |
Phillip A Richmond1, Tamar V Av-Shalom1, Oriol Fornes1, Bhavi Modi1, Alison M Elliott2, Wyeth W Wasserman1.
Abstract
Mendelian rare genetic diseases affect 5%-10% of the population, and with over 5300 genes responsible for ∼7000 different diseases, they are challenging to diagnose. The use of whole-genome sequencing (WGS) has bolstered the diagnosis rate significantly. The effective use of WGS relies on the ability to identify the disrupted gene responsible for disease phenotypes. This process involves genomic variant calling and prioritization, and is the beneficiary of improvements to sequencing technology, variant calling approaches, and increased capacity to prioritize genomic variants with potential pathogenicity. As analysis pipelines continue to improve, careful testing of their efficacy is paramount. However, real-life cases typically emerge anecdotally, and utilization of clinically sensitive and identifiable data for testing pipeline improvements is regulated and limiting. We identified the need for a gene-based variant simulation framework that can create mock rare disease scenarios, utilizing known pathogenic variants or through the creation of novel gene-disrupting variants. To fill this need, we present GeneBreaker, a tool that creates synthetic rare disease cases with utility for benchmarking variant calling approaches, testing the efficacy of variant prioritization, and as an educational mechanism for training diagnostic practitioners in the expanding field of genomic medicine. GeneBreaker is freely available at http://GeneBreaker.cmmt.ubc.ca.Entities:
Keywords: benchmarking; genomics; rare disease; simulation; variant calling; variant interpretation
Mesh:
Year: 2021 PMID: 33368787 PMCID: PMC8247879 DOI: 10.1002/humu.24163
Source DB: PubMed Journal: Hum Mutat ISSN: 1059-7794 Impact factor: 4.878
Figure 1GeneBreaker overview. (a) Overview of GeneBreaker design framework showing user interaction with the website (light blue), connected MySQL tables (red), underlying variant subclasses (dark blue), and output VCF file (yellow). The user interacts with the GeneBreaker website (light blue) which is connected to hidden components for gene description and variant creation/selection. (b) Downstream benchmarking operations enabled by GeneBreaker including splitting variant amongst VCF files according to user‐designed pedigree (yellow), and then either spiking‐in the variant within open source trios for annotation and prioritization testing or simulating the proband variant as a full synthetic simulation via VarSim (purple). VCF, variant call format
A set of rare disease scenarios were designed covering a range of Mendelian inheritance patterns and are comprised of either one or two variants affecting a single gene depending on the inheritance pattern
| Inheritance | Proband sex | Var1 class | Var1 impact | Source | Var2 class | Var2 impact | Source | Gene | Exomiser rank (noninheritance matched) |
|---|---|---|---|---|---|---|---|---|---|
| Autosomal dominant maternal | Male | SNV | Missense | Published: 28111307 | – | – | – | JAK1 | 2 |
| Autosomal dominant de novo | Male | SNV | Missense | ClinVar: 91028 | – | – | – | MSH2 | 1 |
| Autosomal recessive homozygous | Male | SNV | Missense | Published: 24332264 | – | – | – | MALT1 | 1 |
| Autosomal recessive compound heterozygous | Female | SNV | Missense | Novel | Indel | Frameshift | Novel | CFTR | 1 |
| Autosomal recessive compound heterozygous de novo | Male | SNV | Stop‐loss | Novel | SNV | Missense | ClinVar: 217654 | INPP5E | N/A (7) |
| X‐linked dominant de novo | Female | Indel | Frameshift | Novel | SNV | Nonsense | Novel | MECP2 | 1 |
| X‐linked recessive homozygous | Male | SNV | Missense | Published: 32202653 | – | – | – | WAS | 1 |
| X‐linked recessive compound heterozygous de novo | Female | Indel | Frameshift | Novel | SNV | Nonsense | ClinVar: 11696 | SLC6A8 | N/A (2) |
| X‐linked recessive hemizygous de novo | Male | Indel | Frameshift | ClinVar: 11303 | – | – | – | ABCD1 | 2 |
| Y‐linked de novo | Male | Indel | Coding | Novel | – | – | – | SRY | N/A (N/A) |
Note: As many inheritance patterns are sex‐specific, the intended proband sex is included. Each variant has the class (SNV or indel), impact on the gene, and source. For variants from the literature or ClinVar, the ClinVar ID or PubMed ID is provided. Exomiser rank is provided, counting the rank of the simulated gene amongst its matching inheritance gene category, and in parentheses is the rank within other inheritance categories.
Simulated scenarios for noncoding SNVs and indels; CNVs, MEIs, and STRs; and variants in the dark genome
| Inheritance | Proband sex | Var1 class | Var1 impact | Source | Var2 class | Var2 impact | Source | Gene |
|---|---|---|---|---|---|---|---|---|
|
| ||||||||
| Autosomal recessive compound heterozygous | Male | SNV | Intronic | ClinVar: 99243 | Indel | Frameshift | Novel |
|
| Autosomal recessive compound heterozygous | Female | SNV | Intronic | ClinVar: 6770 | SNV | Nonsense | Novel |
|
| Autosomal dominant de novo | Male | Indel | Intronic | ClinVar: 411336 | – | – | – |
|
| Autosomal dominant de novo | Female | SNV | UTR | ClinVar: 556698 | – | – | – |
|
| Autosomal recessive compound heterozygous | Male | SNV | UTR | ClinVar: 412264 | SNV | Nonsense | ClinVar: 523079 |
|
|
| ||||||||
| Autosomal dominant de novo | Female | CNV (partial DEL) | Coding | Novel | – | – | – |
|
| Autosomal recessive compound heterozygous | Female | CNV (full DEL) | Coding | Novel | Indel | Frameshift | Novel |
|
| Autosomal recessive homozygous | Female | MEI | Intron | Published: 28187749 | – | – | – |
|
| X‐linked dominant de novo | Female | MEI | Coding | Novel | – | – | – |
|
| Autosomal recessive homozygous | Female | STR | UTR | Published: 30970188 | – | – | – |
|
| X‐linked recessive compound heterozygous | Female | CNV | Coding | Novel | SNV | Nonsense | ClinVar: 282841 |
|
|
| ||||||||
| Autosomal recessive homozygous | Female | CNV (partial DEL) | Coding | Published: 32066871 | – | – | – |
|
| Autosomal dominant de novo | Female | Indel | Frameshift | Novel | – | – | – |
|
| Autosomal dominant de novo | Female | Indel | Frameshift | Novel | – | – | – |
|
| Autosomal dominant de novo | Female | Indel | Frameshift | Novel | MEI | Coding | Novel |
|
Note: The inheritance, sex, and variant information for variant 1 and variant 2 are shown, including their class, impact, and source (novel, ClinVar ID, or PubMed ID).
Rare disease scenarios created for teaching purposes, labeled as case 1–10, with the sex (M = male, F = female), inheritance model, gene and disease association, ClinVar ID for variant 1, ClinVar ID for variant 2 (if applicable, otherwise “–”), HPO IDs, and HPO phenotype terms
| Case No. | Sex | Inheritance | Gene; disease | Var 1 ClinVar ID | Var 2 ClinVar ID | HPO terms | Features (matched to HPO terms) |
|---|---|---|---|---|---|---|---|
| 1 | M | X‐linked recessive hemizygous |
| 424355 | – | “HP:0000964,” “HP:0001873,” “HP:0005537,” “HP:0000388,” and “HP:0002573” | Eczema, thrombocytopenia, small platelets, otitis media, and hematochezia. The family history is significant for a maternal uncle with autoimmune problems and a recent diagnosis of lymphoma. |
| 2 | M | Autosomal dominant de novo |
| 91055 | – | “HP:0040275,” “HP:0012378,” and “HP:0002027” | Adenocarcinoma of the large intestine, fatigue, and abdominal pain. Negative family history. |
| 3 | F | Autosomal recessive homozygous |
| 662739 | – | “HP:0000964,” “HP:0001581,” “HP:0004386,” “HP:0002090,” and “HP:0002205” | Eczema, recurrent skin infections, gastrointestinal inflammation, pneumonia, and recurrent respiratory infections. Parents are consanguineous (first cousins). |
| 4 | M | Autosomal recessive compound heterozygous |
| 618928 | 554293 | “HP:0002024,” “HP:0002206,” “HP:0002205,” and “HP:0001738” | Malabsorption, pulmonary fibrosis, recurrent respiratory infections, and exocrine pancreatic insufficiency. Parents are unrelated and of British ancestry. |
| 5 | F | X‐linked dominant de novo |
| 424171 | – | “HP:0001250,” “HP:0001257,” “HP:0005484,” and “HP:0002376” | Seizure, spasticity, postnatal microcephaly, and developmental regression. Negative family history. |
| 6 | M | X‐linked recessive hemizygous de novo |
| 458629 | – | “HP:0001250,” “HP:0000709,” “HP:0008207,” “HP:0002180,” and “HP:0002500” | Seizure, psychosis, primary adrenal insufficiency, neurodegeneration, and abnormality of the cerebral white matter. Negative family history. |
| 7 | F | Autosomal dominant paternal |
| 666310 | – | “HP:002342,” “HP:0002553,” “HP:0000494,” “HP:0000448,” “HP:0000347,” and “HP:0011304” | Moderate intellectual disability, arched eyebrows, downslanting palpebral fissures, prominent nose, micrognathia, and broad thumbs. Father with significant learning problems and similar facial features. |
| 8 | M | Autosomal dominant de novo |
| 40488 | – | “HP:0004322,” “HP:0032318,” “HP:0000475,” “HP:0000368,” “HP:0000316,” “HP:0000445,” and “HP:0003196” | Short stature, congenital heart disease, wide neck, low set posteriorly rotated ears, widely spaced eyes, and short and broad nose. Negative family history. |
| 9 | F | Autosomal dominant paternal |
| 449397 | – | “HP:0011800,” “HP:0008625,” “HP:0000545,” and “HP:0000175” | Flat midface, sensorineural hearing loss, myopia, and cleft palate. Father with severe myopia. |
| 10 | M | Autosomal dominant maternal |
| 658274 | – | “HP:0001249,” “HP:0004322,” “HP:0000252,” “HP:0000325,” “HP:0000400,” “HP:0000316,” “HP:0000637,” and “HP:0001572” | Intellectual disability, short stature, microcephaly, triangular face, large and prominent ears, hypertelorism, long palpebral fissures, and macrodontia. Mother similarly affected. |