| Literature DB >> 28487880 |
Christopher A Cassa1,2, Sebastian Akle3, Daniel M Jordan4, Jill A Rosenfeld5.
Abstract
The expanding use of genomic sequencing promises to improve clinical diagnostics and to drive the discovery of new disease genes. Candidate genes are increasingly being identified through recurrent cases (e.g., two or more independent cases ["N of 2"] in which variants are present in the same gene). These second case hits provide statistical evidence of an association, which may then be combined with functional validation or familial segregation studies to bolster the evidence that a gene is truly causal. Here, we discuss how to integrate different forms of functional evidence with human genetics case and segregation data to improve the significance of new disease-gene associations.Entities:
Mesh:
Year: 2017 PMID: 28487880 PMCID: PMC5411689 DOI: 10.1101/mcs.a001099
Source DB: PubMed Journal: Cold Spring Harb Mol Case Stud ISSN: 2373-2873
Modes of new gene discovery that use clinical sequencing data from multiple cases to identify potential gene–phenotype associations
| Mode of discovery | Description |
|---|---|
| Clinical case series ( | In large case series, several patients with similar phenotypes may have similar variants in the same gene, allowing the identification of possible new disease–gene associations. These associations should be statistically and functionally validated before assigning causality, including consideration of variant type (e.g., repeated rare variants vs. repeated de novo variants). |
| Matches identified through the Matchmaker Exchange and/or clinical collaborations | Cases identified across institutions through matchmaking services or collaborations highlight potential disease–gene associations. These too must be checked for potential false-positive associations, given the small sample size, and functionally validated before assigning causality. |
| Large-scale population studies | These studies identify variants in large population cohorts where phenotypic data are available. These cohorts may include populations at large medical centers or consanguineous families that are enriched in identity by descent. Consanguineous population studies are particularly enriched for rare variant types (e.g., nonsense or canonical splicing variants), which may help generate new phenotypic associations at higher rates. |
Evidence for causality from aggregate functional validation assay data in a set of recurrent genes identified in a clinical exome-sequencing program
| Number of genes with importance in functional model | Essential in KBM7 human cell assay | Essential in gene trap assay | Lethal in IMPC mouse knockout |
|---|---|---|---|
| Recurrent genes associated with AD disorders from clinical exome-sequencing case data found to be essential for cellular or embryonic development | 27 | 15 | 4 |
| Expected number of genes in a similarly sized set of unannotated genes found to be essential for cellular or embryonic development | 12.35 | 7.83 | 2.58 |
| χ2
| 0.303 | ||
| Recurrent genes associated with AR disorders from clinical exome sequencing case data found to be essential for cellular or embryonic development | 11 | 7 | 6 |
| Expected number of genes in a similarly sized set of unannotated genes found to be essential for cellular or embryonic development | 7.75 | 4.91 | 1.62 |
| χ2
| 0.172 | 0.271 | |
Statistically significant results are in bold. The unannotated gene set included any gene without a ClinVar or Human Gene Mutation Database (HGMD) annotation (N = 10,719) and was adjusted in size for each gene group. It represents a candidate set of novel genes that might be associated with disease in the future. If a gene is required for cell essentiality, it is significantly more likely to be associated with new autosomal dominant disease genes than a gene with no disease annotations. Conversely, if a gene is required for mouse embryonic development, it is significantly more likely to be lethal in a mouse knockout than any unannotated gene.
IMPC, International Mouse Phenotyping Consortium.
Figure 1.(Left) Histogram showing the number of Human Phenotype Ontology (HPO) categories associated with each patient in the Baylor adult exome-sequencing cohort (Posey et al. 2015). (Right) Histogram showing the number of patients associated with each HPO category in the Baylor adult exome-sequencing cohort.