| Literature DB >> 34906245 |
Vladimir Avramović1,2, Simona Denise Frederiksen1,2, Marjana Brkić3,4, Maja Tarailo-Graovac5,6.
Abstract
BACKGROUND: Genetic variation databases provide invaluable information on the presence and frequency of genetic variants in the 'untargeted' human population, aggregated with the primary goal to facilitate the interpretation of clinically important variants. The presence of somatic variants in such databases can affect variant assessment in undiagnosed rare disease (RD) patients. Previously, the impact of somatic mosaicism was only considered in relation to two Mendelian disease-associated genes. Here, we expand the analyses to identify additional mosaicism-prone genes in blood-derived reference population databases.Entities:
Keywords: Blood-derived reference population databases; Cell proliferation; Clonal hematopoiesis of indeterminate potential (CHIP); Genome sequencing; Hematopoietic genes; Rare diseases
Mesh:
Year: 2021 PMID: 34906245 PMCID: PMC8670043 DOI: 10.1186/s40246-021-00371-y
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
Fig. 1Clonal expansion in Clonal Hematopoiesis of Indeterminate Potential. A Somatic driver mutation acquired by a hematopoietic stem cell leads to clonal expansion in the peripheral blood. The mutant clone becomes more abundant in the blood samples, taken as the source of DNA for sequencing. B Allelic imbalance as seen in the Integrative Genomic Viewer [37], compared with the examples of true homozygosity and heterozygosity. Recent studies [7, 15] showed that allelic imbalance can be used as an indicator of variant somatic origin
Fig. 2Variant extraction and processing workflow. The extracted variants, present in Exome Aggregation Consortium (ExAC) [3], were assessed for supporting evidence in the current literature (PubMed peer-reviewed articles) and subsequently, for the ratio of reads using the Integrative Genomic Viewer (IGV; used as an indicator of somatic evidence) [37]. For the final nine genes associated with early onset, severe Mendelian diseases and with good evidence of somatic mosaicism, the assessment of allelic imbalance was compared with gnomAD data [4] (Additional file 1: Table S4)
Summary of analyzed genes, genetic variants and the associated autosomal dominant (AD) diseases
| Characteristics | AD conditions | ||
|---|---|---|---|
| All* | Severe and early onset** | Severe and early onset, allelic imbalance*** | |
| Number of gene–disease associations | 394 | 77 | 23 |
| Number of AD diseases | 390 | 76 | 23 |
| Number of genes | 353 | 72 | 21 |
| Number of genes per AD disease | 1–2 | 1–2 | 1 |
| 1–1 | 1.8 | 9.1 | 17.4 |
| 1–2 | 13.5 | 68.8 | 60.9 |
| 2–1 | 0.5 | 2.6 | 4.3 |
| 2–2 | 3.8 | 19.5 | 17.4 |
| Number of variants | 664 | 115 | 30 |
| Number of variants per AD disease | 1–16 | 1–9 | 1–3 |
| AD variants, % | |||
| > 1 peer-reviewed publication | 60.4 | 67.0 | 80.0 |
| 1 peer-reviewed publication | 39.6 | 33.0 | 20.0 |
| Intron | 0.3 | - | - |
| Missense | 76.7 | 80.0 | 80.0 |
| Nonsense | 16.4 | 14.8 | 16.7 |
| Splice acceptor | 2.4 | 3.5 | - |
| Splice donor | 3.3 | 1.7 | 3.3 |
| Synonymous | 0.8 | - | - |
| Non-coding transcript exon | 0.1 | - | - |
| Pathogenic | 79.7 | 83.5 | 90.0 |
| Pathogenic/likely pathogenic | 7.1 | 8.7 | 10.0 |
| Likely pathogenic | 13.2 | 7.8 | - |
*AD conditions with ClinVar pathogenic and likely pathogenic ClinVar variants in ExAC (Additional file 1: Table S2)
**Severe and early onset AD diseases with ClinVar pathogenic and likely pathogenic variants in ExAC (Additional file 1: Table S3)
***Severe and early onset AD diseases with ClinVar pathogenic and likely pathogenic variants in ExAC with some evidence of allelic imbalance (at least one allele was found with < 35% of read support; Additional file 1: Table S3). Nine of these genes have evidence of mosaicism (at least two alleles show signs of somatic origin (allelic imbalance))
βAge of onset—Severity. Age of onset score 1 means congenital or very early (< 2 years), age of onset score 2 means mostly early (< 18 years) + variable, severity score 1 means severe, significantly reduced mobility or increased mortality in early life, and severity score 2 means severe plus variable expressivity
Analyzed genes and variants with their involvement in blood-related functions and cancer
| Gene** | AD condition | Variant*** | Variant type | Allele count (< 35% of reads) | COSMIC | ||||
|---|---|---|---|---|---|---|---|---|---|
| ExAC | gnomAD | AA mutation | Confirmed somatic | No. of samples | Samples being ‘hematopoietic and lymphoid’ | ||||
| Bohring–Opitz syndrome | NM_015338.5:c.1210C>T► | Nonsense | 3 | 3 | p.R404* | Yes | 10 | 90.0% | |
| NM_015338.5:c.2893C>T► | Nonsense | 1 | 2 | p.R965* | Yes | 17 | 70.6% | ||
| NM_015338.5:c.1117C>T► | Nonsense | 0 | 2 | – | – | – | – | ||
| Cardio-facio-cutaneous | NM_004333.5:c.1799T>A► | Missense | 1 | 2 | p.V600E | Yes | 29,274 | 3.0% | |
| syndrome | NM_004333.5:c.1406G>A | Missense | 1 | 0 | p.G469E | Yes | 28 | 0.0% | |
| Noonan syndrome-like disorder with or without juvenile myelomonocytic leukemia | NM_005188.3:c.1186T>C | Missense | 1 | 0 | p.C396R | Yes | 13 | 100.0% | |
| NM_005188.3:c.1259G>A► | Missense | 2 | 2 | p.R420Q | Yes | 27 | 77.8% | ||
| NM_005188.3:c.1111T>C► | Missense | 1 | 2 | p.Y371H | Yes | 30 | 96.7% | ||
| Tatton–Brown–Rahman syndrome | NM_022552.5:c.2312G>A► | Missense | 3 | 2 | p.R771Q | Yes | 8 | 50.0% | |
| NM_022552.5:c.2644C>T► | Missense | 4 | 3 | p.R882C | Yes | 398 | 98.5% | ||
| NM_022552.5:c.2536C>T | Nonsense | 1 | 0 | p.Q846* | No | 1 | 100.0% | ||
| LADD syndrome/Thanatophoric dysplasia, type I | NM_000142.5:c.1537G>A | Missense | 1 | 0 | – | – | – | – | |
| NM_000142.5:c.746C>G► | Missense | 1 | 0 | p.S249C | Yes | 1,525 | 0.0% | ||
| D-2-hydroxyglutaric aciduria 2 | NM_001289910.1:c.263G>A | Missense | 4 | 3 | – | – | – | – | |
| Noonan syndrome 3/RAS-associated autoimmune leukoproliferative disorder | NM_004985.4:c.40G>A► | Missense | 1 | 0 | p.V14I | Yes | 34 | 14.71% | |
| NM_004985.4:c.35G>A► | Missense | 1 | 1 | p.G12D | Yes | 15,834 | 1.67% | ||
| Noonan syndrome 1 | NM_002834.5:c.1471C>T► | Missense | 1 | NA | – | – | – | – | |
| NM_002834.5:c.794G>A► | Missense | 1 | 0 | p.R265Q | Yes | 4 | 100.0% | ||
| NM_002834.5:c.188A>G► | Missense | 0 | 1 | p.Y63C | No | 4 | 100.0% | ||
| Schinzel–Giedion midface retraction syndrome | NM_015559.3:c.2608G>A► | Missense | 2 | 1 | p.G870S | Yes | 72 | 98.61% | |
A total of 16 variants (80.0% of variants reported here) residing in genes with good evidence of somatic mosaicism were also present in the Catalogue of Somatic Mutations in Cancer (COSMIC; info obtained on February 4, 2021). Across the genes, the variants were found in 1 to 29,274 samples. It is expected that the more frequent a mutation is in cancer samples, the higher the chance is that it is a driver mutation (the mechanism that makes those cells and the variants they carry more abundant in blood). A driver mutation is by definition a genetic change that gives an advantage to the cell. The advantage enables the cell to grow and proliferate better than other cells, which is a hallmark in cancer
AA, amino acid; AD, autosomal dominant; ASXL1, ASXL transcriptional regulator 1; BRAF, B-Raf proto-oncogene, serine/threonine kinase; CBL, Cbl proto-oncogene; COSMIC, Catalogue of Somatic Mutations in Cancer; DNMT3A, DNA methyltransferase 3 alpha; FGFR3, fibroblast growth factor receptor 3; IDH2, isocitrate dehydrogenase (NADP(+)) 2;; LADD, Lacrimo-auriculo-dento-digital; KRAS, KRAS proto-oncogene, GTPase; PTPN11, protein tyrosine phosphatase non-receptor type 11
▲Evidence of somatic mosaicism involving the germline reported by Erickson [39] or Bedoukian et al.[30]
∆Genes involved in stem cell and/or cell population proliferation
►Variant reported to affect or probably affect function based on Leiden Open Variation Database (LOVD; hg19/GRCh37) version 3.0 [40]
*Genes with evidence of mosaicism that overlap with the list of 156 hematopoietic genes provided by Jaiswal et al. [13] in their Additional file 1: Table S2 (n = 7)
**A gene shows good evidence of mosaicism when at least two alleles show signs of somatic origin (allelic imbalance)
***Rare disease- and/or cancer-related known pathogenic or likely pathogenic variants (in ClinVar and/or COSMIC)
Fig. 3Venn diagram illustrating the classification genes according to the selected feature categories. A total of 16, 9 and 25 genes were considered as hematopoietic, mosaic and proliferative, respectively, based on our categorization [13, 38] and assessment of allelic imbalance [7, 15]. The genes with evidence of mosaicism, BRAF, IDH2 and KRAS, belonged to each of the categories (Additional file 1: Table S3). BRAF, B-Raf proto-oncogene, serine/threonine kinase; IDH2, isocitrate dehydrogenase (NADP(+)) 2; KRAS, KRAS proto-oncogene, GTPase;