| Literature DB >> 35955564 |
Ionut-Florin Iancu1,2, Irene Perea-Romero1,2, Gonzalo Núñez-Moreno1,2,3, Lorena de la Fuente1,3, Raquel Romero1,2, Almudena Ávila-Fernandez1,2, María José Trujillo-Tiebas1,2, Rosa Riveiro-Álvarez1,2, Berta Almoguera1,2, Inmaculada Martín-Mérida1,2, Marta Del Pozo-Valero1,2, Alejandra Damián-Verde1, Marta Cortón1,2, Carmen Ayuso1,2, Pablo Minguez1,2,3.
Abstract
The introduction of NGS in genetic diagnosis has increased the repertoire of variants and genes involved and the amount of genomic information produced. We built an allelic-frequency (AF) database for a heterogeneous cohort of genetic diseases to explore the aggregated genomic information and boost diagnosis in inherited retinal dystrophies (IRD). We retrospectively selected 5683 index-cases with clinical exome sequencing tests available, 1766 with IRD and the rest with diverse genetic diseases. We calculated a subcohort's IRD-specific AF and compared it with suitable pseudocontrols. For non-solved IRD cases, we prioritized variants with a significant increment of frequencies, with eight variants that may help to explain the phenotype, and 10/11 of uncertain significance that were reclassified as probably pathogenic according to ACMG. Moreover, we developed a method to highlight genes with more frequent pathogenic variants in IRD cases than in pseudocontrols weighted by the increment of benign variants in the same comparison. We identified 18 genes for further studies that provided new insights in five cases. This resource can also help one to calculate the carrier frequency in IRD genes. A cohort-specific AF database assists with variants and genes prioritization and operates as an engine that provides a new hypothesis in non-solved cases, augmenting the diagnosis rate.Entities:
Keywords: carrier frequency; gene prioritization; genetic rare diseases; inherited retinal dystrophies; variant prioritization; variants of uncertain significance
Mesh:
Year: 2022 PMID: 35955564 PMCID: PMC9368980 DOI: 10.3390/ijms23158431
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 6.208
Figure 1A heterogeneous cohort of rare diseases with the diagnostic status and variant composition in IRD cases. (A) The cohort of patients with suspected rare genetic diseases at the Genetics Department of HU-FJD was divided into three subcohorts. An IRD subcohort of 1766 samples, other-eye-related diseases (OERD) of 386 cases, and a pseudocontrol subcohort of non-eye-related diseases (NRD) with 3531 samples. (B) IRD diagnostic status of the samples included in IRD subcohort was solved and non-solved. The subcohort of non-solved IRD cases with no candidate variants represent the 25%. C) Flow chart of different filters applied to variants according to quality control (QC) and population (POP) filters. (D) Summary of the variants included in the database in an average IRD case; values represent the average of all IRD samples. (E) Proportion of pathogenic and VUS variants detected in IRD cases in the genes with more pathogenic variants in IRD solved cases; top 5 genes are shown.
Figure 2Comparison of the percentage of deleterious variants in prioritized and non-prioritized variants in solved and non-solved IRD patients. Variant AFs are compared in inherited retinal dystrophies (IRD) solved (A) and non-solved (B) subcohorts against pseudocontrols (PC) using fold changes (FC). IRD more frequent variants (IRD-MFVs), highlighted in dark, were defined using FC thresholds at 0.90 percentiles of all FC in each comparison (A,B). Proportion of deleterious and benign variants in both solved (C) and non-solved IRD cases (D) and the p-values representing the enrichment of deleterious variants in IRD-MFVs. Enrichment analyses are also performed dividing the IRD-MFVs according to the genes in which they are located; they are grouped into IRD genes, other eye-related diseases (OERD genes), and other non-eye-related diseases (NRD genes) (E,F). Total number of deleterious variants in each group is noted at the top of the red bars. Non-significant p-values are marked as “ns”.
Identification of most frequent variants in IRD (IRD-MFV) allowed for the reevaluation of non-solved IRD cases. A total of 8 cases (one per row) gained knowledge in their reassessment. The gene panel column refers to the type of diseases that the gene has been associated with, inherited retinal dystrophies (IRD), other eye-related diseases (OERD), and other non-eye-related diseases (NRD). In the phenotype, abbreviations are RCD: rod-cone dystrophy, RP: retinitis pigmentosa, and MD: macular dystrophy. In variant column: HGVSc, HGVSp, and ACMG classification are included. In “Other Variants”, for those biallelic cases, the second variant found after reviewing the case is included. Genotype column has information about the genotype of the variant(s) in the sample. Diagnostic status column has the new diagnostic status of the case after the reanalysis, as (1) “solved”, when the case has a conclusive diagnosis thanks to the variant(s) identified; or (2) “non-solved”, in any other case. Non-solved cases were annotated according to findings that may help in a future diagnosis as (2.1) “partially solved”, if a heterozygous pathogenic or likely pathogenic variant is found within a recessive gene that fits the observed phenotype; and (2.2) “with evidence”, when the variants identified in the analysis are: (2.2.1) pathogenic/likely pathogenic variants in genes not yet associated with the disease but with some evidence published in the literature or with overlapping phenotypes, or (2.2.2) VUS in a gene associated with the phenotype when one VUS in dominant or 1–2 VUS in recessive genes were found.
| Gene | Gene Panel | Phenotype | Prioritized Variant (HGVSc, HGVSp, and ACMG Classification) | Other variants | Genotype | Diagnostic Status |
|---|---|---|---|---|---|---|
|
| IRD | RCD | NM_022124.6: c.6050–9G > A | - | Monoallelic 0/1 | Partially solved |
|
| IRD | Usher | NM_000260.4: c.1996C > T | NM_000260.4: c.3764del | Biallelic 0/1, 0/1 | Solved |
|
| ONRD | RP | NM_001318491.2: c.538G > T | - | Monoallelic 0/1 | With evidence |
|
| NRD | MD | NM_001008537.3: c.4385del | - | Monoallelic 0/1 | With evidence |
|
| IRD | RP | NM_000370.3: c.227_235del | - | Monoallelic 0/1 | With evidence |
|
| IRD | MD | NM_000370.3: c.227_235del | - | Monoallelic 0/1 | With evidence |
|
| IRD | RP | NM_033100.4: c.2410_2485del | - | Monoallelic 0/1 | Partially solved |
|
| IRD | RP | NM_033100.4: c.2410_2485del | NM_033100.4: c.476C > A (p. Ala159Glu) | Biallelic 0/1, 0/1 | Partially solved |
Reclassification of VUS, with information added about the difference in their frequency in cases and controls. The ACMG classification of VUS with log2 fold change (FC) higher than 1.5 is reevaluated. Gene code, variants in nucleotide code, and protein code are provided. The inheritance mode is annotated as autosomal recessive (AR), autosomal dominant (AD), and x-linked (XL). In the “Varsome” column, we add the automatic ACMG classification of the variants provided by the Varsome database at the time of writing. Previous ACMG criteria fulfilled by the cases are annotated in the “Criteria” column. Column “PS4” provides the new ACMG classification reached after including PS4 criteria. In column “Status”, reclassified variants are marked.
| Gene | HGVSc | HGVSp | Inheritance | log2(FC) | Varsome | Criteria | PS4 | Status |
|---|---|---|---|---|---|---|---|---|
|
| NM_001256789.3: | - | XL | 2.7 | 3 | PM2, BP4 | 4 | Reclassified |
|
| NM_022124.5: | p. Glu1411Lys | AR | 2.0 | 3 | PM2, PP3 | 4 | Reclassified |
|
| NM_001171971.3: | p. Thr530Ser | AR | 2.6 | 3 | PM1, PM2, and BP1 | 4 | Reclassified |
|
| NM_080629.2: | p. Thr1613Asn | AD | 1.6 | 3 | PM2, PP2 | 4 | Reclassified |
|
| NM_001001557.4: | p. Gly42Val | AR/AD | 2.0 | 3 | PM1, PM2, PP5, and BP6 | 4 | Reclassified |
|
| NM_016247.4: | p. His487Leu | AR/AD | 2.7 | 3 | PM2, BP4 | 4 | Reclassified |
|
| NM_006343.3: | p. Tyr812Cys | AR | 2.0 | 3 | PM1, PM2, PP3, and BP6 | 4 | Reclassified |
|
| NM_022567.2: c.505A>G | p. Asn169Asp | XL | 2.9 | 3 | PM2 | 3 | Not reclassified |
|
| NM_003611.2: | p. Asp29Glu | XL | 2.7 | 3 | PM2, PP3, and BP1 | 4 | Reclassified |
|
| NM_006269.2: | p. Phe833Leu | AR/AD | 1.7 | 3 | PM2, BP4 | 4 | Reclassified |
|
| NM_001145853.1: | p. Pro533Ser | AR/AD | 3.7 | 3; 4 | PM2, PP3 | 4 | Reclassified |
Figure 3Genes with a higher accumulated pathogenicity in solved and non-solved IRD cases compared to pseudocontrols. Mean fold changes (FCs) in log2 scale for deleterious (Y-axis) and benign variants (X-axis) are shown for each gene. For IRD solved cases, significative genes with at least eight deleterious variants are shown in the plot (A). For IRD non-solved cases, all significative genes are shown (B).
Genes with more frequent deleterious variants in IRD non-solved cases than in pseudocontrols. The genes are sorted by the gene panel where they are present (IRD, OERD, and NRD) and by number of deleterious variants. The gene panel column refers to the type of diseases that the gene has been associated with, inherited retinal dystrophies (IRD), other eye-related diseases (OERD), and other non-eye-related diseases (NRD). Number of deleterious and benign variants in IRD non-solved cases is shown. FDR stands for false discovery rate of the Wilcoxon rank sum test performed. Inheritance mode associated with each gene is annotated in the “Inheritance” column as recessive, dominant, or R/D (either recessive or dominant).
| Gene | Deleterious | Benign | FDR | Gene Panel | Inheritance |
|---|---|---|---|---|---|
|
| 10 | 42 | 1.08 × 10−3 | IRD | Recessive |
|
| 9 | 43 | 9.81 × 10−4 | IRD | Recessive |
|
| 7 | 22 | 3.56 × 10−2 | IRD | Recessive |
|
| 6 | 43 | 2.75 × 10−2 | IRD | Dominant |
|
| 12 | 59 | 4.55 × 10−3 | OERD | Recessive |
|
| 11 | 12 | 1.32 × 10−2 | OERD | Recessive |
|
| 8 | 11 | 1.87 × 10−2 | OERD | Recessive |
|
| 7 | 13 | 1.32 × 10−2 | OERD | Recessive |
|
| 6 | 44 | 1.87 × 10−2 | OERD | R/D |
|
| 6 | 43 | 1.87 × 10−2 | OERD | Recessive |
|
| 6 | 80 | 1.87 × 10−2 | OERD | Dominant |
|
| 5 | 16 | 2.70 × 10−2 | OERD | Recessive |
|
| 5 | 8 | 1.32 × 10−2 | OERD | Recessive |
|
| 5 | 15 | 1.87 × 10−2 | OERD | R/D |
|
| 9 | 91 | 1.32 × 10−2 | NRD | Recessive |
|
| 7 | 7 | 1.28 × 10−2 | NRD | Recessive |
|
| 6 | 22 | 1.32 × 10−2 | NRD | Recessive |
|
| 6 | 7 | 1.87 × 10−2 | NRD | Recessive |
New diagnostic status produced by the reassessment of non-solved IRD cases with deleterious variants in the genes prioritized. Details about five IRD non-solved cases with variants adding knowledge to the phenotype. The gene panel column refers to the type of diseases that the gene has been associated with, inherited retinal dystrophies (IRD), and other eye-related diseases (OERD). In the phenotype, abbreviations are MD: macular dystrophy. In variant column, HGVSc, HGVSp, and ACMG classification are included. Genotype column has information about the genotype of the variant(s) in the sample. Diagnostic status column has the new diagnostic status of the case after the reanalysis, as (1) “solved”, when the case has a conclusive diagnosis thanks to the variant(s) identified; or (2) “non-solved”, in any other case. Non-solved cases were annotated according to findings that may help in a future diagnosis as (2.1) “partially solved”, if a heterozygous pathogenic or likely pathogenic variant is found within a recessive gene that fits the observed phenotype; and (2.2) “with evidence”, when the variants identified in the analysis are: (2.2.1) pathogenic/likely pathogenic variants in genes not yet associated with the disease but with some evidence published in the literature or with overlapping phenotypes, or (2.2.2) VUS in a gene associated with the phenotype when one VUS in dominant or 1–2 VUS in recessive genes were found.
| Gene | Gene Panel | Phenotype | Variant (HGVSc, HGVSp, ACMG Classification) | Genotype | Diagnostic Status |
|---|---|---|---|---|---|
|
| IRD | MD | NM_001080463.2: c.3793C>T | Biallelic 0/1, 0/1 | With evidence |
|
| IRD | MD | NM_001080463.2: c.988C>T | Monoallelic 0/1 | Partially solved |
|
| IRD | MD | NM_001080463.2: c.7966C>T | Monoallelic 0/1 | Partially solved |
|
| IRD | Usher syndrome | NM_000260.4: c.1996C>T | Biallelic 0/1,0/1 | Solved |
|
| OERD | Lens luxation | NM_001288607.2: c.2594G>A | Biallelic 0/1,0/1 | Solved |
Figure 4Carrier frequency of pathogenic variants in recessive IRD genes in pseudocontrols. The carrier frequency (CF) was calculated for all IRD genes with a recessive inheritance pattern, and at least one solved case in our cohort. Green represents the CF of the genes and orange the frequency in our IRD subcohort. The pseudocontrol subcohort was composed of 3531 cases, and the IRD subcohort had 1766 cases. The top 10 genes with higher CF are ordered decreasingly.