| Literature DB >> 33851121 |
Cole Shulman1,2, Emerald Liang1,2, Misato Kamura3, Khalil Udwan1,2, Tony Yao1,2, Daniel Cattran1,2,4,5, Heather Reich1,2,4,5, Michelle Hladunewich1,2,4,5, York Pei1,2,4,5, Judy Savige6, Andrew D Paterson7,8, Mary Ann Suico3, Hirofumi Kai3, Moumita Barua1,2,4,5.
Abstract
RATIONALE &Entities:
Keywords: ARUP; Alport syndrome; ClinVar; FSGS; LOVD; computational predictions; genomics; gnomAD; in silico predictions; type IV collagen variants
Year: 2021 PMID: 33851121 PMCID: PMC8039416 DOI: 10.1016/j.xkme.2020.12.007
Source DB: PubMed Journal: Kidney Med ISSN: 2590-0595
Figure 1Number of rare missense variants predicted to be pathogenic in: (A) the focal segmental glomerulosclerosis (FSGS) study cohort and (B) Genome Aggregation Database (gnomAD). For rare missense COL4A3, COL4A4, and COL4A5 variants in our FSGS cohort, 43% (6/14), 40% (4/10), and 33% (2/6) were predicted to be deleterious by at least 10 of 12 programs, respectively. For rare missense COL4A3, COL4A4, and COL4A5 variants identified in gnomAD, 35% (301/851), 32% (306/949), and 41% (197/483) were predicted to be deleterious by at least 10 of 12 programs, respectively.
Figure 2Comparison of COL4A3, COL4A4, and COL4A5 in silico predictions with disease database categorization. For ARUP COL4A5 pathogenic variants, there was 97% (317/327) concordance with in silico predictions. For ClinVar COL4A3/A4/A5 pathogenic variants, there was 75% (12/16), 100% (9/9), and 89% (258/287) concordance with in silico predictions, respectively. For LOVD COL4A3/A4/A5 pathogenic variants, there was 82% (28/34), 86% (42/49), and 94% (650/699) concordance. Congruency of in silico predictions was similar for variants categorized as benign, with the exception of COL4A5 variants documented in ARUP and ClinVar, in which the effects were overestimated by in silico programs, though there were fewer variants to interrogate. In ARUP, 57% (4/7) of COL4A5 variants were classified correctly by in silico predictions. In ClinVar, 100% (6/6), 100% (9/9), and 71% (5/7) of COL4A3/A4/A5 variants, respectively, were correctly assigned. Finally, for LOVD, 100% (2/2), 85% (23/37), and 100% (6/6) of COL4A3/A4/A5 variants were correctly classified.
Figure 3Spearman correlation coefficient heatmap comparing results of various prediction models. Most programs had similar prediction scores when comparing with each other except for FATHMM and M-CAP. Dark blue signifies a strong direct correlation while dark red signifies a strong indirect correlation. Squares that are lighter in color signify a weak correlation between the results of the 2 prediction models. Figure created using the corrplot package available in Rstudio.
Comparison of Functional Annotation With In Silico Predictions for Pathogenic COL4A3 and COL4A5 Variants Identified in the FSGS Cohort
| Gene | No. of Pathogenic Variants | No. Predicted Deleterious by 10/12 Programs | No. of Variants With Secretory Defect | Congruence |
|---|---|---|---|---|
| 3 | 3 | 3 | 100% | |
| 6 | 6 | 6 | 100% |
Note: All pathogenic COL4A3 and COL4A5 variants were categorized as such as a result of being identified in other kidney disease cases. All pathogenic variants demonstrated a secretory defect with functional characterization and were correctly assigned by in silico predictions.
Abbreviation: FSGS, focal segmental glomerulosclerosis.
Figure 4Functional characterization of COL4A3 and COL4A5 using the split-luciferase assay. Scatterplots of the intracellular/secreted relative light unit (RLU) ratio from human embryonic kidney 293 (HEK293T) cells expressing (A, B) mutant α3 chain or (C, D) mutant α5 chain compared to wild type (WT) using N-terminal and C-terminal split-luciferase tagged constructs. Pathogenic α3 and α5 chain mutants showed clearer secretory defect with N-terminal tagged constructs. Solid line: Y = X, dotted line: Y = X + 50, Y = X – 50. Square, WT; red circles are pathogenic variants and grey circles are variants of uncertain significance. Any data point under the −50 line was considered as a significant secretory defect. Experiments were performed in triplicate. Data presented are representative of 2 independent experiments.
Comparison of Functional Annotation With In Silico Predictions for Variants of Uncertain Significance in COL4A3 and COL4A5 Identified in the FSGS Cohort
| Gene | No. of Variants of Uncertain Significance | No. Predicted Deleterious by 10/12 Programs | No. of Variants With Secretory Defect | Congruence |
|---|---|---|---|---|
| 4 | 3 | 0 | 0% | |
| 4 | 2 | 1 | 50% |
Note: Only 1 variant of uncertain significance in COL4A5 was found to have a secretory defect, which was accurately predicted by in silico predictions. However, 1 variant of uncertain significance without evidence of a secretory defect was also predicted to be deleterious.
Abbreviation: FSGS, focal segmental glomerulosclerosis.