| Literature DB >> 35373174 |
James Casaletto1,2, Michael Parsons3, Charles Markello1, Yusuke Iwasaki4, Yukihide Momozawa4, Amanda B Spurdle3, Melissa Cline1.
Abstract
More than 40% of the germline variants in ClinVar today are variants of uncertain significance (VUSs). These variants remain unclassified in part because the patient-level data needed for their interpretation is siloed. Federated analysis can overcome this problem by "bringing the code to the data": analyzing the sensitive patient-level data computationally within its secure home institution and providing researchers with valuable insights from data that would not otherwise be accessible. We tested this principle with a federated analysis of breast cancer clinical data at RIKEN, derived from the BioBank Japan repository. We were able to analyze these data within RIKEN's secure computational framework without the need to transfer the data, gathering evidence for the interpretation of several variants. This exercise represents an approach to help realize the core charter of the Global Alliance for Genomics and Health (GA4GH): to responsibly share genomic data for the benefit of human health.Entities:
Year: 2022 PMID: 35373174 PMCID: PMC8975122 DOI: 10.1016/j.xgen.2022.100109
Source DB: PubMed Journal: Cell Genom ISSN: 2666-979X
Summary of the variant data
| Gene | BRCA2 | BRCA2 | BRCA2 | BRCA1 | BRCA2 | BRCA2 | BRCA2 |
|---|---|---|---|---|---|---|---|
| Variant (cDNA HGVS) | c.6325G>A | c.7052C>G | c.943T>A | c.4729T>C | c.4365A>G | c.6131G>T | c.964A>C |
| Variant (protein HGVS) | p.A2351G | p.A2351G | p.C315S | p.S1577P | p.A2351G | p.G2044V | p.K322Q |
| ClinVar classification (May 1, 2021) | B/LB | B/LB | B/LB | B/LB | LB | Conflict | Conflict |
| gnomAD 2.1.1 exome frequency (EAS) | 2.55E–03 | 1.87E–03 | 5.30E–03 | 2.65E–04 | Absent | 4.52E–04 | 4.31E–04 |
| gnomAD 3.1.1 genome frequency (EAS) | 2.39E–03 | 2.02E–03 | 5.03E–03 | 2.02E–04 | 2.01E–03 | 4.52E–03 | 2.41E–03 |
| ACMG/AMP code from gnomAD | BA1 | BA1 | BA1 | BS1 | BS1 | BA1 | BA1 |
| Biobank Japan frequency (Controls) | 1.46E–02 | 3.16E–03 | 1.56E–03 | 1.14E–02 | 4.64E–04 | 3.29E–02 | 2.31E–03 |
| ACMG/AMP frequency from BioBank Japan | BA1 | BA1 | BA1 | BA1 | BS1 | BA1 | BA1 |
| BayesDel score | −0.61 | −0.24 | −0.41 | 0.03 | −0.52 | −0.16 | −0.08 |
| Bioinformatic code | BP4 | BP4 | BP4 | BP4 | BP4 | BP4 | BP4 |
| ACMG/AMP class based on frequency and bioinformatics | B | B | B | B | LB | B | B |
| Gene | BRCA1 | BRCA1 | BRCA2 | BRCA2 | BRCA2 | BRCA2 | N/A |
| Variant (cDNA HGVS) | c.154C>T | c.811G>A | c.5969A>C | c.3395A>G | c.9733T>G | c.5660C>T | N/A |
| Variant (protein HGVS) | p.L52F | p.V271M | p.D1990A | p.K1132R | p.S3245A | p.T1887M | N/A |
| ClinVar classification (May 1, 2021) | Conflict | Conflict | Conflict | VUS | VUS | VUS | N/A |
| gnomAD 2.1.1 exome frequency (EAS) | 1.36E–03 | 1.32E–03 | 0 | Absent | Absent | 1.13E–04 | N/A |
| gnomAD 3.1.1 genome frequency (EAS) | 4.03E-04 | 1.21E-03 | 4.03E-04 | 0.000201 | Absent | Absent | N/A |
| ACMG/AMP code from gnomAD | BA1 | BA1 | BS1 | BS1 | PM2 | BS1 | N/A |
| Biobank Japan frequency (Controls) | 6.78E–03 | 6.28E–03 | 2.61E–03 | 3.75E–03 | 1.01E–03 | 1.69E–04 | N/A |
| ACMG/AMP frequency from BioBank Japan | BA1 | BA1 | BA1 | BA1 | BA1 | BS1 | N/A |
| BayesDel score | 0.14 | 0.06 | −0.08 | −0.2 | −0.47 | −0.29 | N/A |
| Bioinformatic code | BP4 | BP4 | BP4 | BP4 | BP4 | BP4 | N/A |
| ACMG/AMP class based on frequency and bioinformatics | B | B | B | B | B | LB | N/A |
| Gene | BRCA2 | BRCA2 | BRCA2 | BRCA2 | BRCA2 | BRCA2 | N/A |
| Variant (cDNA HGVS) | c.2672T>A | c.587G>T | c.8040C>G | c.358G>A | c.3983G>A | c.6637T>C | N/A |
| Variant (protein HGVS) | p.V891D | p.S196I | p.D2680E | p.V120M | p.S1328N | p.S2213P | N/A |
| ClinVar classification (May 1, 2021) | VUS | VUS | VUS | Absent | Conflict | Conflict | N/A |
| gnomAD 2.1.1 exome frequency (EAS) | Absent | 1.78E-04 | Absent | Absent | 0 | Absent | N/A |
| gnomAD 3.1.1 genome frequency (EAS) | Absent | Absent | 0.000202 | Absent | Absent | Absent | N/A |
| ACMG/AMP code from gnomAD | PM2 | BS1 | BS1 | PM2 | PM2 | PM2 | N/A |
| Biobank Japan frequency (Controls) | 9.69E–04 | 4.64E–04 | 9.69E–04 | 0 | 0 | 0 | N/A |
| ACMG/AMP frequency from BioBank Japan | BS1 | BS1 | BS1 | PM2 | PM2 | PM2 | N/A |
| BayesDel score | −0.05 | −0.22 | −0.05 | −0.48 | −0.57 | −0.06 | N/A |
| Bioinformatic code | BP4 | BP4 | BP4 | BP4 | BP4 | BP4 | N/A |
| ACMG/AMP class based on frequency and bioinformatics | LB | LB | LB | VUS | VUS | VUS | N/A |
The HGVS terms reflect the NM_007294.3 transcript for BRCA1 and NM_000059.3 for BRCA2. Variants are designated as B (Benign), B/LB (Benign or Likely Benign), LB (Likely Benign), Conflict (Conflicting Interpretations), VUS (Uncertain Significance), or Absent (Not Found). All variants scored against the BayesDel in silico predictor with a score of less than 0.3, within the BP4 scoring range. Additionally, two variants were observed to co-occur with two more pathogenic variants in the same gene, indicating that at least one of these co-occurrences must be in trans, which meets the standards of BS2 evidence. In BRCA1, we observed co-occurrences of c.4729T>C with c.1518del and c.188T>A and in BRCA2, we observed co-occurrences of c.964A>C with c.6952C>T, c.5645C>A, and c.6244G>T. While these VUSs had sufficient evidence for classification on allele frequencies only, these co-occurrences add further support to benign classification. We further observed co-occurrences of BRCA2 c.5660C>T with c.1261C>T and c.4365A>G with c.7480C>T, evidence that could support a benign classification if these variants are observed in co-occurrences with different pathogenic variants in other patient cohorts.
KEY RESOURCES TABLE
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Deposited data | ||
| Sequence and phenotype data | Japanese Genotype-Phenotype Archive | Japanese Genotype-Phenotype Archive: JGAS00000000140 |
| Software and algorithms | ||
| Co-occurrence GitHub repository | This manuscript |
|
| Co-occurrence Dockstore repository | This manuscript |
|
| Python 3.7.3 | Python Software Foundation |
|
| Scikit-allel 1.3.1 | Miles et al.[ |
|
| Pandas 1.3.2 | Pandas development team[ |
|
| Bcftools 1.10.2 | Danecek et al.[ |
|
| Pyensembl 1.8.5 | N/A |
|