| Literature DB >> 30204945 |
Leslie J Burke1, Jan Sevcik1,2, Gaetana Gambino1,3, Emma Tudini1,4, Eliseos J Mucaki5, Ben C Shirley6, Phillip Whiley1,4, Michael T Parsons4, Kim De Leeneer7, Sara Gutiérrez-Enríquez8, Marta Santamariña9, Sandrine M Caputo10, Elizabeth Santana Dos Santos10,11,12, Jana Soukupova2, Marketa Janatova2, Petra Zemankova2, Klara Lhotova2, Lenka Stolarova2, Mariana Borecka2, Alejandro Moles-Fernández8, Siranoush Manoukian13, Bernardo Bonanni14, Stacey L Edwards4, Marinus J Blok15, Thomas van Overeem Hansen16, Maria Rossing16, Orland Diez8,17, Ana Vega9, Kathleen B M Claes7, David E Goldgar18, Etienne Rouleau19, Paolo Radice20, Paolo Peterlongo21, Peter K Rogan5,6, Maria Caligo3, Amanda B Spurdle4, Melissa A Brown1.
Abstract
The widespread use of next generation sequencing for clinical testing is detecting an escalating number of variants in noncoding regions of the genome. The clinical significance of the majority of these variants is currently unknown, which presents a significant clinical challenge. We have screened over 6,000 early-onset and/or familial breast cancer (BC) cases collected by the ENIGMA consortium for sequence variants in the 5' noncoding regions of BC susceptibility genes BRCA1 and BRCA2, and identified 141 rare variants with global minor allele frequency < 0.01, 76 of which have not been reported previously. Bioinformatic analysis identified a set of 21 variants most likely to impact transcriptional regulation, and luciferase reporter assays detected altered promoter activity for four of these variants. Electrophoretic mobility shift assays demonstrated that three of these altered the binding of proteins to the respective BRCA1 or BRCA2 promoter regions, including NFYA binding to BRCA1:c.-287C>T and PAX5 binding to BRCA2:c.-296C>T. Clinical classification of variants affecting promoter activity, using existing prediction models, found no evidence to suggest that these variants confer a high risk of disease. Further studies are required to determine if such variation may be associated with a moderate or low risk of BC.Entities:
Keywords: BRCA1; BRCA2; breast cancer; promoter; transcription; variants of unknown clinical significance (VUS)
Mesh:
Substances:
Year: 2018 PMID: 30204945 PMCID: PMC6282814 DOI: 10.1002/humu.23652
Source DB: PubMed Journal: Hum Mutat ISSN: 1059-7794 Impact factor: 4.878
Figure 1Overview of study design. Outline of the workflow of variant collection, prioritization and analysis
Samples used in this study
| Location | Institution | Samples | Gene region |
|---|---|---|---|
| Paris | Institut Curie, Saint Cloud | 686 cases |
|
| Milan | IFOM, Fondazione Instituto FIRC di Oncologia Molecolare |
772 cases 661 controls |
|
| Pisa | Department of Translational Research and New Technologies in Medicine, University of Pisa | 80 cases |
|
| Santiago de Compostela | Fundación Pública Galega de Medicina Xenómica‐SERGAS, Grupo de Medicina Xenómica‐USC, CIBERER, IDIS |
270 cases 130 controls |
|
| Copenhagen | Center for Genomic Medicine | 1157 cases |
|
| Ghent | Center for Medical Genetics, Ghent University Hospital | 357 cases |
|
| Barcelona | Vall d'Hebron Institute of Oncology | 192 cases |
|
| Prague | CZECANCA – CZEch CAncer panel for Clinical Aplication, Institute of Biochemistry and Experimental Oncology |
2961 cases 312 controls |
|
| Maastricht | Department of Clinical Genetics, Maastricht University Medical Centre | 900 cases |
|
Figure 2Variants identified in the 5′ regions of BRCA1 and BRCA2 map to predicted regulatory elements. Snapshots of the UCSC genome browser showing regions of BRCA1 (a) and BRCA2 (b) analyzed by targeted sequencing with available ENCODE regulatory marks derived from MCF7 cells. Chromatin segregation states from regulatory region annotation are shown (MCF7 states). The BRCA1 and BRCA2 genomic regions used for functional analyses are highlighted in grey. Prioritized variants within these regions are indicated
BRCA1 prioritized variants
| Gene | hg19 position (chr17) | Variant name | rsID | Global MAF in dbSNP | TF motif (ENCODE) | Bioinformatic priority |
|---|---|---|---|---|---|---|
|
| g.41277676A>T | c.‐408T>A | Novel | CEBPB | High/medium | |
|
| g.41277648C>T | c.‐380G>A | Novel | RXRA | High/medium | |
|
| g.41277646G>T | c.‐378C>A | rs186775935 | 0.00040 | RXRA | High/medium |
|
| g.41277583del | c.‐315del | rs901029407 | 0.00003 | ATF1,2,3, CREB1 | Medium |
|
| g.41277555G>A | c.‐287C>T | Novel | NFYA, NFYB | High/medium | |
|
| g.41277541C>T | c.‐273G>A | rs112960339 | 0.00499 | Medium | |
|
| g.41277532A>C | c.‐264T>G | rs904148166 | 0.00003 | Medium | |
|
| g.41277488G>T | c.‐220C>A | Novel | Medium | ||
|
| g.41277460A>G | c.‐192T>C | rs113323025 | 0.00519 | Medium |
TF, transcription factors.
Based on NM_007294.3.
Overlap with TF motif in ENCODE TF‐ChIP datasets from all cells.
Variant overlaps this motif, but the deletion does not alter the motif sequence.
BRCA2 prioritized variants
| Gene | hg19 Position (Chr13) | Variant name | rsID | Global MAF in dbSNP | TF motif (ENCODE) | Bioinformatic priority |
|---|---|---|---|---|---|---|
|
| g.32889437G>A | c.‐407G>A | rs36221751 | 0.0018 | Medium | |
|
| g.32889449C>T | c.‐395C>T | Novel | Medium | ||
|
| g.32889548C>T | c.‐296C>T | rs563971900 | 0.0004 | PAX5 | High/medium |
|
| g.32889564delG | c.‐280del | Novel | ELF1, GABPA, ELK1,4 | High | |
|
| g.32889576C>G | c.‐268C>G | Novel | High/medium | ||
|
| g.32889626G>A | c.‐218G>A | Novel | Medium | ||
|
| g.32889644C>T | c.‐200C>T | Novel | MAZ | Medium | |
|
| g.32889647A>C | c.‐197A>C | rs370721506 | NA | MAZ | Medium |
|
| g.32889669C>T | c.‐175C>T | rs55880202 | 0.0058 | Medium | |
|
| g.32889711T>G | c.‐133T>G | Novel | Medium | ||
|
| g.32889757T>G | c.‐87T>G | Novel | Medium/low | ||
|
| g.32889762G>C | c.‐82G>C | Novel | Medium/low |
NA, no data available, TF, transcription factors.
Based on NM_000059.3.
Overlap with TF motif in ENCODE TF‐ChIP datasets from all cells.
Figure 3Variants mapping to the 5′ regions of BRCA1 and BRCA2 alter promoter activity in MCF7 and MDA‐MB‐468 breast cancer cells. MCF7 (a and c) and MDA‐MB‐468 cells (b and d) were transfected with pGL3 vectors where luciferase expression is controlled by a portion of the BRCA1 (B1) (a and b) or BRCA2 (B2) (c and d) promoter. Cells were transfected with plasmids containing the wild‐type (WT) promoter sequence (grey bars), positive control (B1‐Ets or B2‐Ets; striped bars) or the indicated variants (black bars). Luciferase expression was normalized to a cotransfected pRL‐TK plasmid. Data represent the average of three independent biological replicates ± standard deviation (SD). The horizontal dotted line represents WT promoter activity set at 1.0‐fold. The vertical dotted lines demarcate individual experiments that include WT, positive control, and variant containing plasmids. (* P ˂ 0.05; ** P ˂ 0.01, *** P ˂ 0.005, **** P ˂ 0.0001)
Information theory analysis of prioritized BRCA1/2 variants
| Variant name | TF motif (ENCODE) | Consequences |
|---|---|---|
|
| CEBPB | CEBPB site weakened (did not meet stringent filtering thresholds) |
|
| RXRA | Weak RXRA and IRF3 sites weakened, HNF4G site weakened. |
|
| RXRA | RXR unchanged, HSF1 site lost and GR site created |
|
| ATF1,2,3, CREB1 | TCF7L2 site lost and POU2F2 created |
|
| NFYA, NFYB | NFYA and NFYB sites lost, weak PBX3 site created |
|
| Altered TF strength did not fulfill stringent filtering thresholds | |
|
| BHLHE32 and MYC sites created. | |
|
| Altered TF strength did not fulfill stringent filtering thresholds | |
|
| ETS1 site created, weak RFX5 site strengthened. | |
|
| Weak MEF2A site strengthened, GATA2 site lost. | |
|
| TEAD4 site lost. | |
|
| PAX5 | PAX5 site weakened. |
|
| ELF1, GABPA, ELK1,4 | GABPA site unchanged, MXI1 andTCF3 sites lost. |
|
| Altered TF strength did not meet filtering thresholds | |
|
| Altered TF strength did not meet filtering thresholds | |
|
| MAZ | KLF1 site abolished. |
|
| MAZ | SP4 weakened, GR site weakened, TCF3 site created |
|
| Altered TF strength did not fulfill stringent filtering thresholds | |
|
| Altered TF strength did not fulfill stringent filtering thresholds | |
|
| Altered TF strength did not fulfill stringent filtering thresholds | |
|
| Altered TF strength did not fulfill stringent filtering thresholds |
Variant overlaps this motif, but the deletion does not alter the motif sequence.
Change in information did not fulfill stringent filtering criteria, where [A] site R i < R sequence–1 standard deviation of TF model, or [B] where ΔR i < 4 bits.
No MAZ binding model available.
Figure 4Variants in the 5′ regions of BRCA1 alter DNA:protein complex formation. Electrophoretic mobility shift assay (EMSA) reactions were performed with 3′ biotinylated double‐stranded DNA probes from the BRCA1 5′ region and nuclear extracts (NE) from (a) MCF7 or (b) MDA‐MB‐468 cells. DNA probes contained either wild‐type (WT) or variant (Var) sequences. Free unbound probe (FP) and probe bound by nuclear proteins (BP) are indicated
Figure 5Variant sequences in the BRCA1 5′ region alter specific DNA:protein complex formation. Competition electrophoretic mobility shift assay (EMSAs) were performed using 3′ biotinylated double‐stranded DNA probes containing sequences from the BRCA1 5′ region surrounding the B1:c.‐315del (a and b) and B1:c.‐287C>T (c) variants. DNA probes containing the wild‐type (WT) or variant (Var) sequence were incubated with nuclear extracts from MCF7 cells (MCF7 NE) or MDA‐MB 468 cells (468 NE) in the presence (+) or absence (–) of unlabeled WT, Var, or nonspecific (NS) competitor (Comp) DNA. Free unbound probe (FP) and specific DNA:protein complexes (arrowheads) are indicated. Supershift experiments (d) were performed with the BRCA1:c.‐287C (WT) probe and antibodies to NFYA, Oct‐2 (POU2F2) and PAX5. The supershifted NFYA complex is indicated by asterisk (*)
Figure 6Variants in the 5′ region of BRCA2 alter specific DNA:protein complex formation. Competition electrophoretic mobility shift assay (EMSAs; a) were performed using 3′ biotinylated double‐stranded (ds) DNA probes containing sequences from the BRCA2 5′ region surrounding the BRCA2:c.‐296C>T variant. DNA probes containing the wild‐type (WT) or variant (Var) sequence were incubated with nuclear extracts from MCF7 cells (MCF7 NE) in the presence (+) or absence (–) of unlabeled WT, Var, or nonspecific (NS) competitor (Comp) DNA. Cross‐competition EMSAs (b) contained BRCA2 WT sequences and increasing concentrations of ds competitor DNA containing unlabeled WT, Var, or PAX5 binding sites from the hCD19 gene and D‐amino acid oxidase gene (hDAO). Free unbound probe (FP) and specific DNA:protein complexes (arrowheads) are indicated
Classification of prioritized variants
| Gene | Genomic location (hg19) | HGVS c. nomenclature | Luciferase result | Combined interpretation of frequency data & multifactorial analysis | Highest MAF (population, database) | Prior probability of pathogenicity | Segregation Bayes score (# families) | Tumor histopathology likelihood ratio (# tumors) | Combined odds for causality | Posterior probability of pathogenicity |
|---|---|---|---|---|---|---|---|---|---|---|
|
| g.41277676A>T | c.‐408T>A | No effect | Uncertain | 0.02 | |||||
|
| g.41277648C>T | c.‐380G>A | No effect | Uncertain | 0.02 | 1.67 (1) | 1.67 | NA | ||
|
| g.41277646G>T | c.‐378C>A | No effect | Uncertain | 0.0015 (African, 1,000 Genomes) | 0.02 | ||||
|
| g.41277583del | c.‐315del | Decrease | Uncertain | 0.02 | |||||
|
| g.41277555G>A | c.‐287C>T | Increase | Uncertain | 0.02 | 0.64 (1) | 0.64 | NA | ||
|
| g.41277541C>T | c.‐273G>A | No effect | Not pathogenic | 0.0159 (African, 1,000 Genomes) | 0.02 | ||||
|
| g.41277532A>C | c.‐264T>G | No effect | Uncertain | 0.02 | 0.51 (1) | 0.51 | NA | ||
|
| g.41277488G>T | c.‐220C>A | No effect | Uncertain | 0.02 | |||||
|
| g.41277460A>G | c.‐192T>C | Decrease | Not pathogenic | 0.0159 (African, 1,000 Genomes) | 0.02 | ||||
|
| g.32889437G>A | c.‐407G>A | No effect | Not pathogenic | 0.0080 (Prague, this study) | 0.02 | 0.55 (6) | 0.55 | NA | |
|
| g.32889449C>T | c.‐395C>T | No effect | Uncertain | 0.02 | |||||
|
| g.32889548C>T | c.‐296C>T | Decrease | Not pathogenic | 0.0080 (Prague, this study) | 0.02 | 3.07 (1) | 1.91 (8) | 5.87 | 0.1069 |
|
| g.32889564delG | c.‐280del | No effect | Uncertain | 0.02 | 0.69 (1) | 0.69 | NA | ||
|
| g.32889576C>G | c.‐268C>G | No effect | Uncertain | 0.02 | |||||
|
| g.32889626G>A | c.‐218G>A | No effect | Likely not pathogenic | 0.02 | 0.52 (1) | 0.72 (1) | 0.38 | 0.0076 | |
|
| g.32889644C>T | c.‐200C>T | No effect | Likely not pathogenic | 0.02 | 0.37 (1) | 0.37 | 0.0075 | ||
|
| g.32889647A>C | c.‐197A>C | No effect | Not pathogenic | 0.0014 (African, FLOSSIES) | 0.02 | 1.08 (1) | 1.08 | NA | |
|
| g.32889669C>T | c.‐175C>T | No effect | Not pathogenic | 0.0197 (African, FLOSSIES) | 0.02 | ||||
|
| g.32889711T>G | c.‐133T>G | No effect | Uncertain | 0.02 | |||||
|
| g.32889757T>G | c.‐87T>G | No effect | Uncertain | 0.02 | |||||
|
| g.32889762G>C | c.‐82G>C | No effect | Uncertain | 0.02 |
NA, not applicable: multifactorial classification not assigned as the combined odds of causality were insufficient (≥0.5 and ≤2) to derive a posterior probability of pathogenicity (Vallee et al., 2016).
Not pathogenic based on frequency > 1% in an outbred sampleset.
Variant allele assigned a low prior probability of pathogenicity of 0.02 assuming conservatively that 2/100 of such variants might be associated with a high risk of cancer and allele frequency ≥0.001 and < 0.01 in outbred sample set.
Posterior probabilities used to assign IARC 5‐tier class as described in Plon et al., 2008.