| Literature DB >> 31905201 |
Peter Grešner1, Ewa Jabłońska2, Jolanta Gromadzińska3.
Abstract
A case-control study was conducted in which we evaluated the association between genetic variability of DNA repair proteins belonging to the Rad51 family and breast cancer (BrC) risk. In the study, 132 female BrC cases and 189 healthy control females were genotyped for a total of 14 common single nucleotide polymorphisms (SNPs) within Rad51 and Xrcc3. Moreover, our previously reported Rad51C genetic data were involved to explore the nonlinear interactions among SNPs within the three genes and effect of such interactions on BrC risk. The rare rs5030789 genotype (-4601AA) in Rad51 was found to significantly decrease the BrC risk (OR = 0.5, 95% CI: 0.3-1.0, p<0.05). An interaction between this SNP, rs2619679 and rs2928140 (both in Rad51), was found to result in a two three-locus genotypes -4719AA/-4601AA/2972CG and -4719AT/-4601GA/2972CC, both of which were found to increase the risk of BrC (OR = 8.4, 95% CI: 1.8-38.6, p<0.0001), instead. Furthermore, rare Rad51 rs1801320 (135CC) and heterozygous Xrcc3 rs3212057 (10343GA) genotypes were found to respectively increase (OR = 10.6, 95% CI: 1.9-198, p<0.02) and decrease (OR = 0.0, 95% CI: 0.0-NA, p<0.05) the risk of BrC. Associations between these SNPs and BrC risk were further supported by outcomes of employed machine learning analyses. In Xrcc3, the 4541A/9685A haplotype was found to be significantly associated with reduced BrC risk (OR = 0.5; 95% CI: 0.3-0.9; p<0.05). Concluding, our study indicates a complex role of SNPs within Rad51 (especially rs5030789) and Xrcc3 in BrC, although their significance with respect to the disease needs to be further clarified.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31905201 PMCID: PMC6944361 DOI: 10.1371/journal.pone.0226976
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Characteristics of the groups of subjects involved in the study.
| Feature | BrC cases | Control subjects |
|---|---|---|
| Total number | 132 | 189 |
| Age [years] | 57 [36–86] | 43 [35–54] |
| Smoking status [never/ever] | 55/63 (0.47/0.53) | 120/68 (0.64/0.36) |
| Pack-years | 11.3 [6.0–20.0] | 2.0 [1.0–4.6] |
| Tumor staging | ||
| T [1/2/3/4/x] | 44/45/1/6/1 | - |
| N [0/1/2/3/x] | 44/24/10/2/20 | - |
| Tumor grade | ||
| G [1/2/3/x] | 7/37/40/13 | - |
Numerical data for age and pack-years presented as median [range]. Smoking status [never/ever-smokers] expressed as absolute [relative] counts.
a p << 0.001; BrC cases vs. controls; Mann-Whitney U test
b p < 0.005; BrC cases vs. controls; two-sided Mid-P test
c Tumor staging classification (absolute counts): T—describes the size of the primary tumor and its invasiveness, N—describes the regional lymph nodes involved, x—data unavailable.
Fig 1Rad51 and Xrcc3 gene and protein structures.
Rad51 and Xrcc3 gene and protein structures are shown with graphic representation of the localization of all 14 SNPs investigated in this study. The positions of individual SNPs in respective genes are indicated by arrows annotated with respective working codes used in this study. Localizations of rs3212057 (X4), rs861539 (X6) and rs28903081 (X7) are indicated also within the Xrcc3 protein structure, as these were the only three SNPs localized within coding regions of exons. Rad51 and Xrcc3 are placed on chromosomes 15 and 14 and span 37 and 18 kb, respectively. Their protein products consist of 339 and 346 amino acids, respectively, showing high degree of conservativeness. They both share highly conserved Rad51-like C-terminal domain with approx. 250 amino acids (responsible for DNA binding) within which two subdomains can be distinguished: the RecA-like ATP-binding domain (responsible for ATP binding and hydrolysis) and the RecA-like monomer-monomer interface. The RecA-like ATP-binding domain embodies two highly conserved consensus motifs (Walker A and B) conferring ATP-binding and -hydrolysis activities. Distributed within this domain is also the multimere BRC interface (pink lines in the protein structure) facilitating interaction with DNA. X4, X6 and X7 are all localized within Rad51-like C-terminal domain, while the rs861539 (X6) hits exactly the threonine-241 taking part in interaction with DNA. Rad51 consists additionally of the helicase-hairpin-helicase (HhH) domain responsible for nonspecific DNA binding [42,45].
Resume of SNPs genotyped in the study.
| Working code | rs-code | Designation | Other designations | SNP position | MAF |
|---|---|---|---|---|---|
| RA | rs2619679 | c.-1389T>A | -4719A/T | 5' near gene (promoter) | T: 0.49 |
| RB | rs5030789 | c.-1271A>G | -4601A/G | 5' near gene (promoter) | A: 0.42 |
| RC | rs1801320 | c.−98G>C | 135G/C | Exon 2 (5'UTR) | C: 0.08 |
| RD | rs1801321 | c.−61G>T | 172G/T | Exon 2 (5'UTR) | T: 0.42 |
| RE | rs2619680 | c.−3+795C>A | 1037A/C | Intron 3 | A: 0.50 |
| RF | rs2619681 | c.−3+1398T>C | 1640C/T | Intron 3 | T: 0.15 |
| RG | rs2928140 | c.−2−602G>C | 2972C/G | Intron 3 | G: 0.49 |
| X1 | rs1799794 | c.−316A>G | 4541A/G | Exon 2 (5'UTR) | G: 0.22 |
| X2 | rs45603942 | c.−281C>T | 4576C/T | Exon 2 (5'UTR) | |
| X3 | rs861530 | c.194−571A>G | 9685A/G | Intron 5 | A: 0.31 |
| X4 | rs3212057 | c.281G>A | 10343G/A; p.Arg94His | Exon 6 | A: 0.03 |
| X5 | rs1799796 | c.562-14A>G | 17893A/G | Intron 7 (IVS6'14) | G: 0.30 |
| X6 | rs861539 | c.722C>T | 18067C/T; p.Thr241Met | Exon 8 | T: 0.39 |
| X7 | rs28903081 | c.905G>A | p.Arg302His | Exon 10 |
SNP nomenclature, assigned working symbols, localization and minor allele frequencies (MAF) are provided for general European population according to data obtained from the dbSNP database [42]. Table partially taken from [46], nevertheless some corrections have been introduced due to data actualization in dbSNP database.
Predicted effects of SNPs analyzed in the study.
| #rs | Fold-change in transcription factor binding affinity caused by given SNP | Polyphen-2 | ||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Code | Bmal-1 | Clock | Creb-3 | E2F-2 | E2F-4 | AP-1 | jun-D | Myc | N-myc | NF-κB1 | NRF1 | p53 | p63 | Pax-6 | Pax-8 | RAR-α | RAR-γ | Sp1 | Wt1 | class | score | |
| rs2619679 | RA | - | - | 0.33 | - | 3.27 | 4.68 | 4.45 | - | - | 4.41 | - | - | - | 3.37 | 3.47 | - | - | - | - | - | - |
| rs5030789 | RB | - | 0.18 | - | 0.21 | 0.08 | - | - | 0.24 | 0.22 | - | 3.57 | 9.02 | 3.95 | 5.38 | - | 3.72 | - | - | - | - | - |
| rs1801320 | RC | 9.86 | 3.95 | - | - | 0.32 | - | - | 15.91 | 21.22 | - | - | 0.26 | - | - | - | - | 0.30 | 0.11 | 0.26 | - | - |
| rs1801321 | RD | - | - | 0.11 | 0.05 | 6.24 | - | - | 0.16 | - | 7.32 | 0.10 | - | - | 3.61 | 22.98 | 0.32 | 3.72 | 0.09 | 21.65 | - | - |
| rs2619680 | RE | - | 0.27 | - | - | - | - | - | 4.58 | - | 0.10 | 11.45 | - | 0.21 | 0.13 | - | 4.28 | 6.24 | - | - | - | - |
| rs2619681 | RF | 0.07 | - | - | - | - | 0.16 | 0.09 | 0.17 | 0.29 | 0.24 | - | - | - | - | - | - | - | - | - | - | - |
| rs2928140 | RG | - | - | - | - | - | 0.24 | 0.27 | - | - | 0.25 | - | - | - | - | - | - | 3.87 | 0.26 | 0.22 | - | - |
| rs1799794 | X1 | - | - | 0.17 | - | - | - | - | 0.30 | - | 3.61 | - | - | - | - | 3.61 | - | 3.17 | 9.02 | - | - | - |
| rs45603942 | X2 | - | - | 5.59 | - | - | 0.29 | - | - | - | - | - | - | - | - | - | 3.50 | 14.68 | - | - | - | - |
| rs861530 | X3 | 0.04 | 0.29 | - | 0.26 | 3.02 | - | - | 0.20 | 0.09 | - | - | 8.93 | 5.65 | - | - | - | - | - | - | - | - |
| rs3212057 | X4 | - | - | - | - | - | - | - | - | - | - | - | - | - | - | - | - | - | - | - | neutral | 0.334 |
| rs1799796 | X5 | 0.17 | 0.13 | 6.00 | - | - | - | - | - | - | - | - | - | - | - | 5.43 | - | 0.21 | - | - | - | - |
| rs861539 | X6 | - | - | - | - | - | - | - | - | - | - | - | - | - | - | - | - | - | - | - | deleterious | 0.541 |
| rs28903081 | X7 | - | - | - | - | - | - | - | - | - | - | - | - | - | - | - | - | - | - | - | neutral | 0.001 |
Results of PERFECTOS-APE and Polyphen-2 in-silico analyses for non-coding and non-synonymous SNPs, respectively, analyzed in the study. Effects of non-coding SNP were predicted for several well-known transcription factors (TF) against the HOCOMOCO-11 [69] database and are provided as fold changes in respective TF binding affinity caused by given SNP, calculated as the ratio of probability of finding a better scoring TF binding motif by chance given the wild-type and variant SNP allele, respectively [43]. Only 3-fold or larger changes are shown, values above 3.0 indicate TF binding affinity upregulation, values below 0.33 indicate downregulation. All provided values are statistically significant at the level of p<0.05 or higher. For each non-synonymous SNP the classification of SNP provided by the Polyphen-2 algorithm together with respective probability value is provided [44].
Summary on primer sequences, amplicons, PCR cycling conditions and enzymatic digestion for individual SNPs genotyped in the study [46].
| rs# of SNP | Primer sequences (Forward/Reverse) | Amplicon length | PCR | Enzymatic digestion |
|---|---|---|---|---|
| rs2619679 | 402 bp | 95°C(45s) 60°C(45s) 72°C(60s) 28x | ||
| rs5030789 | ||||
| rs1801320 | 399 bp | |||
| rs1801321 | TaqMan Pre-Designed SNP Genotyping Assay (Life Technologies; Assay ID: C___7482700_10) | 95°C(15s) 61.5°C(60s) 50x | ||
| rs2619680 | 485 bp | 95°C(45s) 65°C(45s) 72°C(60s) 29x | ||
| rs2619681 | 245 bp | 95°C(45s) 63°C(45s) 72°C(60s) 28x | ||
| rs2928140 | 332 bp | |||
| rs1799794 | 505 bp | 95°C(45s) 64°C(45s) 72°C(60s) 28x | ||
| rs45603942 | 515 bp | |||
| rs861530 | 497 bp | |||
| rs3212057 | 467 bp | |||
| rs1799796 | 400 bp | |||
| rs861539 | 452 bp | |||
| rs28903081 | 445 bp | |||
a PCR conditions are provided as temperature (duration) of denaturation, annealing and extension phase, respectively, followed by the number of cycles for a given SNP. Each PCR reaction was finished by a final 10 min-long extension phase.
b Digestion/enzyme deactivation temperatures, respectively, are provided for each enzyme. Digestion was always performed for 20 minutes. In the case of MvaI, the reaction was stopped by adding 20 mM of EDTA according to the manufacturer’s instruction.
c rs2619679 and rs5020789 were co-amplified in one PCR reaction tube using only one pair of primers and following the PCR reaction were divided into separate tubes and subjected to endonuclease digestion separately.
d rs1801321 in Rad51 was genotyped by means of the real-time PCR technique using commercially available TaqMan SNP Genotyping Assay Kit and detailed conditions of the reaction are provided elsewhere [46].
Observed genotype and haplotype frequencies together with respective odds ratios (ORs) and corresponding 95% confidence intervals for Rad51 polymorphisms in breast cancer cases and control subjects.
| AA | 32 (0.250) | 49 (0.265) | A: 1.0 [0.7–1.4] | ||
| AT | 73 (0.570) | 98 (0.530) | D: 1.0 [0.6–1.7] | All: 1.0 [0.7–1.4] | |
| TT | 23 (0.180) | 38 (0.205) | R: 1.0 [0.5–1.7] | W/R: 0.9 [0.5–1.9] | |
| GG | 41 (0.320) | 51 (0.277) | A: 0.7 [0.5–1.0] | ||
| GA | 71 (0.555) | 90 (0.489) | D: 0.7 [0.4–1.3] | All: 0.7 [0.5–1.0] | |
| AA | 16 (0.125) | 43 (0.234) | |||
| GG | 93 (0.830) | 109 (0.845) | A: 1.5 [0.9–2.6] | ||
| GC | 11 (0.098) | 19 (0.147) | D: 1.2 [0.6–2.5] | All: 1.6 [0.9–2.8] | |
| CC | 8 (0.071) | 1 (0.008) | |||
| GG | 43 (0.326) | 72 (0.381) | A: 1.1 [0.8–1.6] | ||
| GT | 72 (0.545) | 89 (0.471) | D: 1.2 [0.7–2.0] | All: 1.1 [0.8–1.5] | |
| TT | 17 (0.129) | 28 (0.148) | R: 1.0 [0.5–2.0] | W/R: 1.2 [0.5–2.4] | |
| AA | 30 (0.244) | 49 (0.288) | A: 1.0 [0.7–1.5] | ||
| AC | 71 (0.577) | 89 (0.524) | D: 1.1 [0.6–1.9] | All: 1.1 [0.8–1.5] | |
| CC | 22 (0.179) | 32 (0.188) | R: 1.0 [0.5–1.8] | W/R: 1.0 [0.5–2.2] | |
| CC | 100 (0.787) | 143 (0.769) | A: 0.8 [0.5–1.3] | ||
| CT | 27 (0.213) | 40 (0.215) | D: 0.8 [0.5–1.5] | All: 0.8 [0.5–1.4] | |
| TT | 0 (0.000) | 3 (0.016) | R: 0.0 [ | W/R: 0.0 [ | |
| CC | 28 (0.217) | 49 (0.265) | A: 1.0 [0.7–1.5] | ||
| CG | 77 (0.597) | 97 (0.524) | D: 1.2 [0.7–2.2] | All: 1.1 [0.8–1.4] | |
| GG | 24 (0.186) | 39 (0.211) | R: 0.8 [0.5–1.5] | W/R: 1.0 [0.5–2.1] | |
| (RD,RE,RF): TAC | 104 (0.405) | 142 (0.377) | 1.1 [0.9–1.3] | ||
| (RD,RE,RF): GCC | 93 (0.362) | 126 (0.336) | 1.1 [0.8–1.6] | ||
| (RD,RE,RF): GAC | 32 (0.124) | 59 (0.156) | 0.8 [0.5–1.2] | ||
| (RD,RE,RF): GCT | 28 (0.108) | 46 (0.122) | 0.9 [0.5–1.4] | ||
| (RD,RE,RF): TCC | 0 (0.000) | 2 (0.006) | |||
| (RD,RE,RF): GAT | 0 (0.000) | 1 (0.003) | |||
A) Single-site analyses showed statistically significant BrC risk-reducing effect for rs5030789 and rs1801320 under recessive genetic model, only. B) Results of haplotype analysis for Rad51 SNPs showed no observable associations between Rad51 haplotypes and BrC. Data presented as absolute (relative) frequencies of individual genotypes observed in BrC cases and controls; NA—not applicable;
a Genetic model employed in order to analyze the association between genotype distribution and cancer: A—additive genetic model; D—dominant genetic model; R—recessive genetic model; All—direct comparison of the frequency of alleles; W/R—comparison of wild-type and rare homozygous genotypes.
b OR values adjusted for age and smoking status, together with 95% CI determined by logistic regression.
c Significant differences in genotype distributions between the BrC and control groups were sought for by two-sided exact mid-P test.
d p < 0.05.
e p = 0.06.
f p = 0.07.
Observed genotype and haplotype frequencies together with respective odds ratios (ORs) and corresponding 95% confidence intervals for Xrcc3 polymorphisms in breast cancer cases and control subjects.
| AA | 68 (0.535) | 100 (0.610) | Add: 1.1 [0.8–1.6] | ||
| AG | 51 (0.402) | 52 (0.317) | Dom: 1.2 [08–2.0] | All: 1.2 [0.8–1.7] | |
| GG | 8 (0.063) | 12 (0.073) | Rec: 0.9 [0.3–2.2] | W/R: 0.9 [0.3–2.5] | |
| CC | 120 (1.000) | 186 (1.000) | Add: | ||
| CT | 0 (0.000) | 0 (0.000) | Dom: | All: | |
| TT | 0 (0.000) | 0 (0.000) | Rec: | W/R: | |
| GG | 53 (0.438) | 79 (0.434) | Add: 0.9 [0.6–1.4] | ||
| GA | 59 (0.488) | 88 (0.484) | Dom: 1.0 [0.6–1.6] | All: 1.0 [0.7–1.4] | |
| AA | 9 (0.074) | 15 (0.082) | Rec: 0.8 [0.3–1.9] | W/R: 0.8 [0.3–1.9] | |
| GG | 125 (1.000) | 173 (0.961) | Add: 0.0 [ | ||
| GA | 0 (0.000) | 7 (0.039) | Dom: 0.0 [ | ||
| AA | 0 (0.000) | 0 (0.000) | Rec: | W/R: | |
| AA | 50 (0.417) | 75 (0.412) | Add: 1.2 [0.8–1.6] | ||
| AG | 48 (0.400) | 84 (0.462) | Dom: 1.0 [0.6–1.7] | All: 1.1 [0.8–1.6] | |
| GG | 22 (0.183) | 23 (0.126) | Rec: 1.7 [0.9–3.4] | W/R: 1.6 [0.8–3.3] | |
| CC | 57 (0.449) | 71 (0.436) | Add: 1.0 [0.7–1.5] | ||
| CT | 56 (0.441) | 74 (0.454) | Dom: 0.9 [0.6–1.5] | All: 1.0 [0.7–1.4] | |
| TT | 14 (0.110) | 18 (0.110) | Rec: 1.3 [0.6–2.8] | W/R: 1.4 [0.6–3.2] | |
| GG | 121 (1.000) | 188 (1.000) | Add: | ||
| GA | 0 (0.000) | 0 (0.000) | Dom: | All: | |
| AA | 0 (0.000) | 0 (0.000) | Rec: | W/R: | |
| (X1,X3): AG | 171 (0.684) | 246 (0.669) | 1.1 [0.8–1.5] | ||
| (X1,X3): GA | 65 (0.259) | 79 (0.215) | 1.3 [0.9–1.9] | ||
| (X1,X3): AA | 14 (0.056) | 41 (0.110) | |||
| (X1,X3): GG | 0 (0.000) | 2 (0.006) | |||
| (X5,X6): GC | 93 (0.370) | 128 (0.348) | 1.1 [0.8–1.5] | ||
| (X5,X6): AT | 82 (0.327) | 125 (0.341) | 0.9 [0.7–1.3] | ||
| (X5,X6): AC | 73 (0.294) | 113 (0.309) | 0.9 [0.6–1.3] | ||
| (X5,X6): GT | 2 (0.009) | 1 (0.002) | 2.9 [0.3–32.6] | ||
A) Single-site analyses showed statistically significant BrC risk-reducing effect for variant rs3212057 (10343A) allele at allelic level, only. B) Results of haplotype analysis for Rad51 SNPs revealed statistically significant BrC risk reducing effect for 4541A/9685A (X1/X3; rs1799794/rs861530) Xrcc3 haplotype. Data presented as absolute (relative) frequencies of individual genotypes observed in BrC cases and controls; NA—not applicable.
a Genetic model employed in order to analyze the association between genotype distribution and cancer: A—additive genetic model; D—dominant genetic model; R—recessive genetic model; All—direct comparison of the frequency of alleles; W/R—comparison of wild-type and rare homozygous genotypes.
b OR values adjusted for age and smoking status, together with 95% CI determined by logistic regression.
c Significant differences in genotype distributions between the BrC and control groups were sought for by two-sided exact mid-P test.
d p < 0.05.
Fig 2The map of LD among analyzed SNPs in (A) Rad51 and (B) Xrcc3.
In the case of Xrcc3, rs45603942 (X2) and rs28903081 (X7) are not shown as these SNPs were not considered in haplotype analysis due to lack of any genetic variability observed in these loci. For a given pair of SNP, the values in the map are the values of normalized measure of allelic association (|D’|, provided as percentages) and the color scheme represents the corresponding confidence bounds: dark gray—strong evidence of LD; light-gray—inconclusive; white—strong evidence of recombination [48]. Identified LD blocks are indicated by solid lines encompassing respective SNPs.
Associations of Rad51/Xrcc3 SNP combinations with breast cancer.
| Rates | OR [95% CI] | |||
|---|---|---|---|---|
| (RB) rs5030789 (-4601GG/-4601GA) | (RC) rs1801320 (135CC) | (X4) rs3212057 (10343GG) | ||
| x | x | 8/16 vs. 0/36 | 7.3 [2.1–25.8] | |
| x | x | 109/16 vs. 130/46 | 2.2 [1.2–4.3] | |
| x | x | 7/0 vs. 1/6 | 184.9 [3.1–11172.8] | |
| x | x | x | 7/16 vs. 0/38 | 8.1 [2.3–28.9] |
Only those SNPs for which significant associations with BrC risk were found in previous single-site analyses were included. Data presented as number of cases vs. controls carrying the combination of high-risk/low-risk genotypes in cross-marked SNP loci. Respective high-risk genotypes are provided in the header of the table.
a Subjects carrying the genotype combinations containing 1 high-risk genotype used as reference group.
b p < 0.005
c p < 0.05
Ranking of all analyzed predictors (SNPs, covariates) according to their VIMP-based association with BrC.
| Predictor | w/out covariates | with covariates | ||||
|---|---|---|---|---|---|---|
| VIMP | pVIMP | Rank | VIMP | pVIMP | Rank | |
| RB | 0.0181 | 1 | 0.0108 | 1 | ||
| RD | 0.0093 | 2 | 0.0070 | 2 | ||
| RC | 0.0075 | 3 | 0.0063 | 4 | ||
| X3 | 0.0071 | 0.0742 | 4 | 0.0054 | 0.0859 | 5 |
| RG | 0.0059 | 0.1068 | 5 | 0.0067 | 0.0603 | 3 |
| X1 | 0.0052 | 0.1310 | 6 | 0.0041 | 0.1237 | 7 |
| RcH | 0.0038 | 0.1282 | 7 | 0.0019 | 0.1660 | 11 |
| RA | 0.0035 | 0.1428 | 8 | 0.0021 | 0.1981 | 10 |
| RE | 0.0029 | 0.1671 | 9 | 0.0003 | 0.3995 | 14 |
| RcB | 0.0029 | 0.1418 | 10 | 0.0036 | 0.0890 | 8 |
| X4 | 0.0017 | 11 | 0.0010 | 0.0720 | 12 | |
| RcE | 0.0009 | 0.3006 | 12 | -0.0020 | 0.8351 | 18 |
| RcD | 0.0007 | 0.3196 | 13 | -0.0002 | 0.4791 | 16 |
| X5 | 0.0002 | 0.4097 | 14 | -0.0018 | 0.6128 | 17 |
| X6 | -0.0001 | 0.4373 | 15 | 0.0045 | 0.1252 | 6 |
| RF | -0.0004 | 0.4488 | 16 | 0.0007 | 0.3068 | 13 |
| RcG | -0.0009 | 0.5739 | 17 | 0.0002 | 0.3839 | 15 |
| RcC | -0.0042 | 0.8095 | 18 | -0.0046 | 0.9041 | 21 |
| RcA | -0.0050 | 0.8714 | 19 | -0.0024 | 0.7049 | 19 |
| Age | - | - | - | 0.0031 | 0.1407 | 9 |
| Smoking | - | - | - | -0.0042 | 0.9355 | 20 |
Table presents the variable importance (VIMP) value for each predictor, together with respective significance level (pVIMP) obtained by permutation test using 10,000 permutations and corresponding VIMP-based ranking, as obtained in RF-based analysis with/without age and smoking status as covariates. The higher the VIMP value the stronger the association of a given predictor with BrC. PVIMP values in bold indicate statistically significant association.
Fig 3Results of bootstrap analysis of the VIMP-based ranking of predictors.
Bootstrap estimate of distribution of the VIMP-based ranks of analyzed predictors obtained either (A) without or (B) with subjects’ age (AgeB) and smoking status (SmoB) as covariates. Individual SNPs and/or covariates are ordered according to their weighted average ranks as revealed by the resampling with replacement procedure inherent to the RF methodology following 10,000 runs. Color of each rectangle indicate the total number of bootstrap samples in which a given rank in the VIMP-based ranking was recorded for respective predictor. The more yellowish the color, the higher the total number of samples with respective rank. (Numerical data are provided in S3 Table).
Epistatic SNP interactions with the strongest association with BrC.: Results of MB-MDR and pRF analysis.
| BrC | Controls | Risk group | p | |||||
| AA | CG | 9 (0.070) | 1 (0.006) | High | 0.0706 | |||
| AT | CC | 5 (0.039) | 1 (0.006) | High | 0.0153 | |||
| Cases | Controls | WaldH | ORH | pW | p10000 | |||
| High | 14 (0.109) | 2 (0.012) | ||||||
| Low/Neutr. | 114 (0.891) | 180 (0.988) | 9.0074 | 11.3 [2.5–49.5] | 0.0027 | 0.0001 | ||
| 21,51% | 19,70% | 1,81% | ||||||
| Cases | Controls | Risk group | p | |||||
| AA | AA | CG | 6 (0.047) | 1 (0.006) | High | 0.0865 | ||
| AT | GA | CC | 5 (0.039) | 1 (0.006) | High | 0.0127 | ||
| Cases | Controls | WaldH | ORH | pW | p10000 | |||
| High | 11 (0.086) | 2 (0.011) | ||||||
| Low/Neutr. | 117 (0.914) | 179 (0.989) | 9.0305 | 8.4 [1.8–38.6] | 0.0027 | << 0.0001 | ||
| 25,24% | 21,68% | 3,57% |
Results of analysis of associations between (A) 2-way or (B) 3-way epistatic interactions and BrC are summarized. Absolute (relative) counts of BrC and control subjects are provided for each individual and multi-locus genotype. MB-MDR: p indicates the level of significance for association between individual genotypes, while BrC, WaldH, ORH, pW and p10000 stand for the value of Wald statistic, resulting odds ratio, raw and corrected level of significance for the 2- or 3-way multi-locus genotypes with the strongest association with BrC, respectively. Individuals with given multi-locus genotype of interest were compared against the rest of the individuals, which are considered as reference group. High—high risk of BrC; Low—low risk of BrC; Neutr.–non-informative individuals. pRF: and denote the classification errors for testing dataset with and without the interaction among analyzed predictors, while ΔE denotes the differential error for the analyzed interaction.
Summary of inferred effects of individual SNPs on BrC risk.
| #rs | code | gene | genotype/allele | effect on BrC risk |
|---|---|---|---|---|
| rs5030789 | RB | -4601AA | protective | |
| rs1801321 | RC | 135CC | detrimental | |
| rs1799794 | X1 | 4541A | protective | |
| rs861530 | X3 | 9685A | protective | |
| rs3212057 | X4 | 10343GA | protective | |
| rs2619679 | RA | -4719AA | detrimental | |
| -4719AT | detrimental | |||
| rs5030789 | RB | -4601AA | detrimental | |
| -4601GA | detrimental | |||
| rs2928140 | RG | 2972CG | detrimental | |
| 2972CC | detrimental | |||
| rs1801321 | RD | - | unclear | |
| rs2619680 | RE | - | - | |
| rs2619681 | RF | - | - | |
| rs45603942 | X2 | - | - | |
| rs1799796 | X5 | - | - | |
| rs861539 | X6 | - | - | |
| rs28903081 | X7 | - | - |
Summary of effects of individual SNPs on BrC risk inferred based on statistically significant outcomes of analyses conducted in the study. Main or epistatic effects were inferred if statistically significant outcomes were revealed by conventional (single-site, SNP combinations, haplotype) or ML-based analyses, respectively. Of note is the rs5030789 (RB) SNP in Rad51, for which protective main effect as well as detrimental epistatic effect was predicted.
a as part of the risk-reducing X1/X3 haplotype;
b as part of the risk-increasing -4719AA/-4601AA/2927CG three-locus genotype;
c as part of the risk-increasing -4719AT/-4601GA/2927CC three-locus genotype;
d pure epistatic effect with no observable main effects;
e statistically significant 2nd place in the VIMP-based ranking with no significant outcomes in any of conventional or ML-based methods.