Literature DB >> 25956309

Genome-wide association study of susceptibility loci for breast cancer in Sardinian population.

Grazia Palomba1, Angela Loi2, Eleonora Porcu3, Antonio Cossu4, Ilenia Zara5, Mario Budroni6, Mariano Dei7, Sandra Lai8, Antonella Mulas9, Nina Olmeo10, Maria Teresa Ionta11, Francesco Atzori12, Gianmauro Cuccuru13, Maristella Pitzalis14, Magdalena Zoledziewska15, Nazario Olla16, Mario Lovicu17, Marina Pisano18, Gonçalo R Abecasis19, Manuela Uda20, Francesco Tanda21, Kyriaki Michailidou22, Douglas F Easton23,24, Stephen J Chanock25, Robert N Hoover26, David J Hunter27, David Schlessinger28, Serena Sanna29, Laura Crisponi30, Giuseppe Palmieri31,32.   

Abstract

BACKGROUND: Despite progress in identifying genes associated with breast cancer, many more risk loci exist. Genome-wide association analyses in genetically-homogeneous populations, such as that of Sardinia (Italy), could represent an additional approach to detect low penetrance alleles.
METHODS: We performed a genome-wide association study comparing 1431 Sardinian patients with non-familial, BRCA1/2-mutation-negative breast cancer to 2171 healthy Sardinian blood donors. DNA was genotyped using GeneChip Human Mapping 500 K Arrays or Genome-Wide Human SNP Arrays 6.0. To increase genomic coverage, genotypes of additional SNPs were imputed using data from HapMap Phase II. After quality control filtering of genotype data, 1367 cases (9 men) and 1658 controls (1156 men) were analyzed on a total of 2,067,645 SNPs.
RESULTS: Overall, 33 genomic regions (67 candidate SNPs) were associated with breast cancer risk at the p <  0(-6) level. Twenty of these regions contained defined genes, including one already associated with breast cancer risk: TOX3. With a lower threshold for preliminary significance to p < 10(-5), we identified 11 additional SNPs in FGFR2, a well-established breast cancer-associated gene. Ten candidate SNPs were selected, excluding those already associated with breast cancer, for technical validation as well as replication in 1668 samples from the same population. Only SNP rs345299, located in intron 1 of VAV3, remained suggestively associated (p-value, 1.16 x 10(-5)), but it did not associate with breast cancer risk in pooled data from two large, mixed-population cohorts.
CONCLUSIONS: This study indicated the role of TOX3 and FGFR2 as breast cancer susceptibility genes in BRCA1/2-wild-type breast cancer patients from Sardinian population.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25956309      PMCID: PMC4434540          DOI: 10.1186/s12885-015-1392-9

Source DB:  PubMed          Journal:  BMC Cancer        ISSN: 1471-2407            Impact factor:   4.430


Background

Breast cancer is the most common malignancy in women in western countries, currently accounting for one-third of all female cancer cases [1]. A family history of breast cancer is the principal risk factor for developing the disease [2]. Linkage studies in families have identified several high-penetrance mutations in BRCA1, BRCA2 and other genes as causative of disease in 5 %-10 % of cases [3, 4]. Additionally, a combined approach of family-based and case–control studies revealed that mutations in several genes encoding proteins involved in DNA repair and functionally interacting with the BRCA1/2 proteins are associated with a moderate risk of breast cancer, contributing to another 10 %–15 % of cases [5]. Genome-wide association (GWA) studies have so far identified at least 72 common lower penetrance alleles associated with breast cancer [6, 7]. A large fraction of these susceptibility alleles are associated with increased risk in persons with a family history of breast cancer despite the absence of mutations in BRCA1 or BRCA2, accounting for about 30 % of the familial risk of the disease [7]. Other alleles, such as those that map to the FGFR2 and TOX3 genes, act as risk modifiers in BRCA1/2-mutation carriers [3, 8, 9]. Results from a meta-analysis of GWA studies suggest that a substantial fraction of the residual familial aggregation cases can be explained by other common single nucleotide polymorphisms (SNPs) not yet identified [7]. In particular, the authors hypothesized that more than 1000 additional loci may be involved in breast cancer susceptibility. Because of their low penetrance and the small fraction of familial cases, it is unlikely that other susceptibility genes will be identified through additional family-based studies. A promising approach could be to conduct new studies in non-familial cases, such as case–control in populations with less genetic heterogeneity. One population with notable genetic inter-relatedness is that of the Mediterranean island of Sardinia (Italy). In Sardinia, breast cancer represents the principal death-causing malignancy among women, with an incidence similar to that observed in other western populations [10]. The Sardinian population (1.67 million in 2010, according to the Italian National Institute of Statistics) is isolated, with considerable inter-relatedness and founder effects for several genetic diseases (e.g. thalassemia) [11, 12]. The relatively homogeneous genetic make-up of the Sardinian population offered an opportunity to search for genetic determinants of breast cancer, requiring fewer cases to establish association with a susceptibility locus than do mixed populations. We have conducted such a case–control GWA study for breast cancer risk in our collection of Sardinian breast cancer patients who are negative for BRCA1 or BRCA2 mutations.

Methods

Breast cancer cases and controls

From January 1998 to December 2006, we recruited 1698 patients with breast cancer from the four main oncology units in the Region of Sardinia (Azienda Ospedaliero Universitaria of Sassari, Azienda Sanitaria Locale of Sassari, Businco Oncologic Institute, and University of Cagliari). This cohort includes 1085 patients recruited in 1998-2003 [13]. Inclusion criteria were: (i) a histopathological diagnosis of any type of breast cancer, and (ii) self-reported Sardinian origin, defined as both biological parents and all four biological grandparents born on the island. No exclusion criteria were applied; in particular, patients were not selected for age, gender, grade or stage of cancer, or family history of any cancer. Family history for cancer was evaluated through specific questionnaires during the follow-up visits at the different departments of the participating institutions. Cases were classified as non-familial when less than three (0, 1, or 2) affected members with breast or ovarian cancer were present in first- and second-degree relatives. A sample of peripheral blood was obtained for DNA extraction. Pathological TNM (tumor, node, metastasis) classification and immuno-histochemistry profile for estrogen receptors (ER), progesterone receptors (PR) and HER2 (receptor tyrosine-protein kinase erbB-2) were also obtained, when available. Controls consisted of 2171 healthy persons recruited at community blood donation centers across the island and at the transfusion center of Azienda Ospedaliera Brotzu in Cagliari. Controls were included if at least three out of four grandparents were born in Sardinia and if they reported no type of cancer for their first-degree relatives. Overall, 1503 (69.2 %) were males, in line with the male preponderance among Italian blood donors [CENSIS at http://www.censis.it/] (Fig. 1). Since the present study was aimed at detecting low penetrance alleles at autosomic level, no significant differences were expected by the use of control males. In fact, allelic transmission is identical in males and females as the entire population should comply with the Hardy Weinberg law. As some randomness is expected, we specifically assessed our best SNPs for the impact of males in the controls set, excluding bias. This was added to a genome-wide check of allele frequency distribution between genders. Of note, the same approach of including males among controls has also been adopted by previous studies [14-16], all of them reporting a genome-wide check of differences between males and females, with no significant diversity in alleles’ distribution. In such latter studies, percentages of male controls were reported to up to 78 %, which is thus consistent with the frequency (69 %) reported in our series.
Fig. 1

Flow chart of the selection of cases and controls throughout the study

Flow chart of the selection of cases and controls throughout the study Both cases and controls gave written informed consent for their biological samples and clinical data to be used for research purposes. The study protocol was reviewed and approved by the Ethics Committee of the ASL8 Cagliari and the Bioethics Committee of the Sassari Healthcare District.

BRCA mutation analysis

From our cohort of 1698 breast cancer patients, we selected 1452 non-familial cases who were then tested for BRCA1 and BRCA2 germline mutations. The entire coding sequences and intron-exon boundaries of the BRCA1 and BRCA2 genes were screened by denaturing high-performance liquid chromatography (DHPLC) followed by direct sequencing on an automated DNA sequencer (ABI Prism 3100 Genetic Analyzer, Applied Biosystems, Foster City, USA). Protocols for PCR-based amplification and mutation analysis of exons and exon-intron boundaries were as previously reported [17]. Familial cases (N = 246) were excluded on the basis of the presence of at least three family members (the proband and at least two other first- or second-degree relatives) having with either breast or ovarian cancer. Overall, 21 cases (1.4 %) had a mutation in BRCA1 or BRCA2 and were excluded, leaving 1431 cases for the GWA study. The median age of the 1431 cases at diagnosis was 60 (range, 23–98); 11 cases (0.8 %) were male (Fig. 1).

Genotyping and quality controls

DNA was isolated from peripheral blood and stored at -80 °C. Genotyping was performed at the Institute of Genetic and Biomedical Research using Affymetrix technology according to the manufacturer’s protocols. At first, 740 cases and 412 controls were genotyped using the GeneChip Human Mapping 500 K Array Set (analyzed from 2005 to 2006). The rest, 691 cases and 1759 controls, were assayed in the same center with the newer Genome-Wide Human SNP Array 6.0 (analyzed from 2007 to 2009) (Fig. 1). Genotypes for individuals assessed with the 500 K Array and the 6.0 array set were called respectively with the BRLMM algorithm and with Birdseed v2 [18]. The latter algorithm was applied to a unique cluster containing all cases and controls, given its sensitivity to plate bias. A methodological limitation of the current study came from the use of two different microarray genotyping platforms. This choice was due to the adoption of the larger Array 6.0 (permitting the testing of 900 K SNPs) when it became available, but it was not feasible to retest the initial 740 cases and 412 controls with the larger panel. As a result, we analyzed directly only those SNPs that were represented on both platforms (240742 SNPs). Individuals with a SNP call rate <90 % were excluded from the analysis, as were individuals whose recorded gender was different from that predicted by the genetic data. SNPs showing significant (p < 1x10−6) deviation from Hardy-Weinberg equilibrium in controls, minor allele frequency (MAF) <5 % or sample call rate <95 %, were filtered out. Moreover, for SNPs tested on both platforms, those with allele frequencies differing by >10 % in controls were excluded. We then left out all SNPs that were not represented on both platforms. Finally, to ensure that the dataset contained only unrelated persons, we used RELPAIR software [19] to estimate genotype sharing between all possible pairs of individuals based on a subset of 10000 quality-checked SNPs. When two persons were found to be first-degree relatives, we excluded the one with lower call rate, except when the pair consisted of one case and one control, in which we excluded the control. To avoid bias introduced by population stratification, we performed principal component analysis (PCA) using Eigensoft 3.0 software [20, 21]. Individuals flagged as outliers in the PCA analyses (>6 standard deviations from the mean) were excluded. The principal component eigenvectors for the remaining individuals were recalculated and the axes were then used as covariates to calculate adjusted p values for association with breast cancer; the genomic control parameter was 1.149 (Additional file 1: Figure S1). For all SNPs with adjusted p < 10−6, we visually inspected the discrimination plots and kept only those with good plots (three distinct data clusters). A PCA supporting the overall homogeneity of our Sardinian sample in respect to general Europeans is showed in Additional file 2: Figure S2.

Imputation and identification of candidate SNPs

To improve coverage of the genome, we increased the set of SNPs tested for association through imputation. MACH software (version 1.0) was used to impute non-genotyped markers based on the phased haplotypes from HapMap Phase II (CEU, release 22) and the set of 270742 quality-controlled markers represented on both platforms. Imputation increased the SNP coverage to a total of 2067645 markers (Additional file 3: Table S1), though without direct scoring of SNPs that might have had higher p-values. After imputation, we considered only markers with MAF > 1 % and imputation quality (RSQR) >0.3 (RSQR infers r2 between true and estimated allele counts [22]). These markers were assessed with a likelihood ratio test to identify those with additive effects on modifying the risk of breast cancer; this test was implemented in mach2dat, in the MACH package, using allele dosages and the estimated eigenvectors as covariates in the model. We selected SNPs with p < 1x10−6 and examined the discrimination plots of all the other genotyped SNPs in the surrounding 200 kb genomic region. SNPs residing in genomic regions where all other genotyped SNPs had good discrimination plots were considered candidate markers.

Validation of candidate markers

Validation of the SNPs classified as candidate markers, was done by regenotyping them with custom TaqMan SNP genotyping assays (Life Technologies). This validation step was done using DNA from a subset of the Sardinian cases (1362) and controls (1514) in this study who passed quality control filtering at the sample level in microarray genotyping (SNP call rate >90 % and no error in gender determination). We attempted to replicate promising signals in a wider set of DNAs consisting in additional 201 Sardinian cases and 1467 controls (630 females and 1038 males) collected after December 2006 within the same study protocol and with the same features as the original set. Replication analyses were performed using data from a combined analyses of eleven GWAS in other populations of European ancestry, comprising 16,195 cases and 18,980 controls, and data from 45,290 cases and 41,880 controls of European ancestry from 41 studies collaborating in the Breast Cancer Association Consortium (BCAC), which were genotyped with a custom array (iCOGS) ([7]; http://gameon.dfci.harvard.edu/gameon). For all studies except BCFR, BPC3 and TNBCC, genotypes were estimated by imputation, using IMPUTE2 [23] and the 1000 genomes March 2012 release as a reference panel, after prephasing with SHAPEIT [24]. Per-allele odds ratios (ORs) and standard errors for individual studies were generated using SNPTEST [25]. BCFR, BPC3 and TNBCC performed imputation using MACH and Minimac. Estimated ORs for the combined analysis were generated using a fixed-effect meta-analysis, using METAL [22]. For the combined analysis of the GWAS and iCOGS, we reanalyzed the iCOGS data to remove samples also included in a GWAS, to generate independent datasets.

Genotype associations with clinical data

Chi-square and Fisher’s exact tests were used to evaluate possible associations between tumor phenotype (ER, PR, HER2, pT, pN, M) and the genotypes of candidate SNPs. Statistical tests were performed using SPSS statistical software, version 15.0. All tests were two-tailed and a p < 0.05 indicated significance.

Evaluation of GWAS-identified breast cancer risk variants in our Sardinian cohort

GWA studies have so far identified at least 72 common lower penetrance alleles associated with a mild increase in the risk of breast cancer [6, 7]. We evaluated index SNPs (when not available, proxies) in all 72 breast cancer susceptibility loci identified to date in our subset cohort (1367 cases and 1658 controls).

Results

To search for new loci associated with breast cancer risk, we genotyped germline DNA of 1431 Sardinian patients with sporadic breast cancer (BRCA-mutation-negative) and a set of 2171 healthy blood donor controls. After quality control filtering of genotype data at the sample level, 1367 cases and 1658 controls were analyzed. For 921 cases (67 %), we had data regarding TNM classification and receptor status (Table 1). Among the 775 patients tested for all three receptors, the predominant molecular subtype was ER+/PR+/HER2-, found in 70.3 % of cases. Moreover, the percentage of triple-negative (ER-/PR-/HER2-) cases was low, 7.2 %.
Table 1

Tumor characteristics at the time of diagnosis, for 921 patients (all women) with sporadic, BRCA-mutation-negative breast cancer

CharacteristicPatients, n (%)
Pathological TNM classification
 pT1-2830 (90.1)
 pT3-491 (9.9)
 pN0533 (57.9)
 pN1388 (42.1)
 M0751 (81.5)
 M143 (4.7)
 Mx127 (13.8)
Receptor status
 ER-234 (25.4)
 ER+687 (74.6)
 PR-260 (28.2)
 PR+661 (71.8)
 HER2-601 (65.3)
 HER2+174 (18.9)
 Not tested146 (15.8)
Molecular subtypea
 ER+/PR+/HER2-545 (70.3)
 ER+/PR+/HER2+142 (18.3)
 ER-/PR-/HER2+32 (4.1)
 Triple negative (ER-/PR-/HER2-)56 (7.2)

ER, estrogen receptor; PR, progesterone receptor.

a For the 775 cases tested for all three receptor markers.

Tumor characteristics at the time of diagnosis, for 921 patients (all women) with sporadic, BRCA-mutation-negative breast cancer ER, estrogen receptor; PR, progesterone receptor. a For the 775 cases tested for all three receptor markers. The genome wide analysis conducted on a set of 2067645 markers (Methods and Additional file 4: Table S6), revealed 33 genomic regions on 20 chromosomes that were suggestively associated with breast cancer risk at p < 1x10−6 (Fig. 2). In particular, 7 regions reached the genome wide significance threshold (p = 5x10−8; Table 2). The 33 suggestive genomic regions contained a total of 67 SNPs with p < 10−6 (Table 2, Additional file 3: Table S2).
Fig. 2

Manhattan plot for this genome-wide association study of sporadic breast cancer in Sardinian population. Data shown are the negative logarithm of the association p-value for each single nucleotide polymorphism. The horizontal line indicates the significance cut-off at p < 10−6

Table 2

Most significant SNPs in 33 genomic regions associating at the p < 10−6 level with sporadic, BRCA-mutation-negative breast cancer cases in the Sardinian population

Chromosomal regionSNPPositionAllelesFrequencyRSQROR (CI 95%) p Gene
CasesControls
1p35.1rs942597734238042C/T0.0820.0590.3792.597 (1.817 - 3.709)9.51E-08 CSMD2
1p34.1rs247761844271050C/G0.0770.0580.4162.312 (1.647 -3.245)9.13E-07 -
1p13.3rs345299108249656A/C0.4650.5340.9570.755 (0.676 - 0.845)9.40E-07 VAV3
1q32.1rs6661074203605742A/G0.4830.4250.7311.408 (1.235 - 1.605)3.10E-07 LEMD1
2p25.3rs133937913239073C/T0.0740.0390.9891.862 (1.469 - 2.361)1.99E-07 TSSC1
2p21rs1703295745134208A/G0.7010.7470.4620.597 (0.500 - 0.712)9.17E-09 SIX2/SIX3
2q37.1rs838436232291757A/G0.9080.9260.3080.415 (0.291 - 0.592)8.45E-07 -
3p14.1rs981081667158548C/G0.9350.9550.4040.368 (0.251 - 0.541)1.86E-07 -
5q35.2rs17076993173564400C/T0.0710.0490.4472.507 (1.768 - 3.553)1.29E-07 -
6q23.1rs3777428130495535C/T0.9260.9520.7200.509 (0.387 - 0.668)8.48E-07 L3MBTL3
7p14.2rs696800236531229C/T0.9380.9580.5330.431 (0.308 - 0.603)6.32E-07 AOAH
8q24.3rs11785598144537061C/T0.9480.9290.3283.420 (2.223 - 5.266)6.40E-09 RHPN1
9q22.32rs1051224397815280A/G0.0930.0560.9131.754 (1.411 - 2.181)2.78E-07 -
9q31.2rs10979327110160516G/T0.1120.0600.6323.851 (2.955 - 5.016)2.92E-25 -
9q33.3rs1928482125472269C/T0.3610.3030.9781.362 (1.206 - 1.538)4.87E-07 DENND1A
9q34.13rs1633769134869098C/G0.7690.8030.3930.558 (0.452 - 0.690)6.07E-08 -
9q34.3rs3811159136828478A/G0.6080.6610.6830.706 (0.616 - 0.810)6.07E-07 COL5A1
10q21.1rs190397453549083A/G0.0470.0770.8650.527 (0.409 - 0.680)4.38E-07 PRKG1
11p12rs794738738902742A/G0.0500.0690.3330.347 (0.230 - 0.523)2.45E-07 -
12q21.1rs1117874870021794A/G0.1000.0780.8971.673 (1.366 - 2.046)5.15E-07 TSPAN8
12q23.3rs7488485105789959C/T0.0970.0730.4042.348 (1.722 - 3.198)3.17E-08 RIC8B
13q21.1rs956952856234110C/T0.0740.0510.4922.481 (1.782 - 3.456)5.27E-08 -
14q23.1rs1709737359896563C/T0.0760.0580.7701.861 (1.448 - 2.391)9.43E-07 -
15q12rs1765137525763160C/T0.0840.0560.9341.761 (1.417 - 2.189)2.93E-07 OCA2
16q12.1rs219309451123955G/T0.5570.4780.9821.337 (1.198 - 1.491)1.70E-07 TOX3
16q21rs804551360720397A/T0.0590.0880.6060.430 (0.326 - 0.569)9.49E-10 -
18p11.22rs1743681110080771C/T0.0840.0580.6442.002 (1.536 - 2.608)2.20E-07 -
19q13.31rs1772553149018018G/T0.1400.0990.9301.560 (1.311 - 1.858)5.82E-07 -
20p12.2rs603994210376146A/G0.1000.0680.6232.005 (1.565 - 2.565)2.13E-08 C20orf94
21q22.11rs283342431774071A/G0.2930.2390.9851.365 (1.206 - 1.544)8.42E-07 TIAM1
21q22.11rs96395033438577C/T0.9300.8830.9861.742 (1.432 - 2.119)1.24E-08 IFNAR2
21q22.13rs85798938042001C/G0.1540.1110.8961.537 (1.301 - 1.816)3.72E-07 KCNJ6
22q12.3rs600035135340813A/G0.0930.0690.4452.100 (1.556 - 2.835)7.44E-07 CACNG2

The table reports, for each SNP, the genomic cytoband, the rs name, the position in build36, the corresponding alleles, the frequency in cases and controls, the imputation quality, the OR and its confidence interval, the pvalue and the most candidate gene within 200 kb. Additional SNPs in the same genomic regions are listed in Additional file 3: Table S2.

Manhattan plot for this genome-wide association study of sporadic breast cancer in Sardinian population. Data shown are the negative logarithm of the association p-value for each single nucleotide polymorphism. The horizontal line indicates the significance cut-off at p < 10−6 Most significant SNPs in 33 genomic regions associating at the p < 10−6 level with sporadic, BRCA-mutation-negative breast cancer cases in the Sardinian population The table reports, for each SNP, the genomic cytoband, the rs name, the position in build36, the corresponding alleles, the frequency in cases and controls, the imputation quality, the OR and its confidence interval, the pvalue and the most candidate gene within 200 kb. Additional SNPs in the same genomic regions are listed in Additional file 3: Table S2. One of the identified regions, on chromosome 16q12.1, includes TOX3, a gene already associated with breast cancer risk. For this gene, 19 SNPs had p < 10−6 (Additional file 3: Table S2), supporting a significant susceptibility role for TOX3 in the Sardinian as well as other populations. For a second gene already associated with breast cancer, FGFR2 on chromosome 10q26.13, no SNP had p < 10−6 but 11 were associated at a less restrictive p < 10−5 (Additional file 3: Table S2). No additional SNP with p < 1x10−5 lay near a gene already associated with breast cancer. Finally, the four SNPs tested in BRCA1 and those in BRCA2 all had p > 0.05, implying no association with breast cancer in this series. This negative result is consistent with our study’s explicit exclusion of patients with BRCA mutations in order to focus on other, low-penetrance loci. We then attempted to validate the results for ten candidate SNPs, selected among those in genes not already associated with breast cancer, by regenotyping them by TaqMan Assays in a subset of the Sardinian cases and controls (1362 cases and 1514 controls) (Table 3). These markers included nine SNPs in a known gene in the surrounding 200 kb (rs345299, rs6661074, rs13393791, rs17032957, rs1928482, rs1903974, rs11178748, rs963950, rs857989) and one additional SNP (rs10979327) in a desert region but with p = 2.92x10−25. Of these SNPs, only one, rs345299 reached an association of p = 5.38x10−5, while it reached p = 9.40x10−7 in the original analysis (Tables 2 and 3). The difference between the two p-values could be partially explained by the call rate, equal to 97 % when using TaqMan. The concordance between genotypes and dosages is consistent with the potential of the signal as a candidate (Table 3). Furthermore, extending the validation to an additional 201 cases and 1467 controls (leading to a total set of 1563 BC cases/2981controls for analyses) the p-value reached p = 1.16x10−5, increasing the robustness of the signal. SNP rs345299 resides in intron 1 of VAV3, an oncogene that encodes a guanine nucleotide exchange factor. Both microarray and TaqMan assays suggested that the C allele at this position is associated with disease risk.
Table 3

Association results for the 10 candidate SNPs

A
SNPcorrFREQ1 casesFREQ1_CTNp-valuen.a. p-value*
rs8579890.8110.1310.10628663.99E-034.02E-03
rs111787480.8240.0830.07928480.2890.560
rs19039740.9400.0430.07028375.94E-061.34E-05
rs19284820.8950.3510.31327711.09E-032.24E-03
rs109793270.6970.0780.06928400.1780.220
rs170329570.6510.7490.74828160.8790.910
rs133937910.7930.0510.04128470.0930.068
rs66610740.8060.4740.43428553.62E-032.23E-03
rs3452990.8850.4770.53027895.38E-059.05E-05
rs176339860.8090.8900.88228220.5530.331
B
MARKERALLELESFREQ1_casesFREQ1_CTNn.a. p-value*
rs857989C/G0.1300.10644698.02E-04
rs11178748A/G0.0830.08844320.499
rs1903974A/G0.0440.06344402.40E-04
rs1928482C/T0.3500.31943443.42E-03
rs10979327G/T0.0780.07444560.506
rs17032957A/G0.7510.75044140.865
rs13393791C/T0.0500.04544410.365
rs6661074A/G0.4700.44244160.010
rs345299A/C0.4810.53143431.16E-05
rs17633986C/T0.8890.87744510.100

A. Results on Taqman genotypes for 1362 breast cancer cases and 1514 controls included in the GWAS.

B. Results on Taqman genotypes for all 1563 breast cancer cases and 2981 controls.

*n.a. p-value: not adjusted p-value, not corrected for population stratification.

Association results for the 10 candidate SNPs A. Results on Taqman genotypes for 1362 breast cancer cases and 1514 controls included in the GWAS. B. Results on Taqman genotypes for all 1563 breast cancer cases and 2981 controls. *n.a. p-value: not adjusted p-value, not corrected for population stratification. To assess further the significance of this result, we tried to replicate the association of rs345299 with breast cancer risk in a combined analysis of eleven GWAS, comprising 16,195 cases and 18,980 controls, together with data from a custom array (iCOGS) genotyped on 45,290 cases and 41,880 controls of European origin. rs345299 was present on only one of the GWAS arrays but was well imputed in the other GWAS (r2 = 0.94 to 0.99) and on the iCOGS array with r2 = 0.63. No evidence of association was found in either analyses, nor was there any evidence of association in the iCOGS when analyses were restricted to ER-positive or ER-negative disease (Additional file 3: Table S3). Unfortunately, replication failed suggesting that further analyses are necessary to distinguish whether it is a false positive or an effect specific for the Sardinian population. Testing this hypothesis requires a larger set of patients. Finally, in a post-hoc analysis, we took advantage of the availability of TNM classification and receptor status data for 921 cases (Table 1) to look for associations between six cancer phenotypes and the cases’ genotypes at 34 candidate SNPs (those in Table 2 plus rs11200014 for FGFR2). In all cases, the statistical tests of association gave p > 0.05, suggesting that the candidate markers for breast cancer risk are not associated with clinical or molecular subtypes. We also evaluated the association of the 72 breast cancer susceptibility variants identified so far in population meta-analysis in our Sardinian cohort. We were able to detect only rs2981579 (FGFR2) and rs3803662 (TOX3) with p-values of 3.5x10−6 and 5.18x10−6, respectively (Additional file 3: Table S4). However, if the effect sizes in Sardinian are similar to those reported in other Europeans our sample size is likely underpowered to find associations in the other known genes.

Discussion

In our Sardinian cohort of breast cancer patients, the predominant molecular subtype was ER+/PR+/HER2- and the percentage of triple-negative cases (7.2 % among those tested) was low. Rates of triple-negative breast cancer have been variably reported in the range of 10 %–20 % in different studies [26]. The somewhat lower rate reported here is in line with findings from two other recent Italian studies: 8.7 % among 2347 patients in Modena [27] and 4.8 % among 2112 patients in Trentino [28]. Our study includes males in the control population, who were recruited for parallel projects. While we have demonstrated that their inclusion in the GWAS does not introduce a bias, we acknowledge this as one of the limitations of the study, along with the low number of cases and controls compared to other reported GWAS in Europeans. Nevertheless, we proceeded with GWAS considering that, in addition to its inter-relatedness, the Sardinian population is relatively stable in towns, with a largely shared lifestyle and diet across the island; thus, both epidemiological and genetic factors are less heterogeneous than in cosmopolitan European populations, with an expected increase in the power to detect associations. Furthermore, GWAS has been successfully done in this population to identify disease-associated alleles in another instance with a limited number of cases/controls [29]. Finally, with our sample size we are completely underpowered to detect low penetrance alleles (OR ~1.05) as those recently described by others. We are instead powered to find alleles with moderate effect size (>1.3) that are poorly tagged in other populations by HapMap SNPs but well captured in Sardinians. We used SNP genotyping microarrays and identified 33 genomic regions (67 SNPs) associated (at the p < 10−6 level) with the risk of sporadic, BRCA-mutation-negative breast cancer. Of these genomic regions, 20 contained known genes. However, only one of these genes, TOX3, has already been associated with breast cancer in other population-based studies. When we lowered the criterion for significance to p < 10−5, we also identified 11 SNPs in FGFR2, another known breast cancer-associated gene. The attempt to validate 10 of the 67 SNPs selected as candidate markers by singleplex genotyping for technical validation in the original GWA and then in additional samples one, rs345299, that reached a p-value of 10−5 (p = 1.16x10−5). The robustness of the signal is supported by the high concordance and by the increase in significance when additional samples are included. This marker, in intron 1 of VAV3, was not confirmed to be associated with breast cancer in two other mixed-population cohorts, so that further studies are necessary to clarify whether this marker nevertheless represents a susceptibility locus specific to Sardinians. However, involvement of VAV3 in carcinogenesis is further supported by other recent studies. VAV3 is a well-characterized guanine nucleotide exchange factor that, upon phosphorylation by receptor tyrosine kinases, participates in signal transduction pathways, resulting in changes in gene expression, cell cycle and cytoskeleton rearrangement [30]. Its activation is thought to be involved in both prostate [31, 32] and breast [33, 34] cancer development and progression, in some cases through stimulation of androgen and ER receptors, respectively [35, 36]. Interestingly, the vast majority (about 75%) of breast cancer patients in our series was positive for ER expression (see Table 1). On the basis of such findings, further studies, such as VAV3 gene and protein expression in breast cancer samples from Sardinian cases in relation to the genotype, will help in better understanding the association of the SNP with the disease. Overall, even when the results of analyses in different populations are not mutually confirmatory, they can provide valuable information of wider importance. As an example, FGFR2 and TOX3, which were previously demonstrated to act as risk modifiers in BRCA1/2 mutation carriers [9, 37], were shown to be associated here in patients without germline mutations in BRCA1/2. In this context, VAV3 may possibly lack a range of gene variations significant for cancer risk in other populations. This would be consistent with the common view that breast cancer patients from different areas may have different genetic backgrounds that influence the impact of low-penetrance susceptibility genes on disease risk.

Conclusions

In the present study, a case–control GWA study for breast cancer risk was carried out in a large collection of Sardinian breast cancer patients negative for BRCA1 or BRCA2 mutations. Among the disease-associated genomic regions, TOX3 and FGFR2 genes have been identified as breast cancer susceptibility genes in BRCA1/2-wild-type breast cancer patients from Sardinia. Future functional studies on such candidate genes will provide further details about their role in pathogenesis of breast cancer in Sardinian population.

Availability of supporting data

Genetic results can be downloaded in bulk or searched for SNPs or genes at the web site of the Istituto di Ricerca Genetica e Biomedica (IRGB), National Research Council (CNR), Cagliari, Italy (http://www.irgb.cnr.it/facs/facs.php).
  36 in total

1.  Low penetrance alleles as risk modifiers in familial and sporadic breast cancer.

Authors:  Eva Esteban Cardeñosa; Inmaculada de Juan Jiménez; Sarai Palanca Suela; Isabel Chirivella González; Angel Segura Huerta; Ana Santaballa Beltran; María Casals El Busto; Eva Barragán González; Oscar Fuster Lluch; José Bermúdez Edo; Pascual Bolufer Gilabert
Journal:  Fam Cancer       Date:  2012-12       Impact factor: 2.375

2.  Improved whole-chromosome phasing for disease and population genetic studies.

Authors:  Olivier Delaneau; Jean-Francois Zagury; Jonathan Marchini
Journal:  Nat Methods       Date:  2013-01       Impact factor: 28.547

Review 3.  BRCA1 and BRCA2: different roles in a common pathway of genome protection.

Authors:  Rohini Roy; Jarin Chun; Simon N Powell
Journal:  Nat Rev Cancer       Date:  2011-12-23       Impact factor: 60.716

Review 4.  Triple-negative breast cancer: epidemiological considerations and recommendations.

Authors:  P Boyle
Journal:  Ann Oncol       Date:  2012-08       Impact factor: 32.976

5.  Vav3 oncogene is overexpressed and regulates cell growth and androgen receptor activity in human prostate cancer.

Authors:  Zhongyun Dong; Yin Liu; Shan Lu; Amy Wang; Kiwon Lee; Lu-Hai Wang; Monica Revelo; Shan Lu
Journal:  Mol Endocrinol       Date:  2006-06-08

Review 6.  Breast cancer genome-wide association studies: there is strength in numbers.

Authors:  D Fanale; V Amodeo; L R Corsini; S Rizzo; V Bazan; A Russo
Journal:  Oncogene       Date:  2011-09-26       Impact factor: 9.867

7.  The rho exchange factors vav2 and vav3 control a lung metastasis-specific transcriptional program in breast cancer cells.

Authors:  Carmen Citterio; Mauricio Menacho-Márquez; Ramón García-Escudero; Romain M Larive; Olga Barreiro; Francisco Sánchez-Madrid; Jesús M Paramio; Xosé R Bustelo
Journal:  Sci Signal       Date:  2012-10-02       Impact factor: 8.192

8.  Genome-wide association studies identify four ER negative-specific breast cancer risk loci.

Authors:  Montserrat Garcia-Closas; Fergus J Couch; Sara Lindstrom; Kyriaki Michailidou; Marjanka K Schmidt; Mark N Brook; Nick Orr; Suhn Kyong Rhie; Elio Riboli; Heather S Feigelson; Loic Le Marchand; Julie E Buring; Diana Eccles; Penelope Miron; Peter A Fasching; Hiltrud Brauch; Jenny Chang-Claude; Jane Carpenter; Andrew K Godwin; Heli Nevanlinna; Graham G Giles; Angela Cox; John L Hopper; Manjeet K Bolla; Qin Wang; Joe Dennis; Ed Dicks; Will J Howat; Nils Schoof; Stig E Bojesen; Diether Lambrechts; Annegien Broeks; Irene L Andrulis; Pascal Guénel; Barbara Burwinkel; Elinor J Sawyer; Antoinette Hollestelle; Olivia Fletcher; Robert Winqvist; Hermann Brenner; Arto Mannermaa; Ute Hamann; Alfons Meindl; Annika Lindblom; Wei Zheng; Peter Devillee; Mark S Goldberg; Jan Lubinski; Vessela Kristensen; Anthony Swerdlow; Hoda Anton-Culver; Thilo Dörk; Kenneth Muir; Keitaro Matsuo; Anna H Wu; Paolo Radice; Soo Hwang Teo; Xiao-Ou Shu; William Blot; Daehee Kang; Mikael Hartman; Suleeporn Sangrajrang; Chen-Yang Shen; Melissa C Southey; Daniel J Park; Fleur Hammet; Jennifer Stone; Laura J Van't Veer; Emiel J Rutgers; Artitaya Lophatananon; Sarah Stewart-Brown; Pornthep Siriwanarangsan; Julian Peto; Michael G Schrauder; Arif B Ekici; Matthias W Beckmann; Isabel Dos Santos Silva; Nichola Johnson; Helen Warren; Ian Tomlinson; Michael J Kerin; Nicola Miller; Federick Marme; Andreas Schneeweiss; Christof Sohn; Therese Truong; Pierre Laurent-Puig; Pierre Kerbrat; Børge G Nordestgaard; Sune F Nielsen; Henrik Flyger; Roger L Milne; Jose Ignacio Arias Perez; Primitiva Menéndez; Heiko Müller; Volker Arndt; Christa Stegmaier; Peter Lichtner; Magdalena Lochmann; Christina Justenhoven; Yon-Dschun Ko; Taru A Muranen; Kristiina Aittomäki; Carl Blomqvist; Dario Greco; Tuomas Heikkinen; Hidemi Ito; Hiroji Iwata; Yasushi Yatabe; Natalia N Antonenkova; Sara Margolin; Vesa Kataja; Veli-Matti Kosma; Jaana M Hartikainen; Rosemary Balleine; Chiu-Chen Tseng; David Van Den Berg; Daniel O Stram; Patrick Neven; Anne-Sophie Dieudonné; Karin Leunen; Anja Rudolph; Stefan Nickels; Dieter Flesch-Janys; Paolo Peterlongo; Bernard Peissel; Loris Bernard; Janet E Olson; Xianshu Wang; Kristen Stevens; Gianluca Severi; Laura Baglietto; Catriona McLean; Gerhard A Coetzee; Ye Feng; Brian E Henderson; Fredrick Schumacher; Natalia V Bogdanova; France Labrèche; Martine Dumont; Cheng Har Yip; Nur Aishah Mohd Taib; Ching-Yu Cheng; Martha Shrubsole; Jirong Long; Katri Pylkäs; Arja Jukkola-Vuorinen; Saila Kauppila; Julia A Knight; Gord Glendon; Anna Marie Mulligan; Robertus A E M Tollenaar; Caroline M Seynaeve; Mieke Kriege; Maartje J Hooning; Ans M W van den Ouweland; Carolien H M van Deurzen; Wei Lu; Yu-Tang Gao; Hui Cai; Sabapathy P Balasubramanian; Simon S Cross; Malcolm W R Reed; Lisa Signorello; Qiuyin Cai; Mitul Shah; Hui Miao; Ching Wan Chan; Kee Seng Chia; Anna Jakubowska; Katarzyna Jaworska; Katarzyna Durda; Chia-Ni Hsiung; Pei-Ei Wu; Jyh-Cherng Yu; Alan Ashworth; Michael Jones; Daniel C Tessier; Anna González-Neira; Guillermo Pita; M Rosario Alonso; Daniel Vincent; Francois Bacot; Christine B Ambrosone; Elisa V Bandera; Esther M John; Gary K Chen; Jennifer J Hu; Jorge L Rodriguez-Gil; Leslie Bernstein; Michael F Press; Regina G Ziegler; Robert M Millikan; Sandra L Deming-Halverson; Sarah Nyante; Sue A Ingles; Quinten Waisfisz; Helen Tsimiklis; Enes Makalic; Daniel Schmidt; Minh Bui; Lorna Gibson; Bertram Müller-Myhsok; Rita K Schmutzler; Rebecca Hein; Norbert Dahmen; Lars Beckmann; Kirsimari Aaltonen; Kamila Czene; Astrid Irwanto; Jianjun Liu; Clare Turnbull; Nazneen Rahman; Hanne Meijers-Heijboer; Andre G Uitterlinden; Fernando Rivadeneira; Curtis Olswold; Susan Slager; Robert Pilarski; Foluso Ademuyiwa; Irene Konstantopoulou; Nicholas G Martin; Grant W Montgomery; Dennis J Slamon; Claudia Rauh; Michael P Lux; Sebastian M Jud; Thomas Bruning; Joellen Weaver; Priyanka Sharma; Harsh Pathak; Will Tapper; Sue Gerty; Lorraine Durcan; Dimitrios Trichopoulos; Rosario Tumino; Petra H Peeters; Rudolf Kaaks; Daniele Campa; Federico Canzian; Elisabete Weiderpass; Mattias Johansson; Kay-Tee Khaw; Ruth Travis; Françoise Clavel-Chapelon; Laurence N Kolonel; Constance Chen; Andy Beck; Susan E Hankinson; Christine D Berg; Robert N Hoover; Jolanta Lissowska; Jonine D Figueroa; Daniel I Chasman; Mia M Gaudet; W Ryan Diver; Walter C Willett; David J Hunter; Jacques Simard; Javier Benitez; Alison M Dunning; Mark E Sherman; Georgia Chenevix-Trench; Stephen J Chanock; Per Hall; Paul D P Pharoah; Celine Vachon; Douglas F Easton; Christopher A Haiman; Peter Kraft
Journal:  Nat Genet       Date:  2013-04       Impact factor: 38.330

9.  Large-scale genotyping identifies 41 new loci associated with breast cancer risk.

Authors:  Kyriaki Michailidou; Per Hall; Anna Gonzalez-Neira; Maya Ghoussaini; Joe Dennis; Roger L Milne; Marjanka K Schmidt; Jenny Chang-Claude; Stig E Bojesen; Manjeet K Bolla; Qin Wang; Ed Dicks; Andrew Lee; Clare Turnbull; Nazneen Rahman; Olivia Fletcher; Julian Peto; Lorna Gibson; Isabel Dos Santos Silva; Heli Nevanlinna; Taru A Muranen; Kristiina Aittomäki; Carl Blomqvist; Kamila Czene; Astrid Irwanto; Jianjun Liu; Quinten Waisfisz; Hanne Meijers-Heijboer; Muriel Adank; Rob B van der Luijt; Rebecca Hein; Norbert Dahmen; Lars Beckman; Alfons Meindl; Rita K Schmutzler; Bertram Müller-Myhsok; Peter Lichtner; John L Hopper; Melissa C Southey; Enes Makalic; Daniel F Schmidt; Andre G Uitterlinden; Albert Hofman; David J Hunter; Stephen J Chanock; Daniel Vincent; François Bacot; Daniel C Tessier; Sander Canisius; Lodewyk F A Wessels; Christopher A Haiman; Mitul Shah; Robert Luben; Judith Brown; Craig Luccarini; Nils Schoof; Keith Humphreys; Jingmei Li; Børge G Nordestgaard; Sune F Nielsen; Henrik Flyger; Fergus J Couch; Xianshu Wang; Celine Vachon; Kristen N Stevens; Diether Lambrechts; Matthieu Moisse; Robert Paridaens; Marie-Rose Christiaens; Anja Rudolph; Stefan Nickels; Dieter Flesch-Janys; Nichola Johnson; Zoe Aitken; Kirsimari Aaltonen; Tuomas Heikkinen; Annegien Broeks; Laura J Van't Veer; C Ellen van der Schoot; Pascal Guénel; Thérèse Truong; Pierre Laurent-Puig; Florence Menegaux; Frederik Marme; Andreas Schneeweiss; Christof Sohn; Barbara Burwinkel; M Pilar Zamora; Jose Ignacio Arias Perez; Guillermo Pita; M Rosario Alonso; Angela Cox; Ian W Brock; Simon S Cross; Malcolm W R Reed; Elinor J Sawyer; Ian Tomlinson; Michael J Kerin; Nicola Miller; Brian E Henderson; Fredrick Schumacher; Loic Le Marchand; Irene L Andrulis; Julia A Knight; Gord Glendon; Anna Marie Mulligan; Annika Lindblom; Sara Margolin; Maartje J Hooning; Antoinette Hollestelle; Ans M W van den Ouweland; Agnes Jager; Quang M Bui; Jennifer Stone; Gillian S Dite; Carmel Apicella; Helen Tsimiklis; Graham G Giles; Gianluca Severi; Laura Baglietto; Peter A Fasching; Lothar Haeberle; Arif B Ekici; Matthias W Beckmann; Hermann Brenner; Heiko Müller; Volker Arndt; Christa Stegmaier; Anthony Swerdlow; Alan Ashworth; Nick Orr; Michael Jones; Jonine Figueroa; Jolanta Lissowska; Louise Brinton; Mark S Goldberg; France Labrèche; Martine Dumont; Robert Winqvist; Katri Pylkäs; Arja Jukkola-Vuorinen; Mervi Grip; Hiltrud Brauch; Ute Hamann; Thomas Brüning; Paolo Radice; Paolo Peterlongo; Siranoush Manoukian; Bernardo Bonanni; Peter Devilee; Rob A E M Tollenaar; Caroline Seynaeve; Christi J van Asperen; Anna Jakubowska; Jan Lubinski; Katarzyna Jaworska; Katarzyna Durda; Arto Mannermaa; Vesa Kataja; Veli-Matti Kosma; Jaana M Hartikainen; Natalia V Bogdanova; Natalia N Antonenkova; Thilo Dörk; Vessela N Kristensen; Hoda Anton-Culver; Susan Slager; Amanda E Toland; Stephen Edge; Florentia Fostira; Daehee Kang; Keun-Young Yoo; Dong-Young Noh; Keitaro Matsuo; Hidemi Ito; Hiroji Iwata; Aiko Sueta; Anna H Wu; Chiu-Chen Tseng; David Van Den Berg; Daniel O Stram; Xiao-Ou Shu; Wei Lu; Yu-Tang Gao; Hui Cai; Soo Hwang Teo; Cheng Har Yip; Sze Yee Phuah; Belinda K Cornes; Mikael Hartman; Hui Miao; Wei Yen Lim; Jen-Hwei Sng; Kenneth Muir; Artitaya Lophatananon; Sarah Stewart-Brown; Pornthep Siriwanarangsan; Chen-Yang Shen; Chia-Ni Hsiung; Pei-Ei Wu; Shian-Ling Ding; Suleeporn Sangrajrang; Valerie Gaborieau; Paul Brennan; James McKay; William J Blot; Lisa B Signorello; Qiuyin Cai; Wei Zheng; Sandra Deming-Halverson; Martha Shrubsole; Jirong Long; Jacques Simard; Montse Garcia-Closas; Paul D P Pharoah; Georgia Chenevix-Trench; Alison M Dunning; Javier Benitez; Douglas F Easton
Journal:  Nat Genet       Date:  2013-04       Impact factor: 38.330

10.  Outcome evaluation in pre-trastuzumab era between different breast cancer phenotypes: a population-based study on Italian women.

Authors:  Laura Cortesi; Elisabetta De Matteis; Claudia Cirilli; Luigi Marcheselli; Manuela Proietto; Massimo Federico
Journal:  Tumori       Date:  2012-11
View more
  7 in total

1.  Breast Cancer Family History and Contralateral Breast Cancer Risk in Young Women: An Update From the Women's Environmental Cancer and Radiation Epidemiology Study.

Authors:  Anne S Reiner; Julia Sisti; Esther M John; Charles F Lynch; Jennifer D Brooks; Lene Mellemkjær; John D Boice; Julia A Knight; Patrick Concannon; Marinela Capanu; Marc Tischkowitz; Mark Robson; Xiaolin Liang; Meghan Woods; David V Conti; David Duggan; Roy Shore; Daniel O Stram; Duncan C Thomas; Kathleen E Malone; Leslie Bernstein; Jonine L Bernstein
Journal:  J Clin Oncol       Date:  2018-04-05       Impact factor: 44.544

Review 2.  Vav1: A Dr. Jekyll and Mr. Hyde protein--good for the hematopoietic system, bad for cancer.

Authors:  Shulamit Katzav
Journal:  Oncotarget       Date:  2015-10-06

3.  The precision relationships between eight GWAS-identified genetic variants and breast cancer in a Chinese population.

Authors:  Yazhen Chen; Fangmeng Fu; Yuxiang Lin; Lin Qiu; Minjun Lu; Jiantang Zhang; Wei Qiu; Peidong Yang; Na Wu; Meng Huang; Chuan Wang
Journal:  Oncotarget       Date:  2016-11-15

4.  BRCA1/2-negative, high-risk breast cancers (BRCAX) for Asian women: genetic susceptibility loci and their potential impacts.

Authors:  Joo-Yeon Lee; Jisun Kim; Sung-Won Kim; Sue K Park; Sei Hyun Ahn; Min Hyuk Lee; Young Jin Suh; Dong-Young Noh; Byung Ho Son; Young Up Cho; Sae Byul Lee; Jong Won Lee; John L Hopper; Joohon Sung
Journal:  Sci Rep       Date:  2018-10-15       Impact factor: 4.379

5.  Impact on breast cancer susceptibility and clinicopathological traits of common genetic polymorphisms in TP53, MDM2 and ATM genes in Sardinian women.

Authors:  Matteo Floris; Giovanna Pira; Paolo Castiglia; Maria Laura Idda; Maristella Steri; Maria Rosaria De Miglio; Andrea Piana; Andrea Cossu; Antonio Azara; Caterina Arru; Giovanna Deiana; Carlo Putzu; Valeria Sanna; Ciriaco Carru; Antonello Serra; Marco Bisail; Maria Rosaria Muroni
Journal:  Oncol Lett       Date:  2022-08-08       Impact factor: 3.111

Review 6.  Fibroblast Growth Factor Receptor 2 Signaling in Breast Cancer.

Authors:  Haipeng Lei; Chu-Xia Deng
Journal:  Int J Biol Sci       Date:  2017-09-05       Impact factor: 6.580

7.  TIGAR-V2: Efficient TWAS tool with nonparametric Bayesian eQTL weights of 49 tissue types from GTEx V8.

Authors:  Randy L Parrish; Greg C Gibson; Michael P Epstein; Jingjing Yang
Journal:  HGG Adv       Date:  2021-11-04
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.