Breast Cancer (BCa) genome-wide association studies revealed allelic frequency differences between cases and controls at index single nucleotide polymorphisms (SNPs). To date, 71 loci have thus been identified and replicated. More than 320,000 SNPs at these loci define BCa risk due to linkage disequilibrium (LD). We propose that BCa risk resides in a subgroup of SNPs that functionally affects breast biology. Such a shortlist will aid in framing hypotheses to prioritize a manageable number of likely disease-causing SNPs. We extracted all the SNPs, residing in 1 Mb windows around breast cancer risk index SNP from the 1000 genomes project to find correlated SNPs. We used FunciSNP, an R/Bioconductor package developed in-house, to identify potentially functional SNPs at 71 risk loci by coinciding them with chromatin biofeatures. We identified 1,005 SNPs in LD with the index SNPs (r(2)≥0.5) in three categories; 21 in exons of 18 genes, 76 in transcription start site (TSS) regions of 25 genes, and 921 in enhancers. Thirteen SNPs were found in more than one category. We found two correlated and predicted non-benign coding variants (rs8100241 in exon 2 and rs8108174 in exon 3) of the gene, ANKLE1. Most putative functional LD SNPs, however, were found in either epigenetically defined enhancers or in gene TSS regions. Fifty-five percent of these non-coding SNPs are likely functional, since they affect response element (RE) sequences of transcription factors. Functionality of these SNPs was assessed by expression quantitative trait loci (eQTL) analysis and allele-specific enhancer assays. Unbiased analyses of SNPs at BCa risk loci revealed new and overlooked mechanisms that may affect risk of the disease, thereby providing a valuable resource for follow-up studies.
Breast Cancer (BCa) genome-wide association studies revealed allelic frequency differences between cases and controls at index single nucleotide polymorphisms (SNPs). To date, 71 loci have thus been identified and replicated. More than 320,000 SNPs at these loci define BCa risk due to linkage disequilibrium (LD). We propose that BCa risk resides in a subgroup of SNPs that functionally affects breast biology. Such a shortlist will aid in framing hypotheses to prioritize a manageable number of likely disease-causing SNPs. We extracted all the SNPs, residing in 1 Mb windows around breast cancer risk index SNP from the 1000 genomes project to find correlated SNPs. We used FunciSNP, an R/Bioconductor package developed in-house, to identify potentially functional SNPs at 71 risk loci by coinciding them with chromatin biofeatures. We identified 1,005 SNPs in LD with the index SNPs (r(2)≥0.5) in three categories; 21 in exons of 18 genes, 76 in transcription start site (TSS) regions of 25 genes, and 921 in enhancers. Thirteen SNPs were found in more than one category. We found two correlated and predicted non-benign coding variants (rs8100241 in exon 2 and rs8108174 in exon 3) of the gene, ANKLE1. Most putative functional LD SNPs, however, were found in either epigenetically defined enhancers or in gene TSS regions. Fifty-five percent of these non-coding SNPs are likely functional, since they affect response element (RE) sequences of transcription factors. Functionality of these SNPs was assessed by expression quantitative trait loci (eQTL) analysis and allele-specific enhancer assays. Unbiased analyses of SNPs at BCa risk loci revealed new and overlooked mechanisms that may affect risk of the disease, thereby providing a valuable resource for follow-up studies.
Apart from a few examples of genetic mutations with high penetrance, such as found in BRCA1 & 2 genes [1], most genetic risk of breast cancer (BCa) resides at multiple low penetrance loci, more recently identified by genome-wide association studies (GWASs) [2]. In general, GWASs utilize single nucleotide polymorphisms (SNPs) to tag common genetic variation in linkage disequilibrium (LD) blocks in order to identify genome-wide risk loci for complex diseases. To date, 71 replicated and independent BCa risk loci have been identified [3], [4], [5], [6], [7], [8], [9], [10], [11], [12], [13], [14], [15], [16]. There are thousands of SNPs in each LD block, and many of these SNPs are candidates to exert functionality in BCa risk. At the 71 BCa risk loci, at least 320,000 SNPs are associated with BCa risk. Due to this plethora of SNPs in LD, much of the heritability of complex diseases, such as BCa, remains unknown [17]. Identification of underlying mechanisms that explain how SNPs affect risk will provide a better understanding of the genetic risk of complex diseases, such as breast cancer, which is described in this study.In contrast to Mendelian disorders, where most disease-causing mutations result in absent or non-function proteins, many complex disease-associated variants, such as for BCa are mainly found in non-coding regions of the genome. Since >90% of the genome is non-coding and risk mechanisms of complex diseases are likely due to subtle regulation of gene expression, risk-SNPs are more often found in non-coding regions. Knowledge of the non-coding regions is rudimentary compared to the protein coding part. However, recent ENCODE data dramatically demonstrated that the non-coding part of the genome is much more than simply ‘junk’ DNA and contains well-demarcated gene regulatory regions, in particular enhancers [18].We have recently formulated a roadmap to address the functionality of risk SNPs in non-coding regions by characterizing gene regulatory regions with nucleosome and transcription factor occupancy and histone modifications [19]. Moreover, several research groups annotated genomic regions (coding and non-coding) to identify candidate functional SNPs involved in complex diseases [20], [21], [22], [23], [24], [25], [26]. However, as more next generation sequencing (NGS) data (of chromatin annotations from consortia such as ENCODE), more loci (from meta and primary GWASs), and more SNPs at ever lower minor allele frequencies (from the 1000 genomes project) become available, further analyses utilizing updated data and methods are needed for specific diseases such as BCa.In the present study, we addressed the hypothesis that BCa risk SNPs reside in functional genomic regions such as coding exons, TSS regions, and enhancers. In order to identify potentially functional SNPs, we conducted a comprehensive analysis on 656,895 SNPs from the 1000 genomes project data released in May 2012, at the 71 BCa risk loci by measuring LD and annotating them with 11 NGS datasets, all in primary breast epithelial cells. Thus, we found 1,005 potentially functional high LD SNPs. From these, we were able to frame specific hypotheses involving 547 SNPs in terms of novel biological mechanisms; 2 SNPs were at non-benign codon changes in one gene, 42 and 503 SNPs were within response elements of known transcription factors in TSS regions and enhancers, respectively. This shortlist of potentially functional SNPs will not only aid in prioritizing a manageable number of likely functional SNPs, but also reveal hidden biological mechanisms for the etiology of breast cancer.
Results and Discussion
One-thousand-and-five Potentially Functional High LD SNPs in Seventy-one Breast Cancer Risk Loci
To date, 71 replicated risk loci for BCa have been identified primarily using GWASs [3], [4], [5], [6], [7], [8], [9], [10], [11], [12], [13], [14], [15], [16]. The index SNPs identified by GWASs occur mainly in non-coding DNA (33 intergenic, 33 in introns, 1 in a 3′UTR) and only 4 in coding exons (Fig. 1A, Table S1). Although index SNPs such as rs11571833 (Lys3326Term in BRCA2 gene) [15] seem to be involved in known genetic mechanism of breast cancer tumorigenesis [1], the mechanisms for most of the other index SNPs are hidden. Additionally, these index SNPs are most likely surrogates of many other SNPs in LD, since most of the GWAS arrays were designed based on the Hapmap data to capture a large fraction of common genetic variation [27]. When we extracted SNP data for Europeans from the 1000 genomes project released in May 2012 [28], we found 308,010 very low LD (0≤r2<0.1), 11,438 low LD (0.1≤r2<0.5), and 3,508 high LD (r2≥0.5) SNPs at the 71 BCa risk loci (in a 1 MB window surrounding each index SNP) (Fig. 1B).
Figure 1
Identification of potential functional SNPs in 71 Breast cancer risk loci.
(A) Genomic distribution of 71 replicated index SNPs for breast cancer risk loci. (B) SNPs residing in 1 MB windows around breast cancer risk index SNPs were categorized into the indicated four different groups by measuring LD in EUR ethnic groups. (C) SNPs in each LD group were further analyzed by their locations coinciding with biofeatures. (D) High LD SNPs within biofeatures were categorized to three groups; exon, TSS region, and enhancers. (E) The entire process was summarized in a flow diagram.
Identification of potential functional SNPs in 71 Breast cancer risk loci.
(A) Genomic distribution of 71 replicated index SNPs for breast cancer risk loci. (B) SNPs residing in 1 MB windows around breast cancer risk index SNPs were categorized into the indicated four different groups by measuring LD in EUR ethnic groups. (C) SNPs in each LD group were further analyzed by their locations coinciding with biofeatures. (D) High LD SNPs within biofeatures were categorized to three groups; exon, TSS region, and enhancers. (E) The entire process was summarized in a flow diagram.In order to identify potentially functional SNPs, we hypothesized that risk SNPs occur at sites with functionality of some form or another. Candidates are in coding exons, regulatory regions near TSS (TSS regions), and enhancers. To assist in assigning potential functionality, we performed a FunciSNP (Functional Integration of SNPs) analysis [29]. FunciSNP is an R/Bioconductor package developed in-house to evaluate positional overlap between correlated SNPs at any disease or trait locus, and available chromatin biofeatures. Here, we chose exons, TSS regions (including promoters), and enhancers as biofeatures to annotate the genome comprehensively.Coding exon data were downloaded from the UCSC genome table browser [30]. TSS regions were defined as 3 kb windows centered on the annotated transcription start sites of genes including one or more of the following biofeatures, all in human mammary epithelial cells (HMEC): nucleosome depletion [DNase1-sensitivity and/or Formaldehyde-Assisted Isolation of Regulatory Elements (FAIRE) signals] and/or histone modifications as diagnostics of promoters (H3K4me3, H3K4me2, H3K9ac and/or H3K27ac) [31], [32], [33]. Enhancers were defined as regions in introns and intergenic regions (>1.5 kb from TSS) in HMEC, containing one or more of the following biofeatures: nucleosome depletion (DNase1-sensitivity and/or FAIRE signals) and/or histone modifications as diagnostics of enhancers (H3K4me1, H3K4me2, H3K9ac and/or H3K27ac) [31], [32], [33].In order to identify correlated risk SNPs, a FunciSNP evaluation of each index SNP was applied by extracting all known SNPs from the 1000 genomes project database (1 Mb windows, spanning each index SNP) [28]. Biofeatures were then aligned with the positions of all curated SNPs at each region. Each SNP that overlaps with a biofeature was used to calculate the r2 and distance to the associated index SNP. Among 322,954 correlated SNPs (r2>0), 22 percent were at biofeatures (Fig. 1C). Several issues may be considered to define risk SNPs in LD. One is that low LD SNPs may be the functional risk SNP, poorly measured by the index SNP. On the other hand, high LD SNPs are more likely to be the risk SNP, since this is based on the hypothesis that the underlying functional alleles are common. We identified 1,005 SNPs in relatively high LD (r2≥0.5); 21 in exons, 76 in TSS regions, and 921 in enhancers (Fig. 1D) at 60 of the 71 BCa risk loci. The selection process of potentially functional variants is summarized in Fig. 1E.
Twenty-one High LD SNPs in Exons: Two Non-benign Coding Variants in the ANKLE1 Gene
Twenty-one high LD SNPs (r2≥0.5) were annotated in exons (Fig. 2A). The majority (fifteen) results in synonymous variants. Among the six missense variants, 2 variants: rs8100241 and rs8108174 (both in the gene ANKLE1 at locus 19p13) (Fig. 2B), are predicted to result in a non-benign change as revealed by SIFT and PolyPhen protein function prediction software [34], [35] (Fig. 2C, Table S2). The first of these is in exon 2 (causing A31T) and the other in exon 3 (causing L94Q). Both SNPs are equally and highly correlated (r2 = 0.94) with the original GWAS index SNP, rs2363956, which in turn also results in another non-benign amino acid change (L184W) in exon 5 of ANKLE1 as revealed by PolyPhen analysis (Table S3). Thus, the three SNPs collectively result in two main haplotypes, which in turn create two main protein isoforms, A - L - L and T - Q - W (Fig. S1) with most likely functional consequences as revealed by SIFT and PolyPhen analyses. ANKLE1 is expressed in breast epithelial cells [36], [37] (Fig. S2). It contains an ankyrin repeat likely involved in protein-protein interactions. Also, it is an evolutionary conserved non-membrane-bound LEM protein that shuttles between the nucleus/cytoplasm and has an enzymatically active GIY-YIG endonuclease domain [36]. This multifunctional protein has the potential of affecting many cellular phenotypes and thus cancer risk. The two allelic variants need to be modeled in protein structure-function assays to precisely determine the risk mechanisms involving them. A final interesting genomic feature of the two correlated SNPs is that their locations appear to have histone H3K4me1, -me2 and -me3 signals (Fig. 2B), pointing to possible additional potential roles in regulatory components that in turn may affect expression levels of ANKLE1 and/or the other nearby gene, BABAM1. Such multi-functional SNPs will add to the complexity of BCa disease risk. Interestingly, the same locus was identified in a GWAS of ovarian cancer [38], indicating that ANKLE1 may be generally involved in womencancers, perhaps via hormonal-mediated mechanisms.
Figure 2
21 High LD SNPs in exon and effect of each variant to the respective protein.
(A) The list of high LD SNPs (r2≥0.5) in exons. The risk region number was derived from Table S1 and ordered by chromosome number. Index SNP of each corrSNP and the value of r2 between these SNPs were listed. The distance from the index SNP to each corrSNP was shown along with the name of the nearest gene. The type of each exon variant was also annotated. (B) A genomic browser view of two high LD SNPs, rs8100241 and rs8108174. The first track showed FunciSNP results for the exons. The name of the correlated SNP (rsnumber – r2 value) was shown in blue. The index SNP was shown in black. The bottom tracks were biofeature tracks, RefSeq genes/mRNA/Pseudogene tracks from UCSC Genes, common SNPs (version 137), and Linkage Disequilibrium (LD) blocks. LD block, which was measured by r2 value in phased CEU is shown. Allele frequencies of each SNP (in all populations) and zoomed in view of the genome browser for each SNP were shown, including the amino acid changes by missense variants. (C) The effect of amino acid changes by missense variants of the respective protein was predicted by SIFT and PolyPhen [34], [35].
21 High LD SNPs in exon and effect of each variant to the respective protein.
(A) The list of high LD SNPs (r2≥0.5) in exons. The risk region number was derived from Table S1 and ordered by chromosome number. Index SNP of each corrSNP and the value of r2 between these SNPs were listed. The distance from the index SNP to each corrSNP was shown along with the name of the nearest gene. The type of each exon variant was also annotated. (B) A genomic browser view of two high LD SNPs, rs8100241 and rs8108174. The first track showed FunciSNP results for the exons. The name of the correlated SNP (rsnumber – r2 value) was shown in blue. The index SNP was shown in black. The bottom tracks were biofeature tracks, RefSeq genes/mRNA/Pseudogene tracks from UCSC Genes, common SNPs (version 137), and Linkage Disequilibrium (LD) blocks. LD block, which was measured by r2 value in phased CEU is shown. Allele frequencies of each SNP (in all populations) and zoomed in view of the genome browser for each SNP were shown, including the amino acid changes by missense variants. (C) The effect of amino acid changes by missense variants of the respective protein was predicted by SIFT and PolyPhen [34], [35].
Seventy-six High LD SNPs in TSS Regions
Next, we studied 76 high LD SNPs, which resided at TSS regions of 25 genes (Table S4). Fifty-two percent of these genes are not only expressed in breast tissues, but their expression levels are changed during breast carcinogenesis [39], [40], [41], [42], [43], [44], [45] (Table S5). The TSS regions were defined as containing not only proximal promoters but also distal ones and perhaps also close-by (proximal) enhancers in the 3 kb windows centered at annotated TSS. These genomic regions are likely involved in gene expression regulation of the gene, primarily by altering transcription factor (TF) binding. There are approximately 2,600 proteins in the human genome that bind to DNA [46], and recently, a large number of ChIP-seq datasets were published involving many TFs [18]. However, due to the availability of a limited number of good antibodies and the requirement of high numbers of cells for ChIP assays, ChIP data are often biased towards a subgroup of TFs. As a more broader approach, we performed in silico searches of finding TF REs by utilizing 4 different softwares: HOMER (ChIP-seq known motifs), FIMO, Genome Trax (ChIP-seq TFBS), Haploreg (TRANSFAC, JASPAR, and PBM) [47], [48], [49], [50]. In this way, we established datasets that contain thousands of TF motifs. Among the 76 high LD SNPs in TSS regions, 42 likely affect known transcription factor binding by altering their REs as revealed by our analyses. These SNPs were located at 82 different TF motifs’ REs (Table S6). We ranked the TFs by the number of SNPs affecting their REs across the risk loci, and noted the top 10 motifs, defined as containing 2 or more SNPs affected the motifs in question (Table S7). The top motif was for Specificity Protein 1 (SP1) followed by the motif for Early Growth Response 1 (EGR1). REs of SP1 were affected at 6 TSS regional SNPs from 5 risk loci, and its binding was likely altered by the SNP alleles (Table S6). SP1 is known to be involved in many cellular processes including cell differentiation, cell growth, apoptosis, response to DNA damage, and chromatin remodeling, and its expression is up-regulated in breast cancer cells [51]. Therefore, it is reasonable to suggest that the perturbed REs by our newly identified risk SNPs may alter the binding activity of SP1 and thereby change the expression patterns of the genes, regulated by SP1.One example of a TSS regional SNP is rs2303696 (at 19p13.11 risk locus), which likely alters a SP1 RE. This SNP is highly correlated (r2 = 0.81) with a known index SNP, rs1353747, which is located 22 kb downstream from it. The correlated SNP is located in the promoter region of Inositol-3-phosphate synthase 1 (ISYNA1) gene, which catalyzes the de novo synthesis of myoinositol 1-phosphate from glucose 6-phosphate (Fig. 3, Fig. S3A). Seelan et al [52] reported that E2F1 and SP1 interaction at ISYNA1 gene promoter regulates ISYNA1 expression level. Additionally, it is expressed in breast tissue and decreases 5–6 fold during invasive breast carcinogenesis (Table S5) [39], [45]. We propose here that the SNP may influence the regulatory activity of this gene’s promoter and thus influencing risk.
Figure 3
An example of TSS regional SNPs, rs2303696, in the promoter region of ISYNA1.
The genomic browser view was shown of a TSS regional high LD SNP, rs2303696. First track shows FunciSNP results for TSS region. The name of correlated SNP (rsnumber – r2 value) was shown and color-coded to indicate the number of biofeatures (Fig. S3A). The index SNP was shown in black. The bottom tracks were biofeature tracks, RefSeq genes/mRNA/Pseudogene tracks from UCSC Genes, common SNPs (version 137), and Linkage Disequilibrium (LD) blocks. LD block, which was measured by r2 value in phased CEU is shown. Allele frequencies of rs2303696 (in all populations) and the location of this SNP in SP1 RE were shown.
An example of TSS regional SNPs, rs2303696, in the promoter region of ISYNA1.
The genomic browser view was shown of a TSS regional high LD SNP, rs2303696. First track shows FunciSNP results for TSS region. The name of correlated SNP (rsnumber – r2 value) was shown and color-coded to indicate the number of biofeatures (Fig. S3A). The index SNP was shown in black. The bottom tracks were biofeature tracks, RefSeq genes/mRNA/Pseudogene tracks from UCSC Genes, common SNPs (version 137), and Linkage Disequilibrium (LD) blocks. LD block, which was measured by r2 value in phased CEU is shown. Allele frequencies of rs2303696 (in all populations) and the location of this SNP in SP1 RE were shown.Additionally, expression quantitative trait locus (eQTL) analyses were performed to examine whether these TSS regional SNPs are associated with messenger RNA (mRNA) level by using publicly available datasets [53], [54], [55], [56], [57], [58], [59], [60], [61] (Table S8). Among 76 high LD SNPs in TSS regions, 30 SNPs are significantly associated with nearby gene mRNA level (P<10−5). As an example, rs832552 (at MAP3K1 promoter region) changes the expression level of C5orf35 gene in estrogen receptor positive breast cancer tissues as its allele changes (Table 1).
Table 1
eQTL analyses on high LD SNPs in breast cancer cells.
Index SNP
High LD SNP
r2
Target Gene
eQTL P-value
Cell type
Reference
rs889312
rs832552
0.61
C5orf35
2.46e-6
Estrogen receptor positive breast cancer
(Li et al., 2013) [53]
rs889312
rs252913
0.59
C5orf35
1.36e-8
Estrogen receptor positive breast cancer
(Li et al., 2013) [53]
rs889312
rs331499
0.56
C5orf35
1.16e-11
Estrogen receptor positive breast cancer
(Li et al., 2013) [53]
rs889312
rs331499
0.56
MIER3
7.75e-6
Estrogen receptor positive breast cancer
(Li et al., 2013) [53]
Nine-hundred-and-twenty-one High LD SNPs at Enhancers
Nine-hundred-and-twenty-one high correlated SNPs (r2≥0.5) were annotated at enhancers (Table S9). To verify the activity of identified enhancers, we performed in vitro enhancer assays by cloning approximately 1.2 kb regions in which the SNPs reside. We selected the best 11 SNP regions for cloning, based on the number of chromatin biofeatures (5 or more coinciding biofeatures), and named them breast cancer enhancer 1 (BCE1) through BCE11 (Table S10). By performing dual luciferase assays in normal and breast cancer cells, we found that 9 out of the 11 regions retained enhancer activities over background (CT1 and CT2) in either normal or breast cancer cells, or in both cells types (Fig. 4A, Table S10 and S11). Among 9 active enhancers, BCE4, -5, and -8 had enhancer activities in both normal (HMEC and MCF10A) and breast cancer cells (MCF7 and MDAMB231). On the other hand, BCE1, -2, and -11 revealed enhancer activities only in normal HMEC. BCE7 had enhancer activity only in MCF7, estrogen receptor (ER) positive breast cancer epithelial cells. BCE3 retained enhancer activity in ER negative breast epithelial cells: MDAMB231, MCF10A, HMEC. These BCE enhancers were either in intron or intergenic region: BCE 3, -6, -9, -10 and -11 were in introns. Regardless of SNP location in genome, they retained enhancer activity. As an example of enhancer in intron region, BCE9 was located in intron 9 of RAD51L1 gene, which showed differential expression level between breast cancer cells (MDAMB231) and normal breast epithelial cells (HMEC) (Fig. S4).
Figure 4
Novel enhancers including high LD SNPs were identified in breast epithelial cells.
(A) Eleven enhancer regions, which included FunciSNP identified BCa high LD SNPs in epigenetically defined enhancers, were cloned and analyzed using the dual luciferase assays in MCF7 (blue), MDAMB231 (red), MCF10A (orange) and HMEC (blue). Each luciferase activity was divided by average luciferase activity of two negative controls, CT1 and 2. The average value of two negative controls was shown as a horizontal line across the breast cancer enhancers (BCEs) (gray). (B) The location of three SNPs (blue: rs4871782, green: rs28759353, black: rs10087810) at BCE5 and breast cancer risk tagSNP (red: rs13281615) in 8q24.21 region. (C) Allele-specific FAIRE assays were performed near three candidate SNPs for breast cancer risk at BCE5. Sequence results of Input DNA and FAIRE DNA are shown. The colors of the nucleotides from DNA sequencing: blue is C, green is A, black is G and red is T. Sequences near SNP were shown in a double-strand DNA (bottom). (D) Linkage Disequilibrium (LD) plot (r2) and haplotypes of three SNPs (in EUR) with breast cancer risk tagSNP, rs13281615 were shown [5]. Allele-specific in vitro dual luciferase assays were performed in HMEC (E) and MDAMB231 cells (F). The Analysis of variance statistical test (ANOVA) was used to confirm the difference and two-side p-values between alleles were calculated using the student t-test.
Novel enhancers including high LD SNPs were identified in breast epithelial cells.
(A) Eleven enhancer regions, which included FunciSNP identified BCa high LD SNPs in epigenetically defined enhancers, were cloned and analyzed using the dual luciferase assays in MCF7 (blue), MDAMB231 (red), MCF10A (orange) and HMEC (blue). Each luciferase activity was divided by average luciferase activity of two negative controls, CT1 and 2. The average value of two negative controls was shown as a horizontal line across the breast cancer enhancers (BCEs) (gray). (B) The location of three SNPs (blue: rs4871782, green: rs28759353, black: rs10087810) at BCE5 and breast cancer risk tagSNP (red: rs13281615) in 8q24.21 region. (C) Allele-specific FAIRE assays were performed near three candidate SNPs for breast cancer risk at BCE5. Sequence results of Input DNA and FAIRE DNA are shown. The colors of the nucleotides from DNA sequencing: blue is C, green is A, black is G and red is T. Sequences near SNP were shown in a double-strand DNA (bottom). (D) Linkage Disequilibrium (LD) plot (r2) and haplotypes of three SNPs (in EUR) with breast cancer risk tagSNP, rs13281615 were shown [5]. Allele-specific in vitro dual luciferase assays were performed in HMEC (E) and MDAMB231 cells (F). The Analysis of variance statistical test (ANOVA) was used to confirm the difference and two-side p-values between alleles were calculated using the student t-test.Among these enhancers, we investigated BCE5 in more detail as a proof-of-principle. FunciSNP analysis identified three-correlated risk SNPs (r2≥0.5) at the active regulatory element within BCE5 (rs4871782, rs28759353, and rs10087810) (Fig. 4B). In order to determine whether the alleles of these three SNPs participated in nucleosome depletion (i.e. as measured by FAIRE), we performed allele-specific FAIRE using a HMEC cell strain, which was heterozygous for the three SNPs. Allele-specific FAIRE can determine functional regulatory polymorphisms [62]. Here, allele-specific FAIRE for the three SNPs was performed by sequencing across the interested SNP region of FAIRE samples and comparing the sequence of peaks with that of input DNA (as control). For rs4871782, the FAIRE sample contained about the same relative amount of the two alleles, compared to input. In contrast, for rs28759353, the FAIRE sample had clearly more of the G allele, compared to the input signal. Similarly, for rs10087810, more of the T allele was detected in the FAIRE sample, compared to the input (Fig. 4C). Note the high fidelity of the sequence reactions between the FAIRE and input samples as reflected by the almost identical relative sizes of the peaks surrounding the SNP. These results may indicate that the rs28759353, G allele and rs10087810, T allele (i.e. the GT haplotype) had a more open chromatin structure than the other alleles and perhaps consequently a higher enhancer activity, which we tested next (see below).We analyzed the haplotype of rs28759353, rs4871782, and rs10087810 SNPs relative to the risk tagSNP, rs13281615 [5]. The GGTA haplotype (Fig. 4D) had lower risk of breast cancer because it correlated with the risk allele of rs13281615. The other haplotypes and relative percentages are shown in Europeans. In order to relate allele-specific FAIRE results to enhancer activity, we next performed allele-specific in vitro enhancer assays by generating plasmids, which contain different versions of each SNP in BCE5 region (Fig. 4E and F). Overall, we found that the risk versions of each SNP independently had lower enhancer activities. These results together with the allele specific FAIRE data indicated that rs28759353 and rs10087810 were functional SNPs, with the risk allele having more nucleosome depletion and higher enhancer activity in the in vitro assay. Although we do not understand the disparity between the two assays for SNP rs4871782, it is probably related to the sensitivity of the two assays. For this particular SNP, allele-specific FAIRE is less sensitive to be picked up in the allele-specific FAIRE analysis.
Transcription Factors, which Likely Bind to High LD SNPs at Enhancers
Among 921 SNPs in enhancer regions, 503 SNPs likely affect known transcription factor binding by altering their REs (Table S12). By performing in silico searches of TF REs as described above for TSS regions, we identified 455 different transcription factor REs where the TF binding will likely to be altered by the risk-correlated SNP. Among the motifs, we ranked them by the number of SNPs affecting their RE. The top 18 motifs were selected for further analysis (see below) (Table S13). The top motif was for the T-cell acute lymphocytic leukemia 1 (TAL1; aka SCL); 28 enhancer SNPs at 16 BCa risk loci were thus identified. The next ranked motifs most often likely affected in this manner were in order, Eomesoderim (EOMES), Foxhead box P1 (FOXP1) and SP1. TAL1 is a transcription factor that acts in hemopoiesis, anti-apoptosis, angiogenesis, and other activities [63], [64], [65]. It is expressed in breast tissue and decreases 2–3 fold during invasive breast carcinogenesis [42], [45] (Fig. S5). It also inhibits the expression level of GATA3, a transcription factor, which inhibits breast cancer metastasis [66], [67].One example of a likely TAL1-affecting SNP is rs76969790 at the 5q11 risk locus (Fig. 5, Fig. S3B). The SNP is highly correlated (r2 = 0.88) with a GWAS index SNP, rs1353747, which is located 58 kb upstream from it. This correlated SNP is located in the large intron 10 of the PDE4D gene. PDE4D encodes for an enzyme that has 3′, 5′-cyclic-AMP phosphodiesterase activity and degrades cAMP, resulting in regulation of multiple signaling pathways and metabolism (i.e. GPCR and TOR signaling, cAMP metabolism) [68], [69]. The intron 10 of PDE4D gene is large, 140 kb in length and contains several histone marks of enhancers with nucleosome depletion signals (i.e. DNaseI and FAIRE). The rs76969790 is in close proximity with FAIRE and DNase1 signals and coincides exactly with enhancer histone marks (H3K4me1, H3K27ac, H3K4me2, and H3K9ac) (Fig. 5).
Figure 5
An example of enhancer SNPs, rs76969790 likely alters a TAL1 response element.
The genomic browser view was shown of an enhancer SNP, rs2303696. First track showed FunciSNP results for enhancers. The name of correlated SNP (rsnumber – r2 value) was shown and color-coded to indicate the number of biofeatures (Fig. S3B). The index SNP was shown in black. The bottom tracks were biofeature tracks, RefSeq genes/mRNA/Pseudogene tracks from UCSC Genes, common SNPs (version 137), and Linkage Disequilibrium (LD) blocks. LD block, which was measured by r2 value in phased CEU is shown. Allele frequencies of rs76969790 (in all populations) and the location of this SNP in TAL1 RE are shown.
An example of enhancer SNPs, rs76969790 likely alters a TAL1 response element.
The genomic browser view was shown of an enhancer SNP, rs2303696. First track showed FunciSNP results for enhancers. The name of correlated SNP (rsnumber – r2 value) was shown and color-coded to indicate the number of biofeatures (Fig. S3B). The index SNP was shown in black. The bottom tracks were biofeature tracks, RefSeq genes/mRNA/Pseudogene tracks from UCSC Genes, common SNPs (version 137), and Linkage Disequilibrium (LD) blocks. LD block, which was measured by r2 value in phased CEU is shown. Allele frequencies of rs76969790 (in all populations) and the location of this SNP in TAL1 RE are shown.Additionally, expression quantitative trait locus (eQTL) analyses on the 921 high LD SNPs in enhancers were conducted using published data as we described above for TSS regions [53], [54], [55], [56], [57], [58], [59], [60], [61]. Since the eQTL analyses were detecting relationships between SNP and nearby genes (cis-eQTL), a relatively small number of enhancer high LD SNPs (65 SNPs) were associated with mRNA levels (Table S14). This is unlike the eQTL results in TSS regional high LD SNPs referred to above. Alternatively, the sample number for eQTL analyses could have been too low to detect the association signal between risk loci and affected genes.
Interactions among Breast Cancer Risk Loci
In order to investigate the interactions among genes at the breast cancer risk loci, we further highlighted 32 genes, which contained functional SNPs either in their exons or within their TSS regions. Using these 32 genes plus the BRCA2 gene, in which rs11571833, a nonsense index SNP resided, we executed an ingenuity pathway analysis (IPA, www.ingenuity.com). When we examined interactions among these genes and/or their protein products by using data from published papers, we found only one direct interaction and two indirect interactions [70], [71], [72] (Fig. 6).
Figure 6
Interactions among breast cancer risk loci.
32 proteins coded by genes, which contained functional SNPs either in their exons or within their TSS regions plus the BRCA2, in which a nonsense index SNP resided, were laid out using subcellular localization annotation. Each molecule was shown in green hexagon. Interactions among these 33 genes/proteins were shown in green arrows (direct: solid line, indirect: dashed line). Each circle colored in yellow represents each TF. The interactions between the group, containing 33 genes/proteins and another group, containing top 18 TFs (yellow circle) that affected by high LD SNPs were shown in black arrows (direct: solid line, indirect: dashed line).
Interactions among breast cancer risk loci.
32 proteins coded by genes, which contained functional SNPs either in their exons or within their TSS regions plus the BRCA2, in which a nonsense index SNP resided, were laid out using subcellular localization annotation. Each molecule was shown in green hexagon. Interactions among these 33 genes/proteins were shown in green arrows (direct: solid line, indirect: dashed line). Each circle colored in yellow represents each TF. The interactions between the group, containing 33 genes/proteins and another group, containing top 18 TFs (yellow circle) that affected by high LD SNPs were shown in black arrows (direct: solid line, indirect: dashed line).We next analyzed the relationships among the top 18 TF motifs affected by 10 or more enhancer SNPs and proteins encoded by the above 33 genes. Although some of genes have been understudied and currently lack information about their functions and locations, we observed that a number of proteins interacted with each other, and these TFs mediated interactions among the 33 BCa risk genes/proteins (Fig. 6). For instance, SP1 binds to the promoter (−329bp to 324bp) of the FOSL1 gene, whereas SP1 binds directly to another BCa risk protein, BRCA2 [73], [74]. BRCA2 binds to several fragments of the AR protein (1aa-556aa, 627aa-919aa) [75]. In turn, AR binds to RANBP9 and CASP8 [76], [77], [78]. In prostate cancer cell lines, RANBP9 increases activity of AR protein. CASP8 protein level increases cleavage of AR proteins [76], [77], [79], [80]. These functional networks among identified genes/proteins and motifs at the 71 BCa risk loci may be key interactions, which affect genetic risk for BCa.Recently, Cowper-Sal Lari et al [23] reported that FOXA1 binding to high LD SNPs in BCa are more frequent than other transcription factors. However, that study was performed using a limited number of transcription factors (ChIP-seq data in ER positive breast cancer cells of only 16 transcription factors) and a relatively small number of SNPs (obtained from the limited Hapmap datasets) in only 44 BCa risk loci. For an updated, more comprehensive and unbiased analysis, we assessed high LD SNPs in TF REs within TSS regions and enhancers using the 1000 genomes database, which contained not only rare variants but also un-tagged SNPs from the Hapmap project [27], [28], [81]. We further interrogated thousands of TF motifs in known datasets [47], [48], [49], [50]. Our top potentially affected TFs were SP1 and TAL1, at TSS regions and enhancers, respectively. SNPs in FOXA1 REs were ranked only 51th in our priority list (Table S12). The difference between the Cowper-Sal Lari et al [23] study and the work reported here, is likely due to our more comprehensive analysis coupled with the limited number of TFs and SNPs assessed in the Cowper-Sal Lari et al study.Recently, it was reported that the average number of distal elements interacting with a TSS was 3.9, and the average number of TSSs interacting with a distal element was 2.5 [18]. Another study on genome structure also revealed that active chromatin regions formed inter-chromosomal contacts and blocks of each chromosome interacted with blocks in different chromosomes, composing a spatial nuclear structure [82]. Therefore, a large number of chromosomal contacts and interactions likely are orchestrated by the three-dimensional organization of the nucleus. Through eQTL analyses, we identified the precise genomic loci (SNPs) that regulated expression level of mRNAs. However, it did not demonstrate direct interactions among regulatory elements. Looping interactions between enhancers and target genes can be detected by 3C (chromatin chromosome capture) assays [18]. To scan the interactions genome-wide, 3C derivative methods (3C-seq, 4C-seq, 5C-seq, ChIA-PET and HiC-seq) may be applied [83], [84]. Targets of regulatory elements can be also identified in vitro and in vivo by knock-out DNA method such as transcription activator-like effector nucleases (TALEN) [85] and transgenicmouse modeling by knocking in conserved regulatory elements [86].Newly identified regulatory elements, coinciding with high LD SNPs are not necessarily targeting protein-coding genes. For instance, they can interact with long noncoding RNAs (lncRNA) [87]. Each SNP identified by FunciSNP [29] was further annotated by us for proximity to the nearest known lncRNA (Table S4 and S9). We also identified potentially functional high LD SNPs in regulatory elements that intersect with lncRNA using LNCpedia database version 1.2 [88] (Table S15).
Conclusions
Since 2005, over 1,600 variants have been identified at p-value ≤5×10−8 for over 250 traits. Most of the identified index SNPs from GWASs are in noncoding DNA regions, making the assignment of functionality difficult [27]. Despite the controversy surrounding the utility of GWAS, post-GWAS identification of mechanisms have become valuable for the identification of genomic targets of diseases. Here, we provide functional rationales for 21 SNPs in exons, 76 SNPs in TSS regions and 921 SNPs in putative enhancers at 60 of the 71 BCa risk loci. These annotations are based on the assumption that functional alleles are common. This short list out of more than 320,000 correlated risk SNPs can be used in follow-up fine-mapping and functional studies on identifying disease-causing SNPs.
Materials and Methods
Cell Culture
HMEC cells were obtained from Lonza (Lonza, Walkersville, MD) and cultured under recommended conditions. MDAMB231, MCF10A and MCF7 cells were obtained from American Type Culture Collection (ATCC, Manassas, VA). MDAMB231 and MCF7 cells were cultured in DMEM with 5% FBS. MCF10A cells were cultured in DMEM/F12 with 5% horse serum, 100 units/ml penicillin, 0.1 mg/ml streptomycin, 0.5 µg/ml hydrocortisone, 100 ng/ml cholera toxin, 10 µg/ml insulin, and 20 ng/ml epidermal growth factor (EGF).
FAIRE-seq Library Construction and Sequencing
FAIRE assays were performed as described [89], with a number of modifications. Briefly, the method was as follows: (1) intact cells were crosslinked (1% formaldehyde in PBS); (2) nuclei were extracted from cells and re-suspended in SDS lysis buffer; (3) chromatin DNA was fragmented by sonication; (4) FAIRE DNA samples and reverse-crosslinked input DNA were purified by phenol-chloroform extraction. Two independent libraries were made for each sample by using bar-coded adapters. Each library was PCR amplified and confirmed by quantitative real-time PCR (qPCR). Single-end DNA sequencing (Illumina Hi-Seq 50 cycles) was performed at the USC Epigenome Center. Two independent assays were analyzed separately and then the data were combined in order to increase the depth of coverage (Table S16 and Fig. S6). More than 82% of the merged FAIRE peaks intersected. FAIRE-seq data were deposited in the NCBI GEO under accession number GSE46074.
Identification of FAIRE-seq Peaks
Each bam file was filtered using a quality filter score of 30 after removing PCR artifacts and duplicates by the Samtools [90]. The identification of FAIRE-seq peaks was performed using the findPeaks from HOMER (http://biowhat.ucsd.edu/homer) [47]. Peaks were identified by using a triangle-based distribution with a median length of 150bp. In order to find the peaks, which are not false positive, we used input with an alpha value of 0.01; 99.0% confidence interval for peak pairs, which are unequal between sample and input was used. A subpeak value of 0.6 with a trim float value of 0.3 was used to perform peak separation. After peak identification, we calculated a p-value for each peak between sample and input. To be most stringent, functional peaks [47] at a p-value of 10−9 were used as a cut off to select significantly enriched peaks. FAIRE-seq data within 3 kb windows centered on the annotated TSS of genes were used to define TSS regions. The data >1.5 kb from TSS were utilized to define enhancer regions for the FunciSNP analysis.
Histone Modification ChIP-seq Data
Histone modification ChIP-seq data (H3K4me1, me2, me3, H3K9Ac and H3K27Ac) in HMEC were obtained from accession number [GSE29611] through the NCBI Gene Expression Omnibus portal. [GSE29611] was published as part of the ENCODE project. ChIP assay protocol as well as data processing details may be seen here (http://genome.ucsc.edu/cgi-bin/hgTrackUi?db=hg19&g=wgEncodeBroadHistone).Chromatin State Segmentataion HMM data generated by using above ChIP-seq data were obtained from accession number [GSE38163] and included for the FunciSNP analyses of regulatory elements. NGS data within 3 kb windows centered on the annotated transcription start sites of genes were used for TSS regions. For putative enhancer regions, NGS data >1.5 kb from TSS were utilized.
DNaseI-seq Data
DNaseI-seq data in HMEC were obtained from accession number [GSE32970] through the NCBI Gene Expression Omnibus portal. Additional DNaseI-seq data generated by University of Washington as part of the ENCODE project were downloaded from here (http://hgdownload.cse.ucsc.edu/goldenPath/hg19/encodeDCC/wgEncodeUwDnase/). Detailed protocols may be seen at following websites (http://genome.ucsc.edu/cgi-bin/hgTrackUi?hgsid=307403817&c=chr1&g=wgEncodeOpenChromDnase and http://genome.ucsc.edu/cgi-bin/hgTrackUi?hgsid=307403817&c=chr1&g=wgEncodeUwDnase). NGS data within 3 kb windows centered on the annotated transcription start sites of genes were used to define TSS regions for FunciSNP analysis. For putative enhancer regions, NGS data >1.5 kb from TSS were utilized.
FunciSNP
FunciSNP is an in-house developed R/Bioconductor package for the Functional Integration of SNPs with phenotype by coincidence with chromatin biofeatures. All statistical tests were done using R software (R version 2.9.2, 2009-08-24, (R Development Core Team, 2009)). FunciSNP version 0.99 was used to find correlated SNPs, which coincide with 11 independent ChIP-seq/FAIRE-seq/DNaseI-seq data sets in TSS regions and putative enhancer regions. All the SNPs from the 1000 genomes project (up to May 2012 data release) [28] residing in 1 Mb windows around breast cancer risk index SNP and within EUR ethnic groups (original GWAS), were analyzed with an r2 value 0.5 as a cut-off (Table S4 and S9).
Plasmid Construction and Luciferase Reporter Assays
Eleven potential enhancer regions (∼1200bp sequence surrounding the nucleosome depleted regions with FunciSNP identified correlated SNP) were amplified from genomic DNA using High Fidelity Platinum Tag DNA polymerase (Invitrogen Corp., Carlsbad, CA). The amplified sequences were then subcloned using SacII, EcoRI, BglII or KpnI restriction sites upstream of a thymidine kinase (TK) minimal promoter-firefly-luciferase vector. All clones were confirmed by sequencing. The primer sequences for subcloning are listed in Table S11. HMEC, MCF10A, MDAMB231, MCF7 cells were transfected with reporter plasmids along with constitutively active pRL-TK Renilla luciferase plasmid (Promega Corp., Madison, WI) using LipofectamineLTX Reagent (Invitrogen Corp., Carlsbad, CA) under recommended protocol. Dual luciferase activities were measured as previously described [91].
Allele-specific FAIRE
PCR reactions were performed on FAIRE-isolated and input DNA using High Fidelity Platinum Taq DNA polymerase (Invitrogen Corp., Carlsbad, CA) for 15 cycles after which products were purified and re-PCRed for 20 cycles to minimize the PCR artifacts due to over-cycling. Purified DNA from these reactions was sequenced, using primers near the SNP locations by the DNA Core Facility at the University of Southern California (Table S11). Each experiment was independently performed more than twice.
Allele-specific Luciferase Reporter Assays
Point mutations were introduced to create enhancer-reporter constructs with specific SNP allele using QuikChange site-directed mutagenesis kit (Agilent Technologies Inc., Santa Clara, CA). In order to avoid the bias from miniprep procedures, six independent clones of each construct were made and confirmed by sequencing. Each of the six independent clones of each construct were transfected in four wells and two luciferase assays per well were performed in order to record luciferase-reading variation. Allele-specific fold activities were presented and values shown are means ± SEM of the six independent clones of each allele. The analysis of variance statistical test (ANOVA) was used to confirm the difference and two-side p-values between alleles were calculated using the student t-test.
Gene Expression Analysis between Breast Cancer and Normal Breast Tissues
We compared gene expression levels between breast cancer and normal tissues using the Oncomine database, released in Sep 2012 [45]. This database currently contained more than 674 datasets and information on 73,327 samples tissues, including datasets with over 593 samples for breast cancer [39], [40], [41], [42], [43], [44], [45]. For the differential expression analyses, t-test with false discovery rates as a corrected measure of significance was performed and following cut-off thresholds were utilized: p-value <10−4, fold change >2.0, within top 10% gene rank. The result of this analysis for the genes, which high LD TSS regional SNPs reside in, is listed in Table S5. As an example, TAL1 gene expression level change between normal and breast cancer tissues were shown in detail as boxplots (Fig. S5).
RNA-seq Data for the ANKLE1 Gene
Long RNA-seq from ENCODE/Cold Spring Harbor Lab in HMEC and MCF7 cells were obtained through the UCSC genome browser tracks [92]. In addition to profiling Poly-A+ and Poly-A- RNA from whole cells, RNA-seq data from the cytosol and nucleus were performed in MCF7 cells. These expression data at the ANKLE1 gene were shown in Fig. S2.
Gene Expression Analysis between HMEC and MDAMB231 Cells
We compared gene expression levels between HMEC and MDAMB231 cells by using the affymetrix HG-U133 plus2 microarrays obtained from the accession number [GSE33167] [93]. RAD51L1 gene expression values for both cells were processed and its bar plots were graphed by using the GEO2R [94] (Fig. S4).
eQTL Analyses
We performed expression quantitative trait locus (eQTL) analyses on FunciSNP identified SNPs to examine whether these SNPs were associated with messenger RNA (mRNA) level of nearby genes. We assessed eQTL for all SNPs by using the RegulomeDB, the GTEX database (http://www.ncbi.nlm.nih.gov/gtex/GTEX2/gtex.cgi), University of Chicago eQTL Browser (http://eqtl.uchicago.edu), the Genevar (http://www.sanger.ac.uk/resources/software/genevar/), and The Cancer Genome Atlas (TCGA) breast cancer datasets in 15 breast cancer risk loci [53], [54], [55], [56], [57], [58], [59], [60], [61]. To be most stringent, a p-value of 10−5 was used as a cut-off (Table S8 and S14). Posterior probability and the Bayes factor were used to analyze the eQTL data from Veyrieras et al and Mangravite et al [59].
Motif Discovery
In order to annotate SNP effects on regulatory motifs, sets of position weight matrices (PWMs) were used from FIMO, HOMER (ChIP-seq known motifs), Genome Trax (ChIP-seq TFBS), Haploreg (TRANSFAC, JASPAR, and PBM) [47], [48], [49], [50]. FIMO analysis was performed using the motif database, called JASPAR CORE 2009 vertebrates, downloaded from the MEME suite (http://tools.genouest.org/tools/meme/meme-download.html) [48]. P-value for output threshold utilized for FIMO was 1e-4. FindMotif analysis was executed by using known motifs generated from HOMER. Each motif matrix was established after collecting strong binding sites of each TF genome wide from published human ChIP-seq data. Log odds score of the motif matrix cut-off value 5 was used for findMotif analysis. Predicted ChIP-seq TFBS analysis from Genome Trax was utilized with the motif score cut-off 0.7. Its database contains motif matrices from best-scoring TF binding sites identified with a ChIP-chip or ChIP-seq fragment. A stringent threshold of p<4−8 was applied for the PWM score of each instance for Haploreg. The change in log-odds (LOD) score as alleles change was calculated and listed in Table S6 and S9. Each identified motif RE was organized by SNP id, and the number of SNPs affecting regulatory motif was counted to rank the TFs (Table S6 and S12).
Transcription Factor and Gene/protein Interaction Analysis
We obtained information of the top 18 TFs and 33 genes/proteins using an Ingenuity Pathway Analysis (IPA, www.ingenuity.com). IPA Path Explore tools were used to identify direct and indirect interactions among molecules. IPA Path Designer tools were utilized to map the annotated subcellular location of each molecule.Linkage Disequilibrium block and haplotype analysis of 2 corrSNPs, rs8100241 and rs8108174, and their index SNP, rs2363956. (A) Linkage Disequilibrium block (in EUR) showing two high LD SNPs, rs8100241 and rs8108174, and index SNP, rs2363956, found in exons of ANKLE1. (B) Haplotypes of these SNPs (in EUR) and protein isoforms, containing different amino acid compositions. Antoniou et al reported that T allele of rs2363956 is associated with breast cancer risk [4](TIF)Click here for additional data file.The UCSC genome browser near the
gene, showing breast epithelial cell RNA-seq data. Long RNA-seq from ENCODE/Cold Spring Harbor Lab in HMEC and MCF7 cells were used [92]. For MCF7 cells, in addition to profiling Poly-A+ and Poly-A- RNA from whole cells, RNA-seq data from the cytosol and nucleus were performed. Two replicates for each condition were conducted. Contigs and signals from each replicate were shown in the above tracks.(TIF)Click here for additional data file.Overlap count keys for FunciSNP results. The name of correlated SNP is colored based on the number of biofeatures. (A) Overlap count key for FunciSNP results for TSS regions. (B) Overlap count key for FunciSNP results for enhancers.(TIF)Click here for additional data file.gene expression value in HMEC and MDAMB231.
RAD51L1 gene expression value for HMEC and MDAMB231 were obtained from accession number [GSE33167]. Three replicates for each cell type were generated by using the affymetrix HG-U133 plus2 arrays [93]. Expression bar plots were graphed by using the GEO2R [94](TIF)Click here for additional data file.expression level in breast tissues. The expression value of TAL1 gene was obtained from The Cancer Genome Atlas (TCGA) breast tissues [42]. (A) TAL1 expression level comparison between normal breast tissues and invasive breast carcinoma (B) comparison between normal breast tissues and invasive ductal breast carcinoma (C) comparison between normal breast tissues and mixed lobular and ductal breast carcinoma (D) comparison between normal breast tissues and invasive lobular breast carcinoma. The analysis was performed by using the Oncomine database [95].(TIF)Click here for additional data file.HMEC FAIRE peaks from two replicates.(TIF)Click here for additional data file.71 Breast cancer risk index SNPs and high LD SNPs genomic locations.(DOC)Click here for additional data file.Protein function prediction results for missense variants of high LD SNPs.(XLS)Click here for additional data file.Protein function prediction of index SNPs in exons.(XLS)Click here for additional data file.FunciSNP results for TSS regional high LD SNPs.(XLS)Click here for additional data file.Differential expression analysis of the genes, which high LD TSS regional SNPs reside in.(XLS)Click here for additional data file.TSS regional high LD SNP motif analysis result.(XLS)Click here for additional data file.Top 10 TF motifs for TSS regional high LD SNPs.(DOC)Click here for additional data file.eQTL analyses on 76 TSS regional high LD SNPs.(DOC)Click here for additional data file.FunciSNP results for high LD SNPs in enhancers.(XLS)Click here for additional data file.Breast Cancer Enhancer (BCE) regions used for luciferase assays.(DOC)Click here for additional data file.Oligonucleotide sequences used for cloning and qPCR.(DOC)Click here for additional data file.high LD SNPs in enhancer motif analysis result.(XLS)Click here for additional data file.Top 18 TF motifs for high LD SNPs in enhancers.(DOC)Click here for additional data file.eQTL analyses on high LD SNPs in enhancers.(DOC)Click here for additional data file.lncRNA which intersect with high LD SNPs in regulatory elements.(XLS)Click here for additional data file.FAIRE-seq statistics.(XLS)Click here for additional data file.
Authors: Lingyun Song; Zhancheng Zhang; Linda L Grasfeder; Alan P Boyle; Paul G Giresi; Bum-Kyu Lee; Nathan C Sheffield; Stefan Gräf; Mikael Huss; Damian Keefe; Zheng Liu; Darin London; Ryan M McDaniell; Yoichiro Shibata; Kimberly A Showers; Jeremy M Simon; Teresa Vales; Tianyuan Wang; Deborah Winter; Zhuzhu Zhang; Neil D Clarke; Ewan Birney; Vishwanath R Iyer; Gregory E Crawford; Jason D Lieb; Terrence S Furey Journal: Genome Res Date: 2011-07-12 Impact factor: 9.043
Authors: Christopher A Haiman; Gary K Chen; Celine M Vachon; Federico Canzian; Alison Dunning; Robert C Millikan; Xianshu Wang; Foluso Ademuyiwa; Shahana Ahmed; Christine B Ambrosone; Laura Baglietto; Rosemary Balleine; Elisa V Bandera; Matthias W Beckmann; Christine D Berg; Leslie Bernstein; Carl Blomqvist; William J Blot; Hiltrud Brauch; Julie E Buring; Lisa A Carey; Jane E Carpenter; Jenny Chang-Claude; Stephen J Chanock; Daniel I Chasman; Christine L Clarke; Angela Cox; Simon S Cross; Sandra L Deming; Robert B Diasio; Athanasios M Dimopoulos; W Ryan Driver; Thomas Dünnebier; Lorraine Durcan; Diana Eccles; Christopher K Edlund; Arif B Ekici; Peter A Fasching; Heather S Feigelson; Dieter Flesch-Janys; Florentia Fostira; Asta Försti; George Fountzilas; Susan M Gerty; Graham G Giles; Andrew K Godwin; Paul Goodfellow; Nikki Graham; Dario Greco; Ute Hamann; Susan E Hankinson; Arndt Hartmann; Rebecca Hein; Judith Heinz; Andrea Holbrook; Robert N Hoover; Jennifer J Hu; David J Hunter; Sue A Ingles; Astrid Irwanto; Jennifer Ivanovich; Esther M John; Nicola Johnson; Arja Jukkola-Vuorinen; Rudolf Kaaks; Yon-Dschun Ko; Laurence N Kolonel; Irene Konstantopoulou; Veli-Matti Kosma; Swati Kulkarni; Diether Lambrechts; Adam M Lee; Loïc Le Marchand; Timothy Lesnick; Jianjun Liu; Sara Lindstrom; Arto Mannermaa; Sara Margolin; Nicholas G Martin; Penelope Miron; Grant W Montgomery; Heli Nevanlinna; Stephan Nickels; Sarah Nyante; Curtis Olswold; Julie Palmer; Harsh Pathak; Dimitrios Pectasides; Charles M Perou; Julian Peto; Paul D P Pharoah; Loreall C Pooler; Michael F Press; Katri Pylkäs; Timothy R Rebbeck; Jorge L Rodriguez-Gil; Lynn Rosenberg; Eric Ross; Thomas Rüdiger; Isabel dos Santos Silva; Elinor Sawyer; Marjanka K Schmidt; Rüdiger Schulz-Wendtland; Fredrick Schumacher; Gianluca Severi; Xin Sheng; Lisa B Signorello; Hans-Peter Sinn; Kristen N Stevens; Melissa C Southey; William J Tapper; Ian Tomlinson; Frans B L Hogervorst; Els Wauters; JoEllen Weaver; Hans Wildiers; Robert Winqvist; David Van Den Berg; Peggy Wan; Lucy Y Xia; Drakoulis Yannoukakos; Wei Zheng; Regina G Ziegler; Afshan Siddiq; Susan L Slager; Daniel O Stram; Douglas Easton; Peter Kraft; Brian E Henderson; Fergus J Couch Journal: Nat Genet Date: 2011-10-30 Impact factor: 38.330
Authors: Maya Ghoussaini; Olivia Fletcher; Kyriaki Michailidou; Clare Turnbull; Marjanka K Schmidt; Ed Dicks; Joe Dennis; Qin Wang; Manjeet K Humphreys; Craig Luccarini; Caroline Baynes; Don Conroy; Melanie Maranian; Shahana Ahmed; Kristy Driver; Nichola Johnson; Nicholas Orr; Isabel dos Santos Silva; Quinten Waisfisz; Hanne Meijers-Heijboer; Andre G Uitterlinden; Fernando Rivadeneira; Per Hall; Kamila Czene; Astrid Irwanto; Jianjun Liu; Heli Nevanlinna; Kristiina Aittomäki; Carl Blomqvist; Alfons Meindl; Rita K Schmutzler; Bertram Müller-Myhsok; Peter Lichtner; Jenny Chang-Claude; Rebecca Hein; Stefan Nickels; Dieter Flesch-Janys; Helen Tsimiklis; Enes Makalic; Daniel Schmidt; Minh Bui; John L Hopper; Carmel Apicella; Daniel J Park; Melissa Southey; David J Hunter; Stephen J Chanock; Annegien Broeks; Senno Verhoef; Frans B L Hogervorst; Peter A Fasching; Michael P Lux; Matthias W Beckmann; Arif B Ekici; Elinor Sawyer; Ian Tomlinson; Michael Kerin; Frederik Marme; Andreas Schneeweiss; Christof Sohn; Barbara Burwinkel; Pascal Guénel; Thérèse Truong; Emilie Cordina-Duverger; Florence Menegaux; Stig E Bojesen; Børge G Nordestgaard; Sune F Nielsen; Henrik Flyger; Roger L Milne; M Rosario Alonso; Anna González-Neira; Javier Benítez; Hoda Anton-Culver; Argyrios Ziogas; Leslie Bernstein; Christina Clarke Dur; Hermann Brenner; Heiko Müller; Volker Arndt; Christa Stegmaier; Christina Justenhoven; Hiltrud Brauch; Thomas Brüning; Shan Wang-Gohrke; Ursula Eilber; Thilo Dörk; Peter Schürmann; Michael Bremer; Peter Hillemanns; Natalia V Bogdanova; Natalia N Antonenkova; Yuri I Rogov; Johann H Karstens; Marina Bermisheva; Darya Prokofieva; Elza Khusnutdinova; Annika Lindblom; Sara Margolin; Arto Mannermaa; Vesa Kataja; Veli-Matti Kosma; Jaana M Hartikainen; Diether Lambrechts; Betul T Yesilyurt; Giuseppe Floris; Karin Leunen; Siranoush Manoukian; Bernardo Bonanni; Stefano Fortuzzi; Paolo Peterlongo; Fergus J Couch; Xianshu Wang; Kristen Stevens; Adam Lee; Graham G Giles; Laura Baglietto; Gianluca Severi; Catriona McLean; Grethe Grenaker Alnaes; Vessela Kristensen; Anne-Lise Børrensen-Dale; Esther M John; Alexander Miron; Robert Winqvist; Katri Pylkäs; Arja Jukkola-Vuorinen; Saila Kauppila; Irene L Andrulis; Gord Glendon; Anna Marie Mulligan; Peter Devilee; Christie J van Asperen; Rob A E M Tollenaar; Caroline Seynaeve; Jonine D Figueroa; Montserrat Garcia-Closas; Louise Brinton; Jolanta Lissowska; Maartje J Hooning; Antoinette Hollestelle; Rogier A Oldenburg; Ans M W van den Ouweland; Angela Cox; Malcolm W R Reed; Mitul Shah; Ania Jakubowska; Jan Lubinski; Katarzyna Jaworska; Katarzyna Durda; Michael Jones; Minouk Schoemaker; Alan Ashworth; Anthony Swerdlow; Jonathan Beesley; Xiaoqing Chen; Kenneth R Muir; Artitaya Lophatananon; Suthee Rattanamongkongul; Arkom Chaiwerawattana; Daehee Kang; Keun-Young Yoo; Dong-Young Noh; Chen-Yang Shen; Jyh-Cherng Yu; Pei-Ei Wu; Chia-Ni Hsiung; Annie Perkins; Ruth Swann; Louiza Velentzis; Diana M Eccles; Will J Tapper; Susan M Gerty; Nikki J Graham; Bruce A J Ponder; Georgia Chenevix-Trench; Paul D P Pharoah; Mark Lathrop; Alison M Dunning; Nazneen Rahman; Julian Peto; Douglas F Easton Journal: Nat Genet Date: 2012-01-22 Impact factor: 38.330
Authors: Jacob F Degner; Athma A Pai; Roger Pique-Regi; Jean-Baptiste Veyrieras; Daniel J Gaffney; Joseph K Pickrell; Sherryl De Leon; Katelyn Michelini; Noah Lewellen; Gregory E Crawford; Matthew Stephens; Yoav Gilad; Jonathan K Pritchard Journal: Nature Date: 2012-02-05 Impact factor: 49.962
Authors: Timothy R Dreszer; Donna Karolchik; Ann S Zweig; Angie S Hinrichs; Brian J Raney; Robert M Kuhn; Laurence R Meyer; Mathew Wong; Cricket A Sloan; Kate R Rosenbloom; Greg Roe; Brooke Rhead; Andy Pohl; Venkat S Malladi; Chin H Li; Katrina Learned; Vanessa Kirkup; Fan Hsu; Rachel A Harte; Luvina Guruvadoo; Mary Goldman; Belinda M Giardine; Pauline A Fujita; Mark Diekhans; Melissa S Cline; Hiram Clawson; Galt P Barber; David Haussler; W James Kent Journal: Nucleic Acids Res Date: 2011-11-15 Impact factor: 16.971
Authors: Kate R Rosenbloom; Timothy R Dreszer; Jeffrey C Long; Venkat S Malladi; Cricket A Sloan; Brian J Raney; Melissa S Cline; Donna Karolchik; Galt P Barber; Hiram Clawson; Mark Diekhans; Pauline A Fujita; Mary Goldman; Robert C Gravell; Rachel A Harte; Angie S Hinrichs; Vanessa M Kirkup; Robert M Kuhn; Katrina Learned; Morgan Maddren; Laurence R Meyer; Andy Pohl; Brooke Rhead; Matthew C Wong; Ann S Zweig; David Haussler; W James Kent Journal: Nucleic Acids Res Date: 2011-11-09 Impact factor: 16.971
Authors: Dennis J Hazelett; David V Conti; Ying Han; Ali Amin Al Olama; Doug Easton; Rosalind A Eeles; Zsofia Kote-Jarai; Christopher A Haiman; Gerhard A Coetzee Journal: Cell Cycle Date: 2016 Impact factor: 4.534
Authors: Huihuang Yan; Shulan Tian; Geffen Kleinstern; Zhiquan Wang; Jeong-Heon Lee; Nicholas J Boddicker; James R Cerhan; Neil E Kay; Esteban Braggio; Susan L Slager Journal: Hum Mol Genet Date: 2020-09-29 Impact factor: 6.150
Authors: Ye Feng; Suhn Kyong Rhie; Dezheng Huo; Edward A Ruiz-Narvaez; Stephen A Haddad; Christine B Ambrosone; Esther M John; Leslie Bernstein; Wei Zheng; Jennifer J Hu; Regina G Ziegler; Sarah Nyante; Elisa V Bandera; Sue A Ingles; Michael F Press; Sandra L Deming; Jorge L Rodriguez-Gil; Yonglan Zheng; Song Yao; Yoo-Jeong Han; Temidayo O Ogundiran; Timothy R Rebbeck; Clement Adebamowo; Oladosu Ojengbede; Adeyinka G Falusi; Anselm Hennis; Barbara Nemesure; Stefan Ambs; William Blot; Qiuyin Cai; Lisa Signorello; Katherine L Nathanson; Kathryn L Lunetta; Lara E Sucheston-Campbell; Jeannette T Bensen; Stephen J Chanock; Loic Le Marchand; Andrew F Olshan; Laurence N Kolonel; David V Conti; Gerhard A Coetzee; Daniel O Stram; Olufunmilayo I Olopade; Julie R Palmer; Christopher A Haiman Journal: Cancer Epidemiol Biomarkers Prev Date: 2017-04-04 Impact factor: 4.254
Authors: Kristen S Purrington; Susan Slager; Diana Eccles; Drakoulis Yannoukakos; Peter A Fasching; Penelope Miron; Jane Carpenter; Jenny Chang-Claude; Nicholas G Martin; Grant W Montgomery; Vessela Kristensen; Hoda Anton-Culver; Paul Goodfellow; William J Tapper; Sajjad Rafiq; Susan M Gerty; Lorraine Durcan; Irene Konstantopoulou; Florentia Fostira; Athanassios Vratimos; Paraskevi Apostolou; Irene Konstanta; Vassiliki Kotoula; Sotiris Lakis; Meletios A Dimopoulos; Dimosthenis Skarlos; Dimitrios Pectasides; George Fountzilas; Matthias W Beckmann; Alexander Hein; Matthias Ruebner; Arif B Ekici; Arndt Hartmann; Ruediger Schulz-Wendtland; Stefan P Renner; Wolfgang Janni; Brigitte Rack; Christoph Scholz; Julia Neugebauer; Ulrich Andergassen; Michael P Lux; Lothar Haeberle; Christine Clarke; Nirmala Pathmanathan; Anja Rudolph; Dieter Flesch-Janys; Stefan Nickels; Janet E Olson; James N Ingle; Curtis Olswold; Seth Slettedahl; Jeanette E Eckel-Passow; S Keith Anderson; Daniel W Visscher; Victoria L Cafourek; Hugues Sicotte; Naresh Prodduturi; Elisabete Weiderpass; Leslie Bernstein; Argyrios Ziogas; Jennifer Ivanovich; Graham G Giles; Laura Baglietto; Melissa Southey; Veli-Matti Kosma; Hans-Peter Fischer; Malcom W R Reed; Simon S Cross; Sandra Deming-Halverson; Martha Shrubsole; Qiuyin Cai; Xiao-Ou Shu; Mary Daly; Joellen Weaver; Eric Ross; Jennifer Klemp; Priyanka Sharma; Diana Torres; Thomas Rüdiger; Heidrun Wölfing; Hans-Ulrich Ulmer; Asta Försti; Thaer Khoury; Shicha Kumar; Robert Pilarski; Charles L Shapiro; Dario Greco; Päivi Heikkilä; Kristiina Aittomäki; Carl Blomqvist; Astrid Irwanto; Jianjun Liu; Vernon Shane Pankratz; Xianshu Wang; Gianluca Severi; Arto Mannermaa; Douglas Easton; Per Hall; Hiltrud Brauch; Angela Cox; Wei Zheng; Andrew K Godwin; Ute Hamann; Christine Ambrosone; Amanda Ewart Toland; Heli Nevanlinna; Celine M Vachon; Fergus J Couch Journal: Carcinogenesis Date: 2013-12-09 Impact factor: 4.944
Authors: Kristen S Purrington; Seth Slettedahl; Manjeet K Bolla; Kyriaki Michailidou; Kamila Czene; Heli Nevanlinna; Stig E Bojesen; Irene L Andrulis; Angela Cox; Per Hall; Jane Carpenter; Drakoulis Yannoukakos; Christopher A Haiman; Peter A Fasching; Arto Mannermaa; Robert Winqvist; Hermann Brenner; Annika Lindblom; Georgia Chenevix-Trench; Javier Benitez; Anthony Swerdlow; Vessela Kristensen; Pascal Guénel; Alfons Meindl; Hatef Darabi; Mikael Eriksson; Rainer Fagerholm; Kristiina Aittomäki; Carl Blomqvist; Børge G Nordestgaard; Sune F Nielsen; Henrik Flyger; Xianshu Wang; Curtis Olswold; Janet E Olson; Anna Marie Mulligan; Julia A Knight; Sandrine Tchatchou; Malcolm W R Reed; Simon S Cross; Jianjun Liu; Jingmei Li; Keith Humphreys; Christine Clarke; Rodney Scott; Florentia Fostira; George Fountzilas; Irene Konstantopoulou; Brian E Henderson; Fredrick Schumacher; Loic Le Marchand; Arif B Ekici; Arndt Hartmann; Matthias W Beckmann; Jaana M Hartikainen; Veli-Matti Kosma; Vesa Kataja; Arja Jukkola-Vuorinen; Katri Pylkäs; Saila Kauppila; Aida Karina Dieffenbach; Christa Stegmaier; Volker Arndt; Sara Margolin; Rosemary Balleine; Jose Ignacio Arias Perez; M Pilar Zamora; Primitiva Menéndez; Alan Ashworth; Michael Jones; Nick Orr; Patrick Arveux; Pierre Kerbrat; Thérèse Truong; Peter Bugert; Amanda E Toland; Christine B Ambrosone; France Labrèche; Mark S Goldberg; Martine Dumont; Argyrios Ziogas; Eunjung Lee; Gillian S Dite; Carmel Apicella; Melissa C Southey; Jirong Long; Martha Shrubsole; Sandra Deming-Halverson; Filomena Ficarazzi; Monica Barile; Paolo Peterlongo; Katarzyna Durda; Katarzyna Jaworska-Bieniek; Robert A E M Tollenaar; Caroline Seynaeve; Thomas Brüning; Yon-Dschun Ko; Carolien H M Van Deurzen; John W M Martens; Mieke Kriege; Jonine D Figueroa; Stephen J Chanock; Jolanta Lissowska; Ian Tomlinson; Michael J Kerin; Nicola Miller; Andreas Schneeweiss; William J Tapper; Susan M Gerty; Lorraine Durcan; Catriona Mclean; Roger L Milne; Laura Baglietto; Isabel dos Santos Silva; Olivia Fletcher; Nichola Johnson; Laura J Van'T Veer; Sten Cornelissen; Asta Försti; Diana Torres; Thomas Rüdiger; Anja Rudolph; Dieter Flesch-Janys; Stefan Nickels; Caroline Weltens; Giuseppe Floris; Matthieu Moisse; Joe Dennis; Qin Wang; Alison M Dunning; Mitul Shah; Judith Brown; Jacques Simard; Hoda Anton-Culver; Susan L Neuhausen; John L Hopper; Natalia Bogdanova; Thilo Dörk; Wei Zheng; Paolo Radice; Anna Jakubowska; Jan Lubinski; Peter Devillee; Hiltrud Brauch; Maartje Hooning; Montserrat García-Closas; Elinor Sawyer; Barbara Burwinkel; Frederick Marmee; Diana M Eccles; Graham G Giles; Julian Peto; Marjanka Schmidt; Annegien Broeks; Ute Hamann; Jenny Chang-Claude; Diether Lambrechts; Paul D P Pharoah; Douglas Easton; V Shane Pankratz; Susan Slager; Celine M Vachon; Fergus J Couch Journal: Hum Mol Genet Date: 2014-06-13 Impact factor: 6.150
Authors: Ignacio Blanco; Karoline Kuchenbaecker; Daniel Cuadras; Xianshu Wang; Daniel Barrowdale; Gorka Ruiz de Garibay; Pablo Librado; Alejandro Sánchez-Gracia; Julio Rozas; Núria Bonifaci; Lesley McGuffog; Vernon S Pankratz; Abul Islam; Francesca Mateo; Antoni Berenguer; Anna Petit; Isabel Català; Joan Brunet; Lidia Feliubadaló; Eva Tornero; Javier Benítez; Ana Osorio; Teresa Ramón y Cajal; Heli Nevanlinna; Kristiina Aittomäki; Banu K Arun; Amanda E Toland; Beth Y Karlan; Christine Walsh; Jenny Lester; Mark H Greene; Phuong L Mai; Robert L Nussbaum; Irene L Andrulis; Susan M Domchek; Katherine L Nathanson; Timothy R Rebbeck; Rosa B Barkardottir; Anna Jakubowska; Jan Lubinski; Katarzyna Durda; Katarzyna Jaworska-Bieniek; Kathleen Claes; Tom Van Maerken; Orland Díez; Thomas V Hansen; Lars Jønson; Anne-Marie Gerdes; Bent Ejlertsen; Miguel de la Hoya; Trinidad Caldés; Alison M Dunning; Clare Oliver; Elena Fineberg; Margaret Cook; Susan Peock; Emma McCann; Alex Murray; Chris Jacobs; Gabriella Pichert; Fiona Lalloo; Carol Chu; Huw Dorkins; Joan Paterson; Kai-Ren Ong; Manuel R Teixeira; Frans B L Hogervorst; Annemarie H van der Hout; Caroline Seynaeve; Rob B van der Luijt; Marjolijn J L Ligtenberg; Peter Devilee; Juul T Wijnen; Matti A Rookus; Hanne E J Meijers-Heijboer; Marinus J Blok; Ans M W van den Ouweland; Cora M Aalfs; Gustavo C Rodriguez; Kelly-Anne A Phillips; Marion Piedmonte; Stacy R Nerenstone; Victoria L Bae-Jump; David M O'Malley; Elena S Ratner; Rita K Schmutzler; Barbara Wappenschmidt; Kerstin Rhiem; Christoph Engel; Alfons Meindl; Nina Ditsch; Norbert Arnold; Hansjoerg J Plendl; Dieter Niederacher; Christian Sutter; Shan Wang-Gohrke; Doris Steinemann; Sabine Preisler-Adams; Karin Kast; Raymonda Varon-Mateeva; Andrea Gehrig; Anders Bojesen; Inge Sokilde Pedersen; Lone Sunde; Uffe Birk Jensen; Mads Thomassen; Torben A Kruse; Lenka Foretova; Paolo Peterlongo; Loris Bernard; Bernard Peissel; Giulietta Scuvera; Siranoush Manoukian; Paolo Radice; Laura Ottini; Marco Montagna; Simona Agata; Christine Maugard; Jacques Simard; Penny Soucy; Andreas Berger; Anneliese Fink-Retter; Christian F Singer; Christine Rappaport; Daphne Geschwantler-Kaulich; Muy-Kheng Tea; Georg Pfeiler; Esther M John; Alex Miron; Susan L Neuhausen; Mary Beth Terry; Wendy K Chung; Mary B Daly; David E Goldgar; Ramunas Janavicius; Cecilia M Dorfling; Elisabeth J van Rensburg; Florentia Fostira; Irene Konstantopoulou; Judy Garber; Andrew K Godwin; Edith Olah; Steven A Narod; Gad Rennert; Shani Shimon Paluch; Yael Laitman; Eitan Friedman; Annelie Liljegren; Johanna Rantala; Marie Stenmark-Askmalm; Niklas Loman; Evgeny N Imyanitov; Ute Hamann; Amanda B Spurdle; Sue Healey; Jeffrey N Weitzel; Josef Herzog; David Margileth; Chiara Gorrini; Manel Esteller; Antonio Gómez; Sergi Sayols; Enrique Vidal; Holger Heyn; Dominique Stoppa-Lyonnet; Melanie Léoné; Laure Barjhoux; Marion Fassy-Colcombet; Antoine de Pauw; Christine Lasset; Sandra Fert Ferrer; Laurent Castera; Pascaline Berthet; François Cornelis; Yves-Jean Bignon; Francesca Damiola; Sylvie Mazoyer; Olga M Sinilnikova; Christopher A Maxwell; Joseph Vijai; Mark Robson; Noah Kauff; Marina J Corines; Danylko Villano; Julie Cunningham; Adam Lee; Noralane Lindor; Conxi Lázaro; Douglas F Easton; Kenneth Offit; Georgia Chenevix-Trench; Fergus J Couch; Antonis C Antoniou; Miguel Angel Pujana Journal: PLoS One Date: 2015-04-01 Impact factor: 3.240
Authors: Anja Rudolph; Roger L Milne; Thérèse Truong; Julia A Knight; Petra Seibold; Dieter Flesch-Janys; Sabine Behrens; Ursula Eilber; Manjeet K Bolla; Qin Wang; Joe Dennis; Alison M Dunning; Mitul Shah; Hannah R Munday; Hatef Darabi; Mikael Eriksson; Judith S Brand; Janet Olson; Celine M Vachon; Emily Hallberg; J Esteban Castelao; Angel Carracedo; Maria Torres; Jingmei Li; Keith Humphreys; Emilie Cordina-Duverger; Florence Menegaux; Henrik Flyger; Børge G Nordestgaard; Sune F Nielsen; Betul T Yesilyurt; Giuseppe Floris; Karin Leunen; Ellen G Engelhardt; Annegien Broeks; Emiel J Rutgers; Gord Glendon; Anna Marie Mulligan; Simon Cross; Malcolm Reed; Anna Gonzalez-Neira; José Ignacio Arias Perez; Elena Provenzano; Carmel Apicella; Melissa C Southey; Amanda Spurdle; Lothar Häberle; Matthias W Beckmann; Arif B Ekici; Aida Karina Dieffenbach; Volker Arndt; Christa Stegmaier; Catriona McLean; Laura Baglietto; Stephen J Chanock; Jolanta Lissowska; Mark E Sherman; Thomas Brüning; Ute Hamann; Yon-Dschun Ko; Nick Orr; Minouk Schoemaker; Alan Ashworth; Veli-Matti Kosma; Vesa Kataja; Jaana M Hartikainen; Arto Mannermaa; Anthony Swerdlow; Graham G Giles; Hermann Brenner; Peter A Fasching; Georgia Chenevix-Trench; John Hopper; Javier Benítez; Angela Cox; Irene L Andrulis; Diether Lambrechts; Manuela Gago-Dominguez; Fergus Couch; Kamila Czene; Stig E Bojesen; Doug F Easton; Marjanka K Schmidt; Pascal Guénel; Per Hall; Paul D P Pharoah; Montserrat Garcia-Closas; Jenny Chang-Claude Journal: Int J Cancer Date: 2014-09-23 Impact factor: 7.396