Literature DB >> 18588689

Linkage disequilibrium mapping of a breast cancer susceptibility locus near RAI/PPP1R13L/iASPP.

Bjørn A Nexø1, Ulla Vogel, Anja Olsen, Mette Nyegaard, Zuzanna Bukowy, Eszter Rockenbauer, Xiuqing Zhang, Cemile Koca, Mette Mains, Bettina Hansen, Anne Hedemand, Anette Kjeldgaard, Magdalena J Laska, Ole Raaschou-Nielsen, Søren Cold, Kim Overvad, Anne Tjønneland, Lars Bolund, Anders D Børglum.   

Abstract

BACKGROUND: Previous results have suggested an association of the region of 19q13.3 with several forms of cancer. In the present study, we investigated 27 public markers within a previously identified 69 kb stretch of chromosome 19q for association with breast cancer by using linkage disequilibrium mapping. The study groups included 434 postmenopausal breast cancer cases and an identical number of individually matched controls. METHODS AND
RESULTS: Studying one marker at a time, we found a region spanning the gene RAI (alias PPP1R13L or iASPP) and the 5' portion of XPD to be associated with this cancer. The region corresponds to a haplotype block, in which there seems to be very limited recombination in the Danish population. Studying combinations of markers, we found that two to four neighboring markers gave the most consistent and strongest result. The haplotypes with strongest association with cancers were located in the gene RAI and just 3' to the gene. Coinciding peaks were seen in the region of RAI in groups of women of different age. In a follow-up to these results we sequenced 10 cases and 10 controls in a 44 kb region spanning the peaks of association. This revealed 106 polymorphisms, many of which were not in the public databases. We tested an additional 44 of these for association with disease and found a new tandem repeat marker, called RAI-3'd1, located downstream of the transcribed region of RAI, which was more strongly associated with breast cancer than any other marker we have tested (RR = 2.44 (1.41-4.23, p = 0.0008, all cases; RR = 6.29 (1.49-26.6), p = 0.01, cases up to 55 years of age).
CONCLUSION: We expect the marker RAI-3'd1 to be (part of) the cause for the association of the chromosome 19q13.3 region's association with cancer.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18588689      PMCID: PMC2474586          DOI: 10.1186/1471-2350-9-56

Source DB:  PubMed          Journal:  BMC Med Genet        ISSN: 1471-2350            Impact factor:   2.103


Background

The search for genetic determinants influencing the risk of cancer has gained momentum due to the very detailed maps of the human genome now available. We and others have previously shown that a region of chromosome 19q13.3 seems associated with a number of cancers, including cancer of the skin, breast, lung and brain [1-12], but not testis and colorectal cancer [13,14] Central in this chromosomal region is a 69 kb stretch, which contains two genes of importance for DNA repair, XPD (also known as ERCC2) and ERCC1, one gene presumably relating to apoptosis, RAI (also known as iASPP or PPP1R13L), and one presumably involved in ribosomal RNA transcription, ASE1 alias CD3EAP (see Figure 1). Thus three of four genes are related to the removal of damaged DNA, an important mechanism in preventing cancer.
Figure 1

Schematic representation of the relevant gene region. The crimson arrow marks a chromosomal position. The dark blue arrows represent the 4 genes of the region. The numbers below indicate the position relative to the marker XPD-exon23.

Schematic representation of the relevant gene region. The crimson arrow marks a chromosomal position. The dark blue arrows represent the 4 genes of the region. The numbers below indicate the position relative to the marker XPD-exon23. In the present study, involving 434 breast cancer cases and an identical number of controls, we initially investigated 27 public markers within the 69 kb stretch for association with breast cancer, using linkage disequilibrium mapping based on single markers, as well as on haplotypes combining several neighboring markers. We found that the region formed two haplotype blocks, and that associations were concentrated on the gene RAI in one of these haplotype blocks. We resequenced 44 kb encompassing this block in 10 cases and 10 controls. Among the 106 found polymorphisms (including the 27 described above) we tested an additional 44 polymorphisms for association with disease and identified a 5-bases tandem repeat polymorphism, NT_011109.15 g.18147012dupATTTT(2_11), here called RAI-3'd1, which discriminates cases from controls better than any previously investigated polymorphism in this region, and which may well be the effector causing this haplotype block to associate with cancer.

Results

This investigation deals with breast cancer in women. The cohort consisted of 24 697 Danish postmenopausal women age 50 – 64 at inclusion in 1993 – 1997. 434 of these women developed breast cancer before 2001. The cases and a set of individually matched controls (see Methods for details) were analyzed as a nested case-control study. [9]. All subjects were Caucasians. We investigated DNA polymorphisms in a 69 kb sub-region of 19q13.3 for association with breast cancer. To improve readability each polymorphism investigated was given a trivial name in accordance with its positions relative to the nearest gene followed by a hyphen and a serial number, for instance RAI intron8-3 denotes the third SNP investigated in the eighths intron of RAI. In the relative few polymorphisms that were in/dels or repeat polymorphisms, the serial number was preceded by a d, for instance RAI-3'd1. The markers genotyped in this study are listed in Table 1. We initially typed the ones labelled L1 and T in the "Series" column. There was no statistical significant departure from Hardy-Weinberg equilibrium of any marker among the controls.
Table 1

The polymorphisms used in this study, their kind and their identification

New trivial name1Systematic name2Series3rs#4contig pos5relativ position6
XPD-exon23ERCC2 c.2282A>CL113181181231370
XPD-exon10ERCC2 c.965G>AL117997931813547712340
XPD-exon6ERCC2 c.499A>CL1 + S2384061813652713390
XPD-intron5-1ERCC2 c.492-62T>AS2384071813669613559
XPD-intron5-2ERCC2 c492-892A>GS39168091813742614289
XPD-intron5-3ERCC2 c.391+311A>GS1813982416687
XPD-intron3-1ERCC2 c.278-35G>AL117997831814025417117
XPD-5'1ERCC2 c.-35-113G>CS38103661814216019023
XPD-5'd1ERCC2 c.-35-298_-294delGACAL1 + S39167911814234519208
XPD-5'4ERCC2 c.-35-583A>CL239167881814263019493
XPD-5'd2ERCC2 c.-35-980_-979ins80A39167871814302719890
XPD-5'2ERCC2 c.-35-1958A>GL1 + S20972151814400520868
XPD-5'3g.18145185T>CL1 + S118786441814518522048
RAI-3'd2g. 18145752_18145773del(22)A1814576822631
RAI-3'8g.18142233C>AS1814623323096
RAI-3'7g.18146823C>TL172525671814682323686
RAI-3'd1g.18147012ATTTT(2_11)A72557921814701223875
RAI-3'9g.18147126T>CL223773291814712623989
RAI-3'd3g.18147199delTTA30475601814719224055
RAI-3'10g.18147886T>GS + L2104224891814788624749
RAI-3'11g.18148193C>TL2104267011814819325056
RAI-3'4g.18150199A>GL1 + S45443431815019927062
RAI-exon 13-2PPP1R13L c.*507T>GS1815115828021
RAI-exon13-1PPP1R13L c.*475T>AT + S69661815118028043
RAI-intron12-1PPP1R13L c.2515-68G>AS1815177228635
RAI-intron12-4PPP1R13L c.2515-330G>CS1815203428897
RAI-intron12-3PPP1R13L c.2515-467G>AL1104172351815217129034
RAI-intron12-5PPP1R13Lc.2515-1150G>AL2128762521815276429627
RAI-intron12-2PPP1R13L c.2514+505T>AL181127231815349730360
RAI-intron11-2PPP1R13L c.2315-594T>GS1815479631659
RAI-intron11-1PPP1R13L c.2315-1281G>AL1 + S20171041815548332346
RAI-intron11-2PPP1R13L c.2248+1167G>TS1815587132734
RAI-intron8-1PPP1R13L c.1882-1425A>GL119707641815909135954
RAI-intron8-10PPP1R13L c.1882-1263A>GS1815926336126
RAI-intron8-9PPP1R13L c.1882-1697A>GL21816036337226
RAI-intron8-4PPP1R13L c.1882-2270T>GS1816093637799
RAI-intron8-5PPP1R13L c.1882-2271G>CS1816093737800
RAI-intron8-11PPP1R13l c.1881+1912G>TS1816143338296
RAI-intron8-6PPP1R13L c.1881+1514A>GS1816184138704
RAI-intron8-2PPP1R13L c. 1881+1149B>AL165092101816220639069
RAI-intron8-12PPP1R13l c.1881+756C>TL2129862721816259939462
RAI-intron8-7PPP1R13L c 1881+452T>GS1816290339766
RAI-intron8-8PPP1R13L c.1881+389T>CS1816298639849
RAI-intron8-3PPP1R13L c.1881+385T>GL11816297039833
RAI-intron7-1PPP1R13L c.1421-1236G>CS1816505241915
RAI-intron3-2PPP1R13L c.264-404G>AL148038141816894445807
RAI-intron3-3PPP1R13L c.264-408G>AL148038151816894845811
RAI-intron1-5PPP1R13L c.-21-1304insTAAGS1817111947982
RAI-intron1-7PPP1R13L c.-21-1931G>TS104025841817174648609
RAI-intron1-10PPP1R13Lc.-21-2062G>AS104012931817181548678
RAI-intron1-1PPP1R13L c.-21-2169A>GL1 + S45725141817198448847
RAI-intron1-4PPP1R13L c.-21-3291G>AL2 + S48038161817310649969
RAI-intron1-8PPP1R13L c.-21-3291A>GS1817310749970
RAI-intron1-2PPP1R13L c.-22+1148C>TL122269491817532752190
RAI-intron1-3PPP1R13L c.-22+734A>GL29594571817573952602
RAI-intron1-9PPP1R13L c.-22+379T>CS1817609652959
RAI-intron1-5PPP1R13L c.-22+297T>CL2 + S48038171817617853041
RAI-5'1PPP1R13L c-54-177T>CL1 + S104127611817668053543
ASE1-exon1CD3EAP c.-468-21G>AL19675911817815255015
ASE1-exon2CD3EAP c.22+29A>CS1817822255085
ASE1-exon3-4CD3EAP c.564G>CS1817862355486
ASE1-exon3-1CD3EAP c.1264A>GT7354821818022057083
ASE1-exon3-2CD3EAP c.1605A>GL1 + S7625621818056157424
ASE1-exon3-3CD3EAP c.1668G>AT + S23362191818062457487
ASE1-exon3-d1CD3EAP c.1751TTC(5_6)A32129871818070757570
ASE1-exon3-6CD3EAP c.1998C>AL1 + S32129861818095457817
ASE1-exon3-4CD3EAP c.*1228T>CL232129831818220559068
ERCC1-exon4ERCC1 c.500T>CL1116151819187168734

1) The trivial names have been changed to conform to the new model of the RAI gene, which involves a major 5' extension of the gene relative to older models. Specifically, the polymorphism previously called RAI intron1 is now called RAI-intron8-1. Similarly RAI exon6 is now called RAI-exon13-1.

2) The systematic name according to

3) The series of assays to which the polymorphism belongs

L1: First series of Lightcycler assays

L2: Later series of Lightcycler assays

A: Assays on ABI3100

T: Assays on Taqman

S: Assays on Sequenom

4) The rs-number of the polymorphisms in NCBI's database dbSNP

5) Positions of the polymorphisms in the contig [GenBank: NT_011109.15]

6) Positions of the polymorphisms relative to XPD-exon23

The polymorphisms used in this study, their kind and their identification 1) The trivial names have been changed to conform to the new model of the RAI gene, which involves a major 5' extension of the gene relative to older models. Specifically, the polymorphism previously called RAI intron1 is now called RAI-intron8-1. Similarly RAI exon6 is now called RAI-exon13-1. 2) The systematic name according to 3) The series of assays to which the polymorphism belongs L1: First series of Lightcycler assays L2: Later series of Lightcycler assays A: Assays on ABI3100 T: Assays on Taqman S: Assays on Sequenom 4) The rs-number of the polymorphisms in NCBI's database dbSNP 5) Positions of the polymorphisms in the contig [GenBank: NT_011109.15] 6) Positions of the polymorphisms relative to XPD-exon23 Next, we calculated the relative risks of breast cancer and the corresponding confidence limits for each marker comparing the variant homozygotes to homozygotes for the wildtype allele. However, if the calculation gave an RR value less than 1, we used the reciprocal value of the RR and the reciprocal value of the upper confidence limit instead. This analysis has the advantage that no a priori assumptions about the distribution of genotypes are needed. Figure 2 depicts the relative risks and the lower confidence limits of the relative risks for postmenopausal breast cancer at all ages as a function of the location of the markers. The figure also shows the logarithm of the p-values for trend of association using all 3 genotypes for each marker. Significant associations (i.e. p-values less than 0.05) of markers with cancer existed in the region from 13 000 to 40 000 bases, covering most of the gene RAI and the 5' region of the gene XPD. The singular values for the polymorphism RAI-3'd1 will be discussed later. The corresponding curves for women aged less than 55 showed an even higher relative risk in the same region with a peak value of RR = 6.25 (1.72 – 20), p(trend) = 0.005 (results not shown).
Figure 2

Association of single polymorphisms with breast cancer, all ages. The relative risk for cancer of the two homozygotes (Dark blue), the lower confidence limit for the relative risk(Pink), and the negative logarithm to the p-trend for association of all 3 genotypes(Light blue) of each marker with cancer is depicted against the position of the marker. The singular points represent the values for RAI-3'd1.

Association of single polymorphisms with breast cancer, all ages. The relative risk for cancer of the two homozygotes (Dark blue), the lower confidence limit for the relative risk(Pink), and the negative logarithm to the p-trend for association of all 3 genotypes(Light blue) of each marker with cancer is depicted against the position of the marker. The singular points represent the values for RAI-3'd1. We also used the analytical technique known as haplotype trend regression [15], more specifically the program HelixTree, to associate sets of markers with the individual diseases. We used the entire set of controls for the analyses. This broke the matching of the data sets, but programs performing haplotype trend regression on individually matched cohort data are not available, and matching on other criteria than ethnicity and age may well be irrelevant for genetic association studies [16]. Figure 3 shows the overall distribution of p-values for sets of markers plotted against the position on the chromosome for breast cancer at all ages. The abscissa of each short haplotype was defined as the median position of the markers included. Each curve corresponds to a given size of the marker set, i.e., the number of markers in the haplotypes. The figure suggests that the association with breast cancer had two peaks. One was located at roughly 24 kb and one at 39 kb. The peaks were clearly present in curves for haplotypes of 2, 3, and 4 neighbouring markers. Larger sets were less informative, presumably because of increased degrees of freedom, but they essentially corroborated the results. When the cases were broken down into cases before and after age 55, the peak at 24 kb was present in both groups, whereas the peak at 39 kb only was present in the older group (results not shown). For comparison, the figure also contains a singular point corresponding to the best haplotype, which included the new marker RAI-3'd1, to be described later.
Figure 3

Association of sets of neighbouring SNPs with breast cancer. The overall p-value for the association of a set of markers as calculated by HelixTree is depicted against the median position of the markers. The singular point represent the value for the haplotype RAI-3'd1 RAI-3'd3 RAI-3'4. (Blue), haplotypes made up of 2 neighboring polymorphisms. (Red), haplotypes made up of 3 neighboring polymorphisms. (Green), haplotypes made up of 4 neighboring polymorphisms.

Association of sets of neighbouring SNPs with breast cancer. The overall p-value for the association of a set of markers as calculated by HelixTree is depicted against the median position of the markers. The singular point represent the value for the haplotype RAI-3'd1 RAI-3'd3 RAI-3'4. (Blue), haplotypes made up of 2 neighboring polymorphisms. (Red), haplotypes made up of 3 neighboring polymorphisms. (Green), haplotypes made up of 4 neighboring polymorphisms. Finally, we calculated the pair-wise linkage disequilibrium of all markers in the controls (Figure 4). The region consisted of two major haplotype blocks with a few interspersed markers. One haplotype block spanned 16 markers and 26 kb from XPD exon6 to RAI intron8-3, corresponding to the region with association to cancer, whereas the other haplotype block spanned 6 markers and 9 kb from RAI intron1-1 to ASE1 exon3-3.
Figure 4

Pair-wise linkage disequilibrium between all markers in the controls. The values above the diagonal are the correlations of two polymorphisms, R. The values below the diagonal are Nowontin's normalised linkage disequilibrium, D'. The red color represent high values, blue represents low value.

Pair-wise linkage disequilibrium between all markers in the controls. The values above the diagonal are the correlations of two polymorphisms, R. The values below the diagonal are Nowontin's normalised linkage disequilibrium, D'. The red color represent high values, blue represents low value. Based on all these results we chose to sequence PCR fragments spanning the region from relative position 13 to 59 kb in breast cancer cases and controls. Initially, we selected 10 cases and 10 controls for analysis. The actual number of reliable sequences obtained varied somewhat from one part of the region to the other. Table 9 lists the 106 detected polymorphisms including their positions, their rs-numbers if assigned, and the surrounding sequences [See Additional File 1]. Essentially all previously analyzed polymorphisms were included in the list. Based on the sequencing results, we analyzed 44 new polymorphisms using ABI3100, Lightcycler and Sequenom (designated A, L2 and S in the "Series" column of Table 1). Among the remainder 18 could not be reproduced and were probably sequencing errors rather than polymorphisms. Most of the remaining could not be assayed for due to the repetitive nature of the DNA around them. Fifteen of the polymorphisms were analyzed by two methods. The concordance of the two methods was very high. One polymorphism had a concordance of 98 percent, and the remainder had concordance rates of 99 percent or above. A new tandem repeat polymorphism, RAI-3'd1 was particularly interesting. It was located close to minimum p value of the haplotype trend regression analysis and it had the potential to modify gene expression by virtue of the relative large change in the sequence. To simplify the scoring we subdivided the fragments of RAI-3'd1 into two groups, 6 repeats or less, called S for short, and 7 repeats or more, called L for long. The results are shown in Table 2. RAI-3'd1, showed a stronger association with disease than any other polymorphism we have tested (singular point in figure 2). The calculated relative risk was higher (RR = 2.44 (1.41–4.23) and the associated p-value was lower (p = 0.0008) than any we had observed before. When the effect was broken down into age brackets, RAI-3'd1 was marginally better (RR = 6.29 (1.49–26.6)) than other markers in the youngest age group and showed a clearer tendency in the other age groups (see Table 2).
Table 2

Association of breast cancer with two additional polymorphisms selected from those detected by sequencing.

PolymorphismN1 casesN1 controlsIRR2 (95% CI3)P-trend4
RAI-3'd1

All ages0.0008
 LL38591
 SL1271651.39 (0.81–2.38)
 SS2001502.44 (1.41–4.23)

<55 yr0.01
 LL5151
 SL18251.46 (0.33–6.47)
 SS33146.29 (1.49–26.61)

55–60 yr0.19
 LL12171
 SL45631.72 (0.63–4.73)
 SS61492.57 (0.88–7.48)

>60 yr0.12
 LL21271
 SL64771.09 (0.52–2.60)
 SS106871.73 (0.83–3.64)

RAI-3'd2

All ages0.005
 LL1491131
 SL1681990.62 (0.44–0.88)
 SS66880.50 (0.32–0.79)

1) Number of persons

2) Incidence rate ratio

3) Confidence limit

4) P-value for trend among the IRRs for all 3 genotypes

Association of breast cancer with two additional polymorphisms selected from those detected by sequencing. 1) Number of persons 2) Incidence rate ratio 3) Confidence limit 4) P-value for trend among the IRRs for all 3 genotypes Another interesting polymorphism, RAI-3'd2, was also associated with disease, albeit to a lesser extent. Moreover, when adjusting RAI-3'd2 for RAI-3'd1 we lost all association of the former with cancer, whereas RAI-3'd1 remained associated with cancer after adjustment for RAI-3'd2. This suggested that RAI-3'd2 was only indirectly associated with cancer, for instance through its linkage disequilibrium with RAI-3'd1. We also performed haplotype trend regression on RAI-3'd1 and it's neighboring SNPs. One of the haplotypes, RAI-3'd1 RAI-3'd3 RAI-3'4, gave a lower overall p-value (p= 0.0002) than any we had seen (indicated as the singular value in Figure 3). To better understand the nature of the association, we tabulated the frequencies of individual haplotypes made up of the 3 neighbouring markers centring on the position at 24 kb with maximal evidence of association with breast cancer (RAI-3'd1 RAI-3'd3 RAI-3'4, see above and Table 3). We also tabulated p-values associated with the individual frequencies. The same haplotypes in young and older cancer cases differed in frequency from the controls (RAI-3'd1LRAI-3'd3S RAI-3'4C protective; whereas all haplotypes containing RAI-3'd1S seemed inductive).
Table 3

Frequency of haplotypes in breast cancer cases and controls and p-values for their association with cancer as determined by haplotype trend regression.

Breast cancer casesBreast cancer controls



Haplotype/AgegroupYoung1)Old2)All agesAll ages
RAI-3'd1s RAI-3'd3s RAI-3'4c0.0353 (0.37)40.054 (0.002)0.051 (0.003)0.022
RAI-3'd1s RAI-3'd3s RAI-3'4t0.001 (-)0.002 (0.12)0.002 (0.09)0.000
RAI-3'd1s RAI-3'd3l RAI-3'4c0.008 (0.12)0.006 (0.11)0.006 (0.07)0.001
RAI-3'd1s RAI-3'3dl RAI-3'4t0.712 (0.02)0.631 (0.17)0.642 (0.06)0.597
RAI-3'd1l RAI-3'd3s RAI-3'4c0.244 (0.006)0.297 (0.003)0.290 (0.0005)0.379
RAI-3'd1l RAI-3'd3s RAI-3'4t0.000 (-)0.005 (0.04)0.005 (0.01)0.000
RAI-3'd1l RAI-3'd3l RAI-3'4c0.000 (-)0.000 (0.61)0.000 (0.11)0.000
RAI-3'd1l RAI-3'd3l RAI-3'4t0.000 (0.69)0.003 (0.48)0.003 (0.57)0.001
RAI intron8-2a RAI intron8-3a0.259 (0.78)0.209 (0.01)0.216 (0.02)0.271
RAI intron8-2a RAI intron8-3c0.000 (0.89)0.007 (0.13)0.006 (0.13)0.000
RAI intron8-2g RAI intron8-3a0.69 (0.12)0.701 (0.001)0.699 (0.0007)0.615
RAI intron8-2g RAI intron8-3c0.051 (0.05)0.084 (0.11)0.079 (0.04)0.113

1) Breast cancer before age 55

2) Breast cancer at age 55 or older

3) Haplotype frequency

4) P-value for haplotype frequency in comparison to the haplotype frequency of controls, all ages.

Frequency of haplotypes in breast cancer cases and controls and p-values for their association with cancer as determined by haplotype trend regression. 1) Breast cancer before age 55 2) Breast cancer at age 55 or older 3) Haplotype frequency 4) P-value for haplotype frequency in comparison to the haplotype frequency of controls, all ages. Finally, as a preliminary check of our results we have tested RAI-3'd1 in a small independent sample of Danish women from Funen with breast cancers (Cold, unpublished). We found the persons that were RAI-3'd1SS to have an increased risk of breast cancer between 50 and 60 years of age (Table 4). A similar, but non-significant tendency was present in subjects under age 50, whereas no association between alleles of RAI-3'd1 and breast cancer was present after age 60.
Table 4

Breast cancer risk in an independent small cohort of Danish Women in relation to RAI-3'd1.

Age at diagnosisRAI-3'd1SSRAI-3'd1SL + RAI-3'd1LLOR1 (95% CI2) (one-sided)P-value3 (one-sided)
< 50 yearscases9141.36 (0.51 -)0.30
controls919
50 – 60 yearscases1392.50 (1.02 -)0.04
controls1526
>= 60 yearscases30440.89 (0.46 -)0.88
controls3238

The table lists the number of persons in each group, the odds ratio and the lower 5 percent confidence limit for the distribution of persons, and the p-value for the odds ratio.

1) Odds Ratio for cases and controls combined with genotypes

2) One-sided lower 95% Confidence Limit for the OR

3) One-sided P-value for the OR

Breast cancer risk in an independent small cohort of Danish Women in relation to RAI-3'd1. The table lists the number of persons in each group, the odds ratio and the lower 5 percent confidence limit for the distribution of persons, and the p-value for the odds ratio. 1) Odds Ratio for cases and controls combined with genotypes 2) One-sided lower 95% Confidence Limit for the OR 3) One-sided P-value for the OR

Discussion

This study continues our studies of the association of breast cancer with one particular chromosomal region, 19q13.3, more specifically the gene RAI and its immediate surroundings. The present results improve the chromosomal resolution and suggest a possible effector, i.e. a causative variant. In addition, the haplotype analysis presented is the first evidence for association of RAI with breast cancer after age 55. It does so with p-values around 10-3 at the same location as the analysis of younger persons, a testament to the efficiency of analyzing haplotypic associations rather than single SNPs. With this finding our results become relevant for a much larger group of cancer patients. Most of our data come from a population-based follow-up study in Denmark with very limited dropout. Although this nested case-control study was originally individually matched, we have sometimes treated it as an unmatched case-control study in order to be able to use current computer programs. This opens the possibility of confounding from the matching parameters. For two reasons, however, we find such confounding highly unlikely. First, the observed effects are generally too strong for confounding to be a reasonable explanation. Second, confounders must correlate with the primary variables. We have no reason to believe that any of the match criteria correlated with the constitution on chromosome 19. We used several ways to try to locate the effector(s). The overall conclusion was clear: One effector is located in the haplotype block between the markers XPD exon6 and RAI intron8-3. Details of the results varied: Methods handling one marker at a time indicated that the effector could be located anywhere from the 5' portion of XPD to the 5' portion of RAI, whereas analyses combining information from multiple markers indicated the presence of two effectors; one was located just 3' to the gene RAI, and another was located in the 5' portion of the gene (at 39 kb). We resequenced the region of strongest association identified by haplotype trend regression. Here we discovered a new 5-base tandem repeat polymorphism, RAI-3'd1, that showed even stronger association with cancer (p = 0.0008). Bonferroni adjustment for the multiplicity of testing indicates that with an average correlation of 0.32 among a total of 68 polymorphisms, any p-value below 0.003 will make the whole set significant [17]). Thus, the result cannot be considered a result of the multiplicity of testing. The polymorphism RAI-3'd1 may well be the effector. The haplotype data of Table 3 indirectly supports this view. All haplotypes in Table 3 containing RAI-3'd1S were associated with increased risk of cancer, irrespective of the surrounding polymorphisms. We would not expect this result if RAI-3'd1 only was a correlate of the effector. However, mapping inside the haplotype block with the use of data from the present population is not very sensitive. For instance, the markers XPD-5'2 and RAI intron11, spaced 12 kb apart, were synonymous in 799 of 813 persons. Studies of this polymorphism in populations of different ethnicity could prove very useful, but functional assays will be necessary before a final conclusion can be reached. It may be useful to compare aspects of the breast cancer risk caused by the RAI region with that caused by the BRCA genes. The RAI region is either recessive or suffers from strong haplo-insufficiency, and it roughly doubles the risk of breast cancer, thus the penetrance is low. The BRCA genes on the other hand are dominant, and they increase the breast cancer risk about 8 fold, i.e. penetrance is high. However, the risk allele of RAI is very common. In our study approximately 40 percent of the controls were homozygotes for the risk allele. A conservative estimate of the ethiological fraction suggests that at least 20 percent of breast cancers is caused by RAI-3'd1, and the number may be twice as high. Thus in terms of the number of cancers caused in the Danish population, RAI seems to exceed the BRCA genes in importance. From Table 3 it is apparent that in the Danish population, and presumably in other Caucasian populations as well, the risk allele, RAI-3'd1S, occurred with different neighboring SNP alleles at reasonable frequencies. In contrast the protective allele RAI-3'd1L almost exclusively occurred together with RAI-3'd3S and RAI-3'4C. Further investigation has shown that this tendency was reproduced with a number of the surrounding SNPs. Generally, the less frequent haplotypes involving RAI-3'd1S were approximately 10 fold more common than the less frequent haplotypes involving RAI-3'd1L were. This was true for 12 out of 14 SNPs in the 23 kb interval from XPD exon6 to RAI intron8-1. The exceptions were RAI-e13 and RAI-3'7. In accordance, combined analysis of the 12 SNPs using HelixTree to form a 12 point haplotype indicated that 13 less frequent haplotypes involving RAI-3'd1S constituted 23 percent of the chromosomes, corresponding to 56 percent of the major haplotype with RAI-3'd1S. In contrast, 7 less frequent alleles involving RAI-3'd1L only constituted 2 percent of the chromosomes, corresponding to only 8 percent of the major haplotype with RAI-3'd1L (results not shown). Thus, the haplotypes containing RAI-3'd1S seem more diverse than their counterparts containing RAI-3'd1L do. The simple interpretation of this phenomenon would be that RAI-3'd1S is the ancestral allele in Caucasians. There are at present no known polymorphisms affecting the amino acid sequence of the RAI protein. Presumably, RAI-3'd1 influences the expression of RAI. Our hypothesis is that RAI-3'd1 modulates a silencer or enhancer. The influence of RAI-3'd1 may not be limited to RAI. It is a curious fact that this short stretch of chromosome 19, comprising only 69 kb, contains two DNA repair genes and one apoptosis gene. One could imagine some sort of higher-order control common for this cluster of related genes. Studies of the expression of XPD, RAI and ERCC1 in lymphocytes from humans have indeed shown that their mRNA levels are tightly coordinated [18,19]. If a primary effect of RAI-3'd1 is modified RAI expression, one result should be modified apoptosis. RAI protein is an inhibitor of RelA, a subunit of the transcription factor NF-κB [20,21]. NF-κB has long been implicated in both cell proliferation and apoptosis. Modulation of NF-κB may well be part of a "crunch-time scenario", invoked when the cell has to muster its forces and make life-and-death decisions. A recent report suggested that the choice between cell survival and death is regulated by the relative activity of the two subunits encoded by RelA and c-rel [22]. By neutralising the RelA product, RAI protein would presumably shift this balance towards apoptosis. In accordance with this view, we have found that RAI mRNA increases dramatically when cells undergo apoptosis. We have also found that simultaneous transfection with siRNAs destabilizing RAI mRNA reduces apoptosis in non-transformed cells. Thus, we believe that RAI expression is instrumental for apoptosis [23]. However, RAI protein may also block p53 activity and thus have the opposite effect [24]. For instance, RAI is over-expressed in certain tumors and may in this situation block apoptosis. We have previously found that the expression of RAI is ca. 8-fold higher in both preneoplastic lesions such as adenomas and in tumour tissue such as adenocarcinomas compared to healthy tissue from the same person [25]. Furthermore, we have found that over-expression of RAI enhances the malignant character of oncogene-transformed cells (M. Laska et al, manuscript in preparation). Thus, RAI may alternatively accelerate or inhibit cell growth depending on the circumstances. The previous reports, that have associated our region of interest with the risk of getting basal cell carcinoma, melanoma, lung cancer, glioma, breast cancer, bladder cancer, and possibly head and neck cancer [1-10,26,27], suggest that the effector may influence the risk of getting a wide variety of cancers of different etiology. This conclusion makes perfect sense if the functional elements modulate a downstream cellular response, such as apoptosis.

Conclusion

We found the region around the RAI gene to be strongly associated with breast cancer, in particular during the early menopause. The marker RAI-3'd1 may be the cause for the extra risk for breast cancer. We think it may modulate an enhancer or silencer influencing RAI and possibly other genes.

Methods

Ethical compliance

This study was performed in compliance with the Helsinki declaration. The project was approved by the Science Ethical Committee of Copenhagen (j.nr: 01-345/93/(KF) 11-037/01/11-124/01). Written and verbal informed consent was obtained from all participants.

Study groups

"Diet, Cancer and Health" is a Danish prospective follow-up study [28]. Individuals eligible for inclusion were born in Denmark, living in the Copenhagen or Aarhus areas, and at the time of inclusion had not been registered as having cancer (including non-melanoma skin cancer) in the Danish Cancer Registry. Invited to participate were 160 725 individuals, aged 50 to 64 years, of whom 57 053 were recruited. Among these, 542 were later registered with the Danish Cancer Register with a cancer diagnosed before the date of enrolment and were therefore excluded. At enrolment (1993–1997), detailed information was collected on diet, smoking habits, lifestyle, weight, height, children, medical treatment, and other socio-economic characteristics and environmental exposures. For women, information regarding hormone replacement therapy (HRT) and menopausal status was also recorded. Moreover, blood, urine, fat tissue, and other biological materials were sampled and stored at -150°C. Cohort members were identified by a unique identification number, allocated to every Danish citizen by the Central Population Registry. Cohort members were linked to the Central Population Registry for information on vital status and immigration. Information on cancer occurrence among cohort members was obtained through record linkage to the Danish Cancer Registry, which collects information on all inhabitants in Denmark who develop cancer [29]. Linkage was performed by use of the personal identification number. The study sample for breast cancer included the 29 549 women recruited within "Diet, Health and Cancer". Eight women were excluded from the study because they did not fill in the lifestyle questionnaire. Because the present analysis aimed at the subgroup of women who were post-menopausal at study entry, we excluded 4844 supposedly pre-menopausal women, including 4798 women who had reported at least one menstruation no more than 12 months before entry and no use of HRT, nine women who gave a lifetime history of no menstruations, and 37 women who did not answer the questions about current or previous use of HRT, leaving 24 697 postmenopausal women. Each cohort member was followed up for breast cancer occurrence from the date of entry, i.e. date of visit to the study centre until the date of diagnosis of any cancer (except for non-melanoma skin cancer), date of death, date of emigration, or 31 December 2000, whichever came first. A total of 434 women were diagnosed with incident breast cancer during the follow up period. Controls were selected among the remaining cohort participants using a nested case-control approach. One control was selected for each of the 434 cases. The control was cancer-free at the exact age at diagnosis of the case and was further matched on age at inclusion into the cohort (half-year intervals), certainty of postmenopausal status (known/probably postmenopausal), and use of HRT at inclusion into the cohort (current/former/newer). Of the 434 pairs (866 women; 434 cases and 434 controls, including two cases), 19 pairs were excluded due to lack of blood sample in either case or control, leaving 415 pairs for study. Additional pairs were excluded due to problems with the genotyping procedure in one or both members of the pair.(RAI-3'd1 105 pairs; RAI-3d'2 62 pairs).

Resequencing of genomic DNA

Resequencing was performed on PCR fragments from DNA of persons selected for different haplotypes among the known markers in the region. 7 of the cases were further selected for having gotten breast cancer before age 55. The DNA sequences to be amplified and resequenced were obtained from databases. Repetitive sequences were determined and masked using RepeatMasker software. Primers were chosen using Primer3 software. PCR amplification was performed in a 10 μL volume and the product was purified with Centricon 100 (Ambion). Each sequencing reaction contained 2.5 μL of the purified PCR product, 6.5 μL of water, 2.0 μL of BigDye version 3 mix (Applied Biosystems), and 1.0 μL of 5× sequencing buffer, according to the protocol of the dye manufacturer. Electrophoresis and sequencing detection were performed using an ABI PRISM 3730 × l DNA Analyzer (Applied Biosystems).

Typing of SNPs

Table 1 lists the polymorphisms used in this study, their nature, the numbers in the NCBI database dbSNP, the method and time of typing, and their position therein, and their relative position within the region of interest. Typing was performed on a Lightcycler, a Taqman and a Sequenom. Three polymorphism, RAI-3'd1, RAI-3'd3, and RAI-3'd2 were determined as length polymorphisms on an ABI3100 (Applied Biosystems, Nærum, Denmark). Information about the assay conditions can be found in the Additional File 1: SNP typing and SNP identification.

Statistics

Data recording, calculations and tests of allele frequencies were performed in SPSS and Excel. Calculation of the relative risk and confidence intervals for the single polymorphisms was done using the procedure phrec in the program package SAS (SAS Institute, Cary, NC, USA). Simultaneous analysis of multiple SNPs employing haplotype trend regression [15] was performed with HelixTree (GoldenTree, Bozeman, MT, USA). Three programs were used for assigning haplotypes to individuals on the bases of genotype data: HelixTree, Arlequin [30] and Phase [31]. Arlequin like HelixTree is a maximum likelihood algorithm, whereas Phase also includes a penalty for each new haplotype that is brought into play. Furthermore, Arlequin includes missing values in its table of frequencies. To compensate for the latter, we normalised the values corresponding to fully defined haplotypes before including them in the analysis. All three were run under Windows 2000. HelixTree and Arlequin produced almost identical data. The data from Phase were also similar to those of the other programs. Helix Tree's values were used in the further calculations. The figure of the linkage disequilibrium among the controls (Figure 4) was also derived from HelixTree.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

BAN provided concepts and ideas, performed part of the data analysis and drafted the manuscript. UV contributed to the concepts and performed part of the marker analyses. AO performed part of the data analysis. MN, BH, ZB, ER, CK, MM, AK and ML established and performed most of the marker analyses. QZ performed the sequencing and part of the associated analysis. OR-N, SC, KO and AT established the cohorts. LB and ADB provided ideas and analyses.

Pre-publication history

The pre-publication history for this paper can be accessed here:

Additional file 1

SNP typing and SNP identification. The conditions for the SNP typing and the description of the SNPs found by sequencing are stored as a .pdf file in the Additional File 1. Click here for file
  29 in total

1.  Nucleotide excision repair gene XPD polymorphisms and genetic predisposition to melanoma.

Authors:  D Tomescu; G Kavanagh; T Ha; H Campbell; D W Melton
Journal:  Carcinogenesis       Date:  2001-03       Impact factor: 4.944

2.  Association of chromosome 19q13.2-3 haplotypes with basal cell carcinoma: tentative delineation of an involved region using data for single nucleotide polymorphisms in two cohorts.

Authors:  Eszter Rockenbauer; Mette H Bendixen; Zuzanna Bukowy; Jiaoyang Yin; Nicklas R Jacobsen; Mohammad Hedayati; Ulla Vogel; Lawrence Grossman; Lars Bolund; Bjørn A Nexø
Journal:  Carcinogenesis       Date:  2002-07       Impact factor: 4.944

3.  Proper controls for SNP studies?

Authors:  Kari Hemminki; Asta Försti
Journal:  Carcinogenesis       Date:  2002-08       Impact factor: 4.944

4.  Modulation of nucleotide excision repair capacity by XPD polymorphisms in lung cancer patients.

Authors:  M R Spitz; X Wu; Y Wang; L E Wang; S Shete; C I Amos; Z Guo; L Lei; H Mohrenweiser; Q Wei
Journal:  Cancer Res       Date:  2001-02-15       Impact factor: 12.701

5.  DNA repair capacity: inconsistency between effect of over-expression of five NER genes and the correlation to mRNA levels in primary lymphocytes.

Authors:  U Vogel; M Dybdahl; G Frentz; B A Nexo
Journal:  Mutat Res       Date:  2000-11-09       Impact factor: 2.433

6.  Multiple single nucleotide polymorphisms on human chromosome 19q13.2-3 associate with risk of Basal cell carcinoma.

Authors:  Jiaoyang Yin; Eszter Rockenbauer; Mohammad Hedayati; Nicklas Raun Jacobsen; Ulla Vogel; Lawrence Grossman; Lars Bolund; Børn A Nexø
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2002-11       Impact factor: 4.254

7.  XPD/ERCC2 polymorphisms and risk of head and neck cancer: a case-control analysis.

Authors:  E M Sturgis; R Zheng; L Li; E J Castillo; S A Eicher; M Chen; S S Strom; M R Spitz; Q Wei
Journal:  Carcinogenesis       Date:  2000-12       Impact factor: 4.944

8.  Differential roles of RelA (p65) and c-Rel subunits of nuclear factor kappa B in tumor necrosis factor-related apoptosis-inducing ligand signaling.

Authors:  Xufeng Chen; Karthikeyan Kandasamy; Rakesh K Srivastava
Journal:  Cancer Res       Date:  2003-03-01       Impact factor: 12.701

9.  iASPP oncoprotein is a key inhibitor of p53 conserved from worm to human.

Authors:  Daniele Bergamaschi; Yardena Samuels; Nigel J O'Neil; Giuseppe Trigiante; Tim Crook; Jung-Kuang Hsieh; Daniel J O'Connor; Shan Zhong; Isabelle Campargue; Matthew L Tomlinson; Patricia E Kuwabara; Xin Lu
Journal:  Nat Genet       Date:  2003-01-13       Impact factor: 38.330

10.  A haplotype of polymorphisms in ASE-1, RAI and ERCC1 and the effects of tobacco smoking and alcohol consumption on risk of colorectal cancer: a Danish prospective case-cohort study.

Authors:  Rikke D Hansen; Mette Sørensen; Anne Tjønneland; Kim Overvad; Håkan Wallin; Ole Raaschou-Nielsen; Ulla Vogel
Journal:  BMC Cancer       Date:  2008-02-20       Impact factor: 4.430

View more
  12 in total

1.  Increased expression of iASPP correlates with poor prognosis in FIGO IA2-IIA cervical adenocarcinoma following a curative resection.

Authors:  Fanming Kong; Xiaofeng Shi; Huzi Li; Pu Li; Jianchun Yu; XiaoJiang Li; Jun Chen; Yiyu Sun; Yingjie Jia
Journal:  Am J Cancer Res       Date:  2015-02-15       Impact factor: 6.166

2.  In vitro effect of iASPP on cell growth of oral tongue squamous cell carcinoma.

Authors:  Yu Chen; Wangxiang Yan; Shuqi He; Jiechun Chen; Dan Chen; Zhaoqiang Zhang; Zhiguo Liu; Xueqiang Ding; Anxun Wang
Journal:  Chin J Cancer Res       Date:  2014-08       Impact factor: 5.087

3.  GLTSCR1, ATM, PPP1R13L and CD3EAP Genetic Variants and Lung Cancer Risk in a Chinese Population.

Authors:  Jiao-Yang Yin; Ye-Gang Ma; Ulla Vogel; Dong-Hui Liu; Zhen-Xiang Sun
Journal:  Curr Med Sci       Date:  2018-08-20

4.  MicroRNA124 regulate cell growth of prostate cancer cells by targeting iASPP.

Authors:  Jun Chen; Hengjun Xiao; Zhansen Huang; Zhiming Hu; Tao Qi; Bin Zhang; Xin Tao; Song-Hao Liu
Journal:  Int J Clin Exp Pathol       Date:  2014-04-15

5.  PPP1R13L variant associated with prognosis for patients with rectal cancer.

Authors:  Yee Soo Chae; Jong Gwang Kim; Byung Woog Kang; Soo Jung Lee; Hyo-Sung Jeon; Jun Seok Park; Gyu Seog Choi; Won Kee Lee
Journal:  J Cancer Res Clin Oncol       Date:  2012-11-21       Impact factor: 4.553

6.  Genetic Polymorphisms in XRCC1, CD3EAP, PPP1R13L, XPB, XPC, and XPF and the Risk of Chronic Benzene Poisoning in a Chinese Occupational Population.

Authors:  Ping Xue; Lin Gao; Sha Xiao; Guopei Zhang; Mingyang Xiao; Qianye Zhang; Xiao Zheng; Yuan Cai; Cuihong Jin; Jinghua Yang; Shengwen Wu; Xiaobo Lu
Journal:  PLoS One       Date:  2015-12-17       Impact factor: 3.240

7.  Haplotype frequencies in a sub-region of chromosome 19q13.3, related to risk and prognosis of cancer, differ dramatically between ethnic groups.

Authors:  Mikkel H Schierup; Thomas Mailund; Heng Li; Jun Wang; Anne Tjønneland; Ulla Vogel; Lars Bolund; Bjørn A Nexø
Journal:  BMC Med Genet       Date:  2009-03-03       Impact factor: 2.103

8.  ProSim: A Method for Prioritizing Disease Genes Based on Protein Proximity and Disease Similarity.

Authors:  Gamage Upeksha Ganegoda; Yu Sheng; Jianxin Wang
Journal:  Biomed Res Int       Date:  2015-08-03       Impact factor: 3.411

9.  copy number variation analysis in familial BRCA1/2-negative Finnish breast and ovarian cancer.

Authors:  Kirsi M Kuusisto; Oyediran Akinrinade; Mauno Vihinen; Minna Kankuri-Tammilehto; Satu-Leena Laasanen; Johanna Schleutker
Journal:  PLoS One       Date:  2013-08-13       Impact factor: 3.240

10.  Expression pattern of the apoptosis-stimulating protein of p53 family in p53+ human breast cancer cell lines.

Authors:  Changsong Wang; Chunfang Gao; Yanping Chen; Jian Yin; Ping Wang; Xuexia Lv
Journal:  Cancer Cell Int       Date:  2013-11-18       Impact factor: 5.722

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.