Literature DB >> 17903305

A genome-wide association study of breast and prostate cancer in the NHLBI's Framingham Heart Study.

Joanne M Murabito1, Carol L Rosenberg, Daniel Finger, Bernard E Kreger, Daniel Levy, Greta Lee Splansky, Karen Antman, Shih-Jen Hwang.   

Abstract

BACKGROUND: Breast and prostate cancer are two commonly diagnosed cancers in the United States. Prior work suggests that cancer causing genes and cancer susceptibility genes can be identified.
METHODS: We conducted a genome-wide association study (Affymetrix 100K SNP GeneChip) of cancer in the community-based Framingham Heart Study. We report on 2 cancer traits--prostate cancer and breast cancer--in up to 1335 participants from 330 families (54% women, mean entry age 33 years). Multivariable-adjusted residuals, computed using Cox proportional hazards models, were tested for association with qualifying SNPs (70, 987 autosomal SNPs with genotypic call rate > or =80%, minor allele frequency > or =10%, Hardy-Weinberg test p > or = 0.001) using generalized estimating equations (GEE) models and family based association tests (FBAT).
RESULTS: There were 58 women with breast cancer and 59 men with prostate cancer. No SNP associations attained genome-wide significance. The top SNP associations in GEE models for each trait were as follows: breast cancer, rs2075555, p = 8.0 x 10(-8) in COL1A1; and prostate cancer, rs9311171, p = 1.75 x 10(-6) in CTDSPL. In analysis of selected candidate cancer susceptibility genes, two MSR1 SNPs (rs9325782, GEE p = 0.008 and rs2410373, FBAT p = 0.021) were associated with prostate cancer and three ERBB4 SNPs (rs905883 GEE p = 0.0002, rs7564590 GEE p = 0.003, rs7558615 GEE p = 0.0078) were associated with breast cancer. The previously reported risk SNP for prostate cancer, rs1447295, was not included on the 100K chip. Results of cancer phenotype-genotype associations for all autosomal SNPs are web posted at http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?id=phs000007 webcite.
CONCLUSION: Although no association attained genome-wide significance, several interesting associations emerged for breast and prostate cancer. These findings can serve as a resource for replication in other populations to identify novel biologic pathways contributing to cancer susceptibility.

Entities:  

Mesh:

Year:  2007        PMID: 17903305      PMCID: PMC1995609          DOI: 10.1186/1471-2350-8-S1-S6

Source DB:  PubMed          Journal:  BMC Med Genet        ISSN: 1471-2350            Impact factor:   2.103


Background

Breast and prostate cancer are the most frequently diagnosed cancers in women and men respectively with over 200,000 cases each of new breast and prostate cancer estimated for 2006 in the United States [1]. Furthermore, prostate cancer is the second leading cause of cancer-related deaths in men and breast cancer is the second leading cause of cancer-related deaths in women. Family history is a well established risk factor for both breast and prostate cancer providing evidence for underlying genetic factors contributing to cancer occurrence. Accumulating research has identified a number of candidate genes and biologic pathways associated with increased susceptibility to cancer. However, even the most penetrant mutations, such as in BRCA1 and BRCA2, account for only 5–10% of cases and are present in <1% of the general population. Genome-wide association studies (GWAS) provide a comprehensive approach to identification of genetic variants associated with cancer risk unconstrained by existing knowledge and may permit detection of common genetic variants each with small associated cancer risk but great public health impact. Reports from two recent GWAS demonstrated the importance of this approach with the discovery of novel loci for breast cancer susceptibility [2,3]. Four SNPs in the FGFR2 gene were strongly associated with breast cancer and the association was confirmed in a sample of cases and controls derived from three additional studies [3]. We used the Framingham Heart Study (FHS) Affymetrix 100K SNP genotyping resource for GWAS of breast and prostate cancer phenotypes. The FHS offers the advantage of a prospective longitudinal family-based community sample with participants who have been well-characterized throughout adulthood with respect to risk factors and diseases, including cancer. We report results of two complementary strategies to identify genome-wide associations with cancer phenotypes: 1) a simple low p-value SNP ranking strategy; and 2) 100K SNP associations within candidate genes and regions previously reported to be associated with these cancers in humans.

Methods

Study sample

The genotyped study sample comprised 1345 Original cohort (n = 258) and Offspring (n = 1087) participants from the 330 largest FHS families. The Overview [4] provides further details of this sample. There were 250 participants in the sample with cancer (excluding non-melanoma skin cancer) including 58 women with breast cancer, and 59 men with prostate cancer. The Boston University Medical Center Institutional Review Board approved the examination content of Original Cohort and Offspring examinations. All participants provided written informed consent including consent for genetic studies.

Cancer phenotype definitions and residual creation

The 5209 Original Cohort participants have been examined biennially since study inception in 1948 and the 5124 Offspring Cohort participants (children of the Original Cohort and spouses of the children) have been examined approximately every 4 years since enrollment in 1971. Cancer cases were identified at routine examinations or by health-history updates for participants who did not attend an examination. Medical records were reviewed by two independent reviewers (BEK, GLS). The vast majority of cancers were confirmed by pathology reports; <3.4% of cancer cases were based on death certificate or clinical diagnosis alone. The 1976 World Health Organization ICD-O coding was used to classify all primary cancers. Hence, topography, location (subdivision of site), histology or morphology (cell histopathology), behavior (degree of malignancy), and grade (histological grading & differentiation) were recorded along with date of diagnosis. Cancer cases reviewed through December 31, 2005 were included in this study. The proportion of women and men in the study sample with breast (8%) and prostate cancer (9%) respectively was similar to that in the full FHS sample. Cox proportional hazards models were used to generate martingale residuals using the PHREG procedure in SAS to perform the regression analysis of time from study entry to cancer diagnosis or last contact free of cancer. Breast cancer was examined in women only and models were cohort-specific and adjusted for 1) age at entry and 2) age, parity, and body mass index at study entry. For prostate cancer, in men only, models were cohort-specific and adjusted for age at entry.

Genotyping

Affymetrix 100K SNP GeneChip genotyping and the Marshfield STR genotyping performed by the Mammalian Genotyping Service are described in the Overview [4]. SNPs were excluded if minor allele frequency <0.10 (n = 38062); genotypic call rate <0.80 (n = 2346); Hardy Weinberg equilibrium test p < 0.001 (n = 1595). There were 70,987 autosomal SNPs available for analysis after the exclusions.

Statistical Analysis

The statistical methods for genome-wide association analyses are described in detail in the Overview [4]. While there are various suggested methods for interpretation of genome-wide significance, we chose to use a conservative (p < 0.05/10-6 = 5 × 10-8) threshold to define genome-wide significance for this report.

Association

All cancer residual traits listed in Table 1 were computed using Cox proportional hazards models. The full set of FHS participants with the phenotype were used to create the residuals. The residuals were used to test for association between the genotyped subset of participants and the SNPs using family-based association test (FBAT) and generalized estimating equation (GEE) models. FBAT analyses were restricted to at least 10 informative families. The GEE tests tended to give an excess of very small p-values over what would be expected (see Overview [4]).
Table 1

Cancer Phenotypes for the Framingham Heart Study 100K Analyses

Phenotype*TraitsNumber of participantsEventsAdjustment (database phenotype variable name)
All Cancer* (excluding non-melanoma skin cancer)21335250Cohort- and sex-specific1. age at study entry (allcancer1)2. multivariable adjusted for age, body mass index, and cigarette smoking at study entry (allcancer2)
Breast Cancer (women)272358Cohort-specific1. age at study entry (breastcancer1)2. multivariable adjusted for age, parity, and body mass index at study entry (breastcancer2)
Prostate Cancer (men)161759Cohort-specific1. age at study entry (prostatecancer1)

Residuals from these models were used as traits to test for association with SNP genotypes. All residuals created using Cox Proportional Hazards regression. Cancers ascertained through December 31, 2005

*All SNP associations for all phenotypes in the table including all cancer are available on the web

Cancer Phenotypes for the Framingham Heart Study 100K Analyses Residuals from these models were used as traits to test for association with SNP genotypes. All residuals created using Cox Proportional Hazards regression. Cancers ascertained through December 31, 2005 *All SNP associations for all phenotypes in the table including all cancer are available on the web

SNP prioritization

We used several strategies to prioritize SNPs associated with cancer traits. First, we used an untargeted approach whereby SNP associations were ranked according to the strength of the p-value for each trait. Next we identified candidate genes reported to be associated with each cancer trait from review of the literature. Candidate genes were selected by searching PubMed (using susceptibility, gene, cancer and (breast or prostate) as keywords, last accessed 08-15-06), and the Entrez Gene and Online Mendelian Inheritance in Man resources, as well as recent text books. All available 100K SNPs in or near the a priori selected candidate genes were investigated for association with cancer traits. Finally for prostate cancer, we also examined SNP associations in the region on chromosome 8 (8q24) previously reported to be associated with prostate cancer in Icelandic families and confirmed in three case-control series [5] and African American men [6]. Further, for prostate cancer we examined the overlap in SNP associations in our study and the top 500 ranked SNPs from the Cancer Genetics Markers of Susceptibility (CGEMS) project sponsored by the National Cancer Institute. Because CGEMS used an Illumina platform for genotyping and the genotyping used in this study was performed with an Affymetrix platform, the gene_symbol from the UCSC annotation was used to link with CGEMS top 500 SNP list. Using this method, 1487 SNPs in FHS 100K (including SNPs with MAF < 0.1) correlated with the CGEMS top 500 SNPs related to a known gene. SNPs were annotated with the UCSC genome browser tables using the May 2004 assembly [7,8]. All genes within 60 kb of the top ranked SNPs were identified. The physical location of the SNPs was based on Build 35 of Genome for this report; however, the 100K web browser was based on Build 36.

Results

The cancer phenotypes available in the FHS 100K SNP resource, including details of the sample size, number of cancer events, and covariate adjustment for each trait are listed in Table 1. In this report, we consider only two phenotypes: breast cancer in women (multivariable-adjusted) and prostate cancer in men. Among participants in the 100K sample the mean age at breast cancer diagnosis was 59 years (range 35 to 83 years) in Offspring Cohort women and 70 years (range 35 to 97 years) in Original Cohort women; the mean age at prostate cancer diagnosis was 66 years (range 43 to 85 years) in Offspring Cohort men and 76 years (range 53 to 95 years) in Original Cohort men. For each of the cancer phenotypes, Table 2 provides the top 15 SNPs ranked in order by lowest p-value for the GEE models and for the FBAT models (all SNP associations can be viewed on the web) [9]. None of the SNP associations achieved genome-wide significance (p < 5 × 10-8) [4]. However, for prostate cancer, the top SNP in GEE models, rs9311171, is in CTDSPL (CTD {carboxy-terminal domain, RNA polymerase II, polypeptide A} small phosphatase-like), a gene that may play a role in tumor suppression [10].
Table 2

Cancer Phenotypes for FHS 100K Project: Results of Association Analyses*

2a. GEE results ranked by p-value
SNPChrPhysical PositionGEE p-valueFBAT p-valueGene region (within 60 kb)

BREAST CANCER

rs20755551745,629,2908.3 × 10-80.157COL1A1
rs65567565163,821,8585.0 × 10-70.114
rs11548651272,276,1046.6 × 10-70.052
rs19785031851,815,2809.66 × 10-70.287
rs19266571394,672,9571.9 × 10-60.072ABCC4
rs10263639766,503,4172.7 × 10-60.590
rs10490113259,410,9984.7 × 10-60.077
rs4586852130,099,3826.0 × 10-60.064GRIK1
rs18762061546,687,8786.0 × 10-60.742FBN1
rs93140335163,822,7841.2 × 10-50.058
rs10486490725,846,7891.3 × 10-50.245
rs93250245146,185,9791.7 × 10-50.112PPP2R2B
rs12942551229,808,0931.9 × 10-50.470
rs105137543178,681,8622.8 × 10-50.189
rs105010931127,946,6953.0 × 10-50.066KIF18A

PROSTATE CANCER

rs9311171337,971,4811.8 × 10-60.392CTDSPL
rs152927613102,726,0081.8 × 10-60.071
rs10498792651,774,5903.0 × 10-60.285PKHD1
rs4466137583,021,4953.1 × 10-60.305HAPLN1
rs3450133146,656,4865.1 × 10-60.002
rs105051378111,210,7851.3 × 10-50.002
rs18730383175,020,3431.8 × 10-50.317NLGN1
rs80199321429,676,1771.9 × 10-50.019
rs3449853146,619,4752.1 × 10-50.002
rs19206761115,789,3232.2 × 10-50.196
rs1051120610127,209,3652.4 × 10-50.028
rs10375695119,954,2042.8 × 10-50.165
rs21093122128,292,0613.0 × 10-50.786WDR33
rs13524163174,963,4093.4 × 10-50.201NLGN1
rs14407141115,,790,3993.7 × 10-50.223

2b. FBAT results ranked by p-value

SNPChrPhysical PositionGEE p-valueFBAT p-valueGene Region (within 60 kb)

BREAST CANCER

rs77119905180,307,5040.0108.4 × 10-5
rs14511254137,449,9950.0089.5 × 10-5
rs42663524109,823,9510.0011.2 × 10-4
rs67209182122,873,2760.0271.9 × 10-4
rs61011832059,025,3550.1322.0 × 10-4
rs9307064490,503,8270.1592.2 × 10-4
rs105122879101,538,5230.8072.3 × 10-4GRIN3A
rs7190881167,368,4240.4752.9 × 10-4
rs22244021229,359,0760.4183.1 × 10-4C1orf57
rs10516690484,822,7920.9543.7 × 10-4
rs104879207145,895,7270.0853.9 × 10-4CNTNAP2
rs6479347991,348,1210.0174.5 × 10-4
rs19245871396,195,3800.0204.6 × 10-4HS6ST3
rs2059273165,991,1440.0054.9 × 10-4A2BP1
rs28226692114,752,4490.0295.7 × 10-4SAMSN1

PROSTATE CANCER

rs657057656,402,2410.1093.4 × 10-5DST
rs66914821170,274,5210.0074.0 × 10-5SLC9A11
rs4378061910,930,1270.6336.6 × 10-5
rs20563871234,250,1530.1591.1 × 10-4RYR2
rs22746261176,253,5890.0051.4 × 10-4NPHS2
rs1812471753,562,7300.5523.2 × 10-4OR4D1|OR4D2
rs782734883,020,2220.0543.6 × 10-4
rs255561582,386,5660.0403.9 × 10-4XRCC4
rs15048584137,063,5550.0103.9 × 10-4
rs9547094167,317,4240.0494.2 × 10-4
rs21794432037,450,5160.0574.4 × 10-4
rs35239568,064,8340.6574.6 × 10-4
rs477516816,758,3620.0575.1 × 10-4
rs595725128,993,3310.2395.3 × 10-4OPRD1
rs105153225101,539,5430.0515.4 × 10-4SLCO4C1

*Autosomal SNPs with genotypic call rate ≥80%, minor allele frequency ≥10%, Hardy-Weinberg test p ≥ 0.001, and ≥10 informative families for FBAT. The physical location of the SNPs was based on Build 35 of Genome; however, the 100K web browser was based on Build 36

Cancer Phenotypes for FHS 100K Project: Results of Association Analyses* *Autosomal SNPs with genotypic call rate ≥80%, minor allele frequency ≥10%, Hardy-Weinberg test p ≥ 0.001, and ≥10 informative families for FBAT. The physical location of the SNPs was based on Build 35 of Genome; however, the 100K web browser was based on Build 36 There were several additional associations not listed in Table 2 that were of interest. For prostate cancer, in GEE models rs906304 (rank 27, p = 0.000067), is in NCOR2 also known as SMRT. SMRT levels have been reported to be elevated in prostate cancer cells, and result in suppression of anti-proliferative target gene actions for the vitamin D receptor [11,12]. In FBAT models, for prostate cancer SNP rs255561 (rank 17, p = 0.00039), is near XRCC4, a gene that plays a role in DNA repair and rs1897676 (rank 50, p = 0.0012), is in PTPRD. Protein tyrosine phosphatases are signaling molecules involved in the regulation of a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation [13]. For breast cancer in GEE models rs4146372 (rank 31, p = 0.00007), is near RAB21, SNP rs9307561 (rank 40, p = 0.0001), is near FAT4, and rs10512849 (rank 46, p = 0.00014), is in FGF10. These genes appear to play biologic roles in a variety of processes including tumor growth and suppression [13,14]. In FBAT models, for breast cancer rs2836391 (rank 46, p = 0.0012), is in ERG, an oncogene important in the development of prostate cancer [15,16]. All SNP Associations within Selected Breast and Prostate Candidate Genes (up to 60 kb) Our second strategy was to identify from the literature candidate genes implicated in breast and prostate cancer susceptibility (see Additional data file 1). For prostate cancer, we identified 63 candidate genes. Twenty of these candidate genes had from 1 to 20 SNPs on the 100K chip whereas the remaining genes had no SNP coverage on the chip. For breast cancer, 75 potential candidate genes were identified, 28 of these genes had between 1 and 35 SNPs on the 100K chip and the remaining candidate genes were not covered on the chip. Two SNPs in MSR1 (rs9325782, GEE p = 0.008 and rs2410373, FBAT p = 0.021) were associated with prostate cancer and three SNPs in ERBB4 (rs905883 GEE p = 0.0002, rs7564590 GEE p = 0.003, rs7558615 GEE p = 0.0078) were associated with breast cancer (Table 3). For prostate cancer, a region on chromosome 8q24 was recently reported to be associated with prostate cancer risk in Icelandic men and confirmed in three case-control series of men of European ancestry and African American men [5,6]. There were a total of 64 SNPs on the 100K chip in this 8q24 region (128 to 129.3 Mb interval). However, the reported risk SNP, rs1447295, was not included on the 100K chip and none of the 64 available SNPs were in linkage disequilibrium with the risk SNP. Five other SNPs in this region were associated with prostate cancer with a GEE or FBAT p-value < 0.01 (Table 4).
Table 3

All SNP Associations within Selected Breast and Prostate Candidate Genes (up to 60 kb)

Associations within Candidate Genes Selected by Cancer Type: FBAT or GEE p-value < 0.01
Cancer TraitGeneSNPChrPhysical PositionGEE p-valueFBAT p-value

Breast cancerERBB4rs9058832213,013,9762.0 × 10-40.8517
Breast cancerERBB4rs75645902213,213,4060.00320.2567
Breast cancerERBB4rs75586152212,771,6550.0080.8457
Prostate cancerMSR1rs9325782816,134,8448.2 × 10-40.3448
Prostate cancerMSR1rs2410373815,968,8770.0140.02051
Table 4

SNPs in the Chromosome 8q24 region Associated with Prostate Cancer:GEE or FBAT p-value < 0.01

SNPPhysical PositionDistance from reported markerMAFGEE p-valueFBAT p-value
rs7001069128179828-2532680.01883.0 × 10-50.39322
rs10505483128194377-2387190.02234.0 × 10-50.32697
rs1562871128470954378580.16960.166180.00328
rs10505474128486686535900.4060.009950.0121
rs105055061291144736813770.32750.008290.92941
SNPs in the Chromosome 8q24 region Associated with Prostate Cancer:GEE or FBAT p-value < 0.01 The National Cancer Institute commenced the CGEMS [17] initiative to conduct genome-wide association studies to identify genetic factors related to prostate and breast cancer. We examined overlap between the top 500 ranked SNPs for prostate cancer in CGEMS phase 1a [18] and the results of the FHS 100K GWAS analysis for prostate cancer. The physical position of the SNP was used to detect overlapping associations and the results are shown in Table 5. Of note, many of the associations in Table 5 are in SNPs with very low minor allele frequencies and the results are presented according to minor allele frequency. WWOX gene, a tumor suppressor gene, that has been reported to play a role in prostate cancer [19], showed evidence of association (rs3751832, p = 0.0009) in our study sample.
Table 5

Prostate Cancer SNP Associations Common to Both CGEMS Top 500 Ranked SNPs and FHS 100K SNPs

CGEMS Phase 1aFHS 100K
DBSNP_IDChrPhysical PositionCGEMS p-valueCGEMS Rank100K SNPPhysical PositionGEE p-valueFBAT p-valueMAFNearest Gene

Minor Allele Frequency ≥10%

rs30171831851,319,2410.0002410085rs379488951,212,9350.0170.7990.224TCF4
rs30171831851,319,2410.0002410085rs480114951,214,2270.0440.9830.221TCF4
rs22533192135,109,9160.00041100137rs283464535,109,5560.0090.2930.206RUNX1
rs3927151624,023,3230.00041800141rs1049279724,122,4530.0360.4070.175PRKCB1
rs1025650473,516,7960.00092200295rs44182483,194,1811.8 × 10-40.0560.109SDK1
rs61029122040,636,3490.00111000350rs98683140,594,8920.0090.0940.407PTPRT
rs47827421681,573,1510.00127400395rs25431582,313,9137.1 × 10-40.0110.141CDH13
rs23714382212,791,0370.00128500401rs10497958212,771,4650.0550.0470.164ERBB4
rs65776488135,555,6880.00144900445rs10505624135,607,8624.9 × 10-40.0070.147ZFAT1
rs93278865102,931,6460.00154600468rs10515347102,950,8330.0060.0620.354NUDT12
rs655549157,799,7920.00158700478rs105129207,455,5610.0090.6470.244ADCY2

Minor Allele Frequency <10%

rs38276714,078,2400.00055700181rs1048603114,258,7031.7 × 10-50.7590.032DGKB
rs216666280,745,0150.00057000184rs1052024779,693,4522.8 × 10-50.2400.065CTNNA2
rs216666280,745,0150.00057000184rs1052024679,693,3241.5 × 10-40.1980.050CTNNA2
rs1261256522,490,8530.00060200193rs1052088022,634,0038.5 × 10-50.3560.065CDH12
rs76105843171,635,8850.00061900196rs10513681171,624,0107.8 × 10-70.4020.025CLDN11
rs73296591397,872,5640.00064700209rs798905097,680,2996.0 × 10-60.2580.014FARP1
rs20835472,554,1040.00065000211rs104875772,583,7192.0 × 10-7N/A0.012GNA12
rs37518321677,771,7730.00079900257rs1051444377,353,3869.3 × 10-4N/A0.008WWOX

CGEMS = Cancer Genetics Markers of Susceptibility; MAF = minor allele frequency; N/A= not available, less than 10 informative families for FBAT

Prostate Cancer SNP Associations Common to Both CGEMS Top 500 Ranked SNPs and FHS 100K SNPs CGEMS = Cancer Genetics Markers of Susceptibility; MAF = minor allele frequency; N/A= not available, less than 10 informative families for FBAT

Discussion

Breast and prostate cancer are the two most frequently diagnosed cancers in the United States and result in substantial morbidity and mortality [1]. A number of breast and prostate cancer susceptibility genes and chromosomal regions have been identified [2,3,5,20-34]. However, currently known genes account for only a fraction of the familial aggregation of breast cancer [25] and few prostate cancer susceptibility genes have even been identified. Risk for these cancers is likely mediated through variation in many genes, each conferring a relatively small risk for the disease. Genome-wide association studies provide an opportunity to discover novel genes and pathways that play a causal role in cancer occurrence and in turn may lead to new therapies for the prevention and treatment of cancer. Finding genetic associations with breast and prostate cancer risk that are robust across multiple studies may facilitate the identification of high risk individuals who can be targeted for early screening and preventive interventions. We report GWAS results for breast cancer and prostate cancer phenotypes in a community-based sample of adults from two generations of the same families. Although none of the SNP associations achieved genome-wide significance in GEE or FBAT models, this resource has the potential to detect novel cancer susceptibility genes and to explore the relevance of promising candidate gene associations to human cancer. Our results can be compared to those from other genome-wide association studies such as the National Cancer Institute's CGEMS [17]. Although the two studies used different genotyping platforms limiting overlap in the SNPs examined, we were able to determine the physical position of the SNPs. Using this strategy, SNPs in the ERRB4 gene (CGEMS DSSNP_ID rs2371438 and FHS 100K SNP rs10497958) were associated with prostate cancer. ErbB proteins are widely expressed in prostate cells [35] and may play a role in tumor development, growth and progression in human prostate cancer [36,37]. We also examined the 8q24 region previously associated with prostate cancer risk. CGEMS investigators recently reported a second independent risk SNP (rs6983267) within the 8q24 strongly associated with prostate cancer [34]. The Affymetrix 100K GeneChip did not include either of the previously reported risk SNPs; however, we did identify five other SNPs in this region associated with prostate cancer. The underlying biologic mechanism mediating prostate cancer risk associated with the SNPs and chromosomal region remains unknown. A two-stage approach, genome-wide association followed by selective genotyping of SNPs with suggestive evidence of association, may provide an efficient strategy for pursuing initial genome-wide results [2,38,39]. Several important limitations merit comment. First, this study used cancer cases identified through surveillance of a multigenerational community-based sample. The enrollment and examination of Original Cohort and Offspring Cohort participants began years before DNA collection occurred. Thus, a survival bias may have been introduced. Our cases may be comprised of early-staged and less lethal cancers. To address this potential bias, we adjusted for covariates using the full Framingham sample, and used the residual traits for the subset of individuals genotyped using the 100K Affymetrix GeneChip to test for association with the SNPs in linear regression models. Residual traits from Cox models typically are not ideally distributed for linear regression models, but our adjustment method using the full Framingham sample precludes the testing of SNP associations with cancer traits using Cox models. Second, we had a small number of cancer events (250 all cancer cases, 58 breast cancer cases and 59 prostate cancer cases) limiting our ability to detect SNP associations. In a recent small GWAS of age-related macular degeneration that included 96 cases and 50 controls, an association with the CFH gene was identified [40] and confirmed in larger studies [41-43]. However, in that report, individuals homozygous for the CFH risk allele had a sevenfold increased likelihood of age-related macular degeneration [40]. It is very unlikely that common genetic variants for cancer phenotypes will confer a risk for cancer susceptibility of that magnitude. For example, the odds ratio associated with the risk marker identified for prostate cancer in region 8q24 was 1.72 in the combined Icelandic sample [5]. Furthermore, the associations between prostate cancer and the SNPs with low minor allele frequency (Table 5) are likely to be false positive associations given the small number of prostate cancer cases in our sample. Third, the 100K Affymetrix GeneChip provides limited coverage of the genome; many of our a priori candidate genes did not have any SNP coverage on the chip and coverage of some candidate genes that were present on the chip was suboptimal. Importantly, the replicated risk SNP, rs1447295, for prostate cancer [5] was not included on the chip. NHLBI has committed funds for a 550 K genome-wide scan on all FHS participants. This will enable us to confirm our initial 100K SNP associations in a larger sample with a greater number of cancer cases and with denser coverage of the genome. We did not examine epistasis or gene-environment interactions which may modify the associations noted in this study. Lastly, most of our associations are likely to be due to chance. Replication studies are needed to determine if any of the results we report are indicative of true associations. It is important that our data be used in conjunction with data from other samples given the high probability of false positive associations.

Conclusion

In summary, the untargeted genome-wide approach to detect genetic associations for cancer traits provides an opportunity to identify novel biologic pathways related to cancer occurrence and to direct future study of candidate genes that hold the most promise for relevance to cancer risk in humans. Enhancing our understanding of the mechanisms responsible for cancer susceptibility may in turn identify novel strategies for early detection, prevention, and treatment of breast and prostate cancers. These data serve as a resource for replication in other population-based samples.

Abbreviations

CGEMS = cancer genetics markers of susceptibility; FBAT = family-based association test; FHS = Framingham Heart Study; GEE = generalized estimating equations; GWAS = genome-wide association study; ICD-O = international classification of diseases for oncology; MAF = minor allele frequency; SNP = single nucleotide polymorphism.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

All authors have made substantial contributions to conception and design or acquisition of phenotypic data. JMM, SJW, DF, DL, CLR contributed to the analysis and interpretation of data. JMM, CLR, DL, DF, and SJW have been involved in drafting the manuscript or revising it critically for important intellectual content and all authors have read and approved the final manuscript.

Additional file 1

List of selected candidate genes for breast and prostate cancer. Click here for file
  38 in total

1.  Complement factor H polymorphism in age-related macular degeneration.

Authors:  Robert J Klein; Caroline Zeiss; Emily Y Chew; Jen-Yue Tsai; Richard S Sackler; Chad Haynes; Alice K Henning; John Paul SanGiovanni; Shrikant M Mane; Susan T Mayne; Michael B Bracken; Frederick L Ferris; Jurg Ott; Colin Barnstable; Josephine Hoh
Journal:  Science       Date:  2005-03-10       Impact factor: 47.728

2.  Recurrent fusion of TMPRSS2 and ETS transcription factor genes in prostate cancer.

Authors:  Scott A Tomlins; Daniel R Rhodes; Sven Perner; Saravana M Dhanasekaran; Rohit Mehra; Xiao-Wei Sun; Sooryanarayana Varambally; Xuhong Cao; Joelle Tchinda; Rainer Kuefer; Charles Lee; James E Montie; Rajal B Shah; Kenneth J Pienta; Mark A Rubin; Arul M Chinnaiyan
Journal:  Science       Date:  2005-10-28       Impact factor: 47.728

3.  Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies.

Authors:  Andrew D Skol; Laura J Scott; Gonçalo R Abecasis; Michael Boehnke
Journal:  Nat Genet       Date:  2006-01-15       Impact factor: 38.330

4.  Localization of a prostate cancer predisposition gene to an 880-kb region on chromosome 22q12.3 in Utah high-risk pedigrees.

Authors:  Nicola J Camp; James M Farnham; Lisa A Cannon-Albright
Journal:  Cancer Res       Date:  2006-10-15       Impact factor: 12.701

5.  Somatic mutations of the ERBB4 kinase domain in human cancers.

Authors:  Young Hwa Soung; Jong Woo Lee; Su Young Kim; Young Pil Wang; Keon Hyun Jo; Seok Whan Moon; Won Sang Park; Suk Woo Nam; Jung Young Lee; Nam Jin Yoo; Sug Hyung Lee
Journal:  Int J Cancer       Date:  2006-03-15       Impact factor: 7.396

Review 6.  Models of genetic susceptibility to breast cancer.

Authors:  A C Antoniou; D F Easton
Journal:  Oncogene       Date:  2006-09-25       Impact factor: 9.867

7.  Complement factor H polymorphism and age-related macular degeneration.

Authors:  Albert O Edwards; Robert Ritter; Kenneth J Abel; Alisa Manning; Carolien Panhuysen; Lindsay A Farrer
Journal:  Science       Date:  2005-03-10       Impact factor: 47.728

8.  Large-scale association study identifies ICAM gene region as breast and prostate cancer susceptibility locus.

Authors:  Stefan Kammerer; Richard B Roth; Richard Reneland; George Marnellos; Carolyn R Hoyal; Nathan J Markward; Florian Ebner; Marion Kiechle; Ulrike Schwarz-Boeger; Lyn R Griffiths; Christian Ulbrich; Korbinian Chrobok; Gerhard Forster; Georg M Praetorius; Peter Meyer; Joachim Rehbock; Charles R Cantor; Matthew R Nelson; Andreas Braun
Journal:  Cancer Res       Date:  2004-12-15       Impact factor: 12.701

9.  Spectrum of mutations in BRCA1, BRCA2, CHEK2, and TP53 in families at high risk of breast cancer.

Authors:  Tom Walsh; Silvia Casadei; Kathryn Hale Coats; Elizabeth Swisher; Sunday M Stray; Jake Higgins; Kevin C Roach; Jessica Mandell; Ming K Lee; Sona Ciernikova; Lenka Foretova; Pavel Soucek; Mary-Claire King
Journal:  JAMA       Date:  2006-03-22       Impact factor: 56.272

10.  Commonly studied single-nucleotide polymorphisms and breast cancer: results from the Breast Cancer Association Consortium.

Authors: 
Journal:  J Natl Cancer Inst       Date:  2006-10-04       Impact factor: 13.506

View more
  59 in total

1.  Analysis of the specific pathways and networks of prostate cancer for gene expression profiles in the Chinese population.

Authors:  Jia-hong Chen; Hui-chan He; Fu-neng Jiang; Julia Militar; Petor-yang Ran; Guo-qiang Qin; Chao Cai; Xi-Bin Chen; Jin Zhao; Zi-yao Mo; Yan-ru Chen; Jian-guo Zhu; Xingyin Liu; Wei-de Zhong
Journal:  Med Oncol       Date:  2011-10-30       Impact factor: 3.064

Review 2.  A compendium of genome-wide associations for cancer: critical synopsis and reappraisal.

Authors:  John P A Ioannidis; Peter Castaldi; Evangelos Evangelou
Journal:  J Natl Cancer Inst       Date:  2010-05-26       Impact factor: 13.506

3.  Variants at IRX4 as prostate cancer expression quantitative trait loci.

Authors:  Xing Xu; Wasay M Hussain; Joseph Vijai; Kenneth Offit; Mark A Rubin; Francesca Demichelis; Robert J Klein
Journal:  Eur J Hum Genet       Date:  2013-09-11       Impact factor: 4.246

Review 4.  Molecular alterations in prostate cancer as diagnostic, prognostic, and therapeutic targets.

Authors:  Bora Gurel; Tsuyoshi Iwata; Cheryl M Koh; Srinivasan Yegnasubramanian; William G Nelson; Angelo M De Marzo
Journal:  Adv Anat Pathol       Date:  2008-11       Impact factor: 3.875

5.  Pharmacogenomics of membrane transporters: past, present and future.

Authors:  Sook Wah Yee; Ligong Chen; Kathleen M Giacomini
Journal:  Pharmacogenomics       Date:  2010-04       Impact factor: 2.533

Review 6.  Cancer evolution and individual susceptibility.

Authors:  Jesús Pérez-Losada; Andrés Castellanos-Martín; Jian-Hua Mao
Journal:  Integr Biol (Camb)       Date:  2011-01-24       Impact factor: 2.192

7.  Breast Cancer-Related Low Penetrance Genes.

Authors:  Daehee Kang; Ji-Yeob Choi
Journal:  Adv Exp Med Biol       Date:  2021       Impact factor: 2.622

Review 8.  Role of ErbB4 in breast cancer.

Authors:  Maria Sundvall; Kristiina Iljin; Sami Kilpinen; Henri Sara; Olli-Pekka Kallioniemi; Klaus Elenius
Journal:  J Mammary Gland Biol Neoplasia       Date:  2008-05-03       Impact factor: 2.673

9.  Single and multivariate associations of MSR1, ELAC2, and RNASEL with prostate cancer in an ethnic diverse cohort of men.

Authors:  Joke Beuten; Jonathan A L Gelfond; Jennifer L Franke; Stacey Shook; Teresa L Johnson-Pais; Ian M Thompson; Robin J Leach
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2010-01-19       Impact factor: 4.254

10.  Comparison of genetic variation of breast cancer susceptibility genes in Chinese and German populations.

Authors:  David Barzan; Marlon R Veldwijk; Carsten Herskind; Yang Li; Bo Zhang; Elena Sperk; Wei-Dong Du; Xue-Jun Zhang; Frederik Wenz
Journal:  Eur J Hum Genet       Date:  2013-03-13       Impact factor: 4.246

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.