| Literature DB >> 27207650 |
Stephanie L Schmit1,2,3, Fredrick R Schumacher1,2, Christopher K Edlund1,2, David V Conti1,2, Ugonna Ihenacho1,2, Peggy Wan1,2, David Van Den Berg1, Graham Casey1,2, Barbara K Fortini4, Heinz-Josef Lenz1,2, Teresa Tusié-Luna5,6, Carlos A Aguilar-Salinas5, Hortensia Moreno-Macías7, Alicia Huerta-Chagoya5,6, María Luisa Ordóñez-Sánchez7, Rosario Rodríguez-Guillén7, Ivette Cruz-Bautista7, Maribel Rodríguez-Torres7, Linda Liliana Muñóz-Hernández7, Olimpia Arellano-Campos5, Donají Gómez7, Ulices Alvirde7, Clicerio González-Villalpando8,9, María Elena González-Villalpando9, Loic Le Marchand10, Christopher A Haiman1,2, Jane C Figueiredo1,2.
Abstract
Genome-wide association studies (GWAS) have identified 58 susceptibility alleles across 37 regions associated with the risk of colorectal cancer (CRC) with P < 5×10(-8) Most studies have been conducted in non-Hispanic whites and East Asians; however, the generalizability of these findings and the potential for ethnic-specific risk variation in Hispanic and Latino (HL) individuals have been largely understudied. We describe the first GWAS of common genetic variation contributing to CRC risk in HL (1611 CRC cases and 4330 controls). We also examine known susceptibility alleles and implement imputation-based fine-mapping to identify potential ethnicity-specific association signals in known risk regions. We discovered 17 variants across 4 independent regions that merit further investigation due to suggestive CRC associations (P < 1×10(-6)) at 1p34.3 (rs7528276; Odds Ratio (OR) = 1.86 [95% confidence interval (CI): 1.47-2.36); P = 2.5×10(-7)], 2q23.3 (rs1367374; OR = 1.37 (95% CI: 1.21-1.55); P = 4.0×10(-7)), 14q24.2 (rs143046984; OR = 1.65 (95% CI: 1.36-2.01); P = 4.1×10(-7)) and 16q12.2 [rs142319636; OR = 1.69 (95% CI: 1.37-2.08); P=7.8×10(-7)]. Among the 57 previously published CRC susceptibility alleles with minor allele frequency ≥1%, 76.5% of SNPs had a consistent direction of effect and 19 (33.3%) were nominally statistically significant (P < 0.05). Further, rs185423955 and rs60892987 were identified as novel secondary susceptibility variants at 3q26.2 (P = 5.3×10(-5)) and 11q12.2 (P = 6.8×10(-5)), respectively. Our findings demonstrate the importance of fine mapping in HL. These results are informative for variant prioritization in functional studies and future risk prediction modeling in minority populations.Entities:
Mesh:
Year: 2016 PMID: 27207650 PMCID: PMC4876992 DOI: 10.1093/carcin/bgw046
Source DB: PubMed Journal: Carcinogenesis ISSN: 0143-3334 Impact factor: 4.944
Characteristics of Hispanic/Latino (HL) colorectal cancer cases and controls in a genome-wide association study of 5941 participants
| Casesa | Controlsb |
| ||
|---|---|---|---|---|
|
|
| |||
| Age [mean (SD)] | 61.2 (12.3) | 62.4 (10.2) | <0.01 | |
| BMI [mean (SD)] | 29.2 (6.1) | 27.5 (4.2) | <0.01 | |
| Sex (%) | Male | 910 (56.5) | 1845 (42.6) | <0.01 |
| Female | 701 (43.5) | 2485 (57.4) | ||
| Place of birth | United States | 680 (42.2) | 1100 (25.4) | <0.01 |
| Mexico | 585 (36.3) | 2947 (68.1)c | ||
| Central or South Americad | 222 (13.8) | 277 (6.4) | ||
| Europe | 5 (0.3) | 5 (0.1) | ||
| Cuba or Caribbean Islands | 46 (2.9) | 0 (0.0) | ||
| Ancestry estimates [mean (SD)]e | ||||
| European | 0.50 (0.24) | 0.39 (0.27) | <0.01 | |
| East Asian | 0.05 (0.10) | 0. 03 (0.07) | <0.01 | |
| African | 0.02 (0.06) | 0.01 (0.02) | <0.01 | |
| Amerindian | 0.43 (0.25) | 0.57 (0.28) | <0.01 | |
| Diabetes | No | 1151 (74.2) | 4330 (100.0) | <0.01 |
| Yes | 400 (25.8) | 0 (0.0) | ||
| Family history of CRC (first degree relative) | Nof | 1274 (79.1) | 1675 (38.7) | <0.01 |
| Yes | 146 (9.1) | 114 (2.6) | ||
| Cancer site | Colon | 987 (61.3) | — | |
| Rectum | 347 (21.5) | — | ||
| Other | 14 (0.9) | — | ||
| Stage at diagnosisg | 0 | 7 (0.4) | — | |
| 1 | 436 (27.1) | — | ||
| 2 | 259 (16.1) | — | ||
| 3 | 347 (21.5) | — | ||
| 4 | 122 (7.6) | — |
aFrom the Hispanic Colorectal Cancer Study and the Multiethnic Cohort (California).
bFrom the Slim Initiative in Genomic Medicine for the Americas (California and Mexico).
cAll non-MEC SIGMA participants were assumed to have been born in Mexico.
dArgentina, Belize, Bolivia, Brazil, Chile, Colombia, Costa Rica, Ecuador, El Salvador, Guatemala, Honduras, Nicaragua, Panama, Peru or Uruguay.
e% European, Asian and African ancestries were estimated using fastSTRUCTURE with HapMap3 European, Asian and African samples (k = 4).
f2224 controls were missing family history information.
g440 cases were missing stage information.
Figure 1.Comparison of association effect sizes for previously published CRC risk SNPs (n = 57) in the original GWAS population and in Hispanic/Latinos. Red shading denotes P < 0.05 in the HL study. GWAS = genome-wide association study. OR = odds ratio. Pearson correlation coefficient = 0.13.
Known colorectal cancer susceptibility regions harboring a variant that is statistically significantly associated with CRC with P < region-specific threshold in HL (N = 5941)
| Region | rsID | Chr | Position (hg19) | Eff | Alt | Frq Eff | Info | Unconditional | Conditional | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| OR | SE | 95% LCL | 95% UCL |
| OR | SE | 95% LCL | 95% UCL |
| ||||||||
| 3q26.2 | rs185423955 | 3 | 1699950156 | C | T | 0.04 | 0.96 | 1.61 | 0.11 | 1.29 | 2.01 | 3.2E-05 | 1.57 | 0.11 | 1.26 | 1.97 | 7.5E−05 |
| 11q12.2 | rs60892987a | 11 | 61982418 | A | G | 0.10 | 1.00 | 1.34 | 0.07 | 1.17 | 1.53 | 2.7E-05 | 1.32 | 0.07 | 1.15 | 1.51 | 6.6E−05 |
| Borderline significant variants | |||||||||||||||||
| 2q32.3 | rs7604359a | 2 | 192335294 | C | A | 0.05 | 1.00 | 1.43 | 0.09 | 1.19 | 1.71 | 1.2E-04 | 1.40 | 0.09 | 1.17 | 1.68 | 2.6E−04 |
Results are derived from unconditional and conditional logistic regression adjusted for age, sex and 10 PCs for global ancestry. The index or lead variant adjusted for in conditional analyses had r 2 ≥ 0.2 with the index in the original discovery population.
aDirectly genotyped.
Figure 2.LocusZoom regional plots (±500kb from the index SNP and/or the region’s most strongly associated variant in LD (r 2 ≥ 0.2) with the index) for 2q32.3, 3q26.2 and 11q12.2 based on analyses using best genotype calls. A–C represent association results from logistic regression adjusted for age, sex and global ancestry. D–F represent association results from logistic regression adjusted for age, sex, global ancestry and allelic dosage of the known region’s lead variant. Linkage disequilibrium shading is based on 1000 Genomes Project Phase 3 AMR samples. Diamond-shaped points in purple represent these regions’ lead variants.