| Literature DB >> 30594178 |
Yosr Hamdi1, Mariem Ben Rekaya2, Shan Jingxuan3, Majdi Nagara2, Olfa Messaoud2, Amel Benammar Elgaaied4, Ridha Mrad5, Lotfi Chouchane3, Mohamed Samir Boubaker2, Sonia Abdelhak2, Hamouda Boussen6, Lilia Romdhane2,7.
Abstract
BACKGROUND: Breast cancer is the most common cancer in women worldwide. Around 50% of breast cancer familial risk has been so far explained by known susceptibility alleles with variable levels of risk and prevalence. The vast majority of these breast cancer associated variations reported to date are from populations of European ancestry. In spite of its heterogeneity and genetic wealth, North-African populations have not been studied by the HapMap and the 1000Genomes projects. Thus, very little is known about the genetic architecture of these populations.Entities:
Keywords: Breast cancer susceptibility; Functional analysis; Haplotype analysis; Population genetics
Mesh:
Year: 2018 PMID: 30594178 PMCID: PMC6310952 DOI: 10.1186/s12885-018-5133-8
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Distribution, minor allele frequencies and functional prediction of the selected breast cancer common variants
| Locus | Marker ID | Alleles | GMA | GMAF | TUN (freq) | Location | Score | eQTL associations | Predicted function |
|---|---|---|---|---|---|---|---|---|---|
| 1q32.1 | rs4245739 | A/C | C | 0.2141 | 0,333 | Intergenic | 6 | No association | 3UTR MDM4 miR-191 target site and results in decreased MDM4 expression |
| 2p24.1 | rs12710696 | T/C | T | 0.4455 | 51,1 | Intergenic | 4 | No association | TF binding and DNase peak |
| 2q31.1 | rs1550623 | A/G | G | 0.1711 | 0,24 | Intergenic | 4 | No association | TF binding and DNase peak |
| 4q21 | rs1494961 | C/T | C | 0.3355 | 52,6 | Exonic | 1f | No association | Tolerated |
| 4q21 | rs11099601 | C/T | C | 0.33613 | 53,4 | 3 UTR | 1f | DAE eQTL and TF binding / DNase peak | |
| 4q34.1 | rs6828523 | C/A | A | 0.2468 | 0,179 | Intronic, | No data | No association | No data |
| 6p23 | rs204247 | A/G | G | 0.4321 | 0,37 | Intergenic (11 kb 5′- | 6 | No association | Minimal functional evidence |
| 6q25.1 | rs2046210 | G/A | A | 0.4121 | 0,45 | Intergenic, 6 kb 3’- | 6 | No association | Minimal functional evidence |
| 7q35 | rs720475 | G/A | A | 0.1478 | 0,263 | Intronic; | 5 | TF binding or DNase peak | |
| 9q31.2 | rs10759243 | C/A | A | 0.4607 | 0,477 | Intergenic (53 kb 5′- | No data | No association | No data |
| 10p12.31 | rs7072776 | G/A | A | 0.3055 | 0,441 | Intergenic (382 bp 3′ | 5 | No association | TF binding or DNase peak |
| 10p15.1 | rs2380205 | C/T | T | 0.3750 | 0,48 | Intergenic, 2.6 kb 5′- | 3a | No association | TF binding, any motif and DNase peak |
| 10q22.3 | rs704010 | C/T | T | 0.2674 | 0,322 | Intronic; | 2b | No association | TF binding, any motif, DNase Footprint and DNase peak |
|
|
| A/G | G | 0.4089 | 0,466 | Intronic; | No data | No association | No data |
|
|
| G/A | A | 0.4038 | 0,463 | Intronic, | 5 | No association | TF binding or DNase peak |
| 11q13.1 | rs3903072 | G/T | T | 0.3165 | 0,467 | Intergenic (7.4 kb 3′- | 4 | TF binding and DNase peak | |
| 11p15.5 | rs3817198 | T/C | C | 0.2155 | 0,277 | 3UTR | 5 | No association | TF binding or DNase peak |
| 12q24.21 | rs1292011 | A/G | G | 0.4211 | 0,469 | Intergenic | 4 | No association | TF binding and DNase peak |
| 14q24.1 | rs2588809 | C/T | T | 0.1831 | 0,298 | Intronic | No data | No association | No data |
| 14q32.11 | rs941764 | A/G | G | 0.4193 | 0,468 | Intronic | 4 | No association | TF binding and DNase peak |
| 16q12.1 | rs3803662 | G/A | A | 0.4403 | 0,414 | Intergenic, 5′ to | 5 | No association | TF binding or DNase peak |
|
|
| C/T | T | 0.3133 | 0,396 | Intronic | 5 | No association | TF binding or DNase peak |
| 16q23.2 | rs13329835 | A/G | G | 0.2957 | 0,376 | Intronic | 4 | No association | TF binding and DNase peak |
| 17q21 | rs9911630 | A/G | G | 0.4972 | 0,426 | 3’of | 1b | eQTL, TF binding, any motif, DNase Footprint and DNase peak | |
| 17q21 | rs799916 | T/G | T | 0.4976 | 0,404 | Intronic | 6 | Minimal functional evidence | |
| 18q11.2 | rs1436904 | T/G | G | 0.3568 | 0,292 | Intronic | No data | No association | No data |
| 19p13.11 | rs4808801 | A/G | G | 0.4521 | 0,404 | Intronic | 1f | p = 1.6 × 10−5 for | eQTL, TF binding and/or DNase peak |
| 22q13.1 | rs6001930 | T/C | C | 0.1414 | 0,109 | Intronic | 5 | No association | TF binding or DNase peak |
-GMA Global Minor Allele, GMAF Global Minor Allele Frequency, TUN (freq) the frequency of the global minor allele in the Tunisian population, Score: from the RegulomeDB database and score significance provided in the predicted function column, eQTL association provided by the GTEx database, p the p value of the variants’ eQTL association, TF Transcription factor
-The highlighted rows indicate polymorphisms that showed the highest RegulomeDB scores, significant eQTL associations and other functional evidence
-rs1494961 is the only exonic variant in this list, we provided its predicted functional significance using the Sift software
-In bold, SNPs previously identified as associated with breast cancer risk in the Tunisian population
Fig. 1Distribution of breast cancer at risk haplotypes in the general Tunisian population. After phasing the identified haplotypes in the Tunisian population, the frequencies of at risk haplotypes (in red; identified as haplotypes carrying at risk alleles, those that have been identified as associated with the disease and usually considered as the minor alleles) have been calculated. At risk haplotypes with frequency ≥ 5% are considered as relatively common haplotypes
Fig. 2eQTL analysis of the rs9911630- BRCA1 variant in different populations. a rs9911630 was tested for correlation with BRCA1 gene expression using the eQTL database GENe Expression VARiation (Genevar) from Sanger institute plateform. eQTLs in 8 different HapMap populations are showed in this figure (CEU: Utah residents with Northern and Western European ancestry from the CEPH, CHB: Han Chinese in Beijing, China; JPT: Japanese in Tokyo, Japan; GIH: Gujarati Indians in Houston, MEX: Mexican ancestry in Los Angeles, California; YRI: Yoruba in Ibadan, Nigeria; LWK: Luhya in Webuye, Kenya; MKK: Maasai in Kinyawa, Kenya). The 8 Box plots represent BRCA1 expression levels on the y axis with respect to the rs9911630 genotypes showed on the x axis. P values (P) < 0.05 means significant eQTL associations. b eQTL association results across different HapMap populations are shown on the y axis (log10). The chromosomal position of the BRCA1 gene (chr17:38449840–38,530,994 according to NCBI36/hg18 assembly) is shown on the x axis. eQTL peaks represent the significant eQTL correlation that have been observed for rs9911630 in Caucasian, Chinese and Japanese samples. c Haplotype blocks constructed with variants identified in the Tunisian subjects showing a frequency ≥ 5%. Tagging SNPs identified on a block-by-block basis are denoted with an arrow below the SNP number. Tunisian haplotype frequencies are displayed on the right of each haplotype combination, while the level of recombination is displayed below the connections between the blocks. Thick connections represent haplotypes with frequencies ≥5%, while frequencies below 5% are represented by thin lines. Variant #12 is rs9911630
Fig. 3Principle component analysis showing population distribution based on the MAF of the 28 selected variants. a Tunisian and HapMap populations. b Other sub-populations from North Africa and other ethnic groups