| Literature DB >> 33003314 |
Marcin Słomka1,2, Marta Sobalska-Kwapis1,2, Małgorzata Korycka-Machała3, Jarosław Dziadek3, Grzegorz Bartosz4, Dominik Strapagiel1,2.
Abstract
ATP-binding cassette sub-family G member 2 (ABCG2), also known as breast cancer resistance protein (BCRP), is one of the key efflux ATP-binding cassette (ABC) transporters of xenobiotics, their metabolites and endogenous compounds such as urate. Some of its genetic variants have been found to influence protein functioning, resulting in serious clinical implications concerning chemotherapy response, as well as gout or blood group phenotype Jr(a-). Previous reports have suggested that the frequencies of certain crucial polymorphisms, such as c.34G>A (p.Val12Met) and c.421C>A (p.Gln141Lys) differ significantly between the Polish population and other Caucasian populations. Thus, to clarify this issue, the present study performs a complete analysis of the genetic variation of ABCG2 coding sequence in the Polish population. The genetic variation in 14 out of 15 coding exons of the ABCG2 gene, as well as their flanking intron sequences, were examined among 190 healthy representatives of the Polish population using scanning with High Resolution Melting (HRM). HRM scanning revealed 17 polymorphisms: eight in the exons (including five missense variants and one point-nonsense mutation) and nine in the intron sequences (eight single nucleotide polymorphisms (SNPs) and one deletion variant). These included variants correlating with the presence of gout and phenotype Jr(a-). Linkage disequilibrium, haplotype blocks and haplotype analyses were also performed. The frequencies of the most common polymorphisms in the Polish population did not differ significantly to those observed for other Caucasian populations, but demonstrated divergence from non-Caucasian populations. We hope that our findings may be helpful for other researchers and clinicians, evaluating the pharmacogenetic role of ABCG2.Entities:
Keywords: ABCG2; BCRP; High Resolution Melting; Polish population; gene scanning
Mesh:
Substances:
Year: 2020 PMID: 33003314 PMCID: PMC7600124 DOI: 10.3390/genes11101144
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Pairs of primers used for High Resolution Melting (HRM) screening and sequencing of ABCG2.
| Exon | Number of Scanned bp Upstream from the Exon (without Primer) | Forward Primer (5′→3′) | Reverse primer (5′→3′) | Number of Scanned bp Downstream from the Exon (without Primer) | Size (bp) Including Primers | Polymorphism Detected in HRM Scanning |
|---|---|---|---|---|---|---|
| HRM scanning | ||||||
| 1 | - | - | - | - | - | - |
| 2 | 9 (28*) | TGGACTATCAACTTACTATTGCTTTTC | CAGCTCCTTCAGTAAATGCC | 161 | c.34G>A (rs2231137) | |
| GACAGCTTCCAATGACCTGA | TGGAAATAGCCAAAACCTGTG | 54 | 213 | c.203+36A>G (rs4148152) | ||
| 3 | 23 | GAACCTGACTTAAAATGGAATAGTAAA | AAAAGTGCACAGAAAACGCT | 23 | 153 | c.263+10A>G (rs2231138) |
| 4 | 16 | TCTTATAGGTTATTAGACCCACAACAT | GAAACAGAAAATGCAAACCCA | 11 | 190 | c.335C>A (rs199473672) |
| 5 | 32 | CATGGTCTTAGAAAAGACTCATTATCA | TGACCCTGTTAATCCGTTCG | 174 | c.421C>A (rs2231142) | |
| GAAAACTTACAGTTCTCAGCAGCTC | CCACACAGGGAAAGTCCTACT | 37 | 178 | - | ||
| 6 | 22 | GATAATGACTGGTTGTTATCATTGAC | GCTCCATTCCTATACTAGTCCTTTTT | 115 | c.532-16A>G (rs2231144) | |
| TTTATCCGTGGTGTGTCTGG | CCCCAAGAATATCTGGGACA | 49 | 215 | - | ||
| 7 | 51 | GGCAAGAATAGAGTATTTTACTGAGAA | TATCGAGGCTGATGAATGGA | 129 | c.706C>T/C>A (rs140207606) | |
| GCAGGGACGAACAATCATCT | AGGCCCGTGGAACATAAGTC | 109 | - | |||
| TGATAGCCTCACCTTATTGGC | CTATTAATGAAGCATTTTACAGCATAA | 15 | 125 | - | ||
| 8 | 32 | TCTTTGTTTTCCAAGACCATCA | AGGCAAAAATCTGGGACTGT | 15 | 191 | - |
| 9 | 22 | TGTTGTTATCTTGTTTGTGTTTCC | GGAGTTGACATAAATCTCCGC | 120 | - | |
| CCAAGCAGGATAAGCCACTC | GCTGATCTCCTTGAAGACTGTG | 140 | c.1060G>A (rs138606116) | |||
| AATTACATCAACTTTCCGGGG | AACCACATTGTTCCCATTTGA | 26 | 199 | - | ||
| 10 | 24 | TCTCTAATTGAAACTCTTCCCCTTT | TGAAGAAAGTAACAGCATTTTCTGA | 24 | 181 | - |
| 11 | 60 | TGTGGAAAGAGTTTTGTGGGTA | CTAACCAATAGCCCCTGCTG | 37 | 229 | c.1278-28G>A (rs771435451) |
| 12 | 37 | TGCCCTGTGGCTTCTTAAAC | CTTGGTAACATCCTCATGGGT | 147 | c.1368-21A>T (rs772239662) | |
| TCAGCGGATACTACAGAGTGTCA | CTGACTTCACCCATAGGCAA | 40 | 174 | c.1492+38G>A (rs2231155) | ||
| 13 | 26 | GTGACCTAGGCAGTTGGGTT | AACCACACTCTGACCTGCTG | 156 | - | |
| TGTTTACCCTTATGATGGTGGC | TCAGAGCAAACACAGTTCAGAC | 145 | - | |||
| AGCAGGTCAGAGTGTGGTTT | TGTGCAGGAAGAAATGAAGGA | 42 | 127 | c.1647+40T>C (rs2231157) | ||
| 14 | 10 | GCAGGCCTGACTTTTAGTATTTG | TGTTCCTAGCTTGGGAATGC | 29 | 172 | c.1714A>C (rs200894058) |
| 15 | 15 | CTGTTTACCTTGCCCTGCTC | CAAAACCCATTTTGACACTGAA | 11 | 151 | - |
| 16 | - | - | - | - | - | - |
| Direct sequencing | ||||||
| 1 | - | - | - | |||
| 2 | GTTGTGCCTGTCTTCCCATT | TGACTTCATGATCTGCCTGC | 719 | |||
| 3 | ATGCATGACCTGTTTTGTTTGTT | AAAGCAAGACACCATTGGCT | 650 | |||
| 4 | AGCCAATGGTGTCTTGCTTT | ACCAGCAGACATTTCCAAACT | 716 | |||
| 5 | AGTTTGGAAATGTCTGCTGGT | ACGTACAACACCACATTGCC | 649 | |||
| 6 | AGTACAGTATGTGGGCCGGAAT | CCCCTACACCCTCATCACAG | 718 | |||
| 7 | TGCAGATTACCTGGGTTGCT | CACATGCATGCACATTGAAA | 721 | |||
| 8 | CGTGGGAAGAAGAGAGAAAGAA | GCAAATGCATATATTGCCAAG | 535 | |||
| 9 | GGCCTGTTATACTTTTCTAATGACAGA | CTGAGTTCATGCCACCACAC | 967 | |||
| 10 | TTTTCCAGCAGTGTGCTTTG | AGGCAGGAGAATTGCTTGAA | 575 | |||
| 11 | AGGGCCCATCTTCAAATACC | TTGCTTGCTCTCTCCAACATT | 732 | |||
| 12 | GAGAGGGAGGATGTGTGGAA | GGTTGAGGCTGCAGTGAGAT | 727 | |||
| 13 | TTCCTGTGAAAAGAGAAGGCA | TCAAGAGAATCTGCCGATCA | 797 | |||
| 14 | CAGTGCTTCTTGACCTTCCA | TGAACTGAAATGCAACAATGTG | 709 | |||
| 15 | ATTGGGGAGAGAAGGAGGAA | CAGGAACCATAAGCCCTTGA | 731 | |||
| 16 | - | - | - | |||
28*—upstream from the START codon. Due to observation of multiple melting domains for coding fragment of exon 16 in HRM scanning, despite several attempts, resulting no possibility to successively study its genetic variation, primers sequences for this fragments have been omitted.
Summary of detected ATP-binding cassette sub-family G member 2 (ABCG2) genetic variation using HRM.
| Exon Scanned by HRM | dbSNP ID | Variant Position NM_004827.3: | Intron/Amino Acid Residue NP_004818.2: | Observed Genotypes a,b ( | HWE Exact Test | MAF d | ||
|---|---|---|---|---|---|---|---|---|
| R/R | R/V | V/V | ||||||
| 2 | rs2231137 | c.34G>A | p.Val12Met | 175 | 12 | 0 | 1 | (A) 0.032 |
| 2 | rs4148152 | c.203+36A>G | Intron | 175 | 14 | 0 | 1 | (G) 0.037 |
| 3 | rs2231138 | c.263+10A>G | Intron | 177 | 10 | 0 | 1 | (G) 0.027 |
| 4 | rs199473672 | c.335C>A | p.Pro112Gln | 184 | 1 | 0 | 1 | (A) 0.003 |
| 5 | rs2231142 | c.421C>A | p.Gln141Lys | 147 | 38 | 3 | 0.954 | (A) 0.117 |
| 6 | rs2231144 | c.532-16A>G | Intron | 174 | 13 | 0 | 1 | (G) 0.035 |
| 7 | rs140207606 | c.706C>T | p.Arg236Ter | 186 | 1 | 0 | 1 | (T) 0.003 |
| 7 | rs140207606 | c.706C>A | p.Arg236= | 186 | 1 | 0 | 1 | (A) 0.003 |
| 7 | rs753256449 | c.690-19_690-17delTGT | Intron | 186 | 1 | 0 | 1 | (delTGT) 0.003 |
| 9 | rs138606116 | c.1060G>A | p.Gly354Arg | 187 | 1 | 0 | 1 | (A) 0.003 |
| 11 | rs771435451 | c.1278-28G>A | Intron | 187 | 1 | 0 | 1 | (A) 0.003 |
| 11 | rs781367109 | c.1302G>A | p.Thr434= | 187 | 1 | 0 | 1 | (A) 0.003 |
| 11 | rs2231153 | c.1367+20G>A | Intron | 0 | 11 | 177 | 1 | (G) 0.029 |
| 12 | rs772239662 | c.1368-21A>T | Intron | 188 | 1 | 0 | 1 | (T) 0.003 |
| 12 | rs2231155 | c.1492+38G>A | Intron | 187 | 2 | 0 | 1 | (A) 0.005 |
| 13 | rs2231157 | c.1647+40T>C | Intron | 68 | 93 | 27 | 0.734 | (C) 0.391 |
| 14 | rs200894058 | c.1714A>C | p.Ser572Arg | 186 | 1 | 0 | 1 | (C) 0.003 |
a Number of genotypes detected during this study, R—reference allele, V—variant allele. b Total number of examined samples was 190, however during scanning single samples were excluded due to accidental problems with amplification during PCR and hence, total number of genotypes does not equal precisely 190. c p-value is consistent with Hardy–Weinberg equilibrium if p > 0.001. d Minor allele shown in brackets with its frequency.
Figure 1Structure of ABCG2 transporter in plasma membrane with highlighted amino acids for which genetic variants were detected in this study (based on the NP_004818.2 reference sequence). Amino acids encoded by subsequent exons are described and typed by different font color, and amino acids located on exons borders as splicing effect have background colored in yellow. Conserved motifs in Nucleotide Binding Domain: Walker A and B, the ABC signature C motif, Q-, D- and H-loops are described and indicated with orange squares. Amino acids involved in forming intra-molecular (C592-C608) and inter-molecular disulfide bonds (C603), and functional N-glycosylation site (N596) have background colored in blue. Amino acids with colored background and marks mean variants detected in this study: purple—missense, green—silent, red—nonsense mutation.
Inter-population comparison of minor allele frequency (MAF) values of ABCG2 polymorphisms.
| Population |
| rs2231137 | rs4148152 | rs2231138 | rs2231142 | rs2231144 | rs138606116 | rs2231153 | rs2231157 |
|---|---|---|---|---|---|---|---|---|---|
| Polish (current study) | 190 | 0.032 | 0.037 | 0.027 | 0.117 | 0.035 | 0.003 | 0.029 | 0.391 |
| Chinese [ | 27 | 0.204 | 0.259 | 0.204 | 0.222 | - | - | 0.315 |
|
| Japanese [ | 177 | 0.192 | 0.192 | 0.068 | 0.319 |
| 0.003 | 0.161 | 0.398 |
| Korean [ | 0.230 ( | 0.250 ( | 0.098 ( | 0.280 ( | - | - | 0.240 ( | 0.326 ( | |
| Dutch [ | 100 | 0.065 | 0.065 | - | 0.120 | - | - | 0.045 | 0.465 |
| Vietnamese [ | 140 | 0.360 | - | - | 0.310 | - | - | - | - |
| Chinese [ | 191 | 0.330 | - | - | 0.290 | - | - | - | - |
| Swedish [ | 60 | 0.020 | - | - | 0.100 | - | - | - | - |
| Hungarian [ | 149 | 0.040 | - | - | 0.094 | - | - | - | - |
| Czech [ | 100 | 0.015 | - | - | 0.075 | - | - | - | - |
| German | 0.030 ( | - | - | 0.104 ( | - | - | - | - | |
| Han Chinese [ | 95 | - | - | - | 0.342 ( | - | - | - | - |
| American Caucasian [ | 88 | - | - | - | 0.119 ( | - | - | - | - |
| European Caucasian [ | 84 | - | - | - | 0.107 ( | - | - | - | - |
| African American [ | 94 | - | - | - |
| - | - | - | - |
| African (sub-Sahara) [ | 938 | - | - | - |
| - | - | - | - |
Shaded cells mean significantly higher MAF than in the Polish population; bolded values mean significantly lower MAF than in the Polish population.
Figure 2Linkage disequilibrium analysis of ABCG2 polymorphisms detected in this study using |D’| parameter. Pairwise linkage demonstrated as |D’| values (100×) by graded pink and black colors, denser color means closer linkage and lack of value means |D’| = 1 × 100. Bolded triangles show haplotype blocks.
Figure 3Linkage disequilibrium analysis of ABCG2 polymorphisms detected in this study using r2 parameter. Pairwise linkage demonstrated as r2 values (×100) by graded black colors, denser color means closer linkage. Bolded triangles show haplotype blocks.
Haplotype block analysis.
| Block 1 | Block 2 | |||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Variant residue a,b | c.34G>A | c.203+36A>G | c.263+10A>G | c.335C>A | c.421C>A | c.532-16A>G | c.706C>T | c.706C>A | c.690-19_690-17delTGT e | c.1060G>A | Haplotype frequency | Recombination | c.1278-28G>A | c.1302G>A | c.1367+20G>A | c.1368-21A>T | c.1492+38G>A | c.1647+40T>C | c.1714A>C | Haplotype frequency |
| Haplotypes c | G | A | A | C | C | A | C | C | TGT | G | 0.804 | 0.40 | G | G | A | A | G | T | A | 0.584 |
| G | A | A | C | A | A | C | C | TGT | G | 0.113 | G | G | A | A | G | C | A | 0.376 | ||
| G | A | A | C | C | G | C | C | TGT | G | 0.031 | G | G | G | A | G | T | A | 0.020 | ||
| A | G | G | C | C | A | C | C | TGT | G | 0.024 | others | 0.020 | ||||||||
| others | 0.028 | |||||||||||||||||||
a Residues followed by reference sequences for coding nucleotide NM_004827.3 and amino acid position NP_004818.2. b Bolded polymorphisms mean haplotype tag SNPs (htSNPs). c Major alleles in white boxes, minor alleles in shaded boxes; all the haplotypes with their frequencies in population; haplotypes below 1% not shown and grouped like “others”. d Level of recombination between blocks as a value of multiallelic D’. e TGT allele in haplotype means no deletion.