| Literature DB >> 30245840 |
Qinan Yin1, Kshitij Srivastava1, Amha Gebremedhin2, Addisalem Taye Makuria1,3, Willy Albert Flegel1.
Abstract
The human ACKR1 gene encodes a glycoprotein expressing the Duffy blood group antigens (Fy). The Duffy protein acts as a receptor for distinct pro-inflammatory cytokines and malaria parasites. We determined the haplotypes of the ACKR1 gene in a population inhabiting a malaria-endemic area. We collected blood samples from 60 healthy volunteers in Ethiopia's southwestern low-altitude tropical region. An assay was devised to amplify the ACKR1 gene as a single amplicon and determine its genomic sequence. All haplotypes were resolved at 5178 nucleotides each, covering the coding sequence (CDS) of the ACKR1 gene and including the 5'- and 3'-untranslated regions (UTR), intron 1, and the 5'- and 3'-flanking regions. When necessary, allele-specific PCR with nucleotide sequencing or length polymorphism analysis was applied. Among the 120 chromosomes analyzed, 18 ACKR1 alleles were confirmed without ambiguity. We found 18 single-nucleotide polymorphisms (SNPs); only one SNP was novel. The non-coding sequences harbored 14 SNPs. No SNP, other than c.-67T>C, indicative of a non-functional allele, was detected. We described haplotypes of the ACKR1 gene in an autochthonous East-African population and found 18 distinct ACKR1 alleles. These long-range alleles are useful as templates to phase and analyze next-generation sequencing data, thus enhancing the reliability of clinical diagnostics.Entities:
Year: 2018 PMID: 30245840 PMCID: PMC6138691 DOI: 10.1038/s41439-018-0024-8
Source DB: PubMed Journal: Hum Genome Var ISSN: 2054-345X
Fig. 1Structure of the ACKR1 gene and SNPs found.
The sequenced region included the two exons (yellow) and intron 1 (green) along with the 5′- and 3′-flanking regions (blue), covering most of the nested amplicon. The primary amplicon was much larger (dotted line, not to scale). The 18 SNPs (bars; Table S2) were located in the coding (red bar) and non-coding sequence regions (black bar)
Genetic variations detected in the ACKR1 gene
| Location | Nucleotide changea | dbSNP reference no. | Protein residue changeb | Observations ( | Global VAFc | HWE ( | |||
|---|---|---|---|---|---|---|---|---|---|
| Homozygote reference | Heterozygote | Homozygote variant | VAF | ||||||
| 5′ Flanking region | −2872_−2871TG>del | rs5778112 | NA | 17 | 24 | 19 | 0.517 | 0.141 | 0.123 |
| −2456T>G | rs35432289 | NA | 56 | 4 | 0 | 0.033 | 0.005 | 0.789 | |
| −2212C>G | rs149599957 | NA | 59 | 1 | 0 | 0.008 | 0.003 | 0.948 | |
| −1982C>T | rs34190692 | NA | 48 | 11 | 1 | 0.108 | 0.006 | 0.693 | |
| −1310T>C | rs867811805 | NA | 59 | 1 | 0 | 0.008 | <0.01 | 0.948 | |
| 5′ UTR | −847C>T | rs114349581 | NA | 59 | 1 | 0 | 0.008 | 0.007 | 0.948 |
| −655A>G | rs3027011 | NA | 45 | 15 | 0 | 0.125 | 0.027 | 0.268 | |
| −436C>T | Novel | NA | 59 | 1 | 0 | 0.008 | NA | 0.948 | |
| −399_−398CT>del | rs71782098 | NA | 59 | 1 | 0 | 0.008 | 0.027 | 0.948 | |
| −67T>C | rs2814778 | NA | 0 | 2 | 58 | 0.983 | 0.266 | 0.896 | |
| Intron 1 | +115T>C | rs7550207 | NA | 43 | 14 | 3 | 0.208 | 0.118 | 0.215 |
| +150C>T | rs863002 | NA | 59 | 1 | 0 | 0.008 | 0.201 | 0.948 | |
| −243T>del | rs17838198 | NA | 0 | 0 | 60 | 1 | 0.222 | NA | |
| Exon 2 | 125G>A | rs12075 | Gly42Asp | 0 | 0 | 60 | 1 | 0.460 | NA |
| 265C>T | rs34599082 | Arg89Cys | 59 | 1 | 0 | 0.008 | 0.005 | 0.948 | |
| 298G>A | rs13962 | Ala100Thr | 59 | 1 | 0 | 0.008 | 0.069 | 0.948 | |
| 602C>T | rs758176489 | Thr201Met | 59 | 1 | 0 | 0.008 | 0.000 | 0.948 | |
| 3′ Flanking region | +268A>G | rs863003 | NA | 59 | 1 | 0 | 0.008 | 0.126 | 0.948 |
VAF variant allele frequency, HWE Hardy–Weinberg equilibrium, NA not applicable
aNucleotide substitutions are shown relative to the reference sequence (NG_011626.3). Nucleotide positions are defined using the first nucleotide of the coding sequence (CDS) of NM_002036.3 isoform as nucleotide position 1
bRelative to NCBI Reference Sequence NP_002027.2
cGlobal VAF from 1000Genome, TOPMed (nhlbiwgs.org), and gnomAD (http://gnomad.broadinstitute.org/) databases
ACKR1 allele distribution in southwest Ethiopian individuals
| GenBank number | Allele (confirmed haplotype)a | Observations ( | Allele frequency (%) | |
|---|---|---|---|---|
| Meanb | 95% CIc | |||
| NG_011626.3 | tgtcctcacctttctGCGCa | NA | NA | NA |
| MG932622 | tgtcctcacctctc-ACGCa | 36 | 30.0 | 21.2–40.6 |
| MG932623 | --tcctcacctctc-ACGCa | 41 | 34.3 | 24.1–45.8 |
| MG932624 | --tcttcacctctc-ACGCa | 9 | 7.5 | 3.7–14.0 |
| MG932625 | --gcctcacctctc-ACGCa | 3 | 2.5 | 0.7–6.8 |
| MG932626 | --tcctcacctccc-ACGCa | 2 | 1.7 | 0.3–5.6 |
| MG932627 | --tcttcacctccc-ACGCa | 2 | 1.7 | 0.3–5.6 |
| MG932628 | tgtcctcacctccc-ACGCa | 5 | 4.3 | 1.6–9.3 |
| MG932629 | tgtcctcgcctccc-ACGCa | 12 | 10.0 | 5.6–16.9 |
| MG932630 | --gcctcatctctc-ACGCa | 1 | 0.8 | 0.1–4.4 |
| MG932631 | --tgcttacctctc-ACGCa | 1 | 0.8 | 0.1–4.4 |
| MG932632 | tgtccccacctctc-ACGCa | 1 | 0.8 | 0.1–4.4 |
| MG932633 | tgtcctcacctttt-ATACa | 1 | 0.8 | 0.1–4.4 |
| MG932634 | tgtcctcgcctccc-ACGCg | 1 | 0.8 | 0.1–4.4 |
| MG932635 | tgtcctcacctttc-ACGCa | 1 | 0.8 | 0.1–4.4 |
| MG932636 | --tcttcac--ccc-ACGCa | 1 | 0.8 | 0.1–4.4 |
| MG932637 | --tcctcgcctccc-ACGCa | 1 | 0.8 | 0.1–4.4 |
| MG932638 | tgtcctcgcctctc-ACGCa | 1 | 0.8 | 0.1–4.4 |
| MG932639 | --tcttcacctctc-ACGTa | 1 | 0.8 | 0.1–4.4 |
| Total | 120 | 100 | NA | |
NA not applicable
aThe nucleotides at the 16 SNP and two dinucleotide repeat (rs5778112 and rs71782098) positions are shown in 5′- to 3′-orientation (see Table S2). The remaining 5158 nucleotide positions that we determined had no variation relative to the reference sequence NG_011626.3. The upper case nucleotides are located in the coding sequence while the lowercase nucleotides are located in the non-coding sequence of the ACKR1 gene
bNumber of observed alleles × 100/Total number of alleles
c95% confidence interval (CI), Poisson distribution, two sided[36]
Functional significance of non-synonymous SNPs predicted by PredictSNP
| dbSNP reference number | Variation | Computational analysis results | ||
|---|---|---|---|---|
| Nucleotide changea | Amino acid substitutionb | Classification | Expected accuracy (%)c | |
| rs12075 | c.125G>A | p.Gly42Asp | Neutral | 83 |
| rs34599082 | c.265C>T | p.Arg89Cys | Neutral | 74 |
| rs13962 | c.298G>A | p.Ala100Thr | Neutral | 68 |
| rs758176489 | c.602C>T | p.Thr201Met | Neutral | 83 |
aRelative to NCBI Reference Sequence NM_002036.3
bRelative to NCBI Reference Sequence NP_002027.2
cNormalized confidence as calculated by the software (PredictSNP)
MaCH prediction of the ACKR1 haplotypes
| Two haplotypes per individuala | MaCH prediction of | Rate (%) | |
|---|---|---|---|
| Haplotypes ( | Individuals ( | ||
| Both correct | 112 | 56 | 93.4 |
| Both incorrect | 8 | 4 | 6.6 |
| Total | 120 | 60 | 100 |
aAs compared to the 18 physically confirmed alleles (MG932622 to MG932639)