| Literature DB >> 26366667 |
Christina A Eichstaedt1, Tiago Antao2, Alexia Cardona3, Luca Pagani4, Toomas Kivisild3, Maru Mormina5.
Abstract
Arsenic is a carcinogen associated with skin lesions and cardiovascular diseases. The Colla population from the Puna region in Northwest Argentinean is exposed to levels of arsenic in drinking water exceeding the recommended maximum by a factor of 20. Yet, they thrive in this challenging environment since thousands of years and therefore we hypothesize strong selection signatures in genes involved in arsenic metabolism. We analyzed genome-wide genotype data for 730,000 loci in 25 Collas, considering 24 individuals of the neighbouring Calchaquíes and 24 Wichí from the Gran Chaco region in the Argentine province of Salta as control groups. We identified a strong signal of positive selection in the main arsenic methyltransferase AS3MT gene, which has been previously associated with lower concentrations of the most toxic product of arsenic metabolism monomethylarsonic acid. This study confirms recent studies reporting selection signals in the AS3MT gene albeit using different samples, tests and control populations.Entities:
Keywords: Arsenic drinking water; Calchaquíes; Collas; Methyltransferase; Puna
Mesh:
Substances:
Year: 2015 PMID: 26366667 PMCID: PMC4896383 DOI: 10.1016/j.mrfmmm.2015.07.007
Source DB: PubMed Journal: Mutat Res ISSN: 0027-5107 Impact factor: 2.433
Fig. 1Sampling locations and arsenic levels in the province of Salta, Argentina.
Stars denote sampling locations, circles levels of arsenic. Sampling locations of the Wichí population in the Gran Chaco region are (left to right): Embarcación, Carboncito, Misión Chacheña, Dragones (purple stars). Arsenic concentrations in surrounding locations were measured to be: Las Varas 0 µg/l, Pinchanal 19.5 µg/l, General Ballivián 4 µg/l, Tartagal 2.3 µg/l [22]. Calchaquíes originated from Cachi (turquoise star) with an arsenic level of 3.1 µg/l [22]. Collas (pink stars) were sampled in: San Antonio de los Cobres (arsenic level: 214 µg/l), Tolar Grande (arsenic level: 4 µg/l) and Olacapato (arsenic level of 12 µg/l) [14].
Arsenic detoxification associated candidate genes.
| Source | Gene | Name |
|---|---|---|
| AmiGO: “arsen” associated ontology | ATP-binding cassette sub-family C member 2 | |
| Methylarsonite methyltransferase (arsenite methyltransferase) | ||
| Arsa (Bacterial) arsenite transporter, Atp-binding, homolog 1 | ||
| Cyclin-dependent kinase inhibitor 1 | ||
| Cytoplasmic polyadenylation element-binding protein 2 | ||
| Coproporphyrinogen oxidase | ||
| Cytochrome P1-450, dioxin-inducible | ||
| DEAD (Asp-Glu-Ala-Asp) box helicase 3, X-linked | ||
| Errochelatase | ||
| Glutamate-cysteine ligase, catalytic subunit | ||
| Glutaredoxin 2 | ||
| Glutathione S-transferase omega-1 (monomethylarsonic acid reductase) | ||
| Glutathione S-transferase omega-2 (monomethylarsonic acid reductase) | ||
| Heme Oxygenase 1 | ||
| MAP kinase interacting serine/threonine kinase 2 | ||
| Peptidylprolyl isomerase F (cyclophilin F) | ||
| Phosphatidylinositol-3,4,5-trisphosphate 3-phosphatase | ||
| RNA-binding motif protein 4 | ||
| RING finger protein 4 | ||
| Solute Carrier family 34 member 1 | ||
| Serrate RNA effector molecule homolog (arsenite-resistance protein 2) | ||
| Tumor necrosis factor receptor superfamily member 11B | ||
| Uroporphyrinogen-Ill synthase | ||
| Zinc finger, AN1-type domain 1 | ||
| Zinc finger, AN1-type domain 2A (arsenite inducible RNA Associated protein) | ||
| Zinc finger, AN1-type 2B (arsenite-inducible RNA-associated protein-like protein) | ||
| Gene cards: “arsen” associated gene name | Methyltransferase-like protein 18 (arsenic-transactivated protein 2) | |
| Polymerase (DNA directed), epsilon 3 (arsenic-transactivated protein) | ||
| Serine (or cysteine) proteinase inhibitor, clade H (arsenic-transactivated protein 3) | ||
| Spindly homolog (drosophila) (arsenite-related gene 1 protein) | ||
| Literature: arsenic associated methyl-transferases | Betaine-homocysteine S-methyltransferase [ | |
| DNA (cytosine-5)-methyltransferase 1 [ | ||
| DNA (cytosine-5)-methyltransferase 3B [ | ||
| N-6 Adenine-specific DNA methyltransferase 1 [ | ||
| Phosphatidylethanolamine N-methyltransferase [ |
Fig. 2Window PBS scores across the genome in Collas.
The blue line indicates the top 1% of hits. The fourth highest cluster overall is found on chromosome 10. The green circles indicate the PBS hits ±1 Mb of AS3MT. The highest scoring SNP overall lies on chromosome 11 and falls within a gene free region. The hit on chromosome 21 is located within CBS, which regulates cerebral blood flow velocity.
Fig. 3Distribution of beneficial CG alleles in Argentinean populations.
The majority of Collas has two copies of the beneficial CG alleles (rs1046778, rs7085104) within and near AS3MT, while Calchaquíes mainly carry one copy of the specific alleles. In Wichí most individuals have no copy of the beneficial alleles. Allele frequencies differ significantly between Collas and Wichí and Calchaquíes and Wichí (p <0.001).
Top 15 windows of PBS in Collas.
| Rank | Genes in window | Window location | PBS | Max. score position |
|---|---|---|---|---|
| 1 | 11: 67400,000-67500000 | 1.188 | 23 kb Downstream | |
| 2 | 21: 44400000-44500000 | 1.005 | Within | |
| 3 | 3: 112800000-112900000 | 0.993 | Within | |
| 4 | 10: 104500000-104600000 | 0.965 | Within | |
| 5 | 11: 44700000-44800000 | 0.955 | 27 kb Upstream of TSPAN18 | |
| 6 | 14:97900000-98000000 | 0.950 | 36 kb Upstream of | |
| 7 | No gene | 13: 103800000-103900000 | 0.934 | n.a. |
| 8 | 3: 10600000-10700000 | 0.929 | Within | |
| 9 | 3: 155100000-155200000 | 0.925 | Within | |
| 10 | 10: 104700000-104800000 | 0.906 | Within | |
| 11 | 10: 104600000-104700000 | 0.906 | Within | |
| 12 | 13: 40100000-40200000 | 0.893 | Within | |
| 13 | 21: 44500000-44600000 | 0.872 | Within | |
| 14 | 10: 80300000-80400000 | 0.871 | Within | |
| 15 | 10: 26900000-27000000 | 0.870 | 617 bp Upstream of |
Fig. A1Haplotype analysis of SNPs ±1 Mb around AS3MT.
The region includes SNPs located 1 Mb up and downstream of AS3MT on chromosome 10 between positions 104, 629, 210–104, 661, 656. The central largest triangle includes AS3MT and spans 499 kb. SNPs are represented on the top and the linkage disequilibrium (LD) degree is displayed below by colour. Red squares represent strong LD (D’ = 1) with a high logarithm of odds (LOD) score, i.e. the probability of linkage between 2 loci is very high; purple: high D’, low LOD score; white: low D’ and low LOD score. Black triangles indicate haplotype blocks calculated by the programme Haploview [29]. Only heterozygous SNPs are displayed.