| Literature DB >> 23786662 |
Irving Hsu1, Rong Chen, Aditya Ramesh, Erik Corona, Hyunseok Peter Kang, David Ruau, Atul J Butte.
Abstract
BACKGROUND: Long-term environmental variables are widely understood to play important roles in DNA variation. Previously, clinical studies examining the impacts of these variables on the human genome were localized to a single country, and used preselected DNA variants. Furthermore, clinical studies or surveys are either not available or difficult to carry out for developing countries. A systematic approach utilizing bioinformatics to identify associations among environmental variables, genetic variation, and diseases across various geographical locations is needed but has been lacking.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23786662 PMCID: PMC3699381 DOI: 10.1186/1471-2350-14-62
Source DB: PubMed Journal: BMC Med Genet ISSN: 1471-2350 Impact factor: 2.103
Figure 1GeoWAS methodology used to identify SNPs associated with UV radiation. Once obtained, the radiation SNPs were further analyzed for positive selection in populations, enrichment in disease-related polymorphisms, and enrichment in human genes.
Figure 2UV radiation global distribution. UV radiation levels are measured in Joules of ambient energy per square meter (J/m2). Darker hues correspond to higher levels of radiation.
Figure 3Scatterplot correlating ancestral allele frequencies against the radiation levels in each country for the rs16891982 SNP. Each point on the plot represents a single country. The rs16891982 SNP is found in the SLC45A2 gene, and is known to be associated with phenotypes such as pigmentation and skin cancer. Specific country allele frequencies are shown in Figure 5.
Figure 5HGDP population ancestral allele frequency distributions for the rs16891982 SNP [16]. The distribution trends across the populations are analogous to those of the continents in Figure 4.
Different phenotypes and diseases associated with radiation SNPs, identified through VARIMED
| 10508503 | 0.841646 | Intergenic | - | C | 1.10 × 10-7 | |
| 10809808 | 0.800649 | Intergenic | - | T | 5.80 × 10-12 | |
| 10922300 | 0.805829 | Intergenic | - | C | 7.44 × 10-11 | |
| 11856835 | 0.811979 | Intron | G | 1.10 × 10-7 | ||
| 1263173 | 0.815426 | Intergenic | - | G | 2.13 × 10-7 | |
| 1263173 | 0.815426 | Intergenic | - | G | 6.80 × 10-14 | |
| 12913832 | 0.801066 | Intron | A | 1.00 × 10-11 | ||
| 12913832 | 0.801066 | Intron | A | 8.51 × 10-10 | ||
| 16891982 | -0.83108 | Missense | G | 1.60 × 10-12 | ||
| 16891982 | -0.83108 | Missense | G | 1.48 × 10-12 | ||
| 16891982 | -0.83108 | Missense | G | 3.89 × 10-11 | ||
| 16891982 | -0.83108 | Missense | G | 2.00 × 10-8 | ||
| 16891982 | -0.83108 | Missense | G | 8.30 × 10-15 | ||
| 16891982 | -0.83108 | Missense | G | 1.70 × 10-9 | ||
| 16891982 | -0.83108 | Missense | G | 5.02 × 10-8 | ||
| 16891982 | -0.83108 | Missense | G | 1.00 × 10-7 | ||
| 28777 | 0.805006 | Intron | C | 1.10 × 10-8 | ||
| 2153271 | 0.800255 | Intron | C | 3.98 × 10-10 | ||
| 2286963 | 0.813366 | Missense | T | 3.10 × 10-12 | ||
| 2470893 | -0.80281 | Near Gene5 | T | 2.90 × 10-9 | ||
| 2899472 | 0.81232 | Intron | C | 1.90 × 10-12 | ||
| 3829241 | 0.831729 | Missense | G | 6.20 × 10-14 | ||
| 496300 | 0.813658 | Intergenic | - | T | 3.90 × 10-7 | |
| 7196459 | 0.841939 | Intron | G | 3.10 × 10-15 | ||
| 7259371 | 0.812093 | Intron | A | 2.20 × 10-7 | ||
| 798489 | 0.808614 | Intron | C | 1.90 × 10-8 | ||
| 884205 | 0.806085 | Intergenic | - | C | 9.40 × 10-9 |
Empty values in the gene column indicate that the corresponding SNP is located outside of a gene region. Cancerous phenotypes are highlighted with an asterisk. As many of the same SNPs have been repeatedly identified for varying but similar diseases and conditions, these are not independent traits.
Figure 4Ancestral allele frequency distributions for rs16891982 across various HGDP continents. For each continent, the distribution was calculated by aggregating the allele frequencies of all the countries within it.
Native populations under strong positive selection for radiation SNPs
| Russian | Russia | 413 | 0.99 | 2.1 × 10-6 |
| French | Southern Europe | 411 | 0.97 | 1.5 × 10-6 |
| Basque | Southern Europe | 375 | 0.96 | 1.4 × 10-5 |
| Sardinian | Southern Europe | 385 | 0.90 | 3.7 × 10-4 |
| Orcadian | Northern Europe | 397 | 0.89 | 6.5 × 10-4 |
| Tuscan | Southern Europe | 360 | 0.85 | 1.9 × 10-2 |
| Druze | Israel | 326 | 0.86 | 3.9 × 10-2 |
| Mozabite | Northern Africa | 304 | 0.85 | 4.6 × 10-2 |
Ontology of genes in which the radiation SNPs were significantly enriched
| Glutamate receptor activity | 5 | 13.6 | 1.7 × 10-5 | 0.031 |
| Calcium ion binding | 40 | 2.0 | 2.3 × 10-5 | 0.041 |
After performing a Bonferroni correction, genes with a p-value under 0.05 were further studied for relation to UV radiation.