| Literature DB >> 25849546 |
Carlos Eduardo G Amorim1, Josephine T Daub2, Francisco M Salzano3, Matthieu Foll4, Laurent Excoffier2.
Abstract
Tropical forests are believed to be very harsh environments for human life. It is unclear whether human beings would have ever subsisted in those environments without external resources. It is therefore possible that humans have developed recent biological adaptations in response to specific selective pressures to cope with this challenge. To understand such biological adaptations we analyzed genome-wide SNP data under a Bayesian statistics framework, looking for outlier markers with an overly large extent of differentiation between populations living in a tropical forest, as compared to genetically related populations living outside the forest in Africa and the Americas. The most significant positive selection signals were found in genes related to lipid metabolism, the immune system, body development, and RNA Polymerase III transcription initiation. The results are discussed in the light of putative tropical forest selective pressures, namely food scarcity, high prevalence of pathogens, difficulty to move, and inefficient thermoregulation. Agreement between our results and previous studies on the pygmy phenotype, a putative prototype of forest adaptation, were found, suggesting that a few genetic regions previously described as associated with short stature may be evolving under similar positive selection in Africa and the Americas. In general, convergent evolution was less pervasive than local adaptation in one single continent, suggesting that Africans and Amerindians may have followed different routes to adapt to similar environmental selective pressures.Entities:
Mesh:
Year: 2015 PMID: 25849546 PMCID: PMC4388690 DOI: 10.1371/journal.pone.0121557
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Manhattan plots of the physical position of SNPs (x-axis) and corresponding q-values in log10 scale (y-axis) for inferring selection.
Based on a sliding-window approach, SNPs are color-coded according to the best supported model of selection, namely neutrality (black or grey), in Africa (blue), in the Americas (green), and in both continents (convergent evolution, red) with a False Discovery Rate of 0.1 (dashed line). Different sets of populations were used in the analysis yielding four different population sets (PS1–4). Congruent clusters of outlier SNPs considering all four PSs are highlighted with a grey box.
Coordinates, size, and genic content of the seven clusters of outlier SNPs occurring in all four population sets (PSs) considered in the analysis.
| Cluster | Coordinates in hg19 | Size | SNPs | Genes |
|---|---|---|---|---|
| 1 | chr1:53,476,720–53,520,376 | 43.7 kb | rs6679819, rs10437066, rs6588459, rs7550236 |
|
| 2 | chr2:184,608,065–184,633,769 | 25.7 kb | rs17715017, rs1733497, rs1439771, rs2119047 | - |
| 3 | chr4:48,960,613–48,972,901 | 12.3 kb | rs2605267, rs2572363 |
|
| 4 | chr4:52,758,044–52,935,931 | 117.9 kb | rs4865414, rs1519590, rs178724, rs1460554 |
|
| 5 | chr5:43,497,655–43,862,944 | 365.3 kb | rs10062920, rs4449542, rs7721405, rs6875400 |
|
| 6 | chr6:122,738,019–122,869,764 | 131.7 kb | - |
|
| 7 | chr7:72,722,731–72,750,595 | 27.9 kb | rs1880948, rs1178970 |
|
aOutlier SNPs co-occurring in all PSs.
bNearest genes located at least 50 kb away from an outlier SNP in at least two PSs.
cFor PSs 2 and 4, cluster 5 is defined more broadly, starting at position 43,416,999, including rs4264950 and rs7720858.
dNo outlier SNP at this cluster co-occurred in all four PSs, but in each case four outlier SNPs were found (rs2816141, rs1741820, rs487098, and rs197686 for PSs 1 and 3; and rs3778348, rs3778348, rs9490478, rs9320878 for PSs 2 and 4).
Genes with signals of positive selection suggesting human adaptations to tropical forests in Africa and the Americas.
| Gene | Cluster | Biological function in mammalians or associated human diseases |
|---|---|---|
|
| 1 | Involved in cholesterol trafficking and metabolism [ |
|
| 3 | Enhance lipid remodeling to ceramides [ |
|
| 4 | Unknown. |
|
| 4 | Unknown. |
|
| 4 | Is located in a genomic region where a microdeletion causes Limb-girdle muscular dystrophy type 2E with joint hyperlaxity and contractures [ |
| C5orf34 | 5 | Unknown. |
|
| 5 | Modulate immunity to viral infection [ |
|
| 5 | Produces high concentrations of NADPH at mitochondria and the resulting energy is used for biosynthesis and in free-radical detoxification [ |
|
| 6 | Involved in the activation of heat-shock response genes under conditions of heat [ |
|
| 6 | Associated to the aggressive phenotype of prostate cancer [ |
|
| 6 | Unknown. |
|
| 7 | May play a role in modifying the susceptibility to idiopathic spermatogenic impairment [ |
|
| 7 | Deleted in Williams-Beuren syndrome (vascular system and calcium metabolism problems) [ |
|
| 7 | May be involved in the Williams-Beuren syndrome [ |
Fig 2Gene sets significantly enriched for selection signals.
Nodes are gene sets that score a q-value ≤ 20% after pruning in the gene set enrichment test (See S7 Fig. for the equivalent plot regarding the results before pruning). Gene sets are connected when at least one of them shares >33% of its genes with the other set before pruning. The size of a node scales with the size of the gene set. Each quadrant of a node represents results on one of the four selection scores, while the four rings in a node correspond to the four population sets (different combination of populations; see Methods). The color of each quarter of a ring corresponds to the significance of the test result (dark colored: q≤10%, light colored: 10%20%).
20%).
20%). Note that gene sets that score high in the test using the probability for 'any selection' are not automatically scoring significant in the separate selection models and vice versa. A gene can have a high probability for 'any selection' but this can be the sum of relatively low probabilities of the three selection models 'selection in Africa', 'selection in America' and 'convergent selection'.