Literature DB >> 29658973

Searching for ancient balanced polymorphisms shared between Neanderthals and Modern Humans.

Lucas Henriques Viscardi1, Vanessa Rodrigues Paixão-Côrtes2, David Comas3, Francisco Mauro Salzano1, Diego Rovaris1, Claiton Dotto Bau1, Carlos Eduardo G Amorim4,5, Maria Cátira Bortolini1.   

Abstract

Hominin evolution is characterized by adaptive solutions often rooted in behavioral and cognitive changes. If balancing selection had an important and long-lasting impact on the evolution of these traits, it can be hypothesized that genes associated with them should carry an excess of shared polymorphisms (trans- SNPs) across recent Homo species. In this study, we investigate the role of balancing selection in human evolution using available exomes from modern (Homo sapiens) and archaic humans (H. neanderthalensis and Denisovan) for an excess of trans-SNP in two gene sets: one associated with the immune system (IMMS) and another one with behavioral system (BEHS). We identified a significant excess of trans-SNPs in IMMS (N=547), of which six of these located within genes previously associated with schizophrenia. No excess of trans-SNPs was found in BEHS, but five genes in this system harbor potential signals for balancing selection and are associated with psychiatric or neurodevelopmental disorders. Our approach evidenced recent Homo trans-SNPs that have been previously implicated in psychiatric diseases such as schizophrenia, suggesting that a genetic repertoire common to the immune and behavioral systems could have been maintained by balancing selection starting before the split between archaic and modern humans.

Entities:  

Year:  2018        PMID: 29658973      PMCID: PMC5901502          DOI: 10.1590/1678-4685-GMB-2017-0308

Source DB:  PubMed          Journal:  Genet Mol Biol        ISSN: 1415-4757            Impact factor:   1.771


Introduction

Many of the fundamental processes at the core of complex human behavior and cognitive abilities, including sensory processing, recognition of self and others, emotions, motivation, learning, memory, attention, vocalization and speech processing, executive function, as well as neural development, may be characterized by at least some degree of heritability (Vallender, 2011; Gilissen ; Vissers ; Johnson ). Assuming that genes are partly responsible for these phenotypes, genetic variation between and within species can be expected to give rise to the large behavioral repertoire observed in nature (Bendesky and Bargmann, 2011). Furthermore, the diversity of these traits can be expected to be shaped by the fundamental forces of microevolution, including genetic drift, directional natural selection (either positive or negative), and balancing selection. While directional selection tends to reduce variability close to the selected site (Lachance and Tishkoff, 2013), balancing selection results in the persistence of variation in the population or species, even in the face of loss due to drift, leading to an excess of polymorphisms with intermediate frequencies (Nielsen ) and increasing genetic diversity around the site of selection (Charlesworth, 2006; Fijarczyk and Babik, 2015). Balancing selection can result from different processes, such as heterozygote advantage (overdominance), negative frequency-dependent selection, and heterogeneity in selective advantage across time or space – all possibly acting in changing environments requiring a fast rate of adaptation (Boon ; Wolf ; Bendesky and Bargmann, 2011; Pruitt and Riechert, 2011; Schaschl ; Taub and Page, 2016). While many cases of directional selection have been reported in the literature, only a handful of examples of balancing selection have been described. This may be due to several factors. On the one hand, balancing selection may be a transient state, leaving marks so subtle that their detection may be difficult using current tests, leading to a large number of false negatives (Fijarczyk and Babik, 2015). On the other hand, most methods rely on the fact that balancing selection will lead to a decreased inter-population diversity. In populations that diverged not long ago, or that are experiencing some level of admixture, however, this pattern will be found at most neutral loci, thereby leading to a large number of false positives. One way of circumventing this issue is by comparison of different species, considering ancient balancing selection, in which most loci should indicate moderate to high divergence, and thus the number of false positives is expected to be smaller. One of the best-studied targets of balancing selection is the major histocompatibility complex (MHC), which includes many examples of long-term maintenance of trans-species polymorphisms (trans-SNPs), i.e. ancient polymorphisms that survived in derived taxa (Takahata and Nei, 1990; Clark, 1997; Grimsley ; Ségurel ; Azevedo ). The study of these trans-SNPs has revealed a common theme, where individuals heterozygous for genes with key roles in the immune system seem to be more effective in their defense against pathogens, while at the same time presenting only a moderate inflammatory response (Cagliani ; Leffler ; Azevedo ; Teixeira ). Because of the expected loss of shared polymorphic sites over time due to genetic drift, polymorphisms shared between species that diverged a long time ago are rare under neutrality (Clark, 1997). Trans-SNPs common to species separated by a relatively deep evolutionary split, and without recent admixture, are therefore probably adaptive and maintained by balancing selection. Examples of such adaptive trans-SNPs were reported by Leffler and Teixeira in their comparison between humans and the Pan genus, which are thought to have diverged about 8 million years ago (Moorjani ). In more recently-diverged species, the presence of trans-SNPs must be interpreted with greater caution. For instance, for humans, which have an estimated effective population size (Ne) of ~10,000 individuals, 1% of neutral trans-SNPs will be preserved in the genome even after 53,000 generations (~1,6 million years) (Clark, 1997). Assuming that the split between Homo sapiens and Homo neanderthalensis occurred around 400,000-275,000 years ago (Endicott ; Prüfer ), some trans-SNPs occurring in these species are expected to be neutral and occur due to stochastic events. Additionally, the retention of ancestral polymorphisms can be due to introgression, the exchange of genetic material between different species due to hybridization (Fijarczyk and Babik, 2015). This has been described for the hybridization between archaic humans (including H. neanderthalensis and the closely related Denisovans) and some modern human populations (Green ; Reich ; Meyer ). Despite these difficulties, the investigation of trans-SNPs in the genus Homo is an exciting research goal. Hominin evolution is characterized by adaptive solutions rooted in behavioral and cognitive changes. For instance, creative thinking promotes change from prevailing modes of thought or expression, a change that can be associated with fitness gain (Nettle, 2006). In addition to the benefits of change over time, there are adaptive advantages to the parallel maintenance of different behavioral strategies within a species (Sih ; Korte ; Cagliani ; Taub and Page, 2016). Assuming a genetic basis for these behavioral strategies, their parallel persistence can be seen as the result of balancing selection. In support of the idea for balancing selection, there have been several reports of polymorphisms in genes with known roles in modulating complex behavior in modern human and other mammals, which have likely persisted through balancing selection (Cagliani ; Schaschl ; Taub and Page, 2016). Expanding on this idea, it is of great interest to investigate the role of balancing selection in the evolution of hominin, including human, behavior on a greater scale. If balancing selection has indeed had an important and long-lasting impact on the evolution of behavior in hominins, it can be hypothesized that genes associated with behavior should carry an excess of trans-SNPs across hominin species. Based on this hypothesis, we investigated the role of balancing selection in the evolution of behavior in hominins by studying the pattern of trans-SNPs in genes relevant to these traits in Neanderthals and modern humans. Importantly, as previously mentioned, many available methods used to detect balancing selection were originally designed to target balancing selection operating over more than 4Ne generations (Clark, 1997). Due to the relatively recent split between modern humans and Neanderthals, these methods are ineffective in detecting balancing selection when studying these two species. To overcome this limitation, we implemented an approach that enables the identification of an excess of trans-SNPs in groups of genes of interest in comparison to the exome background (i.e. the null distribution), which serves as a control for demographic effects, while we added control for gene length biases, GC content (and thus indirectly mutation rate), number of polymorphisms per gene and background selection. Using this approach, we were able to identify polymorphisms shared between modern humans and Neanderthals, many of which located in genes related to immunology and a few in genes playing a potential role in behavior, including genes that may contribute to personality traits and psychiatric disorders.

Material and Methods

Defining gene sets

To find genes underlying immune and behavioral systems, we defined two target gene sets, which we named IMMS (genes related to the immune system) and BEHS (genes related to behavior). We populated these gene sets by searching the AmiGO database (http://amigo.geneontology.org/cgi-bin/amigo/browse.cgi) using GeneOntology terms directly related to immune system (for IMMS) and behavior (for BEHS; supplementary material Table S1; http://amigo.geneontology.org/cgi-bin/amigo/browse.cgi). We further added to the BEHS gene set, genes associated with autism spectrum disorder (Iossifov ; Yuen ), schizophrenia (Carrera ; Li ; Srinivasan ), major depression (Cagliani ), and finally the OMIM database (https://www.omim.org) was also consulted for psychopathology associated genes (i.e. schizophrenia, major depressive disorder, autism spectrum disorder, asperger syndrome, attention-deficit disorder, antisocial behavior, and obsessive-compulsive disorder) expanding our total dataset. We then excluded genes that were not available for Neanderthal exome analysis, making for a final count of 1,780 genes in IMMS and 278 in BEHS, with a total of 17,246 analyzed genes including target (IMMS or BEHS, accordingly) and control genes.

Genetic datasets

The high quality exomes of three Neanderthals were retrieved from the Max Planck Institute database (http://cdna.eva.mpg.de/neandertal/exomes/; Castellano ). Modern human exome data were obtained from phase 3 of the 1000 Genomes Project (The 1000 Genomes Project Consortium, 2015). To avoid any confounding effects due to interbreeding among archaic humans and modern Eurasians (as reported by Green ; Reich ; Condemi ) in our analysis of balancing selection, we used only Yoruba genomes, as they are assumed to have no admixture history with Neanderthals or Denisovans. We included only autosomal single nucleotide biallelic loci, therefore excluding insertions and deletions (InDels), polymorphic sites with more than two alleles, variants on the sex chromosomes and mitochondrial variants. Only loci found to be heterozygous in the Neanderthal exomes were considered for shared and non-shared polymorphisms; this observed heterozygosity at the individual level was assumed to reflect population-wide polymorphism. Regarding the Yoruba (YRI) sample of 108 individuals, we only considered polymorphisms that both ancestral and derived alleles were segregating in this population. Importantly, as these Neanderthal samples from Croatia and Spain dated to more than 40 Kya (Castellano ), we do not expect that any Neanderthal polymorphism is a result of modern human introgression. Due to the spontaneous deamination of 5-methylcytosine, methylated CpG sites are more prone to mutation than other sites, which raises the probability of allelic identity by state rather than by descent (Azevedo ). Because of this, we performed all analyses both including and excluding SNPs located within putatively methylated CpG sites (similarly to Leffler ). The SIFT4G Software (Ng and Henikoff, 2003; Vaser ) was used to classify SNPs into synonymous vs. nonsynonymous substitutions and to perform a phenotype prediction for the disruption effect of mutations, allied with known references in literature. Polymorphisms within untranslated regions (UTRs) were excluded from all further analyses.

Evaluating the excess of trans-SNPs in gene sets

We searched in each gene for polymorphisms shared between Neanderthal and Yoruba exomes (i.e. trans-SNPs). We then compared the number of trans-SNPs in each one of the two target gene sets (IMMS and IBHS) to that of the 10,000 gene sets made according to random permutations of all remaining human genes for which Neanderthal exome sequences were available (total of 17,246 genes). In doing so, we always removed genes already accounted for in the target gene set accordingly. For instance, IMMS has 1,780 genes in its dataset, therefore 10,000 random gene sets were built using 15,466 genes as control. Each of the 10,000 random gene sets consisted of as many genes as the target gene set it was simulating, namely 1,780 genes for the comparison to IMMS and 278 to BEHS. The rationale behind the construction of random gene sets, was to generate a null genomic distribution for trans-SNPs, against which each target gene set was then tested. Statistical significance was determined by assessing the deviation in the number of trans-SNPs in the target gene sets in comparison to the background genomic distribution of trans-SNPs generated with the random gene sets. The bash script was used to run this analysis (https://github.com/cegamorim/excess_transSNPs). Because all loci in each genome were subject to the same demographic history, this approach implicitly controls for demography. However, it does not automatically control for possible effects of background selection, varying mutation rates across sites, and gene size. These effects are known to affect genetic diversity and may therefore bias our results. To control for these effects, genes in the control sets were matched, on a gene-by-gene basis, to those in the IMMS and BEHS target gene sets for background selection, gene length and GC content as follows: Background selection was measured with the B statistic developed by Mcvicker , which was computed based on the expected reduction in nucleotide diversity at a neutral site due to purifying selection at other sites, as a function of recombination rates, selected site locations, deleterious mutation rate, and the distribution of selection strengths, and indicates the expected fraction of neutral diversity that is present at a given site. A value close to 0 represents near complete removal of diversity as a result of selection, while a value close to 1 indicates that selection has had little effect on diversity (McVicker ). To be matched with a target gene, the value of B for a control gene needed to be within 0.1 units of the value of B for the target gene. Gene size was measured as total exonic length, in accordance with the USCS build hg19 refGene table (https://genome.ucsc.edu/). To be matched, the length of target and control genes needed to be within 400 bp of each other. GC content was calculated considering the gene coordinates described in the refGene table of UCSC build hg19. In a first step, we used BEDTools (Quinlan and Hall 2010) to extract the coding exon sequence based on these coordinates, and then used in-house scripts to determine the GC percentage. To be matched, the GC percentages of target and control genes needed to be within 5% of each other. The criterion for thresholds applied was chosen after many trials where at least one control gene in the exome was found for at least 98% of the target genes in IMMS and BEHS datasets. Those target genes that could not be matched to at least one control gene in the exome were excluded from all further analyses in both the target and control gene sets (Table S2). All data were handled with vcftools 0.1.13 (Danecek ) and bcftools, as well as using in- house Python and bash scripts. Plots and analyses were implemented in the R environment (www.r- project.org).

Population genetics analyses

In addition to the analysis of trans-SNPs, we considered the classical population genetics statistics Tajima’s D and Fst as potential markers of balancing selection, indicated by an excess of polymorphisms with low population differentiation. While shared polymorphisms can detect long-term balancing selection, Tajima’s D highlights regions with an excess of (not necessarily shared) polymorphisms, due to balancing selection or population size change. On the other hand, Fst estimates genetic differentiation among populations. Both, Tajima’s D and Fst are of interest, since together they indicate potential targets for balancing selection over an intermediate time span, as evaluated using the interval between 0.4 Ne and 4 Ne (Fijarczyk and Babik, 2015). Tajima’s D for the YRI population and intercontinental Fst scores (Global Fst) for Yoruba versus Europeans (CEU) and Asians (ASN) were obtained from the 1000 Genomes Selection Browser 1.0 (http://hsb.upf.edu/)(Pybus ). It should be stressed that, despite the use of just H. sapiens populations, the polymorphisms selected were those shared with Neanderthals. Negative Fst values were interpreted as 0. Furthermore, since intermediate allele frequencies are a hallmark of balancing selection (Nielsen ), we retrieved the average heterozygosity and standard error for all trans-SNPs from the dbSNP database (https://www.ncbi.nlm.nih.gov/projects/SNP/). A threshold for Fst of 0.04, as used by Cagliani to represent low values for Fst among human populations, and an average heterozygosity greater than 0.400 (as employed by Pakstis ) were considered to flag up SNPs potentially affected by balancing selection. All relevant SNPs identified in our analyses were queried for known associations with psychiatric disease using the GWAS catalog implemented in the UCSC Table Browser and the available literature.

Results

We retrieved 22,832 heterozygous sites from the Neanderthal exome (Castellano ), which 4,117 were trans-SNPs (Table S3) found within 2,519 genes. Here trans-SNPs are defined as heterozygous sites in Neanderthals that were also polymorphic in modern humans, represented by Yoruba people, according to the 1000 Genomes phase 3 data (The 1000 Genomes Project Consortium, 2015). We note that when loci located within CpG sites are included the numbers of such polymorphisms and genes almost doubles (Table S4), which is expected due to the high mutability of such sites (Kong ), some of which will potentially not be trans-SNPs but only identical by state. We sought to evaluate if gene sets related to the immune system (IMMS; 1,754 genes) and behavior (BEHS; 271 genes) were enriched for these trans-SNPs in comparison to the genome as a whole. To do so, we built and permutated 10,000 random sets of genes to be compared to these, by matching each gene in these two sets of genes to others in the genome, controlling for background selection, gene size, and mutation rate (see Material and Methods), since these factors are known to affect the number of polymorphisms in each gene. Implicitly we were also controlling for demography, since we were comparing target genes to others in the same genome, which were therefore subject to the same demographic history. Moreover, because there is no evidence up to date of interbreeding between archaic and modern humans before H. sapiens migrated out of Africa, we used Yoruba samples to control for the effect of archaic introgression (Green ; Reich ; Meyer ). Below we describe our findings using this approach for each gene set individually (IMMS and BEHS).

Signals of balancing selection in the Immune System

We identified 547 trans-SNPs in IMMS genes. This number was significantly higher than the null distribution for trans-SNP observed in the 10,000 random permutations of genes from the control set (p-value= 0.016); Table 1, Figure 1 and Figure S1; see Methods). This pattern remains even if we exclude from the analysis the CpG transitions, which are known to present a higher mutation rate (Table S3). This pattern is borderline significant when considering only shared non- synonymous substitutions (297 polymorphisms; p-value = 0.05). These results suggest that in the genus Homo, genes underlying immune system function are more likely than non-immune-genes to maintain long- term shared polymorphisms, possibly through balancing selection. Additionally, we found that Neanderthals IMMS genes harbor more heterozygous sites (2,024 polymorphisms; shared or not with modern humans) than the random sets generated by permutation (p-value < 0.001; Figure 1, column “Polymorphisms”; Table S5). Many of these loci have SNP ID numbers (rs) and have thus been found to be at least biallelic in a modern human population. Due to the known hybridization between Neanderthals and some non-African H. sapiens populations, it is difficult to determine whether they represent polymorphisms shared since their split from the common ancestral; nevertheless, these findings reinforce that genes of the immune system maintain a high level of heterozygosity in the genus Homo.
Table 1

Percentiles of the distribution of the mean values of polymorphism per gene in 10 000 random combinations simulating each of the proposed target gene systems (IMMS and BEHS) .

PercentileIMMSBEHS
%SNPsTrans-SNPs NonSynTrans-SNPsSNPsTrans-SNPs NonSynTrans-SNPs
50.7699320.0837130.1662870.988971 0.099265 0.209559
100.7773350.0865600.1702731.0073500.1066180.220588
150.7823750.0888380.1731211.0220600.1102940.227941
200.7864460.0905470.1753991.0367600.1139710.235294
250.7898630.0916860.1776771.0441200.1176470.238971
300.7927110.0933940.1793851.0551500.1213240.246324
350.7961280.0945330.1810931.0625000.1250000.250000
400.7984050.0956720.1828021.0698500.1286760.253676
450.8012530.0968110.1845101.0772100.1323530.257353
500.8041000.0979500.1856491.0845600.1323530.261029
550.8063780.0985190.1873581.0919100.1360290.264706
600.8092260.0996580.1884971.0992600.1397060.272059
650.8115030.1007970.1902051.1102900.1433820.275735
700.8143510.1019360.1919131.1176500.1470590.279412
750.8177680.1036450.1941911.1250000.1507350.286765
800.8211850.1053530.1964691.1360300.1544120.290441
850.8251710.1064920.1981781.1470600.1580880.297794
900.8302960.1093390.2010251.1617600.1654410.308824
950.8382690.112187 0.206150 1.1875000.1727940.319853
100 0.882688 0.1338270.2312071.3272100.2279410.400735

Values close to the mean number of polymorphism per gene for each of the target gene sets (IMMS and BEHS) are in italic and underlined, while significant values are in bold. Mean values for the IMMS gene set: Total Neanderthal SNPs = 0.894647, non-synonymous trans-SNPs = 0.115604, trans-SNPs = 0.220957. Mean value for the BeHS gene set: Total Neanderthal SNPs = 1.1875, non-synonymous trans-SNPs = 0.07353, trans-SNPs = 0.169118.

Figure 1

- Density distribution of the average number of polymorphisms per gene observed for random sets genes (blue shade) in the Neanderthal samples matched to those included in the immune system (IMMS) and behavioral system (BEHS) target gene sets (red bars).

Values close to the mean number of polymorphism per gene for each of the target gene sets (IMMS and BEHS) are in italic and underlined, while significant values are in bold. Mean values for the IMMS gene set: Total Neanderthal SNPs = 0.894647, non-synonymous trans-SNPs = 0.115604, trans-SNPs = 0.220957. Mean value for the BeHS gene set: Total Neanderthal SNPs = 1.1875, non-synonymous trans-SNPs = 0.07353, trans-SNPs = 0.169118. Six of the 547 trans-SNPs shared between Neanderthals and Yoruba that we identified in IMMS (rs2240464, rs56318802, rs5899, rs118014438, rs377657111, and rs14178; Table 2) are located within genes that were previously associated with schizophrenia by the Schizophrenia Working Group of the Psychiatric Genomics Consortium (Schizophrenia Working Group of the Psychiatric Genomics Consortium, 2014). It is noteworthy that another two of the trans-SNPs in IMMS are found in genes that have been associated with this disorder according to another study: rs28919579 is located in the CD4 gene, a locus that has been linked to schizophrenia through an imbalance of CD4 cell subtypes, and rs374886374 is located within the C4A gene, a potential MHC locus that has recently been associated with schizophrenia (Sekar ). C4 is a fundamental element of the classical complement cascade pathway, which rapidly recognizes and eliminates pathogens and cellular debris. In the brain, C4A is expressed in neurons and promotes synaptic pruning, which is impaired in schizophrenic patients (Sekar ). The possibility that these trans-SNPs have influenced the behavioral plasticity of Neanderthal and modern humans is speculative, potentially a lot so, but these findings suggest that at least part of the common genetic repertoire that links the IMMS with schizophrenia in modern humans has been maintained polymorphic for thousands of years. We note, however, that immune system genes are known to be highly pleiotropic (Cotsapas ; Andreassen ; Wang ), and this picture may also be true for other traits besides those related to psychiatric disorders. Still, we identified another 60 trans-SNPs located within these 108 loci associated with schizophrenia (Table 2), but that were neither part of our set for IMMS nor BEHS, suggesting that other systems may have trans-SNPs in pleiotropy with schizophrenia.
Table 2

Trans-SNPs within loci associated with schizophrenia according to the Schizophrenia Working Group of the Psychiatric Genomics Consortium (2014).

Gene nameChromosomePositionID(rs)Ancestral alleleDerived allele
RERE 18419873rs376434590TC
RERE 18419874novelTG
RERE 18421203rs13596TC
BRINP2 1177247854rs31764431CG
CYP26B1 272361960rs2241057AG
MARS2 2198570253rs200490327GA
GIGYF2 2233712227rs7563724AG
STAB1 352546872rs150311081GA
STAB1 352548136rs74491782TC
PBRM1 352643685rs3755806TC
ITIH4 352850999rs2535621AG
ITIH4 352863155rs150181495GA
MUSTN1 352867718rs2276820CT
ATXN7 363982082rs3774729GA
KDM3B 5137754695rs7726234TC
DND1 5140052271rs2563333AG
PCDHA1 5140168070rs2240696AG
PCDHA3 5140182101rs7701755GT
PCDHA4 5140188383rs2337987TG
PCDHA4 5140188401rs2879086AG
PCDHA8 5140221139rs3756331GA
PCDHA8 5140221195rs199713478GC
GPX5 628497245novelCT
GPX5 628497279rs60523386GA
SCAND3 628541036novelCT
SCAND3 628543089rs41270593GT
SRPK2 7 104782888 rs2240464 A T
SRPK2 7 104844229 rs56318802 C T
DGKZ 1146387868rs1317826AG
DGKZ 1146393138novelCT
F2 11 46747662 rs5899 C T
ZDHHC5 1157463496rs140343860CT
ZDHHC5 1157466653novelGA
STAT6 12 57499258 rs118014438 C T
LRP1 1257548364rs34574998TC
LRP1 1257575070rs199672493CT
LRP1 1257592090rs370217380GA
NXPH4 1257619362rs10783816GA
PITPNM2 12123480138novelCT
CDK2AP1 12123749780rs150530930GA
XRCC3 14104165239novelGA
XRCC3 14104169630rs138987760CT
PLCB2 1540590134rs2229690GA
PLCB2 1540594191rs373064934AG
ADAMTSL3 1584706461rs950169CT
FES 15 91428302 rs377657111 C T
THAP11 1667876823rs28647874AG
THAP11 1667876826rs3982383GA
THAP11 1667876835rs151159352GA
THAP11 1667876844novelGA
CTRL 1667964203rs1134760TC
PSMB10 16 67969531 rs14178 A G
SLC12A4 1667979051rs373093291GA
SLC12A4 1667980969rs11860125GC
PLA2G15 1668293320rs3743739TC
SMG6 172203025rs1885987TG
SMG6 172203356rs35172468CG
GID4 1717948475rs2955355GA
MYO15A 1718022235rs200234990CT
MYO15A 1718023897rs2955365GA
TSSK6 1919625547rs7250893AG
ZNF536 1931038940rs199936097GA
ZNF536 1931038995rs1469705TC
ACTR5 2037377139rs2254105CT
L3MBTL2 2241610024rs139451GA

Potential signals of balancing selection in Behavior System

No excess of trans-SNPs (shared polymorphisms) was found in BEHS genes (51 SNPs in 41 genes; Table 3; Figure 1 and Figure S1). Likewise, and in contrast to the immune system, the number of heterozygous loci found in Neanderthals, shared and non-shared with Yoruba, corresponds to that expected by chance (p-value = 0.1; Figure 1, column “Polymorphisms”). Additionally, Tajima’s D was not significant for positive values (Table 3). In this regard, significant positive values for Tajima’s D would reflect more pairwise differences than segregating sites due to the increased diversity of the region surrounding the selected site, indicating old balancing selection (Nordborg ). On the other hand, we found low Fst and high average heterozygosity, both important indicators of potential signals of balancing selection, in five trans-SNPs (rs11176013, rs12628, rs310617, rs438042, and rs362331 (Table 3) shared between Neanderthals and Yoruba. All of these SNPs are associated with psychiatric and neurodevelopmental disorders, including schizophrenia. For instance, rs438042, located near the intron/exon boundary of THBS4 exon 3, is associated with Alzheimer Disease and might be important for splicing, since THBS4 acts in inflammatory responses and synaptogenesis (Cagliani ; Cocchi ). Another trans-SNP identified in the BEHS gene set, rs310617, located in the EEF1A gene, has been found to be heterozygous in the Denisova specimen. Although the specific role of rs310617 is not known, other substitutions in this gene have been associated with severe intellectual disability and epileptic encephalopathy (Nakajima ; Inui ).
Table 3

Trans-species polymorphisms identified in genes of the BEHS gene set.

GeneSNPAverage heterozygosityFunction Fst Tajima’ D PhenotypeReference1Denisova heterozygote
ALS2 rs32191560.185 +/- 0.240missense0.1642-0.558111591Amyotrophic lateral sclerosis/parkinsonism-dementia complex Tomiyama et al.,2008 NO
ALS2 rs32191680.158 +/- 0.233synonymous0.1191-0.093457107Amyotrophic lateral sclerosis Kress et al.,2005 NO
ANK2 rs339669110.104 +/- 0.203synonymous0.1239-0.750076422NANANO
ANKRD11 rs728213560.058 +/- 0.160synonymous0.0255-0.931794107KBG syndrome (OMIM 611192)NANO
ANKRD11 rs605203020.072 +/- 0.176missense0.0019-0.931794107KBG syndrome (OMIM 611192)NANO
APOL2 rs1180973500.006 +/- 0.054missense0.00050.301905912NANANO
BCL11A rs75699460.282 +/- 0.248synonymous0.0742-0.928248907Fetal Hemoglobin Level Bauer et al.,2013 NO
CACNA2D3 rs170547850.181 +/- 0.240synonymous0.0766-0.12464999NANANO
CCDC108 rs134038020.028 +/- 0.115missense0.0467-0.365570142NANANO
CHD1 rs1619410.417 +/- 0.186Synonymous0.3542-0.774153927NANANO
CHI3L1 rs1408571840.013 +/- 0.080missense0.0346-0.860993564NANANO
CHL1 rs1162613680.028 +/- 0.115missense0.0409-0.478853152NANANO
CHL1 rs22725220.428 +- /0.175missense0.133-0.478853152Schizophrenia Shaltout et al.,2013 NO
CTNS rs1614000.269 +/- 0.249missense0.64690.188849119Nephropatic cystinosis Shahkarami et al.,2013 NO
CTNS rs774538390.130 +/- 0.219synonymous0.29470.383526916Nephropatic cystinosis Shahkarami et al.,2013 NO
DBH rs11085800.490 +/- 0.069synonymous0.26280.111473168Schizophrenia, TDAH, cocaine dependence, fetal growth, bipolar disorder Shaltout et al.,2013 NO
DISC1 rs557959500.004 +/- 0.042missense0.0094-0.209468866Schizophrenia Khan et al.,2013 NO
DRD5 rs1842888060.002 +/- 0.032synonymousNANANANANO
EEF1A2 rs3106170.483 +/- 0.090synonymous0.01020.516242391Mental retardation, Epilectic encephalopathy (OMIM 602959)NAYES
EIF4EBP2 rs37507670.029 +/- 0.117synonymous0.0332-0.133346514NANANO
GPR153 rs1405188560.005 +/- 0.049synonymousNANANANANO
GRM5 rs23061530.091 +/- 0.193synonymous0.0516-0.65655565NANANO
HOXB8 rs454414920.114 +/- 0.210synonymous0.1385-0.957471816NANANO
HRAS rs126280.418 +/- 0.185synonymous0.0217-0.20012551Costello syndrome Gripp et al.,2011 NO
HTT rs3631250.307 +/- 0.243missense0.3302-0.83813619Huntington disease (OMIM 613004) Kay et al.,2014 NO
HTT rs3623310.493 +/- 0.058missense0.0336-0.718349527Huntington disease (OMIM 613004) Lombardi et al.,2009 NO
HTT rs1401245040.002 +/- 0.032synonymousNANAHuntington disease (OMIM 613004)NANO
HTT rs1384891390.002 +/- 0.035synonymousNANAHuntington disease (OMIM 613004)NANO
KDM6B rs110787090.490 +/- 0.070synonymous0.3901-1.068036831NANANO
KLF12 rs773775450.019 +/- 0.096synonymous0.026-0.747818207Panic disorder syndrome 1 (OMIM 167870)NANO
LRRK2 rs111760130.485 +/- 0.084synonymous0.02460.777713125Parkinson disease 8 (OMIM 609007) Mata et al.,2006 NO
LRRTM1 rs67338710.465 +/- 0.128missense0.1323-1.03399393Schizophrenia Ludwig et al.,2009 NO
MC4R rs22296160.032 +/- 0.122missense0.01-1.339833048Obesity Heid et al.,2008 NO
MKKS rs178526250.310 +/- 0.243synonymous0.03930.147132768Obesity Rouskas et al.,2008 NO
MKKS rs169915470.363 +/- 0.223synonymous0.10670.147132768NANANO
NMUR2 rs49585320.215 +/- 0.248missense0.1464-0.701022375NANANO
OR4C46 rs112466060.363 +/- 0.223missense-0.0025-1.30558338NANANO
PAK7 rs557737190.013 +/- 0.081synonymous0.0145-0.339650099NANANO
PHF2 rs561347530.040 +/- 0.135synonymous0.0199-0.264535787NANANO
PLCL2 rs76538340.496 +/- 0.047synonymous0.054-0.63849307NANANO
PRODH rs1399030090.004 +/- 0.047missense0.00050.292869235Schizophrenia (OMIM 606810)NANO
PRODH rs48197560.342 +/- 0.232missense0.31820.309595729Schizophrenia Kempf et al.,2008 Ota et al.,2014 NO
PRODH rs18083200.393 +/- 0.205synonymous0.14090.369538366Autism, Schizophreniaautismkb.cbi.pku.edu.cn; Prata et al.,2006 NO
RGS12 rs802518440.088 +/- 0.191missense0.1524-0.829372734NANANO
RGS12 rs1474164500.002 +/- 0.028missenseNANANANANO
RIMS1 rs771212180.019 +/- 0.097synonymous0.0092-1.576071446Cone-rod dystrophy (OMIM 606629)NAYES
SCN9A rs43698760.064 +/- 0.167missense0.0735-0.16967483Basal Pain Sensitivity Duan et al.,2015 NO
SHANK1 rs37455210.405 +/- 0.196missense0.2752-0.048981111Specifc language impairment (OMIM 606712); Mental retardation (OMIM 611097)NANO
TCF3 rs118828210.033 +/- 0.123synonymous0.0467-1.767706279NANANO
TEKT5 rs1481857510.005 +/- 0.051missense0.00050.183653042NANANO
THBS4 rs4380420.490 +/- 0.069synonymous0.03340.252034645Alzheimer Disease Cagliani et al.,2013 NO

Discussion

The evolutionary history of hominins is characterized by several notable and peculiar features. Of particular importance to the successful evolutionary trajectory of the genus Homo was the emergence of a large brain capable of sustaining complex and plastic adaptive behaviors, as well as cognitive skills. In recent years, a growing body of research has countered the notion that the Neanderthals were devoid of symbolism and presented lower cognitive abilities and less sophisticated behavior than the early H. sapiens of the Paleolithic (Akazawa ; Zilhão and Trinkaus, 2002; Zilhão ; Pike ; Rendu ). This is in line with the findings of recent genetic and paleoneurological research. For instance, research by Mounier and Ponce-de-León revealed strong similarities between modern humans and Neanderthals in both endocranial anatomy and general brain development, while a study of 162 genes related to cognition by Paixão-Côrtes identified a genetic repertoire shared between extinct archaic humans and modern humans. Assuming that the use of cognitive skills and complex behavior as an adaptive strategy represent a central element of the human evolutionary trajectory (Cagliani ; Schaschl ; Taub and Page, 2016), we sought to evaluate whether natural selection, particularly in the form of balancing selection, played any role in the evolution of genes potentially related to human behavior. Examples of a role for balancing selection in behavioral plasticity have been reported from primates (Babb ; Claw ; Dobson and Brent, 2013; Goto ; Taub and Page, 2016), rodents (Lonn ) and even arthropods (Fitzpatrick ). To detect balanced polymorphisms in genes related to behavior, we implemented an approach based on the search for an excess of shared polymorphisms (trans-SNPs) between archaic and modern humans, in comparison to the genome as a whole. Our analyses revealed no excess of trans-SNPs in genes known to underlie behavioral traits (captured in our BEHS gene set, see Material and Methods). These findings are consistent with the current knowledge about the genetics underlying H. sapiens brain function. Genes expressed in the brain have a large number of functions and the interactions between them are complex (giving rise to basal and specific behavioral phenotypes). Therefore, these genes are subject to functional constraint (Nielsen ). Recently, Aggarwala and Voight (2016) developed the concept of genic tolerance to assess the probability of nucleotide substitution in the human genome, based on factors such as population history and selection, among others. In their analysis, genes playing a role in neurodevelopmental and psychiatric disorders were found to have a strong genic intolerance to nucleotide substitution. In the context of functional complexity and constraint, relatively few, key genetic changes can lead to larger effects on certain phenotypes, in response to a specific selective pressure, while at the same time maintaining original functions. Among the results of our analyses, five trans-SNPs located within our set of BEHS (rs11176013, rs12628, rs310617, rs438042, and rs362331; Table 3) presented high average heterozygosity and low Fst values, suggesting a homogeneous distribution of both alleles between populations. These results imply that balancing selection did not have a significant role in the evolution of genes implicated in human behavior as a whole, but may have been important for the evolution of particular genes within this set. Alternatively, our approach may not have had enough statistical power to detect the effect of balancing selection on the evolution of human behavior. That could be the case if frequency dependent selection, rather than overdominance, was the main mode of balancing selection, since our test is best suited to detect overdominance. Alternatively, the signals for balancing selection in the BEHS set may significantly pre- or postdate the evolutionary split between Neanderthals and modern humans. Another possibility that deserves to be discussed in the light of our results comes from recent findings suggesting an association between behavioral traits and genes previously implicated in the immune response (Power ; Sekar ). Through our inter-Homo exonic trans-SNP approach, we found that genes underlying immune function (found in the IMMS gene set) contain more ancestral polymorphisms than expected by chance in both Neanderthal and modern humans. These genes have played an important immunological defense role during speciation and migration of the genus Homo in a probable similar context of their hominin common ancestral. Our results support the idea that the variability of immune genes is both a target and an outcome of balancing selection (Grimsley ; Ségurel ). Interestingly, and perhaps surprisingly, several studies have revealed a connection between the genetics of the immune system and human behavior. For instance, some of the strongest genetic associations found for schizophrenia at the population level involve variation in the immune system loci (Schizophrenia Working Group of the Psychiatric Genomics Consortium, 2014; Sekar ). It has been suggested that some proteins of the immune system work to promote synaptic pruning (Sekar ), which is impaired in schizophrenic patients (Lips ). Other mechanisms involved both in the etiology of schizophrenia and in the immune system have also been suggested, including deviant immune responses (Pandarakalam 2013). Some of the IMMS trans-SNPs identified here have previously been found to be associated with schizophrenia in modern humans (rs2240464, rs56318802, rs5899, rs118014438, rs377657111, rs14178, Table 2). Furthermore, the trans-SNP rs374886374 is located in the gene C4A, at the MHC locus, which has recently also been associated with schizophrenia in a landmark study using several Psychiatric Genomics Consortium cohorts (Sekar ; nearly 65,000 individuals; p-value < 10-8). Schizophrenia is known to have a high heritability of around 60-80%, and interestingly, it is frequently contextualized in hypotheses that attempt to explain the evolution of modern human complex behavior (Srinivasan ). One hypothesis aiming to reconcile the relatively high prevalence (~0.5 to 1%) of this disorder across human populations with its negative effect on fitness is that balancing selection is maintaining several alleles at loci contributing to creative thinking, a trait that increases fitness. Under unfavorable circumstances, however, the same alleles are thought to increase vulnerability to psychiatric disorders, including schizophrenia (Power ; Srinivasan ). Our findings contribute to this hypothesis and suggest that some components of the immune genetic repertoire that were maintained polymorphic in both archaic and modern humans could have indirectly influenced the evolution of human behavior. This would represent an extraordinary case of evolutionary co-option, in which IMMS genes under balancing selection harbor ancestral adaptive polymorphisms related to the behavioral plasticity of the genus Homo. Allied to our conclusion, recent studies have contributed to unveil the physiological process of autoimmunity in cognition, being proposed as the driving force of cognitive evolution in genus Homo (Nataf, 2017). Finally, a range of other molecular and biological processes certainly play an important role in the evolution of the behavioral plasticity characteristic of Homo species, such as gene regulation and epigenetic mechanisms. Moreover, beyond the role of the heterozygote advantage in maintaining these polymorphisms, other forms of natural selection (frequency-dependent, directional, etc.) at multiple levels (i.e., individual, kin, and/or group levels; Polimeni and Reiss, 2003; Wilson and Hölldobler, 2005; Zhang and Perc, 2016), together with the unequivocal role of culture (Mesoudi, 2016), have shaped and, in the case of our species, continue to shape, human evolution. A full exploration of these topics is well beyond the scope of the present study, which intends only to explore and discuss some genetic and evolutionary pieces of this complex puzzle. Future studies may help to build a more complete picture of the evolution of hominin behavior.
  90 in total

1.  The interaction between personality, offspring fitness and food abundance in North American red squirrels.

Authors:  Adrienne K Boon; Denis Réale; Stan Boutin
Journal:  Ecol Lett       Date:  2007-09-17       Impact factor: 9.492

2.  Hippocampal Pruning as a New Theory of Schizophrenia Etiopathogenesis.

Authors:  Enrico Cocchi; Antonio Drago; Alessandro Serretti
Journal:  Mol Neurobiol       Date:  2015-04-24       Impact factor: 5.590

3.  Genes with de novo mutations are shared by four neuropsychiatric disorders discovered from NPdenovo database.

Authors:  Jinchen Li; Tao Cai; Yi Jiang; Huiqian Chen; Xin He; Chao Chen; Xianfeng Li; Qianzhi Shao; Xia Ran; Zhongshan Li; Kun Xia; Chunyu Liu; Zhong Sheng Sun; Jinyu Wu
Journal:  Mol Psychiatry       Date:  2015-04-07       Impact factor: 15.992

Review 4.  Detecting balancing selection in genomes: limits and prospects.

Authors:  Anna Fijarczyk; Wiesław Babik
Journal:  Mol Ecol       Date:  2015-06-15       Impact factor: 6.185

5.  Variation in the molecular clock of primates.

Authors:  Priya Moorjani; Carlos Eduardo G Amorim; Peter F Arndt; Molly Przeworski
Journal:  Proc Natl Acad Sci U S A       Date:  2016-09-06       Impact factor: 11.205

6.  U-series dating of Paleolithic art in 11 caves in Spain.

Authors:  A W G Pike; D L Hoffmann; M García-Diez; P B Pettitt; J Alcolea; R De Balbín; C González-Sainz; C de las Heras; J A Lasheras; R Montes; J Zilhão
Journal:  Science       Date:  2012-06-15       Impact factor: 47.728

7.  Using genetic evidence to evaluate four palaeoanthropological hypotheses for the timing of Neanderthal and modern human origins.

Authors:  Phillip Endicott; Simon Y W Ho; Chris Stringer
Journal:  J Hum Evol       Date:  2010-05-26       Impact factor: 3.895

8.  A majority of Huntington's disease patients may be treatable by individualized allele-specific RNA interference.

Authors:  Maria Stella Lombardi; Leonie Jaspers; Christine Spronkmans; Cinzia Gellera; Franco Taroni; Emilio Di Maria; Stefano Di Donato; William F Kaemmerer
Journal:  Exp Neurol       Date:  2009-03-13       Impact factor: 5.330

9.  The complete genome sequence of a Neanderthal from the Altai Mountains.

Authors:  Kay Prüfer; Fernando Racimo; Nick Patterson; Flora Jay; Sriram Sankararaman; Susanna Sawyer; Anja Heinze; Gabriel Renaud; Peter H Sudmant; Cesare de Filippo; Heng Li; Swapan Mallick; Michael Dannemann; Qiaomei Fu; Martin Kircher; Martin Kuhlwilm; Michael Lachmann; Matthias Meyer; Matthias Ongyerth; Michael Siebauer; Christoph Theunert; Arti Tandon; Priya Moorjani; Joseph Pickrell; James C Mullikin; Samuel H Vohr; Richard E Green; Ines Hellmann; Philip L F Johnson; Hélène Blanche; Howard Cann; Jacob O Kitzman; Jay Shendure; Evan E Eichler; Ed S Lein; Trygve E Bakken; Liubov V Golovanova; Vladimir B Doronichev; Michael V Shunkov; Anatoli P Derevianko; Bence Viola; Montgomery Slatkin; David Reich; Janet Kelso; Svante Pääbo
Journal:  Nature       Date:  2013-12-18       Impact factor: 49.962

10.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

View more
  2 in total

1.  Diverse functions associate with non-coding polymorphisms shared between humans and chimpanzees.

Authors:  Keila Velazquez-Arcelay; Mary Lauren Benton; John A Capra
Journal:  BMC Ecol Evol       Date:  2022-05-23

2.  Analysis of whole exome sequencing in severe mental illness hints at selection of brain development and immune related genes.

Authors:  Jayant Mahadevan; Ajai Kumar Pathak; Alekhya Vemula; Ravi Kumar Nadella; Biju Viswanath; Sanjeev Jain; Meera Purushottam; Mayukh Mondal
Journal:  Sci Rep       Date:  2021-10-26       Impact factor: 4.379

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.