Literature DB >> 34132815

DLX5/6 GABAergic Expression Affects Social Vocalization: Implications for Human Evolution.

Giovanni Levi1, Camille de Lombares1, Cristina Giuliani2, Vincenzo Iannuzzi3, Rym Aouci1, Paolo Garagnani4,5, Claudio Franceschi4,6, Dominique Grimaud-Hervé7, Nicolas Narboux-Nême1.   

Abstract

DLX5 and DLX6 are two closely related transcription factors involved in brain development and in GABAergic differentiation. The DLX5/6 locus is regulated by FoxP2, a gene involved in language evolution and has been associated with neurodevelopmental disorders and mental retardation. Targeted inactivation of Dlx5/6 in mouse GABAergic neurons (Dlx5/6VgatCre mice) results in behavioral and metabolic phenotypes notably increasing lifespan by 33%. Here, we show that Dlx5/6VgatCre mice present a hyper-vocalization and hyper-socialization phenotype. While only 7% of control mice emitted more than 700 vocalizations/10 min, 30% and 56% of heterozygous or homozygous Dlx5/6VgatCre mice emitted more than 700 and up to 1,400 calls/10 min with a higher proportion of complex and modulated calls. Hyper-vocalizing animals were more sociable: the time spent in dynamic interactions with an unknown visitor was more than doubled compared to low-vocalizing individuals. The characters affected by Dlx5/6 in the mouse (sociability, vocalization, skull, and brain shape…) overlap those affected in the "domestication syndrome". We therefore explored the possibility that DLX5/6 played a role in human evolution and "self-domestication" comparing DLX5/6 genomic regions from Neanderthal and modern humans. We identified an introgressed Neanderthal haplotype (DLX5/6-N-Haplotype) present in 12.6% of European individuals that covers DLX5/6 coding and regulatory sequences. The DLX5/6-N-Haplotype includes the binding site for GTF2I, a gene associated with Williams-Beuren syndrome, a hyper-sociability and hyper-vocalization neurodevelopmental disorder. The DLX5/6-N-Haplotype is significantly underrepresented in semi-supercentenarians (>105 years of age), a well-established human model of healthy aging and longevity, suggesting their involvement in the coevolution of longevity, sociability, and speech.
© The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  DLX5/6 genes; aging; human evolution; self-domestication; social behavior; ultrasonic vocalization

Mesh:

Substances:

Year:  2021        PMID: 34132815      PMCID: PMC8557472          DOI: 10.1093/molbev/msab181

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


Introduction

Within groups of animals, vocal calls play an essential role in directing social behavior, in transmitting information about food availability and predators and for reproductive and maternal activities. In humans, vocal communication and speech constitute major socialization determinants; their dysfunction represents a core trait of neurodevelopmental conditions such as Autism Spectrum Disorders (ASDs). Several mouse models characterized by genetic lesions associated with human neurodevelopmental disorders have been used to study the genetic and neurological mechanisms supporting the interplay between vocalization and socialization (Caruso, et al. 2020). Mice social behaviors are accompanied by the production of ultrasonic vocalizations (USVs) (Sales 1972) which, in the case of most laboratory strains, present a frequency included between 35 and 110 kHz (Holy and Guo 2005). Mice USVs profiles vary with the type of social activity performed (e.g., courtship, aggression, and fear), with the animal’s developmental stage, with their genetic background and can be altered by induced genetic modifications. Although the link between USVs profiles and social activity is still incompletely analyzed, recent studies have implemented artificial intelligence algorithms to associate USVs of individual mice to specific social actions (Ey, et al. 2020; Sangiamo, et al. 2020), unveiling a functional correlation between specific USVs and social behaviors. The generation of voluntary structured vocalizations requires the coordination of different levels of cognitive and motor activities and involves a complex and diffuse neural network ranging from the prefrontal and motor cortical areas for voluntary initiation of vocalization to several forebrain and brainstem regions for the control of laryngeal movements and respiration (Jurgens 2002; Simonyan 2014). The inputs/outputs to this network are integrated in the midbrain periaqueductal gray (PAG), the central neural region controlling vocalization in most mammalians (Jurgens 2009; Tschida, et al. 2019). The study of developmental disorders affecting vocal communication such as stuttering, verbal dyspraxia, and some types of ASDs has led to the identification of genes involved in the control of speech and/or socio/cognitive faculties (Konopka and Roberts 2016). The most intensively studied of them, the transcription factor FOXP2, is associated with inherited dyspraxia and poor control of the facial musculature. Other “language-associated” genes include for example SHANK1/2/3, FOXP1, GNPTAB, GNPTG, CNTNAP2, and NAGPA; for most of these genes mouse models have been generated resulting almost invariably in alteration of USV and social behavior (Wohr 2014; Caruso, et al. 2020). These animal models suggest that a certain level of conservation exists between the mechanisms controlling vocalization. Homozygous disruption of Foxp2 in the mouse cerebral cortex results in abnormal social approach and altered USVs (Medvedeva, et al. 2019). One of the targets of Foxp2 is the Dlx5/6 locus: the mouse line Foxp2-S321X, which does not produce Foxp2 protein, displays a more than 200-fold increased expression of Dlx6as1 (also known as Evf1/2 or Shhrs) (Vernes, et al. 2006) a long noncoding RNA generated from the Dlx5/6 locus known to play a central role in the control of Dlx gene expression (Faedo, et al. 2004; Feng, et al. 2006). DLX5/6 are pleiotropic genes that control multiple aspects of embryonic development (Merlo, et al. 2000), including limb, craniofacial, and laryngeal morphogenesis (Beverdam, et al. 2002; Depew, et al. 2002; Robledo, et al. 2002; Conte, et al. 2016) and GABAergic neuronal differentiation and function (Perera, et al. 2004; Wang, et al. 2010; Cho, et al. 2015; de Lombares, et al. 2019). In humans, mutations in a 1.5 Mb genomic region including DLX5/6 are associated with Split Hand/Foot Malformation type 1 (SHFM1) (Scherer, et al. 1994; Crackower, et al. 1996), a form of ectrodactyly which can be accompanied by sensorineural deafness, craniofacial malformations with or without intellectual disability (Birnbaum, et al. 2012; Rasmussen, et al. 2016). Recently, a large-scale transcriptomic study on postmortem brains has suggested that the DLX5/6 locus participates to genetic modules involved in ASDs and schizophrenia (Gandal, et al. 2018). During aging, the methylation state of many genes increases or diminishes linearly giving rise to the “methylation clock” which permits to estimate the biological age of organisms, including humans (Guarasci, et al. 2019). The methylation status of the DLX5/6 gene locus increases linearly with age in several mouse and human tissues including the brain (Bell, et al. 2012), suggesting that DLX5/6 expression could be associated with phenotypic changes in aging and lifespan. The general inactivation of Dlx5/6 in the mouse results in perinatal death with a SHFM1 phenotype associated with severe craniofacial defects (Beverdam, et al. 2002; Depew, et al. 2002) while their targeted inactivation only in cephalic neural crest cells (CNCCs) results in more limited craniofacial defects (Shimizu, et al. 2018). In the brain, Dlx5/6 are selectively expressed by GABAergic neurons. Recently, by crossing Dlx5/6 mice with Vgat transgenic animals in which the “Cre-recombinase” gene is expressed selectively by GABAergic neurons, we have generated mice in which the Dlx5/6 DNA-binding capacity is reduced (Dlx5/6) or suppressed (Dlx5/6) only in GABAergic neurons (de Lombares, et al. 2019). Dlx5/6 and Dlx5/6 mice present reduced anxiety, compulsivity, and adiposity, associated with a 33% life expectancy extension (de Lombares, et al. 2019). In this study, we show that Dlx5/6 and Dlx5/6 mice also present different social behaviors and vocalization patterns when compared to their littermates. It has long been noted that morphological differences between modern and archaic Homo is similar to those of domesticated species (Boas 1938). The set of characters affected by Dlx5/6 in the mouse is reminiscent of those distinguishing domesticated species from their wild counterparts (Wilkins, et al. 2014), the so-called “domestication syndrome” characterized by increased socialization and by morphological changes in brain-case shape and size, retraction of the face, and decreased tooth size (Sánchez-Villagra and van Schaik 2019). A similar set of morphological and behavioral changes also occurred in wild house mice that adapted to life in close contact to humans (Geiger, et al. 2018). The “self-domestication hypothesis” suggests that human evolution (Zanella, et al. 2019) can be better understood in the light of a domestication process leading to increased socialization and communication accompanied by progressive changes in morphological characters such as craniofacial features (for a review, see Šimić, et al. 2020). Modern humans, the sole survivor of the group of hominins, differ from other Hominidae by many anatomical and physiological characters and, most remarkably, for their cognitive and linguistic capacities which reflect the singularity of our brain. Comparison of modern and archaic Homo genomes has played a pivotal role in the quest of Homo sapiens evolutionary history and in defining what makes us modern humans (see e.g., Prufer, et al. 2014; Racimo, et al. 2014; Racimo 2016; Peyregne, et al. 2017; Hajdinjak, et al. 2018). Comparison of Neanderthal, Denisovan, and modern human genomes has shown that interbreeding occurred between these different human groups until 45,000–65,000 years ago (Meyer, et al. 2012; Sankararaman, et al. 2014; Dannemann, et al. 2016; Vernot, et al. 2016). Negative selection against introgression is expected to have had the strongest impact on genome-wide Neanderthal ancestry during the few early generations after admixture, while some introgressed regions were conserved, and were even under positive selection. Moreover, drift and demographic events (such as admixture) can determine shifts in ancestry proportions (Harris and Nielsen 2016; Petr, et al. 2019). There is increasing interest to decode the biological, phenotypic, and medical consequences of Neanderthal legacy in the modern human genome (Simonti, et al. 2016; Dolgova and Lao 2018; Rotival and Quintana-Murci 2020). To understand the coevolution of human traits, a particular attention has been given to pleiotropic genes, which can simultaneously control the development of numerous characters. For example, in prospective studies trying to identify genes capable of affecting simultaneously craniofacial shape, osteogenesis, and language abilities Boeckx and Benitez-Burraco have proposed a network of physically or functionally interconnected genes including the RUNX, FOXP, and DLX families and their regulators (Boeckx and Benitez-Burraco 2014b, 2015). Here, we show that the inactivation of Dlx5/6 in mouse GABAergic neurons results in a hyper-vocalization phenotype and in a sharp increase of dynamic social interactions. Furthermore, we identify an introgressed Neanderthal haplotype (DLX5/6-N-Haplotype) representing 6.3% of European alleles, which covers DLX6 and DLX5 coding and regulatory sequences including SNPs associated with either hyper- (Williams–Beuren syndrome) or hypo- (ASDs) sociability and vocalization syndromes (Poitras, et al. 2010).

Results

Dlx5/6 Expression in GABAergic Neurons Affects Mouse Social Ultrasonic Vocalizations

To analyze the social ultrasonic vocalization phenotypes (UVs) resulting from the simultaneous deletion of Dlx5/6 homeodomains in adult GABAergic neurons, we applied the resident/intruder experimental paradigm to groups of control, Dlx5/6 (heterozygous deletion) and Dlx5/6 (homozygous deletion) adult female mice (Moles, et al. 2007). In this experimental setting, the animals to be tested (resident) are isolated 3 days before being exposed to an unfamiliar group-housed control adult mouse (intruder). Although UVs are simultaneously recorded for the pair of mice (resident and intruder), previous studies have shown that the contribution of the new comer is minor in comparison to that of the occupant (Hammerschmidt, et al. 2012). The number and types of vocalizations emitted during a 10 min test period (v/10 min) by 10 Dlx5/6, 9 Dlx5/6, and 15 control females confronted to a control female intruder were individually recorded. Most (14/15; 93%) control mice emitted less than 700 v/10 min and only one emitted 849 v/10 min. In contrast, 30% (3/10) of the Dlx5/6 and 56% (5/9) of the Dlx5/6 mice presented a unique vocalization profile emitting more than 700 v/10 min, with two heterozygous and four homozygous emitting more than 1,000 v/10 min and up to 1,400 v/10 min (fig. 1).
Fig. 1.

Number of calls/10 min emitted by Dlx5/6 and Dlx5/6 mice. Number of calls emitted by of Dlx5/6 (n = 10), Dlx5/6 (n = 9), and control (n = 15) 6 months old female mice confronted to a genetically normal female mouse intruder for 10 min. The number and types of vocalization were quantified. (A) 93% of control mice emitted less than 700 vocalizations during the 10 min tests, only one outlier emitted 849 calls. In contrast, 30% of Vgat (3/10) and 56% of Vgat (5/9) mice emitted between 850 and 1400 calls/10 min. (B) Percentage of mice emitting less (pink) or more (red) than 700 calls/10 min. The difference between each genotype is highly significant (***P < 0.001, Chi-square test).

Number of calls/10 min emitted by Dlx5/6 and Dlx5/6 mice. Number of calls emitted by of Dlx5/6 (n = 10), Dlx5/6 (n = 9), and control (n = 15) 6 months old female mice confronted to a genetically normal female mouse intruder for 10 min. The number and types of vocalization were quantified. (A) 93% of control mice emitted less than 700 vocalizations during the 10 min tests, only one outlier emitted 849 calls. In contrast, 30% of Vgat (3/10) and 56% of Vgat (5/9) mice emitted between 850 and 1400 calls/10 min. (B) Percentage of mice emitting less (pink) or more (red) than 700 calls/10 min. The difference between each genotype is highly significant (***P < 0.001, Chi-square test). This observation demonstrates the presence of an incompletely penetrant vocalization phenotype associated with the reduction of the level of expression of Dlx5/6 in GABAergic neurons, characterized by an allele-dosage-dependent increase in the number of calls per minute. Some illustrative examples of vocalization profiles recorded from normal and mutant resident mice are presented in figure 2. Control animals (fig. 2) and low-vocalizing mutants (fig. 2) emitted bursts of vocal signals separated by silent intervals or did not emit any call for long periods of time (fig. 2). In contrast, the high-vocalizing Dlx5/6 mice emitted uninterrupted series of calls extending for many seconds without any detectable interruption or pause (fig. 2), this pattern of vocalization being never observed in control animals.
Fig. 2.

Spectrograms of control and Dlx5/6 mice. Representative spectrograms from four controls (A–D) and four Dlx5/6 (E–H) mice. In each group of animals, some individuals (e.g., D, H) emitted very few vocalizations while others emitted sequences of calls separated by silent intervals (A–C, G). Hyper-vocalizing Dlx5/6 mice emitted long, uninterrupted, sequences of calls (E, F) which were never observed in controls or low-vocalizing animals.

Spectrograms of control and Dlx5/6 mice. Representative spectrograms from four controls (A–D) and four Dlx5/6 (E–H) mice. In each group of animals, some individuals (e.g., D, H) emitted very few vocalizations while others emitted sequences of calls separated by silent intervals (A–C, G). Hyper-vocalizing Dlx5/6 mice emitted long, uninterrupted, sequences of calls (E, F) which were never observed in controls or low-vocalizing animals. The total time spent vocalizing by mice emitting a high number of vocalizations was about 4 times longer than that of low-vocalizing littermates (fig. 3). We categorized the types of vocalization emitted in five major groups short, simple, complex, frequency jumps, and unstructured as previously described (Ey, et al. 2013). The analysis of the specific types of calls emitted by each animal showed that high-vocalizing animals presented a significant increase in the number of complex vocalizations (“modulated + complex” according to Ey, et al. (2013)) which are the longest vocalization types emitted by mice and can last up to 0.15 ms (fig. 3). Complex vocalizations have been recently associated with more social behaviors (Ey, et al. 2020). Therefore, in this experimental paradigm, designed to detect social vocalization, we observed a new partially penetrant phenotype characterized by an increase in the number calls, and differences in vocal repertoire and “prosody” (Lahvis, et al. 2011).
Fig. 3.

Total vocalization time and call types emitted by hyper-vocalizing and low-vocalizing mice. (A) Total time spent vocalizing by mice emitting less or more than 700 calls/10 min. (B) Percentage of call types produced by mice emitting less or more than 700 calls/10 min. Five types of emissions have been considered as in Ey, et al. (2013): Short, Simple (Flat + Upward + Downward), Complex (Modulated + Complex), Unstructured, and Frequency jumps. Hyper-vocalizing animals vocalize, in average, four times more than low vocalizing animals and emit significantly more complex and modulated calls. (***P < 0.001; *P < 0.05, Mann–Whitney U test).

Total vocalization time and call types emitted by hyper-vocalizing and low-vocalizing mice. (A) Total time spent vocalizing by mice emitting less or more than 700 calls/10 min. (B) Percentage of call types produced by mice emitting less or more than 700 calls/10 min. Five types of emissions have been considered as in Ey, et al. (2013): Short, Simple (Flat + Upward + Downward), Complex (Modulated + Complex), Unstructured, and Frequency jumps. Hyper-vocalizing animals vocalize, in average, four times more than low vocalizing animals and emit significantly more complex and modulated calls. (***P < 0.001; *P < 0.05, Mann–Whitney U test).

Dlx5/6 Expression in Brain Regions Which Control Vocalization

Vocal communication depends on a complex neural network that includes regions of the forebrain, of the basal ganglia, and of the brainstem (Jurgens 2002). These multiple inputs are integrated by the periaqueductal gray (PAG), a midbrain territory that is at the interface between the telencephalon and phonation/respiration motor centers. As all these brain territories include GABAergic neurons, we determined in which of them Dlx5 is expressed. To this end, we analyzed the distribution of ß-galactosidase staining in Dlx5adult mouse brains (fig. 4) (Acampora, et al. 1999). We found that Dlx5 is not expressed in the PAG or other midbrain regions (fig. 4) nor in posterior motor areas. On the contrary, Dlx5 is strongly expressed in the cerebral cortex (fig. 4), in the hypothalamus (fig. 4), and in the striatum (fig. 4). These observations suggest that Dlx5/6 affect central cognitive and emotional inputs to vocal communication but do not affect the “mechanics” of phonation.
Fig. 4.

Expression of Dlx5 in different adult brain regions involved in vocalization. X-gal staining on serial sections of Dlx5 brains. ß-Galactosidase activity was not detected in the brainstem (A), in the cerebellum and in the midbrain including the ventro-lateral periacqueductal gray (vlPAG) a region which integrates sensory and cognitive inputs and controls vocalization motor centers. Dlx5 was expressed in the prefrontal cortex (B) and in other cortical regions (cx), in the striatum (C) and in hypothalamic nuclei (D). AHP, anterior hypothalamic area; cc, corpus callosum; cx, cortex; DM, Dorsomedial hypothalamus; PVN, para-ventricular nucleus; SCh, suprachiasmatic nucleus; VMH, ventro medial hypothalamus.

Expression of Dlx5 in different adult brain regions involved in vocalization. X-gal staining on serial sections of Dlx5 brains. ß-Galactosidase activity was not detected in the brainstem (A), in the cerebellum and in the midbrain including the ventro-lateral periacqueductal gray (vlPAG) a region which integrates sensory and cognitive inputs and controls vocalization motor centers. Dlx5 was expressed in the prefrontal cortex (B) and in other cortical regions (cx), in the striatum (C) and in hypothalamic nuclei (D). AHP, anterior hypothalamic area; cc, corpus callosum; cx, cortex; DM, Dorsomedial hypothalamus; PVN, para-ventricular nucleus; SCh, suprachiasmatic nucleus; VMH, ventro medial hypothalamus.

Social Behavior of Hyper-Vocalizing Mice

The social behavior of the mice on which we had quantified UVs was analyzed with the resident/intruder experimental setting described above. Behavior and social interactions were filmed and analyzed as previously described (de Chaumont, et al. 2019). Social behavior between individuals of the same genotype was very variable, resulting in no significant difference in sociability between control, Dlx5/6 and Dlx5/6 mice (de Lombares, et al. 2019). However, when we confronted the groups of hyper-vocalizing (>700 calls/10 min) and low-vocalizing (<700 calls/10 min) animals, we found a highly significant association between hyper-vocalization and hyper-sociability. Hyper-vocalizing animals spent more than twice the time following or approaching the intruder than the low-vocalizing individuals (fig. 5).
Fig. 5.

Social behavior of hyper-vocalizing and low-vocalizing mice. The social interaction behavior of Dlx5/6 (n = 10), Dlx5/6 (n = 9), and control (n = 15) mice placed in the same cage with a control visitor was monitored using a protocol (de Chaumont, et al. 2019; de Lombares, et al. 2019), which associates computer vision and artificial intelligence to identify and categorize individual social interactions. The types of behavior recorded were classified as follows: 1) isolated behavior (moving alone, rearing alone, stopped alone, jump alone, huddling alone), 2) static interactions (rearing in contact, oral–oral contact, oro-genital contact, side-by-side in same or opposite orientation, 3) dynamic interactions (moving in contact, train of 2 mice, following the same path), 4) Social approach (making contact, approaching in rearing, approaching), 5) social escape (breaking contact, escaping), a detailed description of each behavior is found in (de Chaumont, et al. 2019). The time spent by each mouse (s) adopting a specific behavior was measured over a 10 min recording. (A) Normalized results: for each behavior, the average time spent by low-vocalizing animals was set as one (pink) and the ratio with the time spent by hyper-vocalizing mice shown (red). For example, “Dynamic Interaction” was adopted 2.3 times more frequently by hyper-vocalizing mice than by low-vocalizing animals. (B) Actual values in seconds recorded for each social behavioral trait. (***P < 0.001; *P < 0.05, Mann–Whitney U test).

Social behavior of hyper-vocalizing and low-vocalizing mice. The social interaction behavior of Dlx5/6 (n = 10), Dlx5/6 (n = 9), and control (n = 15) mice placed in the same cage with a control visitor was monitored using a protocol (de Chaumont, et al. 2019; de Lombares, et al. 2019), which associates computer vision and artificial intelligence to identify and categorize individual social interactions. The types of behavior recorded were classified as follows: 1) isolated behavior (moving alone, rearing alone, stopped alone, jump alone, huddling alone), 2) static interactions (rearing in contact, oral–oral contact, oro-genital contact, side-by-side in same or opposite orientation, 3) dynamic interactions (moving in contact, train of 2 mice, following the same path), 4) Social approach (making contact, approaching in rearing, approaching), 5) social escape (breaking contact, escaping), a detailed description of each behavior is found in (de Chaumont, et al. 2019). The time spent by each mouse (s) adopting a specific behavior was measured over a 10 min recording. (A) Normalized results: for each behavior, the average time spent by low-vocalizing animals was set as one (pink) and the ratio with the time spent by hyper-vocalizing mice shown (red). For example, “Dynamic Interaction” was adopted 2.3 times more frequently by hyper-vocalizing mice than by low-vocalizing animals. (B) Actual values in seconds recorded for each social behavioral trait. (***P < 0.001; *P < 0.05, Mann–Whitney U test). Therefore, the same animals in which the mutation of Dlx5/6 in GABAergic neurons induced hyper-vocalization presented also a hyper-sociability phenotype.

Introgression of a Neandertal-like Haplotype in the DLX5/6 Locus

Patients carrying mutations in the DLX5/6 locus or mouse models in which these genes are deregulated present abnormalities in at least five sets of characters which differentiate H. sapiens from Homo Neanderthalensis: 1) brain size and shape, 2) limb morphology, 3) craniofacial shape, 4) middle ear morphology, and 5) bone density. Comparing the DLX5/6 locus of modern humans (fig. 6) to those of Neanderthal (Altaï, Vindija), Denisovan, and other Hominidae, we identified 298 SNPs with rare alleles that are shared between modern humans and Neanderthals and are absent from African populations in an interval of 400 kb centered on Dlx5 coding sequences. Among them we identified, in the proximal region of DLX5/6, five SNPs (rs12704907, rs4729325, rs1005169, rs1724297, and rs1207733), all located in noncoding regions, that cosegregate at more than 80% in individuals from the worldwide 1000 Genomes database (The 1000 Genomes Project Consortium), forming a “core” 37,160 bp haplotype (DLX5/6-N-Haplotype) defined by consecutive sites with Neanderthal-like alleles (table 1, fig. 6). This haplotype includes DLX6 coding and regulatory sequences, the intergenic region between DLX6 and DLX5 and DLX5 intron 3 and exon 3.
Fig. 6.

Linkage disequilibrium analysis of the distribution of the DLX5/6-N-Haplotype in different populations. (A) For each population, the width of the color bar represents the allele frequency for each SNP. SNPs linked by a solid bar cosegregate; if cosegregation is less than 100%, the width of the bar diminishes accordingly. The allele frequency of the most represented Neanderthal SNP of the DLX5/6-N-Haplotype is indicated for each population. The major SNPs constituting the haplotype are indicated. The scale bar represents a 5% allelic frequency within a population. EAS, East Asia populations; SAS, South Asia populations; EUR, European populations; AMR, American populations; AMR/AFR, Afro-americans; AFR, Africans. (B) Geographical distribution of the DLX5/6-N-Haplotype. The allele frequency of the most represented SNP constituting the DLX5/6-N-Haplotype is represented on the earth map. The higher frequency of the DLX5/6-N-Haplotype is seen in Europe and South Asia, the haplotype is not present in African populations and in Peruvians from Lima. CHB, Han Chinese in Beijing, China; JPT, Japanese in Tokyo, Japan; CHS, Southern Han Chinese; CDX, Chinese Dai in Xishuangbanna, China; KHV, Kinh in Ho Chi Minh City, Vietnam; CEU, Utah Residents (CEPH) with Northern and Western European Ancestry; TSI, Tuscans in Italy; FIN, Finnish in Finland; GBR, British in England and Scotland; IBS, Iberian Population in Spain; YRI, Yoruba in Ibadan, Nigeria; LWK, Luhya in Webuye, Kenya; GWD, Gambian in Western Divisions in the Gambia; MSL, Mende in Sierra Leone; ESN, Esan in Nigeria; ASW, Americans of African Ancestry in South-West USA; ACB, African Caribbeans in Barbados; MXL, Mexican Ancestry from Los Angeles USA; PUR, Puerto Ricans from Puerto Rico; CLM, Colombians from Medellin, Colombia; PEL, Peruvians from Lima, Peru; GIH, Gujarati Indian from Houston, Texas; PJL: Punjabi from Lahore, Pakistan; BEB: Bengali from Bangladesh; STU: Sri Lankan Tamil from the UK; ITU, Indian Telugu from the UK. (C) Neighbor-joining tree computed on five modern human haplotypes (I–V) identified through haplotype network analysis, three archaic human haplotypes (Vindija, Denisovan, and Altai) and the chimpanzee. The values in blue boxes represent the support for each branch (1,000 bootstraps). The pie charts point out the frequency in the five continental population groups and in the semi-supercentenarians of each modern human core haplotype. The number reported near each pie-chart indicates the frequency of each core-haplotype among 1000 Genomes individuals and, in brackets, the frequency of the core haplotypes among the semi-supercentenarians.

Table 1.

SNPs Included in the DLX5/6-N-Haplotype.

rs_IDIntrogressedAltaiVindijaDenisovaChimpBononboGorillaOrang
rs12704907CCCCCTTTTTT
rs4729325CTTCCTTTTTT
rs1005169ACCCACCCCCC
rs1724297AGGAGGGGGGG
rs1207733AAAAAGGGGGG
Linkage disequilibrium analysis of the distribution of the DLX5/6-N-Haplotype in different populations. (A) For each population, the width of the color bar represents the allele frequency for each SNP. SNPs linked by a solid bar cosegregate; if cosegregation is less than 100%, the width of the bar diminishes accordingly. The allele frequency of the most represented Neanderthal SNP of the DLX5/6-N-Haplotype is indicated for each population. The major SNPs constituting the haplotype are indicated. The scale bar represents a 5% allelic frequency within a population. EAS, East Asia populations; SAS, South Asia populations; EUR, European populations; AMR, American populations; AMR/AFR, Afro-americans; AFR, Africans. (B) Geographical distribution of the DLX5/6-N-Haplotype. The allele frequency of the most represented SNP constituting the DLX5/6-N-Haplotype is represented on the earth map. The higher frequency of the DLX5/6-N-Haplotype is seen in Europe and South Asia, the haplotype is not present in African populations and in Peruvians from Lima. CHB, Han Chinese in Beijing, China; JPT, Japanese in Tokyo, Japan; CHS, Southern Han Chinese; CDX, Chinese Dai in Xishuangbanna, China; KHV, Kinh in Ho Chi Minh City, Vietnam; CEU, Utah Residents (CEPH) with Northern and Western European Ancestry; TSI, Tuscans in Italy; FIN, Finnish in Finland; GBR, British in England and Scotland; IBS, Iberian Population in Spain; YRI, Yoruba in Ibadan, Nigeria; LWK, Luhya in Webuye, Kenya; GWD, Gambian in Western Divisions in the Gambia; MSL, Mende in Sierra Leone; ESN, Esan in Nigeria; ASW, Americans of African Ancestry in South-West USA; ACB, African Caribbeans in Barbados; MXL, Mexican Ancestry from Los Angeles USA; PUR, Puerto Ricans from Puerto Rico; CLM, Colombians from Medellin, Colombia; PEL, Peruvians from Lima, Peru; GIH, Gujarati Indian from Houston, Texas; PJL: Punjabi from Lahore, Pakistan; BEB: Bengali from Bangladesh; STU: Sri Lankan Tamil from the UK; ITU, Indian Telugu from the UK. (C) Neighbor-joining tree computed on five modern human haplotypes (I–V) identified through haplotype network analysis, three archaic human haplotypes (Vindija, Denisovan, and Altai) and the chimpanzee. The values in blue boxes represent the support for each branch (1,000 bootstraps). The pie charts point out the frequency in the five continental population groups and in the semi-supercentenarians of each modern human core haplotype. The number reported near each pie-chart indicates the frequency of each core-haplotype among 1000 Genomes individuals and, in brackets, the frequency of the core haplotypes among the semi-supercentenarians. SNPs Included in the DLX5/6-N-Haplotype. To find evidences supporting the DLX5/6-N-Haplotype introgression, we firstly used the introgression probabilities for each SNP reported by Sankararaman, et al. (2014) for all European and all Asian individuals. The average posterior probabilities computed along the putatively introgressed haplotype for Europeans and Asians are equal to 0.0431 and 0.0423, respectively. Both these values are significantly higher (t-test P value < 2.2E−16) than the chromosomal average posterior probability that is equal to 0.0252 for both populations. Moreover, a haplotype (hg38 - chr7:96991129-97022094) of Neanderthal (Vindija) origin has been recently identified applying a two-state hidden Markov model to search for archaic fragments (Denisovan, Altai, and Vindija Neanderthal) in the whole genome of the Icelandic population (see Supplementary Materials in Skov, et al. 2020). The identified haplotype corresponds to the DLX5/6-N-Haplotype. Although the DLX5/6-N-Haplotype is the most represented, some of the Icelanders present larger haplotypes which include several additional SNPs of Neanderthal origin. Similarly, in the 1000 Genomes database, the five “core SNPs,” present in every population except Africa, cosegregate with other SNPs most of which correspond to those described in Skov et al. (2020) (supplementary table 2S, Supplementary Material online; supplementary fig. 1S, Supplementary Material online). Further evidence in favor of introgression is provided by the geographic distribution of archaic-like core haplotypes. The proportion of individuals carrying the DLX5/6-N-Haplotype is variable between populations, from 1.5% in East Asia to 6.3% in Europe and absent in continental Africa (fig. 6). Overall, the proportion of the DLX5/6-N-Haplotype is low in East Asia and America (less than 4%) and completely absent from Africans and Peruvians from Lima. In South Asia and Europe, the DLX5/6-N-Haplotype is much more frequent reaching up to 8.2% in the British population of England and Scotland and in Sri Lankan Tamil in the UK. In African-American and Caribbean, the haplotype is poorly represented (fig. 6). The five reported SNPs are in LD in the European populations and the r2 are reported in supplementary table 1S, Supplementary Material online for different populations of the 1000 Genomes Project. To find further evidences in favor of introgression in this genomic area we first determined the haplotype structure of this region (hg38 - chr7:96991129-97022094). We used the inferred 1000 Genomes phase III haplotypes for 2,504 individuals from 26 populations. We reconstructed 5,008 individual haplotypes and 1,020 unique haplotypes. By clustering haplotypes (see Materials and Methods) we identified 5 different clusters (core haplotypes) as reported in figure 6. We then built a rooted neighbor-joining tree for the region (hg38 - chr7:96991129-97022094) computing the pairwise genetic distances between the sequences of five modern haplotypes together with the Vindija, Altai, and Denisovan sequences. Branch support values were derived using 1,000 bootstraps (fig. 6). We included the orthologous sequence for chimpanzee. The analyses showed that haplotype I is the only haplotype similar to the archaic genome sequences and it is the most similar to the Vindija Neanderthal genome. Haplotype I overlaps with putatively “core” introgressed haplotype (DLX5/6-N-Haplotype) and it is predominantly found in the European individuals (6.3%) and largely restricted to populations outside Africa. No other modern human haplotypes (II, III, IV, and V) are similar to archaic genomes. Allele sharing with archaic genomes at polymorphic sites in non-African populations can result either from incomplete lineage sorting or admixture. To distinguish between these two scenarios, we looked at the length of the haplotypes. Introgressed haplotypes present fewer recombinations than those predicted from the time interval that separates modern humans from the common ancestral population. The DLX5/6-N-Haplotype is significantly longer than expected under the null hypothesis of incomplete lineage sorting (P value = 1.08e−5), supporting an archaic origin of the haplotype.

Association of the DLX5/6 Locus with Human Cognitive Impairment and Speech Defects

Deletions or mutations at the DLX5/6 locus are associated with Split Hand Foot Malformation Type 1 (SHFM1), a form of ectrodactyly in which limb malformations can be associated with cognitive, craniofacial, or hearing defects. To evaluate the possible implications of the DLX5/6 locus and of the DLX5/6-N-Haplotype in particular for human development and evolution, we reviewed the cognitive phenotypes reported for SHFM1-patients carrying known genetic alterations within this region (Rasmussen, et al. 2016). This analysis permitted to associate specific genomic subregions to syndromic traits such as mental retardation, language delay, and autism (fig. 7).
Fig. 7.

Association of different pathological conditions with the DLX5/6 genomic region. The central black line represents the 7q21.3 locus, which includes at least 5 genes DLX5, DLX6, SEM1, SLC25A13, and DYNC1I1. A long noncoding antisense RNA, DLX6-AS, is also transcribed from this region. Purple boxes in the upper part of the figure represent the phenotypic consequences observed in Split Hand Foot Malformation syndrome (SHFM) patients carrying deletion/mutations in this locus. While all SHFM patients present ectrodactyly, in certain cases this malformation is accompanied by craniofacial defects, hearing loss and/or cognitive defects. The closer the deletion are to DLX5/6 coding sequences the higher is the frequency of patients presenting also cognitive impairment. Point mutations within the DLX5/6 region and proximal or intragenic enhancers have been associated with autism spectrum disorders (ASD) without ectrodactyly or craniofacial defects and are included in the DLX5/6-N-Haplotype. Three conserved enhancers (I56i, I56ii, and Mef2-BA) that direct DLX5/6 expression in the brain (indicated in yellow) are also part of the haplotype. A point mutation within the I56i enhancer, prevents binding of GTF2I, a transcription factor deleted in Williams–Beuren syndrome, a condition characterized by hyper-vocalization and hyper-sociability (Poitras, et al. 2010).

Association of different pathological conditions with the DLX5/6 genomic region. The central black line represents the 7q21.3 locus, which includes at least 5 genes DLX5, DLX6, SEM1, SLC25A13, and DYNC1I1. A long noncoding antisense RNA, DLX6-AS, is also transcribed from this region. Purple boxes in the upper part of the figure represent the phenotypic consequences observed in Split Hand Foot Malformation syndrome (SHFM) patients carrying deletion/mutations in this locus. While all SHFM patients present ectrodactyly, in certain cases this malformation is accompanied by craniofacial defects, hearing loss and/or cognitive defects. The closer the deletion are to DLX5/6 coding sequences the higher is the frequency of patients presenting also cognitive impairment. Point mutations within the DLX5/6 region and proximal or intragenic enhancers have been associated with autism spectrum disorders (ASD) without ectrodactyly or craniofacial defects and are included in the DLX5/6-N-Haplotype. Three conserved enhancers (I56i, I56ii, and Mef2-BA) that direct DLX5/6 expression in the brain (indicated in yellow) are also part of the haplotype. A point mutation within the I56i enhancer, prevents binding of GTF2I, a transcription factor deleted in Williams–Beuren syndrome, a condition characterized by hyper-vocalization and hyper-sociability (Poitras, et al. 2010). Most families carrying mutations limited to the genomic region encompassing only limb-specific enhancers (Birnbaum, et al. 2012) present SHFM1 with no other physical or mental problems, with the exception of one case of mild cognitive defects. Families with deletions or translocations including craniofacial-specific enhancers present SHFM1 with hearing loss with or without other craniofacial anomalies and a high incidence of cognitive defects (Rasmussen, et al. 2016). Large heterozygous deletions covering both regulatory and coding DLX5/6 sequences present SHFM1 with craniofacial defects associated in 5 out of 7 cases to severe mental retardation. Patients carrying mutations restricted to DLX5/6 intergenic region or their coding sequences have a higher probability to include members with ASDs or speech delay but do not present the limb deformities of SHFM1 (Hamilton, et al. 2005; Nakashima, et al. 2010; Poitras, et al. 2010; Chang, et al. 2011). The DLX5/6-N-Haplotype includes enhancers (I56i, I56ii, and MEF2) capable to direct DLX5/6 expression in the brain (Fazel Darbandi, et al. 2016; Assali, et al. 2019). Interestingly, a polymorphism in enhancer I56i is associated with the binding of GTF2I, a transcription factor deleted in Williams–Beuren syndrome, a complex condition characterized by craniofacial, hyper-vocalization, and hyper-sociability (fig. 7), signs which has been considered an entry point to understand the evolution of the modern human face and prosociability (Zanella, et al. 2019). Through PheWeb, we examined the association between the SNPs constituting the DLX5/6-N-Haplotype and the phenomes of the Michigan genomics initiative and of the UK Biobank. No significant P values (<5 × 10−8) were reported. Results with P values < 0.01 are reported in supplementary tables 3S–6S, Supplementary Material online. The GeneATLAS analysis showed that the most significant phenotype is “Standing height” (P value = 8.92E−08 for rs1207733-G). A detailed list of all phenotypes associated with each SNPs is reported in supplementary tables 7S–11S, Supplementary Material online. ClinVar, GWAS Catalog, Gwas Central, GTEX PORTAL, and clinGen were examined for the same SNPs, but no results were found. Moreover, we searched if the list of the five variants included in the core haplotype (or variants in LD with those variants) have been previously associated with a trait or disease using the function LDtrait of the R package LDlinkR (Myers, et al. 2020) and no association were reported.

The DLX5/6-N-Haplotype Allele is Under-represented in Semi-supercentenarians

As the DLX5/6 locus has been associated with ASDs and mental retardation (Gandal, et al. 2018 and fig. 7), we analyzed the DLX5/6-N-Haplotype frequency in the MSSNG database of ASDs families (Yuen et al. 2017), and we found a nonsignificant association (P = 0.16) with the disease. This finding, however, cannot exclude an association of the haplotype to specific mental traits not necessarily represented in the ASDs database. We then focused on other traits that can be associated with social vocalization. Notably, mortality risk has been correlated to social environment in humans and other social mammals (Snyder-Mackler, et al. 2020). Since inactivation of Dlx5/6 in GABAergic neurons results in a 33% life span extension in mice (de Lombares, et al. 2019), we focused on the population of semi-supercentenarians (CEN), that is people over 105 years of age (105+) characterized by exceptional longevity (Garagnani, et al. 2021). The frequency of the DLX5/6-N-Haplotype was calculated for all the European populations present in the 1000 Genomes database and estimated for a cohort of 81 Italian semi-supercentenarians. The 105+ (CEN) presented a 50% under-representation of the haplotype when compared to any other European population with the only exception of IBS, and constituted the most divergent group (fig. 8).
Fig. 8.

Allele frequency of the DLX5/6-N-Haplotype in the semi-supercentenarian population. (A) The allele frequency of DLX5/6-N-Haplotype European populations (1000 Genomes Project) is confronted to the CEN population of 81 semi-supercentenarians (105+ yo) and to the ITA population (Tuscany 1000 Genomes population [TSI] + 36 controls genomes matching the CEN). While the ITA population does not show any significant difference from the Europeans, a significant reduction of the DLX5/6-N-Haplotype allele frequency is observed in the semi-supercentenarian population (*P < 0.05, Fst distribution). (B) Density plot build considering all the Fst values calculated considering all the archaic haplotypes described in Skov, et al. (2020) for CEN versus ITA groups. The blue arrow indicates the value of Fst calculated for the DLX5/6-N-Haplotype.

Allele frequency of the DLX5/6-N-Haplotype in the semi-supercentenarian population. (A) The allele frequency of DLX5/6-N-Haplotype European populations (1000 Genomes Project) is confronted to the CEN population of 81 semi-supercentenarians (105+ yo) and to the ITA population (Tuscany 1000 Genomes population [TSI] + 36 controls genomes matching the CEN). While the ITA population does not show any significant difference from the Europeans, a significant reduction of the DLX5/6-N-Haplotype allele frequency is observed in the semi-supercentenarian population (*P < 0.05, Fst distribution). (B) Density plot build considering all the Fst values calculated considering all the archaic haplotypes described in Skov, et al. (2020) for CEN versus ITA groups. The blue arrow indicates the value of Fst calculated for the DLX5/6-N-Haplotype. We found a significant underrepresentation of the DLX5/6-N-Haplotype (P value = 0.026) in the CEN population when compared to the ITA control group composed by TSI from 1000 Genomes Project and 36 Italian healthy individuals described in Sazzini, et al. (2020). This reduction was also confirmed in the total European population (P < 0.05) (fig. 8). The same pattern was confirmed for the haplotypes reported in the Neighbor-joining tree (fig. 6): the frequency of haplotype I (that overlap with DLX5/6-N-Haplotype) in semi-supercentenarians is 3.0%, whereas the frequencies of core haplotypes II, III, IV, and V are 41.4%, 19.8%, 8.0%, and 27.8%, respectively. Principal component analysis (PCA) was performed to control for population stratification considering semi-supercentenarians and ITA and it shows that the two groups cluster together (supplementary fig. S2A, Supplementary Material online), even if considered in the context of other European populations from 1000 Genomes Project (supplementary fig. S2B, Supplementary Material online). We also calculated Fst between CEN, ITA, and European populations from 1000 Genomes Project (TSI, GBR, FIN, IBS, and CEU), considering 110,692 archaic haplotypes located in all chromosomes from Skov, et al. (2020). We performed a distribution of Fst computed between all population pairs (CEN, ITA and TSI, GBR, FIN, IBS, CEU of 1000 Genomes Project) and for each distribution we checked if the Fst calculated for DLX5/6-N-Haplotype is significantly different from the reported distribution. The comparisons of CEN with ITA showed a significant difference (P value = 0.018). The difference is still significant (P value = 0.03) if the same analyses is performed only including low-frequency haplotypes (<10% in the ITA group). The Fst of DLX5/6-N-Haplotype in other comparisons including CEN (CEN vs. FIN, CEN vs. CEU, CEN vs. GBR, and CEN vs. TSI) is always greater than the 90th percentile of the empirical distribution with the only exception of CEN versus IBS. Fst distribution of all archaic haplotypes for CEN versus ITA is reported in figure 8 where the Fst value related to DLX5/6-N-Haplotype is indicated by a blue arrow, suggesting that this haplotype significantly differentiates the two populations (Fst = 0.093; P value = 0.018). Moreover, to exclude that the identified signal is related to other haplotypes including nonarchaic variants in LD with variants in our region we performed two analyses: we analyzed haplotypes in the surrounding regions selecting haplotypes that include nonarchaic variants in even weak LD with our archaic variants (downstream our region hg38: chr7:96,991,129-97,022,094); we analyzed a new haplotype built with six non-archaic variants in LD (r2 > 0.7) with all the archaic variants within the analyzed region (hg38: chr7:96,991,129-97,022,094). First, we performed a LD analyses of the area to identify all the SNPs in LD with archaic variants (we reported in supplementary fig. 3S, Supplementary Material online a map with pattern of LD in the ITA population, measured by the R2 statistic including a wide region around DLX5/6-N-Haplotype). Archaic-variants are in weak LD with non-archaic variants located downstream. These variants define two non-archaic haplotypes (indicated with green and red segments in supplementary fig. 3S, Supplementary Material online). We thus computed the Fst values between CEN and ITA for both the non-archaic haplotypes (Fst = 0.006 and Fst = 0.005, respectively), and we checked if the calculated Fst values are significant from the reported Fst distribution. No significant P values were observed (P value = 0.364 and P value = 0.382, respectively), supporting the relevance of inferences on this specific haplotype. Secondly, we selected all the nonarchaic variants in LD (r2 > 0.7) with all the archaic variants in the same region (segment in blue in supplementary fig. 3S, Supplementary Material online) and we calculated Fst between CEN and ITA that is 0.073 (P value = 0.029), lower than the Fst computed for the archaic variants. These results suggest that the larger Fst effect is observed for the haplotype defined by the archaic variants.

Discussion

Here, we have shown that reduction of DLX5/6 expression in mouse GABAergic neurons results in a hyper-vocalization and hyper-socialization phenotype. Furthermore, we identified a DLX5/6-Neanderthal-derived haplotype present in up to 8% of non-African alleles that includes enhancers controlling gene expression in the central nervous system (Fazel Darbandi, et al. 2016) and sequences associated with either hyper- or hypo-socialization/vocalization pathologies such as Williams-Buren syndrome (Poitras, et al. 2010) or ASDs (Gandal, et al. 2018). Interestingly, the DLX5/6-N-Haplotype is underrepresented in the genomes of healthy semi-supercentenarians suggesting an association between DLX5/6 brain expression and healthy aging reminiscent of what has been shown in mouse models (de Lombares, et al. 2019). In social mammals, such as mice and humans, vocal communication is essential for the survival of the group. It is needed to convey diverse information such as location, social status, reproductive availability, emotional context (fear, aggression), food availability, and presence of predators. Such a vast set of inputs and outputs is coordinated by a complex neural network that combines regions of the forebrain, of the basal ganglia and of the brainstem (Jurgens 2002). A key brain region involved in the integration of these neuronal activities is the periaqueductal gray (PAG), a midbrain territory that receives multiple telencephalic inputs and transmit them to phonation and respiration motor centers. As Dlx5/6 are not expressed in the PAG, but only in telencephalic interneurons, their effect on vocalization must derive from cognitive initiation and modulation and not from impairment of laryngeal or respiratory motor centers. In comparison to their normal controls, many Dlx5/6 and Dlx5/6mice emit a higher number of longer “complex” calls often not separated by any interruption. These long-modulated USVs are emitted prevalently when mice are engaged in socially intense behaviors and more rarely during isolated behavior (Ey, et al. 2020). Therefore, the high-vocalization profile of Dlx5/6 mice is predictive of an enhanced social behavior and is in line with our finding of an increase in dynamic socialization of high-vocalizing individuals. Many mouse models have been generated to study the effects of genetic lesions associated with human neurodevelopmental disorders. In particular, the simultaneous study of ultrasonic vocalizations and behavior has permitted to evaluate the impact of specific mutations on social communication and interaction in mouse models of ASDs (Caruso, et al. 2020). For example, female–female social interactions and USV have been analyzed in Shank2 and Shank3 mutant mice showing reduction in the number and duration of USVs associated with diminished sociability (Schmeisser, et al. 2012; Ey, et al. 2013; Wohr 2014). Extrapolation of our results to the human condition would suggest that reduced DLX5/6 expression in GABAergic neurons could be associated with increased social vocalization, a phenotype opposite to ASDs. Our understanding of the origin of psychiatric pathologies such as schizophrenia and ASDs has recently progressed substantially thanks to large genome-wide association studies (GWAs). They have shown that, beyond some specific cases where monogenic mutations cause highly penetrant phenotypes, psychiatric conditions are complex polygenic disorders associated with hundreds of common genetic loci involving genes often associated with synaptic function and neural development (Huguet, et al. 2016; Dennison, et al. 2019). Module analyses of gene-networks implicated in interneuron-dependent ASD and schizophrenia have identified gene expression variations and splicing of DLX5, DLX6, and asDLX6nc. Indeed, the modules including these DLX genes and other key genetic components of GABAergic neurons constitute the major hub for both ASD and schizophrenia (Gandal, et al. 2018). Remarkably, a point mutation within the I56i enhancer is associated with binding of the transcription factor GTF2I, which is deleted in Williams–Beuren syndrome characterized by hyper-vocalization and hyper-sociability (Poitras, et al. 2010). GTF2I is located in 7q11.23, a genomic region which deletion or microduplication provokes complex syndromes with opposite social and speech language behaviors including sociability, speech language, and cognitive alteration associated with neural crest-associated facial features (López-Tobón, et al. 2020). These genetic networks, which include DLX5/6 and control concomitant modifications of craniofacial morphology, cognition, and sociability are proposed to have contributed to human self-domestication (Zanella, et al. 2019). Furthermore, DLX5/6 were proposed as potential contributors to brain and skull globularization and acquisition of human-specific language capacities (Boeckx and Benitez-Burraco 2014a). The origin of vocal language in the genus Homo has been approached through anatomical analyses of the brain and larynx and, more recently, through the comparison of genes involved in language acquisition such as FOXP2 (Dediu and Levinson 2013; Dediu and Christiansen 2016). Remarkably, FOXP2 is located in 7q31, close to the DLX5/6 locus (7q21), and regulates the expression of a large antisense noncoding RNA of DLX6 (asDLX6nc or Evf1/2) (Vernes, et al. 2006, 2011), which is directly involved in the control of Dlx5 and Dlx6 expression (Feng, et al. 2006). The DLX5/6-N-Haplotype covers the complete sequence of DLX6, the DLX5/6 intergenic region, the second intron and third exon of DLX5, three known enhancers for brain expression of DLX5/6: I56i, I56ii, and Mef2-BA (Birnbaum, et al. 2012), and the three first exons of the long noncoding RNA asDLX6nc, which has been implicated in GABAergic neuronal differentiation (Cajigas, et al. 2018). The major SNPs composing the DLX5/6-N-Haplotype are located in noncoding sequences in proximity of the known enhancers and splice sites of DLX5 and asDLX6nc and could well alter their expression profile. Whole-genome analyses of Neanderthal and Denisovan introgressions in 27,566 Icelanders identify the presence of DLX5/6-N-Haplotype comprising the five core SNPs with a 1.8% allelic frequency identified as having a Neanderthal origin. This introgression coexists with other, rare, longer haplotypes (Skov, et al. 2020). The comparison of human genomes with those of other vertebrate species and, in particular, with other Hominidae, has revealed the presence of Human Accelerated Regions (HARs), short sequences presenting an important number of Single Nucleotide Changes (SNCs) specific to H. sapiens (Lindblad-Toh, et al. 2011; Capra, et al. 2013). Interestingly, the DLX5/6 locus includes at the same time 5 HARs and, as shown in this report, an introgressed Neanderthal region. This paradoxical situation might suggest that enhancers implicated in nonneural functions of DLX5/6 such as bone remodeling and reproduction could have been strongly counter-selected, whereas brain-expression driving enhancers could have been conserved in the human population. Interestingly, two of these HARS are located immediately in 5′ from DLX5/6-N-Haplotype (2.1 and 3.1 kb), whereas the other three are more distant (81.5, 634, 1088 kb upstream although in the range of the multiple known modifiers of DLX5/6 expression). The presence of the proximal HARs may have limited the extension of the introgressed haplotype in 5′. Vocal communication and socialization are pivotal traits of the evolution of modern human society. The conservation of the DLX5/6-N-Haplotype in 6.3% of EUR alleles (which represent about 12.6% of the population as the DLX5/6-N-Haplotype is almost invariably heterozygous), might have contributed to the sustainability of modern human society. The analyses of the phenotype/genotype associations which we have performed using UK Biobank and other platforms (supplementary tables 3–11, Supplementary Material online) present fairly high P values and do not permit to draw, at present, any solid correlation. As DLX5/6 are pleiotropic genes with multiple interconnected functions including craniofacial morphogenesis, neuronal differentiation, bone development, and reproduction (Merlo, et al. 2000) one could speculate that the presence of the DLX5/6-N-Haplotype might also affect other traits characteristic of the evolution of H. sapiens, such as metabolism, skull shape and bone density. In particular, one of the main differences between modern humans and Neanderthals has been the composition and size of social groups: while Neanderthal groups were small, composed of 10–30 individuals, modern humans developed complex societies (Duveau, et al. 2019). It is possible, therefore, that the DLX5/6-N-Haplotype could influence characters affecting the structure of modern human society. However, the DLX5/6-N-Haplotype seems to correlate with different aspects of human health in modern human populations and some of them are pathological conditions. This observation is in line with data that demonstrates that a high proportion of Neanderthal introgression might be deleterious for modern humans (Juric, et al. 2016). Thus, we can speculate that the frequency of this haplotype in the modern human population was the result of balancing selection shaped by different evolutionary forces. Future studies are needed to address this point. Interestingly, the DLX5/6-N-Haplotype is underrepresented in semi-supercentenarians, a population considered as a paradigm of healthy aging as it has the capacity to slow the progression of pathological conditions determining reduced mental and physical functions (Vacante, et al. 2012). This observation could be related to the 33% lifespan extension described for Dlx5/6 mice, suggesting that the DLX5/6-N-Haplotype could regulate DLX5/6 neuronal expression (de Lombares, et al. 2019). These data on semi-supercentenarians could be seen in the light of George Williams antagonistic pleiotropy theory (Williams 1957), who proposes that genes controlling more than one phenotypic trait can have been positively selected being favorable early in life and in adulthood but can become detrimental later in life (old age and extreme longevity), phases of life subjected to weaker evolutionary pressures (Williams 1957; De Benedictis and Franceschi 2006; Giuliani, et al. 2017). Our findings could well be integrated within current ideas on the biology of aging which suggest that aging rates, and consequently lifespans, evolve as a function of trade-offs with other characters including social, cognitive, and reproductive capacities (Yang, et al. 2016; Giuliani, et al. 2018; Snyder-Mackler, et al. 2020).

Materials and Methods

Animals

Procedures involving animals were conducted in accordance with European Community and French Ministry of Research directives. The project was reviewed and approved by the “Cuvier” ethical committee of the MNHN (CEEA Cuvier n°68) and validated by the French Ministry of Research (ref. 3917). Mice were housed in light, temperature, and humidity-controlled conditions. Food and water were available ad libitum. Mice were individually identified by a microchip implanted 3 weeks postnatal. Litter sizes and genotypes were kept in record. Dlx5/6 and Dlx5/6 mice, were generated using a Cre/lox strategy to simultaneously remove Dlx5/6 homeodomain-coding sequences specifically in GABAergic neurons (de Lombares, et al. 2019): Dlx5/6 mice (Bellessort, et al. 2016) were crossed with Slc32a1 knock-in mice (here referred to as Vgat mice) in which the “Cre-recombinase” gene is expressed only by GABAergic neurons on a mixed C57BL6/N X DBA/2 genetic background. Phenotypes of Dlx5/6 and Dlx5/6 mice were compared to those of control littermates.

Vocalization and Social Behavior Analysis

Test females were isolated 3 days before the trial to increase their motivation for social interactions. On trial day, the tested female mouse was placed in a test cage (Plexiglas, 50 cm × 25 cm × 30 cm, 100 lux, with fresh bedding) for 30 min habituation. After this time, an unfamiliar group-housed control adult female was introduced. Ultrasonic vocalizations were recorded with a condenser ultrasound microphone Polaroid/CMPA, the interface UltraSoundGate 416-200, and the software Avisoft-SASLab Pro Recorded from Avisoft Bioacoustics (sampling frequency: 300 kHz; FFT-length: 1024 points; 16-bit format). Ultrasonic vocalizations were recorded for the pair of mice tested (one control newcomer and a habituated control, Dlx5/6 or Dlx5/6 mouse). All analyzed animals were 6 months old. Previous studies suggest that the contribution of the newcomer to the vocalization is minor in comparison with that of the occupant (Hammerschmidt, et al. 2012). We measured the latency for the first ultrasonic vocalizations and the total number of ultrasonic vocalizations emitted. We manually established the distribution of ultrasonic vocalizations among the following call types (see Ey, et al. 2013): 1) short, call duration less than 5 ms; 2) simple, call duration longer than 5 ms and a frequency range either smaller than 6.25 kHz (simple flat) or with a frequency modulation in only one direction (simple upward or downward); 3) complex, frequency modulations in more than one direction and frequency range larger than 6.25 kHz (modulated), or inclusion of one or more additional frequency components (harmonic or nonlinear phenomena, but no saturation); 4) unstructured, no pure tone component and 5) frequency jumps, presence of one or more jump(s) in frequency, without time gap between the consecutive elements. We used unpaired nonparametric Wilcoxon tests to compare the latency for the first call and the call rate between genotypes. We used χ2 (with a Bonferroni correction for multiple testing) tests to compare the proportions of calls between genotypes. During vocalization analysis, animals were filmed for 10 min. Social behavior was measured using a real-time approach that couples computer vision, machine learning, and triggered-RFID identification to track and monitor animals (de Chaumont, et al. 2019).

Histological Analysis

Mice were fixed by transcardiac perfusion with 4% paraformaldehyde, brains were cryoprotected in 30% sucrose and frozen. Cryoprotected brains were embedded in OCT (Leica, France) and 60-μm-thick free-floating cryostat sections were prepared. For lacZ expression analysis, X-gal staining was performed as described (Acampora, et al. 1999) on sections from Dlx5 brains. Pictures were acquired using a light microscope.

Identification of DLX5/6-Neanderthal Haplotype Introgression

To investigate whether polymorphisms identified in the DLX5/6 locus belong to a Neanderthal introgressed haplotype, we analyzed individuals from the 1000 Genomes data set (The 1000 Genomes Project Consortium phase 3, 2015) and looked for Neandertal-like alleles in a window of 200 kb up and downstream from DLX5, using continental African populations as outgroup. Among these SNPs, we identified an haplotype of 37,160 bp (GRCh 38–7: 96,991,129–97,022,094) defined by five consecutive SNPs (rs12704907, rs4729325, rs1005169, rs1724297, rs1207733) with Neandertal-like alleles shared by at least 80% of individuals worldwide. To explore the relationship between haplotypes in this genomic region (Hg38: chr7:96991129-97022094), we constructed a haplotype network based on the haplotypes of all 1000 Genomes individuals (phase III). We used 654 SNPs where archaic genomes were homozygous. We used SHAPEIT to phase together the haplotypes of the 2,504 individuals from 1,000 Genomes, the haplotypes of the 81 semi-super centenarians, and the haplotypes of the 36 matched controls. We identified 5,242 individual haplotypes and 1,020 unique haplotypes. We merged the resulting 1020 unique modern haplotypes into five different clusters (core haplotypes) using an AGlomerative NESting hierarchical cluster analysis (AGNES), implemented in the R package cluster (Maechler 2019). The optimal number of clusters was defined using the silhouette method (Rousseeuw 1987). The mean pairwise nucleotide difference observed in each group is less than 12, whereas the mean pairwise nucleotide difference between each group exceeds 12. We generated a consensus sequence for each core haplotype, corresponding to the sequence of the allele with the highest frequency in each cluster. Then, we computed a rooted neighbor-joining tree for the entire 30.96 kb region based on the nucleotide distance between the sequences of the five modern human core haplotypes together with the Altai, Denisovan, Vindija, chimpanzee (outgroup of the tree) using the R package ape (Paradis and Schliep 2019). We computed the probability of generating such a long haplotype under a scenario of incomplete lineage sorting by assuming the haplotype length follows a gamma distribution (Huerta-Sanchez, et al. 2014) using the parameters from (Dannemann, et al. 2016) and the local recombination rate of 2.046 cM/Mb (Hinch, et al. 2011). The haplotype is significantly longer than expected under the null hypothesis of incomplete lineage sorting (P value = 1.08e−05), supporting an Archaic origin. The DLX5/6-N-Haplotype covers part of DLX5, all DLX6, the intergenic region, and regulatory region upstream of DLX6. The proportion of each SNP in the different geographic regions was derived from the 1000 Genomes database phase 3, which included 694 American, 1,008 East-Asian, 1,006 European, 978 South-Asian, and 1,322 African genomes.

DLX5/6-N-Haplotype Allelic Frequency

The allelic frequency of each SNP composing the DLX5/6-N-Haplotype and their linkage disequilibrium was determined in populations of different geographical and ethnical origins present in the 1000 Genomes data set (The 1000 Genomes Project Consortium 2015). These frequencies were used to reconstruct the haplotype using the linkage disequilibrium R2 between the different SNPs; their variations suggested the coexistence of several haplotypes of different sizes (unphased reconstruction). Genome-wide phased data (Skov, et al. 2020) also identified the introgression of the DLX5/6-N-Haplotype and the other haplotypes of different size from Neanderthal. We searched if the list of the 5 variants included in the core haplotype (or variants in LD with those variants) have been previously associated with a trait or disease using the function LDtrait of the R package LDlinkR (Myers, et al. 2020) (https://cran.r-project.org/web/packages/LDlinkR/index.html) setting r2 of LD > 0.1 in the Europeans and a window of 500,000 bp. No association was reported excluding the presence of non-archaic variants with phenotypic effects in the neighborhood. DLX5/6-N-Haplotype representation in autistic populations was determined using the MSSNG database (Yuen et al. 2017) and the probability was calculated by transmission disequilibrium test (TDT). PheWeb was examined to identify phenotypes associated with the five SNPs (rs12704907, rs4729325, rs1005169, rs1724297, and rs1207733) (Gagliano Taliun, et al. 2020). We examined UK Biobank data set through PheWeb based on SAIGE analysis of ICD -derived traits (http://pheweb.sph.umich.edu/SAIGE-UKB/) (Zhou, et al. 2018) and on the first Neale lab analysis (https://pheweb.org/UKB-Neale/). PheWeb was examined to identify associations considering participants of the Michigan Genomics Initiative (http://pheweb.org/MGI-freeze2/). PhenoScanner v2 (http://www.phenoscanner.medschl.cam.ac.uk/), ClinVar (https://www.ncbi.nlm.nih.gov/clinvar/), GWAS Catalog (https://www.ebi.ac.uk/gwas/), Gwas Central (https://www.gwascentral.org/), GTEX Portal (https://gtexportal.org/home/), and clinGen (https://clinicalgenome.org/) were examined to identify in silico potential functional impact of the five SNPs (rs12704907, rs4729325, rs1005169, rs1724297, and rs1207733). Then, we analyzed phenotypic associations in the UK Biobank using the GeneATLAS tool (Canela-Xandri, et al. 2018). In brief, the Gene ATLAS provides phenotypic associations based on Mixed Linear Models using 452,264 Britons with European descent. The model includes sex, array batch, UK Biobank Assessment Centre, age, and 20 genomic principal components. Results are reported in supplementary tables 7S–11S, Supplementary Material online). The genomes from a cohort of 81 healthy semi-supercentenarians and supercentenarians [105+/110+] older than 105 years (mean age 106.6 ± 1.6) recruited in North, Center, and South of the Italian peninsula (CEN) were compared to a population (ITA) obtained combining the Tuscany 1000 Genomes population (TSI) with 36 controls genomes matching the different geographical origin of CEN. European population correspond to TSI, CEU, FIN, GBR, and IBS people present in 1000 Genomes. Haplotypes were used to compute the Fst statistic (Nei 1973) between all the combinations of population pairs. To evaluate the significance of the Fst observed values, we performed 500 simulations where individuals were randomly assigned to one of the seven studied populations. For each simulation, Fst statistic was computed. At the end of this procedure, we were able to fit a probability density function for the Fst statistic and to compute an empirical P value for the Fst using the GHap (Utsunomiya, et al. 2016) R package. The significance of allelic differences between European population and semi-supercentenarians was also determined with a χ2 test. In order to test if the Fst calculated for the DLX5/6-N-Haplotype is within the distribution of other archaic haplotypes or is an outlier, we built a density plot considering all the Fst values calculated including all the 110,692 archaic haplotypes described by Skov, et al. (2020) for CEN versus ITA groups. The 110,692 archaic haplotypes (Skov, et al. 2020) are located on all chromosomes and were phased by applying SHAPEIT software v.2.17 (Delaneau, et al. 2013; O'Connell, et al. 2014). For each archaic haplotype, the Fst statistic (Nei 1973) was computed between all population pairs (CEN, ITA and CEU, FIN, GBR, and IBS from 1000 Genomes Project). Then, for each pairwise comparison, we fitted a probability density function for the Fst statistic and an empirical P value was computed for the Fst value calculated on our haplotype. To explore the extent of the DLX5/6-N-Haplotype, we computed the Linkage Disequilibrium (LD) between 278 SNPs (MAF > 0.05 in ITA) located in a wide region (hg38 - chr7:96935616-97130527) encompassing the DLX5/6-N-Haplotype (supplementary fig. 3S, Supplementary Material online). We defined two haplotypes (indicated with green and red segments in supplementary fig. 3S, Supplementary Material online) using the –blocks flag of the software PLINK (Purcell, et al. 2007). These haplotypes include nonarchaic variants in weak LD with archaic variants and located downstream the core-haplotype. We computed the Fst value between CEN and ITA for such defined nonarchaic haplotypes, and we checked if the calculated Fst value is significant from the reported Fst distribution. The same Fst analysis was then performed considering nonarchaic variants included in the core-haplotype (indicated with the blue segment in supplementary fig. 3S, Supplementary Material online) and in high LD (r2 > 0.7) with all the archaic variants present in the same region. The Fst analysis was implemented in R software (https://www.R-project.org/) using the GHap package (Utsunomiya, et al. 2016).

Supplementary Material

Supplementary data are available at Molecular Biology and Evolution online. Click here for additional data file.
  99 in total

1.  The phenotypic legacy of admixture between modern humans and Neandertals.

Authors:  Corinne N Simonti; Benjamin Vernot; Lisa Bastarache; Erwin Bottinger; David S Carrell; Rex L Chisholm; David R Crosslin; Scott J Hebbring; Gail P Jarvik; Iftikhar J Kullo; Rongling Li; Jyotishman Pathak; Marylyn D Ritchie; Dan M Roden; Shefali S Verma; Gerard Tromp; Jeffrey D Prato; William S Bush; Joshua M Akey; Joshua C Denny; John A Capra
Journal:  Science       Date:  2016-02-12       Impact factor: 47.728

Review 2.  Social determinants of health and survival in humans and other animals.

Authors:  Noah Snyder-Mackler; Joseph Robert Burger; Lauren Gaydosh; Daniel W Belsky; Grace A Noppert; Fernando A Campos; Alessandro Bartolomucci; Yang Claire Yang; Allison E Aiello; Angela O'Rand; Kathleen Mullan Harris; Carol A Shively; Susan C Alberts; Jenny Tung
Journal:  Science       Date:  2020-05-22       Impact factor: 47.728

3.  An atlas of genetic associations in UK Biobank.

Authors:  Oriol Canela-Xandri; Konrad Rawlik; Albert Tenesa
Journal:  Nat Genet       Date:  2018-10-22       Impact factor: 38.330

4.  Expression analysis and mutation detection of DLX5 and DLX6 in autism.

Authors:  Naomi Nakashima; Takanori Yamagata; Masato Mori; Mari Kuwajima; Kiyotaka Suwa; Mariko Y Momoi
Journal:  Brain Dev       Date:  2009-02-04       Impact factor: 1.961

Review 5.  Multiple functions of Dlx genes.

Authors:  G R Merlo; B Zerega; L Paleari; S Trombino; S Mantero; G Levi
Journal:  Int J Dev Biol       Date:  2000       Impact factor: 2.203

6.  A high-resolution map of human evolutionary constraint using 29 mammals.

Authors:  Kerstin Lindblad-Toh; Manuel Garber; Or Zuk; Michael F Lin; Brian J Parker; Stefan Washietl; Pouya Kheradpour; Jason Ernst; Gregory Jordan; Evan Mauceli; Lucas D Ward; Craig B Lowe; Alisha K Holloway; Michele Clamp; Sante Gnerre; Jessica Alföldi; Kathryn Beal; Jean Chang; Hiram Clawson; James Cuff; Federica Di Palma; Stephen Fitzgerald; Paul Flicek; Mitchell Guttman; Melissa J Hubisz; David B Jaffe; Irwin Jungreis; W James Kent; Dennis Kostka; Marcia Lara; Andre L Martins; Tim Massingham; Ida Moltke; Brian J Raney; Matthew D Rasmussen; Jim Robinson; Alexander Stark; Albert J Vilella; Jiayu Wen; Xiaohui Xie; Michael C Zody; Jen Baldwin; Toby Bloom; Chee Whye Chin; Dave Heiman; Robert Nicol; Chad Nusbaum; Sarah Young; Jane Wilkinson; Kim C Worley; Christie L Kovar; Donna M Muzny; Richard A Gibbs; Andrew Cree; Huyen H Dihn; Gerald Fowler; Shalili Jhangiani; Vandita Joshi; Sandra Lee; Lora R Lewis; Lynne V Nazareth; Geoffrey Okwuonu; Jireh Santibanez; Wesley C Warren; Elaine R Mardis; George M Weinstock; Richard K Wilson; Kim Delehaunty; David Dooling; Catrina Fronik; Lucinda Fulton; Bob Fulton; Tina Graves; Patrick Minx; Erica Sodergren; Ewan Birney; Elliott H Margulies; Javier Herrero; Eric D Green; David Haussler; Adam Siepel; Nick Goldman; Katherine S Pollard; Jakob S Pedersen; Eric S Lander; Manolis Kellis
Journal:  Nature       Date:  2011-10-12       Impact factor: 49.962

7.  Altitude adaptation in Tibetans caused by introgression of Denisovan-like DNA.

Authors:  Emilia Huerta-Sánchez; Xin Jin; Zhuoma Bianba; Benjamin M Peter; Nicolas Vinckenbosch; Yu Liang; Xin Yi; Mingze He; Mehmet Somel; Peixiang Ni; Bo Wang; Xiaohua Ou; Jiangbai Luosang; Zha Xi Ping Cuo; Kui Li; Guoyi Gao; Ye Yin; Wei Wang; Xiuqing Zhang; Xun Xu; Huanming Yang; Yingrui Li; Jian Wang; Jun Wang; Rasmus Nielsen
Journal:  Nature       Date:  2014-07-02       Impact factor: 49.962

8.  Globularity and language-readiness: generating new predictions by expanding the set of genes of interest.

Authors:  Cedric Boeckx; Antonio Benítez-Burraco
Journal:  Front Psychol       Date:  2014-11-25

9.  Dlx5 and Dlx6 expression in GABAergic neurons controls behavior, metabolism, healthy aging and lifespan.

Authors:  Camille de Lombares; Eglantine Heude; Gladys Alfama; Anastasia Fontaine; Rim Hassouna; Cécile Vernochet; Fabrice de Chaumont; Christophe Olivo-Marin; Elodie Ey; Sébastien Parnaudeau; François Tronche; Thomas Bourgeron; Serge Luquet; Giovanni Levi; Nicolas Narboux-Nême
Journal:  Aging (Albany NY)       Date:  2019-09-12       Impact factor: 5.682

10.  A general approach for haplotype phasing across the full spectrum of relatedness.

Authors:  Jared O'Connell; Deepti Gurdasani; Olivier Delaneau; Nicola Pirastu; Sheila Ulivi; Massimiliano Cocca; Michela Traglia; Jie Huang; Jennifer E Huffman; Igor Rudan; Ruth McQuillan; Ross M Fraser; Harry Campbell; Ozren Polasek; Gershim Asiki; Kenneth Ekoru; Caroline Hayward; Alan F Wright; Veronique Vitart; Pau Navarro; Jean-Francois Zagury; James F Wilson; Daniela Toniolo; Paolo Gasparini; Nicole Soranzo; Manjinder S Sandhu; Jonathan Marchini
Journal:  PLoS Genet       Date:  2014-04-17       Impact factor: 5.917

View more
  2 in total

1.  Dlx5/6 Expression Levels in Mouse GABAergic Neurons Regulate Adult Parvalbumin Neuronal Density and Anxiety/Compulsive Behaviours.

Authors:  Rym Aouci; Mey El Soudany; Zakaria Maakoul; Anastasia Fontaine; Hiroki Kurihara; Giovanni Levi; Nicolas Narboux-Nême
Journal:  Cells       Date:  2022-05-25       Impact factor: 7.666

2.  Human Cerebral Organoid Implantation Alleviated the Neurological Deficits of Traumatic Brain Injury in Mice.

Authors:  Zhongyuan Bao; Kaiheng Fang; Zong Miao; Chong Li; Chaojuan Yang; Qiang Yu; Chen Zhang; Zengli Miao; Yan Liu; Jing Ji
Journal:  Oxid Med Cell Longev       Date:  2021-11-22       Impact factor: 6.543

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.