Literature DB >> 28158547

IBD Sharing between Africans, Neandertals, and Denisovans.

Gundula Povysil1, Sepp Hochreiter1.   

Abstract

Interbreeding between ancestors of humans and other hominins outside of Africa has been studied intensively, while their common history within Africa still lacks proper attention. However, shedding light on human evolution in this time period about which little is known, is essential for understanding subsequent events outside of Africa. We investigate the genetic relationships of humans, Neandertals, and Denisovans by identifying very short DNA segments in the 1000 Genomes Phase 3 data that these hominins share identical by descent (IBD). By focusing on low frequency and rare variants, we identify very short IBD segments with high confidence. These segments reveal events from a very distant past because shorter IBD segments are presumably older than longer ones. We extracted two types of very old IBD segments that are not only shared among humans, but also with Neandertals and/or Denisovans. The first type contains longer segments that are found primarily in Asians and Europeans where more segments are found in South Asians than in East Asians for both Neandertal and Denisovan. These longer segments indicate complex admixture events outside of Africa. The second type consists of shorter segments that are shared mainly by Africans and therefore may indicate events involving ancestors of humans and other ancient hominins within Africa. Our results from the autosomes are further supported by an analysis of chromosome X, on which segments that are shared by Africans and match the Neandertal and/or Denisovan genome were even more prominent. Our results indicate that interbreeding with other hominins was a common feature of human evolution starting already long before ancestors of modern humans left Africa.

Entities:  

Keywords:  identity by descent; interbreeding; Denisova; human evolution; gene flow; Neandertal

Mesh:

Substances:

Year:  2016        PMID: 28158547      PMCID: PMC5381509          DOI: 10.1093/gbe/evw234

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


Introduction

One of the most fundamental questions of humanity is: “Where do we come from?” Recent advances in biotechnology made whole genome sequencing feasible and therefore helped us to get closer to an answer. In the 1000 Genomes Project (1000 Genomes Project Consortium 2010), thousands of individuals were sequenced, which gave new insights into the population structure of humans. Application of these sequencing technologies was extended to ancient DNA and made it possible to assemble the genome of hominins that lived tens of thousands of years ago (Green et al. 2010; Reich et al. 2010; Meyer et al. 2012; Prüfer et al. 2014). The outcomes of whole genome sequencing of Neandertals (Green et al. 2010; Prüfer et al. 2014) and a Denisovan (Reich et al. 2010; Meyer et al. 2012) changed the view on the history of humans and other hominins. Previous evidence from mtDNA (Krings et al. 1997; Currat and Excoffier 2004; Serre et al. 2004) and the Y chromosome (Krause et al. 2007) had suggested that Neandertals lived isolated in Europe and Asia until they were replaced by anatomically modern humans. However, current findings hint at numerous admixture events between hominin groups (Green et al. 2010; Reich et al. 2010; Meyer et al. 2012; Sankararaman et al. 2012, 2014, 2016; Yang et al. 2012; Lohse and Frantz 2014; Prüfer et al. 2014). Several studies have found that non-Africans, especially Asians, share more alleles with the Neandertal genome than sub-Saharan Africans (Green et al. 2010; Wall et al. 2013; Khrameeva et al. 2014; Prüfer et al. 2014). For the Denisovan genome, enrichment only within Oceanian populations was initially reported (Reich et al. 2010, 2011; Mendez et al. 2012, 2013). Later studies also detected DNA regions of Denisovan origin in East and Southeast Asians as well as in populations of the Americas, but only very few such regions in Europeans (Skoglund and Jakobsson 2011; Meyer et al. 2012; Prüfer et al. 2014). The controversy over the origin of these differences in allele sharing between particular modern populations and the genomes of Neandertals and Denisovans is ongoing. While the differences may be attributed to the existence of an ancient population substructure within Africa (Eriksson and Manica 2012, 2014; Lowery et al. 2013), ancient admixture events between Neandertals, Denisovans and anatomically modern humans outside of Africa are considered as a more plausible explanation (Green et al. 2010; Sankararaman et al. 2012, 2014, 2016; Yang et al. 2012; Lohse and Frantz 2014; Prüfer et al. 2014). Since African populations show less allele sharing with Neandertals and Denisovans than non-Africans, they are often used as a baseline for the absence of Neandertal or Denisovan ancestry (Yang et al. 2012; Sankararaman et al. 2014, 2016; Vernot and Akey 2014). Only few studies have looked at Neandertal ancestry among African populations (Lachance et al. 2012; Sanchez-Quinto et al. 2012; Wang et al. 2013; Llorente et al. 2015) and mostly came to the conclusion that Neandertal ancestry in Africans is probably due to Eurasian backflow admixture. However, archaic hominins lived longer side by side within than outside of Africa and this led to plentiful opportunities for gene flow within Africa that should not be dismissed (Hammer et al. 2011). Knowing DNA regions of earlier admixture would both help to better identify later gene flow events and shed light on adaptation processes that were supported by interbreeding between hominin groups at different points in time. We use HapFABIA (Hochreiter 2013), our recently developed method for detecting DNA segments that are identical by descent (IBD), on the 1000 Genomes Phase 3 data with the aim to gain insights into the genetic relationships between humans, Neandertals and Denisovans (see “HapFABIA for Extracting Short IBD Segments” section). IBD detection methods have already been successful in inferring population structure (Gusev et al. 2012; Palamara et al. 2012; Botigué et al. 2013; Carmi et al. 2013; Gravel et al. 2013; Ralph and Coop 2013). Previous studies were limited to detecting relatively recent genetic relationships because the methods used were not able to reliably identify short IBD segments, which are presumably older than longer ones (Chapman and Thompson 2003). HapFABIA accurately detects these very short IBD segments via low frequency and rare variants that tag them because rare minor alleles are highly unlikely to arise independently (Strachan and Read 2004, Ch. 15.3, p. 441). In the Phase 3 data of the 1000 Genomes Project, we found IBD-based indications of interbreeding between ancestors of humans and other ancient hominins within Africa (see examples of an IBD segment that matches the Denisovan and Neandertal genome, respectively, in figs. 1 and 2). First, we present results of IBD segment sharing between Africans and the ancient genomes of Neandertal and Denisovan. Then, we show that these shared IBD segments are older than those originating from introgression events outside of Africa. Finally, in the “Discussion” section, we consider different theories concerning the origin of IBD segments shared between modern Africans, Neandertals and Denisovans. Example of a Denisovan-matching IBD segment shared among Africans. The rows represent all individuals that have the IBD segment, and columns represent consecutive variants. Major alleles are shown in yellow, minor alleles of variants that tag the IBD segment (tagSNVs) in violet, and minor alleles of other variants in cyan. The row labeled “model L” indicates tagSNVs identified by HapFABIA in violet. The rows “Ancestor”, “Neandertal”, and “Denisova” show bases of the respective genomes in violet if they match the minor allele of the tagSNVs (in yellow otherwise). For the “Ancestor genome”, we used the reconstructed common ancestor sequence that was provided as part of the 1000 Genomes Project data. Example of a Neandertal-matching IBD segment shared among Africans. This segment is tagged by more than 100 minor alleles. See figure 1 for a detailed description.
F

Example of a Denisovan-matching IBD segment shared among Africans. The rows represent all individuals that have the IBD segment, and columns represent consecutive variants. Major alleles are shown in yellow, minor alleles of variants that tag the IBD segment (tagSNVs) in violet, and minor alleles of other variants in cyan. The row labeled “model L” indicates tagSNVs identified by HapFABIA in violet. The rows “Ancestor”, “Neandertal”, and “Denisova” show bases of the respective genomes in violet if they match the minor allele of the tagSNVs (in yellow otherwise). For the “Ancestor genome”, we used the reconstructed common ancestor sequence that was provided as part of the 1000 Genomes Project data.

Material and Methods

Data

We used HapFABIA to extract short IBD segments from the 1000 Genomes Project Phase 3 data (1000 Genomes Project Consortium 2015) (ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20130502/, last accessed 31 October 2014). This dataset consists of low-coverage whole genome sequences from 2,504 individuals (661 Africans, 347 Admixed Americans, 504 East Asians, 489 South Asians, and 503 Europeans). Compared with the 1000 Genomes Phase 1 data (1000 Genomes Project Consortium 2012) this dataset includes additional subpopulations from Africa, the Americas and East Asia, as well as South Asian populations (see supplementary table S2, Supplementary Material online, for details about the subpopulations). We removed 11 individuals because they showed cryptic first degree relatedness to others (see “Removal of Related Individuals” section). The final dataset consisted of 2,493 individuals (654 Africans—AFR, 347 Admixed Americans—AMR, 504 East Asians—EAS, 485 South Asians—SAS, and 503 Europeans—EUR). We also excluded Admixed American individuals from the group of individuals that share an IBD segment in order to avoid confounding influences from their admixed ancestry. For the same reason we removed individuals with African Ancestry in Southwest US and African Caribbeans in Barbados for most of the analyses. Since previous analyses revealed many phasing errors (Hochreiter 2013), we applied HapFABIA with default parameters to the unphased genotype data. Since short indels and larger copy number variants can still not be called as reliably as SNVs we removed them from the provided VCF files. Furthermore, we split multiallelic variants into multiple lines. We also filtered out repeat regions and CpGs using bed files downloaded from the UCSC genome browser (Meyer et al. 2013). HapFABIA is based on low frequency and rare variants, therefore we removed common and private variants prior to the analysis. Afterwards, all chromosomes were divided into intervals of 10,000 variants with adjacent intervals overlapping by 5,000 variants. About 31× coverage whole genome sequencing data of the Denisovan and the Altai Neandertal genome of 52× coverage were provided by the Max Planck Institute for Evolutionary Anthropology (Meyer et al. 2012; Prüfer et al. 2014) (http://cdna.eva.mpg.de/denisova/, last accessed 2 February 2012 and http://cdna.eva.mpg.de/neandertal/altai/, last accessed 23 May 2013). If we only consider variants that are present in at least one individual of the 1000 Genomes Project Phase 3, 0.9% of the Denisovan bases and 0.4% of the Neandertal bases were marked as not determined. Of the remaining Denisovan and Neandertal bases 96.0% and 95.6%, respectively, matched bases of the human reference, and 4.0% and 4.4%, respectively, matched either the human minor allele or were different from human alleles. As additional information in the 1000 Genomes Project data, bases of the reconstructed common ancestor of human, chimpanzee, gorilla, orang-utan, macaque, and marmoset genomes were included. These are named ancestral genome of humans and other primates in the following.

HapFABIA for Extracting Short IBD Segments

In large sequencing data, HapFABIA (Hochreiter 2013) identifies very short IBD segments that are tagged by low frequency or rare variants (so-called tagSNVs) with a minor allele frequency (MAF) of ≤5%. A DNA segment is identical by state (IBS) in two or more individuals if they all have identical nucleotide sequences in this segment. An IBS segment is identical by descent (IBD) in two or more individuals if they have inherited it from a common ancestor, that is, the segment has the same ancestral origin in these individuals. Rare variants can be utilized to distinguish IBD from IBS without IBD because independent origins are highly unlikely for such variants. In other words, IBS generally implies IBD for rare variants, which is not true for common variants (Strachan and Read 2004; Browning SR and Browning BL 2012). Furthermore, rare variants make juxtapositions of smaller IBD segments unlikely, which precludes their aggregation into one long IBD segment. Consequently, the length of IBD segments is estimated more accurately than with previous methods. HapFABIA identifies 100 times smaller IBD segments than current state-of-the-art methods: 10 kb for HapFABIA versus 1 Mb for state-of-the-art methods. This was verified in experiments with artificial, simulated, and real genotyping data, where HapFABIA outperformed its competitors in detecting short IBD segments (Hochreiter 2013). It was also shown that HapFABIA is very robust to sequencing errors, therefore, the low coverage data of the 1000 Genomes Project do not pose a problem. HapFABIA is based on biclustering (Hochreiter et al. 2010), which in turn uses modern machine learning techniques derived from maximizing the posterior in a Bayes framework (Hochreiter et al. 2006; Clevert et al. 2011; Klambauer et al. 2012; Munro et al. 2014; Su et al. 2014). Preliminary results showing the ability of HapFABIA to detect IBD segments shared with Neandertals and Denisovans were already published in Hochreiter (2013).

Exponentially Distributed IBD Lengths

The length of an IBD segment is exponentially distributed with a mean of 100/(2g) cM (centiMorgans), where g is the number of generations which separate the two haplotypes that share a segment from their common ancestor (Thomas et al. 1994, 2008; Browning 2008; Gusev et al. 2012; Palamara et al. 2012). Therefore, on average, shorter IBD segments are older than longer ones. We exploit this relation in order to distinguish between older admixture within Africa and more recent interbreeding outside of Africa. Ulgen and Li (2005) recommended using a recombination rate (cM-to-Mb ratio) of 1, but it varies from 0 to 9 along a chromosome (Yu et al. 2001). For chromosome X, the sex-averaged recombination rate is 2/3 of the female recombination rate on the X chromosome (Hedrick 2007). The female recombination rate on the X chromosome has approximately the same value as the sex-averaged autosomal recombination rate (Kong et al. 2002). Thus, we assume that the sex-averaged recombination rate on the X chromosome is 2/3 times the autosomal recombination rate. The average length of IBD segments on chromosome X should therefore be 3/2 times the average length of the segments on autosomes (Povysil and Hochreiter 2014). Our length distributions show this expected outcome (see supplementary fig. S1, Supplementary Material online). This corroboration of our theoretical considerations justifies using the length for a relative estimation of the age of IBD segments. However, we are not confident in absolute age estimations of the IBD segments based on their length (see explanation in Povysil and Hochreiter 2014).

Removal of Related Individuals

The amount of IBD sharing between two individuals reflects their degree of relatedness. For a first degree relationship approximately 50% of the IBD segments are expected to be shared between two individuals. Therefore, we looked for the fraction of IBD segments each pair of individuals in the 1000 Genomes data shares. Pairs of individuals that have more than 50% of their IBD segments in common on several chromosomes were considered to be related. One individual of each pair was randomly selected to be removed and the IBD detection algorithm was rerun using the reduced data.

IBD Segments Matching Ancient Genomes

First we identified IBD segments in the 1000 Genomes data for human populations only. Then we checked whether the identified human IBD segments match ancient genomes. This matching is indicated by the presence of human minor alleles of tagSNVs in the corresponding ancient genome. In general, not the whole IBD segment is observed in ancient genomes but only a part thereof. The part of the IBD segment which carries the minor alleles that are observed in an ancient genome is called “ancient part” of the IBD segment. We define matching of an IBD segment and an ancient genome by three criteria: (1) at least 15% of the tagSNVs of the IBD segment matches the ancient genome, that is, also the ancient genome carries the human minor allele, (2) the “ancient part” of the IBD segment contains at least 8 tagSNVs, and (3) 30% of the tagSNVs in the “ancient part” of the IBD segment match the ancient genome. Finally, we corrected the IBD segment lengths to obtain the length of the “ancient part” that matches a particular ancient genome (see details in Povysil and Hochreiter 2014). We tried different thresholds for determining whether an IBD segment matches a particular ancient genome. Although the resulting number of shared IBD segments depends on these thresholds the distribution over different populations and the densities of the length of the IBD segments show no recognizable differences (see supplementary figs. S2 and S3 and table S1, Supplementary Material online). Since the high coverage ancient genomes are only compared with previously detected human IBD segments, it is not a problem that the 1000 Genomes Project whole genome sequences are of low coverage. On the contrary, we have a high coverage verification that the IBD segments found are not random findings but indeed present in the genomes. Thus, the ancient genomes not only serve to look up ancient DNA segments (similar to a database) but additionally support IBD segments that are found in human genomes.

Removal of Ancestral IBD Segments

Human IBD segments that are found in ancient genomes may have already been present in the ancestral genome of humans and other primates. These IBD segments would confound the interbreeding analysis based on IBD sharing between modern human populations and Neandertals or Denisovans. Thus for most of the analyses, we removed IBD segments of which at least 30% of the tagSNVs that are shared by an ancient genome also match the reconstructed common ancestor sequence that was provided as part of the 1000 Genomes Project data. Again, adjusting this threshold did not affect the overall results. As can be seen in supplementary figure S4, Supplementary Material online, IBD segments shared with ancient genomes that are also present in the ancestral sequence are shorter than segments shared with ancient genomes that do not match the ancestral sequence. This confirms that, by removing IBD segments that are already present in the ancestral genome, we get rid of older IBD segments that originate from a common ancestor of all primates.

Results

We report results from our analysis of the 1000 Genomes Project Phase 3 (1000 Genomes Project Consortium 2015) whole genome sequencing data, where we focus on very short IBD segments that are tagged by low frequency and rare variants (tagSNVs). First, we extracted IBD segments shared by continental populations from Africa, East Asia, South Asia, and Europe. In the next step, we examined whether the IBD segments match the Neandertal and/or Denisovan sequence using the high-coverage whole genome sequencing data of the Denisovan and the Altai Neandertal (Meyer et al. 2012; Prüfer et al. 2014). For more details on the data, see “Data” section. We removed IBD segments that match the reconstructed common ancestor sequence of humans and other primates that was provided as part of the 1000 Genomes Project data (see “Removal of Ancestral IBD Segments” section), since they would confound the interbreeding analysis based on IBD sharing between modern human populations and Neandertals or Denisovans. We especially investigated IBD segments shared between Africans and Neandertals and/or Denisovans in contrast to segments shared between non-Africans and these ancient genomes. By comparing the average length of IBD segments that particular populations share with these ancient genomes, we were able to distinguish between very early events within Africa and later admixture outside of Africa as the origin of shared segments (see “Exponentially Distributed IBD Lengths” section). Further, differences between the autosomes and chromosome X were of particular interest because chromosome X is under especially high selective pressure and may show different levels of introgression if gene flow is primarily male-driven (Nielsen et al. 2005; Lambert et al. 2010; Veeramah et al. 2014). In total, more than 25,000 IBD segments that match ancient genomes are shared predominantly by Africans. On chromosome X, there are more than 1,000 of these segments which amounts to more than half of the X-chromosomal segments that match ancient genomes. First, we investigate the frequency of IBD segments that are shared with ancient genomes across different continental populations. An IBD segment is assigned to an ancient genome if at least 30% of it matches the respective genome and to a population if more than 50% of the individuals with the corresponding IBD segment belong to it (see “IBD Segments Matching Ancient Genomes” section for more details). We visualize the results in figure 3, where different pie charts reflect the different genomes and the color reflects the population to which more than 50% of the individuals with the corresponding IBD segment belong. Gray slices represent IBD segments where neither of the populations has a share of more than 50%. Compared with all IBD segments found in the 1000 Genomes Project data (labeled “All”), those that match ancient genomes are enriched in Europeans and Asians. Interestingly, South Asians do not only share more segments with the Denisovan genome than East Asians and Europeans, but also the number of segments shared with the Neandertal genome is higher for South Asians than for other non-African populations. Furthermore, many segments matching ancient genomes are shared predominantly by Africans — especially on chromosome X. About 13,106 of 47,987 Neandertal-matching IBD segments on the autosomes and 614 of 1,479 on chromosome X are primarily found in Africans. Denisovan-matching IBD segments are predominantly shared by Africans in 14,098 out of 28,157 cases for the autosomes and 670 out of 983 for chromosome X.
F

Predominant population for an IBD segment and genome. For each IBD segment, the population in which it is predominantly found is determined. “All” stands for all IBD segments found, while “Neandertal” and “Denisovan” denote all IBD segments matching the Neandertal and Denisovan sequence, respectively. The color indicates the population to which more than 50% of the individuals with the corresponding IBD segment are assigned to. Gray slices indicate IBD segments for which neither of the populations has a share of more than 50%. The pie charts show prominent IBD sharing between Africans and ancient genomes, visualized by the green slices of the pies.

Predominant population for an IBD segment and genome. For each IBD segment, the population in which it is predominantly found is determined. “All” stands for all IBD segments found, while “Neandertal” and “Denisovan” denote all IBD segments matching the Neandertal and Denisovan sequence, respectively. The color indicates the population to which more than 50% of the individuals with the corresponding IBD segment are assigned to. Gray slices indicate IBD segments for which neither of the populations has a share of more than 50%. The pie charts show prominent IBD sharing between Africans and ancient genomes, visualized by the green slices of the pies. The IBD segments we considered above are often observed in more than one continental population, which means that, although they are found primarily in Africans, some Asians or Europeans may also share the same segment. Later admixture events between human populations may thus be the reason for parts of ancient genomes being observed in Africans. Therefore, we additionally investigated IBD segments that are observed exclusively in one continental population. Table 1 shows the number of Neandertal- and Denisovan-matching IBD segments per chromosome that are shared exclusively by different continental populations. Neandertal-matching IBD segments are most often exclusive to Africans across all chromosomes. About 20% of all Neandertal-matching IBD segments on the autosomes are shared exclusively by Africans, compared with less than 7%, 11% and 4% for East Asians, South Asians and Europeans, respectively. On chromosome X, almost 35% of all Neandertal-matching IBD segments are in fact shared exclusively by Africans. The results are even more impressive for sharing with the Denisovan genome. Almost 40% of all Denisovan-matching IBD segments on the autosomes are found exclusively in Africans followed by South Asians with less than 14% of the shared segments. Europeans share less than 300 IBD segments exclusively with Denisovans indicating that IBD segments shared between Europeans and Denisovans probably were introduced into European genomes via admixture with other human populations. Interestingly, on chromosome X, East Asians and Europeans have very few exclusive IBD segments that are shared with the Denisovan individual. About 563 out of 983 IBD segments on chromosome X that match the Denisovan genome are exclusive to the African population, and a large proportion of the remaining segments is also observed in Africans. Explanations for these findings are: (1) that little Denisovan DNA entered the human X chromosome outside of Africa or (2) that due to selective pressure, modern Eurasians lost most of the Denisovan sequence on this chromosome. IBD segments that are exclusively shared between Africans and Neandertals or Denisovans occur in similar quantities in all African subpopulations (see table 2). The uniform distribution of ancient genome sharing across Africa contradicts a recent Eurasian backflow to Africa as sole origin of IBD segments shared between Africans and ancient genomes.
Table 1

Number of Neandertal- and Denisovan-Matching IBD Segments Shared Exclusively by a Particular Continental Population for Each Chromosome

Chr.Neandertal
Denisovan
ALLEASSASEURAFRALLEASSASEURAFR
13,906271 (6.9%)332 (8.5%)130 (3.3%)751 (19.2%)2,284165 (7.2%)219 (9.6%)15 (0.7%)865 (37.9%)
24,094219 (5.3%)398 (9.7%)125 (3.1%)943 (23.0%)2,477106 (4.3%)354 (14.3%)24 (1.0%)997 (40.3%)
33,299203 (6.2%)453 (13.7%)75 (2.3%)679 (20.6%)1,81182 (4.5%)273 (15.1%)13 (0.7%)709 (39.1%)
43,675283 (7.7%)397 (10.8%)93 (2.5%)830 (22.6%)2,332251 (10.8%)382 (16.4%)14 (0.6%)892 (38.3%)
53,168225 (7.1%)366 (11.6%)153 (4.8%)573 (18.1%)1,978159 (8.0%)260 (13.1%)34 (1.7%)679 (34.3%)
63,424268 (7.8%)270 (7.9%)128 (3.7%)533 (15.6%)1,993146 (7.3%)219 (11.0%)28 (1.4%)692 (34.7%)
72,607146 (5.6%)202 (7.7%)83 (3.2%)627 (24.1%)1,47264 (4.3%)129 (8.8%)10 (0.7%)707 (48.0%)
82,29575 (3.3%)228 (9.9%)136 (5.9%)675 (29.4%)1,38039 (2.8%)180 (13.0%)27 (2.0%)649 (47.0%)
91,893123 (6.5%)211 (11.1%)76 (4.0%)335 (17.7%)1,24545 (3.6%)269 (21.6%)12 (1.0%)472 (37.9%)
102,393144 (6.0%)242 (10.1%)74 (3.1%)400 (16.7%)1,47076 (5.2%)259 (17.6%)11 (0.7%)536 (36.5%)
112,778116 (4.2%)275 (9.9%)171 (6.2%)445 (16.0%)1,58782 (5.2%)227 (14.3%)42 (2.6%)555 (35.0%)
122,298139 (6.0%)275 (12.0%)59 (2.6%)298 (13.0%)1,05877 (7.3%)133 (12.6%)5 (0.5%)406 (38.4%)
131,891127 (6.7%)242 (12.8%)55 (2.9%)338 (17.9%)1,08097 (9.0%)152 (14.1%)5 (0.5%)352 (32.6%)
141,898181 (9.5%)190 (10.0%)72 (3.8%)256 (13.5%)90455 (6.1%)94 (10.4%)9 (1.0%)321 (35.5%)
151,350126 (9.3%)142 (10.5%)42 (3.1%)230 (17.0%)77331 (4.0%)130 (16.8%)2 (0.3%)289 (37.4%)
161,21355 (4.5%)131 (10.8%)41 (3.4%)326 (26.9%)78221 (2.7%)112 (14.3%)3 (0.4%)382 (48.8%)
171,01846 (4.5%)137 (13.5%)42 (4.1%)266 (26.1%)63730 (4.7%)71 (11.1%)3 (0.5%)298 (46.8%)
181,42370 (4.9%)135 (9.5%)58 (4.1%)308 (21.6%)81744 (5.4%)135 (16.5%)1 (0.1%)309 (37.8%)
191,06163 (5.9%)98 (9.2%)33 (3.1%)247 (23.3%)67743 (6.4%)63 (9.3%)2 (0.3%)277 (40.9%)
201,11646 (4.1%)115 (10.3%)55 (4.9%)287 (25.7%)63111 (1.7%)66 (10.5%)9 (1.4%)318 (50.4%)
2162835 (5.6%)72 (11.5%)19 (3.0%)166 (26.4%)37824 (6.3%)62 (16.4%)0 (0.0%)153 (40.5%)
2255930 (5.4%)43 (7.7%)17 (3.0%)94 (16.8%)39130 (7.7%)85 (21.7%)1 (0.3%)127 (32.5%)
1-2247,9872,991 (6.2%)4,954 (10.3%)1,737 (3.6%)9,607 (20.0%)28,1571,678 (6.0%)3,874 (13.8%)270 (1.0%)10,985 (39.0%)
X1,47984 (5.7%)204 (13.8%)83 (5.6%)512 (34.6%)9838 (0.8%)123 (12.5%)23 (2.3%)563 (57.3%)

Note.—The column labeled “Chr.” gives the chromosome, and “Neandertal” and “Denisovan” group IBD segments matching the Neandertal and Denisovan genomes, respectively. “ALL” gives the total number of IBD segments matching the respective ancient genome, “EAS”, “SAS”, “EUR”, and “AFR” report the number of matching IBD segments shared exclusively by East Asians, South Asians, Europeans, and Africans, respectively, and the percentage compared to the total number of IBD segments matching the respective ancient genome.

Table 2

Number of Neandertal- and Denisovan-Matching IBD Segments Shared Exclusively by Africans that are Observed in a Particular Subpopulation

AFRESNGWDLWKMSLYRI
Neandertal 1–229,6072,9873,1352,9943,1913,133
Neandertal X512139117148123132
Denisovan 1–2210,9853,3123,4943,4103,6023,489
Denisovan X563167154187163166

Note.—The rows present results for Neandertal on the autosomes and chromosome X, as well as, results for the Denisovan on the autosomes and chromosome X, respectively. The column labeled “AFR” gives the total number of IBD segments that are exclusively shared by Africans and match the respective genome, “ESN” (Esan, Nigeria), “GWD” (Gambian, Western Division), “LWK” (Luhya, Kenya), “MSL” (Mende, Sierra Leone), and “YRI” (Yoruba, Nigeria) report the number of IBD segments exclusively shared by Africans that are also shared by at least one individual from the respective subpopulation. See Supplementary Material for details about the subpopulations.

Number of Neandertal- and Denisovan-Matching IBD Segments Shared Exclusively by a Particular Continental Population for Each Chromosome Note.—The column labeled “Chr.” gives the chromosome, and “Neandertal” and “Denisovan” group IBD segments matching the Neandertal and Denisovan genomes, respectively. “ALL” gives the total number of IBD segments matching the respective ancient genome, “EAS”, “SAS”, “EUR”, and “AFR” report the number of matching IBD segments shared exclusively by East Asians, South Asians, Europeans, and Africans, respectively, and the percentage compared to the total number of IBD segments matching the respective ancient genome. Number of Neandertal- and Denisovan-Matching IBD Segments Shared Exclusively by Africans that are Observed in a Particular Subpopulation Note.—The rows present results for Neandertal on the autosomes and chromosome X, as well as, results for the Denisovan on the autosomes and chromosome X, respectively. The column labeled “AFR” gives the total number of IBD segments that are exclusively shared by Africans and match the respective genome, “ESN” (Esan, Nigeria), “GWD” (Gambian, Western Division), “LWK” (Luhya, Kenya), “MSL” (Mende, Sierra Leone), and “YRI” (Yoruba, Nigeria) report the number of IBD segments exclusively shared by Africans that are also shared by at least one individual from the respective subpopulation. See Supplementary Material for details about the subpopulations. Next, we consider the lengths of IBD segments of different continental populations in order to compare their age. Since, as mentioned in “Exponentially Distributed IBD Lengths” section, shorter segments are assumed to be older than longer ones, IBD segments that originate from events involving ancestors of modern humans and ancestors of Neandertals and/or Denisovans within Africa should be shorter than those from later interbreeding outside of Africa. Figure 4 shows densities of lengths of IBD segments that match the Neandertal/Denisovan genome and are private to Africans versus IBD segments that are not observed in Africans. With the major density peak (the mode of the density) at about 5,700 and 5,000 bp, respectively, Neandertal- and Denisovan-matching IBD segments are shorter when they are shared by Africans than when they are shared by non-Africans, where the peaks are at about 12,500 and 12,000 bp, respectively. Neandertal- and Denisovan-matching IBD segments that are only shared by non-African populations are clearly longer and therefore younger than those shared between Africans and the ancient genomes. This supports previous reports of interbreeding after migration out of Africa (Green et al. 2010; Reich et al. 2010; Meyer et al. 2012; Sankararaman et al. 2012, 2014, 2016; Yang et al. 2012; Lohse and Frantz 2014; Prüfer et al. 2014). On the other hand, the short Neandertal- and Denisovan-matching IBD segments found exclusively in Africans are considerably older and therefore probably originate from within Africa.
F

Population specific IBD segment lengths for Neandertal- and Denisovan-matching segments. Densities of lengths of IBD segments on the autosomes that match the Neandertal (left) or Denisovan (right) genome and are private to Africans (red) versus IBD segments matching ancient genomes that are not observed in Africans (blue). The dotted lines emphasize peaks of the densities. Compared with those of non-Africans, African IBD segments that match an ancient genome are enriched in regions of shorter segment lengths.

Population specific IBD segment lengths for Neandertal- and Denisovan-matching segments. Densities of lengths of IBD segments on the autosomes that match the Neandertal (left) or Denisovan (right) genome and are private to Africans (red) versus IBD segments matching ancient genomes that are not observed in Africans (blue). The dotted lines emphasize peaks of the densities. Compared with those of non-Africans, African IBD segments that match an ancient genome are enriched in regions of shorter segment lengths. If we compare the densities of lengths of IBD segments that match the Neandertal/Denisovan genome and are private to Africans versus IBD segments that are observed in Africans but not exclusively, we also see a peak at 12,000 and 11,000 bp for Neanderthal- and Denisovan-matching IBD segments, respectively, that are observed in Africans but not exclusively (see supplementary fig. S5, Supplementary Material online). These peaks may represent segments that stem from later interbreedings outside of Africa but entered the African genome via a recent Eurasian backflow as reported previously (Wang et al. 2013; Prüfer et al. 2014; Llorente et al. 2015). Similarly, if we compare the densities of lengths of IBD segments that match the Neandertal/Denisovan genome and are private to Africans versus IBD segments that are observed outside of Africa but not exclusively, we see that the densities of IBD segments observed outside of Africa are shifted to the left if compared with densities of IBD segments not observed in Africans (see supplementary fig. S6, Supplementary Material online vs. fig. 4). The shorter segments that cause this shift may indicate remnants of the older segments that originate from within Africa.

Discussion

There are different explanations for our results that concern the origin of segments that are shared between Africans and the Neandertal and/or Denisovan genome. One interpretation of our results is that ancestors of humans and ancient hominins interbred within Africa. The hypothesis of ancient substructures in Africa with limited gene flow between subpopulations of hominins (Plagnol and Wall 2006; Slatkin 2008; Yang et al. 2012) does not contradict this interbreeding. Neandertals and Denisovans could be more closely related to Africans than to out-of-Africa populations because of more interactions between their ancestors. In this case, since the ancestors of Africans and Neandertals/Denisovans were not clearly separated, this could be considered “admixture” rather than “interbreeding”. Another interpretation of the extensive IBD sharing between Africans and ancient genomes is that these shared IBD segments originate from a common ancestor of Neandertals/Denisovans and humans. They can only be found in modern Africans due to incomplete lineage sorting. According to this scenario, the detected IBD segments arose first in the population that existed prior to the ancient separation of Neandertals, Denisovans and modern humans, but were relatively rare. Consequently they survived in both archaic humans and in present-day Africans, while drifting to a very low frequency in non-Africans. On average, these IBD segments are tagged by more than 20 rare variants. Surprisingly the individuals analyzed possess either all of these variants or none of them while individuals with only some of these rare variants are missing. In our opinion, it is unlikely that, for thousands of IBD segments, only these extremes survived while all the intermediate cases died out completely. Consequently, we assume that the source population was separated from all the other populations for a long time and, therefore, acquired such a high number of mutations. We do not know, whether the separated population was already a Neandertal, a Denisovan, their ancestor, or a different hominin. We cannot rule out, that the IBD segments also existed in ancestors of modern Eurasians and were lost due to strong genetic drift. However, our results suggest an interbreeding within Africa that involved a population that was isolated for an extended period of time. This early interbreeding can still be detected via IBD segments that are shared between Africans and Neandertals and/or Denisovans. Another explanation without introgression in Africa is back-to-Africa gene flow as suggested by Prüfer et al. (2014), Wang et al. (2013), and Llorente et al. (2015) for Neandertal ancestry in Africans. However, (1) the large number of IBD segments shared exclusively between the ancient genomes and Africans, (2) their even distribution across all African populations analyzed, and (3) their short length contradict this hypothesis. Besides the short IBD segments shared with Africans, the other interesting finding is, that there are more IBD segments shared between South Asians and Neandertals and/or Denisovans than between other non-African populations and these ancient genomes. Recent investigations found that South Asians share a surprisingly high amount of DNA with the Denisovan genome (Sankararaman et al. 2016). In our analysis the amount is even higher. Possible explanations for this finding are as follows: (1) additional interbreeding events with ancestors of South Asians, (2) introduction of IBD segments from ancient genomes into other non-African populations via South Asians and not directly, and (3) combinations of bottlenecks, genetic drift, and different selective pressures. As supplementary figure S7, Supplementary Material online, shows, the length distributions of IBD segments shared between Neandertals or Denisovans and different non-African populations are very similar. Thus, a separate interbreeding event at a different time point seems to be unlikely. However, based on the IBD segment length distributions, we cannot rule out multiple interbreeding events with different human populations in a relatively short time period. Because the IBD segment lengths are determined by the number of crossovers since the interbreeding, a single cannot be distinguishing from multiple events. According to the second explanation, Denisovans and Neandertals interbred only with the ancestors of South Asians. In the following, South Asians introduced parts of the ancient genomes into the genomes of Europeans and East Asians via later admixture events. This explanation might account for the low number of Denisovan-matching segments found in Europeans. However, we think it is unlikely that it is the reason for the considerably larger number of IBD segments shared exclusively between Neandertals and Europeans/East Asians and between Denisovans and East Asians. The third explanation is based on the idea that the interbreeding occurred between the ancient hominins and the common ancestors of Eurasians. Consequently, different populations went through different bottlenecks, experienced various selective pressures and admixtures with other human populations. Because of the aforementioned factors, ancient DNA that had entered the human gene pool survived in one population while it has been lost in another population, or has been diluted in the third population. This would explain why some of the detected IBD segments are present in all three non-African populations, some only in two out of three, and others are exclusively found in a single continental population. Previous results (Vernot and Akey 2014, 2015; Kim and Lohmueller 2015) showed that above factors probably are not the only reason for varying levels of Neandertal- and Denisovan-ancestry in different non-African populations, however, we assume that at least they affected IBD sharing between modern humans and ancient genomes.

Conclusion

Using the 1000 Genomes Phase 3 data to represent the human genome and the sequenced genomes of a Neandertal and a Denisovan, we analyzed genetic relationships between these hominins. We characterized these relations by means of shared DNA segments that are identical by descent, focusing on low frequency and rare variants to identify very short IBD segments with high confidence. Since shorter IBD segments are presumably older than longer ones, the segments we extracted reveal events from the very distant past. We found short IBD segments that match the Neandertal and/or Denisovan genome and are shared mainly by Africans. These segments may either stem from a common ancestor with subsequent incomplete lineage sorting or more likely from an interbreeding of ancestors of humans and other ancient hominins within Africa. DNA segments which originate from a common history within Africa can also help to improve the reconstruction of human evolution outside of Africa. Furthermore, these segments have the potential to lead to new insights into adaptation processes at different time points in human history.

Supplementary Material

Supplementary figures S1–S7 and tables S1 and S2 are available at Genome Biology and Evolution online (http://www.gbe.oxfordjournals.org/).

Acknowledgements

We thank Günter Klambauer and Ulrich Bodenhofer for helpful discussions, proofreading, and comments on this manuscript. Furthermore, we thank the NVIDIA corporation for the GPU donation that facilitated our work. Funding for open access charge: Funds from the Institute of Bioinformatics, Johannes Kepler University Linz. Click here for additional data file.
  61 in total

1.  Archaic human ancestry in East Asia.

Authors:  Pontus Skoglund; Mattias Jakobsson
Journal:  Proc Natl Acad Sci U S A       Date:  2011-10-31       Impact factor: 11.205

Review 2.  Linkage disequilibrium--understanding the evolutionary past and mapping the medical future.

Authors:  Montgomery Slatkin
Journal:  Nat Rev Genet       Date:  2008-06       Impact factor: 53.242

3.  The derived FOXP2 variant of modern humans was shared with Neandertals.

Authors:  Johannes Krause; Carles Lalueza-Fox; Ludovic Orlando; Wolfgang Enard; Richard E Green; Hernán A Burbano; Jean-Jacques Hublin; Catherine Hänni; Javier Fortea; Marco de la Rasilla; Jaume Bertranpetit; Antonio Rosas; Svante Pääbo
Journal:  Curr Biol       Date:  2007-10-18       Impact factor: 10.834

4.  Gene flow from North Africa contributes to differential human genetic diversity in southern Europe.

Authors:  Laura R Botigué; Brenna M Henn; Simon Gravel; Brian K Maples; Christopher R Gignoux; Erik Corona; Gil Atzmon; Edward Burns; Harry Ostrer; Carlos Flores; Jaume Bertranpetit; David Comas; Carlos D Bustamante
Journal:  Proc Natl Acad Sci U S A       Date:  2013-06-03       Impact factor: 11.205

5.  cn.FARMS: a latent variable model to detect copy number variations in microarray data with a low false discovery rate.

Authors:  Djork-Arné Clevert; Andreas Mitterecker; Andreas Mayr; Günter Klambauer; Marianne Tuefferd; An De Bondt; Willem Talloen; Hinrich Göhlmann; Sepp Hochreiter
Journal:  Nucleic Acids Res       Date:  2011-04-12       Impact factor: 16.971

6.  The complete genome sequence of a Neanderthal from the Altai Mountains.

Authors:  Kay Prüfer; Fernando Racimo; Nick Patterson; Flora Jay; Sriram Sankararaman; Susanna Sawyer; Anja Heinze; Gabriel Renaud; Peter H Sudmant; Cesare de Filippo; Heng Li; Swapan Mallick; Michael Dannemann; Qiaomei Fu; Martin Kircher; Martin Kuhlwilm; Michael Lachmann; Matthias Meyer; Matthias Ongyerth; Michael Siebauer; Christoph Theunert; Arti Tandon; Priya Moorjani; Joseph Pickrell; James C Mullikin; Samuel H Vohr; Richard E Green; Ines Hellmann; Philip L F Johnson; Hélène Blanche; Howard Cann; Jacob O Kitzman; Jay Shendure; Evan E Eichler; Ed S Lein; Trygve E Bakken; Liubov V Golovanova; Vladimir B Doronichev; Michael V Shunkov; Anatoli P Derevianko; Bence Viola; Montgomery Slatkin; David Reich; Janet Kelso; Svante Pääbo
Journal:  Nature       Date:  2013-12-18       Impact factor: 49.962

7.  HapFABIA: identification of very short segments of identity by descent characterized by rare variants in large sequencing data.

Authors:  Sepp Hochreiter
Journal:  Nucleic Acids Res       Date:  2013-10-29       Impact factor: 16.971

8.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

9.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

10.  Neanderthal ancestry drives evolution of lipid catabolism in contemporary Europeans.

Authors:  Ekaterina E Khrameeva; Katarzyna Bozek; Liu He; Zheng Yan; Xi Jiang; Yuning Wei; Kun Tang; Mikhail S Gelfand; Kay Prufer; Janet Kelso; Svante Paabo; Patrick Giavalisco; Michael Lachmann; Philipp Khaitovich
Journal:  Nat Commun       Date:  2014-04-01       Impact factor: 14.919

View more
  4 in total

1.  Global Picture of Genetic Relatedness and the Evolution of Humankind.

Authors:  Gennady V Khvorykh; Oleh A Mulyar; Larisa Fedorova; Andrey V Khrunin; Svetlana A Limborska; Alexei Fedorov
Journal:  Biology (Basel)       Date:  2020-11-10

2.  Temporal population structure, a genetic dating method for ancient Eurasian genomes from the past 10,000 years.

Authors:  Sara Behnamian; Umberto Esposito; Grace Holland; Ghadeer Alshehab; Ann M Dobre; Mehdi Pirooznia; Conrad S Brimacombe; Eran Elhaik
Journal:  Cell Rep Methods       Date:  2022-08-22

3.  Rectified factor networks for biclustering of omics data.

Authors:  Djork-Arné Clevert; Thomas Unterthiner; Gundula Povysil; Sepp Hochreiter
Journal:  Bioinformatics       Date:  2017-07-15       Impact factor: 6.937

4.  Multiple selective sweeps of ancient polymorphisms in and around LTα located in the MHC class III region on chromosome 6.

Authors:  Michael C Campbell; Bryan Ashong; Shaolei Teng; Jayla Harvey; Christopher N Cross
Journal:  BMC Evol Biol       Date:  2019-12-02       Impact factor: 3.260

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.