Literature DB >> 30642962

Cross-species hybridization and the origin of North African date palms.

Jonathan M Flowers1,2, Khaled M Hazzouri1,3, Muriel Gros-Balthazard1, Ziyi Mo1, Konstantina Koutroumpa4, Andreas Perrakis5, Sylvie Ferrand1, Hussam S M Khierallah6, Dorian Q Fuller7, Frederique Aberlenc8, Christini Fournaraki5, Michael D Purugganan9,2.   

Abstract

Date palm (Phoenix dactylifera L.) is a major fruit crop of arid regions that were domesticated ∼7,000 y ago in the Near or Middle East. This species is cultivated widely in the Middle East and North Africa, and previous population genetic studies have shown genetic differentiation between these regions. We investigated the evolutionary history of P. dactylifera and its wild relatives by resequencing the genomes of date palm varieties and five of its closest relatives. Our results indicate that the North African population has mixed ancestry with components from Middle Eastern P. dactylifera and Phoenix theophrasti, a wild relative endemic to the Eastern Mediterranean. Introgressive hybridization is supported by tests of admixture, reduced subdivision between North African date palm and P. theophrasti, sharing of haplotypes in introgressed regions, and a population model that incorporates gene flow between these populations. Analysis of ancestry proportions indicates that as much as 18% of the genome of North African varieties can be traced to P. theophrasti and a large percentage of loci in this population are segregating for single-nucleotide polymorphisms (SNPs) that are fixed in P. theophrasti and absent from date palm in the Middle East. We present a survey of Phoenix remains in the archaeobotanical record which supports a late arrival of date palm to North Africa. Our results suggest that hybridization with P. theophrasti was of central importance in the diversification history of the cultivated date palm.
Copyright © 2019 the Author(s). Published by PNAS.

Entities:  

Keywords:  archaeobotany; crop wild relative; domestication; introgression; range expansion

Mesh:

Substances:

Year:  2019        PMID: 30642962      PMCID: PMC6358688          DOI: 10.1073/pnas.1817453116

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


Domesticated crops are among the most evolutionarily successful species in the world. From geographically restricted centers of origin, many domesticated species have dramatically and rapidly expanded their ranges, adapting to new environments and cultures within the span of a few hundred to thousands of generations. The precise genetic and evolutionary mechanisms that allow crop species to adapt to multiple environments remain unclear, but understanding these mechanisms provides insights into the nature of species range expansion and the dynamics of human cultural evolution since the Neolithic. Identifying how crop species range expansion has occurred in the past may also suggest new approaches for future crop breeding efforts, especially in light of climate change. The date palm (Phoenix dactylifera L.) is a dioecious species in the Arecaceae (formerly Palmae) family and the most important fruit-bearing crop in arid regions of the Middle East and North Africa. Date palms grow primarily in hot arid habitats including desert oases or well-irrigated small farms or plantations where they are propagated via a mixed clonal–sexual system. The traditional range of date palm cultivation extends from Morocco to Egypt in North Africa; the Arabian Peninsula, Iraq, and Iran in the Middle East; and Pakistan in South Asia (1) (Fig. 1). Population genetic analysis using microsatellites (2–6) and whole-genome sequences (6–8) indicates that Middle Eastern date palms are genetically differentiated from North African P. dactylifera, with a possible hybrid zone in Egypt and the Sudan (5, 7).
Fig. 1.

The geographic distribution and phylogeny of cultivated date palm (P. dactylifera) and its wild relatives. (A) Approximate geographic distribution of Phoenix species included in this study (27). (B) Maximum-likelihood phylogeny of Phoenix species based on a subset of SNPs from whole-genome sequencing data. Support values are the percentage of RAxML bootstrap replicates that support each node. The tree is midpoint rooted.

The geographic distribution and phylogeny of cultivated date palm (P. dactylifera) and its wild relatives. (A) Approximate geographic distribution of Phoenix species included in this study (27). (B) Maximum-likelihood phylogeny of Phoenix species based on a subset of SNPs from whole-genome sequencing data. Support values are the percentage of RAxML bootstrap replicates that support each node. The tree is midpoint rooted. Date palms are one of the earliest domesticated tree crops. The oldest evidence of exploitation comes from seed remains excavated from Dalma Island, Abu Dhabi, and Kuwait that date to the Arabian Neolithic ∼7,000 y before present (yBP) (9). Evidence of date palm cultivation appears somewhat later in Mesopotamia at ∼6,700–6,000 yBP (10, 11) and in the Levant at ∼5,700–5,500 yBP (12). The early evidence of cultivation at the eastern edge of the Fertile Crescent and the Upper Arabian Gulf supports an ancient center of origin of date palm domestication in this region (1, 9, 12, 13). Evidence for morphological change in archaeological date stones suggests selection of larger-fruited domesticated forms took place between ∼5,000 yBP and 2,000 yBP in the Near East (14). The sugar date palm, Phoenix sylvestris (L.) Roxb., is the sister species of P. dactylifera (1, 15), but unlikely to be the wild progenitor of domesticated date palm (15, 16). A recent study identified wild P. dactylifera populations in Oman, which may represent a relictual population of the wild progenitor of cultivated date palms in the Middle East (6). Alternate origin hypotheses propose a domestication center of date palm in North Africa (1, 17, 18). These hypotheses are poorly supported by the archaeobotanical record, however, as evidence of date palm cultivation appears ∼3,000 y later in the records of this region compared with the Middle East. P. dactylifera remains in the Nile Valley are found as early as the predynastic period, more than 5,000 yBP (19, 20), but cultivation may not have begun until the Middle Kingdom, ∼4,000 yBP (14, 21). In Libya, cultivation began ∼2,800–2,400 yBP (18, 22). Date palm is unknown from elsewhere in the Saharan Maghreb and the sub-Saharan Sahel until much later (23–25). The archaeobotanical record therefore is consistent with a late arrival of date palm to North Africa and suggests a model where date palm was domesticated in the Near or Middle East and later expanded to the African continent (26). Paradoxically, however, nucleotide diversity in North African date palm is at least 20% higher than that in the Middle East (6, 7) and diversity at microsatellite loci is comparable between the two populations (5). These observations are inconsistent with a bottleneck associated with founder-effect colonization in North Africa. Evidence of an unknown ancestry component in North African cultivated date palm further implies a more complex history in this region (6). The genetic distinctiveness of North African date palms, their absence from early archaeological sites in North Africa, and elevated levels of nucleotide diversity perpetuate the enigma of the origins of date palms in the western part of their range. To examine the origin of North African date palms, we resequenced a large sample of cultivars from Morocco to Pakistan and five wild Phoenix relatives that occur either peripatrically or allopatrically to cultivated date palm (Fig. 1). Here we present evidence that the North African population is the product of introgressive hybridization between cultivated date palm and the Cretan date palm Phoenix theophrasti Greuter, a species endemic to Crete and the Eastern Mediterranean. We demonstrate that introgression has been central in shaping patterns of diversity genome-wide, which supports introgression as being an important factor that shaped the domestication history of date palm. The growing list of examples of interspecific hybridization associated with domestication suggests that hybridization is a common evolutionary genetic mechanism for the adaptation of both annual and perennial crops.

Results

Plant Samples, Nucleotide Diversity, and the Phylogeny of Phoenix Species.

We resequenced the genomes of 71 cultivated date palm varieties and multiple genomes from each of its 5 closest wild relatives in the genus Phoenix to address questions about the origin of the domesticated date palm (P. dactylifera). Our sample set included common varieties of dates from its traditional range of cultivation (7) and 2–18 individuals from wild relatives that include P. sylvestris; P. theophrasti; Phoenix atlantica A. Chev.; Phoenix canariensis hort. ex Chabaud; and a single member of a putative outgroup species from sub-Saharan Africa, Phoenix reclinata Jacq. (). Samples were sequenced to moderate to deep coverage (5–56, mean = 22; ) using 2 100 bp paired-end Illumina sequencing. The close relationships of the wild relatives to date palm allowed us to align >96% of short reads of each sequenced genome () to the draft assembly of the date palm genome (28). We identified 14,402,469 single-nucleotide polymorphisms (SNPs) across Phoenix species which we used in population and phylogenomic analysis. Reconstruction of the phylogeny of date palm and its wild relatives using the whole-genome SNP data supported the close relationship of P. sylvestris, P. theophrasti, P. atlantica, and domesticated date palm P. dactylifera, (the “date palm group”) (1). P. canariensis and P. reclinata are more distant relatives (Fig. 1), consistent with previous analyses (1, 15) and the Cape Verde Islands endemic, P. atlantica, is a member of a well-supported P. dactylifera clade consistent with the samples in our analysis being feral date palms (1, 6). Within the date palm group, P. sylvestris is the sister species of date palm, and P. theophrasti is a more diverged species (Fig. 1). Estimates of nucleotide diversity in the Phoenix wild relatives suggest P. canariensis ( = 0.0117) > P. sylvestris ( = 0.0105) > P. theophrasti ( = 0.0072), while estimates in the two populations of date palm indicate higher diversity in North Africa compared with the Middle East () as previously reported (6, 7).

Population Clustering Suggests That North African Date Palms Are Admixed.

To identify genetic clusters in P. dactylifera and its wild relatives, we conducted model-based clustering with the program STRUCTURE (29). STRUCTURE runs with the independent allele-frequency model produced clusters at K = 2 to K = 4 that showed evidence of mixed ancestry of North African date palm. Approximately 5–15% of the North African sample genomes were shared with P. theophrasti and the remainder with Middle Eastern date palm (Fig. 2 and ). Higher K values did not assign samples to any additional meaningful sources.
Fig. 2.

North African date palm has mixed ancestry from Middle Eastern date palm and a wild relative, P. theophrasti. Shown is a STRUCTURE (29) diagram for K = 2 to K = 4 using the independent allele-frequency model.

North African date palm has mixed ancestry from Middle Eastern date palm and a wild relative, P. theophrasti. Shown is a STRUCTURE (29) diagram for K = 2 to K = 4 using the independent allele-frequency model. Running STRUCTURE with the independent model may underestimate the number of clusters. We therefore repeated the analysis with the correlated allele-frequency model (). K = 2 to K = 4 produced qualitatively similar results as above. However, At K = 5 and higher, a third ancestry component appears in the North African population which does not trace to any population in our analysis (). This could represent an additional source of variation in North Africa or could be attributable to known artifacts of the correlated frequencies model, which is subject to oversplitting at higher K values (29, 30). Finally, we conducted a set of analyses restricted to species pairs (“hierarchical” analysis) (31). Pairwise STRUCTURE runs limited to date palm samples and those from a single wild relative species yielded results qualitatively similar to the corresponding correlated and independent allele-frequency analyses that included all samples. The analysis restricted to P. theophrasti and date palm also supported the mixed ancestry of North African samples and this wild relative (). STRUCTURE analyses are therefore consistent with the North African population being admixed between cultivated date palm and this wild relative or a P. theophrasti-like population.

Tests of Admixture Indicate North African Date Palms Are Products of Interspecific Hybridization.

Evidence of mixed ancestry of date palms in the STRUCTURE analysis is suggestive of hybridization, but interpretation must be treated with caution (30). We therefore conducted explicit tests of admixture to establish whether North African date palms are indeed the product of interspecific hybridization between Middle Eastern P. dactylifera and the wild Cretan species, P. theophrasti. First, we performed ABBA–BABA, or D tests (32, 33), to test for excesses of shared derived polymorphism that can be attributed to gene flow between species. We conducted all combinations of tests among date palm and its wild relatives by assuming the population configuration D(P1,P2,P3,O), where P1 and P2 are Middle Eastern and North African populations of domesticated date palms, respectively; P3 is a wild relative; and P. reclinata is the outgroup (O). A positive D statistic in this configuration indicates gene flow between North African date palm (P2) and a wild relative (P3), while negative tests indicate gene flow between Middle Eastern date palm (P1) and a wild species. We observe a positive D statistic for the test with P. theophrasti as the wild relative (D = 0.58) (SE 0.02; Z = 37.24; Fig 3 and ), which indicates an excess of shared derived alleles between P. theophrasti and North African date palm and suggests interspecific hybridization between this wild relative and date palm. This result is robust to the choice of outgroup, as replacing P. reclinata with P. canariensis produces a similar test outcome (Fig. 3 and ). Tests where samples from the Maghreb countries are replaced with cultivars from Egypt also show an excess of derived allele sharing with P. theophrasti although the degree of sharing is less than in cultivars sampled from elsewhere in North Africa (). D tests including P. sylvestris and P. canariensis are negative and positive, respectively, possibly supporting additional gene flow with date palm and other wild relatives. However, these are minor contributions compared with those of P. theophrasti and may represent hybridization in anthropogenic contexts (Fig. 3 and ).
Fig. 3.

Admixture between North African date palm and P. theophrasti. (A) D statistics SE. (B) statistics SE. (C and D) Maximum-likelihood tree based on TreeMix with block size of 3,000 SNPs for (C) zero (m = 0) and (D) one migration (m = 1) event. These models explain 98.6% and 99.9% of the variance in relatedness among populations. The migration edge pointing from P. theophrasti to North African date palm has a mixture weight of 15.7%. Abbreviations: Af, North Africa; can, P. canariensis; ME, Middle East; rec, P. reclinata; syl, P. sylvestris; and the, P. theophrasti.

Admixture between North African date palm and P. theophrasti. (A) D statistics SE. (B) statistics SE. (C and D) Maximum-likelihood tree based on TreeMix with block size of 3,000 SNPs for (C) zero (m = 0) and (D) one migration (m = 1) event. These models explain 98.6% and 99.9% of the variance in relatedness among populations. The migration edge pointing from P. theophrasti to North African date palm has a mixture weight of 15.7%. Abbreviations: Af, North Africa; can, P. canariensis; ME, Middle East; rec, P. reclinata; syl, P. sylvestris; and the, P. theophrasti. Second, we tested for admixture between date palm and its wild relatives with tests (34, 35). The statistic tests whether a population is the product of admixture between two reference, or source, populations with a significantly negative statistic supporting admixture and all other outcomes being uninformative about the admixture history (34). The test with North African date palm as the test population and P. theophrasti and Middle Eastern date palm as reference populations was negative and suggests that North African date palms have a mixed ancestry from these two populations [(North Africa; Middle East, P. theophrasti) = −0.15; = 19.8; Fig. 3 and ). All other tests either did not differ significantly from zero or were positive (Fig. 3 and ). The negative (North Africa; Middle East, P. theophrasti) test supports admixture between date palm and a wild species, and also suggests that the Middle Eastern population or a Middle Eastern-like population is a source of alleles segregating in North Africa. We modeled the population history of date palm and its relatives with the maximum-likelihood method TreeMix (36). This modeling approach infers the population tree based on the covariance in allele frequencies among populations and incorporates admixture through the addition of migration edges between populations that are poor fits to the strict tree model. To determine whether gene flow between date palm and P. theophrasti is supported by the data, we first inferred the maximum-likelihood tree without migration (Fig. 3). A model rooted with P. reclinata yielded the same relationships among Phoenix species as the maximum-likelihood phylogeny reported above (Fig. 1) and explained 98.6% of the variance in relatedness between populations. Addition of a single migration event to the model predicted gene flow between P. theophrasti and the lineage leading to the North African population of date palm (Fig. 3) and increased the percentage of variance in relatedness explained to 99.9%. This migration edge was stable in independent runs of TreeMix with both different species included in the model and with different block sizes to account for linkage disequilibrium (LD) (). Together, the admixture tests and TreeMix results suggest that the ancestry of North African date palm can be traced to Middle Eastern date palm and P. theophrasti.

Mixed Ancestry of North African Date Palm.

We estimated the percentage of ancestry of North African date palms derived from P. theophrasti and Middle Eastern date palm using statistics (35) and TreeMix mixture weights (36). First, the ratio of appropriate statistics can provide an estimate of the proportion of North African ancestry that is derived from the Middle East and P. theophrasti by assuming the phylogenetic relationships in Fig. 1. We estimated the percentage of ancestry () derived from Middle Eastern date palm at 82% (SE 0.01; ) and the P. theophrasti fraction as 1 – = 18%. Repeating the estimation of by replacing samples from the Maghreb with Egyptian samples as the test population yielded a smaller P. theophrasti and larger Middle Eastern component in these samples ( = 95%) (SE 0.007; ). Second, TreeMix models with gene flow provide mixture weights estimates for each migration edge, which approximate ancestry proportions (36). A mixture weight of 15.7% on the migration edge from P. theophrasti to North African date palm with m = 1 (Fig. 3) is comparable to our estimate from the -ratio approach. These results suggest that 5–18% of the North African date palm genome is derived from the Cretan date palm with varieties from countries west of Egypt sharing greater ancestry with this wild relative.

Introgressed Regions in the North African Date Palm Genome.

We identified introgressed genomic segments in the North African population using a combination of approaches. We traced individual alleles in this population to their most likely source population by characterizing variant sites that are fixed for alternate alleles in the Middle Eastern population and P. theophrasti. This analysis yielded 1,556,435 nucleotide fixations, of which 90.2% are polymorphic in North African P. dactylifera. Of the sites segregating for a Middle Eastern-like and P. theophrasti-like allele in this population, the latter is typically the minor allele with most observed at low to moderate frequency. A total of 1,252,999 of 1,404,273, or 89.2% of these polymorphic sites in North Africa, have a P. theophrasti-like allele frequency of 30% or less. Only 9.8% of sites that are fixed between Middle Eastern date palm and P. theophrasti are fixed for the Middle Eastern-like allele in North Africa and none are fixed for the P. theophrasti-like variant. To characterize regions with introgressed haplotypes that have risen to moderate-to-high frequency, we estimated the introgression fraction (fd) for the population configuration D(Middle East, North Africa, P. theophrasti, P. reclinata) (above and and ref. 37). A positive fd implies introgression from P. theophrasti into North African date palm. We identified blocks of two or more consecutive 5-kb intervals with an fd in the upper 10th percentile of the genome-wide distribution of this statistic. Applying this criterion, we identified 1,281 introgressed segments of 10 kb or larger, which totaled 24.6 Mb of the draft assembly. The median length of tracts with outlier fd was 15 kb and the largest region was 105 kb on scaffold NW_008246809.1. This approach yields an underestimate of tract lengths owing, in part, to the current fragmented state of the draft assembly. Moreover, the signature of introgression is not limited to these outlier regions as evidenced by an elevated introgression fraction and altered patterns of diversity throughout much of the genome (below). Fig. 4 shows fd in sliding windows across the longest genome scaffolds. Fig. 4 shows an example 3-Mb region with both elevated fd and Tajima’s D in North Africa and the incidence of high-frequency theophrasti-like alleles in this region. A gene tree reconstructed from phased genotypes confirmed the existence of shared haplotypes between P. theophrasti and North African date palm cultivars in this region (Fig. 4).
Fig. 4.

Introgressed regions in North African date palm. (A) The introgression fraction (fd) in 5-kb nonoverlapping windows from the 30 longest scaffolds in the draft assembly. (B) Population genetic summary statistics of North African date palm for scaffold NW_008246512.1 in 5-kb windows. B, Bottom shows the frequency of P. theophrasti-like alleles (main text) segregating in North Africa. Each circle represents a SNP. Curves were fitted to the data using local polynomial regression (LOESS). (C) Neighbor-joining 50% majority rule tree based on phased haplotypes in a 5-kb interval at position 1,100 kb of the scaffold in B. Colored circles represent the population origin of phased sequences. The single haplotype from the Middle East in the P. theophrasti/North Africa clade bearing the shared haplotype is from a Pakistani variety (Aseel) with a mixed Middle Eastern/North African ancestry (7). Single haplotypes from each of two probable hybrids (a P. theophrasti sample from Almyros, Crete and a P. sylvestris sample from Faisalabad, Pakistan) are nested within P. dactylifera clades as are haplotypes from two P. atlantica samples.

Introgressed regions in North African date palm. (A) The introgression fraction (fd) in 5-kb nonoverlapping windows from the 30 longest scaffolds in the draft assembly. (B) Population genetic summary statistics of North African date palm for scaffold NW_008246512.1 in 5-kb windows. B, Bottom shows the frequency of P. theophrasti-like alleles (main text) segregating in North Africa. Each circle represents a SNP. Curves were fitted to the data using local polynomial regression (LOESS). (C) Neighbor-joining 50% majority rule tree based on phased haplotypes in a 5-kb interval at position 1,100 kb of the scaffold in B. Colored circles represent the population origin of phased sequences. The single haplotype from the Middle East in the P. theophrasti/North Africa clade bearing the shared haplotype is from a Pakistani variety (Aseel) with a mixed Middle Eastern/North African ancestry (7). Single haplotypes from each of two probable hybrids (a P. theophrasti sample from Almyros, Crete and a P. sylvestris sample from Faisalabad, Pakistan) are nested within P. dactylifera clades as are haplotypes from two P. atlantica samples. We conducted a phylogenetic analysis of chloroplast (cpDNA) and mitochondrial (mtDNA) genomes to examine patterns of introgression in organellar genomes and evaluate whether cultivated date palm P. dactylifera or P. theophrasti served as the maternal progenitor in the interspecific cross. Both neighbor-joining and maximum-likelihood analyses of cpDNA genome sequences largely supported the species phylogeny recovered from the whole-genome analysis (). P. theophrasti samples formed a distinct clade indicating that cpDNA from this species is not introgressed in North Africa. Middle Eastern and North African date palms appear as distinct and well-supported lineages, although the Middle Eastern haplotype is also found in some North African samples, a result consistent with a previous report (5). While the North African clade is distinct and well supported, it shares a most recent common ancestor with P. sylvestris. Single P. theophrasti (Gölköy, Turkey) and P. sylvestris (Faisalabad, Pakistan) samples that were admixed with date palm in the STRUCTURE analysis based on nuclear genotypes (Fig. 2) possess P. dactylifera cpDNA haplotypes () consistent with their being hybrid samples. Similar phylogenetic relationships were apparent in the analysis of the mtDNA genome (). The distinctness of the P. theophrasti cpDNA and mtDNA haplotypes and their absence from North African date palm samples suggest an asymmetry in the direction of the interspecific cross and indicate that P. theophrasti was the paternal (pollen) contributor.

Impact of Introgression on Genetic Diversity in North Africa.

Introgression has impacted genome-wide patterns of genetic diversity in North African date palm. Higher nucleotide diversity in the North African population (P 2.2 × 10−16; two-tailed Wilcoxon signed-rank test, Fig. 5 and ref. 7) could be explained by introgression. To determine whether introgression has contributed to elevated diversity in North Africa, we rank ordered the genome based on fd and found that highly introgressed regions show higher levels of nucleotide diversity in North Africa (). However, regions with little or no evidence of introgression on average also show higher diversity in North Africa, an observation that suggests that hybridization alone may be insufficient to account for elevated diversity in North Africa. An effect of introgression on Tajima’s D is also apparent. At the whole-genome level, Tajima’s D is higher in North Africa than in the Middle East (P 2.2 × 10−16, ). This appears to be driven at least in part by introgression as elevated Tajima’s D is most pronounced in regions with highest fd ().
Fig. 5.

Patterns of genetic diversity in North African and Middle Eastern date palm populations. (A) Boxplot distributions of Fst based on 5-kb nonoverlapping intervals between pairs of populations (M.E., Middle East; N.A., North Africa; the, P. theophrasti). (B) Distribution of nucleotide diversity in Middle Eastern and North African date palm populations. (C) Decay of LD calculated separately for each population. Symbols on decay curves represent the half-decay distance at 20.5 kb and 30.8 kb for Middle Eastern and North African populations.

Patterns of genetic diversity in North African and Middle Eastern date palm populations. (A) Boxplot distributions of Fst based on 5-kb nonoverlapping intervals between pairs of populations (M.E., Middle East; N.A., North Africa; the, P. theophrasti). (B) Distribution of nucleotide diversity in Middle Eastern and North African date palm populations. (C) Decay of LD calculated separately for each population. Symbols on decay curves represent the half-decay distance at 20.5 kb and 30.8 kb for Middle Eastern and North African populations. The impact of introgression is also apparent in measures of population differentiation and LD. Using Fst as a measure of population subdivision, we observe that the two regional populations of date palm are moderately diverged (Fst = 0.085). However, the North African population is less diverged from P. theophrasti (Fst = 0.369) than is the Middle Eastern population (Fst = 0.403; P 2.2 × 10–16, two-tailed Wilcoxon signed-rank test, Fig. 5). When we compare Fst across genomic intervals ranked by fd, we observe that the reduction in Fst between North African date palm and P. theophrasti is most pronounced in regions with a high introgression fraction (). Moreover, Fst between Middle Eastern and North African populations is elevated in these same regions (), thus suggesting that population structure observed between these populations is at least partially due to introgression from P. theophrasti in North Africa. If these patterns are the result of recent introgression from a wild relative, we expect the North African population to show evidence of admixture LD. At equilibrium, a population with larger effective population size (Ne) should have a faster rate of LD decay. However, although the North African population has higher nucleotide diversity and therefore larger Ne at equilibrium (7), LD decays at a slower rate in this population (Fig. 5). LD in North Africa reaches half its maximum at ∼30 kb, while in the Middle East the half-decay distance is ∼20 kb, a pattern consistent with recent admixture of North African date palm with a distant population. These observations suggest that the mixed ancestry of North African date palm has profoundly impacted genome diversity in this population.

Private Alleles in Date Palm Populations.

Our analysis raises questions about the possibility of an unsampled population that may have been a source of ancestry in the North African population. We examined the distribution of private polymorphisms among populations and Phoenix wild relatives. When measured as a fraction of total SNPs within each group, we find that the North African population does not have a large class of private polymorphisms (5.7%) relative to Middle Eastern date palm (13.4%) (). This suggests any additional unsampled source of variation in the North African population is unlikely to be a divergent lineage.

Archaeobotanical Records Are Consistent with a Late Appearance of Date Palm in North Africa.

We assembled archaeobotanical records from the published literature to address questions about the historical distribution of date palm ( and Dataset S1). Inspection of these records indicates that date palm appears in the archaeobotanical record of North Africa later than in the Near and Middle East (Dataset S1 and ). A single pair of date stones recovered from Takarkori, Southwest Libya, might be as old as 9,000 yBP (38), but is here rejected as plausibly recent and intrusive. The earliest records on the African continent date to predynastic Egypt (ca. 5,500 yBP) and Sudan (ca. 4,000 yBP). In the Maghreb, the oldest reliable records are from Libya (ca. 3,000 yBP). West of Libya, there is one record of date palm at Volubilis, Morocco dating to the Classical Period (ca. 2,000 yBP), but otherwise no archaeobotanical records before the Islamic Period (ca. 700 yBP) when seed remains appear at the medieval sites of Igîlîz, Volubilis, and Sijilmasa, Morocco and Essouk and Gao Saney, Mali. These observations are consistent with the view that date palm may have only recently expanded to the African continent.

Discussion

We have presented evidence that the North African date palm population has a mixed ancestry that can be traced to hybridization between date palm and a wild relative P. theophrasti, the Cretan date palm. P. theophrasti is a distinct species (Figs. 1 and 3 and ref. 15) with a present-day distribution that is restricted primarily to geographically isolated locations on the island of Crete, the East Aegean Islands, and the Anatolian coast of Turkey (Fig. 1 and refs. 39–42). Evidence that the North African population has mixed ancestry between this Aegean Sea endemic and cultivated date palm suggests that P. dactylifera on the African continent has a hybrid origin and that P. theophrasti may be the unknown source of variation in North Africa identified by Gros-Balthazard et al. (6). The geographic context of hybridization between P. theophrasti and P. dactylifera is obscured by the fact that the present-day distribution of P. theophrasti does not overlap with the traditional range of date palm cultivation (Fig. 1). One model is that hybridization occurred in the Levant or elsewhere in the Eastern Mediterranean where the ranges of the two species may have once overlapped. Phoenix stones identified as P. theophrasti have been found at a site in northern Israel from ∼9,000 yBP, suggesting that this species had a wider distribution in the past and may have been exploited (38, 43). P. dactylifera was also distributed historically in the Levant including the Jordan River valley and the Dead Sea until the Middle Ages when the Judean date palm population went extinct (44). Whether the two species historically occurred sympatrically in the Eastern Mediterranean or elsewhere is unclear. The late appearance of date palm in the archaeobotanical record of North Africa suggests a model where Middle Eastern date palm expanded its range to the African continent. This model predicts a bottleneck associated with founder event colonization that was not supported by previous genome-wide analysis showing elevated genetic diversity in North Africa (6, 7). Evidence of hybridization, while appearing to explain much of the elevated diversity in this population, also does not appear to fully account for differences in nucleotide diversity between populations. These observations suggest that a simple expansion plus hybridization model may be insufficient to account for patterns in the genomic data. More complex expansion models such as those including postexpansion inbreeding (7) or additional bottlenecking in the Middle East could account for our observations. Alternatives to the expansion model include those that propose an additional source of variation in addition to P. theophrasti and Middle Eastern date palm in the North African gene pool. For example, it is possible that the archaeobotanical record provides an incomplete picture of Phoenix in North Africa and that a wild population of P. dactylifera similar to the relictual population recently discovered on the Arabian Peninsula (6) once existed on the African continent. This population either could have been domesticated independently or was not domesticated, but served as an additional source of variation for an extant cultivated population with P. theophrasti and Middle Eastern date palm ancestry. A distinct haplotype found at high frequency in the organellar genomes of the North African population () (45) provides some support for such a lineage. However, we note that the relatively small numbers of private alleles in North Africa (5.7% of SNPs in North Africa vs. 13.4% in the Middle East) suggest that if such a population exists, it is unlikely to be a divergent lineage or has not contributed significantly to the genomic constitution of North African date palms. Phoenix species are known to hybridize and produce viable offspring (16, 27, 46, 47) and either natural hybridization or horticultural practices could account for the introgression of interspecific alleles into P. dactylifera. For example, the practice of hand pollination of date palm was widely known throughout the ancient world (48) and existing practices of fertilization with interspecific pollen (49) and seedling propagation (50) suggest a possible mechanism for hybrid origins. Alternatively, putative natural hybrids between P. theophrasti and date palm have been reported (40) and are also apparent in our analysis. Although P. theophrasti samples appear as a distinct cluster in our STRUCTURE results (Fig. 2), individuals with some P. dactylifera ancestry include a sample from a reported hybrid population in Gölköy, Turkey (40); two samples from Almyros, Crete; and two samples from Epidaurus, Peloponesse. These samples may represent instances of crop-to-wild gene flow (51) or be relicts from an earlier hybridization event. Crop species frequently retain the capacity to hybridize with their wild relatives and hybrid genotypes are sources of novelty, superior quality, and adaptive traits. Evidence of introgressive hybridization in annual crops such as rice (52) and maize (53) and perennial tree crops including citrus (54), almonds (55), and apricots (56) supports a role for hybridization in the domestication process. The adaptive significance of hybridization and the traits subject to selection are often unknown except in exceptional cases (53). However, it is becoming increasingly clear that introgressive hybridization is an important source of diversity (53) that frequently accompanies expansion of domesticated species such as apples (57) and grapevine (58). Discovery that a regional population of domesticated date palm is also the product of hybridization helps resolve long-standing questions about the origins of date palm and indicates that introgression may be an important factor that dramatically alters the genome of domesticated species, thus providing novel diversity for adaptation during domestication.

Materials and Methods

A detailed description of materials and methods is provided in . Briefly, we obtained varieties of P. dactylifera and Phoenix wild relatives from various sources, including collections from wild populations, germplasm collections, research facilities, and ornamental gardens (). Whole-genome 2 100 paired-end sequencing was conducted on an Illumina HiSeq 2500 Sequencer with one-half to one lane per sample. Data were aligned to the date palm draft assembly (28) available at the National Center for Biotechnology Information (NCBI) with Burrows–Wheeler aligner (59). SNPs were called following standard protocols (60) and filtered accordingly. Phylogenomic analysis was conducted on a subset of SNPs separately for nuclear, cpDNA, and mtDNA genomes using a combination of randomized axelerated maximum likelihood (RAxML) (61) and R packages phangorn (62) and ape (63). Model-based clustering of population genomic data was conducted with STRUCTURE (29). A model of population splits and mixtures was fitted to the data using TreeMix (36). Admixture tests with and D statistics were conducted with Popstats (64). Estimates of ancestry proportions were obtained from TreeMix mixture weights and with the -ratio approach as implemented in Popstats. Population and introgression summary statistics were estimated directly from sample alignments with analysis of next generation sequencing data (ANGSD) (65) or from SNPs with vcftools (66) and a script from Martin et al. (37). Statistical analysis was conducted with the R Statistical Programing Language (67). Archaeobotanical records of date palm were gathered from published reports, based on finding monograph chapters based on previous regional reviews (e.g., refs. 68 and 69), a database search of “literature on archaeological remains of cultivated plants 1981–2004” (70), and a Google scholar search for additional recent journal articles. It includes all reports, most of which are dated based on associated artifactal and archaeological evidence (indicated as “ass.”), in some cases by associated radiocarbon dates (C14) or direct accelerator mass spectrometry (AMS) dates on crop remains. Few of these are directly dated Phoenix stones themselves, but those available are indicated as “AMS*.” Calibrated radiocarbon dates have been summed and the 1-sigma range has been taken to represent the phase of the sample, from a which a median has been calculated. In the case of associated dating evidence a standard period age range has been taken from recent archaeological literature (based on the expertise of D.Q.F.) and a median taken from this range. Where there are concerns over the true antiquity of remains, such as uncarbonized Phoenix stones of large size from Takrakori, Libya, attributed to a 8,000-yBP context (38), these are flagged in our database (Dataset S1) and excluded from the synthesis of date palm spread. Additional details are provided in . DNA sequence data new to this study have been deposited in the Sequence Read Archive (71). SNP data have been deposited in the Dryad Digital Repository (72).
  24 in total

1.  Signatures of Introgression across the Allele Frequency Spectrum.

Authors:  Simon H Martin; William Amos
Journal:  Mol Biol Evol       Date:  2021-01-23       Impact factor: 16.240

2.  SALAI-Net: species-agnostic local ancestry inference network.

Authors:  Benet Oriol Sabat; Daniel Mas Montserrat; Xavier Giro-I-Nieto; Alexander G Ioannidis
Journal:  Bioinformatics       Date:  2022-09-16       Impact factor: 6.931

3.  Extent of wild-to-crop interspecific introgression in grapevine (Vitis vinifera) as a consequence of resistance breeding and implications for the crop species definition.

Authors:  Serena Foria; Gabriele Magris; Irena Jurman; Rachel Schwope; Massimo De Candido; Elisa De Luca; Dragoslav Ivanišević; Michele Morgante; Gabriele Di Gaspero
Journal:  Hortic Res       Date:  2022-01-18       Impact factor: 7.291

4.  863 genomes reveal the origin and domestication of chicken.

Authors:  Ming-Shan Wang; Mukesh Thakur; Min-Sheng Peng; Yu Jiang; Laurent Alain François Frantz; Ming Li; Jin-Jin Zhang; Sheng Wang; Joris Peters; Newton Otieno Otecko; Chatmongkon Suwannapoom; Xing Guo; Zhu-Qing Zheng; Ali Esmailizadeh; Nalini Yasoda Hirimuthugoda; Hidayat Ashari; Sri Suladari; Moch Syamsul Arifin Zein; Szilvia Kusza; Saeed Sohrabi; Hamed Kharrati-Koopaee; Quan-Kuan Shen; Lin Zeng; Min-Min Yang; Ya-Jiang Wu; Xing-Yan Yang; Xue-Mei Lu; Xin-Zheng Jia; Qing-Hua Nie; Susan Joy Lamont; Emiliano Lasagna; Simone Ceccobelli; Humpita Gamaralalage Thilini Nisanka Gunwardana; Thilina Madusanka Senasige; Shao-Hong Feng; Jing-Fang Si; Hao Zhang; Jie-Qiong Jin; Ming-Li Li; Yan-Hu Liu; Hong-Man Chen; Cheng Ma; Shan-Shan Dai; Abul Kashem Fazlul Haque Bhuiyan; Muhammad Sajjad Khan; Gamamada Liyanage Lalanie Pradeepa Silva; Thi-Thuy Le; Okeyo Ally Mwai; Mohamed Nawaz Mohamed Ibrahim; Megan Supple; Beth Shapiro; Olivier Hanotte; Guojie Zhang; Greger Larson; Jian-Lin Han; Dong-Dong Wu; Ya-Ping Zhang
Journal:  Cell Res       Date:  2020-06-25       Impact factor: 25.617

Review 5.  Back to the Origins: Background and Perspectives of Grapevine Domestication.

Authors:  Fabrizio Grassi; Gabriella De Lorenzis
Journal:  Int J Mol Sci       Date:  2021-04-26       Impact factor: 5.923

6.  The genomes of ancient date palms germinated from 2,000 y old seeds.

Authors:  Muriel Gros-Balthazard; Jonathan M Flowers; Khaled M Hazzouri; Sylvie Ferrand; Frédérique Aberlenc; Sarah Sallon; Michael D Purugganan
Journal:  Proc Natl Acad Sci U S A       Date:  2021-05-11       Impact factor: 11.205

7.  Genome ancestry mosaics reveal multiple and cryptic contributors to cultivated banana.

Authors:  Guillaume Martin; Céline Cardi; Gautier Sarah; Sébastien Ricci; Christophe Jenny; Emmanuel Fondi; Xavier Perrier; Jean-Christophe Glaszmann; Angélique D'Hont; Nabila Yahiaoui
Journal:  Plant J       Date:  2020-02-28       Impact factor: 6.417

8.  Deletion of beta-fructofuranosidase (invertase) genes is associated with sucrose content in Date Palm fruit.

Authors:  Joel A Malek; Sweety Mathew; Lisa S Mathew; Shameem Younuskunju; Yasmin A Mohamoud; Karsten Suhre
Journal:  Plant Direct       Date:  2020-05-27

9.  Novel subpopulations in date palm (Phoenix dactylifera) identified by population-wide organellar genome sequencing.

Authors:  Yasmin A Mohamoud; Lisa S Mathew; Maria F Torres; Shameem Younuskunju; Robert Krueger; Karsten Suhre; Joel A Malek
Journal:  BMC Genomics       Date:  2019-06-17       Impact factor: 3.969

Review 10.  Apple whole genome sequences: recent advances and new prospects.

Authors:  Cameron P Peace; Luca Bianco; Michela Troggio; Eric van de Weg; Nicholas P Howard; Amandine Cornille; Charles-Eric Durel; Sean Myles; Zoë Migicovsky; Robert J Schaffer; Evelyne Costes; Gennaro Fazio; Hisayo Yamane; Steve van Nocker; Chris Gottschalk; Fabrizio Costa; David Chagné; Xinzhong Zhang; Andrea Patocchi; Susan E Gardiner; Craig Hardner; Satish Kumar; Francois Laurens; Etienne Bucher; Dorrie Main; Sook Jung; Stijn Vanderzande
Journal:  Hortic Res       Date:  2019-04-05       Impact factor: 6.793

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.