Literature DB >> 33361817

A genetic history of the pre-contact Caribbean.

Daniel M Fernandes1,2, Kendra A Sirak3,4, Harald Ringbauer3,4, Jakob Sedig3,4, Nadin Rohland3,5, Olivia Cheronet1, Matthew Mah3,4,5,6, Swapan Mallick3,4,5,6, Iñigo Olalde3,7, Brendan J Culleton8, Nicole Adamski3,6, Rebecca Bernardos3,6, Guillermo Bravo1,9, Nasreen Broomandkhoshbacht3,6,10, Kimberly Callan3,6, Francesca Candilio11, Lea Demetz1, Kellie Sara Duffett Carlson1, Laurie Eccles12, Suzanne Freilich1, Richard J George13, Ann Marie Lawson3,6, Kirsten Mandl1, Fabio Marzaioli14, Weston C McCool13, Jonas Oppenheimer3,6,15, Kadir T Özdogan1, Constanze Schattke1, Ryan Schmidt16, Kristin Stewardson3,6, Filippo Terrasi14, Fatma Zalzala3,6, Carlos Arredondo Antúnez17, Ercilio Vento Canosa18, Roger Colten19, Andrea Cucina20, Francesco Genchi21, Claudia Kraan22, Francesco La Pastina21, Michaela Lucci23, Marcio Veloz Maggiolo24, Beatriz Marcheco-Teruel25, Clenis Tavarez Maria26, Christian Martínez26, Ingeborg París27, Michael Pateman28,29, Tanya M Simms30, Carlos Garcia Sivoli27, Miguel Vilar31, Douglas J Kennett13, William F Keegan32, Alfredo Coppa33,34,35, Mark Lipson3,4, Ron Pinhasi36, David Reich37,38,39,40.   

Abstract

Humans settled the Caribbean about 6,000 years ago, and ceramic use and intensified agriculture mark a shift from the Archaic to the Ceramic Age at around 2,500 years ago1-3. Here we report genome-wide data from 174 ancient individuals from The Bahamas, Haiti and the Dominican Republic (collectively, Hispaniola), Puerto Rico, Curaçao and Venezuela, which we co-analysed with 89 previously published ancient individuals. Stone-tool-using Caribbean people, who first entered the Caribbean during the Archaic Age, derive from a deeply divergent population that is closest to Central and northern South American individuals; contrary to previous work4, we find no support for ancestry contributed by a population related to North American individuals. Archaic-related lineages were >98% replaced by a genetically homogeneous ceramic-using population related to speakers of languages in the Arawak family from northeast South America; these people moved through the Lesser Antilles and into the Greater Antilles at least 1,700 years ago, introducing ancestry that is still present. Ancient Caribbean people avoided close kin unions despite limited mate pools that reflect small effective population sizes, which we estimate to be a minimum of 500-1,500 and a maximum of 1,530-8,150 individuals on the combined islands of Puerto Rico and Hispaniola in the dozens of generations before the individuals who we analysed lived. Census sizes are unlikely to be more than tenfold larger than effective population sizes, so previous pan-Caribbean estimates of hundreds of thousands of people are too large5,6. Confirming a small and interconnected Ceramic Age population7, we detect 19 pairs of cross-island cousins, close relatives buried around 75 km apart in Hispaniola and low genetic differentiation across islands. Genetic continuity across transitions in pottery styles reveals that cultural changes during the Ceramic Age were not driven by migration of genetically differentiated groups from the mainland, but instead reflected interactions within an interconnected Caribbean world1,8.

Entities:  

Mesh:

Year:  2020        PMID: 33361817      PMCID: PMC7864882          DOI: 10.1038/s41586-020-03053-2

Source DB:  PubMed          Journal:  Nature        ISSN: 0028-0836            Impact factor:   49.962


Prior to European colonization, the Caribbean was a mosaic of archaeologically-distinct communities connected by networks of interaction since the first human occupations in Cuba, Hispaniola, and Puerto Rico around 6,000 years ago[3,7]. The pre-contact Caribbean is divided into three archaeological Ages that denote shifts in material cultural complexes[1,9]. The Lithic and Archaic Ages are defined by distinct stone-tool technologies[10-11], while the Ceramic Age, beginning ~2,500–2,300 years ago, featured an agricultural economy and intensive pottery production. Technological and stylistic changes in material culture across these Ages reflect local developments by connected Caribbean people and also migration from the American continents, although the geographic origins, trajectories, and numbers of migratory waves remain under debate[1,3,12] (Table 1; Supplementary Information section 1).
Table 1.

Archaeological debates addressed by our analyses.

Genetic data provide new insight into open debates inspired by archaeological research.

DebatesGenetic inferences
Archaic Age migration(s)Archaic-associated individuals have ancestry more closely related to published Central and South Americans than to North Americans. Archaic-related ancestry was >98% replaced by Ceramic-related ancestry in most of the Greater Antilles but persisted with minimal admixture in Cuba for over 2,500 years. All Archaic-associated individuals are consistent with deriving from a single source, contrary to a claim of additional migration with affinity to North Americans.
Ceramic Age migration(s)The great majority of Ceramic-associated individuals are genetically homogeneous with a connection to northeastern South America, now the homeland of Arawak-speakers. A south-to-north migratory movement of genetically-homogenous people is most parsimonious, although we cannot rule out multiple migrations by genetically similar groups.
Stylistic transitions and migrationsGenetic homogeneity across changes in ceramic styles provides evidence against a scenario of multiple waves of migration of genetically differentiated people from South America. We document over a millennium of genetic continuity in a small region of the southeast coast of Hispaniola.
Archaic/Ceramic interactionsArchaic- and Ceramic-associated admixture was extremely rare; we identify it in 3 of 201 ceramic-using Caribbean individuals. Unadmixed Archaic-related ancestry persisted as late as 700 BP in Cuba, but was replaced by Ceramic-related ancestry in Hispaniola beginning at least a millennium before.
Demographic historyEffective population sizes (Ne) for Ceramic-associated sites were larger (~500–1500) than for Archaic-associated sites (~200–300) and are estimated at ~1500–8000 across islands. A small pan-Caribbean gene pool and interconnected population is also evidenced by 19 cross-island relative pairs and very low genetic differentiation across the Ceramic Age Caribbean. As census size is unlikely to be >10x larger than Ne, population estimates in the hundreds of thousands are likely too large. Ancient Caribbean people avoided unions of first cousins or closer.
Persistence of ancestry todayWe identify up to ~14% Ceramic-related ancestry in present-day Puerto Ricans and Cubans and identify a new mtDNA haplogroup unique to the Caribbean present in pre-contact times as well as today.
We screened 195 individuals and generated genome-wide data passing authenticity criteria for 174 individuals (Supplementary Data 1, 2) who lived ~3100–400 calibrated years before present (calBP; based on 45 new radiocarbon dates, Extended Data Fig. 1a; Supplementary Data 3; Supplementary Information section 3) in The Bahamas, Hispaniola (Haiti and the Dominican Republic), Puerto Rico, Curaçao, and Venezuela (Fig. 1a; Supplementary Information section 2). These individuals had a median of 700,689 SNPs covered (range: 20,063–977,658 SNPs, median of 2.2× coverage of targeted positions (range: 0.02–9.95×), Supplementary Data 1). We co-analyzed the new data alongside 89 previously-published individuals[4] (Supplementary Information section 4). In what follows, we denote sites with stone tools or radiocarbon dates predating intensive ceramic use as ‘Archaic’ and sites with a preponderance of ceramics as ‘Ceramic’; we use ‘-related’ to refer to ancestry and ‘-associated’ for archaeological affiliation.
Extended Data Fig. 1:

Temporal distribution of newly-reported individuals and overview of population structure.

(a) Numbers represent individuals from each site; thick lines denote direct 14C dates (95.4% calibrated confidence intervals); thin lines denote archaeological context dating; grey area identifies the first arrivals of ceramic-users in the Caribbean. Colors and labels are consistent with Fig. 1. (b) PCA plot with ancient individuals shown as solid squares or circles (Archaic- or Ceramic-associated individuals, respectively). Newly-reported individuals are outlined in black, genetic outliers are outlined in red, and individuals with <30,000 SNPs are outlined in blue. Individuals are separated by sub-clades, and three individuals from the site of Cueva Roja (Dominican Republic) who were excluded from clading analysis are labeled “Dominican Cueva Roja Archaic” and colored magenta. Individual PDI009, assessed elsewhere as an outlier[11], is denoted with an asterisk. Three previously-published ancient Caribbean individuals[9,10] are shown as inverted triangles outlined in gray and colored for the sub-clade that encompasses the geographic region with which they are associated. This plot focuses on ancient individuals and does not show some present-day populations; a full plot is provided as Fig. S17. (c) ADMIXTURE analysis best supports K=6 ancestral elements. Newly-reported and co-analyzed individuals are clustered by sub-clade; all newly-reported individuals are identified by a black bar to the side of the plot. The same three previously-published individuals[9,10] shown in Extended Data Fig. 1b are included, and three modern-day populations are shown for reference (Suruí, Cabécar, Piapoco).

Fig. 1:

Geography and significant genetic structure.

(a) Newly-reported data shown as large bordered shapes; co-analyzed data[4] shown as small non-bordered shapes. Asterisk (*) denotes Archaic-associated site of Cueva Roja (excluded due to low-coverage); hash (#) denotes sites with admixed individuals. Andrés is represented as SECoastDR_Ceramic and Dominican_Archaic. Numbers of individuals and temporal distribution in Extended Data Fig. 1a. Map generated with the R package “maps” (R Core Team (2013). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL http://www.R-project.org/). (b) Relationships reconstructed from allele sharing (Supplementary Information section 8). Solid lines connect sub-groupings comprising a larger group; dashed lines represent admixture. Colored boxes represent final sub-clades with the color scheme matching Fig. 1a.

Ethics

We acknowledge the ancient individuals whose skeletal remains we analyzed, present-day people who have an Indigenous legacy, and Caribbean-based scholars who were centrally involved in this work. Permission to perform ancient DNA analysis was documented through authorization letters signed by a custodian who represented the remains from each site. Results were discussed prior to submission with members of Indigenous communities who trace their legacy to the pre-contact Caribbean and their feedback was incorporated. Genetic data are a form of knowledge that contributes to understanding the past; they co-exist with oral traditions and other Indigenous knowledge. Genetic ancestry should not be conflated with perceptions of identity, which cannot be defined by genetics alone. A full ethics statement is in the Supplementary Information.

Genetic structure of the pre-contact Caribbean

We performed principal component analysis (PCA), projecting ancient individuals onto axes computed using present-day Indigenous American groups[13] (Extended Data Fig. 1b; Supplementary Data 4). Ceramic- and Archaic-associated individuals project in separate clusters, while ancient Venezuelans relate to present-day Chibchan-speakers (like Cabécar) in PCA and ADMIXTURE analysis (Extended Data Figs. 1b, 1c; Supplementary Information sections 5, 6; population self-denominations in Supplementary Data 5). Individuals from Curaçao and Haiti (who are admixed, discussed below) mostly overlap the Ceramic-associated cluster. An exception to within-site genetic homogeneity is at Andrés (a primarily Ceramic-associated site, Dominican Republic), where individual I10126 is dated to the Archaic Age (~3140–2950 calBP, Supplementary Data 3) and appears genetically similar to other Archaic-associated individuals (Extended Data Figs. 1b, 1c). We exclude from subsequent analyses three Archaic-associated individuals from Cueva Roja (~1900 calBP, Dominican Republic) with low coverage (<~0.05×) who are qualitatively similar to other Archaic-associated individuals, and one individual from three pairs of first-degree relatives (Supplementary Data 1). To study genetic structure independent of archaeologically-based assignments (Supplementary Information section 2), we grouped individuals with increasing resolution based on allele sharing, starting with major ‘clades’ and then ‘sub-clades’ (Supplementary Information section 8). Our nomenclature combined the geographic location encompassing sites in the cluster plus ‘Archaic’ or ‘Ceramic’ (Fig. 1b). We identified three significantly differentiated major clades. GreaterAntilles_Archaic included 50 individuals from Cuba spanning ~3200–700 calBP[4] and individual I10126 from Andrés (Dominican Republic). Caribbean_Ceramic comprised 194 individuals from Ceramic-associated sites dating ~1700–400 calBP. Venezuela_Ceramic comprised eight individuals dated ~2350 calBP. Two Haiti_Ceramic and five Curacao_Ceramic individuals fit as mixtures of major clades (below). We next identified sub-clades and substructure within them (Supplementary Data 6; Table S6). Within Caribbean_Ceramic, SECoastDR_Ceramic comprised four sites along 50 kilometers of the southeast coast of the Dominican Republic (from west to east: La Caleta, Andrés, Juan Dolio, and El Soco) (Table S7). These sites were occupied for ~1,400 years, documenting genetic continuity across changes in ceramic styles. All Ceramic-associated sites from The Bahamas and Cuba (spanning ~700 years) grouped as BahamasCuba_Ceramic, and further substructure was present in each of five Bahamian islands and two Cuban sites. The two sites in the Lesser Antilles grouped as LesserAntilles_Ceramic, and the remaining sites from Caribbean_Ceramic grouped as EasternGreaterAntilles_Ceramic, showing no cross-site substructure. Pairwise FST<~0.01 indicates a striking degree of homogeneity among these Caribbean_Ceramic sub-clades (compared to FST ~0.1 between Ceramic- and Archaic-related clades), reflecting high migration rates among islands (discussed below; Extended Data Fig. 2).
Extended Data Fig. 2|

FST distances.

Average pairwise FST distances and standard errors (x100) between (a) clades and (b) sites with more than two unrelated individuals, demonstrating both overall high levels of genetic similarity between the Caribbean_Ceramic sub-clades and the sites composing them, as well as the magnitude of genetic differentiation between those and the groups with Archaic- and Venezuela-related ancestries.

To identify Caribbean_Ceramic individuals who had an excess of Archaic-related ancestry relative to others within each sub-clade, we used f-statistics (Supplementary Information section 8; Supplementary Data 8). Individual I16539 from La Caleta (Dominican Republic) and the two individuals comprising Haiti_Ceramic showed significant evidence of Ceramic-/Archaic-related admixture (Z=−5.5; Table S8). In contrast to a previous claim[11], we did not detect significant Archaic-related admixture in individual PDI009 from Paso del Indio (Puerto Rico) (Z=0.6; Supplementary Information section 4; Table S3).

Archaic-associated Caribbean people

The GreaterAntilles_Archaic clade shares the most genetic drift with Indigenous groups from Central and northern South America belonging to seven language families: Arawakan, Cariban, Chibchan, Chocoan, Guajiboan, Mataco-Guaicuru, and Tupian[14,15] (Fig. 2a; Supplementary Data 10; Supplementary Information section 11). There is no evidence of excess allele sharing with people from one language family relative to the others or evidence of genetic drift specifically shared with present-day populations from Mesoamerica or North America (Fig. 2a, 2b; Supplementary Data 11). Archaic-associated individuals from Cuba share more alleles with each other than with Dominican individual I10126 (Table S6), demonstrating Archaic substructure; we separate individual I10126 as Dominican_Andres_Archaic for some analyses.
Fig. 2:

Genetic affinities of ancient Caribbean people.

(a) Outgroup f-statistics measuring the relatedness of the clades GreaterAntilles_Archaic, Caribbean_Ceramic, and Venezuela_Ceramic to present-day populations (squares). Map generated with the R package “maps” (R Core Team (2013). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL http://www.R-project.org/). (b) We computed f(Mbuti, Test; LanguageGroup1Pop, LanguageGroup2Pop) evaluating if each Test sub-clade is more closely related to populations belonging to one language family or another. Points represent the average Z-scores among all populations from each pair of language groups tested; horizontal lines show the range across such comparisons. Vertical lines represent a significance threshold corresponding to a 99.5% CI. (c) Admixture graph modelling of representative ancient Caribbean groupings and select non-Caribbean populations. We fit 12 groups, including the clades LesserAntilles_Ceramic and GreaterAntilles_Archaic, without mixture; the other three Caribbean_Ceramic sub-clades and the clade Venezuela_Ceramic fit as mixtures. The worst Z-score comparing observed to expected f-statistics is |3.6|, which is not significant after correcting for multiple hypothesis testing.

We could not replicate a previous claim that a migration by people with affinity to North Americans also contributed ancestry to some Archaic Age Caribbean individuals[4] (Supplementary Information section 17). This claim was based on a finding of affinity between Early Period individuals from California’s Channel Islands (USA_CA_Early_SanNicolas) and individual CIP009 from Cueva del Perico (Cuba) relative to individual GUY002 from Guayabo Blanco (Cuba). First, in the symmetry test f4(GUY002, CIP009; USA_CA_Early_SanNicolas, Bahamas_Taino), the deviation is non-significant (Z=−0.9; Table S25). Second, a key statistic underlying this claim was that a qpWave-based symmetry test involving CIP009 and GUY (three individuals from Guayabo Blanco) yielded p=0.013; however, this is not significant after correcting for the number of sample pairs tested. Third, we computed f(Outgroup, CIP009; USA_CA_Early_SanNicolas, Bahamas_Taino), whose negative value was interpreted as evidence for affinity between CIP009 and USA_CA_Early_SanNicolas; while we replicated the non-significant statistic (Z=−1.3; Table S23), it became positive when we replaced the Mbuti outgroup with diverse Eurasians or Bahamas_Taino[16] with ancient Bahamian shotgun data newly generated for this study, which should give qualitatively similar results (Tables S24 and S26). Fourth, the (non-significant) Z-scores for attraction to CIP009 were as strong when South American ancient genomes were placed in the position of USA_CA_Early_SanNicolas, showing no evidence of a North American-specific relationship (Table S27). Fifth, CIP009 fits best in a simplified version of our qpGraph tree on the same node as other Archaic-associated individuals (Supplementary Information section 17; Fig. S34). Thus, to the limits of the resolution of allele sharing methods, all Archaic-associated Caribbean ancestry is consistent with deriving from a single source. In qpGraph, we fit GreaterAntilles_Archaic in an early splitting branch containing most ancient Caribbean, Belizean, Brazilian, and Argentinian populations (Fig. 2c). In a maximum likelihood tree allowing admixture events[17], GreaterAntilles_Archaic also fits as a divergent Native American group (Extended Data Fig. 3). We could not obtain further evidence of specific affinities to mainland groups using qpAdm (Supplementary Information section 9; Table S16) or f-statistics (Table S17).
Extended Data Fig. 3:

Maximum likelihood population tree from allele frequencies using Treemix.

The Caribbean_Ceramic sub-clades are shown on the same branch as modern Arawak-speaking groups (Palikur, Jamamadi). Orange arrows represent admixture events, although observations from other analyses (e.g., qpAdm admixture modeling) suggest that the indicated direction of admixture may be inaccurate (e.g., we believe it is more likely that there is GreaterAntilles_Archaic admixture into Haiti_Ceramic than the reverse scenario; Supplementary Information section 9).

The arrival of ceramic users displaced Archaic-related ancestry in much of the Caribbean. An exception is western Cuba, where Archaic lineages persisted with minimal mixture for >2,500 years, resonating with archaeological[18] and historical[19] accounts that this region was home to people with a distinct language and cultural traditions as late as the Contact Period.

The spread of ceramic users

Previous analyses have found that Caribbean Ceramic-associated people have genetic affinities to Arawak-speakers in northeastern South America[16,20,21] (Supplementary Information section 1). Although we are not able to support this conclusion with symmetry f4-statistics which show no significant evidence of closer relatedness to Arawak- than to Cariban- or Tupian-speaking populations (Fig. 2b; Supplementary Data 11; Supplementary Information section 11), ADMIXTURE suggests an Arawak affinity, as individuals from each Caribbean_Ceramic sub-clade are almost entirely composed of a component found in the highest proportion in modern Arawak speakers (e.g., Piapoco in Extended Data Fig. 1c). We also find support for an Arawak connection in a maximum likelihood tree allowing admixture events, which places all Caribbean_Ceramic sub-clades on the same branch as Arawak-speaking Piapoco and Palikur (Extended Data Fig. 3). Further evidence comes from a successful fit with Piapoco as the single source for Caribbean_Ceramic in qpAdm (Tables S18, S19), and qpGraph (Fig. 2c). We estimate ~0.5–2.0% Archaic-related ancestry in the Ceramic-associated people of the Greater Antilles and The Bahamas when modeled in qpAdm as a mixture of LesserAntilles_Ceramic and Dominican_Andres_Archaic (Table S21). We reject reverse models of LesserAntilles_Ceramic deriving from Greater Antilles or Bahamas/Cuba-based sub-clades which fail when Archaic-associated people are included in the reference set (p=0.001–0.008, Table S21). This supports a scenario of south-to-north movement of ceramic using ancestors into the Caribbean, whereby ancestry like that in the 1000–650 BP ancient Lesser Antilles individuals (plausibly descended from the first ceramic users of the Lesser Antilles) spread into the Greater Antilles and The Bahamas, displacing the people that lived there with no more than ~2.0% mixture with resident groups. We found only three individuals from two Ceramic-associated sites in Hispaniola with significant Archaic-related admixture, who we estimate using qpAdm to have Archaic-related ancestry in proportions ranging between 11.8±1.9% (I16539 from La Caleta, Dominican Republic; Table S9) and 18.5±2.1% (two individuals from Diale 1, Haiti; Tables S12, S13). Using DATES[22], we estimate that admixture occurred ~16±3 generations (~350–500 years) before these individuals from Haiti lived (Supplementary Information section 14). Venezuela_Ceramic’s affinities with Chibchan speakers in ADMIXTURE and f-statistics (Fig. 2a, 2b; Extended Data Fig. 1c) are confirmed in qpAdm where Venezuela_Ceramic fits as a clade with Cabécar (Tables S18, S19). Thus, although Las Locas is located in a hypothesized source region for the Ceramic expansion and the individuals date to near the beginning of the Ceramic Age, our analysis increases the weight of evidence that this expansion had more easterly origins. We model ceramic users from Curaçao as 74.5±3.7% LesserAntilles_Ceramic-related ancestry and 25.5±3.7% Venezuela_Ceramic-related ancestry (Table S15), suggesting that Curaçao’s Ceramic Age population was derived from the admixture of two groups: one related to the population that also spread to the Antillean Caribbean at the onset of the Ceramic Age, and the other associated with the Dabajuroid ceramic styles linking sites like Las Locas to Curaçao. Although a study of cranial morphology suggested a possible Carib migration from western Venezuela ~1,150 years ago[23], we find no evidence of a new ancestry, as might be expected for such an event. In simulations using Venezuela_Ceramic, LesserAntilles_Ceramic, or present-day Cariban-speaking Arara as proxies for Caribs, we can detect as little as ~2–8% ancestry from such groups (Supplementary Information section 13). The genetic data shows no evidence for a separate migration, although we cannot rule out migration from an unsampled continental group genetically more similar to Caribbean ceramic people than the proxies we used for simulation, or who contributed less than 2% of their ancestry.

Social structure and population size estimates

We screened 202 individuals from our co-analysis dataset with >400,000 SNPs covered for runs of homozygosity (ROH) >4 centimorgan (cM)[24] (Supplementary Data 12; Supplementary Information section 7; Fig. S21). Large sums of long ROH (>20cM) indicate parental relatedness within the last few generations, whereas an abundance of shorter ROH signals background parental relatedness and restricted mating pools[25]. Only two out of 202 individuals had more than 100cM of their genome in ROH>20cM blocks (~135cM is the average in offspring of first cousins), indicating that close kin unions were rare. In contrast, 48 individuals had at least one ROH>20cM, indicating that many unions took place between individuals as close as second or third cousins, suggesting limited local population sizes. As further evidence of low population sizes, we detected abundant short and mid-size ROH across the Caribbean. We estimated effective population size (Ne) using the length distribution of all ROH 4–20cM, which arise from co-ancestry mostly within the last ~50 generations (Figs. 3a, 3b). Ne estimates can be used to infer census population size, which in humans is typically three- and up ten-fold greater[26,27]. Ne for Ceramic-associated Caribbean sites are larger (Ne ~500–1500, similar to previous estimates[16,20]) than for Archaic-associated sites (Ne~200–300) (Extended Data Fig. 4a; Extended Data Table 1), pointing to increased population density with the intensification of agriculture. This is also reflected in higher heterozygosity in Ceramic- than Archaic-associated groups (Extended Data Fig. 5).
Fig. 3:

Estimates of effective population size from shared haplotypes.

Details in Supplementary Information section 7. (a) Number of generations since two chromosomes with a shared segment of a specific size shared a common ancestor, assuming a constant population size N=1000. (b) Average rate of ROH segments in different length bins after excluding highly consanguineous individuals (defined as having a sum of ROH>20 >50cM). (c) Rates of IBD segments shared on the X chromosome between pairs of males within length bins after excluding closely related individuals (defined as sum of IBD X>20 >25cM). For the Ne estimates quoted in the paper we use the pool of 12–20cM segments; for comparisons between the two major clades SECoastDR_Ceramic and EasternGreaterAntilles_Ceramic this gives Ne=3082 (95% CI 1530–8150). In (b) and (c) confidence intervals correspond to one standard deviation (68% coverage) assuming a Poisson distribution in each bin (vertical bars). Point estimates (circles) placed at the center of each 2cM bin, with jitter added for visual separation. Gray lines depict expectations for panmictic populations of various sizes.

Extended Data Fig. 4:

Estimated effective population sizes.

(a) Estimates per site are based on ROH blocks 4–20 cM long using a likelihood model (Supplementary Information section 7). Colors as per sub-clades, numbers denote the count of analyzed individuals. Highly consanguineous individuals with a sum of ROH>20 above 50 cM were excluded. (b) Same as (a) but for IBD segments 8–20cM long shared on the X chromosome between all pairs of males. Closely related pairs of individuals with a sum of IBD X>20 above 25 cM were excluded. Numbers denote counts of all remaining pairs. In (a) and (b) points represent maximum likelihood estimate and vertical bars represent 95% CI.

Extended Data Table 1:

Ne estimates for each site.

Table includes all individuals where ROH analysis is possible and excludes individuals with more than 50cM sum of 20cM long ROH.

NeEstimateNeSTDCl(low)Cl(high)nLocalityCountryClade
503933216843Abaco IslandBahamasBahamasCuba_Ceramic
562943777474South Andros IslandBahamasBahamasCuba_Ceramic
6101513149062Crooked IslandBahamasBahamasCuba_Ceramic
87318151912284Eleuthera IslandBahamasBahamasCuba_Ceramic
79314051810685Cueva de los EsqueletosCubaBahamasCuba_Ceramic
6753460874253La CaletaDominican RepublicSECoastDR_Ceramic
83717050411704AndresDominican RepublicSECoastDR_Ceramic
141628086719667Juan DolioDominican RepublicSECoastDR_Ceramic
962126715120811El SocoDominican RepublicSECoastDR_Ceramic
83983677100217AtajadizoDominican RepublicEasternGreaterAntilles_Ceramic
105027451215883La UnionDominican RepublicEasternGreaterAntilles_Ceramic
6121513159092El FrancesDominican RepublicEasternGreaterAntilles_Ceramic
105133639117102MacaoDominican RepublicEasternGreaterAntilles_Ceramic
104927451215873Cueva JuanaDominican RepublicEasternGreaterAntilles_Ceramic
104927451215873Santa ElenaPuerto RicoEasternGreaterAntilles_Ceramic
74420234811412Canas/Collores/MonserratePuerto RicoEasternGreaterAntilles_Ceramic
123830364318324Paso del IndoPuerto RicoEasternGreaterAntilles_Ceramic
95329138215242Diale 1HaitiHaiti_Ceramic
4691032676702de SavaanCuracaoCuracao_Ceramic
127522483617158LavoutteSt. LuciaLesserAntilles_Ceramic
2731524430220Canimar AbajoCubaCuba_Archaic
216271622703Playa del MangoCubaCuba_Archaic
268461783572Guayabo BlancoCubaCuba_Archaic
432912546102Cueva CaleroCubaCuba_Archaic
Extended Data Fig. 5:

Conditional heterozygosity by clade.

Conditional heterozygosity in the ancient Caribbean was similar to that of contemporaneous groups from Peru[70], except for the Archaic-associated groups and Venezuela_Ceramic. First- and second-degree relatives were excluded from the analysis, including the pair of related individuals representing Haiti_Ceramic. Colored circles represent point estimates (color scheme matching Fig. 1); bars represent three standard errors.

Ne estimates from the ROH signal represent lower bounds on pan-Caribbean effective population size as they could reflect restricted gene pools for people living just at those sites, rather than interconnected gene pools. We therefore also analyzed long shared segments (IBD blocks) between the X chromosomes of pairs of males (Supplementary Information section 7). Focusing on shared segments of long IBD 12–20cM, which reflect the size of the shared ancestor pool from within the last ~20 generations (Fig. 3a), we find that the rate of such segments decreases with geographic distance (Fig. 3c), as expected if people exchange more genes with people living closer to them. However, we still detect 19 pairs of individuals who share segments of at least 8.7cM across islands (Extended Data Table 2), revealing that people across the Caribbean shared common ancestors in the hundreds of years prior to the time they lived (as expected given a small pan-Caribbean population size). A comparison between the two major clades in Hispaniola and Puerto Rico gives an estimate of Ne=3082 (1530–8150, 95% CI; estimates in Fig. 3 legend). This provides an upper bound for the recent effective size of the joint population living in Hispaniola and Puerto Rico, as limited migration reduces the rate of distant cousins and IBD sharing across sites. Multiplying Ne estimates by three- to ten-fold to obtain census size, we infer that pre-contact population size estimates of hundreds of thousands or even millions for large islands such as Hispaniola[5] (based on outdated reports or poorly-documented population counts[6]) are too large.
Extended Data Table 2:

Subset of cross-site relatives from different islands, identified through IBD analysis.

We measured the X chromosome length and IBD map lengths as ⅔ of the map length of female X. Complete table including cross-site distant relatives within islands in Supplementary Data 13.

ID1ID2EvidenceSite 1Site 2
113320115973X chromosome IBD segment of 10.0 cMBahamas, Abaco IslandDominican Republic, La Caleta
113318PDI010X chromosome IBD segment of 14.0 cMBahamas, Crooked IslandPuerto Rico, Vega Baja, Paso delIndio
113321112344X chromosome IBD segment of 12.7 cMBahamas, Eleuthera IslandDominican Republic, El Soco
113321113196X chromosome IBD segment of 10.7 cMBahamas, Eleuthera IslandDominican Republic, Juan Dolio
113321113326X chromosome IBD segment of 12.0 cMBahamas, Eleuthera IslandPuerto Rico, Monserrate
113737CDE001X chromosome IBD segment of 10.7 cMBahamas, Long Island, Clarence Town, Rolling Heads SiteCuba, Camaguey, Sierra de Cubitas, Cueva de los Esqueletos 1
114880112344X chromosome IBD segment of 8.7 cMBahamas, South Andros, SanctuaryBlue HoleDominican Republic, El Soco
114879115963X chromosome IBD segment of 10.0 cMBahamas, South Andros, SanctuaryBlue HoleDominican Republic, La Caleta
I8549114879X chromosome IBD segment of 10.0 cMDominican Republic, AndresBahamas, South Andros, SanctuaryBlue Hole
117903114875X chromosome IBD segment of 14.7 cMDominican Republic, AtajadizoBahamas, Abaco, Bill Johnson’s Cave, Lubber’s Quarters
113441114880X chromosome IBD segment of 10.7 cMPuerto Rico, Cabo Rojo 11Bahamas, South Andros, SanctuaryBlue Hole
113441113189X chromosome IBD segment of 10.0 cMPuerto Rico, Cabo Rojo 11Dominican Republic, El Soco
113441115676X chromosome IBD segment of 10.0 cMPuerto Rico, Cabo Rojo 11Dominican Republic, La Caleta
113441114992X chromosome IBD segment of 9.3 cMPuerto Rico, Cabo Rojo 11Dominican Republic, Los Muertos
113326112344X chromosome IBD segment of 11.3 cMPuerto Rico, MonserrateDominican Republic, El Soco
PDI012013115963X chromosome IBD segment of 9.3 cMPuerto Rico, Vega Baja, Paso delIndioDominican Republic, La Caleta
113318114880X chromosome IBD segment of 22.7 cMBahamas, Crooked IslandBahamas, South Andros, SanctuaryBlue Hole
113318114879X chromosome IBD segment of 10.0 cMBahamas, Crooked IslandBahamas, South Andros, SanctuaryBlue Hole
113321113320X chromosome IBD segment of 12.0 cMBahamas, Eleuthera IslandBahamas, Abaco
We also identified 57 pairs of closely related individuals (up to third- to fourth-degree relatives; Extended Data Fig. 6; Supplementary Information section 7). Most were within La Caleta (Dominican Republic), where 37 out of 63 individuals studied had one or several close relatives, although the rate was not significantly greater than within other sites (95% CI 1.5%−2.8% for La Caleta versus 1.4%−4.6% for other sites). As further evidence of an interconnected population, we identified male relatives buried ~75 kilometers apart in the southern Dominican Republic: a father/son pair from Atajadizo and their second and third-degree relative from La Caleta.
Extended Data Fig. 6:

Pairwise kinship estimates for all individuals from sites where close relatives were identified using autosomal data.

Dotted lines identify family clusters and inter-site relationships; bottom rows correspond to relationships per individual.

Pre-contact ancestry persists in the present-day Caribbean

We tested for genetic affinity between the Indigenous ancestry found in present-day[21] and ancient Caribbean people by computing f(European, Test; Cuba_Archaic, Caribbean_Ceramic). We obtained a signal for relatedness between Puerto Ricans and Ceramic-associated individuals (|Z|= 3.4 and 4.6 for two datasets) (Supplementary Data 14). Our results are consistent with entirely Ceramic-related but not entirely Archaic-related ancestry (Supplementary Information section 14). We carried out the same test separately for 15 provinces of Cuba[28] and found two provinces and eight municipalities with weakly significant evidence of Ceramic-related ancestry (2.0<|Z|<3.4) and only a single municipality (Guines, western Cuba) with marginally significant evidence of Archaic-related ancestry (Z=2.0) (Supplementary Data 14). Thus while the available ancient data show the perpetuation of unadmixed Archaic-related ancestry in parts of Cuba into the last millennium, it was substantially replaced by Ceramic-related ancestry prior to the present day. Previous reports have also found pre-contact Indigenous ancestry in present-day Caribbean people in uniparental haplogroups[29-32]. We add to this by identifying a previously undocumented deep branch of mitochondrial DNA (mtDNA) haplogroup C1d at a frequency of ~7% across Caribbean_Ceramic sub-clades as well as in a modern Puerto Rican individual from the 1000 Genomes Project dataset[33] (Supplementary Data 9; Supplementary Information section 10). This provides direct evidence that Indigenous matrilineal ancestry persisted in the Caribbean since pre-contact times and cannot be explained by colonial-era movements from the American continents.

Discussion

This study addresses multiple debates about the people of the pre-contact Caribbean (Table 1). First, the ancestry present in the Greater Antilles during the Archaic Age was consistent with deriving from a single source, with only subtle differences among Archaic-associated individuals spanning ~2,500 years. We cannot distinguish between a Central or South American origin for the source population of Archaic-associated people, but find a North American origin to be unlikely (though we note that there is a paucity of comparative genetic data from North America). Second, our data are consistent with a migratory movement accompanying the introduction and spread of intensive ceramic use in the Caribbean[34]. Ceramic-associated individuals show an affinity to present-day Arawak speakers, consistent with archaeological and linguistic evidence of northeastern South American origin[35]. In line with hypotheses that Arawak-speaking populations split as they migrated northeast from Amazonian South America, with some groups moving further along the Orinoco and into the Antilles and others toward the western Venezuela coast[29], Curaçao individuals have ancestry related to that in LesserAntilles_Ceramic. While the earliest ceramic sites in the Caribbean are in Puerto Rico and the northern Lesser Antilles, and there is no archaeological evidence that the Windward Islands of the Lesser Antilles were settled until ~1,800 years ago, the sharing of some ancestry between individuals from Curaçao and those from the Lesser Antilles but not the Greater Antilles supports a south-to-north stepping stone trajectory into the Caribbean[4]. Third, we find no association between our Caribbean_Ceramic sub-clades and the traditional Caribbean ceramic typologies (Saladoid, Ostionoid, Meillacoid, Chicoid), providing no support for a culture-history model that views stylistic transitions as the result of major movements of new people. Instead, the ancestry profile in regions such as the southeastern coast of the Dominican Republic spans more than a millennium across stylistic transitions in material culture. While we cannot rule out that migrations of populations from the Americas genetically similar to Caribbean people drove some of the cultural changes, our findings increase the weight of evidence that connectivity among ceramic using groups within the Caribbean catalyzed stylistic transitions. Fourth, we provide the first evidence of admixture between Archaic-/Ceramic-related ancestry in three individuals in Hispaniola. This finding also confirms a previous inference[4] that admixture between people of Archaic- and Ceramic-associated ancestry in the Caribbean was extremely rare (seen here in only three out of 201 ceramic-using Caribbean individuals). Fifth, we confirm that people living in some parts of the Caribbean (especially Puerto Rico and Cuba) today carry proportions of pre-contact Indigenous ancestry. In Cuba, Archaic-related ancestry persisted nearly until the Contact Period; however, the Indigenous ancestry in Cuba today is mostly not derived from this source. This could reflect post-colonial movement of Indigenous people, although at least some of it likely reflects pre-contact events as Ceramic-related ancestry was present in individuals from western and central Cuba dated to ~500 calBP. Sixth, our data provide insights into social structure and demography. Analyzing ROH, we document an avoidance of unions between close relatives during both the Archaic and Ceramic Ages and detect large proportions of cumulative ROH across most of the Caribbean, reflecting a small population size[36]. We identify male relatives buried ~75 kilometers apart, suggesting networks of connectivity between archaeological sites analyzed today as separate entities. As further evidence of connectivity, we observe shared haplotypes across islands (19 distant cousin pairs) at a rate expected for an effective population size of Ne=3082 (95% CI 1530–8150) across the large islands of Hispaniola and Puerto Rico. Although these estimates represent the last ~20 generations since the analyzed individuals lived, they point to a census size across these large islands being substantially less than estimates of hundreds of thousands to millions at contact suggested in some literature[1,37]. While our population size estimates are lower than those from historical reports and population counts[5,6], the devastating impact that European colonization, expropriation, and systematic killing of Indigenous people had on Caribbean populations is indisputable. The ancestry and legacy of pre-contact Caribbean people persists today, and the study of ancient DNA helps us to better appreciate this. Present-day Caribbean people harbor mixtures of genetic ancestry in different proportions, primarily comprising pre-contact Indigenous populations (~4% on average in Cuba, ~6% in the Dominican Republic, and ~14% in Puerto Rico according to our estimation by qpAdm), immigrant Europeans (~70% in Cuba, ~56% in the Dominican Republic, and ~68% in Puerto Rico), and Africans who were brought to this region during the course of the trans-Atlantic slave trade (~26% in Cuba, ~38% in the Dominican Republic, and ~18% in Puerto Rico) (Extended Data Table 3). All three groups contributed in central ways to the present-day people of the Caribbean and continue to shape the legacy of the interconnected Caribbean world.
Extended Data Table 3:

Ancestry proportion estimates with qpAdm in present-day Caribbean individuals from Cuba (and its provinces), Dominican Republic, and Puerto Rico[21,28].

Top half, proportions across countries.

CountryCaribbean_Ceramic1000 Genomes CEU1000 Genomes YRI

ProportionSEProportionSEProportionSE
Cuba(SGDP)0.0290.0020.7220.0040.2490.002
Cuba(1000G1)0.0420.0020.7030.0020.2550.001
Dominican Republic (SGDP)0.0580.0030.5580.0060.3840.004
Dominican Republic (1000G1)0.0620.0020.5580.0040.3790.003
Puerto Rico (SGDP)0.1320.0040.6860.0060.1820.003
Puerto Rico (1000G1)0.1400.0030.6760.0030.1840.002

CEU = European source; YRI = African source; CHB = East Asian source; SGDP = Simons Genome Diversity Project outgroup populations Karitiana, Mixe, Yakut, Ulchi, Papuan, Mursi, and Mbuti; 1000G1 = 1000 Genomes outgroup populations PEL, PJL, JPT, and MSL. Bottom half, proportions across different Cuban provinces. 1000G2 = 1000 Genomes outgroup populations PEL, PJL, JPT, MSL and GIH.

METHODS

No statistical methods were used to predetermine sample size. The experiments were not randomized, and the investigators were not blinded to allocation during experiments and outcome assessment.

Ancient DNA analysis

We generated powder from the skeletal remains of all individuals excavated from sites throughout the Caribbean (see Supplementary Information section 2 for archaeological site information and Figures S1-S11 for maps showing the location of the islands and/or sites studied). Powder was produced from a cochlea[38,39], tooth, phalanx, or ossicle[40] from each individual in a clean room facility at Harvard Medical School (Boston, USA), University College Dublin (Dublin, Ireland), or the University of Vienna (Vienna, Austria); see Supplementary Data 2 for the skeletal element used for each individual and location of powder preparation. We extracted DNA in dedicated ancient DNA laboratories at Harvard Medical School or the University of Vienna following published protocols[41-43]. From the extracts, we prepared dual-barcoded double-stranded[44] or dual-indexed single-stranded libraries[45], both treated with uracil-DNA glycosylase (n class="Gene">UDG) to reduce the rate of characteristic ancient DNA damage[46]. Double-stranded libraries were treated in a modified partial UDG preparation[44] (‘half’), leaving a reduced damage signal at both ends (5’ C-to-T, 3’ G-to-A). Single-stranded libraries were treated with E. coli UDG (USER from NEB) that inefficiently cuts the 5’ Uracil and does not cut the 3’ Uracil. For a subset of individuals, we increased coverage by preparing multiple libraries; see Supplementary Data 2 for the number of libraries analyzed for each individual. To generate SNP capture data, we used in-solution target hybridization to enrich for sequences that overlap the mitochondrial genome and ~1.24 million genome-wide SNPs[47-50] (“1240k”), either in two separate enrichments or simultaneously (Supplementary Data 2). We then added two 7-base-pair indexing barcodes to the adapters of each double-stranded library (single-stranded libraries are already indexed from the library preparation) and sequenced libraries using either an Illumina NextSeq500 instrument with 2×76 cycles or an Illumina HiSeqX10 instrument with 2×101 cycles and reading the indices with 2×7 cycles (double-stranded libraries) or 2×8 cycles (single-stranded libraries). Prior to alignment, we merged paired-end sequences, retaining reads that exhibited no more than one mismatch between the forward and reverse base if base quality was ≥20, or 3 mismatches if base quality was <20. A custom toolkit was used for merging and trimming adapters and barcodes (available at https://github.com/DReichLab/ADNA-Tools). Merged sequences were mapped to the reconstructed human mtDNA consensus sequence (RSRS)[51] and the n class="Species">human reference genome version hg19 using the samse command in BWA v.0.7.15-r1140[52] with the parameters -n 0.01, -o 2, and -l 16500. Duplicate molecules (those exhibiting the same mapped start and end position and same stand orientation) were removed after alignment using the Broad Institute’s Picard MarkDuplicates tool (available at http://broadinstitute.github.io/picard/). We trimmed two terminal bases from UDG-half libraries to reduce damage-induced errors. We evaluated the authenticity of the isolated DNA by retaining individuals with a minimum of 3% of cytosine-to-n class="Chemical">thymine substitutions at the end of the sequenced fragments[44] for double stranded libraries and 10% for single-stranded libraries, point estimates of mitochondrial DNA (mtDNA) contamination below 5% using contamMix v.1.0–12[47], and point estimates of X chromosome contamination (in males) below 3%[53]; we also used contamLD[54] to confirm low contamination rates (<~6%) (Supplementary Data 2). Eight single-stranded libraries from Ceramic Age individuals did not reach our 10% cytosine-to-thymine substitution threshold but had at least an 8% substitution rate, and therefore assessed as authentic given the relatively recent dates for these individuals; all eight libraries also were within the expected range for the other two authenticity metrics and had <1% contamination as assessed by contamLD. Multiple libraries from I10333 and I10334 as well as one library from I12341 showed poor match rates to the mtDNA consensus sequence, but this is likely due to low mtDNA coverage (0.5–2.1×). Two libraries from I7977 and one from I15596 were also slightly below this threshold (6–10% mismatch rate), but also surpassed thresholds for the other two metrics and had ~1.1% contamination as assessed by contamLD. We determined SNPs by randomly sampling an overlapping read with minimum mapping quality of ≥10 and base quality of ≥20. Individuals with <20,000 covered SNPs were excluded from quantitative analyses. One individual from each of three pairs of first-degree relatives in the dataset was excluded from population genetics analysis; in all cases, we retained the higher coverage individual; see Supplementary Data 1. We also generated shotgun sequencing data for two Ceramic-associated individuals from The Bahamas, I14922 (Abaco Island) and I14879 (South Andros) using the same system of data generation and processing, although the capture step was not included (Supplementary Data 2). For shotgun data, we report thresholds of mapping quality ≥30 and base quality ≥ 20.

Radiocarbon dates

We report 45 new radiocarbon (14C) dates on bone fragments generated using accelerator mass spectrometry (AMS) (Supplementary Data 3). Most dates (n=41) were generated at the Pennsylvania State University (PSU) Radiocarbon Laboratory, and the remainder (n=4) were generated at the Center for Isotopic Research on Cultural and Environmental heritage (CIRCE). The sample preparation methodology at PSU was carried out as previously reported[22], where bone collagen was extracted and purified using a modified Longin method with ultrafiltration[55] (>30 kDa gelatin); if collagen yields were low, a modified XAD process[56] (XAD amino acids) was used. Carbon and nitrogen isotope ratios were then measured (Supplementary Information section 3) as a quality control measure; all C:N ratios fell between 3.15 and 3.44, indicating good collagen or amino acid preservation[55]. We also evaluated diet in these individuals (e.g., marine vs. terrestrial) and compared the results to reference data from 242 ancient Caribbean and Maya individuals (Figures S12-S14). Attenuated Total Reflectance Fourier Transform Infrared (ATR-FTIR) spectra were generated to assess postmortem changes in the apatite crystal structure of the bone samples; ATR-FTIR spectra of all samples are displayed in Figure S15 and quality control parameters are reported in Table S1. Ultimately, all calibrated 14C ages were computed using OxCal v4.4[57] using the IntCal20[58] after our stable isotope analysis detected minimal consumption of marine resources. Sample preparation at CIRCE was carried out following the lab-adapted Longin method[59]; isotopic information was not generated for these individuals. Supplementary Data 3 lists the preparation method used for each individual and Supplementary Information section 3 describes the generation of isotopic data in more detail and its use in calibrating the 14C dates generated for the Caribbean individuals.

Dataset assembly

We merged genome-wide data for 93 previously-reported individuals[4] with newly-generated data from 174 ancient individuals for co-analysis, retaining 89 of them for a final co-analysis dataset comprising 263 individuals (details of merging in Supplementary Information section 4). We leverage these previously published data to revisit statistics and analyses reported in that work[4] (Tables S2, S23, S29) and carry out additional analyses using these data (Tables S3, S24, S25, S26, S27, S28, Figures S33, S34). We merged these 263 ancient individuals that passed screening into a base dataset that included 61 previously published ancient American individuals[16,20,60-63], and 36 modern Indigenous American groups sourced from single nucleotide polymorphism (SNP) array genotyping datasets or whole genome sequencing datasets (Supplementary Data 5): ‘1240K SNPs’, whole genome sequencing data restricted to a canonical set of 1,233,013 SNPs[47-50,64,65] Human Origins dataset’, 597,573 SNPs[66-68] ‘Illumina dataset’ (unmasked/unadmixed individuals only), 352,432 SNPs[13] All comparative analyses involving present-day Indigenous American populations were performed on the Illumina dataset, whereas for qpAdm and qpWave’s set of outgroup populations (“Right”) we used the Human Origins dataset for increased coverage. All genome-wide analyses were performed on autosomal data.

Uniparental haplogroups

We determined mtDNA haplogroups for all individuals using bam files, restricting to reads with MAPQ ≥ 30 and base quality ≥ 20. We constructed a consensus sequence with samtools and bcftools version 1.3.1 using a majority rule and then determined the haplogroup with HaploGrep2, using Phylotree version 17. We determined Y chromosome haplogroups using sequences mapping to 1240K Y-chromosome targets, restricting to sequences with MAPQ ≥ 30 and base quality ≥ 30. We called haplogroups by determining the most derived mutation for each individual, using the nomenclature of the International Society of Genetic Genealogy (ISOGG; http://www.isogg.org) version 14.76 (April 2019). Mutational differences and corresponding mtDNA haplogroups, and Y chromosome haplogroups and their supporting derived mutations are found in Supplementary Data 9. A discussion of mtDNA and Y chromosome haplogroup distribution in the Caribbean is found in Supplementary Information section 10; see Figures S29 for distribution of mtDNA haplogroups, Figure n class="Gene">S30 for details of three mtDNA mutations diagnostic of previously unobserved mtDNA haplogroup which is a variant of C1d, and Figure S31 for distribution of Y chromosome haplogroups.

Kinship

We assessed kinship for every pair of individuals newly-reported here as those that we co-analyze[4] (including individuals from different sites and islands) using a previously described method[69], and we present results for 1st-, 2nd-, and 3rd-/4th-degree (‘close’) relatives in Table S5 (Supplementary Information section 7). In our newly-reported dataset of 174 ancient individuals, we identified 49 individuals sharing 49 unique pairwise kin relationships. Three pairs of individuals were identified as 1st-degree relatives, while 21 pairs were 2nd-degree relatives, and 25 pairs were 3rd-degree or higher. For the data that we co-analyze[4], we identified 13 individuals who were part of eight relationships (four 2nd-degree and four 3rd-degree or higher). No close relatives were identified between the datasets. Distant cousins detected using IBD analysis are presented elsewhere (Extended Data Table 2; Supplementary Data 13).

Analysis of shared genomic segments

We identified Runs of Homozygosity (ROH) within our ancient dataset using the Python package hapROH (https://test.pypi.org/project/hapsburg/). Following a previously described method[24], we used 5008 global haplotypes from the 1000 Genomes Project haplotype panel[33] as the reference panel. As recommended for datasets with genotypes for 1240K SNPs, we applied our method to ancient individuals with at least 400,000 SNPs covered and ran the method on the pseudo-haploid data to identify ROH longer than 4 centimorgan (cM). We used the default parameters of hapROH, which are optimized for ancient data genotyped at 1240K SNPs. For each individual, we group the inferred ROH into four length categories: 4–8cM, 8–12cM, 12–20cM and >20cM and report the total sum in these bins (Supplementary Data 12; Fig. S21). To estimate effective population size Ne from ROH, we applied a maximum likelihood inference framework (for derivation of the likelihood see Supplementary Information section 7). We fit the lengths of all genome-wide ROH lengths 4–20cM, and infer the effective population size that maximizes the likelihood for ROH lengths observed in a set of individuals. Estimation uncertainties are obtained from the likelihood profile (95% CIs correspond to values within 1.92 units down from the maximum of the log-likelihood function). Tests on simulated data confirmed the ability of our estimator to recover Ne estimates from genome-wide ROH of few individuals (Figs. S22, S23). We also analyzed shared genomic segments on the X chromosome between pairs of male individuals (“IBD_X”). To call such n class="Disease">IBD blocks, we paired pseudo-haploid data of two X chromosomes and ran hapROH on read counts of the resulting artificial diploid individual; see Figure S24 for example of IBD segment shared between two individuals. We inferred population sizes from IBD with the same likelihood approach as described for ROH, applying it to all pairs of individuals between two groups of individuals. See Supplementary Information section 7 for details.

Conditional Heterozygosity

We used popstats[68] to compute conditional heterozygosity for all clades and sub-clades, which we compared with contemporaneous groups from continental South America, such as from the Peruvian Middle and Late Horizon periods[70]. As previously described[71,72], we restricted the analysis to transversion SNPs ascertained in a Yoruba individual; see Extended Data Fig. 5.

PCA

We performed principal component analysis (PCA) with smartpca v181623[73], using the 1240K + Illumina merged dataset and using the option ‘lsqproject: YES’ to project ancient individuals onto the eigenvectors computed from modern individuals in the version shown in the main manuscript. The approach of projecting each ancient individual onto patterns of variation learned from modern individuals enables us to use data from a large fraction of SNPs covered in each individual and thereby maximize the information about ancestry that would be lost in approaches that require restriction to a potentially smaller number of SNPs for which there is intersecting data across lower coverage ancient individuals. We used the option ‘newshrink: YES’ to remap the points for the individuals used to generate the PCA onto the positions where they would be expected to fall if they had been projected, thereby allowing the projected and non-projected individuals to be appropriately co-visualized. We projected 92 previously published ancient individuals[4,16,20] and 174 new ancient individuals onto the first two principal components computed using 61 individuals from 23 present-day populations (Extended Data Fig. 1b). See Supplementary Data 4 for all individuals included in PCA and values of PCs 1 and 2 for the main manuscript PCA. For the PCA presented as Fig. S19 (Supplementary Information section 5), we used non-related, non-outlier ancient individuals from Cuba_Archaic, Venezuela_Ceramic, EasternGreaterAntilles_Ceramic, BahamasCuba_Ceramic, and SECoastDR_Ceramic with >500K SNPs to compute the eigenvectors and projected all other ancient individuals. We again used the ‘lsqproject: YES’ and ‘newshrink: YES’ options. Individuals used to compute eigenvectors are listed in Supplementary Data 4. For PCA by archaeological site, non-zoomed PCA, PCA excluding CpG sites, and PCA with axes computed using ancient individuals, see Figs. S16-S19.

Unsupervised analysis of population structure

We used the software ADMIXTURE v1.3.0[74,75] to perform unsupervised structure analysis on a dataset comprised of autosomal SNPs that overlap between the 1240k and Illumina dataset and pruned in PLINK1.9[76] using --indep-pairwise 200 25 0.4. This left 273,245 SNPs for the analysis. We ran five random-seeded replicates for each K in the interval between 2 and 10 with cross-validation enabled (--cv flag) to identify the runs with the low cross-validation errors (Table S4). For each value of K, we plotted the replicate with the lowest cross-validation error and compared the results. We choose to present K=6 as Extended Data Fig. 1c, as we found that the model with six components had a low cross-validation error and differentiated the components in a useful way for visualization. Results for the other values of K are presented as Fig. S20 in Supplementary Information section 6.

Estimation of FST coefficients

To measure pairwise genetic differentiation between two groups of individuals, we estimated average pairwise FST and its standard error via block-jackknife using smartpca v.181623 and the options ‘fstonly: YES’ and ‘inbreed: YES.’ We removed the individual with lower coverage of each pair of first degree relatives, as well as ancestry outliers (see main text); we excluded Haiti_Ceramic, which comprises only two individuals who share a second-degree relationship as well as Macao, a site in the Dominican Republic from which all four individuals analyzed are 2nd-3rd-degree relatives of at least one other individual from the site. See results in Extended Data Fig. 2.

Clade grouping framework with qpWave, TreeMix and f-statistics

We used a multi-step framework involving qpWave, TreeMix, and f-statistics to group sites and individuals, and considered this information together with admixture profiles and proportions from qpAdm to produce Fig. 1b (detailed methodology in Supplementary Information section 8). We started by using qpWave to identify major clades based on shared ancestry and then used TreeMix and f-statistics to investigate the existence of sub-clades. Once all sub-clades were identified, we used f-statistics to investigate further substructure between sites within each clade. Geographic and chronological information such as island or cultural affiliation was not considered for these analyses, ensuring all clades and subclades were based solely on genetic information. We examined the association between genetic data and archaeological cultural complexes only after considering the genetic and archaeological information separately, following a previously published example[77]. The software qpWave[13] from ADMIXTOOLS v6.0[68] estimates the minimum number of ancestry sources needed to form a group of test populations (“Left”), relative to a set of differentially related reference populations (“Right”). If the “Left” group contains two populations, qpWave will evaluate if they can be modelled as descending from the same sources, and hence will determine whether they form a clade. We used 12 present-day Indigenous American populations from the Human Origins dataset[67] plus Yukpa[64] representing different language families and ancestries from the American continent as our “Right” reference population set: Chipewyan, Zapotec, Mixe, Mixtec, Suruí, Cabécar, Piapoco, Karitiana, Yukpa, Quechua, Wayuu, Apalai, Arara The argument ‘allsnps: NO’ was used, which restricts the analysis SNP set to intersection of all SNPs among all populations and maximizes the reliability of the analysis[78]. The ‘allsnps: YES’ option was developed to increase the number of SNPs analyzed in cases where very little SNP overlap exists between all populations included in a qpWave model[79]. While it is commonly used when low coverage data results in the loss of the majority of sites in the initial datasets[78], there is a risk that this option introduces unreliability in the analysis, particularly in cases where the base population is highly diverged. In this dataset, a high depth of coverage and relatively large sample sizes made it unnecessary for us to use the ‘allsnps: YES’ option. We ran two consecutive steps of qpWave analyses, starting with the identification of major groupings (step 1; Figure S25), or clades, and then reassessed the relationships between members within those clades by running the same tests in a “model competition” approach where individuals from other sites from within the same clade were added to the “Right” set (step 2; Figure S26). A significance threshold of p>0.01 was set for accepting a clade between two sites or individuals. The range of covered SNPs was 170,927–827,039, with a median of 672,888. After identifying the major clades and/or pairs of sites that uniquely formed a clade with one another, we ran TreeMix with these clades and 27 previously published present-day Indigenous populations[13] (Supplementary Data 5) to identify within-clade site structure (step 3; Figures S27, S28) by generating a maximum likelihood tree. We excluded four Chibchan, Chocoan and Arawak-speaking populations possibly admixed with each other from this analysis. We ran TreeMix, grouping the SNPs in windows of 500 (flag -k 500) to account for linkage disequilibrium, setting Chipewyan as root (-root), allowing random migration events (-m), and disabling sample size correction (-noss) in order to include sites or populations represented by a single-individual. We note that single-individual populations still present artifactually long branches that do not truly represent population-specific drift. By running TreeMix and allowing consecutive random migration/admixture events, we identified nodes and branches that maintained the same ancient Caribbean sites among the different runs. We then used f-statistics to evaluate if they formed a sub-clade to the exclusion of the other sites by following the tree’s structure. For each identified intact node among all TreeMix runs we used each downstream pair of site(s) as Test1 and Test2 and investigated their relationship to upstream sites or pools of sites (step 4). If an upstream node was unchanged in all runs, the sites composing it were pooled. However, once the first inconsistency was identified in an upstream node, all sites beyond that node were pooled together. A combination of three statistics per relationship allowed us to evaluate the TreeMix structure of the sites being tested: With Test1 and Test2 expected to be closer to each other than to Pool, the tested relationship finds support if the first test is statistically non-significant and at least one of the other two are significant. We used a Z-score threshold of 2.8 (associated with a 99.5% CI) to assess significance. These sites were then merged into a sub-clade inside the major Ceramic clade for further analysis. We did not include the sites of Cueva del Perico I, Los Indios, Punta Candelero, and Tibes in the TreeMix and f due to reduced coverage, but evaluated these sites separately to see if they shared closer affinities to any sub-clades relative to the others (Supplementary Data 7; Supplementary Information section 8). After this clading analysis, we used f-statistics to further investigate potential substructure between sites within each sub-clade (step 5). For each pairwise site comparison, we randomly divided each site into two groups of individuals, and used a statistic of the form f(Site1_subset1, Site2_subset1; Site1_subset2, Site2_subset2) to identify positive statistics suggesting substructure within the same clade. This randomization step was repeated 10 times, and the average Z-score was calculated. If a site was composed of a single individual we instead computed statistics of the form f(Mbuti, Site1_subset1; Site2_singleIndividual, Site1_subset2), intended to evaluate if individuals within Site1 were closer to each other than to the single individual from Site2. No statistics were computed if both sites being tested contained only one individual. We also used f-statistics to test if any specific sub-clade within the Caribbean_Ceramic clade had more Archaic-related ancestry than another. Specifically we used the statistic f and interpreted results as significant based on a |Z|>2.8; results are presented in Table S20.

qpAdm

We used qpAdm[49] from ADMIXTOOLS v6.0[66] with ‘allsnps: NO’ to identify the most likely sources of ancestry and admixture for our populations/clades. First, we investigated if the possible outliers SECoastDR_Ceramic16539, SECoastDR_Ceramic16520 and EasternGreaterAntilles_Ceramic7969, as well as the individuals comprising the sub-clades LesserAntilles_Ceramic, Haiti_Ceramic and Curacao_Ceramic, could be modelled as admixed between the major ancestries represented by GreaterAntilles_Archaic (composed of all Archaic-associated individuals Cuba and I10126), Caribbean_Ceramic (composed of BahamasCuba_Ceramic, EasternGreaterAntilles_Ceramic and SECoastDR_Ceramic, as well as LesserAntilles_Ceramic where relevant), and Venezuela_Ceramic (see Tables S9, S10, S12-S15). We used this information to complete Fig. 1b. We also used qpAdm to evaluate the presence of Archaic-related ancestry in Caribbean_Ceramic. Then, based on this admixture information, we attempted to obtain more detailed admixture models using the sub-clades from within Caribbean_Ceramic and GreaterAntilles_Archaic as possible sources. Lastly, we attempted to identify more distal sources of ancestry by using previously published ancient individuals from the Americas[60-63], in this case for qpWave’s three major clades/groups. The base “Right” set used was the same used for qpWave. We also tested all 1-, 2-, and 3-way models using these “Right” present-day populations as sources by moving them to the “Left” as necessary, and confirmed the results with the same unmasked/unadmixed populations from the Illumina dataset.

qpGraph

We used qpGraph and an edited skeleton tree of previously published ancient American populations[63] to construct an admixture tree representing the relationships of the new populations analysed in this study along with ref.[4] and present-day Piapoco, which our other analyses showed to be closely related to Caribbean_Ceramic (Fig. 2c). Detailed methodology is provided in Supplementary Information section 12.

Admixture simulations

We investigated the sensitivity of qpWave in detecting Carib-related ancestry in the Caribbean_Ceramic sub-clades by generating artificially admixed individuals with Caribbean_Ceramic ancestry mixed with increasing amounts (1, 2, 5, 8, 10, 20, 30, 40, and 50%) of a plausibly Carib-associated ancestry. For the Carib-associated ancestry we tested Arara (present-day Indigenous Carib speakers), Venezuela_Ceramic (inhabitants of a possible region of origin for this ancient Carib migration), and also LesserAntilles_Ceramic (possibly representing Island Caribs), and then assessed at what admixture threshold we were able to reliably detect the latter ancestry type (Supplementary Information section 13; Fig. S32). To generate these admixed individuals, we identified common SNPs between the two sources, randomly selected genotypes from the Arara individuals from the Human Origins and Illumina SNP array datasets corresponding to each of the nine percentages to be tested, and added the remaining SNPs from a random individual from Bahamas_Ceramic, EasternGreaterAntilles_Ceramic, SECoastDR_Ceramic, and LesserAntilles_Ceramic with over 800,000 SNPs. We then ran qpWave with each of the simulated admixed individuals on the “Left” plus their correspondent sub-clade, while using the default 12 “Right” populations (excluding Arara), as described in Supplementary Information section 8, plus the Carib proxy population used to generate those individuals.

Dating admixture

We used the method DATES (Distribution of Ancestry Tracts of Evolutionary Signals[22] v3520 (Chintalapati, M., Neel, A., Patterson, N. & Moorjani, P. Reconstructing the spatio-temporal patterns of admixture in human history. In Preparation.) to estimate the dates of admixture in admixed individuals from Haiti. This method measures the decay of ancestry covariance to infer the time since mixture and estimates jackknife standard errors. Details of DATES analysis are found in Supplementary Information section 14; results for Haiti_Ceramic are found in Table S22.

Relatedness of ancient individuals to present-day admixed Caribbean populations

We computed relative allele-sharing between present-day admixed Caribbean populations (via their Indigenous ancestry) and ancient Archaic-associated versus Ceramic-associated individuals with ADMIXTOOLS 2 (Maier R., Reich D., Patterson N. Rapid inference of demographic history using ADMIXTOOLS 2. In Preparation.) through the statistic f(European, Test; Cuba_Archaic, Caribbean_Ceramic). In order to evaluate statistical power, we compared results for present-day Cubans alone to results obtained by adding one ancient individual from either the GreaterAntilles_Archaic or Caribbean_Ceramic clade to the Cuban test population. Full details are found in Supplementary Information section 15.

Analysis of phenotypically-relevant SNPs

Analyzing SNPs previously known to be relevant to phenotypic traits allows us to explore their frequencies in the pre-contact Caribbean and Venezuela. We used mpileup in samtools[80] version 1.3.1 with the settings -B -q30 -Q30 to obtain information about each SNP covered by reads from the bam files of our individuals (after trimming 2 base pairs from the molecule ends) and used the fasta file from human genome GRCh37 (hg19) as a reference file for the pileup. We counted the number of reference and alternate alleles, combining counts on the forward and reverse strands. Data are provided in Supplementary Data 15, with a discussion of results in Supplementary Information section 16.

Testing for an Australasian link

We tested for a signal of relatedness to present-day Australasian populations[64,68] (“Population Y” signal), using the statistic f(Mbuti, Onge/Papuan; Mixe, Archaic/Ceramic) and testing all final sub-clades as Archaic/Ceramic. Here, Mixe is representative of a population that harbors no Population Y signal. When Onge was used as the Australasian proxy, several of the ancient groups showed weakly positive statistics (Z between 2 and 3), but only the Archaic individual I10126 from the site of Andrés (Dominican Republic) was significant at Z = 3.4. While this signal is significant at p=0.0030 even after performing a Bonferroni correction for the nine hypotheses tested in Extended Data Table 4, the signal is non-significant when Papuan is used as the Australasian proxy (Z=2.2). We also caution that all Population Y statistics are likely to be overinflated in their significance because the original discovery of the Population Y signal carried out extensive hypothesis testing to identify a population in the third position of the statistic f(Mbuti, Onge/Papuan; Mixe, Archaic/Ceramic) (Mixe) that maximized the value of the statistic when any other Native American group in was used in the fourth position; thus, there is a further multiple hypothesis testing issue for which our analysis does not correct. The lack of a clear population Y signal is consistent with prior studies that also have not found this signal in ancient individuals from this region[16] and other areas of South America[63].
Extended Data Table 4:

Statistics testing for an Australasian link.

Testf4(Mbuti, Onge; Mixe, Test)Z-scoreSNPs used
Cuba_Archaic0.0006062.3301115829
Domincan_Andres_Archaic0.0012913.380741742
BahamasCuba_Ceramic0.0005902.4971104937
EasternGreaterAntilles_Ceramic0.0005282.3581110135
SECoastDR_Ceramic0.0005482.4201112602
Haiti_Ceramic0.0007202.1021015357
Curacao_Ceramic0.0005952.180984268
LesserAntilles_Ceramic0.0004902.0981096317
Venezuela_Ceramic0.0006332.447957964

Testf4(Mbuti, Papuan; Mixe, Test)Z-scoreSNPs used

Cuba_Archaic0.0003251.3151116502
Domincan_Andres_Archaic0.0006961.853742248
BahamasCuba_Ceramic0.0003831.8061105601
EasternGreaterAntilles_Ceramic0.0004452.1921110808
SECoastDR_Ceramic0.0004011.9501113277
Haiti_Ceramic0.0003771.2431015971
Curacao_Ceramic0.0003991.573984884
Lesser_Antilles_Ceramic0.0003381.5991096963
Venezuela_Ceramic0.0002250.923958591

Temporal distribution of newly-reported individuals and overview of population structure.

(a) Numbers represent individuals from each site; thick lines denote direct 14C dates (95.4% calibrated confidence intervals); thin lines denote archaeological context dating; grey area identifies the first arrivals of ceramic-users in the Caribbean. Colors and labels are consistent with Fig. 1. (b) PCA plot with ancient individuals shown as solid squares or circles (Archaic- or Ceramic-associated individuals, respectively). Newly-reported individuals are outlined in black, genetic outliers are outlined in red, and individuals with <30,000 SNPs are outlined in blue. Individuals are separated by sub-clades, and three individuals from the site of Cueva Roja (Dominican Republic) who were excluded from clading analysis are labeled “Dominican Cueva Roja Archaic” and colored magenta. Individual PDI009, assessed elsewhere as an outlier[11], is denoted with an asterisk. Three previously-published ancient Caribbean individuals[9,10] are shown as inverted triangles outlined in gray and colored for the sub-clade that encompasses the geographic region with which they are associated. This plot focuses on ancient individuals and does not show some present-day populations; a full plot is provided as Fig. S17. (c) ADMIXTURE analysis best supports K=6 ancestral elements. Newly-reported and co-analyzed individuals are clustered by sub-clade; all newly-reported individuals are identified by a black bar to the side of the plot. The same three previously-published individuals[9,10] shown in Extended Data Fig. 1b are included, and three modern-day populations are shown for reference (Suruí, Cabécar, Piapoco).

FST distances.

Average pairwise FST distances and standard errors (x100) between (a) clades and (b) sites with more than two unrelated individuals, demonstrating both overall high levels of genetic similarity between the Caribbean_Ceramic sub-clades and the sites composing them, as well as the magnitude of genetic differentiation between those and the groups with Archaic- and Venezuela-related ancestries.

Maximum likelihood population tree from allele frequencies using Treemix.

The Caribbean_Ceramic sub-clades are shown on the same branch as modern Arawak-speaking groups (Palikur, Jamamadi). Orange arrows represent admixture events, although observations from other analyses (e.g., qpAdm admixture modeling) suggest that the indicated direction of admixture may be inaccurate (e.g., we believe it is more likely that there is GreaterAntilles_Archaic admixture into Haiti_Ceramic than the reverse scenario; Supplementary Information section 9).

Estimated effective population sizes.

(a) Estimates per site are based on ROH blocks 4–20 cM long using a likelihood model (Supplementary Information section 7). Colors as per sub-clades, numbers denote the count of analyzed individuals. Highly consanguineous individuals with a sum of ROH>20 above 50 cM were excluded. (b) Same as (a) but for IBD segments 8–20cM long shared on the X chromosome between all pairs of males. Closely related pairs of individuals with a sum of IBD X>20 above 25 cM were excluded. Numbers denote counts of all remaining pairs. In (a) and (b) points represent maximum likelihood estimate and vertical bars represent 95% CI.

Conditional heterozygosity by clade.

Conditional heterozygosity in the ancient Caribbean was similar to that of contemporaneous groups from Peru[70], except for the Archaic-associated groups and Venezuela_Ceramic. First- and second-degree relatives were excluded from the analysis, including the pair of related individuals representing Haiti_Ceramic. Colored circles represent point estimates (color scheme matching Fig. 1); bars represent three standard errors.

Pairwise kinship estimates for all individuals from sites where close relatives were identified using autosomal data.

Dotted lines identify family clusters and inter-site relationships; bottom rows correspond to relationships per individual.

Ne estimates for each site.

Table includes all individuals where ROH analysis is possible and excludes individuals with more than 50cM sum of 20cM long ROH.

Subset of cross-site relatives from different islands, identified through IBD analysis.

We measured the X chromosome length and IBD map lengths as ⅔ of the map length of female X. Complete table including cross-site distant relatives within islands in Supplementary Data 13.

Ancestry proportion estimates with qpAdm in present-day Caribbean individuals from Cuba (and its provinces), Dominican Republic, and Puerto Rico[21,28].

Top half, proportions across countries. CEU = European source; YRI = African source; CHB = East Asian source; SGDP = Simons Genome Diversity Project outgroup populations Karitiana, Mixe, Yakut, Ulchi, Papuan, Mursi, and Mbuti; 1000G1 = 1000 Genomes outgroup populations PEL, PJL, JPT, and MSL. Bottom half, proportions across different Cuban provinces. 1000G2 = 1000 Genomes outgroup populations PEL, PJL, JPT, MSL and GIH. Statistics testing for an Australasian link.
  50 in total

1.  Effective population size/adult population size ratios in wildlife: a review.

Authors:  Richard Frankham
Journal:  Genet Res       Date:  2007-12       Impact factor: 1.588

2.  Genomic insights into the early peopling of the Caribbean.

Authors:  Kathrin Nägele; Cosimo Posth; Miren Iraeta Orbegozo; Yadira Chinique de Armas; Silvia Teresita Hernández Godoy; Ulises M González Herrera; Maria A Nieves-Colón; Marcela Sandoval-Velasco; Dorothea Mylopotamitaki; Rita Radzeviciute; Jason Laffoon; William J Pestle; Jazmin Ramos-Madrigal; Thiseas C Lamnidis; William C Schaffer; Robert S Carr; Jane S Day; Carlos Arredondo Antúnez; Armando Rangel Rivero; Antonio J Martínez-Fuentes; Edwin Crespo-Torres; Ivan Roksandic; Anne C Stone; Carles Lalueza-Fox; Menno Hoogland; Mirjana Roksandic; Corinne L Hofman; Johannes Krause; Hannes Schroeder
Journal:  Science       Date:  2020-06-04       Impact factor: 47.728

3.  Ancient DNA Reconstructs the Genetic Legacies of Precontact Puerto Rico Communities.

Authors:  Maria A Nieves-Colón; William J Pestle; Austin W Reynolds; Bastien Llamas; Constanza de la Fuente; Kathleen Fowler; Katherine M Skerry; Edwin Crespo-Torres; Carlos D Bustamante; Anne C Stone
Journal:  Mol Biol Evol       Date:  2020-03-01       Impact factor: 16.240

4.  The formation of human populations in South and Central Asia.

Authors:  Vagheesh M Narasimhan; Nick Patterson; Priya Moorjani; Nadin Rohland; Rebecca Bernardos; Swapan Mallick; Iosif Lazaridis; Nathan Nakatsuka; Iñigo Olalde; Mark Lipson; Alexander M Kim; Luca M Olivieri; Alfredo Coppa; Massimo Vidale; James Mallory; Vyacheslav Moiseyev; Egor Kitov; Janet Monge; Nicole Adamski; Neel Alex; Nasreen Broomandkhoshbacht; Francesca Candilio; Kimberly Callan; Olivia Cheronet; Brendan J Culleton; Matthew Ferry; Daniel Fernandes; Suzanne Freilich; Beatriz Gamarra; Daniel Gaudio; Mateja Hajdinjak; Éadaoin Harney; Thomas K Harper; Denise Keating; Ann Marie Lawson; Matthew Mah; Kirsten Mandl; Megan Michel; Mario Novak; Jonas Oppenheimer; Niraj Rai; Kendra Sirak; Viviane Slon; Kristin Stewardson; Fatma Zalzala; Zhao Zhang; Gaziz Akhatov; Anatoly N Bagashev; Alessandra Bagnera; Bauryzhan Baitanayev; Julio Bendezu-Sarmiento; Arman A Bissembaev; Gian Luca Bonora; Temirlan T Chargynov; Tatiana Chikisheva; Petr K Dashkovskiy; Anatoly Derevianko; Miroslav Dobeš; Katerina Douka; Nadezhda Dubova; Meiram N Duisengali; Dmitry Enshin; Andrey Epimakhov; Alexey V Fribus; Dorian Fuller; Alexander Goryachev; Andrey Gromov; Sergey P Grushin; Bryan Hanks; Margaret Judd; Erlan Kazizov; Aleksander Khokhlov; Aleksander P Krygin; Elena Kupriyanova; Pavel Kuznetsov; Donata Luiselli; Farhod Maksudov; Aslan M Mamedov; Talgat B Mamirov; Christopher Meiklejohn; Deborah C Merrett; Roberto Micheli; Oleg Mochalov; Samariddin Mustafokulov; Ayushi Nayak; Davide Pettener; Richard Potts; Dmitry Razhev; Marina Rykun; Stefania Sarno; Tatyana M Savenkova; Kulyan Sikhymbaeva; Sergey M Slepchenko; Oroz A Soltobaev; Nadezhda Stepanova; Svetlana Svyatko; Kubatbek Tabaldiev; Maria Teschler-Nicola; Alexey A Tishkin; Vitaly V Tkachev; Sergey Vasilyev; Petr Velemínský; Dmitriy Voyakin; Antonina Yermolayeva; Muhammad Zahir; Valery S Zubkov; Alisa Zubova; Vasant S Shinde; Carles Lalueza-Fox; Matthias Meyer; David Anthony; Nicole Boivin; Kumarasamy Thangaraj; Douglas J Kennett; Michael Frachetti; Ron Pinhasi; David Reich
Journal:  Science       Date:  2019-09-06       Impact factor: 47.728

5.  Inference of population splits and mixtures from genome-wide allele frequency data.

Authors:  Joseph K Pickrell; Jonathan K Pritchard
Journal:  PLoS Genet       Date:  2012-11-15       Impact factor: 5.917

6.  Reconstructing Native American population history.

Authors:  David Reich; Nick Patterson; Desmond Campbell; Arti Tandon; Stéphane Mazieres; Nicolas Ray; Maria V Parra; Winston Rojas; Constanza Duque; Natalia Mesa; Luis F García; Omar Triana; Silvia Blair; Amanda Maestre; Juan C Dib; Claudio M Bravi; Graciela Bailliet; Daniel Corach; Tábita Hünemeier; Maria Cátira Bortolini; Francisco M Salzano; María Luiza Petzl-Erler; Victor Acuña-Alonzo; Carlos Aguilar-Salinas; Samuel Canizales-Quinteros; Teresa Tusié-Luna; Laura Riba; Maricela Rodríguez-Cruz; Mardia Lopez-Alarcón; Ramón Coral-Vazquez; Thelma Canto-Cetina; Irma Silva-Zolezzi; Juan Carlos Fernandez-Lopez; Alejandra V Contreras; Gerardo Jimenez-Sanchez; Maria José Gómez-Vázquez; Julio Molina; Angel Carracedo; Antonio Salas; Carla Gallo; Giovanni Poletti; David B Witonsky; Gorka Alkorta-Aranburu; Rem I Sukernik; Ludmila Osipova; Sardana A Fedorova; René Vasquez; Mercedes Villena; Claudia Moreau; Ramiro Barrantes; David Pauls; Laurent Excoffier; Gabriel Bedoya; Francisco Rothhammer; Jean-Michel Dugoujon; Georges Larrouy; William Klitz; Damian Labuda; Judith Kidd; Kenneth Kidd; Anna Di Rienzo; Nelson B Freimer; Alkes L Price; Andrés Ruiz-Linares
Journal:  Nature       Date:  2012-08-16       Impact factor: 49.962

7.  Origins and genetic legacies of the Caribbean Taino.

Authors:  Hannes Schroeder; Martin Sikora; Shyam Gopalakrishnan; Lara M Cassidy; Pierpaolo Maisano Delser; Marcela Sandoval Velasco; Joshua G Schraiber; Simon Rasmussen; Julian R Homburger; María C Ávila-Arcos; Morten E Allentoft; J Víctor Moreno-Mayar; Gabriel Renaud; Alberto Gómez-Carballa; Jason E Laffoon; Rachel J A Hopkins; Thomas F G Higham; Robert S Carr; William C Schaffer; Jane S Day; Menno Hoogland; Antonio Salas; Carlos D Bustamante; Rasmus Nielsen; Daniel G Bradley; Corinne L Hofman; Eske Willerslev
Journal:  Proc Natl Acad Sci U S A       Date:  2018-02-20       Impact factor: 11.205

8.  Faces Divulge the Origins of Caribbean Prehistoric Inhabitants.

Authors:  Ann H Ross; William F Keegan; Michael P Pateman; Colleen B Young
Journal:  Sci Rep       Date:  2020-01-10       Impact factor: 4.379

9.  Accurate Non-parametric Estimation of Recent Effective Population Size from Segments of Identity by Descent.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Am J Hum Genet       Date:  2015-08-20       Impact factor: 11.025

10.  Reconstructing the population genetic history of the Caribbean.

Authors:  Andrés Moreno-Estrada; Simon Gravel; Fouad Zakharia; Jacob L McCauley; Jake K Byrnes; Christopher R Gignoux; Patricia A Ortiz-Tello; Ricardo J Martínez; Dale J Hedges; Richard W Morris; Celeste Eng; Karla Sandoval; Suehelay Acevedo-Acevedo; Paul J Norman; Zulay Layrisse; Peter Parham; Juan Carlos Martínez-Cruzado; Esteban González Burchard; Michael L Cuccaro; Eden R Martin; Carlos D Bustamante
Journal:  PLoS Genet       Date:  2013-11-14       Impact factor: 5.917

View more
  17 in total

1.  Archaeogenomic distinctiveness of the Isthmo-Colombian area.

Authors:  Marco Rosario Capodiferro; Bethany Aram; Alessandro Raveane; Nicola Rambaldi Migliore; Giulia Colombo; Linda Ongaro; Javier Rivera; Tomás Mendizábal; Iosvany Hernández-Mora; Maribel Tribaldos; Ugo Alessandro Perego; Hongjie Li; Christiana Lyn Scheib; Alessandra Modi; Alberto Gòmez-Carballa; Viola Grugni; Gianluca Lombardo; Garrett Hellenthal; Juan Miguel Pascale; Francesco Bertolini; Gaetano Salvatore Grieco; Cristina Cereda; Martina Lari; David Caramelli; Luca Pagani; Mait Metspalu; Ronny Friedrich; Corina Knipper; Anna Olivieri; Antonio Salas; Richard Cooke; Francesco Montinaro; Jorge Motta; Antonio Torroni; Juan Guillermo Martín; Ornella Semino; Ripan Singh Malhi; Alessandro Achilli
Journal:  Cell       Date:  2021-03-23       Impact factor: 41.582

2.  Reconstructing the history of founder events using genome-wide patterns of allele sharing across individuals.

Authors:  Rémi Tournebize; Gillian Chu; Priya Moorjani
Journal:  PLoS Genet       Date:  2022-06-23       Impact factor: 6.020

3.  Archaeological occurrences of terrestrial herpetofauna in the insular Caribbean: cultural and biological significance.

Authors:  Corentin Bochaton
Journal:  R Soc Open Sci       Date:  2022-07-13       Impact factor: 3.653

4.  Ancient Maltese genomes and the genetic geography of Neolithic Europe.

Authors:  Bruno Ariano; Valeria Mattiangeli; Emily M Breslin; Eóin W Parkinson; T Rowan McLaughlin; Jess E Thompson; Ronika K Power; Jay T Stock; Bernardette Mercieca-Spiteri; Simon Stoddart; Caroline Malone; Shyam Gopalakrishnan; Lara M Cassidy; Daniel G Bradley
Journal:  Curr Biol       Date:  2022-05-18       Impact factor: 10.900

5.  Ethics of DNA research on human remains: five globally applicable guidelines.

Authors:  Songül Alpaslan-Roodenberg; David Anthony; Hiba Babiker; Eszter Bánffy; Thomas Booth; Patricia Capone; Arati Deshpande-Mukherjee; Stefanie Eisenmann; Lars Fehren-Schmitz; Michael Frachetti; Ricardo Fujita; Catherine J Frieman; Qiaomei Fu; Victoria Gibbon; Wolfgang Haak; Mateja Hajdinjak; Kerstin P Hofmann; Brian Holguin; Takeshi Inomata; Hideaki Kanzawa-Kiriyama; William Keegan; Janet Kelso; Johannes Krause; Ganesan Kumaresan; Chapurukha Kusimba; Sibel Kusimba; Carles Lalueza-Fox; Bastien Llamas; Scott MacEachern; Swapan Mallick; Hirofumi Matsumura; Ana Y Morales-Arce; Giedre Motuzaite Matuzeviciute; Veena Mushrif-Tripathy; Nathan Nakatsuka; Rodrigo Nores; Christine Ogola; Mercedes Okumura; Nick Patterson; Ron Pinhasi; Samayamantri P R Prasad; Mary E Prendergast; Jose Luis Punzo; David Reich; Rikai Sawafuji; Elizabeth Sawchuk; Stephan Schiffels; Jakob Sedig; Svetlana Shnaider; Kendra Sirak; Pontus Skoglund; Viviane Slon; Meradeth Snow; Marie Soressi; Matthew Spriggs; Philipp W Stockhammer; Anna Szécsényi-Nagy; Kumarasamy Thangaraj; Vera Tiesler; Ray Tobler; Chuan-Chao Wang; Christina Warinner; Surangi Yasawardene; Muhammad Zahir
Journal:  Nature       Date:  2021-10-20       Impact factor: 49.962

Review 6.  Peopling of the Americas as inferred from ancient genomics.

Authors:  Eske Willerslev; David J Meltzer
Journal:  Nature       Date:  2021-06-16       Impact factor: 49.962

7.  Tropical islands of the Anthropocene: Deep histories of anthropogenic terrestrial-marine entanglement in the Pacific and Caribbean.

Authors:  Scott M Fitzpatrick; Christina M Giovas
Journal:  Proc Natl Acad Sci U S A       Date:  2021-10-05       Impact factor: 11.205

8.  Genetic ancestry, admixture, and population structure in rural Dominica.

Authors:  Monica H Keith; Mark V Flinn; Harly J Durbin; Troy N Rowan; Gregory E Blomquist; Kristen H Taylor; Jeremy F Taylor; Jared E Decker
Journal:  PLoS One       Date:  2021-11-03       Impact factor: 3.240

9.  TKGWV2: an ancient DNA relatedness pipeline for ultra-low coverage whole genome shotgun data.

Authors:  Daniel M Fernandes; Olivia Cheronet; Pere Gelabert; Ron Pinhasi
Journal:  Sci Rep       Date:  2021-10-28       Impact factor: 4.379

Review 10.  Clinical Features, Immunopathogenesis, and Therapeutic Strategies in Vitiligo.

Authors:  Yinghan Wang; Shuli Li; Chunying Li
Journal:  Clin Rev Allergy Immunol       Date:  2021-07-20       Impact factor: 8.667

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.