Literature DB >> 24671218

AmericaPlex26: a SNaPshot multiplex system for genotyping the main human mitochondrial founder lineages of the Americas.

Alexandra Coutinho1, Guido Valverde1, Lars Fehren-Schmitz2, Alan Cooper1, Maria Inés Barreto Romero3, Isabel Flores Espinoza3, Bastien Llamas1, Wolfgang Haak1.   

Abstract

Phylogeographic studies have described a reduced genetic diversity in Native American populations, indicative of one or more bottleneck events during the peopling and prehistory of the Americas. Classical sequencing approaches targeting the mitochondrial diversity have reported the presence of five major haplogroups, namely A, B, C, D and X, whereas the advent of complete mitochondrial genome sequencing has recently refined the number of founder lineages within the given diversity to 15 sub-haplogroups. We developed and optimized a SNaPshot assay to study the mitochondrial diversity in pre-Columbian Native American populations by simultaneous typing of 26 single nucleotide polymorphisms (SNPs) characterising Native American sub-haplogroups. Our assay proved to be highly sensitive with respect to starting concentrations of target DNA and could be applied successfully to a range of ancient human skeletal material from South America from various time periods. The AmericaPlex26 is a powerful assay with enhanced phylogenetic resolution that allows time- and cost-efficient mitochondrial DNA sub-typing from valuable ancient specimens. It can be applied in addition or alternative to standard sequencing of the D-loop region in forensics, ancestry testing, and population studies, or where full-resolution mitochondrial genome sequencing is not feasible.

Entities:  

Mesh:

Year:  2014        PMID: 24671218      PMCID: PMC3966882          DOI: 10.1371/journal.pone.0093292

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Population genetic studies on modern-day Native American populations have described the presence of five haplogroups (hgs), termed A, B, C, D and X [1]–[3]. These five hgs are shared with East Asian populations and support an entry route to the Americas via the Bering landmass. However, Native American populations can be distinguished from their East Siberian source populations by exhibiting distinct sub-haplogroups (sub-hgs), which can only be found in the Americas. These so-called ‘founder lineages’ have been used to describe the demographic history of Native American populations and to shed light on the timing of the entry into and spread throughout the Americas [4]–[6]. The fact that the mtDNAs of all human populations native to the Americas can be assigned to one of the founder lineages pertains to stochastic events that would have affected the initial colonizers of the Americas [7]. The low genetic variation found in modern Native American groups is believed to be due to either population bottlenecks or genetic drift [8], [9]. Most mitochondrial DNA (mtDNA) studies on prehistoric American populations involve sequencing of the D-loop, which contains Hypervariable Regions 1 and 2 (HVR1 and HVR 2 respectively), to describe a sequence haplotype, from which the hg can be inferred [10], [11]. Sequencing of the HVR regions of mtDNA was relatively cost-effective and less time consuming than full mtDNA sequencing, and is therefore still the method of choice for many labs which study human populations [11]. Yet not all lineages harbour enough variation in the D-loop from which to infer a sub-hgs at a deeper level than the overall hg, let alone a specific founder lineage [11]–[14]. As a result, many past and present studies on Native American population history have been restricted to the information gained from the distribution of the major five Pan-American hgs. The coupling of multiplex polymerase chain reaction (PCR) with a Single Base Extension (SBE) reactions, based on the established SNaPshot (Applied Biosystems) or minisequencing principle, has been widely used to design panels of single nucleotide polymorphisms (SNPs) for forensic and anthropological studies [15]. It has also found wide use in population genetic studies focussing on mtDNA and Y-chromosome SNPs, either including SNPs with a global representation or via a targeted selection of characteristic SNPs representing specific geographic regions [13], [16]–[21]. The design of a SNP panel including those markers defining the 15 American founder lineages described by Perego et al. [22] and more had not been attempted, although ‘Multiplex 3’ in van Oven et al. 2011 [23] covered 12 out of these 15. The primary aim of this study was therefore to design a novel SNaPshot assay that enables a fast and cost-efficient high-resolution typing of the majority of known Native American sub-hgs by targeting 26 characteristic SNPs. Our goal was to develop an assay that is universally applicable to accommodate the specific needs of damaged and degraded DNA in ancient DNA work and forensics. Selective sequencing of the SNP regions of interest not only allows for flexibility in the number and choice of SNP sites but also allows (with reservations) the design of ultra-short amplicon lengths (50–80 bp) suitable for degraded DNA typing [13], while using far less DNA than traditional sequencing methods or SNP-typing in individual singleplex PCRs. This is of great importance in forensic and ancient DNA studies where sample DNA is a limited resource [23], [24]. The secondary aim was to develop an assay that could complement an established assay with a global set of SNPs (GenoCore22, see [24] but also [23]) and at the same time provide a fast and efficient screening tool that allows the assessment of overall sample quality (presence of very short fragments of endogenous mtDNA and absence of contaminant hgs) for further use in mitochondrial genome sequencing via DNA library preparation and targeting enrichment techniques, e.g. [25]–[27].

Materials and Methods

AmericaPlex26 SNP selection

We developed a multiplex SNaPshot reaction targeting 26 SNP sites in total including characteristic SNPs of the four major Pan-American sub-hgs A2, B2, C1 and D1, as well as SNP sites for the minor Pan-American lineages C4c, D2a, D4h3a and X2a [6]. The initial choice of SNPs was based on a study by Perego et al. [22] describing 15 American founder lineages. Additional SNP sites were chosen for sub-hgs within each major hg based on the most up-to-date mtDNA phylogeny available at the time (phylotree.org, mtDNA tree Build 13, 28 Dec 2011) in order to enhance the discriminating power of the assay. For sub-hgs defined by more than one characteristic SNP we employed selection criteria during the primer design stage based on the ability to design primers with high specificity in the short flanking region around the SNPs, and under a consensus-melting temperature for all pairs in a multiplex environment. Presented below is a summary of each major Native American sub-hg, their distribution throughout the Americas, as well as the SNP sites chosen to represent the hg and their respective sub-hgs. The representative SNPs typed in the AmericaPlex26 are given in parentheses and a simplified tree illustrating the phylogenetic relationship is shown in Figure 1.
Figure 1

Simplified phylogeny in related to the Reconstructed Sapiens Reference Sequence (RSRS [52]) and typing scheme of the 26 SNPs targeted in the AmericaPlex26.

Sub-haplogroups, which can be unambiguously assigned, are shown in blue (blue). Basal hgs, which cannot be unambiguously assigned, are also shown (black) in order to illustrate the phylogenetic relationship, but also the inherent limitations of our assay. The phylogenetic position of the revised Cambridge Reference Sequence (rCRS [53]) is indicated within macro-hg N. SNPs with a SBE primer in reverse direction targeting the opposite strand are given in Italics.

Simplified phylogeny in related to the Reconstructed Sapiens Reference Sequence (RSRS [52]) and typing scheme of the 26 SNPs targeted in the AmericaPlex26.

Sub-haplogroups, which can be unambiguously assigned, are shown in blue (blue). Basal hgs, which cannot be unambiguously assigned, are also shown (black) in order to illustrate the phylogenetic relationship, but also the inherent limitations of our assay. The phylogenetic position of the revised Cambridge Reference Sequence (rCRS [53]) is indicated within macro-hg N. SNPs with a SBE primer in reverse direction targeting the opposite strand are given in Italics. Haplogroup A2 (G12007A) is found throughout the Americas, but its derivatives A2a (C3330T) and A2b (T11365C) are mainly found in the Northern parts of North America in Inuit, Na-Dené and Siberian populations such as Koryaks and Chukchi [22], [28]–[30], whereas particular subgroups of A2a were also reported from Athapaskan territories in the Southwest [30]. A16265G (defining A2b in [22]) was also added to the assay as it was further resolved to represent sub-hg A2b1, which can be found in Eskimoan-speaking populations (such as the Inuit and Yupik) across the Arctic [31]. Haplogroup B2 (C11177T) is widely distributed throughout the Americas and is nested within hg B4b (G4820A), which has a largely Asian distribution. Studies on modern-day populations describe sub-hgs within B2, which are prevalent in specific geographic areas within the American continents and thus phylogeographically informative [32]. We included diagnostic SNPs for sub-hgs B2a (G16483A, North/Central), B2b (G6755A, ubiquitous), B2c (A7241G, predominantly Central), B2d (T8875C, Central/South), B2e (C6119T, South) and B2f (T10535C, North/Central) to monitor this phylogeographic pattern in ancient Native American populations through time. Haplogroup C is represented in the Americas by sub-hgs C1b, C1c and C1d [22], [32], whereas sub-hgs C1a and C1e are Siberian/East Asian and European sister-clades, respectively. Sub-hg C1b (A493G) can be found throughout South America. Sub-hg C1c is most frequent in Mexico [22] and was split into sub-hgs C1c1a (A12978G) and C1c2 (C14356T), since the immediate flanking region of the two C1c SNPs defining (G1888A and G15930A) were not suitable for primer design. A recent study by Perego et al. [22] has further resolved Central American sub-hg C1d (A16051G), which now includes sub-hg C1d1 (G7697A). Minor Pan-American hg C4c (C14433T) was recently discovered in an ancient sample from British Columbia, and was found to be one of the founding lineages of the Americas based on coalescent age estimates [33], [34]. Haplogroup D is found in the Americas as four distinct sub-hgs D1 (C2092T), D2a, D3 and D4h3a [6], [35]. Sub-hg D2a has been found in Sireniki, Yuit, Aleut and Chukchi populations in Siberia and Aleuts from the Commander Islands [31], [36], [37], and has been further resolved into sub-hgs D2a1 (A9667G; including the Saqqaq Paleo-Eskimo [31]), D2a2 (G4991A) and D2b (A9181G) [22]. Recent studies have further revealed sub-hg D3 to be more derived than previously thought and nested within the larger branch D4b1 (C10181T), which currently encompasses three sub-hgs within North American and Siberian native populations: D4b1a, D4b1b′d, and D4b1c [31]. Therefore, we chose the ancestral SNP defining hg D4b1 for our panel (See mtDNA tree Build 13, 2011). In addition, sub-hg D4h3a (G6285A) is distributed along the Pacific coast of the American continents [6] but most frequent in South America. Haplogroup X2a (A8913G) has only been found in a limited number of samples in North American populations as compared to those of A2, B2, C1 and D1, and is therefore described as minor founding lineage in this paper [8], [35]. Lastly, SNP site T14783C was included as control to define macro-hg M, which encompasses hgs C and D. In contrast, this SNP retains the ancestral state in hgs A, B and X, which belong to macro-hg N.

Primer and probe design

PCR and SBE primers were designed and quality-controlled using default settings and features in the software package Geneious v5.2 (Geneious version (5.2) created by Biomatters. Available from http://www.geneious.com/) and Batchprimer3 v1.0 [38], both based on the program primer3 [39], generally following the guidelines set out in Sanchez et al. 2006 [40]. Amplicon sizes were deliberately kept smaller than 90 bp in length to allow amplification of highly fragmented DNA as typical in forensics and ancient DNA studies. Given how short the flanking regions of each SNP were, which already constrained our selection of suitable SNPs, we could not consider potential polymorphic sites in these areas nor nuclear insertions, and relied on empirical testing of PCR primer efficiency. SBE primers were then ranked according to quality score and orientation (forward or reverse) for efficient use of fluorescent dyes and fragment length spacing. The latter was adjusted to 4 bp by adding poly-CT tails to the 5′end of each SBE primer (Table 1) [41].
Table 1

Table showing the details of multiplex primers and SBE probes used in the AmericaPlex26 assay.

PCR AmplificationSingle-Base Extension
SitehgPrimerPrimer Sequences (5′ to 3′)Conc (μM)length (bp)SBE Probe Sequences (5′ to 3′)Conc (μM)ntAlleles (Dyes) as detected
12007 A2L11984 CTAGTCACAGCCCTATACTCCCTCT 0.02573(ct)26cCCTCTACATATTTACCACAACACAATG0.01580G (blue), A (green)
H12009 TGTTAATGTGGTGGGTGAGTGAGC 0.025
3330 A2aL03326 ACATACCCATGGCCAACCTCCT 0.02260(ct)14cCCATGGCCAACCTCCTACT0.01548C (yellow), T (red)
H03339 GGAATGCCATTGCGATTAGAATGGGT 0.022
11365 A2bL11359 GCCAACAACTTAATATGACTAGCTTACACA 0.03380(ct)25CAACTTAATATGACTAGCTTACACAATAGC0.02080T (red), C (yellow)
H11381 GGGAGTCATAAGTGGAGTCCGTAAAGAGG 0.033
16265 A2b1L16262 ACTGCAACTCCAAAGCCACCCC 0.01156(ct)15ACTCCAAAGCCACCCCTC0.01548A (green), G (blue)
H16272 GGGTGGGTAGGTTTGTTGGTATCCT 0.011
11177 B2L11163 CCCACCTTGGCTATCATCACCCG 0.02563(ct)7cCCCGATGAGGCAACCAG0.02032T (red), C (yellow)
H11182 GTATGTGCCTGCGTTCAGGCGT 0.025
16483 B2aL16471 GCTCCGGGCCCATAACACTTGG 0.03070(ct)21ACCAGATGTCGGATACAGTTCA0.02064C (yellow), T (red)
H16494 ACCCTGAAGTAGGAACCAGATGTCGG 0.030
6755 B2bL06750 GTCTGAGCTATGATATCAATTGGCTTCC 0.03290 (ct)16TGGTGTGCTCACACGATAAA 0.01552C (yellow), T (red)
H06789 TGCTCGTGTGTCTACGTCTATTCC 0.032
7241 B2cL07224 TCCGGAATGCCCCGACGTTACT 0.02368(ct)9cTCGGACTACCCCGATGC0.01536A (green), G (blue)
H07243 ACAGATGATAGGATGTTTCATGTGGTGT 0.023
8875 B2dL08864 TCCCCTTATGAGCGGGCACAGT 0.02976(ct)18CGGGCACAGTGATTATAGGC0.01556T (red), C (yellow)
H08896 TGTGGTAAGAAGTGGGCTAGGGC 0.029
6119 B2eL06094 TCGTCACAGCCCATGCATTTGT 0.03176 (ct)22CCAAAGCCTCCGATTATGAT 0.01564G (blue), A (green)
H06133 AGTCAGTTGCCAAAGCCTCCGA 0.031
10535 B2fL10528 AGCATTTACCATCTCACTTCTAGGAATACT 0.03162 (ct)16GAGGATATGAGGTGTGAGCG 0.02052A (green), G (blue)
H10537 GTAGGGAGGATATGAGGTGTGAGC 0.031
4820 B4bL04816 GCCCCCTTTCACTTCTGAGTCCC 0.03356 (ct)11AGGGGTGCCTTGGGTAAC 0.02040C (yellow), T (red)
H04828 CCGGATGTCAGAGGGGTGCCTT 0.033
8913 X2aL08905 CGCTCTAAGATTAAAAATGCCCTAGCCC 0.00359(ct)26AATGCCCTAGCCCACTTCTT0.01572A (green), G (blue)
H08915 AGGGGTGTAGGTGTGCCTTGTG 0.003
14783 ML14774 ACCCCAATACGCAAAACTAACCCCC 0.03176(ct)26CGCAAAACTAACCCCCTAATAAAA0.02076T (red), C (yellow)
H14804 TGTTGGATGGGGTGGGGAGGTC 0.031
493 C1bL00474 TTTCCCCTCCCACTCCCATACT 0.03373ctcACTCCCATACTACTAATCTCATCAATACA0.02532A (green), G (blue)
H00511 TAGCAGCGGTGTGTGTGTGCTG 0.033
12978 C1c1aL12972 ACGCTAATCCAAGCCTCACCCCA 0.01865(ct)18CCAAGCCTCACCCCACTACT0.01556A (green), G (blue)
H12993 TTGGGCTGATTTGCCTGCTGCT 0.018
14356 C1c2L14348 ACCACAACCACCACCCCATCAT 0.01567 (ct)24GTAGGATTGGTGCTGTGGGT 0.01568G (blue), A (green)
H14371 TGGGGTTAGCGATGGAGGTAGGA 0.015
16051 C1dL16049 TCTTTCATGGGGAAGCAGATTTGGG 0.01962(ct)11GGGGAAGCAGATTTGGGT0.01540A (green), G (blue)
H16065 AGCGGTTGTTGATGGGTGAGTC 0.019
7697 C1d1L07684 TGATCACGCCCTCATAATCATTTTCCTT 0.03373 (ct)23GTTAGGAAAAGGGCATACAGGA 0.02068C (yellow), T (red)
H07705 TGTTGTGAGTGTTAGGAAAAGGGCA 0.033
14433 C4cL14431 GACCTCAACCCCTGACCCCCAT 0.01957(ct)10ACCCCTGACCCCCATG0.01536C (yellow), T (red)
H14443 ACTACAGCGATGGCTATTGAGGAG 0.019
2092 D1L02079 GCCCACAGAACCCTCTAAATCCCC 0.03375 (ct)26cAGCTGTTCCTCTTTGGACTAACA 0.02076G (blue), A (green)
H02106 TCCTAGTGTCCAAAGAGCTGTTCCT 0.033
9667 D2a1L09652 GGAGTATCAATCACCTGAGCTCACCA 0.03270 (ct)24cCTTGAATTATTTGGTTTCGGTTG 0.02072T (red), C (yellow)
H09670 GCAGTGCTTGAATTATTTGGTTTCGGT 0.032
4991 D2a2L04976 TCATAGCAGGCAGTTGAGGTGGA 0.03288(ct)20GGTGGATTAAACCAAACCCA0.01560G (blue), A (green)
H05007 TCCTATGTGGGTAATTGAGGAGTATGC 0.032
9181 D2bL09159 TCGCTGTCGCCTTAATCCAAGCC 0.03268 (ct)12cTTGTCGTGCAGGTAGAGGC 0.01544T (red), C (yellow)
H09183 TGTGTTGTCGTGCAGGTAGAGG 0.032
10181 D4b1L10177 AATCCACCCCTTACGAGTGCGG 0.00663 (ct)13GGCGGGGGATATAGGGTC 0.01544G (blue), A (green)
H10197 TTATGGAGAAAGGGACGCGGGC 0.006
6285 D4h3aL06282 GCCGGAGCAGGAACAGGTTGAA 0.03275 (ct)2°CCCTGCTAAGGGAGGGTAGA 0.02060C (yellow), T (red)
H06314 GTCTACGGAGGCTCCAGGGTGG 0.032

SBE probes in reverse direction are shown in Italics.

hg haplogroup; conc. concentration; bp base pairs; nt nucleotides.

SBE probes in reverse direction are shown in Italics. hg haplogroup; conc. concentration; bp base pairs; nt nucleotides.

Ethics statement

All necessary permits were obtained for the described study, which complied with all relevant regulations. Permissions to collect, export and analyze ancient Peruvian specimens from the Huaca Pucllana site were granted by the Ministry of Culture (the former National Institute of Cultural Heritage – INC) and the National Museum of Archaeology, Anthropology and History of Peru (MNAAHP) and are available on request (ACTA No 017-2010-ARMC-MNAAHP-MC and Resolución Viceministerial No. 120-2010-VMPCIC-MC). No specific permits were required for the modern control samples when solely used for methodological validation (waiver from The Human Research Ethics Committee (HREC) at the University of Adelaide). Swab samples were nevertheless collected using written informed consent.

Samples and DNA extractions

DNA from modern control samples (AC, GV) was extracted from cheek swabs using QIAamp DNA Mini Kit (Qiagen) a following the manufacturer's instructions. Ancient samples were collected by MIBR, BL and WH under DNA-free conditions at the Museo de Sitio Huaca Pucllana, Calle General Borgoño cuadra 8 s/n, Miraflores, Lima, Perú, where the samples are stored. Sample preparation, DNA extractions and PCR amplification from ancient samples were performed at the Australian Centre for Ancient DNA in Adelaide, Australia, applying established methods and authentication criteria as described previously [26], [42], [43]. In brief, we used an in-house silica extraction method, detailed in [26], to extract DNA from two independent samples per individual. PCR amplifications from each extract and direct sequencing of the HVR-I were performed using four overlapping primer pairs with reaction conditions described in [42], [44]. Details of the four primer pairs are given in (Table 2).

Multiplex PCR amplification

PCR amplifications were carried out in a final reaction volume of 12.5 μl consisting of 0.5 μL DNA sample (3 μL for ancient DNA), 1x PCR Gold Buffer, 6.5 mM MgCl2, 0.1 U AmpliTaq Gold DNA polymerase (all Applied Biosystems) 1.25 mM dNTP solution (Bioline Pty Ltd), (0.8 μg RSA for ancient DNA samples), and a primer mix consisting of 26 primer pairs, with concentrations given in Table 1. PCR was carried out on a Tetrad 2 Peltier Thermal Cycler (Bio-Rad Laboratories) using the following conditions: 95°C for 6 min and 30 cycles (45 cycles for ancient DNA samples) of 95°C for 30 s, 55°C for 30 s, 65°C for 30 s, and a final extension time at 65°C for 6 min. Amplification success was monitored via gel electrophoresis on an 3.5% agarose gel (100 V for 40 min; Hyperladder V DNA size ladder (Bioline Pty Ltd)). PCR products were purified by mixing 5 μl of PCR reaction with 1 U ExoSAP-IT (Thermo Fisher Scientific Australia Pty Ltd), followed by incubation at 37°C for 50 min, 80°C for 15 min and 15°C for 10 min. Single Base Extension reactions consisted of a final volume of 5 μL containing 1 μL PCR product, 2.5 μL SNaPshot ready reaction mix (Applied Biosystems), and 0.5 μL extension primer mix (individual concentrations are given in Table 1). Thermocycling of the SBE reactions was performed in a Tetrad 2 Peltier Thermal Cycler (Bio-Rad Laboratories) with the following conditions: 96°C for 10 s; followed by 35 cycles of 55°C for 5 s and 60°C for 30 s. SBE products were purified by adding 1 U Shrimp Alkaline Phosphatase (Thermo Fisher Scientific) to the reaction solution and incubating it at 37°C for 50 min, 80°C for 15 min and 15°C for 10 min. Capillary electrophoresis was performed on a 3130 xl Genetic Analyser (Applied Biosystems) using POP-6 polymer and a customised run module, by adding 1 μL sample DNA to 18.5 μL Hi-Di Formamide and 0.5 μL GeneScan-120 LIZ internal size standard (Applied Biosystems). Electropherograms were analysed using the software Genemapper ID version 3.2.1 software (Applied Biosystems) applying custom panel and bin settings available on request.

Sensitivity tests

We performed sensitivity studies using serial dilutions of 1, 1∶10, 1∶100, 1∶1000 and 1∶10,000 of DNA from a buccal swab sample from a lab member (AC). The mtDNA copy number of the modern sample was determined through qPCR using the SYBR-Green kit (Qiagen), targeting a short 77 bp fragment of human mitochondrial DNA with primer pair L13258 and H13295 [45]. Serial dilutions were treated as separate samples, and each sample was analysed in triplicate. The qPCR reaction was performed in a total reaction volume of 10 μL consisting of 1 μL of each sample dilution, 2x Brilliant SYBR Green Master Mix and 0.1 μM of each primer. The qPCR were carried out on a Rotor-Gene Q Real-Time PCR cycler (Qiagen) with thermocycling conditions as follows: 95°C for 5 min, followed by 45 cycles of 95°C for 10 s, 58°C for 20 s and 72°C for 20 s.

Results and Discussion

Optimization of the multiplex protocol

The AmericaPlex26 assay was initially tested with default concentrations of 0.017 μM for each primer (3 μL of 25 μM stock) and 0.015 μM for each SBE primer (3 μL of 50 μM stock) to assess the generic efficiency of primers or probes when used in the multiplex assay. Twenty-two out of 26 SNP sites could be readily amplified, albeit with highly variable peak heights across the assay. Primers and probes for the four problematic SNP sites were each tested in singleplex PCR and SNaPshot reactions to ensure they performed individually as expected. If the SNP fragment were successfully amplified in the singleplex PCR, concentrations of the primer would be doubled in the following multiplex PCR reaction mix. We chose 3000 relative fluorescence units (rfu) as a default average peak height based on the ancestral allele status observed in our European modern control sample (AC), and calculated the percentage difference between peaks and the 3000 rfu average. Multiplex primer concentrations were adjusted according to this percentage difference to allow amplification of problematic SNP sites and to balance the peak heights of those that did amplify. Based on poor performance of the primer pair chosen to amplify the C1b SNP site (A493G), we performed a second round of balancing primer concentration with a new primer pair for this site. To further refine the balance in peak height, the concentration of some SBE extension primers was adjusted to the final recommended concentrations given in Table 1. Changes to probe concentrations resulted in a more balanced electropherogram and amplification of all 26 SNP sites using modern buccal swab and ancient DNA samples (Figure 2).
Figure 2

Electropherograms of two ancient examples representing the optimized AmericaPlex26.

Panel A shows a South American sample and panel B an ancient European sample illustrating the ancestral state of all 26-haplogroups B2a (16483), B2b (6755), B2c (7241), B2d (8875), B2e (6119) and B2f (10535).

Electropherograms of two ancient examples representing the optimized AmericaPlex26.

Panel A shows a South American sample and panel B an ancient European sample illustrating the ancestral state of all 26-haplogroups B2a (16483), B2b (6755), B2c (7241), B2d (8875), B2e (6119) and B2f (10535).

Sensitivity studies

The amount of mitochondrial DNA was measured for a modern sample (1,171,699 copies/μL) and four serial dilutions of 1∶10, 1∶100, 1∶1000 and 1∶10,000 using real-time quantitative PCR (Figure 3). A near complete SNP profile could be observed for serial dilutions up to 28,278 copies/μL DNA (1∶100), which is similar to other published multiplex assays [23], [24], [46].
Figure 3

Electropherograms from the sensitivity tests showing four serial dilutions of template DNA with a starting copy number of 1,171,699 copies/μL.

Note the increasing number of locus dropout as template DNA concentration decreases.

Electropherograms from the sensitivity tests showing four serial dilutions of template DNA with a starting copy number of 1,171,699 copies/μL.

Note the increasing number of locus dropout as template DNA concentration decreases.

Method application on ancient samples

The AmericaPlex26 assay was tested on ancient samples from three successive pre-Columbian cultures from the Huaca Pucllana archaeological site in Lima, Peru. They included samples from the Early Intermediate (n = 20; 200–600 AD), the Middle Horizon (n = 20; 600–1000 AD) and the Late Intermediate (n = 12; 1000–1476 AD) [47] plus an Early Medieval European samples as control for the ancestral state. Samples from each period varied in the state of preservation, due to differences in mortuary customs. From our test dataset of 52 samples in total, we were able to unambiguously type 29 samples (56%) (Table 3, Figure 4). A typing result was considered reliable when two samples from the same individual could be unambiguously assigned to the same sub-hg in two independent experiments.
Table 3

Direct comparison of results for HVR-I sequencing and AmericaPlex26 SNP typing assay for samples unambiguously typed using the AmericaPlex26 assay.

IndividualMuseum no.SamplesHVR I hgAmericaPlex26Consensus
EI 1A06 95/96 10802A ?B2B2
10803A ?B2
EI 2A06 79/01 Ind2 10804A D1D1?D1
10805A D1M
EI 3A06 76/96 10809A C1C1bC1b
10810A C1C1b
EI 4A15 06/00 10813A B4B2B2
10814A B4B2
EI 5A06 01/02 10817A C1C1bC1b
10818A ?C1b
EI 6A6 68/96 10821A B4B2B2
10822A B4B2
EI7A06 90 Ind1 10787A ?B2B2
10788A ?B2
EI 8A20 05/09 10789A B4B2bB2b
10790A ?B2b
EI 9A20 03/07 Ind1 10791A A2A2A2
10792A A2A2
EI 10A06 82/96 10793A B4B2B2
10794A B4B2
EI 11A06 77/96 10796A B4B2bB2
10797A ?B2
EI 12A06 79/96 Ind1 10800A B4B2bB2
10801A B4B2
MH 1A20 08/08 Ind2 10733A ?C1bC1b
10734A ?C1b
MH 2A20 CF003/09 10741A ?B2B2
10742A ?B2
MH 3A20 07/08 Ind4 10749A ?B2B2
10750A ?B2
MH 4A20 05/08 10753A ?B2B2
10754A B4?B2
MH 5A20 18/08 10765A -C1bC1b
10766A -C1b
MH 6A20 01/09 Ind1 10772A C1C1bC1b
10773A ?C1b
MH 7A20 01/09 Ind2 10774A ?A2A2
10775A -A2
MH 8A20 04/07 Ind2 10778A D?A2A2
10779A -A2
LI 1A15 01/02 10709A C1MC1
10710A C1C1b
LI 2A0 cf14 ind-1/98 10712A B4B2B2
10713A B4B2
LI 3A01 CF16/98 10715A ?B2B2
10716A -B2
LI 4A0 CF15/01 10717A C1MM
10718A ?M
LI 5A15 CF36/01 10719A B4B2B2
10720A B4B2b
LI 6A15 Sin Contexto 10724A C1C1bC1b
10725A C1C1b
LI 7A3 CF01/04 10726A ?B2bB2
10727A ?B2
LI 8A0 08/98 10728A B4B2B2
10729A B4B2
LI 9A15 02/02 10730A B4B2B2
10730Y B4B2b
LI 10A0 56/97 10731A C1C1bC1b
10731Y C1C1b

Consensus haplogroups were called based on last common SNP from both replicates from independent extractions, and minimum peak size >50 rfu. (?)/(−): Insufficient or no sequence information; EI: Early Intermediate; MH: Middle Horizon; LI: Late Intermediate.

Figure 4

Genotyping success of the AmericaPlex26 assay compared to standard HVR-I sequencing via four overlapping amplicons.

The success rate is given in percentage of unambiguous genotype calls for each of the two methods per cultural horizon and all results combined (black bars, p<0.0001).

Genotyping success of the AmericaPlex26 assay compared to standard HVR-I sequencing via four overlapping amplicons.

The success rate is given in percentage of unambiguous genotype calls for each of the two methods per cultural horizon and all results combined (black bars, p<0.0001). Consensus haplogroups were called based on last common SNP from both replicates from independent extractions, and minimum peak size >50 rfu. (?)/(−): Insufficient or no sequence information; EI: Early Intermediate; MH: Middle Horizon; LI: Late Intermediate. We subsequently compared the AmericaPlex26 assay results to our previous attempts at amplifying and sequencing the mitochondrial HVR-I with four overlapping primer pairs and found that the AmericaPlex26 assay improved the typing efficiency from ancient samples (Table 3, Figure 4). For example, the AmericaPlex26 assay allowed reliable SNP typing for eleven Early Intermediate (55%) and ten Late Intermediate samples (83%), whereas HVR-I sequencing gave reliable sequence haplotypes for seven (35%) and seven (58%) samples, respectively. For example, HVR-I sequencing for samples 10802A and 10803A failed, while the AmericaPlex26 assay revealed specific hg B2 (Table 3). Samples from the Middle Horizon culture were in general less well preserved, resulting in eight consensus sub-hg calls (40%) using the AmericaPlex26 assay, whereas HVR-I sequencing did not produce any reliable sequence haplotype from the sample replicates (0%). This highlights the genotyping power of our assay when dealing with challenging samples. Importantly, SNP typing with the AmericaPlex26 assay also gave a higher resolution compared to traditional HVR-I sequencing. For example, samples 10809A and 10810A of the Early Intermediate period were assigned to the major hg C1 by HVR-I sequencing, yet the AmericaPlex26 assay allowed further resolution to sub-hg C1b. In addition, while many of the HVR-I results from Late Intermediate samples remained tentative, i.e. non-reproducible, the AmericaPlex26 assay provided reliable and specific sub-hgs for both replicates (Table 3, Figure 4). Taking all results together, the AmericaPlex26 assay showed a significantly higher success rate when compared to the standard HVR-I sequencing (p<0.0001, Wilcoxon matched-pairs signed rank test; Figure 4). This is likely due to the difference in amplicon sizes between the two methods.

The overall effectiveness of the assay

Overall, the effectiveness of the multiplex SNaPshot method in analysing ancient DNA lies in the fact that it only requires minimal flanking regions either side of the SNP, which in theory allows the design of very short overall amplicon sizes [40], [48], 56–90 bp in our case. This makes it suitable for the extremely fragmented and damaged state of ancient DNA, while the multiplex approach maximises the amount of information that can be gained per PCR [15], [49], [50]. As such, SNaPshot typing is able to generate results for samples for which traditional sequencing methods often fail with ancient and/or degraded DNA, as they require longer fragment lengths to be cost-effective [51]. Multiplexing also allows the combination of many informative SNP sites into one reaction, which are otherwise spread across longer sequence regions. On its own, the multiplex PCR and SNaPshot method is time- and cost-effective, and requires substantially smaller amounts of valuable DNA extract compared to HVR-I sequencing, as fewer individual reactions are needed from preparation of the multiplex PCR to capillary electrophoresis [48], [50]. We show that the AmericaPlex26 can be used to complement or expand upon standard mtDNA sequencing approaches for ancient Native American populations, and especially for ancient samples where DNA preservation does not allow amplification of longer (>100 bp or more) DNA molecules. It efficiently and economically targets characteristic SNPs from the coding region of mtDNA [15] in order to corroborate HVR-I sequencing results and to define a particular sub-hg [14]. Alternatively, the AmericaPlex26 can be used in addition to global mtDNA SNP multiplexes, such as the GenoCore22 and others [23], [24]. Moreover, it is flexible enough to add newly discovered SNPs/lineages in order to enhance sub-regional resolution (see e.g. [32], [35]. In our experience, the new method provided an extremely useful one-reaction test to screen larger numbers of degraded samples allowing the assessment of the general state of preservation, the authenticity of the result (i.e. absence of potential contaminating lineages), while at the same time allowing a categorisation of potentially interesting sub-hgs. We are currently using this approach in order to further dissect the phylogenetic resolution via DNA library creation and targeted mtDNA enrichment and Next Generation Sequencing [25], [26].

Conclusions

We present a powerful, optimized SNP assay, which allows unambiguous typing of Native American mtDNA ‘founder lineages’ and additional SNPs for further resolution. This short-amplicon AmericaPlex26 assay is highly efficient, time and cost-effective compared to classical HVR-I sequencing, and allows highly resolved SNP typing of degraded DNA samples in forensic and ancient DNA work. It is suitable as a qualitative ‘screening’ method to identify samples with sufficient DNA preservation, free of contaminants that complicate full mitochondrial sequencing (and beyond) via Next Generation Sequencing techniques.
Table 2

Details of primers used for standard HVR-I amplification and sequencing.

PrimerPrimer Sequences 5′ to 3′Length in bp (incl./excl. primer)Reference
L16055 GAAGCAGATTTGGGTACCAC 126 (87) [54]
H16142 ATGTACTACAGGTGGTCAAG [55]
L16117 TACATTACTGCCAGCCACCAT 162 (115) [42]
H16233 GCTTTGGAGTTGCAGTTGATGTGT [42]
L16209 CCCCATGCTTACAAGCAAGT 179 (138) [54]
H16348 ATGGGGACGAGAAGGGATTTG [42]
L16287 CACTAGGATACCAACAAACC 162 (122) [54]
H16410 GCGGGATATTGATTTCACGG [54]
  45 in total

1.  Native American mitochondrial DNA analysis indicates that the Amerind and the Nadene populations were founded by two independent migrations.

Authors:  A Torroni; T G Schurr; C C Yang; E J Szathmary; R C Williams; M S Schanfield; G A Troup; W C Knowler; D N Lawrence; K M Weiss
Journal:  Genetics       Date:  1992-01       Impact factor: 4.562

2.  The initial peopling of the Americas: a growing number of founding mitochondrial genomes from Beringia.

Authors:  Ugo A Perego; Norman Angerhofer; Maria Pala; Anna Olivieri; Hovirag Lancioni; Baharak Hooshiar Kashani; Valeria Carossa; Jayne E Ekins; Alberto Gómez-Carballa; Gabriela Huber; Bettina Zimmermann; Daniel Corach; Nora Babudri; Fausto Panara; Natalie M Myres; Walther Parson; Ornella Semino; Antonio Salas; Scott R Woodward; Alessandro Achilli; Antonio Torroni
Journal:  Genome Res       Date:  2010-06-29       Impact factor: 9.043

3.  A single multiplex PCR and SNaPshot minisequencing reaction of 42 SNPs to classify admixture populations into mitochondrial DNA haplogroups.

Authors:  Greiciane G Paneto; Stephan Köhnemann; Joyce A Martins; Regina M B Cicarelli; Heidi Pfeiffer
Journal:  Mitochondrion       Date:  2010-12-21       Impact factor: 4.160

4.  Subtyping mtDNA haplogroup H by SNaPshot minisequencing and its application in forensic individual identification.

Authors:  P Grignani; G Peloso; A Achilli; C Turchi; A Tagliabracci; M Alù; G Beduschi; U Ricci; L Giunti; C Robino; S Gino; C Previderè
Journal:  Int J Legal Med       Date:  2005-12-07       Impact factor: 2.686

5.  Reconciling migration models to the Americas with the variation of North American native mitogenomes.

Authors:  Alessandro Achilli; Ugo A Perego; Hovirag Lancioni; Anna Olivieri; Francesca Gandini; Baharak Hooshiar Kashani; Vincenza Battaglia; Viola Grugni; Norman Angerhofer; Mary P Rogers; Rene J Herrera; Scott R Woodward; Damian Labuda; David Glenn Smith; Jerome S Cybulski; Ornella Semino; Ripan S Malhi; Antonio Torroni
Journal:  Proc Natl Acad Sci U S A       Date:  2013-08-12       Impact factor: 11.205

6.  Brief communication: mitochondrial haplotype C4c confirmed as a founding genome in the Americas.

Authors:  Ripan S Malhi; Jerome S Cybulski; Raul Y Tito; Jesse Johnson; Harold Harry; Carrie Dan
Journal:  Am J Phys Anthropol       Date:  2010-03       Impact factor: 2.868

7.  The retrieval of ancient human DNA sequences.

Authors:  O Handt; M Krings; R H Ward; S Pääbo
Journal:  Am J Hum Genet       Date:  1996-08       Impact factor: 11.025

8.  Complete mitochondrial DNA genome sequences from the first New Zealanders.

Authors:  Michael Knapp; K Ann Horsburgh; Stefan Prost; Jo-Ann Stanton; Hallie R Buckley; Richard K Walter; Elizabeth A Matisoo-Smith
Journal:  Proc Natl Acad Sci U S A       Date:  2012-10-22       Impact factor: 11.205

9.  Rapid coastal spread of First Americans: novel insights from South America's Southern Cone mitochondrial genomes.

Authors:  Martin Bodner; Ugo A Perego; Gabriela Huber; Liane Fendt; Alexander W Röck; Bettina Zimmermann; Anna Olivieri; Alberto Gómez-Carballa; Hovirag Lancioni; Norman Angerhofer; Maria Cecilia Bobillo; Daniel Corach; Scott R Woodward; Antonio Salas; Alessandro Achilli; Antonio Torroni; Hans-Jürgen Bandelt; Walther Parson
Journal:  Genome Res       Date:  2012-02-14       Impact factor: 9.043

10.  Ancient DNA reveals prehistoric gene-flow from siberia in the complex human population history of North East Europe.

Authors:  Clio Der Sarkissian; Oleg Balanovsky; Guido Brandt; Valery Khartanovich; Alexandra Buzhilova; Sergey Koshel; Valery Zaporozhchenko; Detlef Gronenborn; Vyacheslav Moiseyev; Eugen Kolpakov; Vladimir Shumkin; Kurt W Alt; Elena Balanovska; Alan Cooper; Wolfgang Haak
Journal:  PLoS Genet       Date:  2013-02-14       Impact factor: 5.917

View more
  7 in total

Review 1.  Forensically relevant SNaPshot® assays for human DNA SNP analysis: a review.

Authors:  Bhavik Mehta; Runa Daniel; Chris Phillips; Dennis McNevin
Journal:  Int J Legal Med       Date:  2016-11-14       Impact factor: 2.686

2.  Ancient DNA Analysis Suggests Negligible Impact of the Wari Empire Expansion in Peru's Central Coast during the Middle Horizon.

Authors:  Guido Valverde; María Inés Barreto Romero; Isabel Flores Espinoza; Alan Cooper; Lars Fehren-Schmitz; Bastien Llamas; Wolfgang Haak
Journal:  PLoS One       Date:  2016-06-01       Impact factor: 3.240

3.  Multiplex APLP System for High-Resolution Haplogrouping of Extremely Degraded East-Asian Mitochondrial DNAs.

Authors:  Tsuneo Kakuda; Hideki Shojo; Mayumi Tanaka; Phrabhakaran Nambiar; Kiyoshi Minaguchi; Kazuo Umetsu; Noboru Adachi
Journal:  PLoS One       Date:  2016-06-29       Impact factor: 3.240

4.  A mass sacrifice of children and camelids at the Huanchaquito-Las Llamas site, Moche Valley, Peru.

Authors:  Gabriel Prieto; John W Verano; Nicolas Goepfert; Douglas Kennett; Jeffrey Quilter; Steven LeBlanc; Lars Fehren-Schmitz; Jannine Forst; Mellisa Lund; Brittany Dement; Elise Dufour; Olivier Tombret; Melina Calmon; Davette Gadison; Khrystyne Tschinkel
Journal:  PLoS One       Date:  2019-03-06       Impact factor: 3.240

5.  Interaction between mitochondrial NADH dehydrogenase subunit-2 5178 C > A and clinical risk factors on the susceptibility of essential hypertension in Chinese population.

Authors:  Xi Chen; Xiang-Yu He; Chao Zhu; Yusong Zhang; Zongbin Li; Yuqi Liu; Yuxiao Zhang; Tong Yin; Yang Li
Journal:  BMC Med Genet       Date:  2019-07-05       Impact factor: 2.103

6.  Ancient Human Genomes and Environmental DNA from the Cement Attaching 2,000-Year-Old Head Lice Nits.

Authors:  Mikkel W Pedersen; Catia Antunes; Binia De Cahsan; J Víctor Moreno-Mayar; Martin Sikora; Lasse Vinner; Darren Mann; Pavel B Klimov; Stuart Black; Catalina Teresa Michieli; Henk R Braig; M Alejandra Perotti
Journal:  Mol Biol Evol       Date:  2022-02-03       Impact factor: 16.240

7.  Ancient DNA reveals selection acting on genes associated with hypoxia response in pre-Columbian Peruvian Highlanders in the last 8500 years.

Authors:  Lars Fehren-Schmitz; Lea Georges
Journal:  Sci Rep       Date:  2016-03-21       Impact factor: 4.379

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.