| Literature DB >> 26561991 |
Alberto Gómez-Carballa1,2, Laura Catelli3, Jacobo Pardo-Seco1,2, Federico Martinón-Torres2,4, Lutz Roewer5, Carlos Vullo3, Antonio Salas1,2.
Abstract
In 1985, a frozen mummy was found in Cerro Aconcagua (Argentina). Archaeological studies identified the mummy as a seven-year-old Inca sacrifice victim who lived >500 years ago, at the time of the expansion of the Inca Empire towards the southern cone. The sequence of its entire mitogenome was obtained. After querying a large worldwide database of mitogenomes (>28,000) we found that the Inca haplotype belonged to a branch of haplogroup C1b (C1bi) that has not yet been identified in modern Native Americans. The expansion of C1b into the Americas, as estimated using 203 C1b mitogenomes, dates to the initial Paleoindian settlements (~18.3 thousand years ago [kya]); however, its internal variation differs between Mesoamerica and South America. By querying large databases of control region haplotypes (>150,000), we found only a few C1bi members in Peru and Bolivia (e.g. Aymaras), including one haplotype retrieved from ancient DNA of an individual belonging to the Wari Empire (Peruvian Andes). Overall, the results suggest that the profile of the mummy represents a very rare sub-clade that arose 14.3 (5-23.6) kya and could have been more frequent in the past. A Peruvian Inca origin for present-day C1bi haplotypes would satisfy both the genetic and paleo-anthropological findings.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26561991 PMCID: PMC4642457 DOI: 10.1038/srep16462
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The Aconcagua mummy.
The inset shows a picture of a portion of dissected lung from the mummy. A small piece of 350 mg was used for DNA extraction. The photo of the mummy has been taken from49 and it is reproduced here with the permission of the University of Cuyo Publisher (Argentina).
Variants found in the mitogenome of the Inca mummy using the rCRS17 as a reference.
| Variant | Substitution | Location | Aminoacidchange | Type |
|---|---|---|---|---|
| 56 (P) | A-T | HVS-2 | − | R14 |
| 57 (P) | T-C | HVS-2 | − | H15–T2b3a1–W5a1a1a |
| 60 + T (P) | − | HVS-2 | − | R0a2'3–R04–W5a1a1a |
| 64 (P) | C-T | HVS-2 | − | L0a2–M15–M27b–M74a–M52b1a–A8a–S3–X2n–R0a’b–H1b1g–H57–F1a1b–F2e1–K1a19a |
| 73 | A-G | HVS-2 | − | R0–R |
| 249d | − | HVS-2 | − | M8–CZ |
| 263 | A-G | HVS-2 | − | H2a2a–H2a2 |
| 290–291d | − | HVS-2 | − | C–C1 |
| 309 + C | − | HVS-2 | − | Hotspot |
| 315 + C | − | HVS-2 | − | Hotspot |
| 455 + T (P) | − | HVS-2 | − | L5a–M5b2b1–M7c1c3i–I1–N1b1a1–B4c1b1–U1a2 |
| 489 | T-C | HVS-2 | − | L3–M |
| 493 | A-G | HVS-2 | − | C1–C1b |
| 523–524d | − | HVS-2 | − | Hotspot |
| 662 (P) | T-C | 12 S | − | – |
| 750 | A-G | 12 S | − | H2a2–H2a |
| 1438 | A-G | 12 S | − | H2–H |
| 2563 (P) | T-C | 16 S | − | – |
| 2706 | A-G | 16 S | − | H2–HV |
| 3552 | T-A | ND1 | − | CZ–C |
| 4715 | A-G | ND2 | − | M–M8 |
| 4769 | A-G | ND2 | − | H2a–H2 |
| 5135 (P) | C-T | ND2 | − | N7b |
| 7028 | C-T | CO1 | − | H2–HV |
| 7196 | C-A | CO1 | − | M–M8 |
| 8584 | G-A | ATP6 | A-T | M–M8 |
| 8701 | A-G | ATP6 | T-A | N–L3 |
| 8725 (P) | A-G | ATP6 | T-A | L0d2c2 |
| 8860 | A-G | ATP6 | T-A | H2a2a–H2a2 |
| 9540 | T-C | CO3 | − | N–L3 |
| 9545 | A-G | CO3 | − | CZ–C |
| 10398 | A-G | ND3 | T-A | N–L3 |
| 10400 | C-T | ND3 | − | L3–M |
| 10873 | T-C | ND4 | − | N–L3 |
| 11719 | G-A | ND4 | − | R0–R |
| 11914 | G-A | ND4 | − | CZ–C |
| 12705 | C-T | ND5 | − | R–N |
| 13263 | A-G | ND5 | − | CZ–C |
| 14318 | T-C | ND6 | N-S | CZ–C |
| 14766 | C-T | CYTB | T-I | HV–R0 |
| 14783 | T-C | CYTB | − | L3–M |
| 15043 | G-A | CYTB | − | L3–M |
| 15301 | G-A | CYTB | − | N–L3 |
| 15326 | A-G | CYTB | T-A | H2a2a–H2a2 |
| 15487 | A-T | CYTB | − | M–M8 |
| 16124 (P) | T-C | HVS-1 | − | L3b–L3d–M3c1b1–C4b2–M69a–H2a1e1a–H5t–R9b1b |
| 16223 | C-T | HVS-1 | − | R–N |
| 16298 | T-C | HVS-1 | − | M–M8 |
| 16325 | T-C | HVS-1 | − | C–C1 |
| 16327 | C-T | HVS-1 | − | CZ–C |
| 16519 | T-C | HVS-1 | − | Hotspot |
Private variants within the C1b phylogeny are indicated as (P) in the first column. The “Type” column indicates whether this variant had been detected previously in other haplogroup contexts within the worldwide phylogeny (according to Phylotree Build 16), or whether it is a hotspot (thus present in a large number of different branches of the mtDNA phylogeny). Transition T2563C is not in the skeleton of Phylotree Build 16 but it appeared sporadically in some mitogenome (e.g. HQ713446 [haplogroup A4]).
Figure 2Skeleton of the global C1b phylogeny.
The C1b1i clade represented by the haplotype found in the Inca mummy is also located in the phylogeny. There is one mitogenome (JX413043) that belongs to haplogroup C1b13b sampled in a Spanish individual, although born in Talagante (Chile); therefore we labeled it here as originating in America. TMRCA are indicated above haplogroup labels. An asterisk to the right of the haplogroup labels identifies sub-clades that were newly identified in this study, compared to the last version of Phylotree (Build 16). The position of the revised Cambridge reference sequence (rCRS) is indicated for reading sequence motifs17. Mitochondrial DNA variants are indicated along the branches of the phylogenetic tree. Mutations are transitions unless a suffix A, C, G, or T indicates a transversion. Other suffixes indicate insertions (+), synonymous substitution (s), mutational changes in tRNA (−t), mutational changes in rRNA (−r), noncoding variant located in the mtDNA coding region (−nc) and an amino acid replacement (indicated in round brackets). Variants underlined represent recurrent mutations in this tree, while a prefix ‘@’ indicates a back mutation. Mutational hotspot variants at positions 16182, 16183, and 16519, as well as variation around position 310 and length or point heteroplasmies were not considered for the phylogenetic reconstruction. The numbers in small squares attached above haplogroup labels indicate the number of occurrences (mitogenomes) of the corresponding haplogroups available in the public domain (literature and/or GenBank); the color of the squares indicates their geographic origin according to the legend inset. More details on the geographic or ethnic origin of all the mitogenomes used in this network are provided in Supplemental Data Table S1. The bottom-right inset shows a network of HVS-I sequences that potentially belong to haplogroup C1bi (left) and a map of South America showing their geographic location (right). The map was built using a blank map based on GPS coordinates and the SAGA v. 2.1.1 (http://www.saga-gis.org/; see methods).
Haplogroup coalescence time estimates in kya for C1b and its sub-clades based on ML and the computation of the averaged distance (AD) of the haplotypes of a given clade to the respective root haplotype.
| AD | AD (only synonymous mutations) | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 95% CI | TMRCA | 95% CI | ρ | σ | ρ/(n×σ2) | TMRCA | 95% CI | ρ | σ | ρ/(n×σ2) | |||
| C1b | 200 | 18.29 | 16.22−20.39 | 17.27 | 14.1–26.63 | 6.39 | 0.50 | 0.1 | 19.67 | 16.86–22.48 | 2.51 | 0.36 | 0.1 |
| C1b + 16311 | 18 | 17.10 | 14.39−19.84 | 18.85 | 12.55–25.35 | 6.94 | 1.14 | 0.3 | 21.02 | 15.28–26.77 | 2.67 | 0.73 | 0.3 |
| C1b7 | 7 | 13.30 | 8.65−18.06 | 11.39 | 5.93–17.03 | 4.29 | 1.03 | 0.6 | 11.26 | 5.75–16.78 | 1.43 | 0.70 | 0.4 |
| C1b7a | 4 | 6.92 | 2.20−11.77 | 7.89 | 2.38–13.59 | 3.00 | 1.06 | 0.7 | 1.97 | 0.00–3.94 | 0.25 | 0.25 | 1.0 |
| C1b10 | 7 | 9.44 | 6.60−12.33 | 12.97 | 5.35–20.09 | 4.86 | 1.43 | 0.3 | 14.64 | 7.61–21.68 | 1.86 | 0.89 | 0.3 |
| C1b1 | 5 | 11.42 | 7.56−15.37 | 11.16 | 4.08–18.53 | 4.20 | 1.34 | 0.5 | 20.5 | 12.01–28.99 | 2.60 | 1.08 | 0.4 |
| C1b11 | 9 | 13.43 | 10.65−16.25 | 13.68 | 6,16–21.5 | 5.11 | 1.41 | 0.3 | 14.02 | 8.07–19.96 | 1.78 | 0.75 | 0.4 |
| C1b12 | 5 | 6.29 | 2.91−9.75 | 5.22 | 1.37–9.16 | 2.00 | 0.75 | 0.7 | 4.73 | 1.2–8.26 | 0.60 | 0.45 | 0.6 |
| C1b13 | 22 | 11.81 | 8.56−15.12 | 11.48 | 7.99–15.05 | 4.32 | 0.65 | 0.5 | 12.18 | 8.57–15.8 | 1.55 | 0.46 | 0.3 |
| C1b13a | 5 | 10.59 | 7.36−13.88 | 15.61 | 8.2–23.31 | 5.80 | 1.37 | 0.6 | 12.61 | 6.71–18.51 | 1.60 | 0.75 | 0.6 |
| C1b13a1 | 3 | 8.62 | 5.30−12.00 | 15.23 | 7.82–22.94 | 5.67 | 1.37 | 1.0 | 13.14 | 7.26–19.02 | 1.67 | 0.75 | 1.0 |
| C1b13b | 3 | 8.24 | 4.21−12.37 | 9.7 | 3.89–15.7 | 3.67 | 1.11 | 1.0 | 10.51 | 5.26–15.77 | 1.33 | 0.67 | 1.0 |
| C1b13c | 6 | 6.92 | 3.16−10.77 | 6.55 | 1-76-11.49 | 2.50 | 0.93 | 0.5 | 6.57 | 1.15–11.99 | 0.83 | 0.69 | 0.3 |
| C1b13c1 | 4 | 4.37 | 1.62−7.18 | 4.55 | 0.72–8.49 | 1.75 | 0.75 | 0.8 | 1.97 | 0.00–3.94 | 0.25 | 0.25 | 1.0 |
| C1b13d | 2 | 4.50 | 0.00−9.65 | 3.9 | 0.00–8.42 | 1.50 | 0.87 | 1.0 | 3.94 | 0.00–7.88 | 0.50 | 0.50 | 1.0 |
| C1b13e | 4 | 6.11 | 2.01−10.31 | 5.22 | 1.58–8.94 | 2.00 | 0.71 | 1.0 | – | – | – | – | – |
| C1b14 | 3 | 12.46 | 7.71−17.33 | 14.3 | 7.14–21.74 | 5.33 | 1.33 | 1.0 | 10.51 | 5.26–15.77 | 1.33 | 0.67 | 1.0 |
| C1b2 | 32 | 2.11 | 1.28−2.94 | 12.18 | 2.23–22.71 | 4.57 | 1.89 | 0.0 | 17.8 | 6.94–28.65 | 2.26 | 1.38 | 0.0 |
| C1b3 | 3 | 15.12 | 10.05−20.33 | 14.3 | 7.14-21-74 | 5.33 | 1.33 | 1.0 | 5.26 | 1.54–8.97 | 0.67 | 0.47 | 1.0 |
| C1b4 | 10 | 1.56 | 0.27−2.87 | 1.55 | 0.12–2.99 | 0.60 | 0.28 | 0.8 | 1.58 | 0.46–2.69 | 0.20 | 0.14 | 1.0 |
| C1b5 | 8 | 12.00 | 7.14−17.00 | 11.64 | 5.67–17.81 | 4.37 | 1.12 | 0.4 | 15.77 | 10.02–21.51 | 2.00 | 0.73 | 0.5 |
| C1b5a | 4 | 6.79 | 2.56−11.14 | 6.55 | 2.46–10.75 | 2.50 | 0.79 | 1.0 | 7.88 | 3.94–11.83 | 1.00 | 0.50 | 1.0 |
| C1b5b | 4 | 7.99 | 4.81−11.23 | 8.57 | 2.32–15.05 | 3.25 | 1.20 | 0.6 | 15.77 | 8.39–23.14 | 2.00 | 0.94 | 0.6 |
| C1b6 | 3 | 1.93 | 0.00−4.81 | 1.72 | 0.00–4.13 | 0.67 | 0.47 | 1.0 | 2.63 | 0.00–5.26 | 0.33 | 0.33 | 1.0 |
| C1b8 | 5 | 16.44 | 12.85−20.09 | 22.47 | 11.88–33.59 | 8.20 | 1.91 | 0.4 | 20.5 | 11.72–29.28 | 2.60 | 1.11 | 0.4 |
| C1b8a | 3 | 4.68 | 0.97−8.48 | 7 | 1.04–13.18 | 2.67 | 1.15 | 0.7 | 7.88 | 2.01–13.76 | 1.00 | 0.75 | 0.6 |
| C1b9 | 7 | 4.43 | 1.68−7.24 | 5.98 | 0.99–11.12 | 2.29 | 0.97 | 0.3 | 4.51 | 2.25–6.76 | 0.57 | 0.29 | 1.0 |
We used all the mitogenomes from Table S1. Estimates were additionally obtained for the complete genomes and considering only the synonymous variants. The summand n refers to the complete mtDNA sequences considered in each clade. Computation of TMRCA of C1b13 includes one European individual of South American descent.
Figure 3BSP indicating the median of the hypothetical effective population size through time based on data from the C1b mitogenomes.
The maximum time is the median posterior estimate of the genealogy root-height.
Figure 4(A) ML phylogeny and TMRCA of the main C1b clades analyzed in the present study and the American chronology (LGM: Last Glacial Maximum). (B) Spatial-frequency distribution of haplogroup C1b. The map was built as indicated in Figure 2 and based on control region information. Note there are two main peaks of haplogroup C1b frequencies, one centered in Mexico and another one in Peru. In addition, there is a third peak in Puerto Rico (n = 23); this high frequency on the island projects over the North-East of South America (i.e. Venezuela and North of Brazil) where in reality C1b is virtually absent. The map was built using a blank map based on GPS coordinates and the SAGA v. 2.1.1 (see methods).
Haplotypes that are closely related to, or belong to, C1bi clade.
| Ref. | Population | Region/Archeological site | Country(Province) | Variants(from C1b root) | Sequencerange | |
|---|---|---|---|---|---|---|
| This study | 1 | Inca (AD 1480) | Cerro Aconcagua | Argentina (Mendoza) | 16124 | 16024–16569 |
| [ | 1 | Aymara | Andahuayalas | Peru (Apurimac) | 16124 16362 | 16024–16383 |
| [ | 1 | Post-Wari (AD 1000–1450) | Huari | Peru (Ayacucho) | 16124 16189 16222 16316 | 16011–16382 |
| [ | 3 | Aymara | − | Bolivia (La Paz) | 16124 16183C 16189 | 16024–16383 |
| [ | 1 | Bolivian | Pucarani | Bolivia (Cochabamba) | 16111 16124 | 16024–16569 |
Ref.: [1]: Barbieri et al.50. [2]: Kemp et al.15. [3]: Batai et al.51. [4]: Taboada-Echalar et al.52.