| Literature DB >> 33182715 |
Gennady V Khvorykh1, Oleh A Mulyar2, Larisa Fedorova2, Andrey V Khrunin1, Svetlana A Limborska1, Alexei Fedorov2,3.
Abstract
We performed an exhaustive pairwise comparison of whole-genome sequences of 3120 individuals, representing 232 populations from all continents and seven prehistoric people including archaic and modern humans. In order to reveal an intricate picture of worldwide human genetic relatedness, 65 million very rare single nucleotide polymorphic (SNP) alleles have been bioinformatically processed. The number and size of shared identical-by-descent (IBD) genomic fragments for every pair of 3127 individuals have been revealed. Over 17 million shared IBD fragments have been described. Our approach allowed detection of very short IBD fragments (<20 kb) that trace common ancestors who lived up to 200,000 years ago. We detected nine distinct geographical regions within which individuals had strong genetic relatedness, but with negligible relatedness between the populations of these regions. The regions, comprising nine unique genetic components for mankind, are the following: East and West Africa, Northern Europe, Arctica, East Asia, Oceania, South Asia, Middle East, and South America. The level of admixture in every studied population has been apportioned among these nine genetic components. Genetically, long-term neighboring populations are strikingly similar to each other in spite of any political, religious, and cultural differences. The topmost admixture has been observed at the center of Eurasia. These admixed populations (including Uyghurs, Azerbaijanis, Uzbeks, and Iranians) have roughly equal genetic contributions from the Middle East, Europe, China, and India, with additional significant traces from Africa and Arctic. The entire picture of relatedness of all the studied populations unfolds and presents itself in the form of shared number/size of IBDs.Entities:
Keywords: computational biology; genealogy; genetic variation; genomics; inheritance; polymorphism
Year: 2020 PMID: 33182715 PMCID: PMC7696950 DOI: 10.3390/biology9110392
Source DB: PubMed Journal: Biology (Basel) ISSN: 2079-7737
Average number of shared identical-by-descent (IBD) segments between populations.
| Identifiers | Populations | Studied Populations from Americas | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Cac | Cha | CLM | Col | Kar | May | Mix | MXL | PEL | Pia | Pim | PUR | Que | Sur | Wic | Zap | ||
| Cac_AMR | Cachi_Argentina |
| 4.5 | 1.3 | 59.3 | 3.8 | 6.7 | 6.0 | 2.7 | 4.3 | 8.3 | 5.4 | 1.1 | 30.3 | 9.0 | 9.2 | 5.2 |
| Cha_AMR | S_Chane:Argentina | 5.3 |
| 0.8 | 9.1 | 13.7 | 12.3 | 11.8 | 2.8 | 4.8 | 24.8 | 6.8 | 0.6 | 9.0 | 19.8 | 80.8 | 10.6 |
| CLM_AMR | Colombians_Medellin_Colombia | 1.2 | 0.7 |
| 1.0 | 0.9 | 0.8 | 1.8 | 12.0 | 8.3 | 1.2 | 0.9 | 12.7 | 1.2 | 0.8 | 0.6 | 1.9 |
| Col_AMR | Colla_Argentina | 60.6 | 7.8 | 1.0 |
| 6.2 | 6.2 | 6.2 | 2.4 | 4.3 | 6.7 | 6.7 | 0.6 | 31.6 | 6.2 | 10.1 | 6.1 |
| Kar_AMR | B_Karitiana:Brazil | 4.4 | 14.2 | 1.1 | 6.2 |
| 9.5 | 8.2 | 2.5 | 4.3 | 16.3 | 4.9 | 0.5 | 7.3 | 66.9 | 6.7 | 7.1 |
| May_AMR | S_Mayan:Mexico | 7.1 | 11.5 | 1.0 | 6.6 | 9.8 |
| 19.4 | 4.9 | 5.9 | 13.2 | 17.2 | 0.9 | 11.8 | 11.2 | 5.9 | 20.9 |
| Mix_AMR | B_Mixe:Mexico | 6.5 | 12.4 | 1.9 | 6.5 | 8.3 | 20.2 |
| 6.2 | 5.2 | 10.9 | 17.2 | 1.5 | 8.7 | 7.9 | 6.7 | 41.1 |
| MXL_AMR | Mexican_Ancestry_Los_Angeles | 2.2 | 1.9 | 11.9 | 2.1 | 1.9 | 3.5 | 4.1 |
| 11.2 | 2.4 | 3.8 | 10.7 | 1.8 | 2.2 | 2.0 | 5.9 |
| PEL_AMR | Peruvians_Lima_Peru | 2.5 | 2.6 | 8.0 | 2.6 | 3.2 | 4.0 | 3.5 | 10.5 |
| 3.1 | 3.4 | 6.9 | 5.0 | 2.4 | 2.2 | 4.5 |
| Pia_AMR | S_Piapoco:Colombia | 9.2 | 25.6 | 1.4 | 6.7 | 16.0 | 12.8 | 10.9 | 3.4 | 5.4 |
| 7.8 | 0.5 | 12.4 | 20.6 | 5.3 | 9.9 |
| Pim_AMR | S_Pima:Mexico | 5.4 | 8.3 | 1.1 | 7.3 | 5.1 | 18.3 | 17.3 | 6.2 | 5.3 | 8.9 |
| 0.3 | 9.8 | 8.5 | 7.1 | 17.1 |
| PUR_AMR | Puerto_Ricans_from_Puerto_Rico | 1.1 | 0.4 | 12.8 | 0.6 | 0.4 | 0.8 | 1.4 | 10.9 | 7.1 | 0.5 | 0.3 |
| 0.8 | 0.5 | 0.4 | 1.2 |
| Que_AMR | S_Quechua:Peru | 32.3 | 9.6 | 1.3 | 33.4 | 6.9 | 12.0 | 8.1 | 2.5 | 9.1 | 11.9 | 8.6 | 0.9 |
| 9.1 | 5.9 | 9.2 |
| Sur_AMR | S_Surui:Brazil | 9.9 | 21.5 | 0.8 | 6.2 | 65.9 | 11.2 | 8.2 | 2.8 | 4.2 | 21.0 | 8.9 | 0.6 | 8.8 |
| 7.0 | 9.5 |
| Wic_AMR | Wichi_Argentina | 9.6 | 80.3 | 0.6 | 10.2 | 6.3 | 5.7 | 6.2 | 2.7 | 3.5 | 5.3 | 6.9 | 0.4 | 5.9 | 6.8 |
| 10.2 |
| Zap_AMR | S_Zapotec:Mexico | 5.4 | 9.0 | 2.2 | 6.1 | 7.1 | 20.9 | 39.6 | 8.0 | 6.4 | 10.3 | 16.0 | 1.2 | 9.0 | 9.1 | 10.2 |
|
| Ale_ARC | S_Aleut:Russia | 1.0 | 0.8 | 2.7 | 1.4 | 3.2 | 4.8 | 2.3 | 3.3 | 2.4 | 3.7 | 4.7 | 2.3 | 2.5 | 4.1 | 2.0 | 2.5 |
| Chu_ARC | S_Chukchi:Russia | 0.8 | 0.7 | 1.3 | 0.6 | 1.1 | 1.6 | 1.7 | 1.6 | 1.1 | 0.6 | 2.2 | 1.3 | 0.6 | 0.8 | 0.8 | 0.9 |
| Esk_ARC | S_Eskimo_Sireniki:Russia | 1.3 | 2.7 | 0.4 | 1.1 | 1.8 | 2.2 | 2.5 | 1.5 | 1.6 | 2.2 | 5.5 | 0.4 | 1.8 | 1.9 | 1.8 | 3.1 |
| Ite_ARC | S_Itelman:Russia | 0.3 | 2.9 | 0.6 | 0.0 | 1.4 | 2.1 | 1.3 | 1.5 | 1.3 | 0.7 | 2.6 | 0.6 | 1.4 | 2.1 | 0.0 | 4.1 |
| Kor_ARC | Koryaks | 0.3 | 0.4 | 0.1 | 0.3 | 0.2 | 0.6 | 0.4 | 0.3 | 0.3 | 0.4 | 0.8 | 0.1 | 0.4 | 0.2 | 0.5 | 0.3 |
| Tli_ARC | S_Tlingit:Russia | 1.5 | 0.8 | 4.4 | 1.7 | 0.5 | 2.6 | 2.6 | 4.2 | 3.2 | 1.5 | 3.5 | 4.2 | 2.6 | 1.5 | 0.8 | 1.1 |
| Ulc_ARC | S_Ulchi:Russia | 0.6 | 0.7 | 0.3 | 0.7 | 0.4 | 1.7 | 0.4 | 0.9 | 1.0 | 0.0 | 0.9 | 0.2 | 0.2 | 0.7 | 0.2 | 1.7 |
| Alt_CAS | Altaian | 0.1 | 0.2 | 1.3 | 0.5 | 0.4 | 0.6 | 0.4 | 1.3 | 1.0 | 0.1 | 0.4 | 1.1 | 0.2 | 0.1 | 0.1 | 0.4 |
| Ish_CAS | Ishkasim_Tajikistan | 0.4 | 0.0 | 2.9 | 0.0 | 0.0 | 0.4 | 0.4 | 2.2 | 1.0 | 0.0 | 0.0 | 2.6 | 0.3 | 0.0 | 0.2 | 0.4 |
| Kal_CAS | Kalmyk | 0.4 | 1.8 | 1.1 | 0.5 | 1.8 | 0.0 | 0.0 | 1.1 | 0.9 | 0.0 | 0.0 | 1.1 | 0.6 | 0.0 | 0.0 | 0.9 |
| Kaz_CAS | Kazakhs_Kazakhstan | 0.3 | 0.0 | 1.9 | 0.3 | 0.2 | 0.0 | 0.2 | 1.4 | 0.8 | 0.3 | 0.0 | 1.7 | 0.0 | 0.0 | 0.2 | 0.3 |
| Kyr_CAS | S_Kyrgyz:Kyrgyzystan | 0.3 | 0.0 | 1.4 | 0.2 | 0.3 | 0.3 | 0.4 | 1.4 | 1.0 | 0.2 | 0.4 | 1.5 | 0.4 | 0.1 | 0.1 | 0.7 |
| Rus_CAS | Rushan-Vanch_Tajikistan | 0.2 | 0.0 | 3.4 | 0.2 | 0.0 | 0.0 | 0.6 | 2.6 | 2.3 | 0.4 | 0.7 | 3.3 | 0.6 | 0.0 | 0.2 | 0.0 |
| Shu_CAS | Shugnan_Tajikistan | 1.1 | 0.0 | 3.4 | 0.0 | 0.0 | 0.0 | 0.3 | 3.2 | 1.6 | 0.0 | 0.0 | 3.5 | 0.0 | 0.0 | 0.0 | 0.0 |
| Taj_CAS | Tajiks | 0.1 | 0.5 | 4.2 | 0.5 | 0.0 | 0.5 | 0.6 | 3.9 | 2.0 | 0.0 | 0.2 | 4.0 | 0.3 | 0.3 | 0.0 | 0.7 |
| TKm_CAS | Turkmens_Uzbekistan | 0.1 | 0.0 | 2.4 | 0.2 | 0.0 | 0.6 | 0.4 | 2.4 | 1.4 | 0.0 | 0.3 | 2.4 | 0.0 | 0.3 | 0.2 | 0.8 |
| Uyg_CAS | Uygurs_Kazakhstan | 0.2 | 0.0 | 2.5 | 0.1 | 0.0 | 0.0 | 0.4 | 2.1 | 1.3 | 0.0 | 0.4 | 2.4 | 0.3 | 0.2 | 0.1 | 0.5 |
| Uzb_CAS | Uzbek | 0.2 | 0.6 | 2.7 | 0.1 | 0.2 | 0.5 | 0.5 | 2.5 | 1.5 | 0.0 | 0.0 | 2.6 | 0.7 | 0.3 | 0.9 | 0.0 |
| Yag_CAS | Yaghnobi_Tajikistan | 0.0 | 0.0 | 3.1 | 1.3 | 0.0 | 1.6 | 0.9 | 3.0 | 1.8 | 0.0 | 0.0 | 3.7 | 0.5 | 0.0 | 0.0 | 0.0 |
This table represents an illustrative fragment of the complete version of 243 × 243 Supplementary Table S1. The number in the crossing for a row, presenting a population and a column, presenting population shows the average number of shared IBD fragments for a pair of individuals in which one person belongs to population while the other person from population . In a vast majority of cases, when and represent the same population, the number of shared IBD fragments between individuals of this population is the highest compared to the number of shared IBD fragments between this population and another one. In rare cases, a population in our study is represented by a sole individual (e.g., Chane from Argentina). Because the number of shared IBD fragments for the same person does not make biological sense, the table shows N/A value for intra-population data for single individual.
Figure 1Heat map for the Table S1. Blue color—no genetic relatedness between populations, red—the highest relatedness. The scale represents numbers of IBD fragments per pair of individuals from 0 (dark blue) to 300 (red). AFR—Africa; AMR—America; ARC—Arctic; CAS—Central Asia; CAU—Caucasus; EUR—Europe; EAS—East Asia; SAS—South Asia; OCE—Oceania and Australia; MDE—Middle East; SIB—Siberia; XXX—archaic and prehistoric modern humans described in the Section 5.
Example of ranking data for numbers of shared IBD fragments for Russian population (identifier: Rus_EUR).
| #IBDs | Pop ID | Population Name |
|---|---|---|
| 27.29 | Vep_EUR | Vepsas_Russia |
| 23.92 | Kar_EUR | Karelians |
| 23.39 | Rus_EUR | S_Russian:Russia |
| 23.11 | Est_EUR | S_Estonian:Estonia |
| 23.00 | Lat_EUR | Latvians |
| 22.86 | Fin_EUR | S_Finnish:Finland |
| 22.37 | Ing_EUR | Ingrians_Russia_North |
| 21.43 | Kom_EUR | Komis_Russia |
| 21.15 | Lit_EUR | Lithuanians |
| 20.04 | FIN_EUR | Finnish_in_Finland |
| 19.58 | Bel_EUR | Belarusians |
| 19.37 | Pol_EUR | S_Polish:Poland |
| 19.36 | Saa_EUR | S_Saami:Finland |
| 18.91 | Ukr_EUR | Ukrainians |
| 18.20 | Tli_ARC | S_Tlingit:Russia |
| 17.81 | Cos_EUR | Cossacks_Ukraine |
| 17.79 | Mor_EUR | Mordvins_Russia |
| 17.08 | Nor_EUR | S_Norwegian:Norway |
| 16.03 | Hun_EUR | S_Hungarian:Hungary |
| 15.88 | Kry_EUR | Kryashen-Tatars_Russia |
| 15.19 | Swe_EUR | Swedes_Sweden |
| … | … | … |
| 5.52 | Tur_MDE | S_Turkish:Turkey |
| 5.42 | NOc_CAU | S_North_Ossetian:Russia |
| 5.30 | Chu_ARC | S_Chukchi:Russia |
| 5.20 | Haz_SAS | S_Hazara:Pakistan |
| 5.15 | Che_CAU | S_Chechen:Russia |
| 5.03 | Cir_CAU | Circassians_Russia_Caucaz |
| 4.92 | Kum_CAU | Kumyks_Russia |
| 4.92 | Sar_EUR | B_Sardinian:Italy |
| 4.79 | CLM_AMR | Colombians_Colombia |
| 4.78 | Sel_SIB | Selkups_Russia |
| 4.73 | Rom_EUR | Roma_Bosnia-Herzegovina |
| 4.71 | Tub_SIB | S_Tubalar:Russia |
| 4.64 | Taj_CAS | Tajiks |
| … | … | … |
| 0.03 | Dus_OCE | S_Dusun:Brunei |
| 0.03 | Igo_OCE | S_Igorot:Philippines |
| 0.01 | Luh_AFR | S_Luhya:Kenya |
| 0 | Con_AFR | Congo-pygmies_ |
| 0 | Esa_AFR | S_Esan:Nigeria |
| 0 | Cha_AMR | S_Chane:Argentina |
| 0 | Ami_EAS | S_Ami:Taiwan |
| 0 | Haw_OCE | S_Hawaiian:USA |
| 0 | Gon_SAS | Gond_India |
The first column shows the average numbers of shared IBD fragments (per pair of individuals) between Russian population with others. The second column shows population identifier, and the third—population name. Table 2 shows the first 21 rows with the highest numbers of IBDs, 12 rows from the middle of this table and 9 last rows with the least numbers of IBDs. The entire set of ranking tables for all populations are presented in the Supplementary File S1.
Nine Distinct Human Genetics Regions (DHGR), each characterized by three reference populations that share the least number of IBD fragments with the rest of the world.
|
| Unique Genetic Regions | 3 Reference Populations |
|---|---|---|
| AFE | East Africa | Luhya Webuye Kenya (LWK); Dinka (Sudan); Masai (Kenya) |
| AFW | West Africa | Yoruba (YRI); Esan Nigeria (ESN); Mende Sierra Leone (MSL) |
| AMR | America | Piapoco (Columbia); Wichi (Argentina); Karitiana (Brazil) |
| ARC | Arctica (N Eurasia + N. America) | Koryaks (Russia); Eskimo Sireniki (Russia); Chukchi (Russia) |
| EAS | East Asia | Han China South (CHS); Japanese Tokyo (JPT); Miao (China) |
| EUR | North Europe | Swedes (Sweden); Estonians (Estonia); Germans (Germany) |
| SAS | Hindustan Peninsula | Sri Lankans from UK (STU); Gujarati from Texas (GIH); Mala (India) |
| OCE | Oceania + Australia | Australians (Australia); Papuan (Papua New Guinea); Agta (Philippines) |
| MDE | Middle East | Arabs (Israel); Saudi-Arabians; Palestinian (Israel Central) |
Figure 2The graphical stacked-bar representation for the Supplementary Table S4. Each bar is a population. The color and the height correspond to the percentage of IBD shared with the corresponding geographical region. The colors denoting geographical regions are given at the top of the graph. The populations are grouped into 12 geographical regions which IDs are given at the bottom of the graph. The populations are ordered within each geographical region by the percentage of shared IBD fragments.
Figure 3Relative abundance (in percentages) of IBD fragments from prehistoric people in modern people from major geographic regions.
Median lengths of shared IBD fragments between populations from major geographical regions (in 1000 nucleotides).
| AFR | AMR | ARC | CAS | CAU | EAS | EUR | MDE | OCE | SAS | SIB | NEA | ANC | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 157 | 128 | 100 | 52 | 96 | 29 | 117 | 108 | 17 | 47 | 54 | 22 | 200 |
|
| 128 | 635 | 408 | 322 | 415 | 330 | 435 | 315 | 52 | 316 | 334 | 80 | N/A |
|
| 100 | 408 | 1253 | 536 | 475 | 398 | 609 | 409 | 180 | 384 | 683 | 60 | 531 |
|
| 52 | 322 | 536 | 641 | 492 | 504 | 522 | 422 | 288 | 457 | 668 | 62 | 412 |
|
| 96 | 415 | 475 | 492 | 704 | 428 | 491 | 469 | 44 | 404 | 518 | 57 | 487 |
|
| 29 | 330 | 398 | 504 | 428 | 558 | 395 | 354 | 39 | 392 | 466 | 63 | N/A |
|
| 117 | 435 | 609 | 522 | 491 | 395 | 670 | 443 | 61 | 453 | 580 | 58 | 585 |
|
| 108 | 315 | 409 | 422 | 469 | 354 | 443 | 473 | 25 | 372 | 407 | 57 | 465 |
|
| 17 | 52 | 180 | 288 | 44 | 39 | 61 | 25 | 482 | 54 | 254 | 77 | N/A |
|
| 47 | 316 | 384 | 457 | 404 | 392 | 453 | 372 | 54 | 478 | 440 | 63 | 403 |
|
| 54 | 334 | 683 | 668 | 518 | 466 | 580 | 407 | 254 | 440 | 996 | 62 | 692 |
|
| 22 | 80 | 60 | 62 | 57 | 63 | 58 | 57 | 77 | 63 | 62 | N/A | N/A |
|
| 200 | N/A | 531 | 412 | 487 | N/A | 585 | 465 | N/A | 403 | 692 | N/A | N/A |
Time (in 1000 years) for last common ancestors of shared IBD fragments between populations from major geographical regions.
| AFR | AMR | ARC | CAS | CAU | EAS | EUR | MDE | OCE | SAS | SIB | NEA | ANC | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 13.5 | 16.6 | 21.2 | 40.7 | 22.1 | 73.1 | 18.1 | 19.6 | 124 | 45.1 | 39.2 | 192 | 21.2 |
|
| 16.6 | 3.3 | 5.2 | 6.6 | 5.1 | 6.4 | 4.9 | 6.7 | 40.7 | 6.7 | 6.3 | 53.0 | N/A |
|
| 21.2 | 5.2 | 1.7 | 4.0 | 4.5 | 5.3 | 3.5 | 5.2 | 11.8 | 5.5 | 3.1 | 70.6 | 8.0 |
|
| 40.7 | 6.6 | 4.0 | 3.3 | 4.3 | 4.2 | 4.1 | 5.0 | 7.4 | 4.6 | 3.2 | 68.3 | 10.3 |
|
| 22.1 | 5.1 | 4.5 | 4.3 | 3.0 | 5.0 | 4.3 | 4.5 | 48.2 | 5.2 | 4.1 | 74.3 | 8.7 |
|
| 73.1 | 6.4 | 5.3 | 4.2 | 5.0 | 3.8 | 5.4 | 6.0 | 54.3 | 5.4 | 4.5 | 67.3 | N/A |
|
| 18.1 | 4.9 | 3.5 | 4.1 | 4.3 | 5.4 | 3.2 | 4.8 | 34.7 | 4.7 | 3.7 | 73.1 | 7.2 |
|
| 19.6 | 6.7 | 5.2 | 5.0 | 4.5 | 6.0 | 4.8 | 4.5 | 84.7 | 5.7 | 5.2 | 74.3 | 9.1 |
|
| 124 | 40.7 | 11.8 | 7.4 | 48.2 | 54.3 | 34.7 | 84.7 | 4.4 | 39.2 | 8.3 | 55.0 | N/A |
|
| 45.1 | 6.7 | 5.5 | 4.6 | 5.2 | 5.4 | 4.7 | 5.7 | 39.2 | 4.4 | 4.8 | 67.3 | 10.5 |
|
| 39.2 | 6.3 | 3.1 | 3.2 | 4.1 | 4.5 | 3.7 | 5.2 | 8.3 | 4.8 | 2.1 | 68.3 | 6.1 |
|
| 192 | 53.0 | 70.6 | 68.3 | 74.3 | 67.3 | 73.1 | 74.3 | 55.0 | 67.3 | 68.3 | N/A | N/A |
|
| 21.2 | N/A | 8.0 | 10.3 | 8.7 | N/A | 7.2 | 9.1 | N/A | 10.5 | 6.1 | N/A | N/A |
Figure 4Heat map for estimated age of IBD fragments which are shared between populations from major geographical regions. Black color represents N/A data.
Characteristics of ancient DNA samples from prehistoric people.
| Sample | Species | Site/Region | Dating (YA) | Coverage | Reference |
|---|---|---|---|---|---|
| Altai, Denisova5 |
| Denisova Cave, | 52,000 | 50 | [ |
| Chagyrskaya 8 |
| Chagyrskaya Cave | 60,000 | 27 | [ |
| Vindija 33.19 |
| Vindija Cave, | 45,000 | 30 | [ |
| Ustishim |
| Settlement of Ust’-Ishim in western Siberia | 45,000 | 40 | [ |
| Denisova |
| Denisova Cave, | 40,000 | 30 | [ |
| Stuttgart |
| Germany | 7000 | 19 | [ |
| Loschbour |
| Loschbour rock shelter, | 8000 | 22 | [ |
* In brackets the dates inferred using the DNA-based method are presented [55].