| Literature DB >> 24614536 |
Thirsa Kraaijenbrink1, Kristiaan J van der Gaag1, Sofia B Zuniga1, Yali Xue2, Denise R Carvalho-Silva2, Chris Tyler-Smith2, Mark A Jobling3, Emma J Parkin3, Bing Su4, Hong Shi4, Chun-Jie Xiao5, Wen-Ru Tang5, V K Kashyap6, R Trivedi6, T Sitalaximi6, Jheelam Banerjee6, Nirmal M Tuladhar7, Jean-Robert M L Opgenort8, George L van Driem8, Guido Barbujani9, Peter de Knijff1.
Abstract
The greater Himalayan region demarcates two of the most prominent linguistic phyla in Asia: Tibeto-Burman and Indo-European. Previous genetic surveys, mainly using Y-chromosome polymorphisms and/or mitochondrial DNA polymorphisms suggested a substantially reduced geneflow between populations belonging to these two phyla. These studies, however, have mainly focussed on populations residing far to the north and/or south of this mountain range, and have not been able to study geneflow patterns within the greater Himalayan region itself. We now report a detailed, linguistically informed, genetic survey of Tibeto-Burman and Indo-European speakers from the Himalayan countries Nepal and Bhutan based on autosomal microsatellite markers and compare these populations with surrounding regions. The genetic differentiation between populations within the Himalayas seems to be much higher than between populations in the neighbouring countries. We also observe a remarkable genetic differentiation between the Tibeto-Burman speaking populations on the one hand and Indo-European speaking populations on the other, suggesting that language and geography have played an equally large role in defining the genetic composition of present-day populations within the Himalayas.Entities:
Mesh:
Year: 2014 PMID: 24614536 PMCID: PMC3948894 DOI: 10.1371/journal.pone.0091534
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Group names and numbers of samples collected in Nepal and Bhutan.
| Group/Pool | Country | Linguistic phylum and -cluster | n males | n females | Coordinates used in spatial analyses | Code |
|
|
|
|
|
|
|
|
| Bahun (Brahmin) | Nepal | IE, Eastern Pahādī | 25 | 8 | 29,1667 N/81,1667 E | BHU |
| Barām | Nepal | TB, Newaric | 32 | 6 | 28,0667 N/84,6667 E | BAR |
| Black Mountain Mönpa | Bhutan | TB, East Bodish | 40 | 18 | 27,2167 N/90,2167 E | MON |
| Bodo | N-India | TB, Brahmaputran | 37 | 2 | 26,6667 N/90,3333 E | BOD |
| Brokkat | Bhutan | TB, Central/South Bodish | 24 | 5 | 27,7333 N/90,4333 E | KAT |
| Brokpa (Bj'op) | Bhutan | TB, Central/South Bodish | 40 | 10 | 27,4 N/91,7167 E | BRP |
| Bumthang | Bhutan | TB, East Bodish | 50 | 10 | 27,6667 N/90,55 E | BUM |
|
|
|
|
|
|
|
|
| Chali | Bhutan | TB, East Bodish | 50 | 11 | 27,3833 N/91,0167 E | CHL |
| Chantyal | Nepal | TB, Tamangic | 21 | 2 | 28,4 N/83,3667 E | CHN |
| Chepang (Praja) | Nepal | TB, Magaric | 20 | 7 | 27,5833 N/84,7 E | CHP |
| Chetri (Kshetriya) | Nepal | IE, Eastern Pahādī | 37 | 10 | 29,1667 N/81,2 E | CHE |
| Dakpa (Dwagspo) | Bhutan | TB, East Bodish | 49 | 10 | 27,4667 N/91,5167 E | DAK |
|
|
|
|
|
|
|
|
| Dhimal | Nepal | TB, Dhimalish | 20 | 2 | 26,5 N/87,7 E | DHI |
| Dura | Nepal | TB | 27 | 8 | 28,2833 N/84,2 E | DUR |
| Dzala | Bhutan | TB, East Bodish | 51 | 11 | 27,9 N/91,15 E | DZA |
|
|
|
|
|
|
|
|
| Ghale | Nepal | TB, Tamangic | 17 | 8 | 28,2833 N/84,7333 E | GHL |
| Gongduk | Bhutan | TB | 46 | 10 | 27,0833 N/90,9333 E | GNG |
| Gurung | Nepal | TB, Tamangic | 40 | 6 | 28,3 N/84,1167 E | GUR |
| High caste Newar | Nepal | TB, Newaric | 24 | 6 | 27,6167 N/85,4333 E | HCN |
| Kham (Magar) | Nepal | TB, Magaric | 13 | 1 | 28,5 N/83 E | KHM |
| Khengpa | Bhutan | TB, East Bodish | 52 | 10 | 27,1333 N/90,6833 E | KHG |
| Kumal | Nepal | IE, Eastern Pahādī, IA with suspected TB substrate | 21 | 5 | 28,05 N/84,45 E | KUM |
| Kurtöp | Bhutan | TB, East Bodish | 51 | 13 | 27,8167 N/90,8167 E | KUR |
| Lakha | Bhutan | TB, Central/South Bodish | 50 | 10 | 27,6833 N/90,15 E | LAK |
| Layap | Bhutan | TB, Central/South Bodish | 25 | 5 | 28,0667 N/89,6833 E | LAY |
| Lhokpu (Lhop, Doya) | Bhutan | TB | 39 | 8 | 26,95 N/89,1167 E | LHP |
|
|
|
|
|
|
|
|
| Magar | Nepal | TB, Magaric | 40 | 6 | 28,0833 N/83,8333 E | MGR |
| Majhi (Bote) | Nepal | IE, Eastern Pahādī, IA with suspected TB substrate | 21 | 6 | 27,8333 N/83,6667 E | MAJ |
| Mangde ('Nyenkha, Henke) | Bhutan | TB, East Bodish | 54 | 10 | 27,4167 N/90,2167 E | MNG |
| Newar | Nepal | TB, Newaric | 44 | 10 | 27,6167 N/85,4 E | NWR |
| 'Ngalop (Dzongkha) | Bhutan | TB, Central/South Bodish | 50 | 10 | 27,5333 N/89,4833 E | NGA |
| Nup (Trongsap) | Bhutan | TB, East Bodish | 27 | 10 | 27,5833 N/90,3333 E | NUP |
| Sherpa (Solu-Khumbu) | Nepal | TB, Central/South Bodish | 20 | 5 | 27,7333 N/86,5833 E | SHE |
| Tamang | Nepal | TB, Tamangic | 41 | 9 | 27,8833 N/85,4167 E | TMG |
| Thakali | Nepal | TB, Tamangic | 20 | 9 | 28,8167 N/83,75 E | THK |
| Thangmi | Nepal | TB, Newaric | 16 | 2 | 27,75 N/86 E | THG |
| Tharu | Nepal | IE, Maithili & Bhojpuri, IA with suspected TB substrate | 28 | 7 | 27,4167 N/83,3333 E | THR |
| Toto | N-India | TB, Dhimalish | 54 | 16 | 26,6667 N/89 E | TOT |
| Tshangla (Shâchop) | Bhutan | TB | 50 | 11 | 27,1833 N/91,3167 E | TSH |
|
|
|
|
|
|
|
|
Names of pools are shown in bold print, and names of groups in normal print. Alternative group-names are shown between brackets. For information about the populations included in the pools, see table S2.
Classification according to van Driem [1]; TB: Tibeto-Burman, IE: Indo-European, IA: Indo-Aryan. For more detailed cluster data, see figure S1.
Codes used in figures to indicate the populations.
Figure 1Distribution of ethnolinguistic groups/pools sampled in Nepal and Bhutan.
Blue dots indicate the approximate geographical centres of the Indo-European groups/pools and red dots indicate the approximate geographical centres of the Tibeto-Burman groups/pools. Populations from Nepal: 1 = Kham, 2 = Chantyal, 3 = Thakali, 4 = Magar, 5 = Dura, 6 = Gurung, 7 = Ghale, 8 = Barām, 9 = Chepang, 10 = Tamang, 11 = Newar, 12 = High caste Newar, 13 = Thangmi, 14 = Sherpa, 15 = Western Kiranti (pool), 16 = Central Kiranti (pool), 17 = Eastern Kiranti (pool), 18 = Limbu (pool), 19 = Dhimal, 20 = Artisanal caste Indo-Aryan (pool), 21 = Bahun, 22 = Chetri, 23 = Tharu, 24 = Majhi, 25 = Kumal, 26 = Danuwar & Kachadiya Danuwar (pool). Populations from Bhutan and India: 27 = Toto (India), 28 = Lhokpu, 29 = Layap, 30 = 'Ngalop, 31 = Lakha, 32 = Mangde, 33 = Black Mountain Mönpa, 34 = Nup, 35 = Bodo (India), 36 = Brokkat, 37 = Bumthang, 38 = Khengpa, 39 = Kurtöp, 40 = Gongduk, 41 = Chali, 42 = Dzala, 43 = Tshangla, 44 = Dakpa, 45 = Brokpa. © John Benjamins Publishing Company, Amsterdam/Philadelphia.
Figure 2Results of significance tests for values.
values are grouped by linguistic phylum with each dot representing one population. The median per phylum is indicated by a red line, the boxes surrounding the medians indicate the 25th and 75th percentiles and the 5th and 95th percentiles are indicated as error-bars. Non-significant results of the Kruskal-Wallis/Dunn's multiple comparison test are indicated by lines with the text ns below the values and significant results are indicated by lines with p-values above the values. For the population abbreviations used in the figure, see tables 1 and S1.
Summary of Admixture analysis.
| Nepal TB | Nepal IE | Bhutan | India TB | |
|
| 0.0482±0.0554 | 0.2047±0.0544 | 0.1565±0.0229 | –0.4276±0.4911 |
|
| 0.6244±0.0490 | 0.1342±0.0424 | 0.5750±0.0181 | 1.1053±0.3807 |
|
| 0.1711±0.0374 | 0.4261±0.0562 | 0.0203±0.0209 | 0.8141±0.3815 |
|
| 0.1563±0.0000 | 0.2350±0.0000 | 0.2482±0.0000 | –0.4917±0.0000 |
IE: Indo-European, TB: Tibeto-Burman, DR: Dravidian, AL: Altaic.
Figure 3Results of STRUCTURE analyses.
The colours represent the proportion of inferred ancestry from K ancestral populations. 3A: Unsupervised run for K = 2. The distribution pattern of inferred ancestry seems to indicate the presence of subtle population substructures within the Himalayas, with the Indo-European speaking Himalayan populations clustering more closely to the Indo-European and Dravidian reference populations (predominantly light blue) and the Tibeto-Burman speaking Himalayan populations clustering more closely to the Tibeto-Burman and Altaic reference populations (predominantly brown). 3B: Unsupervised run for K>2. The clustering pattern as observed for K = 2 in the Himalayan populations is mostly lost in favour of the separation of the Lhokpu (LHP), Black Mountain Mönpa (MON) and Toto (TOT), although some differences can still be observed between the Tibeto-Burman and Indo-European populations (this figure shows the results for K = 4).
Figure 4MDS plot.
Populations have been colour coded according to linguistic phylum and geographical origin, as is explained within the figure. The population-codes used in this plot are explained in table 1 and table S1. The stress values for the dimensions used to construct this plot were 0.333475 and 0.216317 respectively.
Summary of Mantel tests in zt.
| Genetic distance | Linguistic distance | Geographical distance | |
| Genetic distance | - | rp = 0.144210 (p = 0.000010) ri = 0.049613 (p = 0.000100) | rp = 0.134967 (p = 0.012230) ri = 0.009021 (p = 0.030597) |
| Linguistic distance | rp = 0.115407 (p = 0.000010) ri = 0.049190 (p = 0.000100) | - | rp = 0.247616 (p = 0.000010) ri = 0.302360 (p = 0.000100) |
| Geographical distance | rp = 0.103531 (p = 0.045580) ri = –0.006282 (p = 0.102290) | rp = 0.232692 (p = 0.000010) ri = 0.302297 (p = 0.000100) | - |
The results of the simple Mantel tests are shown above the diagonal; the results of the partial Mantel tests are shown below the diagonal. rp: correlation coefficient for Mantel test on population level with 100 thousand randomisations, ri: correlation coefficient for Mantel test on individual level with 10 thousand randomisations. Corresponding p-values are shown between brackets.