| Literature DB >> 35046511 |
Muhammad Tariq1,2,3, Habib Ahmad4, Brian E Hemphill5, Umar Farooq6, Theodore G Schurr7.
Abstract
Northwest Pakistan has served as a point of entry to South Asia for different populations since ancient times. However, relatively little is known about the population genetic history of the people residing within this region. To better understand human dispersal in the region within the broader history of the subcontinent, we analyzed mtDNA diversity in 659 and Y-chromosome diversity in 678 individuals, respectively, from five ethnic groups (Gujars, Jadoons, Syeds, Tanolis and Yousafzais), from Swabi and Buner Districts, Khyber Pakhtunkhwa Province, Pakistan. The mtDNAs of all individuals were subject to control region sequencing and SNP genotyping, while Y-chromosomes were analyzed using 54 SNPs and 19 STR loci. The majority of the mtDNAs belonged to West Eurasian haplogroups, with the rest belonging to either South or East Asian lineages. Four of the five Pakistani populations (Gujars, Jadoons, Syeds, Yousafzais) possessed strong maternal genetic affinities with other Pakistani and Central Asian populations, whereas one (Tanolis) did not. Four haplogroups (R1a, R1b, O3, L) among the 11 Y-chromosome lineages observed among these five ethnic groups contributed substantially to their paternal genetic makeup. Gujars, Syeds and Yousafzais showed strong paternal genetic affinities with other Pakistani and Central Asian populations, whereas Jadoons and Tanolis had close affinities with Turkmen populations from Central Asia and ethnic groups from northeast India. We evaluate these genetic data in the context of historical and archeological evidence to test different hypotheses concerning their origins and biological relationships.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35046511 PMCID: PMC8770644 DOI: 10.1038/s41598-022-05076-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Geospatial map of mtDNA haplogroup frequencies in KPP ethnic groups and comparative populations. See the Methods section for a description of the mapping process, and Table S3 for the data on which the projection is based. With respect to the abbreviations in the different panels in the figure, “MEA” indicates mtDNAs belonging to East Asian haplogroups deriving from macrohaplogroup M (C, D, G, M7-M9, Z), while “MSA” denotes those belonging to South Asian haplogroups derived from M (M2-M10, M12, M18, etc.). Similarly, “RSA” indicates mtDNAs derived from R haplogroups arising in South Asia (e.g., R2, R5, R6, R7, R9, R30, R31), “UWE” denotes mtDNAs from U haplogroups common to West Eurasian populations (U2e, U3-U5, U7, U8), and “USA” mtDNAs from U haplogroups identified in South Asian populations (U*, U1, U2a-c).
Summary statistics for KPP ethnic groups and other Pakistani populations based on mtDNA HVS-1 haplotypes.
| Population | n | # | HD | ND | PD | RI | Tajima’s D | Fu’s FS |
|---|---|---|---|---|---|---|---|---|
| Gujars | 122 | 56 | 0.975 | 0.022 ± 0.012 | 5.71 ± 2.76 | 0.016 | −1.805 | −25.194 |
| Jadoons | 99 | 66 | 0.988 | 0.022 ± 0.012 | 5.67 ± 2.74 | 0.011 | −2.003 | −25.268 |
| Syeds | 127 | 93 | 0.992 | 0.023 ± 0.012 | 5.89 ± 2.83 | 0.009 | −2.094 | −25.117 |
| Tanolis | 134 | 59 | 0.970 | 0.022 ± 0.012 | 5.76 ± 2.78 | 0.012 | −1.664 | −25.147 |
| Yousafzais | 177 | 131 | 0.994 | 0.022 ± 0.012 | 5.61 ± 2.71 | 0.011 | −2.107 | −25.055 |
| Kashmiri | 317 | 211 | 0.993 | 0.023 ± 0.012 | 5.89 ± 2.82 | 0.011 | −2.117 | −24.675 |
| Makrani | 100 | 60 | 0.984 | 0.027 ± 0.014 | 7.01 ± 3.32 | 0.006 | −1.742 | −24.936 |
| Pathan | 230 | 153 | 0.992 | 0.022 ± 0.012 | 5.61 ± 2.70 | 0.010 | −2.225 | −24.934 |
| Saraiki | 85 | 47 | 0.957 | 0.023 ± 0.012 | 5.98 ± 2.88 | 0.013 | −1.637 | −25.238 |
| Sindhi | 115 | 81 | 0.992 | 0.024 ± 0.013 | 6.16 ± 2.95 | 0.013 | −1.579 | −25.069 |
| Balti | 49 | 32 | 0.979 | 0.020 ± 0.019 | 5.12 ± 2.53 | 0.016 | −1.772 | −22.570 |
| Bangash | 25 | 17 | 0.973 | 0.012 ± 0.011 | 5.10 ± 2.56 | 0.045 | −1.162 | −7.036 |
| Khattak | 25 | 14 | 0.932 | 0.021 ± 0.011 | 5.34 ± 2.66 | 0.073 | −0.413 | −2.867 |
| Mahsuds | 25 | 10 | 0.917 | 0.019 ± 0.011 | 4.96 ± 2.50 | 0.028 | −0.680 | −0.142 |
| Orakzai | 25 | 18 | 0.967 | 0.022 ± 0.012 | 5.61 ± 2.79 | 0.045 | −1.186 | −7.811 |
| Brahui | 38 | 22 | 0.952 | 0.018 ± 0.010 | 4.63 ± 2.32 | 0.032 | −1.619 | −10.643 |
| Hazara | 23 | 21 | 0.992 | 0.022 ± 0.012 | 5.76 ± 2.86 | 0.018 | −1.638 | −15.567 |
| Hunza | 44 | 32 | 0.980 | 0.024 ± 0.013 | 6.13 ± 2.97 | 0.016 | −1.912 | −21.877 |
| Kalash | 44 | 11 | 0.830 | 0.015 ± 0.009 | 3.86 ± 1.98 | 0.064 | −0.041 | −0.249 |
| Parsi | 44 | 20 | 0.943 | 0.018 ± 0.010 | 4.53 ± 2.27 | 0.058 | −1.501 | −6.759 |
n = number of samples; # = number of haplotypes; HD: Haplotype Diversity; ND: Nucleotide Diversity; PD: Pairwise differences; RI: Raggedness index; Citations and references for the comparative populations are provided in Table S9.
Figure 2A Neighbor-Joining tree showing the genetic relationships between KPP ethnic groups and 77 world populations based on FST estimates from mtDNA HVS1 sequence data (Table S4).
AMOVA results for mtDNA HVS-1 sequences in KPP and comparative populations.
| No Groups | ||||||
|---|---|---|---|---|---|---|
| Source of variation | d.f. | Sum of squares | Variance component | % of variance | Fixation Indices | P-value |
| Among populations | 72 | 1782.704 | 0.15213 Va | 4.79 | FST: 0.04786 | 0.000 ± 0.000 |
| Within populations | 10,496 | 31,766.373 | 3.02652 Vb | 95.21 | ||
| Total | 10,568 | 33,549.077 | 3.17865 | |||
| Among groups | 6 | 865.080 | 0.08299 Va | 2.60 | FSC (Va):0.026 | 0.000 ± 0.000 |
| Among populations within groups | 66 | 917.624 | 0.08129 Vb | 2.55 | FST (Vb):0.051 | 0.000 ± 0.000 |
| Within populations | 10,496 | 31,766.373 | 3.02652 Vc | 94.85 | FCT (Vc):0.026 | 0.000 ± 0.000 |
| Total | 10,568 | 33,549.077 | 3.19080 | |||
| Among groups | 15 | 977.099 | 0.08197 Va | 2.57 | FSC (Va): 0.025 | 0.000 ± 0.000 |
| Among populations within groups | 57 | 805.604 | 0.07913 Vb | 2.48 | FST (Vb): 0.051 | 0.000 ± 0.000 |
| Within populations | 10,496 | 31,766.373 | 3.02652 Vc | 94.95 | FCT (Vc): 0.026 | 0.000 ± 0.000 |
| Total | 10,568 | 33,549.077 | 3.18762 | |||
Figure 3A geospatial map of NRY haplogroup frequencies in KPP ethnic groups and comparative populations. See the Methods section for a description of the mapping process, and Table S6 for the data on which the projection is based.
Summary statistics for KPP ethnic groups and other Pakistani populations based on Y-STR haplotypes.
| Populations | n | # | HD | PD |
|---|---|---|---|---|
| Gujars | 124 | 37 | 0.910 | 5.47 ± 2.65 |
| Jadoons | 114 | 36 | 0.796 | 3.07 ± 1.61 |
| Syeds | 129 | 31 | 0.794 | 3.05 ± 1.60 |
| Tanolis | 134 | 32 | 0.795 | 3.21 ± 1.67 |
| Yousafzais | 177 | 85 | 0.905 | 4.32 ± 2.15 |
| Yousafais_Old | 146 | 90 | 0.966 | 5.07 ± 2.48 |
| Gujars_Swat | 20 | 10 | 0.758 | 5.29 ± 2.67 |
| Kohestani | 20 | 14 | 0.890 | 6.13 ± 3.04 |
| Tarklani | 20 | 9 | 0.837 | 2.65 ± 1.47 |
| Utmankhail | 20 | 6 | 0.684 | 2.27 ± 1.30 |
| Yousafzais_Swat | 20 | 10 | 0.800 | 3.43 ± 1.83 |
| Pathan_Pakistan | 270 | 152 | 0.973 | 5.61 ± 2.70 |
| Kashmiri | 101 | 68 | 0.981 | 6.38 ± 3.05 |
| Hazara_Pakistan | 153 | 73 | 0.910 | 4.15 ± 2.07 |
| Punjabi | 394 | 266 | 0.995 | 6.10 ± 2.91 |
| Sheikh | 180 | 100 | 0.984 | 6.15 ± 2.94 |
| Gujars_Punjab | 176 | 84 | 0.971 | 6.23 ± 2.97 |
| Baluch | 59 | 48 | 0.988 | 6.68 ± 3.20 |
| Brahui | 110 | 80 | 0.968 | 5.66 ± 2.73 |
| Burusho | 86 | 55 | 0.990 | 6.17 ± 2.96 |
| Kalash | 44 | 23 | 0.946 | 5.55 ± 2.72 |
| Makrani | 58 | 52 | 0.996 | 6.75 ± 3.23 |
| Parsi_Pak | 90 | 56 | 0.969 | 6.09 ± 2.93 |
| Sindhi | 122 | 97 | 0.990 | 5.97 ± 2.87 |
n = number of samples; # = number of haplotypes; HD: Haplotype Diversity; PD: Pairwise differences; Citations and references for the comparative populations are provided in Table S10.
Figure 4A Neighbor-Joining tree showing the genetic relationships between KPP ethnic groups and 82 world populations based on RST estimates from Y-STR haplotype data (Table S8).
AMOVA results for Y-STR haplotypes in KPP and comparative populations.
| No Groups | ||||||
|---|---|---|---|---|---|---|
| Source of variation | d.f. | Sum of squares | Variance component | % of variance | Fixation Indices | P-value |
| Among populations | 79 | 18,852.353 | 1.59547 Va | 18.23 | FST: 0.182 | 0.000 ± 0.000 |
| Within populations | 11,641 | 83,324.558 | 7.15785 Vb | 81.77 | ||
| Total | 11,720 | 102,176.911 | 8.75333 | |||
| Among groups | 9 | 6701.154 | 0.33529 Va | 3.81 | FSC (Va):0.153 | 0.000 ± 0.000 |
| Among populations within groups | 70 | 12,151.198 | 1.29605 Vb | 14.75 | FST (Vb):0.186 | 0.000 ± 0.000 |
| Within populations | 11,641 | 83,324.558 | 7.15785 Vc | 81.44 | FCT (Vc):0.038 | 0.000 ± 0.000 |
| Total | 11,720 | 102,176.911 | 8.78919 | |||
| Among groups | 14 | 8879.462 | 0.44807 Va | 5.10 | FSC (Va):0.141 | 0.000 ± 0.000 |
| Among populations within groups | 65 | 9972.890 | 1.17945 Vb | 13.43 | FST (Vb):0.185 | 0.000 ± 0.000 |
| Within populations | 11,641 | 83,324.558 | 7.15785 Vc | 81.47 | FCT (Vc):0.051 | 0.000 ± 0.000 |
| Total | 11,720 | 102,176.911 | 8.78537 | |||
Figure 5A map of Pakistan showing the locations of fieldwork in the Khyber Pakhtunkhwa Province. DNA samples were collected from areas in which each ethnic group was highly concentrated. Gujars, Syeds and Yousafzais were sampled from both Buner and Swabi Districts, while the Jadoons and Tanolis were only sampled in the Swabi District. The map was created with the ArcGIS software, v10.3.1., based on source map from ESRI https://www.esri.com/en-us/home.