| Literature DB >> 31235771 |
A Urnikyte1, A Flores-Bello2, M Mondal3, A Molyte1, D Comas2, F Calafell2, E Bosch4, V Kučinskas5.
Abstract
The analysis of geographically specific regions and the characterization of fine-scale patterns of genetic diversity may facilitate a much better understanding of the microevolutionary processes affecting local human populations. Here we generated genome-wide high-density SNP genotype data in 425 individuals from six geographical regions in Lithuania and combined our dataset with available ancient and modern data to explore genetic population structure, ancestry components and signatures of natural positive selection in the Lithuanian population. Our results show that Lithuanians are a homogenous population, genetically differentiated from neighbouring populations but within the general expected European context. Moreover, we not only confirm that Lithuanians preserve one of the highest proportions of western, Scandinavian and eastern hunter-gather ancestry components found in European populations but also that of an steppe Early to Middle Bronze Age pastoralists, which together configure the genetic distinctiveness of the Lithuanian population. Finally, among the top signatures of positive selection detected in Lithuanians, we identified several candidate genes related with diet (PNLIP, PPARD), pigmentation (SLC24A5, TYRP1 and PPARD) and the immune response (BRD2, HLA-DOA, IL26 and IL22).Entities:
Mesh:
Year: 2019 PMID: 31235771 PMCID: PMC6591479 DOI: 10.1038/s41598-019-45746-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Map of Lithuanian ethnolinguistic groups. Six regions based on dialect are distinguished in Lithuania: three groups from Aukstaitija (west, south and east) and three groups from Zemaitija (north, west and south).
Figure 2Principal component analysis (PCA) of the six ethnolinguistic groups of the Lithuanian population. (A) Principal components 1 and 2 (after removing outliers) are shown. NZ, North Zemaitija; SZ, South Zemaitija; WZ, West Zemaitija; EA, East Aukstaitija; SA, South Aukstaitija; WA, West Aukstaitija. (B) Centroids of the PC scores by ethnolinguistic groups. Bars represent standard errors.
Weir and Cockerham F estimates between pairs of ethnolinguistic groups in Lithuania.
| Population | NZ | SZ | WZ | EA | WA | SA |
|---|---|---|---|---|---|---|
| NZ | 0 | |||||
| SZ | 0.001886 | 0 | ||||
| WZ | 0.003293 | 0.00333 | 0 | |||
| EA | 0.002236 | 0.002485 | 0.003764 | 0 | ||
| WA | 0.002006 | 0.00217 | 0.003641 | 0.001986 | 0 | |
| SA | 0.002738 | 0.002586 | 0.003884 | 0.002765 | 0.002201 | 0 |
NZ, North Zemaitija; SZ, South Zemaitija; WZ, West Zemaitija; EA, East Aukstaitija; SA, South Aukstaitija; WA, West Aukstaitija.
Figure 3Lithuania within Western Eurasia. (A) PCA based on present-day samples from West Eurasia and North Africa. (B) PCA based on present-day samples, with ancient samples projected. Present-day samples are shown as grey points, with the exception of Lithuanians (light green), Basques (dark green) and Sardinians (purple). E, Early; M, Middle; L, Late; HG, Hunter-gatherer; N, Neolithic; ChL, Chalcolithic; BA, Bronze Age; IA, Iron Age. Western HG, Scandinavian HG and Eastern HG are circled.
Figure 4Geographic distribution of outgroup f3-statistics. Outgroup f3 values for the top ancient ancestries detected in the Lithuanian population were plotted across contemporary populations in western Eurasia: (A) f3(Mbuti; X, WHG). (B) f3(Mbuti; X, EHG). (C) f3(Mbuti; X, SHG). (D) f3(Mbuti; X, Europe_EN). (E) f3(Mbuti; X, Steppe_EMBA), being X a modern population.
Candidate regions under recent positive selection in the Lithuanian population as detected by F and XP–EHH.
| Genome coordinates | Genes | Population (SNPsa) |
|---|---|---|
| chr1:35484861–35635453 |
| LT–YRI (3) |
| chr1:36549157–36562551 |
| LT–YRI (3) |
| chr1:231866461–231908347 |
| LT–CEU (3) |
| chr1:245363423–247401645 |
| LT–FIN (4) |
| chr2:56070352–56161538 |
| LT–CEU (4) |
| chr3:2347409–2355167 |
| LT–FIN (5) |
| chr3:112511598–112816192 |
| LT–FIN (5) |
| chr3:130096881–131445951 |
| LT–FIN (2), LT-YRI (31) |
| chr4:2389513–23901275 |
| LT-CEU (4) |
| chr4:106552414–106594625 |
| LT–YRI (3), FIN–YRI (3) |
| chr4:14879847–148916437 |
| LT-YRI (2), CEU-YRI (10) |
| chr4:89629000–89669852 |
| LT–FIN (4) |
| chr4:181637915–181691738 |
| LT–FIN (4) |
| chr5:10073956–10109791 |
| LT–CEU (4) |
| chr5:133177588–133638034 |
| LT–FIN (4) |
| chr5: 165752621–165758259 |
| LT–FIN (2) |
| chr6: 3205325–3207258 |
| LT–FIN (2) |
| chr6:11710405–11717934 |
| LT–FIN (4) |
| chr6:27811815–28096280 |
| LT–FIN (43) |
| chr6:32961621–32965942 |
| LT-CEU (3) |
| chr6:46596286–46834685 |
| LT–CEU (8) |
| chr7:21298242–21346861 |
| LT-CEU (2) |
| chr7:30276161–30415102 |
| LT–YRI (5) |
| chr7:112994204–113020044 |
| LT–FIN (4) |
| chr7:129187430–129350170 |
| LT–CEU (11) |
| chr9:12352971–12537279 |
| LT–CEU (29) |
| chr9:129633328–129646775 |
| LT–CEU (2) |
| chr10:116362835–116377962 |
| LT–FIN (4) |
| chr11:41897826–42307166 |
| LT–YRI (10) |
| chr11:60047410–60107105 |
| LT–CEU (3) |
| chr11:129457972–129458962 |
| LT–CEU (2) |
| chr11:129832270–130645506 |
| LT–YRI (10), CEU–YRI (7) |
| chr12:68640583–68654780 |
| LT–CEU (6) |
| chr12:103326468–103394540 |
| LT–CEU (5) |
| chr13:101364651–101375663 |
| LT-CEU (2) |
| chr15:48387088–48686175 |
| LT–YRI (8), CEU–YRI (16) |
| chr15:51639821–51673630 |
| LT–CEU (9) |
| chr15:58579956–58588116 |
| LT–CEU (4) |
| chr16:66962411–67028008 |
| LT–FIN (6) |
| chr17:4722606–4943176 |
| LT–FIN (15), CEU-FIN (11) |
| chr20:19702049–19706617 |
| LT–FIN (6), CEU–FIN (13) |
| chr21:21389233–21568418 |
| LT–CEU (8) |
aNumber of significant SNPs that were located at the extreme 0.1% of the empirical distribution for the XP–EHH and at least one SNP in the region had F value < 0.01.
Figure 5Shared signals of adaptation between Lithuanians and the CEU and FIN populations from the 1000 Genomes Project Phase3 dataset. Venn diagrams showing the overlap of candidate regions for recent and older events of positive selection as detected by the combined F and XP-EHH approach and the Tajima’s D statistic, respectively.
Candidate regions under positive selection in the Lithuanian population as detected by the Tajima’s D statistic.
| Genome coordinates | Windows | P-value | Genes | Shared signal |
|---|---|---|---|---|
| chr1:35818960–35948959 | 4 | 0.0008 |
| |
| chr1:49988960–50728959 | 6 | 0.0003 |
| |
| chr1:188788960–188968959 | 9 | 0.0003 |
| CEU |
| chr2:21728675–21888674 | 7 | 0.0004 |
| FIN |
| chr2:179468675–179648674 | 9 | 0.0003 |
| CEU |
| chr3:50343412–51893411 | 11 | 0.0006 |
| CEU |
| chr3:128763412–128903411 | 5 | 0.0006 |
| CEU, FIN |
| chr3:143543412–143683411 | 5 | 0.0001 |
| CEU, FIN |
| chr4:171930684–172360683 | 4 | 0.0002 |
| |
| chr4:176190684–176400683 | 11 | 0.0005 |
| FIN |
| chr5:50531164–50691163 | 6 | 0.0003 |
| |
| chr5:126311164–126441163 | 4 | 0.0005 |
| |
| chr6:35265879–35395878 | 4 | 0.0005 |
| FIN |
| chr6:97855879–98005878 | 6 | 0.0007 |
| |
| chr7:30280729–30470728 | 8 | 0.0006 |
| CEU |
| chr7:151730729–151870728 | 5 | 0.0002 |
| CEU, FIN |
| chr8:48621077–48801076 | 9 | 0.0007 |
| |
| chr8:93731077–93901076 | 8 | 0.0002 |
| CEU |
| chr9:38474202–38614201 | 5 | 0.0001 |
| FIN |
| chr9:125434202–125574201 | 5 | 0.0006 |
| |
| chr10:66065709–66215708 | 6 | 0.0002 |
| FIN |
| chr10:118195709–118325708 | 4 | 0.0008 |
| |
| chr11:71554229–71734228 | 9 | 0.0004 |
| |
| chr12:1296235–1426234 | 4 | 0.0009 |
| CEU, FIN |
| chr12:15736235–15886234 | 6 | 0.0003 |
| CEU, FIN |
| chr13:34108565–34278564 | 8 | 0.0005 |
| |
| chr14:64046743–64236742 | 10 | 0.0005 |
| |
| chr15:48357093–48477092 | 3 | 0.0005 |
| CEU, FIN |
| chr15:69617093–69737092 | 3 | 0.0002 |
| CEU |
| chr16:67321264–67501263 | 7 | 0.0005 |
| CEU |
| chr17:29252345–29392344 | 5 | 0.0002 |
| CEU, FIN |
| chr18:30389383–30559382 | 8 | 0.0001 |
| CEU |
| chr19:50580913–50700912 | 3 | 0.0005 |
| |
| chr20:58399095–58539094 | 5 | 0.0006 |
| FIN |
| chr21:44859932–44989931 | 3 | 0.0005 |
|