Literature DB >> 33638983

Improving Selection Detection with Population Branch Statistic on Admixed Populations.

Burak Yelmen^1,2, Davide Marnetto¹, Ludovica Molinaro^1,2, Rodrigo Flores¹, Mayukh Mondal¹, Luca Pagani^1,3.

Abstract

Detecting natural selection signals in admixed populations can be problematic since the source of the signal typically dates back prior to the admixture event. On one hand, it is now possible to study various source populations before a particular admixture thanks to the developments in ancient DNA (aDNA) in the last decade. However, aDNA availability is limited to certain geographical regions and the sample sizes and quality of the data might not be sufficient for selection analysis in many cases. In this study, we explore possible ways to improve detection of pre-admixture signals in admixed populations using a local ancestry inference approach. We used masked haplotypes for population branch statistic (PBS) and full haplotypes constructed following our approach from Yelmen et al. (2019) for cross-population extended haplotype homozygosity (XP-EHH), utilizing forward simulations to test the power of our analysis. The PBS results on simulated data showed that using masked haplotypes obtained from ancestry deconvolution instead of the admixed population might improve detection quality. On the other hand, XP-EHH results using the admixed population were better compared with the local ancestry method. We additionally report correlation for XP-EHH scores between source and admixed populations, suggesting that haplotype-based approaches must be used cautiously for recently admixed populations. Additionally, we performed PBS on real South Asian populations masked with local ancestry deconvolution and report here the first possible selection signals on the autochthonous South Asian component of contemporary South Asian populations.

Entities: Chemical

Keywords: PBS; South Asia; XP-EHH; admixed populations; local ancestry inference; natural selection

Mesh：

Year: 2021 PMID： 33638983 PMCID： PMC8046333 DOI： 10.1093/gbe/evab039

Source DB: PubMed Journal: Genome Biol Evol ISSN： 1759-6653 Impact factor: 3.416

Significance

Detecting natural selection in recently admixed populations can be difficult due to the obscurity of the source of selection signals. In this study, we used local ancestry inference methods to obtain ancestry assigned haplotypes out of simulated admixed populations and reported improvement in detecting selection signals before the admixture event using these haplotypes instead of the admixed ones. We additionally demonstrated that methods utilizing haplotype structure to detect selection must be used cautiously for recently admixed populations. Finally, we applied our approach to real South Asian genomes to report first possible selection signals for autochthonous South Asian populations.

Introduction

Investigating signatures of selection in admixed populations is challenging due to the fact that independent signals from source populations may be obscured by each other (Huerta-Sánchez et al. 2014; Galaverni et al. 2017; Pierron et al. 2018). Although recent advancements in ancient DNA (aDNA) grants us with the opportunity to independently study source populations prior to the admixture event, detecting selection signals in populations with admixture background is still widely an unexplored and challenging endeavor, especially in areas where aDNA preservation is limited. When conventional methods like population branch statistics (PBS; Yi et al. 2010) or cross-population extended haplotype homozygosity (XP-EHH; Sabeti et al. 2007) are utilized to detect possible signals on admixed populations, it is not easy to resolve whether the candidate signals are due to selection acting after the admixture event or before and in the latter case, on which source population the selection event took place. Here, we particularly concentrated on detecting selection signals before admixture. We postulate that applying local ancestry inference as a preliminary step and then searching for signatures of selection within each set of source haplotypes may greatly decrease false positives and may help assign the observed sweeps to the correct ancestral population. In a previous study, we used a local ancestry deconvolution-based approach to create surrogates of the two main ancestral components of contemporary South Asian human genomes to shed light into the demographic history of these highly admixed human populations (Yelmen et al. 2019) and to address patterns of selection after admixture. Here, we test our idea through forward simulations and investigate the possible use of these surrogates for frequency based (PBS) and haplotype based (XP-EHH) selection tests in humans or any other diploid organism.

Results

Comparison of PBS Scores Based on Forward Simulations

We used forward simulations to create an admixed human population by mixing available genomes from European (French) and East Asian (Han) individuals for 200 generations (assuming 30 years per generations, which will be around six thousand years ago). Notably, European-East Asian population splits may mimic the split between the North and South genomic components found within contemporary South Asian populations (see Yelmen et al. for a full description of South Asian demography). For this reason, we decided to respectively label “N” and “S” the European and East Asian ancestries retrieved from this simulation test, to keep consistency with the subsequent real case scenario applied on South Asian human genomes. Utilizing both PCAdmix (Brisbin et al. 2012) and ELAI (Guan 2014) and using German and Japanese genomes (two populations deemed to be genetically close to French and Han, respectively) as reference populations for local ancestry deconvolution, we formed MASK_S (German-assigned chunks masked out) and MASK_N (Japanese-assigned chunks masked out) populations from the simulated data. Then, we performed PBS on both (masked and naive) and compared with the results obtained from the original source populations (French and Han), which were used as a true dataset, set as our standard (see Materials and Methods). Overall SNP by SNP correlation showed significantly higher correlation between the PBS scores of original source populations and masked populations compared to the unmasked admixed populations (Supplementary figures S1 and S2). However, when we only analyzed SNPs with PBS scores above the 99th percentile (with SNPs selected based on Han and French source population PBS scores), the ELAI approach retained higher correlation whereas PCAdmix performed poorly (figure 1, Supplementary figure S2b). The poor performance of PCAdmix was due to false negatives, as can be seen from the low PBS values accumulating along the x axis. Additionally, both masking approaches performed better compared with the naive populations when we concentrated only on the top 50 scoring SNPs except for MASK_N_ELAI (Supplementary table S1a).

Fig. 1.

SNP by SNP PBS comparison for SNPs with PBS values above 99% threshold (SNPs selected based on Han and French source population scores) using Spearman’s correlation. (a) Han vs naive (correlation coefficient: 0.409, 95% confidence interval: 0.377–0.441, p-value <2.2e−16, n = 2,529), Han vs MASK_S_ELAI (correlation coefficient: 0.510, 95 percent confidence interval: 0.479–0.542, p-value <2.2e−16, n = 2,529), Han versus MASK_S_PCAdmix (correlation coefficient: 0.210, 95% confidence interval: 0.169–0.252, p-value <2.2e−16, n = 2,529). (b) French vs naive (correlation coefficient: 0.400, 95% confidence interval: 0.367–0.436, p-value <2.2e−16, n = 2,530), French versus MASK_N_ELAI (correlation coefficient: 0.452, 95% confidence interval: 0.422–0.483, p-value <2.2e−16, n = 2,530), French versus MASK_N_PCAdmix (correlation coefficient: 0.394, 95% confidence interval: 0.354–0.432, p-value <2.2e−16, n = 2,530). Aside from SNP by SNP comparison, we also compared window-based mean PBS scores as widely used in selection studies (Huerta-Sánchez et al. 2013). Overall, precision and true positive rate indicators were higher in masked populations compared with the naive population (table 1), showing that our approach helps retrieving a lower fraction of false signals compared with simply studying the admixed genomes. Although setting signal threshold to 99.9% reduced the hits significantly which, in return, made the comparison difficult (Supplementary table S2), 99.5% threshold results mostly showed improvement using masked populations.

Table 1

Comparison of Possible Selection Signals Based on Mean PBS Scores (Above 99.5% Noted as Positive) for 50 kb Windows with TP, FP, FN, TPR (Measuring the Fraction of Correctly identified positives), and FDR, Measuring Expected Fraction of FPs) Indicators for Each Tested Population (Admixed Naive along with PCAdmix and ELAI-Masked Populations, see Materials & Methods for details) Compared with True Source Population

	TP	FP	FN	TPR	FDR
Naive	25	39	29	0.46	0.61
MASK_S_ELAI	27	23	27	0.50	0.46
MASK_S_PCAdmix	25	18	29	0.46	0.42
Naive	50	33	60	0.45	0.40
MASK_N_ELAI	41	21	69	0.37	0.34
MASK_N_PCAdmix	68	38	42	0.62	0.36

Comparison of XP-EHH Scores Based on Forward Simulations

Since XP-EHH cannot be performed with missing chunks, masked populations could not be utilized in a straightforward manner. We, therefore, created ancestral random breeders (ARBs, see methods for more details), full genomes with no gaps made by combining together masked haplotypes from multiple individuals, as described in Yelmen et al. (2019), resulting in ARB_N and ARB_S individuals. This approach assigns ancestries to the chunks of admixed genomes and combines matching ancestries to create whole genomes, bypassing the lack of a proper imputation panel when either of the ancestry is no longer available in unadmixed form. Therefore, they can be seen as “Frankensteins” of stitched haplotypes of the same ancestry. Similar to the PBS analysis, we performed XP-EHH on ARB (filled in after local ancestry deconvolution) and naive (prior to any type of processing) genomes, and compared these with the results from French and Han source populations (see Materials and Methods). This time, ARB populations performed worse in comparison to admixed naive population for position by position XP-EHH scores (Supplementary fig. S3), although both ARB and naive populations performed very poorly when we checked top 50 scoring SNPs in comparison to the source populations (Supplementary table S1b).

PBS on Real South Asian Populations

Given that applying XP-EHH on admixed populations proved to be imprecise using either ARB or admixed genomes, we then resorted to PBS as our only viable approach to detect signatures of selection in a case study: contemporary South Asian human populations. Genomic composition of South Asians can be characterized in a broad perspective as an admixture of West Eurasian and South Asian components (Reich et al. 2009; Chaubey et al. 2011; Metspalu et al. 2011; Moorjani et al. 2013; Basu et al. 2016; Lazaridis et al. 2016; Damgaard et al. 2018; Pathak et al. 2018; Yelmen et al. 2019). Even though recent studies shed more light into this composition (Narasimhan et al. 2019), there is still no available unadmixed aDNA attributed to the South Asian component for a selection scan to be carried out. In our previous study, we checked for local admixture imbalance between ancestral components of South Asians and detected some possible selection signals pointing to selective pressures acting after the admixture event (Yelmen et al. 2019). In this study, we instead focused on events that took place prior to the admixture of the two genomic components, and reported the first time possible selection signals for the South Asian component of contemporary South Asians (fig. 2). We additionally report the genes present within 50-kb windows with possible positive signals (Supplementary table 3) and top 20 scoring SNPs with related genes (Supplementary table S4). We stress that the PBS analysis could also detect postadmixture selection, for example, by searching for overlap between the signals found on each ancestry independently; however, postadmixture imbalance (Yelmen et al. 2019) remains the strategy of choice given the short evolutionary time elapsed after the admixture event.

Fig. 2.

PBS on masked South Asian genomes (MASK_S) with the dashed line marking 99.9% threshold. Genes within 50-kb range of the highest peaks are annotated.

Discussion

In our study, we explored the possibility of improving selection tests on admixed populations using local ancestry deconvolution approaches. Following our previous work (Yelmen et al. 2019), we created surrogates of components from a simulated admixed population both in the form of masked and reconstructed genomes. Knowing the true source populations which we merged to create the admixture, we were able to compare selection scores and observed an increased performance in the site-specific PBS using masked populations instead of the admixed one. However, it is important to note that even with this improvement, the scores are far from matching the ones obtained from the true source population in our simulations (Supplementary Table 1). Additionally, although we calculated the true positive rate and precision for window-based comparison, due to the vast number of true negatives, it was not possible to obtain other meaningful indicators such as accuracy or false positive rate. On the other hand, haplotype aware XP-EHH analyses revealed that both reconstructed and admixed genomes performed poorly compared with the benchmark (Supplementary table 1b), with the naive approach being slightly better than our proposed solution in the case of haplotype-based methods. This might be an expected outcome since combining chunks from different individual genomes disrupts haplotype integrity (Yelmen et al. 2019), although it should still be seen as a strategy to be preferred over simple imputation in the absence of a suitable reference panel. Nonetheless, we found some regions above the threshold level (>2) in the admixed dataset, though there we never simulated any selection (after admixture). This suggests that the selection signal found in an admixed population using XP-EHH (or similar selection tests) does not necessarily advocate for the selection to have happened in that admixed population. It might be a remnant effect coming from source populations where the variant was already selected. Moreover, some signals detected in the admixed populations were not present in the source populations (Supplementary fig. 4). These findings suggest that XP-EHH and haplotype-aware methods are probably not a good option for selection analysis in admixed populations. In addition to the analyses on simulated data, we also performed PBS on real masked genomes and reported the first possible selection signals for the South Asian component of contemporary South Asian populations (fig. 1, Supplementary tables S2 and S3). Some possible signals included high-scoring SNPs from chromosome 6 within 50 kb of HLA-G and HLA-F and TRIM31 and TRIM40 genes related to immune system (Ishitani et al. 2003; Rajagopalan and Long 2012; Fu et al. 2017; Liu et al. 2017; Zhao et al. 2017; Lin and Yan 2019) along with KCTD6 related to sweet taste signaling pathway (Liu et al. 2013), ACOX2 related to branched fatty acid processing (Bjørklund et al. 2015; Vilarinho et al. 2016), DOK5 coding for adapter proteins involved in signal transduction (Favre et al. 2003), and CDH4 that is thought to be involved in brain segmentation and neuronal growth (Babb et al. 2005). These highlight for the first time putative selection signals that took place in the autochthonous South Asian population, as yet unsampled in its unadmixed form. However, it is important to note that signals related to HLA genes should be interpreted very cautiously due to difficulties related to variant calling and genotyping for that region. Furthermore, in our previous work (Yelmen et al. 2019), we found this region to be under unbalanced admixture between “North” and “South” South Asian components, with “North” haplotypes to appear preferred over “South” haplotypes. In conclusion, our work shows that applying selection scans on admixed populations of any diploid organism with no prior deconvolution yields several off target results, while a preliminary local ancestry deconvolution step may help improve the detection of true signals, in the case of PBS. On the other hand, more research is required to assess the effect of different local ancestry methods on detecting true selection signals since we detected varying results between PCAdmix and ELAI applications. Our results also show that haplotype-aware methods to detect selection may be severely impaired in presence of recent admixture between highly divergent populations. Overall, our results should inform future studies in the field to be cautious when reporting selection scans from human or other organisms where recent admixture has been detected, with particular reference to those populations where at least one of the admixing sources is no longer available in its unadmixed form due to complete assimilation or extinction (in case of wild species) of that particular group.

Materials and Methods

Samples and Simulations

We used SNP array data of French, Han, Japanese, Yoruba (Li et al. 2008), and German (Yunusbayev et al. 2015) populations along with masked South Asian populations (Brahmin Gujarati, Gujarati, Khatri, Maratha, Pallan, Chamar, Dharkar, Kanjar, Gujjar, and Ror) (Metspalu et al. 2011; Basu et al. 2016; Pathak et al. 2018) with same sample sizes as described in Yelmen et al. 2019 (see Supplementary table S5). A simulated admixed population consisting of 500 samples was created via admix-simu (Williams 2016) using French and Han as admixing groups. First, we only kept positions whose minor allele frequency is more than 1% (using vcftools –maf command), remove all the indels and kept only SNPs (using vcftools –remove-indels) and kept only the first biallelic position in case of multiple allele (using bcftools –norm -d all). After filtering, we used admix-simu to simulate the admixed populations, which were admixed 200 generations ago with 50/50 contributions from the two sources.

Local Ancestry Inference

Local ancestry inference was performed with PCAdmix (Brisbin et al. 2012) setting window size to ten SNPs after default LD pruning (r2 > 0.8, based on a built-in window size), a value suggested by the software’s developers as the smallest size for reliable resolution. Additionally, we only used windows assigned to ancestral proxies with posterior probabilities (provided in the PCAdmix .fbk output) higher than 0.95, based on simulation tests described in Yelmen et al. 2019. For the simulated admixture events we used Germans and Japanese as reference groups, given that French and Han were used as mixing populations. For the South Asian real populations, we used French and Paniya as proxies for “North” South Asian (N) and “South” South Asian (S) ancestry donors, respectively, following Yelmen et al. 2019. For ELAI (Guan 2014), we used -mg 100 to define the admixture event that happened 100 generations ago. We find that using -mg 200 gives spurious results, although the true admixture in this case was 200 generations ago. Further we used -s 20 for having 20 steps for EM. We ran 10 times independently for ELAI and then averaged it out, and used the same source populations described for the PCAdmix approach.

Haplotype Masking

For each local ancestry method, we kept the regions assigned to either ancestry with at least 95% confidence and marked all the other regions as unknown or unassigned. Based on ancestry assigned genomic chunks, two sets (PCAdmix and ELAI) of 500 MASK_N and 500 MASK_S individuals were created to be used in PBS analysis. These haploid individuals retained information only for the confidently assigned sites and showed gaps or “NA” for the rest of their genome.

Ancestral Random Breeders

Additionally, to overcome issues introduced by the missing data and by the lack of suitable imputation panels for the “South” South Asian component, we generated 20 ARBs for each ancestry, using ELAI as the local ancestry method, to be used in XP-EHH analysis. ARBs were created for each N and S ancestry, by taking all the MASK_N or MASK_S individuals and replenishing the masked-out haplotypes by randomly picking (with replacement) a nonmasked haplotype from another donor within the same ancestry assignation at that locus. This process hence created a number of ARB haploid individuals which feature the genetic makeup of the original individual used as a scaffold, and, where not available, a random set of haplotypes from a given ancestry drawn from that individual’s population. The reconstructed ARB population can therefore be seen as a set of random breeders, to be considered as the best available proxy to the actual ancestry source within the studied population. To control for fluctuations in allele frequencies introduced by this drawing approach, we worked only on a small number of recipient ARBs (20 individuals, from the simulated populations or from the whole set of available South Asian genomes) by retaining only positions for which a minimum number of available donor haplotypes were available in a given ancestry status to minimize the donation of the same haplotype to multiple ARBs, and by maximizing the length of the donated haplotype and its affinity to the surrounding sequences of a given receiving ARB, to minimize the number of ancestry switches artificially introduced by the ARB making process. This method has been initially described in Yelmen et al. (2019) to which we refer for further details and testing.

Population Branch Statistic

We used scikit-allel package (Miles et al. 2020) for calculating PBS score for each available position using allel.pbs function with window_size = 1 and window_step = 1. Two different sets of analyses were performed for simulated data. PBS for MASK_N_ELAI, MASK_N_PCAdmix, naive (as an admixture of French and Han), and French group was performed using Yoruba and Japanese outgroups [PBS(N, Japanese, Yoruba)]. PBS for MASK_S_ELAI, MASK_S_PCAdmix, naive (as an admixture of French and Han) and Han group was performed using Yoruba and German outgroups [PBS(S, German, Yoruba)]. For window-based analysis, genomes were divided into 50-kb regions and mean PBS was calculated for each region. We defined 99.9% and 99.5% thresholds as possible selection signals for each region.

Cross -Population Extended Haplotype Homozygosity

We used scikit-allel package (Miles et al. 2020) for calculating XP-EHH score for each available position using default parameters with allel.xpehh function. Two different sets of analyses were performed for simulated data. XP-EHH for ARB_N, naive (as an admixture of French and Han), and French group was performed using Japanese outgroup. XP-EHH for ARB_S, naive (as an admixture of French and Han), and Han group was performed using German outgroup.

Computing Correlation Coefficients and Sensitivity/Specificity Measures

We calculated Spearman’s correlation coefficient with R version 3.6.3 (Development Core Team R 2020) to assess the SNP by SNP correlation of PBS and XP-EHH scores. We used bootstrapping with 1,000 replicates to acquire 95 percent confidence intervals. For 50 kb window analyses, we calculated the true positive rate (TPR) and false discovery rate (FDR) as where TP is true positives, FN is false negatives, and FP is false positives.

Data Availability

All data used in this article are publicly available from the original publications and on https://evolbio.ut.ee link. All data used for the analyses were obtained from literature, no new sample data were collected. Click here for additional data file.

31 in total

1. Sequencing of 50 human exomes reveals adaptation to high altitude.

Authors: Xin Yi; Yu Liang; Emilia Huerta-Sanchez; Xin Jin; Zha Xi Ping Cuo; John E Pool; Xun Xu; Hui Jiang; Nicolas Vinckenbosch; Thorfinn Sand Korneliussen; Hancheng Zheng; Tao Liu; Weiming He; Kui Li; Ruibang Luo; Xifang Nie; Honglong Wu; Meiru Zhao; Hongzhi Cao; Jing Zou; Ying Shan; Shuzheng Li; Qi Yang; Peixiang Ni; Geng Tian; Junming Xu; Xiao Liu; Tao Jiang; Renhua Wu; Guangyu Zhou; Meifang Tang; Junjie Qin; Tong Wang; Shuijian Feng; Guohong Li; Jiangbai Luosang; Wei Wang; Fang Chen; Yading Wang; Xiaoguang Zheng; Zhuo Li; Zhuoma Bianba; Ge Yang; Xinping Wang; Shuhui Tang; Guoyi Gao; Yong Chen; Zhen Luo; Lamu Gusang; Zheng Cao; Qinghui Zhang; Weihan Ouyang; Xiaoli Ren; Huiqing Liang; Huisong Zheng; Yebo Huang; Jingxiang Li; Lars Bolund; Karsten Kristiansen; Yingrui Li; Yong Zhang; Xiuqing Zhang; Ruiqiang Li; Songgang Li; Huanming Yang; Rasmus Nielsen; Jun Wang; Jian Wang
Journal: Science Date: 2010-07-02 Impact factor: 47.728

2. ACOX2 deficiency: A disorder of bile acid synthesis with transaminase elevation, liver fibrosis, ataxia, and cognitive impairment.

Authors: Sílvia Vilarinho; Sinan Sari; Francesca Mazzacuva; Kaya Bilgüvar; Güldal Esendagli-Yilmaz; Dhanpat Jain; Gülen Akyol; Buket Dalgiç; Murat Günel; Peter T Clayton; Richard P Lifton
Journal: Proc Natl Acad Sci U S A Date: 2016-09-19 Impact factor: 11.205

3. Zebrafish R-cadherin (Cdh4) controls visual system development and differentiation.

Authors: Sherry G Babb; Shannon M Kotradi; Bijal Shah; Christin Chiappini-Williamson; Lauren N Bell; Glen Schmeiser; Elbert Chen; Qin Liu; James A Marrs
Journal: Dev Dyn Date: 2005-07 Impact factor: 3.780

4. DOK4 and DOK5: new Dok-related genes expressed in human T cells.

Authors: C Favre; A Gérard; E Clauzier; P Pontarotti; D Olive; J A Nunès
Journal: Genes Immun Date: 2003-01 Impact factor: 2.676

5. Worldwide human relationships inferred from genome-wide patterns of variation.

Authors: Jun Z Li; Devin M Absher; Hua Tang; Audrey M Southwick; Amanda M Casto; Sohini Ramachandran; Howard M Cann; Gregory S Barsh; Marcus Feldman; Luigi L Cavalli-Sforza; Richard M Myers
Journal: Science Date: 2008-02-22 Impact factor: 47.728

6. The first horse herders and the impact of early Bronze Age steppe expansions into Asia.

Authors: Peter de Barros Damgaard; Rui Martiniano; Jack Kamm; J Víctor Moreno-Mayar; Guus Kroonen; Michaël Peyrot; Gojko Barjamovic; Simon Rasmussen; Claus Zacho; Nurbol Baimukhanov; Victor Zaibert; Victor Merz; Arjun Biddanda; Ilja Merz; Valeriy Loman; Valeriy Evdokimov; Emma Usmanova; Brian Hemphill; Andaine Seguin-Orlando; Fulya Eylem Yediay; Inam Ullah; Karl-Göran Sjögren; Katrine Højholt Iversen; Jeremy Choin; Constanza de la Fuente; Melissa Ilardo; Hannes Schroeder; Vyacheslav Moiseyev; Andrey Gromov; Andrei Polyakov; Sachihiro Omura; Süleyman Yücel Senyurt; Habib Ahmad; Catriona McKenzie; Ashot Margaryan; Abdul Hameed; Abdul Samad; Nazish Gul; Muhammad Hassan Khokhar; O I Goriunova; Vladimir I Bazaliiskii; John Novembre; Andrzej W Weber; Ludovic Orlando; Morten E Allentoft; Rasmus Nielsen; Kristian Kristiansen; Martin Sikora; Alan K Outram; Richard Durbin; Eske Willerslev
Journal: Science Date: 2018-05-09 Impact factor: 47.728

7. The genetic legacy of the expansion of Turkic-speaking nomads across Eurasia.

Authors: Bayazit Yunusbayev; Mait Metspalu; Ene Metspalu; Albert Valeev; Sergei Litvinov; Ruslan Valiev; Vita Akhmetova; Elena Balanovska; Oleg Balanovsky; Shahlo Turdikulova; Dilbar Dalimova; Pagbajabyn Nymadawa; Ardeshir Bahmanimehr; Hovhannes Sahakyan; Kristiina Tambets; Sardana Fedorova; Nikolay Barashkov; Irina Khidiyatova; Evelin Mihailov; Rita Khusainova; Larisa Damba; Miroslava Derenko; Boris Malyarchuk; Ludmila Osipova; Mikhail Voevoda; Levon Yepiskoposyan; Toomas Kivisild; Elza Khusnutdinova; Richard Villems
Journal: PLoS Genet Date: 2015-04-21 Impact factor: 5.917

8. Ancestry-Specific Analyses Reveal Differential Demographic Histories and Opposite Selective Pressures in Modern South Asian Populations.

Authors: Burak Yelmen; Mayukh Mondal; Davide Marnetto; Ajai K Pathak; Francesco Montinaro; Irene Gallego Romero; Toomas Kivisild; Mait Metspalu; Luca Pagani
Journal: Mol Biol Evol Date: 2019-08-01 Impact factor: 16.240

9. The KCTD family of proteins: structure, function, disease relevance.

Authors: Zhepeng Liu; Yaqian Xiang; Guihong Sun
Journal: Cell Biosci Date: 2013-11-24 Impact factor: 7.133

10. Genomic insights into the origin of farming in the ancient Near East.

Authors: Iosif Lazaridis; Dani Nadel; Gary Rollefson; Deborah C Merrett; Nadin Rohland; Swapan Mallick; Daniel Fernandes; Mario Novak; Beatriz Gamarra; Kendra Sirak; Sarah Connell; Kristin Stewardson; Eadaoin Harney; Qiaomei Fu; Gloria Gonzalez-Fortes; Eppie R Jones; Songül Alpaslan Roodenberg; György Lengyel; Fanny Bocquentin; Boris Gasparian; Janet M Monge; Michael Gregg; Vered Eshed; Ahuva-Sivan Mizrahi; Christopher Meiklejohn; Fokke Gerritsen; Luminita Bejenaru; Matthias Blüher; Archie Campbell; Gianpiero Cavalleri; David Comas; Philippe Froguel; Edmund Gilbert; Shona M Kerr; Peter Kovacs; Johannes Krause; Darren McGettigan; Michael Merrigan; D Andrew Merriwether; Seamus O'Reilly; Martin B Richards; Ornella Semino; Michel Shamoon-Pour; Gheorghe Stefanescu; Michael Stumvoll; Anke Tönjes; Antonio Torroni; James F Wilson; Loic Yengo; Nelli A Hovhannisyan; Nick Patterson; Ron Pinhasi; David Reich
Journal: Nature Date: 2016-07-25 Impact factor: 49.962

2 in total

Review 1. Human genetic admixture through the lens of population genomics.

Authors: Shyamalika Gopalan; Samuel Pattillo Smith; Katharine Korunes; Iman Hamid; Sohini Ramachandran; Amy Goldberg
Journal: Philos Trans R Soc Lond B Biol Sci Date: 2022-04-18 Impact factor: 6.671

2. Adaptive eQTLs reveal the evolutionary impacts of pleiotropy and tissue-specificity while contributing to health and disease.

Authors: Melanie H Quiver; Joseph Lachance
Journal: HGG Adv Date: 2021-12-24

2 in total