| Literature DB >> 32999407 |
Begoña Dobon1,2, Rob Ter Horst3,4, Hafid Laayouni1,5, Mayukh Mondal6, Erica Bianco1, David Comas1, Mihai Ioana7, Elena Bosch1,8, Jaume Bertranpetit9, Mihai G Netea10,11,12.
Abstract
The Roma people are the largest transnational ethnic minority in Europe and can be considered the last human migration of South Asian origin into the continent. They left Northwest India approximately 1,000 years ago, reaching the Balkan Peninsula around the twelfth century and Romania in the fourteenth century. Here, we analyze whole-genome sequencing data of 40 Roma and 40 non-Roma individuals from Romania. We performed a genome-wide scan of selection comparing Roma, their local host population, and a Northwestern Indian population, to identify the selective pressures faced by the Roma mainly after they settled in Europe. We identify under recent selection several pathways implicated in immune responses, among them cellular metabolism pathways known to be rewired after immune stimulation. We validated the interaction between PIK3-mTOR-HIF-1α and cytokine response influenced by bacterial and fungal infections. Our results point to a significant role of these pathways for host defense against the most prevalent pathogens in Europe during the last millennium.Entities:
Mesh:
Year: 2020 PMID: 32999407 PMCID: PMC7528012 DOI: 10.1038/s41598-020-73182-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Ancestry of Roma people from Romania. (A) Principal component analysis of Roma (ROM) from Romania in the context of other worldwide populations. The graph shows the two principal components and the variance explained by them. Roma fall in a cline between Indian and European populations. (B) Proportion of shared genetic drift between Roma and extant worldwide populations measured using outgroup f3 statistic in the form f3(Roma; X, YRI). (C) Clustering analysis showing K = 3 and K = 4. Roma show their own component in K = 4 (best supported model). Roma share more drift with Europeans than with South East Asian populations. See Supplementary Notes for the description of the population abbreviations; some coordinates were slightly displaced to avoid overlap.
Figure 2Runs of homozygosity in worldwide populations. Total number of runs of homozygosity (NROH) versus the sum of the total length of ROH in Mb (SROH) for ROH > 2 Mb in worldwide populations. Each dot represents population means. Except for Roma who are colored in orange, dots color denotes their geography: Europe (pink), South Asia (blue) and East Asia (green). Right panel is an inset of the main plot to better show the separation between populations. Roma show a similar profile to Indian tribal populations (RIA, ILA and BIR) and to populations with high endogamy, such as VLR. See Supplementary Notes for the description of the population abbreviations. See Supplementary Fig. 10d for a comparison generated by down sampling all populations to the lowest sample size available.
Figure 3Genomic distribution, local ancestry, and enrichment analysis of the selection signals. (A). Enrichment of SNP functional categories. Square size is proportional to the contribution of each SNPs functional category to the total χ2 score indicated by the Pearson residuals. Positive values (in blue) indicate that the proportion of that category is higher than expected in the signals whereas negative values (in red) indicate that that signals are depleted in that functional category. (B) Pathway enrichment analysis based on the genes under selection. Pathways with an excess of European local ancestry are marked with an asterisk (p-value < 0.05, Benjamini and Hochberg—False Discovery Rate (BH—FDR)). Bar length is proportional to the percentage of genes in the term found within our signals, while the number of genes found in each term is shown. Pathways are colored depending on their main biological function. All terms are statistically significant (p-value < 0.05, BH—FDR). (C) Modulation of cytokine production by PI3K-mTOR-HIF-1α. Heatmap of cytokine production after pathogen stimulation and inhibition of mTOR (by Rapamycine) or HIF-1α (by ascorbate). Significant p-values are indicated with colors: red indicates an increase in cytokine production after inhibition, whereas blue indicates a decrease. P-values were corrected for multiple testing by FDR.
Figure 4Enrichment of cQTLs in the selection signals. (A) Local Manhattan plot of the region in which the 8 cQTLs found in the selection signals are located. The left plot shows a region of 1 Mb, with R2 values calculated relative to rs28413898 (selected as reference point as it is in the center of the region). The right plot shows a zoomed in region of 70 Kb, where for each of the 8 SNPs the other SNPs in the same region most strongly in linkage disequilibrium are shown in the same color scheme. Both plots were created using LocusZoom[74]. LD-values are based on hg19 EUR 1000 Genomes 2014. (B) Number of cQTLs found in the selection signals (blue line) compared to permutation results (grey histogram). Overlap between the top cytokine QTLs (cQTLs, p-value < 1e-7) and pruned selection signals. The overlap was compared to 10,000 random permutations. cQTL results for different stimuli and cytokines were merged by taking the lowest p-value for each SNP.