| Literature DB >> 32933484 |
Christelle Vangenot1, José Manuel Nunes1,2, Gaby M Doxiadis3, Estella S Poloni1,2, Ronald E Bontrop3, Natasja G de Groot3, Alicia Sanchez-Mazas4,5.
Abstract
BACKGROUND: Many species are threatened with extinction as their population sizes decrease with changing environments or face novel pathogenic threats. A reduction of genetic diversity at major histocompatibility complex (MHC) genes may have dramatic effects on populations' survival, as these genes play a key role in adaptive immunity. This might be the case for chimpanzees, the MHC genes of which reveal signatures of an ancient selective sweep likely due to a viral epidemic that reduced their population size a few million years ago. To better assess how this past event affected MHC variation in chimpanzees compared to humans, we analysed several indexes of genetic diversity and linkage disequilibrium across seven MHC genes on four cohorts of chimpanzees and we compared them to those estimated at orthologous HLA genes in a large set of human populations.Entities:
Keywords: Balancing selection; Demographic history; HLA; Human populations; Linkage disequilibrium; MHC; Nucleotide diversity; Patr; Population bottleneck; Selective sweep; Western chimpanzees
Mesh:
Substances:
Year: 2020 PMID: 32933484 PMCID: PMC7491122 DOI: 10.1186/s12862-020-01669-6
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Fig. 1Map of the human and chimpanzee MHC region showing average physical distances between the 7 loci under study in both species. The distances between loci (in Kb = kilobases) slightly vary between the two species but they have the same order of magnitude. ~ 80 Kb stands for “physical distance between DQB1 and DRB1 is about 80 Kb”
Results of Hardy-Weinberg equilibrium (HWE) and Ewens-Watterson-Slatkin (EWS) tests at seven MHC loci in chimpanzees (pooled cohort) and humans (multiple populations)
| Chimpanzees (pooled cohort) | |||||||
|---|---|---|---|---|---|---|---|
| 44 | 48 | 29 | 46 | 51 | 51 | 50 | |
| HWE | 1 | 0.456 | 0.627 | 0.81 | 0.983 | 0.226 | 1 |
| EWS | 0.079 | 0.873 | 0.100 | 0.031 | 0.645 | 0.811 | 0.069 |
| EWS | 0.491 | 0.970 | 0.707 | 0.473 | 0.907 | 0.945 | 0.572 |
| Humans (multiple populations) | |||||||
| 50 | 79 | 52 | 89 | 80 | 59 | 81 | |
| 87.8 (45.2) | 100.6 (55.3) | 90.7 (45.5) | 105.5 (111.8) | 124 (134.6) | 127.9 (141.1) | 127.6 (133.6) | |
| % HWE rejections (after correction) | 0 | 0 | 0 | 1.1 (0) | 0 | 0 | 0 |
| % EWS rejections (excess heterozygotes) | 0 | 16.46 (2.5) | 28.84 (0) | 30.34 (7.9) | 30 (0) | 33.9 (0) | 16.05 (3.7) |
| % EWS rejections (excess homozygotes) | 22 (2) | 1.27 (0) | 0 | 2.25 (0) | 5 (0) | 1.69 (0) | 7.41 (0) |
N: sample size (in number of individuals); k: number of populations; : mean sample size (in number of individuals); s.d.: standard deviation; Holm correction for multiple testing is given within brackets for the % of HWE and EWS tests. Significance tests were done without prior assumptions, thus two-tailed rejection at the 5% level either occurs below 0.025 (excess of heterozygotes) or above 0.975 (excess of homozygotes). The order of loci corresponds to their position on the chromosome from centromere (left) to telomere (right). The pooled cohort does not include Texascb (see Text)
Allele frequencies at each Patr locus in the pooled cohort of chimpanzeesa
| Locus | Locus | Locus | Locus | Locus | Locus | Locus | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| DPB1*01:09 | 0.279 | DQB1*03:02 | 0.5937 | DQA1*20:04 | 0.4138 | DRB1*02:01 | 0.2283 | B*01:01 | 0.2807 | C*04:01 | 0.3075 | A*03:01 | 0.1505 |
| DPB1*01:11 | 0.2514 | DQB1*06:02 | 0.2917 | DQA1*01:01 | 0.2759 | DRB1*02:04 | 0.2283 | B*05:01 | 0.1764 | C*06:01 | 0.2629 | A*04:01 | 0.126 |
| DPB1*01:12 | 0.1705 | DQB1*06:01 | 0.0521 | DQA1*05:02 | 0.1552 | DRB1*03:07 | 0.1528 | B*13:01 | 0.0784 | C*09:01 | 0.1211 | A*09:01 | 0.126 |
| DPB1*01:07 | 0.1073 | DQB1*15:01 | 0.0312 | DQA1*20:01 | 0.1034 | DRB1*03:02 | 0.0913 | B*24:02 | 0.0783 | C*02:03 | 0.0686 | A*07:01 | 0.1193 |
| DPB1*01:13 | 0.1023 | DQB1*03:05 | 0.0208 | DQA1*05:03 | 0.0517 | DRB1*03:05 | 0.087 | B*17:01 | 0.0686 | C*11:01 | 0.049 | A*01:01 | 0.0983 |
| DPB1*01:18 | 0.0114 | DQB1*06:07 | 0.0104 | DRB1*03:09 | 0.0462 | B*14:01 | 0.0588 | C*05:01 | 0.0392 | A*14:01 | 0.07 | ||
| DPB1*03:04 | 0.0114 | DRB1*10:01 | 0.0435 | B*24:01 | 0.0392 | C*12:01 | 0.033 | A*06:01 | 0.0668 | ||||
| blank | 0.0668 | DRB1*03:11 | 0.0326 | B*04:01 | 0.0294 | C*03:01 | 0.0196 | A*05:01 | 0.0422 | ||||
| DRB1*07:01 | 0.0217 | B*16:01 | 0.0294 | C*05:02 | 0.0196 | A*02:01 | 0.0319 | ||||||
| blank | 0.0684 | B*20:01 | 0.0294 | C*01:01 | 0.0114 | A*11:01 | 0.03 | ||||||
| B*03:01 | 0.0214 | C*02:02 | 0.0098 | A*06:02 | 0.0106 | ||||||||
| B*09:01 | 0.0214 | C*08:01 | 0.0098 | A*03:02 | 0.01 | ||||||||
| B*29:01 | 0.0196 | C*09:02 | 0.0098 | A*04:02 | 0.01 | ||||||||
| B*05:02 | 0.0098 | C*13:02 | 0.0098 | A*04:04 | 0.01 | ||||||||
| B*08:02 | 0.0098 | blank | 0.0288 | A*08:02 | 0.01 | ||||||||
| B*10:01 | 0.0098 | A*08:03 | 0.01 | ||||||||||
| B*16:02 | 0.0098 | A*17:01 | 0.01 | ||||||||||
| B*19:01 | 0.0098 | blank | 0.0683 | ||||||||||
| B*22:01 | 0.0098 | ||||||||||||
| B*23:01 | 0.0098 | ||||||||||||
| blank | 0.0002 | ||||||||||||
Alleles in grey have a frequency lower than 0.05
aThe allele frequencies for the individual chimpanzee cohorts are in Additional Table S1
bFor locus DQA1, only data for cohort BPRCwb are available
Genetic diversity at 7 MHC loci in chimpanzees (average on all chimpanzee cohorts and in the pooled cohort) and human populations (averaged on multiple populations)
| Number of samples ( | Allelic richness ( | Heterozygosity ( | Nucleotide diversity ( | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Chimpanzees | Humans | Chimpanzees | Humans | Chimpanzees | Humans | Chimpanzees | Humans | ||||||
| average | pooleda | average | pooleda | average | pooleda | ||||||||
| 2 | 44 | 50 | 87.8 | 6.3 | 7.4 | 9.8 | 77.9 | 80.3 | 72.5 | 1.3 | 1.3 | 2.4 | |
| 3 | 48 | 79 | 100.6 | 5.6 | 9.6 | 55.8 | 79 | 3.9 | 6.0 | ||||
| 1 | 29 | 52 | 90.6 | 5 | 5 | 6.4 | 71.5 | 71.5 | 74.3 | 7.8 | 7.8 | 6.9 | |
| 3 | 46 | 89 | 105.5 | 10.1 | 9.7 | 14.3 | 84.7 | 84.6 | 86.4 | 8.1 | 7.0 | 7.0 | |
| 3 | 51 | 80 | 124 | 13.9 | 17.1 | 19.2 | 84.1 | 86.4 | 90.3 | 5.2 | 4.2 | ||
| 2 | 51 | 59 | 127.9 | 9.7 | 11.5 | 12.4 | 79.6 | 81 | 85.2 | 2.1 | 2.6 | ||
| 3 | 50 | 81 | 127.6 | 13 | 14.2 | 11.9 | 88.6 | 90.3 | 79.7 | 2.7 | 3.4 | ||
k: number of samples; N: size of pooled cohort; (s.d): average size of human population samples and standard deviation
(s.d): average allelic richness in the chimpanzee samples and standard deviation; ar: allelic richness of the pooled cohort; (s.d): average allelic richness in human population samples (estimated relatively to the pooled cohort) and standard deviation. * value (in bold) significantly different between human populations and chimpanzee cohorts (Wilcoxon test, p = 0.0342) for allelic richness at DQB1. Test without the Texascb cohort: Wilcoxon two-sided test: p = 0.0604, single-sided test (“less”): p = 0.0302
(s.d): average expected heterozygosity in the chimpanzee samples and in human population samples and standard deviation; H%: expected heterozygosity of the pooled cohort. * value (in bold) significantly different between human populations and chimpanzee cohorts (Wilcoxon test, p = 0.0316) for heterozygosity at DQB1. Test without the Texascb cohort: Wilcoxon two-sided test: p = 0.0371, single-sided test (“less”): p = 0.0185
(s.d): average expected nucleotide diversity in the chimpanzee samples and in human population samples and standard deviation; П: expected nucleotide diversity of the pooled cohort. * values (in bold) significantly different between human populations and chimpanzee cohorts (Wilcoxon test, p = 0.0426, 0.0175, 0.0225,0.0371) for nucleotide diversity at DQB1, B, C and A, respectively. Test without the Texascb cohort: Wilcoxon two-sided test: p = 0.0825, 0.0255, 0.0944, single-sided test: p = 0.0412, 0.0128, 0.0472 at DQB1, B and A, respectively
The order of loci corresponds to their position on the chromosome from centromere (top) to telomere (bottom)
adoes not include Texascb (see Text)
Fig. 2Genetic diversity indexes estimated in chimpanzee cohorts and human populations. Left panels: allelic richness (top), heterozygosity (middle) and nucleotide diversity (bottom) at the seven studied MHC loci in the pooled cohort of chimpanzees (in red) and averaged on multiple human populations (in blue). The pooled cohort includes all cohorts except Texascb. Middle panels: allelic richness (top), heterozygosity (middle) and nucleotide diversity (bottom) at the seven studied MHC loci in each cohort of chimpanzees (in red) and for the human populations (in blue) represented as violin plots. The values calculated for each chimpanzee cohort are indicated by filled and unfilled shapes for cohorts of wild-born and captive-born chimpanzees, respectively. The values calculated for the human populations (average number of k = 70 (s.d 15.9) samples of average size N = 109.2 (s.d 17.31)) are shown as violin plots. The width of the violin varies so as to represent the probability density of the data, the thick black bar in the centre represents the interquartile range, the thin black line extended from it represents the 95% confidence intervals, and the blue dot is the median. Right panel: allelic richness (top), heterozygosity (middle) and nucleotide diversity (bottom) at the seven studied MHC loci in each cohort of chimpanzees (in red) and for the human populations (in two shades of blue) represented as violin plots. The values calculated for each chimpanzee cohort are indicated by filled and unfilled shapes for cohorts of wild-born and captive-born chimpanzees, respectively. The values calculated for the human population are plotted as violin plots, in light blue for small sized and isolated populations that likely experienced rapid genetic drift (RGD) and in dark blue for large outbred populations with slow genetic drift (SGD).
Ordering of the MHC loci based on decreasing values of three genetic diversity indexes in chimpanzees (pooled cohort) and humans (average on multiple populations)
| Genetic diversity | Species | Ordering of values |
|---|---|---|
| Allelic richness | Chimpanzees (pooled cohort&) | |
| Humans (average#) | ||
| Heterozygosity | Chimpanzees (pooled cohort&) | |
| Humans (average#) | ||
| Nucleotide diversity | Chimpanzees (pooled cohort&) | |
| Humans (average#) |
Perfect matches in locus ordering between chimpanzees and humans are highlighted in bold; # estimated on multiple human populations; &: for chimpanzees, the pooled cohort has an average sample size of 45.57 on the different loci; #: for humans, the averages were calculated on an average number of 70.14 populations with an average sample size of 109.46. The pooled cohort does not include Texascb (see Text)
Results of Global Linkage Disequilibrium (GLD) significance test (PRS resampling procedure) between different pairs of MHC loci in chimpanzees (BPRC cohort) and humans (multiple populations, further subdivided into RGD and SGD populations)
| Chimpanzees | Humans | |||||||
|---|---|---|---|---|---|---|---|---|
| BPRCWB | Multiple populations | RGD | SGD | |||||
| Loci pairs | ||||||||
| 25 | 0.106 | 40 | 30 (12.5) | 15 | 47 (27) | 25 | 24 (4) | |
| 25 | 0.0697 | 33 | 27.3 (15.1) | 16 | 44 (25) | 17 | 12 (5.9) | |
| 25 | 0.328 | 31 | 25.8 (9.7) | 16 | 37 (12.5) | 15 | 13 (6.7) | |
| 25 | 1 | 10 | 10 (10) | 4 | 25 (25) | 6 | 0 | |
| 25 | 0.999 | 8 | 12.5 (0) | 2 | 50 (0) | 6 | 0 | |
| 24 | 0.982 | 12 | 0 | 5 | 0 | 7 | 0 | |
| 29 | 46 | 20 | 26 | |||||
| 29 | 51 | 23 | 28 | |||||
| 29 | 1 | 13 | 61.5 (53.8) | 7 | 86 (86) | 6 | 33 (16.7) | |
| 29 | 0.975 | 10 | 40 (40) | 5 | 60 (60) | 5 | 20 (20) | |
| 28 | 0.98 | 16 | 12.5 (12.5) | 9 | 22 (22) | 7 | 0 | |
| 29 | 38 | 21 | 17 | |||||
| 29 | 1 | 9 | 44.4 (44.4) | 5 | 80 (80) | 4 | 0 | |
| 29 | 1 | 6 | 33.3 (33.3) | 3 | 67 (67) | 3 | 0 | |
| 28 | 0.831 | 12 | 8.3 (0) | 7 | 14 (0) | 5 | 0 | |
| 29 | 1 | 39 | 51.3 (35.9) | 22 | 73 (54.5) | 17 | 23 (11.8) | |
| 29 | 0.998 | 30 | 63.3 (63.3) | 18 | 78 (78) | 12 | 42 (42) | |
| 28 | 0.991 | 39 | 28.2 (25.6) | 24 | 33 (33) | 15 | 20 (13.3) | |
| 29 | 59 | 21 | 38 | 63 (63) | ||||
| 28 | 0.931 | 76 | 35.5 (25) | 27 | 63 (40.7) | 49 | 20 (16.3) | |
| 28 | 0.892 | 58 | 53.4 (37.9) | 23 | 65 (47.8) | 35 | 46 (31.4) | |
The PRS resampling procedure was done with 10′000 simulations for the chimpanzee cohorts (significant results are in bold). The PRS resampling procedure was done with 1′000 simulations in each human population, and the table reports the percentage of significant results for each pair of loci (in brackets after Holm correction for multiple testing). Pairs in significant global linkage disequilibrium in more than 70% of human populations are in bold. N: number of chimpanzee individuals; p: p-value; k: total number of human populations; k1: number of RGD populations; k2: number of SGD populations. The table includes results obtained for the BPRCwb cohort, i.e. the cohort including the greatest number of animals and the only one for which all loci were tested. Results for the other cohorts are in Additional Table S5
Fig. 3Nucleotide diversity at MHC loci and other genomic regions in Western chimpanzees (A) and in different sub-species of chimpanzees and bonobos (P. paniscus) (B). R1: Non-coding autosomal regions [66]; R2: Non-coding autosomal regions [67]; R3: Xq13.3 [95]; R4: Non-coding autosomal regions [82]; R5: Mitogenome [82]; R6: Mitogenome [54]; Patr/Papa-B, C, A: average nucleotide diversity for genes Patr/Papa-B, −C, −A: this study, [61, 62]. No data is available for R1, R2 and R3 in P.t.ellioti, for R3 in P.paniscus and for R3 in P.t schweinfurthii. Values are given in Additional Table S10
Fig. 4Schematic representation of the evolutionary mechanisms explaining the genetic diversity observed in Patr genes. For each diversity index, the Patr loci are plotted according to the values given in Table 3 for the pooled cohort of chimpanzees. The pooled cohort includes all cohorts except Texascb
Summary of the chimpanzee and human population data at Patr and HLA genes, respectively
| Chimpanzees (individual and pooled cohorts) | ||||||||
| BPRCwb | 25 | 29 | 29 | 29 | 29 | 29 | 28 | |
| Yerkescb | 22 | 22 | 22 | |||||
| Texascb | 16 | 17 | 23 | 23 | ||||
| Kumawb | 19 | 19 | 17 | |||||
| Pooled cohorta | 44 | 48 | 29 | 46 | 51 | 51 | 50 | |
| Humans (multiple populations) | ||||||||
| Africa | 7 | 19 | 8 | 12 | 11 | 8 | 11 | |
| Europe | 14 | 21 | 15 | 16 | 6 | 3 | 6 | |
| Asia | 9 | 12 | 6 | 33 | 38 | 30 | 36 | |
| N/S America | 11 | 19 | 17 | 18 | 9 | 5 | 9 | |
| Australia | 3 | 2 | 2 | 3 | 4 | 4 | 4 | |
| Pacific | 6 | 6 | 4 | 6 | 5 | 4 | 8 | |
| Otherb | 0 | 0 | 0 | 1 | 7 | 5 | 7 | |
| 87.8 (45.2) | 100.6 (55.3) | 90.7 (45.5) | 105.5 (111.8) | 124 (134.6) | 127.9 (141.1) | 127.6 (133.6) | ||
N: sample size (in number of individuals); k: number of populations per geographic region; : mean sample size (in number of individuals); s.d.: standard deviation. The order of loci corresponds to their position on the chromosome from centromere (left) to telomere (right)
adoes not include Texascb (see Text)
bPopulations were allocated to one of six geographic regions, and we considered an additional category Other for known admixed populations