| Literature DB >> 25126587 |
Alicia Sanchez-Mazas1, Diogo Meyer2.
Abstract
Next generation sequencing (NGS) is currently being adapted by different biotechnological platforms to the standard typing method for HLA polymorphism, the huge diversity of which makes this initiative particularly challenging. Boosting the molecular characterization of the HLA genes through efficient, rapid, and low-cost technologies is expected to amplify the success of tissue transplantation by enabling us to find donor-recipient matching for rare phenotypes. But the application of NGS technologies to the molecular mapping of the MHC region also anticipates essential changes in population genetic studies. Huge amounts of HLA sequence data will be available in the next years for different populations, with the potential to change our understanding of HLA variation in humans. In this review, we first explain how HLA sequencing allows a better assessment of the HLA diversity in human populations, taking also into account the methodological difficulties it introduces at the statistical level; secondly, we show how analyzing HLA sequence variation may improve our comprehension of population genetic relationships by facilitating the identification of demographic events that marked human evolution; finally, we discuss the interest of both HLA and genome-wide sequencing and genotyping in detecting functionally significant SNPs in the MHC region, the latter having also contributed to the makeup of the HLA molecular diversity observed today.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25126587 PMCID: PMC4122113 DOI: 10.1155/2014/971818
Source DB: PubMed Journal: J Immunol Res ISSN: 2314-7156 Impact factor: 4.818
Figure 1Plot comparisons of two different measures of genetic distances among populations. Each point corresponds to a pair of populations (taken from a large database of 90 populations typed for HLA-B and 106 populations typed for HLA-DRB1, resp.) for which two values are reported: the genetic distance estimated by taking into account both the allele frequencies and the molecular distance between the alleles (“Φst-derived” Reynold's genetic distance based on PBR sequences, X axis) and the genetic distance estimated only from allele frequencies (“F st-derived” Reynolds' genetic distance based on allele frequencies, Y axis). (a) Locus HLA-B; in this plot, the population pairs formed by the Samoans and all other populations are highlighted by red squares. (b) HLA-DRB1; in this plot, the population pairs formed by the Lebanese Arabs and all other populations are highlighted by red squares. For Samoans at HLA-B, genetic distances are clearly skewed toward higher values with Reynold's genetic distance based on PBR sequences, while the opposite situation is observed for the Lebanese Arabs at HLA-DRB1. Taking into account the molecular information has thus substantial effects on the estimation of population relationships (from [21]).
Number (#) and proportion (%) of HLA alleles defined at the second, third, and fourth fields of the current HLA nomenclature (IMGT/HLA database release 3.15.0, 2014-01-17). The third field (highlighted in bold) is defined by synonymous nucleotide substitutions in coding regions.
| Locus | Total | Second field | Third field | Fourth field | |||
|---|---|---|---|---|---|---|---|
| # | # | % | # | % | # | % | |
| A | 2579 | 1862 | 72.2% |
|
| 26 | 1.0% |
| B | 3285 | 2404 | 73.2% |
|
| 23 | 0.7% |
| C | 2133 | 1486 | 69.7% |
|
| 31 | 1.5% |
| DRA | 7 | 0 | 0.0% |
|
| 3 | 42.9% |
| DRB1 | 1411 | 972 | 68.9% |
|
| 10 | 0.7% |
| DQA1 | 51 | 23 | 45.1% |
|
| 15 | 29.4% |
| DQB1 | 509 | 318 | 62.5% |
|
| 10 | 2.0% |
| DPA1 | 37 | 15 | 40.5% |
|
| 5 | 13.5% |
| DPB1 | 248 | 198 | 79.8% |
|
| 4 | 1.6% |
Figure 2Distributions of the number of nucleotide differences between pairs of sequences (mismatch distributions) in two Asiatic populations of close sample sizes, one Taiwanese (Ami, N = 98, top) and one continental (Uyghur, N = 105, bottom), based on DNA sequences of HLA-A, HLA-B, and HLA-DRB1 alleles defined at high resolution (sequences downloaded from IMGT/HLA database). While the distributions exhibit a likely unimodal shape for the Uyghur, suggesting recent demographic expansion, they are very irregular for the Ami, reflecting population stationarity or contraction (note that the zero class corresponds to identical sequences, reflecting a high proportion of homozygosity). Besides these demographic signatures, Tajima's D are positive for all distributions, in agreement with balancing selection acting on these loci (not shown).