| Literature DB >> 34786601 |
Tatsuhiko Naito1,2, Yukinori Okada3,4,5.
Abstract
Variations of human leukocyte antigen (HLA) genes in the major histocompatibility complex region (MHC) significantly affect the risk of various diseases, especially autoimmune diseases. Fine-mapping of causal variants in this region was challenging due to the difficulty in sequencing and its inapplicability to large cohorts. Thus, HLA imputation, a method to infer HLA types from regional single nucleotide polymorphisms, has been developed and has successfully contributed to MHC fine-mapping of various diseases. Different HLA imputation methods have been developed, each with its own advantages, and recent methods have been improved in terms of accuracy and computational performance. Additionally, advances in HLA reference panels by next-generation sequencing technologies have enabled higher resolution and a more reliable imputation, allowing a finer-grained evaluation of the association between sequence variations and disease risk. Risk-associated variants in the MHC region would affect disease susceptibility through complicated mechanisms including alterations in peripheral responses and central thymic selection of T cells. The cooperation of reliable HLA imputation methods, informative fine-mapping, and experimental validation of the functional significance of MHC variations would be essential for further understanding of the role of the MHC in the immunopathology of autoimmune diseases.Entities:
Keywords: Autoimmune diseases; Fine-mapping; HLA; HLA imputation; MHC
Mesh:
Substances:
Year: 2021 PMID: 34786601 PMCID: PMC8837514 DOI: 10.1007/s00281-021-00901-9
Source DB: PubMed Journal: Semin Immunopathol ISSN: 1863-2297 Impact factor: 9.623
Fig. 1Structure of the MHC region and nomenclature of HLA alleles. a The MHC region is categorized into class I, II, and III. Only classical HLA genes are illustrated along with their positions for simplicity. b The nomenclature of HLA alleles. HLA alleles are named hierarchically as four fields based on the resolution of sequences. The last letter denotes expression status, e.g., “N” indicates “not to be expressed.”
Fig. 2An illustration of HLA imputation using a reference panel. An HLA reference panel contains individual data for which both SNP genotypes and HLA typing information are available. Based on LD information from a reference panel, it is possible to infer HLA allelic information of target individuals for whom only SNP genotype information is available.
Comparison of HLA imputation software
| Name | Type | Methods | URL | Reference |
|---|---|---|---|---|
| HLA*IMP:02 | Stand-alone software | Haplotype-graph model | NA | [ |
| HLA*IMP:03 | Web application | Random forest model | [ | |
| SNP2HLA | Shell script | Beagle with considering markers as binary alleles | [ | |
| HIBAG | R package | Bagging of multiple classifiers of EM algorithm | [ | |
| CookHLA | Python script | Beagle with considering markers as binary alleles and embedding of markers on exons | [ | |
| DEEP*HLA | Python script | Multi-task convolutional deep neural networks | [ |
Fig. 3The architecture and imputation strategy of DEEP*HLA. DEEP*HLA is a multi-task deep convolutional neural network model that takes SNP information and outputs genotype probabilities of HLA genes. In the training phase, DEEP*HLA learns the relationship between SNP genotype and HLA alleles from an HLA reference panel consisting of many individuals (a). In the imputation phase, a trained DEEP*HLA can perform HLA imputation from SNP genotype data without a reference panel (b).
A list of existing HLA reference panels
| Population | HLA typing method | Sample size | URL | Year | Reference |
|---|---|---|---|---|---|
| Europeans | SSOP | 5225 | 2013 | [ | |
| Korean | NGS | 413 | 2014 | [ | |
| East and South Asians | SSOP | 530 | 2014 | [ | |
| Japanese | SSOP | 908 | 2015 | [ | |
| Han-Chinese | NGS | 10,689 | 2016 | [ | |
| Japanese | NGS | 1120 | 2019 | [ | |
| Finnish | SSOP, SSP, SBT | 1150 | NA | 2020 | [ |
| Europeans | NGS | 401 | NA | 2020 | [ |
| Taiwanese | NGS | 1012 | NA | 2020 | [ |
| Multi-ethnicity | NGS | 21,546 | 2021 | [ |
SSOP sequence-specific oligonucleotide probe, NGS next-generation sequencing, SSP sequence-specific primer, SBT sequencing-based typing
Major findings on HLA associations in autoimmune diseases obtained from MHC fine-mapping studies
| Disease | HLA class (top association) | HLA loci associated with risk | Independently associated HLA variants | Population | Reference |
|---|---|---|---|---|---|
| Rheumatoid arthritis | II | HLA-DRβ1 AA 11 and 13, 71, 74; HLA-B AA 9; HLA-DPβ1 AA 9 | European | [ | |
| II | HLA-DRβ1 AA 11 and 13, 57, 74; HLA-B AA 9; HLA-DPβ1 AA 9 | East Asian | [ | ||
| II | HLA-DRβ1 AA 11 and 13 | East Asian | [ | ||
| II | HLA-DPβ1 AA 84; rs378352 (HLA-DOA); HLA-B*40:02 | East Asian | [ | ||
| Systemic lupus erythematous | II | HLA-DRβ1 AA 11 and 13 | East Asian | [ | |
| II | HLA-DRB1∗15:03; HLA-DQB1∗02:02, 03:19; HLA-DQA1∗05:01 02:01, 05:05; HLA-B∗08:01; HLA-C∗17:01 | African | [ | ||
| II | HLA-DQB1∗02:01; HLA-B∗08:01, 18:01; HLA-DRB3∗02; HLA-DQA1∗01:02 | European | [ | ||
| Dermatomyositis | II | HLA-DPB1*17 | East Asian | [ | |
| Idiopathic inflammatory myositis | II | HLA-DRB1*03:01; HLA-B*08:01; HLA-DQβ1 AA 57; HLA-DQB1*04:02 | European | [ | |
| Juvenile idiopathic arthritis | II | HLA-DRβ1 AA 13; HLA-DPB1*02:01; HLA-A AA 95; HLA-B AA 152 | European | [ | |
| Sjögren's syndrome | II | HLA-DQB1*0201 | European | [ | |
| Granulomatosis with polyangiitis | II | HLA-DPB1*04 | European | [ | |
| Eosinophilic granulomatosis with polyangiitis | II | HLA-DRB1*08:01; HLA-DQA1*02:01; HLA-DRB1*01:03 | European | [ | |
| Ankylosing spondylitis | I | HLA-B*27, 07:02 and 57:01; HLA-A*02:01; rs1126513 (HLA-DPB1); HLA-DRB1*01:03 | European | [ | |
| Psoriasis | I | HLA-C*06:02, 07:04; HLA-B AA 9, 67; HLA-DPB1*05:01 | East Asian | [ | |
| I | HLA-A*02:07; HLA-C*06:02; HLA-DQβ1 AA 57 | East Asian | [ | ||
| Celiac disease | II | HLA-DQβ1 AA 74, 57; HLA-DQα1 AA 47, 25 | European | [ | |
| Type I diabetes | II | HLA-DQβ1 AA 57, HLA-DRβ1 AA 13, 71; HLA-B*39:06; HLA-DPB1*04:02; HLA-A AA 62 | European | [ | |
| II | rs1770 (HLA-DQB1); HLA-DRβ1 AA 74, 11; HLA-A AA 9; HLA-C AA 275 | East Asian | [ | ||
| II | HLA-DQβ1 AA 185, 30, 70; HLA-DRβ1 AA 71, 74; HLA-B*54:01; HLA-A AA 62 | East Asian, European | [ | ||
| Grave's disease | II | HLA-DPβ1 AA 35, 9; HLA-A AA 9; HLA-B AA 45, 67; HLA-DRβ1 AA 74 | East Asian | [ | |
| Crohn's disease | II | HLA-DRB1*01:03; HLA-C*06:02 | European | [ | |
| II | HLA-DRβ1 AA 37, 57, HLA-DRB1*04:03 | East Asian | [ | ||
| Ulcerative colitis | II | rs6927022(HLA-DQA1); HLADRB1*01:03; HLA-C*12:02 | European | [ | |
| Pulmonary alveolar proteinosis | II | HLA-DRB1*08:03; HLA-DPβ1 AA 8 | East Asian | [ | |
| Primary biliary cholangitis | II | HLA-DRB1*08 and 14; HLA-DPB1*03:01 | European | [ | |
| II | HLA-DPβ1 AA 11; HLA-DRβ1 AA 74; HLA-DQβ1 AA 57; HLA-C AA 155; HLA-DQα1 AA 13 | European | [ | ||
| II | HLA-DRβ1 AA 74; HLA-DQβ1 AA 55; HLA-DPβ1 AA 85, 55 | East Asian | [ | ||
| Multiple sclerosis | II | HLA-DRB1*15:01, 03:01, 13:03, 04:04, 04:01, 14:01; HLA-A*02:01; HLA-DPβ1 AA 65; rs2516489; HLA-B*37:01, 38:01 | European | [ | |
| II | HLA-DRB1*15:01, *03:01, *13:03, *08:01; HLA-A*02:01, rs9277565 (HLA-DPB1); HLA-B*44:02, *38:01, *55:01; rs2229029 (LTA) | European | [ |
AA amino acid