Literature DB >> 34462158

HLA repertoire of 115 UAE nationals infected with SARS-CoV-2.

Halima Alnaqbi1, Guan K Tay2, Herbert F Jelinek3, Amirtharaj Francis4, Eman Alefishat5, Sarah El Haj Chehadeh6, Amna Tahir Saeed7, Mawada Hussein7, Bassam H Mahboub7, Maimunah Uddin8, Nawal Alkaabi8, Habiba S Alsafar9.   

Abstract

The class I and class II Human Leucocyte Antigens (HLA) are an integral part of the host adaptive immune system against viral infections. The characterization of HLA allele frequency in the population can play an important role in determining whether HLA antigens contribute to viral susceptibility. In this regard, global efforts are currently underway to study possible correlations between HLA alleles with the occurrence and severity of SARS-CoV-2 infection. Specifically, this study examined the possible association between specific HLA alleles and susceptibility to SARS-CoV-2 in a population from the United Arab Emirates (UAE). The frequencies of HLA class I (HLA-A, -B, and -C) and HLA class II alleles (HLA-DRB1 and -DQB1); defined using Next Generation Sequencing (NGS); from 115 UAE nationals with mild, moderate, and severe SARS-CoV-2 infection are presented here. HLA alleles and supertypes were compared between hospitalized and non-hospitalized subjects. Statistical significance was observed between certain HLA alleles and supertypes and the severity of the infection. Specifically, alleles HLA-B*51:01 and HLA-A*26:01 showed a negative association (suggestive of protection), whilst genotypes HLA-A*03:01, HLA-DRB1*15:01, and supertype B44 showed a positive association (suggestive of predisposition) to COVID-19 severity. The results support the potential use of HLA testing to differentiate between patients who require specific clinical management strategies.
Copyright © 2021 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  COVID-19; Human Leucocyte Antigen; Major histocompatibility complex; SARS-CoV-2; United Arab Emirates

Mesh:

Substances:

Year:  2021        PMID: 34462158      PMCID: PMC8391094          DOI: 10.1016/j.humimm.2021.08.012

Source DB:  PubMed          Journal:  Hum Immunol        ISSN: 0198-8859            Impact factor:   2.850


Introduction

The Human Leucocyte Antigens (HLA) class I (HLA-A, HLA-B, and HLA-C) and class II (HLA-DRB1, HLA-DRA1, HLA-DQB1, HLA-DQA1, HLA-DPB1, and HLA-DPA1) genes of the Major Histocompatibility Complex (MHC) are important elements of the immune system. Proteins encoded by these genes play a role in initiating the host response against viral infections [1]. In this regard, studies have shown that specific HLA alleles and HLA haplotypes play a role in the severity as well as the progression of several autoimmune diseases [2] as well as viral diseases, including Human Immunodeficiency Virus (HIV) [3], [4], [5], [6], [7], hepatitis B [8], [9], hepatitis C [10], [11], [12] and influenza [13], [14], [15]. Moreover, HLA class I and class II homozygosity or heterozygosity appear to play a role in viral disease susceptibility. In genetically isolated populations, homozygosity has been shown to lead to greater susceptibility to infection, whereas heterozygosity can increase resistance to infection [16], [17]. The highly polymorphic nature of the HLA class I and HLA class II genes encode diverse protein variants that control the adaptive immune system. HLA class I molecules have two linked polypeptide chains, α and β–2 microglobulin. A peptide-binding groove exists between the α1 and α2 domains. The peptide-binding groove of the class I molecule accommodates peptides of 8 to 10 amino acids in length. When a virus infects a host cell, its proteins are degraded into small peptides. These viral peptides then bind to the binding groove of these HLA class I molecules and are exposed on the surface of the cell. The presence of those viral antigens is then recognized by leukocytes, and a cascade of immune response events is initiated to eliminate the virus [18]. HLA class II molecules have two polypeptides chains, an α and a β chain. The peptide-binding groove of HLA class II is made up of the α1 and β1 domains. It can accommodate up to 12 to 24 amino acids where the peptide binds in an extended conformation, with about a third of the peptide being approachable by the CD8 + T-cells [1]. Although limited, information on the association between HLA and COVID19 is gradually accumulating. Insights can also be gleaned from previous reports on the closely related SARS-CoV and MERS-CoV [18] strains of coronavirus. To date, HLA-B is the most polymorphic class I gene, followed by HLA-A and HLA-C [19]. Polymorphic sites situated in locations that encode the peptide-binding groove, which interacts with T-cell receptors, impacts the binding specificity of amino acids in the binding groove [1]. The frequencies of HLA variants differ between populations as a result of historical migrations leading to admixture and different selection pressures of past epidemics [1]. These factors, among others, have contributed to the selection of the HLA alleles with antigen peptide-binding properties for specific viral peptides. Previous research describing antigen presentation of SARS-CoV has revealed that several HLA variants contribute to the susceptibility of the infection. These HLA variants have included HLA-B*46:01, HLA-B*07:03, HLA-DRB1*12:02 [20], and HLA-C*08:01, whereas HLA-DRB1*03:01, HLA-C*15:02, and HLA-A*02:01 alleles were found to be protective against infection [18]. Genome sequencing of SARS-CoV-2 and SARS-CoV isolates has shown a high degree of homology at the nucleotide level between the two viruses, but there is a noticeable difference at the protein level [21]. Protein sequence alignment analysis revealed that there are two proteins; encoded by orf8 and orf10 in SARS-CoV-2; that have no homologous proteins in SARS-CoV, which may contribute to the differences in the HLA response to SARS-CoV and SARS-CoV-2 [21]. The highly infectious nature of SARS-CoV-2 and its rapid spread warrants the study of factors that contributed to the differences observed in the susceptibility and severity of SARS-CoV-2 among individuals. Severity risk factors include age, cardiovascular disease, and metabolic disorders comorbidity [22]. However, the role of genetic variations in individuals that account for the different clinical manifestations of the disease remains elusive [23]. Epidemiological studies have shown that while most cases of COVID-19 are mild in nature, the disease appears to be highly fatal in severe cases. A fatality rate of around 26% among hospitalized patients and 37% among those who have needed invasive mechanical ventilation has been reported in the United Kingdom [24]. Studies have shown that HLA class I genes or those that are in linkage disequilibrium with HLA class I genes might have an impact on the pathogenesis of the severe acute respiratory syndrome (SARS) in 2003 [20] and SARS-CoV-2 in 2019 [25], [26], [27]. Furthermore, our research has implicated HLA-B*35 and HLA-C*07, as well as the 2-locus haplotype: HLA -C*04-B*35 to SARS-CoV-2 infection in a mixed population that resides in the UAE. The latest research points to genetic variants that contribute to immunity factors may also be playing a role in the manifestation of the observed variable clinical outcomes [28]. Knowledge of genetic biomarkers, including HLA, could assist in prioritizing public health intervention for vulnerable individuals in addition to providing individualized therapeutic targets [28]. Using Next Generation Sequencing (NGS) based HLA typing, this study investigated a possible correlation between the occurrence and severity of COVID-19 and certain HLA alleles and haplotypes in 115 COVID-19 patients from the UAE.

Materials and methods

Enrolment and sample collection

Blood samples of 115 unrelated patients with COVID-19 were collected during their COVID-19 case review at the Sheikh Khalifa Medical Centre (SKMC), Abu Dhabi, UAE. All 115 patients were nationals of the UAE. Information about the study was provided to potential participants. Those who agreed to be part of the study provided their written consent on forms that were approved by the Abu Dhabi Health COVID-19 Research Ethics Committee (DOH/DQD/2020/538). Reciprocal approval was also granted by the Dubai Scientific Research Ethics Committee (DSREC-04/2020_09) and SEHA Research Ethics Committee (SEHA-IRB-005). Samples from minors (age < 18 years) were collected after obtaining signed assent forms from their parents. Blood was collected by an experienced phlebotomy nurse. A total of 2 ml of blood was collected from the cubital vein in an ethylenediaminetetraacetic acid (EDTA) tube. Samples were stored in a sealed biohazard bag and transported at 4 °C to the testing laboratories at Khalifa University, using a cool transport container.

Demographic data collection

Demographic data, including age, gender, and medical history (Table 1 ), were obtained with the aid of questionnaires. Clinical assessments of the participants provided by the healthcare providers included the determination of the level of severity (mild, moderate, and severe) and diagnosis of pneumonia which was confirmed using a chest x-ray. The group of participants that presented with mild or no disease symptoms did not require hospitalization (n = 82). The patients from the moderate group exhibited symptoms such as fever, cough, and pneumonia, which required hospitalization (n = 12). The severe group presented with critical clinical features such as high temperature, cough, pneumonia, shortness of breath, and required intensive care (n = 21).
Table 1

Demographic data of the COVID-19 Abu Dhabi cohort, United Arab Emirates.

Mild(n = 82)Moderate and severe(n = 33)
Age mean in years34 ± 1458 ± 15
Gender
Female – count (percentage)27 (32.9%)9 (27.2%)
Male– count (percentage)55 (67%)24 (72.7%)
Clinical outcome
Pneumonia – count (percentage)18 (22.2 %)33 (100 %)
Death - count0 (0%)10 (30.3 %)
Demographic data of the COVID-19 Abu Dhabi cohort, United Arab Emirates.

DNA extraction and HLA sequencing

DNA was extracted from the EDTA tubes using the MagPurix system (Zinexts, Taiwan) as per the recommendations by the manufacturer. The quantity of the genomic DNA was determined by the dsDNA broad range fluorescence-based quantitation method (Denovix, Wilmington, USA). HLA genotyping was performed using the Holotype HLA 96/11 library kit (Omixon, Budapest, Hungary) according to the manufacturer’s protocol. The first step involved amplification of HLA class I and HLA class II loci by long-range Polymerase Chain Reaction (PCR). The PCR products of each individual were subsequently pooled for cleanup. The third step was the library preparation phase, which involved fragmentation, end repair, and ligation with indexed adaptors. Libraries were then combined into a single pooled library, and size selection was carried out using AMPure XP beads (Beckman Coulter, Massachusetts, USA). Quantification of the single pooled library was performed on the Viia 7 real-time PCR instrument (Applied Biosystems, Foster City, USA) using KAPA library quantification ROX low kit (Kappa Biosystems, Wilmington, USA). The final pooled library was then loaded onto the Illumina Miseq system (Illumina, San Diego, USA). FASTQ sequencing files were uploaded into the HLA Twin Software v4.2.0 (Omixon, Budapest, Hungary) for analysis. The software runs two independent computational algorithms to provide high confidence allele calling.

Statistical analysis

The samples were genotyped at a resolution of up to 4-fields using NGS. However, due to the sample size, the results in the second field resolution (subtype) were used for analysis and reported to minimize the number of tests, as analysis with high resolution would have generated too many unique 3-field allele assignments. Further, restricting the analysis to 2-field resolution allowed comparisons with published results. The cohort was divided into two groups, hospitalized (patients with moderate to severe conditions) and non-hospitalized (patients who were asymptotic or had very mild symptoms). Rare and very rare alleles were checked using the rare allele detector server (score < 4) available at (http://www.allelefrequencies.net), which detects rare and very rare alleles based on the established common and well-documented alleles in world populations [29]. Expected heterozygosity (He) and inbreeding coefficient (Fis) for each locus were determined using GenePop (stand-alone version 4.7.0 with the default settings) [31]. Hardy Weinberg Equilibrium test (HWE) was obtained using Genepop and Pypop [30] (the Guo and Thompson Hardy Weinberg test) [32]. Hardy-Weinberg equilibrium (HWE) was further tested using Genepop when H1 = excess heterozygosity and when H1 = defect heterozygosity using default settings (Markov chain parameters: dememorization = 10,000, batches = 20, iterations per batch = 5000). Based on the proposed approach for analyzing immunogenetic data in case-control studies by Hollenbach, et al. [33], the Bridging ImmunoGenomic Data-Analysis Workflow Gaps (BIGDAWG) R package was used for case-control association analyses of individual HLA loci and amino acid level analysis [34]. HLA allelic association analyses was estimated using chi-square testing. Calculations of odds ratio (95% confidence interval), and P-values were performed using the R epicalc package implemented in BIGDAWG. The BIGDAWG software was designed for the analysis of highly polymorphic HLA data. The software combines rare alleles for multi-locus analyses with expected counts less than five (or degree of freedom do not allow for a test) into a common class (binning) for each locus and performs a goodness-of-fit test. The corrected P values (Pc) for multiple comparisons were calculated using the Bonferroni method. The significance threshold (P = 0.05) was divided by the number of tested alleles after binning by BIGDAWG for the allelic association analysis (5 for HLA-A, 3 for HLA-C, 4 for HLA-B, 5 for HLA-DRB1, and 8 for HLA-DQB1), and the number of supertypes groups for the supertype association analysis (7 for HLA-A, 3 for HLA-C and 8 for HLA-B including the unclassified group). All associations reported herein are based on the corrected P-value.

In silico prediction of HLA class I binding affinity to viral peptides

In order to perform in silico peptide binding prediction for the complete SARS-CoV-2 proteome (9,744 amino acids), we obtained FASTA-formatted protein sequence data from the National Center of Biotechnology Information (NCBI) database (https://www.ncbi.nlm.nih.gov/genome/browse#!/overview/Sars-cov-2). Specifically, the following 11 protein viral products were retrieved: ORF1ab (YP_009724389.1), surface glycoprotein (S) (YP_009724390.1), ORF3a (YP_009724391.1), envelope (E) (YP_009724392.1), membrane glycoprotein (M) (YP_009724393.1), ORF6 (YP_009724394.1), ORF7a (YP_009724395.1), ORF7b (YP_009725318.1), ORF8 (YP_009724396.1), nucleocapsid (N) (YP_009724397.2), and ORF10 (YP_009725255.1). Those sequences were k-meirzed to 8-, 9-, 10-, 11- and 12-mers and uploaded into the NetMHCpan v4.1 server [35].

Results

The Abu Dhabi COVID-19 cohort in this study consisted of 115 UAE nationals infected with SARS-CoV-2. The age of the patients ranged from 16 to 80 years. Around seventy-one percent of the cohort were asymptomatic, while 28.6% had a moderate or severe infection. In total, 51 patients developed pneumonia: 18 out of 82 mild cases and 33 out of 33 moderate and severe cases. A summary of the demographics of the cohort can be found in Table 1. The correlation between age and COVID-19 severity was examined to assess the effect of age as a confounding factor in the study (see Fig. 1 ).
Fig. 1

A moderately positive correlation between Covid-19 severity and age in 115 UAE national patients with SARS-Cov-2 infection. Each point in the graph represents one individual. Four patients, 2 with mild, 1 with moderate, and 1 with severe symptoms, were excluded from the analysis due to the lack of necessary data (Age or date of birth).

A moderately positive correlation between Covid-19 severity and age in 115 UAE national patients with SARS-Cov-2 infection. Each point in the graph represents one individual. Four patients, 2 with mild, 1 with moderate, and 1 with severe symptoms, were excluded from the analysis due to the lack of necessary data (Age or date of birth). Table 2 lists the HLA class I (HLA-A, -B, -C) and class II (HLA-DRB1, -DQB1) allele counts of the non-hospitalized (n = 82) compared to the hospitalized (n = 33) group.
Table 2

HLA Class I and class II allele count and statistical association in non-hospitalized and hospitalized SARS-CoV2 infected patients.

HLA AlleleNon-hospitalized(n = 82)Hospitalized(n = 33)ORP-value95% CIEffectSB(<50 nM)WB(<500 nM)
HLA-A*02:018112.280.0929(0.77–7.01)243728
HLA-A*03:012108.750.0015(1.7384.34)Risk249729
HLA-A*24:02920.310.1200(0.03–1.57)328964
HLA-A*26:011300.000.0019(0.00–0.44)Protective347986
HLA-C*04:01981.920.2035(0.60–6.01)248752
HLA-C*06:022080.760.5551(0.27–2.00)239726
HLA-B*08:01373.310.0787(0.71–20.62)220722
HLA-B*50:01960.850.7707(0.23–2.88)193621
HLA-B*51:013100.00<0.0001(0.000.10)Protective205652
HLA-DRB1*03:011670.860.7502(0.28–2.39)
HLA-DRB1*07:011670.860.7502(0.28–2.39)
HLA-DRB1*11:011130.520.3229(0.09–2.09)
HLA-DRB1*15:01289.220.0012(1.7291.09)Risk
HLA-DQB1*02:012030.310.0611(0.06–1.15)
HLA-DQB1*02:021350.910.8657(0.24–2.95)
HLA-DQB1*03:011671.050.9140(0.34–2.97)
HLA-DQB1*03:021220.370.1911(0.04–1.78)
HLA-DQB1*04:02861.910.2502(0.51–6.72)
HLA-DQB1*05:01673.110.0442(0.83–11.87)
HLA-DQB1*05:021450.840.7463(0.22–2.67)

Alleles that showed statistical significance (corrected P-value < 0.05) are denoted in bold. OR: odds ratio, CI: confidence interval.

HLA Class I and class II allele count and statistical association in non-hospitalized and hospitalized SARS-CoV2 infected patients. Alleles that showed statistical significance (corrected P-value < 0.05) are denoted in bold. OR: odds ratio, CI: confidence interval. In total, we identified 40 HLA-A alleles, 32 HLA-C alleles, 38 HLA-B alleles, 41 HLA-DRB1 alleles, and 17 HLA-DQB1 alleles. However, due to low counts of some alleles, 35 alleles of HLA-A, 29 of HLA-C, 34 of HLA-B, 36 of HLA-DRB1, and nine of HLA-DQB1 were combined (binned) into 5 common groups (HLA-A binned, HLA-C binned, HLA-B binned, HLA-DRB1 binned, and HLA-DQB1 binned) by BIGDAWG prior to computing the χ2 statistic. This resulted in a total of 5 alleles for HLA-A, 3 for HLA-C, 4 for HLA-B, 5 for HLA-DRB1, and 8 for HLA-DQB1. Statistical analysis incorporating the binned groups resulted in some significant allele associations between the hospitalized and non-hospitalized groups (Table 2). From class I, HLA-A*03:01 (P = 0.0015) was observed more frequently in the hospitalized groups while HLA-A*26:01 (P = 0.0019) and HLA-B*51:01 (P < 0.0001) genotypes were common among non-hospitalized patients and absent in the hospitalized group, indicating a possible protective effect. From class II, HLA-DRB1*15:01 (P = 0.0021) was significantly higher in the non-hospitalized group. Analysis for strong binding (SB) and weak binding (WB) of SARS-CoV-2 peptide binding affinity for each HLA class I allele are presented in Table 2. Strong binders (SB) are defined as having a percentile binding affinity rank (%rank) ≤ 0.5, while weak binders (WB) are defined as having a %Rank ≤ 2%. The total number of different peptides recognized as SB or WB for each HLA allele was then calculated to identify the HLA allele with the greater number of possible strong and weak viral peptide recognition. However, when the predicted peptide binding results were compared to the genotypic data of this study cohort, no direct correlation to severity was found (Table 2). The extreme polymorphism of HLA alleles results in a vast diversity of peptide-binding HLA specificities and low population coverage for any given peptide-binding HLA specificity [36]. Several researchers have found that aggregating HLA alleles into supertypes is useful in studies of disease association [36], [37], and in designing specific cytotoxic T-lymphocyte (CTL) vaccine candidates for SARS [38]. This increases the power of the studies by grouping large numbers of rare alleles according to their functional relevance [37]. In this regard, the differences between the hospitalized and the non-hospitalized group based on HLA class I supertypes were estimated. To test for HLA supertype association, the revised and updated classification of HLA class I supertypes for locus A and B described by Sidney, et al. [36], which covers most of the HLA class I polymorphism, was used. Only those subtypes [36] that were identified based on the stringency of selection (i.e. reference panel) were considered in the analysis. Unlike HLA class I, class II supertypes have been less extensively studied due to the complexity of their structure. In this regard, this paper only discusses class I supertype associations. After correcting for multiple comparison, the results of this study showed that supertype B44 (including HLA-B*18:01, -B*40:02, -B*41:01, -B*44:02, -B*44:03, B*45:01, -B*50:01) is significantly associated to SARS-CoV-2 severity (see Fig. 2 ).
Fig. 2

The association of HLA-B supertypes on the severity of SARS-CoV-2 infection in the current cohort. The dots represent uncorrected P-values and the dotted line represents the P-corrected treshhold (Pcr = 0.006). Only B44 supertypes are associated with SARS-CoV-2 infection.

The association of HLA-B supertypes on the severity of SARS-CoV-2 infection in the current cohort. The dots represent uncorrected P-values and the dotted line represents the P-corrected treshhold (Pcr = 0.006). Only B44 supertypes are associated with SARS-CoV-2 infection. In the current study, an overall higher than expected level of homozygosity for HLA-A, HLA-B, HLA-C, HLA-DRB1, and HLA-DQB1 was observed in the 115 infected patients after applying the Hardy-Weinberg test using PyPop (Table 3 ). Further, we compared the heterozygosity level between hospitalized and non-hospitalized patients and found that HLA-A and HLA-B loci showed a statistically significant association with severity (Fig. 3 ). Others have highlighted associations between homozygosity of HLA class I alleles with age at death in COVID-19 patients [39].
Table 3

Hardy-Weinberg test for heterozygote deficiency and excess for each HLA locus in COVID-19 group using Genepop.

Fis estimation

LocusP-valueS.E.W&CR&H
HLA-A0.00000.00000.19350.1159
HLA-B0.00000.00000.24500.1621
HLA-C0.00000.00000.22340.2690
HLA-DRB10.00000.00000.35050.3725
HLA-DQB10.00010.00010.30240.2826



Hardy Weinberg test when H1 = heterozygote deficit
Fis estimation

LocusP-valueS.E.W&CR&H

HLA-A0.99970.00030.19350.1159
HLA-B1.00000.00000.24500.1621
HLA-C1.00000.00000.22340.2690
HLA-DRB11.00000.00000.35050.3725
HLA-DQB11.00000.00000.30240.2826
Hardy Weinberg test when H1 = heterozygote excess
Fis estimation

LocusP-valueS.E.W&CR&H

HLA-A0.00170.00170.19350.1159
HLA-B0.00000.00000.24500.1621
HLA-C0.00000.00000.22340.2690
HLA-DRB10.00000.00000.35050.3725
HLA-DQB10.00290.00050.30240.2826

Fis estimation: inbreeding coefficient. S.E.: standard error, R&H: Robertson & Hill value, W&C: Weir & Cockerham value.

Fig. 3

The association of heterozygosity level in HLA loci to the severity of COVID-19. heterozygosity is defined as having the same 2-field allele in a specific locus and was tested using Pypop.

Hardy-Weinberg test for heterozygote deficiency and excess for each HLA locus in COVID-19 group using Genepop. Fis estimation: inbreeding coefficient. S.E.: standard error, R&H: Robertson & Hill value, W&C: Weir & Cockerham value. The association of heterozygosity level in HLA loci to the severity of COVID-19. heterozygosity is defined as having the same 2-field allele in a specific locus and was tested using Pypop. The inbreeding coefficient (Fis) for the cohort ranged from 0.19 to 0.35 for the Weir, and Cockerham estimates and from 0.12 to 0.37 with Robertson and Hill estimates, where an estimate up to 0.1 demonstrates a low degree of inbreeding and a zero represents individuals with arbitrarily remote common ancestors (Table 4 ).
Table 4

Guo and Thompson Hardy-Weinberg test of 115 COVID-19 patients using PyPop.

LocusObserved HeterozygotesExpected HeterozygotesObserved HomozygotesExpected HomozygotesP-value
HLA-A49101596.650.0000*
HLA-C83103298.620.0000*
HLA-B44915360.0000*
HLA-DRB182109358.180.0000*
HLA-DQB1699031100.0000*
Guo and Thompson Hardy-Weinberg test of 115 COVID-19 patients using PyPop. Locus level association analysis results showed that HLA-A (P < 0.001) and HLA-B (P < 0.001) are significantly associated with COVID-19 severity (Table 5 ).
Table 5

Summarized locus association testing results obtained using BIGDAWG software. Loci denoted in bold are significant after correcting for multiple comparisons.

Locusχ2P-valueSignificance
HLA-A22.73<0.001S
HLA-C1.770.413NS
HLA-B33.82<0.001S
HLA-DRB111.000.027NS
HLA-DQB19.960.191NS

NS: Not significant, S: Significant

Summarized locus association testing results obtained using BIGDAWG software. Loci denoted in bold are significant after correcting for multiple comparisons. NS: Not significant, S: Significant From the amino acid position association analysis, multiple positions from locus HLA-A and HLA-B were found in a significant association with the disease (P < 0.001) while HLA-C had one position, namely position.121, that was marginally significant. From class II locus, two positions from HLA-DRB1 [position.13 (P = 0.007), position.72 (P = 0.0012)] and one position from HLA-DQB1 [position.57 (P = 0.028)] were associated with the disease. Table 6 summarizes all amino acid associations.
Table 6

Summarized significant amino acid position association analysis results obtained using BIGDAWG.

LocusPositionχ2P-valueLocusPositionχ2P-value
APosition.912.670.005BPosition.1211.220.001
APosition.655.540.019BPosition.2435.94<0.001
APosition.704.520.034BPosition.3711.380.001
APosition.746.580.010BPosition.4611.380.001
APosition.7618.79<0.001BPosition.679.260.002
APosition.7718.42<0.001BPosition.7010.880.001
APosition.7914.68<0.001BPosition.7110.880.001
APosition.8014.68<0.001BPosition.7231.33<0.001
APosition.8114.68<0.001BPosition.8611.180.001
APosition.8214.68<0.001BPosition.9338.77<0.001
APosition.8314.68<0.001BPosition.9438.77<0.001
APosition.9723.42<0.001BPosition.10818.62<0.001
APosition.1057.930.005BPosition.1266.770.009
APosition.1076.580.010BPosition.14415.64<0.001
APosition.1099.090.003BPosition.1654.770.029
APosition.11432.77<0.001BPosition.16927.32<0.001
APosition.1436.480.011BPosition.17615.040.001
APosition.14530.79<0.001BPosition.18421.57<0.001
APosition.1466.480.011BPosition.1909.900.002
APosition.15214.03<0.001BPosition.1939.900.002
APosition.17617.83<0.001DRB1Position.1314.000.007
CPosition.1216.210.045DRB1Position.7210.880.012
BPosition.98.820.012DQB1Position.579.130.028
BPosition.116.260.012

Loci denoted in bold are significant associations after correcting for multiple comparisons.

Summarized significant amino acid position association analysis results obtained using BIGDAWG. Loci denoted in bold are significant associations after correcting for multiple comparisons.

Discussion

Since the outbreak of SARS-CoV-2 in 2019, HLA variability analysis has been the focus of many groups to identify possible genetic variations that contribute to the severity and progression of the disease [25], [26], [27]. However, most of these findings do not align, making it difficult to draw any conclusions about the role of HLA in COVID-19 infection. In this regard, it is important to consider the fact that different allele frequencies observed in association with COVID-19 might vary between racial backgrounds [25], [26], [27], [28]. Thus, bioinformatics models such as peptide binding prediction could potentially be used to obtain more conclusive findings [27], [40]. The present study was subject to multiple methodological limitations, including sample size, uneven age distribution, and existing comorbidities in some of the patients. Due to the limited sample size, adjustment for age between hospitalized and the non-hospitalized group was not possible. Furthermore, haplotype frequency calculation resulted in many single count haplotypes due to the small sample size. Thus, risk haplotypes could not be discussed. The UAE population has been shown to be admixed [41]. However, controlling for population stratification was not done in this study due to the small sample size. Nevertheless, HLA allelic associations must be analyzed with caution due to the number of alleles under investigation, as there is a high risk of obtaining a significant allele by chance. Therefore, adjustment for multiple comparisons is necessary. Most HLA allelic associations lost significance after statistical correction for multiple testing in this study. Similarly, a large study on 835 patients from Italy and 775 patients from Spain revealed no significant allele associations with either COVID-19 disease occurrence or severity after statistical adjustment [42]. A study by Littera, et al. [43] on 182 Sardinian COVID-19 patients showed no substantial differences in HLA alleles between subjects and the control group [37]. Conversely, in this study, several interesting and clinically relevant observations can be made from the statistical comparison between the hospitalized and non-hospitalized groups. Among the most frequent alleles (subtype) in this cohort is HLA-B*35 (13%) (Including HLA-B*35:01/02/03/05/08/36) and HLA-C*07 (Including HLA-C*07:01/02/10/18/23/29/31) (30%). Similarly, in a recent report to characterize the COVID-19 disease in the UAE, our group observed higher than expected frequencies of HLA-B*35 and HLA-C*07 in infected patients with ethnic backgrounds from South Asia. The same allele, HLA-B*35, has been identified in other viral respiratory diseases, including influenza A (H3N2) in Northeast India [15] and the development of isolated pulmonary hypertension in patients with scleroderma in Europeans [44]. Additionally, more than half of the HLA-B*35 carriers (60%) in the current cohort carried this allele in the haplotype HLA-B*35 ~ C*04 (6%). In a study by our group, the same haplotype was the most frequent in a multiethnic cohort and was observed in 11 out of the 15 nationalities (Tay et al., 2021 submitted). To increase the power and confidence of HLA viral association studies, it is important to combine HLA typing with case-control studies and T cell binding assays in addition to systematic prediction of the peptide-binding groove of different alleles [27], [40]. On this basis, genotypic data from this study were compared with theoretical viral peptide-binding predictions using in silico analysis methods. Bioinformatic peptide-binding models of HLA class I have been successfully employed in multiple SARS-CoV-2 studies [26], [45], [39], [46]. Unfortunately, HLA class II peptide binding prediction models still have problems, and that is why they have not been discussed in this study [27]. In theory, those who present alleles that are predicted to have a high affinity to bind to viral peptides would only develop mild symptoms of the disease, as their HLA molecules will be more efficient in presenting the peptides to the immune system. Greater susceptibility to severe SARS-CoV-2 infection was observed in subjects who carried the class I allele HLA-B*51:01 (P < 0.0001). Interestingly, HLA-B*51:01 is the second common HLA-B allele in the general population of Abu Dhabi with a frequency as high as 13.5% after HLA-B*50:01 [19]. However, no patient from the hospitalized group carried this allele, whilst it was frequent in the non-hospitalized cohort. Nevertheless, our SARS-CoV-2 peptide binding prediction showed that HLA-B*51:01 binding capacity is limited to 205 for SB and 652 for WB. Although some studies have shown a link between the peptide binding affinity and the severity of the disease [26], [39], [46], [47], this study did not reveal a clear link between the degree of affinity between individual HLA molecules and the evolution of the disease in this study. Similarly, other reports did not confirm a link between in silico peptide binding prediction and genotypic data of patients [45]. NetMHCpan 4.1 server [35] is a validated algorithm for T-cell epitope prediction and has been recently updated. A combination of binding affinity thresholds that offers the best precision and specificity was used [35], [48]. However, it is important to acknowledge the limitation of this computational method. The peptide binding recognition is complex and is affected by many factors, including the relative expression of individual viral proteins [48]. Furthermore, there is currently limited data on experimentally determined SARS-CoV-2 immunogenic T-cell epitope and peptide HLA restriction [48], and our analysis relied on the available information in published literature. Iturrieta-Zuazo, et al. [27] suggested that the degree of HLA homozygosity could play a role in the severity of the SARS-CoV-2 infection as their results showed a higher percentage of HLA homozygosity in Spanish patients with severe infection in loci A and C when compared to moderate and mild patients [27]. This is based on the evidence that individuals with heterozygosity at certain HLA loci may have better resistance to infectious diseases when compared to the corresponding homozygotes [49]. A study of 6,311 HIV-1 infected individuals reported better resistance of HLA-B heterozygosity against HIV-1 [17]. The study suggested that heterozygosity in HLA alleles increases the probability of carrying the most protective HLA alleles [5]. Similarly, a statistically significant association between COVID-19 severity and homozygosity at HLA-A and HLA-B (Fig. 3) was observed in this study. In conclusion, the present study investigated the association of HLA allele and haplotype variations in Emirati patients with respect to COVID-19 severity. Alleles HLA-B*51:01, HLA-A*26:01 were significantly higher in the non-hospitalized patients, suggesting a protective association, whilst genotypes HLA-A*03:01, HLA-DRB1*15:01, and supertype B44 showed a significant correlation with disease severity. Overall, we could not establish a clear link between in silico peptide binding prediction of HLA class I and clinical outcome. Suffice to say, many other genetic and epigenetic factors independent of the HLA could play a role in the immunopathogenesis of SARS-CoV-2. Despite some shortcomings, this study contributes to the global effort to understand the immunological pathway of SARS-CoV-2 severity and occurrence. Further investigation with a larger cohort and diverse populations is necessary to clarify the role of HLA in SARS-CoV-2 infection. This will further assist in developing specific vaccination and personalizing treatments.

Availability of data

The data that support the findings of this study are available on request from the corresponding author.

The UAE COVID-19 Collaborative Partnership

Juan Acuna, Ernesto Damiani, Samuel Feng, Andreas Henschel, Abdulrahim Sajini, Ahmed Yousef (Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates); Bassam Ali, Farah Mustafa, Tahir A. Rizvi (United Arab Emirates University, Al Ain, United Arab Emirates); Hiba Alhumaidan, Hala Imambabaccus, Stefan Weber (Sheikh Khalifa Medical City and SEHA, Abu Dhabi, United Arab Emirates); Mohammad Tahseen Al Bataineh, Rabih Halwani, Rifat Akram Hamoudi (University of Sharjah, Sharjah, United Arab Emirates); Abdulmajeed Al Khajeh, Laila Salameh (Dubai Health Authority, Dubai, United Arab Emirates) Braulio Peramo (Al Ain Fertility Center, Al Ain, United Arab Emirates).

Author contribution

H.A.; a graduate student at Khalifa University; was primarily responsible for managing this project and developed the objectives with her advisors, G.K.T. and H.S.S. H.A. collated the data and performed the analyses outlined in this study. H.A. and G.K.T. analyzed, and constructed the figures, with H.A. preparing the first draft of the manuscript. H.A. and S.E carried out the laboratory assays used in the study at the Center for Biotechnology. M.U. and N.A were responsible for the recruitment of the patients who contributed data to the study by collecting demographic data. H. F. J., A. F., E. A., S. E., A.T. S., M. M. A. H., B. H. M., M. U., N. A. provided critical review during manuscript preparation. All authors on the primary list contributed to the data interpretation or critically reviewed the manuscript and approved the final manuscript for submission. All investigators of the UAE COVID-19 Collaborative Partnership provided comments on the study in some form or were involved in developing overall research questions for a larger project that was conceived to study the role of the virus and host in COVID-19 in the United Arab Emirates.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  46 in total

1.  Association between human leukocytes antigen alleles and chronic hepatitis C virus infection in the Korean population.

Authors:  Seung Kew Yoon; Joon Yeol Han; Chul-Woo Pyo; Jin Mo Yang; Jeong Won Jang; Chang Wook Kim; U Im Chang; Si Hyun Bae; Jong Young Choi; Kyu Won Chung; Hee Sik Sun; Hee Baeg Choi; Tai-Gyu Kim
Journal:  Liver Int       Date:  2005-12       Impact factor: 5.828

2.  KIR and a specific HLA-C gene are associated with susceptibility and resistance to hepatitis B virus infection in a Brazilian population.

Authors:  Patricia Araujo; Giovana Gonçalves; Flavia Latini; Orlando Ferreira; Luis Cristovão Porto; Jose Augusto Barreto; Manoel João Castelo Girao; Ricardo Sobhie Diaz
Journal:  Cell Mol Immunol       Date:  2014-05-19       Impact factor: 11.530

3.  Hla-Cw7 allele as predictor of favorable therapeutic response to interferon-alpha in patients with chronic hepatitis C.

Authors:  Ivo Ivić; Nikola Bradarić; Neira Puizina-Ivić; Dragan Ledina; Boris Luksić; Roko Martinić
Journal:  Croat Med J       Date:  2007-12       Impact factor: 1.351

4.  Associations between MHC class I and susceptibility to HIV-2 disease progression.

Authors:  Khady Diouf; Abdoulaye Dieng Sarr; Geoffrey Eisen; Stephen Popper; Souleymane Mboup; Phyllis Kanki
Journal:  J Hum Virol       Date:  2002 Jan-Feb

5.  Allele frequency net 2015 update: new features for HLA epitopes, KIR and disease and HLA adverse drug reaction associations.

Authors:  Faviel F González-Galarza; Louise Y C Takeshita; Eduardo J M Santos; Felicity Kempson; Maria Helena Thomaz Maia; Andrea Luciana Soares da Silva; André Luiz Teles e Silva; Gurpreet S Ghattaoraya; Ana Alfirevic; Andrew R Jones; Derek Middleton
Journal:  Nucleic Acids Res       Date:  2014-11-20       Impact factor: 19.160

6.  Human Leukocyte Antigen Susceptibility Map for Severe Acute Respiratory Syndrome Coronavirus 2.

Authors:  Austin Nguyen; Julianne K David; Sean K Maden; Mary A Wood; Benjamin R Weeder; Abhinav Nellore; Reid F Thompson
Journal:  J Virol       Date:  2020-06-16       Impact factor: 5.103

7.  Association of HLA Class I Genotypes With Severity of Coronavirus Disease-19.

Authors:  Maxim Shkurnikov; Stepan Nersisyan; Tatjana Jankevic; Alexei Galatenko; Ivan Gordeev; Valery Vechorko; Alexander Tonevitsky
Journal:  Front Immunol       Date:  2021-02-23       Impact factor: 7.561

8.  Effect of human leukocyte antigen heterozygosity on infectious disease outcome: the need for allele-specific measures.

Authors:  Marc Lipsitch; Carl T Bergstrom; Rustom Antia
Journal:  BMC Med Genet       Date:  2003-01-24       Impact factor: 2.103

9.  Estimating the Binding of Sars-CoV-2 Peptides to HLA Class I in Human Subpopulations Using Artificial Neural Networks.

Authors:  Caterina A M La Porta; Stefano Zapperi
Journal:  Cell Syst       Date:  2020-09-10       Impact factor: 10.304

Review 10.  Genetic variants of the human host influencing the coronavirus-associated phenotypes (SARS, MERS and COVID-19): rapid systematic review and field synopsis.

Authors:  Emilio Di Maria; Andrea Latini; Paola Borgiani; Giuseppe Novelli
Journal:  Hum Genomics       Date:  2020-09-11       Impact factor: 4.639

View more
  2 in total

1.  Distribution of HLA-A, -B, -C, -DRB1, -DQB1, -DPB1 allele frequencies in patients with COVID-19 bilateral pneumonia in Russians, living in the Chelyabinsk region (Russia).

Authors:  Tatiana A Suslova; Mikhail N Vavilov; Svetlana V Belyaeva; Alexander V Evdokimov; Daria S Stashkevich; Alexander Galkin; Ilya A Kofiadi
Journal:  Hum Immunol       Date:  2022-04-28       Impact factor: 2.211

Review 2.  The association of human leucocyte antigen (HLA) alleles with COVID-19 severity: A systematic review and meta-analysis.

Authors:  Zorana Dobrijević; Nikola Gligorijević; Miloš Šunderić; Ana Penezić; Goran Miljuš; Sergej Tomić; Olgica Nedić
Journal:  Rev Med Virol       Date:  2022-07-12       Impact factor: 11.043

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.