| Literature DB >> 36229378 |
Vishal Rao1, Nagasuma Chandra2.
Abstract
Differences in outcome to COVID-19 infection in different individuals is largely attributed to genetic heterogeneity leading to differential immune responses across individuals and populations. HLA is one such genetic factor that varies across individuals leading to differences in how T-cell responses are triggered against SARS-CoV-2, directly influencing disease susceptibility. HLA alleles that influence COVID-19 outcome, by virtue of epitope binding and presentation, have been identified in cohorts worldwide. However, the heterogeneity in HLA distribution across ethnic groups limits the generality of such association. In this study, we address this limitation by comparing the recognition of CTL epitopes across HLA genotypes and ethnic groups. Using HLA allele frequency data for ethnic groups from Allele Frequency Net Database (AFND), we construct synthetic populations for each ethnic group and show that CTL epitope strength varies across HLA genotypes and populations. We also observe that HLA genotypes, in certain cases, can have high CTL epitope strengths in the absence of top-responsive HLA alleles. Finally, we show that the theoretical estimate of responsiveness and hence protection offered by a HLA allele is bound to vary across ethnic groups, due to the influence of other HLA alleles within the HLA genotype on CTL epitope recognition. This emphasizes the need for studying HLA-disease associations at the genotype level rather than at a single allele level.Entities:
Keywords: COVID-19; CTL epitopes; Ethnic group; HLA; Population
Year: 2022 PMID: 36229378 PMCID: PMC9550298 DOI: 10.1016/j.humimm.2022.09.008
Source DB: PubMed Journal: Hum Immunol ISSN: 0198-8859 Impact factor: 2.211
Top and bottom 10 responding HLA alleles, genotypes and ethnic groups. Response of HLA alleles is measured as the number of epitopes recognized. Response of HLA genotypes and ethnic groups are measured in terms of CTLi and CTLpop respectively. Since CTLpop is the CTL epitope strength averaged over the entire population, it is rounded off to the nearest integer value.
| A*02:11 | 353 | A*25:04 | 1 |
| A*02:50 | 325 | B*38:12 | 1 |
| B*15:62 | 324 | B*38:20 | 1 |
| A*02:22 | 323 | A*66:01 | 1 |
| A*02:104 | 323 | B*27:14 | 1 |
| A*02:122 | 280 | B*40:110 | 1 |
| C*12:19 | 272 | B*51:93 | 1 |
| B*15:156 | 270 | B*51:64 | 1 |
| B*15:132 | 270 | C*02:17 | 1 |
| B*15:03 | 270 | B*57:07 | 1 |
| A*68:01|A*02:05|B*15:03|B*41:01|C*14:03|C*07:01 | 699 | A*66:01|A*66:01|B*58:02|B*37:01|C*04:01|C*04:01 | 1 |
| A*68:01|A*02:05|B*15:03|B*41:01|C*14:03|C*02:02 | 698 | A*74:01|A*74:01|B*14:02|B*58:02|C*04:01|C*04:01 | 1 |
| A*02:03|A*11:01|B*15:25|B*55:02|C*14:02|C*03:04 | 691 | A*74:01|A*74:01|B*15:10|B*58:02|C*04:01|C*04:01 | 1 |
| A*02:03|A*11:01|B*15:25|B*13:01|C*14:02|C*03:04 | 682 | A*74:01|A*74:01|B*49:01|B*58:02|C*04:01|C*04:01 | 1 |
| A*02:03|A*11:01|B*15:25|B*51:01|C*14:02|C*03:04 | 682 | A*66:01|A*66:01|B*58:02|B*58:02|C*04:01|C*04:01 | 1 |
| A*02:03|A*11:01|B*15:25|B*46:01|C*14:02|C*03:04 | 680 | A*66:01|A*74:01|B*58:02|B*58:02|C*04:01|C*04:01 | 1 |
| A*68:01|A*68:02|B*15:03|B*41:01|C*14:03|C*07:01 | 677 | A*66:01|A*66:01|B*37:01|B*37:01|C*04:01|C*04:01 | 1 |
| A*68:01|A*68:02|B*15:03|B*41:01|C*14:03|C*02:02 | 676 | A*66:03|A*66:03|B*58:02|B*37:01|C*04:01|C*04:01 | 2 |
| A*02:03|A*11:01|B*58:01|B*40:01|C*14:02|C*03:02 | 675 | A*66:03|A*66:03|B*37:01|B*37:01|C*04:01|C*04:01 | 2 |
| A*02:22|A*68:01|B*15:39|B*35:05|C*03:04|C*04:01 | 669 | A*66:03|A*66:01|B*58:02|B*58:02|C*04:01|C*04:01 | 2 |
| Paraguay Argentina Ache NA-DHS 24 | 417 | Singapore SGVP. Indian INS | 167 |
| India Khandesh Region Pawra | 398 | Colombia North Wiwa El Encanto | 166 |
| Colombia Waunana NA-DHS 20 (G) | 378 | Singapore Javaneses | 162 |
| Paraguay Argentina Guarani NA-DHS 23 (G) | 372 | Colombia North Chimila Amerindians | 162 |
| Brazil Terena | 358 | Papua New Guinea Madang | 161 |
| Costa Rica Guaymi NA-DHS 10 (G) | 342 | Malaysia Peninsular Malay | 160 |
| Colombia Inga NA-DHS 11 (G) | 331 | Georgia Tibilisi Kurd | 158 |
| Colombia Zenu NA-DHS 18 (G) | 327 | Singapore Riau Malay | 155 |
| Mexico Oaxaca Mixe | 326 | India West Coast Parsi | 153 |
| Mali Bandiagara | 309 | USA NMDP Southeast Asian | 151 |
| Paraguay Argentina Ache NA-DHS 24 | 417 | Singapore SGVP. Indian INS | 167 |
Fig. 1Modeling CTL epitope strength across individuals(CTL (A) Using the input HLA allele frequency data from each ethnic group annotated in the Allele Frequency Net Database (AFND), hypothetical genotypes (individuals) were constructed by defining a threshold frequency beyond which a particular HLA genotype which is a combination of 6 HLA alleles is considered to be a member of the population. The frequency was then scaled with respect to the least frequent genotype in the population so that the scaled frequency represented the number of individuals with the particular genotype. (B) A barplot representing the number of HLA genotypes present in each ethnic group. (C) CTLi across global HLA genotypes. The top, middle and bottom 1% HLA genotypes, sorted based on their CTLi, are marked as ‘high’, ‘medium’ and ‘low’ responding genotypes, respectively. (D) A bubble plot representing the frequency of occurrence of the top 20 responding HLA alleles in the low, medium and high responding HLA genotypes.
Fig. 2Comparing CTL epitope strength across ethnic groups (CTL (A) CTL epitope strength for an ethnic group (CTLpop) was computed as the sum of epitopes recognized by each individual within the synthetic population (CTLi), weighted by the scaled frequency of occurrence of genotype i. This was divided by the number of individuals within the population to obtain CTLpop. (B-G) Pairs of representative ethnic groups for (B-C) low, (D-E) medium and (F-G) high responders were considered and the variation of CTLi within these ethnic groups across individuals is shown. From each of these ethnic groups, the 5 highest and 5 lowest responding HLA genotypes (dotted arrows) are represented below. Each circle represents a HLA allele and a string of 6 circles represents a HLA genotype. High-responding HLA alleles (>200 epitopes) are shaded in green while low-responding ones (1 epitope) are shaded in orange. Blank circles represent HLA alleles that show intermediate response, as they neither belong to the highest nor the least responding HLA allele groups. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)