| Literature DB >> 33527770 |
Christin Paech1, Viviane Albrecht1, Kathrin Putke1, Gerhard Schöfl1, Bianca Schöne1, Alexander H Schmidt1,2, Vinzenz Lange1, Anja Klussmeier1.
Abstract
HLA-E is a member of the nonclassical HLA class Ib genes. Even though it is structurally highly similar to the classical HLA class Ia genes, it is less diverse and only 45 alleles and 12 proteins were known in December 2019 (IPD-IMGT/HLA, release 3.38.0). Since 2017, we have genotyped over 3 million voluntary stem cell donors for HLA-E by sequencing the most relevant allele-determining bases of exons 2 and 3. As expected, most donors harbor the two predominant alleles HLA-E*01:01 and/or HLA-E*01:03. However, in 1666 (0.05%) of our samples we detected 345 distinct novel HLA-E sequences. The most frequent one was identified in 162 samples and has by now been named HLA-E*01:114. To characterize these novel alleles in full-length, we used both short-read Illumina and long-read PacBio sequencing to obtain fully phased and highly accurate sequences. This resulted in 234 submissions to IPD-IMGT/HLA comprising 170 novel HLA-E alleles, which encode for 93 novel HLA-E proteins, as well as 64 confirmations or sequence extensions. Consequently, the number of HLA-E alleles in the database (release 3.42.0) has now increased to 256 HLA-E alleles and 110 HLA-E proteins.Entities:
Keywords: HLA-E; genotyping; next-generation sequencing; novel allele
Mesh:
Substances:
Year: 2021 PMID: 33527770 PMCID: PMC8247977 DOI: 10.1111/tan.14195
Source DB: PubMed Journal: HLA ISSN: 2059-2302 Impact factor: 4.513
FIGURE 1HLA‐E PCR amplification strategy. Arrows indicate the primer locations for the full‐gene characterization workflow (3.3 kb amplicon, blue) and high‐throughput workflow (535 bp amplicon, red). Exons are depicted as yellow boxes. The location of the HLA‐E*01:01/01:03 distinguishing SNP in exon 3 is colored in green
FIGURE 2HLA‐E sequence submissions to IPD‐IMGT/HLA. A, Submitted HLA‐E sequences include 170 distinct novel alleles (null alleles, novel proteins, synonymous exonic variations, intronic variations) and 64 sequence confirmations or extensions. Sequences with more than one variation are included only once and categorized by their most relevant variation. B, Growth of the IPD‐IMGT/HLA database for HLA‐E alleles and HLA‐E proteins from January 2019 to October 2020. The submissions described in this study are colored in red, submissions from other laboratories are colored in gray
Novel HLA‐E alleles with nonsynonymous alterations
| Allele name | Reference allele | Exon | AA position (mature protein) | Reference > novel AA | AA 107 | Number of observations | Frequency (%) |
|---|---|---|---|---|---|---|---|
| 01:18 | 01:01:01:01 | 3 | 121 | K > N | R | 1 | 0.000016 |
| 01:19 | 01:01:01:01 | 2 | 89 | E > D | R | 3 | 0.000047 |
| 01:20 | 01:01:01:01 | 3 | 98 | M > I | R | 14 | 0.000219 |
| 01:21N | 01:03:02:01 | 3 | 96 | Q > STOP | G | 9 | 0.000141 |
| 01:22 | 01:01:01:01 | 2 | 78 | L > V | R | 5 | 0.000078 |
| 01:23 | 01:03:02:01 | 2 | 82 | R > P | G | 1 | 0.000016 |
| 01:24 | 01:01:01:01 | 3 | 137 | D > E | R | 4 | 0.000063 |
| 01:25N | 01:03:02:01 | 2 | 85 | Y > STOP | G | 4 | 0.000063 |
| 01:26 | 01:01:01:01 | 3 | 99 | H > Q | R | 5 | 0.000078 |
| 01:27 | 01:01:01:01 | 3 | 99 | H > D | R | 6 | 0.000094 |
| 01:28 | 01:01:01:01 | 3 | 111 | R > H | R | 3 | 0.000047 |
| 01:29 | 01:01:01:01 | 3 | 158 | A > G | R | 5 | 0.000078 |
| 01:30 | 01:01:01:01 | 3 | 114 | E > D | R | 2 | 0.000031 |
| 01:31 | 01:01:01:01 | 3 | 124 | L > V | R | 5 | 0.000078 |
| 01:32 | 01:01:01:01 | 3 | 112 | G > R | R | 28 | 0.000438 |
| 01:33 | 01:03:01:01 | 3 | 104 | G > V | G | 3 | 0.000047 |
| 01:34 | 01:03:02:01 | 3 | 108 | R > H | G | 6 | 0.000094 |
| 01:35 | 01:01:01:01 | 3 | 106 | D > G | R | 3 | 0.000047 |
| 01:36 | 01:01:01:01 | 3 | 104 | G > R | R | 4 | 0.000063 |
| 01:37 | 01:03:02:01 | 3 | 115 | Q > L | G | 2 | 0.000031 |
| 01:38 | 01:03:02:01 | 3 | 96 | Q > P | G | 27 | 0.000422 |
| 01:39 | 01:01:01:01 | 3 | 129 | D > N | R | 2 | 0.000031 |
| 01:40 | 01:01:01:01 | 3 | 96 | Q > P | R | 4 | 0.000063 |
| 01:41 | 01:01:01:01 | 3 | 105 | P > S | R | 16 | 0.000250 |
| 01:42 | 01:01:01:01 | 2 | 84 | Y > S | R | 31 | 0.000485 |
| 01:43 | 01:01:01:01 | 2 | 89 | E > K | R | 2 | 0.000031 |
| 01:44 | 01:03:01:01 | 3 | 131 | R > S | G | 2 | 0.000031 |
| 01:45 | 01:01:01:01 | 3 | 138 | T > R | R | 4 | 0.000063 |
| 01:46 | 01:03:02:01 | 3 | 128 | E > K | G | 3 | 0.000047 |
| 01:47 | 01:03:02:01 | 2 | 84 | Y > H | G | 18 | 0.000281 |
| 01:48 | 01:01:01:01 | 3 | 107 | R > K | K | 45 | 0.000704 |
| 01:49 | 01:01:01:01 | 2 | 87 | Q > E | R | 3 | 0.000047 |
| 01:50 | 01:01:01:01 | 2 | 83 | G > D | R | 2 | 0.000031 |
| 01:51 | 01:01:01:01 | 3 | 148 | N > D | R | 11 | 0.000172 |
| 01:52 | 01:03:02:01 | 3 | 98 | M > L | G | 15 | 0.000235 |
| 01:53 | 01:01:01:01 | 3 | 131 | R > C | R | 12 | 0.000188 |
| 01:54 | 01:03:02:01 | 3 | 102 | E > Q | G | 47 | 0.000735 |
| 01:55N | 01:03:02:01 | 3 | 101 | C > STOP | G | 1 | 0.000016 |
| 01:56 | 01:01:01:01 | 2 | 85 | Y > H | R | 6 | 0.000094 |
| 01:57 | 01:03:01:01 | 3 | 106 | D > E | G | 2 | 0.000031 |
| 01:58 | 01:01:01:01 | 3 | 94 | T > A | R | 12 | 0.000188 |
| 01:59 | 01:03:02:01 | 3 | 99 | H > Q | G | 2 | 0.000031 |
| 01:60 | 01:03:02:01 | 3 | 140 | A > V | G | 6 | 0.000094 |
| 01:61 | 01:01:01:01 | 3 | 114 | E > K | R | 2 | 0.000031 |
| 01:62 | 01:01:01:01 | 3 | 122 | D > N | R | 6 | 0.000094 |
| 01:63 | 01:03:02:01 | 3 | 105 | P > T | G | 7 | 0.000109 |
| 01:64 | 01:01:01:01 | 3 | 137 | D > Y | R | 2 | 0.000031 |
| 01:65 | 01:01:01:01 | 3 | 106 | D > E | R | 19 | 0.000297 |
| 01:66 | 01:03:01:01 | 3 | 93 | H > R | G | 34 | 0.000532 |
| 01:67 | 01:01:01:01 | 3 | 110 | L > V | R | 3 | 0.000047 |
| 01:68Q | 01:01:01:01 | 3 | 101 | C > G | R | 19 | 0.000297 |
| 01:69 | 01:03:01:01 | 3 | 95 | L > P | G | 2 | 0.000031 |
| 01:70 | 01:03:02:01 | 2 | 85 | Y > H | G | 3 | 0.000047 |
| 01:71 | 01:01:01:01 | 3 | 102 | E > D | R | 1 | 0.000016 |
| 01:72 | 01:01:01:01 | 3 | 99 | H > Y | R | 13 | 0.000203 |
| 01:73 | 01:03:02:01 | 3 | 118 | Y > C | G | 2 | 0.000031 |
| 01:74 | 01:01:01:01 | 3 | 138 | T > A | R | 3 | 0.000047 |
| 01:75 | 01:03:02:01 | 3 | 131 | R > L | G | 12 | 0.000188 |
| 01:76 | 01:03:02:01 | 3 | 119 | D > N | G | 2 | 0.000031 |
| 01:77 | 01:01:01:01 | 2 | 84 | Y > H | R | 23 | 0.000360 |
| 01:78 | 01:01:01:01 | 3 | 113 | Y > C | R | 2 | 0.000031 |
| 01:79 | 01:01:01:01 | 3 | 123 | Y > F | R | 1 | 0.000016 |
| 01:80 | 01:03:02:01 | 3 | 94 | T > N | G | 2 | 0.000031 |
| 01:81 | 01:01:01:01 | 3 | 149 | D > H | R | 4 | 0.000063 |
| 01:82 | 01:03:02:01 | 3 | 93 | H > P | G | 4 | 0.000063 |
| 01:83 | 01:01:01:01 | 3 | 141 | Q > H | R | 1 | 0.000016 |
| 01:84 | 01:01:01:01 | 3 | 138 | T > K | R | 1 | 0.000016 |
| 01:85 | 01:03:02:01 | 3 | 121 | K > R | G | 2 | 0.000031 |
| 01:86 | 01:03:02:01 | 3 | 108 | R > L | G | 1 | 0.000016 |
| 01:87 | 01:01:01:01 | 3 | 123 | Y > C | R | 3 | 0.000047 |
| 01:88 | 01:01:01:01 | 3 | 107 | R > S | S | 1 | 0.000016 |
| 01:89 | 01:03:02:01 | 2 | 83 | G > S | G | 2 | 0.000031 |
| 01:90 | 01:01:01:01 | 3 | 109 | F > L | R | 1 | 0.000016 |
| 01:91N | 01:03:02:01 | 3 | 113 | Y > STOP | G | 1 | 0.000016 |
| 01:92 | 01:03:01:01 | 3 | 112 | G > R | G | 1 | 0.000016 |
| 01:93 | 01:03:02:01 | 3 | 92 | S > Y | G | 1 | 0.000016 |
| 01:94 | 01:03:02:01 | 2 | 87 | Q > H | G | 2 | 0.000031 |
| 01:95 | 01:03:02:01 | 3 | 156 | Q > H | G | 1 | 0.000016 |
| 01:96 | 01:01:01:01 | 3 | 151 | S > F | R | 2 | 0.000031 |
| 01:97 | 01:01:01:01 | 2 | 83 | G > C | R | 5 | 0.000078 |
| 01:98 | 01:01:01:01 | 3 | 151 | S > P | R | 1 | 0.000016 |
| 01:99 | 01:01:01:01 | 3 | 111 | R > G | R | 1 | 0.000016 |
| 01:100 | 01:01:01:01 | 2 | 79 | R > Q | R | 4 | 0.000063 |
| 01:101 | 01:03:02:01 | 3 | 91 | G > V | G | 1 | 0.000016 |
| 01:102 | 01:01:01:01 | 3 | 121 | K > R | R | 1 | 0.000016 |
| 01:103 | 01:03:01:01 | 2 | 79 | R > W | G | 1 | 0.000016 |
| 01:104 | 01:01:01:01 | 3 | 100 | G > S | R | 1 | 0.000016 |
| 01:105 | 01:01:01:01 | 3 | 102 | E > A | R | 1 | 0.000016 |
| 01:106 | 01:03:02:01 | 3 | 151 | S > C | G | 1 | 0.000016 |
| 01:107 | 01:03:01:01 | 3 | 150 | A > T | G | 1 | 0.000016 |
| 01:108 | 01:03:02:01 | 3 | 99 | H > R | G | 3 | 0.000047 |
| 01:109 | 01:01:01:01 | 3 | 125 | T > A | R | 1 | 0.000016 |
| 01:110 | 01:01:01:01 | 3 | 117 | A > T | R | 5 | 0.000078 |
| 01:111 | 01:03:02:01 | 3 | 125 | T > I | G | 4 | 0.000063 |
| 01:112 | 01:01:01:01 | 3 | 103 | L > R | R | 3 | 0.000047 |
| 01:114 | 01:01:01:01 | 3 | 94 | T > S | R | 162 | 0.002533 |
| 01:115 | 01:01:01:01 | 3 | 144 | E > G | R | 7 | 0.000109 |
| 01:116 | 01:03:02:01 | 2 | 83 | G > R | G | 1 | 0.000016 |
| 01:117N | 01:01:01:01 | 3 | 96 | Q > STOP | R | 3 | 0.000047 |
Notes: Position and nature of the amino acid (AA) change are reported using the respective reference allele. AA 107 defines HLA‐E*01:01‐ or HLA‐E*01:03‐allele groups. Number of observations and frequency values are based on exon 2/3 data. Alleles are reported with two‐field resolution. The data of this table is also available as Table S2.
FIGURE 3Phylogenetic tree. The 250 HLA‐E alleles of IPD‐IMGT/HLA release 3.42.0 with available full‐length sequence information are displayed in a maximum likelihood phylogenetic tree. Colors indicate the amino acid at position 107 of the mature HLA‐E protein