| Literature DB >> 29760718 |
Kim N T Ton1, Simone L Cree1, Sabine J Gronert-Sum2, Tony R Merriman3, Lisa K Stamp4, Martin A Kennedy1.
Abstract
The human leukocyte antigen (HLA) system encodes the human major histocompatibility complex (MHC). HLA-B is the most polymorphic gene in the MHC class I region and many HLA-B alleles have been associated with adverse drug reactions (ADRs) and disease susceptibility. The frequency of such HLA-B alleles varies by ethnicity, and therefore it is important to understand the prevalence of such alleles in different population groups. Research into HLA involvement in ADRs would be facilitated by improved methods for genotyping key HLA-B alleles. Here, we describe an approach to HLA-B typing using next generation sequencing (NGS) on the MinION™ nanopore sequencer, combined with data analysis with the SeqNext-HLA software package. The nanopore sequencer offers the advantages of long-read capability and single molecule reads, which can facilitate effective haplotyping. We developed this method using reference samples as well as individuals of New Zealand Māori or Pacific Island descent, because HLA-B diversity in these populations is not well understood. We demonstrate here that nanopore sequencing of barcoded, pooled, 943 bp polymerase chain reaction (PCR) amplicons of 49 DNA samples generated ample read depth for all samples. HLA-B alleles were assigned to all samples at high-resolution with very little ambiguity. Our method is a scaleable and efficient approach for genotyping HLA-B and potentially any other HLA locus. Finally, we report our findings on HLA-B genotypes of this cohort, which adds to our understanding of HLA-B allele frequencies among Māori and Pacific Island people.Entities:
Keywords: HLA-B; Māori; Pacific Island; Polynesian; nanopore sequencing; pharmacogenetics
Year: 2018 PMID: 29760718 PMCID: PMC5936980 DOI: 10.3389/fgene.2018.00152
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Primers used for amplifying and sequencing.
| SBT | HLA-B_1710_F | TGTCGGGTCCTTCTTCCAGG | Abbott et al., |
| HLA-B_1710_R | GAAAATTCAGGCGCTTT | ||
| HLA-B_943_F | GCAGGCGGGGGCGCAGGACC | Cotton et al., | |
| HLA-B_943_R | GGAGATGGGGAAGGCTCCCCACT | ||
| HLA-B_Seq1 | GGAGCCGCGCCGGGAGGAGGGTC | ||
| HLA-B_Seq2 | GGATGGGGAGTCGTGACCT | ||
| HLA-B_Seq3 | ACKG | ||
| HLA-B_Seq4 | CGGGGTCACTCACCG | ||
| HLA-B_Seq5 | GG | ||
| HLA-B_Seq6 | ACTGCCCCTGGTACC | ||
| MinION | HLA-B_MinION_F | TTTCTGTTGGTGCTGATATTGCGGGAGGAG | |
| HLA-B_MinION_R | ACTTGCCTGTCGCTCTATCTTCGGAGGCCATCCCCGGCGACCTAT |
Underlined letters are IUPAC codes indicating base redundancies at positions corresponding to known HLA-B variants.
Figure 1Agarose gel electrophoresis of two representative PCR amplicons (a) before and (b) after PCR barcoding step. DNA molecular size marker obtained from KAPA Universal Ladder (KAPA Biosystems, Boston, MA).
DNA quantity and number of mapped reads of poorly amplified samples.
| PI_G2 | BC15 | 66.84 | 473 |
| PI_C3 | BC19 | 20.4 | 80 |
| PI_D3 | BC20 | 57.96 | 869 |
| PI_H3 | BC22 | 64.44 | 333 |
| PI_C6 | BC42 | 33.36 | 158 |
Figure 2Histogram of read length with read N50 metric.
Figure 3Bivariate plot shows with a kernel density estimate the read length compared to the average read basecall Phred quality.
Figure 4Number of mapped and unmapped reads per individual. Blue bars indicate the assigned reads that are mapped to the regions of interest. Red bars are the aligned reads that pass the filter and are used for allele assignment.
Assignment result of HLA-B obtained from MinION sequencing and SBT.
| NA16688 | Ch | B*15:07:01G | *35:01:01G | ||
| NA17019 | Ch | B*15:02:01G | *15:11:01G | B*15:02:01G | *15:11:01G |
| NA17240 | C | B*07:02:01G | *57:01:01G | B*07:02:01G | *57:01:01G |
| NA19834 | A | B*35:01:01G | *39:10:01 | ||
| UDRUGS01 | C | B*40:02:01G | *44:02:01G | B*40:02:01G | *44:02:01G |
| UDRUGS02 | C | B*15:01:01G | *44:02:01G | B*15:01:01G | *44:02:01G |
| UDRUGS29 | C | B*18:01:01G | *27:09 | B*18:01:01G | *27:09 |
| UDRUGS41 | C | B*15:01:01G | *51:01:01G | B*15:01:01G | *51:01:01G |
| UDRUGS44 | C | B*15:01:01G | *44:03:01G | B*15:01:01G | *44:03:01G |
| PI_A1 | 0.25 NM 0.75 C | B*37:01:01G | *39:01:01G | B*37:01:01G | *39:01:01G |
| PI_A2 | 0.5 NM 0.5 C | B*27:05:02G | *57:01:01G | B*27:05:02G | *57:01:01G |
| PI_A3 | 0.5 CM 0.5 P | B*39:05:01 | *52:01:01G | B*39:05:01 | *52:01:01G |
| PI_A5 | 0.125 NM 0.25 P 0.625 C | B*08:01:01G | *35:01:01G | ||
| PI_B1 | 1.0 S | B*46:01:01G | *55:02:01G | ||
| PI_B2 | 0.5 NM 0.5 C | B*07:02:01G | *57:01:01G | ||
| PI_B3 | U | B*27:07:01G | *35:03:01G | ||
| PI_B4 | 0.333 NM 0.667 P | B*40:01:01G | *44:02:01G | B*40:01:01G | *44:02:01G |
| PI_B5 | 0.5 NM 0.5 C | B*07:02:01G | *55:02:01G | ||
| PI_B6 | 1.0 S | B*40:01:01G | *56:02:01 | ||
| PI_C1 | 0.5 NM | B*44:02:01G | *55:02:01G | ||
| PI_C2 | U | B*52:02:01 | *52:02:01 | ||
| PI_C3 | 0.25 NM 0.25 C | B*44:04 | *56:02:01 | ||
| PI_C4 | 0.25 NM 0.75 C | B*08:156 | *42:01:01 | ||
| PI_C5 | 0.25 NM 0.75 N | B*55:01:01G | *55:02:01G | ||
| PI_C6 | 0.25 NM 0.75 C | B*44:02:01G | *55:01:01G | ||
| PI_D1 | 0.5 NM 0.5 C | B*15:01:01G | *49:01:01G | ||
| PI_D2 | 0.125 NM 0.875 C | B*08:01:01G | *44:03:01G | ||
| PI_D3 | 0.25 NM 0.75 C | B*40:01:01G | *51:01:01G | ||
| PI_D4 | 0.5 NM 0.5 C | B*13:02:01G | *39:01:01G | ||
| PI_D5 | 0.25 NM 0.5 N 0.25 C | B*40:01:01G | *40:01:01G | ||
| PI_D6 | 0.375 NM 0.125 C | B*35:60 | *56:09 | ||
| PI_E1 | 0.5 NM 0.5 C | B*35:03:01G | *40:01:01G | ||
| PI_E2 | 0.5 S 0.5 N | B*56:02:01 | *56:02:01 | ||
| PI_E4 | 0.5 NM 0.5 C | B*14:02:01G | *56:01:01G | ||
| PI_E5 | 0.5 NM 0.5 S | B*39:01:01G | *55:01:01G | ||
| PI_F1 | 0.5 NM 0.5 C | B*44:02:01G | *50:01:01G | ||
| PI_F2 | 0.75 NM 0.25 C | B*07:02:01G | *40:01:01G | ||
| PI_F3 | 0.125 CM 0.875 C | B*18:01:01G | *44:02:01G | ||
| PI_F4 | 0.5 NM 0.5 C | B*44:02:01G | *43:03:01G | ||
| PI_F5 | 0.125 NM 0.875 C | B*08:01:01G | *40:01:01G | ||
| PI_F6 | 0.25 NM 0.75 C | B*35:01:01G | *48:01:01G | ||
| PI_G1 | 0.5 NM 0.5 C | B*15:17:01G | *37:01:01G | ||
| PI_G2 | 1.0 U | B*07:02:01G | *53:17:02 | ||
| PI_G4 | 0.125 NM 0.875 C | B*07:02:01G | *07:02:01G | ||
| PI_G5 | 0.25 NM 0.25 C | B*35:01:01G | *40:01:01G | ||
| PI_G6 | 0.333 NM 0.583 N 0.083 C | B*40:01:01G | *44:03:01G | ||
| PI_H1 | 0.5 NM 0.5 C | B*27:10 | *55:01:01G | ||
| PI_H2 | 0.5 NM 0.5 C | B*07:02:01G | *40:10:01G | ||
| PI_H3 | 1.0 NM | B*40:01:01G | *48:01:01G | ||
A, African; Ch, Chinese; C, Caucasian; NM, NZ Maori; CM, Cook Island Maori; S, Samoan; N, Niuean; P, others; U, unknown.
Samples selected for validation.
Samples required manual allele assignment.
Figure 5SeqPilot screenshot from mapping sequencing reads with reference sequences. Bias phasing position (black box) followed by deletion errors.
Figure 6SeqPilot screenshot from analysis of sample PI_A2. This illustrates the ambiguous results of sample PI_A2 with several possible alleles and their corresponding mismatched locations. HLA-B*27:05:02G was not confidently called due to two mismatches at nucleotide position 133 and 136 of exon 2. The software provides possible allele pairs with number of mismatches and their corresponding positions (red box). The mismatched nucleotides from the assigned allele are shown in each sequence read (ellipses).
Figure 7Comparison of consensus sequences of NA17240 (BC47) obtained from Sanger sequencing with those from Nanopolish. Yellow bar indicates the region where mismatched alignment happens (nt 130-136). HLA-B*07:02:01 is the reference sequence; polished.B0702 and polished B5701 are consensus sequences generated by Nanopolish. Data was visualized on Geneious v.9.1.5 (Biomatters Ltd., New Zealand).