| Literature DB >> 26634332 |
William J Lane1,2, Connie M Westhoff3, Jon Michael Uy1, Maria Aguad1, Robin Smeland-Wagman1, Richard M Kaufman1, Heidi L Rehm1,2,4,5, Robert C Green6,2,5, Leslie E Silberstein7.
Abstract
BACKGROUND: There are 346 serologically defined red blood cell (RBC) antigens and 33 serologically defined platelet (PLT) antigens, most of which have known genetic changes in 45 RBC or six PLT genes that correlate with antigen expression. Polymorphic sites associated with antigen expression in the primary literature and reference databases are annotated according to nucleotide positions in cDNA. This makes antigen prediction from next-generation sequencing data challenging, since it uses genomic coordinates. STUDY DESIGN AND METHODS: The conventional cDNA reference sequences for all known RBC and PLT genes that correlate with antigen expression were aligned to the human reference genome. The alignments allowed conversion of conventional cDNA nucleotide positions to the corresponding genomic coordinates. RBC and PLT antigen prediction was then performed using the human reference genome and whole genome sequencing (WGS) data with serologic confirmation.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26634332 PMCID: PMC5019240 DOI: 10.1111/trf.13416
Source DB: PubMed Journal: Transfusion ISSN: 0041-1132 Impact factor: 3.157
Figure 1Approach for mapping conventional cDNA reference sequence positions to genomic coordinates. (A) Process developed to convert conventional CDS positions to genomic coordinates with FY as an example. (B) CDS positions referenced to cDNA sequence. (C and D) Genome transcript and genomic coordinates according to the human reference genome. (E) UCSC genomic sequence in which each exon and intron is annotated as separate sequence entry preceded by the genomic coordinates. The sequence regions are colored: 3′ and 5′ (gray), CDS (green uppercase), and intron (blue lowercase). (F) FY gene conversion between genomic coordinates and cDNA reference sequence.
FY alleles, cDNA and genomic coordinates, bases(s) found, WGS coverage, and result
| Allele | CDS | Gene | Genome | Base(s) found | Coverage | Result |
|---|---|---|---|---|---|---|
|
| c.−67 | g.881 | chr1:159,174,683 | T | 28× | Absent |
|
| c.125 | g.1552 | chr1:159,175,354 | G/A | 15/15× | Fy(a+b+) |
|
| c.145 | g.1572 | chr1:159,175,374 | G | 24× | Absent |
|
| c.265 | g.1692 | chr1:159,175,494 | C | 27× | Absent |
|
| c.281_295 | g.1708_1722 | chr1:159,175,510_chr1:159,175,524 | CTGGCT GGCCTGTCC | 31‐36× | Absent |
|
| c.287 | g.1714 | chr1:159,175,516 | G | 34× | Absent |
|
| c.298 | g.1725 | chr1:159,175,527 | G | 32× | Absent |
|
| c.327 | g.1754 | chr1:159,175,556 | C | 37× | Absent |
|
| c.407 | g.1834 | chr1:159,175,636 | G | 27× | Absent |
|
| c.408 | g.1835 | chr1:159,175,637 | G | 28× | Absent |
Differences encountered when aligning the conventional cDNA with the human reference genome (shown as conventional reference > reference genome)
| Symbol | Gene | CDS nucleotide (genomic coordinate) [amino acid] | Differences |
|---|---|---|---|
| ABO |
| c.261delG (no genomic coordinate) |
Genome: inactive enzyme |
| MNS |
| c.38(chr4:145,041,741)C>A [p.Ala13Glu]; c.59(chr4:145,041,720)C>T [p.Ser20Lue], c.71(chr4:145,041,708)G>A, c.72(chr4:145,041,707)T>G [p.Gly24Glu]; c.93(chr4:145,041,686)C>T [p.Thr31Thr] |
Probable nonrelevant missense change in cleaved N‐term; |
| MNS |
| c.251(chr4:144,918,712)C>G [p.Thr84Ser] | Presumed non‐relevant missense change c.251G is part of the S−s−U+w [GYPB.NY] allele which has additional changes c.208G>T and c.230C>T not present in the reference genome |
| P1PK |
| Two different cDNA reference sequences | |
| RH |
| c.1136(chr1:25,643,553)C>T [p.Thr379Met] | Genome: common African black allele |
| RH |
| c.48(chr1:25,747,230)G>C [p.Trp16Cys] | Genome: common African black allele |
| LU |
| c.1615(chr19:45,322,744)G>A [p.Ala539Thr] | Antigenic difference: Au(a−b+) > Au(a+b−) |
| LE |
| c.202(chr19:5,844,649)T>C [p.Trp68Arg], c.314(chr19:5,844,537)C>T [p.Thr105tMet] | Genome: inactive enzyme associated with a Le(a−b−) phenotype |
| FY |
| Conventional: reference is alternative splice form with different numbering than the alleles | |
| DI |
| c.357(chr17:42,337,900)T>C [p.Val119Val] | Silent change |
| YT |
| Conventional reference is alternative splice form with different numbering and splice form was not deposited into GenBank | |
| DO |
| c.378(chr12:14,993,854)C>T [p.Tyr126Tyr], c.624(chr12:14,993,608)T>C [p.Leu208Leu]; c.793(chr12:14,993,439)A>G [p.Asn265Asp] |
Silent changes; |
| H |
| Conventional reference is numbered to alternative splice form rather than the deposited long isoform | |
| KN |
| c.4828(chr1:207,782,916)T>A [p.Ser1610Thr] | Genome: rare Sl3– phenotype |
| IN |
| c.326(chr11:35,201,913)A>C [p.Tyr109Ser] | Found in association with rare In(a+b−) phenotype but with additional changes c.137G>C p.Arg46Pro and c.716G>A p.Gly239Glu. It is unclear if c.326A>C alone can lead to antigenic change. |
| OK |
| c.537(chr19:581,407)T>C [p.Asp179Asp] | Silent change |
| RAPH |
| c.579(chr11:837,582)A>G [p.Gly193Gly] | Silent change |
| I |
| c.816(chr6:10,587,038)G>C [p.Glu272Asp] | Genome: uncommon allele with unclear phenotype |
| GIL |
| c.61(chr9:33,447,468)T>C [p.Leu21Leu], c.105(chr9:33,447,424)C>G [p.Leu35Leu], c.390(chr9:33,442,952)T>C [p.Phe130Phe], c.543(chr9:33,442,466)T>C [p.Pro181Pro] | Silent changes |
| RHAG |
| c.724(chr6:49,582,483)G>A [p.Asn242Asp] | Conventional reference sequence sequencing error |
Figure 2WGS‐based RBC and PLT gene sequencing. Circos plot 61 of the WGS data that has been filtered to only show the RBC and PLT genes with a circular plot of the sequence coverage (100‐bp bins).
Comprehensive RBC antigen prediction from a patient's whole genomea
| System | Gene | Average coverage | Phenotypes |
|---|---|---|---|
| 001 ABO |
| 27×, 31× | A1 |
| 002 MNS |
| 41× | M+N+, Vr−, Mt(a−), Ri(a−), Ny(a−), Or−, ERIK−, Os(a−), ENEP+, ENEH+, ENAV+, ENEV+, MNTD− |
|
| 45× | S+s+, U+, En(a+), He−, Mi(a−), Mur−, Mv−, s(D−), Mit−, Dantu− | |
| 003 P1PK |
| 29× | P1+/P1−, pk+, NOR− |
| 004 RH | RHD | 34× | D+, Tar− |
|
| 33× | C−c+E+e−, CW−, CX−, EW−, V−, VS−, Rh26+LOCR−, Be(a−), DAK−, Go(a−), Rh32−, Crawford−CELO+, JAL−CEST+, STEM−, JAHK− | |
| 005 LU |
| 24×, 21×, 25× | Lu(a−b+), LURC+, Lu4+, Lu5+, Lu6+, Lu7+, Lu8+, Lu13+, Lu16+, Lu17+, Au(a+b−), Lu20+, Lu21+ |
| 006 KEL |
| 33× | K−k+, Kp(a−b+c−), Js(a−b+), Ul(a−), K11+, K12+, K13+, K14+, K18+, K19+, K22+, K23−, VLAN−VONG−, TOU+, RAZ+, KALT+, KTIM+, KYO−, KUCI+, KANT+, KASH+, KELP+, KETI+, KHUL+ |
| 007 LE |
| 28×, 28×, 27×, 23× | Le(a+b−) |
| 008 FY |
| 28× | Fy(a+b+) |
| 009 JK |
| 37× | Jk(a+b+) |
| 010 DI |
| 26× | Di(a−b+), Wr(a−b+), Wd(a−), Rb(a−), WARR−, ELO−, Bp(a−), Mo(a−), Hg(a−), Vg(a−), Sw(a−), BOW−, NFLD−, Jn(a−), KREP−, Tr(a−), Fr(a−), SW1−, Wu−DISK+ |
| 011 YT |
| 23× | Yt(a+b−) |
| 013 SC |
| 37× | Sc1+Sc2−, Rd−, STAR+, SCER+, SCAN+ |
| 014 DO |
| 40× | Do(a+b+), Jo(a+), DOYA+, Hy+, DOMR+, DOLG+ |
| 015 CO |
| 27× | Co(a+b−), Co4+ |
| 016 LW |
| 23× | LW(a+b−) |
| 017 CH/RG |
| 25× | Ch1+, Ch2+, Ch3+, Ch4+, Ch5+, Ch6+, Rg1−, Rg2− |
| 017 CH/RG |
| 25× | Ch1−, Ch2−, Ch3−, Ch4−, Ch5−, Ch6−, Rg1+, Rg2+ |
| 018 H |
| 27×, 28×, 29× | H+ |
| 019 XK |
| 39× | Kx+ |
| 020 GE |
| 32× | Ge2+, Ge3+, Ge4+, Es(a+), Wb−, An(a−), Dh(a−), GEIS−, GELP+, GEAT+, GETI+ |
| 021 CROM |
| 43× | Cr(a+), Tc(a+b−c−), Dr(a+), Es(a+), WES(a−b+), UMC+, GUTI+, SERF+, ZENA+, CROV+, CRAM+, CROZ+ |
| 022 KN |
| 35× | Kn(a+b−), McC(a+b−), Sla+Vil−, Yk(a+), Sl3+, KCAM+/KCAM− |
| 023 IN |
| 39× | In(a−b+), INFI+, INJA+ |
| 024 OK |
| 22× | Ok(a+), OKGV+, OKVM+ |
| 025 RAPH |
| 22× | MER2+ |
| 026 JMH |
| 27× | JMHK+, JMHL+, JMHG+, JMHM+, JMHQ+ |
| 027 I |
| 37× | I+ |
| 028 GLOB |
| 40× | P+ |
| 029 GIL |
| 27× | GIL+ |
| 030 RHAG |
| 39× | Duclos+, Ol(a−), DSLK+, RHAG4− |
| 031 FORS |
| 27× | FORS+ |
| 032 JR |
| 37× | Jra+ |
| 033 LAN |
| 29× | Lan+ |
| 034 VEL |
| 22× | Vel+ |
| 035 CD59 |
| 36× | CD59.1+ |
| 036 AT |
| 28× | At(a+) |
Serologic RBC confirmation = A+, B−, D+, C−, c+, E+, e−, K−, k+, Fy(a+b+), Jk(a+b+), M+, N+, S+, s+.
FUT1 = active; FUT2 = inactive; FUT3 = active; SLC35C1 = active; ABO = active A1; B3GALNT1 = active; GCNT2 = active; GBGT1 = inactive; KLF1 = active; GATA1 = active.
Comprehensive PLT antigen prediction from a patient's whole genome
| Gene | Average coverage | Predicted HPA phenotypes |
|---|---|---|
|
| 34× | 1a+, 1b+, 4a+, 4b−, 6bw−, 7bw−, 8bw−, 10bw−, 11bw−, 14bw−, 16bw−, 17bw−, 19w−, 21w−, 23bw−, 26bw− |
|
| 25× | 2a+, 2b− |
|
| 27× | 3a+, 3b−, 9bw−, 20w−, 22bw−, 24bw−, 27bw−, 28bw− |
|
| 40× | 5a+, 5b−, 13bw−, 18w−, 25bw− |
|
| 21× | 12bw− |
|
| 39× | 15a−, 15b+ |
Changes not known to encode new or altered antigenic epitopes
| Gene | CDS nucleotide (genomic coordinate) and [amino acid] |
|---|---|
|
|
hom c.38(chr4:145,041,741)A>C [missense p.Glu13Ala] |
|
| het c.109(chr22:43,089,849)A>G [missense p.Met37Val] |
|
| het c.3623(chr1:207,753,621)A>G [missense p.His1208Arg]; het c.5480(chr1:207,790,088)C>G [missense p.Pro1827Arg]; hom c.5905(chr1:207,795,320)A>G [missense p.Thr1969Ala] |
|
| hom c.3722(chr6:74,521,947)C>T [missense p.Thr1241Met] |
hom = homozygous, het = heterozygous