| Literature DB >> 28521747 |
Justin H J Ng1,2,3, Mary Tachedjian1, Lin-Fa Wang1,3, Michelle L Baker4.
Abstract
BACKGROUND: Bats are an extremely successful group of mammals and possess a variety of unique characteristics, including their ability to co-exist with a diverse range of pathogens. The major histocompatibility complex (MHC) is the most gene dense and polymorphic region of the genome and MHC class II (MHC-II) molecules play a vital role in the presentation of antigens derived from extracellular pathogens and activation of the adaptive immune response. Characterisation of the MHC-II region of bats is crucial for understanding the evolution of the MHC and of the role of pathogens in shaping the immune system.Entities:
Keywords: Australian black flying-fox; Bat; Comparative genomics; MHC-II; Pteropus alecto
Mesh:
Substances:
Year: 2017 PMID: 28521747 PMCID: PMC5437515 DOI: 10.1186/s12864-017-3760-0
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Detailed map of the bat MHC-II region (1,262,706 bp) identified on scaffold202 of the bat genome. Maroon arrows represent MHC-II genes, blue arrows represent annotated genes, and green arrows represent predicted, unannotated open reading frames. Red blocks represent gaps (>4000 bp) in the scaffold. Dotted arrows represent extended class II sub-region and solid arrows represent classical class II sub-region. ψ represents putative pseudogenes
List of annotated genes in Bat MHC-II region
| Description | Gene | Start | End | Strand | Accession | Locus Taga |
|---|---|---|---|---|---|---|
| NA | - | 54622 | 54407 | - | ELK16616 | PAL_GLEAN10007084 |
| ψ MHC Class II DR β chain 2 |
| 124020 | 127163 | + | ELK16617 | PAL_GLEAN10007085 |
| Promotilin |
| 171900 | 176944 | + | ELK16618 | PAL_GLEAN10007086 |
| LEM domain-containing protein 2 |
| 184083 | 200005 | + | ELK16619 | PAL_GLEAN10007087 |
| Inositol hexakisphosphate kinase 3 |
| 233132 | 244985 | + | ELK16620 | PAL_GLEAN10007088 |
| Uncharacterised protein C6orf125 homolog | - | 251970 | 264591 | + | ELK16621 | PAL_GLEAN10007089 |
| Inositol 1,4,5-trisphosphate receptor type 3 |
| 330989 | 267163 | - | ELK16622 | PAL_GLEAN10007090 |
| Putative Bcl-2 homologous antagonist/killer 2 |
| 358514 | 361666 | + | ELK16623 | PAL_GLEAN10007091 |
| Zinc finger and BTB domain-containing protein 9 |
| 435884 | 437284 | - | ELK16624 | PAL_GLEAN10007093 |
| Ras GTPase-activating protein SynGAP |
| 444762 | 473798 | - | ELK16625 | PAL_GLEAN10007094 |
| Protein CutA |
| 475923 | 477426 | + | ELK16626 | PAL_GLEAN10007095 |
| PHD finger protein 1 |
| 483072 | 477988 | - | ELK16627 | PAL_GLEAN10007096 |
| Kinesin-like protein KIFC1 |
| 493984 | 483931 | - | ELK16628 | PAL_GLEAN10007097 |
| Death domain-associated protein 6 |
| 547565 | 550867 | + | ELK16629 | PAL_GLEAN10007098 |
| Zinc finger and BTB domain-containing protein 22 |
| 552438 | 554336 | + | ELK16630 | PAL_GLEAN10007099 |
| Tapasin |
| 555260 | 564216 | + | ELK16631 | PAL_GLEAN10007100 |
| Ral guanine nucleotide dissociation stimulator-like 2 |
| 566433 | 572764 | + | ELK16632 | PAL_GLEAN10007101 |
| Prefoldin subunit 6 |
| 575085 | 574122 | - | ELK16633 | PAL_GLEAN10007102 |
| WD repeat-containing protein 46 |
| 575753 | 582904 | + | ELK16634 | PAL_GLEAN10007103 |
| Beta-1,3-galactosyltransferase 4 |
| 585083 | 583725 | - | ELK16635 | PAL_GLEAN10007104 |
| 40S ribosomal protein S18 |
| 590495 | 585718 | - | ELK16636 | PAL_GLEAN10007105 |
| Vacuolar protein sorting-associated protein 52 homolog |
| 590542 | 608895 | + | ELK16637 | PAL_GLEAN10007106 |
| E3 ubiquitin-protein ligase RING1 |
| 632961 | 629730 | - | ELK16638 | PAL_GLEAN10007107 |
| Estradiol 17-beta-dehydrogenase 8 |
| 636805 | 635286 | - | ELK16639 | PAL_GLEAN10007108 |
| Zinc transporter SLC39A7 |
| 640599 | 638121 | - | ELK16640 | PAL_GLEAN10007109 |
| Retinoic acid receptor RXR-beta |
| 641371 | 646657 | + | ELK16641 | PAL_GLEAN10007110 |
| Collagen alpha-2(XI) chain |
| 674830 | 675093 | + | ELK16642 | PAL_GLEAN10007112 |
| ψMHC Class II DP β chain 1 (Fragment) |
| 703197 | 701801 | - | ELK16643 | PAL_GLEAN10007114 |
| RNA-binding protein Raly |
| 730049 | 730573 | + | ELK16644 | PAL_GLEAN10007116 |
| MHC Class II DO α chain |
| 734481 | 736943 | + | ELK16645 | PAL_GLEAN10007117 |
| Bromodomain-containing protein 2 |
| 762249 | 755609 | - | ELK16646 | PAL_GLEAN10007118 |
| MHC Class II DM α chain |
| 779647 | 782542 | + | ELK16647 | PAL_GLEAN10007119 |
| MHC Class II DM β chain |
| 788588 | 791667 | + | - | - |
| Proteasome subunit beta type-9 |
| 868896 | 864164 | - | ELK16648 | PAL_GLEAN10007120 |
| Antigen peptide transporter 1 |
| 869430 | 876791 | + | ELK16649 | PAL_GLEAN10007121 |
| Proteasome subunit beta type-8 |
| 878304 | 881075 | + | ELK16650 | PAL_GLEAN10007122 |
| Antigen peptide transporter 2 |
| 883417 | 891866 | + | ELK16651 | PAL_GLEAN10007123 |
| MHC Class II DO β chain |
| 904258 | 907879 | + | ELK16652 | PAL_GLEAN10007124 |
| MHC Class II DQ β chain 1 |
| 924949 | 931213 | + | ELK16653 | PAL_GLEAN10007125 |
| MHC Class II DQ α chain 1 |
| 948995 | 947759 | - | - | PAL_GLEAN10007126 |
| MHC Class II DQ β chain 2 |
| 1039500 | 1044135 | + | - | - |
| MHC Class II DQ α chain 2 |
| 1064301 | 1059023 | - | ELK16654 | PAL_GLEAN10007127 |
| MHC Class II DR β chain 1 |
| 1184173 | 1187159 | + | - | PAL_GLEAN10007128 |
| MHC Class II DR α chain |
| 1213079 | 1209726 | - | ELK16655 | PAL_GLEAN10007129 |
| Butyrophilin-like protein 2 |
| 1230144 | 1232101 | + | ELK16656 | PAL_GLEAN10007130 |
| Butyrophilin-like protein 2 |
| 1239555 | 1241047 | + | ELK16657 | PAL_GLEAN10007131 |
| Butyrophilin subfamily 2 member A3 |
| 1260527 | 1250684 | - | ELK16658 | PAL_GLEAN10007132 |
aLocus tags refer to annotations in the P. alecto whole genome
ψ represents putative pseudogenes
Fig. 2Comparative gene maps of bat MHC-II region (centre) against human, horse and pig MHC-II regions. Red arrows represent classical MHC-II genes, green arrows represent non-classical MHC-II genes, blue arrows represent flanking MHC-II region genes and purple arrows represent essential AP genes [37]. The areas highlighted in purple represent the antigen-processing (AP) cluster. ψ represents putative pseudogenes. The direction and orientation of the MHC-II regions relative to the telomere centromere are shown. The human, horse and pig gene maps were adapted from the Ensembl annotation
Fig. 3Simplified genomic maps of MHC-II genes in human, mouse, pig and bat. Red and blue boxes represent α and β chains respectively. Light red and light blue boxes represent pseudogenes for α and β chains respectively. Only human, mouse and pig were included as the MHC-II region of these species has been well characterised (The MHC Sequencing Consortium 1999; Velten et al. 1999; Kumánovics et al. 2003; Horton et al. 2004; Renard et al. 2006)
Fig. 4Phylogenetic trees of MHC-II genes. a MHC-II A gene Maximum likelihood phylogeny based on alignment of nucleotide sequences corresponding to the α1 and α2 domains. b MHC-II B gene Maximum likelihood phylogeny based on nucleotide sequences corresponding to the β1 and β2 domains. A discrete Gamma distribution was used to model evolutionary rate differences among sites (5 categories (+G, parameter = 1.5470)). The tree is drawn to scale, with branch lengths representing the number of substitutions per site. Branch support is indicated as percentage of trees out of 1000 bootstrap replicates that produce the same branching order. ψ represents putative pseudogenes. HLA – human leucocyte antigen; SLA – swine leucocyte antigen; Eqca – Equus caballus; Ovar – Ovis aries; Mumu – Mus musculus; Orcu – Oryctolagus cuniculus; Gaga – Gallus gallus
Coordinates of Bat class II S-X-Y motifs within the MHC-II genomic region on scaffold202
| Gene Name | Strand | Gene Start | S-X-Y Start | S-X-Y End | S-X-Y Relative Position (S-X-Y End to Gene Start) |
|---|---|---|---|---|---|
|
| + | 779647 | 779475 | 779555 | −92 |
|
| + | 788588 | Not found | Not found | n/a |
|
| + | 734481 | 734305 | 734363 | −118 |
|
| + | 904258 | 904074 | 904121 | −137 |
|
| - | 703197 | Not found | Not found | n/a |
|
| - | Not found | Not found | Not found | n/a |
|
| - | 1064301 | 1064483 | 1064419 | 118 |
|
| + | 924949 | 924738 | 924799 | −150 |
|
| + | 1039500 | 1039285 | 1039348 | −152 |
|
| - | 1213079 | 1213260 | 1213211 | 132 |
|
| + | Not found | Not found | Not found | n/a |
|
| + | Not found | Not found | Not found | n/a |
ψ represents putative pseudogenes
Fig. 5Comparison of the S-X-Y motifs in the (a) MHC-II A genes and (b) MHC-II B genes of bat and human. Logos of corresponding position-specific scoring matrix models are presented. The height, in terms of log2, of each stack of symbols (y-axis) represents information content in each position of the DNA sequence (x-axis), with a maximum value of 2