Literature DB >> 33126081

Cartilaginous fish class II genes reveal unprecedented old allelic lineages and confirm the late evolutionary emergence of DM.

Tereza Almeida1, Arnaud Gaigher2, Antonio Muñoz-Mérida2, Fabiana Neves2, L Filipe C Castro3, Martin F Flajnik4, Yuko Ohta4, Pedro J Esteves5, Ana Veríssimo6.   

Abstract

Cartilaginous fish (chimaeras, rays and sharks) are the most basal extant jawed vertebrates with an adaptive immune system based on the Major Histocompatibility Complex (MHC). Despite being a key taxon in the evolution of vertebrate adaptive immunity, no comprehensive characterization of MHC class II genes has been undertaken for the group. We performed extensive bioinformatic searches on a taxonomically diverse dataset of transcriptomes and genomes of cartilaginous fish targeting MHC class II sequences. Class IIα and IIβ sequences were retrieved from all taxa analyzed and showed typical features of classical class II genes. Phylogenetic trees of the immunoglobulin superfamily domain showed two divergent and remarkably ancient lineages of class II genes in Selachians (sharks), originating >350 million years ago. Close linkage of lineage-specific pairs of IIα and IIβ genes was found, confirming previous results, with genes from distinct lineages segregating as alleles. Nonclassical class II DM sequences were not retrieved from these data and classical class II sequences lacked the conserved residues shown to interact with DM molecules, supporting claims that the DM system arose only in the lobe-finned fish lineage leading to tetrapods. Based on our search methods, other divergent class II genes are unlikely in cartilaginous fish.
Copyright © 2020 The Author(s). Published by Elsevier Ltd.. All rights reserved.

Entities:  

Keywords:  Adaptive immunity; Antigen presentation; Basal jawed vertebrates; MHC; Sharks and rays

Mesh:

Substances:

Year:  2020        PMID: 33126081      PMCID: PMC8010645          DOI: 10.1016/j.molimm.2020.10.003

Source DB:  PubMed          Journal:  Mol Immunol        ISSN: 0161-5890            Impact factor:   4.407


Introduction

The major histocompatibility complex (MHC) is a genomic region encoding genes involved in antigen presentation and processing, which are essential for immune defense mechanisms. One of the key immune gene families is MHC class II, which initiates an adaptive immune response by presenting exogenously acquired antigens in the form of peptides to CD4+ T-cells. Class II molecules are heterodimers of α and β chains (IIα and IIβ) encoded by separate genes. Each chain is composed of two extracellular domains, namely the peptide-binding region (PBR, i.e. α1 and β1) and the immunoglobulin superfamily domain (IgSF, i.e. α2 and β2), in addition to a signal peptide, a connecting piece (CP), a transmembrane domain (TM), and the intracellular cytoplasmic tail (CYT) (Klein and Sato, 2000). A hallmark of most class II genes in jawed vertebrates is the high level of polymorphism and evolutionary positive selection on the codons encoding the peptide binding residues, resulting in high diversity at the PBR and allowing the recognition of a wider array of pathogen-derived peptides conferring broad pathogen resistance (Radwan et al., 2020). Binding of antigenic peptides by classical MHC class II molecules is preceded by several steps associated with heterodimer assembly and association with the invariant chain (Ii) in the endoplasmic reticulum (ER). The class II-Ii complex transits from the ER to a specific late endosomal compartment where Ii breakdown occurs, leaving the class II–associated invariant chain peptide (CLIP) in the class II PBR. The exchange of CLIP by antigenic peptides is facilitated by the nonclassical class II molecule DM, that interacts most closely with the α1 domain of the CLIP-class II complex (Pos et al., 2012). The DM system also promotes binding of high-affinity peptides that form stable complexes with class II proteins (Lazarski et al., 2006; Schulze and Wucherpfennig, 2012). Class II peptide processing is, in turn, regulated by another nonclassical class II molecule found in mammals, DO, that works in concert with the DM molecules (Denzin et al., 1997; Kropshofer et al., 1998). Classical MHC class II genes have been described in all jawed vertebrate lineages, including mammals (reviewed in Yeager and Hughes, 1999), birds (reviewed in Burri et al., 2010 and Parker and Kaufman, 2017), amphibians (Flajnik et al., 1991; Kiemnec-Tyburczy et al., 2010), bony fishes (reviewed in Dijkstra et al., 2013), and cartilaginous fishes (Bartl and Weissman, 1994; Kasahara et al., 1992). The Ii genes have also been reported for all jawed vertebrate lineages (Criscitiello et al., 2012), while DM genes have been found in sarcopterygians (i.e. lobe-finned fishes), from lungfish to mammals (Dijkstra et al., 2018; Fling et al., 1994; Kaufman, 1999; O’hUigin et al., 1998; Ohta et al., 2006), but so far not in basal jawed vertebrates such as cartilaginous and actinopterygians (i.e. ray-finned fishes), including teleosts (Dijkstra et al., 2013). Teleost fish genomes show many divergent features from other jawed vertebrates, with tetrapods and cartilaginous fish genomes showing comparatively more similarities (reviewed in Ravi and Venkatesh, 2018). In fact, teleosts lack the primordial linkage between classical class I and II genes observed in all other jawed vertebrates (Klein and Sato, 1998; Ohta et al., 2000). Given the paucity of immunogenetic studies on Chondrichthyan taxa, it remains to be confirmed if the loss of DM is (another) derived feature of teleosts, or if the absence of the DM is a common feature in basal jawed vertebrates implying its late emergence in the lobe-finned fish leading to tetrapods. Cartilaginous fish (Class Chondrichthyes) are the oldest living jawed vertebrates (~500 MY old) possessing an adaptive immune system and represent a key evolutionary group to study the evolution of vertebrate adaptive immunity. The group includes Elasmobranchs (sharks and rays) and Holocephalans (chimaeras) comprising a diverse assemblage of species occupying all aquatic ecosystems (marine, estuarine and freshwaters) and exploiting a variety of habitats (e.g. coastal, open-ocean, or deep waters) (Carrier et al., 2010), and whose lineages date back several hundred million years (Heinicke et al., 2009). Thus, exposure to different environmental stressors and pathogenic agents, both currently and through time, may have shaped putatively different immune response mechanisms and genetic makeup in this group. However, our current knowledge of Chondrichthyan immunobiology has relied mostly on a single model species, the nurse shark Ginglymostoma cirratum (reviewed in Smith et al., 2014). Here we provide the first comprehensive assessment of MHC class II gene diversity in cartilaginous fishes, making use of accessible genetic, genomic and transcriptomic resources made available for sharks, rays and chimaeras in the last few years. Chondrichthyan class II sequences were compared with those from classical and nonclassical class II genes of other vertebrates regarding their biochemical features and phylogenetic relationships. By doing so, we also objectively searched for DM-like homologs in cartilaginous fish. Furthermore, we assessed class II gene diversity among sharks, rays and chimaeras to infer the number and diversity of class II lineages in each group. Ultimately, this study serves as a baseline of Chondrichthyan MHC class II immunogenetics and aims to promote additional studies exploring the evolutionary history of MHC genes in the group at the origins of adaptive immunity. These data also set the stage for future studies of the drivers of MHC diversity within and among species exhibiting remarkable variety in biology and ecology, as well as having long evolutionary histories.

Material and methods

Taxonomic breadth and data availability

Sequence Read Archive (SRA), Transcriptome Shotgun Assembly (TSA) and Whole Genome Sequence (WGS) databases publicly accessible on NCBI were screened for MHC class II-like sequences on available Chondrichthyan taxa. A total of 33 species were included in the analysis (Supplemental Table 1), covering the Holocephalans (or chimaeras; 1 species) and the two sister lineages of Elasmobranchs (Selachii, or sharks: 23 species, and Batoidea, or rays: 9 species). All but one order of Batoids (sensu Aschliman, 2011) and five out of the nine orders of Selachii (sensu Naylor et al., 2012) are represented in the dataset. Some SRA datasets from RNAseq projects on Selachian taxa were excluded when referring to species whose genus had multiple datasets and if including non-immunity-related tissues (e.g. retina, pectoral fin bud or ampullary receptor cells). In turn, Batoid and Holocephalan taxa were underrepresented in public databases and thus all available data were retained for MHC screening.

Bioinformatic searches and filtering

The different “omic” databases available on NCBI (i.e. TSA; SRA; WGS) were screened using different protocols to extract class II-like sequences. Briefly, for TSA data, the available transcripts were downloaded and blasted against a set of five class IIα and five IIβ reference sequences (species name followed by accession number: Homo sapiens HLA-DR ARB08440.1 and AAB60387.1, and HLA-DM CAA54169.1 and ARB08367.1; elephant shark Callorhinchus milii AFK10583.1 and AFP09377.1, nurse shark Ginglymostoma cirratum AAF66123.1 and AAF82681.1, and winter skate Leucoraja ocellata GEZH01010955.1 and GEZH01040536.1). We only retained transcripts with a blast e-value under 0.001 and covering more than 50 % of each reference sequence. For SRA data, raw reads were downloaded, cleaned using Trimmomatic, and assembled using SPAdes to obtain the final transcripts. The resulting transcripts were screened as mentioned above for TSA data. Regarding WGS data, available contigs were screened using the same reference sequences mentioned above through tblastn, to locate the positions in the contigs using stringent e-values (10−7). The corresponding hits were visually inspected and filtered to avoid overlaps and redundancy, and the corresponding gene sequence was bioinformatically extracted from target contigs using a flanking region of 200 bp to maximize inclusion of the whole gene sequence. To refine our search and retain only class IIα and IIβ proteins, all retrieved sequences were translated to amino acids and only those showing the basic features of class II genes were retained, namely (at least) full sequences containing the PBR (α1/β1) and IgSF domains (α2/β2). This Chondrichthyan sequence dataset was complemented with additional class II sequences deposited in the GenBank nt database (see Supplemental Files 1 and 2 for full list of sequences and references). Furthermore, classical and nonclassical class IIα and IIβ sequences from other jawed vertebrate lineages (ray-finned fishes, amphibians, reptiles/birds and mammals) were downloaded from GenBank to compare and contrast MHC diversity of Chondrichthyans to that of other vertebrates.

Structural and biochemical features of MHC II genes

Amino acid alignments of full MHC IIα and IIβ proteins were performed separately, using the MUSCLE algorithm (Edgar, 2004) implemented in Geneious Prime v2.1. Sequence alignments were manually edited to highlight typical features of functional class IIα and IIβ genes following Brown et al. (1993); Kaufman et al. (1994); Wang et al. (2001) and Dijkstra et al. (2018, 2013). These structural and biochemical features were checked in all retrieved cartilaginous class II sequences and compared against representative lineages of other vertebrate groups (as listed above) including both classical and nonclassical MHC II genes.

Phylogenetic analyses

To infer the phylogenetic relationships of Chondrichthyan class II genes and those of other vertebrate representatives, including classical and nonclassical genes (e.g. DO and DM genes), we built neighbor-joining trees with Jones-Taylor-Thornton model as implemented in MEGAX v10.1.18 (Kumar et al., 2018; branch support evaluated by 1000 bootstrap replicates). MHC IIα and IIβ trees were performed independently based on amino acid sequence alignments of the PBR and IgSF domain (exons 2 and 3), including representative sequences of each Chondrichthyan genus in the dataset in addition to classical and nonclassical sequences representative of different vertebrate lineages (i.e. Actinopterygii/ray-finned fishes: spotted gar Lepisosteus oculatus; stickleback Gasterosteus aculeatus; salmon Salmo salar; zebrafish Danio rerio Sarcopterygii/lobe-finned fishes: coelacanth Latimeria chalumnae, lungfish Lepidosiren paradoxa, frog Xenopus laevis, lizard Sphenodon punctatus, chicken Gallus gallus, rat Rattus norvegicus and human Homo sapiens). As different evolutionary dynamics are expected between class II domains, neighbor-joining trees were built independently for PBR and IgSF domains (i.e. exon 2 vs. exon 3) of IIα and IIβ genes of Chondrichthyan taxa, based on nucleotide sequence alignments and using the Kimura 2-parameter distance implemented in MEGAX (branch support evaluated by 1000 bootstrap replicates). An expanded Chondrichthyan sequence dataset was used for this purpose, i.e. including multiple sequences from each species screened, where available, to allow a better perspective on the sequence diversity and divergence patterns in the group. With this in mind, we retained only sequences with less than 97 % identity at the within-species level.

Selection analysis

To infer codon-specific footprints of evolutionary positive and negative selection across the Chondrichthyan phylogeny we used four Maximum Likelihood (ML) methods available in the HYPHY package (Pond et al., 2005) implemented in the Datamonkey web server (Weaver et al., 2018), namely Single Likelihood Ancestor Counting (SLAC), Fixed-Effect Likelihood (FEL), Random Effects Likelihood (REL) and Fast Unconstrained Bayesian AppRoximation (FUBAR). All these methods are based on the estimation of nonsynonymous (dN) and synonymous (dS) substitutions rates for each site, with dN/dS < 1 being indicative of negative selection while dN/dS > 1 of positive selection. The selection inferences were conducted independently for each domain (α1, α2, β1 and β2) by using the tree topology described above. To avoid false positives, codons were considered evolving under selection if detected by at least two different methods.

Amplification of MHC class II genes and pedigree analyses

Based on the results from the phylogenetic analysis, two lineages were found for class IIα (i.e. DAA and DBA) and IIβ (DAB and DBB) (see more details in the Results section). One family of nurse sharks was previously used in demonstrating linkage between class II αDAA and βDAB genes (family#1; Ohta et al., 2000), with two pups exhibiting paternal alleles for two αDBA genes (the mother was negative for DBA genes, but multiple paternity was found in the litter). Here, the same family was screened for the presence/absence of the MHC class II αDBA and βDBB genes to infer possible linkage between specific pairs of α and β genes. In addition to nurse shark family#1, 15 wild nurse sharks were also analyzed to assess consistency of the association between class IIα and IIβ genes in unrelated individuals. Screening was based on lineage-specific amplification with the polymerase chain reaction (PCR) using primers anchored in the IgSF domains of IIα and IIβ genes, namely DAA forward 5’ CCTGAAGTCTCTGTGTATTCTG 3’ and reverse 5’ TACTGGATCCTGTAGGCTCGA 3’; DBA forward CCTCAGATTGCCATGTATCCTG 3’ and reverse 5’ CGTTGGATCCTGCAGCCCCTC 3’; DAB forward 5’ ATCCGAACAAAAGAATCGAC 3’ and reverse 5’ ATCCACACTCATTGGTGAGG 3’; DBB forward 5’ ATCCGACCTAAAGCCTCTCA 3’ and reverse 5’ TTCCACACTCCTAGGACTCC 3’. PCR amplification was performed in 15 μL reactions using 7.5 μL Qiagen Multiplex master mix, 7.5 pmol of each primer, 3 μL autoclaved water, and 15 ng of genomic DNA. The temperature profile included an initial denaturation at 95 °C for 15 min, followed by 35 cycles of 95 °C for 1 min, annealing at 58 °C for DAA and DAB, 59 °C for DBB, and 62 °C for DBA for 45 s, 72 °C for 1 min, and a final extension at 72 °C for 5 min. The resulting amplicons were sequenced directly.

Results

Chondrichthyan sequences showed conserved features of vertebrate class II genes

Our bioinformatic searches retrieved MHC class II-like sequences from 33 species of sharks, rays and chimaeras for which data were available. In total, 97 and 83 sequences exhibited the typical domains in class IIα and IIβ chains, and showed high similarity to previously described class II sequences of Chondrichthyan taxa (e.g. nurse shark G. cirratum, Bartl and Weissman, 1994 and Kasahara et al., 1992; elephant shark C. milii, Venkatesh et al., 2014; Fig. 1). Several structural features typical of the PBR domains (α1 and β1) of class II genes were found across the Chondrichthyan sequences (Fig. 1). Specifically, as expected, there were no intradomain disulfide bonds in α1, while there were the two canonical cysteines in β1 (Cβ15 and Cβ79). The α1 domain had a putative N-linked glycosylation site (NXS/T) conserved at Nα75 across Elasmobranchs (except in the requiem sharks - Order Carcharhiniformes, and stingrays - Order Myliobatiformes), although this was shifted from the mammal-like position (i.e. DR Nα78) as noted by Kasahara et al. (1992). In Carcharhiniformes, the glycosylation site is generally conserved as Nα78, while in Myliobatiformes the α75 was polymorphic (N, Q, K or I) (Fig. 1a, Supplemental Files 3 and 4). In contrast, all sequences retrieved from the Holocephalan C. milii lacked the N-linked glycan residues in the α1 domain. In turn, the N-linked glycosylation site reported for the β1 domain of higher vertebrates was conserved across all Chondrichthyans (Nβ19; Fig. 1b). Most class IIα sequences exhibited the highly conserved salt bonds in the distal domain (Hα5/Dα27; Fig. 1a), with few sequences exhibiting the Yα5 substitution originally described for the nurse shark (Kasahara et al., 1992); thus, the α1 salt bond is the rule rather than the exception.
Fig. 1.

Amino acid alignment of full protein of MHC class II genes. Alignments for a) alpha and b) beta genes, for representative Chondrichthyan taxa (one representative sequence per genus) including sharks (Selachii), rays (Batoidea) and chimaeras (Holocephali), as well as other vertebrate taxa including classical and nonclassical (DM) class II sequences. Taxon names for Chondrichthyan representatives are shown as the first letter for genus name (in capital letter) followed by the first three letters of the species name, and a number for cross referencing with the full nucleotide sequences in Supplemental Files 1–2 (alpha sequences) and 3–4 (beta sequences), except for Cmil alpha sequence (GenBank Accession JW875535.1). Abbreviation and full species name are indicated in Supplemental Table 2. GenBank Accessions for non-Chondrichthyan sequences are as follows: human HLA-DRA1 AAA59785.1, HLA-DRB1 BAO73158.1, DMA CAA54169.1, DMB ARB08367.1; chicken BLA1 AAY40298.1, BLB1 BAF62996.1, M alpha CAA18966, M beta BAG69311; frog DAA AAL58430.1, DAB BAA02842.1, DM alpha AAH61681, DM beta XP 002942059; stickleback DAA AAU01917.1, DAB AAU01918.1. The salt bonds in class IIα sequences are highlighted by double bars. Residues highlighted in color represent putative N-glycosylation sites (green); disulfide bonds (grey); Ig-like domains (blue); putative residues involved in CD4-binding sites (yellow; Wang et al., 2011; Zhang et al., 2020); conserved peptide binding residues (black); DM specific residues (red; Dijkstra et al., 2019); residues in red font are important for classical class II α chain interaction with DM (Zhang et al., 2020). Predicted locations of β-strands (S1–11 in alpha, and S1–13 in beta) and α-helices (H-2 in alpha, and H1–3 in beta) are shown as double lines below alignments. Numbers above the alignment refer to positions in the HLA-DRA1 and HLA-DRB1, respectively. Residues in Chondrichthyan sequences under positive (+) or negative (*) selection are shown below alignment positions. Residues assumed to interact with peptides (p or P), or with the TCR (t) are indicated below the alignment (Kasahara et al., 1992).

The PBR of classical class II genes are characterized by the presence of eight conserved peptide-binding residues, of which four are particularly important in establishing hydrogen-bonds and are often perfectly conserved in tetrapods: αN62, αN69, Wβ61, and Nβ82 (Kaufman et al., 1994; Painter and Stern, 2012). Three of these four residues were generally conserved in Chondrichthyans, except α62 which was highly variable and under positive selection (Table 1). The remaining peptide-binding residues were either highly variable (e.g. α51, α53), or generally conserved for a residue distinct from those in higher vertebrates resulting in amino acid substitutions with very distinct properties (α76, β81; Table 1).
Table 1

Peptide binding residues for MHC class II molecules, with conserved residues responsible for binding to mainchain atoms highlighted in grey shade (letter in parentheses shows conserved residue found in tetrapods, following Kaufman et al., 1994). Residues under positive selection are marked with + on top of their corresponding positions in the alignment. Taxon names for Chondrichthyan representatives are shown as the first letter for genus name (in capital letter) followed by the first three letters of the species name. Abbreviation and full species name indicated in Supplemental Table 2.

αl
Taxon++++++++
791122242631324351(F)53(S)54555862(N)656869(N)7276(R)
Sharks
 HzebEQYFLVVLATRGGGQQQNVL
 CcarNLYFVVVLAAQGGGNQQSVL
 CtauYQYFLEVFAGQGGGNQNNVL
 IoxyDLYFVVVFATQGGGDQQSVL
 CperDLSWNLVFAAQGGADQQNIR
 PglaDLSFMVVFVGQGGAQQQNVL
 ScanNLYFLVVLAARGGRTQNNVL
 CgriQSYFLVIVGYQRGGNQHNIR
 GcirFQYFVEIFAYQGGANQNNVL
 RtypEQYFVVVLAVRGGGNQNNVL
 EspiEQYFLVVFAFQAGGNQQNVL
 SacaHLYFVEVFAARGGAQVQNVL
Rays
 LeriDLYFLVVLATRGGRQFQNVL
 OkenEQYFLVILATRGG.NVQNVL
 NkuhDMFFVDRFPGEGGTIQQNVL
 UjamDMYFVDRFPVQGGGQQQNVL
Chimaeras
 CmilGLWFVVLFVTEGGGDVNNGD
Expected structural features of IgSF domain of MHC class II molecules were also detected across Chondrichthyan sequences (Fig. 1), namely two Cys residues in α2 (Cα107, Cα163) and β2 (Cβ117, Cβ173). Minisatellite motifs were found across all Chondrichthyan β2 sequences (codons 150–156, Fig. 1b), but also across Elasmobranch α1 sequences (codons 40–44, Fig. 1a). Sites associated with putative CD4-binding residues in higher tetrapods were generally conserved in Chondrichthyans but showed alternative residues to those reported in birds and mammals (Fig. 1; Wang et al., 2011; Zhang et al., 2020), supporting previous claims that cartilaginous fish class II genes do not show an obvious tetrapod-like CD4 binding site (Bartl, 2001; Dijkstra et al., 2013). Additional features associated with intra- and interdomain contact sites typical of classical class II molecules were also found in Chondrichthyan sequences (Fig. 1). Highly conserved residues characteristic of IgSF molecules at Dα29/Eα30 and Rβ93 were conserved across taxa, while β33 was mostly conserved as an Asp except in the elephant shark C. milii, stingrays (Myliobatiformes), and some shark species (H. zebra and C. taurus), where it showed the mammalian condition as Nβ33. The CP, TM and CYT domains were generally conserved in size (with 1–2 amino acid variations) across sharks, rays and elephant shark, and also exhibited some typical features of vertebrate class II molecules (Fig. 1). In the beta CP, the residue Kβ198, a C-terminal ectodomain binding between α and β chains, is conserved in Elasmobranchs although the elephant shark C. milii exhibits a conservative substitution as Rβ198. In the TM domain, all residues involved in helix packing and binding are highly conserved (e.g. α: C195, G198, G202, G205 and G209; and β: G202, G205, G209 and G216), with minor exceptions. In both IIα and IIβ sequences, the CYT of Chondrichthyans differ in length from those of other vertebrates (α:2–7 aa; β:3–16 aa), as well as the CP but to a less extent (1–2 aa difference only).

Chondrichthyan class II sequences show signs of positive selection and increased diversity at the PBR

When comparing the diversity of MHC class II domains across Chondrichthyan α and β sequences, the TM (average percentage of pairwise amino acid differences: 10 % and 18 %, respectively) and CYT (20 % and 21 %, respectively) were the most conserved, followed by the CP which was more conserved in the IIα than in the IIβ sequences (26 % vs. 41 %). The PBR and IgSF domains were the most variable, with the former exhibiting higher average amino acid differences (alpha: 36 % vs. 32 %, beta: 42 % vs. 27 %). The PBR domains had a higher proportion of non-synonymous substitutions compared to the IgSF domains (average values in PBR vs. IgSF: alpha: 20 % vs. 16 %, beta: 27 % vs. 16 %). Both the PBR and IgSF domains of Chondrichthyan class IIα and IIβ proteins showed residues under negative selection, although these were most prevalent in the latter (α1: 20 sites, α2: 39 sites, β1: 13 sites, β2: 39 sites). Positions under negative selection generally coincided with important structural residues such as the cysteines involved in disulfide bridges, or the Hα5 residue essential in establishing a salt bridge (Fig. 1). Sites under positive selection were detected only in the PBR domains (α1: 10 sites, β1: 6 sites) and largely coincide with polymorphic residues that are used for peptide binding in mammalian class II molecules (Fig. 1, Table 1).

Elasmobranchs show remarkably ancient class II lineages

All Chondrichthyan sequences of MHC IIα and IIβ genes formed well-supported monophyletic clades, with classical and nonclassical class II gene sequences from other jawed vertebrates forming separate clusters (Fig. 2). Within Chondrichthyans, MHC IIα and IIβ trees had congruent topologies with sequences clustering into well-supported hierarchical clades consistent with the currently accepted taxonomic arrangement in the group, namely two sub-classes: Holocephali (chimaeras) and Elasmobranchii (sharks and rays), and two sister lineages of Elasmobranchs: Batoidea (rays) and Selachii (sharks) (Supplemental Fig. 1). Furthermore, taxon-based clustering was also observed towards the inner tree branches, with sequences generally clustering according to taxonomic order (but see below for discordances).
Fig. 2.

Phylogenetic trees of MHC class II genes. Trees for a) alpha and b) beta genes are based on amino acid sequences from the PBR and IgSF domains (exons 2 and 3, respectively) retrieved for Chondrichthyan taxa (names listed on Fig. 1 legend, amino acid sequences provided in Supplemental File 2 and 4 for alpha and beta, respectively), and including other vertebrate representatives. GenBank Accessions for non-Chondrichthyan sequences are as follows: human HLA-DRA1 AAA59785.1, HLA-DRB1 BAO73158.1, HLA-DQA1 AAA59760.1, HLA-DQB1 AAA59770.1, HLA-DPA1 AAH09956.1, HLA-DPB1 AAA59837.1; DOA P06340, DOB P13765, DMA CAA54169.1, DMB ARB08367.1; rat H2-DOA NP 898874, H2-DOB NP 001008846; chicken BLA1 AAY40298.1, BLB1 BAF62996.1, BLB2 BAF62998.1, M alpha CAA18966, M beta BAG69311; tuatara (lizard) DAA AKG62148.1, DAB AAZ77712.1; clawed frog DAA AAL58430.1, DAB BAA02842.1, DM alpha AAH61681, DM beta XP 002942059; coelacanth IIA XP006014228.1, IIB XP006010591.1; lungfish IIA GEHZ01055957.1, IIB GEHZ01012825.1; DMA1 GEHZ01056980.1, DMB1 GEHZ01000527.1; stickleback DAA AAU01917.1, DBA AAU01919.1, DAB AAU01918.1, DBB AAU01920.1; spotted gar IIA 501A2 JH591501, IIB 501B1 JH591501; salmon DAA AAL40122.1, DBA ABX44764.1, DAB CAA49725.1, DBB ABX44766.1; zebrafish DAA AAA72019, DBA D8.45A2, DAB AAA50043, DBB AAA87894. Bootstrap support values (10000 replicates) are shown only if above 50 %.

Topologies of the NJ trees differed when the PBR and IgSF domain were analyzed independently (Fig. 3 and 4, respectively). The PBR trees had short inner branches (weakly supported) and long outer branches, with well-supported clusters generally coincided with Order-level (i.e. taxon-based) clades. In contrast, the IgSF trees exhibited longer inner branches and shorter outer branches with good support at different levels of the tree, but the major clusters were not fully consistent with taxon-based clades. In this case, sequences from the same species may occur in very divergent clusters not including their closest relatives. This was particularly evident among shark taxa where two divergent and well supported lineages were found for IIα and IIβ genes (Fig. 3 and 4, respectively). Both phylogenetic trees imply that the lineages within each gene are remarkably old considering previous molecular-based divergence time estimates (Heinicke et al., 2009; Fig. 5). The lineages of class IIα and IIβ genes were found across several long-diverged shark orders implying a lineage split in the Selachian ancestor, i.e. after the divergence between sharks and rays and prior to the radiation of sharks between 350–393 Mya (Heinicke et al., 2009). Based on the placement of previously reported nurse shark sequences on the phylogenetic trees, we will refer to the lineages of class IIα genes as DAA and DBA following Kasahara et al. (1993), and of class IIβ genes as DAB and DBB (for lineage 1 of Bartl and Weissman, 1994; and lineage 2 of Bartl, 2001, respectively).
Fig. 3.

Phylogenetic trees of MHC class IIα genes. Trees are based on a) the PBR domain (α1, exon 2) and b) the IgSF domain (α2, exon 3), retrieved for Chondrichthyans. Lineages based on IgSF domain differences are framed by straight lines (DAA) and dotted lines (DBA). Major Chondrichthyan taxonomic groups are highlighted by colored bars: white – Holocephalans (chimaeras), grey – Selachians (sharks), black – Batoids (rays). Taxon names follow description in Fig. 1 legend (full names listed in Supplemental Table 2A), preceded by a reference number to allow direct comparison between the PBR and IgSF trees. Chondrichthyan nucleotide sequences are provided in Supplemental File 1.

Fig. 4.

Phylogenetic trees of MHC class IIβ genes. Trees are based on a) the PBR domain (β1, exon 2) and b) the IgSF domain (β2, exon 3), retrieved for Chondrichthyans. Lineages based on IgSF domain differences are framed by straight lines (DAB) and dotted lines (DBB). Major Chondrichthyan taxonomic groups are highlighted by colored bars: white – Holocephalans (chimaeras), grey – Selachians (sharks), black – Batoids (rays). Taxon names follow description in Fig. 1 legend (full names listed in Supplemental Table 2B), preceded by a reference number to allow direct comparison between the PBR and IgSF trees. Chondrichthyan nucleotide sequences are provided in Supplemental File 3.

Fig. 5.

Schematic representation of MHC class II lineage split in Chondrichthyans and the hypothetical timings of gene divergence. Dotted grey line indicates divergence of class IIα and IIβ lineages predating the radiation within sharks (represented as a blue triangle), and dashed grey line indicates the split between sharks and rays (radiation within rays represented as a green triangle). Time tree of Chondrichthyan evolution adapted from Heinicke et al. (2009).

Class II genes form lineage-specific pairs of linked α and β genes

A consistent feature between the class IIα and IIβ datasets is the unbalanced sequence numbers and unequal taxonomic representation between lineages. One lineage in each gene (αDBA and βDBB) included the vast majority of the shark sequences retrieved, and representatives from all Selachian taxonomic orders present in the dataset. The second lineage (αDAA and βDAB) included comparatively fewer sequences and representatives of only three shark orders (Orectolobiformes - G. cirratum, Lamniformes - C. taurus and Squaliformes - S. acanthias and E. spinax; Figs. 3 and 4). The association of lineage-specific pairs of α and β chains was confirmed by PCR amplification using lineage-specific primers, consistent with the previously reported linkage between DAA and DAB genes (Ohta et al., 2000) but now also between DBA and DBB genes (Fig. 6A). Indeed, results from family#1 of nurse sharks showed absence of DBB genes in all of the offspring except in the two pups previously shown to have two DBA genes (Fig. 6B; Ohta et al., 2000), with the same results also found in additional wild sharks.
Fig. 6.

Association of MHC class IIα and IIβ lineages into haplotypes in the nurse shark G. cirratum. A) Lineage-specific PCRs in family #1 from Ohta et al. (2000) (numbers refer to pups; M: mother), and in 15 unrelated individuals except no. 1 and 2 which are siblings from family#4 (mother was positive for all lineages; Y. Ohta, unpublished data). Lineage-specific PCR amplification was performed on genomic B) Schematic representation of class IIα and IIβ genomic organization in nurse sharks, with linkage between specific sets of IIα (boxes with thick black lines) and IIβ genes (boxes with light black lines). Two DBA genes were previously found in nurse sharks (Kasahara et al., 1993; Ohta et al., 2000) but copy number may vary in other species. Presence of DAA and DAB genes (shown for the Mother of family#1) was found across all individuals analyzed, while presence of DBA and DBB (shown for pups 10 & 13) was less common. NTC - no template control.

Insights into lineage-specific genetic diversity and among-lineage genetic divergence were obtained considering sequence alignments of the PBR and IgSF domains of the class IIα and IIβ genes of three shark species exhibiting both lineages per gene (Table 2). In general, αDBA and βDBB showed higher genetic diversity compared to αDAA and βDAB. As expected, this diversity was mostly distributed within the PBR domain while the IgSF was more conserved within lineages. Despite its lower polymorphism levels, among-lineage divergence was more pronounced at the IgSF domain (Table 2; Fig. 3 and 4) where several lineage-specific amino acid differences were found (Fig. 7). The between-lineage amino acid differences changed the hydrophobicity profile of the corresponding proteins, particularly of the beta chain (Fig. 7), suggesting conformational changes in the resulting heterodimers of class II α and β chains. Other fixed amino acid differences found between lineages included the absence of the Hα5 in DAA (replaced by Yα5) and its associated salt bonds, and conservation of alternative residues for β150 between DAB (Nβ150) and DBB (Dβ150) at the conserved minisatellite motif on the β2 domain (codons 140–144).
Table 2

Estimates of average evolutionary divergence over sequence pairs within and between MHC class IIα and IIβ lineages, estimated in MEGAX v10.1.18 assuming uniform rates and pairwise deletions for indels. Values within lineages correspond to average pairwise amino acid differences; values between lineages correspond to net average amino acid differences between sequences from the two lineages per gene. Values are given for the PBR and IgSF domains separately (exon 2 vs 3, respectively). N – number of sequences used in calculations, including three shark species and corresponding to those shown in Fig. 5.

MHC class IINPBRIgSF
AlphaDAA322%16 %
DBA619 %19 %
between lineages4%12 %
BetaDAB527 %9%
DBB434%20 %
between lineages13 %21 %
Fig. 7.

Amino acid alignments of the PBR (exon 2) and IgSF (exon 3) domains highlighting the differences between lineages. Alignments are shown for MHC a) IIα and b) IIβ genes for a subset of Elasmobranch taxa. Taxon names follow description in Fig. 1 legend (full names listed in Supplemental Table 2A and 2B for alpha and beta, respectively), preceded by a reference number to allow direct comparison between the PBR and IgSF domain alignments.

Chondrichthyan fish lack DM homologues

Nonclassical class II DM molecules exhibit characteristic residues, such as Nα125 (here Nα111) which directly interacts with the classical class II Wα43 residue (Pos et al., 2012), or the presence of an endosomal sorting motif (i.e. YXXΦ, X can be any residue, Φ denotes a hydrophobic residue) in the β CYT required for transport of DM to the cellular compartments where classical class II proteins are loaded with antigen (Dijkstra et al., 2013). Here, none of the Chondrichthyan class II sequences exhibited the required Nα111 residue, which was generally conserved as Gly (Fig. 1, Supplemental File 3). Likewise, none of the β sequences had CYT with the Tyr residue required for the endosomal sorting motif. In turn, all Chondrichthyan class II alpha sequences lacked the conserved Wα43 residue (Fig. 1a) and instead showed high variability of amino acids at this site across taxa, none of which was a Trp or had hydrogen-bonding abilities. Other classical class IIα1 residues have been proposed as important for the human HLA-DR and HLA-DM interaction and show some conservation across higher vertebrates (highlighted in Fig. 1a; Dijkstra and Yamaguchi, 2019; Pos et al., 2012; Zhang et al., 2020). However, these residues were either not conserved (in fact, some were under positive selection, e.g. α51; Fig. 1a), or were entirely different in Chondrichthyan class II. Among the variable sites, some exhibited conservative amino acid substitutions (e.g. αV34 M, αV42I) but lacked the required hydrogen-bonding ability, which may compromise DM binding.

Discussion

MHC class II sequences were obtained from all Chondrichthyan lineages, i.e. sharks, rays and chimaeras, showing features typical of those described for other vertebrates. These sequences also exhibited highly conserved residues binding peptide main-chain atoms although for alternative amino acids and at fewer residues than those previously described for tetrapods (Table 1; Kaufman et al., 1994; Painter and Stern, 2012), similar to previous reports on classical class II genes of teleosts (Dijkstra et al., 2013). The presence of conserved peptide-binding residues in addition to other features found here and in previous studies indicate that Chondrichthyan class II sequences conform to expectations of classical MHC genes, such as ubiquitous expression across tissues, linkage to the MHC and individual polymorphism. Ubiquitous tissue expression is suggested by the large amount of sequences retrieved from transcriptome data from a variety of Chondrichthyan taxa and tissues (including brain, heart, kidney, spleen, pancreas, liver, gonads, skin, eye; Supplemental Table 1). This pattern is consistent with studies showing MHC class II α and β genes expression in several tissues of the nurse shark G. cirratum and of the whitespotted bamboo shark Chiloscyllium plagiosum (Criscitiello et al., 2012; Ma et al., 2013; Ohta et al., 2004). It is likely, but not proven, that the ubiquitous class II expression in non-lymphoid tissues is due to the infiltration of hematopoietic cells like lymphocytes and antigen-presenting cells. Previous studies showed very high expression of class II on splenic dendritic cells (Rumfelt et al., 2004). Previous work has also shown linkage between MHC class I and II genes in the nurse shark G. cirratum (Ohta et al., 2000), and the same linkage was found in the recently assembled genome of the bamboo shark C. plagiosum (Zhang et al., 2019). Data on the polymorphism levels of Chondrichthyan MHC II genes is still limited, but we found several sequences for both alpha and beta genes within the same individual (e.g. 3–11 sequences per lineage of alpha or beta genes per individual; Supplemental Files 1 and 2) suggesting the presence of multiple alleles and gene copies. These observations are in line with studies of a few shark species showing copy number variation and allelic diversity within species at MHC class II genes (e.g. IIα Kasahara et al., 1993; IIβ Ma et al., 2013). Moreover, we also found higher genetic diversity and residues under positive selection in the PBR of IIα and IIβ genes, in contrast to the more conserved IgSF domain, as expected in classical class II genes (Radwan et al., 2020). Chondrichthyan sequences comprised a monophyletic group of class II genes clustering separately from those of bony fishes, amphibians, birds/reptiles, and mammals (both classical and nonclassical). Within Selachians (i.e. sharks), two distinct and well supported lineages of class IIα and IIβ genes were found based on sequence divergence at the IgSF domain. Most notably, the class II gene trees imply a remarkably old age of lineage divergence, with the split of class IIα and class IIβ lineages occurring earlier than 350 MYA (sensu Heinicke et al., 2009), i.e. prior to the major shark radiation (Fig. 5). Such an old age of classical class II lineages is unprecedented among vertebrates and highlights not only the long evolutionary history of cartilaginous fish but likely their slow evolution rates (Martin, 1999; Martin and Palumbi, 1993; Renz et al., 2013). However, dedicated molecular phylogenetic analyses including higher taxonomic coverage are advised to ascertain the evolutionary history of MHC class II genes in Elasmobranchs. Phylogenetic trees based on the PBR did not show similar lineage splits and, instead, exhibited a taxon-based sequence clustering. Despite its higher genetic diversity compared to the IgSF domain, the PBR showed lower between-lineage divergence. Such contrasting topologies between the two functionally distinct extracellular domains are in line with expectations of different selective pressures and recombination levels: the PBR may be under selection by taxon-specific pathogen communities and may evolve to better fit with the species’ antigen repertoires, including a higher recombination rate (via gene conversion), while differences at IgSF domain may be more limited given its conserved function. Indeed, the IgSF domain has been shown to provide a better evolutionary signal among MHC class IIβ genes compared to the PBR (e.g. in birds; Burri et al., 2010). Although similar functions and selective pressures may be expected for class I genes, lineage distinction in this case is generally based on the genetic divergence levels and distinct structural features of the PBR, while the IgSF domain shows among-lineage conversion (Almeida et al., 2020; Flajnik et al., 1993; Ohta et al., 2019) perhaps due to the close interaction with the conserved β2-microglobulin. Previous studies in the nurse shark G. cirratum showed linkage between IIα and IIβ genes, as well as linkage to class I genes (Ohta et al., 2000). These results were based on Southern blot analyses of family#1 (used here) using full cDNA probes of DAA and DAB at high stringency, and also showed the presence of paternal DBA “alleles” in two siblings (pups 10 and 13). Here we build on previous data to show that IIα and IIβ genes are found as lineage-specific pairs, namely the presence of DAA genes was exclusively associated with the presence of DAB genes, while the same is true for DBA and DBB genes. In line with previous observations of linked DAA and DAB genes, our results also suggest linkage between DBA and DBB genes. Based on the current results, it is expected that (at least) sharks may exhibit one set of class II genes while others may exhibit an extra set (Fig. 6). Genes in the two lineages segregate as alleles despite belonging to separate loci (Fig. 6B; Kasahara et al., 1993; Ohta et al., 2000). Given the stable nature of Chondrichthyan genomes and evidence of an MHC region linking genes in class I, class II and class III regions (Ohta et al., 2011, 2002, 2000; Terado et al., 2003), coupled with reports of specific sets of genes organized into MHC haplotypes (this study; Ohta et al., 2002), it may be hypothesized that organization into haplotypes may extend to the whole MHC region. In nurse sharks, DAA and DAB genes were found in all individuals analyzed here and appear to be most common gene set, in line with previous reports (Kasahara et al., 1993; Ohta et al., 2000). However, this may not be true across taxa as our results suggest that the most common gene set refers to DBA and DBB genes whose sequences were predominantly detected for the vast majority of the shark taxa analyzed (Fig. 3 and 4). Additional confirmation on genomic DNA are needed to ascertain if both lineages (and gene sets) are also present across shark taxa. In fact, one important caveat of the present dataset is its reliance on genes being expressed in high enough amounts to be detected by non-targeted RNAseq projects (such as those used here); such an approach may limit detection of non-expressed/underexpressed genes that may provide a distinct perspective of the MHC class II evolution in Elasmobranchs. In fact, preliminary data from gene expression analysis on the bull shark C. taurus, for which sequences from all IIα and IIβ lineages were retrieved, showed higher relative expression of DBA and DBB genes compared to DAA and DAB genes (A. Veríssimo, unpublished data). The particular genomic arrangement of class II genes in Elasmobranchs and their segregation as alleles (pseudoalleles) may perhaps explain the absence of between-lineage gene conversion of the IgSF domain, in contrast to that found in class I genes (see details above). In turn, given the linkage between specific lineages of IIα and IIβ genes, we think it is likely that each has co-evolved to better interact with each other. Similarly, Burri et al. (2010) report on a gene duplication event of IIβ genes prior to major avian radiations in which positive selection lead to changes in residues associated with IIα and IIβ chain interaction, suggesting co-evolution of the two genes. Given that between-chain interaction is mostly centered on the α2 and β2 domains (Bondinas et al., 2007; Brown et al., 1993), it may be expected that between-lineage differences may be found predominantly in the IgSF domains, as shown here (Table 2). Our search identified neither the nonclassical class II gene DM from the sequences retrieved from the Chondrichthyan “omic” datasets, nor did the classical class II proteins display the typical residues that interact with DM in mammals (sensu Dijkstra et al., 2013). The absence of sequences showing classical MHC class II features required for DM binding coupled to the lack of typical DM sequences in the present Chondrichthyan dataset are consistent with the absence of a DM-regulating system in basal jawed vertebrates. These results are reinforced by the fact that, although we specifically searched for MHC class II sequences in the cartilaginous “omics” datasets, other divergent sequences were also retrieved such as class I and immunoglobulin sequences. Based on our search strategy, it is likely that no other class II genes are present in cartilaginous fish genomes aside from the classical sequences described above. The evidence gathered so far in this and previous studies indicates that the existence of DM is a derived feature of the lobe-finned fish lineage leading to tetrapods (Dijkstra and Yamaguchi, 2019). Dijkstra et al. (2013) proposed that the critical residues involved in DM function, i.e. Wα43 and Nα125, may have evolved early in the lobe-finned fish lineage as suggested by their simultaneous presence in classical class II sequences of the coelacanth, a basal lobe-fin fish. On the other hand, the Holocephalan C. milii differed from Elasmobranchs by exhibiting Cys at α13 and α66, described as being important to form disulfide bonds in the nonclassical DM (van Lith and Benham, 2006). However, these Cys are present not only in DM molecules but also in the DA, DB and DE lineages of teleosts and in the classical class IIα of the coelacanth (Dijkstra et al., 2013; Dijkstra and Yamaguchi, 2019) and may represent an ancestral trait lost in most vertebrates. It remains to be clarified how peptide loading onto classical MHC class II proteins occurs without a DM molecule, a condition now observed in cartilaginous and bony fish. Several alternative mechanisms assisting peptide-CLIP exchange without DM have been proposed, such as the presence of Ii isoforms with a thyroglobulin domain, reported in basal jawed vertebrates, that may facilitate CLIP dissociation from and peptide binding to MHC class II molecules (Criscitiello et al., 2012); expression of classical class II molecules that bind CLIP with low affinity at low pH or existence of other types of nonclassical class II molecules that may interact with class II proteins in a similar way to the DM (Dijkstra et al., 2013). Along this line, recent studies have shown that small molecules may interact with MHC class II in the endosomal compartment by enhancing CLIP dissociation and peptide binding (reviewed in Call, 2011); these may serve as a simpler alternative to a more complex DM-based system. A final possibility is that fish class II molecules themselves might act in a DM-like fashion as ‘dimers of dimers’ that aid each other in peptide exchange. If this scenario is possible, then it would make sense that the catalyst for peptide exchange in tetrapods is a class II duplicate (DM) rather than any other enzyme found in endosomes.

Conclusions

We provide the first comprehensive overview of MHC class II diversity in the most basal jawed vertebrate lineage, a key taxon to infer ancestral traits in adaptive immunity. All shark, ray and chimaera species analyzed here showed MHC class II sequences exhibiting features reported as conserved across vertebrates, and consistent with classical class II molecules. Notably, our data showed ancient lineage diversification of IIα and IIβ genes in Elasmobranchs in addition to linkage between lineage-specific pairs of IIα and IIβ genes. Finally, our results also support previous claims on the absence of a DM system in basal jawed vertebrates and instead support its late emergence in the lineage leading to tetrapods.
  52 in total

Review 1.  Evolution of the mammalian MHC: natural selection, recombination, and convergent evolution.

Authors:  M Yeager; A L Hughes
Journal:  Immunol Rev       Date:  1999-02       Impact factor: 12.988

Review 2.  Birth of the major histocompatibility complex.

Authors:  J Klein; A Sato
Journal:  Scand J Immunol       Date:  1998-03       Impact factor: 3.487

3.  Negative regulation by HLA-DO of MHC class II-restricted antigen processing.

Authors:  L K Denzin; D B Sant'Angelo; C Hammond; M J Surman; P Cresswell
Journal:  Science       Date:  1997-10-03       Impact factor: 47.728

Review 4.  What chickens might tell us about the MHC class II system.

Authors:  Aimée Parker; Jim Kaufman
Journal:  Curr Opin Immunol       Date:  2017-04-21       Impact factor: 7.486

Review 5.  The Divergent Genomes of Teleosts.

Authors:  Vydianathan Ravi; Byrappa Venkatesh
Journal:  Annu Rev Anim Biosci       Date:  2018-02-15       Impact factor: 8.923

6.  Three-dimensional structure of the human class II histocompatibility antigen HLA-DR1.

Authors:  J H Brown; T S Jardetzky; J C Gorga; L J Stern; R G Urban; J L Strominger; D C Wiley
Journal:  Nature       Date:  1993-07-01       Impact factor: 49.962

7.  HLA-DMA and -DMB genes are both required for MHC class II/peptide complex formation in antigen-presenting cells.

Authors:  S P Fling; B Arp; D Pious
Journal:  Nature       Date:  1994-04-07       Impact factor: 49.962

Review 8.  The mechanism of HLA-DM induced peptide exchange in the MHC class II antigen presentation pathway.

Authors:  Monika-Sarah E D Schulze; Kai W Wucherpfennig
Journal:  Curr Opin Immunol       Date:  2011-12-02       Impact factor: 7.486

9.  Proteasome, transporter associated with antigen processing, and class I genes in the nurse shark Ginglymostoma cirratum: evidence for a stable class I region and MHC haplotype lineages.

Authors:  Yuko Ohta; E Churchill McKinney; Michael F Criscitiello; Martin F Flajnik
Journal:  J Immunol       Date:  2002-01-15       Impact factor: 5.422

10.  The spectrum of HLA-DQ and HLA-DR alleles, 2006: a listing correlating sequence and structure with function.

Authors:  George P Bondinas; Antonis K Moustakas; George K Papadopoulos
Journal:  Immunogenetics       Date:  2007-05-12       Impact factor: 2.846

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.