| Literature DB >> 29931472 |
John C Schwartz1, John A Hammond2.
Abstract
The leukocyte receptor complex (LRC) encodes numerous immunoglobulin (Ig)-like receptors involved in innate immunity. These include the killer-cell Ig-like receptors (KIR) and the leukocyte Ig-like receptors (LILR) which can be polymorphic and vary greatly in number between species. Using the recent long-read genome assembly, Sscrofa11.1, we have characterized the porcine LRC on chromosome 6. We identified a ~ 197-kb region containing numerous LILR genes that were missing in previous assemblies. Out of 17 such LILR genes and fragments, six encode functional proteins, of which three are inhibitory and three are activating, while the majority of pseudogenes had the potential to encode activating receptors. Elsewhere in the LRC, between FCAR and GP6, we identified a novel gene that encodes two Ig-like domains and a long inhibitory intracellular tail. Comparison with two other porcine assemblies revealed a second, nearly identical, non-functional gene encoding a short intracellular tail with ambiguous function. These novel genes were found in a diverse range of mammalian species, including a pseudogene in humans, and typically consist of a single long-tailed receptor and a variable number of short-tailed receptors. Using porcine transcriptome data, both the novel inhibitory gene and the LILR were highly expressed in peripheral blood, while the single KIR gene, KIR2DL1, was either very poorly expressed or not at all. These observations are a prerequisite for improved understanding of immune cell functions in the pig and other species.Entities:
Keywords: Annotation; Immunoglobulin domains; Leukocytes; Natural killer cells; Pseudogenes; Sus scrofa
Mesh:
Substances:
Year: 2018 PMID: 29931472 PMCID: PMC6182393 DOI: 10.1007/s00251-018-1067-1
Source DB: PubMed Journal: Immunogenetics ISSN: 0093-7711 Impact factor: 2.846
Fig. 1Structure and organization of the LRC in a humans and b–d pigs. a Organization of the human LRC, with KIR haplotype A, in the current genome assembly GRCh38.p12. b Recurrence plot comparing the Sscrofa10.2 (y-axis) and Sscrofa11.1 (x-axis) assemblies. Sequence gaps are indicated as black boxes above and at right. LRC-associated genes and their respective orientations in the Sscrofa11.1 assembly are indicated below the recurrence plot. Red shading indicates a region of approximately 197 kb that is present in the Sscrofa11.1 assembly, but absent from the Sscrofa10.2 assembly. c Organization of the LRC in Sscrofa11.1 with the contig inversion at the centromeric end corrected. The entire region spanning VSTM1 to GP6 is approximately 849 kb, excluding the poly-N sequence gap. d Organization of genes on the BAC clone PigE-173F2 (GenBank: CR853303), which was previously described by Sambrook et al. (2006) and re-annotated here. Parts a, c, and d are drawn to the same scale (shown at bottom) and the centromeric/telomeric orientation is the same in pigs and humans. Gene orientation is indicated with arrows pointing in the direction of transcription. Gene families are differentially colored to ease visualization. Ig-like domains are indicated as either hexagons or triangles above the respective gene, and sub-group domain phylogeny is indicated by its vertical position (e.g., human LILRB4 is lacking both the second and third Ig domains). Open symbols indicate non-functional domains and pseudogenes, whereas closed symbols indicate that they are putatively functional. Long- or short-tailed intracellular domains are also shown to indicate whether a gene encodes an inhibitory or activating receptor, respectively
Fig. 2Phylogenetic analysis of Ig-like genes in the LRC using nucleotide extracellular coding region sequences. All intact porcine LRC Ig-like genes are shown to illustrate that the LILR and novel Ig-like genes form distinct clades. For comparison, all functional human LILR are shown, as well as the novel inhibitory genes from human, Sunda flying lemur, horse, little brown bat, cattle, sheep, and goat. Branch node values indicate the percentage of replicate trees in which the associated sequences clustered together (based on 1000 bootstrap replications) and only values > 50% are indicated. Branch length units are the number of substitutions per site. Except for the novel short-tailed gene sequence, which is derived from PigE-173F2, all porcine sequences are derived from the Sscrofa11.1 assembly. Genome coordinates for the non-porcine novel inhibitory genes are described in the “Methods” section. The GenBank accession numbers for the human LILR genes depicted here are as follows: LILRB1 (NM_006669.6), LILRB2 (NM_005874.4), LILRA1 (NM_006863.3), LILRA2 (NM_001130917.2), LILRA3 (NM_006865.4), LILRB3 (NM_001081450.2), LILRB4 (NM_001278426.3), LILRB5 (NM_001081442.2), LILRA4 (NM_012276.4), LILRA5 (NM_021250.3), and LILRA6 (NM_024318.3)
Fig. 3Expression of LRC Ig-like genes in porcine peripheral blood. Read numbers were normalized to fragments per kilobase of transcript per million sequenced reads (FPKM) in order to account for differences in transcript lengths and sequencing depth. Error bars indicate one standard deviation based on FPKM values for each animal. The values for LILR are based on reads that map to either of the two clades shown in Fig. 2. The values for the novel Ig-like genes are based on reads that map only to the region of the gene encoding the intracellular tail, as this region is unique between the two genes. All of the data, including numbers of reads, are shown in Supplementary Table 2