| Literature DB >> 29544443 |
Sophie Mißbach1,2, Denis Aleksic1, Lisa Blaschke1, Timm Hassemer1,3, Kyung Jin Lee1,4, Martin Mansfeld1, Jana Hänske1, Johannes Handler5, Robert Kammerer6,7.
Abstract
BACKGROUND: The CEA gene family is one of the most rapidly evolving gene families in the human genome. The founder gene of the family is thought to be an ancestor of the inhibitory immune checkpoint molecule CEACAM1. Comprehensive analyses of mammalian genomes showed that the CEA gene family is subject to tremendous gene family expansion and contraction events in different mammalian species. While in some species (e.g. rabbits) less than three CEACAM1 related genes exist, were in others (certain microbat species) up to 100 CEACAM1 paralogs identified. We have recently reported that the horse has also an extended CEA gene family. Since mechanisms of gene family expansion and diversification are not well understood we aimed to analyze the equine CEA gene family in detail.Entities:
Keywords: Alternative splicing; CEA gene family; CEACAM; Evolution; Gene duplication; Horse; Signaling motifs
Mesh:
Substances:
Year: 2018 PMID: 29544443 PMCID: PMC5856374 DOI: 10.1186/s12862-018-1145-x
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Fig. 1Comparison of equine CEACAM TM domains. Phylogenetic trees were constructed from transmembrane domain exon nucleotide sequences (a), and amino acid sequences (b) using the UPGMA (a) and ML (b) method (MEGA 6.0 software). The reliability of a phylogenetic tree was assessed using the Bootstrap test applying 500 replicates. The statistical support for selected nodes is shown. Boxes group CEACAMs with the indicated properties. Note that on amino acid level the association of the TM with cytoplasmic signaling motifs is not visible. The transmembrane domain of Equine CEACAM19 was used as an out group. c Amino acid sequences of transmembrane domains of equine CEACAM-1 related CEACAMs. Predicted sequence of transmembrane helices by the TMHMM Server v. 2.0 http://www.cbs.dtu.dk/services/TMHMM-2.0/ are underlined. Stop codons within the transmembrane domain exon are indicated in red. Amino acids are depicted in single letter code. TM, transmembrane domain; CC, CEACAM; ITAM, immunoreceptor tyrosine-based activation motif; ITIM, immunoreceptor tyrosine-based inhibition motif
Expression of membrane anchored equine CEACAMs in various tissues of the horse
| Tissue | CC1 | CC19 | CC41 | CC42 | CC43 | CC45 | CC50 | CC53 | CC54 |
|---|---|---|---|---|---|---|---|---|---|
| Salivary gland | + | ||||||||
| Tongue | ++ | ++ | |||||||
| Oesophagus | ++ | ++ | + | ||||||
| Liver | +++ | ++ | + | ||||||
| Duodenum | ++ | ++ | +++ | +++ | ++ | ++ | |||
| Jejunum | + | ++ | ++ | ||||||
| Ileum | +++ | ++ | ++ | +++ | ++ | ||||
| Caecum | + | ++ | + | +++ | ++ | +++ | |||
| Colon | ++ | + | ++ | + | ++ | ||||
| Rectum | + | ++ | + | ++ | |||||
| Trachea | ++ | ++ | |||||||
| Lung | + | ||||||||
| Kidney | + | ++ | |||||||
| Bladder | |||||||||
| Vulva | + | + | +++ | ++ | +++ | ||||
| Spleen | + |
number of animals = 2
+++ = strong expression; ++ = moderate expression; + = weak expression
Expression of membrane anchored equine CEACAMs in naïve and IL-2 stimulated immune cells
| Cell type | CC1 | CC19 | CC41 | CC42 | CC43 | CC45 | CC50 | CC53 | CC54 |
|---|---|---|---|---|---|---|---|---|---|
| PBMC | + | ||||||||
| PBMC 7d_IL2 | ++ | ++ | + | +++ | ++ | ||||
| PBMC 14d_IL2 | ++ | + | ++ | ||||||
| PBMC 19d_IL2 | ++ | + | +++ | ||||||
| Granulocytes | +++ | +a | ++ |
n = 3
+++ = strong expression; ++ = moderate expression; + = weak expression
aonly in one sample positive
Gene-specific oligonucleotides for expression analyses and cDNA cloning of horse CEA gene family members
| Gene | Oligonucleotide sequence | Location of primers (exon) | Size of PCR product (bp) |
|---|---|---|---|
| CEACAM1 | For: TGCATCATATAAGATAGGCCCAG | N domain | 367 |
| Rev: AGTGAGAGTCCTCTTGTCCAGG | A1 domain | ||
| CEACAM41 | For: CATTGCATGTGATAGAGCGAC | N domain | 246 |
| Rev: CGTCCTTCTGTTCTGTGACTGT | A2 domain | ||
| CEACAM42 | For: AGGGGAAGGAATAGATCCCG | N domain | 392 |
| Rev: GAGTCCTGTTGTCCGAGGATAG | A1 domain | ||
| CEACAM43 | For: CCCATCAAGAAATTGTGTCCT | N domain | 276 |
| Rev: ATGTTAACACTACAGGGTCCCTG | A1 domain | ||
| CEACAM44 | For: GCTGTTGTAGGGACCGATGTTA | N domain | 503 |
| Rev: CCTCCTTCCTGATGATGTGTGT | A2 domain | ||
| CEACAM45 | For: GCGATAGGGCAACAAGAAATTAT | N domain | 254 |
| Rev: CATGTTAACACTACAGGGTCCTCA | A1 domain | ||
| CEACAM46 | For: AGTCCCACCCAATGGTATCC | N domain | 525 |
| Rev: CCCAAGTATTGCCCCTTCTGT | A1 domain | ||
| CEACAM47 | For: ACAGACCAAGTCCCAAAACC | N domain | 256 |
| Rev: TTACGTCTTTTGACGATTCCAG | A1 domain | ||
| CEACAM48 | For: GACCAGCTCGCAAACAAA | N domain | 279 |
| Rev: AACAAGCTTCTTATCTGGCATTT | A2 domain | ||
| CEACAM49 | For: TGGAGCACGTCCACATAAAC | N domain | 232 |
| Rev: GGAGGTATTTGACCCTGGATT | A2 domain | ||
| CEACAM50 | For: AGATGCTCTTGAAGGAACGGAT | N domain | 453 |
| Rev: GACAGCTTCAGCCAGGTCCTA | A1 domain | ||
| CEACAM51 | ND | NA | NA |
| CEACAM52 | For: ATGCTGCTGCAGGGGATA | N domain | 508 |
| Rev: CATCCTCCCTCCTGACACAT | A2 domain | ||
| CEACAM53 | For: TTCAAAGGGGAAATAGATTCCA | N domain | 394 |
| Rev: GAGTCCTGTTGTCTGGGGAC | A1 domain | ||
| CEACAM54 | For: ATCAGTCCCTGGCTTCAGA | N domain | 409 |
| Rev: TACACGGAGCTGTATACTTC | A domain | ||
| CEACAM55 | For: CCCTACTAGTCACGAGGAAGAAC | N domain | 222 |
| Rev: CATCCTCTCGGTCAGTCACA | A2 domain | ||
| CEACAM1 | For: GTCAGTAAGCTTCAGCAGACACCATGGAACTC | 5’UTR | 1624 |
| Rev: GTCAGTTCTAGACAGTGAACAGGGCAGGATATG | 3’UTR | ||
| CEACAM41 | For: GAGCAGTGCTTGTGAGCATT | 5’UTR | 1131 |
| Rev: TAAGGGGAAGTTCCTGAAGG | 3’UTR | ||
| CEACAM42 | For: TCACAGAGGGAGGGACAGAG | 5’UTR | |
| Rev: GGGTAAAGGGATCCTTCCAG | 3’UTR | 1390 | |
| Rev: TCACAGAGGGAGGGACAGAG | TM domain | 1121 | |
| CEACAM43 | For: GTCAGTAAGCTTGACAGAGCAGGCAGCAGAC | 5’UTR | 1613 |
| Rev: GTCAGTTCTAGAGAACAGGGCAGGTTGCAT | 3’UTR | ||
| CEACAM45 | For: TCACAGAGGAAAGGACAGAGC | 5’UTR | |
| Rev: CCTTGATTCCTGGACATTGAA | 3’UTR | 1539 | |
| Rev: CCTTGATTCCTGGACATTGAA | TM domain | 1379 | |
| CEACAM50 | For: CAGGAGTGCTTGTGAGAGTT | 5’UTR | 1139 |
| Rev: AGGGAAGAGGCTTCGTCTTC | 3’UTR | ||
| CEACAM52 | For: GTCAGTAAGCTTGCAGAAGCTCATCTCACAGAG | 5’UTR | 583 |
| Rev: GTCAGTTCTAGATTGTGAGGCAGATCAGATCC | 3’UTR | ||
| CEACAM53 | For: CAGAGGGAGGGACAGAACAG | 5’UTR | 1163 |
| Rev: ATGGCAGTTAGCCTTGGAGA | 3’UTR | ||
| CEACAM54 | For: TCTCACAGAGGGAGGAGCCA | 5’UTR | 916 |
| Rev: AGTCAGCAGTGCAGGAAACA | 3’UTR |
ND not done, NA not applicable, For forward primer, Rev. reverse primer
Fig. 2mRNA of equine CEACAMs. From all indicated CEACAMs the genomic exon structure is shown on top and below the different transcript variants observed in this study were depicted. Differential splicing was found for CEACAM1 (4 transcripts), CEACAM41, CEACAM45 and CEACAM54 (each 2 different transcripts). Transcripts coding for secreted CEACAMs were found for CEACAM41 and CEACAM54. No transcript coding for a CEACAM with an ITAM signaling motif was identified. Transcripts coding for CEACAMs with ITIM/ITSM motifs were found only for two CEACAM genes. The majority of transcripts code for CEACAMs with a short cytoplasmic tail lacking immunotyrosine-based motifs. No indication for GPI-anchored CEACAMs was found. Red stars indicate the presence of a stop codon, mutated splice sites are marked with a red tilde
Fig. 3CEACAM42 has an extended transmembrane domain. The exon structure of the CEACAM42 gene indicates that its gene product may contain an ITIM signaling motif. However as shown in a the cloned mRNA demonstrate that an alternative splice donor site of the transmembrane exon is used and that the exons C1 was not integrated into the mRNA. The stop codon which exists in the extended transmembrane exon is indicated by the red arrow and the “stop”. The mutated splice donor sites of C1 is indicated with a red tilde. b shows the possible but unused splice donor site (underlined). c Amino acid sequence of the transmembrane and cytoplasmic part of CEACAM42. The PKA phosphorylation site is highlighted in blue, the predicted Y-based sorting signal in red. d Expression of CEACAM42 was detected in various parts of the intestine
Fig. 4CEACAM50 has unique signaling motifs within the CEA gene family. The exon structure of the CEACAM50 gene indicates that its gene product may contain an ITIM signaling motif. However as shown in a the cloned mRNA demonstrate that an alternative splice donor site of the transmembrane exon is used and that the exons C1 and C2 were not integrated into the mRNA. b shows the possible but unused splice donor site (underlined). c Amino acid sequence of the transmembrane and cytoplasmic part of CEACAM50. The PKA phosphorylation site is highlighted in blue, further phosphorylation sites were indicated in bold. d Expression of CEACAM50 was detected in various parts of the intestine. e Expression of CEACAM50 fused to a FLAG-tag at the N-terminus by transfected Cos7L. Red stars indicate the presence of a stop codon
Fig. 5CEACAM54 contains a P-T-R rich membrane proximal extracellular region. a Exon structure of the CEACAM54 gene and the two transcript variants identified in the present study. The simple sequence inserted in front of the transmembrane domain and the translation into the amino acid sequence (blue box) are depicted in (b). c Expression of transcript variant 1 (916 bp) in the intestine and trachea and of variant 2 (650 bp) in the vulva mucosa. Cartoon of the Flag-tag-CEACAM54 fusion protein (d). Expression of the Flag-tagged CEACAM54 at the cell surface of Cos7L cells as detected by flow cytometry (e). Red stars indicate the presence of a stop codon, mutated splice sites are marked with a red tilde
Fig. 6CEACAM41 is a secreted protein. a PBMC were cultured with 500 U/ml rhIL-2 for the indicated times. RT-PCR using primers located in the leader and the N-domain suggested that CEACAM41 is preferentially expressed by stimulated PBMC. Full length amplification of CEACAM41 cDNA identified two different transcripts (b). The two transcripts differ from each other by a short sequence, inserted between exon C2 and C3 further named C2’ exon (c). The stop codon which exists in the extended C1 exon is indicated by the red arrow and the “stop” (c). Both transcripts code for the same protein (d). e Comparison of the amino acid sequence of the cytoplasmic tail of CEACAM41 as predicted from the exon sequence of the CEACAM41 gene and the peptide sequence encoded by the cytoplasmic exons in transcripts without transmembrane domains