| Literature DB >> 15647114 |
Andrew S McLellan1, Beate Fischer, Gabriela Dveksler, Tomomi Hori, Freda Wynne, Melanie Ball, Katsuzumi Okumura, Tom Moore, Wolfgang Zimmermann.
Abstract
BACKGROUND: The pregnancy-specific glycoprotein (Psg) genes encode proteins of unknown function, and are members of the carcinoembryonic antigen (Cea) gene family, which is a member of the immunoglobulin gene (Ig) superfamily. In rodents and primates, but not in artiodactyls (even-toed ungulates / hoofed mammals), there have been independent expansions of the Psg gene family, with all members expressed exclusively in placental trophoblast cells. For the mouse Psg genes, we sought to determine the genomic organisation of the locus, the expression profiles of the various family members, and the evolution of exon structure, to attempt to reconstruct the evolutionary history of this locus, and to determine whether expansion of the gene family has been driven by selection for increased gene dosage, or diversification of function.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15647114 PMCID: PMC546212 DOI: 10.1186/1471-2164-6-4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of mouse PSG nomenclature and sequence accession numbers
| Psg16 | bCEA | AC148976.2 (RC 40000–60000) | predicted CDS: join (1878–1941, 4115–4462, 7291–7650, 8750–9109, 11758–12041); |
| Psg17 | Cea2, mmCGM5 | NM_007677 | |
| Psg18 | Cea3, mmCGM6 | NM_011963 | |
| Psg19 | Cea4 | NM_011964 | |
| Psg20 | Cea7 | AC079497.1 (113793–127892) | predicted CDS: join (1770–1836, 2989–3345, 4997–5356, 6587–6937, 13114–13397) |
| Psg21 | Cea8 | NM_027403 | |
| Psg22 | Cea9 | NM_001004152.1 | |
| Psg23 | Cea11 | NM_020261 | |
| Psg24 | Cea12 | AC079526 (115000–131000) | predicted CDS: join (1648–1696, 2771–3130, 5965–6324, 9351–9710, 10943–11302, 12844–13191, 14196–14479) |
| Psg25 | Cea13 | NW_000292.1 (RC 890000–910000) | predicted CDS: join (4905–4968, 7121–7480, 10406–10765, 11988–12347, 15508–15791) |
| Psg26 | Cea14 | join (CAAA01217140.1 {RC 1–6315}, CAAA01213459.1 {557–4715}, CAAA01175422.1 {155–2891}) | predicted CDS: join (2148–2211, 3292–3651, 5836–6195, 7507–7866, 10823–11106) |
| Psg27 | Cea15 | AC087156.1 (RC 139366–153050) | predicted CDS: join (240–303, 2037–2393, 5271–5630, 6669–7028, 10039–10322) |
| Psg28 | Cea16 | NM_054063 | |
| Psg29 | Cea17 | AC079526 (183285–194009) | predicted CDS: join (1459–1522, 2658–3005, 6275–6634, 8128–8487, 9428–9700) |
| Psg30 | XM_145406 | GNOMON prediction in NCBI | |
| Psg31 | AC134475.3 (10000–70000) | predicted CDS: join (3923–3986, 5262–5621, 19366–19725, 34382–34741, 36822–37172, 40760–41119, 42413–42763, 47310–47669, 49090–49443, 50473–50756) | |
| Psg32 | Psg-ps1 | XR_000250 | GNOMON prediction in NCBI |
a Where nucleotide start and end positions are shown in parenthesis after accession numbers, they refer to the start and end positions of the genomic sequence excerpt (encompassing the PSG exons) that is included in Additional file 1. RC indicates that the sequence in Additional file 1 is the reverse complement. b Where we have predicted the full CDS of a PSG (based on common structure and splice sites), the numbers shown refer to exon start and end positions within the excerpted sequence included in Additional file 1.
Figure 1Domain organization of mouse PSGs. Mouse PSGs are composed of 3 – 8 IgV-like N domains and one IgC-like A domain. The relative position of potential N-glycosylation sites (consensus amino acid sequence: asparagine-X-threonine / serine; X any amino acid except proline) were identified using the NetNGlyc 1.0 Server online software and indicated by lollipops. Although PSG32 is probably not routed through the endoplasmic reticulum, the putative N-glycosylation sites are shown for comparison. Of the two PSG16 splice variants, only the variant expressed in the placenta is shown.
Figure 2Evolutionary relationships between mouse PSG IgV-like domains. An unrooted evolutionary tree based on ClustalX amino acid sequence alignments showing the relationships between all mouse PSG N-domains. The three main groups N1, N2 and N3 have been ringed for clarity. The scale bar represents 0.1 amino acid substitutions per site.
Figure 3Domain expansion of . A. NJ-trees based on ClustalX amino acid sequence alignments showing: (i) the evolution of PSG24 IgV-like domains compared to those of PSG17; (ii) the evolution of PSG30 IgV-like domains compared to those of PSG17; (iii) the evolution of PSG31 IgV-like domains compared to those of PSG17. The trees were rooted using an outgroup consisting of the N-domain amino acid sequences of human PSG1, PSG2 and PSG3. Alignments were bootstrapped 1000 times yielding node values which are represented as follows < 50%: no mark; 50–74%: marked *; 75–94%: marked **; ≥ 95%: marked ***. The scale bar represents 0.1 amino acid substitutions per site. B. The arrangement of domains represented by boxes shaded: cyan for leader (L) peptides; light pink for the N1-domains; dark pink for N2 and N2-like domains; red for N3 and N3-like domains; blue for A-domains. (i) Comparison of Psg17 and Psg24 exon arrangement including identities of amino acid sequence alignments. (ii) Comparison of Psg30 and Psg31 exon arrangements including identities of amino acid sequence alignments. C. Predicted model of IgV-like domain expansion by exon duplications in (i) Psg24 and (ii) Psg30 and Psg31.
Figure 4Expression of . Total RNA (1 μg) from day 10.5, 12.5, 15.5 and 17.5 BALB/c placentae was reverse transcribed using an oligo (dT) oligonucleotide (reverse PCR primer). After addition of the degenerate Psg-all oligonucleotide (forward PCR primer), which anneals to the cDNA of all known members of the mouse Psg family, Psg cDNAs were amplified by PCR (see schematic diagram depicting generalised mouse Psg cDNA amplification). Aliquots were size-separated by agarose gel electrophoresis. a, PCR products were visualised by ethidium bromide staining. b-o, the amplification products were blotted onto nylon membranes and individual blots were hybridised with single gene-specific 32P-labelled oligonucleotides from the N1 domain regions (Table 2). The location of the primers used for amplification of the Psg cDNAs and the region from which the sequences of the gene-specific oligonucleotides were derived are shown together with a schematic representation of mouse Psg mRNA. The 5'- and 3'-untranslated regions are shown as bold lines. L, leader; N1-N3, IgV-like domains; A, IgC-like domain.
Oligonucleotides used in this study
| Psg17A5' | 5'-CTTGCCACACAGCCCGTCAT-3' | Psg17 A domain | |
| Psg17A3' | 5'-TCATCACAGCCAGGATGACT-3' | Psg17 A domain | |
| mPsg-5' | 5'-AWCCTSYTGSYTCCTGC-3'a | N1 domain | binds to several mouse Psg cDNA sequences |
| mPsg-3' | 5'-TGMARGWAYAKGGATGT-3'a | N1 domain | binds to several mouse Psg cDNA sequences |
| PsgN1-F | 5'-GA | intron 1/N1 exon | for the amplification of all known |
| PsgN1-R | 5'-CC | N1 exon/intron 2 | for the amplification of all known |
| Psg32N1-F | 5'-GA | N1 domain | |
| Psg32-exon1 | 5'-GAGGTGTCCTTGGTGCTTCTC-3' | exon 1 | Psg32-specific |
| oligo (dT) | 5'-TTCTAGAATTCAGCGGCCGC(T)30 VN-3'a | poly(A) tail | |
| Psg-all | 5'-CCTCCMTYTTDDCCTRCTGS-3'a | N1 domain | binds to all known Psg cDNA sequences except Psg32 |
| bCEAN/2 | 5'-GCAAATGTACAGTGGTAG-3' | N1 domain | Psg16-specific |
| Psg17N | 5'-GTGGAATTCTTACCTCCC-3' | N1 domain | Psg17-specific |
| Psg18N | 5'-GGCTGTACTACTATAGTG-3' | N1 domain | Psg18-specific |
| BK07 | 5'-AAAGTGCCACCCGGGAA-3' | N1 domain | Psg19-specific |
| Psg20N | 5'-TGCCAAGGTCACTATCCA-3' | N1 domain | Psg20-specific |
| Psg21N | 5'-GCTCTGCATTTTCTGGAC-3' | N1 domain | Psg21-specific |
| 35N | 5'-GTCTGGTATAGAGGGGTG-3' | N1 domain | Psg22-specific |
| 53N | 5'-GCTGTGTATTTACTGGAC-3' | N1 domain | Psg23-specific |
| 9.3N1 | 5'-ATAGCAGAGGTGTGACG-3' | N1 domain | Psg24-specific |
| 11.2N1 | 5'-ATCTTCTAGGCCTTGCC-3' | N1 domain | Psg25-specific |
| 189N | 5'-CATTCGCTGTACTATAGTG-3' | N1 domain | Psg26-specific |
| 214N | 5'-CGAGTCACCATCCATTCA-3' | N1 domain | Psg27-specific |
| 2128N | 5'-GCACTATAGTTTAACAGCG-3' | N1 domain | Psg28-specific |
| 9140N | 5'-TGCAGTGGTGTCTGACTT-3' | N1 domain | Psg29-specific |
| Psg-ps1N | 5'-TTAGTGCCACCACAAGTG-3' | N1 domain | Psg32-specific |
a Standard IUB/IUPAC nucleic acid codes codes have been used to indicate degeneracy where: R = G/A; Y = T/C; K = G/T; M = A/C; S = G/C; W = A/T; B = G/T/C; D = G/A/T; H = A/C/T; V = G/C/A; N = A/C/G/T.
Figure 5Virtual Northern analysis of the mouse . The nucleotide sequences of the Psg exons encoding the N1 or the A domains were used in NCBI-BLAST searches of the GenBank mouse EST database (March 16, 2004) for the presence of Psg transcripts (virtual Northern analysis). A hit was registered when a 100% match for a sequence > 150 bp was observed. Obvious mismatches such as unidentified nucleotides (N) or single nucleotide insertions or deletions (especially at the end of a sequence run) were ignored.
Figure 6Physical map of mouse . A. The order of the Psg genes was inferred from the presence of the various genes on overlapping cosmid, BAC and YAC clones. The position of Psgs represented by filled boxes is unequivocal, whereas the position of those represented by open boxes is ambiguous. Arrows between pairs of genes indicate that their order remains unresolved. The distances between individual genes are not shown to scale. Chimeric YACs mapping to separate chromosomes are indicated by stippled and solid lines. The solid lines correspond to chromosome 7 regions containing the Psg genes indicated above. The locations of the non-chromosome 7 regions are not known. Only the sizes of non-chimeric YACs have been determined and are shown (size bar corresponds to 100 kb). The centromere (cen) / telomere (ter) order and the relative orientation of the two Psg gene subclusters were resolved by FISH mapping. B. Two-colour FISH prophase mapping of relative orientation of the two Psg gene subclusters using mouse m5S cells and C57BL/6CrSlc mouse lymphocytes. (i) FISH pattern representative of 38 experiments where BAC 310D2 in subcluster 1, labelled with rhodamine (R), is centromeric to BAC 600E2 from subcluster 2, labelled with fluorosceine (F). (ii) FISH pattern representative of 38 experiments where BAC 310D2 in subcluster 1, labelled with rhodamine, is centromeric to YAC F10104 from subcluster 2, labelled with fluorosceine. (iii) Orientation of subcluster 2 determined by relative positions of BAC 572D4, labelled with rhodamine, which is telomeric to YAC F10104, labelled with fluorosceine.