| Literature DB >> 16507138 |
Alistair R R Forrest1, Darrin F Taylor, Mark L Crowe, Alistair M Chalk, Nic J Waddell, Gabriel Kolle, Geoffrey J Faulkner, Rimantas Kodzius, Shintaro Katayama, Christine Wells, Chikatoshi Kai, Jun Kawai, Piero Carninci, Yoshihide Hayashizaki, Sean M Grimmond.
Abstract
BACKGROUND: Alternative transcripts of protein kinases and protein phosphatases are known to encode peptides with altered substrate affinities, subcellular localizations, and activities. We undertook a systematic study to catalog the variant transcripts of every protein kinase-like and phosphatase-like locus of mouse http://variant.imb.uq.edu.au.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16507138 PMCID: PMC1431701 DOI: 10.1186/gb-2006-7-1-r5
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Protein kinase and phosphatase loci of mouse
| Classification | |
| Kinase-like | 527 |
| Phosphatase-like | 160 |
| Transcript evidence | |
| Observed transcript | 680 |
| Gene predictions | 7 |
| Gene architecture | |
| Multi-exon | 679 |
| Single exon | 8 |
| Total | 687 |
cDNA evidence
| Transcript support | 5' end | 3' end |
| FANTOM3 | 3,211 | 3,211 |
| PUBLIC | 2,666 | 2,666 |
| 5' ESTs | 20,866 | - |
| 3' ESTs | - | 32,166 |
| Public ESTs | 41,543 | 15,989 |
| GIS | 1,279 | 1,279 |
| GSC | 27,616 | 27,616 |
| CAGE | 162,707 | - |
| Total count | 259,888 | 82,927 |
Breakdown of supporting transcript evidence used in the paper: full-length cDNAs (FANTOM3, public), expressed sequence tags (ESTs; public ESTs, and RIKEN 5' and 3' ESTs), capped analysis of gene expression (CAGE) tags, and DiTags (gene identification signature [GIS] and Genome Sciences Centre [GSC]).
Support for alternative transcription starts and stops within the phosphoregulator set
| End | 5 counts | 10 counts | 20 counts | 50 counts | |
| 5' | 5' exon clusters | 1086/612 (1.8) | 852/576 (1.5) | 730/543 (1.3) | 577/480 (1.2) |
| TSS clusters | 1289/609 (2.1) | 924/572 (1.6) | 742/533 (1.4) | 550/472 (1.2) | |
| 3' | 3' exon clusters | 976/611 (1.6) | 750/564 (1.3) | 576/495 (1.2) | 335/307 (1.1) |
| TTS clusters | 1600/620 (2.6) | 1054/566 (1.9) | 685/483 (1.4) | 307/262 (1.2) |
Number of 5' or 3' ends are shown for thresholds of 5, 10, 20 or 50 supporting tags. Shows the number of ends divided by the number of genes, and the ratio in brackets Note that at a threshold of 50, the number of genes with 3' end support is almost half that with 5' support. TSS, transcription start site; TTS, transcription termination site.
Three types of alternative transcription starts identified in this study. (a) ME-Exon: mutually exclusive starting exons (Sgk; GenBank:AK132234 and GenBank:AK086892). (b) Intronic: starts within introns that run into the next exon (Egfr; GenBank:AF275367 [longer form] and GenBank:AK087861 [shorter intronic start form]). (c) Exonic: starts within exon of longer transcript (Ntrk1; GenBank:AK081588 and GenBank:AK148691; supported by a CpG island and 5'-RACE). 5'-RACE, 5' rapid amplification of cDNA ends.
Loci with well supported alternative 5' exons
| Intron | Type | Count | MGI symbol |
| 1 | ME_exon | 16 | Abl1, Adck1, Brd4, Dusp14, Mark2, Pak1, Pdp1, Pkn3, Prkacb, Prkar1a, Ptp4a3, Ptprs, Raf1, Riok2, Sgk, Srpk2 |
| Intronic | 9 | Acvrl1, Ccrk, Cdk9, Ntrk2, Pim3, Ppp4c, Prkcn, Prkwnk1 | |
| 2 | ME_exon | 1 | Sgk3 |
| Intronic | 1 | Ptp4a2 | |
| 3-4 | ME_exon | 6 | Mast3, Limk2, Pak6, Pftk1, Pkn1, Prkcz |
| Intronic | 0 | ||
| 5> | ME_exon | 6 | Dcamkl1, Lats2, Plk1, Ptprd, Tns1, Tns3, Ttn |
| Intronic | 2 | Mylk, Ptpro |
The Intron column refers to the intron where alternative transcript begins, and the Count column shows the number of loci in each class. Intronic, starts in intron runs into next exon; ME_exon, mutually exclusive first exons.
Breakdown of transcript and peptide sets used in the variant analyses
| Total set | Full-length cDNAs | Transcript isoforms | Peptide encoding transcripts | Peptide isoforms | Domain combinations | |
| Loci | 687 | 676 | 676 | 639 | 639 | 639 |
| Variants | - | 5,877 | 4,496 | 2,358 | 1,469 | 1,080 |
Unique transcripts and unique peptides were identified by the Isoform Transcript Set (ITS) and Isoform Peptide Set (IPS) sequences identified by Carninci and coworkers [20].
Figure 2Relationship between transcript isoforms, peptide isoforms, and domain combinations.
Catalytic variants lacking all accessory domains
| MGD symbol | Transcripts | Catalytic | Accessory domains removed |
| B230120H23Rik | AB049732 | + | SAM, H+ transporter IPR000194 |
| Bmp2k | AK046752 | + | IPR011051 RmlC-like cupin |
| Camk2d | AK032524 | + | NTF2 |
| Dcamkl1 | AK032424 | + | Doublecortin domain |
| Ddr2 | AK132504 | + | Ligand binding ectodomain |
| Irak2 | AY162380 | + | Death domain |
| Jak1 | BC031297 | + | SH2, Band4.1/Ferm |
| Map3k14 | AF143094 | omega toxin-like. (SSF57059) | |
| Mapk8 | AB005663 | + | H+ transporter IPR000194 |
| Mast1 | AK141034 | + | PDZ domain (IPR001478). |
| Pik3r4 | AK042361 | + | ARM repeat fold, WD40 repeats and HEAT repeats. |
| Plk4 | AK045082 | + | C-terminal polo-box domain |
| Ppm1a | AF369981 | + | SSF81601 Protein serine/threonine phosphatase 2C, C-terminal |
| Prkx | AK039088 | + | Protein kinase c terminal domain(IPR000961) |
| Ptpn21 | D83072 | + | Band4.1/Ferm |
| Ptprb | AF157628 | + | Ligand binding ectodomain |
| Ptprd | BC025145 | + | Ligand binding ectodomain |
| Ptpre | U36758 | + | Ligand binding ectodomain |
| Ptprg | AK144283 | + | Ligand binding ectodomain |
| Ptprs | AK159320 | + | Ligand binding ectodomain |
| Ptpro | U37466 | + | Ligand binding ectodomain |
| Rps6kc1 | BC058403 | + | MIT, PX |
| Stk36 | AK007188 | + | ARM repeat fold |
| Tns1 | AK053112 | + | SH2 and pleckstrin homology/phosphotyrosine interaction domain |
| Zap70 | AB083210 | + | SH2 |
Noncatalytic variants with the full set of accessory domains
| MGD symbol | Transcripts | Catalytic | Accessory domains in noncatalytic form |
| Araf | AK133797 | - | Ras-binding domain (IPR003116), PKC PE/DAG binding domain (IPR002219) |
| Camk2a | X87142 | - | C-terminal SSF54427 domain |
| Cwf19l1 | AK088543 | - | CwfJ domain only |
| D10Ertd802e | AK139747 | - | ARM repeat fold only |
| Dcamkl1 | AK043874 | - | Doublecortin domain |
| Dusp16 | AK035652 | - | Rhodanese domain only |
| Egfr | BC023729 | - | Ligand binding ectodomain |
| Eif2ak3 | AK010397 | - | Quinonprotein alcohol dehydrogenase-like motif (IPR011047) |
| Ksr | AK164833 | - | PKC PE/DAG (IPR002219) |
| Map2k5 | BC013697 | - | Octicosapeptide/Phox/Bem1p domain (IPR000270). |
| Map3k14 | AK006468 | - | Omega toxin-like (SSF57059) |
| Mark3 | AK075742, BC026445 | - | Ubiquitin associated domain and kinase associated c-terminal domain |
| Mast2 | AK004728 | - | PDZ |
| Mtm1 | AK149997 | - | Gram |
| Prkwnk1 | BB619950 | - | TONB box, site specific DNA methyltransferase |
| Ptpn14 | AF170902 | - | Band4.1/Ferm and Pleckstrin homology |
| Syk | AK036736 | SH2 | |
| Tns1 | AK004758 | - | SH2 and pleckstrin homology/phosphotyrosine interaction domain |
Variant kinase and phosphatase receptor forms of mouse
| Type | Loci | Novel | Knowna |
| Secreted | Alk, Csf1ra, Egfrab, Epha1b, Epha3a, Epha5, Epha7b, Epha10a, Ephb1, Flt1ab, Flt3b, Insr, Insrr, Kdr, Met, Ptk7, Ptprc, Ptprdb, Ptprgb, Ptprkab, Ptprn, Ptprn2, Ptprob, Ptprr, Ptprsb, Ptprz1ab | 19 | 7 |
| Tethered | Axlb, Bmpr1a, Csf1r, Epha4, Epha5, Epha6, Epha7ab, Ntrk2ab, Ntrk3a, Pdgfraab, Ptprk, Ptprm, Ptpru | 9 | 4 |
| Tmcat | Axl, Ddr2, Epha6, Igf1r, Kit, Ntrk1, Ptprb, Ptprea, Ptproa, Ptprra, Ptpru, Ror2, Tgfbr1 | 10 | 3 |
| Catalytic | Acvr1c, Csf1r, Epha10, Fgfr1, Fgfr2, Kita, Mertk, Ptprea, Ptprgb, Ptprm, Ptproa, Ptprs | 9 | 3 |
aPreviously reported variants [37,38,1,82-92]. bDetected by massively parallel signature sequencing (MPSS) or Genomics Institute of the Novartis Research Foundation (GNF).
Figure 3Alternative splice forms of the Csf1 receptor (c-fms). (a) Genomic alignment (mm5; chr18:61616977..61647364) of full-length and variant receptors displaying exon structure and peptide features. Also shown are subcellular localizations of variant receptors transiently expressed in HeLa cells: (b) full-length Csf1r (GenBank:AK076215); (c) Tethered1 (GenBank:AK155565); (d) Tethered3 (GenBank:AK171543); and (e) Secreted (GenBank:AK171241). Tethered forms are produced by exon skipping (Tethered1; c), termination within an intron (Tethered2), and a mutually exclusive alternative 3' exon (Tethered3; d). Tethered forms 1 and 3 exhibit similar localizations to that of the full-length receptor (panel b; cell surface and perinuclear puncta). The form lacking the transmembrane (TM) domain is absent from the cell surface and displays a secretory pathway-like localization.
Figure 4Expression of variant Csf1r transcripts relative to the full-length isoform. BMM, bone marrow derived macrophages; dCT, differences in cycle numbers between variant and full-length isoforms; LPS, lipopolysaccharide.