Literature DB >> 11058137

Analysis of canonical and non-canonical splice sites in mammalian genomes.

M Burset1, I A Seledtsov, V V Solovyev.   

Abstract

A set of 43 337 splice junction pairs was extracted from mammalian GenBank annotated genes. Expressed sequence tag (EST) sequences support 22 489 of them. Of these, 98.71% contain canonical dinucleotides GT and AG for donor and acceptor sites, respectively; 0.56% hold non-canonical GC-AG splice site pairs; and the remaining 0.73% occurs in a lot of small groups (with a maximum size of 0.05%). Studying these groups we observe that many of them contain splicing dinucleotides shifted from the annotated splice junction by one position. After close examination of such cases we present a new classification consisting of only eight observed types of splice site pairs (out of 256 a priori possible combinations). EST alignments allow us to verify the exonic part of the splice sites, but many non-canonical cases may be due to intron sequencing errors. This idea is given substantial support when we compare the sequences of human genes having non-canonical splice sites deposited in GenBank by high throughput genome sequencing projects (HTG). A high proportion (156 out of 171) of the human non-canonical and EST-supported splice site sequences had a clear match in the human HTG. They can be classified after corrections as: 79 GC-AG pairs (of which one was an error that corrected to GC-AG), 61 errors that were corrected to GT-AG canonical pairs, six AT-AC pairs (of which two were errors that corrected to AT-AC), one case was produced from non-existent intron, seven cases were found in HTG that were deposited to GenBank and finally there were only two cases left of supported non-canonical splice sites. If we assume that approximately the same situation is true for the whole set of annotated mammalian non-canonical splice sites, then the 99.24% of splice site pairs should be GT-AG, 0.69% GC-AG, 0.05% AT-AC and finally only 0.02% could consist of other types of non-canonical splice sites. We analyze several characteristics of EST-verified splice sites and build weight matrices for the major groups, which can be incorporated into gene prediction programs. We also present a set of EST-verified canonical splice sites larger by two orders of magnitude than the current one (22 199 entries versus approximately 600) and finally, a set of 290 EST-supported non-canonical splice sites. Both sets should be significant for future investigations of the splicing mechanism.

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 11058137      PMCID: PMC113136          DOI: 10.1093/nar/28.21.4364

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  40 in total

Review 1.  RNA-RNA interactions in the spliceosome: unraveling the ties that bind.

Authors:  T W Nilsen
Journal:  Cell       Date:  1994-07-15       Impact factor: 41.582

2.  Ab initio gene finding in Drosophila genomic DNA.

Authors:  A A Salamov; V V Solovyev
Journal:  Genome Res       Date:  2000-04       Impact factor: 9.043

3.  The unusual 5' splicing border GC is used in myrosinase genes of the Brassicaceae.

Authors:  J Xue; L Rask
Journal:  Plant Mol Biol       Date:  1995-10       Impact factor: 4.076

4.  dbEST--database for "expressed sequence tags".

Authors:  M S Boguski; T M Lowe; C M Tolstoshev
Journal:  Nat Genet       Date:  1993-08       Impact factor: 38.330

5.  Requirement of U12 snRNA for in vivo splicing of a minor class of eukaryotic nuclear pre-mRNA introns.

Authors:  S L Hall; R A Padgett
Journal:  Science       Date:  1996-03-22       Impact factor: 47.728

6.  Unusual splice sites revealed by mutagenic inactivation of an authentic splice site of the rabbit beta-globin gene.

Authors:  B Wieringa; F Meyer; J Reiser; C Weissmann
Journal:  Nature       Date:  1983-01-06       Impact factor: 49.962

Review 7.  Organization and expression of eucaryotic split genes coding for proteins.

Authors:  R Breathnach; P Chambon
Journal:  Annu Rev Biochem       Date:  1981       Impact factor: 23.643

8.  A catalogue of splice junction sequences.

Authors:  S M Mount
Journal:  Nucleic Acids Res       Date:  1982-01-22       Impact factor: 16.971

9.  Ovalbumin gene: evidence for a leader sequence in mRNA and DNA sequences at the exon-intron boundaries.

Authors:  R Breathnach; C Benoist; K O'Hare; F Gannon; P Chambon
Journal:  Proc Natl Acad Sci U S A       Date:  1978-10       Impact factor: 11.205

10.  Human pre-mRNA splicing signals.

Authors:  F E Penotti
Journal:  J Theor Biol       Date:  1991-06-07       Impact factor: 2.691

View more
  193 in total

1.  Efficient use of a 'dead-end' GA 5' splice site in the human fibroblast growth factor receptor genes.

Authors:  Simon Brackenridge; Andrew O M Wilkie; Gavin R Screaton
Journal:  EMBO J       Date:  2003-04-01       Impact factor: 11.598

2.  Refined annotation of the Arabidopsis genome by complete expressed sequence tag mapping.

Authors:  Wei Zhu; Shannon D Schlueter; Volker Brendel
Journal:  Plant Physiol       Date:  2003-06       Impact factor: 8.340

Review 3.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

4.  eShadow: a tool for comparing closely related sequences.

Authors:  Ivan Ovcharenko; Dario Boffelli; Gabriela G Loots
Journal:  Genome Res       Date:  2004-06       Impact factor: 9.043

5.  Natural selection affects frequencies of AG and GT dinucleotides at the 5' and 3' ends of exons.

Authors:  S T Eskesen; F N Eskesen; A Ruvinsky
Journal:  Genetics       Date:  2004-05       Impact factor: 4.562

6.  Over 20% of human transcripts might form sense-antisense pairs.

Authors:  Jianjun Chen; Miao Sun; W James Kent; Xiaoqiu Huang; Hanqing Xie; Wenquan Wang; Guolin Zhou; Run Zhang Shi; Janet D Rowley
Journal:  Nucleic Acids Res       Date:  2004-09-08       Impact factor: 16.971

Review 7.  Using bioinformatics to predict the functional impact of SNVs.

Authors:  Melissa S Cline; Rachel Karchin
Journal:  Bioinformatics       Date:  2010-12-15       Impact factor: 6.937

8.  Three new alternative splicing variants of human cytochrome P450 2D6 mRNA in human extratumoral liver tissue.

Authors:  Jian Zhuge; Ying-Nian Yu
Journal:  World J Gastroenterol       Date:  2004-11-15       Impact factor: 5.742

9.  Tyrosinaemia type I--de novo mutation in liver tissue suppressing an inborn splicing defect.

Authors:  Y T Bliksrud; E Brodtkorb; P A Andresen; I E T van den Berg; E A Kvittingen
Journal:  J Mol Med (Berl)       Date:  2005-03-10       Impact factor: 4.599

10.  The effect of temperature on Natural Antisense Transcript (NAT) expression in Aspergillus flavus.

Authors:  Carrie A Smith; Dominique Robertson; Bethan Yates; Dahlia M Nielsen; Doug Brown; Ralph A Dean; Gary A Payne
Journal:  Curr Genet       Date:  2008-09-24       Impact factor: 3.886

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.