Literature DB >> 11125105

SpliceDB: database of canonical and non-canonical mammalian splice sites.

M Burset1, I A Seledtsov, V V Solovyev.   

Abstract

A database (SpliceDB) of known mammalian splice site sequences has been developed. We extracted 43 337 splice pairs from mammalian divisions of the gene-centered Infogene database, including sites from incomplete or alternatively spliced genes. Known EST sequences supported 22 815 of them. After discarding sequences with putative errors and ambiguous location of splice junctions the verified dataset includes 22 489 entries. Of these, 98.71% contain canonical GT-AG junctions (22 199 entries) and 0.56% have non-canonical GC-AG splice site pairs. The remainder (0.73%) occurs in a lot of small groups (with a maximum size of 0.05%). We especially studied non-canonical splice sites, which comprise 3.73% of GenBank annotated splice pairs. EST alignments allowed us to verify only the exonic part of splice sites. To check the conservative dinucleotides we compared sequences of human non-canonical splice sites with sequences from the high throughput genome sequencing project (HTG). Out of 171 human non-canonical and EST-supported splice pairs, 156 (91.23%) had a clear match in the human HTG. They can be classified after sequence analysis as: 79 GC-AG pairs (of which one was an error that corrected to GC-AG), 61 errors corrected to GT-AG canonical pairs, six AT-AC pairs (of which two were errors corrected to AT-AC), one case was produced from a non-existent intron, seven cases were found in HTG that were deposited to GenBank and finally there were only two other cases left of supported non-canonical splice pairs. The information about verified splice site sequences for canonical and non-canonical sites is presented in SpliceDB with the supporting evidence. We also built weight matrices for the major splice groups, which can be incorporated into gene prediction programs. SpliceDB is available at the computational genomic Web server of the Sanger Centre: http://genomic.sanger.ac. uk/spldb/SpliceDB.html and at http://www.softberry. com/spldb/SpliceDB.html.

Entities:  

Mesh:

Year:  2001        PMID: 11125105      PMCID: PMC29840          DOI: 10.1093/nar/29.1.255

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  7 in total

Review 1.  A reappraisal of non-consensus mRNA splice sites.

Authors:  I J Jackson
Journal:  Nucleic Acids Res       Date:  1991-07-25       Impact factor: 16.971

2.  GenBank.

Authors:  D A Benson; M S Boguski; D J Lipman; J Ostell; B F Ouellette; B A Rapp; D L Wheeler
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

3.  INFOGENE: a database of known gene structures and predicted genes and proteins in sequences of genome sequencing projects.

Authors:  V V Solovyev; A A Salamov
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

4.  Ab initio gene finding in Drosophila genomic DNA.

Authors:  A A Salamov; V V Solovyev
Journal:  Genome Res       Date:  2000-04       Impact factor: 9.043

5.  Analysis of canonical and non-canonical splice sites in mammalian genomes.

Authors:  M Burset; I A Seledtsov; V V Solovyev
Journal:  Nucleic Acids Res       Date:  2000-11-01       Impact factor: 16.971

6.  A clean data set of EST-confirmed splice sites from Homo sapiens and standards for clean-up procedures.

Authors:  T A Thanaraj
Journal:  Nucleic Acids Res       Date:  1999-07-01       Impact factor: 16.971

7.  Human pre-mRNA splicing signals.

Authors:  F E Penotti
Journal:  J Theor Biol       Date:  1991-06-07       Impact factor: 2.691

  7 in total
  83 in total

1.  PALS db: Putative Alternative Splicing database.

Authors:  Y-H Huang; Y-T Chen; J-J Lai; S-T Yang; U-C Yang
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

2.  An allelic series of mutations in Smad2 and Smad4 identified in a genotype-based screen of N-ethyl-N- nitrosourea-mutagenized mouse embryonic stem cells.

Authors:  Jay L Vivian; Yijing Chen; Della Yee; Elizabeth Schneider; Terry Magnuson
Journal:  Proc Natl Acad Sci U S A       Date:  2002-11-13       Impact factor: 11.205

3.  Refined annotation of the Arabidopsis genome by complete expressed sequence tag mapping.

Authors:  Wei Zhu; Shannon D Schlueter; Volker Brendel
Journal:  Plant Physiol       Date:  2003-06       Impact factor: 8.340

4.  Splice variation in mouse full-length cDNAs identified by mapping to the mouse genome.

Authors:  Mihaela Zavolan; Erik van Nimwegen; Terry Gaasterland
Journal:  Genome Res       Date:  2002-09       Impact factor: 9.043

5.  Complexity: an internet resource for analysis of DNA sequence complexity.

Authors:  Y L Orlov; V N Potapov
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

6.  The IgE gene in primates exhibits extraordinary evolutionary diversity.

Authors:  Pheidias C Wu; Jiun-Bo Chen; Shoji Kawamura; Christian Roos; Stefan Merker; Chih-Chin Shih; Ban-Dar Hsu; Carmay Lim; Tse Wen Chang
Journal:  Immunogenetics       Date:  2011-11-10       Impact factor: 2.846

7.  Evaluation of five ab initio gene prediction programs for the discovery of maize genes.

Authors:  Hong Yao; Ling Guo; Yan Fu; Lisa A Borsuk; Tsui-Jung Wen; David S Skibbe; Xiangqin Cui; Brian E Scheffler; Jun Cao; Scott J Emrich; Daniel A Ashlock; Patrick S Schnable
Journal:  Plant Mol Biol       Date:  2005-02       Impact factor: 4.076

8.  U1-like snRNAs lacking complementarity to canonical 5' splice sites.

Authors:  Christina Kyriakopoulou; Pontus Larsson; Lei Liu; Jens Schuster; Fredrik Söderbom; Leif A Kirsebom; Anders Virtanen
Journal:  RNA       Date:  2006-07-07       Impact factor: 4.942

9.  Novel diagnostic tool for prediction of variant spliceogenicity derived from a set of 395 combined in silico/in vitro studies: an international collaborative effort.

Authors:  Raphaël Leman; Pascaline Gaildrat; Gérald Le Gac; Chandran Ka; Yann Fichou; Marie-Pierre Audrezet; Virginie Caux-Moncoutier; Sandrine M Caputo; Nadia Boutry-Kryza; Mélanie Léone; Sylvie Mazoyer; Françoise Bonnet-Dorion; Nicolas Sevenet; Marine Guillaud-Bataille; Etienne Rouleau; Brigitte Bressac-de Paillerets; Barbara Wappenschmidt; Maria Rossing; Danielle Muller; Violaine Bourdon; Françoise Revillon; Michael T Parsons; Antoine Rousselin; Grégoire Davy; Gaia Castelain; Laurent Castéra; Joanna Sokolowska; Florence Coulet; Capucine Delnatte; Claude Férec; Amanda B Spurdle; Alexandra Martins; Sophie Krieger; Claude Houdayer
Journal:  Nucleic Acids Res       Date:  2018-09-06       Impact factor: 16.971

10.  Targeted knockout and lacZ reporter expression of the mouse Tmhs deafness gene and characterization of the hscy-2J mutation.

Authors:  Chantal M Longo-Guess; Leona H Gagnon; Bernd Fritzsch; Kenneth R Johnson
Journal:  Mamm Genome       Date:  2007-09-18       Impact factor: 2.957

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.