Literature DB >> 14576308

Splice site prediction with quadratic discriminant analysis using diversity measure.

Lirong Zhang1, Liaofu Luo.   

Abstract

Based on the conservation of nucleotides at splicing sites and the features of base composition and base correlation around these sites we use the method of increment of diversity combined with quadratic discriminant analysis (IDQD) to study the dependence structure of splicing sites and predict the exons/introns and their boundaries for four model genomes: Caenorhabditis elegans, Arabidopsis thaliana, Drosophila melanogaster and human. The comparison of compositional features between two sequences and the comparison of base dependencies at adjacent or non-adjacent positions of two sequences can be integrated automatically in the increment of diversity (ID). Eight feature variables around a potential splice site are defined in terms of ID. They are integrated in a single formal framework given by IDQD. In our calculations 7 (8) base region around the donor (acceptor) sites have been considered in studying the conservation of nucleotides and sequences of 48 bp on either side of splice sites have been used in studying the compositional and base-correlating features. The windows are enlarged to 16 (donor), 29 (acceptor) and 80 bp (either side) to improve the prediction for human splice sites. The prediction capability of the present method is comparable with the leading splice site detector--GeneSplicer.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 14576308      PMCID: PMC275452          DOI: 10.1093/nar/gkg805

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  26 in total

1.  EID: the Exon-Intron Database-an exhaustive database of protein-coding intron-containing genes.

Authors:  S Saxonov; I Daizadeh; A Fedorov; W Gilbert
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  The prediction of the structural class of protein: application of the measure of diversity.

Authors:  Q Z Li; Z Q Lu
Journal:  J Theor Biol       Date:  2001-12-07       Impact factor: 2.691

3.  Modeling splicing sites with pairwise correlations.

Authors:  Masanori Arita; Koji Tsuda; Kiyoshi Asai
Journal:  Bioinformatics       Date:  2002       Impact factor: 6.937

Review 4.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

5.  Improved splice site detection in Genie.

Authors:  M G Reese; F H Eeckman; D Kulp; D Haussler
Journal:  J Comput Biol       Date:  1997       Impact factor: 1.479

6.  The current status and portability of our sequence handling software.

Authors:  R Staden
Journal:  Nucleic Acids Res       Date:  1986-01-10       Impact factor: 16.971

7.  Analysis of donor splice sites in different eukaryotic organisms.

Authors:  I B Rogozin; L Milanesi
Journal:  J Mol Evol       Date:  1997-07       Impact factor: 2.395

8.  A weight array method for splicing signal analysis.

Authors:  M Q Zhang; T G Marr
Journal:  Comput Appl Biosci       Date:  1993-10

9.  Splice site prediction in Arabidopsis thaliana pre-mRNA by combining local and global sequence information.

Authors:  S M Hebsgaard; P G Korning; N Tolstrup; J Engelbrecht; P Rouzé; S Brunak
Journal:  Nucleic Acids Res       Date:  1996-09-01       Impact factor: 16.971

10.  Predicting internal exons by oligonucleotide composition and discriminant analysis of spliceable open reading frames.

Authors:  V V Solovyev; A A Salamov; C B Lawrence
Journal:  Nucleic Acids Res       Date:  1994-12-11       Impact factor: 16.971

View more
  11 in total

1.  Prediction of the beta-hairpins in proteins using support vector machine.

Authors:  Xiu Zhen Hu; Qian Zhong Li
Journal:  Protein J       Date:  2008-02       Impact factor: 2.371

2.  Eukaryotic and prokaryotic promoter prediction using hybrid approach.

Authors:  Hao Lin; Qian-Zhong Li
Journal:  Theory Biosci       Date:  2010-11-03       Impact factor: 1.919

3.  Calculation of nucleosomal DNA deformation energy: its implication for nucleosome positioning.

Authors:  Jian-Ying Wang; Jingyan Wang; Guoqing Liu
Journal:  Chromosome Res       Date:  2012-12-05       Impact factor: 5.239

4.  Prediction of nucleosome DNA formation potential and nucleosome positioning using increment of diversity combined with quadratic discriminant analysis.

Authors:  Xiujuan Zhao; Zhiyong Pei; Jia Liu; Sheng Qin; Lu Cai
Journal:  Chromosome Res       Date:  2010-10-16       Impact factor: 5.239

5.  iNuc-PhysChem: a sequence-based predictor for identifying nucleosomes via physicochemical properties.

Authors:  Wei Chen; Hao Lin; Peng-Mian Feng; Chen Ding; Yong-Chun Zuo; Kuo-Chen Chou
Journal:  PLoS One       Date:  2012-10-29       Impact factor: 3.240

6.  Prediction for human transcription start site using diversity measure with quadratic discriminant.

Authors:  Jun Lu; Liaofu Luo
Journal:  Bioinformation       Date:  2008-04-28

7.  Prediction of HIV-1 and HIV-2 proteins by using Chou's pseudo amino acid compositions and different classifiers.

Authors:  Juan Mei; Ji Zhao
Journal:  Sci Rep       Date:  2018-02-05       Impact factor: 4.379

8.  The organization of nucleosomes around splice sites.

Authors:  Wei Chen; Liaofu Luo; Lirong Zhang
Journal:  Nucleic Acids Res       Date:  2010-01-21       Impact factor: 16.971

9.  Fast splice site detection using information content and feature reduction.

Authors:  A K M A Baten; S K Halgamuge; B C H Chang
Journal:  BMC Bioinformatics       Date:  2008-12-12       Impact factor: 3.169

Review 10.  Bioinformatics in China: a personal perspective.

Authors:  Liping Wei; Jun Yu
Journal:  PLoS Comput Biol       Date:  2008-04-25       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.