Literature DB >> 11159307

Prediction whether a human cDNA sequence contains initiation codon by combining statistical information and similarity with protein sequences.

T Nishikawa1, T Ota, T Isogai.   

Abstract

MOTIVATION: In the previous works, we developed ATGpr, a computer program for predicting the fullness of a cDNA, i.e. whether it contains an initiation codon or not. Statistical information of short nucleotide fragments was fully exploited in the prediction algorithm. However, sequence similarities to known proteins, which are becoming increasingly available due to recent rapid growth of protein database, were not used in the prediction. In this work, we present a new prediction algorithm based on both statistical and similarity information, which provides better performance in sensitivity and specificity.
RESULTS: We evaluated the accuracy of ATGpr for predicting fullness of cDNA sequences from human clustered ESTs of UniGene, and we obtained specificity, sensitivity, and correlation coefficient of this prediction. Specificity and sensitivity crossed at 46% over the ATGpr score threshold of 0.33 and the maximum correlation coefficient of 0.34 was obtained at this threshold. Without ATGpr we found it effective to use alignments with known proteins for predicting the fullness of cDNA sequences. That is, specificity increased monotonously as similarity (identity of the alignments) increased. Specificity was achieved greater than 80% if identity was greater than 40%. For more effective prediction of fullness of cDNA sequences we combined the similarity (identity of query sequence) with known proteins and ATGpr score. As a result, specificity became greater than 80% if identity was greater than 20%. AVAILABILITY: The prediction program, called ATGpr_ sim, is available at http://www.hri.co.jp/atgpr/ATGpr_sim.html CONTACT: nisikawa@crl.hitachi.co.jp

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 11159307     DOI: 10.1093/bioinformatics/16.11.960

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  25 in total

Review 1.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

2.  A novel long non-coding RNA in the rheumatoid arthritis risk locus TRAF1-C5 influences C5 mRNA levels.

Authors:  T C Messemaker; M Frank-Bertoncelj; R B Marques; A Adriaans; A M Bakker; N Daha; S Gay; T W Huizinga; R E M Toes; H M M Mikkers; F Kurreeman
Journal:  Genes Immun       Date:  2015-12-17       Impact factor: 2.676

3.  155R is a novel structural protein of bovine adenovirus type 3, but it is not essential for virus replication.

Authors:  Ahmed O Hassan; Sai V Vemula; Anurag Sharma; Dinesh S Bangari; Krishna K Mishra; Suresh K Mittal
Journal:  J Gen Virol       Date:  2017-04-27       Impact factor: 3.891

4.  Two Isoforms of the RNA Binding Protein, Coding Region Determinant-binding Protein (CRD-BP/IGF2BP1), Are Expressed in Breast Epithelium and Support Clonogenic Growth of Breast Tumor Cells.

Authors:  Saja A Fakhraldeen; Rod J Clark; Avtar Roopra; Emily N Chin; Wei Huang; John Castorino; Kari B Wisinski; TaeWon Kim; Vladimir S Spiegelman; Caroline M Alexander
Journal:  J Biol Chem       Date:  2015-04-10       Impact factor: 5.157

5.  Corticosteroid treatment exacerbates nephrotic syndrome in a zebrafish model of magi2a knockout.

Authors:  Tilman Jobst-Schwan; Charlotte A Hoogstraten; Caroline M Kolvenbach; Johanna Magdalena Schmidt; Amy Kolb; Kaitlyn Eddy; Ronen Schneider; Shazia Ashraf; Eugen Widmeier; Amar J Majmundar; Friedhelm Hildebrandt
Journal:  Kidney Int       Date:  2019-03-05       Impact factor: 10.612

6.  Predicting cross-reactive immunological material (CRIM) status in Pompe disease using GAA mutations: lessons learned from 10 years of clinical laboratory testing experience.

Authors:  Deeksha S Bali; Jennifer L Goldstein; Suhrad Banugaria; Jian Dai; Joanne Mackey; Catherine Rehder; Priya S Kishnani
Journal:  Am J Med Genet C Semin Med Genet       Date:  2012-01-17       Impact factor: 3.908

7.  Expression of the prospective mesoderm genes twist, snail, and mef2 in penaeid shrimp.

Authors:  Jiankai Wei; Richard Samuel Elliot Glaves; Melony J Sellars; Jianhai Xiang; Philip L Hertzler
Journal:  Dev Genes Evol       Date:  2016-04-29       Impact factor: 0.900

8.  Hypothalamic expression of Eap1 is not directly controlled by ovarian steroids.

Authors:  Valerie Matagne; Claudio Mastronardi; Robert A Shapiro; Daniel M Dorsa; Sergio R Ojeda
Journal:  Endocrinology       Date:  2008-11-20       Impact factor: 4.736

9.  Clinical and genetic spectrum in limb-girdle muscular dystrophy type 2E.

Authors:  Claudio Semplicini; John Vissing; Julia R Dahlqvist; Tanya Stojkovic; Luca Bello; Nanna Witting; Morten Duno; France Leturcq; Cinzia Bertolin; Paola D'Ambrosio; Bruno Eymard; Corrado Angelini; Luisa Politano; Pascal Laforêt; Elena Pegoraro
Journal:  Neurology       Date:  2015-04-10       Impact factor: 9.910

10.  A long noncoding RNA critically regulates Bcr-Abl-mediated cellular transformation by acting as a competitive endogenous RNA.

Authors:  G Guo; Q Kang; X Zhu; Q Chen; X Wang; Y Chen; J Ouyang; L Zhang; H Tan; R Chen; S Huang; J-L Chen
Journal:  Oncogene       Date:  2014-05-19       Impact factor: 9.867

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.