Literature DB >> 8877510

A probabilistic classification system for predicting the cellular localization sites of proteins.

P Horton1, K Nakai.   

Abstract

We have defined a simple model of classification which combines human provided expert knowledge with probabilistic reasoning. We have developed software to implement this model and have applied it to the problem of classifying proteins into their various cellular localization sites based on their amino acid sequences. Since our system requires no hand tuning to learn training data, we can now evaluate the prediction accuracy of protein localization sites by a more objective cross-validation method than earlier studies using production rule type expert systems. 336 E. coli proteins were classified into 8 classes with an accuracy of 81% while 1484 yeast proteins were classified into 10 classes with an accuracy of 55%. Additionally we report empirical results using three different strategies for handling continuously valued variables in our probabilistic reasoning system.

Entities:  

Mesh:

Substances:

Year:  1996        PMID: 8877510

Source DB:  PubMed          Journal:  Proc Int Conf Intell Syst Mol Biol        ISSN: 1553-0833


  28 in total

1.  Predicting subcellular localization via protein motif co-occurrence.

Authors:  Michelle S Scott; David Y Thomas; Michael T Hallett
Journal:  Genome Res       Date:  2004-10       Impact factor: 9.043

2.  Similarity-dissimilarity plot for visualization of high dimensional data in biomedical pattern classification.

Authors:  Muhammad Arif
Journal:  J Med Syst       Date:  2010-08-24       Impact factor: 4.460

3.  Identification of novel genes expressed during mouse tooth development by microarray gene expression analysis.

Authors:  Trevor J Pemberton; Fang-Yuan Li; Shoji Oka; Gustavo A Mendoza-Fandino; Ya-Hsuan Hsu; Pablo Bringas; Yang Chai; Malcolm L Snead; Ruty Mehrian-Shai; Pragna I Patel
Journal:  Dev Dyn       Date:  2007-08       Impact factor: 3.780

4.  Organization and structural evolution of four multigene families in Arabidopsis thaliana: AtLCAD, AtLGT, AtMYST and AtHD-GL2.

Authors:  R Tavares; S Aubourg; A Lecharny; M Kreis
Journal:  Plant Mol Biol       Date:  2000-03       Impact factor: 4.076

5.  Cell wall and membrane-associated exo-beta-D-glucanases from developing maize seedlings.

Authors:  J B Kim; A T Olek; N C Carpita
Journal:  Plant Physiol       Date:  2000-06       Impact factor: 8.340

6.  Mosquito has a single multisubstrate deoxyribonucleoside kinase characterized by unique substrate specificity.

Authors:  Wolfgang Knecht; Gitte Ebert Petersen; Michael Paolo Bastner Sandrini; Leif Søndergaard; Birgitte Munch-Petersen; Jure Piskur
Journal:  Nucleic Acids Res       Date:  2003-03-15       Impact factor: 16.971

7.  Discriminative motif finding for predicting protein subcellular localization.

Authors:  Tien-ho Lin; Robert F Murphy; Ziv Bar-Joseph
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2011 Mar-Apr       Impact factor: 3.710

8.  Genome sequence of Symbiobacterium thermophilum, an uncultivable bacterium that depends on microbial commensalism.

Authors:  Kenji Ueda; Atsushi Yamashita; Jun Ishikawa; Masafumi Shimada; Tomo-o Watsuji; Kohji Morimura; Haruo Ikeda; Masahira Hattori; Teruhiko Beppu
Journal:  Nucleic Acids Res       Date:  2004-09-21       Impact factor: 16.971

9.  Plasmodium falciparum PfA-M1 aminopeptidase is trafficked via the parasitophorous vacuole and marginally delivered to the food vacuole.

Authors:  Omid Azimzadeh; Cissé Sow; Marc Gèze; Julius Nyalwidhe; Isabelle Florent
Journal:  Malar J       Date:  2010-06-30       Impact factor: 2.979

10.  Arabidopsis thaliana GLN2-encoded glutamine synthetase is dual targeted to leaf mitochondria and chloroplasts.

Authors:  Masakazu Taira; Ulrika Valtersson; Brad Burkhardt; Robert A Ludwig
Journal:  Plant Cell       Date:  2004-07-23       Impact factor: 11.277

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.