| Literature DB >> 17537825 |
Swetlana Nikolajewa1, Rainer Pudimat, Michael Hiller, Matthias Platzer, Rolf Backofen.
Abstract
BioBayesNet is a new web application that allows the easy modeling and classification of biological data using Bayesian networks. To learn Bayesian networks the user can either upload a set of annotated FASTA sequences or a set of pre-computed feature vectors. In case of FASTA sequences, the server is able to generate a wide range of sequence and structural features from the sequences. These features are used to learn Bayesian networks. An automatic feature selection procedure assists in selecting discriminative features, providing an (locally) optimal set of features. The output includes several quality measures of the overall network and individual features as well as a graphical representation of the network structure, which allows to explore dependencies between features. Finally, the learned Bayesian network or another uploaded network can be used to classify new data. BioBayesNet facilitates the use of Bayesian networks in biological sequences analysis and is flexible to support modeling and classification applications in various scientific fields. The BioBayesNet server is available at http://biwww3.informatik.uni-freiburg.de:8080/BioBayesNet/.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17537825 PMCID: PMC1933181 DOI: 10.1093/nar/gkm292
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The general workflow of the BioBayesNet web-server.
An example of user-given feature vectors describing potentially discriminative features of alternatively and constitutively spliced exons
| File 1: Class labels and feature names | File 2: Data samples |
|---|---|
The first line of the first file has to contain the class labels (alternative and constitutive). The next lines of this file specify feature names and their value ranges. Each line of the second file contains one data sample. The features are given in the order of the first file and the class label is given at the end of each line.
Figure 2.Graphical overview of a BN and the dependencies between feature variables.