| Literature DB >> 17597889 |
Luv Kashyap1, Mohammad Tabish.
Abstract
Computational gene prediction and identifying alternatively spliced isoforms have always been a challenging task. In this paper, we describe the performance of three gene/exon finding programmes namely Fex, Gen view2 and Gene builder capable of predicting open reading frames or exons for a given set of sequences from C. elegans genome. The predicted exons were compared with the 'sequencing consortium' identified exons and degree of consensus among them is discussed. We found that exon prediction by Fex was similar to the consortium prediction as compared to Gen view2 and Gene builder results. Interestingly, some exons (six exons in five genes) predicted positive only by Fex and not by the 'sequencing consortium' are found at the C. elegans EST database. This data is critical for further debate and discussion on gene finding in C. elegans.Entities:
Year: 2006 PMID: 17597889 PMCID: PMC1891687
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1A plot between number of exons and number of genes. A chart showing number of genes having varying number of exons ranging from one to six. Genes containing one (6), two (42), three (40), four (24), five (6) and six (2) exons are shown.
Comparison of various parameters for three exon predicting programmes: A comparative view of essential parameters including positives, negatives and degree of consensus for three gene/exon predicting programmes to ensure the accuracy in their predictions
| Parameters | FEX | GEN VIEW2 | GENE BUILDER |
|---|---|---|---|
| Actual exons | 348 | 348 | 348 |
| Predicted exons | 384 | 72 | 149 |
| Positives | 283 | 55 | 77 |
| Negatives | 63 | 230 | 162 |
| New exons | 101 | 17 | 72 |
| Degree of consensus | 0.73 | 0.76 | 0.51 |
Figure 2Homology of the Fex predicted exons with cDNA sequences of C. elegans EST database. Sequences were compared by TBLASTN computer-based sequence analysis. [16] Hypothetical gene name designated by C. elegans sequencing consortium is given in bold letters and amino acid sequence predicted by Fex is given next to the hypothetical gene name in bracket. Newly predicted exons were compared with cDNA sequences separately. Numbers on the left and right side of amino acid sequences indicate the position of these amino acid residues in the exons predicted by Fex (query) and cDNA hits (yk series) during TBLASTN search. Names of each cDNA clone are mentioned on the left side and their EMBL accession numbers are given on the right side of the aligned sequences.