| Literature DB >> 19346323 |
Ying Huang1, Paul Gilna, Weizhong Li.
Abstract
MOTIVATION: Identification of genes coding for ribosomal RNA (rRNA) is considered an important goal in the analysis of data from metagenomics projects. Here, we report the development of a software program designed for the identification of rRNA genes from metagenomic fragments based on hidden Markov models (HMMs). This program provides rRNA gene predictions with high sensitivity and specificity on artificially fragmented genomic DNAs. AVAILABILITY: Supplementary files, scripts and sample data are available at http://tools.camera.calit2.net/camera/meta_rna.Entities:
Mesh:
Year: 2009 PMID: 19346323 PMCID: PMC2677747 DOI: 10.1093/bioinformatics/btp161
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Prediction sensitivities for different fragment lengths
| Prediction method | hmm_fs | BLASTN | ||||
|---|---|---|---|---|---|---|
| Length of reads | 5S | 16S | 23S | 5S | 16S | 23S |
| 100 | 91.9 | 98.2 | 96.2 | 79.4 | 89.9 | 94.8 |
| 200 | 95.8 | 97.9 | 98.6 | 85.7 | 96.7 | 97.8 |
| 300 | 96.8 | 99.3 | 99.0 | 88.3 | 99.0 | 98.2 |
| 400 | 97.6 | 98.3 | 99.2 | 89.1 | 97.5 | 98.5 |
| 500 | 98.2 | 99.2 | 99.1 | 89.2 | 99.2 | 98.4 |
| 600 | 98.0 | 98.8 | 99.1 | 89.5 | 98.4 | 98.5 |
| 700 | 98.7 | 99.5 | 99.3 | 90.3 | 99.5 | 98.7 |
| 800 | 98.2 | 99.2 | 99.6 | 90.8 | 99.2 | 99.1 |
Here, hmm_fs represents our algorithm. Sensitivities are represented in percentage (%).
Prediction specificities for different fragment lengths
| Prediction method | hmm_fs | BLASTN | ||||
|---|---|---|---|---|---|---|
| Length of reads | 5S | 16S | 23S | 5S | 16S | 23S |
| 100 | 88.6 | 92.7 | 94.5 | 92.8 | 91.5 | 94.8 |
| 200 | 90.4 | 91.2 | 94.0 | 93.0 | 88.1 | 94.6 |
| 300 | 91.7 | 93.5 | 94.4 | 94.9 | 86.9 | 94.8 |
| 400 | 92.3 | 95.4 | 94.3 | 94.2 | 88.6 | 94.9 |
| 500 | 93.7 | 91.9 | 93.3 | 95.0 | 84.4 | 94.1 |
| 600 | 92.0 | 91.4 | 94.2 | 94.1 | 86.5 | 94.6 |
| 700 | 93.9 | 91.0 | 94.9 | 95.6 | 85.5 | 95.6 |
| 800 | 92.6 | 89.6 | 94.5 | 94.1 | 82.3 | 94.9 |
Here, hmm_fs represents our algorithm. Specificities are represented in percentage (%).