Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A Sensitive and Accurate protein domain cLassification Tool (SALT) for short reads.

Literature DB >> 23782615

A Sensitive and Accurate protein domain cLassification Tool (SALT) for short reads.

Abstract

MOTIVATION: Protein domain classification is an important step in functional annotation for next-generation sequencing data. For RNA-Seq data of non-model organisms that lack quality or complete reference genomes, existing protein domain analysis pipelines are applied to short reads directly or to contigs that are generated using de novo sequence assembly tools. However, these strategies do not provide satisfactory performance in classifying short reads into their native domain families.
RESULTS: We introduce SALT, a protein domain classification tool based on profile hidden Markov models and graph algorithms. SALT carefully incorporates the characteristics of reads that are sequenced from the domain regions and assembles them into contigs based on a supervised graph construction algorithm. We applied SALT to two RNA-Seq datasets of different read lengths and quantified its performance using the available protein domain annotations and the reference genomes. Compared with existing strategies, SALT showed better sensitivity and accuracy. In the third experiment, we applied SALT to a non-model organism. The experimental results demonstrated that it identified more transcribed protein domain families than other tested classifiers. AVAILABILITY: The source code and supplementary data are available at https://sourceforge.net/projects/salt1/

Entities: Chemical

Mesh：

Substances：

Year: 2013 PMID： 23782615 DOI： 10.1093/bioinformatics/btt357

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

8 in total

A Sensitive and Accurate protein domain cLassification Tool (SALT) for short reads.

Review 1. Music of metagenomics-a review of its applications, analysis pipeline, and associated tools.

2. UProC: tools for ultra-fast protein domain classification.

3. Xander: employing a novel method for efficient gene-targeted metagenomic assembly.

4. A scalable and accurate targeted gene assembly tool (SAT-Assembler) for next-generation sequencing data.

5. Metagenome and Metatranscriptome Analyses Using Protein Family Profiles.

6. In silico approach to designing rational metagenomic libraries for functional studies.

7. A multi-source domain annotation pipeline for quantitative metagenomic and metatranscriptomic functional profiling.

8. A sensitive short read homology search tool for paired-end read sequencing data.