| Literature DB >> 20478825 |
Po-Jung Huang1, Yi-Chung Liu, Chi-Ching Lee, Wei-Chen Lin, Richie Ruei-Chi Gan, Ping-Chiang Lyu, Petrus Tang.
Abstract
DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log(2)-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20478825 PMCID: PMC2896168 DOI: 10.1093/nar/gkq392
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.DSAP workflow. DSAP follows several analysis steps: (a) Cleanup to remove adaptors and poly-A/T/C/G/N nucleotides. (b) Clustering to group cleaned sequence tags into unique sequence clusters. (c) ncRNA matching to map unique sequence clusters against a transcribed sequence library of ncRNA (Rfam). (d) Known miRNA matching to detect known miRNAs in miRBase based on sequence homology. (e) Comparative miRNAomics to show differential miRNA expression profiles from different jobs, and cross-species distributions of identified miRNAs.
Figure 2.Optimized observation of isomiRs. The alignment of unique sequence clusters with the corresponding miRNA hairpin is optimized for the observation of isomiRs. Unique sequence clusters and precursor miRNA hairpin sequences were first aligned using word matching, and then we appended the leading and trailing sequences to get a MSA. The alignment can be sorted by expression levels, sequence length or sequence homology.
Benchmarking of DSAP
| Datasets | Number of sequence tags | Processing time (hh:mm:ss) |
|---|---|---|
| CE5 | 329 334 | 00:03:42 |
| CE7 | 220 166 | 00:01:58 |
| CE9 | 153 406 | 00:01:20 |
| Protist1 | 754 059 | 00:04:38 |
| Protist2 | 736 939 | 00:05:02 |
| Protist3 | 395 939 | 00:02:50 |
| Protist4 | 697 983 | 00:03:56 |
| Plant1 | 1 643 030 | 00:11:34 |
| Plant2 | 2 090 730 | 00:13:53 |
Benchmarking is done on a Linux CentOS 64bit server housing two quad-core Intel® Xeon® 5300 Series Processors and 16 GB RAM with different numbers of sequence tags.
Comparison of DSAP, miRExpress, miRAnalyzer and miRDeep
| DSAP | miRExpress | miRanalyze | miRDeep | |
|---|---|---|---|---|
| Service type | Web server | Local server (command-line) | Web server | Local server (command-line) |
| Installation | NA | Requires knowledge on biocomputing | NA | Requires knowledge on biocomputing |
| Input file format | Tab-delimited | Tab-delimited, Fastq | Tab-delimited, Fasta | Fasta |
| miRNA database | miRBase v14 (pre-installed) | miRBase v14 | miRBase v12 (pre-installed) | miRBase v14 |
| Number of organisms available | 115 | 115 | 10 | Limited to sequenced genomes |
| Additional database | Rfam v9.1 | NA | Rfam v9.1, RepBase | UCSC Genome Browser database |
| Classification of small RNA | Yes | NA | Yes | Yes |
| Classification of miRNA | Yes | Yes | Yes | Yes |
| Iso-miRNA alignment | Yes | NA | Yes | NA |
| Cross-species distribution of miRNA | Yes (Graphic) | Yes (text) | NA | NA |
| Differential miRNA expression | Yes (Graphic) | Yes (text) | NA | NA |
| New miRNA prediction | NA | NA | Limited to 10 organisms | Limited to sequenced genomes |
| Target prediction | NA | NA | Limited to 10 organisms | Limited to sequenced genomes |
| Result output format | Table, graphic | Text file | Table | Text file |
| Result link to miRBase | Yes | NA | NA | NA |
ahttp://dsap.cgu.edu.tw
bhttp://mirexpress.mbc.nctu.edu.tw/
chttp://web.bioinformatics.cicbiogune.es/microRNA/
dhttp://www.mdc-berlin.eu/en/research/research_teams/systems_biology_of_gene_regulatory_elements/projects/miRDeep/index.html
NA, not available.