| Literature DB >> 23190475 |
Lin Dai1, Xin Gao, Yan Guo, Jingfa Xiao, Zhang Zhang.
Abstract
UNLABELLED: As advances in life sciences and information technology bring profound influences on bioinformatics due to its interdisciplinary nature, bioinformatics is experiencing a new leap-forward from in-house computing infrastructure into utility-supplied cloud computing delivered over the Internet, in order to handle the vast quantities of biological data generated by high-throughput experimental technologies. Albeit relatively new, cloud computing promises to address big data storage and analysis issues in the bioinformatics field. Here we review extant cloud-based services in bioinformatics, classify them into Data as a Service (DaaS), Software as a Service (SaaS), Platform as a Service (PaaS), and Infrastructure as a Service (IaaS), and present our perspectives on the adoption of cloud computing in bioinformatics. REVIEWERS: This article was reviewed by Frank Eisenhaber, Igor Zhulin, and Sandor Pongor.Entities:
Mesh:
Year: 2012 PMID: 23190475 PMCID: PMC3533974 DOI: 10.1186/1745-6150-7-43
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Figure 1Illustration of bioinformatics cloud. Cloud-based services in bioinformatics are grouped into Data as a Service (DaaS), Software as a Service (SaaS), Platform as a Service (PaaS), and Infrastructure as a Service (IaaS).
Cloud resources in bioinformatics
| AWS Public Datasets | Cloud-based archives of GenBank, Ensembl, 1000 Genomes, Model Organism Encyclopedia of DNA Elements, Unigene, Influenza Virus, etc.; |
| BGI Cloud (unpublished) | Cloud-based implementations of various genomic analysis applications; |
| CloudAligner [ | Fast and full-featured MapReduce-based tool for sequence mapping; |
| CloudBLAST [ | A cloud-based implementation of NCBI BLAST; |
| CloudBurst [ | Highly sensitive short read mapping with MapReduce; |
| Contrail (unpublished) | Cloud-based |
| Crossbow [ | Read Mapping and SNP calling using cloud computing; |
| EasyGenomics (unpublished) | Cloud-based NGS pipelines for whole genome resequencing, exome resequencing, RNA-Seq, small RNA and de novo assembly; |
| eCEO [ | Cloud-based identification of large-scale epistatic interactions in genome-wide association study (GWAS); |
| FX [ | RNA-Seq analysis tool; |
| Gaea (unpublished) | Cloud-based genome re-sequencing assembly; |
| Hecate (unpublished) | Cloud-based |
| Jnomics (unpublished) | Cloud-scale sequence analysis suite based on Apache Hadoop; |
| Myrna [ | Differential gene expression tool for RNA-Seq; |
| PeakRanger [ | Cloud-enabled peak caller for ChIP-seq data; |
| RSD [ | Reciprocal smallest distance algorithm for ortholog detection using Amazon's Elastic Computing Cloud; |
| VAT [ | Variant annotation tool to functionally annotate variants from multiple personal genomes at the transcript level; |
| YunBe [ | Pathway-based or gene set analysis of expression data; |
| Eoulsan [ | Cloud-based platform for high throughput sequencing analyses; |
| Galaxy Cloud [ | Cloud-scale Galaxy for large-scale data analysis; |
| Cloud BioLinux [ | A publicly accessible virtual machine for high performance bioinformatics computing using cloud platforms; |
| CloVR [ | A portable virtual machine for automated sequence analysis using cloud computing; |