| Literature DB >> 20622843 |
Michael C Schatz1, Ben Langmead, Steven L Salzberg.
Abstract
Entities:
Mesh:
Substances:
Year: 2010 PMID: 20622843 PMCID: PMC2904649 DOI: 10.1038/nbt0710-691
Source DB: PubMed Journal: Nat Biotechnol ISSN: 1087-0156 Impact factor: 54.908
Bioinformatics Cloud Resources
| Applications | |
|---|---|
| CloudBLAST | Scalable BLAST in the Clouds |
| CloudBurst | Highly Sensitive Short Read Mapping |
| Cloud RSD | Reciprocal Smalest Distance Ortholog Detection |
| Contrail | De novo assembly of large genomes |
| Crossbow | Alignment and SNP Genotyping |
| Myrna | Differential expression analysis of mRNA-seq |
| Quake | Quality guided correction of short reads |
Figure 1Map-Shuffle-Scan framework used by Crossbow
Users begin by uploading the sequencing reads into the cloud storage. Hadoop, running on a cluster of virtual machines in the cloud, then maps the unaligned reads to the reference genome using many parallel instances of Bowtie. Hadoop then automatically shuffles the alignments into sorted bins determined by chromosome region. Finally, many parallel instances of SOAPsnp scan the sorted alignments in each bin. The final output is a stream of SNP calls stored within the cloud that can be downloaded back to the user's local computer.