| Literature DB >> 19126223 |
Yannick Wurm1, Paolo Uva, Frédéric Ricci, John Wang, Stephanie Jemielity, Christian Iseli, Laurent Falquet, Laurent Keller.
Abstract
BACKGROUND: Fourmidable is an infrastructure to curate and share the emerging genetic, molecular, and functional genomic data and protocols for ants. DESCRIPTION: The Fourmidable assembly pipeline groups nucleotide sequences into clusters before independently assembling each cluster. Subsequently, assembled sequences are annotated via Interproscan and BLAST against general and insect-specific databases. Gene-specific information can be retrieved using gene identifiers, searching for similar sequences or browsing through inferred Gene Ontology annotations. The database will readily scale as ultra-high throughput sequence data and sequences from additional species become available.Entities:
Mesh:
Year: 2009 PMID: 19126223 PMCID: PMC2639375 DOI: 10.1186/1471-2164-10-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Data Content in Fourmidable (October 2008)
| 28,006 input sequences including: |
| • Tracefiles from 21,715 ESTs (some were multiply sequenced) |
| • 1,496 additional ESTs and mRNA sequences from GenBank |
| 12,859 putative transcripts: |
| • 4,958 contigs |
| • 7,263 singlets |
| Sequence annotation: |
| • 14,222 annotating GO terms on 2,818 putative transcripts |
| • 599 Interproscan annotations |
| • Blast comparisons against the non-redundant protein database, as well as proteomes and genomes of |
| Microarray data: |
| • Two public experiments |
| • 66 hybridizations |
| 709 input sequences which are: |
| • Tracefiles from 615 EST clones |
| 403 putative transcripts: |
| • 147 contigs |
| • 256 singlets |
| 21 Interproscan annotations |
| 8 molecular biology protocols |
Figure 1Table of sequence search results. For each result, the following are shown from left to right if applicable: sequence identifiers for raw and assembled sequence (these respectively link to the raw datafiles and assembly information); links to raw and assembled sequence in Fasta format; links to the results of BLAST against different databases (red buttons if E-value ≤ 10-5; blue buttons if 0.01 > E-value > 10-5; bx, tbx and bn respectively indicate BLASTX, TBLASTX and BLASTN algorithms); a description as inferred from BLASTX against the non-redundant protein database; and a link to Interproscan annotation.