| Literature DB >> 20015948 |
Abstract
SUMMARY: Recent parallel pyrosequencing methods and the increasing number of finished genomes encourage the sequencing and investigation of closely related strains. Although the sequencing itself becomes easier and cheaper with each machine generation, the finishing of the genomes remains difficult. Instead of the desired whole genomic sequence, a set of contigs is the result of the assembly. In this applications note, we present the tool r2cat (related reference contig arrangement tool) that helps in the task of comparative assembly and also provides an interactive visualization for synteny inspection.Entities:
Mesh:
Year: 2009 PMID: 20015948 PMCID: PMC2820676 DOI: 10.1093/bioinformatics/btp690
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Synteny plots produced by r2cat. The contigs of C.urealyticum (NCBI number: NC_010545) are mapped onto the reference sequence of C.jeikeium (NC_007164).
Times for matching a set of contigs on a reference genome
| Time (s) | Unmatched | Time (s) | Unmatched | |
|---|---|---|---|---|
| blast | 20.0 | 0 | 162.1 | 0 |
| blat | 46.9 | 94 | 700.8 | 84 |
| nucmer | 9.8 | 109 | 45.6 | 92 |
| 6.2 | 102 | 45.4 | 75 | |
Additionally, the number of contigs is given that could not be matched. The employed programs are BLAST (Altschul et al., 1990, blastall v. 2.2.19), BLAT (Kent, 2002, blat v. 15), MUMmer (Kurtz et al., 2004, nucmer v. 3.06), and our matching routine implemented within r2cat. The experiments were performed on a sparcv9 processor operating at 1593 MHz.