| Literature DB >> 21693004 |
Marco Galardini1, Emanuele G Biondi, Marco Bazzicalupo, Alessio Mengoni.
Abstract
Recent developments in sequencing technologies have given the opportunity to sequence many bacterial genomes with limited cost and labor, compared to previous techniques. However, a limiting step of genome sequencing is the finishing process, needed to infer the relative position of each contig and close sequencing gaps. An additional degree of complexity is given by bacterial species harboring more than one replicon, which are not contemplated by the currently available programs. The availability of a large number of bacterial genomes allows geneticists to use complete genomes (possibly from the same species) as templates for contigs mapping.Here we present CONTIGuator, a software tool for contigs mapping over a reference genome which allows the visualization of a map of contigs, underlining loss and/or gain of genetic elements and permitting to finish multipartite genomes. The functionality of CONTIGuator was tested using four genomes, demonstrating its improved performances compared to currently available programs.Our approach appears efficient, with a clear visualization, allowing the user to perform comparative structural genomics analysis on draft genomes. CONTIGuator is a Python script for Linux environments and can be used on normal desktop machines and can be downloaded from http://contiguator.sourceforge.net.Entities:
Year: 2011 PMID: 21693004 PMCID: PMC3133546 DOI: 10.1186/1751-0473-6-11
Source DB: PubMed Journal: Source Code Biol Med ISSN: 1751-0473
Figure 1CONTIGuator. a) Program flowchart: the input contigs are mapped to the reference using the combination of Blast and MUMmer, generating a map (viewable with ACT), Primer3 provides a series of PCR primers that can be used to generate a new set of contigs, from which the process can be iterated again; b-c-d-e) ACT visualization: the reference genome is on top, pseudocontig on bottom; b) a putative primer placement over two near contigs; c) a section of no synteny in an otherwise fully syntenic contig; d) a region of the reference genome with no homolog in the mapped contig; e) the same region in d) highlighted, since it harbors a tblastn hit against the excluded contigs; f-g) verification of structural clues in ACT: the closed genome of Sinorhizobium meliloti BL225C is on top, the reference genome in the middle and the contigs on bottom.
Comparison between ABACAS and CONTIGuator performances over the four test genomes.
| 3291978 | 4164667 | 1912883 | 6350458 | ||
| 3256036 | 3876600 | 1803685 | 5754197 | ||
| 3267649 | 3090612 | 1761837 | 6897232 | ||
| 3286027 | 4236992 | 2020582 | 6906017 | ||
| 3271308 | 3160370 | 1661752 | 262344 | ||
| 61 | 304 | 89 | 73 | ||
| 49 | 255 | 78 | 62 | ||
| 71 | 278 | 367 | 20 | ||
| 58 | 171 | 63 | 39 | ||
| 42 | 158 | 56 | 35 | ||
| 64 | 90 | 323 | 4 | ||
The base pairs derived from those contigs appearing in more than one replicon were divided by two.