| Literature DB >> 20576622 |
Christian Baudet1, Claire Lemaitre, Zanoni Dias, Christian Gautier, Eric Tannier, Marie-France Sagot.
Abstract
SUMMARY: Genomes undergo large structural changes that alter their organization. The chromosomal regions affected by these rearrangements are called breakpoints, while those which have not been rearranged are called synteny blocks. Lemaitre et al. presented a new method to precisely delimit rearrangement breakpoints in a genome by comparison with the genome of a related species. Receiving as input a list of one2one orthologous genes found in the genomes of two species, the method builds a set of reliable and non-overlapping synteny blocks and refines the regions that are not contained into them. Through the alignment of each breakpoint sequence against its specific orthologous sequences in the other species, we can look for weak similarities inside the breakpoint, thus extending the synteny blocks and narrowing the breakpoints. The identification of the narrowed breakpoints relies on a segmentation algorithm and is statistically assessed. Here, we present the package Cassis that implements this method of precise detection of genomic rearrangement breakpoints. AVAILABILITY: Perl and R scripts are freely available for download at http://pbil.univ-lyon1.fr/software/Cassis/. Documentation with methodological background, technical aspects, download and setup instructions, as well as examples of applications are available together with the package. The package was tested on Linux and Mac OS environments and is distributed under the GNU GPL License.Entities:
Mesh:
Year: 2010 PMID: 20576622 PMCID: PMC2905553 DOI: 10.1093/bioinformatics/btq301
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Sequence S is defined by the boundaries of two consecutive synteny blocks A and B on the genome G. S (S) is defined by the boundaries of the orthologous block A (B) and of the previous/next synteny block (according to the orientation of the blocks) in the genome G. To perform the segmentation, the package considers the extended version of the sequences S, S and, S which includes the first/last genes of the synteny blocks.