| Literature DB >> 19146671 |
Zhen Wang1, Guohui Ding, Zhonghao Yu, Lei Liu, Yixue Li.
Abstract
BACKGROUND: The identification of chromosomal homologous segments (CHS) within and between genomes is essential for comparative genomics. Various processes including insertion/deletion and inversion could cause the degeneration of CHSs.Entities:
Year: 2009 PMID: 19146671 PMCID: PMC2647922 DOI: 10.1186/1748-7188-4-2
Source DB: PubMed Journal: Algorithms Mol Biol ISSN: 1748-7188 Impact factor: 1.405
Figure 1Definition of CHS. In this application, CHS is defined as two genomic regions that share a set of homologous (matched) genes, regardless of gene order and orientation. A limited number of unmatched genes can be allowed between two adjacent matched genes, but are restricted to be no more than a predefined constant, i.e. the maximal gap size.
Figure 2Graphical display of CHS. (A) CHSMiner organizes all identified CHSs as a table. It can generate two types of images for them. (B) Visualization of individual selected CHS, where homologous genes linked in the CHS are matched and labelled. (C) Visualization of a whole chromosomal pattern, where all homologous regions in a given chromosome are marked. The image is interactive and users can zoom in on a specific region.
Summary of the four programmes for comparison
| Programme | CHS definition | Search algorithm | Statistical evaluation |
| CHSMiner | Gene content | Bottom-up | Analytical calculation |
| LineUp | Gene colinearity | Bottom-up | Monte Carlo simulation |
| CloseUp | Gene content | Bottom-up | Monte Carlo simulation |
| HomologyTeams | Gene content | Top-down | Not available |
Figure 3Performance comparison on simulated data. The extent of noise was controlled by the background similarity R and the number of inversions F. For each combination of R and F, 10 samples were simulated. Both sensitivity and specificity were calculated for the result of each sample (see Methods). The data point and error bar represent the mean value and the standard error of every percentage.
Number of orthologs covered by Ensembl synteny map and CHSMiner result
| CHSMiner result (by maximal gap size) | ||||||
| One gene | Five genes | Total | ||||
| Present | Absent | Present | Absent | |||
| Ensembl synteny map | Present | 15866 | 2887 | 18135 | 618 | 18753 |
| Absent | 447 | 3071 | 1209 | 2309 | 3518 | |
| Total | 16313 | 5958 | 19344 | 2927 | ||