| Literature DB >> 16640774 |
Tong Zhou1, Jianhong Weng, Xiao Sun, Zuhong Lu.
Abstract
BACKGROUND: Meiotic double-strand breaks occur at relatively high frequencies in some genomic regions (hotspots) and relatively low frequencies in others (coldspots). Hotspots and coldspots are receiving increasing attention in research into the mechanism of meiotic recombination. However, predicting hotspots and coldspots from DNA sequence information is still a challenging task.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16640774 PMCID: PMC1463011 DOI: 10.1186/1471-2105-7-223
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The performance of SVM for differentiating the hot ORFs from the cold ORFs. The SVM models were trained using codon composition, codon bias and amino acid composition as sequence attributes respectively.
| Attributes | Sensitivity (%) | Specificity (%) | OP (%) | Accuracy (%) |
| FCU | 86.6 | 75.0 | 80.8 | 85.0 |
| RSCU | 84.2 | 71.9 | 78.1 | 82.5 |
| FAAU | 66.3 | 62.5 | 64.4 | 65.8 |
The performance of SVM with exclusion of certain amino acids. The SVM models were trained using codon composition, codon bias and amino acid composition as sequence attributes respectively, excluding His, Arg, Asn and Ser from the training datasets.
| Attributes | Sensitivity (%) | Specificity (%) | OP (%) | Accuracy (%) |
| FCU | 86.1 | 68.8 | 77.5 | 83.8 |
| RSCU | 84.2 | 65.6 | 74.9 | 81.6 |
| FAAU | 61.4 | 50.0 | 55.7 | 59.8 |
Figure 1Plot of the two most dominant axes generated by PCA. Hot ORFs are represented by the red asterisks, while cold ORFs are represented by the filled blue circles.
Figure 2A comparision of the average RSCU values of hot ORFs with that of cold ORFs. From left to right, the first 26 codons are statistically more frequent in the hot ORFs while the last 21 triplets are used at the higher frequency in the cold ORFs (P < 0.05). The RSCU values of hot group are marked with filled red squares, while those of cold group are marked with filled blue circles.
The performance of SVM only trained on the key codons. The SVM models were trained using FCU and RSCU values of the codons listed in Figure 2 as sequence attributes respectively. These codons are thought to play important roles in classification.
| Attributes | Sensitivity (%) | Specificity (%) | OP (%) | Accuracy (%) |
| FCU | 86.6 | 68.8 | 77.7 | 84.2 |
| RSCU | 84.6 | 65.6 | 75.4 | 82.1 |
Figure 3The performance of SVM for predicting hot ORFs from the full genome. The numerical value stuck on each bar cluster denotes the recombination rate cutoff of hot ORFs.
Figure 4The performance of SVM for predicting cold ORFs from the full genome. The numerical value stuck on each bar cluster denotes the recombination rate cutoff of cold ORFs.