| Literature DB >> 31074373 |
Jialu Hu1,2, Jingru Wang3, Jianan Lin3, Tianwei Liu3, Yuanke Zhong3, Jie Liu3, Yan Zheng3, Yiqun Gao3, Junhao He3, Xuequn Shang3.
Abstract
BACKGROUND: Transcription factors (TFs) play important roles in the regulation of gene expression. They can activate or block transcription of downstream genes in a manner of binding to specific genomic sequences. Therefore, motif discovery of these binding preference patterns is of central significance in the understanding of molecular regulation mechanism. Many algorithms have been proposed for the identification of transcription factor binding sites. However, it remains a challengeable problem.Entities:
Keywords: Binding site preference; Multiple instance learning; Support vector machine; Transcription factor
Mesh:
Substances:
Year: 2019 PMID: 31074373 PMCID: PMC6509868 DOI: 10.1186/s12859-019-2735-3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1An example of MIL model for DNA fragments
Binary codes for each nucleotide
| Nucleotide | Code |
|---|---|
| A | (1, 0, 0, 0) |
| T | (0, 1, 0, 0) |
| C | (0, 0, 1, 0) |
| G | (0, 0, 0, 1) |
Each nucleotide was encoded in a 4-dimensional vector.
Fig. 2Comparison of motifs discovered by MD-SVM and JASPAR. Here, sequence motifs are graphically displayed in seq-logos. The height of each logo position reflects the degree of sequence conservation in multiple alignments. We compared our seq-logos of eight transcription factors to that extracted from the JASPAR database. Results show that MD-SVM can acurately identify most of the eight transcription factors
Performance comparison between MI-SVM and MD-SVM
| Transcription factor | MI-SVM | MD-SVM |
|---|---|---|
| Zscan10-3 | 0.802262 | 0.802638 |
| Sox14 | 0.918162 | 0.918175 |
| Irf2 | 0.966050 | 0.966175 |
| Nkx2-9 | 0.937225 | 0.937575 |
| Foxg1 | 0.896850 | 0.897000 |
| Mlx | 0.999125 | 0.999475 |
| Sdccag8 | 0.996550 | 0.996575 |
| Mecp2 | 0.930225 | 0.930125 |
| Zfp202 | 0.913325 | 0.920475 |
| Egr2 | 0.899875 | 0.911275 |
| Dmrtc2 | 0.968725 | 0.966925 |
| Pou1f1 | 0.997725 | 0.998575 |
| Pou3f1 | 0.993062 | 0.993063 |
| Foxo1 | 0.930800 | 0.930325 |
| Oct1 | 0.989450 | 0.994550 |
| Pit1 | 0.994875 | 0.994775 |
| Foxp2 | 0.925825 | 0.926425 |