| Literature DB >> 28155714 |
Quan Zou1, Shixiang Wan1,2, Ying Ju3, Jijun Tang1,4, Xiangxiang Zeng5.
Abstract
BACKGROUND: It is necessary and essential to discovery protein function from the novel primary sequences. Wet lab experimental procedures are not only time-consuming, but also costly, so predicting protein structure and function reliably based only on amino acid sequence has significant value. TATA-binding protein (TBP) is a kind of DNA binding protein, which plays a key role in the transcription regulation. Our study proposed an automatic approach for identifying TATA-binding proteins efficiently, accurately, and conveniently. This method would guide for the special protein identification with computational intelligence strategies.Entities:
Keywords: Dimensionality reduction; Machine learning; Protein sequence features; Support vector machine; TATA binding protein
Mesh:
Substances:
Year: 2016 PMID: 28155714 PMCID: PMC5259984 DOI: 10.1186/s12918-016-0353-5
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Fig. 1PSIPRED graphical output from prediction of a TBP (CASP3 target Q8CII9) produced by PSIPRED View—a Java visualization tool that produces two-dimensional graphical representations of PSIPRED predictions
Fig. 2Optimal dimensionality searching based on MRMD
Fig. 3Five classifier sensitivities (SN)
Fig. 4Five classifier specificities (SP)
Fig. 5Five classifier accuracies (ACC)
Fig. 6SN, SP, and ACC of the primary step
Fig. 7SN, SP, and ACC of the secondary step
Fig. 8SN, SP, and ACC of the primary step with high quality negative samples
Fig. 9SN, SP, and ACC of the secondary step with high quality negative samples
Comparison with the searching tools
| sn | sp | acc | |
|---|---|---|---|
| Our method | 89.60% | 91.10% | 90.46% |
| BLASTP | 86.26% | 78.96% | 82.89% |
| PSI-BLAST | 88.62% | 81.60% | 84.36% |