Literature DB >> 31944107

Identification of Highest-Affinity Binding Sites of Yeast Transcription Factor Families.

Zongyu Wang1, Wenying He1, Jijun Tang1,2,3, Fei Guo1.   

Abstract

Transcription factors (TFs) play a crucial role in controlling key cellular processes and responding to the environment. Yeast is a single-cell fungal organism that is a vital biological model organism for studying transcription and translation in basic biology. The transcriptional control process of yeast cells has been extensively calculated and studied using traditional methods and high-throughput technologies. However, the identities of transcription factors that regulate major functional categories of genes remain unknown. Due to the avalanche of biological data in the post-genomic era, it is an urgent need to develop automated computational methods to enable accurate identification of efficient transcription factor binding sites from the large number of candidates. In this paper, we analyzed high-resolution DNA-binding profiles and motifs for TFs, covering all possible contiguous 8-mers. First, we divided all 8-mer motifs into 16 various categories and selected all sorts of samples from each category by setting the threshold of E-score. Then, we employed five feature representation methods. Also, we adopted a total of four feature selection methods to filter out useless features. Finally, we used Extreme Gradient Boosting (XGBoost) as our base classifier and then utilized the one-vs-rest tactics to build 16 binary classifiers to solve this multiclassification problem. In the experiment, our method achieved the best performance with an overall accuracy of 79.72% and Mathew's correlation coefficient of 0.77. We found the similarity relationship among each category from different TF families and obtained sequence motif schematic diagrams via multiple sequence alignment. The complexity of DNA recognition may act as an important role in the evolution of gene regulation. Source codes are available at https://github.com/guofei-tju/tfbs.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 31944107     DOI: 10.1021/acs.jcim.9b01012

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  9 in total

1.  Identification of Sub-Golgi protein localization by use of deep representation learning features.

Authors:  Zhibin Lv; Pingping Wang; Quan Zou; Qinghua Jiang
Journal:  Bioinformatics       Date:  2020-12-26       Impact factor: 6.937

2.  Identification and Classification of Enhancers Using Dimension Reduction Technique and Recurrent Neural Network.

Authors:  Qingwen Li; Lei Xu; Qingyuan Li; Lichao Zhang
Journal:  Comput Math Methods Med       Date:  2020-10-18       Impact factor: 2.238

3.  6mA-Pred: identifying DNA N6-methyladenine sites based on deep learning.

Authors:  Qianfei Huang; Wenyang Zhou; Fei Guo; Lei Xu; Lichao Zhang
Journal:  PeerJ       Date:  2021-02-03       Impact factor: 2.984

4.  prPred: A Predictor to Identify Plant Resistance Proteins by Incorporating k-Spaced Amino Acid (Group) Pairs.

Authors:  Yansu Wang; Pingping Wang; Yingjie Guo; Shan Huang; Yu Chen; Lei Xu
Journal:  Front Bioeng Biotechnol       Date:  2021-01-21

5.  Analysis of Cyclin-Dependent Kinase 1 as an Independent Prognostic Factor for Gastric Cancer Based on Statistical Methods.

Authors:  Xu Zhang; Hua Ma; Quan Zou; Jin Wu
Journal:  Front Cell Dev Biol       Date:  2020-12-07

6.  Gene-Based Testing of Interactions Using XGBoost in Genome-Wide Association Studies.

Authors:  Yingjie Guo; Chenxi Wu; Zhian Yuan; Yansu Wang; Zhen Liang; Yang Wang; Yi Zhang; Lei Xu
Journal:  Front Cell Dev Biol       Date:  2021-12-16

7.  Accurate identification of RNA D modification using multiple features.

Authors:  Lijun Dou; Wenyang Zhou; Lichao Zhang; Lei Xu; Ke Han
Journal:  RNA Biol       Date:  2021-03-17       Impact factor: 4.652

8.  Discrimination of Thermophilic Proteins and Non-thermophilic Proteins Using Feature Dimension Reduction.

Authors:  Zifan Guo; Pingping Wang; Zhendong Liu; Yuming Zhao
Journal:  Front Bioeng Biotechnol       Date:  2020-10-22

Review 9.  Recent Advances in Predicting Protein S-Nitrosylation Sites.

Authors:  Qian Zhao; Jiaqi Ma; Fang Xie; Yu Wang; Yu Zhang; Hui Li; Yuan Sun; Liqi Wang; Mian Guo; Ke Han
Journal:  Biomed Res Int       Date:  2021-02-09       Impact factor: 3.411

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.