Literature DB >> 27916703

OOgenesis_Pred: A sequence-based method for predicting oogenesis proteins by six different modes of Chou's pseudo amino acid composition.

Maryam Rahimi1, Mohammad Reza Bakhtiarizadeh2, Abdollah Mohammadi-Sangcheshmeh1.   

Abstract

Regarding to critical roles of oogenesis in formation of ova or unfertilized eggs from the oogonia by mitotic division and subsequent differentiation, the identification of oogenesis-related proteins is of great interest. However, the experimental determination of proteins involved in oogenesis is expensive, time consuming and labor-intensive. Therefore, a new powerful discriminating model is indispensable for classifying oogenesis/non-oogenesis-related proteins with high accuracy and precision. Hereby, for the first time we developed a support vector machine based oogenesis protein prediction method which differentiates oogenesis from non-oogenesis proteins. By means of informative protein physicochemical properties and in addition parameter optimization scheme, our method yields a robust and consistent performance. Our model achieved 87.68% and 84.82% prediction accuracy by five-fold cross validation test for datasets with 90% and 50% identity, respectively. The prediction model was also assessed using the independent dataset and yielded 91.62% and 85.38% prediction accuracy for datasets with 90% and 50% identity, respectively, which further demonstrates the effectiveness of our method. Moreover, by applying 10 different feature weighting methods, the more important protein features for oogenesis/non-oogenesis-related proteins discrimination, including serine and glycine frequency, quasi-sequence-order, pseudo-amino acid composition, distribution and conjoint triad, were determined. The success rates revealed that our model can be considered as a new encouraging and strong model for predicting proteins involved in oogenesis with appropriate performance. To enhance the value of the practical applications of the proposed method, we developed a standalone software for predicting oogenesis candidate proteins called OOgenesis_Pred. This software is the first predictor ever established for identifying oogenesis proteins. We also showed the capability of OOgenesis_Pred by making oogenesis-related proteins prediction for some of the oogenesis candidate proteins. It is anticipated that OOgenesis_Pred will become a powerful tool for future proteomic studies related to oogenesis.
Copyright © 2016 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Data mining; Feature selection; Oogenesis; Protein features; SVM

Mesh:

Substances:

Year:  2016        PMID: 27916703     DOI: 10.1016/j.jtbi.2016.11.028

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  11 in total

Review 1.  Some illuminating remarks on molecular genetics and genomics as well as drug development.

Authors:  Kuo-Chen Chou
Journal:  Mol Genet Genomics       Date:  2020-01-01       Impact factor: 3.291

2.  Assessing the Performances of Protein Function Prediction Algorithms from the Perspectives of Identification Accuracy and False Discovery Rate.

Authors:  Chun Yan Yu; Xiao Xu Li; Hong Yang; Ying Hong Li; Wei Wei Xue; Yu Zong Chen; Lin Tao; Feng Zhu
Journal:  Int J Mol Sci       Date:  2018-01-08       Impact factor: 5.923

3.  Estimating the effects of transcription factors binding and histone modifications on gene expression levels in human cells.

Authors:  Lu-Qiang Zhang; Qian-Zhong Li
Journal:  Oncotarget       Date:  2017-06-20

4.  iRNAm5C-PseDNC: identifying RNA 5-methylcytosine sites by incorporating physical-chemical properties into pseudo dinucleotide composition.

Authors:  Wang-Ren Qiu; Shi-Yu Jiang; Zhao-Chun Xu; Xuan Xiao; Kuo-Chen Chou
Journal:  Oncotarget       Date:  2017-06-20

5.  2L-piRNA: A Two-Layer Ensemble Classifier for Identifying Piwi-Interacting RNAs and Their Function.

Authors:  Bin Liu; Fan Yang; Kuo-Chen Chou
Journal:  Mol Ther Nucleic Acids       Date:  2017-04-13

6.  Detecting Succinylation sites from protein sequences using ensemble support vector machine.

Authors:  Qiao Ning; Xiaosa Zhao; Lingling Bao; Zhiqiang Ma; Xiaowei Zhao
Journal:  BMC Bioinformatics       Date:  2018-06-25       Impact factor: 3.169

7.  iRSpot-Pse6NC: Identifying recombination spots in Saccharomyces cerevisiae by incorporating hexamer composition into general PseKNC.

Authors:  Hui Yang; Wang-Ren Qiu; Guoqing Liu; Feng-Biao Guo; Wei Chen; Kuo-Chen Chou; Hao Lin
Journal:  Int J Biol Sci       Date:  2018-05-22       Impact factor: 6.580

8.  Prediction of presynaptic and postsynaptic neurotoxins by combining various Chou's pseudo components.

Authors:  Haiyan Huo; Tao Li; Shiyuan Wang; Yingli Lv; Yongchun Zuo; Lei Yang
Journal:  Sci Rep       Date:  2017-07-19       Impact factor: 4.379

9.  PrESOgenesis: A two-layer multi-label predictor for identifying fertility-related proteins using support vector machine and pseudo amino acid composition approach.

Authors:  Mohammad Reza Bakhtiarizadeh; Maryam Rahimi; Abdollah Mohammadi-Sangcheshmeh; Vahid Shariati J; Seyed Alireza Salami
Journal:  Sci Rep       Date:  2018-06-13       Impact factor: 4.379

10.  2L-PCA: a two-level principal component analyzer for quantitative drug design and its applications.

Authors:  Qi-Shi Du; Shu-Qing Wang; Neng-Zhong Xie; Qing-Yan Wang; Ri-Bo Huang; Kuo-Chen Chou
Journal:  Oncotarget       Date:  2017-08-01
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.