Literature DB >> 25792441

Improved feature-based prediction of SNPs in human cytochrome P450 enzymes.

Li Li1, Yi Xiong, Zhuo-Yu Zhang, Quan Guo, Qin Xu, Hien-Haw Liow, Yong-Hong Zhang, Dong-Qing Wei.   

Abstract

Single nucleotide polymorphisms (SNPs) make up the most common form of mutations in human cytochrome P450 enzymes family, and have the potential to bring with different drug responses or specific diseases in individual patients. Here, based on machine learning technology, we aim to explore an effective set of sequence-based features for improving prediction of SNPs by using support vector machine algorithms. The features are derived from the target residues and flanking protein sequences, such as amino acid types, sequences composition, physicochemical properties, position-specific scoring matrix, phylogenetic entropy and the number of possible codons of target residues. In order to deal with the imbalance data with a majority of non-SNPs and a minority of SNPs, a preprocessing strategy based on fuzzy set theory was applied to the datasets. Our final model achieves the performance of 93.8% in sensitivity, 88.8% in specificity, 91.3% in accuracy and 0.971 of AUC value, which is significantly higher than the previous DNA sequence-based or protein sequence-based methods. Furthermore, our study also suggested the roles of individual features for prediction of SNPs. The most important features consist of the amino acid type, the number of available codons, position-specific scoring matrix and phylogenetic entropy. The improved model will be a promising tool for SNP predictions, and assist in the research of genome mutation and personalized prescriptions.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25792441     DOI: 10.1007/s12539-014-0257-2

Source DB:  PubMed          Journal:  Interdiscip Sci        ISSN: 1867-1462            Impact factor:   2.233


  5 in total

1.  Prediction of Effective Drug Combinations by an Improved Naïve Bayesian Algorithm.

Authors:  Li-Yue Bai; Hao Dai; Qin Xu; Muhammad Junaid; Shao-Liang Peng; Xiaolei Zhu; Yi Xiong; Dong-Qing Wei
Journal:  Int J Mol Sci       Date:  2018-02-05       Impact factor: 5.923

2.  Predicting Active NBA Players Most Likely to Be Inducted into the Basketball Hall of Famers Using Artificial Neural Networks in Microsoft Excel: Development and Usability Study.

Authors:  Po-Hsin Chou; Tsair-Wei Chien; Ting-Ya Yang; Yu-Tsen Yeh; Willy Chou; Chao-Hung Yeh
Journal:  Int J Environ Res Public Health       Date:  2021-04-16       Impact factor: 3.390

3.  An app to classify a 5-year survival in patients with breast cancer using the convolutional neural networks (CNN) in Microsoft Excel: Development and usability study.

Authors:  Cheng-Yao Lin; Tsair-Wei Chien; Yen-Hsun Chen; Yen-Ling Lee; Shih-Bin Su
Journal:  Medicine (Baltimore)       Date:  2022-01-28       Impact factor: 1.889

4.  PVP-SVM: Sequence-Based Prediction of Phage Virion Proteins Using a Support Vector Machine.

Authors:  Balachandran Manavalan; Tae H Shin; Gwang Lee
Journal:  Front Microbiol       Date:  2018-03-16       Impact factor: 5.640

5.  Predicting the 14-Day Hospital Readmission of Patients with Pneumonia Using Artificial Neural Networks (ANN).

Authors:  Shu-Farn Tey; Chung-Feng Liu; Tsair-Wei Chien; Chin-Wei Hsu; Kun-Chen Chan; Chia-Jung Chen; Tain-Junn Cheng; Wen-Shiann Wu
Journal:  Int J Environ Res Public Health       Date:  2021-05-12       Impact factor: 3.390

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.