Literature DB >> 24771561

EnsembleGASVR: a novel ensemble method for classifying missense single nucleotide polymorphisms.

Trisevgeni Rapakoulia1, Konstantinos Theofilatos1, Dimitrios Kleftogiannis1, Spiros Likothanasis1, Athanasios Tsakalidis1, Seferina Mavroudi2.   

Abstract

MOTIVATION: Single nucleotide polymorphisms (SNPs) are considered the most frequently occurring DNA sequence variations. Several computational methods have been proposed for the classification of missense SNPs to neutral and disease associated. However, existing computational approaches fail to select relevant features by choosing them arbitrarily without sufficient documentation. Moreover, they are limited to the problem of missing values, imbalance between the learning datasets and most of them do not support their predictions with confidence scores.
RESULTS: To overcome these limitations, a novel ensemble computational methodology is proposed. EnsembleGASVR facilitates a two-step algorithm, which in its first step applies a novel evolutionary embedded algorithm to locate close to optimal Support Vector Regression models. In its second step, these models are combined to extract a universal predictor, which is less prone to overfitting issues, systematizes the rebalancing of the learning sets and uses an internal approach for solving the missing values problem without loss of information. Confidence scores support all the predictions and the model becomes tunable by modifying the classification thresholds. An extensive study was performed for collecting the most relevant features for the problem of classifying SNPs, and a superset of 88 features was constructed. Experimental results show that the proposed framework outperforms well-known algorithms in terms of classification performance in the examined datasets. Finally, the proposed algorithmic framework was able to uncover the significant role of certain features such as the solvent accessibility feature, and the top-scored predictions were further validated by linking them with disease phenotypes.
AVAILABILITY AND IMPLEMENTATION: Datasets and codes are freely available on the Web at http://prlab.ceid.upatras.gr/EnsembleGASVR/dataset-codes.zip. All the required information about the article is available through http://prlab.ceid.upatras.gr/EnsembleGASVR/site.html.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2014        PMID: 24771561     DOI: 10.1093/bioinformatics/btu297

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

Review 1.  Clinical applications of artificial intelligence and machine learning in cancer diagnosis: looking into the future.

Authors:  Muhammad Javed Iqbal; Zeeshan Javed; Haleema Sadia; Ijaz A Qureshi; Asma Irshad; Rais Ahmed; Kausar Malik; Shahid Raza; Asif Abbas; Raffaele Pezzani; Javad Sharifi-Rad
Journal:  Cancer Cell Int       Date:  2021-05-21       Impact factor: 5.722

Review 2.  Artificial intelligence perspective in the future of endocrine diseases.

Authors:  Mandana Hasanzad; Bagher Larijani; Hamid Reza Aghaei Meybodi; Negar Sarhangi
Journal:  J Diabetes Metab Disord       Date:  2022-01-11

Review 3.  Artificial Intelligence-Based Data-Driven Strategy to Accelerate Research, Development, and Clinical Trials of COVID Vaccine.

Authors:  Ashwani Sharma; Tarun Virmani; Vipluv Pathak; Anjali Sharma; Kamla Pathak; Girish Kumar; Devender Pathak
Journal:  Biomed Res Int       Date:  2022-07-06       Impact factor: 3.246

Review 4.  Towards Increasing the Clinical Relevance of In Silico Methods to Predict Pathogenic Missense Variants.

Authors:  David L Masica; Rachel Karchin
Journal:  PLoS Comput Biol       Date:  2016-05-12       Impact factor: 4.475

5.  TELS: A Novel Computational Framework for Identifying Motif Signatures of Transcribed Enhancers.

Authors:  Dimitrios Kleftogiannis; Haitham Ashoor; Vladimir B Bajic
Journal:  Genomics Proteomics Bioinformatics       Date:  2018-12-19       Impact factor: 7.691

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.