Literature DB >> 21168527

Implementing ReliefF filters to extract meaningful features from genetic lifetime datasets.

Lorenzo Beretta1, Alessandro Santaniello.   

Abstract

BACKGROUND: The analysis of survival data allows to evaluate whether in a population the genetic exposure is related to the time until an event occurs. Owing to the complexity of common human diseases, there is the incipient need to develop bioinformatics tools to properly model non-linear high-order interactions in lifetime datasets. These tools, such as the survival dimensionality reduction algorithm, may suffer from extreme computational costs in large-scale datasets. Herein, we address the problem of estimating the quality of attributes, so as to extract relevant features from lifetime datasets and to scale down their size.
METHODS: The ReliefF algorithm was modified and adjusted to compensate for the loss of information due to censoring, introducing reclassification and weighting schemes. Synthetic lifetime two-locus epistatic datasets of 500 attributes, 400-800 individuals and different degrees of cumulative heritability and censorship were generated. The capability of the survival ReliefF algorithm (sReliefF) and of a tuned sReliefF approach to properly select the causative pair of attributes was evaluated and compared to univariate selection based on Cox scores. RESULTS/
CONCLUSIONS: sReliefF methods efficiently scaled down the simulated datasets, whilst univariate selection performed no better than random choice. These approaches may help to reduce the computational cost and to improve the classification task of algorithms that model high-order interactions in presence of right-censored data. AVAILABILITY: http://sourceforge.net/projects/sdrproject/files/sReliefF/.
Copyright © 2010 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2010        PMID: 21168527     DOI: 10.1016/j.jbi.2010.12.003

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  4 in total

1.  Colorectal Cancer Diagnostic Algorithm Based on Sub-Patch Weight Color Histogram in Combination of Improved Least Squares Support Vector Machine for Pathological Image.

Authors:  Kai Yang; Bi Zhou; Fei Yi; Yan Chen; Yingsheng Chen
Journal:  J Med Syst       Date:  2019-08-14       Impact factor: 4.460

Review 2.  Relief-based feature selection: Introduction and review.

Authors:  Ryan J Urbanowicz; Melissa Meeker; William La Cava; Randal S Olson; Jason H Moore
Journal:  J Biomed Inform       Date:  2018-07-18       Impact factor: 6.317

3.  Ensemble learning for detecting gene-gene interactions in colorectal cancer.

Authors:  Faramarz Dorani; Ting Hu; Michael O Woods; Guangju Zhai
Journal:  PeerJ       Date:  2018-10-29       Impact factor: 2.984

4.  Investigating the relationship between the SNCA gene and cognitive abilities in idiopathic Parkinson's disease using machine learning.

Authors:  Mehrafarin Ramezani; Pauline Mouches; Eunjin Yoon; Deepthi Rajashekar; Jennifer A Ruskey; Etienne Leveille; Kristina Martens; Mekale Kibreab; Tracy Hammer; Iris Kathol; Nadia Maarouf; Justyna Sarna; Davide Martino; Gerald Pfeffer; Ziv Gan-Or; Nils D Forkert; Oury Monchi
Journal:  Sci Rep       Date:  2021-03-01       Impact factor: 4.379

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.