Literature DB >> 16337569

GMDH-based feature ranking and selection for improved classification of medical data.

R E Abdel-Aal1.   

Abstract

Medical applications are often characterized by a large number of disease markers and a relatively small number of data records. We demonstrate that complete feature ranking followed by selection can lead to appreciable reductions in data dimensionality, with significant improvements in the implementation and performance of classifiers for medical diagnosis. We describe a novel approach for ranking all features according to their predictive quality using properties unique to learning algorithms based on the group method of data handling (GMDH). An abductive network training algorithm is repeatedly used to select groups of optimum predictors from the feature set at gradually increasing levels of model complexity specified by the user. Groups selected earlier are better predictors. The process is then repeated to rank features within individual groups. The resulting full feature ranking can be used to determine the optimum feature subset by starting at the top of the list and progressively including more features until the classification error rate on an out-of-sample evaluation set starts to increase due to overfitting. The approach is demonstrated on two medical diagnosis datasets (breast cancer and heart disease) and comparisons are made with other feature ranking and selection methods. Receiver operating characteristics (ROC) analysis is used to compare classifier performance. At default model complexity, dimensionality reduction of 22 and 54% could be achieved for the breast cancer and heart disease data, respectively, leading to improvements in the overall classification performance. For both datasets, considerable dimensionality reduction introduced no significant reduction in the area under the ROC curve. GMDH-based feature selection results have also proved effective with neural network classifiers.

Entities:  

Mesh:

Year:  2005        PMID: 16337569     DOI: 10.1016/j.jbi.2005.03.003

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  6 in total

1.  Classification and Progression Based on CFS-GA and C5.0 Boost Decision Tree of TCM Zheng in Chronic Hepatitis B.

Authors:  Xiao Yu Chen; Li Zhuang Ma; Na Chu; Min Zhou; Yiyang Hu
Journal:  Evid Based Complement Alternat Med       Date:  2013-01-27       Impact factor: 2.629

2.  Enhancement of early cervical cancer diagnosis with epithelial layer analysis of fluorescence lifetime images.

Authors:  Jun Gu; Chit Yaw Fu; Beng Koon Ng; Lin Bo Liu; Soo Kim Lim-Tan; Caroline Guat Lay Lee
Journal:  PLoS One       Date:  2015-05-12       Impact factor: 3.240

3.  Multiparametric quantitative and texture 18F-FDG PET/CT analysis for primary malignant tumour grade differentiation.

Authors:  Mykola Novikov
Journal:  Eur Radiol Exp       Date:  2019-12-18

4.  Deep learning for early detection of pathological changes in X-ray bone microstructures: case of osteoarthritis.

Authors:  Livija Jakaite; Jiří Hladůvka; Sergey Minaev; Aziz Ambia; Wojtek Krzanowski; Vitaly Schetinin
Journal:  Sci Rep       Date:  2021-01-27       Impact factor: 4.379

5.  Time series prediction of under-five mortality rates for Nigeria: comparative analysis of artificial neural networks, Holt-Winters exponential smoothing and autoregressive integrated moving average models.

Authors:  Daniel Adedayo Adeyinka; Nazeem Muhajarine
Journal:  BMC Med Res Methodol       Date:  2020-12-03       Impact factor: 4.615

6.  Systems biology and machine learning approaches identify drug targets in diabetic nephropathy.

Authors:  Maryam Abedi; Hamid Reza Marateb; Mohammad Reza Mohebian; Seyed Hamid Aghaee-Bakhtiari; Seyed Mahdi Nassiri; Yousof Gheisari
Journal:  Sci Rep       Date:  2021-12-06       Impact factor: 4.379

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.