Literature DB >> 19645418

New variable selection method using interval segmentation purity with application to blockwise kernel transform support vector machine classification of high-dimensional microarray data.

Li-Juan Tang1, Wen Du, Hai-Yan Fu, Jian-Hui Jiang, Hai-Long Wu, Guo-Li Shen, Ru-Qin Yu.   

Abstract

One problem with discriminant analysis of microarray data is representation of each sample by a large number of genes that are possibly irrelevant, insignificant, or redundant. Methods of variable selection are, therefore, of great significance in microarray data analysis. A new method for key gene selection has been proposed on the basis of interval segmentation purity that is defined as the purity of samples belonging to a certain class in intervals segmented by a mode search algorithm. This method identifies key variables most discriminative for each class, which offers possibility of unraveling the biological implication of selected genes. A salient advantage of the new strategy over existing methods is the capability of selecting genes that, though possibly exhibit a multimodal distribution, are the most discriminative for the classes of interest, considering that the expression levels of some genes may reflect systematic difference in within-class samples derived from different pathogenic mechanisms. On the basis of the key genes selected for individual classes, a support vector machine with block-wise kernel transform is developed for the classification of different classes. The combination of the proposed gene mining approach with support vector machine is demonstrated in cancer classification using two public data sets. The results reveal that significant genes have been identified for each class, and the classification model shows satisfactory performance in training and prediction for both data sets.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19645418     DOI: 10.1021/ci900032q

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  2 in total

1.  Chemometric Analysis of the Volatile Compounds Generated by Aspergillus carbonarius Strains Isolated from Grapes and Dried Vine Fruits.

Authors:  Zhan Cheng; Menghua Li; Philip J Marriott; Xiaoxu Zhang; Shiping Wang; Jiangui Li; Liyan Ma
Journal:  Toxins (Basel)       Date:  2018-02-06       Impact factor: 4.546

2.  Wnt/β-Catenin, Carbohydrate Metabolism, and PI3K-Akt Signaling Pathway-Related Genes as Potential Cancer Predictors.

Authors:  Pengliang Chen; Pengwei Shi; Gang Du; Zhen Zhang; Liang Liu
Journal:  J Healthc Eng       Date:  2019-10-20       Impact factor: 2.682

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.