Xiaobo Li1, Sihua Peng2. 1. Department of Computer Science and Technology, College of Engineering, Lishui University, Lishui 323000, China; ; School of Science and Technology, Zhejiang International Studies University, Hangzhou 310012, China; 2. Department of Biological Technology, School of Fisheries and Life Science, Shanghai Ocean University, Shanghai 201306, China.
Abstract
OBJECTIVE: Identification of colorectal cancer (CRC) metastasis genes is one of the most important issues in CRC research. For the purpose of mining CRC metastasis-associated genes, an integrated analysis of microarray data was presented, by combined with evidence acquired from comparative genomic hybridization (CGH) data. METHODS: Gene expression profile data of CRC samples were obtained at Gene Expression Omnibus (GEO) website. The 15 important chromosomal aberration sites detected by using CGH technology were used for integrated genomic and transcriptomic analysis. Significant Analysis of Microarray (SAM) was used to detect significantly differentially expressed genes across the whole genome. The overlapping genes were selected in their corresponding chromosomal aberration regions, and analyzed by using the Database for Annotation, Visualization and Integrated Discovery (DAVID). Finally, SVM-T-RFE gene selection algorithm was applied to identify metastasis-associated genes in CRC. RESULTS: A minimum gene set was obtained with the minimum number [14] of genes, and the highest classification accuracy (100%) in both PRI and META datasets. A fraction of selected genes are associated with CRC or its metastasis. CONCLUSIONS: Our results demonstrated that integration analysis is an effective strategy for mining cancer-associated genes.
OBJECTIVE: Identification of colorectal cancer (CRC) metastasis genes is one of the most important issues in CRC research. For the purpose of mining CRC metastasis-associated genes, an integrated analysis of microarray data was presented, by combined with evidence acquired from comparative genomic hybridization (CGH) data. METHODS: Gene expression profile data of CRC samples were obtained at Gene Expression Omnibus (GEO) website. The 15 important chromosomal aberration sites detected by using CGH technology were used for integrated genomic and transcriptomic analysis. Significant Analysis of Microarray (SAM) was used to detect significantly differentially expressed genes across the whole genome. The overlapping genes were selected in their corresponding chromosomal aberration regions, and analyzed by using the Database for Annotation, Visualization and Integrated Discovery (DAVID). Finally, SVM-T-RFE gene selection algorithm was applied to identify metastasis-associated genes in CRC. RESULTS: A minimum gene set was obtained with the minimum number [14] of genes, and the highest classification accuracy (100%) in both PRI and META datasets. A fraction of selected genes are associated with CRC or its metastasis. CONCLUSIONS: Our results demonstrated that integration analysis is an effective strategy for mining cancer-associated genes.
Entities:
Keywords:
Colorectal cancer metastasis; Database for Annotation, Visualization and Integrated Discovery (DAVID); SVM-T-RFE gene selection algorithm; Significant Analysis of Microarray (SAM); comparative genomic hybridization (CGH); integrated analysis
Authors: Glynn Dennis; Brad T Sherman; Douglas A Hosack; Jun Yang; Wei Gao; H Clifford Lane; Richard A Lempicki Journal: Genome Biol Date: 2003-04-03 Impact factor: 13.583
Authors: M F Di Renzo; M Olivero; A Giacomini; H Porte; E Chastre; L Mirossay; B Nordlinger; S Bretti; S Bottardi; S Giordano Journal: Clin Cancer Res Date: 1995-02 Impact factor: 12.531
Authors: Peter J Park; Sek Won Kong; Toma Tebaldi; Weil R Lai; Simon Kasif; Isaac S Kohane Journal: Bioinformatics Date: 2009-09-28 Impact factor: 6.937
Authors: K L Bowlt Blacklock; Z Birand; L E Selmic; P Nelissen; S Murphy; L Blackwood; J Bass; J McKay; R Fox; S Beaver; M Starkey Journal: Sci Rep Date: 2019-04-24 Impact factor: 4.379