Literature DB >> 36018512

Classification prediction of early pulmonary nodes based on weighted gene correlation network analysis and machine learning.

Guang Li1, Meng Yang2, Longke Ran3, Fu Jin4.   

Abstract

OBJECTIVE: To use weighted gene correlation network analysis (WGCNA) and machine learning algorithm to predict classification of early pulmonary nodes with public databases.
METHODS: The expression data and clinical data of lung cancer patients were firstly extracted from public database (GTEx and TCGA) to study the differentially expressed genes (DEGs) of lung adenocarcinoma (LUAD). The intersection of three R packages (Dseq2, Limma, EdgeR) methods were selected as candidate DEGs for further study. WGCNA was used to obtain relevant modules and key genes of lung cancer classification, GO and KEGG enrichment analysis was performed. The model was built using two machine learning methods, Least Absolute Shrinkage and Selection Operator (LASSO) regression and tumor classification was also predicted with extreme Gradient Boosting (XGBoost) algorithm.
RESULTS: DEGs analysis revealed that there were 1306 LUAD genes. WGCNA module analysis showed that a total of 116 genes were significantly related to classification, and module genes were mainly related to 14 KEGG pathways. The machine learning algorithm identified 10 target genes by LASSO regression analysis of differential genes, and 18 genes were identified by XGBoost model. A total of 6 genes were found from the intersection of the above methods as classification signatures of early pulmonary nodules, including "HMGB3" "ARHGAP6" "TCF21" "FCN3" "COL6A6" "GOLM1".
CONCLUSION: Using DEGs analysis, WGCNA method and machine learning algorithm, six gene signatures related to early stage of LUAD, which can assist clinicians in disease classification prediction.
© 2022. The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.

Entities:  

Keywords:  DEGs; LASSO; Lung cancer; WGCNA; XGBoost

Year:  2022        PMID: 36018512     DOI: 10.1007/s00432-022-04312-7

Source DB:  PubMed          Journal:  J Cancer Res Clin Oncol        ISSN: 0171-5216            Impact factor:   4.322


  5 in total

1.  Overexpression of golgi membrane protein 1 promotes non-small-cell carcinoma aggressiveness by regulating the matrix metallopeptidase 13.

Authors:  Li Mei Li
Journal:  Am J Cancer Res       Date:  2018-03-01       Impact factor: 6.166

2.  LncRNA LINC00163 upregulation suppresses lung cancer development though transcriptionally increasing TCF21 expression.

Authors:  Xiaotong Guo; Youlei Wei; Zhe Wang; Wenyi Liu; Yikun Yang; Xin Yu; Jie He
Journal:  Am J Cancer Res       Date:  2018-12-01       Impact factor: 6.166

3.  CRISPR/Cas9-mediated knock-in of alligator cathelicidin gene in a non-coding region of channel catfish genome.

Authors:  Rhoda Mae C Simora; Max R Bangs; Wenwen Wang; Xiaoli Ma; Baofeng Su; Mohd G Q Khan; Zhenkui Qin; Cuiyu Lu; Veronica Alston; Darshika Hettiarachchi; Andrew Johnson; Shangjia Li; Michael Coogan; Jeremy Gurbatow; Jeffery S Terhune; Xu Wang; Rex A Dunham
Journal:  Sci Rep       Date:  2020-12-17       Impact factor: 4.379

4.  The landscape of gene co-expression modules correlating with prognostic genetic abnormalities in AML.

Authors:  Chao Guo; Ya-Yue Gao; Qian-Qian Ju; Chun-Xia Zhang; Ming Gong; Zhen-Ling Li
Journal:  J Transl Med       Date:  2021-05-29       Impact factor: 5.531

5.  Metabolic and Evolutionary Engineering of Diploid Yeast for the Production of First- and Second-Generation Ethanol.

Authors:  Yang Sun; Meilin Kong; Xiaowei Li; Qi Li; Qian Xue; Junyan Hou; Zefang Jia; Zhipeng Lei; Wei Xiao; Shuobo Shi; Limin Cao
Journal:  Front Bioeng Biotechnol       Date:  2022-01-28
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.