Sirorat Pattanapairoj1, Atit Silsirivanit2, Kanha Muisuk3, Wunchana Seubwai3, Ubon Cha'on2, Kulthida Vaeteewoottacharn2, Kanlayanee Sawanyawisuth2, Danaipong Chetchotsak4, Sopit Wongkham5. 1. Department of Industrial Engineering, Faculty of Engineering, Khon Kaen University, Khon Kaen, Thailand; System Modeling for Industry Research Group, Faculty of Engineering, Khon Kaen University, Khon Kaen, Thailand. 2. Department of Biochemistry, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand; Liver Fluke and Cholangiocarcinoma Research Center, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand. 3. Department of Forensic Medicine, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand; Liver Fluke and Cholangiocarcinoma Research Center, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand. 4. Department of Industrial Engineering, Faculty of Engineering, Khon Kaen University, Khon Kaen, Thailand; System Modeling for Industry Research Group, Faculty of Engineering, Khon Kaen University, Khon Kaen, Thailand. Electronic address: cdanai@kku.ac.th. 5. Department of Biochemistry, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand; Liver Fluke and Cholangiocarcinoma Research Center, Faculty of Medicine, Khon Kaen University, Khon Kaen, Thailand. Electronic address: sopit@kku.ac.th.
Abstract
OBJECTIVE: Cholangiocarcinoma (CCA) is usually fatal because of the absence of tests for early detection and lack of effective therapy. Tumor markers with adequate diagnostic values are of clinical significance. This study is aimed to improve the diagnostic power of serum markers using the computational data mining technique to develop a combined diagnostic model that yielded the best diagnostic values for CCA. DESIGN AND METHODS: Eight CCA-associated markers-carcinoembryonic antigen, carbohydrate antigen 19-9, alkaline phosphatase (ALP), and gamma glutamyl transferase, biliary-ALP, mucin5AC, CCA-associated carbohydrate antigen (CCA-CA) and CA-S27-were used as the inputs for the C4.5 decision tree classification model and the selected model was confirmed by ANN analyses. Eight serum markers for CCA were determined in the training set of 85 histologically proven-CCA patients and 82 control subjects. The chosen set of combined markers that gave the best diagnostic values for CCA was then validated in the testing set of 22 CCA patients and 60 controls. RESULTS: A decision tree diagram built by the C4.5 algorithm suggested the serial analysis of CCA-CA and ALP for distinguishing CCA patients from non-CCA subjects with all diagnostic parameters ≥95%. The combined tests showed a precise diagnosis in the testing set. CONCLUSIONS: The C4.5 model indicates the combined markers of CCA-CA and ALP that produced the more precise diagnosis for CCA.
OBJECTIVE:Cholangiocarcinoma (CCA) is usually fatal because of the absence of tests for early detection and lack of effective therapy. Tumor markers with adequate diagnostic values are of clinical significance. This study is aimed to improve the diagnostic power of serum markers using the computational data mining technique to develop a combined diagnostic model that yielded the best diagnostic values for CCA. DESIGN AND METHODS: Eight CCA-associated markers-carcinoembryonic antigen, carbohydrate antigen 19-9, alkaline phosphatase (ALP), and gamma glutamyl transferase, biliary-ALP, mucin5AC, CCA-associated carbohydrate antigen (CCA-CA) and CA-S27-were used as the inputs for the C4.5 decision tree classification model and the selected model was confirmed by ANN analyses. Eight serum markers for CCA were determined in the training set of 85 histologically proven-CCA patients and 82 control subjects. The chosen set of combined markers that gave the best diagnostic values for CCA was then validated in the testing set of 22 CCA patients and 60 controls. RESULTS: A decision tree diagram built by the C4.5 algorithm suggested the serial analysis of CCA-CA and ALP for distinguishing CCA patients from non-CCA subjects with all diagnostic parameters ≥95%. The combined tests showed a precise diagnosis in the testing set. CONCLUSIONS: The C4.5 model indicates the combined markers of CCA-CA and ALP that produced the more precise diagnosis for CCA.