| Literature DB >> 34738844 |
Zarrukh Baig1, Nawaf Abu-Omar1, Rayyan Khan1, Carlos Verdiales2, Ryan Frehlick2, John Shaw1, Fang-Xiang Wu1, Yigang Luo1.
Abstract
Background: The purpose of this project is to identify prognostic features in resectable pancreatic head adenocarcinoma and use these features to develop a machine learning algorithm that prognosticates survival for patients pursuing pancreaticoduodenectomy.Entities:
Keywords: machine learning; pancreatic cancer; pancreaticoduodenectomy; prognosis; supervise learning model; whipple procedure
Mesh:
Year: 2021 PMID: 34738844 PMCID: PMC8573477 DOI: 10.1177/15330338211050767
Source DB: PubMed Journal: Technol Cancer Res Treat ISSN: 1533-0338
List of 50 potential factors that could affect survival after pancreaticoduodenectomy selected by a team of Hepatopancreaticobiliary Surgeons in Saskatoon. These factors were then subsequently analyzed to determine 11 specific features that can predict survival < 2 years. List of abbreviations: ASA (American Society of Anesthesiologists classification), T2DM (Type 2 Diabetes Mellitus), HTN (Hypertension), CAD (Coronary Artery Disease), AFP (alpha fetoprotein), CA19 to 9 (Carbohydrate Antigen 19-9 tumour marker), ERCP (Endoscopic Retrograde Cholangiopancreatography Procedure), CT (Computerized Tomography), MRI (Magnetic Resonance Imaging), CBD (Common Bile Duct), SMA (Superior Mesenteric Artery), and SMV (Superior Mesenteric Vein).
| Demographics: | Gender | Age | Type pf Cancer | Family History of Caner | Type of Familial Cancer | ||
|---|---|---|---|---|---|---|---|
| Comorbidities: | BMI | ASA | T2DM | HTN | CAD | Dyslipidemia | MEN 1 mutation |
| Smoking | Alcohol | ||||||
| Presenting Symptoms: | New Diagnosis of DM | Abdominal/Back Pain | Jaundice | Weight loss (> 15 lb) | |||
| Laboratory Investigations: | Total Bilirubin | Lipase | Albumin | Pre-albumin | AFP | CA19 - 9 | |
| Histopathology: | T-Stage | N-stage | M-Stage | Pathological Differentiation | Additional pathology | Tumor Site and Invasion | Tumor size on pathology |
| Dysplasia of Portal Tissue | |||||||
| Imaging: | ERCP | Tumor size -CT | Tumor Size- MRI | CBD Dilation | CBD Stricutre | Pancreatic Duct Dilation | SMA Involvement |
| SMV Involvement | Portal Vein Involvement | Perineural Involvement | Other Involvement | SMV Involvement | |||
| Surgical Resection: | Margin (P or N) | Portal Vein Resection | |||||
| Other Treatments & Outcome: | Neoadjuvant Therapy | Adjuvant Therapy | Hospital Stay (d) | Post-op Complications | Recurrence | Site of Recurrence or Metastasis |
Figure 1.Selection of participants after pancreaticoduodenectomy for PDAC.
11 prognostic factors that predict survival in pancreatic ductal adenocarcinoma. These were selected using 3 separate statistical tests that included statistical significance testing, recursive feature elimination, and stable selection techniques.
| Prognostic Factors (Mean ± SD) | ||
|---|---|---|
| Variables | Survival < 2 Years | Survival > 2 Years |
|
| ||
| T2DM | 0.487 ±0.506 | 0.272 ± 0.449 |
| FHx of Cancer | 0.315 ± 0.471 | 0.527 ± 0.503 |
| Number of Family Members with Cancer | 0.324 ± 0.474 | 0.545 ± 0.502 |
| Bile duct Stricture | 0.315 ± 0.471 | 0.145 ± 0.355 |
| Perineural involvement | 0.736 ± 0.446 | 0.309 ± 0.466 |
| Margins (positive) | 0.605 ± 0.495 | 0.218 ± 0.416 |
| Portal vein tissue resection | 0.447 ± 0.503 | 0.072 ± 0.262 |
| Neoadjuvant therapy | 0.081 ± 0.276 | 0.076 ± 0.269 |
| Adjuvant therapy | 0.815 ± 0.392 | 0.962 ± 0.192 |
|
| ||
| Size of tumor based on MRI | 1.797 ± 1.327 | 2.725 ± 2.125 |
| Size of tumor based on pathology | 1.793 ± 1.054 | 1.574 ± 1.319 |
Comparison of various machine learning methods and their respective performances. In this study, an SVM algorithm was utilized given it provided the highest testing accuracy.
| Classifier | Training | Testing |
|---|---|---|
| SVM | 0.7527 | 0.7526 |
| Random Forest | 0.6844 | 0.6733 |
| Naïve Bayes | 0.7526 | 0.7469 |
Testing data confusion matrix. Based on the confusion matrix, this algorithm has a sensitivity of 41.9% and specificity of 97.5% in predicting survival < 2. The algorithm was designed using 90% of the data. 10% of the data was used for testing the sensitivity and specificity of the algorithm. The model was also cross-validated using the entire subset of 93 subjects by applying 300 runs of the leave-one-out cross validation method. This achieved a mean training accuracy of 75.26%.
|
| ||||
|---|---|---|---|---|
| Survival < 2 years | Survival > 2 years | |||
|
| Survival < 2 years (n = 38) 4 (10% of 38) | TP = 1.6766 | FN = 2.3233 | Sensitivity = 41.9% |
| Survival > 2 years (n = 55) 6 (10% of 55) | FP = 0.150 | TN = 5.850 | Specificity = 97.5% | |
| 2.756 | 7.242 | ACC = .7526 | ||