Literature DB >> 30568493

Development of a prediction model for pancreatic cancer in patients with type 2 diabetes using logistic regression and artificial neural network models.

Meng Hsuen Hsieh¹, Li-Min Sun², Cheng-Li Lin^3,4, Meng-Ju Hsieh⁵, Chung-Y Hsu⁶, Chia-Hung Kao^6,7,8.

Abstract

OBJECTIVES: Patients with type 2 diabetes (T2DM) are suggested to have a higher risk of developing pancreatic cancer. We used two models to predict pancreatic cancer risk among patients with T2DM.
METHODS: The original data used for this investigation were retrieved from the National Health Insurance Research Database of Taiwan. The prediction models included the available possible risk factors for pancreatic cancer. The data were split into training and test sets: 97.5% of the data were used as the training set and 2.5% of the data were used as the test set. Logistic regression (LR) and artificial neural network (ANN) models were implemented using Python (Version 3.7.0). The F 1, precision, and recall were compared between the LR and the ANN models. The areas under the receiver operating characteristic (ROC) curves of the prediction models were also compared.
RESULTS: The metrics used in this study indicated that the LR model more accurately predicted pancreatic cancer than the ANN model. For the LR model, the area under the ROC curve in the prediction of pancreatic cancer was 0.727, indicating a good fit.
CONCLUSION: Using this LR model, our results suggested that we could appropriately predict pancreatic cancer risk in patients with T2DM in Taiwan.

Entities: Chemical

Keywords: artificial neural network; logistic regression; pancreatic cancer; type 2 diabetes

Year: 2018 PMID： 30568493 PMCID： PMC6267763 DOI： 10.2147/CMAR.S180791

Source DB: PubMed Journal: Cancer Manag Res ISSN： 1179-1322 Impact factor: 3.989

Study highlights

What is current knowledge? Type 2 diabetes has a higher risk of pancreatic cancer. What is new here? We used logistic and ANN models to predict pancreatic cancer.

Introduction

Pancreatic cancer is one of the most lethal malignancies because its early diagnosis is difficult, and most patients have already progressed to unresectable and incurable statuses at diagnosis.1,2 According to the GLOBOCAN 2012 estimates, pancreatic cancer ranked as the 11th most common cancer and the seventh leading cause of cancer death in both genders globally in 2012.3 In Taiwan, although it is not one of the top 10 cancers, the age-adjusted incidence rate steadily increased from 4.63/100,000 persons in 2005 to 6.23/100,000 persons in 2015.4 Moreover, it was the sixth and eighth leading cause of mortality from cancer among women and men in 2016, respectively.5 Early detection and treatment are vital, considering the relatively poor survival rate compared with its incidence. Identification of the risk factors of pancreatic cancer and regular surveillance of high-risk groups may increase the opportunity of early diagnosis, which could lead to improvements in treatment outcome. Risk factors of pancreatic cancer, namely, smoking, obesity, chronic pancreatitis, unhealthy diet, and heavy alcohol consumption, have been well documented,6–8 but some studies have suggested that patients with type 2 diabetes (T2DM) are also more likely to develop pancreatic cancer.6–12 The exact mechanisms that link this possible association have still not been fully determined. Li11 stated that insulin resistance and associated hyperglycemia, hyperinsulinemia, and inflammation may play a role in the underlying mechanisms, thereby contributing to the development of diabetes-associated pancreatic cancer. In this study, we used data from the National Health Insurance Research Database (NHIRD) of Taiwan and attempted to create a suitable model to help physicians evaluate and predict the risk of development of pancreatic cancer in patients with T2DM. Logistic regression (LR) and artificial neural network (ANN) models have been used to predict medical outcomes.13,14 This study aims to compare the effectiveness of LR and ANN models in predicting the development of pancreatic cancer.

Methods

Data source

The study cohort was selected from the Longitudinal Cohort of Diabetes Patients (LHDB) of the National Health Insurance (NHI) program. The database is anonymized. The LHDB comprises data of 1,700,000 randomly selected newly diagnosed T2DM (ICD-9 code 250.x0 and 250.x2) patients with longitudinally linked data available from 1997 to 2013. Patients who had at least two diagnoses of T2DM within a year were eligible for inclusion in the LHDB. Diseases in the claims data were coded using the ICD, ninth revision, clinical modification (ICD-9-CM). The study was approved by the Research Ethics Committee of China Medical University and Hospital in Taiwan (CMUH104-REC2-115-CR3).

Participants

Patients with newly diagnosed T2DM were identified from the period of 2000 to 2012 from the data set of the LHDB. The first diagnosis date was defined as the index date of T2DM. T2DM patients with a history of pancreatic cancer (ICD-9 code 157) before the index date, aged <20 years, or with incomplete demographic information were excluded.

Comorbidities and medications

The baseline comorbidities considered in this study were acute pancreatitis, chronic pancreatitis, alcohol-related illness, gallstone, cholecystectomy, cirrhosis, COPD, Helicobacter pylori infection, hepatitis B, hepatitis C, hypertension, hyperlipidemia, nephropathy, and obesity. The Charlson comorbidity index (CCI) was also determined for each participant from claims data of outpatient visits or hospitalizations before the index date. The CCI is a scoring system that weighs factors on crucial concomitant diseases; it has been validated for use with the ICD-9-CM-coded administrative database.15,16 We categorized CCI into the following four levels: 0, 1, 2, and 3 or more. To measure the severity of T2DM, we used the adapted Diabetes Complication Severity Index (aDCSI).17 The aDCSI had seven categories, namely, retinopathy, nephropathy, neuropathy, cerebrovascular, cardiovascular, peripheral vascular disease, and metabolic. The progression of diabetes was defined as a yearly increase in aDCSI score from the date of T2DM diagnosis to the end of follow-up. Three progression groups were defined as having a yearly increase in scores less than 0–0.1, 0.1–0.3, and >0.3 per year. Medications that may be associated with pancreatic cancer were also evaluated, including statin and antidiabetic drugs. Antidiabetic drugs included insulin, sulfonylureas, metformin, and thiazolidinediones and other antidiabetic drugs.

Constructing training and data sets

The data comprised 1,358,634 data points, each of which represented one patient. The data were cleaned and one-hot encoded using RStudio. After data cleaning, 22 input features and two output features were obtained. The features included patient’s age, underlying diseases, aDCSI score, and medications. The positive output class represented diagnosis of pancreatic cancer, whereas the negative output class represented no diagnosis. The data were split into training and test sets: 97.5% of the data were used as the training set and 2.5% of the data were used as the test set. Table 1 presents the allocation between the two data sets.

Table 1

Distribution of train and test sets

All patients	Training set	Test set
1,358,634	1,324,669	33,965

Algorithm and training

The average k-fold cross-validation accuracy, with a k-value of 10, was used as the metric to determine the optimal hyperparameters for the prediction models. The LR model used an L2 regularization penalty with primal formulation. The LIBLINEAR algorithm was used for the optimization problem.18 The one-versus-rest scheme was used as the loss function. The LR model was trained for 100 iterations before convergence. The ANN model was a multilayer perceptron deep neural network. The model consisted of an input layer of 22 dimensions, two hidden layers of 22 dimensions, and an output layer of two dimensions. The model was trained using the stochastic gradient descent, with a mini batch size of 1. The model was optimized using Adam with the default parameters outlined by Kingma et al, with a learning rate of 0.01, the β1 value of 0.9, the β2 value of 0.999, and no decay rate.19 The input and hidden layers used a scaled exponential linear unit activation function,20 and the output layer used the Softmax activation function. Dropout of 20% was applied at the input layer and 50% at the output layer.21 The categorical cross entropy function was used as the loss function. The neuron weights were initialized using normalized He initialization.22 The ANN model was trained for 3,600 epochs. Nondiagnosis of pancreatic cancer was prevalent in the output data. The ratio between patients with and without pancreatic cancer was 1:438.40. For the LR and ANN models, each data point in the positive class was weighted as 438.40 times greater than each data point in the negative class to ensure that the output of the prediction was not unbalanced. The software was implemented using Python (version 3.7.0). The LR model was created and trained with the scikit-learn library (version 0.19.1)23 and trained on an Intel Core i5 CPU. The ANN model was created and trained with the Tensorflow framework (version 1.8.0)24 on an NVIDIA® Tesla K80 graphics processing unit through Google Cloud.

Statistical analyses

The baseline characteristics, comorbidities, and medications of the pancreatic cancer group and nonpancreatic cancer group were compared. The Chi-squared test and Student’s t-test were used to test the differences of categorical and continuous variables, respectively. All risk factors outlined in Table 2 were included in the model.

Table 2

Baseline characteristics of T2DM patients with and without pancreatic cancer

Variable	Pancreatic cancer				P-value
	No		Yes
	N=1,355,542		N=3,092
	n	%	n	%

Age group (years)					<0.001
≤49	422,146	31.1	389	12.6
50–64	523,033	38.6	1,197	38.7
65+	410,363	30.3	1,506	48.7
Mean (SD) (years)*	57.3	14.2	63.8	11.4	<0.001
Gender					<0.001
Women	642,176	47.4	1,341	43.4
Men	713,366	52.6	1,751	56.6
Underlying disease
Acute pancreatitis	40,578	2.99	331	10.7	<0.001
Chronic pancreatitis	13,124	0.97	182	5.89	<0.001
Alcohol-related illness	143,856	10.6	307	9.93	0.22
Gallstone	147,231	10.9	596	19.3	<0.001
Cholecystectomy	53,533	3.95	179	5.79	<0.001
Cirrhosis	632,546	46.7	1,681	54.4	<0.001
COPD	383,509	28.3	894	28.9	0.25
Helicobacter pylori infection	21,838	1.61	57	1.84	0.31
Hepatitis B	129,275	9.54	290	9.38	0.77
Hepatitis C	74,671	5.51	162	5.24	0.51
Hypertension	1,001,683	73.9	2,279	73.7	0.81
Hyperlipidemia	912,371	67.3	1,784	57.7	<0.001
Nephropathy	26,796	1.98	40	1.29	0.006
Obesity	71,808	5.30	86	2.78	<0.001
CCI score*					<0.001
0	845,298	62.4	1,774	57.4
1	245,041	18.1	595	19.2
2	129,974	9.59	378	12.2
3 or more	135,231	9.98	345	11.2
Diabetes complication (components of the aDCSI)
Retinopathy	279,890	20.7	487	15.8	<0.001
Nephropathy	489,087	36.1	881	28.5	<0.001
Neuropathy	405,625	29.9	785	25.4	<0.001
Cerebrovascular	248,489	18.3	523	16.9	<0.001
Cardiovascular	686,634	50.7	1,622	52.5	0.045
Peripheral vascular disease	371,646	27.4	658	21.3	<0.001
Metabolic	61,492	4.54	125	4.04	0.19
Change in aDCSI score per year					<0.001
0–0.1	691,408	51.1	1,831	59.2
0.1–0.3	3,733,385	27.6	535	17.3
>0.3	290,749	21.5	726	23.5
Mean aDCSI score (SD)*
Onset	1.44	1.70	1.42	1.62	0.60
End of follow-up	2.62	2.18	2.23	1.99	<0.001
Medications
Statin	716,701	52.9	1,183	38.3	<0.001
Insulin	449,011	33.1	1,044	33.7	0.45
Sulfonylureas	782,389	57.7	1,970	63.7	<0.001
Metformin	868,824	64.1	1,985	64.2	0.90
Other antidiabetic drugs	371,333	27.4	786	25.4	<0.001
TZD	226,441	16.7	471	15.2	<0.001

Notes: Chi-squared test.

t-Test comparing subjects with and without pancreatic cancer.

Abbreviations: aDCSI, adapted Diabetes Complication Severity Index; CCI, Charlson comorbidity index; T2DM, type 2 diabetes; TZD, thiazolidinediones.

We used the weighted average recall (sensitivity), precision (positive predictive value), and F1 (harmonic mean of recall and precision) values to evaluate the predictor performance instead of accuracy due to an unbalanced data distribution.25 The F1, precision, and recall values were calculated for the test set and for all data using the scikit-learn library. In addition, the receiver operating characteristic (ROC) curve was used as a metric to measure predictor performance. The ROC was calculated between the outcome and the predicted probability of outcome by the prediction model. The F1, precision, and recall, and area under the ROC curve were compared between the LR and ANN models. The area under the ROC curve of both prediction models was also compared with the ideal value of 1.26

Results

Demographic features of patients

Overall, 1,358,634 participants were selected for this retrospective cohort study, including 3,092 pancreatic cancer patients and 1,355,542 nonpancreatic cancer patients (Table 2). The age distribution was different in both groups, with the mean age higher in the pancreatic cancer group than in the nonpancreatic cancer group (63.8 [SD=11.4] vs 57.3 [SD=14.2] years). Compared with the nonpancreatic cancer group, the patients with pancreatic cancer had more prevalent comorbidities, including acute pancreatitis, chronic pancreatitis, gallstone, cholecystectomy, and cirrhosis. The proportion of those with a CCI score of 3 or above was 11.2% in the pancreatic cancer group compared with 9.98% in the nonpancreatic cancer group. The major T2DM-related complications (namely retinopathy, nephropathy, neuropathy, cerebrovascular, and peripheral vascular disease) were more prevalent in the nonpancreatic cancer group than in the pancreatic cancer group. The mean aDCSI score was 2.23 (SD =1.99) in the pancreatic cancer group and 2.62 (SD=2.18) in the nonpancreatic cancer group. The mean follow-up periods were 3.84 (SD=3.44) years in the pancreatic cancer group and 6.87 (SD=3.87) years in the nonpancreatic cancer group.

Evaluation of predictor performance

The F1, precision, and recall values of the LR and ANN models across all data are outlined in Table 3. The F1 and recall values of the LR model were greater than all of those of the ANN model, whereas the precision values of the ANN model were greater than those of the LR model across all the data. The weighted k-fold cross-validation accuracies (k=10) of the LR and ANN models were 0.996 and 0.907, respectively.

Table 3

Accuracy analysis of LR and ANN models across all data set

Data set	Model	F₁	Precision	Recall	AUROC	SE of AUC	95% CI of AUC

All data (n=1,358,634)	LR	0.997	0.995	0.998	0.727	0.004	0.718–0.735
All data (n=1,358,634)	ANN	0.871	0.996	0.775	0.605	0.005	0.595–0.615
Training set (n=1,324,669)	LR	0.997	0.995	0.998	0.726	0.004	0.718–0.735
Training set (n=1,324,669)	ANN	0.932	0.996	0.876	0.606	0.005	0.596–0.617
Test set (n=33,965)	LR	0.996	0.995	0.998	0.707	0.029	0.650–0.765
Test set (n=33,965)	ANN	0.930	0.995	0.873	0.642	0.034	0.576–0.708

Abbreviations: ANN, artificial neural network; AUROC, area under the receiver operating characteristic curve; LR, logistic regression; SE, standard error; AUC, area under the curve.

Figures 1 and 2 present the ROC curves of the LR and ANN models, respectively. The area under the ROC curve across all data for the LR and ANN models were 0.727 (95% CI: 0.718–0.735, standard error [SE]: 0.004) and 0.605 (95% CI: 0.595–0.615, SE: 0.05), respectively. The areas under ROC curves of both prediction models were significantly better than the null hypothesis area of 0.5. The area under the ROC curve of the LR model was significantly greater than the area under the ROC curve of the ANN model.

Figure 1

The ROC curve of the LR model.

Note: The AUC across all data for the LR model is 0.727.

Abbreviations: AUC, area under the ROC curve; LR, logistic regression; ROC, receiver operating characteristic.

Figure 2

The ROC curve of the ANN model.

Note: The AUC curve across all data for the ANN model is 0.605.

Abbreviations: ANN, artificial neural network; AUC, area under the ROC curve; ROC, receiver operating characteristic.

Discussion

In this study, we created two models to predict the risk of developing pancreatic cancer among patients with T2DM in Taiwan. The metrics used in this study indicated that the LR model achieved superior results to the ANN model in the prediction of pancreatic cancer. Studies have suggested that patients with T2DM possess an elevated risk of developing pancreatic cancer.6–12 In Taiwan, researchers used a traditional Cox proportional hazard model and the NHIRD to evaluate the pancreatic cancer risk among patients with T2DM and antidiabetic therapies and revealed a positive association;27–30 however, Tseng31 indicated that this relationship was likely due to detection bias and confounders. Based on changes in glucose level, changes in weight, and age at the onset of diabetes, Sharma et al32 developed a model to determine the risk of pancreatic cancer among patients with new-onset diabetes. The current study attempted to use predictive models to evaluate their possible linkage. ANN is a mathematical model imitating the structure and function of a biological neural network and is used to evaluate functions or approximate operations. It is the most commonly used “model” of artificial intelligence and can be used for prediction, forecasting, diagnosis, and decision making.33,34 By using the NHIRD, researchers have revealed that ANN is a suitable model to predict some diseases.34,35 However, our results indicated that the area under the ROC curve across all data for the ANN model was only 0.605; by contrast, the LR model achieved a superior performance in predicting pancreatic cancer in patients with T2DM. Furthermore, the F1 and recall values also indicated that the LR model was superior. LR may be used to predict the risk of developing a given disease as well. The outcome can be binomial, ordinal, or multinomial. Steyerberg et al13 suggested that LR analysis can be used to develop a statistical model for a binary outcome. As most of our variables were categorical, LR was a suitable choice for modeling. However, Tu14 suggested that although neural networks generally have an accurate predictor performance, the performance of prediction models depends upon the characteristics of the data set. One characteristic of the current data set was that its outcome distribution was asymmetric. As a result, the prediction model may have overfitted the data, which could be solved by adding regularization to the model. In the present study, although the LR model outperformed the ANN model in the area under the ROC curve, F1 value, and recall value, only the precision value of the ANN model was higher than that of the LR model. This suggested that the ANN model may have overfitted the data despite dropout regularization and class weighting. Ayer et al36 noted that ANNs are particularly useful when implicit interactions and complex relationships exist in the data, whereas LR models are the superior choice when statistical inferences must be drawn from the outputted data. To our knowledge, this is the first national population-based study that used LR and ANN models to predict the risk of pancreatic cancer in patients with T2DM. This study was based on a nationwide representative sample, which increased its generalizability. In addition, the diagnoses of T2DM and pancreatic cancer were highly reliable because all the NHI claims were scrutinized by medical reimbursement specialists and peer reviewed to prevent errors and overutilization of medical resources. However, several limitations must still be addressed before the results are interpreted. First, we used outpatient and inpatient records of the ICD-9 code diagnosis of pancreatic cancer and defined patients with at least five consensus diagnoses to ensure the validity of diagnosis. We do not have the information in regard to the registry for catastrophic illness patients file and the registry for drug prescriptions file to maximize the accuracy of diagnosis of the pancreatic cancer. However, the reimbursement policy of NHIRD is universal and operated by a single buyer, the government in Taiwan. All the NHI claims were scrutinized by medical reimbursement specialists and peer reviewed to prevent errors and overutilization of medical resources, and medical providers face administrative sanction and high financial penalties if diagnostic claims do not agree with the standard diagnostic criteria used for medical reimbursement. Therefore, the diagnosed validity of “pancreatic cancer” based on ICD-9 codes in this study is highly reliable. In addition, some related studies with the same diagnostic method and criteria by ICD-9 coding were already been published.27,29,30 Second, unlike the traditional Cox proportional hazard model, our predictive models could not provide valued levels (95% CIs and P-values) to assess the overall statistical significance of the predictions made by the prediction models. Instead, we used recall, precision, F1, and area under the ROC as metrics to evaluate the performance of the prediction models. Third, certain health-related behaviors such as smoking and drinking alcohol have been suggested to increase the risk of pancreatic cancer6–8; however, the NHIRD did not contain any information regarding this. Therefore, we cannot confirm if adding these factors might have improved the values of the metrics or the ROC curves for the prediction models. Instead, we used alcohol-related illness to decrease the effect of alcohol in possible associations. Fourth, some undetermined factors, such as family history of pancreatic cancer, diet, and physical exercise, which may be related to pancreatic cancer,7,8 were also unavailable in the NHIRD for data extraction. We could not control these factors in the analyses either. Finally, the prediction models in this study did not take into account time series information, which means that we did not track the progression of particular subjects over time. We only considered individual subject data from 2000 to 2012.

Conclusion

This study compared models for the prediction of pancreatic cancer risk in patients with T2DM. Our analysis indicated that the LR model rather than ANN model provided a more appropriate method for predicting pancreatic cancer in patients with T2DM in Taiwan. Our findings may increase the prognosis of pancreatic cancer through surveillance, early diagnosis, and treatment in people with certain risk factors. Further investigations from other countries are required to determine if our findings are applicable elsewhere.

26 in total

1. Prognostic modelling with logistic regression analysis: a comparison of selection and estimation methods in small data sets.

Authors: E W Steyerberg; M J Eijkemans; F E Harrell; J D Habbema
Journal: Stat Med Date: 2000-04-30 Impact factor: 2.373

2. Diabetes and pancreatic cancer.

Authors: Donghui Li
Journal: Mol Carcinog Date: 2012-01 Impact factor: 4.784

3. Adapting a clinical comorbidity index for use with ICD-9-CM administrative databases.

Authors: R A Deyo; D C Cherkin; M A Ciol
Journal: J Clin Epidemiol Date: 1992-06 Impact factor: 6.437

4. Diabetes mellitus correlates with increased risk of pancreatic cancer: a population-based cohort study in Taiwan.

Authors: Kuan-Fu Liao; Shih-Wei Lai; Chia-Ing Li; Wen-Chi Chen
Journal: J Gastroenterol Hepatol Date: 2012-04 Impact factor: 4.029

5. Risk factors for pancreatic cancer: case-control study.

Authors: Manal M Hassan; Melissa L Bondy; Robert A Wolff; James L Abbruzzese; Jean-Nicolas Vauthey; Peter W Pisters; Douglas B Evans; Rabia Khan; Ta-Hsu Chou; Renato Lenzi; Li Jiao; Donghui Li
Journal: Am J Gastroenterol Date: 2007-08-31 Impact factor: 10.864

6. Informatics in radiology: comparison of logistic regression and artificial neural network models in breast cancer risk estimation.

Authors: Turgay Ayer; Jagpreet Chhatwal; Oguzhan Alagoz; Charles E Kahn; Ryan W Woods; Elizabeth S Burnside
Journal: Radiographics Date: 2009-11-09 Impact factor: 5.333

Review 7. Present treatment and future expectations in advanced pancreatic cancer.

Authors: George P Stathopoulos; Nikos Androulakis; John Souglakos; John Stathopoulos; Vassilis Georgoulias
Journal: Anticancer Res Date: 2008 Mar-Apr Impact factor: 2.480

8. Diabetes complications severity index and risk of mortality, hospitalization, and healthcare utilization.

Authors: Bessie Ann Young; Elizabeth Lin; Michael Von Korff; Greg Simon; Paul Ciechanowski; Evette J Ludman; Siobhan Everson-Stewart; Leslie Kinder; Malia Oliver; Edward J Boyko; Wayne J Katon
Journal: Am J Manag Care Date: 2008-01 Impact factor: 2.229

9. Artificial neural networks for diagnosis and survival prediction in colon cancer.

Authors: Farid E Ahmed
Journal: Mol Cancer Date: 2005-08-06 Impact factor: 27.401

10. Suicides among Danish cancer patients 1971-1999.

Authors: U Yousaf; M-Lm Christensen; G Engholm; H H Storm
Journal: Br J Cancer Date: 2005-03-28 Impact factor: 7.640

14 in total

Review 1. Early detection of pancreatic cancer.

Authors: Stephen P Pereira; Lucy Oldfield; Alexander Ney; Phil A Hart; Margaret G Keane; Stephen J Pandol; Debiao Li; William Greenhalf; Christie Y Jeon; Eugene J Koay; Christopher V Almario; Christopher Halloran; Anne Marie Lennon; Eithne Costello
Journal: Lancet Gastroenterol Hepatol Date: 2020-03-02

2. Representation of EHR data for predictive modeling: a comparison between UMLS and other terminologies.

Authors: Laila Rasmy; Firat Tiryaki; Yujia Zhou; Yang Xiang; Cui Tao; Hua Xu; Degui Zhi
Journal: J Am Med Inform Assoc Date: 2020-10-01 Impact factor: 4.497

3. An Artificial Neural Network Approach to Predicting Stroke in Postmenopausal Women.

Authors: Hyejin Park; Kisok Kim
Journal: Iran J Public Health Date: 2022-04 Impact factor: 1.479

4. Logistic regression has similar performance to optimised machine learning algorithms in a clinical setting: application to the discrimination between type 1 and type 2 diabetes in young adults.

Authors: Anita L Lynam; John M Dennis; Katharine R Owen; Richard A Oram; Angus G Jones; Beverley M Shields; Lauric A Ferrat
Journal: Diagn Progn Res Date: 2020-06-04

5. PDAC-ANN: an artificial neural network to predict pancreatic ductal adenocarcinoma based on gene expression.

Authors: Palloma Porto Almeida; Cristina Padre Cardoso; Leandro Martins de Freitas
Journal: BMC Cancer Date: 2020-01-31 Impact factor: 4.430

6. Pancreatic Cancer Prediction Through an Artificial Neural Network.

Authors: Wazir Muhammad; Gregory R Hart; Bradley Nartowt; James J Farrell; Kimberly Johung; Ying Liang; Jun Deng
Journal: Front Artif Intell Date: 2019-05-03

7. A data mining based clinical decision support system for survival in lung cancer.

Authors: Beatriz Pontes; Francisco Núñez; Cristina Rubio; Alberto Moreno; Isabel Nepomuceno; Jesús Moreno; Jon Cacicedo; Juan Manuel Praena-Fernandez; German Antonio Escobar Rodriguez; Carlos Parra; Blas David Delgado León; Eleonor Rivin Del Campo; Felipe Couñago; Jose Riquelme; Jose Luis Lopez Guerra
Journal: Rep Pract Oncol Radiother Date: 2021-12-30

8. Risk Prediction of Barrett's Esophagus in a Taiwanese Health Examination Center Based on Regression Models.

Authors: Po-Hsiang Lin; Jer-Guang Hsieh; Hsien-Chung Yu; Jyh-Horng Jeng; Chiao-Lin Hsu; Chien-Hua Chen; Pin-Chieh Wu
Journal: Int J Environ Res Public Health Date: 2021-05-17 Impact factor: 3.390

9. Nomogram for Predicting Risk of Digestive Carcinoma Among Patients with Type 2 Diabetes.

Authors: Lu-Huai Feng; Kun-Peng Bu; Shuang Ren; Zhenhua Yang; Bi-Xun Li; Cheng-En Deng
Journal: Diabetes Metab Syndr Obes Date: 2020-05-21 Impact factor: 3.168

10. Low ambient temperatures correlate with increased risk of hypoglycemia in patients with type 2 diabetes: An ecological study in Taiwan.

Authors: Shih-Wei Lai; Wan-Chi Chang; Cheng-Li Lin; I-Ching Chou; Fuu-Jen Tsai; Yen-Jen Lai
Journal: Medicine (Baltimore) Date: 2020-02 Impact factor: 1.817