Literature DB >> 33193628

The Key Genes for Perineural Invasion in Pancreatic Ductal Adenocarcinoma Identified With Monte-Carlo Feature Selection Method.

Jin-Hui Zhu1, Qiu-Liang Yan2, Jian-Wei Wang3, Yan Chen1, Qing-Huang Ye1, Zhi-Jiang Wang1, Tao Huang4.   

Abstract

BACKGROUND: Pancreatic ductal adenocarcinoma (PDAC) is the most aggressive form of pancreatic cancer. Its 5-year survival rate is only 3-5%. Perineural invasion (PNI) is a process of cancer cells invading the surrounding nerves and perineural spaces. It is considered to be associated with the poor prognosis of PDAC. About 90% of pancreatic cancer patients have PNI. The high incidence of PNI in pancreatic cancer limits radical resection and promotes local recurrence, which negatively affects life quality and survival time of the patients with pancreatic cancer.
OBJECTIVES: To investigate the mechanism of PNI in pancreatic cancer, we analyzed the gene expression profiles of tumors and adjacent tissues from 50 PDAC patients which included 28 patients with perineural invasion and 22 patients without perineural invasion.
METHOD: Using Monte-Carlo feature selection and Incremental Feature Selection (IFS) method, we identified 26 key features within which 15 features were from tumor tissues and 11 features were from adjacent tissues.
RESULTS: Our results suggested that not only the tumor tissue, but also the adjacent tissue, was informative for perineural invasion prediction. The SVM classifier based on these 26 key features can predict perineural invasion accurately, with a high accuracy of 0.94 evaluated with leave-one-out cross validation (LOOCV).
CONCLUSION: The in-depth biological analysis of key feature genes, such as TNFRSF14, XPO1, and ATF3, shed light on the understanding of perineural invasion in pancreatic ductal adenocarcinoma.
Copyright © 2020 Zhu, Yan, Wang, Chen, Ye, Wang and Huang.

Entities:  

Keywords:  Monte-Carlo feature selection; incremental feature selection; pancreatic ductal adenocarcinoma; perineural invasion; support vector machine

Year:  2020        PMID: 33193628      PMCID: PMC7593847          DOI: 10.3389/fgene.2020.554502

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


Introduction

Pancreatic cancer is a type of common malignant tumor of the digestive tract, the most aggressive form of which is pancreatic ductal adenocarcinoma (PDAC), which has a 5-year survival rate of only 3–5% (Huang et al., 2014). The poor prognosis of PDAC is largely due to the lack of early symptoms, explosive outcomes, and resistance to treatment (Pour et al., 2003).Currently, there is no effective method to detect pancreatic cancer in its early stages. However, with the increasing insight into the mechanism of this cancer over time, novel therapies are being researched and developed (Rossi et al., 2014). Pancreatic cancer has poor responses to conventional therapies, such as chemotherapy and irradiation (Rossi et al., 2014). Although surgery has been indicated to be an effective therapeutic approach to eliminate cancer cells, 70–81% of patients are rendered unresectable because of locally advanced disease or distant metastatic lesions (White et al., 2001; Mossner, 2010; Cai et al., 2013) and most patients who have undergone surgery experience recurrence and comorbidities (Pour et al., 2003). In the last few decades, Gemcitabine has been the preferred treatment option for PDAC. However, recent studies suggested that FOLFIRINOX (a regimen combining fluorouracil, leucovorin, oxaliplatin, and irinotecan) has shown a significant therapeutic advantage in patients with advanced PDAC (Kleger et al., 2014; Ferrone et al., 2015). In addition, the curative effect of oral fluorouracil in Asian patients with PDAC has been proven (Cid-Arregui and Juarez, 2015). Most studies have focused on biomarkers to predict the progression or recurrence of PDAC. It has been reported that about 90% of the later stage pancreatic cancers have point mutations of KRAS, indicating that KRAS may be used as a diagnostic marker of PDAC (Campbell et al., 2007; De Oliveira et al., 2012; Zhang et al., 2014). SLIT2-ROBO signaling in PDAC has also been reported to enhance metastasis and predispose PDAC cells to neural invasion (Gohrig et al., 2014). There have also been some important and highly penetrative genes identified, such as CEACAM1, MCU, VDAC1, PKM2, CYCS, C15ORF52, TMEM51, LARP1, and ERLIN2 (Calabretta et al., 2016; Giulietti et al., 2016). Although many biomarkers have now been shown to be associated with PDAC, their effectiveness in the early detection of cancer still require verification. Perineural invasion (PNI) is a process in which cancer cells invade the surrounding nerves and perineural spaces (Ceyhan et al., 2008), which is associated with recurrence (Dai et al., 2007; Gil et al., 2010) and poor outcome (Bapat et al., 2011). PNI also contributes to the severe pain syndrome in patients with advanced PDAC (Zhu et al., 1999; Esposito et al., 2008). It is estimated that the incidence of PNI reaches up to 90% in pancreatic cancer (Nakao et al., 1996). The high incidence of PNI in pancreatic cancer limits radical resection and promotes local recurrence, which negatively affects life quality and survival time of the patients with pancreatic cancer (Hirai et al., 2002). Among the factors influencing the prognosis of pancreatic cancer, PNI has gradually become an independent prognostic factor and pathological feature. Therefore, further studies are urgently needed to investigate the mechanism of PNI in pancreatic cancer, thus providing a theoretical basis for the treatment of pancreatic cancer. Adjacent tissues are important parts of a tumor microenvironment, and Existing studies have taken adjacent tissues as normal tissues for control to study the difference between cancer tissues and normal tissues. However, present studies have indicated that there will still be some physiological changes in adjacent tissues affected by the tumor tissues of patients (Casbas-Hernandez et al., 2015; Yamakawa et al., 2019). A number of studies have included adjacent tissues in cancer research, and researchers have found that adjacent tissues can also serve as a marker of tumor prognosis (Lee et al., 2019). In this study, PNI was studied in combination with the differences between tumor tissues and adjacent tissues of patients to find prognostic biomarkers affecting PNI. In this work, we analyzed the gene expression profiles of 28 pancreatic ductal adenocarcinoma patients with perineural invasion and 22 pancreatic ductal adenocarcinoma patients without perineural invasion. Both tumor and adjacent tissues were profiled. With Monte-Carlo feature selection and Incremental Feature Selection (IFS) method, 26 key features were identified. Interestingly, 15 of them were from tumor tissues but the other 11 features were from adjacent tissues. Our results proved that the microenvironment of the tumor is important for perineural invasion. Based on these 26 key features, a Support Vector Machine (SVM) predictor was constructed and its accuracy, evaluated with Leave-One-Out Cross Validation (LOOCV), was 0.94, which needs to be validated in another independent large dataset. But many key feature genes, such as TNFRSF14, XPO1, and ATF3, showed great promise on explaining perineural invasion in pancreatic ductal adenocarcinoma.

Materials and Methods

Datasets

We downloaded the gene expression profiles of tumors and adjacent tissues in 50 pancreatic ductal adenocarcinoma patients from GEO (Gene Expression Omnibus) under accession number GSE102238 (Yang et al., 2020). In this dataset, 28 patients had perineural invasion while 22 patients had an absence of perineural invasion. Each patient had both tumor and adjacent samples. The gene expression levels were profiled with 25,492 probes from the Agilent-052909 CBC_lncRNAmRNA_V3 platform, which included both lncRNAs and mRNAs. To systematically compare the difference between pancreatic ductal adenocarcinoma patients with perineural invasion and pancreatic ductal adenocarcinoma patients without perineural invasion, we combined the gene expression levels from tumor samples and adjacent samples for each patient. Therefore, there were 25,492 × 2 = 50,984 gene expression features. Our goal was to identify the key genes from either tumor or adjacent samples that could discriminate the patients with perineural invasion and without perineural invasion.

Identification of Key Genes Using Monte-Carlo Feature Selection

As we can see, the feature number was much greater than the sample size. If we directly used so many features to classify the patients, it would obviously overfit. To partially solve this problem, we adopted the Monte-Carlo feature selection method (Draminski et al., 2008) to rank the features. The Monte-Carlo feature selection method randomly chooses a number of features and then constructs a number of tree classifiers (Chen et al., 2018a; Pan et al., 2018; Wang et al., 2018). Based on these tree classifiers, it assigns each feature an importance value. If a feature is selected by many tree classifiers, it is more important than others. Let us formulate the algorithm formally. Suppose d is the total number of features, we randomly select m features (m≪d) for s times and construct t trees for each of the s subsets. At last, s⋅t classification trees will be constructed. A feature g’s relative importance (RI) can be reflected by how many times it is used to set a decision rule by the s⋅t trees and how much it contributes to the classification of the s⋅t trees, and is calculated with the equation below: where wAcc is the weighted classification accuracy of decision treeτ, IG(n(τ) is the information gain of node ng(τ), no.inn(τ) is the number of samples under node ng(τ), no.inτ is the number of samples in decision tree and τ, u and v are parameters. To be more specific, wAcc is defined as follows: where c is the number of classes (it is 2 in this study) and nij is the number of samples from class i that are classified as class j(i,j=1,2,…,c) The features were ranked based on their RI values from large to small as F where N is the total number of features (50,984 for this study).

Construction of SVM Predictor for Perineural Invasion

Although all features were ranked using Monte-Carlo feature selection, it was not clear how many top features should be selected to construct a final predictor for perineural invasion. To choose the final key features for the predictor, we adopted the Incremental Feature Selection (IFS) method (Wang et al., 2017; Zhang et al., 2017; Chen et al., 2018b,c; Li et al., 2018) to optimize the key features and their predictor. We tested 500 different feature sets (F1,F2,…,F500), whereF = [f1,f2, …,f]. In other words, feature set F contains the top i features in F from equation (2). For each feature set, we constructed a support vector machine (SVM) predictor. Based on the number of features and their accuracy, we can balance the model complexity and performance and choose the final key features and optimized model. In this study, the SVM predictor was constructed using R function svm from package e1017 and leave-one-out cross validation (LOOCV) was used to evaluate the performance of the SVM predictor.

Results and Discussion

The Top Discriminative Genes Between Patients Were With Perineural Invasion and Without Perineural Invasion

The gene expression profiles in the tumor and adjacent tissues can represent the difference between pancreatic ductal adenocarcinoma patients with perineural invasion and without perineural invasion. The gene expression in the tumor directly shows the activity of pancreatic ductal adenocarcinoma while the gene expression in the adjacent tissue reflect the microenvironment of the tumor. Therefore, we combined the gene expression profiles in tumors and in adjacent tissues for each patient and compared the combined profiles between pancreatic ductal adenocarcinoma patients with perineural invasion and without perineural invasion using Monte-Carlo feature selection. Based on the RI values, which represented how well a gene feature can classify the two groups of patients, we ranked all the features and further analyzed the top 500 discriminative genes.

The Final Key Features and SVM Predictor for Perineural Invasion

With IFS method (Chen et al., 2017a,b,c,d; Li and Huang, 2017; Liu et al., 2017), we evaluated the prediction accuracy of different feature sets and plotted the IFS curve in which the X-axis was the number of features and the Y-axis was their prediction accuracy evaluated with LOOCV. The IFS curve was shown in Figure 1. It can be seen that when 175 genes were used, the accuracy was the highest, at 0.96. But when only 26 genes were used, the accuracy became 0.94. Balancing both model complexity and performance, we chose the 26 genes as the final key features and their SVM predictor as the optimized predictor for perineural invasion. The 26 key features were given in Table 1. With the 26 key features, 15 features were from tumor tissues while 11 features were from adjacent tissues. These results suggested that not only the tumor tissue, but also the adjacent tissue, was informative for perineural invasion prediction.
FIGURE 1

The IFS curve for final key feature selection. The X-axis was the number of features. The Y-axis was their prediction accuracy evaluated with LOOCV. When 175 genes were used, the accuracy was the highest, at 0.96. But when only 26 genes were used, the accuracy became 0.94. Balancing both model complexity and performance, we chose the 26 genes as the final key features and their SVM predictor as the optimized predictor for perineural invasion.

TABLE 1

The 26 key discriminative features between patients with perineural invasion and without perineural invasion.

RankTissueProbeRIProbe sequenceblastn Scoreblastn IdentitiesChromosome positionGene symbol
1Adjacentp126010.0518GGCTGAAGGAGAACTTCAATATCATATATTTTAAA GGTTGACTCACAGTTTGGAACAAGA111bits (60)60/60 (100%)chr4:3545147-3545206AL590235.1
2Tumorp186020.0395CTTGCCATTTAGCCACTGTTACTGATAATTTGGAT GGAAAAGAAAAATAAATTATATCAC111bits (60)60/60 (100%)chr12:25944921-25944862RASSF8-AS1
3TumorA_24_P888010.0370CAGACGGAGTTCATGAACCTTTTGACCTTTCAG AGCAGACCTATGACTTCTTGGGTGAAA111bits (60)60/60 (100%)chr2:110123870-110123811NPHP1
4TumorA_33_P33291280.0352GGTGGCTGGACAATCATTTGCTAAAGATATT AGGCATGTGACTCATGAATCCAATCAATCNo significant similarity found
5AdjacentA_21_P00105060.0271TTAGGCCAAGTGTGGAGAAATCAATGATGT TGACGATGAGGCTCCCTGAGAGAAATCACA111bits (60)60/60 (100%)chr1:2565049-2565108TNFRSF14
6AdjacentA_24_P9417590.0243ATGCTTCAAGTAATGCAATACAAAACATA ACCCATTTAATGATGAATTACTTGAATGTAT111bits (60)60/60 (100%)chr14:30619533-30619592G2E3
7Adjacentp148430.0238AGTGACTGATTTGAAACCAGTTGTACCGTA TTATTAGGAAAGGCGCCTTGATGAAAAGAT80.5 bits (43)43/43 (100%)chr7:19813727-19813685AC004543.1
8Adjacentp286940.0238CATTTACTGGGCCTGAAATGGGAAAATGAA AGATGTGGCAAGAAACTGACAAGGGCCCAA111bits (60)60/60 (100%)chr1:1660100-1660159FO704657.1/SLC35E2B
9Tumorp46840.0236AAGGTGTTGAAGCATAGACGCTGGAACATAAAATG ACTCATGATCTCACTGGGAGAAGGG111bits (60)60/60 (100%)chr14:101635450-101635391LINC02320
10AdjacentA_23_P1700880.0214ATCCTTTCTGTGCTGCTTTAGGCATCTGCC CTTACGTGGTTCGTGTCCAGCTCTGTCAAC106 bits (57)59/60 (98%)chr9:137392642-137392583EXD3
11TumorA_23_P3983720.0198TCAGGTAGAGAAAGCAAAAAATCTCTGGCCGTAA ACCGTGCTCTCTAATTTATCGGCAGC111bits (60)60/60 (100%)chr9:136115009-136114950TMEM250
12TumorA_24_P2776730.0191CAAGGTACTGAGCGATAATATTCAGGGCATTACC AAGTGCACTATCCGGCGCTTGGCCCG111bits (60)60/60 (100%)chr6:26246918-26246859HIST1H4G
13TumorA_19_P008035750.0190AAGCAGCTTGTATAATTCCAACTGGTGTTT CATTTCTGTTCTAATGCTAAGTGGTAACGC111bits (60)60/60 (100%)chr17:17192194-17192253MPRIP
14TumorA_33_P38029660.0190GAGGGGGTTAACATAACGCGGACCGATCCCA AATGGCATTGATGAGTGTACCTCCCACGANo significant similarity found
15AdjacentA_32_P2071240.0166CCCTGCCCTTCCCATCTTAGGGTGTCGTCT GAGACAGACTCTTATTCCCTCAATAAAGAG111bits (60)60/60 (100%)chrX:120932228-120932169CT47A12
16TumorA_23_P4101280.0165AAACCAACAAATAAAAGCATGATAAATTGACT ATATCAAAATTTAAAACTTCTCTATGAC111bits (60)60/60 (100%)chr22:42027707-42027766WBP2NL
17Adjacentp284850.0157AGTCTCAGGGCTAGACGTATTCCAAATATTTGGAT AATTCAAAGTAATTTGCACAGACAT111bits (60)60/60 (100%)chr7:28738207-28738148CREB5
18Tumorp132890.0155CTAGGGTGCTCTATGCTGTGATGCTATCAAATC TTCATGGATTTTTCCAGGATCCTCAAA111bits (60)60/60 (100%)chr5:92432262-92432203AC114316.1/AC124854.1
19TumorA_33_P33493340.0149CAGGCTTGTATGATCTATTCCTTACCACAAAAG AAGTAGACAATTGCCACTTTTATTTCT111bits (60)60/60 (100%)chr14:55049328-55049387SOCS4
20AdjacentA_23_P400780.0146GGGTATTTGTCGACCAAAATGATGCCAATTTG TAAATTAAAATGTCACCTAGTGGCCCTT111bits (60)60/60 (100%)chr2:61478760-61478701AC016727.1/XPO1
21AdjacentRNA143544| tRNA_461_680.0145AGAAAAACCATTTCATAACTTTGTCAAAGTTAAA TTATAGGCTAAATCCTATATACCTTA106 bits (57)59/60 (98%)chrM:7526-7585MT-TD
22TumorA_33_P32598170.0142AGAAATGAAAGCCAACTACAGGGAAATGGC GAAGGAGCTTTCTGAAATCATGCATGAGCA111bits (60)60/60 (100%)chr13:98797175-98797116DOCK9
23TumorRNA95815| RNS_897_1090.0141TAAAAGGAGAAAGGGAGGGGCCTTGTGAGGTG AAGGGTGTCCTTATACAGGTGTGACAGC111bits (60)60/60 (100%)chr20:56660085-56660144
24AdjacentA_24_P338950.0139TCCAGAAGATGAGAGAAACCTCTTTATCCAACA GATAAAAGAAGGAACATTGCAGAGCTA111bits (60)60/60 (100%)chr1:212619495-212619554ATF3
25Adjacentp291860.0138AGGCGGGGGATGCTGTGTGGCACCTCCTATTG TCTCTTTTTGCGTTTTCTCCCATTCTCG111bits (60)60/60 (100%)chr1:19188401-19188460UBR4
26TumorA_33_P33517850.0137GCATCAAAATCAACAAAAAACCAGAATATAGTC CCAAAAGAGAAATCCACCAAGTACCAT111bits (60)60/60 (100%)chr20:8871665-8871724PLCB1
The IFS curve for final key feature selection. The X-axis was the number of features. The Y-axis was their prediction accuracy evaluated with LOOCV. When 175 genes were used, the accuracy was the highest, at 0.96. But when only 26 genes were used, the accuracy became 0.94. Balancing both model complexity and performance, we chose the 26 genes as the final key features and their SVM predictor as the optimized predictor for perineural invasion. The 26 key discriminative features between patients with perineural invasion and without perineural invasion.

Compare the SVM Predictor With Other Classification Methods

To compare the SVM predictor with other classification methods, we tried three other classification algorithms: decision tree (R function rpart from package rpart), nearest neighbor (R function knn with k = 1 from package class), and naïve Bayes (R function naiveBayes from package e1071). The highest accuracies of decision tree, nearest neighbor, naïve Bayes were 0.76 with 24 features, 0.94 with 44 features, and 0.94 with 185 features, respectively. Their performances were worse than SVM and required more features.

Compare the Monte-Carlo Feature Selection With Other Seven Feature Selection Methods

There have been many feature selection methods. Each has its assumption and application scenario. Therefore, we compared the Monte-Carlo feature selection results with seven other feature selection methods in Weka (Frank et al., 2016): chi-squared statistic (ChiSquared), correlation (Correlation), gain ratio (GainRatio), information gain (InfoGain), OneR classifier (OneR), ReliefF (ReliefF), and symmetrical uncertainty (SymmetricalUncert). The default parameters in Weka were used for the seven feature selection methods. We checked the ranks of the 26 key features selected by the Monte-Carlo method in the other seven feature selection methods. Their ranks were listed in Table 2. It can be seen that most of the features ranked on top with other methods as well. The first feature by Monte-Carlo ranked fourth by OneR, the third feature ranked second by GainRatio, the fourth feature ranked fourth by GainRatio, the fifth feature ranked fifth by SymmetricalUncert, the sixth feature ranked fourth by ReliefF, the seventh feature ranked eight by Correlation, and the eighth feature ranked third by SymmetricalUncert.
TABLE 2

Ranking of the 26 key features selected by the Monte-Carlo method in the other seven feature selection methods.

Monte CarloBest Rank in other seven methodsChiSquaredCorrelationGainRatioInfoGainOneRReliefFSymmetricalUncert
1OneR 4612511242115
2ChiSquared 14144952249430630
3GainRatio 2181121216267119
4GainRatio 4205014152272208
5SymmetricalUncert 51338131052716025
6ReliefF 4264316839613467
7Correlation 81081351918398029
8SymmetricalUncert 3117412963061213
9ChiSquared 25254717038546619469
10Correlation 54051404876054572
11GainRatio 1191205118106711
12GainRatio 2912316972911411583854346
13Correlation 1311171373314
14Correlation 6456931103716022
15ChiSquared 1, InfoGain 1, SymmetricalUncert 11891111371
16SymmetricalUncert 412239141140301284
17ReliefF 381002787238155152838269
18ChiSquared 535310641757366572192
19GainRatio 317193531760911510
20ChiSquared 222258140634409433597
21ChiSquared 1515359325126232548966
22ChiSquared 16162125025455665
23ChiSquared 414130313950373932970
24ChiSquared 626228824041201972038259
25ReliefF 42910141129021014110141654942910141
26ReliefF 527118014381421187452288
Ranking of the 26 key features selected by the Monte-Carlo method in the other seven feature selection methods. Similarly, for the seven methods, the top 500 ranked genes were further evaluated with IFS and their accuracies were used to represent how different they were between two groups of samples. The IFS results of the seven feature selection methods in Weka was shown in Figure 2. It can be seen that the peak LOOCV SVM accuracies of ChiSquared, Correlation, GainRatio, InfoGain, OneR, ReliefF, and SymmetricalUncert were 0.88, 0.94, 0.90, 0.88, 0.76, 0.92, and 0.88, respectively. They were all smaller than the highest accuracy of Monte-Carlo feature selection, which was 0.96.
FIGURE 2

The IFS curves of seven other feature selection methods from Weka. (A) The IFS curve of ChiSquaredAttributeEval; (B) The IFS curve of CorrelationAttributeEval; (C) The IFS curve of GainRatioAttributeEval; (D) The IFS curve of InfoGainAttributeEval; (E) The IFS curve of OneRAttributeEval; (F) The IFS curve of ReliefFAttributeEval; (G) The IFS curve of SymmetricalUncertAttributeEval. The IFS curves of seven other feature selection methods from Weka were plotted. Their peak accuracies were 0.88, 0.94, 0.90, 0.88, 0.76, 0.92, and 0.88, all smaller than the highest accuracy of Monte-Carlo feature selection, which was 0.96.

The IFS curves of seven other feature selection methods from Weka. (A) The IFS curve of ChiSquaredAttributeEval; (B) The IFS curve of CorrelationAttributeEval; (C) The IFS curve of GainRatioAttributeEval; (D) The IFS curve of InfoGainAttributeEval; (E) The IFS curve of OneRAttributeEval; (F) The IFS curve of ReliefFAttributeEval; (G) The IFS curve of SymmetricalUncertAttributeEval. The IFS curves of seven other feature selection methods from Weka were plotted. Their peak accuracies were 0.88, 0.94, 0.90, 0.88, 0.76, 0.92, and 0.88, all smaller than the highest accuracy of Monte-Carlo feature selection, which was 0.96. We compared the best Monte-Carlo genes with the best genes selected by the other seven methods in Weka using R package SuperExactTest[1] (Wang et al., 2015). The number of overlapped genes between Monte-Carlo and SymmetricalUncert, ReliefF, OneR, InfoGain, GainRatio, Correlation, and ChiSquared were 10, 10, 4, 11, 13, 5, and 12, respectively. The enrichment p values between Monte-Carlo and SymmetricalUncert, ReliefF, OneR, InfoGain, GainRatio, Correlation, and ChiSquared were 4.88E-31, 3.41E-22, 1.78E-08, 3.85E-33, 7.46E-31, 4.57E-14, and 4.40E-36, respectively. The Monte-Carlo selected genes were most like the ChiSquared selected genes and most unlike the OneR selected genes.

The Biological Functions of Key Genes for Perineural Invasion

The probes of Agilent-052909 CBC_lncRNAmRNA_V3 microarray were poorly annotated. We mapped the probe sequence onto the human genome using blastn[2] with default parameters against Genome (GRCh38.p12 reference, Annotation Release 109) and identified the best match genes for these probes. The biological functions of the 15 genes from tumor tissues, the 11 genes from adjacent tissues, and the combined 26 genes were analyzed using GATHER[3]. The enrichment results were shown in Table 3. For tumor signature genes, they were significantly enriched onto GO:0016043: cell organization and biogenesis and GO:0006996: organelle organization and biogenesis with a p value of 0.0004 and 0.006, respectively. For adjacent genes, TNFRSF14 was involved in hsa04060: Cytokine-cytokine receptor interaction. DOCK9, NPHP1, and SOCS4 from tumors and CREB5 and XPO1 from adjacent tissues were all targets of transcription factor NF-κB.
TABLE 3

The enriched functions of the 15 genes from tumor tissues, the 11 genes from adjacent tissues, and the combined 26 genes using GATHER.

Function# GenesGenesp valueBayes factor
The 15 genes from tumorGene OntologyGO:0016043: cell organization and biogenesis3HIST1H4G NPHP1 SOCS40.00044
GO:0006996: organelle organization and biogenesis2HIST1H4G NPHP10.0062
V$HEN1_02: HEN12NPHP1 SOCS40.00034
TRANSFACV$NFKB_Q6: NF-kappaB3DOCK9 NPHP1 SOCS40.0023
V$HNF1_Q62NPHP1 SOCS40.0042
The 11 genes from adjacentKEGG Pathwayhsa04060: Cytokine-cytokine receptor interaction1TNFRSF140.0022
TRANSFACV$USF_01: upstream stimulating factor4ATF3 CREB5 TNFRSF14 XPO1<0.00017
V$MYC_Q24ATF3 CREB5 TNFRSF14 XPO10.00025
V$NMYC_01: N-Myc4ATF3 CREB5 TNFRSF14 XPO10.00044
V$MYCMAX_02: c-Myc:Max heterodimer4ATF3 CREB5 TNFRSF14 XPO10.0013
V$AML_Q63ATF3 CREB5 TNFRSF140.0022
V$SRF_Q6: serum response factor2ATF3 CREB50.0032
Combined 26 genesTRANSFACV$NFKB_Q6: NF-kappaB5CREB5 DOCK9 NPHP1 SOCS4 XPO10.00083
The enriched functions of the 15 genes from tumor tissues, the 11 genes from adjacent tissues, and the combined 26 genes using GATHER. Bockman DE et al. found that a large number of molecules, such as LIF (Bressy et al., 2018), CCL2–CCR2 (Jessen and Mirsky, 2016), and NCAM (Deborde et al., 2016), were involved in PNI by studying the paracrine mechanism of signal transduction between nerves and cancer cells (Wang et al., 2014). For instance, cellular adhesion molecules LICAM mediates the homologous interaction between the tumor and nerves and increases PNI to promote the development of cancer (Ben et al., 2014; Lund et al., 2015). According to Giulia Gasparini et al., nerve growth factor (NGF) may be involved in the migration of glial cells in PNI. The results suggested that high levels of NGF and its affinity receptor TrKA were associated with the frequency of occurrence and severity of PNI, as well as decreased survival time and increased pain in patients with PDAC (Barbacid, 1995; Demir et al., 2010; Wang et al., 2014; Gasparini et al., 2019). The importance of GDNF-RET signal transduction in PDAC nerve invasion has been emphasized in many studies (Gil et al., 2010; Demir et al., 2012). Demir et al. have shown that in PDAC, soluble GFRα1 released by nerves can promote the binding of neural GDNF and RET in pancreatic adenocarcinoma, thus enhancing PNI (He et al., 2014; Mulligan, 2018). The synthesis, secretion, and transport of these cytokines are carried out by organelle organization, such as ribosomes and endoplasmic reticulum (Alrawashdeh et al., 2019). This evidence supports our findings that there is a close relationship between GO:0006996 (organelle organization), hsa04060 (Cytokine-cytokine receptor interaction), and PDAC PNI. In addition, some studies have shown that the activation of the NF- κB signaling pathway affects a wide range of biological processes, including immunity, inflammation, stress response, B cell development, and lymphoid organogenesis (Yu et al., 2017; Balaji et al., 2018), while PNI in PDAC is associated with lymph node metastasis (Chatterjee et al., 2012). We investigated their clinical relevance with the survival of 117 pancreatic ductal adenocarcinoma patients from Kaplan Meier-plotter[4] (Nagy et al., 2018). 17 genes were included in the database. 11 of them (NPHP1, WBP2NL, EXD3, G2E3, DOCK9, CT47A12, TMEM250, PLCB1, XPO1, HIST1H4G, and SLC35E2B) were significant with a p value smaller than 0.05 and one (ATF3) was marginally significant with a p value of 0.074. The Kaplan Meier plot of these 12 survival-associated genes were shown in Supplementary Figure 1. To select the most possible key genes, we constructed the network of the 26 genes using STRING database[5] version 11.0 (Szklarczyk et al., 2018). The network of the identified genes was shown in Figure 3. It can be seen that six genes from tumors were mapped onto the network and they can be grouped into three categories: (1) the XPO1, UBR4, EXD3 cluster in which XPO1 was the hub gene with degree of six (RAN, RANGAP1, CDC42, JUN, UBR4, EXD3); (2) the ATF3, CREB5 cluster in which ATF3 was the hub gene with degree of five (JUN, ATF4, CDC42, AHI1, CREB5); and (3) the TNFRSF14 cluster in which the degree of TNFRSF14 was three (JUN, BTLA, TNFSF14). Therefore, the three genes (XPO1, ATF3, and TNFRSF14) from tumors were the hubs.
FIGURE 3

The STRING network of the 26 genes. The 26 genes were mapped onto the STRING network (https://string-db.org/). The light-yellow nodes were genes from adjacent tissues while the light-blue nodes were genes from tumor tissues. The genes from tumors can be grouped into three clusters and the hub genes of these three clusters were ATF3, XPO1, and TNFRSF14, which were highlighted in red.

The STRING network of the 26 genes. The 26 genes were mapped onto the STRING network (https://string-db.org/). The light-yellow nodes were genes from adjacent tissues while the light-blue nodes were genes from tumor tissues. The genes from tumors can be grouped into three clusters and the hub genes of these three clusters were ATF3, XPO1, and TNFRSF14, which were highlighted in red. Probe A_21_P0010506, ranked 5th in Table 1, was mapped onto TNFRSF14. TNFRSF14, also known as HVEM, encodes a member of the TNF receptor superfamily that activates either proinflammatory or inhibitory signaling pathways (Pasero et al., 2012). Recent reports indicate that HVEM and its ligands may also be involved in tumor progression and resistance to immune response (Derre et al., 2010). The tumor microenvironment of pancreatic cancer is rich in the expression of immune-inhibitory molecules, such as PDL1, galectin-9, HVEM, or HLA-G (Sideras et al., 2014). High tumor expression of these molecules has been identified to be associated with improved cancer-specific survival (Sideras et al., 2017). The combination of such immune biomarkers could be a powerful prognostic tool for pancreatic cancer patients, as well as targets for future immunotherapy. Probe A_23_P40078, ranked 20th in Table 1, was mapped onto XPO1. Exportin 1 (XPO1), also called chromosome maintenance region 1 (CRM1), is known as a medium of nucleo-cytoplasmic shuttling of mature microRNAs (Muqbil et al., 2013). Recent studies show that up- or down-regulation of specific miRNAs and their target genes are directly involved in the progression and prognosis of human cancers like PDAC (Calin and Croce, 2006; Zhang et al., 2007). Azmi et al. suggested that XPO1 inhibition can down-regulate the expression of miR-145 target pathways via up-regulating the expression of tumor suppressive miR-145, therefore leading to the inhibition of migration and proliferation of PDAC (Azmi et al., 2017). High expression of XPO1, a common feature of PDAC and other cancers, results in functional inactivation of tumor suppressor proteins (TSPs) via constant nuclear protein export (Turner and Sullivan, 2008; Huang et al., 2009; Gao et al., 2014). Gao et al. developed some newly specific inhibitors of nuclear export targeting XPO1, which have been proven to inhibit the proliferation of pancreatic cancer cells and tumor invasion effectively (Gao et al., 2014). Therefore, blocking nuclear export could become an attractive therapeutic strategy for the treatment of PDAC (Mao and Yang, 2013). Probe A_24_P33895, ranked 24th in Table 1, was mapped onto ATF3. Activating transcription factor 3 (ATF3) is involved in the complex process of cellular stress response (Hackl et al., 2010) and is a key mediator of the PERK/ATF4 pathway (Jiang et al., 2004). Several studies have identified rapid increases of ATF3 expression during pancreatic insult (Allen-Jennings et al., 2001; Hartman et al., 2004; Jiang et al., 2004), but it is still unclear how the increase affects the response of the pancreas to injury. Fazio et al. demonstrated that ATF3 reduces the severity of pancreatic injury as a key transcriptional regulator of pancreatic acinar cells (Fazio et al., 2017). However, long-term activation of ATF3 may increase the sensitivity to factors that promote PDAC.

Conclusion

Pancreatic cancer is a common cancer and pancreatic ductal adenocarcinoma (PDAC) is the most aggressive subtype, with a 5-year survival rate of only 3–5%. Perineural invasion (PNI) is associated with the poor prognosis of PDAC. Adjacent tissues are normal tissues that grow around tumors. There are often some capillaries and immune cells in the adjacent tissues due to the influences of tumor invasion. Adjacent and tumor tissues constitute the tumor microenvironment (Degos et al., 2019; Harmon et al., 2019). Many studies have been conducted on the adjacent tissues of patients, and the results suggest that the expression of corresponding genes in adjacent tissues can be used to predict the prognosis of patients (Lee et al., 2019). To explore the mechanism of PNI, by analyzing the gene expression profiles of tumors and adjacent tissues from 28 pancreatic ductal adenocarcinoma patients with perineural invasion and 22 pancreatic ductal adenocarcinoma patients without perineural invasion, we identified 26 key features, within which 15 features were from tumor tissues while 11 features were from adjacent tissues. These results merit further validation in large cohort studies.

Data Availability Statement

Publicly available datasets were analyzed in this study. This data can be found here: the NCBI Gene Expression Omnibus (GSE102238).

Author Contributions

J-HZ, Q-LY, and J-WW contributed to the study design. YC conducted the literature search. Q-HY, Z-JW, and TH acquired the data. J-HZ and TH wrote the article. Q-LY and J-WW performed the data analysis. J-HZ, TH, and Q-LY revised the article and gave the final approval of the version to be submitted. All authors have read and agreed to the published version of the manuscript.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  79 in total

1.  Weighted gene co-expression network analysis reveals key genes involved in pancreatic ductal adenocarcinoma development.

Authors:  Matteo Giulietti; Giulia Occhipinti; Giovanni Principato; Francesco Piva
Journal:  Cell Oncol (Dordr)       Date:  2016-05-30       Impact factor: 6.730

2.  Perineural invasion in pancreatic cancer.

Authors:  Ichiro Hirai; Wataru Kimura; Koichiro Ozawa; Shun Kudo; Koichi Suto; Hiroshi Kuzu; Akira Fuse
Journal:  Pancreas       Date:  2002-01       Impact factor: 3.327

3.  Updated long-term outcomes and prognostic factors for patients with unresectable locally advanced pancreatic cancer treated with intraoperative radiotherapy at the Massachusetts General Hospital, 1978 to 2010.

Authors:  Sophie Cai; Theodore S Hong; Saveli I Goldberg; Carlos Fernandez-del Castillo; Sarah P Thayer; Cristina R Ferrone; David P Ryan; Lawrence S Blaszkowsky; Eunice L Kwak; Christopher G Willett; Keith D Lillemoe; Andrew L Warshaw; Jennifer Y Wo
Journal:  Cancer       Date:  2013-09-04       Impact factor: 6.860

4.  The roles of ATF3 in glucose homeostasis. A transgenic mouse model with liver dysfunction and defects in endocrine pancreas.

Authors:  A E Allen-Jennings; M G Hartman; G J Kociba; T Hai
Journal:  J Biol Chem       Date:  2001-05-22       Impact factor: 5.157

5.  Neoadjuvant chemoradiation for localized adenocarcinoma of the pancreas.

Authors:  R R White; H I Hurwitz; M A Morse; C Lee; M S Anscher; E K Paulson; M R Gottfried; J Baillie; M S Branch; P S Jowell; K M McGrath; B M Clary; T N Pappas; D S Tyler
Journal:  Ann Surg Oncol       Date:  2001-12       Impact factor: 5.344

6.  Paracrine regulation of pancreatic cancer cell invasion by peripheral nerves.

Authors:  Ziv Gil; Oren Cavel; Kaitlyn Kelly; Peter Brader; Avigail Rein; Sizhi P Gao; Diane L Carlson; Jatin P Shah; Yuman Fong; Richard J Wong
Journal:  J Natl Cancer Inst       Date:  2010-01-12       Impact factor: 13.506

7.  Predicting and analyzing early wake-up associated gene expressions by integrating GWAS and eQTL studies.

Authors:  JiaRui Li; Tao Huang
Journal:  Biochim Biophys Acta Mol Basis Dis       Date:  2017-11-03       Impact factor: 5.187

Review 8.  The repair Schwann cell and its function in regenerating nerves.

Authors:  K R Jessen; R Mirsky
Journal:  J Physiol       Date:  2016-03-21       Impact factor: 5.182

9.  Identification of Differentially Expressed Genes between Original Breast Cancer and Xenograft Using Machine Learning Algorithms.

Authors:  Deling Wang; Jia-Rui Li; Yu-Hang Zhang; Lei Chen; Tao Huang; Yu-Dong Cai
Journal:  Genes (Basel)       Date:  2018-03-12       Impact factor: 4.096

10.  miR-5591-5p regulates the effect of ADSCs in repairing diabetic wound via targeting AGEs/AGER/JNK signaling axis.

Authors:  Qiang Li; Sizhan Xia; Yating Yin; Yanping Guo; Feifei Chen; Peisheng Jin
Journal:  Cell Death Dis       Date:  2018-05-01       Impact factor: 8.469

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.