| Literature DB >> 32005189 |
Palloma Porto Almeida1, Cristina Padre Cardoso1,2, Leandro Martins de Freitas3.
Abstract
BACKGROUND: Although the pancreatic ductal adenocarcinoma (PDAC) presents high mortality and metastatic potential, there is a lack of effective therapies and a low survival rate for this disease. This PDAC scenario urges new strategies for diagnosis, drug targets, and treatment.Entities:
Keywords: Artificial neural network; Meta-analysis; Pancreatic ductal adenocarcinoma
Mesh:
Substances:
Year: 2020 PMID: 32005189 PMCID: PMC6995241 DOI: 10.1186/s12885-020-6533-0
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Characteristics of studies used in the meta-analysis
| Accession number | Study | Array platform | Differentially expressed genes | Samples | ||
|---|---|---|---|---|---|---|
| Upregulated | Downregulated | Tumor | Normal | |||
| GSE23397 | * | Affymetrix Human Exon 1.0 ST Array | 4031 | 870 | 15 | 6 |
| GSE28735 | [ | 245 | 146 | 45 | 45 | |
| GSE41368 | [ | 1200 | 462 | 6 | 6 | |
| GSE32676 | [ | Affymetrix Human Genome U133 Plus 2.0 Array | 686 | 319 | 25 | 7 |
| GSE71989 | [ | 3052 | 661 | 13 | 8 | |
| GSE15471 | [ | 1546 | 227 | 39 | 39 | |
| GSE62165 | [ | Affymetrix Human Genome U219 Array | 2638 | 1266 | 118 | 13 |
| GSE43795 | [ | Illumina HumanHT-12 V4.0 expression beadchip | 1978 | 1343 | 6 | 5 |
| GSE71729 | [ | Agilent-014850 Whole Human Genome Microarray 4x44K G4112F | 285 | 175 | 145 | 46 |
| GSE60979 | [ | Agilent-028004 SurePrint G3 Human GE 8x60K Microarray | 1365 | 1336 | 49 | 12 |
| Total | 461 | 187 | ||||
| GSE16515 | [ | Affymetrix Human Genome U133 Plus 2.0 Array | – | – | 36 | 16 |
| GSE62452 | [ | Affymetrix Human Gene 1.0 ST Array | 69 | 61 | ||
| Total | 105 | 77 | ||||
* No publication available
- Analysis of differentially expressed genes was not applied to the validation dataset
Fig. 1Artificial neural network architecture. A graphical representation of a fully connected artificial intelligence algorithm (PDAC-ANN). PDAC-ANN is a set of mathematical equations; in each layer, it transforms expression values up to the last layer. The expression values from AHNAK2, KRT19, LAMB2, LAMC2, and S100P genes are data inserted in the input layer (green neurons), the hidden layers (blue neurons) process the expression values, and the output layer (red neurons) give the classification in normal or PDAC sample as a probability
Description of the core-genes involved in the PDAC biological process
| Gene symbol | Gene name | Gene symbol | Gene name |
|---|---|---|---|
| AHNAK2 | AHNAK nucleoprotein 2 | KRT19 | keratin 19 |
| ANLN | anillin actin binding protein | LAMA3 | laminin subunit alpha 3 |
| ANO1 | anoctamin 1 | LAMB3 | laminin subunit beta 3 |
| ASPM | abnormal spindle microtubule assembly | LAMC2 | laminin subunit gamma 2 |
| CAPG | capping actin protein, gelsolin like | LCN2 | lipocalin 2 |
| CEACAM5 | carcinoembryonic antigen related cell adhesion molecule 5 | MET | MET proto-oncogene, receptor tyrosine kinase |
| CEACAM6 | carcinoembryonic antigen related cell adhesion molecule 6 | NQO1 | NAD(P)H quinone dehydrogenase 1 |
| COL10A1 | collagen type X alpha 1 chain | OAS1 | 2′-5′-oligoadenylate synthetase 1 |
| CXCL5 | C-X-C motif chemokine ligand 5 | S100A14 | S100 calcium binding protein A14 |
| DKK1 | dickkopf WNT signaling pathway inhibitor 1 | S100P | S100 calcium binding protein P |
| FXYD3 | FXYD domain containing ion transport regulator 3 | SERPINB5 | serpin family B member 5 |
| GABRP | gamma-aminobutyric acid type A receptor pi subunit | SLC2A1 | solute carrier family 2 member 1 |
| GCNT3 | glucosaminyl (N-acetyl) transferase 3, mucin type | SLC44A4 | solute carrier family 44 member 4 |
| GJB2 | gap junction protein beta 2 | SLC6A14 | solute carrier family 6 member 14 |
| GPRC5A | G protein-coupled receptor class C group 5 member A | SLPI | secretory leukocyte peptidase inhibitor |
| GPX2 | glutathione peroxidase 2 | TCN1 | transcobalamin 1 |
| IFI27 | interferon alpha inducible protein 27 | TFF1 | trefoil factor 1 |
| ITGA2 | integrin subunit alpha 2 | TMC5 | transmembrane channel like 5 |
| ITGA3 | integrin subunit alpha 3 | TMPRSS4 | transmembrane protease, serine 4 |
| TSPAN1 | tetraspanin 1 | ||
| AOX1 | aldehyde oxidase 1 | ||
Fig. 2Variation in protein expression data from the GC list retrieved from immunohistochemical staining images in HPA. The protein expression data shows that 14 genes have more than 75% of images with high plus medium expression in pancreatic cancer, evidencing the expression of predicted core-genes in the pancreatic tissue. The genes with protein expression confirmed in IHC staining images were highlighted in red. Data credit: Human Protein Atlas
Fig. 3Representative immunohistochemistry staining of AHNAK2, KRT19, LAMB2, LAMC2, and S100P in Pancreatic Ductal Adenocarcinoma (Tumor) and normal pancreatic tissue (Normal). The proteins presented more than 75% of images with high plus medium expression in HPA. Scales bars represent 400 μm. Image courtesy of Human Protein Atlas
Fig. 4PCA and hierarchical analysis of the merged data set into one data. a. PCA analysis clearly showed two distinct groups corresponding to normal and tumor samples. b. Clustering analysis. The red band indicates the PDAC samples with similar gene expression on 40-core-gene, and the blue band indicates the normal samples
Classification report of the validation test set
| Precision | Recall | F1-score | Support | |
|---|---|---|---|---|
| Normal | 0.83 | 0.83 | 0.83 | 77 |
| Tumor | 0.88 | 0.88 | 0.88 | 105 |
| Avg/total | 0.86 | 0.86 | 0.86 | 182 |
Confusion matrix of the training and validation test samples
| Actual normal | Actual tumor | ||
|---|---|---|---|
| Training | Classified normal | 169 | 49 |
| Classified tumor | 18 | 412 | |
| Specificity = 90.4 | Sensitivity = 89.4 | ||
| Test | Classified normal | 64 | 13 |
| Classified tumor | 13 | 92 | |
| Specificity = 83.1 | Sensitivity = 87.6 |