| Literature DB >> 26388997 |
Enery Lorenzo1, Katia Camacho-Caceres1, Alexander J Ropelewski2, Juan Rosas1, Michael Ortiz-Mojer1, Lynn Perez-Marty1, Juan Irizarry1, Valerie Gonzalez1, Jesús A Rodríguez1, Mauricio Cabrera-Rios1, Clara Isaza3.
Abstract
Establishing how a series of potentially important genes might relate to each other is relevant to understand the origin and evolution of illnesses, such as cancer. High-throughput biological experiments have played a critical role in providing information in this regard. A special challenge, however, is that of trying to conciliate information from separate microarray experiments to build a potential genetic signaling path. This work proposes a two-step analysis pipeline, based on optimization, to approach meta-analysis aiming to build a proxy for a genetic signaling path.Entities:
Keywords: cancer biology; signaling pathways; traveling salesman problem
Year: 2015 PMID: 26388997 PMCID: PMC4573573 DOI: 10.3390/microarrays4020287
Source DB: PubMed Journal: Microarrays (Basel) ISSN: 2076-3905
Figure 1Representation of a multiple criteria optimization problem with two performance measures.
Figure 2Multiple Criteria Optimization Problem solved using Data Envelopment Analysis (BCC model). The efficient solutions are identified through the use of piecewise-linear segments.
Figure 3A case with genes characterized by two performance measures. Referring to this figure, and following the proposed method, at this point it is recommended to identify the first 10 efficient frontiers. This can be easily done by identifying the genes in the first efficient frontier through DEA, then removing them from the set and continue with a second DEA iteration.
Figure 4Representation of the many options for a cyclic path for 5 genes.
List 28 genes found through DEA as being differentially expressed in cervix cancer and cross validated for the direction of expression change [3].
| Gene Probe | Gene Name | Sign of expression change from healthy tissues to cancer tissues | Examples of cancer types where the gene is involved | Reference | |
|---|---|---|---|---|---|
| Database 1 [ | Database 2 [ | ||||
| 202575_at | CRABP2 | - | - | Head and Neck, Breast | [ |
| 205402_x_at | PRSS2 | - | - | Colorectal, Gastric Tumorigenesis | [ |
| 218677_at | S100A14 | - | - | Esophageal squamous cell carcinoma cells, oral squamous cell carcinoma | [ |
| 202096_s_at | TSPO | - | - | Thyroid, Breast | [ |
| 212249_at | PIK3R1 | - | - | Endometrial, Colorectal | [ |
| 212567_s_at | MAP4 | - | - | Breast, non small cell lung carcinomas | [ |
| 211366_x_at | CASP1 | - | - | Cervical squamous carcinoma cells | [ |
| 212889_x_at | GADD45GIP1 | - | - | SKOV3 and HeLa cell lines | [ |
| 206626_x_at | SSX1 | - | - | Prostate, multiple myeloma | [ |
| 213450_s_at | ICOSLG | - | - | Metastatic melanoma, ductal pancreatic adenocarcinoma | [ |
| 220405_at | SNTG1 | - | - | ||
| 208032_s_at | GRIA3 | - | - | Pancreatic | [ |
| 205690_s_at | BUD31 | - | - | ||
| 206543_at | SMARCA2 | - | - | Prostate, Skin | [ |
| 212291_at | HIPK1 | + | + | Acute myeloid leukemia | [ |
| 211615_s_at | LRPPRC | + | + | Lung adenocarcinoma cell lines, oesophageal squamous cell carcinoma, stomach, colon, mammary and endometrial adenocarcinoma, and lymphoma | [ |
| 222027_at | NUCKS1 | + | + | Breast | [ |
| 205362_s_at | PFDN4 | + | + | Colorectal | [ |
| 211929_at | HNRNPA3 | + | + | Non-small cell lung cancer | [ |
| 203738_at | C5orf22 | + | + | ||
| 201794_s_at | SMG7 | + | + | ||
| 200607_s_at | RAD21 | + | + | Breast | [ |
| 201011_at | RPN1 | + | + | Hematologic malignancies | [ |
| 201761_at | MTHFD2 | + | + | Bladder, breast | [ |
| 203880_at | COX17 | + | + | Non-small cell lung cancer | [ |
| 212255_s_at | ATP2C1 | + | + | Breast, Cervical | [ |
| 205112_at | PLCE1 | + | + | Gastric adenocarcinoma, colorectal | [ |
| 201663_s_at201664_at | SMC4 | + | + | Breast, cervical | [ |
Figure 5Highest Correlated Cyclic Path among the 28 genes identified in Stage 1.
Adjacent genes in the solutions for the correlated cyclic path found adding five genes at a time.
| Number of Genes | Adjacent Genes |
|---|---|
| 5 | (CRABP2 with PRSS2) and (S100A14 with TSPO) |
| 10 | (PIK3R1 with MAP4) and (GADD45GIP1 with ICOSLG) |
| 15 | (SSX1 with BUD31), (ICOSLG with SNTG1), and (S100A14 with TSPO) |
| 20 | (LRPPRC with C5orf22) and (S100A14 with TSPO) |
| 25 | (S100A14 with TSPO), (SSX1 with GRIA3), (LRPPRC with MTHFD2), (RAD21 with BUD31), and (RPN1 with COX17) |
| 28 | (LRPPRC with MTHFD2) and (RPN1 with COX17) |
Selected genes localization.
| Gene | Location |
|---|---|
| HIPK1 | 1p13.2 |
| NUCKS1 | 1q32.1 |
| SMG7 | 1q25.3 |
| CRABP2 | 1q21.3 |
| S100A14 | 1q21.1 |
| HNRNPA3 | 2q31.2 |
| LRPPRC | 2p21 |
| MTHFD2 | 2p13.1 |
| SMC4 | 3q26.1 |
| ATP2C | 3q22.1 |
| RPN1 | 3q21.3 |
| MAP4 | 3p21.31 |
| COX17 | 3q13.33 |
| C5orf22 | 5p13.3 |
| PIK3R1 | 5q13.1 |
| BUD31 | 7q22.1 |
| PRSS2 | 7q34 |
| SNTG1 | 8q11.21 |
| RAD21 | 8q24.11 |
| SSX1 | Xp11.23 |
| GRIA3 | Xq25 |
| PFDN4 | 20q13.2 |
| CASP1 | 11q22.3 |
| PLCE1 | 10q23.33 |
| ICOSLG | 21q22.3 |
| GADD45G | 19p13.2 |
| SMARCA2 | 9p22.3 |
| TSPO | 22q13.31 |
An example of how the number of genes deemed significant changes when choosing different cutoff values for fold change and p-value.
| Fold change | Differentially expressed genes (number) | Number of genes Overexpressed | Number of genes Underexpressed | |
|---|---|---|---|---|
| 10−2 | 2 | 934 | 645 | 289 |
| 10−2 | 8 | 29 | 23 | 6 |
| 10−2 | 24 | 2 | 1 | 1 |
| 10−7 | 2 | 649 | 516 | 133 |
| 10−7 | 8 | 27 | 22 | 5 |
| 10−7 | 24 | 2 | 1 | 1 |
| 10−12 | 2 | 130 | 121 | 9 |
| 10−12 | 8 | 12 | 11 | 1 |
| 10−12 | 24 | 2 | 1 | 1 |