| Literature DB >> 26471143 |
Katia I Camacho-Cáceres1, Juan C Acevedo-Díaz1, Lynn M Pérez-Marty1, Michael Ortiz1, Juan Irizarry1, Mauricio Cabrera-Ríos1, Clara E Isaza1,2.
Abstract
Microarrays can provide large amounts of data for genetic relative expression in illnesses of interest such as cancer in short time. These data, however, are stored and often times abandoned when new experimental technologies arrive. This work reexamines lung cancer microarray data with a novel multiple criteria optimization-based strategy aiming to detect highly differentially expressed genes. This strategy does not require any adjustment of parameters by the user and is capable to handle multiple and incommensurate units across microarrays. In the analysis, groups of samples from patients with distinct smoking habits (never smoker, current smoker) and different gender are contrasted to elicit sets of highly differentially expressed genes, several of which are already associated to lung cancer and other types of cancer. The list of genes is provided with a discussion of their role in cancer, as well as the possible research directions for each of them.Entities:
Keywords: Biomarker; lung cancer; meta-analysis; multi-criteria optimization
Mesh:
Year: 2015 PMID: 26471143 PMCID: PMC4940807 DOI: 10.1002/cam4.540
Source DB: PubMed Journal: Cancer Med ISSN: 2045-7634 Impact factor: 4.452
Figure 1Problem representation where G = {g i}, i = 1,2,3,…,n and .
Figure 2Representation of the Pareto‐efficient frontier of the MCO problem.
All the possible combinations of a minimization problem for two criteria
| Outcome number |
|
|
| Outcome |
|---|---|---|---|---|
| 1 | 0 | 0 | 0 |
|
| 2 | 0 | −1 | −1 |
|
| 3 | 0 | W | W |
|
| 4 | −1 | 0 | −1 |
|
| 5 | −1 | −1 | −2 |
|
| 6 | −1 | W | W‐1 |
|
| 7 | W | 0 | W |
|
| 8 | W | −1 | W‐1 |
|
| 9 | W | W | 2W |
|
Figure 3Graphical and Mathematical representation of the sample problem. (A) The six candidate solutions of the sample problem. (B) Mathematical formulation of the problem.
Figure 4Pareto‐efficient solutions for the sample problem.
Figure 5Organization of database GDS3257. “C” indicates cancer and “H” indicates controls.
Figure 6Local Pareto‐efficient frontiers of all groups. For the first and second groups, two genes are at the local Pareto‐efficient frontier, and only one gene for the third group.
Figure 7Globally‐optimal Pareto‐efficient frontier consisting of and genes.
Figure 8Diagram representing six analyses between four different conditions (HNS, HCS, CNS, CCS). The edges of the graph list the genes in the associated Pareto‐efficient frontier.
Figure 9Diagram representing six analyses between four different conditions for women samples (HNSW, HCSW, CNSW, CCSW). The edges of the graph list the genes in the associated Pareto‐efficient frontier.
Figure 10Diagram representing six analyses between four different conditions for men samples (HNSM, HCSM, CNSM, CCSM). The edges of the graph list the genes in the associated Pareto‐efficient frontier.
Scientific names the genes identified in the analyses of this work
| Official symbol | Official name |
|---|---|
| RAGE | Receptor for Advanced Glycosylation End Products |
| SPP1 | Secreted PhosphoProtein 1 |
| XIST | X Inactive Specific Transcript (nonprotein coding) |
| RPS4Y1 | Ribosomal Protein S4, Y‐linked 1 |
| CYP1B1 | Cytochrome P450, family 1, subfamily B, polypeptide 1 |
| FABP4 | Fatty Acid Binding Protein 4, adipocyte |
| CEACAM6 | Carcinoembryonic Antigen‐related Cell Adhesion Molecule 6 (nonspecific cross reacting antigen) |
| MSMB | Microseminoprotein, beta |
| SCGB1A1 | Secretoglobin, family 1A, member 1 (uteroglobin) |
| ADH1B | Alcohol Dehydrogenase 1B (class I), beta polypeptide |
| CYP4B1 | Cytochrome P450, family 4, subfamily B, polypeptide 1 |
| KRT15 | Keratin 15 |
| FGG | Fibrinogen Gamma chain |
Figure 11Groups for meta‐analysis with four PMs.
Summary from Pareto‐efficient frontier genes and their related cancer
| Gene name | Examples of cancer types where the gene is involved | Reference |
|---|---|---|
| RAGE | Pancreas, colon and prostate, colorectal, gastric, liver, lung |
|
| SPP1 | Oral, lung, bone, bladder, prostate, cervical, breast, head and neck, liver |
|
| XIST | Meninges, breast, ovarian |
|
| RPS4Y1 | Meninges |
|
| CYP1B1 | Lung, cervical, head and neck, prostate |
|
|
| ||
| FABP4 | Prostate and breast, ovarian |
|
| MSMB | Prostate |
|
| CEACAM6 | Head and neck, breast, colon, lung |
|
| SCGB1A1* | Lung |
|
|
| ||
| FGG | Liver |
|
| KRT15 | Lung, ovarian |
|
| ADH1B | Esophageal, colorectal, head and neck |
|
| CYP4B1 | Bladder |
|
| SCGB1A1* | Lung |
|
Summary of important genes expressed using volcano plot
| P‐value | Fold change | Differential expression | Overexpressed | Underexpressed |
|---|---|---|---|---|
| 10−2 | 2 | 934 | 645 | 289 |
| 10−2 | 8 | 29 | 23 | 6 |
| 10−2 | 24 | 2 | 1 | 1 |
| 10−7 | 2 | 649 | 516 | 133 |
| 10−7 | 8 | 27 | 22 | 5 |
| 10−7 | 24 | 2 | 1 | 1 |
| 10−12 | 2 | 130 | 121 | 9 |
| 10−12 | 8 | 12 | 11 | 1 |
| 10−12 | 24 | 2 | 1 | 1 |