| Literature DB >> 30040709 |
Yazeli E Cruz-Rivera1, Jaileene Perez-Morales2, Yaritza M Santiago1, Valerie M Gonzalez1, Luisa Morales3, Mauricio Cabrera-Rios1, Clara E Isaza1,3.
Abstract
In 2017, approximately 5 million Americans were living with Alzheimer's disease (AD), and it is estimated that by 2050 this number could increase to 16 million. In this study, we apply mathematical optimization to approach microarray analysis to detect differentially expressed genes and determine the most correlated structure among their expression changes. The analysis of GSE4757 microarray dataset, which compares expression between AD neurons without neurofibrillary tangles (controls) and with neurofibrillary tangles (cases), was casted as a multiple criteria optimization (MCO) problem. Through the analysis it was possible to determine a series of Pareto efficient frontiers to find the most differentially expressed genes, which are here proposed as potential AD biomarkers. The Traveling Sales Problem (TSP) model was used to find the cyclical path of maximal correlation between the expression changes among the genes deemed important from the previous stage. This leads to a structure capable of guiding biological exploration with enhanced precision and repeatability. Ten genes were selected (FTL, GFAP, HNRNPA3, COX1, ND2, ND3, ND4, NUCKS1, RPL41, and RPS10) and their most correlated cyclic structure was found in our analyses. The biological functions of their products were found to be linked to inflammation and neurodegenerative diseases and some of them had not been reported for AD before. The TSP path connects genes coding for mitochondrial electron transfer proteins. Some of these proteins are closely related to other electron transport proteins already reported as important for AD.Entities:
Keywords: Alzheimer’s disease; correlation; optimization; traveling salesman problem
Mesh:
Substances:
Year: 2018 PMID: 30040709 PMCID: PMC6087431 DOI: 10.3233/JAD-170799
Source DB: PubMed Journal: J Alzheimers Dis ISSN: 1387-2877 Impact factor: 4.472
Fig.1(a) MCO representation: Each gene under analysis is represented by two performance measures to be maximized, in this case, the absolute difference of medians and the absolute difference of means (b) MCO Solution: the maximization directions help to form a cone that originates on a particular gene to be evaluated. If this cone is empty (it does not contain another gene), then the gene is a Pareto Efficient solution. Otherwise, the gene is a dominated solution and therefore not Pareto Efficient.
Fig.2(a) TSP Initial Setup: in this graph, the nodes represent genes and the undirected arcs represent correlation between each pair of nodes. Each arc contains the absolute value of the correlation between the gene expression changes of the genes at its extremes (b) TSP Solution: the cyclic correlation path with the largest sum of absolute correlations, in this case those assigned to pairs (1,2)-(2,5)-(5,3)-(3,4)-(4,1).
List of 10 potential biomarkers identified in the first 3 frontiers through the MCO problem
| Accession Number | Identifier |
|---|---|
| 1553551_s_at | |
| 1553538_s_at | |
| 224373_s_at | |
| 1555653_at | |
| 1553588_at | |
| 201492_s_at | |
| 212788_x_at | |
| 203540_at | |
| 200095_x_at | |
| 229353_s_at |
Fig.4Gene coordinated behavior pathway as determined by the Traveling Sales Problem solution.
Fig.3Correlation matrix indicating how strong the correlations between the expression changes for the 10 potential biomarkers are. Values close to 1 indicates strong correlations, the strength decreases as the correlation coefficient does.
Biological processes, chromosomal location, and subcellular location of the 10 genes identified by the MCO and TSP method
| Gene name | Abbreviation | Biological Process | Subcellular location | Chromosomal location | Literature Review Expression | Expression due to our model | Reference |
|---|---|---|---|---|---|---|---|
| Ferritin light chain | iron homeostasis; neutrophil degranulation | cytosol, lysosome, extracellular exosome | 19q13.33 | Underexpression | overexpression | Uniprot [91] | |
| Glial fibrillary acidic protein | cytoskeletal organization, response to wounding, negative regulation of neuron projection development | cytoskeleton, cytosol, mitochondrion | 17q21.31 | Overexpression | overexpression | Gencards [92] | |
| Heterogenous nuclear ribonucleoprotein A3 | mRNA splicing, mRNA transport, RNA metabolic process | nucleus, cytosol | 2q31.2 | Underexpression | overexpression | Uniprot [93] | |
| Cytochrome C oxidase subunit 1 | aerobic respiration, aging, cerebellum development, response to electrical stimulus and oxidative stress | mitochondrion | 9q32-33.3 | Overexpression | overexpression | Uniprot [94] | |
| NADH Ubiquinone oxidoreductase chain 2 | receptor binding, reactive oxygen species metabolic process | mitochondrion inner membrane | Overexpression | overexpression | Uniprot | ||
| NADH Uniquinone oxidoreductase chain 3 | cellular response to glucocorticoid stimulus; oxidoreductase process, response to light intensity | mitochondrion inner membrane | 10p22.12 | Overexpression | overexpression | Uniprot [95, 96] | |
| NADH Ubiquinone oxidoreductase chain 4 | aging, cerebellum process, mitochondrial electron transport | mitochondrion inner membrane | 1q24.2 | Overexpression | overexpression | Uniprot | |
| Nuclear ubiquitine casein and cyclin dependent kinase substrate 1 | cellular glucose homeostasis, regulation of insulin receptor and regulation of viral transcription | nucleus | 1q32.1 | overexpression | overexpression | Uniprot [97] | |
| 60S ribosomal protein 41 | rRNA processing, translation, viral transcription | cytosol | 12q13.2 | overexpression | overexpression | Uniprot, Genecards [98] | |
| 40S ribosomal protein S10 | ribosomal small subunit | ||||||
| assembly, rRNA processing, viral transcription | nucleus | 6p21.31 | overexpression | overexpression | Uniprot |
Fig.5Cellular localization of the selected genes’ protein products
Fig.6Mitochondrial localization of COX1, Mt-ND3, and Mt-ND4.
Fig.7Diagram summarizing a response to SIRT3 lower expression leading to a decrease level of MT-ND2 and MT-ND4 [35]. This decrease levels results in neuronal damage and subsequent neurodegeneration.
Fig.8Flowchart summarizing a relation between RPL41 and ATF4. A) Depletion of glutathione induces oxidative stress which causes activation of RPL41 that results in phosphorylation of ATF4 causing neuronal dysfunction [52]. B) Deletion of ATF4 causes neurons to resist cell death and preserve glutathione levels [58].