| Literature DB >> 32276350 |
Ziling Fan1, Yuan Zhou2, Habtom W Ressom2.
Abstract
The recent advancement of omic technologies provides researchers with the possibility to search for disease-associated biomarkers at the system level. The integrative analysis of data from a large number of molecules involved at various layers of the biological system offers a great opportunity to rank disease biomarker candidates. In this paper, we propose MOTA, a network-based method that uses data acquired at multiple layers to rank candidate disease biomarkers. The networks constructed by MOTA allow users to investigate the biological significance of the top-ranked biomarker candidates. We evaluated the performance of MOTA in ranking disease-associated molecules from three sets of multi-omic data representing three cohorts of hepatocellular carcinoma (HCC) cases and controls with liver cirrhosis. The results demonstrate that MOTA allows the identification of more top-ranked metabolite biomarker candidates that are shared by two different cohorts compared to traditional statistical methods. Moreover, the mRNA candidates top-ranked by MOTA comprise more cancer driver genes compared to those ranked by traditional differential expression methods.Entities:
Keywords: differential network; metabolomics; multi-omic integration; transcriptomics
Year: 2020 PMID: 32276350 PMCID: PMC7241240 DOI: 10.3390/metabo10040144
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Figure 1Framework of Multi-Omic inTegrative Analysis (MOTA), demonstrating how an intra-omic network is constructed based on data from Omic 1 and other omic datasets (Omic 2 and Omic 3) to add inter-omic connections to the network. The resulting network allows us to rank disease-associated molecules from Omic 1. Partial correlation (pc) is calculated for feature pairs in the Omic 1 dataset, and is used to determine intra-omic connections. For other omic datasets, canonical correlation (cc) is calculated for feature pairs between Omic 1 and other omic datasets (e.g., Omic 2, Omic 3, etc.), and is used to determine inter-omic connections. The MOTA activity score of a node is calculated based on the network topology and statistical significance of the feature represented by the node itself and other features whose nodes (from any of the omic datasets) are connected to it.
Multi-omic datasets acquired from three cohorts. The number of features in each omic dataset and the number of serum and tissue samples analyzed by multi-omic approaches are shown. HCC, hepatocellular carcinoma, CIRR, cirrhosis, TU, Tanta University, GU, Georgetown University.
| Datasets | Omic Studies | Serum | Tissue | ||
|---|---|---|---|---|---|
| HCC | CIRR | HCC | CIRR | ||
| TU Datasets | Metabolomics (66) | 39 | 48 | ||
| GU1 Datasets | Metabolomics (53) | 40 | 44 | ||
| GU2 Datasets | Metabolomics (3672) | 37 | 24 | ||
Figure 2Network constructed by MOTA using the GU1 dataset, which consists of metabolomic, proteomic, and glycomic data, to rank metabolites. The size of a metabolite node is proportional to the corresponding MOTA activity score.
Ranking of metabolites in TU datasets. The p-value is calculated using Student’s t-test.
| Feature | Rank | MOTA | Rank | |
|---|---|---|---|---|
| tyrosine | 0.42 | 36 | 11.29 | 1 |
| alpha tocopherol | 0.85 | 50 | 10.24 | 2 |
| pyroglutamic acid | 0.01 | 4 | 8.96 | 3 |
| glycine | 0.01 | 5 | 8.62 | 4 |
| ethanolamine | 0.00 | 1 | 8.34 | 5 |
| phenylalanine | 0.01 | 2 | 7.92 | 6 |
| citric acid | 0.13 | 16 | 7.42 | 7 |
| threitol | 0.08 | 12 | 7.27 | 8 |
| tyramine | 0.95 | 53 | 7.23 | 9 |
| aspartic acid | 0.08 | 13 | 7.18 | 10 |
| ribitol /arabitol | 0.06 | 10 | 7.08 | 11 |
| creatinine | 0.02 | 7 | 7.01 | 12 |
| malic acid | 0.22 | 20 | 7.00 | 13 |
| Proline | 0.45 | 38 | 7.00 | 14 |
| lactulose | 0.26 | 23 | 6.43 | 15 |
| linoleic acid | 0.02 | 6 | 6.42 | 16 |
| hydroxybenzyl alcohol | 0.34 | 33 | 6.40 | 17 |
| malonic acid | 0.26 | 24 | 6.34 | 18 |
| xanthine | 0.29 | 29 | 6.30 | 19 |
| sorbose | 0.01 | 3 | 6.26 | 20 |
| myo-inositol | 0.31 | 30 | 6.23 | 21 |
| stearic acid | 0.08 | 11 | 6.20 | 22 |
| diglycerol | 0.21 | 19 | 6.18 | 23 |
| lauric acid | 0.06 | 8 | 6.18 | 24 |
Biomarker candidates overlapping between the GU1, TU, and combined TU and GU1 datasets ranked by t-test, iDINGO, and MOTA.
| Rank | GU1 Cohort | TU Cohort | GU1+TU Cohort | No. of Overlaps |
|---|---|---|---|---|
| Ranking using | ||||
| 1 |
| glutamic acid | ethanolamine | 2 |
| 2 | phenylalanine | lactic acid |
| |
| 3 |
| alpha tocopherol | citric Acid | |
| 4 | pyroglutamic acid | valine | isoleucine | |
| 5 | glycine |
| threitol | |
| 6 | linoleic acid | alpha-D-glucosamine 1-phosphate | ribose | |
| 7 | creatinine | norvaline | malic acid | |
| 8 | lauric acid | citric Acid | phenylalanine | |
| 9 | ribitol /arabitol | norleucine | stearic acid | |
| 10 | threitol |
| trans-aconitic acid | |
| Ranking using | ||||
| 1 | linoleic acid | norvaline | valine | 2 |
| 2 |
| cystine | ethanolamine | |
| 3 | leucine |
| butanediol | |
| 4 | proline | tagatose | ribose | |
| 5 | ethanolamine |
| glycine | |
| 6 | valine | trans-3-hydroxy-L-proline |
| |
| 7 | glutamic acid | N,N-dimethyl-1-4-phenylenediamine | tyrosine | |
| 8 |
| cholesterol | malic acid | |
| 9 | aspartic acid | butanediol |
| |
| 10 | glycine | arachidic acid | tagatose | |
| Ranking using | ||||
| 1 |
|
|
| 4 |
| 2 |
|
|
| |
| 3 | pyroglutamic acid |
| glycine | |
| 4 | glycine | creatinine | lactic acid | |
| 5 |
|
| creatinine | |
| 6 | phenylalanine | mimosine |
| |
| 7 | citric acid | lactic acid | cholesterol | |
| 8 | threitol | cholesterol |
| |
| 9 |
| threitol | citric Acid | |
| 10 | aspartic acid | ribose | isoleucine | |
Note: Metabolite candidates that appeared in the top-10 ranked lists of all three cohorts are highlighted with the same color.
Top-5 significant Gene Ontology (GO) terms based on genes selected by DESeq2, iDINGO, and MOTA.
| DESeq2 | iDINGO | MOTA | ||||
|---|---|---|---|---|---|---|
| No. of GO Terms with FDR < 0.05 | 0 | 7 | 10 | |||
| GO Terms | FDR | GO Terms | FDR | GO Terms | FDR | |
| Gene | chromatin organization | 1.0 | chromatin assembly | 0.014 | extracellular matrix organization | 1.27 × 10−5 |
| kidney development | 1.0 | nucleosome assembly | 0.014 | extracellular structure organization | 1.90 × 10−5 | |
| renal system development | 1.0 | nucleosome organization | 0.016 | positive regulation of protein kinase B signaling | 7.56 × 10−4 (1.43 × 10−7) | |
| nucleosome assembly | 1.0 | Chromatin assembly or disassembly | 0.019 | cell chemotaxis | 1.48 × 10−3 (6.56 × 10−7) | |
| urogenital system development | 1.0 | DNA packaging | 0.214 | elastic fiber assembly | 1.58 × 10−3 (5.98 × 10−7) | |
Number of cancer driver genes selected using DESeq2, iDINGO, and MOTA.
| Top | DESeq2 | iDINGO | MOTA |
|---|---|---|---|
| Top 10 | 1 | 0 | 2 |
| Top 50 | 3 | 1 | 6 |
| Top 100 | 4 | 3 | 8 |