| Literature DB >> 29029625 |
Chenggang Yu1, Hyung Jun Woo1, Xueping Yu1, Tatsuya Oyama1, Anders Wallqvist1, Jaques Reifman2.
Abstract
BACKGROUND: Researchers have previously developed a multitude of methods designed to identify biological pathways associated with specific clinical or experimental conditions of interest, with the aim of facilitating biological interpretation of high-throughput data. Before practically applying such pathway analysis (PA) methods, we must first evaluate their performance and reliability, using datasets where the pathways perturbed by the conditions of interest have been well characterized in advance. However, such 'ground truths' (or gold standards) are often unavailable. Furthermore, previous evaluation strategies that have focused on defining 'true answers' are unable to systematically and objectively assess PA methods under a wide range of conditions.Entities:
Keywords: Gene set enrichment analysis; Method evaluation; Pathway analysis
Mesh:
Year: 2017 PMID: 29029625 PMCID: PMC5640951 DOI: 10.1186/s12859-017-1866-7
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Gene Expression Omnibus (GEO) dataset sample size distribution. The majority of datasets deposited in GEO contain a small number of samples. Over 34% of datasets deposited in GEO before 2016 contain ≤6 samples, and over 57% of them contain ≤12 samples. This distribution remains mostly the same for datasets added to GEO in 2016
The 10 gene expression datasets used to assess the performance of pathway analysis methods
| Dataset IDa | Study | Tissue Type | Sample Size | References | |
|---|---|---|---|---|---|
| Treatment | Control | ||||
| GSE5281 | Alzheimer’s disease | Brain tissue | 35 | 38 | [ |
| GSE20295 | Parkinson’s disease | Brain tissue | 40 | 53 | [ |
| GSE3365 | Crohn’s disease | Blood | 59 | 42 | [ |
| GSE48018 | Influenza vaccination | Blood | 110 | 111 | [ |
| GSE20194 | Breast cancer | Cancer tissue | 222 | 56 | [ |
| GSE4115 | Lung cancer | Bronchial epithelium | 97 | 90 | [ |
| GSE37069 | Burn injury | Blood | 29 | 36 | [ |
| GSE36809 | Trauma injury | Blood | 75 | 37 | [ |
| N/A | Drug treatment of MCF7 cells | Cell line | 57 | 323 | [ |
| N/A | Drug treatment of PC3 cells | Cell line | 32 | 184 | [ |
aGene Expression Omnibus (GEO) database ID
Fig. 2The procedure used to compute two metrics, (a) recall and (b) discrimination. Recall provides a measure of consistency of the overlap between the pathways identified from the original dataset with those obtained from sub-datasets randomly sampled from the original dataset. The computation of discrimination employs two datasets from different studies and multiple sub-datasets resampled from each of the two datasets. Discrimination measures the fraction of sub-datasets that yield higher recall with the original dataset from which they were resampled in comparison to the recall for another randomly selected dataset
Fig. 3Comparison of recall values computed for the six pathway analysis methods, based on the top 20 most significantly up-regulated (a, b) and top 20 most significantly down-regulated pathways (c, d), which each method identified for the Parkinson’s disease dataset. For each box-and-whisker plot, the horizontal line, top and bottom sides of the box, and vertical line show the median, upper and lower quartiles, and range, respectively, of the recall values. The comparisons are separated into two groups for clarity and to distinguish the two different permutation methods. a and c compare methods ORA, GSA, GSEA, and AFC. b and d compare methods GSEAs, GSEA, AFCs, and AFC. The sample size corresponds to the number of samples contained in the sub-datasets randomly resampled from the original datasets
Average recall values for the six pathway analysis (PA) methods, using the 10 gene expression datasets
| Sample Size | ORA | GSA | GSEA | GSEAs | AFC | AFCs |
|---|---|---|---|---|---|---|
| Significantly up-regulated pathways | ||||||
| 20 MSPa | ||||||
| 6 | 0.19 |
| 0.25 |
| ||
| 12 | 0.28 | 0.31 | 0.30 |
|
| 0.31 |
| 24 | 0.38 | 0.45 |
| 0.35 |
| 0.45 |
| 48 | 0.51 | 0.58 |
| 0.48 |
| 0.60 |
| 50 MSP | ||||||
| 6 |
| 0.27 | 0.39 |
| ||
| 12 |
| 0.43 | 0.46 | 0.35 |
| 0.42 |
| 24 |
| 0.55 | 0.52 | 0.46 |
| 0.55 |
| 48 |
| 0.68 | 0.60 | 0.59 |
| 0.68 |
| Significantly down-regulated pathways | ||||||
| 20 MSP | ||||||
| 6 | 0.37 |
| 0.35 |
| ||
| 12 | 0.49 | 0.31 | 0.40 |
|
| 0.34 |
| 24 | 0.58 | 0.43 | 0.45 |
|
| 0.46 |
| 48 | 0.68 | 0.55 |
| 0.51 |
| 0.60 |
| 50 MSP | ||||||
| 6 | 0.39 |
| 0.48 |
| ||
| 12 | 0.51 | 0.45 | 0.54 |
|
| 0.47 |
| 24 | 0.60 | 0.56 | 0.60 |
|
| 0.59 |
| 48 | 0.70 | 0.68 | 0.65 |
|
| 0.70 |
aMSP most significant pathway
bUnderlined text indicates the minimal value in a row
cBold text indicates the maximal value in a row
The computations are based on each method’s identification of the top 20 or 50 most significantly up- or down-regulated pathways for all datasets (and 500 randomly resampled sub-datasets for each original dataset). The sample size corresponds to the number of samples contained in the sub-datasets. A higher average recall reflects a more consistent PA method
Fig. 4Comparison of discrimination values computed for the six pathway analysis methods, based on the top 20 most significantly up-regulated (a, b) and top 20 most significantly down-regulated pathways (c, d), which each method identified for the 10 gene expression datasets. The sample size corresponds to the number of samples contained in the sub-datasets randomly resampled from the 10 datasets