| Literature DB >> 24423189 |
Hyunjung Kang, Ikjung Choi, Sooyoung Cho, Daeun Ryu, Sanghyuk Lee, Wankyu Kim1.
Abstract
BACKGROUND: Gene set analysis (GSA) is useful in deducing biological significance of gene lists using a priori defined gene sets such as gene ontology (GO) or pathways. Phenotypic annotation is sparse for human genes, but is far more abundant for other model organisms such as mouse, fly, and worm. Often, GSA needs to be done highly interactively by combining or modifying gene lists or inspecting gene-gene interactions in a molecular network. DESCRIPTION: We developed gsGator, a web-based platform for functional interpretation of gene sets with useful features such as cross-species GSA, simultaneous analysis of multiple gene sets, and a fully integrated network viewer for visualizing both GSA results and molecular networks. An extensive set of gene annotation information is amassed including GO & pathways, genomic annotations, protein-protein interaction, transcription factor-target (TF-target), miRNA targeting, and phenotype information for various model organisms. By combining the functionalities of Set Creator, Set Operator and Network Navigator, user can perform highly flexible and interactive GSA by creating a new gene list by any combination of existing gene sets (intersection, union and difference) or expanding genes interactively along the molecular networks such as protein-protein interaction and TF-target. We also demonstrate the utility of our interactive and cross-species GSA implemented in gsGator by several usage examples for interpreting genome-wide association study (GWAS) results. gsGator is freely available at http://gsGator.ewha.ac.kr.Entities:
Mesh:
Year: 2014 PMID: 24423189 PMCID: PMC3898093 DOI: 10.1186/1471-2105-15-13
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Comparison of the supported features in GSA tools
| gsGator | O | O | O | O | O | O | O | O | O | 7 species | |
| DAVID | O | O | O | O | | O | | O | | All species | [ |
| g:Profiler | O | | O | O | O | O | O | | | 85 species | [ |
| Algal Functional Annotation Tool | O | O | | | | | | | | 2 fungal species | [ |
| GeneTerm Linker | O | O | | | | | | | | H. sapiens yeast | [ |
| KOBAS 2.0 | O | | | O | | | O | | | 1327 species | [ |
| ADGO 2.0 | O | | | O | | | | | | 8 species | [ |
| ToppCluster | O | O | O | O | O | O | | | | H. sapiens | [ |
| PhenoFam | | O | | | | | | | | 48 species | [ |
| agriGO | O | | | | | | | | | 38 species | [ |
| Lists2Networks | O | O | O | O | O | O | | O | O | Mammals | [ |
| GeneCodis3 | O | O | | | | O | | O | | 16 species | [ |
| FuncAssociate (ver 2.0) | O | | | | | | | | | 37 species | [ |
| GeneWeaver | O | O | O | O | O | O | 9 species | [ | |||
Figure 1A typical example of interactive and cross-species GSA in gsGator. A. Preparation of input gene set using set operation, orthology mapping, and network expansion tools. Two human gene sets (H1, H2) are combined to generate H1∩H2 by intersection operation. An orthologous set of M1 is generated by orthology mapping to mouse genes from H1∩H2. The M1 set is expanded via a molecular network, resulting in M2. B. GSA is performed between predefined gene sets in the public category and M1, M2 in the private category. The GSA result is shown as a table or a network among gene sets, where significantly associated gene sets are connected to each other.
The coverage of gene functional annotation and molecular networks by major annotation DBs
| | 42,059 | 33,584 | 45,727 | 22,930 | 57,992 | 4,497 | 6,353 | | |
| 34,710 (82.5%) | 27,274 (81.2%) | 18,734 (41.0%) | 20,106 (87.7%) | 48,128 (83.0%) | 2,531 (56.3%) | 6,303 (99.2%) | | ||
| 17,198 (40.9%) | 0 | 6,513 (14.2%) | 16,665 (72.7%) | 9,966 (17.2%) | 0 | 5,620 (88.5%) | | ||
| | 0 | 447 (1.1%) | 848 (2%) | 377 (0.9%) | 0 | 583 (1.4%) | 2,255 (5.4%) | ||
| 9,621 (22.9%) | 6,911 (20,6%) | 2,681 (5.9%) | 5,622 (24.5%) | 1,588 (2.7%) | 47 (1.0%) | 5,479 (86.2%) | | ||
| 1,814 (4.3%) | 634 (1.5%) | 1,365 (3.2%) | 138 (0.3%) | 0 | 1,658 (3.9%) | 5,609 (13.3%) |
Figure 2The gain of annotation coverage for human genes by orthology mapping for phenotype (orange) and protein-protein interaction network (blue) in gsGator. (At: A. thaliana, Ce: C. elegans, Dm: D. melanogaster, Mm: M. musculus, Sc: S. cerevisiae, All: all the 5 model organisms).
Figure 3A schematic diagram of preparing the list of input genes in the three case examples. In case 1, the input human genes are simply mapped to mouse genes by orthology. In case 2, the human genes are expanded by molecular network e.g. protein-protein interaction (PPI). In case 3, the input genes are first, network expanded and then mapped to mouse genes by orthology. Some of the human genes may have no orthologous (E and I) or multiple orthologous genes (H → h, h’) in mouse.
The GSA result by a simple cross-species GSA (case example 1)
| 16 | 87 | 2 | 1.9E-05 | 9.1.E-03 | Prenatal lethality |
| 17 | 975 | 3 | 3.2E-05 | 1.4.E-02 | Decreased body size |
| 18 | 125 | 2 | 4.0E-05 | 1.7.E-02 | Infertility |
| 19 | 237 | 2 | 1.4E-04 | 5.4.E-02 | Increased susceptibility to bacterial infection |
| 20 | 365 | 2 | 3.4E-04 | 9.1.E-02 | Complete prenatal lethality |
| 21 | 519 | 2 | 6.8E-04 | 1.1.E-01 | Male infertility |
| 22 | 574 | 2 | 8.3E-04 | 1.3.E-01 | Complete postnatal lethality |
| 23 | 1258 | 2 | 3.9E-03 | 2.9.E-01 | Premature death |
| 24 | 1642 | 2 | 6.5E-03 | 3.9.E-01 | No abnormal phenotype detected |
The related annotations to eye color is indicated in bold type.
The GSA result by a network-expanded GSA (case example 2)
| 2 | 3 | 2 | 2.7.E-05 | 7.2.E-03 | Polycystic ovarian syndrome |
The related annotations to adiposity is indicated in bold type.
The GSA result by a network-expanded GSA only (case example 3)
| 1 | 5 | 4 | 9.3.E-12 | 4.9.E-09 | Pregnancy loss, recurrent |
| 2 | 12 | 4 | 9.1.E-10 | 2.4.E-07 | Stroke |
| 3 | 6 | 3 | 3.3.E-08 | 5.9.E-06 | Stroke, ischemic |
| 4 | 2 | 2 | 1.4.E-06 | 1.5.E-04 | Fetal loss |
| 6 | 4 | 2 | 8.6.E-06 | 7.6.E-04 | Peripheral vascular disease |
| 7 | 5 | 2 | 1.4.E-05 | 1.1.E-03 | Polycystic ovary syndrome |
| 8 | 7 | 2 | 3.0.E-05 | 2.0.E-03 | Cerebrovascular disease |
| 9 | 11 | 2 | 7.8.E-05 | 4.6.E-03 | Myocardial infarction |
| 10 | 12 | 2 | 9.3.E-05 | 5.0.E-03 | Cardiovascular disease |
| 11 | 25 | 2 | 4.2.E-04 | 2.0.E-02 | Myocardial infarct |
The related annotations to venous thrombosis is indicated in bold type.
The GSA result by a network-expanded and cross-species GSA (case example 3)
| 4 | 150 | 5 | 7.0.E-08 | 1.2.E-04 | Increased leukocyte cell number |
| 5 | 57 | 4 | 7.9.E-08 | 1.2.E-04 | Abnormal cell adhesion |
| 6 | 1258 | 9 | 1.5.E-07 | 1.9.E-04 | Premature death |
| 7 | 75 | 4 | 2.4.E-07 | 2.6.E-04 | Decreased susceptibility to endotoxin shock |
| 9 | 94 | 4 | 6.0.E-07 | 5.0.E-04 | Increased monocyte cell number |
| 10 | 30 | 3 | 1.3.E-06 | 9.1.E-04 | Abnormal cellular extravasation |
| 13 | 33 | 3 | 1.7.E-06 | 1.0.E-03 | Decreased susceptibility to bacterial infection induced morbidity/mortality |
| 14 | 153 | 4 | 4.1.E-06 | 2.1.E-03 | Decreased erythrocyte cell number |
| 15 | 153 | 4 | 4.1.E-06 | 2.1.E-03 | Increased neutrophil cell number |
| 16 | 50 | 3 | 6.1.E-06 | 2.9.E-03 | Impaired macrophage chemotaxis |
| 21 | 60 | 3 | 1.1.E-05 | 3.8.E-03 | Impaired neutrophil recruitment |
| 22 | 8 | 2 | 1.4.E-05 | 4.4.E-03 | Increased susceptibilityto infection induced morbidity/mortality |
| 23 | 65 | 3 | 1.4.E-05 | 4.4.E-03 | Abnormal leukocyte migration |
| 24 | 211 | 4 | 1.5.E-05 | 4.6.E-03 | Increased sensitivityto induced morbidity/mortality |
| 25 | 69 | 3 | 1.6.E-05 | 4.8.E-03 | Increased eosinophil cell number |
| 26 | 70 | 3 | 1.7.E-05 | 4.8.E-03 | Abnormal myelopoiesis |
| 27 | 9 | 2 | 1.7.E-05 | 4.8.E-03 | Hemothorax |
| 28 | 72 | 3 | 1.8.E-05 | 4.9.E-03 | Abnormal cell migration |
| 32 | 86 | 3 | 3.1.E-05 | 7.3.E-03 | Decreased angiogenesis |
| 33 | 13 | 2 | 3.8.E-05 | 8.5.E-03 | Decreased susceptibility to induced choroidal neovascularization |
| 34 | 100 | 3 | 4.9.E-05 | 1.1.E-02 | Abnormal inflammatory response |
| 35 | 289 | 4 | 4.9.E-05 | 1.1.E-02 | Anemia |
| 37 | 18 | 2 | 7.3.E-05 | 1.5.E-02 | Abnormal uterine environment |
| 38 | 20 | 2 | 9.1.E-05 | 1.8.E-02 | Abnormal physiological neovascularization |
| 40 | 22 | 2 | 1.1.E-04 | 2.0.E-02 | Pregnancy-related premature death |
| 43 | 24 | 2 | 1.3.E-04 | 2.3.E-02 | Increased susceptibility to fungal infection |
| 44 | 33 | 2 | 2.5.E-04 | 4.3.E-02 | Decreased cerebral infarction size |
| 45 | 180 | 3 | 2.7.E-04 | 4.6.E-02 | Pallor |
The related annotations to venous thrombosis is indicated in bold type.