| Literature DB >> 20152038 |
Alexander Lachmann1, Avi Ma'ayan.
Abstract
BACKGROUND: Systems biologists are faced with the difficulty of analyzing results from large-scale studies that profile the activity of many genes, RNAs and proteins, applied in different experiments, under different conditions, and reported in different publications. To address this challenge it is desirable to compare the results from different related studies such as mRNA expression microarrays, genome-wide ChIP-X, RNAi screens, proteomics and phosphoproteomics experiments in a coherent global framework. In addition, linking high-content multilayered experimental results with prior biological knowledge can be useful for identifying functional themes and form novel hypotheses.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20152038 PMCID: PMC2843617 DOI: 10.1186/1471-2105-11-87
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1System architecture of the L2N software. The system contains several related components (blue ellipses). Registered users are able to upload lists of genes/proteins. Set operations allow users to create new lists from existing lists. Overlap analyses use background databases of gene-list libraries to compute enrichment. Lists of genes are able to be expanded using background knowledge networks of PPIs, co-expression, and co-annotations. Users have the capability to share their lists with other users and communicate with other users through a messaging system.
Figure 2Bottom center: Screenshot from the overlap component showing sample analysis. The background knowledge category chosen is KEA kinase-substrate enrichment. Top right: Expand Lists component screenshot. Uploaded lists can be expanded using protein-protein interactions network, co-expression network, or co-annotation network. Users can set two parameters: sensitivity and number of steps. This allows controlling the size of the expanded lists. Top left: Network of lists representation. Uploaded user lists (blue) are connected if they overlap significantly. Green and black nodes represent labeled gene-lists from a specific library (kinases and their known substrates) that overlap with the uploaded lists. Green nodes represent annotated lists that overlap with only one input list whereas black nodes are lists that overlap with two or more input lists.
GMT files used for gene-list overlap analysis in L2N
| Name | Description | Source | GMT File Creator |
|---|---|---|---|
| WikiPathways_pathways | Biological pathways | Ma'ayan lab | |
| Reactome_pathways | Biological pathways | Ma'ayan lab | |
| GenMAPP_patwhays | Biological pathways | MSigDB | |
| KEGG_pathways | Signaling pathways | MSigDB | |
| BioCarta_pathways | Signaling pathways | MSigDB | |
| Chromosome_location | Chromosome location | MSigDB | |
| TFs_predicted_binidng_sites | TRANSFAC predicted sites | Ma'ayan lab | |
| TFs_chip_interactions | ChIP interactions collected from literature by the Ma'ayan lab | Various sources from individual publications | Ma'ayan lab |
| Gene_Ontology_BP | Gene Ontology Biological Process | Ma'ayan lab | |
| Gene_Ontology_MF | Gene Ontology Molecular Function | Ma'ayan lab | |
| Gene_Ontology_CC | Gene Ontology Cellular Component | Ma'ayan lab | |
| HMDB_metabolites | Genes associated with Metabolites | Ma'ayan lab | |
| OMIM_disease_genes | Genes associated with a specific disease | Ma'ayan lab | |
| OMIM_disease_neighborhoods | Genes associated with a specific disease expanded using protein interactions and the genes2networks algorithm | Ma'ayan lab | |
| KEA_kinases | Substrates of protein kinases collected from different databases | Ma'ayan lab | |
| PFAM_InterPro_domains | Proteins that share a structural domain | Ma'ayan lab | |
| Predcited_microRNAs | Predicted microRNAs that bind to mRNA of sets of genes | Ma'ayan lab | |
| Protein_interaction_hubs | Proteins that interacts with many other proteins | Protein interactions collected from several mammalian protein interactions databases | Ma'ayan lab |
Columns include: a) File name as it appears in the drop-down box in the overlap feature on the web-page; b) Description of the content of the gene-list library; c) URL of the source of the data. d) Library created by.