| Literature DB >> 35580053 |
Christopher A Mancuso1, Patrick S Bills2, Douglas Krum2, Jacob Newsted2, Renming Liu1, Arjun Krishnan1,3.
Abstract
Biomedical researchers take advantage of high-throughput, high-coverage technologies to routinely generate sets of genes of interest across a wide range of biological conditions. Although these technologies have directly shed light on the molecular underpinnings of various biological processes and diseases, the list of genes from any individual experiment is often noisy and incomplete. Additionally, interpreting these lists of genes can be challenging in terms of how they are related to each other and to other genes in the genome. In this work, we present GenePlexus (https://www.geneplexus.net/), a web-server that allows a researcher to utilize a powerful, network-based machine learning method to gain insights into their gene set of interest and additional functionally similar genes. Once a user uploads their own set of human genes and chooses between a number of different human network representations, GenePlexus provides predictions of how associated every gene in the network is to the input set. The web-server also provides interpretability through network visualization and comparison to other machine learning models trained on thousands of known process/pathway and disease gene sets. GenePlexus is free and open to all users without the need for registration.Entities:
Year: 2022 PMID: 35580053 PMCID: PMC9252732 DOI: 10.1093/nar/gkac335
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 19.160
Figure 1.The workflow of the GenePlexus web-server. First the user uploads a gene set of interest and choses the network and representation and how the negative genes should be selected. Next, the data is prepared, the model is trained and the results are created. Finally, the user can retrieve gene predictions, gain insight into the trained model and visualize the network connectivity of the top genes interactively through their browser.
Figure 2.Uploading and validating the gene set. (A) The user can either paste gene IDs or upload them from a file. (B) Upon clicking the Done button, the genes are converted in Entrez ID space and the overlap of the gene set with the genes in each network is displayed.
Figure 3.Genome-wide Prediction. For every gene in the genome-scale molecular network that was used to train the model, a score is calculated of how associated it is to the user-supplied gene set and displayed as an interactive table.
Figure 4.Interpretability features of GenePlexus. (A) The model trained using the user-supplied gene set is compared to thousands of models pre-trained on known gene sets from the (A) GeneOntology and (B) DisGeNet databases. (C) The network connectivity of the top associated genes are displayed as an interactive graph.