| Literature DB >> 31066453 |
Uku Raudvere1, Liis Kolberg1, Ivan Kuzmin1, Tambet Arak1, Priit Adler1,2, Hedi Peterson1,2, Jaak Vilo1,2,3.
Abstract
Biological data analysis often deals with lists of genes arising from various studies. The g:Profiler toolset is widely used for finding biological categories enriched in gene lists, conversions between gene identifiers and mappings to their orthologs. The mission of g:Profiler is to provide a reliable service based on up-to-date high quality data in a convenient manner across many evidence types, identifier spaces and organisms. g:Profiler relies on Ensembl as a primary data source and follows their quarterly release cycle while updating the other data sources simultaneously. The current update provides a better user experience due to a modern responsive web interface, standardised API and libraries. The results are delivered through an interactive and configurable web design. Results can be downloaded as publication ready visualisations or delimited text files. In the current update we have extended the support to 467 species and strains, including vertebrates, plants, fungi, insects and parasites. By supporting user uploaded custom GMT files, g:Profiler is now capable of analysing data from any organism. All past releases are maintained for reproducibility and transparency. The 2019 update introduces an extensive technical rewrite making the services faster and more flexible. g:Profiler is freely available at https://biit.cs.ut.ee/gprofiler.Entities:
Mesh:
Year: 2019 PMID: 31066453 PMCID: PMC6602461 DOI: 10.1093/nar/gkz369
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The growth of available species/strains in g:Profiler since 2012. Each column represents an archive version, the current version is highlighted in blue. The isolated peak in 1185_e69_eg16 refers to a version where we incorporated bacterial species to g:Profiler but due to differences in the underlying database structure we dropped them again from further releases.
Figure 2.Example of g:GOSt multiquery Manhattan plot. (A) Name of the query. (B) X-axis shows the functional terms grouped and colour-coded by data source. (C, D) The position of terms in the plots is fixed and terms from the same (GO) branch are close to each other. (E) P-values in the table outputs are color-coded from yellow (insignificant) to blue (highly significant). (F) Data sources that were not included in the analysis are shown in grey. (G) Hovering over the term circle shows a tooltip with the most relevant information. (H) In case of a multiquery, the same term is also highlighted on other plots. (I) The P-values from other queries are indicated in red next to the y-axis for easier comparison. (J) A click allows to pin the circles to the plot together with a numeric ID and creates a more detailed result table below the image.