Yi-An Chen1, Rodolfo S Allendes Osorio1, Kenji Mizuguchi1,2. 1. Artificial Intelligence Center for Health and Biomedical Research (ArCHER), National Institutes of Biomedical Innovation, Health and Nutrition, Osaka, 567-0085, Japan. 2. Institute for Protein Research, Osaka University, Osaka, 565-0871, Japan.
Abstract
SUMMARY: We introduce the newest version of TargetMine, which includes the addition of new visualization options; integration of previously disaggregated functionality; and the migration of the front-end to the newly available Bluegenes service. AVAILABILITY AND IMPLEMENTATION: TargeteMine is accessible online at https://targetmine.mizuguchilab.org/bluegenes. Users do not need to register to use the software. Source code for the different components listed in the article is available from TargetMine's organizational account at http://github.com/targetmine. SUPPLEMENTARY INFORMATION: A brief reference user guide is available as Supplementary data at Bioinformatics online.
SUMMARY: We introduce the newest version of TargetMine, which includes the addition of new visualization options; integration of previously disaggregated functionality; and the migration of the front-end to the newly available Bluegenes service. AVAILABILITY AND IMPLEMENTATION: TargeteMine is accessible online at https://targetmine.mizuguchilab.org/bluegenes. Users do not need to register to use the software. Source code for the different components listed in the article is available from TargetMine's organizational account at http://github.com/targetmine. SUPPLEMENTARY INFORMATION: A brief reference user guide is available as Supplementary data at Bioinformatics online.
The last decade has seen a steady increase in the number of studies related to multi-omics analysis (Krassowski ; Tarazona ). References for ‘Multi-omics Analysis’ reviews listed on PubMed increases from 4 in 2012 to 145 in 2021 (https://pubmed.ncbi.nlm.nih.gov. Last accessed, 1 February 2022). Multi-omics analysis can not only be used to improve the classification of biological data but also for the prediction of variables (such as clinical outcomes), and it might even have the potential to elucidate regulatory mechanisms that include several molecular layers (Tarazona ).A main challenge in multi-omics analysis lies in data integration (Canzler ; Krassowski ; Tarazona ). Approaches on data integration include early integration—data are concatenated into a single matrix; intermediate integration—jointly analyze different omics layers together; and late integration—integrate the analysis results (Adossa ). This categorization has been extended by Picard to also consider mixed and hierarchical integration strategies. At the same time, the development of platforms for the storage of multi-omics data also remains a strong research focus, with (Eloe-Fadrosh ; Tang ; Zhou ) being only a few examples across different domains, all of them reported in this year’s Nucleic Acid Research’s special issue on Databases (Rigden and Fernández, 2022).In this context, the TargetMine Data Warehouse has evolved into an integrative data analysis platform. TargetMine incorporates various types of omics data, sourced from a variety of data sources and models to provide a deep coverage of the biological data space, with a focus on target prioritization and broad-based biological knowledge discovery (Chen , 2019). Consolidated as a useful resource for the drug discovery scientific community (as suggested by the number of citations of the original paper as recorded by PubMed), through the integration of new data from different, heterogeneous sources, and by providing new widgets for its analysis, TargetMine continues to strive in becoming an integral solution to multi-omics data analysis, especially in terms of data storage and biological interpretation (Tarazona ).
2 New in TargetMine
2.1 Integration and new visualization tools
Up until now, TargetMine also included an Auxiliary Toolkit (Chen ), accessible through a separated user interface. This has now been integrated into a single-user experience. The display of a Composite Network Graph, added to report pages of gene lists, allows interactive visualization of gene-to-gene interactions among the list members, together with their relation to other genes, microRNA, chemical compounds and/or transcription factors found within TargetMine. Similarly, the Enrichment Display Graph, also included in the gene list report page, shows through bar graphs and heatmaps, the proportion of genes with a given annotation compared with the annotation of the whole genome, or how the individual genes in the list are matched to the corresponding enriched elements, respectively.Completely new display widgets have also been added to TargetMine. The gene report page now includes a Gene Expression Graph (see Fig. 1); and the report page for chemical compounds has now a Bio-activity Graph. As suggested by their names, both these graphs allow to dynamically inspect either the expression or bio-activity levels of individual genes or chemical compounds, respectively. Whilst the first includes controls to handle the display at different levels of detail where the gene expression is measured; the second provides controls to clearly identify different assays.
Fig. 1.
Sample image of TargetMine’s new interface for a list of genes. Elements have been slightly adjusted for better display here. MicroRNA associations to original genes are shown in graph format. New MGeND enrichment for the list of genes is also shown
Sample image of TargetMine’s new interface for a list of genes. Elements have been slightly adjusted for better display here. MicroRNA associations to original genes are shown in graph format. New MGeND enrichment for the list of genes is also shownDetails and user guides for all the aforementioned visualization tools are provided as Supplementary Material to this article.
2.2 Bluegenes migration
TargetMine is based on InterMine (Kalderimis ; Smith ), a data warehousing system that provides easy query and analysis of various heterogeneous data sources. Paired to InterMine, a new front-end named Bluegenes (https://github.com/intermine/bluegenes), meant to replace the old Java Server Pages (JSP)-based interface has been released.As several customly implemented elements of TargetMine were implemented as components of the JSP-based interface, they all needed to be refactored into new Bluegenes tools. Figure 1 shows an example of the new interface used for TargetMine, in particular, the one used to report information of a list of genes. Users familiar with the application will notice the new, modern feel and look achieved with the new front-end.One major advantage of this approach is that each element can be implemented as its own project, and thus can be individually maintained (i.e. is kept on its own GitHub repository). An extensive list of all the migrated tools and their corresponding repositories is provided as Supplementary Material.
2.3 New data sources
In order to continuously improve the coverage of the biological data space, some new data types and sources were added. These new data sources include protein binding pockets from PoSSuM (Ito ), genomic variant with clinical annotation from MGeND (see Fig. 1) (Kamada ), clinical trial data from WHO (https://trialsearch.who.int/. Last accessed, 11 March 2022) and also genome annotations from NCBI (https://www.ncbi.nlm.nih.gov/genome. Last accessed, 11 March 2022). New data are accommodated by extending the data model currently used by TargetMine, which can be generally described as an Object Oriented definition, transpilled into a Relational database for storage purposes. More details on how this is implemented can be found in (Chen ). Applications of the new additions will be reported elsewhere.
3 Discussion
We believe TargetMine to be a highly valued data warehouse within the drug discovery research community, as proved by the continuous access that it has on a daily basis, from countries across five continents. As a response to the support shown by the community, we constantly strive to improve the service, with monthly data updates and constant software updates being a proof of our commitment toward this end.Here, we introduced some of the major updates made to TargetMine over the past couple of years, namely, its migration to a new front-end and the development of new visualization widgets, customly targeted to specific data elements within the repository.Click here for additional data file.
Authors: Richard N Smith; Jelena Aleksic; Daniela Butano; Adrian Carr; Sergio Contrino; Fengyuan Hu; Mike Lyne; Rachel Lyne; Alex Kalderimis; Kim Rutherford; Radek Stepan; Julie Sullivan; Matthew Wakeling; Xavier Watkins; Gos Micklem Journal: Bioinformatics Date: 2012-09-27 Impact factor: 6.937