Literature DB >> 35894632

TargetMine 2022: A new vision into drug target analysis.

Yi-An Chen¹, Rodolfo S Allendes Osorio¹, Kenji Mizuguchi^1,2.

Abstract

SUMMARY: We introduce the newest version of TargetMine, which includes the addition of new visualization options; integration of previously disaggregated functionality; and the migration of the front-end to the newly available Bluegenes service.
AVAILABILITY AND IMPLEMENTATION: TargeteMine is accessible online at https://targetmine.mizuguchilab.org/bluegenes. Users do not need to register to use the software. Source code for the different components listed in the article is available from TargetMine's organizational account at http://github.com/targetmine. SUPPLEMENTARY INFORMATION: A brief reference user guide is available as Supplementary data at Bioinformatics online.

Entities: Chemical

Year: 2022 PMID： 35894632 PMCID： PMC9477527 DOI： 10.1093/bioinformatics/btac507

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.931

1 Introduction

The last decade has seen a steady increase in the number of studies related to multi-omics analysis (Krassowski ; Tarazona ). References for ‘Multi-omics Analysis’ reviews listed on PubMed increases from 4 in 2012 to 145 in 2021 (https://pubmed.ncbi.nlm.nih.gov. Last accessed, 1 February 2022). Multi-omics analysis can not only be used to improve the classification of biological data but also for the prediction of variables (such as clinical outcomes), and it might even have the potential to elucidate regulatory mechanisms that include several molecular layers (Tarazona ). A main challenge in multi-omics analysis lies in data integration (Canzler ; Krassowski ; Tarazona ). Approaches on data integration include early integration—data are concatenated into a single matrix; intermediate integration—jointly analyze different omics layers together; and late integration—integrate the analysis results (Adossa ). This categorization has been extended by Picard to also consider mixed and hierarchical integration strategies. At the same time, the development of platforms for the storage of multi-omics data also remains a strong research focus, with (Eloe-Fadrosh ; Tang ; Zhou ) being only a few examples across different domains, all of them reported in this year’s Nucleic Acid Research’s special issue on Databases (Rigden and Fernández, 2022). In this context, the TargetMine Data Warehouse has evolved into an integrative data analysis platform. TargetMine incorporates various types of omics data, sourced from a variety of data sources and models to provide a deep coverage of the biological data space, with a focus on target prioritization and broad-based biological knowledge discovery (Chen , 2019). Consolidated as a useful resource for the drug discovery scientific community (as suggested by the number of citations of the original paper as recorded by PubMed), through the integration of new data from different, heterogeneous sources, and by providing new widgets for its analysis, TargetMine continues to strive in becoming an integral solution to multi-omics data analysis, especially in terms of data storage and biological interpretation (Tarazona ).

2 New in TargetMine

2.1 Integration and new visualization tools

Up until now, TargetMine also included an Auxiliary Toolkit (Chen ), accessible through a separated user interface. This has now been integrated into a single-user experience. The display of a Composite Network Graph, added to report pages of gene lists, allows interactive visualization of gene-to-gene interactions among the list members, together with their relation to other genes, microRNA, chemical compounds and/or transcription factors found within TargetMine. Similarly, the Enrichment Display Graph, also included in the gene list report page, shows through bar graphs and heatmaps, the proportion of genes with a given annotation compared with the annotation of the whole genome, or how the individual genes in the list are matched to the corresponding enriched elements, respectively. Completely new display widgets have also been added to TargetMine. The gene report page now includes a Gene Expression Graph (see Fig. 1); and the report page for chemical compounds has now a Bio-activity Graph. As suggested by their names, both these graphs allow to dynamically inspect either the expression or bio-activity levels of individual genes or chemical compounds, respectively. Whilst the first includes controls to handle the display at different levels of detail where the gene expression is measured; the second provides controls to clearly identify different assays.

Fig. 1.

Sample image of TargetMine’s new interface for a list of genes. Elements have been slightly adjusted for better display here. MicroRNA associations to original genes are shown in graph format. New MGeND enrichment for the list of genes is also shown Details and user guides for all the aforementioned visualization tools are provided as Supplementary Material to this article.

2.2 Bluegenes migration

TargetMine is based on InterMine (Kalderimis ; Smith ), a data warehousing system that provides easy query and analysis of various heterogeneous data sources. Paired to InterMine, a new front-end named Bluegenes (https://github.com/intermine/bluegenes), meant to replace the old Java Server Pages (JSP)-based interface has been released. As several customly implemented elements of TargetMine were implemented as components of the JSP-based interface, they all needed to be refactored into new Bluegenes tools. Figure 1 shows an example of the new interface used for TargetMine, in particular, the one used to report information of a list of genes. Users familiar with the application will notice the new, modern feel and look achieved with the new front-end. One major advantage of this approach is that each element can be implemented as its own project, and thus can be individually maintained (i.e. is kept on its own GitHub repository). An extensive list of all the migrated tools and their corresponding repositories is provided as Supplementary Material.

2.3 New data sources

In order to continuously improve the coverage of the biological data space, some new data types and sources were added. These new data sources include protein binding pockets from PoSSuM (Ito ), genomic variant with clinical annotation from MGeND (see Fig. 1) (Kamada ), clinical trial data from WHO (https://trialsearch.who.int/. Last accessed, 11 March 2022) and also genome annotations from NCBI (https://www.ncbi.nlm.nih.gov/genome. Last accessed, 11 March 2022). New data are accommodated by extending the data model currently used by TargetMine, which can be generally described as an Object Oriented definition, transpilled into a Relational database for storage purposes. More details on how this is implemented can be found in (Chen ). Applications of the new additions will be reported elsewhere.

3 Discussion

We believe TargetMine to be a highly valued data warehouse within the drug discovery research community, as proved by the continuous access that it has on a daily basis, from countries across five continents. As a response to the support shown by the community, we constantly strive to improve the service, with monthly data updates and constant software updates being a proof of our commitment toward this end. Here, we introduced some of the major updates made to TargetMine over the past couple of years, namely, its migration to a new front-end and the development of new visualization widgets, customly targeted to specific data elements within the repository. Click here for additional data file.

14 in total

Review 1. Prospects and challenges of multi-omics data integration in toxicology.

Authors: Sebastian Canzler; Jana Schor; Wibke Busch; Kristin Schubert; Ulrike E Rolle-Kampczyk; Hervé Seitz; Hennicke Kamp; Martin von Bergen; Roland Buesen; Jörg Hackermüller
Journal: Arch Toxicol Date: 2020-02-08 Impact factor: 5.153

Review 2. Computational strategies for single-cell multi-omics integration.

Authors: Nigatu Adossa; Sofia Khan; Kalle T Rytkönen; Laura L Elo
Journal: Comput Struct Biotechnol J Date: 2021-04-27 Impact factor: 7.271

3. InterMine: a flexible data warehouse system for the integration and analysis of heterogeneous biological data.

Authors: Richard N Smith; Jelena Aleksic; Daniela Butano; Adrian Carr; Sergio Contrino; Fengyuan Hu; Mike Lyne; Rachel Lyne; Alex Kalderimis; Kim Rutherford; Radek Stepan; Julie Sullivan; Matthew Wakeling; Xavier Watkins; Gos Micklem
Journal: Bioinformatics Date: 2012-09-27 Impact factor: 6.937

4. PoSSuM v.2.0: data update and a new function for investigating ligand analogs and target proteins of small-molecule drugs.

Authors: Jun-ichi Ito; Kazuyoshi Ikeda; Kazunori Yamada; Kenji Mizuguchi; Kentaro Tomii
Journal: Nucleic Acids Res Date: 2014-11-17 Impact factor: 16.971

TargetMine 2022: A new vision into drug target analysis.

1 Introduction

2 New in TargetMine

2.1 Integration and new visualization tools

2.2 Bluegenes migration

2.3 New data sources

3 Discussion

Review 1. Prospects and challenges of multi-omics data integration in toxicology.

Review 2. Computational strategies for single-cell multi-omics integration.

3. InterMine: a flexible data warehouse system for the integration and analysis of heterogeneous biological data.

4. PoSSuM v.2.0: data update and a new function for investigating ligand analogs and target proteins of small-molecule drugs.

5. An integrative data analysis platform for gene set analysis and knowledge discovery in a data warehouse framework.

6. The TargetMine Data Warehouse: Enhancement and Updates.

Review 7. State of the Field in Multi-Omics Research: From Computational Needs to Data Mining and Sharing.

8. The 2022 Nucleic Acids Research database issue and the online molecular biology database collection.

9. CyanoOmicsDB: an integrated omics database for functional genomic analysis of cyanobacteria.

10. MGeND: an integrated database for Japanese clinical and genomic information.