| Literature DB >> 32374843 |
Gaurhari Dass1, Manh-Tu Vu1, Pan Xu2, Enrique Audain3, Marc-Phillip Hitz3, Björn A Grüning4, Henning Hermjakob1,2, Yasset Perez-Riverol1.
Abstract
The Omics Discovery Index is an open source platform that can be used to access, discover and disseminate omics datasets. OmicsDI integrates proteomics, genomics, metabolomics, models and transcriptomics datasets. Using an efficient indexing system, OmicsDI integrates different biological entities including genes, transcripts, proteins, metabolites and the corresponding publications from PubMed. In addition, it implements a group of pipelines to estimate the impact of each dataset by tracing the number of citations, reanalysis and biological entities reported by each dataset. Here, we present the OmicsDI REST interface (www.omicsdi.org/ws/) to enable programmatic access to any dataset in OmicsDI or all the datasets for a specific provider (database). Clients can perform queries on the API using different metadata information such as sample details (species, tissues, etc), instrumentation (mass spectrometer, sequencer), keywords and other provided annotations. In addition, we present two different libraries in R and Python to facilitate the development of tools that can programmatically interact with the OmicsDI REST interface.Entities:
Year: 2020 PMID: 32374843 PMCID: PMC7319562 DOI: 10.1093/nar/gkaa326
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Most relevant methods provided by OmicsDI REST Interface
| Category | Method | Description | Type of method |
|---|---|---|---|
|
| /dataset/search | Search for datasets in OmicsDI | GET |
| /dataset/{database}/{accession} | Retrieve a specific dataset from OmicsDI | GET | |
| /dataset/batch | Retrieve a batch of datasets | GET | |
| /dataset/getFileLinks | Retrieve all file links for a given dataset | GET | |
| /dataset/{database}/{accession}/files | Retrieve the list of dataset's files | GET | |
| /dataset/latest | Retrieve the latest datasets added to OmicsDI | GET | |
| /dataset/getSimilarByPubmed | Retrieve similar datasets based on PubMed identifier | GET | |
| /dataset/getSimilar | Retrieve the related datasets to one Dataset | GET | |
|
| /database/all | Retrieve OmicsDI databases/repositories | GET |
|
| /term/getTermByPattern | Search dictionary Terms | GET |
| /term/frequentlyTerm/list | Retrieve frequently terms | GET | |
|
| /statistics/organisms | Return statistics per organisms | GET |
| /statistics/tissues | Return statistics per tissues | GET | |
| /statistics/omics | Return statistics per Omics type | GET | |
| /statistics/diseases | Return statistics per diseases | GET | |
| /statistics/domains | Return statistics per Repository | GET | |
|
| /seo/home | Retrieve JSON+LD for home page | GET |
| /seo/search | Retrieve JSON+LD for browse page | GET | |
| /seo/api | Retrieve JSON+LD for api page | GET | |
| /seo/schema/{database}/{accession} | Retrieve JSON+LD Schema for dataset page | GET | |
| /seo/database | Retrieve JSON+LD for databases page | GET | |
| /seo/dataset/{database}/{accession} | Retrieve JSON+LD for dataset page | GET |
Figure 1.Data file geolocation feature in OmicsDI Rest API. The user request localization defines the instance of the dataset that will be retrieved by the API. In this example, if a user based in the United States requests ‘File3’, the URL of ‘File3’ provided by the MassIVE repository at the University of California, San Diego, will be returned. If a user based in Europe requests ‘File3’, the URL of ‘File3’ provided by the PRIDE repository at EMBL-EBI in Cambridge, UK, will be provided.
Figure 2.For a given dataset, the OmicsDI Rest Interface allows to the dataset files closest to the IP client request. For example, for dataset www.omicsdi.org/dataset/pride/PXD003213 the API marks as primary source the (A) MassIVE files when the query is performed from the United States (curl –header ‘X-Forwarded-For: 66.165.239.58’www.omicsdi.org/ws/dataset/pride/PXD003213?); and (B) PRIDE files are marked as primary when the request is performed from the United Kingdom (curl –header ‘X-Forwarded-For: 193.62.193.80’www.omicsdi.org/ws/dataset/pride/PXD003213).