| Literature DB >> 33936065 |
Sanchita Bhattacharya1,2, Zicheng Hu1,2, Atul J Butte1,2.
Abstract
The field of immunology is rapidly progressing toward a systems-level understanding of immunity to tackle complex infectious diseases, autoimmune conditions, cancer, and beyond. In the last couple of decades, advancements in data acquisition techniques have presented opportunities to explore untapped areas of immunological research. Broad initiatives are launched to disseminate the datasets siloed in the global, federated, or private repositories, facilitating interoperability across various research domains. Concurrently, the application of computational methods, such as network analysis, meta-analysis, and machine learning have propelled the field forward by providing insight into salient features that influence the immunological response, which was otherwise left unexplored. Here, we review the opportunities and challenges in democratizing datasets, repositories, and community-wide knowledge sharing tools. We present use cases for repurposing open-access immunology datasets with advanced machine learning applications and more.Entities:
Keywords: data reuse; democratization; immunology; open-access; public repositories
Year: 2021 PMID: 33936065 PMCID: PMC8086961 DOI: 10.3389/fimmu.2021.647536
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Figure 1Democratization of datasets and computational tools. The Jupyter logo was used under Copyright © 2017 Project Jupyter Contributors. https://github.com/jupyter/jupyter.github.io/blob/master/assets/main-logo.svg; The Scikit learn logo is under Copyright © The scikit-learn developers. Source:-https://commons.wikimedia.org/wiki/File:Scikit_learn_logo_small.svg; NumPy logo source:- The NumPy logo is created by NumPy Team, 2020; https://github.com/numpy/numpy/blob/main/branding/logo/logomark/numpylogoicon.svg; Python logos are trademarks or registered trademarks of the Python Software Foundation, used with permission from the Foundation. Source:- https://legacy.python.org/community/logos/; Galaxy Project: https://galaxyproject.org/images/galaxy-logos/; Gen3:- The logo was used under the permission from Center for Translational Data Science at University of Chicago. Shiny- Shiny are trademarks of RStudio, PBC. https://github.com/rstudio/hex-stickers/blob/master/PNG/shiny.png; The R logo is © 2016 The R Foundation. (CC-BY-SA 4.0); Docker- Docker and the Docker logo are trademarks of Docker, Inc. in the United States and/or other countries. https://www.docker.com/company/newsroom/media-resources; Github- GITHUB®, the GITHUB® logo design are exclusive trademarks registered in the United States by GitHub, Inc, source:-https://github.com/logos.
List of publications leveraging open-access immunological datasets.
| Authors | Pubmed ID | Datasets | Study type | Description |
|---|---|---|---|---|
| Orange et al. ( | 29468833 | Transcriptomics and histology | Machine learning | Identify RA subgroups using machine learning models |
| Hu et al. ( | 32801215 | CyTOF | Machine learning | Identify latent CMV infection using a deep learning model |
| Gielis et al. ( | 31849987 | TCR sequencing | Machine learning | Predict antigen specificity using a machine learning model |
| Berry et al. ( | 20725040 | Transcriptomics | Meta-analysis | Identify transcription signature |
| Sweeney et al. ( | 27384347 | Transcriptomics | Meta-analysis | Classify viral and bacterial infections using transcription signature |
| Jiang et al. ( | 30127393 | Transcriptomics | Meta-analysis | Identify T cell suppression and exclusion signatures. |
| McClain et al. ( | 32743603 | Transcriptomics | Biomarker analyses and validation using public datasets | Host response to SARS-CoV-2 infection through RNA sequencing |
| Kidd et al. ( | 26619012 | Transcriptomics | Drug repurposing | Mapping the effects of drugs on the state-transition of immune cells |