| Literature DB >> 28600904 |
Abstract
Docker virtualization allows for software tools to be executed in an isolated and controlled environment referred to as a container. In Docker containers, dependencies are provided exactly as intended by the developer and, consequently, they simplify the distribution of scientific software and foster reproducible research. The Docker paradigm is that each container encapsulates one particular software tool. However, to analyze complex biomedical data sets, it is often necessary to combine several software tools into elaborate workflows. To address this challenge, several Docker containers need to be instantiated and properly integrated, which complicates the software deployment process unnecessarily. Here, we demonstrate how an extension to Docker, Docker compose, can be used to mitigate these problems by providing a unified setup routine that deploys several tools in an integrated fashion. We demonstrate the power of this approach by example of a Docker compose setup for a drug target screening platform consisting of five integrated web applications and shared infrastructure, deployable in just two lines of codes.Entities:
Keywords: deployment; docker; high-throughput screening; scalability; virtualization
Mesh:
Year: 2017 PMID: 28600904 PMCID: PMC6042832 DOI: 10.1515/jib-2017-0016
Source DB: PubMed Journal: J Integr Bioinform ISSN: 1613-4516
Figure 1:The web tools integrated here via Docker compose constitute a drug target screening platform. (i) OpenLabFramework is used to track functional genomics experiments that yield modified cell lines. (ii) These cell lines are then typically seeded across dozens of microtiter plates in a high-throughput screening experiment logistically tracked by SAVANAH. Here, each well constitutes its own experiment, e.g. a single gene knockout. The outcome is often a fluorescence based value of metabolic activity. (iii) Cells left-over from high-throughput screens can be lysed and deposited on reverse-phase protein arrays. MIRACLE tracks samples throughout this process and allows for the analysis of protein readout data. (iv) Primary and secondary screening results are processed and normalized in HiTSeekR to facilitate identification of samples that exhibit a phenotype of interest. (v) Genes of interest are finally subjected to systems biology analysis, leading to the generation of new hypotheses.
DockerHub URLs for the web applications used in the drug discovery platform.
| Software | Docker image location |
|---|---|
| OpenLabFramework | |
| SAVANAH | |
| MIRACLE | |
| HiTSeekR | |
| KeyPathwayMiner |
|
Figure 2:Overview of the high-throughput screening platform with individual Docker containers for each application (purple boxes). Pre-defined network connections facilitate protected internal communication among the tools where needed (black arrows). Service containers (green boxes) are shared by all applications (black arrows omitted here for clarity). An NGINX Docker container serves as the single entry point to the platform and enables user access to individual tools through the web browser (purple arrows).