| Literature DB >> 24106693 |
Daniele D'Agostino1, Andrea Clematis, Alfonso Quarati, Daniele Cesini, Federica Chiappori, Luciano Milanesi, Ivan Merelli.
Abstract
Cloud computing opens new perspectives for small-medium biotechnology laboratories that need to perform bioinformatics analysis in a flexible and effective way. This seems particularly true for hybrid clouds that couple the scalability offered by general-purpose public clouds with the greater control and ad hoc customizations supplied by the private ones. A hybrid cloud broker, acting as an intermediary between users and public providers, can support customers in the selection of the most suitable offers, optionally adding the provisioning of dedicated services with higher levels of quality. This paper analyses some economic and practical aspects of exploiting cloud computing in a real research scenario for the in silico drug discovery in terms of requirements, costs, and computational load based on the number of expected users. In particular, our work is aimed at supporting both the researchers and the cloud broker delivering an IaaS cloud infrastructure for biotechnology laboratories exposing different levels of nonfunctional requirements.Entities:
Mesh:
Year: 2013 PMID: 24106693 PMCID: PMC3782806 DOI: 10.1155/2013/138012
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Principal operations of a typical drug discovery pipeline.
Characteristics of the operations of the drug discovery pipeline.
| TI | VS | ER | LO | |
|---|---|---|---|---|
|
| ||||
| Number ofcores | ≥16 | ≥8 | ≥32 | ≥4 |
| Network | — | — | Infiniband | — |
| RAM (GB) | ≥32 | ≥16 | ≥64 | ≥16 |
| HDD (GB) | — | ≥500 | ≥1000 | — |
| Reference Amazon EC21 virtual cluster | 20 × D | 10 × 2 × D | 1–10 × 2 × D | 1 × C |
|
| ||||
| Security | Low | Medium | Medium | High |
| Availability/Resiliency | Low | Medium | Medium | Low |
|
| ||||
| Input data and size | 10 K–100 K Parameter simulations | 5 K–10 K Compounds | 1–10 Simulations 50 K–250 K Atoms | 5–10 Compounds |
| Service demand time ( | 16–160 h | 8–16 h | 2–1200 h | 1–6 h |
| Arrival rate ( | 4 | 3/4 | 2/3 | 1/2 |
|
| ||||
| Commercial | MATLAB | FlexX, Glide, ICM | AMBER, CHARMM | MacroModel, Medstere |
| Free/open source | R | Dock, Autodock | GROMACS, NAMD | Omega |
1See Table 2.
Characteristics of the considered Amazon EC2 instance types.
| Amazon EC2 instance types | RAM | EC2 compute unit | Storage | On Demand instances | Heavy utilization reserved instances |
|---|---|---|---|---|---|
| A: High-Memory Double eXtra Large | 34.2 GB | 13 | 850 GB | $0.9 | $0.176 |
| B: High-Memory Quadruple XL | 68.4 GB | 26 | 1690 GB | $1.8 | $0.352 |
| C: Cluster Compute Quadruple XL | 23 GB | 33.5 | 1690 GB | $1.3 | $0.297 |
| D: Cluster Compute Eight XL | 60.5 GB | 88 | 3370 GB | $2.4 | $0.361 |
Commercial and open source cloud broker Platforms.
| Cloud | Users1 | CI2 | Type of X as a Service | QoS/SLA | Billing/ | SW3 | |
|---|---|---|---|---|---|---|---|
| Zimory4 | Management system | CB, SP, CC | All | All | Security, resiliency/Y | APIs | |
| ComputeNext5 | Service brokerage | SP, CC | — | All (I) | —/Y | Y/Y | APIs |
| CompatibleOne6 | Cloud Broker | All | All | All | Security/Y | OS APIs | |
| Gravitant cloudMatrix7 | Services brokerage | CB, SP, CC | All | All | Security/Y | Y/Y | APIs |
| enStratius8 | Management system | CB, SP, CC | All | S, I | Security | Y | APIs |
| RightScale myCloud9 | Management system | SP, CC | All (H) | S, P | Security | Y | APIs |
| Scalr10 | Management system | SP, CC | All | All | [OS] | ||
| Standing Cloud11 | Marketplace | All | PB | All | |||
| Artisan Infrastructure12 | IaaS provider | SP | All | Security, resiliency/Y | |||
| StratusLab13 | IaaS distribution | CB, SP | — | I | — | — | OS APIs |
| Contrail14 | Components | CP, CB, SP | All | P, I | Security, reliability/Y | OS APIs | |
| RESERVOIR15 | Federated clouds management | CB, SP | H | I | —/Y | OS APIs | |
| MOSAIC16 | Component framework | SP, CB | H, P | All | Security/Y | —/Y | OS APIs |
1Cloud Provider, Cloud Broker, Service Provider, Cloud Consumer.
2Cloud Infrastructure: PuBlic, Private, Hybrid.
3License: Open Source.
4 http://www.zimory.com.
5 https://www.computenext.com.
6 http://www.compatibleone.org.
7 http://www.gravitant.com.
8 http://www.enstratius.com.
9 http://www.rightscale.com.
10 http://www.scalr.com.
11 http://www.standingcloud.com.
12 http://www.artisaninfrastructure.com.
13 http://stratuslab.eu.
14 http://contrail-project.eu.
15 http://www.reservoir-fp7.eu.
16 http://www.mosaic-cloud.eu.
Figure 2UML sequential diagram of a request for service execution in a hybrid cloud.
Figure 3The brokering system architecture.
Figure 4Private zone utilization U at varying arrival rates.
Figure 5Annual CB's revenue.
VM configurations and hourly provisioning price per type of requests.
| Instance type | RAM | Cores | Storage | Operation | AZ price/h | PZ price/h |
|---|---|---|---|---|---|---|
| C | 32 GB | 8 | 1690 GB | LO | $1.3 | $0.297 |
| D | 64 GB | 16 | 3370 GB | TI | $2.4 | $0.361 |
| E | 128 GB | 32 | 1-2 TB | VS, ER | $4.8 | $0.722 |
Figure 6Average waiting times, in hours, per pipeline.
Figure 7Private zone utilization U with 75% sporadic users.
Figure 8Annual CB's revenue with 75% sporadic users.
Figure 9Average waiting times per pipeline with 75% sporadic users.
Figure 10Private zone utilization U with 75% frequent users.
Figure 11Annual CB's revenue with 75% frequent users.
Figure 12Average waiting times per pipeline with 75% frequent users.
Figure 13Private zone utilization U, with a 15-node system.
Figure 14Annual CB's revenue with a 15-node system.
Figure 15Percentage revenue increases with respect to the basic scenario (15 versus 10 nodes).
Figure 16Average waiting times per pipeline with a 15-node system.