| Literature DB >> 32529017 |
Abstract
BACKGROUND: Laboratories performing clinical high-throughput sequencing for oncology and germline testing are increasingly migrating their data storage to cloud-based solutions. Cloud-based storage has several advantages, such as low per-GB prices, scalability, and minimal fixed costs; however, while these solutions tout ostensibly simple usage-based pricing plans, practical cost analysis of cloud storage for NGS data storage is not straightforward.Entities:
Year: 2020 PMID: 32529017 PMCID: PMC7276491 DOI: 10.1016/j.plabm.2020.e00168
Source DB: PubMed Journal: Pract Lab Med ISSN: 2352-5517
| Vendor | Storage Tier (see legend for abbreviations) | Cost per GB-Month (a) | Retrieval Time | Retrieval Cost per GB (b) | Cost per Test (6 GB Exome over 10 years) | ||
|---|---|---|---|---|---|---|---|
| Strategy A | Strategy B | Strategy C | |||||
| AWS | S3 | 2.1–2.3 cents | Immediate | – | $12.39 | $3.29 (2 years S3 then 8 years Deep Glacier) | $0.88 (3 months S3 then 10 years Deep Glacier) |
| S3-IA | 1.25 cents | Immediate | 1.0 cents | $6.77 | |||
| Glacier | 0.4 cents | 3–5 h (typical); 1–5 min (expedited) | 0.25–3.0 cents | $2.17 | |||
| Deep Glacier | 0.099 cents | 12–48 h | 0.25–2.0 cents | $0.54 | |||
| GCP | Regional | 2.0–2.3 cents | Immediate | – | $10.83 | $5.57 (2 years Regional then 8 years Coldline) | $4.09 (3 months Regional then 10 years Coldline) |
| Nearline | 1.0 cents | Immediate | 1.0 cents | $5.41 | |||
| Coldline | 0.7 cents | Immediate | 2.0 cents | $3.79 | |||
| Archive (c) | 0.25 cents | Immediate | 5.0 cents | $1.35 | |||
| Azure | ZRS Hot | 2.12–2.3 cents | Immediate | – | $12.40 | $3.30 (2 years LRS Hot then 8 years LRS Archive) | $0.86 (3 months LRS Hot then 10 years LRS Archive) |
| ZRS Cool | 1.25 cents | Immediate | – | $6.77 | |||
| LRS Hot | 1.7–2.08 cents | Immediate | – | $9.92 | |||
| LRS Cool | 1.0–1.5 cents | Immediate | 1.0 cents | $5.41 | |||
| LRS Archive | 0.099–0.2 cents | <15 h | 2.0 cents | $0.54 | |||
Strategy A: 1000 exomes per year (6TB generated per year), stored for 10 years at indicated storage level.
Strategy B: 1000 exomes per year (6TB generated per year), stored for 2 years in “hot” storage and 8 years in “cold” storage.
Strategy C: 1000 exomes per year (6TB generated per year), stored for 3 months in “hot” storage and 10 years in “cold” storage.
(Simulating 20 years total, no re-access).
All prices, features and storage classes are current as of May 2020. An updated, online version of this table can be found at https://ngscosts.info.
Not all storage classes, features and costs from each vendor are represented. Displayed classes are meant to be representative of range available from each vendor.
Prices are subject to change.
Abbreviations; AWS: Amazon Web Serivces, GCP: Google Cloud Platform, S3: Simple Storage Service, S3-IA: S3 Infrequent Access, ZRS: Zone-redundant storage, LRS: Locally-redundant storage.
Notes: (a) Ranges indicate when prices may vary by region and/or total data stored(b) Data retrieved to locations outside of the vendors cloud or cross-region may incur network transfer costs.(c) GCP Archive does not come with a Service Level Agreement for availability.
Fig. 2Historical prices of cloud storage across major cloud vendors and products over time. Storage tiers with an asterisk are either retired names or products. Please note: chart is based on historical or archived web content and dates of price drops or product introductions are approximate. AWS: Amazon Web Services, S3: Simple Storage Service, S3-IA: S3 Infrequent Access, GCP: Google Cloud Platform, GB: Gigabyte.
Fig. 1Screenshot from the application. Panels on the left side of the application enable control of parameters and customization. File sizes, test volumes, compression, storage tiers and retention times are configurable (see Methods for additional description of options). Outputs are interactively updated, and track total stored data over time, per-test marginal costs as well as the total lifetime cost.