| Literature DB >> 25025055 |
Ginés D Guerrero1, Baldomero Imbernón2, Horacio Pérez-Sánchez2, Francisco Sanz3, José M García4, José M Cecilia2.
Abstract
Bioinformatics is an interdisciplinary research field that develops tools for the analysis of large biological databases, and, thus, the use of high performance computing (HPC) platforms is mandatory for the generation of useful biological knowledge. The latest generation of graphics processing units (GPUs) has democratized the use of HPC as they push desktop computers to cluster-level performance. Many applications within this field have been developed to leverage these powerful and low-cost architectures. However, these applications still need to scale to larger GPU-based systems to enable remarkable advances in the fields of healthcare, drug discovery, genome research, etc. The inclusion of GPUs in HPC systems exacerbates power and temperature issues, increasing the total cost of ownership (TCO). This paper explores the benefits of volunteer computing to scale bioinformatics applications as an alternative to own large GPU-based local infrastructures. We use as a benchmark a GPU-based drug discovery application called BINDSURF that their computational requirements go beyond a single desktop machine. Volunteer computing is presented as a cheap and valid HPC system for those bioinformatics applications that need to process huge amounts of data and where the response time is not a critical factor.Entities:
Mesh:
Year: 2014 PMID: 25025055 PMCID: PMC4082831 DOI: 10.1155/2014/474219
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Hardware features for our local test-bed infrastructure.
| Intel system | |
|---|---|
| Processor | Intel Xeon E5620 @ 2.4 GHz |
| GPU 0 | Nvidia 7300GT |
| Memory | 16 GB DDR3 @ 1333 MHz |
| Maximum power draw | 80 W |
| Experimental idle power | 38 W |
|
| |
| GPU 1: Nvidia GTX 465 | |
| GPU family | GF100 |
| Manufacturing process | 40 nm |
| Core clock | 607 MHz |
| Memory size | 1024 MB |
| Memory clock | 2 × 1603 MHz |
| Memory bus width | 256 bits |
| Memory bandwidth | 102.6 GB/sec |
| Stream processors | 352 |
| Maximum power draw | 200 W |
| Experimental idle power | 24 W |
|
| |
| GPU 2: Nvidia GTX 480 | |
| GPU family | GF100 |
| Manufacturing process | 40 nm. |
| Core clock | 700 MHz |
| Memory size | 1536 MB |
| Memory clock | 2 × 1848 MHz |
| Memory bus width | 384 bits |
| Memory bandwidth | 177.4 GB/sec |
| Stream processors | 480 |
| Maximum power draw | 250 W |
| Experimental idle power | 37 W |
|
| |
| GPU 3: Nvidia Tesla C2070 | |
| GPU family | GF100 |
| Process | 40 nm. |
| Core clock | 573.5 MHz |
| Memory size | 6143 MB |
| Memory clock | 2 × 1494 MHz |
| Memory bus width | 384 bits |
| Memory bandwidth | 143.4 GB/sec |
| Stream processors | 448 |
| Maximum power draw | 247 W |
| Experimental idle power | 107 W |
|
| |
| GPU 4: Nvidia GTX 590 | |
| GPU family | GF100 |
| Manufacturing process | 40 nm. |
| Core clock | 1215 MHz |
| Memory size | 2 × 1536 MB |
| Memory clock | 2 × 1707 MHz |
| Memory bus width | 2 × 384 bits |
| Memory bandwidth | 2 × 327.7 GB/sec |
| Stream processors | 1024 |
| Maximum power draw | 365 W |
| Experimental idle power | 140 W |
|
| |
| GPU 5: Nvidia Tesla K20c | |
| GPU family | GK110 |
| Manufacturing process | 28 nm. |
| Core clock | 705 MHz |
| Memory size | 5120 MB |
| Memory clock | 2 × 2600 MHz |
| Memory bus width | 320 bits |
| Memory bandwidth: | 208 GB/sec |
| Stream processors | 2496 |
| Maximum power draw | 225 W |
| Experimental idle power | 27 W |
GPU-based machines in the Ibercivis project to date. The nodes are divided into operative systems and GPU brands.
| Windows | Linux | Darwin | Total | |
|---|---|---|---|---|
| Nvidia | 918 | 106 | 68 | 1092 |
| ATI + Intel | 445 | 10 | 50 | 505 |
|
| ||||
| Total | 1363 | 116 | 118 |
|
Execution time in seconds of BINDSURF for the execution of ligand A (l1c4), ligand B (2byr), and ligand C (3p4w) in our local infrastructure with different Monte Carlo steps. Lowest RMSD values obtained for ligands B and C are 3 and 2 Angstroms, while there is no crystal structure available for ligand A.
| Steps | GTX 465 | GTX 480 | GTX 590 | Tesla C2070 | Tesla K20 |
|---|---|---|---|---|---|
| Ligand A (l1c4) | |||||
| 5 | 57.79 | 57.70 | 59.11 | 58.57 | 68.52 |
| 10 | 57.97 | 57.97 | 57.64 | 59.15 | 68.78 |
| 50 | 61.66 | 61.73 | 62.08 | 63.68 | 72.40 |
| 500 | 114.17 | 114.09 | 118.37 | 126.62 | 116.66 |
| 5000 | 788.52 | 788.81 | 842.34 | 942.35 | 678.61 |
| 50000 | 8888.45 | 8890.91 | 9546.22 | 10702.00 | 7371.31 |
|
| |||||
| Ligand B (2byr) | |||||
| 5 | 49.24 | 49.42 | 49.69 | 50.71 | 60.81 |
| 10 | 49.49 | 49.65 | 50.13 | 51.18 | 60.87 |
| 50 | 53.39 | 53.42 | 54.22 | 55.73 | 65.05 |
| 500 | 107.20 | 107.10 | 111.58 | 121.64 | 114.42 |
| 5000 | 733.35 | 732.86 | 783.29 | 891.92 | 686.69 |
| 50000 | 7162.95 | 7163.43 | 7688.34 | 8813.30 | 6569.21 |
|
| |||||
| Ligand C (3p4w) | |||||
| 5 | 75.41 | 75.35 | 75.75 | 76.41 | 86.44 |
| 10 | 75.53 | 75.45 | 76.00 | 76.70 | 86.05 |
| 50 | 78.64 | 79.23 | 80.20 | 80.74 | 89.44 |
| 500 | 127.35 | 127.52 | 131.72 | 139.09 | 129.28 |
| 5000 | 761.68 | 761.46 | 813.74 | 903.31 | 640.37 |
| 50000 | 7925.02 | 7924.22 | 8520.64 | 9494.24 | 6219.92 |
Execution time in seconds of BINDSURF for the execution of ligand A (l1c4), ligand B (2byr), and ligand C (3p4w) in Ibercivis with different Monte Carlo steps. It is divided into total time (i.e., including submission overheads) and processing time.
| Steps | Total time | Processing time |
|---|---|---|
| Ligand A (l1c4) | ||
| 5 | 60.60 | 30893 |
| 10 | 61.88 | 24325 |
| 50 | 61.85 | 28469 |
| 500 | 130.79 | 31719 |
| 5000 | 1144.02 | 20840 |
| 50000 | 11467.09 | 28469 |
|
| ||
| Ligand B (2byr) | ||
| 5 | 65.5 | 24158 |
| 10 | 64.58 | 24116 |
| 50 | 69.72 | 43664 |
| 500 | 133.73 | 91752 |
| 5000 | 1161.78 | 27727 |
| 50000 | 10895.33 | 42469 |
|
| ||
| Ligand C (3p4w) | ||
| 5 | 72.82 | 46150 |
| 10 | 76.59 | 20016 |
| 50 | 86.77 | 1933 |
| 500 | 177.47 | 2016 |
| 5000 | 1190.07 | 31843 |
| 50000 | 12660.21 | 46150 |
Averaged power consumption in Watts when processing ligand A (l1c4), ligand B (2byr), and ligand C (3p4w) in a local machine with different Monte Carlo steps. The runtimes are shown in Table 3.
| Steps | GTX 465 | GTX 480 | GTX 590 | Tesla C2070 | Tesla K20 |
|---|---|---|---|---|---|
| 5 | 240.57 | 246.45 | 270.99 | 291.78 |
|
| 10 | 236.79 | 246.50 | 271.44 | 293.32 |
|
| 50 | 240.92 | 251.37 | 274.40 | 297.85 |
|
| 500 | 276.09 | 293.75 | 309.07 | 325.52 |
|
| 5000 | 312.79 | 351.90 | 355.19 | 357.10 |
|
| 50000 | 318.79 | 362.36 | 364.24 | 356.14 |
|
Averaged economic costs in $ for processing 6000 redocking simulations of ligands (ligand A (l1c4), ligand B (2byr), and ligand C (3p4w)) in a local machine. We also vary the number of Monte Carlo steps to increase the computational cost.
| Steps | GTX 465 | GTX 480 | GTX 590 | Tesla C2070 | Tesla K20 |
|---|---|---|---|---|---|
| 5 | 167.74 |
| 162.16 | 166.99 | 200.20 |
| 10 | 166.28 |
| 161.48 | 168.22 | 200.13 |
| 50 | 181.50 |
| 172.72 | 180.08 | 210.55 |
| 500 | 394.53 |
| 318.83 | 349.33 | 334.95 |
| 5000 | 3063.61 | 2017.71 | 2158.96 | 2475.37 |
|
| 50000 | 33020.51 | 21209.85 | 22812.22 | 26228.97 |
|