| Literature DB >> 30458034 |
Xiaoyong Tang1,2, Xiaoyi Liao1.
Abstract
Recently, computational Grids have proven to be a good solution for processing large-scale, computation intensive problems. However, the heterogeneity, dynamics of resources and diversity of applications requirements have always been important factors affecting their performance. In response to these challenges, this work first builds a Grid job scheduling architecture that can dynamically monitor Grid computing center resources and make corresponding scheduling decisions. Second, a Grid job model is proposed to describe the application requirements. Third, this paper studies the characteristics of commercial interconnection networks used in Grids and forecast job transmission time. Fourth, this paper proposes an application-aware job scheduling mechanism (AJSM) that includes periodic scheduling flow and a heuristic application-aware deadline constraint job scheduling algorithm. The rigorous performance evaluation results clearly demonstrate that the proposed application-aware job scheduling mechanism can successful schedule more Grid jobs than the existing algorithms. For successful scheduled jobs, our proposed AJSM method is the best algorithm for job average processing time and makespan.Entities:
Mesh:
Year: 2018 PMID: 30458034 PMCID: PMC6245787 DOI: 10.1371/journal.pone.0207596
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1China National Grid.
Fig 2The Grid scheduling architecture.
Grid job A characteristics.
| Job executing software | |
| Software version | |
| Software license | |
| Computational nodes | |
| Manycore demand | |
| The size of whole job | |
| The job arrival time | |
| The job execution time | |
| The job deadline |
Fig 3A data transfer bandwidth variance curve.
Fig 4The application-aware periodic job scheduling flow.
The settings of simulated Grid computing center.
| Grid Center | Nodes | Storage(TB) | Software | Multicore | Manycore | Memory(GB) | |||
|---|---|---|---|---|---|---|---|---|---|
| CPUs | Cores | Speed(GHz) | Capacity(GHz) | ||||||
| 2048 | 1.47 | 78 | 2 | 6 | 2.93 | 140.64 | 1 | 48 | |
| 17920 | 12.4 | 112 | 2 | 12 | 2.2 | 188.1 | 2.3 | 64 | |
| 2560 | 0.408 | 48 | 2 | 6 | 2.66 | 515 | 1 | 24 | |
| 128 | 1.5 | 20 | 4 | 8 | 2.4 | 128 | |||
| 40960 | 230 | 75 | 2381.1 | 1.9 | 32 | ||||
| 912 | 600 | 132 | 2 | 6 | 2.5 | 64 | |||
| 980 | 160 | 44 | 2 | 12 | 2.5 | 128 | |||
| 7168 | 262 | 88 | 2 | 6 | 2.93 | 515 | 1 | 32 | |
| 1650 | 45 | 26 | 4 | 4 | 2.0 | 48 | |||
| 300 | 23 | 46 | 2 | 12 | 2.5 | 32 | |||
| 1 | 0.8 | 10 | 1 | 12 | 3.0 | 16 | |||
Three jobs characteristics.
| Job | Software | Version | License | Computational nodes | Manycore demand | Execution Time(s) | Size | Deadline(s) |
|---|---|---|---|---|---|---|---|---|
| CP2K | 4.1 | 20 | 45 | No | 2737 | 0.8 | 3000 | |
| NAMD | 2.12 | 1024 | Yes | 17890 | 0.3 | 20000 | ||
| CASTEP | 16.4 | 20 | No | 1751 | 1.37 | 2000 |
The experimental results of job transmission time prediction.
| Job | Prediction(s) | Actual(s) | Error rate(%) | Job | Prediction(s) | Actual(s) | Error rate(%) |
|---|---|---|---|---|---|---|---|
| 32.5 | 28.9 | 12.4 | 212.3 | 219.6 | 3.3 | ||
| 346.9 | 332.1 | 4.5 | 72.5 | 108.9 | 33.4 | ||
| 34.8 | 33.9 | 2.7 | 56.7 | 57.3 | 1 | ||
| 873.7 | 940.8 | 7.1 | 77.8 | 85.3 | 8.8 | ||
| 708.4 | 755.6 | 6.2 | 347.8 | 785.6 | 55.7 |
Fig 5Performance impact of jobs with 60 scheduling points.
(a) Total Processing Time; (b) Average Processing Time; (c) Makespan; (d) Job Rejection Ratio.
Fig 6Performance impact of jobs with 120 scheduling points.
(a) Total Processing Time; (b) Average Processing Time; (c) Makespan; (d) Job Rejection Ratio.