1/36
ComplexHPC Spring School Day 2: KOALA Tutorial The KOALA Scheduler
Nezih Yigitbasi Delft University of Technology
4
May 10, 2011
4 May 10, 2011 1/36 Outline Koala Architecture Job Model System - - PowerPoint PPT Presentation
ComplexHPC Spring School Day 2: KOALA Tutorial The KOALA Scheduler Nezih Yigitbasi Delft University of Technology 4 May 10, 2011 1/36 Outline Koala Architecture Job Model System Components Support for different application
1/36
May 10, 2011
2/36
3/36
4/36
May 13, 2011
5/36
6/36
scheduler decides on component placement
job components same total job size
job component placement fixed
components
scheduling
execution
7/36
8/36
9/36
10/36
11/36
12/36
13/36
14/36
15/36
See: H.H. Mohamed and D.H.J. Epema, “An Evaluation of the Close-to-Files Processor and Data Co-Allocation Policy in Multiclusters,” IEEE Cluster 2004.
16/36
See: O.O.Sonmez, H.H. Mohamed and D.H.J. Epema, “Communication-aware Job Scheduling Policies for the Koala Grid Scheduler”, IEEE e-Science 2006.
17/36
Components
Clusters
Component
18/36
(Prime) to very communication-intensive (Wave) average job response time (s) Prime Poisson Wave
19/36
average job response time (s)
FCM CA FCM CA [w/o Delft] [with Delft]
20/36
21/36
22/36
23/36
24/36
25/36
Scheduler CS-Runner
Node
submits PSA(s)
JDL
grow/shrink messages registers
Launcher Head Node KCM
submits launchers deploys, monitors, and preempts tasks monitors/informs idle/demanded resources CS Policies:
grid-wide basis
per cluster Application Level Scheduling:
26/36
Clusters
CS User-1 CS User-2 CS User-3
27/36
Clusters
CS User-1 CS User-2 CS User-3
28/36
See: O. Sonmez, B. Grundeken, H.H. Mohamed, Alex Iosup, D.H.J. Epema, Scheduling Strategies for Cycle Scavenging in Multicluster Grid Systems, CCGrid 2009.
Equi-PerSite is fair and superior to Equi-All
Number of Completed Jobs Equi-All Equi-All Equi-PerSite Equi-PerSite WBlock WBurst WBlock WBurst Makespan [s]
with the same parameter range
job startup overhead + information delay Number of Jobs
29/36
30/36
31/36
*G. Karypis and V. Kumar. Multilevel graph partitioning schemes. In Int. Conf. Par.
Proc., pages 113–122, 1995.
32/36
5/10/11 32
*H. Topcuoglu, S. Hariri, and M. Wu. Performance-effective and low-complexity task scheduling for heterogeneous computing. IEEE TPDS, 13(3):260–274, 2002.
33/36
34/36
35/36
36/36