On the Price of Heterogeneity in Parallel Systems P . Brighten - - PowerPoint PPT Presentation

on the price of heterogeneity in parallel systems
SMART_READER_LITE
LIVE PREVIEW

On the Price of Heterogeneity in Parallel Systems P . Brighten - - PowerPoint PPT Presentation

On the Price of Heterogeneity in Parallel Systems P . Brighten Godfrey and Richard M. Karp SPAA06 - July 31, 2006 1 Introduction Consider a parallel system in which processing speed, Bottleneck bandwidth to each node has a


slide-1
SLIDE 1

On the Price of Heterogeneity in Parallel Systems

P . Brighten Godfrey and Richard M. Karp

SPAA’06 - July 31, 2006

1

slide-2
SLIDE 2

Introduction

  • Consider a parallel system in which

each node has a capacity

  • Does increasing heterogeneity of the

capacity distribution help or hurt?

processing speed, Bottleneck bandwidth to Internet, memory, ...

2

slide-3
SLIDE 3

Introduction

System A All nodes have equal capacity System B Same total capacity; higher variance Does A or B perform better?

...can do either, depending on what system we’re talking about AND the conditions under which we’re running the system

Yes

3

slide-4
SLIDE 4

Example

  • Set of jobs, each with a length
  • n processors, each with speed ci
  • Assign jobs to processors to minimize

makespan: time until last processor completes its jobs

  • Completion time of processor i: sum of job

lengths given to it, divided by ci Minimum Makespan Scheduling

4

slide-5
SLIDE 5

Example 1

Job lengths Processor speeds C Processor speeds C’ Completion time

4 sec

Completion time

2 sec

5

slide-6
SLIDE 6

Example 2

Job lengths Processor speeds C Processor speeds C’

1 sec

Completion time

≈ 2 sec

Completion time

6

slide-7
SLIDE 7

So increasing heterogeneity can help or hurt. Can we make any generalizations?

7

slide-8
SLIDE 8

In This Paper

a general framework to quantify the worst-case effect

  • f increasing heterogeneity

8

slide-9
SLIDE 9

Contents

  • Model
  • Results
  • Conclusion

9

slide-10
SLIDE 10

sup

W,C,C′: CC′

g(C′, W) g(C, W)

cost function (processing time in optimal schedule)

Model

Price of Heterogeneity of g node capacities (CPU speed) workload (jobs to run)

g usually optimal value to some combinatorial

  • ptimization

problem.

10

slide-11
SLIDE 11

∀k

k

  • i=1

c′

i ≥ k

  • i=1

ci and

n

  • i=1

c′

i = n

  • i=1

ci

C′ C

  • when

Defining Heterogeneity

So majorization is consistent with both variance and negative entropy.

C′ C = ⇒

  • var(C′) ≥ var(C)

−H(C′) ≥ −H(C)

  • Majorization partial order

C = (c1, . . . , cn) C′ = (c′

1, . . . , c′ n)

  • Capacity vectors

(sorted decreasing)

11

slide-12
SLIDE 12

Majorization example

(4, 0, 0, 0) (1, 1, 1, 1) (2, 2, 0, 0) (4/3, 4/3, 4/3, 0) (1.5, 1.5, 1, 0) Homogeneous Most Heterogeneous (3.8, .1, .1, 0) Serial Most Parallel

12

slide-13
SLIDE 13

So Price of Heterogeneity also bounds the the Value of Parallelism!

13

slide-14
SLIDE 14

Using the Price of Heterogeneity

  • Justified generalizations

(Constant vs. unbounded PoH)

  • Comparison across systems
  • Worst cases for testing

What characteristics place a cost function in one or the other category?

14

slide-15
SLIDE 15

Contents

  • Model
  • Results
  • Conclusion

15

slide-16
SLIDE 16

Results

Problem PoH

Minimum makespan scheduling 2-1/n Scheduling on related machines O(1) PCS, unit length jobs ≤ 16 Precedence Constrained Sched. O(log n)

  • Sched. with release times

Unbounded Minimum network diameter ≤ 2

16

slide-17
SLIDE 17

One way to bound PoH

  • Goal: show C’ is as good as more

homogeneous capacities C C C’

  • Total capacity “simulated” by each C’ node

must be not much more than its own capacity

17

slide-18
SLIDE 18

Simulation Lemma

  • A ß-simulation is a mapping from C to C’

such that no C’-node gets more than ß times its capacity.

  • Lemma: a (2-1/n)-simulation always exists

for any C and more heterogeneous C’

18

slide-19
SLIDE 19

Lay of the land

  • Minimum Makespan Scheduling & a class of

generalizations: O(1) PoH

  • Scheduling with release times: unbounded PoH
  • Precedence Constrained Scheduling (PCS):

O(log n) PoH

19

slide-20
SLIDE 20

PCS

  • Like Min. Makespan Scheduling, except...
  • Given set of precedence constraints:

“Job i must finish before job k starts”

20

slide-21
SLIDE 21

PCS

  • Simulation technique cannot succeed

... Time C-machines 1 2 3 ...

  • Design capacity distributions such that some

C’-machines simulate multiple C-machines

  • Factor n/4 increase in schedule length!

21

slide-22
SLIDE 22

PCS

  • Instead, use LP relaxation of PCS due to

Chudak and Shmoys

  • Can apply Simulation Lemma to optimal

values of the LP

  • Key relaxed constraint: machine can only

execute one job at a time

  • LP is within O(log n) of optimal => PoH of

PCS is O(log n)

22

slide-23
SLIDE 23

Contents

  • Model
  • Results
  • Conclusion

23

slide-24
SLIDE 24

Conclusion

  • Introduced a framework to characterize

worst-case effect of increasing heterogeneity

  • “Batch” scheduling problems have low PoH
  • Even PCS has O(log n) PoH, while release

times cause unbounded PoH

  • Does PCS have O(1) PoH?

24