Swarm-based In Incast Congestion Control in in a Datacenter - - PowerPoint PPT Presentation

swarm based in incast congestion control in in
SMART_READER_LITE
LIVE PREVIEW

Swarm-based In Incast Congestion Control in in a Datacenter - - PowerPoint PPT Presentation

Swarm-based In Incast Congestion Control in in a Datacenter Serving Web Applications Haoyu Wang* , Haiying Shen * and Guoxin Liu ^ *U *Universit ity of of Vir irgin inia ia, ^C ^Cle lemson Univ iversit ity Outline Introduction


slide-1
SLIDE 1

Swarm-based In Incast Congestion Control in in a Datacenter Serving Web Applications

Haoyu Wang*, Haiying Shen* and Guoxin Liu^

*U *Universit ity of

  • f Vir

irgin inia ia, ^C ^Cle lemson Univ iversit ity

slide-2
SLIDE 2
  • Introduction
  • Approach description
  • Evaluation
  • Conclusion

Outline

2

slide-3
SLIDE 3
  • Introduction
  • Approach description
  • Evaluation
  • Conclusion

Outline

3

slide-4
SLIDE 4

4

Introduction

Incast congestion is a common problem in modern datacenters

  • 1. TCP timeout and retransmission
  • 2. Throughput loss
  • 3. Increased latency
  • 4. Application failure

Glenn from Morgan Stanley, NSDI 2015

slide-5
SLIDE 5

5

Introduction

Incast congestion

Incast is a many-to-one communication pattern commonly found in cloud data

  • centers. It begins when a singular parent server places a request for data
  • bjects to a large number of servers simultaneously.

The Nodes respond to the singular Parent. The result is a micro burst of many machines simultaneously sending TCP data streams to one machine

slide-6
SLIDE 6

6

Introduction

Incast congestion

Incast is a many-to-one communication pattern commonly found in cloud data

  • centers. It begins when a singular parent server places a request for data
  • bjects to a large number of servers simultaneously.

The servers respond to the singular parent, resulting a micro burst of many machines simultaneously sending TCP data streams to one machine

slide-7
SLIDE 7

7

Introduction

slide-8
SLIDE 8

8

Introduction

Previous work Sliding Window MCN’95 The window size changes after the congestion is detected ICTCP (Improved sliding window protocol) Staggered flow

slide-9
SLIDE 9

9

Introduction

Previous work Sliding Window The window size changes after the congestion is detected ICTCP (Improved sliding window protocol) Conext’10 Staggered flow

slide-10
SLIDE 10

10

Introduction

Previous work Sliding Window The window size changes after the congestion is detected ICTCP (Improved sliding window protocol) Conext’10 Staggered flow MASCOTS’12, COMPSACW’13

slide-11
SLIDE 11
  • Introduction
  • Approach description
  • Evaluation
  • Conclusion

Outline

11

slide-12
SLIDE 12

12

Approach Description

A multilevel tree with proximity-aware swarm

Hub: The server connecting with the font-end server and has the largest spare capacity to handle I/O among each rack

slide-13
SLIDE 13

13

Approach Description A swarm structure is formed only for one data request

  • 1. The transient structure does not need to be maintained
  • 2. Transmitting data through a much smaller structure greatly reduces the latency
  • 3. Data servers without requested data objects do not need to participate in the

structure

Determine a suitable number of hubs:

𝑂 = 𝑇𝑓 𝐶𝑒 ∗ 𝐶𝑣 𝑡 ∗ 𝑛

Building multi-level tree of hubs:

  • 1. The hubs under the same aggregation router are linked together in the tree
  • 2. A hub’s child always has a smaller number of requested data objects than its parent
slide-14
SLIDE 14

14

Approach Description

Pseudocode of multi-level tree generation

  • 1. Cluster target data servers in each rack into a swarm
  • 2. /* Select a hub from each swarm*/
  • 3. For each swarm do
  • 4. Select the data server with the largest number of requested

data objects as the hub; Enqueue the hub into queue 𝑅ℎ

  • 5. Sort the hubs in 𝑅ℎ in ascending order of number of

requested data objects

slide-15
SLIDE 15

15

Approach Description

Pseudocode of multi-level tree generation

  • 1. /*Create multi-level tree from hubs*/
  • 2. While 𝑅ℎ>N do
  • 3. Dequeue a hub ℎ𝑗from 𝑅ℎ
  • 4. Select a hub ℎ𝑘 with the smallest number of data objects and

under the same aggregation router as ℎ𝑗; Link ℎ𝑗 as child to ℎ𝑘

  • 5. While ℎ𝑘 has less than children and ℎ𝑗 has children do
  • 6. Transmit the last child from ℎ𝑗 to be a child of ℎ𝑘
slide-16
SLIDE 16

16

Approach Description

Two-level data transmission speed control

In order to avoid overloading the front-end server:

  • 1. At the front-end server

The front-end server periodically adjusts the assigned bandwidth to each hub after each short time period

  • 2. At the aggregation router

For multi front-end servers under the same router, we adjust the request transmission speed of each front-end server

slide-17
SLIDE 17
  • Introduction
  • Approach description
  • Evaluation
  • Conclusion

Outline

17

slide-18
SLIDE 18

18

Evaluation

Simulation setup: 3000 data servers with fat tree structure TCP retransmission timeout: 10ms Comparison methods:

  • 1. One-all
  • 2. Sliding window protocol (SW) MCN’95
  • 3. ICTCP Conext’10
slide-19
SLIDE 19

19

Evaluation

Performance of SICC

slide-20
SLIDE 20

20

Evaluation

Performance of SICC

slide-21
SLIDE 21

21

Evaluation

Performance of multi-level tree of hubs

slide-22
SLIDE 22

22

Evaluation

Computing time of multi-level tree generation

slide-23
SLIDE 23
  • Introduction
  • Approach description
  • Evaluation
  • Conclusion

Outline

23

slide-24
SLIDE 24

24

Conclusion

  • 1. Incast congestion is a common problem in modern datacenters
  • 2. We proposed Swarm-based Incast Congestion Control method

(SICC)

  • 1. Proximity-aware swarm based data transmission
  • 2. Two-level data transmission speed control
  • 3. other enhancements
  • 3. Experiments show that SICC achieves higher throughput and

lower latency

slide-25
SLIDE 25

25

Conclusion

Thank you!

Question