Assessing the Quality of Automatically Built Network Representations - - PowerPoint PPT Presentation

assessing the quality of automatically built network
SMART_READER_LITE
LIVE PREVIEW

Assessing the Quality of Automatically Built Network Representations - - PowerPoint PPT Presentation

Assessing the Quality of Automatically Built Network Representations Lionel Eyraud-Dubois (ENS-Lyon, France) Martin Quinson (University of Nancy, France) May 16, 2007 Workshop on Programming Models for Grid Computing associated to CCGrid07


slide-1
SLIDE 1

Assessing the Quality of Automatically Built Network Representations

Lionel Eyraud-Dubois (ENS-Lyon, France) Martin Quinson (University of Nancy, France) May 16, 2007 Workshop on Programming Models for Grid Computing associated to CCGrid07

slide-2
SLIDE 2

Scheduling on a large-scale distributed platform

◮ Let GP = (VP, EP) denote the platform graph

1 1 1 1 10 P1 P2 P4 P3

◮ Each edge Pi → Pj is labeled by ci,j:

time necessary to send a unit-size message between Pi and Pj

◮ Communication model:

◮ full-overlap of communications and computations ◮ 1-port for incoming communications and 1-port for outgoing communications

◮ Each node Pi has a processing speed wi ∈ R

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Introduction 2/16

slide-3
SLIDE 3

Scheduling on a large-scale distributed platform

◮ Let GP = (VP, EP) denote the platform graph

1 1 1 1 10 P1 P2 P4 P3

◮ Each edge Pi → Pj is labeled by ci,j:

time necessary to send a unit-size message between Pi and Pj

◮ Communication model:

◮ full-overlap of communications and computations ◮ 1-port for incoming communications and 1-port for outgoing communications

◮ Each node Pi has a processing speed wi ∈ R

Eh wait!

Where does this graph instance comes from?

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Introduction 2/16

slide-4
SLIDE 4

Building a Network Representation

Motivation

◮ Modern platforms are heterogeneous and dynamic ◮ Distributed applications must be network aware and reactive ◮ Information on the network needed (at least) for:

◮ Service and distributed application deployment ◮ Communication-aware scheduling ◮ Group communication ◮ Proximity Neighbor Selection in P2P systems

Several levels of information (almost as many as layers in the OSI model)

◮ Physical inter-connexion map (wires in the walls) ◮ Routing infrastructure (path of network packets, from router to switch) ◮ Application level (focus on effects – bandwidth, latency – not causes)

Network mapping process

◮ Step 1: End-to-end measurements ◮ Step 2: Reconstruct a graph

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Introduction 3/16

slide-5
SLIDE 5

Classical Measurements in a Grid Environment?

Use of low-level network protocols (like SNMP or BGP)

◮ Example: Remos ◮ Use of SNMP restricted for security reasons (DoS or spying)

Use of traceroute or ping (ie on ICMP)

◮ Examples: TopoMon, Lumeta, IDmaps, Global Network Positioning ◮ Use of ICMP more and more restricted by admins (for same reasons)

Over the lifetime of the project, we have noticed that the number

  • f replying destinations in our lists decays at the rate of 2-3% per

month. – Authors of the Skitter project

Pathchar

◮ Works without privilege on the network, but must be root on hosts

⇒ not adapted to grid settings

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Introduction 4/16

slide-6
SLIDE 6

Classical Measurements in a Grid Environment?

Use of low-level network protocols (like SNMP or BGP)

◮ Example: Remos ◮ Use of SNMP restricted for security reasons (DoS or spying)

Use of traceroute or ping (ie on ICMP)

◮ Examples: TopoMon, Lumeta, IDmaps, Global Network Positioning ◮ Use of ICMP more and more restricted by admins (for same reasons)

Over the lifetime of the project, we have noticed that the number

  • f replying destinations in our lists decays at the rate of 2-3% per

month. – Authors of the Skitter project

Pathchar

◮ Works without privilege on the network, but must be root on hosts

⇒ not adapted to grid settings

Measurements must be at application-level (no privilege)

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Introduction 4/16

slide-7
SLIDE 7

Solutions relying on application-level measurements

NWS (Network Weather Service – UCSB)

◮ De facto standard, used in Globus, DIET, NINF, to gather info on network ◮ Reports bandwidth, latency, CPU availability, and future trends ◮ Only quantitative values, not topological information

(but one can label a big clique with NWS provided values)

ENV (Effective Network View – UCSD)

◮ Use interference measurement to build a tree representation

ECO (Efficient Collective Communication)

◮ Use application-level measurements to optimize collective communications ◮ Should be generalized, if possible

Existing reconstruction algorithms

◮ Reconstructed topology: clique (NWS, ECO) or tree (ENV, Lat. clustering) ◮ Goal of this work: assess quality of clique and spanning tree algorithms

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Introduction 5/16

slide-8
SLIDE 8

Presentation outline

Introduction Problem Statement State of the art The ALNeM project Goals and architecture The GRAS development framework Experimentations Evaluation methodology Experiments on a real platform Experiments on simulator

Renater platform GridG platforms

Conclusions

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Introduction 6/16

slide-9
SLIDE 9

ALNeM (Application-Level Network Mapper)

Presentation

◮ Long-term goal: be a tool to provide topology to network-aware applications ◮ Short-term goal: allow the study of network mapping algorithms

?

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality The ALNeM project 7/16

slide-10
SLIDE 10

ALNeM (Application-Level Network Mapper)

Presentation

◮ Long-term goal: be a tool to provide topology to network-aware applications ◮ Short-term goal: allow the study of network mapping algorithms

DB S S S S S S S S

Architecture

◮ Lightweight distributed measurement infrastructure (collection of sensors) ◮ MySQL measurement database

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality The ALNeM project 7/16

slide-11
SLIDE 11

ALNeM (Application-Level Network Mapper)

Presentation

◮ Long-term goal: be a tool to provide topology to network-aware applications ◮ Short-term goal: allow the study of network mapping algorithms

DB S S S S S S S S

Architecture

◮ Lightweight distributed measurement infrastructure (collection of sensors) ◮ MySQL measurement database

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality The ALNeM project 7/16

slide-12
SLIDE 12

ALNeM (Application-Level Network Mapper)

Presentation

◮ Long-term goal: be a tool to provide topology to network-aware applications ◮ Short-term goal: allow the study of network mapping algorithms

Algorithm 1 Right platform Wrong topology Wrong values Algorithm 2 Algorithm 3 DB

Architecture

◮ Lightweight distributed measurement infrastructure (collection of sensors) ◮ MySQL measurement database ◮ Topology builder, with several reconstruction algorithms

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality The ALNeM project 7/16

slide-13
SLIDE 13

ALNeM (Application-Level Network Mapper)

Presentation

◮ Long-term goal: be a tool to provide topology to network-aware applications ◮ Short-term goal: allow the study of network mapping algorithms

Algorithm 1 Right platform Wrong topology Wrong values Algorithm 2 Algorithm 3 DB S S S S S S S S

Architecture

◮ Lightweight distributed measurement infrastructure (collection of sensors) ◮ MySQL measurement database ◮ Topology builder, with several reconstruction algorithms

Development on simulator, use in real life

◮ Implemented using GRAS (part of the SimGrid framework)

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality The ALNeM project 7/16

slide-14
SLIDE 14

The GRAS project

◮ Goal: Easing infrastructure development (motivated by ALNeM)

Development of real distributed applications using a simulator

Development

Without GRAS

Research

Code Simulation Application

rewrite

Code

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality The ALNeM project 8/16

slide-15
SLIDE 15

The GRAS project

◮ Goal: Easing infrastructure development (motivated by ALNeM)

Development of real distributed applications using a simulator

Application Simulation

With GRAS

Research & Development

Code

Development

Without GRAS

Research

Code Simulation Application

rewrite

Code

◮ Framework for Rapid Development of Distributed Infrastructure

◮ Develop and tune on the simulator; Deploy in situ without modification

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality The ALNeM project 8/16

slide-16
SLIDE 16

The GRAS project

◮ Goal: Easing infrastructure development (motivated by ALNeM)

Development of real distributed applications using a simulator

  • API

GRDK GRE SimGrid

With GRAS

Research & Development

Code

Development

Without GRAS

Research

Code Simulation Application

rewrite

Code

GRAS

◮ Framework for Rapid Development of Distributed Infrastructure

◮ Develop and tune on the simulator; Deploy in situ without modification

How: One API, two implementations

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality The ALNeM project 8/16

slide-17
SLIDE 17

The GRAS project

◮ Goal: Easing infrastructure development (motivated by ALNeM)

Development of real distributed applications using a simulator

  • API

GRDK GRE SimGrid

With GRAS

Research & Development

Code

Development

Without GRAS

Research

Code Simulation Application

rewrite

Code

GRAS

◮ Framework for Rapid Development of Distributed Infrastructure

◮ Develop and tune on the simulator; Deploy in situ without modification

How: One API, two implementations

◮ Efficient Grid Runtime Environment (result = application = prototype)

◮ Performance concern: efficient communication of structured data

How: Efficient wire protocol (avoid data conversion)

◮ Portability concern: because of grid heterogeneity

How: ANSI C + autoconf + no dependency

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality The ALNeM project 8/16

slide-18
SLIDE 18

Presentation outline

Introduction The ALNeM project Experimentations Evaluation methodology Experiments on a real platform Experiments on simulator

Renater platform GridG platforms

Conclusions

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Experimentations 9/16

slide-19
SLIDE 19

Evaluation methodology

Goal: Quantify similarity between initial and reconstructed platforms. Not so easy!

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Experimentations 10/16

slide-20
SLIDE 20

Evaluation methodology

Goal: Quantify similarity between initial and reconstructed platforms. Not so easy!

4 evaluation approaches

◮ Visual evaluation (structural comparison) ◮ Compare end-to-end measurements (communication-level) ◮ Compare application running time (application-level)

  • Comm. schema

// comm # steps Token-ring Ring No 1 Broadcast Tree No 1 All2All Clique Yes 1 Parallel Matrix Multiplication Hypercube Yes √procs

◮ Compare interference amount:

Interf ((a, b) , (c, d)) = 1 iff BW (a → b) BW (a → b c → d) ≈ 0.5

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Experimentations 10/16

slide-21
SLIDE 21

Evaluation methodology

Goal: Quantify similarity between initial and reconstructed platforms. Not so easy!

4 evaluation approaches

◮ Visual evaluation (structural comparison) ◮ Compare end-to-end measurements (communication-level) ◮ Compare application running time (application-level)

  • Comm. schema

// comm # steps Token-ring Ring No 1 Broadcast Tree No 1 All2All Clique Yes 1 Parallel Matrix Multiplication Hypercube Yes √procs

◮ Compare interference amount:

Interf ((a, b) , (c, d)) = 1 iff BW (a → b) BW (a → b c → d) ≈ 0.5

Apply all approaches on several platform

◮ In simulation: collect data on “real” platform, compare reconstructed to initial ◮ In situ (some comparisons not applicable)

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Experimentations 10/16

slide-22
SLIDE 22

Experiments on a real platform

The Grid’5000 platform

◮ Test-bed for Grid researchers ◮ 9 sites in France, targets 5000 procs (2500 currently)

Graphical evaluation

gdx bordeaux toulouse capricorne sagittaire icluster2 helios azur lille parasol paravent

Real platform

gdx parasol paravent lille icluster2 helios azur sagittaire capricorne toulouse bordeaux

Latency spanning tree

gdx paravent parasol lille sagittaire capricorne icluster2 helios azur toulouse bordeaux

Bandwidth spanning tree

◮ Some links are missing, of course ◮ Bandwidth induced graph better, but maybe G5K more focused on bandwidth

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Experimentations 11/16

slide-23
SLIDE 23

Experiments on a real platform: Grid’5000

gdx bordeaux toulouse capricorne sagittaire icluster2 helios azur lille parasol paravent

Real platform

gdx parasol paravent lille icluster2 helios azur sagittaire capricorne toulouse bordeaux

Latency spanning tree

gdx paravent parasol lille sagittaire capricorne icluster2 helios azur toulouse bordeaux

Bandwidth spanning tree

End-to-end measurement

◮ Compare real measurements to the one in simulator on reconstructed platform

0.5 1 2

Performance factor Clique TreeBW TreeLat Bandwidth Latency

◮ Clique very good, but trivial result ◮ Latency:

◮ Over-estimated when missing links ( longer path) ◮ Under-estimated when routing on G5K optimizes

bandwidth

◮ Bandwidth mis-estimated:

◮ Technical issue in simulator:

assumes constant TCP window size but it varies with clusters in G5K (simulator validation issue)

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Experimentations 12/16

slide-24
SLIDE 24

Experiments on simulator: Renater platform

◮ Real platform built manually (real measurements + admin feedback)

Visual evaluation

Toulouse Marseille Lyon Strasbourg Paris Rennes Poitiers Nice

Real platform

Toulouse Marseille Lyon Strasbourg Paris Rennes Poitiers Nice

Latency spanning tree

Toulouse Marseille Lyon Strasbourg Nice Poitiers Rennes Paris

Bandwidth spanning tree

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Experimentations 13/16

slide-25
SLIDE 25

Experiments on simulator: Renater platform

◮ Real platform built manually (real measurements + admin feedback)

Visual evaluation

Toulouse Marseille Lyon Strasbourg Paris Rennes Poitiers Nice

Real platform

Toulouse Marseille Lyon Strasbourg Paris Rennes Poitiers Nice

Latency spanning tree

Toulouse Marseille Lyon Strasbourg Nice Poitiers Rennes Paris

Bandwidth spanning tree

End to end measurements

0.5 1 2

Performance factor Clique Bandwidth Latency TreeBW TreeLat

◮ Again, clique very good (of course) ◮ TreeBW:

◮ Very good for bandwidth ◮ Underestimates latency

◮ TreeLat not satisfactory on this criterion

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Experimentations 13/16

slide-26
SLIDE 26

Experiments on simulator: Renater platform

Application-level measurements

0.5 1 2

Execution time factor Clique TreeBW TreeLat all2all bcast pmm token ◮ Token and broadcast: same conclusion than end2end ◮ All2all and pmm: completely new light

◮ Clique dramatically underestimates times:

No contention between parallel communication

◮ Tree* overestimate times: Missing links (as before) ◮ Effect more visible for all2all than pmm:

all2all only performs communications

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Experimentations 14/16

slide-27
SLIDE 27

Experiments on simulator: Renater platform

Application-level measurements

0.5 1 2

Execution time factor Clique TreeBW TreeLat all2all bcast pmm token ◮ Token and broadcast: same conclusion than end2end ◮ All2all and pmm: completely new light

◮ Clique dramatically underestimates times:

No contention between parallel communication

◮ Tree* overestimate times: Missing links (as before) ◮ Effect more visible for all2all than pmm:

all2all only performs communications

Interference measurements

500 1000

# occurences Correct positives False positives False negatives # actual interferences Clique TreeBW TreeLat

◮ ∀a, b, c, d, Interfinit(ab, cd) ?

= Interfrecons(ab, cd)

◮ Tree*

◮ Most existing interferences right ◮ Lot of false positive

◮ Clique

◮ No false positive ◮ Very few actual interferences

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Experimentations 14/16

slide-28
SLIDE 28

Experiments on simulator: GridG platforms

◮ GridG is a synthetic platform generator [Lu, Dinda – SuperComputing03]

Generates realistic platforms

◮ Experiment: 15 platforms (40 hosts – default GridG parameters)

End to end measurements

Clique TreeBW TreeLat

0.015625 0.03125 0.0625 0.125 0.25 0.5 1 2 4 8 16

Performance factor Bandwidth Latency

Application-level measurements

0.5 1 2 4 8

Execution time factor Clique TreeBW TreeLat all2all bcast pmm token

Interpretation

◮ Results are not better on these platforms ◮ Worse: they are not even stable over the experiments

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Experimentations 15/16

slide-29
SLIDE 29

Conclusions

Contributions

◮ Completed a framework for reconstruction algorithm evaluation

◮ Several criterion of similarity between initial and reconstructed platforms

visual (structural), end-to-end, application timings, interferences

◮ Allows comparison of reconstruction algorithms from application POV ◮ Runs on simulator or in-situ thanks to GRAS (& SimGrid)

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Conclusions 16/16

slide-30
SLIDE 30

Conclusions

Contributions

◮ Completed a framework for reconstruction algorithm evaluation

◮ Several criterion of similarity between initial and reconstructed platforms

visual (structural), end-to-end, application timings, interferences

◮ Allows comparison of reconstruction algorithms from application POV ◮ Runs on simulator or in-situ thanks to GRAS (& SimGrid)

◮ Compared classical algorithms from the literature

◮ Evaluated algorithms: Clique, Bandwidth or Latency Spanning Tree ◮ Evaluation condition: Real platform, Simulator (manual and synthetics) ◮ Conclusion: None of these algorithms are satisfactory ◮ Spanning trees: miss edges, leading to performance under-estimation ◮ Clique: do not capture any existing interference

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Conclusions 16/16

slide-31
SLIDE 31

Conclusions

Contributions

◮ Completed a framework for reconstruction algorithm evaluation

◮ Several criterion of similarity between initial and reconstructed platforms

visual (structural), end-to-end, application timings, interferences

◮ Allows comparison of reconstruction algorithms from application POV ◮ Runs on simulator or in-situ thanks to GRAS (& SimGrid)

◮ Compared classical algorithms from the literature

◮ Evaluated algorithms: Clique, Bandwidth or Latency Spanning Tree ◮ Evaluation condition: Real platform, Simulator (manual and synthetics) ◮ Conclusion: None of these algorithms are satisfactory ◮ Spanning trees: miss edges, leading to performance under-estimation ◮ Clique: do not capture any existing interference ◮ Conclusion2: We search for a routed graph (wrong route perf. over-estimation)

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Conclusions 16/16

slide-32
SLIDE 32

Conclusions

Contributions

◮ Completed a framework for reconstruction algorithm evaluation

◮ Several criterion of similarity between initial and reconstructed platforms

visual (structural), end-to-end, application timings, interferences

◮ Allows comparison of reconstruction algorithms from application POV ◮ Runs on simulator or in-situ thanks to GRAS (& SimGrid)

◮ Compared classical algorithms from the literature

◮ Evaluated algorithms: Clique, Bandwidth or Latency Spanning Tree ◮ Evaluation condition: Real platform, Simulator (manual and synthetics) ◮ Conclusion: None of these algorithms are satisfactory ◮ Spanning trees: miss edges, leading to performance under-estimation ◮ Clique: do not capture any existing interference ◮ Conclusion2: We search for a routed graph (wrong route perf. over-estimation)

Future works

◮ Better reconstruction algorithms (ongoing)

◮ Add some links to spanning trees to improve them ◮ TreeBW and TreeLat both use only half of the info. Can we combine them?

◮ Other measurements from the sensors (new inputs to algorithms)

◮ Interference (but very expensive to acquire), packet loss, etc.

  • M. Quinson, L. Eyraud-Dubois

Automatically Built Network Representations Quality Conclusions 16/16