V-Combiner: Speeding-up Iterative Graph Processing on a - - PowerPoint PPT Presentation

v combiner speeding up iterative graph processing on a
SMART_READER_LITE
LIVE PREVIEW

V-Combiner: Speeding-up Iterative Graph Processing on a - - PowerPoint PPT Presentation

V-Combiner: Speeding-up Iterative Graph Processing on a Shared-memory Platform with Vertex Merging Azin Heidarshenas , Serif Yesil , Dimitrios Skarlatos , Sasa Misailovic , Adam Morrison*, Josep Torrellas University of Illinois


slide-1
SLIDE 1

V-Combiner: Speeding-up Iterative Graph Processing on a Shared-memory Platform with Vertex Merging

Azin Heidarshenas†, Serif Yesil†, Dimitrios Skarlatos†, Sasa Misailovic†, Adam Morrison*, Josep Torrellas†

University of Illinois Urbana-Champaign† Tel-Aviv University*

International Conference on Supercomputing (ICS), June 2020

slide-2
SLIDE 2

Iterative graph processing

2

Update all vertices in parallel Converged? Finish yes Page Rank Community Detection HITS Belief Propagation

Computational complexity ∝ #Iterations

50-200 Iterations parallel for v in vertices for u in v.neighbors … // update v

slide-3
SLIDE 3

Graph processing can be approximate

3

4 1 2 3

Vertex Page Rank 1 0.0510103 3 0.0255164 4 7.3626e-05 2 5.16674e-05

Example: CEO of Company X wants to invest only on the most influential customers in their network

Computing Page Ranks of Vertices 2 and 4 is useless. …

2000 1000 hub hub 2000

slide-4
SLIDE 4

Pruning graphs can be effective

4

Build Graph Graph Algorithm Build Graph Graph Algorithm Pre-processing Compute Pre-processing Compute Time Prune Removing useless computation

Removing certain vertices / edges

(pruning)

Original graph Approximate graph

slide-5
SLIDE 5

Overview of Sparsification and K-core

5

4 1 2 3 4 1 2 3

Sparsification1 Prunes only edges, probabilistically from dense regions K-core2 Prunes vertices (along with their edges), until the remaining vertices have a degree of at least K

[1] Spectral sparsification of graphs: theory and algorithms. Commun. ACM 56, 2013 [2] K-core decomposition of large networks on a single PC, VLDB, 2015

slide-6
SLIDE 6

At the highest accuracy (~80%), Sparsification achieves 1.6x for Page Rank. Degree of pruning

Limitations of Sparsification and K-core

6

Degree of pruning Desirable speedup > 2x

Accuracy is the ratio of vertices found in the top ranking. Accuracy is the ratio of vertices with correct communities.

High speedup is achieved only at low Accuracy (<60%) for Community Detection.

slide-7
SLIDE 7

Addressing the Limitations

7

4 1 2 3 4 1 2 3

Sparsification1 Prunes only edges, probabilistically from dense regions K-core2 Prunes vertices (along with their edges), until the remaining vertices have a degree of at least K

[1] Spectral sparsification of graphs: theory and algorithms. Commun. ACM 56, 2013 [2] K-core decomposition of large networks on a single PC, VLDB, 2015

4 1 3 2

V-Combiner Prunes and merges certain vertices into hubs (in the direction of information flow), so that hubs stay connected to the rest of the graph

slide-8
SLIDE 8

Overview of V-Combiner

8

Baseline V-Combiner

Build Graph Graph Algorithm Build Graph Graph Algorithm Pre-processing Compute Pre-processing Compute Recovery Time Prune + Merge More merging vs. pre-processing time vs. performance savings Post-processing

slide-9
SLIDE 9

Different Vertex Merging Scenarios

9

Edges Information flow Example App. Page Rank,

  • Comm. Detection

Directed One-way HITS Directed Two-way Belief Propagation Undirected Two-way

Merge in-neighbors Merge in-neighbors Merge out-neighbors Merge all neighbors

slide-10
SLIDE 10

Supernode Subnode Regular Regular Supernode: Large in-degree (but not too large)

Classification of Vertices in V-Combiner

10

4 1 2 3

Subnode: Small in- and out-degree, at least one supernode in its out- neighborhood Regular: Neither a supernode nor a subnode

Large in-degree for supernodeà More mergings per supernode Small in- and out-degree for subnode à Less distortion after pruning

slide-11
SLIDE 11

Prune + Merge in V-Combiner

11

for e in edges //MERGE if e.dst is a subnode and e.src is NOT a subnode then // Increment in-degree of the supernode by one //PRUNE if e.src is a subnode and e.dst is NOT a subnode then // Decrement in-degree of the e.dst by one

4 1 3 2 4 1 2 3

Vertex Old in-degree New in-degree 1 6 6 2 1 3 5 5 4 2 1

One increment and one decrement cancel out.

slide-12
SLIDE 12

No subnodes in the approximate graph Recover using the in-neighbors’ values and the graph algorithm operator

  • More efficient using Delta graph
  • As if an extra iteration of the algorithm is run, but only for the subnodes

Recovery in V-Combiner

12

Approximate graph 4 1 3 2 4 1 2 3 Delta graph

For Page Rank: Pr[2] = 0.85 Pr[1] / 2 + 0.15

slide-13
SLIDE 13

Evaluation Setup

13

End-to-end speedup measured. 44 Intel Xeon cores, no hyper-threading and DVFS 4 graph applications:

  • Page Rank (PR)
  • Community Detection (CD)
  • Hyperlink-Induced Topic Search (HITS)
  • Belief Propagation (BP)

5 graph inputs

  • Friendster social network (FS)
  • Twitter social network (TW)
  • Page-Level Domain graph (PLD)
  • Arabic domain network (AR)
  • Dbpedia network (DB)
slide-14
SLIDE 14

Accuracy Metrics

14

Top-K Accuracy: The ratio of vertices in the top ranking of the exact result that are also in the top ranking of the approximate result

  • Page Rank
  • HITS
  • Belief Propagation

Classification Accuracy:

The ratio of vertices that have been correctly assigned to their communities

  • Community Detection

Accuracy threshold of 90%.

slide-15
SLIDE 15

End-to-End Performance

15

Build Algorithm

slide-16
SLIDE 16

End-to-End Performance: V-Combiner

16

Prune/Merge Recovery Build Algorithm

1.25 end-to-end speedup at mean accuracy of 91.8%

slide-17
SLIDE 17

End-to-End Performance: Sparsification

17

Prune/Merge Recovery Build Algorithm

Sparsification fails to meet accuracy threshold in 1 benchmark

slide-18
SLIDE 18

End-to-End Performance: K-core

18

Prune/Merge Recovery Build Algorithm

K-core fails to meet accuracy threshold in 4 benchmarks

slide-19
SLIDE 19

More in the Paper

  • Details of other scenarios of the merging
  • Choosing the merging parameters
  • Algorithm performance and accuracy analysis
  • Analysis of connectivity
  • Analysis of the average length of the paths
  • Analysis of pruning/merging parameters

19

slide-20
SLIDE 20

Take-away

  • Iterative graph processing is computationally expensive and

can be approximate.

  • V-Combiner is a pruning + merging + recovery technique
  • It has the following advantages over the state-of-the-art

pruning techniques: – Preserving average length of the paths – Maintaining connectivity – Improving load balancing – Modest pre-processing overhead

20

slide-21
SLIDE 21

V-Combiner: Speeding-up Iterative Graph Processing on a Shared-memory Platform with Vertex Merging

Azin Heidarshenas†, Serif Yesil†, Dimitrios Skarlatos†, Sasa Misailovic†, Adam Morrison*, Josep Torrellas†

University of Illinois Urbana-Champaign† Tel-Aviv University*