[PPT] - Learning Data Systems Components Tim Kraska <kraska@mit.edu> PowerPoint Presentation

SLIDE 1

Tim Kraska <kraska@mit.edu>

[Disclaimer: I am NOT talking on behalf of Google]

Learning Data Systems Components

Work partially done at

SLIDE 2

HashMaps Sorting Joins Bloom Filter Tree

“Machine Learning Just Ate Algorithms In One Large Bite….” [Christopher Manning, Professor at Stanford]

Comments on Social Media

SLIDE 3

Disclaimer

HashMaps Sorting Joins Bloom Filter Tree

SLIDE 4

Fundamental Building Blocks

Sorting B-Tree Hash- Map Scheduling Join Priority Queue Bloom Filter Caching Range Filter

SLIDE 5

Databases as an Example:

B-Trees

SLIDE 6

SLIDE 7

SLIDE 8

SLIDE 9

B-C

C-G

G-J

K-N N-R

Q-S

S-U

U-V

V-X

X-@

SLIDE 10

B-C

C-G

G-J

K-N N-R

A-B B-C C-G …

Key

SLIDE 11

B-C

C-G

G-J

K-N N-R

A-B B-C C-G …

AA- AL AL- AK AK- AP … BA- BE BI- BL BL- BR … … ... ... …

….

Key

SLIDE 12

A-B B-C C-G …

AA- AL AL- AK AK- AP … BA- BE BI- BL BL- BR … … ... ... …

….

… … … … …. … …. …

…. ….

… … … … …. … …. … … … … … …. … …. …

Key

SLIDE 13

B-C

C-G

G-J

K-N N-R

SLIDE 14

The Librarian

SLIDE 15

Harry Potter Childreen Books Curious George O’Reilly Books Travel Books DaVinci Code The Girl

n the Train

Bill Brycen The Source The Gruffalo The Gruffalo A Day in the Life

f Marlon Bundo

ML With Python

SLIDE 16

Harry Potter Childreen Books Curious George O’Reilly Books Travel Books DaVinci Code The Girl

n the Train

Bill Brycen The Source The Gruffalo Make Way for Ducklings A Day in the Life

f Marlon Bundo

ML With Python

SLIDE 17

B-C

C-G

G-J

K-N N-R

SLIDE 18

B-C

C-G

G-J

K-N N-R

A- B B- C C- G …

AA- AL AL- AK AK- AP … BA- BE BI- BL BL- BR … … ... ... …

….

… … … … …. … …. …

…. ….

… … … … …. … …. … … … … … …. … …. …

Key

SLIDE 19

B-C

C-G

G-J

K-N N-R

Model

Key

SLIDE 20

Fundamental Algorithms & Data Structures

Hash-Map Tree Sorting Join Range-Filter Priority Queue

…..

Scheduling Cache Policy Bloom-Filter

SLIDE 21

Not convinced yet?

SLIDE 22

Another Example:    Index All Integers from 900 to 800M  

900 901 902 903 904 905 906 907 908 909 800M

…

… … … …

… … … … … … … … … ... ... …

….

… … … … …. … …. …

…. ….

… … … … …. … …. … … … … … …. … …. …

B-Tree?

SLIDE 23

A More Concrete Example:    Index All Integers from 900 to 800M  

900 901 902 903 904 905 906 907 908 909 800M

… data_array[lookup_key - 900]

SLIDE 24

Goal:     Index All Integers from 900 to 800M  

900 901 902 903 904 905 906 907 908 909 800M

…

900 902 904 906 908 910 912 914 916 918 800M

…

Index All Even Integers from 900 to 800M

data_array[(lookup_key – 900) / 2]

SLIDE 25

Still holds for other data distributions

SLIDE 26

Key Insight

Traditional data structures (typically) make no assumptions about the data

But knowing the data distribution might allow for significant performance gains and might even change the complexity of data structures (e.g., O(log n) O(1) for lookups or O(n) O(1) for storage)

SLIDE 27

Building A System From Scratch For Every Use Case Is Not Economical

SLIDE 28

Conceptually a   B-Tree maps a key to a page

B- Tree key page For simplicity assume all pages are continuously stored in main memory

SLIDE 29

Alternative View  B-Tree maps a key to a position with a fixed min/max error

For simplicity assume all pages are continuously stored in main memory B- Tree Sorted Array key position pos pos + page-size

1. B-tree: key→pos
2. Binary search within

min/max-error

SLIDE 30

Sorted Array key position pos pos + page-size Model

A B-Tree Is A Model

SLIDE 31

Finding an item

1. Any model: key → pos estimate
2. Binary search in

[pos - errmin, pos + errmax] errmin and errmax are known from the training process

Sorted Array key position pos pos + page-size Model

A B-Tree Is A Model

SLIDE 32

A B-Tree Is A Model

A form of a regression model

key→ pos is equivalent of modeling the CDF of the (observed) key distribution: Pos-estimate = P(X ≤ key) * #keys

Sorted Array key position pos pos + page-size Model

SLIDE 33

A B-Tree Is A Model

Pos-estimate = F(key) * #keys

SLIDE 34

B-Trees Are Regression Trees

B- Tree Sorted Array key position

SLIDE 35

What Does This Mean

SLIDE 36

What Does This Mean

Database people were the first to do   large scale machine learning :)

SLIDE 37

Potential Advantages of Learned B-Tree Models

Smaller indexes → less (main-memory) storage
Faster Lookups?
More parallelism → Sequential if-statements are exchanged

for multiplications

Hardware accelerators → Lower power, better $/compute….
Cheaper inserts? → more on that later. For the moment,

assume read-only

SLIDE 38

A First Attempt

200M web-server log records by timestamp-sorted
2 layer NN, 32 width, ReLU activated
Prediction task: timestamp position within

sorted array

SLIDE 39

Cache-Optimized B-Tree

≈250ns ???

A First Attempt

SLIDE 40

A First Attempt

≈250ns ≈80,000ns

Cache-Optimized B-Tree

SLIDE 41

Reasons

Problem I: Tensorflow is designed for large models Problem II: B-Trees are great for overfitting Problem III: B-Trees are   cache-efficient Problem IV: Search does not take advantage of the prediction

SLIDE 42

Problem I:  

The Learning Index Framework (LIF)

An index synthesis system
Given an index configuration generate the best possible code
Uses ideas from Tupleware [VLDB15]
Simple models are trained “on-the-fly”, whereas for complex

models we use Tensorflow and extract weights afterwards (i.e., no Tensorflow during inference time)

Best index configuration is found using auto-tuning (e.g., see

TuPAQ [SOCC15]

SLIDE 43

Problem II + III: 

Precision Gain per Node

……. ……. ……. ……. ……. ……. ……. …….

Index over 100M records. Page-size: 100

Precision Gain: 100M --> 1M (Min/Max-Error: 1M) Precision Gain: 1M --> 10k Precision Gain: 10k --> 100 100M records (i.e., 1M pages)

SLIDE 44

The Last Mile Problem

Pos Key

SLIDE 45

Solution:   Recursive Model Index (RMI)

SLIDE 46

How Does The Lookup-Code Look Like

Model on stage 1: f0(key_type key) Models on stage two: f1[] (e.g., the first model in the second stage is is f1[0](key_type key)) Lookup Code:

pos_estimate f1[f0(key)](key) pos exp_search(key, pos_estimate, data);

Number of operations with linear regression models:

ffset a + b * key

weights2 weights_stage2[offset] pos_estimate weights2.a + weights2.b * key pos exp_search(key, pos_estimate, data)

2x multiplies 2x additions 1x array-lookup

SLIDE 47

How Does The Lookup-Code Look Like

Model on stage 1: f0(key_type key) Models on stage two: f1[] (e.g., the first model in the second stage is is f1[0](key_type key)) Lookup Code for a 2-stage RMI:

pos_estimate f1[f0(key)](key) pos exp_search(key, pos_estimate, data);

Operations with a 2-stage RMI with linear regression models

ffset a + b * key

weights2 weights_stage2[offset] pos_estimate weights2.a + weights2.b * key pos exp_search(key, pos_estimate, data)

2x multiplies 2x additions 1x array-lookup

SLIDE 48

Hybrid RMI

Worst-Case Performance is the one of a B-Tree

SLIDE 49