Streaming space complexity of nearly all functions of one variable - - PowerPoint PPT Presentation

streaming space complexity of nearly all functions of one
SMART_READER_LITE
LIVE PREVIEW

Streaming space complexity of nearly all functions of one variable - - PowerPoint PPT Presentation

Streaming space complexity of nearly all functions of one variable Vladimir Braverman, Stephen Chestnut, David P. Woodruff, Lin F. Yang January 7, 2016 A stream of m = 7 items from [ n ] = [4] 4 , 2 , 3 , 2 , 4 , 2 , 2 0 0


slide-1
SLIDE 1

Streaming space complexity of nearly all functions of one variable

Vladimir Braverman, Stephen Chestnut, David P. Woodruff, Lin F. Yang

January 7, 2016

slide-2
SLIDE 2

A stream of m = 7 items from [n] = [4] 4, 2, 3, 2, 4, 2, 2 f =         f 2

i =0

slide-3
SLIDE 3

A stream of m = 7 items from [n] = [4] 4, 2, 3, 2, 4, 2, 2 f =     1     f 2

i =

1

slide-4
SLIDE 4

A stream of m = 7 items from [n] = [4] 2, 3, 2, 4, 2, 2 f =     1 1     f 2

i =

2

slide-5
SLIDE 5

A stream of m = 7 items from [n] = [4] 3, 2, 4, 2, 2 f =     1 1 1     f 2

i =

3

slide-6
SLIDE 6

A stream of m = 7 items from [n] = [4] 2, 4, 2, 2 f =     2 1 1     f 2

i =

6

slide-7
SLIDE 7

A stream of m = 7 items from [n] = [4] 4, 2, 2 f =     2 1 2     f 2

i =

9

slide-8
SLIDE 8

A stream of m = 7 items from [n] = [4] 2, 2 f =     3 1 2     f 2

i =

14

slide-9
SLIDE 9

A stream of m = 7 items from [n] = [4] 2 f =     4 1 2     f 2

i =

21

slide-10
SLIDE 10

A stream of m = 7 items from [n] = [4] f =     4 1 2     f 2

i =

21 How much storage for a streaming (1 ± ǫ)-approximation to

i f 2 i ?

slide-11
SLIDE 11

Classify g : Z≥0 → R Is there a streaming (1 ± ǫ)-approximation for

i g(fi)

using only poly( 1

ǫ log nm) bits?

Previous works g(x) = 1(x = 0): [FM85],[KNW10] g(x) = xp: [F85],[AMS96],[IW05],[I06] g(x) = x log x: [CDM06],[CCM07],[HNO08] monotonic g: [BO10],[BC15] ǫ = Ω(

1 polylog(n))

m = poly(n) g(0) = 0 g(x) > 0, ∀x > 0

slide-12
SLIDE 12

Recursive Subsampling [Indyk & Woodruff 2005]

An α-heavy hitter is any item i∗ such that g(fi∗) ≥ α

i g(fi).

Theorem (Braverman & Ostrovsky 2010)

ǫ2 log3 n-heavy hitters ⇒ (1 ± ǫ)-approximation to

  • i

g(fi).

slide-13
SLIDE 13

Recursive Subsampling [Indyk & Woodruff 2005]

An α-heavy hitter is any item i∗ such that g(fi∗) ≥ α

i g(fi).

Theorem (Braverman & Ostrovsky 2010)

ǫ2 log3 n-heavy hitters ⇒ (1 ± ǫ)-approximation to

  • i

g(fi). Heavy hitters by CountSketch[Charikar, Chen & Farach-Colton 2002] Find i∗ such that f 2

i∗ ≥ α i f 2 i

Estimate fi∗ O(α−1 log2 n) bits.

slide-14
SLIDE 14

Three properties are sufficient and almost necessary for ˜ O(1) bits g(x) x

slide-15
SLIDE 15

Slow-jumping g(x) x x y g(x) g(y) g(y) g(x) y x 2 YES: g(x) = x2 log x NO: g(x) = x3

slide-16
SLIDE 16

Slow-dropping g(x) x x g(x) y g(y) g(y) g(x) 1 YES: g(x) = Θ( 1 log x ) NO: g(x) = Θ(1 x )

slide-17
SLIDE 17

Predictable g(x) x g(x) g(y)

  • y−x≪x

g(y) = (1 ± ǫ)g(x)

  • r

g(y − x) g(x) YES: g(x) = (2 + sin x)1(x > 0) NO: g(x) = (2 + sin x)x2

slide-18
SLIDE 18

Predictable g(x) x g(x) g(y) g(y − x)

  • y−x≪x

g(y) = (1 ± ǫ)g(x)

  • r

g(y − x) g(x) YES: g(x) = (2 + sin x)1(x > 0) NO: g(x) = (2 + sin x)x2

slide-19
SLIDE 19

Three properties are sufficient and almost necessary for O(1) bits slow-jumping

g(y) g(x)

y

x

2, slow-dropping g(y) g(x), and predictable whenever 0 < y − x ≪ x g(y) = (1 ± ǫ)g(x) or g(y − x) g(x). g(x) lower bound fails x3 Ω(n1/3) slow-jumping 1/x Ω(n) slow-dropping g(x) = (2 + sin x)x2 Ω(n) predictability

slide-20
SLIDE 20

Almost necessary? 2−i(x) x 1

1 2 1 4

i(x) = max{j ∈ N : 2j divides x}