Budget Feasible Mechanisms for Experimental Design Thibaut Horel - - PowerPoint PPT Presentation

budget feasible mechanisms for experimental design
SMART_READER_LITE
LIVE PREVIEW

Budget Feasible Mechanisms for Experimental Design Thibaut Horel - - PowerPoint PPT Presentation

Budget Feasible Mechanisms for Experimental Design Thibaut Horel Joint work with Stratis Ioannidis and S. Muthukrishnan February 26, 2013 Thibaut Horel () February 26, 2013 1 / 1 Motivation Data mining engine Thibaut Horel () February 26,


slide-1
SLIDE 1

Budget Feasible Mechanisms for Experimental Design

Thibaut Horel Joint work with Stratis Ioannidis and S. Muthukrishnan February 26, 2013

Thibaut Horel () February 26, 2013 1 / 1

slide-2
SLIDE 2

Motivation

Data mining engine

Thibaut Horel () February 26, 2013 2 / 1

slide-3
SLIDE 3

Motivation

Data mining engine

c1$ c 3 $ c2$

Thibaut Horel () February 26, 2013 2 / 1

slide-4
SLIDE 4

Motivation

Data mining engine

c1$ c 3 $ c2$ p1$ p 2 $

B$

Thibaut Horel () February 26, 2013 2 / 1

slide-5
SLIDE 5

Motivation

Data mining engine

p1$ p 2 $

B$

Thibaut Horel () February 26, 2013 2 / 1

slide-6
SLIDE 6

Challenges

Value of data? How to optimize it? Strategic users?

Thibaut Horel () February 26, 2013 3 / 1

slide-7
SLIDE 7

Challenges

Value of data? How to optimize it? Strategic users?

Thibaut Horel () February 26, 2013 3 / 1

slide-8
SLIDE 8

Challenges

Value of data? How to optimize it? Strategic users?

Thibaut Horel () February 26, 2013 3 / 1

slide-9
SLIDE 9

Challenges

Value of data? How to optimize it? Strategic users?

Thibaut Horel () February 26, 2013 3 / 1

slide-10
SLIDE 10

Contributions

case of the linear regression deterministic mechanism generalization (randomized mechanism)

Thibaut Horel () February 26, 2013 4 / 1

slide-11
SLIDE 11

Contributions

case of the linear regression deterministic mechanism generalization (randomized mechanism)

Thibaut Horel () February 26, 2013 4 / 1

slide-12
SLIDE 12

Contributions

case of the linear regression deterministic mechanism generalization (randomized mechanism)

Thibaut Horel () February 26, 2013 4 / 1

slide-13
SLIDE 13

Contributions

case of the linear regression deterministic mechanism generalization (randomized mechanism)

Thibaut Horel () February 26, 2013 4 / 1

slide-14
SLIDE 14

Outline

Thibaut Horel () February 26, 2013 5 / 1

slide-15
SLIDE 15

Outline

Thibaut Horel () February 26, 2013 6 / 1

slide-16
SLIDE 16

Reverse auction

set of N sellers: A = {1, . . . , N}; a buyer V value function of the buyer, V : 2A → R+ ci ∈ R+ price of seller’s i good B budget constraint of the buyer

Goal

Find S ⊂ A maximizing V (S) Find payment pi to seller i ∈ S

Thibaut Horel () February 26, 2013 7 / 1

slide-17
SLIDE 17

Reverse auction

set of N sellers: A = {1, . . . , N}; a buyer V value function of the buyer, V : 2A → R+ ci ∈ R+ price of seller’s i good B budget constraint of the buyer

Goal

Find S ⊂ A maximizing V (S) Find payment pi to seller i ∈ S

Thibaut Horel () February 26, 2013 7 / 1

slide-18
SLIDE 18

Reverse auction

set of N sellers: A = {1, . . . , N}; a buyer V value function of the buyer, V : 2A → R+ ci ∈ R+ price of seller’s i good B budget constraint of the buyer

Goal

Find S ⊂ A maximizing V (S) Find payment pi to seller i ∈ S

Thibaut Horel () February 26, 2013 7 / 1

slide-19
SLIDE 19

Reverse auction

set of N sellers: A = {1, . . . , N}; a buyer V value function of the buyer, V : 2A → R+ ci ∈ R+ price of seller’s i good B budget constraint of the buyer

Goal

Find S ⊂ A maximizing V (S) Find payment pi to seller i ∈ S

Thibaut Horel () February 26, 2013 7 / 1

slide-20
SLIDE 20

Reverse auction

set of N sellers: A = {1, . . . , N}; a buyer V value function of the buyer, V : 2A → R+ ci ∈ R+ price of seller’s i good B budget constraint of the buyer

Goal

Find S ⊂ A maximizing V (S) Find payment pi to seller i ∈ S

Thibaut Horel () February 26, 2013 7 / 1

slide-21
SLIDE 21

Objectives

Payments (pi)i∈S must be: individually rational: pi ≥ ci, i ∈ S truthful: reporting one’s true cost is a dominant strategy budget feasible:

i∈S pi ≤ B

Mechanism must be: computationally efficient: polynomial time good approximation: V (OPT) ≤ αV (S) with: OPT = arg max

S⊂A

  • V (S) |
  • i∈S

ci ≤ B

  • Thibaut Horel ()

February 26, 2013 8 / 1

slide-22
SLIDE 22

Objectives

Payments (pi)i∈S must be: individually rational: pi ≥ ci, i ∈ S truthful: reporting one’s true cost is a dominant strategy budget feasible:

i∈S pi ≤ B

Mechanism must be: computationally efficient: polynomial time good approximation: V (OPT) ≤ αV (S) with: OPT = arg max

S⊂A

  • V (S) |
  • i∈S

ci ≤ B

  • Thibaut Horel ()

February 26, 2013 8 / 1

slide-23
SLIDE 23

Objectives

Payments (pi)i∈S must be: individually rational: pi ≥ ci, i ∈ S truthful: reporting one’s true cost is a dominant strategy budget feasible:

i∈S pi ≤ B

Mechanism must be: computationally efficient: polynomial time good approximation: V (OPT) ≤ αV (S) with: OPT = arg max

S⊂A

  • V (S) |
  • i∈S

ci ≤ B

  • Thibaut Horel ()

February 26, 2013 8 / 1

slide-24
SLIDE 24

Objectives

Payments (pi)i∈S must be: individually rational: pi ≥ ci, i ∈ S truthful: reporting one’s true cost is a dominant strategy budget feasible:

i∈S pi ≤ B

Mechanism must be: computationally efficient: polynomial time good approximation: V (OPT) ≤ αV (S) with: OPT = arg max

S⊂A

  • V (S) |
  • i∈S

ci ≤ B

  • Thibaut Horel ()

February 26, 2013 8 / 1

slide-25
SLIDE 25

Objectives

Payments (pi)i∈S must be: individually rational: pi ≥ ci, i ∈ S truthful: reporting one’s true cost is a dominant strategy budget feasible:

i∈S pi ≤ B

Mechanism must be: computationally efficient: polynomial time good approximation: V (OPT) ≤ αV (S) with: OPT = arg max

S⊂A

  • V (S) |
  • i∈S

ci ≤ B

  • Thibaut Horel ()

February 26, 2013 8 / 1

slide-26
SLIDE 26

Objectives

Payments (pi)i∈S must be: individually rational: pi ≥ ci, i ∈ S truthful: reporting one’s true cost is a dominant strategy budget feasible:

i∈S pi ≤ B

Mechanism must be: computationally efficient: polynomial time good approximation: V (OPT) ≤ αV (S) with: OPT = arg max

S⊂A

  • V (S) |
  • i∈S

ci ≤ B

  • Thibaut Horel ()

February 26, 2013 8 / 1

slide-27
SLIDE 27

Objectives

Payments (pi)i∈S must be: individually rational: pi ≥ ci, i ∈ S truthful: reporting one’s true cost is a dominant strategy budget feasible:

i∈S pi ≤ B

Mechanism must be: computationally efficient: polynomial time good approximation: V (OPT) ≤ αV (S) with: OPT = arg max

S⊂A

  • V (S) |
  • i∈S

ci ≤ B

  • Thibaut Horel ()

February 26, 2013 8 / 1

slide-28
SLIDE 28

Known results

When V is submodular: randomized budget feasible mechanism, approximation ratio: 7.91 (Chen et al., 2011) deterministic mechanisms for:

◮ Knapsack: 2 +

√ 2 (Chen et al., 2011)

◮ Matching: 7.37 (Singer, 2010) ◮ Coverage: 31 (Singer, 2012) Thibaut Horel () February 26, 2013 9 / 1

slide-29
SLIDE 29

Known results

When V is submodular: randomized budget feasible mechanism, approximation ratio: 7.91 (Chen et al., 2011) deterministic mechanisms for:

◮ Knapsack: 2 +

√ 2 (Chen et al., 2011)

◮ Matching: 7.37 (Singer, 2010) ◮ Coverage: 31 (Singer, 2012) Thibaut Horel () February 26, 2013 9 / 1

slide-30
SLIDE 30

Known results

When V is submodular: randomized budget feasible mechanism, approximation ratio: 7.91 (Chen et al., 2011) deterministic mechanisms for:

◮ Knapsack: 2 +

√ 2 (Chen et al., 2011)

◮ Matching: 7.37 (Singer, 2010) ◮ Coverage: 31 (Singer, 2012) Thibaut Horel () February 26, 2013 9 / 1

slide-31
SLIDE 31

Outline

Thibaut Horel () February 26, 2013 10 / 1

slide-32
SLIDE 32

Linear Regression

Linear regression

x1 x2 x3 y2 y1 y3

N users

Thibaut Horel () February 26, 2013 11 / 1

slide-33
SLIDE 33

Linear Regression

Linear regression

x1 x2 x3 y2 y1 y3

xi: public features (e.g. age, gender, height, etc.)

Thibaut Horel () February 26, 2013 11 / 1

slide-34
SLIDE 34

Linear Regression

Linear regression

x1 x2 x3 y2 y1 y3

yi: private data (e.g. disease, etc.)

Thibaut Horel () February 26, 2013 11 / 1

slide-35
SLIDE 35

Linear Regression

Linear regression

x1 x2 x3 y2 y1 y3

Gaussian Linear model: yi = βTxi + εi β∗ = arg min

β

  • i

|yi − βTxi|2

Thibaut Horel () February 26, 2013 11 / 1

slide-36
SLIDE 36

Linear Regression

Linear regression

x1 x2 x3 y2 y1 y3

c1$ c 3 $ c2$

Thibaut Horel () February 26, 2013 11 / 1

slide-37
SLIDE 37

Linear Regression

Linear regression

x1 x2 x3 y2 y1 y3

c1$ c 3 $ c2$ p1$ p 2 $

B$

Thibaut Horel () February 26, 2013 11 / 1

slide-38
SLIDE 38

Linear Regression

Linear regression

x1 x2 x3 y2 y1 y3

c1$ c 3 $ c2$ p1$ p 2 $

B$

Thibaut Horel () February 26, 2013 11 / 1

slide-39
SLIDE 39

Experimental design

Public vector of features xi ∈ Rd Private data yi ∈ R Gaussian linear model: yi = βTxi + εi, β ∈ Rd, εi ∼ N(0, σ2) Which users to select? Experimental design ⇒ D-optimal criterion

Experimental Design

maximize V (S) = log det

  • Id +
  • i∈S

xixT

i

  • subject to

|S| ≤ k

Thibaut Horel () February 26, 2013 12 / 1

slide-40
SLIDE 40

Experimental design

Public vector of features xi ∈ Rd Private data yi ∈ R Gaussian linear model: yi = βTxi + εi, β ∈ Rd, εi ∼ N(0, σ2) Which users to select? Experimental design ⇒ D-optimal criterion

Experimental Design

maximize V (S) = log det

  • Id +
  • i∈S

xixT

i

  • subject to

|S| ≤ k

Thibaut Horel () February 26, 2013 12 / 1

slide-41
SLIDE 41

Experimental design

Public vector of features xi ∈ Rd Private data yi ∈ R Gaussian linear model: yi = βTxi + εi, β ∈ Rd, εi ∼ N(0, σ2) Which users to select? Experimental design ⇒ D-optimal criterion

Experimental Design

maximize V (S) = log det

  • Id +
  • i∈S

xixT

i

  • subject to

|S| ≤ k

Thibaut Horel () February 26, 2013 12 / 1

slide-42
SLIDE 42

Budgeted Experimental design

maximize V (S) = log det

  • Id +
  • i∈S

xixT

i

  • subject to
  • i∈S

ci ≤ B the non-strategic optimization problem is NP-hard V is submodular previous results give a randomized budget feasible mechanism deterministic mechanism?

Thibaut Horel () February 26, 2013 13 / 1

slide-43
SLIDE 43

Budgeted Experimental design

maximize V (S) = log det

  • Id +
  • i∈S

xixT

i

  • subject to
  • i∈S

ci ≤ B the non-strategic optimization problem is NP-hard V is submodular previous results give a randomized budget feasible mechanism deterministic mechanism?

Thibaut Horel () February 26, 2013 13 / 1

slide-44
SLIDE 44

Budgeted Experimental design

maximize V (S) = log det

  • Id +
  • i∈S

xixT

i

  • subject to
  • i∈S

ci ≤ B the non-strategic optimization problem is NP-hard V is submodular previous results give a randomized budget feasible mechanism deterministic mechanism?

Thibaut Horel () February 26, 2013 13 / 1

slide-45
SLIDE 45

Budgeted Experimental design

maximize V (S) = log det

  • Id +
  • i∈S

xixT

i

  • subject to
  • i∈S

ci ≤ B the non-strategic optimization problem is NP-hard V is submodular previous results give a randomized budget feasible mechanism deterministic mechanism?

Thibaut Horel () February 26, 2013 13 / 1

slide-46
SLIDE 46

Main result

Theorem

There exists a budget feasible, individually rational and truthful mechanism for budgeted experimental design which runs in polynomial time. Its approximation ratio is: 10e − 3 + √ 64e2 − 24e + 9 2(e − 1) ≃ 12.98

Thibaut Horel () February 26, 2013 14 / 1

slide-47
SLIDE 47

Sketch of proof

Mechanism (Chen et. al, 2011) for submodular V

Find i∗ = arg maxi V ({i}) Compute SG greedily Return:

◮ {i∗} if V ({i∗}) ≥ V (OPT−i∗) ◮ SG otherwise

Valid mechanism, approximation ratio: 8.34 Problem: OPT−i∗ is NP-hard to compute Solution: Replace V (OPT−i∗) with L∗: computable in polynomial time close to V (OPT−i∗) Knapsack (Chen et al., 2011) Coverage (Singer, 2012)

Thibaut Horel () February 26, 2013 15 / 1

slide-48
SLIDE 48

Sketch of proof

Mechanism (Chen et. al, 2011) for submodular V

Find i∗ = arg maxi V ({i}) Compute SG greedily Return:

◮ {i∗} if V ({i∗}) ≥ V (OPT−i∗) ◮ SG otherwise

Valid mechanism, approximation ratio: 8.34 Problem: OPT−i∗ is NP-hard to compute Solution: Replace V (OPT−i∗) with L∗: computable in polynomial time close to V (OPT−i∗) Knapsack (Chen et al., 2011) Coverage (Singer, 2012)

Thibaut Horel () February 26, 2013 15 / 1

slide-49
SLIDE 49

Sketch of proof

Mechanism (Chen et. al, 2011) for submodular V

Find i∗ = arg maxi V ({i}) Compute SG greedily Return:

◮ {i∗} if V ({i∗}) ≥ V (OPT−i∗) ◮ SG otherwise

Valid mechanism, approximation ratio: 8.34 Problem: OPT−i∗ is NP-hard to compute Solution: Replace V (OPT−i∗) with L∗: computable in polynomial time close to V (OPT−i∗) Knapsack (Chen et al., 2011) Coverage (Singer, 2012)

Thibaut Horel () February 26, 2013 15 / 1

slide-50
SLIDE 50

Sketch of proof

Mechanism (Chen et. al, 2011) for submodular V

Find i∗ = arg maxi V ({i}) Compute SG greedily Return:

◮ {i∗} if V ({i∗}) ≥ L∗ ◮ SG otherwise

Valid mechanism, approximation ratio: 8.34 Problem: OPT−i∗ is NP-hard to compute Solution: Replace V (OPT−i∗) with L∗: computable in polynomial time close to V (OPT−i∗) Knapsack (Chen et al., 2011) Coverage (Singer, 2012)

Thibaut Horel () February 26, 2013 15 / 1

slide-51
SLIDE 51

Sketch of proof

Mechanism (Chen et. al, 2011) for submodular V

Find i∗ = arg maxi V ({i}) Compute SG greedily Return:

◮ {i∗} if V ({i∗}) ≥ L∗ ◮ SG otherwise

Valid mechanism, approximation ratio: 8.34 Problem: OPT−i∗ is NP-hard to compute Solution: Replace V (OPT−i∗) with L∗: computable in polynomial time close to V (OPT−i∗) Knapsack (Chen et al., 2011) Coverage (Singer, 2012)

Thibaut Horel () February 26, 2013 15 / 1

slide-52
SLIDE 52

Sketch of proof

Mechanism (Chen et. al, 2011) for submodular V

Find i∗ = arg maxi V ({i}) Compute SG greedily Return:

◮ {i∗} if V ({i∗}) ≥ L∗ ◮ SG otherwise

Valid mechanism, approximation ratio: 8.34 Problem: OPT−i∗ is NP-hard to compute Solution: Replace V (OPT−i∗) with L∗: computable in polynomial time close to V (OPT−i∗) Knapsack (Chen et al., 2011) Coverage (Singer, 2012)

Thibaut Horel () February 26, 2013 15 / 1

slide-53
SLIDE 53

Sketch of proof

Mechanism (Chen et. al, 2011) for submodular V

Find i∗ = arg maxi V ({i}) Compute SG greedily Return:

◮ {i∗} if V ({i∗}) ≥ L∗ ◮ SG otherwise

Valid mechanism, approximation ratio: 8.34 Problem: OPT−i∗ is NP-hard to compute Solution: Replace V (OPT−i∗) with L∗: computable in polynomial time close to V (OPT−i∗) Knapsack (Chen et al., 2011) Coverage (Singer, 2012)

Thibaut Horel () February 26, 2013 15 / 1

slide-54
SLIDE 54

Sketch of proof (2)

L∗ = arg max

λ∈[0,1]n

  • log det
  • Id +
  • i

λixixT

i

  • |

n

  • i=1

λici ≤ B

  • polynomial time? convex optimization problem

close to V (OPT−i∗)?

Technical lemma

L∗ ≤ 2V (OPT) + V ({i∗})

Thibaut Horel () February 26, 2013 16 / 1

slide-55
SLIDE 55

Sketch of proof (2)

L∗ = arg max

λ∈[0,1]n

  • log det
  • Id +
  • i

λixixT

i

  • |

n

  • i=1

λici ≤ B

  • polynomial time? convex optimization problem

close to V (OPT−i∗)?

Technical lemma

L∗ ≤ 2V (OPT) + V ({i∗})

Thibaut Horel () February 26, 2013 16 / 1

slide-56
SLIDE 56

Sketch of proof (2)

L∗ = arg max

λ∈[0,1]n

  • log det
  • Id +
  • i

λixixT

i

  • |

n

  • i=1

λici ≤ B

  • polynomial time? convex optimization problem

close to V (OPT−i∗)?

Technical lemma

L∗ ≤ 2V (OPT) + V ({i∗})

Thibaut Horel () February 26, 2013 16 / 1

slide-57
SLIDE 57

Sketch of proof (2)

L∗ = arg max

λ∈[0,1]n

  • log det
  • Id +
  • i

λixixT

i

  • |

n

  • i=1

λici ≤ B

  • polynomial time? convex optimization problem

close to V (OPT−i∗)?

Technical lemma

L∗ ≤ 2V (OPT) + V ({i∗})

Thibaut Horel () February 26, 2013 16 / 1

slide-58
SLIDE 58

Outline

Thibaut Horel () February 26, 2013 17 / 1

slide-59
SLIDE 59

Generalization

Generative model: yi = f (xi) + εi, i ∈ A prior knowledge of the experimenter: f is a random variable uncertainty of the experimenter: entropy H(f ) after observing {yi, i ∈ S}, uncertainty: H(f | S)

Value function: Information gain

V (S) = H(f ) − H(f | S), S ⊂ A V is submodular ⇒ randomized budget feasible mechanism

Thibaut Horel () February 26, 2013 18 / 1

slide-60
SLIDE 60

Generalization

Generative model: yi = f (xi) + εi, i ∈ A prior knowledge of the experimenter: f is a random variable uncertainty of the experimenter: entropy H(f ) after observing {yi, i ∈ S}, uncertainty: H(f | S)

Value function: Information gain

V (S) = H(f ) − H(f | S), S ⊂ A V is submodular ⇒ randomized budget feasible mechanism

Thibaut Horel () February 26, 2013 18 / 1

slide-61
SLIDE 61

Generalization

Generative model: yi = f (xi) + εi, i ∈ A prior knowledge of the experimenter: f is a random variable uncertainty of the experimenter: entropy H(f ) after observing {yi, i ∈ S}, uncertainty: H(f | S)

Value function: Information gain

V (S) = H(f ) − H(f | S), S ⊂ A V is submodular ⇒ randomized budget feasible mechanism

Thibaut Horel () February 26, 2013 18 / 1

slide-62
SLIDE 62

Generalization

Generative model: yi = f (xi) + εi, i ∈ A prior knowledge of the experimenter: f is a random variable uncertainty of the experimenter: entropy H(f ) after observing {yi, i ∈ S}, uncertainty: H(f | S)

Value function: Information gain

V (S) = H(f ) − H(f | S), S ⊂ A V is submodular ⇒ randomized budget feasible mechanism

Thibaut Horel () February 26, 2013 18 / 1

slide-63
SLIDE 63

Generalization

Generative model: yi = f (xi) + εi, i ∈ A prior knowledge of the experimenter: f is a random variable uncertainty of the experimenter: entropy H(f ) after observing {yi, i ∈ S}, uncertainty: H(f | S)

Value function: Information gain

V (S) = H(f ) − H(f | S), S ⊂ A V is submodular ⇒ randomized budget feasible mechanism

Thibaut Horel () February 26, 2013 18 / 1

slide-64
SLIDE 64

Generalization

Generative model: yi = f (xi) + εi, i ∈ A prior knowledge of the experimenter: f is a random variable uncertainty of the experimenter: entropy H(f ) after observing {yi, i ∈ S}, uncertainty: H(f | S)

Value function: Information gain

V (S) = H(f ) − H(f | S), S ⊂ A V is submodular ⇒ randomized budget feasible mechanism

Thibaut Horel () February 26, 2013 18 / 1

slide-65
SLIDE 65

Conclusion

Experimental design + Auction theory = powerful framework deterministic mechanism for the general case? other learning tasks? approximation ratio ≃ 13. Lower bound: 2

Thibaut Horel () February 26, 2013 19 / 1

slide-66
SLIDE 66

Conclusion

Experimental design + Auction theory = powerful framework deterministic mechanism for the general case? other learning tasks? approximation ratio ≃ 13. Lower bound: 2

Thibaut Horel () February 26, 2013 19 / 1

slide-67
SLIDE 67

Conclusion

Experimental design + Auction theory = powerful framework deterministic mechanism for the general case? other learning tasks? approximation ratio ≃ 13. Lower bound: 2

Thibaut Horel () February 26, 2013 19 / 1