Probabilities and Provenance on Trees and Treelike Instances Antoine - - PowerPoint PPT Presentation

probabilities and provenance on trees and treelike
SMART_READER_LITE
LIVE PREVIEW

Probabilities and Provenance on Trees and Treelike Instances Antoine - - PowerPoint PPT Presentation

Probabilities and Provenance on Trees and Treelike Instances Antoine Amarilli 1 , Pierre Bourhis 2 , Pierre Senellart 1 , 3 , 4 September 7th, 2016 1 Tlcom ParisTech 2 CNRS CRIStAL 3 National University of Singapore 4 cole normale


slide-1
SLIDE 1

Probabilities and Provenance

  • n Trees and Treelike Instances

Antoine Amarilli1, Pierre Bourhis2, Pierre Senellart1,3,4 September 7th, 2016

1Télécom ParisTech 2CNRS CRIStAL 3National University of Singapore 4École normale supérieure 1/7

slide-2
SLIDE 2

How to travel to Highlights from Paris?

2/7

slide-3
SLIDE 3

How to travel to Highlights from Paris?

2/7

slide-4
SLIDE 4

How to travel to Highlights from Paris?

2/7

slide-5
SLIDE 5

How to travel to Highlights from Paris?

2/7

slide-6
SLIDE 6

How to travel to Highlights from Paris?

(Metro|RER)*|(Bus|Tram)*

2/7

slide-7
SLIDE 7

How to travel to Highlights from Paris?

2/7

slide-8
SLIDE 8

How to travel to Highlights from Paris?

(Metro|RER)*|(Bus|Tram)*

2/7

slide-9
SLIDE 9

How to travel to Highlights from Paris?

50%

(Metro|RER)*|(Bus|Tram)*

2/7

slide-10
SLIDE 10

How to travel to Highlights from Paris?

50%

2/7

slide-11
SLIDE 11

How to travel to Highlights from Paris?

50% 90%

(Metro|RER)*|(Bus|Tram)*

2/7

slide-12
SLIDE 12

How to travel to Highlights from Paris?

(Metro|RER)*|(Bus|Tram)*

50% 90% 42% 37% 90% 83% 78% 72%

2/7

slide-13
SLIDE 13

How to travel to Highlights from Paris?

(Metro|RER)*|(Bus|Tram)*

50% 90% 42% 37% 90% 83% 78% 72%

What is the probability that I can attend Highlights 2016?

2/7

slide-14
SLIDE 14

Problem statement

Input:

? Query Q

(Metro|RER)*|(Bus|Tram)*

3/7

slide-15
SLIDE 15

Problem statement

Input:

? Query Q

(Metro|RER)*|(Bus|Tram)* Database D or graph

3/7

slide-16
SLIDE 16

Problem statement

Input:

? Query Q

(Metro|RER)*|(Bus|Tram)* Database D or graph

% Probabilities on facts or edges

50%

3/7

slide-17
SLIDE 17

Problem statement

Input:

? Query Q

(Metro|RER)*|(Bus|Tram)* Database D or graph

% Probabilities on facts or edges

50%

Output: the probability that the query is true under the distribution (assuming independence of all probabilistic events)

3/7

slide-18
SLIDE 18

Problem statement

Input:

? Query Q

(Metro|RER)*|(Bus|Tram)* Database D or graph

% Probabilities on facts or edges

50%

Output: the probability that the query is true under the distribution (assuming independence of all probabilistic events) Complexity: already #P-hard in the input database! (from #MONOTONE-SAT)

3/7

slide-19
SLIDE 19

Using treewidth to make the problem tractable

4/7

slide-20
SLIDE 20

Using treewidth to make the problem tractable

4/7

slide-21
SLIDE 21

Using treewidth to make the problem tractable

4/7

slide-22
SLIDE 22

Using treewidth to make the problem tractable

Treewidth by example:

4/7

slide-23
SLIDE 23

Using treewidth to make the problem tractable

Treewidth by example:

4/7

slide-24
SLIDE 24

Using treewidth to make the problem tractable

Treewidth by example:

4/7

slide-25
SLIDE 25

Using treewidth to make the problem tractable

Treewidth by example:

4/7

slide-26
SLIDE 26

Using treewidth to make the problem tractable

Treewidth by example:

4/7

slide-27
SLIDE 27

Using treewidth to make the problem tractable

Treewidth by example:

4/7

slide-28
SLIDE 28

Using treewidth to make the problem tractable

Treewidth by example:

4/7

slide-29
SLIDE 29

Using treewidth to make the problem tractable

Treewidth by example:

4/7

slide-30
SLIDE 30

Using treewidth to make the problem tractable

Treewidth by example:

4/7

slide-31
SLIDE 31

Using treewidth to make the problem tractable

Treewidth by example:

4/7

slide-32
SLIDE 32

Using treewidth to make the problem tractable

Treewidth by example:

4/7

slide-33
SLIDE 33

Using treewidth to make the problem tractable

Treewidth by example:

4/7

slide-34
SLIDE 34

Using treewidth to make the problem tractable

Treewidth by example:

4/7

slide-35
SLIDE 35

Using treewidth to make the problem tractable

Treewidth by example:

4/7

slide-36
SLIDE 36

Using treewidth to make the problem tractable

Treewidth by example:

  • Trees have treewidth 1
  • Cycles have treewidth 2
  • k-cliques and (k − 1)-grids have treewidth k − 1

4/7

slide-37
SLIDE 37

Using treewidth to make the problem tractable

Treewidth by example:

  • Trees have treewidth 1
  • Cycles have treewidth 2
  • k-cliques and (k − 1)-grids have treewidth k − 1

→ Treelike: the treewidth is bounded by a constant

4/7

slide-38
SLIDE 38

Tractability on treelike instances (RER|metro)* |(bus|tram)* MSO query Treelike data

5/7

slide-39
SLIDE 39

Tractability on treelike instances Tree automaton (RER|metro)* |(bus|tram)* MSO query Treelike data

5/7

slide-40
SLIDE 40

Tractability on treelike instances Tree automaton Tree encoding (RER|metro)* |(bus|tram)* MSO query Treelike data

5/7

slide-41
SLIDE 41

Tractability on treelike instances Tree automaton Tree encoding (RER|metro)* |(bus|tram)* MSO query linear [Courcelle] Treelike data Query answer

TRUE

5/7

slide-42
SLIDE 42

Tractability on treelike instances Tree automaton Tree encoding (RER|metro)* |(bus|tram)* MSO query Treelike data

5/7

slide-43
SLIDE 43

Tractability on treelike instances Tree automaton Tree encoding (RER|metro)* |(bus|tram)* MSO query Provenance circuit

linear Treelike data

5/7

slide-44
SLIDE 44

Tractability on treelike instances Tree automaton Tree encoding (RER|metro)* |(bus|tram)* MSO query Provenance circuit

linear Treelike data Probability

42%

linear

5/7

slide-45
SLIDE 45

Tractability on treelike instances Tree automaton Tree encoding (RER|metro)* |(bus|tram)* MSO query Provenance circuit

linear Treelike data Probability

42%

linear

Theorem For any fixed Boolean MSO query q and k ∈ N, given a database D of treewidth ≤ k with independent probabilities, we can compute in linear time the probability that D satisfies q

5/7

slide-46
SLIDE 46

Lower bound

What can we do for unbounded-treewidth instances?

6/7

slide-47
SLIDE 47

Lower bound

What can we do for unbounded-treewidth instances? ... not much.

6/7

slide-48
SLIDE 48

Lower bound

Theorem For any graph signature σ, there is a first-order query q such that for any constructible unbounded-treewidth class I, probability evaluation of q on I is #P-hard under RP reductions

6/7

slide-49
SLIDE 49

Lower bound

Theorem For any graph signature σ, there is a first-order query q such that for any constructible unbounded-treewidth class I, probability evaluation of q on I is #P-hard under RP reductions Proof idea: extract instances of a hard problem as topological minors using recent polynomial bounds [Chekuri and Chuzhoy, 2014]

6/7

slide-50
SLIDE 50

Lower bound

Theorem For any graph signature σ, there is a first-order query q such that for any constructible unbounded-treewidth class I, probability evaluation of q on I is #P-hard under RP reductions

3 1 2 4 3 4 2 1

maps vertices to vertices maps edges to vertex-disjoint paths Proof idea: extract instances of a hard problem as topological minors using recent polynomial bounds [Chekuri and Chuzhoy, 2014]

6/7

slide-51
SLIDE 51

Future and ongoing work

  • Improving the lower bound:
  • From graphs to arbitrary arity databases
  • From FO down to unions of conjunctive queries with =

7/7

slide-52
SLIDE 52

Future and ongoing work

  • Improving the lower bound:
  • From graphs to arbitrary arity databases
  • From FO down to unions of conjunctive queries with =
  • Complexity in query and database — currently Ω
  • 22...2|Q|

× |D|

  • → Which queries can efficiently be compiled to automata?

7/7

slide-53
SLIDE 53

Future and ongoing work

  • Improving the lower bound:
  • From graphs to arbitrary arity databases
  • From FO down to unions of conjunctive queries with =
  • Complexity in query and database — currently Ω
  • 22...2|Q|

× |D|

  • → Which queries can efficiently be compiled to automata?
  • Non-Boolean queries: efficient enumeration of query results?

7/7

slide-54
SLIDE 54

Future and ongoing work

  • Improving the lower bound:
  • From graphs to arbitrary arity databases
  • From FO down to unions of conjunctive queries with =
  • Complexity in query and database — currently Ω
  • 22...2|Q|

× |D|

  • → Which queries can efficiently be compiled to automata?
  • Non-Boolean queries: efficient enumeration of query results?
  • Other tasks: probabilistic conditioning

“Knowing that I’m here, what’s the probability that RER B is up?”

7/7

slide-55
SLIDE 55

Future and ongoing work

  • Improving the lower bound:
  • From graphs to arbitrary arity databases
  • From FO down to unions of conjunctive queries with =
  • Complexity in query and database — currently Ω
  • 22...2|Q|

× |D|

  • → Which queries can efficiently be compiled to automata?
  • Non-Boolean queries: efficient enumeration of query results?
  • Other tasks: probabilistic conditioning

“Knowing that I’m here, what’s the probability that RER B is up?” Thanks for your attention!

7/7

slide-56
SLIDE 56

References I

Chekuri, C. and Chuzhoy, J. (2014). Polynomial bounds for the grid-minor theorem. In STOC. Courcelle, B. (1990). The monadic second-order logic of graphs. I. Recognizable sets

  • f finite graphs.
  • Inf. Comput., 85(1).
slide-57
SLIDE 57

Image credits

  • Slide 2:
  • https://commons.wikimedia.org/wiki/File:

Paris_Metro_map.svg (cropped), user Umx on Wikimedia Commons, public domain

  • http://www.parisvoyage.com/images/cartoon18.jpg,

ParisVoyage, fair use

  • http://www.vianavigo.com/fileadmin/galerie/pdf/CGU_t_.pdf

(cropped), RATP, fair use

  • Slides 4 and 5: https://commons.wikimedia.org/wiki/File:

Carte_Transilien_RER_sch%C3%A9matique.svg (modified), user Benjamin Smith on Wikimedia Commons, license CC BY-SA 4.0 international.