Symbolic Search and Abstraction Heuristics for Cost-Optimal Planning
´ Alvaro Torralba Advisors: Daniel Borrajo and Carlos Linares L´
- pez
Universidad Carlos III de Madrid – June 2, 2015
´ Alvaro Torralba PhD Defense June 2, 2015 1 / 54
Symbolic Search and Abstraction Heuristics for Cost-Optimal Planning - - PowerPoint PPT Presentation
Symbolic Search and Abstraction Heuristics for Cost-Optimal Planning Alvaro Torralba Advisors: Daniel Borrajo and Carlos Linares L opez Universidad Carlos III de Madrid June 2, 2015 Alvaro Torralba PhD Defense June 2, 2015 1 /
´ Alvaro Torralba PhD Defense June 2, 2015 1 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 2 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 2 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 3 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 4 / 54
◮ International Planning Competition: 1998, 2000, 2002, 2004, 2006,
◮ Standard set of benchmark domains: 1998-2011 ◮ Time limit: 30 minutes ◮ Memory limit: 4GB RAM ◮ Coverage: number of problems solved ◮ Time: solve problems faster ´ Alvaro Torralba PhD Defense June 2, 2015 4 / 54
◮ Bidirectional search ◮ Symbolic search ◮ Abstraction heuristics
´ Alvaro Torralba PhD Defense June 2, 2015 5 / 54
◮ Bidirectional search ◮ Symbolic search ⇒ GAMER: winner of IPC 2008 ◮ Abstraction heuristics
´ Alvaro Torralba PhD Defense June 2, 2015 5 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 6 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 6 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 6 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 6 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 6 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 6 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 6 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 7 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 8 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 8 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 8 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 8 / 54
◮ Variable ordering ◮ Reduction rules
1
2
3
T at A P in T P in T P at A P at A T F ´ Alvaro Torralba PhD Defense June 2, 2015 9 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 10 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 11 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 11 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 11 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 11 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 11 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 12 / 54
◮ Monolithic relation ⇒ may use exponential memory ◮ Solution in GAMER ⇒ One TR for each action
´ Alvaro Torralba PhD Defense June 2, 2015 13 / 54
◮ Monolithic relation ⇒ may use exponential memory ◮ Solution in GAMER ⇒ One TR for each action
´ Alvaro Torralba PhD Defense June 2, 2015 13 / 54
◮ Monolithic relation ⇒ may use exponential memory ◮ Solution in GAMER ⇒ One TR for each action
´ Alvaro Torralba PhD Defense June 2, 2015 13 / 54
◮ Monolithic relation ⇒ may use exponential memory ◮ Solution in GAMER ⇒ One TR for each action
´ Alvaro Torralba PhD Defense June 2, 2015 13 / 54
1
2
3
20: conjunction tree
4
100k: aggregate TRs
´ Alvaro Torralba PhD Defense June 2, 2015 14 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 15 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 16 / 54
1
2
1
2
´ Alvaro Torralba PhD Defense June 2, 2015 17 / 54
1
2
v1 v2 T F
v1 v2 v2 v3 v3 T F
v1 v2 v2 v3 v3 T F
´ Alvaro Torralba PhD Defense June 2, 2015 18 / 54
◮ Baseline (B) ◮ Not pruning invalid states: M∅ ◮ Pruning invalid states: cBDD or e-deletion (e-del)
´ Alvaro Torralba PhD Defense June 2, 2015 19 / 54
500 1,000 1,500 2,000 500 1,000 1,500 2,000 Solving time of Oh2 (seconds) Solving time of O (seconds)
500 1,000 1,500 2,000 500 1,000 1,500 2,000 Solving time of e-del (seconds) Solving time of Oh2 (seconds)
500 1,000 1,500 2,000 500 1,000 1,500 2,000 Solving time of e-del (seconds) Solving time of M100k (seconds)
´ Alvaro Torralba PhD Defense June 2, 2015 20 / 54
CGAMER -BD
´ Alvaro Torralba PhD Defense June 2, 2015 21 / 54
CGAMER-FW
´ Alvaro Torralba PhD Defense June 2, 2015 21 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 21 / 54
CGAMER -BD
CGAMER-FW
´ Alvaro Torralba PhD Defense June 2, 2015 21 / 54
1
◮ Analyzed different methods for image computation ◮ Best method: aggregate TRs 2
◮ Pruning invalid states (specially useful in bw search) ◮ Best encoding for symbolic search: e-edeletion
´ Alvaro Torralba PhD Defense June 2, 2015 22 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 23 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 24 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 24 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 24 / 54
◮ Smaller abstract state space → easier to search ◮ Use optimal distances in abstract state space as heuristic ◮ Preserve transitions → admissible estimation
´ Alvaro Torralba PhD Defense June 2, 2015 25 / 54
◮ Smaller abstract state space → easier to search ◮ Use optimal distances in abstract state space as heuristic ◮ Preserve transitions → admissible estimation
´ Alvaro Torralba PhD Defense June 2, 2015 25 / 54
◮ Smaller abstract state space → easier to search ◮ Use optimal distances in abstract state space as heuristic ◮ Preserve transitions → admissible estimation
◮ Ignore some variables in the problem ◮ Limitation: ignoring a single variable may relax too much ´ Alvaro Torralba PhD Defense June 2, 2015 25 / 54
◮ Smaller abstract state space → easier to search ◮ Use optimal distances in abstract state space as heuristic ◮ Preserve transitions → admissible estimation
◮ Ignore some variables in the problem ◮ Limitation: ignoring a single variable may relax too much ´ Alvaro Torralba PhD Defense June 2, 2015 25 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 26 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 26 / 54
moveA,B moveB,A moveA,B moveB,A moveA,B moveB,A load/unloadA load/unloadB ´ Alvaro Torralba PhD Defense June 2, 2015 26 / 54
moveA,B moveB,A moveA,B moveB,A moveA,B moveB,A load/unloadA load/unloadB ´ Alvaro Torralba PhD Defense June 2, 2015 26 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 27 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 28 / 54
moveA,B moveB,A moveA,B moveB,A moveA,B moveB,A load/unloadA load/unloadB ´ Alvaro Torralba PhD Defense June 2, 2015 29 / 54
moveA,B moveB,A moveA,B moveB,A moveA,B moveB,A load/unloadA load/unloadB
´ Alvaro Torralba PhD Defense June 2, 2015 29 / 54
moveA,B moveB,A moveA,B moveB,A moveA,B moveB,A load/unloadA load/unloadB
´ Alvaro Torralba PhD Defense June 2, 2015 29 / 54
moveA,B moveB,A moveA,B moveB,A moveA,B moveB,A load/unloadA load/unloadB
AA Ac AB ✗ BA Bc BB ✗ ´ Alvaro Torralba PhD Defense June 2, 2015 29 / 54
moveA,B moveB,A moveA,B moveB,A moveA,B moveB,A load/unloadA load/unloadB
AA Ac AB ✗ BA Bc BB ✗ ´ Alvaro Torralba PhD Defense June 2, 2015 29 / 54
moveA,B moveB,A moveA,B moveB,A moveA,B moveB,A load/unloadA load/unloadB
AA Ac AB ✗ BA Bc ´ Alvaro Torralba PhD Defense June 2, 2015 29 / 54
◮ ADD representation of heuristics reduces memory ◮ Variable ordering has a huge impact
´ Alvaro Torralba PhD Defense June 2, 2015 30 / 54
◮ Contradicts our hypothesis ´ Alvaro Torralba PhD Defense June 2, 2015 31 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 32 / 54
1
2
´ Alvaro Torralba PhD Defense June 2, 2015 33 / 54
1
2
1
2
´ Alvaro Torralba PhD Defense June 2, 2015 33 / 54
(original problem)
(M&S abstraction) ´ Alvaro Torralba PhD Defense June 2, 2015 34 / 54
(original problem)
(M&S abstraction) ´ Alvaro Torralba PhD Defense June 2, 2015 34 / 54
(original problem)
(M&S abstraction) ´ Alvaro Torralba PhD Defense June 2, 2015 34 / 54
(original problem)
(M&S abstraction) ´ Alvaro Torralba PhD Defense June 2, 2015 34 / 54
(original problem)
(M&S abstraction) ´ Alvaro Torralba PhD Defense June 2, 2015 34 / 54
(original problem)
(M&S abstraction) ´ Alvaro Torralba PhD Defense June 2, 2015 34 / 54
1
2
⋆ membership in perimeter ⋆ frontier mapping 3
1
2
´ Alvaro Torralba PhD Defense June 2, 2015 35 / 54
1
2
⋆ membership in perimeter ⋆ frontier mapping 3
1
2
´ Alvaro Torralba PhD Defense June 2, 2015 35 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 36 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 36 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 36 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 36 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 37 / 54
100 101 102 103 104 105 106 107 100 102 104 106 108 Expanded nodes SPM&S bop 10k Expanded nodes M&S bop 10k 100 101 102 103 104 105 106 107 100 102 104 106 108 Expanded nodes SPM&S bop 10k Expanded nodes LM-CUT ´ Alvaro Torralba PhD Defense June 2, 2015 38 / 54
100 101 102 103 104 105 106 107 108 100 102 104 106 108 Expanded nodes SPM&S bop 10k Expanded nodes SP 100 101 102 103 104 105 106 107 108 100 102 104 106 Expanded nodes SPM&S bop 10k Expanded nodes SPPDB ´ Alvaro Torralba PhD Defense June 2, 2015 39 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 40 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 41 / 54
1
2
´ Alvaro Torralba PhD Defense June 2, 2015 42 / 54
1
2
´ Alvaro Torralba PhD Defense June 2, 2015 42 / 54
1
2
◮ Very promising since years ago ◮ Never really able to outperform A∗ or bidirectional uniform-cost
´ Alvaro Torralba PhD Defense June 2, 2015 42 / 54
1
⋆ If it succeeds → done! 2
´ Alvaro Torralba PhD Defense June 2, 2015 43 / 54
1
⋆ If it succeeds → done! 2
´ Alvaro Torralba PhD Defense June 2, 2015 43 / 54
1
⋆ If it succeeds → done! 2
´ Alvaro Torralba PhD Defense June 2, 2015 43 / 54
1
⋆ If it succeeds → done! 2
´ Alvaro Torralba PhD Defense June 2, 2015 43 / 54
1
⋆ If it succeeds → done! 2
´ Alvaro Torralba PhD Defense June 2, 2015 43 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 44 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 44 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 44 / 54
◮ SymBA∗: a symbolic bidirectional heuristic search algorithm ◮ Bidirectional search in abstract state spaces ◮ Synergy: Symbolic search + Bidirectional search + Perimeter
◮ Future work: domain-independent abstraction strategies (better
´ Alvaro Torralba PhD Defense June 2, 2015 45 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 46 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 47 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 47 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 47 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 47 / 54
1
2
3
◮ GAMER: baseline symbolic planner ◮ Top explicit-state search planners and portfolios
◮ Domains/Instances selection, bugs, . . . ´ Alvaro Torralba PhD Defense June 2, 2015 48 / 54
Barman Cave Childsnack Citycar Floortile GED Hiking Maintenan Openstack Parking Tetris Tidybot Transport Visitall Total
´ Alvaro Torralba PhD Defense June 2, 2015 49 / 54
Barman Cave Childsnack Citycar Floortile GED Hiking Maintenan Openstack Parking Tetris Tidybot Transport Visitall Total
´ Alvaro Torralba PhD Defense June 2, 2015 49 / 54
Barman Cave Childsnack Citycar Floortile GED Hiking Maintenan Openstack Parking Tetris Tidybot Transport Visitall Total
´ Alvaro Torralba PhD Defense June 2, 2015 49 / 54
Barman Cave Childsnack Citycar Floortile GED Hiking Maintenan Openstack Parking Tetris Tidybot Transport Visitall Total
´ Alvaro Torralba PhD Defense June 2, 2015 49 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 50 / 54
◮ Analysis of image computation ◮ State-invariant pruning
1
2
´ Alvaro Torralba PhD Defense June 2, 2015 51 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 52 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 53 / 54
´ Alvaro Torralba PhD Defense June 2, 2015 54 / 54