Welcome to the Jet Age How AI and DL Makes Online Shopping Smarter - - PowerPoint PPT Presentation

welcome to the jet age
SMART_READER_LITE
LIVE PREVIEW

Welcome to the Jet Age How AI and DL Makes Online Shopping Smarter - - PowerPoint PPT Presentation

Welcome to the Jet Age How AI and DL Makes Online Shopping Smarter at Walmart Today 1. GPU GA for Smart Merchant Selection 2. Deep Reinforcement Learning for Packing 2 J-Wal 2007. Introduction 3 US eCommerce site dedicated to savings


slide-1
SLIDE 1

Welcome to the Jet Age

How AI and DL Makes Online Shopping Smarter at Walmart

slide-2
SLIDE 2
  • 1. GPU GA for Smart Merchant Selection
  • 2. Deep Reinforcement Learning for Packing

Today

J-Wal 2007.

2

slide-3
SLIDE 3

Introduction

3

US eCommerce site dedicated to savings Revolutionary pricing engine Top technology and fulfillment platform Acquired by Walmart in mid 2016

slide-4
SLIDE 4

Introduction

4

Machine learning and Cognitive computing Numerical HPC algorithm design and implementation Specialized in GPU computing and parallel algorithms Creator of Alea GPU

slide-5
SLIDE 5

Smart Merchant Selection

slide-6
SLIDE 6

Jet Pricing Engine

6

Users shop for products Platform decides about most optimal fulfillment during shopping and at checkout Savings come from

  • Cheapest net item prices
  • Pack items together for fewer boxes to ship
  • Conditions, merchant commission, basket rules
  • More efficient fulfillment
slide-7
SLIDE 7

Savings Potential Larger Carts – More Savings

7

Larger carts – more savings

slide-8
SLIDE 8

Full Search Embarrassingly Parallel

8

Low Prices Cart pricing can be executed independently

2

Large number of ways to fulfill a shopping cart

1

Ideal problem to solve in parallel with GPUs

slide-9
SLIDE 9

Full Search Exponential Complexity

9

Number of combinations = num offers for item 1 * . . . * num offers for item k Complexity =

slide-10
SLIDE 10

Full Search on GPU

10

50x 300x

slide-11
SLIDE 11

Example

11

Somebody wanted to build a computer

slide-12
SLIDE 12

Exponential Complexity

12

= 70’442’237’952’000 combinations Number of combinations = Offers for item 1 * Offers for item 2 * * Offers for item 10 . . . . = 32 * 17 * 19 * 16 * 29 * 9 * 25 * 10 * 16 * 17 * 24 = 1013.85 combinations

slide-13
SLIDE 13

Performance Gap

13

13.85 = Log10 70’442’237’952’000 Real time

Real time gap

slide-14
SLIDE 14

Genetic Algorithm

14

Apply Genetic Algorithms to solve the problem

  • Standard GA does not work
  • Search space is astronomically large
  • Need a reliable high quality approx. solution
  • Calculations in near real-time
  • Rely on AI & ML to choose GA configuration
  • Generation iteration is serial, extending the population size

dramatically allows to reduce iterations

slide-15
SLIDE 15

Genetic Algorithm Schematic Description

15

Generate initial population Evaluate fitness Parents selection Elite selection Crossover and mutation New population Evaluate stopping criterion

AI & ML

slide-16
SLIDE 16

Convergence Embedding with TSNE

16

Initial population

  • 1. generation
  • 4. generation

Best part from full search Special greedy «boundary» points Score rapidly improves Catches best points found by full search

slide-17
SLIDE 17

In the News

17

slide-18
SLIDE 18

Deep Reinforcement Learning for Packing

slide-19
SLIDE 19

Optimal Packing Non-standard Multi-container Loading Problem

19

30 – 40 different container types Choose the best containers to pack in as few containers as possible Respect many constraints Add optional coolant for fresh Minimize waste volume

slide-20
SLIDE 20

Learning Approach

20

GA is powerful but

  • Slow (complex constraints)
  • Hard to move to GPU (constraints, placement heuristics)

Deep Reinforcement Learning

  • More natural (cost resp. reward based)
  • More flexible
  • Bootstrapping with solutions from GA
  • Requires retraining when container types change
slide-21
SLIDE 21

Deep Reinforcement Learning

21

Learning a behavioral strategy which maximizes long term sum of rewards by a direct interaction with an unknown and uncertain environment

Environment

Reward State Action While not terminal do: Agent perceives state st Agent performs action at Agent receives reward rt Environment evolves to state st+1

Agent

slide-22
SLIDE 22

Placements in Containers

22

Free subspaces keep track of potential placements

slide-23
SLIDE 23

Reinforcement Learning Setup

23

Immediate reward

  • Number of containers used so far
  • Waste volume
  • Constraints violations

Final reward

  • Total shipping costs

States

  • Opened containers
  • Free subspaces of each opened

container

  • Remaining boxes to pack

Action

  • Option to open new container
  • Choose an orientation of the box
  • Choose a free subspace in a container

to place the box

slide-24
SLIDE 24

Reinforcement Learning Performance

24

Compare to baseline random search

slide-25
SLIDE 25
slide-26
SLIDE 26
slide-27
SLIDE 27

Questions?