IMPROVING PRECISION OF E-COMMERCE SEARCH RESULTS HAYSTACK Europe - - PowerPoint PPT Presentation

improving precision of e commerce search results
SMART_READER_LITE
LIVE PREVIEW

IMPROVING PRECISION OF E-COMMERCE SEARCH RESULTS HAYSTACK Europe - - PowerPoint PPT Presentation

IMPROVING PRECISION OF E-COMMERCE SEARCH RESULTS HAYSTACK Europe 2019 - Berlin 06.11.2019 1 ABOUT US Jens Krsten Tech Lead & Developer Search @otto.de Arne Vogt Business Designer Search @otto.de HAYSTACK Europe 2019 - Berlin


slide-1
SLIDE 1

HAYSTACK Europe 2019 - Berlin 1

IMPROVING PRECISION OF E-COMMERCE SEARCH RESULTS

06.11.2019

slide-2
SLIDE 2

HAYSTACK Europe 2019 - Berlin 06.11.2019

Jens Kürsten Tech Lead & Developer Search @otto.de Arne Vogt Business Designer Search @otto.de

ABOUT US

slide-3
SLIDE 3

About OTTO and otto.de

06.11.2019 3

OTTO‘s headquarter in Hamburg

▪ Founded in 1949 ▪ Number of employees 4,900 ▪ Revenue in 2018/19 3.2 billion Euro

HAYSTACK Europe 2019 - Berlin

▪ On average 1.6 million visits on otto.de per day ▪ Up to 10 ordersper second ▪ More than 3 million items on otto.de ▪ More than 400 OTTO market partners ▪

  • Approx. 6,800 brands on otto.de

▪ Expansion of the business model towards becoming a marketplace

slide-4
SLIDE 4

Key Figures Product Search @otto.de in 2018

06.11.2019 4

Ø search queries per day search queries in 2018

  • max. search queries per day

unique search terms in 2018

~0.9 million ~320 million ~3 million ~40 million

HAYSTACK Europe 2019 - Berlin

slide-5
SLIDE 5

Our Key Requirement for Search Relevance @otto.de

06.11.2019 5 HAYSTACK Europe 2019 - Berlin

BUSINESS USER

Search relevance @otto.de is determinedby

  • our user queries
  • product data (quality)
  • different performance indicators of our products
  • different business goals for different categories

Finding the balance between the user‘s intent and the business‘ perspective is our key requirement for search relevance @otto.de

!

slide-6
SLIDE 6

HAYSTACK Europe 2019 - Berlin 6

WHAT IS THE PROBLEM?

06.11.2019

slide-7
SLIDE 7

One Challenge wrt. Search Relevance @otto.de: Understanding the User‘s Intent

06.11.2019 7

Query results for category searches are often too fuzzy: recall is good, but precision can be quite bad

HAYSTACK Europe 2019 - Berlin

slide-8
SLIDE 8

06.11.2019

One Challenge wrt. Search Relevance @otto.de: Understanding the User‘s Intent

8

Fuzzy search results lead to difficulties in ranking

HAYSTACK Europe 2019 - Berlin

slide-9
SLIDE 9

06.11.2019

One Challenge wrt. Search Relevance @otto.de: Understanding the User‘s Intent

9

Results via navigation deliver much higher precison for the same category

HAYSTACK Europe 2019 - Berlin

slide-10
SLIDE 10

Topical Relevance vs. Business Value

06.11.2019 10 HAYSTACK Europe 2019 - Berlin

10 20 30 40 50 60 70

Impact Rank Position

Topical Relevancevs. Business Value - Query "tie"

Business Value Relevance

slide-11
SLIDE 11

HAYSTACK Europe 2019 - Berlin 12

HOW IS IMPROVING THE PRECISION GOING TO AFFECT THE USER?

06.11.2019

slide-12
SLIDE 12

First Business Objective: Search Effectiveness

06.11.2019 HAYSTACK Europe 2019 - Berlin 13

We regard an order in a search session as a sign of success

Successfulsearch session: Unsuccessful search session:

slide-13
SLIDE 13

Second Business Objective: Search Efficiency

06.11.2019 HAYSTACK Europe 2019 - Berlin 14

We regard a search session with less search interactions as more efficient

5 Search Interactions 1 search order Ratio 5:1 2 Search Interactions 1 search order Ratio 2:1

slide-14
SLIDE 14

Hypothesis for improving the precision

Hypothesis 1: Search Effectiveness We assume that some of our users have a low involvement in the search task or the online shop. They are easily frustrated due to the current lack of precision and leave the shop before they find what they are looking for. → An improvement in precision will therefore lead to a higher search conversion rate Hypothesis 2: Search Efficiency We assume that some of our users have a high involvement in the search task. They will tolerate the lack of precision and still find what they are looking for. It just cost them more effort (time, clicks, thoughts). →An improvement in precision will therefore lead to a lower ratio of search interactions to orders

06.11.2019 HAYSTACK Europe 2019 - Berlin 15

How will an improvement in precision influence our users?

slide-15
SLIDE 15

HAYSTACK Europe 2019 - Berlin 16

OUR APPROACH

06.11.2019

slide-16
SLIDE 16

Our basic discovery approach

06.11.2019 HAYSTACK Europe 2019 - Berlin 17

In our discoveries we loosely follow the design thinking process

understanding the problem finding the solution testing the solution

slide-17
SLIDE 17

Our basic discovery approach

06.11.2019 HAYSTACK Europe 2019 - Berlin 18

In our discoveries we loosely follow the design thinking process

understanding the problem finding the solution testing the solution

slide-18
SLIDE 18

Our Idea for a Solution of the Problem : Automatic Filter Selection

06.11.2019 19 HAYSTACK Europe 2019 - Berlin

Use the data our customers leave behind

slide-19
SLIDE 19

Our Idea for a Solution of the Problem: Automatic Filter Selection

06.11.2019 HAYSTACK Europe 2019 - Berlin 20

Use the data our customers leave behind

clicks & orders filter attribute values for relevance searchterm & product performance filtered search results

slide-20
SLIDE 20

It took us four iterations to define the prototype

06.11.2019 HAYSTACK Europe 2019 - Berlin 22

Iteration 1

Scope: brand searches Insight: potential too low

Iteration 2

Scope: category searches Insight: potential ok, but there might be more

Iteration 3

Scope: all searches Insight: higher potential, but also higher risk

Iteration 4

Scope: Shaping the prototype Insight: Definition of cut-off, decision for data fields and metrics

slide-21
SLIDE 21

Our basic discovery approach

06.11.2019 HAYSTACK Europe 2019 - Berlin 23

In our discoveries we loosely follow the design thinking process

understanding the problem finding the solution testing the solution

slide-22
SLIDE 22

Offline Evaluation of Search Relevance Improvements

06.11.2019 24 HAYSTACK Europe 2019 - Berlin

ONLINE OFFLINE

query and click logs relevance assessment of different configurations judgements

  • n-site testing

new configuration

slide-23
SLIDE 23

Offline Evaluation Architecture

06.11.2019 25 HAYSTACK Europe 2019 - Berlin

queries hits configs metrics # queries # clicks per product (in time slices) query judgement & score pairs (optionallysampled) web shop tracking data

slide-24
SLIDE 24

Metrics in the Making

OFFLINE

  • Topical relevance metrics
  • Precision@n
  • NDCG
  • Average Precision
  • ERR
  • Adressing temporal changes in frequency and significance
  • Traffic weight as metric factor at query-level
  • Adressing significance as business performance predictor
  • Traffic weight * business importance at query-level

06.11.2019 27 HAYSTACK Europe 2019 - Berlin

slide-25
SLIDE 25

06.11.2019 28 HAYSTACK Europe 2019 - Berlin

Offline Evaluation Setup for Automatic Filter Selection

We evaluated 12 configurations based on different product data, interaction data and filter/attribute value selection on a query-set with 100.000 entries

!

assortment category producttype clicks add to baskets x% of interaction precision @ k average precision @ k Product data as filter fields: Interaction data: Filter value selection based on: Evaluated metrics:

OFFLINE

slide-26
SLIDE 26

Filter Attribute Value Selection Strategy

06.11.2019 29 HAYSTACK Europe 2019 - Berlin

Produkttyp Values Clicks Cumulated Sum Coverage LED-Fernseher 100 100 50% 4k Fernseher 80 180 90% Curved TV 10 190 95% Smart TV 5 195 97,5% … … … … … … 200 100%

OFFLINE

slide-27
SLIDE 27

Offline Evaluation Results for Automatic Filter Selection

06.11.2019 30 HAYSTACK Europe 2019 - Berlin

OFFLINE

slide-28
SLIDE 28

Offline Evaluation Results for Automatic Filter Selection

06.11.2019 31 HAYSTACK Europe 2019 - Berlin

Every configuration leads to increased precision.

!

OFFLINE

slide-29
SLIDE 29

Offline Evaluation Results for Automatic Filter Selection

06.11.2019 32 HAYSTACK Europe 2019 - Berlin

Higher attribute granularity → higher precision

!

OFFLINE

slide-30
SLIDE 30

Offline Evaluation Results for Automatic Filter Selection

06.11.2019 33 HAYSTACK Europe 2019 - Berlin

Using click events performs better than using add2basket events.

!

OFFLINE

slide-31
SLIDE 31

Our basic discovery approach

06.11.2019 HAYSTACK Europe 2019 - Berlin 34

In our discoveries we loosely follow the design thinking process

understanding the problem finding the solution testing the solution

slide-32
SLIDE 32

Technical Integration

06.11.2019 35 HAYSTACK Europe 2019 - Berlin

Business Rules Query Preprocessor (querqy)*

"krawatte" => FILTER: class:krawatten

*https://github.com/renekrie/querqy

X

slide-33
SLIDE 33

Query Selection for Auto Filtering

06.11.2019 36 HAYSTACK Europe 2019 - Berlin

230k Queries

  • 1. No Nonsense
  • Identical hit count
  • 0-hits
  • Unclear judgements
  • 2. Business Rules
  • No brands
  • Pos. metric change
  • Hit set >30

40k Filter Rules

slide-34
SLIDE 34

User Interaction Challenge

06.11.2019 37 HAYSTACK Europe 2019 - Berlin

slide-35
SLIDE 35

Data Update Challenges

  • Filtering data removes existing

interaction patterns

  • Missing „trending“ attribute

selections may lead to missing products

  • Frequency of interaction data updates

06.11.2019 38 HAYSTACK Europe 2019 - Berlin

slide-36
SLIDE 36

On-Site Test Results*

06.11.2019 39 HAYSTACK Europe 2019 - Berlin

Hypothesis 1: Search effectiveness An improvement in precision will lead to a higher search conversion rate KPI: conversion rate search Test result: -0,49% Hypothesis 2: Search efficiency An improvement in precision will lead to a lower ratio of search interactions to orders KPI: Ratio of search interactions to search orders Test result: -0,73% (the lower the better)

* only one week of data, not significant (yet)

slide-37
SLIDE 37

We generate data with the A/B-Test…

06.11.2019 HAYSTACK Europe 2019 - Berlin 40

… and use the insights for the next iteration

Next Iteration

slide-38
SLIDE 38

Next Steps // The Future

Products and user interests change over time → a fixed set of filters is not an option on the long term

06.11.2019 41 HAYSTACK Europe 2019 - Berlin

We aim for:

  • a self-learning system:
  • identifying queries with fuzzy search results in need of filtering
  • finding appropriate filter attributes and values

With plenty of query and product features we can train a machine learning algorithm to predict a relation between seachterm and product characteristics, determining a query re-formulation to improve precision

slide-39
SLIDE 39

42 06.11.2019

Connect with us.

jens.kuersten@otto.de @faultfinder80 arne.vogt@otto.de

We are hiring ;)

HAYSTACK Europe 2019 - Berlin