M d li U B h i d I t ti Modeling User Behavior and Interactions Lecture 2: Interpreting Behavior Data Eugene Agichtein Emory University Emory University
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Modeling User Behavior and Interactions M d li U B h i d I t ti - - PowerPoint PPT Presentation
Modeling User Behavior and Interactions M d li U B h i d I t ti Lecture 2: Interpreting Behavior Data Eugene Agichtein Emory University Emory University Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Click
2
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Relevance Feedback (RF)
3
Adapted from: M. Hearst, SUI, 2009
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
7
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Initial query
Initial query
Δ Δ
Δ
R i d
x non-relevant documents
Revised query
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
∈ ∈
nr j r j
D d j nr D d j r m
r r
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
vector feedback positive vector query
vector query ⋅ + ⋅ = β α T i ll β vector feedback negative p ⋅ −γ β Typically, γ < β 4 8 4 8 Original query
1 2 4 1 2 1 1 4 2 4 8 2 8 4 4 0 16 Positive Feedback Negative feedback
25 γ
2 1 1 4
3 7 0 -3 8 4 4 0 16 Negative feedback
25 . = γ
New query 7 q y
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
3 0 528 04/04/90 Science Panel Backs NASA Satellite Plan But Urges
Launches of Smaller Probes
Staying Within Budget
for Climate Research 6 0 524 08/22/90 Report Provides Support for the Critics Of Using Big
Satellites to Study Climate
Canada
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
12
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
2 0 500 08/13/91 NASA H 't S d I i S t t
Do Some Spy Work of Their Own py
5 0 492 12/02/87 T l i ti T l f T C i
Rocket Launchers 8 0 490 06/14/90 R f S lli B S A T C $90 Milli
13
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
14
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
15
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
16
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
17
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Nk= total number of docs containing tk
18
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
19
20
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
– K1, K2, … KR
1 2 R
3 2 1 1 ⎟ ⎞ ⎜ ⎛
76 . 5 3 3 2 1 1 3 1 ≈ ⎟ ⎠ ⎞ ⎜ ⎝ ⎛ + + ⋅
21
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
– contribution of ith rank position:
i
y
contribution of ith rank position: – Ex: has DCG score of
45 . 5 ) 6 log( 1 ) 5 log( ) 4 log( 1 ) 3 log( 3 ) 2 log( 1 ≈ + + + +
– best possible ranking as score NDCG = 1
22
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Jürgen Koenemann and Nicholas J. Belkin. (1996) A Case For Interaction: A Study of Interactive Information Retrieval Behavior and Effectiveness. CHI 1996
23
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
24
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Opaque Penetrable
25
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Penetrable interface required fewer iterations to arrive at final query +15% +17 34% +17-34% Penetrable RF performed 15% Penetrable RF performed 15% better than opaque and transparent
26
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Relevance feedback improves results 66% of the
Requires >= 5 judged documents, otherwise unstable Requires queries for which the set of relevant
27
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Click
28
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Click
29
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Click
Segment Object Class Examine Retain View Listen Select (click) Print Bookmark
Retain Print Bookmark Save Purchase D l t Subscribe
Reference Delete Copy / paste Quote Forward Reply
Annotate Link Cite Mark up Rate Organize Publish
30
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Click
31
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
[J hi t l 2005] Click [Joachims et al., 2005]
32
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
[Agichtein et al., 2006] Click
ency All queries PTR=1
Higher clickthrough at top non-relevant than at top relevant document
ck Freque PTR=1 PTR=3 ative Clic
1 2 3 5 10
Rel
Result Position
33
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
[Agichtein et al., 2006] Click
0.144 0.14 0.16
PTR=1 PTR=3
0 063 0.08 0.1 0.12
cy deviation
PTR=3
0 001 0.010 0 001 0.063 0.02 0.04 0.06
Click frequenc
1 2 3 5 10
Result position C Result position 34
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Click
0.2 0.4 viation
1
1 2 3 4 5
h Frequency Dev
2 3 4 Click Click
0.4 Result position Clickthroug
4 5 6 Click
Result position
7 8
35
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Click
36
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
Click
Click Model for Web Search Ranking, WWW 2009
37
Click
Click Model for Web Search Ranking, WWW 2009
38
Click
Click Model for Web Search Ranking, WWW 2009
Eugene Agichtein, Emory University, IR Lab
39
Click
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
40
[Piwowarski et al., 2009] Click
Building query chains Building query chains Analysing the chains Analysing the chains Validation of the model Validation of the model
based on time deltas & query
Bayesian Network (BN) model
clicked documents
q y similarities
Trees with features from the BN
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
41
[Pi ki t l 2009] Click Grouping atomic sessions Grouping atomic sessions [Piwowarski et al., 2009] Time threshold Time threshold Similarity threshold Similarity threshold
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
42
[Pi ki t l 2009] Click [Piwowarski et al., 2009]
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
43
[Piwowarski et al., 2009] Click Probability (Chain state=… / observations) = (0.2, 0.4, 0.01, 0.39, 0)
Probability (Search state=… / observations) = (0.1, 0.42, …)
Probability (Page state=… / observations) = (0.25, 0.2, …)
Probability (Click state=… / observations) = (0.02, 0.5, …)
44
Probability ([not] Relevant / observations) = (0 4 0 5)
44
= (0.4, 0.5)
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
44
[Piwowarski et al., 2009] Click
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
45
[Piwowarski et al., 2009] Click
p ,
J2 ith sessions of S2 J2 with sessions of S2
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
46
[Piwowarski et al., 2009] Click
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
47
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
48
[Fox et al., 2003]
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
49
[Fox et al., 2003]
Result-Level Session-Level Diff Secs, Duration Secs Averages of result-level measures (Dwell Time and Position) S crolled, ScrollCnt, AvgS ecsBetweenS croll, TotalS crollTime, MaxS croll Query count TimeToFirstClick, TimeToFirstS croll Results set count Page Page Position Absolute Position Results visited Page, Page Position, Absolute Position Results visited Visits End action Exit Type ImageCnt, PageS ize, ScriptCnt ageC t, ageS e, Sc ptC t Added to Favorites, Printed
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
50
[F t l 2003]
[Fox et al., 2003]
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
51
[Fox et al., 2003]
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
52
[Fox et al., 2003] Only clickthrough Only clickthrough Combined measures Combined measures with confidence of > 0.5 (80-20 train/test split)
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
53
[Agichtein et al., 2006]
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
54
[Agichtein et al., SIGIR2006] Presentation Presentation ResultPosition ResultPosition Position of the URL in Current ranking Position of the URL in Current ranking QueryTitleOverlap QueryTitleOverlap Fraction of query terms in result Title Fraction of query terms in result Title Clickthrough Clickthrough Clickthrough Clickthrough DeliberationTime DeliberationTime Seconds between query and first click Seconds between query and first click ClickFrequency ClickFrequency Fraction of all clicks landing on page Fraction of all clicks landing on page ClickDeviation ClickDeviation Deviation from expected click frequency Deviation from expected click frequency Browsing Browsing DwellTime DwellTime Result page dwell time Result page dwell time p g p g DwellTimeDeviation DwellTimeDeviation Deviation from expected dwell time for query Deviation from expected dwell time for query Sample Behavior Features
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
55
[Agichtein et al., SIGIR2006]
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
56
[Agichtein et al., SIGIR2006]
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
57
[Agichtein et al., SIGIR2006]
0.78 0.8 SA+N CD 0.72 0.74 0.76
ion
UserBehavior Baseline 0.66 0.68 0.7
Precis
SA+N
0.6 0.62 0.64 0.1 0.2 0.3 0.4
Recall
58
Segment Object Class Segment Object Class Examine View Listen
Retain Print Bookmark Save Purchase Subscribe
Reference Purchase Delete Subscribe Copy / paste Quote Forward Reply
A t t Quote Reply Link Cite M k R t O i Annotate Mark up Rate Publish Organize
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
59
[Buscher et al 2008]
[Buscher et al., 2008]
2.
Fixation detection and saccade classification
3
Reading (red) and skimming (yellow) detection line by line
3.
Reading (red) and skimming (yellow) detection line by line
See G. Buscher, A. Dengel, L. van Elst: “Eye Movements as Implicit Relevance Feedback”, in CHI '08
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
61
[Buscher et al., 2008]
Input: viewed documents
Gaze-Filter
TF x IDF based on read or ki d
Gaze-Length-
skimmed passages Interest(t) x TF x IDF
Filter
based on length of coherently read text
Reading Speed
ReadingScore(t) x TF x IDF based on read vs. skimmed passages containing term t
Baseline
TF x IDF containing term t based on opened entire documents entire documents
[Buscher et al 2008] [Buscher et al., 2008]
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
63
Eugene Agichtein, Emory University RuSSIR 2009, Petrozavodsk, Karelia
64
Click
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
65
Marti Hearst, Search User Interfaces, 2009, Chapter 6 “Query Reformulation”: http://searchuserinterfaces.com/ Kelly, D. and Teevan, J. Implicit feedback for inferring user preference: a
Joachims, T., Granka, L., Pan, B., Hembrooke, H., and Gay, G. Accurately interpreting clickthrough data as implicit feedback., SIGIR 2005 Agichtein, E., Brill, E., Dumais, S., and Ragno, R. Learning user interaction models for predicting web search result preferences SIGIR 2006 models for predicting web search result preferences, SIGIR 2006 Buscher, G., Dengel, A., and van Elst, L. Query expansion using gaze-based feedback on the subdocument level., SIGIR 2008 Chapelle O and Y Zhang A Dynamic Bayesian Network Click Model for Chapelle, O, and Y. Zhang, A Dynamic Bayesian Network Click Model for Web Search Ranking, WWW 2009 Piwowarski, B, Dupret, G, Jones, R: Mining user web search activity with layered bayesian networks or how to capture a click in its context, y y p , WSDM 2009
Eugene Agichtein, Emory University, RuSSIR 2009 (Petrozavodsk, Russia)
66