for Microblog Search A Preliminary Study Maram Hasanain, Rana - - PowerPoint PPT Presentation

for microblog search
SMART_READER_LITE
LIVE PREVIEW

for Microblog Search A Preliminary Study Maram Hasanain, Rana - - PowerPoint PPT Presentation

Query Performance Prediction for Microblog Search A Preliminary Study Maram Hasanain, Rana Malhas, Tamer Elsayed 11 July 2014 SoMeRA14 Workshop in conjunction with SIGIR14 Why? sigir awards Expectation high quality results Reality


slide-1
SLIDE 1

Query Performance Prediction for Microblog Search

A Preliminary Study

Maram Hasanain, Rana Malhas, Tamer Elsayed

11 July 2014 SoMeRA’14 Workshop in conjunction with SIGIR’14

slide-2
SLIDE 2

Why?

Expectation high quality results Reality Some queries are difficult Poor results

sigir awards

2

slide-3
SLIDE 3

What’s QPP?

3

Query Performance Prediction (QPP)

estimated performance

Query Retrieval model Result list (R) sigir awards

slide-4
SLIDE 4

QPP in Microblog Search?

  • QPP is not a new problem
  • Microblog search is different

4

RQ1: How well the existing state-of-the-art predictors perform in the context of microblog search? RQ2: Will the predictors performance be

consistent across different retrieval models, specifically temporal ones?

slide-5
SLIDE 5

Setup of the Study …

5

slide-6
SLIDE 6

Overview

  • Examine frequently-used predictors for tweets

search

  • 2 types of predictors:
  • Content-based: consider terms in tweets and

queries

  • Temporal: also consider time factor
  • 2 types of retrieval models:
  • Content-based

e.g. Query Likelihood

  • Temporal

e.g. Time-based Exponential Priors

6

slide-7
SLIDE 7

QP Predictors

Content-based predictors

  • Standard deviation (σ)
  • Normalized Standard Deviation (NSD)
  • Normalized Query Commitment (NQC)
  • KL-divergence
  • Clarity (CLR)
  • Information Gain
  • Weighted information gain (WIG)

7

Post- retrieval

slide-8
SLIDE 8

QP Predictors

  • Inverse document frequency (IDF)

SumIDF, MaxIDF, AvgIDF,…

  • Collection-query similarity (SCQ)

SumSCQ, MaxSCQ, AvgSCQ,…

  • Simplified clarity score (SCS)

8

Pre- retrieval

slide-9
SLIDE 9

QP Predictors

Temporal predictor

  • KL-divergence

Temporal Clarity (t-CLR)

9

Post-retrieval

slide-10
SLIDE 10

Retrieval Models

Content-based

  • Query Likelihood (QL)

Temporal

  • QL with temporal prior (t-EXP)
  • Temporal relevance modeling (t-QRM)

10

slide-11
SLIDE 11

Evaluation

11

slide-12
SLIDE 12

Setup

Datasets Evaluating retrieval

Evaluation measure: Average precision (AP)

12

Tweets2011 Tweets2013

Source TREC’11-12 TREC’13 Tweets ~16M ~243M Queries 108 60 Time span ~2 weeks ~2 months

slide-13
SLIDE 13

Setup

Evaluating prediction

  • Correlation between predicted AP & actual AP.
  • Linear correlation: Pearson’s r
  • Rank correlation: Kendall’s-τ

Training/Testing

  • 75% of queries for parameter tuning
  • Repeat and average with 120 trials

13

slide-14
SLIDE 14

0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 0.60

QL t-EXP t-QRM Pearson’s correlation Retrieval model t-CLR CLR WIG NSD NQC SumIdf

Results (Tweets2011)

t-CLR is best SumIdf: Comparable quality CLR: Decline in quality 14

NQC: Increase in performance

Not significant WIG: Decline in quality

slide-15
SLIDE 15

0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 0.60

QL t-EXP t-QRM Pearson’s correlation Retrieval model t-CLR CLR WIG NSD NQC SumIdf

Results (Tweets2013)

CLR is best t-CLR has good performance NQC: Increase in performance 15 Not significant

slide-16
SLIDE 16

Combining Predictors

  • Using linear regression
  • Feature selection to find best predictors

combination

  • Only over Tweet2011
  • 40% of queries for parameter tuning
  • Train & test combined model by cross-validation

with 60% of queries.

16

slide-17
SLIDE 17

{t -CLR,CLR,WIG,SCS} {t -CLR,WIG,SCS} {t-CLR,NQC,NSD,SumIDF} 0.00 0.10 0.20 0.30 0.40 0.50 0.60

QL t-EXP t-QRM Pearson's correlation Retrieval model

Combined Best

Combining Predictors (Tweets2011)

Pre-retrieval predictors in best combinations t-CLR in best combinations 46.5% 27.8% 21.6%

17

slide-18
SLIDE 18

Summary

  • First comprehensive study focusing on testing QPP

in microblog search with different retrieval models.

  • Temporal predictors might be more suitable for

microblog search.

  • Combining predictors improved prediction quality.
  • Some pre-retrieval predictors are showing

promising results.

18

slide-19
SLIDE 19

Future Work

  • Experiment with more temporal predictors &

retrieval models

  • Develop new…
  • Temporal predictors
  • Predictors considering tweet-specific features
  • Use QPP in …
  • Selective query expansion
  • Dynamic query expansion

19

slide-20
SLIDE 20

Thank You 

20