for Microblog Search A Preliminary Study Maram Hasanain, Rana - - PowerPoint PPT Presentation

▶

Apr 18, 2023 320 likes •533 views

Query Performance Prediction for Microblog Search A Preliminary Study Maram Hasanain, Rana Malhas, Tamer Elsayed 11 July 2014 SoMeRA14 Workshop in conjunction with SIGIR14 Why? sigir awards Expectation high quality results Reality

SLIDE 1

Query Performance Prediction for Microblog Search

A Preliminary Study

Maram Hasanain, Rana Malhas, Tamer Elsayed

11 July 2014 SoMeRA’14 Workshop in conjunction with SIGIR’14

SLIDE 2

Why?

Expectation high quality results Reality Some queries are difficult Poor results

sigir awards

SLIDE 3

What’s QPP?

Query Performance Prediction (QPP)

estimated performance

Query Retrieval model Result list (R) sigir awards

SLIDE 4

QPP in Microblog Search?

QPP is not a new problem
Microblog search is different

RQ1: How well the existing state-of-the-art predictors perform in the context of microblog search? RQ2: Will the predictors performance be

consistent across different retrieval models, specifically temporal ones?

SLIDE 5

Setup of the Study …

SLIDE 6

Overview

Examine frequently-used predictors for tweets

search

2 types of predictors:
Content-based: consider terms in tweets and

queries

Temporal: also consider time factor
2 types of retrieval models:
Content-based

e.g. Query Likelihood

Temporal

e.g. Time-based Exponential Priors

SLIDE 7

QP Predictors

Content-based predictors

Standard deviation (σ)
Normalized Standard Deviation (NSD)
Normalized Query Commitment (NQC)
KL-divergence
Clarity (CLR)
Information Gain
Weighted information gain (WIG)

Post- retrieval

SLIDE 8

QP Predictors

Inverse document frequency (IDF)

SumIDF, MaxIDF, AvgIDF,…

Collection-query similarity (SCQ)

SumSCQ, MaxSCQ, AvgSCQ,…

Simplified clarity score (SCS)

Pre- retrieval

SLIDE 9

QP Predictors

Temporal predictor

KL-divergence

Temporal Clarity (t-CLR)

Post-retrieval

SLIDE 10

Retrieval Models

Content-based

Query Likelihood (QL)

Temporal

QL with temporal prior (t-EXP)
Temporal relevance modeling (t-QRM)

SLIDE 11

Evaluation

SLIDE 12

Setup

Datasets Evaluating retrieval

Evaluation measure: Average precision (AP)

Tweets2011 Tweets2013

Source TREC’11-12 TREC’13 Tweets ~16M ~243M Queries 108 60 Time span ~2 weeks ~2 months

SLIDE 13

Setup

Evaluating prediction

Correlation between predicted AP & actual AP.
Linear correlation: Pearson’s r
Rank correlation: Kendall’s-τ

Training/Testing

75% of queries for parameter tuning
Repeat and average with 120 trials

SLIDE 14

0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 0.60

QL t-EXP t-QRM Pearson’s correlation Retrieval model t-CLR CLR WIG NSD NQC SumIdf

Results (Tweets2011)

t-CLR is best SumIdf: Comparable quality CLR: Decline in quality 14

NQC: Increase in performance

Not significant WIG: Decline in quality

SLIDE 15

0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 0.60

QL t-EXP t-QRM Pearson’s correlation Retrieval model t-CLR CLR WIG NSD NQC SumIdf

Results (Tweets2013)

CLR is best t-CLR has good performance NQC: Increase in performance 15 Not significant

SLIDE 16

Combining Predictors

Using linear regression
Feature selection to find best predictors

combination

Only over Tweet2011
40% of queries for parameter tuning
Train & test combined model by cross-validation

with 60% of queries.

SLIDE 17

{t -CLR,CLR,WIG,SCS} {t -CLR,WIG,SCS} {t-CLR,NQC,NSD,SumIDF} 0.00 0.10 0.20 0.30 0.40 0.50 0.60

QL t-EXP t-QRM Pearson's correlation Retrieval model

Combined Best

Combining Predictors (Tweets2011)

Pre-retrieval predictors in best combinations t-CLR in best combinations 46.5% 27.8% 21.6%

SLIDE 18

Summary

First comprehensive study focusing on testing QPP

in microblog search with different retrieval models.

Temporal predictors might be more suitable for

microblog search.

Combining predictors improved prediction quality.
Some pre-retrieval predictors are showing

promising results.

SLIDE 19

Future Work

Experiment with more temporal predictors &

retrieval models

Develop new…
Temporal predictors
Predictors considering tweet-specific features
Use QPP in …
Selective query expansion
Dynamic query expansion

SLIDE 20

Query Performance Prediction for Microblog Search

A Preliminary Study

Why?

What’s QPP?

QPP in Microblog Search?

RQ1: How well the existing state-of-the-art predictors perform in the context of microblog search? RQ2: Will the predictors performance be

consistent across different retrieval models, specifically temporal ones?

Setup of the Study …

Overview

search

queries

QP Predictors

Content-based predictors

QP Predictors

SumIDF, MaxIDF, AvgIDF,…

SumSCQ, MaxSCQ, AvgSCQ,…

QP Predictors

Temporal predictor

Temporal Clarity (t-CLR)

Retrieval Models

Content-based

Temporal

Evaluation

Setup

Datasets Evaluating retrieval

Evaluation measure: Average precision (AP)

Tweets2011 Tweets2013

Setup

Evaluating prediction

Training/Testing

Results (Tweets2011)

Results (Tweets2013)

Combining Predictors

combination

with 60% of queries.

Combining Predictors (Tweets2011)

Summary

in microblog search with different retrieval models.

microblog search.

promising results.

Future Work

retrieval models

Thank You 