Vox Populi Annotation: Measuring Intensity of Ideological - - PowerPoint PPT Presentation

vox populi annotation measuring intensity of ideological
SMART_READER_LITE
LIVE PREVIEW

Vox Populi Annotation: Measuring Intensity of Ideological - - PowerPoint PPT Presentation

Vox Populi Annotation: Measuring Intensity of Ideological Perspectives by Aggregating Group Judgments LREC, Marrakech, Morocco, May 28-30, 2008 Wei-Hao Lin and Alexander Hauptmann Language Technologies Institute School of Computer Science


slide-1
SLIDE 1

Vox Populi Annotation: Measuring Intensity of Ideological Perspectives by Aggregating Group Judgments

LREC, Marrakech, Morocco, May 28-30, 2008 Wei-Hao Lin and Alexander Hauptmann Language Technologies Institute School of Computer Science Carnegie Mellon University

1

slide-2
SLIDE 2

Goal: Annotating Intensity of Expressing Ideology at the Sentence Level

2

slide-3
SLIDE 3

Sentence of High Intensity

  • In the first weeks of the Intifada, for example,

Palestinian public protests and civilian demonstrations were answered brutally by Israel, which killed tens of unarmed protesters.

3

slide-4
SLIDE 4

Sentence of Low Intensity

  • The Rhodes aggrements of 1949 set them as

the ceasefire lines between Israel and the Arab states.

4

slide-5
SLIDE 5

Annotating Intensity is Hard

  • Hard to define Strong, Medium, and Weak
  • Hard to train annotators
  • Hard to achieve high inter-rater agreement

5

slide-6
SLIDE 6

Solution: Vox Populi Annotation

  • Aggregate group judgments on a simple,

forced binary question

  • “Which side do you think the sentence was

written from?”

6

slide-7
SLIDE 7

Two Problems

  • How many annotators are needed?
  • Are these group judgments random?

7

slide-8
SLIDE 8

Number of Annotators

  • A statistical testing problem
  • The more annotators, the finer difference in

intensity we can discern.

8

slide-9
SLIDE 9

Number of Annotators

  • 5

10 15 20 25 0.0 0.2 0.4 0.6 0.8 1.0 sample size p value

  • 0.9

0.75 0.6

9

slide-10
SLIDE 10

Reliability

  • Reliable = two groups agree with each other
  • Measure Pearson’s correlation coefficient

10

slide-11
SLIDE 11

Annotation Study

  • 250 sentences from editorials on the Israeli-

Palestinian conflict

  • 18 participants
  • “Do you think the sentence is written from

the Israeli or Palestinian perspective?”

11

slide-12
SLIDE 12

Distribution of Intensity

Vox Populi Intensity Frequency 0.0 0.2 0.4 0.6 0.8 1.0 10 20 30 40 50

12

slide-13
SLIDE 13

Reliability Assessment

  • 1

2 3 4 5 6 0.1 0.0 0.1 0.2 0.3 0.4 0.5 group size correlation

  • Vox Populi

random 0.5 random 0.99

13

slide-14
SLIDE 14

Where to recruit many annotators?

14

slide-15
SLIDE 15

Conclusion

  • Vox Populi Annotation for hard annotation

tasks

  • Solution to two problems in VPA
  • Positive correlation observed in an empirical

annotation study

15