A Machine Learning Analysis of Twitter Sentiment to the Sandy Hook - - PowerPoint PPT Presentation

a machine learning analysis of twitter sentiment to the
SMART_READER_LITE
LIVE PREVIEW

A Machine Learning Analysis of Twitter Sentiment to the Sandy Hook - - PowerPoint PPT Presentation

A Machine Learning Analysis of Twitter Sentiment to the Sandy Hook Shootings Nan Wang, Blesson Varghese Queens University Belfast Peter Donnelly University of Toronto 12-CRS-0106 REVISED 8 FEB 2013 1 10/26/16 Outline Motivation


slide-1
SLIDE 1

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 1

A Machine Learning Analysis of Twitter Sentiment to the Sandy Hook Shootings

Nan Wang, Blesson Varghese

Queen’s University Belfast

Peter Donnelly University of Toronto

slide-2
SLIDE 2

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 2

Outline

  • Motivation
  • Methodology
  • Machine Learning Approaches
  • Case Study: Sandy Hook Elementary School Shooting
  • Limitations
  • Conclusion
  • Q & A
slide-3
SLIDE 3

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 3

Outline

  • Motivation
  • Methodology
  • Machine Learning Approaches
  • Case Study: Sandy Hook Elementary School Shooting
  • Limitations
  • Conclusion
  • Q & A
slide-4
SLIDE 4

12-CRS-0106 REVISED 8 FEB 2013

A Machine Learning Analysis of Twitter Sentiment to the Sandy Hook Shootings

  • Motivation

– Apply and evaluate machine learning approaches for sentiment analysis on social network – Provide insights gathered from social networks to decision makers – Engage non-CS audiences with research outputs through interactive visualisation

4 10/26/16

slide-5
SLIDE 5

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 5

Outline

  • Motivation & Contribution
  • Methodology
  • Machine Learning Approaches
  • Case Study: Sandy Hook Elementary School Shooting
  • Limitations
  • Conclusion
  • Q & A
slide-6
SLIDE 6

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 6

Methodology

slide-7
SLIDE 7

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 7

Methodology

  • Pro-Gun Public Sentiment Score

– Baseline – Correction for Volume of Tweets – Correction for Volume of Tweets & Population

g: geographic region t: time frame

slide-8
SLIDE 8

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 8

Outline

  • Motivation & Contribution
  • Methodology
  • Machine Learning Approaches
  • Case Study: Sandy Hook Elementary School Shooting
  • Limitations
  • Conclusion
  • Q & A
slide-9
SLIDE 9

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 9

Machine Learning Approaches

  • Feature Extraction

– N-gram

Uni-gram Bi-gram Tri-gram Not Not sure Not sure if sure sure if sure if gun if if gun if gun shot gun gun shot gun shot or shot shot or shot or fire

  • r
  • r firework
  • r firework

firework

slide-10
SLIDE 10

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 10

Machine Learning Approaches

  • Feature Extraction

– Hashtags #PrayForNewtown, #NRA, #guncontrol – Reply/Mention Tags @BarackObama, @Death, @cnnbrk

slide-11
SLIDE 11

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 11

Machine Learning Approaches

  • Modelling

– Support Vector Machine (SVM) – Naïve Bayes (NB) – Maximum Entropy (ME) – Decision Tree (Single, Bagged, Boosted) – Random Forest (RF) – Neural Network (NN)

slide-12
SLIDE 12

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 12

Machine Learning Approaches

  • Evaluation
slide-13
SLIDE 13

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 13

Machine Learning Approaches

  • Evaluation
slide-14
SLIDE 14

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 14

Outline

  • Motivation & Contribution
  • Methodology
  • Machine Learning Approaches
  • Case Study: Sandy Hook Elementary School Shooting
  • Limitations
  • Conclusion
  • Q & A
slide-15
SLIDE 15

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 15

Case Study: Sandy Hook Elementary School Shooting

  • Data Description

– Timeframe

Friday, 12/07/2012 00:00:01 GMT ~ Tuesday, 01/15/2013 23:59:59 GMT

– Data Size

7 million tweets

  • Triple-class Sentiment

– Positive

“The only thing that stops a bad guy with a gun, is a good guy with a gun”

– Negative

“We NEED strict gun control. #Newtwon”

– Neutral

“Not sure if gun shot of firework.”

slide-16
SLIDE 16

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 16

Case Study: Sandy Hook Elementary School Shooting

  • Tweets Statistics
slide-17
SLIDE 17

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 17

Case Study: Sandy Hook Elementary School Shooting

  • Tweets Statistics
slide-18
SLIDE 18

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 18

Case Study: Sandy Hook Elementary School Shooting

  • Visualisation http://www.gunsontwitter.com/

– Motion Chart

slide-19
SLIDE 19

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 19

Case Study: Sandy Hook Elementary School Shooting

  • Visualisation

– Line Graph

slide-20
SLIDE 20

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 20

Case Study: Sandy Hook Elementary School Shooting

  • Visualisation

– Geo Map

Baseline PGPSS

12/07/2012 ~ 01/15/2013 12/13/2012 ~ 12/15/2012

slide-21
SLIDE 21

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 21

Outline

  • Motivation & Contribution
  • Methodology
  • Machine Learning Approaches
  • Case Study: Sandy Hook Elementary School Shooting
  • Limitations
  • Conclusion
  • Q & A
slide-22
SLIDE 22

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 22

Limitations

  • Size of Twitter Corpus
  • Complexity of Feature Selection

– emoticon – Part-Of-Speech tagging

  • Trade-off between Performance & Computing

Power

slide-23
SLIDE 23

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 23

Outline

  • Motivation & Contribution
  • Methodology
  • Machine Learning Approaches
  • Case Study: Sandy Hook Elementary School Shooting
  • Limitations
  • Conclusion
  • Q & A
slide-24
SLIDE 24

12-CRS-0106 REVISED 8 FEB 2013

10/26/16 24

Conclusion

  • This paper

– Evaluates of machine learning approaches for twitter sentiment analysis – Investigates tweets’ relevance to gun violence – Visualises public sentiment related data on multiple geographic/temporal level interactively

slide-25
SLIDE 25

12-CRS-0106 REVISED 8 FEB 2013

Q & A

25 10/26/16