12-CRS-0106 REVISED 8 FEB 2013
10/26/16 1
A Machine Learning Analysis of Twitter Sentiment to the Sandy Hook Shootings
Nan Wang, Blesson Varghese
Queen’s University Belfast
Peter Donnelly University of Toronto
A Machine Learning Analysis of Twitter Sentiment to the Sandy Hook - - PowerPoint PPT Presentation
A Machine Learning Analysis of Twitter Sentiment to the Sandy Hook Shootings Nan Wang, Blesson Varghese Queens University Belfast Peter Donnelly University of Toronto 12-CRS-0106 REVISED 8 FEB 2013 1 10/26/16 Outline Motivation
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 1
Nan Wang, Blesson Varghese
Queen’s University Belfast
Peter Donnelly University of Toronto
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 2
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 3
12-CRS-0106 REVISED 8 FEB 2013
– Apply and evaluate machine learning approaches for sentiment analysis on social network – Provide insights gathered from social networks to decision makers – Engage non-CS audiences with research outputs through interactive visualisation
4 10/26/16
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 5
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 6
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 7
– Baseline – Correction for Volume of Tweets – Correction for Volume of Tweets & Population
g: geographic region t: time frame
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 8
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 9
– N-gram
Uni-gram Bi-gram Tri-gram Not Not sure Not sure if sure sure if sure if gun if if gun if gun shot gun gun shot gun shot or shot shot or shot or fire
firework
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 10
– Hashtags #PrayForNewtown, #NRA, #guncontrol – Reply/Mention Tags @BarackObama, @Death, @cnnbrk
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 11
– Support Vector Machine (SVM) – Naïve Bayes (NB) – Maximum Entropy (ME) – Decision Tree (Single, Bagged, Boosted) – Random Forest (RF) – Neural Network (NN)
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 12
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 13
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 14
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 15
– Timeframe
Friday, 12/07/2012 00:00:01 GMT ~ Tuesday, 01/15/2013 23:59:59 GMT
– Data Size
7 million tweets
– Positive
“The only thing that stops a bad guy with a gun, is a good guy with a gun”
– Negative
“We NEED strict gun control. #Newtwon”
– Neutral
“Not sure if gun shot of firework.”
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 16
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 17
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 18
– Motion Chart
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 19
– Line Graph
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 20
– Geo Map
Baseline PGPSS
12/07/2012 ~ 01/15/2013 12/13/2012 ~ 12/15/2012
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 21
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 22
– emoticon – Part-Of-Speech tagging
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 23
12-CRS-0106 REVISED 8 FEB 2013
10/26/16 24
– Evaluates of machine learning approaches for twitter sentiment analysis – Investigates tweets’ relevance to gun violence – Visualises public sentiment related data on multiple geographic/temporal level interactively
12-CRS-0106 REVISED 8 FEB 2013
25 10/26/16