Probabilistic Models in Political Science Pablo Barber a Center - - PowerPoint PPT Presentation

probabilistic models in political science
SMART_READER_LITE
LIVE PREVIEW

Probabilistic Models in Political Science Pablo Barber a Center - - PowerPoint PPT Presentation

Probabilistic Models in Political Science Pablo Barber a Center for Data Science New York University www.pablobarbera.com 4 / 54 5 / 54 Two approaches to the study of social media and politics: 1. How social media platforms transform


slide-1
SLIDE 1

Probabilistic Models in Political Science

Pablo Barber´ a Center for Data Science New York University www.pablobarbera.com

slide-2
SLIDE 2
slide-3
SLIDE 3
slide-4
SLIDE 4

4 / 54

slide-5
SLIDE 5

5 / 54

slide-6
SLIDE 6

Two approaches to the study of social media and politics:

  • 1. How social media platforms transform political

communication

. Are social media creating ideological “echo chambers”?

  • 2. Social media as digital traces of political behavior

. Can we infer latent individual traits (e.g. political ideology) from online ties (follows, likes...)?

6 / 54

slide-7
SLIDE 7

Inferring political ideology using Twitter data

I Two common patterns about social behavior:

  • 1. Homophily: clustering in social networks along common

traits (“birds of a feather tweet together”)

  • 2. Selective exposure: preference for information that

reinforces current views and for avoiding opinion challenges.

I Social media networks replicate offline networks. I Key assumption: individuals prefer to follow political

accounts they perceive to be ideologically close.

I These decisions contain information about allocation of

scarce resource (attention).

I Use this information to estimate ideological locations of

politicians and individuals on the latent same scale.

7 / 54

slide-8
SLIDE 8
  • Political Accounts

NYTimeskrugman senrobportman maddow FiveThirtyEight HRC WhiteHouse BarackObama

BarackObama WhiteHouse GOP maddow FoxNews HRC . . .

  • pol. account m

ryanpetrik 1 1 1 1 . . . user 2 1 1 . . . user 3 1 1 . . . user 4 1 1 1 . . . user 5 1 1 . . . . . . user n 1 1 . . . 8 / 54

slide-9
SLIDE 9

Spatial following model

I Users’ and politicians’ ideology (✓i and j) are defined as

latent variables to be estimated.

I Data: “following” decisions, a matrix of binary choices (Yij). I Spatial following model: for n users, indexed by i, and m

political accounts, indexed by j: P(yij = 1|↵j, i, , ✓i, j) = logit−1 ⇣ ↵j + i − (✓i − j)2⌘ where:

↵j measures popularity of politician j i measures political interest of user i is a normalizing constant

More 9 / 54

slide-10
SLIDE 10

Intuition of the model

Probability that Twitter user i follows politician j, as a function of the user’s ideology:

φj1 = −1.51 αj1 = 3.51 φj2 = 1.09 αj2 = 2.59

−2 2

θi, Ideology of Twitter user i Pr(yij = 1)

10 / 54

slide-11
SLIDE 11

Estimation

I Goal of learning:

I ✓i: ideological positions of users i = 1, . . . , n I j: ideological positions of political accounts j = 1, . . . , m

I Likelihood function:

p(y|✓, , ↵, , ) =

n

Y

i=1 m

Y

j=1

logit−1(⇡ij)yij(1 − logit−1(⇡ij))1−yij where ⇡ij = ↵j + i − (✓i − j)2

I Exact inference is intractable → MCMC (approx. inference) I Estimation:

I First stage: HMC in Stan with random sample of Y to compute

posterior distribution of j-indexed parameters.

I Second stage: parallelized MH in R for rest of i-indexed

parameters (assuming independence), on NYU’s HPC.

11 / 54

slide-12
SLIDE 12

Data

I m = list of 620 popular political accounts in the U.S.

→ Legislators, president, candidates, other political figures, media outlets, journalists, interest groups. . .

I n = followers of at least one of these accounts

→ 30.8M users (∼75% of U.S. users) → 100K of these were matched with voter files

I States: AK, CA, FL, OH, PA. I Unique, perfect matches on first and last name, and county.

I Code:

I Method: github.com/pablobarbera/twitter ideology I Applications: github.com/SMAPPNYU/echo chambers I Data collection: streamR, Rfacebook packages for R

(available on CRAN)

I Data analysis: github.com/pablobarbera/pytwools (python) 12 / 54

slide-13
SLIDE 13

Results

  • @sensanders

@HillaryClinton @nancypelosi @VP @BarackObama Median House D Median Senate D @senjohnmccain Median Senate R Median House R @sentedcruz

  • @motherjones

@maddow @NPR @msnbc @nytimes @cnnbrk @washingtonpost @FoxNews @DRUDGE_REPORT @glennbeck @limbaugh

  • @HRC

@glaad @OccupyWallSt @dailykos @aclu @hrw @BrookingsInst @RANDCorporation @CatoInstitute @AEI @Heritage @nra @redstate

Political Actors Media Interest Groups −1.5 0.0 1.5 −1.5 0.0 1.5 −1.5 0.0 1.5

Position on latent ideological scale

13 / 54

slide-14
SLIDE 14

Validation

This method is able to correctly classify and scale Twitter users

  • n the left-right dimension:
  • 1. Political accounts

I Correlation with measures based on roll-call votes.

  • 2. Ordinary citizens

I Individual and aggregate-level survey responses I Voting registration files

It is also able to predict change over time.

14 / 54

slide-15
SLIDE 15

Political elites

Ideal Points of Members of the 113th U.S. Congress

House Senate

  • ρD = 0.63

ρR = 0.46

  • ρD = 0.66

ρR = 0.63

−2 −1 1 2 −2 −1 1 2 −2 −1 1 2

Estimated Twitter Ideal Points Ideology Estimates Based on Roll−Call Votes (Simon Jackman's ideal point estimates)

15 / 54

slide-16
SLIDE 16

Ordinary Users

Comparison with ideology estimates from aggregated surveys (Lax and Phillips, 2012; Tausanovitch and Warshaw, 2013)

AL AZ AR CA CO CT DE FL GA ID IL IN IA KS KY LA ME MD MA MI MN MS MO MT NE NV NH NJ NM NY NC ND OH OK OR PA RI SC SD TN TX UT VT VA WA WV WI WY

ρ = −0.916 40% 45% 50% 55% −0.4 −0.2 0.0 0.2 0.4 0.6

Ideology of Median Twitter User in Each State Mean Liberal Opinion (Lax and Phillips, 2012)

ρ = 0.791

  • −1.0

−0.5 0.0 0.5 −0.6 −0.4 −0.2 0.0 0.2 0.4

Ideology of Median Twitter User in Each City Public Preference Estimate (Tausanovitch and Warshaw, 2013) 16 / 54

slide-17
SLIDE 17

Ordinary Users

Republicans are more conservative than Democrats

  • Arkansas

California Florida Ohio Pennsylvania −2 −1 1 2 Dem Rep Dem Rep Dem Rep Dem Rep Dem Rep

Party Registration θi, Twitter−Based Ideology Estimates

Predictive accuracy for party affiliation is 83%

17 / 54

slide-18
SLIDE 18

Application: Ideology of Presidential Candidates

  • @washingtonpost

@nytimes @msnbc @JimWebbUSA @HillaryClinton @POTUS @LincolnChafee @MotherJones @GovernorOMalley @SenSanders @SenWarren @tedcruz @RealBenCarson @ScottWalker @RandPaul @rushlimbaugh @BobbyJindal @GovernorPerry @GovernorPataki @JohnKasich @marcorubio @DRUDGE_REPORT @RepPaulRyan @GrahamBlog @GovMikeHuckabee @JebBush @RickSantorum @FoxNews @GovChristie @CarlyFiorina @realDonaldTrump @WSJ Average Republican in 114th Congress Average Twitter User Average Democrat in 114th Congress

−2 −1 1 2

Position on latent ideological scale

Twitter ideology scores of potential Democratic and Republican presidential primary candidates

Barber´ a “Who is the most conservative Republican candidate for president?” The Washington Post, June 16 2015

19 / 54

slide-19
SLIDE 19

Application: Twitter as an Ideological Echo Chamber?

Tweets mentioning Obama Tweets mentioning Romney 50K 100K −2 2 −2 2

Estimated Ideology Count of Sent Tweets Obama Romney −2 −1 1 2 −2 −1 1 2 −2 −1 1 2

Estimated Ideology of Author Estimated Ideology of Retweeter

0.00% 0.25% 0.50% 0.75% 1.00% 1.25% % of Tweets

Barber´ a (2015) “Birds of the Same Feather Tweet Together: Bayesian Ideal Point Estimation Using Twitter Data.” Political Analysis

20 / 54

slide-20
SLIDE 20

Application: Twitter as an Ideological Echo Chamber?

Barber´ a, Jost, Nagler, Tucker, & Bonneau (2015) “Tweeting From Left to Right: Is Online Political Communication More Than an Echo Chamber?” Psychological Science

21 / 54

slide-21
SLIDE 21

Other applications

Ideology of media outlets Ideological Asymmetries Multidimensional Policy Spaces

22 / 54

slide-22
SLIDE 22

Two approaches to the study of social media and politics:

  • 1. How social media platforms transform political

communication

. As voters are able to directly interact with politicians, does the quality of political representation improve?

Social media as digital traces of political behavior

. Are legislators’ and citizens’ social media messages a valid proxy for the attention they give to different political issues?

23 / 54

slide-23
SLIDE 23

Political Representation

Public Opinion Policy

24 / 54

slide-24
SLIDE 24

Political Representation

Issues Voters Discuss Issues Legislators Discuss Do Legislators Accurately Represent Voters’ Interests? Who Leads? Who Follows?

Barber´ a, Nagler, Egan, Bonneau, Jost, & Tucker (2014) “Leaders or Followers? Measuring Political Responsiveness in the U.S. Congress Using Social Media Data.” APSA Conference Paper.

25 / 54

slide-25
SLIDE 25

Outline

  • 1. Analyze tweets sent by Members of U.S. Congress and their

followers using topic modeling techniques.

  • 2. Estimate the importance (frequency of discussion) of 100

different issues in the revealed expressed political agenda for legislators and constituents

  • 3. Political Congruence: are Members of Congress

discussing the same set of issues as their constituents?

  • 4. Political Responsiveness: do topics discussed by

Members of Congress temporally precede or follow topics discussed by the voters?

26 / 54

slide-26
SLIDE 26

Data

651,116 tweets by Members of U.S. Congress, from Jan. 1, 2013 to

  • Dec. 31, 2014 (113th Congress), collected by the Social Media and

Political Participation Lab (SMaPP) using Twitter’s Streaming API.

  • ● ●
  • ● ●
  • ● ● ●
  • ● ● ●
  • ● ●
  • ● ●
  • ● ●
  • 10

20 Feb−13 May−13 Aug−13 Nov−13 Feb−14 May−14 Aug−14 Nov−14 Feb−15

Tweets per Week

Members

  • f Congress
  • House Democrats

House Republicans Senate Democrats Senate Republicans

27 / 54

slide-27
SLIDE 27

Citizens’ Tweets

Collected all tweets for 3 samples of citizens:

  • 1. Informed public:

I Followers of 5 major media outlets (CNN, FoxNews, MSNBC,

NYT, WSJ) located in U.S. (filtered by time zone)

I Random sample of 10,000 (out of ∼30M)

  • 2. Republican Party Supporters:

I Follow 3+ Rep MCs and no Dem MCs I Random sample of 10,000 (out of 203,140)

  • 3. Democratic Party Supporters:

I Follow 3+ Dem MCs and no Rep MCs I Random sample of 10,000 (out of 67,843) 28 / 54

slide-28
SLIDE 28

Table : Number of tweets in dataset

Group N Avg. Min Max Tweets House Republicans 238 1,215 70 8,857 267,311 House Democrats 207 1,177 113 5,993 222,491 Senate Republicans 46 1,532 73 6,627 67,412 Senate Democrats 56 1,616 150 10,736 87,307 Informed Public 10K 948 2 5,861 9,487,382

  • Rep. Supporters

10K 1,091 2 8,804 10,911,813

  • Dem. Supporters

10K 1,306 2 5,122 13,058,947

Period of analysis: January 1, 2013 to December 31, 2014.

29 / 54

slide-29
SLIDE 29

Political Representation

Public Opinion Policy Media Media data:

I 273,007 tweets from 36 largest media outlets in U.S. (print,

broadcast, online) over same period.

30 / 54

slide-30
SLIDE 30

From Tweets to Topics

4 steps in our analysis

  • 1. Tweets from Members of Congress are preprocessed and

split by day, party and chamber (N=2,920 documents)

  • 2. Latent Dirichlet Allocation (Blei, 2003):

I Each document is a mixture over K = 100 latent topics. I Topics are distributions over V = 75, 000 n-grams (up to

trigrams, selected by frequency; keeping hashtags)

I Estimated parameters:

ˆ β Distribution of n-grams over topics (K × V) ˆ θ Distribution of topics over documents (K × N)

  • 3. Similar text processing for tweets from citizens and NYT

tweets (split by day and group)

  • 4. Using simulation, compute posterior distribution of ˆ

✓F for

  • bserved n-grams for citizens and media

31 / 54

slide-31
SLIDE 31

Latent Dirichlet allocation (LDA)

I Topic models are powerful tools for exploring large data sets

and for making inferences about the content of documents

!"#$%&'() *"+,#)

+"/,9#)1 +.&),3&'(1 "65%51 :5)2,'0("'1 .&/,0,"'1

  • .&/,0,"'1

2,'3$1 4$3,5)%1 &(2,#)1 6$332,)%1 )+".()1 65)&65//1 )"##&.1 65)7&(65//1 8""(65//1

  • I Many applications in information retrieval, document

summarization, and classification

New+document+ What+is+this+document+about?+

Words+w1,+…,+wN+

θ

Distribu6on+of+topics+

weather+ .50+ finance+ .49+ sports+ .01+

I LDA is one of the simplest and most widely used topic

models

32 / 54

slide-32
SLIDE 32

Latent Dirichlet Allocation

I Document = random mixture over latent topics I Topic = distribution over n-grams

Probabilistic model with 3 steps:

  • 1. Choose ✓i ∼ Dirichlet(↵)
  • 2. Choose k ∼ Dirichlet()
  • 3. For each word in document i:

I Choose a topic zm ∼ Multinomial(✓i) I Choose a word wim ∼ Multinomial(i,k=zm)

where: ↵=parameter of Dirichlet prior on distribution of topics over docs. ✓i=topic distribution for document i =parameter of Dirichlet prior on distribution of words over topics k=word distribution for topic k

33 / 54

slide-33
SLIDE 33

Estimation

I Applications that aggregate by author or day outperform

tweet-level analyses (Hong and Davidson, 2010)

I K is fixed at 100 based on cross-validated model fit.

  • Perplexity

logLikelihood 0.80 0.85 0.90 0.95 1.00 10 20 30 40 50 60 70 80 90 100 110 120

Number of topics Ratio wrt worst value

I Text is parsed with scikit-learn in python I Estimation: Collapsed Gibbs Sampler in C++ (Griffits and

Steyvers, 2004), ported to R by Gr¨ un and Hornik (2011)

34 / 54

slide-34
SLIDE 34

Validation

j.mp/lda-congress-demo

35 / 54

slide-35
SLIDE 35

Congruence

Are Members of Congress discussing the same set of issues as their constituents?

Table : Contemporaneous Pearson Correlations in Topic Distribution

Dem Rep Group Mcs MCs Democratic Members of Congress 1.00 0.22 Republican Members of Congress 0.22 1.00 Informed Public 0.33 0.39 Republican Party Supporters 0.17 0.62 Democratic Party Supporters 0.58 0.33 Media 0.39 0.61

36 / 54

slide-36
SLIDE 36

Responsiveness

Do legislators influence the public? Does the public influence legislators? To explore causal relationships between topic distributions, we use a Granger-causality framework (Granger, 1969):

I Regress proportion of tweets on topic k at time t by each

group on lagged proportions for all groups, using five lags.

I Do legislators’ tweets predict tweets by the public, controlling

for the media, and vice versa? → Changes in tweets as proxies for changes in salience of issues

37 / 54

slide-37
SLIDE 37

Results: Democratic legislators

Public Legislators 0.09 <0.00 Media 0.01 0.25 0.09 0.04

38 / 54

slide-38
SLIDE 38

Results: Republican legislators

Public Legislators 0.03 <0.00 Media 0.01 0.25 0.11 0.07

39 / 54

slide-39
SLIDE 39

Conclusions

  • 1. Social media as variable
  • 2. Social media as data

Future work / open questions:

I More complex generative models for tweets that exploit

platform features (Author-Topic; Dynamic; Hierarchical)

I Text- vs network-based estimates of political ideology I Predicting latent probability to turn out to vote based on

tweet text, using voting registration records

I Multilingual topic modeling I Detecting irony and sarcasm (Trump!) I Identifying bots and spam with user and text features only

40 / 54

slide-40
SLIDE 40

Thanks! website: pablobarbera.com twitter: @p barbera github: pablobarbera

41 / 54

slide-41
SLIDE 41

Backup slides (index)

Model with covariates Model identification Unequal representation Comparative responsiveness

42 / 54

slide-42
SLIDE 42

Model with Covariates

Baseline model: P(yij = 1) = logit−1 ⇣ ↵j + i − (✓i − j)2⌘ Model with geographic covariate: P(yij = 1) = logit−1 ⇣ ↵j + i − (✓i − j)2 + sij ⌘ where sij = 1 if user i and political actor j are located in the same state, and sij = 0 otherwise. ˆ ≈ 1.20 and ˆ ≈ 0.90

43 / 54

slide-43
SLIDE 43

Model with Covariates

Comparing Parameter Estimates Across Different Model Specifications

φ

j, Elites' Ideology Estimates

θi, Users' Ideology Estimates

G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G GG G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G GG G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G GG G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G GG G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G GG G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G GG G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G GG G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G

−2 −1 1 2 −2 −1 1 2 −2 −1 1 2

Parameter Estimates (Model with Geographic Covariate) Parameter Estimates (Baseline Model)

Index 44 / 54

slide-44
SLIDE 44

Identification

P(yijt = 1) = logit−1 ⇣ ↵j + i − (✓it − j)2⌘ Additive aliasing: = logit−1 ⇣ (↵j + k) + (i − k) − (✓it − j)2⌘ = logit−1 ⇣ ↵j + i − ((✓it + k) − (j − k))2⌘ Multiplicative aliasing: = logit−1 ⇣ ↵j + i − k2 ((✓it − j) × k)2⌘

Index 45 / 54

slide-45
SLIDE 45

Identifying restrictions

Indeterminacy Approach 1 Approach 2 Additive aliasing (1) Fix α0

j = 0 or β0 i = 0

Fix µα = 0 or µβ = 0 Additive aliasing (2) Fix φ0

j = +1 or θ0 i = +1

Fix µφ = 0 or µθ = 0 Multiplicative aliasing Fix φ00

j = −1 or θ00 i = −1

Fix σφ = 1 or σθ = 1

phi theta

G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G GG G G G G G G G G G G GG G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G GG G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G GG G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G GG G G G G G G G G G G GG G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G GG G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G GG G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G G

−2.5 0.0 2.5 −2.5 0.0 2.5 Approach 1 Approach 2 −2.5 0.0 2.5 −2.5 0.0 2.5

Estimated value of parameter True value of parameter Index 46 / 54

slide-46
SLIDE 46

Application: Ideology of Media Outlets and Journalists

  • Mother Jones

New Yorker MSNBC Think Progress Daily Kos Slate Al Jazeera NPR News The Atlantic TIME New York Times FiveThirtyEight Los Angeles Times Salon Huffington Post Newsweek ABC News CBS News U.S. News CNN Christian Science Monitor CNBC Washington Post USA Today Financial Times The Economist Reuters Wall Street Journal Forbes Politico The Hill Washington Times Fox News Drudge Report Real Clear Politics Red State −2 −1 1 Estimated Ideological Ideal Point (Accounts Weighted by Number of Followers)

Barber´ a & Sood (2014) “Follow Your Ideology: A Measure of Ideological Location of Media Sources”, MPSA Conference

Index 47 / 54

slide-47
SLIDE 47

Application: Ideology of Media Outlets and Journalists

  • ● ●
  • Fox News

USA Today Wall Street Journal CBS News U.S. News ABC News CNN ABC Washington Post NBC News Los Angeles Times TIME Time Magazine New York Times NPR News MSNBC −1 1 2

φj, Estimated Ideological Ideal Points

Barber´ a & Sood (2014) “Follow Your Ideology: A Measure of Ideological Location of Media Sources”, MPSA Conference

Index 47 / 54

slide-48
SLIDE 48

Application: Ideological Asymmetries in Pol. Comm.

  • Super Bowl

Newtown Shooting Oscars 2014 Syria Winter Olympics Boston Marathon Minimum Wage Marriage Equality Budget

  • Gov. Shutdown

State of the Union 2012 Election 0.00 0.25 0.50 0.75 1.00

Estimated Rate of Cross−Ideological Retweeting (Exponentiated Coefficient from Poisson Regression)

  • Liberals

Conservatives

Barber´ a, Jost, Nagler, Tucker, & Bonneau (2015) “Tweeting From Left to Right: Is Online Political Communication More Than an Echo Chamber?” Psychological Science

Index 48 / 54

slide-49
SLIDE 49

Application: Multidimensional Policy Spaces in Europe

P(yij = 1) = logit1 αi + βj −

d

X

k=1

γd(θik − φjk)2 ! Estimated ideological positions for 120 parties in 28 European countries

Left-Right Dimension Pro/Anti-European Union Dimension

Grunen AN0 2011 FPO OVP SPO FDL ERC C CD&V CDA CDU−CSU KD CU Cs ODS DL PO CpE Syriza Cons DPP DIMAR SLD PD D66 Ecol. Reform Verts Fine Gael Centre M5S LNNK FDP FI GD ZZA GL Green Green U ANEL Labour PL PvdA Lab PiS SEL FG V Left Lib LibDem FP Venstre Mod. Coalition FN PN NEOS ND N−VA NSi LN VLD Sinn Fein PASOK PVV PC VVD Piraten Podemos PSL PP IRL MR Reform SNP SDS CDS SAP SocDem SPD SD PS SP PS sp.a SF Fianna Fail PSOE SD Linke Finns UPM UCD UDI UPyD UKIP IU SP Unity

r=0.81

0.0 2.5 5.0 7.5 10.0 0.0 2.5 5.0 7.5 10.0

Ideology estimates (Surveys) Ideology estimates (Twitter)

Grunen AN0 2011 FPO OVP SPO FDL ERC C CD&V CDA KD CU Cs ODS DL PO Syriza Cons DPP DIMAR SLD PD D66 Ecol. Reform Verts Fine Gael Centre M5S LNNK FDP FI GD ZZA GL Green Green U ANEL Labour PL PvdA Lab PiS SEL FG V Left Lib LibDem FP Venstre Mod. Coalition FN PN NEOS ND N−VA LN VLD Sinn Fein PASOK PVV PC VVD Piraten Podemos PSL PP IRL MR Reform SNP SDS CDS SAP SocDem SPD SD SP PS sp.a SF Fianna Fail PSOE SD Linke Finns UPM UCD UPyD UKIP IU SP Unity

r=0.77

0.0 2.5 5.0 7.5 10.0 0.0 2.5 5.0 7.5 10.0

Ideology estimates (Surveys) Ideology estimates (Twitter)

Barber´ a, Popa, & Schmitt (2015) “Analyzing the Common Multidimensional Political Space for Voters, Parties, and Legislators in Europe”, MPSA Conference

Index 49 / 54

slide-50
SLIDE 50

Unequal representation

We also analyze whether correspondence between citizens and legislators is higher for:

I Co-partisans (party supporters) I Issues owned by each party (e.g. economy for Republicans;

social issues for Democrats)

I Constituents (vs general public) I Informed public vs random sample of U.S. Twitter users I Individuals with income above median

50 / 54

slide-51
SLIDE 51

Electoral Institutions and Political Representation

What institutional configurations foster better representation? Theoretical expectations Country Government Instit. Congr. Responsiv. Germany Coalition Prop. High Low Spain Single-party Prop. Medium Medium UK Coalition Maj. Medium Medium France Single-party Maj. Low High Barber´ a & Bølstad (2015) “A Comparative Study of the Quality

  • f Political Representation Using Social Media Data”, EPSA

Conference Paper.

Index 51 / 54

slide-52
SLIDE 52

Electoral Institutions and Political Representation

j.mp/EPSA-lda-demo

Index 52 / 54