An Introduction to Visual Analysis of Social Networks Nan Cao @ - - PowerPoint PPT Presentation

an introduction to visual analysis of social networks
SMART_READER_LITE
LIVE PREVIEW

An Introduction to Visual Analysis of Social Networks Nan Cao @ - - PowerPoint PPT Presentation

An Introduction to Visual Analysis of Social Networks Nan Cao @ HKUST nancao@cse.ust.hk April 2011 Agenda Introductions to visual analysis Community representation Analysis on Rich Context Social Medias Introduction Equation Tag


slide-1
SLIDE 1

An Introduction to Visual Analysis

  • f Social Networks

Nan Cao @ HKUST nancao@cse.ust.hk April 2011

slide-2
SLIDE 2

Agenda

  • Introductions to visual analysis
  • Community representation
  • Analysis on Rich Context Social Medias
slide-3
SLIDE 3

Introduction

  • How can we understand and interpreted the analysis

results in an intuitive way ?

  • The data mining results are not 100% correct, how can

we estimate the errors and refine them precisely ?

Equation Tag Clouds extracted from “Mining Organizational Structure in Social Network”

slide-4
SLIDE 4

Introduction

  • Traditional data mining techniques

– An automatic analysis process bases on various models for different purposes – Maximize the power of machines

  • Traditional data visualization techniques

– Leverage human’s capability on pattern recognition and represent the multidimensional data in an intuitive way using various visual encodings – Maximize the power for human beings

  • Visual Analysis

– A semi-automatic analysis process that combines analysis model (DM), visual representation (Visualization) as well as user interactions (HCI) together. – Seamlessly connect humans with machines for the analysis purpose

slide-5
SLIDE 5

Introduction

slide-6
SLIDE 6

Introduction

Raw Data Abstract Data

Data Mining

Layout / Coloring / Sizing

filtering Visual Form

Display

rendering View

Reference Model For Information Visualization and Visual Analysis

References [1] Readings in Information Visualization: Using Vision to Think, Stuart K. Card, Jock Mackinlay, Ben

  • Shneiderma. 1999

[2] prefuse: A Toolkit for Interactive Information Visualization, Jeffery Heer, Stuart K. Card, James A. Landay, ACM sigCHI, 2005 User

interactions

slide-7
SLIDE 7

Visualization On Social Networks

www.visualcomplexity.com

slide-8
SLIDE 8

Visualization On Social Networks

www.visualcomplexity.com Visualization is not to generate beautiful figures. More importantly, it help users to understand the information insights

slide-9
SLIDE 9

Agenda

  • Introductions to visual analysis
  • Community representation
  • Analysis on Rich Context Social Medias

Raw Data Abstract Data

Data Mining

Layout / Coloring / Sizing

filtering Visual Form

Display

rendering View User

interactions

slide-10
SLIDE 10

Community (Cluster) Representations

  • Graph Layout Problem

– Graph layout, as a branch of graph theory, applies topology and geometry to derive two-dimensional representations of graphs – Wikipedia

  • Layouts for cluster representations

– Group the nodes with strong connections together (same as community detection). – Reduce overlaps of the nodes – Minimize the average edge length (reduce line crossings) – Keep a good symmetry of the graph (It is easier for users to identify patterns in a symmetry structure)

slide-11
SLIDE 11

Graph Layout

Orthogonal Hierarchical Force-Directed

Edge oriented Structure

  • riented

Cluster

  • riented

Hierarchy

  • riented

Radial

slide-12
SLIDE 12

Graph Layout

Orthogonal Hierarchical Force-Directed

Edge oriented Structure

  • riented

Cluster

  • riented

Hierarchy

  • riented

Radial

Graph layout, as a branch of graph theory, applies topology and geometry to derive 2D representations of graphs – Wikipeia

slide-13
SLIDE 13

13

Force-directed graph layout

  • Graph layout = Energy minimization
  • Hence, the drawing algorithm is an

iterative optimization process

  • Convergence to global minimum is not

guaranteed!

Layou t Ene rgy Radom Layout Fine Result

slide-14
SLIDE 14

Force-directed graph layout

  • Cluster Properties

– Proximity preservation: similar nodes are

drawn closely

  • Aesthetical properties

– Symmetry preservation: isomorphic sub-

graphs are drawn identically

– Minimized Edge length: reduce edge

intersections

– No external influences: “Let the graph speak

for itself”

slide-15
SLIDE 15

15

Spring Embed Model

 Edges are springs  Vertices are repelling particles  Force on vertex:

fuv is force on spring

guv is repelling force

 

 

 

V u uv E v u uv

g f v F

} , {

) (

References: [3]A heuristic for drawing graph, P.Eades, 84. [4]Graph Drawing by Force-Directed Graph, Fruchterman, 91. [5]Drawing Graph Nicely Using Simulated Annealing, Davidson, 96. [6]A Fast Adaptive Layout Algorithm for Undirected Graphs, Frick, 94. [7]Spring Algorithms and Symmetry, Eades and X Lin, 99

slide-16
SLIDE 16

Model Comparison

MDS:

Clustering Model Layout Model

 

 

j i ij j i

d X X

2

|| || min

 

 

j i ij j i

d X X d

2 2

|| || 1 min

Spectrum:

 

LX X Tr

T

min

 

 

n E j i j i ij T

X X LX X

, 2

) ( min 2 1 min 

Spring Embed Model [3-7] MDS Layout Model [8]

[8] Graph Drawing by Stress Majorization, 2002, Graph Drawing [9] An r-Dimensional Quadratic Placement Algorithm, Kenneth M. Hall, 1970 [10] ACE: A fast multiscale eigenvector computation for drawing huge graphs, Y.Koren, L. Carmel and D. Harel, InfoVis 2002

Spectrum Model [9, 10]

slide-17
SLIDE 17

Agenda

  • Introductions to visual analysis
  • Community representation
  • Explorative Analysis on Rich Context Social

Media

Raw Data Abstract Data

Data Mining

Layout / Coloring / Sizing

filtering Visual Form

Display

rendering View User

interactions

slide-18
SLIDE 18

Rich Context Social Network

Each vertex has

multiple attributes

age / sex / jobs location : city /county /state contact : emails / phones Degree / Closeness / Betweenness / Spectrum friends colleagues classmate family

The vertexes are connected by multiple relations

  • How to analysis the network topology by considering multiple

relationships?

  • How to analysis the network beyond the graph topology by

considering the vertex attributes?

slide-19
SLIDE 19

Visual Analysis on Complex Relational Patterns (1)

[11] NodeTrix: A Hybrid Visualization of Social Networks, Nathalie Henry et al. IEEE TVCG 2007 Demo:http://www.youtube.com/watch?v=7G3MxyOcHKQ

slide-20
SLIDE 20

Visual Analysis on Complex Relational Patterns (1)

[11] NodeTrix: A Hybrid Visualization of Social Networks, Nathalie Henry et al. IEEE TVCG 2007 Demo:http://www.youtube.com/watch?v=7G3MxyOcHKQ

slide-21
SLIDE 21

Visual Analysis on Complex Relational Patterns (1)

[11] NodeTrix: A Hybrid Visualization of Social Networks, Nathalie Henry et al. IEEE TVCG 2007 Demo:http://www.youtube.com/watch?v=7G3MxyOcHKQ

slide-22
SLIDE 22

Visual Analysis on Complex Relational Patterns (2)

[12] FacetAtlas: Multifaceted Visualization for Rich Text Corpora, Nan Cao, et al. IEEE TVCG 2010

slide-23
SLIDE 23

23

multiple facets

  • Symptoms
  • Treatments
  • Causes
  • Tests & Diagnosis
  • Prognosis
  • Prevention
  • Complications
slide-24
SLIDE 24

24

Type2

Metabolic Syndrome

Type1

Gestational Diabetes

Diabetes

(Q1) How to model the document contents into multifaceted relation data? (Q2) How to intuitively visualize multifaceted document contents and their relations? (Q3) How to find the insight patterns visually driven by users’ interests?

slide-25
SLIDE 25

25

Type2

Metabolic Syndrome

Type1

Gestational Diabetes

Diabetes

(Q1) How to model the document contents into multifaceted relation data? (Q2) How to intuitively visualize multifaceted document contents and their relations? (Q3) How to find the insight patterns visually driven by users’ interests?

How to visualize the relations

  • f multifaceted document contents?
slide-26
SLIDE 26

26

(Q1) How to model the document contents into multifaceted relational data ?

document set entity set multifaceted entity relational data model facet segmentation symptom disease treatment entity extraction

type 1 diabetes type 2 diabetes take medications blood sugar control thirst blurred vision

Internal relations External relations

slide-27
SLIDE 27

Rich Context Social Network

Each vertex has

multiple attributes

age / sex / jobs location : city /county /state contact : emails / phones Degree / Closeness / Betweenness / Spectrum friends colleagues classmate family

The vertexes are connected by multiple relations

  • How to analysis the network topology by considering multiple

relationships?

  • How to analysis the network beyond the graph topology by

considering the vertex attributes?

slide-28
SLIDE 28

Visual Analysis on Multidimensional Patterns (1)

[13] The FlowVizMenu and Parallel Scatterplot Matrix: Hybrid Multidimensional Visualizations for Network Exploration. IEEE TVCG 2010 Demo: http://www.youtube.com/watch?v=f9Z0mPOnG_M

  • Centrality :

– Degree – Closeness – Betweenness – Eigenvector

  • Cluster Coefficient
  • Node Index

Scatter Plot Matrix

slide-29
SLIDE 29

Parallel Coordinates

Index Cluster Coef Degree Eigenvector Closeness min max [14] A. Inselberg and B. Dimsdale. Parallel coordinates: a tool for visualizing multi- dimensional geometry, InfoVis 2000

slide-30
SLIDE 30

P-SPLOMs

  • Combine the parallel

coordinates with the scatter plot matrix

– Provide flexible interactions and let users to explore the whole dataset from multiple aspects will help on the pattern detectoin

slide-31
SLIDE 31

Demo

slide-32
SLIDE 32

References

  • [1] Readings in Information Visualization: Using Vision to Think, Stuart K. Card, Jock Mackinlay,

Ben Shneiderma. 1999

  • [2] prefuse: A Toolkit for Interactive Information Visualization, Jeffery Heer, Stuart K. Card, James
  • A. Landay, ACM sigCHI, 2005
  • [3]A heuristic for drawing graph, P.Eades, 84.
  • [4]Graph Drawing by Force-Directed Graph, Fruchterman, 91.
  • [5]Drawing Graph Nicely Using Simulated Annealing, Davidson, 96.
  • [6]A Fast Adaptive Layout Algorithm for Undirected Graphs, Frick, 94.
  • [7]Spring Algorithms and Symmetry, Eades and X Lin, 99
  • [8] Graph Drawing by Stress Majorization, 2002, Graph Drawing
  • [9] An r-Dimensional Quadratic Placement Algorithm, Kenneth M. Hall, 1970
  • [10] ACE: A fast multiscale eigenvector computation for drawing huge graphs, Y.Koren, L. Carmel

and D. Harel, InfoVis 2002

  • [11] NodeTrix: A Hybrid Visualization of Social Networks, Nathalie Henry et al. IEEE TVCG 2007
  • [12] FacetAtlas: Multifaceted Visualization for Rich Text Corpora, Nan Cao, et al. IEEE TVCG 2010
  • [13] The FlowVizMenu and Parallel Scatterplot Matrix: Hybrid Multidimensional Visualizations for

Network Exploration. IEEE TVCG 2010

  • [14] A. Inselberg and B. Dimsdale. Parallel coordinates: a tool for visualizing multi-dimensional

geometry, InfoVis 2000

slide-33
SLIDE 33

An Introduction to Visual Analysis

  • f Social Networks

Nan Cao @ HKUST nancao@cse.ust.hk April 2011