Text visualization Lucas Rizoli CPSC 533C, November 2006 Web-pages - - PowerPoint PPT Presentation

text visualization
SMART_READER_LITE
LIVE PREVIEW

Text visualization Lucas Rizoli CPSC 533C, November 2006 Web-pages - - PowerPoint PPT Presentation

Text visualization Lucas Rizoli CPSC 533C, November 2006 Web-pages Email Instant messages Digitized books, articles 2 Why use text in visualization? 3 Green Blue Blue Blue Red Red 4 Reading is fast 5 New York Justice Five [from


slide-1
SLIDE 1

Text visualization

Lucas Rizoli CPSC 533C, November 2006

slide-2
SLIDE 2

2

Web-pages Email Instant messages Digitized books, articles

slide-3
SLIDE 3

3

Why use text in visualization?

slide-4
SLIDE 4

4

Red Blue Blue Green Red Blue

slide-5
SLIDE 5

5

Reading is fast

slide-6
SLIDE 6

6

New York Five

[from http://en.wikipedia.org/wiki/Image:Statue-Of-Liberty.jpg]

Justice

slide-7
SLIDE 7

7

Text can be a dense representation Text can be inexact

slide-8
SLIDE 8

8

Fast Dense Inexact

slide-9
SLIDE 9

9

Difficulties of using text

slide-10
SLIDE 10

10

Space Arrangement Orientation Legibility Meaning

slide-11
SLIDE 11

11

[from http://www.futureofthebook.org/mitchellstephens/]

slide-12
SLIDE 12

12

[from http://www.textarc.org/]

slide-13
SLIDE 13

13

[from http://www.textarc.org/]

slide-14
SLIDE 14

14

Index Searching Explicit in data

slide-15
SLIDE 15

15

[from http://enron.trampolinesystems.com/]

slide-16
SLIDE 16

16

[from http://jheer.org/enron/]

slide-17
SLIDE 17

17

[from http://jheer.org/enron/]

slide-18
SLIDE 18

18

[from http://www.idlewords.com/2004/03/your_literary_masterpiece_was_delicious.htm]

slide-19
SLIDE 19

19

Graph Analyzing Derived from data Human supervision of automated processes

slide-20
SLIDE 20

20

Reliance on meta-data Says little about content

slide-21
SLIDE 21

21

[from Tat, A., & Carpendale, M. S. T. (2002)]

slide-22
SLIDE 22

22

[from Tat, A., & Carpendale, M. S. T. (2002)]

Wordiness Direction of conversation CAPS Exclamations

slide-23
SLIDE 23

23

[from Tat, A., & Carpendale, M. S. T. (2002)]

slide-24
SLIDE 24

24

[from Havre, S., Hetzler, E., Whitney, P., & Nowell, L. (2002)]

slide-25
SLIDE 25

25

[from Viégas, F. B., Golder, S., & Donath, J. (2006)]

slide-26
SLIDE 26

26

[from http://alumni.media.mit.edu/~fviegas/projects/themail/study/index.htm]

slide-27
SLIDE 27

27

[from Viégas, F. B., Golder, S., & Donath, J. (2006)]

slide-28
SLIDE 28

28

Unique visual representation Exploration Derived from data Increasingly semantic Greater reliance on human users

slide-29
SLIDE 29

29

Trouble pre-processing data Many assumptions made

slide-30
SLIDE 30

30

Finding meaning in text is difficult

slide-31
SLIDE 31

31

Adjusting for word frequency Full semantic processing

slide-32
SLIDE 32

32

Take-home lessons

slide-33
SLIDE 33

33

Text in visualization Fast, dense, inexact Complicated to apply

slide-34
SLIDE 34

34

Visualizing text Range of levels and methods Meta-data adds structure Pre-processing is hard, important

slide-35
SLIDE 35

35

  • Ceglowski, M. (2004). Your Literary Masterpiece Was Delicious. Retrieved November 6, 2006

from http://www.idlewords.com/2004/03/your_literary_masterpiece_was_delicious.htm

  • Havre, S., Hetzler, E., Whitney, P., & Nowell, L. (2002). ThemeRiver: Visualizing Thematic

Changes in Large Document Collections. Visualization and Computer Graphics, IEEE Transactions, 8, 9-20.

  • Heer, J. (2004). Exploring Enron: Visualizing ANLP Results. Retrieved November 6, 2006 from

http://jheer.org/enron/v1/

  • Paley, W. B. (2002). TextArc: Showing Word Frequency and Distribution in Text. In Wong, P. C.,

& Keith Andrews (Eds.), Proceedings of the IEEE Symposium on Information Visualization (Infovis '02) Poster Compendium. Los Alamitos, CA, USA: IEEE Press.

  • Paley, W. B. (n.d.). TextArc.org Home. Retrieved November 6, 2006 from http://textarc.org/
  • Tat, A., & Carpendale, M. S. T. (2002). Visualizing Human Dialog. Proceedings of the IEEE

Conference on Information Visualization (Infovis '02). London, UK: IEEE Press.

  • Trampoline Systems (n.d.). Trampoline Enron Explorer. Retrieved November 6, 2006 from

http://enron.trampolinesystems.com/

  • Viégas, F. B., Golder, S., & Donath, J. (2006). Visualizing Email Content: Portraying

Relationships from Conversational Histories. Proceedings of the SIGCHI conference on Human factors in computing systems (CHI '06). Montréal, Québec, Canada: ACM Press.