Visualizing public health data for communicable disease management - - PowerPoint PPT Presentation

visualizing public health data for communicable disease
SMART_READER_LITE
LIVE PREVIEW

Visualizing public health data for communicable disease management - - PowerPoint PPT Presentation

Visualizing public health data for communicable disease management and control Anamaria Crisan PhD Candidate, Computer Science The University of British Columbia Data visualization in the GenEpi current paradigm = Communication of scientific


slide-1
SLIDE 1

Visualizing public health data for communicable disease management and control

Anamaria Crisan

PhD Candidate, Computer Science The University of British Columbia

slide-2
SLIDE 2

Data visualization in the GenEpi current paradigm

  • f scientific research

= Communication

2

slide-3
SLIDE 3

Do you have an

Outbreak? Yes. No.

Do all the

Science!

But you want to

Monitor

right?

Duh. Inform

the masses!

3

https://www.ratbotcomics.com/comics/pgrc_2014/1/1.html

slide-4
SLIDE 4

Do you have an

Outbreak? Yes. No.

Do all the

Science!

But you want to

Monitor

right?

Duh. Inform

Maybe data

Visualization? Infographics are pretty

the masses!

4

slide-5
SLIDE 5

Do you have an

Outbreak? Yes. No.

Do all the

Science!

But you want to

Monitor

right?

Duh. Inform

Did it work? Maybe data

Visualization?

the masses!

5

Infographics are pretty

slide-6
SLIDE 6

Do you have an

Outbreak? Yes. No.

Do all the

Science!

But you want to

Monitor

right?

Duh. Inform

Did it work? Maybe data

Visualization? No : (

the masses!

Different Infographics?

6

slide-7
SLIDE 7

Do you have an

Outbreak? Yes. No.

Do all the

Science!

But you want to

Monitor

right?

Duh.

the masses!

Inform

Did it work? Maybe data

Visualization? No : ( Different Infographics? Declare Victory Yes! (maybe?)

7

slide-8
SLIDE 8

Data Visualization!

  • Many different visualization designs
  • Design impacts data interpretation
  • How to choose which is best?
  • Feelings (ad hoc)
  • Impressions (ad hoc)
  • Systematic assessment (lacking)

Challenge : Multiple Alternatives

8

slide-9
SLIDE 9

www.vishealth.org

OPTION A OPTION B OPTION C

  • Same objective (understand treatment efficacy), same data, different visualizations
  • Tested accuracy, timeliness, and preference with 2,038 participants
  • Option A was most accurate, easier (faster) to read, and preferred

9

Challenge : Multiple Alternatives

slide-10
SLIDE 10

10

Challenge : Multiple Alternatives

slide-11
SLIDE 11

Lack of Systematic Thinking about Data Visualization

§ Peer-reviewed, systematic approaches

§ Automated systems & packages

§ Benchmark comparisons / evaluation § Attempts to standardize § Formal instruction § Community dialogue

§ papers, reviews, blog posts

§ Ad hoc – visualization mainly by intuition, trial & error

§ Some automated systems & packages

§ No real comparison / evaluation § No attempts to standardize § No formal instruction § Some community dialogue

§ blog posts (kind of), twitter

Bioinformatics Methods GenEpi Data Visualization

11

slide-12
SLIDE 12

Introducing

GEviT

12

slide-13
SLIDE 13

Introducing GEviT

§ GEviT = Genomic Epidemiology Visualization Typology

§ A way to describe data visualization for analysis § Organizes qualitative descriptors into a typology § What does GEviT do and not do?

13

slide-14
SLIDE 14

Introducing GEviT

14

§ GEviT = Genomic Epidemiology Visualization Typology

§ A way to describe data visualization for analysis § Organizes qualitative descriptors into a typology § What does GEviT do and not do? Evaluation Systematic Preliminary GEviT provides a base

§ Deliverables : 1. Typology 2. Interactive Gallery

GEviT does not evaluate

§ Massive undertaking that would take many years § Needs GEviT to conduct evaluations

Completion: Fall 2017 Completion: Fall 2117?

slide-15
SLIDE 15

Introducing GEviT

15

§ GEviT = Genomic Epidemiology Visualization Typology

§ A way to describe data visualization for analysis § Organizes qualitative descriptors into a typology § What does GEviT do and not do? Evaluation Systematic Preliminary GEviT provides a base

§ Deliverables : 1. Typology 2. Interactive Gallery

GEviT does not evaluate

§ Massive undertaking that would take many years § Needs GEviT to conduct evaluations

Completion: Fall 2017 Completion: Fall 2117?

slide-16
SLIDE 16

Introducing GEviT

16

Systematic Preliminary GEviT provides a base

§ Deliverables : 1. Typology 2. Interactive Gallery

Completion: Fall 2017

§ GEviT = Genomic Epidemiology Visualization Typology

§ A way to describe data visualization for analysis § Organizes qualitative descriptors into a typology § What does GEviT do and not do? Evaluation GEviT does not evaluate

§ Massive undertaking that would take many years § Needs GEviT to conduct evaluations

Completion: Fall 2117?

slide-17
SLIDE 17

How does GEviT do this?

§ Create a why-what-how typology

§ Uses methods from qualitative methods & infovis research § Typology, not ontology : typologies are lighter weight than ontologies

§ May be used in conjunction with a ontology § Blame epistemologists for their systems of classification

17

slide-18
SLIDE 18

How does GEviT do this?

§ Create a why-what-how typology

§ Uses methods from qualitative methods & infovis research § Typology, not ontology : typologies are lighter weight than ontologies

§ May be used in conjunction with a ontology § Blame epistemologists for their systems of classification

§ Why are data being visualized?

§ i.e. show transmission in a hospital

18

slide-19
SLIDE 19

How does GEviT do this?

§ Create a why-what-how typology

§ Uses methods from qualitative methods & infovis research § Typology, not ontology : typologies are lighter weight than ontologies

§ May be used in conjunction with a ontology § Blame epistemologists for their systems of classification

§ Why are data being visualized?

§ i.e. show transmission in a hospital

§ What data are being visualized?

§ i.e. patient location, duration in hospital, test outcomes, SNPs, clusters

19

slide-20
SLIDE 20

How does GEviT do this?

§ Create a why-what-how typology

§ Uses methods from qualitative methods & infovis research § Typology, not ontology : typologies are lighter weight than ontologies

§ May be used in conjunction with a ontology § Blame epistemologists for their systems of classification

§ Why are data being visualized?

§ i.e. show transmission in a hospital

§ What data are being visualized?

§ i.e. patient location, duration in hospital, test outcomes, SNPs, clusters

§ How are data being visualized?

§ i.e. timeline, phylogenetic tree (high-level) § i.e. test outcome = shape ; patient location = colour; cluster = spatial arrangement (low-level)

20

slide-21
SLIDE 21

21

Development

GEviT

slide-22
SLIDE 22

GEviT Development (Example)

OPTION A OPTION B

22

WHY: Show within hospital transmission

slide-23
SLIDE 23

OPTION A OPTION B

23

HO HOW (hi high-le level) l): Ti Timeline HO HOW (hi high-le level): ): Phylogeny HO HOW (hi high-le level) l): Ti Timeline HO HOW (hi high-le level): ): Node-lin link graph

GEviT Development (Example)

Same why, different high-level how

slide-24
SLIDE 24

OPTION A OPTION B

24

WHAT: Lo Location [ [ HOW: Co Colour ] WHAT: Lo Location [ [ HOW : Co Colour ]

GEviT Development (Example)

Same why, same what , same how

slide-25
SLIDE 25

OPTION A OPTION B

25

WH WHAT: Test Performed [ [ HOW : Glyph ] WH WHAT: Test Performed [ HOW : Li Line ] WH WHAT: Test Result [ [ HOW : Co Colour]

GEviT Development (Example)

Same why, sameish what , different how

slide-26
SLIDE 26

OPTION A OPTION B

26

WH WHAT: Clusters [ [ HOW : Co Colour ]

GEviT Development (Example)

Same why, different what and how

slide-27
SLIDE 27

OPTION A OPTION B

27

GEviT Development (Example)

WH WHAT: Tran ansmission Confidence [ [ HOW: Co Colour ]

Same why, different what and how

slide-28
SLIDE 28

28

What How Options

High-level Low-level High-Level Low-level A B Admin Patient ID Timeline Annotation X X

  • P. Sample ID

Annotation, Colour X Genomic SNP Distance Phylogeny Annotation, Position X Clusters Position, Colour X Spatial Location Timeline Colour X X Laboratory Test Performed Timeline Glyph X Line, Colour X Test Result Colour X Temporal Admission Date Timeline Position X X Episode Duration Bar X X Test Date Position X X Transmission Transmission Confidence Node-link graph Colour X

How is what you see

GEviT Development (Example)

slide-29
SLIDE 29

29

What How Options

High-level Low-level High-Level Low-level A B Admin Patient ID Timeline Annotation X X

  • P. Sample ID

Annotation, Colour X Genomic SNP Distance Phylogeny Annotation, Position X Clusters Position, Colour X Spatial Location Timeline Colour X X Laboratory Test Performed Timeline Glyph X Line, Colour X Test Result Colour X Temporal Admission Date Timeline Position X X Episode Duration Bar X X Test Date Position X X Transmission Transmission Confidence Node-link graph Colour X

GEviT Development (Example)

slide-30
SLIDE 30

30

What How Options

High-level Low-level High-Level Low-level A B Admin Patient ID Timeline Annotation X X

  • P. Sample ID

Annotation, Colour X Genomic SNP Distance Phylogeny Annotation, Position X Clusters Position, Colour X Spatial Location Timeline Colour X X Laboratory Test Performed Timeline Glyph X Line, Colour X Test Result Colour X Temporal Admission Date Timeline Position X X Episode Duration Bar X X Test Date Position X X Transmission Transmission Confidence Node-link graph Colour X

GEviT Development (Example)

slide-31
SLIDE 31

31

What How Options

High-level Low-level High-Level Low-level A B Admin Patient ID Timeline Annotation X X

  • P. Sample ID

Annotation, Colour X Genomic SNP Distance Phylogeny Annotation, Position X Clusters Position, Colour X Spatial Location Timeline Colour X X Laboratory Test Performed Timeline Glyph X Line, Colour X Test Result Colour X Temporal Admission Date Timeline Position X X Episode Duration Bar X X Test Date Position X X Transmission Transmission Confidence Node-link graph Colour X

GEviT Development (Example)

slide-32
SLIDE 32

32

What How Options

High-level Low-level High-Level Low-level A B Admin Patient ID Timeline Annotation X X

  • P. Sample ID

Annotation, Colour X Genomic SNP Distance Phylogeny Annotation, Position X Clusters Position, Colour X Spatial Location Timeline Colour X X Laboratory Test Performed Timeline Glyph X Line, Colour X Test Result Colour X Temporal Admission Date Timeline Position X X Episode Duration Bar X X Test Date Position X X Transmission Transmission Confidence Node-link graph Colour X

GEviT Development (Example)

slide-33
SLIDE 33

33

What How Options

High-level Low-level High-Level Low-level A B Admin Patient ID Timeline Annotation X X

  • P. Sample ID

Annotation, Colour X Genomic SNP Distance Phylogeny Annotation, Position X Clusters Position, Colour X Spatial Location Timeline Colour X X Laboratory Test Performed Timeline Glyph X Line, Colour X Test Result Colour X Temporal Admission Date Timeline Position X X Episode Duration Bar X X Test Date Position X X Transmission Transmission Confidence Node-link graph Colour X

GEviT Development (Example)

slide-34
SLIDE 34

What is GEviT’s status?

34

Systematic Portion of Project Anticipation Completion: Fall 2017

Prototype Gallery: https://amcrisan.shinyapps.io/gevit_gallery_prototype/

slide-35
SLIDE 35

GEviT Takeaways

35

Data Visualization!

§ Data visualization rigor is possible § Rigor is also desirable

§ Analytic, rather than ad hoc, process to data visualization

§ Rigor will be necessary

§ Clinical application of GenEpi § Ensuring visualization is interpretable

§ GEviT provides a basis

§ Evolving framework § Enables evaluation

slide-36
SLIDE 36

The Bigger Picture

Autodesk Research (2017). Same Stats, Different Graphs: https://www.autodeskresearch.com/publications/samestats

Same stats, different graphs

36

slide-37
SLIDE 37

37

Thanks! Contact

@amcrisan <slide url> http://cs.ubc.ca/~acrisan

  • Dr. Gardy, Dr. Munzner, &

the UBC infovis group

slide-38
SLIDE 38

38

Extra Slides

slide-39
SLIDE 39

Do all the

Science! Data Visualization!

the masses!

Inform

Missed EDA Opportunity

  • EDA = Exploratory Data Analysis
  • Incorporate data visualization as a

part of scientific process

  • Science != just statistics + wetlab
  • Visualization != just pretty pictures
  • Visualization can …
  • Check methdological assumptions
  • Show unanticipated patterns
  • Generate new hypothesis

39

slide-40
SLIDE 40

How does GEviT add Rigor to Data Visualization in GenEpi?

§ Ad hoc – visualization mainly by intuition, trial & error § No real comparison / evaluation § No attempts to standardize § No formal instruction § Some community dialogue

GEviT introduces design alternatives, systematic reasoning GEviT enables comparison for design & evaluation GEviT standardizes descriptions of elements in visualization design GEviT bridges GenEpi with infovis pedagogy GEviT intended to start a bigger dialogue about data visualization practices

40

slide-41
SLIDE 41

Design Trajectory of GEviT Gallery:

EXAMPLE: Info about vis EXAMPLE: Info about vis

41

Treevis.net Setviz.net Vishealth.org