Вы находитесь на странице: 1из 24

Multivariate Data &

Representations

Information Visualization
April. 30, 2008
Carsten Görg

Slides adapted
from John Stasko

Housekeeping

• Second assignment due today

• Important Dates on Webpage

Summer term 2008 2

1
Agenda

• Data forms and representations


• Basic representation techniques
• Multivariate (>3) techniques

Summer term 2008 3

Data Sets

• Data comes in many different forms


• Typically, not in the way you want it

• How is stored (in the raw)?

Summer term 2008 4

2
Example

• Cars
− make
− model
− year
− miles per gallon
− cost
− number of cylinders
− weights
− ...

Summer term 2008 5

Data Tables

• Often, we take raw data and transform it


into a form that is more workable
• Main idea:
− Individual items are called cases
− Cases have variables (attributes)

Summer term 2008 6

3
Data Table Format

Case1 Case2 Case3 ...

Variable1 Value11 Value21 Value31

Variable2 Value12 Value22 Value32

Variable3 Value13 Value23 Value33

...

Think of as a function
f(case1) = <Val11, Val12,…>

Summer term 2008 7

Example

Mary Jim Sally Mitch ...

M-Nr. 145 294 563 823

Age 23 17 47 29

Hair brown black blonde red

Grade 2.9 3.7 3.4 2.1

...

People in class

Summer term 2008 8

4
Example

Baseball
statistics

Summer term 2008 9

Variable Types

• Three main types of variables


− N-Nominal (equal or not equal to other
values)
Example: gender
− O-Ordinal (obeys < relation, ordered set)
Example: degrees – Bachelor, Master, PhD
− Q-Quantitative (can do math on them)
Example: age

Summer term 2008 10

5
Metadata

• Descriptive information about the data


− Might be something as simple as the type of
a variable, or could be more complex
− For times when the table itself just isn’t
enough
− Example: if variable1 is “l”, then variable3
can only be 3, 7 or 16

Summer term 2008 11

How Many Variables?

• Data sets of dimensions 1, 2, 3 are


common
• Number of variables per class
− 1 - Univariate data
− 2 - Bivariate data
− 3 - Trivariate data
− >3 - Hypervariate data

Summer term 2008 12

6
Representation

• What’s a common way of visually


representing multivariate data sets?
• Graphs! (not the vertex-edge ones)

Summer term 2008 13

Good Example

www.nationmaster.com

Summer term 2008 14

7
Basic Symbolic Displays

• Graphs Å
• Charts
• Maps
• Diagrams

From:
S. Kosslyn, “Understanding charts
and graphs”, Applied Cognitive
Psychology, 1989.

Summer term 2008 15

1. Graph

Showing the relationships between variables’


values in a data table

100

80

60
East
40 W est
20 North

0
1st 2nd 3rd 4th
Q tr Q tr Qtr Qtr

Summer term 2008 16

8
Properties

• Graph
− Visual display that illustrates one or more
relationships among entities
− Shorthand way to present information
− Allows a trend, pattern or comparison to be
easily comprehended

Summer term 2008 17

Issues

• Critical to remain task-centric


− Why do you need a graph?
− What questions are being answered?
− What data is needed to answer those
questions?
− Who is the audience?

money

time
Summer term 2008 18

9
Graph Components

• Framework
− Measurement types, scale
• Content
− Marks, lines, points
• Labels
− Title, axes, ticks

Summer term 2008 19

Other Symbolic Displays

• Chart
• Map
• Diagram

Summer term 2008 20

10
2. Chart

• Structure is important, relates entities to each other


• Primarily uses lines, enclosure, position to link entities

Examples: flowchart, family tree, org chart, ...

Summer term 2008 21

3. Map

• Representation of spatial relations


• Locations identified by labels

Summer term 2008 22

11
Choropleth Map

Areas are filled


and colored
differently to
indicate some
attribute of that
region

Summer term 2008 23

Cartography

• Cartographers and map-makers have a


wealth of knowledge about the design
and creation of visual information artifacts
− Labeling, color, layout, …
• Information visualization researchers
should learn from this older, existing area

Summer term 2008 24

12
4. Diagram

• Schematic picture of object or entity


• Parts are symbolic

Examples: figures, steps in a manual, illustrations,...


Summer term 2008 25

Details

• What are the constituent pieces of these


four symbolic displays?

• What are the building blocks?

Summer term 2008 26

13
Visual Structures

• Composed of
− Spatial substrate
− Marks
− Graphical properties of marks

Summer term 2008 27

Space

• Visually dominant
• Often put axes on space to assist
• Use techniques of
composition, alignment, folding,
recursion, overloading to
1) increase use of space
2) do data encodings

Summer term 2008 28

14
Marks

• Things that occur in space


− Points
− Lines
− Areas
− Volumes

Summer term 2008 29

Graphical Properties

• Size, shape, color, orientation...

Spatial properties Object properties

Expressing Position
Grayscale
extent Size

Differentiating Orientation Color


marks Shape
Texture

Summer term 2008 30

15
Back to Data

• What were the different types of data


sets?
• Number of variables per class
− 1 - Univariate data
− 2 - Bivariate data
− 3 - Trivariate data
− >3 - Hypervariate data

Summer term 2008 31

Univariate Data

• Representations

7 Bill
Tukey box plot
5
low Middle 50% high
3

1
Mean

0 20

Summer term 2008 32

16
What goes where
• In univariate representations, we often think of the data
case as being shown along one dimension, and the
value in another
Line Bar
graph graph

Y-axis is quantitative Y-axis is quantitative


variable variable

See changes over Compare relative point


consecutive values values
Summer term 2008 33

Alternative View

• We may think of graph as representing


independent (data case) and dependent
(value) variables
• Guideline:
− Independent vs. dependent variables
Put independent on x-axis
See resultant dependent variables along y-axis

Summer term 2008 34

17
Bivariate Data

• Representations

Scatter plot is common

price

Two variables, want to


see relationship

Each mark is now mileage Is there a linear, curved or


a data case random pattern?

Summer term 2008 35

Trivariate Data

• Representations

3D scatter plot is possible


price

horsepower

mileage

Summer term 2008 36

18
Alternative Representation

Still use 2D but have


mark property
represent third
variable

Summer term 2008 37

Alternative Representation

Represent each variable


in its own explicit way

Summer term 2008 38

19
Hypervariate Data

• Ahhh, the tough one


• Number of well-known visualization
techniques exist for data sets of 1-3
dimensions
− line graphs, bar graphs, scatter plots OK
− We see a 3-D world (4-D with time)
• What about data sets with more than 3
variables?
− Often the interesting, challenging ones

Summer term 2008 39

Multiple Views

Give each variable its own display

1
A B C D E
1 4 1 8 3 5 2
2 6 3 4 2 1
3 5 7 2 4 3 3
4 2 6 3 1 5

A B C D E
Summer term 2008 40

20
Scatterplot Matrix

Represent each possible


pair of variables in their
own 2-D scatterplot

Useful for what?


Misses what?

Summer term 2008 41

Chernoff Faces

Encode different variables’ values in characteristics


of human face

Cute applets: http://www.cs.uchicago.edu/~wiseman/chernoff/


http://hesketh.com/schampeo/projects/Faces/chernoff.html

Summer term 2008 42

21
Paper Recap

“Multidimensional Information Visualization


Through Sliding Rods”
Tom Lanning, Kent Wittenburg,
Micheal Heinrichs, Christina Fyock, Glenn Li
AVI 2000

Summer term 2008 43

Introduction

• Two types of interaction paradigms for


Web Information Finding
− Browsing
− Query/Response
• Motivation for MultiNav
− Easy to use techniques for multidimensional
visualization
− Integrate attribute info. with individual item
browsing

Summer term 2008 44

22
MultiNav

Summer term 2008 45

Attributes as sliding rods

Summer term 2008 46

23
Sources Used

CMS book
Referenced articles
Marti Hearst SIMS 247 lectures
Kosslyn ‘89 article
A. Marcus, Graphic Design for Electronic Documents
and User Interfaces
M. Monmonier, How to Lie with Maps
W. Cleveland, The Elements of Graphing Data
C. H. Yu, Visualization Techniques of Different Dimensions
http://seamonkey.ed.asu.edu/~behrens/asu/reports/compre/comp1.html
http://www.csc.ncsu.edu/faculty/healey/PP/PP.html

Summer term 2008 47

24

Вам также может понравиться