Академический Документы
Профессиональный Документы
Культура Документы
BUSS1020
Quantitative
Business
Analysis
1
BUSS1020
2
http://visual.ly/nachos-101
http://visual.ly/power-infographics
BUSS1020
LEARNING OBJECTIVES
In this section you will learn:
To construct tables and charts to visualize and
organise numerical data
To construct tables and charts to visualize and
organise categorical data
The principles of properly presenting summarized
data and graphs
BUSS1020
AGENDA
Introduction
Categorical Data
Organising One Variable Categorical Data
Visualising One Variable Categorical Data
Organising Two Variable Categorical Data
Visualising Two Variable Categorical Data
Numerical Data
Organising Numerical Data
Visualising One Variable Numerical Data
Visualising Two Variable Numerical Data
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
BUSS1020
BUSS1020
ORGANISING AND
VISUALISING DATA
Data is organised and visualized so as to
reveal, glean insight from and communicate
the information,
especially the main features and patterns,
that are hidden within it.
First step: find the story in the data
Last step: communicate the story, well
BUSS1020
DCOVA
ORGANIZING CATEGORICAL
DATA: TABLES
Categorical
Data
Tallying Data
One
Categorical
Variable
Summary
Table
Two
Categorical
Variables
Contingency
Table
BUSS1020
SUMMARY TABLE
A summary table indicates the frequency, amount,
percentage or proportion of items in each of a set of categories
This allows you to see:
the relative frequency of each category
the differences between the categories
Banking Preference?
ATM
Telephone
Percent
16%
2%
17%
In person at branch
41%
Internet
24%
FIDELITY INVESTMENTS
Once considered stopping its bill-paying
service for customers
The service was losing money and used by
very few customers
However, an analysis of their customer
database showed that those using this service
were among the most loyal and the most
profitable customers.
Fidelity retained the service, so as not to lose
those customers who contributed enormously
to their profit margin.
BUSS1020
10
BUSS1020
11
SUMMARY TABLE
Summary table for
Retirement Fund data: Type
Frequency table results for Type:
Count = 316
Type
Growth
Value
Frequency
227
89
Percent of Total
71.8
28.2
Retirement Funds.xlsx
BUSS1020
12
SUMMARY TABLE
Frequency table results for Risk:
Count = 316
Risk
Frequency
Low
Average
High
What type and
Percent of
Total
212
67.1
91
28.8
13
4.1
scale is Risk?
Cumulative
Percent of
Total
67.1
95.9
100 Funds.xlsx
Retirement
13
AGENDA
Introduction
Categorical Data
Organising One Variable Categorical Data
Visualising One Variable Categorical Data
Organising Two Variable Categorical Data
Visualising Two Variable Categorical Data
Numerical Data
Organising Numerical Data
Visualising One Variable Numerical Data
Visualising Two Variable Numerical Data
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
BUSS1020
14
VISUALIZING
CATEGORICAL DATA (1
VAR)
DCOVA
Chap 2, section 3, pg 53
Categorical
Data
Visualizing Data
Contingency
Table For Two
Variables
Summary
Table For
One
Variable
Bar
Chart
Pareto
Chart
Side-By-Side
Bar Chart
Pie Chart
BUSS1020
15
%
16%
Automated or
live
telephone
2%
Drive-through
service at
branch
17%
In person at
branch
41%
Internet
24%
BUSS1020
16
Retirement Funds.xlsx
BUSS1020
17
Banking Preference?
ATM
Automated or live
telephone
%
16%
2%
Drive-through service at
branch
17%
In person at branch
41%
Internet
24%
BUSS1020
18
Retirement Funds.xlsx
BUSS1020
19
VISUALIZING
CATEGORICAL DATA:
THE PARETO CHART
Chap 2, section 3, pg 55
BUSS1020
20
Chap 2 pg 57
100.00%
90.00%
90.00%
80.00%
80.00%
70.00%
70.00%
60.00%
60.00%
50.00%
50.00%
40.00%
40.00%
30.00%
30.00%
20.00%
20.00%
10.00%
10.00%
0.00%
0.00%
BUSS1020
21
AGENDA
Introduction
Categorical Data
Organising One Variable Categorical Data
Visualising One Variable Categorical Data
Organising Two Variable Categorical Data
Visualising Two Variable Categorical Data
Numerical Data
Organising Numerical Data
Visualising One Variable Numerical Data
Visualising Two Variable Numerical Data
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
BUSS1020
22
DCOVA
Summary
Table
Two
Categorical
Variables
Contingency
Table
BUSS1020
23
DCOVA
CONTINGENCY TABLE
Can show pattern or relationship between
two or more categorical variables
BUSS1020
24
CONTINGENCY TABLE
Contingency table for
Retirement Fund data: Type vs Risk
Contingency table results:
Rows: Type
Columns: Risk
Growth
Value
Total
Average
74
17
91
High
10
3
13
Low
143
69
212
Total
227
89
316
25
Growth
Row %
Average
High
Low
Total
74
10
143
227
(32.6%) Cell
(4.41%)
(63%)
(100%)
format
(81.32%) (76.92%) (67.45%) (71.84%)
Count
Column %
(Row percent)
Value
(Column percent)
17
3
69
89
(19.1%) (3.37%) (77.53%)
(100%)
(18.68%) (23.08%) (32.55%) (28.16%)
Total
91
(28.8%)
(100%)
13
212
(4.11%) (67.09%)
(100%)
(100%)
316
(100%)
(100%)
BUSS1020
26
Type
Intermediate
Government
Intermediate
Government
Intermediate
Government
Intermediate
Government
Intermediate
Government
Intermediate
Government
Assets Fees
Expense
Ratio
Return
2009
3-Year
Return
5-Year
Return
7268.1No
0.45
6.9
6.9
475.1No
0.50
9.8
7.5
Risk
Below
5.5 average
Below
6.1 average
193.0No
0.71
6.3
7.0
5.6Average
18603.5No
.13
5.4
6.6
5.5Average
142.6No
0.60
5.9
6.7
5.4Average
1401.6No
0.54
5.7
6.4
6.2Average
BUSS1020
27
BUSS1020
28
29
Chap 2-29
AGENDA
Introduction
Categorical Data
Organising One Variable Categorical Data
Visualising One Variable Categorical Data
Organising Two Variable Categorical Data
Visualising Two Variable Categorical Data
Numerical Data
Organising Numerical Data
Visualising One Variable Numerical Data
Visualising Two Variable Numerical Data
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
BUSS1020
30
VISUALIZING
CATEGORICAL DATA (2
VAR)
DCOVA
Chap 2 Sect 3 pg 57
Categorical
Data
Visualizing Data
Contingency
Table For
Two
Variables
Summary
Table For One
Variable
Bar
Chart
Pareto
Chart
Side-By-Side
Bar Chart
Pie Chart
BUSS1020
31
VISUALIZING
CATEGORICAL DATA:
SIDE-BY-SIDE BAR
The side by side bar chart represents the data
CHARTS
from a contingency table.
No
Errors
Errors
Total
Small
Amount
50.75%
30.77%
47.50%
Medium
Amount
29.85%
61.54%
35.00%
Large
Amount
19.40%
7.69%
17.50%
100.0%
100.0%
100.0%
Total
No Errors
0.0% 10.0%20.0%30.0%40.0%50.0%60.0%70.0%
Small Medium Large
32
BUSS1020
33
BUSS1020
34
BUSS1020
35
BUSS1020
36
http://www.perceptualedge.com/examples.ph
p
BUSS1020
37
www.sharebuilder.com
http://www.perceptualedge.com/example12.php
BUSS1020
38
Is this better?
Why?
Good points?
Negatives?
http://www.perceptualedge.com/example12.php
BUSS1020
39
http://flowingdata.com/2012/06/15/what-3-d-pie-charts-are-good-for/
BUSS1020
40
SOME PRINCIPLES OF
GRAPHING
Maximise message; minimise noise
Include a title and label axes
Include a reference to the source
Keep things in correct proportions
BUSS1020
41
http://www.perceptualedge.com/example9.php
BUSS1020
42
HOW DO CATEGORY
PROPORTIONS CHANGE?
Over time
Spatially
Interactive graphics
BUSS1020
43
BUSS1020
http://www.nytimes.com/interactive/2013/10/02/us/uninsured-americans-map.html?_r=0
44
http://www.nytimes.com/interactive/2014/08/13/upshot/where-people-in-each-state-were-born.html?abt=0002&abg
=0
BUSS1020
45
SOCRATIVE QUESTION
ON THIS PLOT
https://b.socrative.com
Room:9633CA5F
BUSS1020
46
SOCRATIVE QUESTION
ON THIS PLOT
https://b.socrative.com
BUSS1020
47
Room:9633CA5F
AGENDA
Introduction
Categorical Data
Organising One Variable Categorical Data
Visualising One Variable Categorical Data
Organising Two Variable Categorical Data
Visualising Two Variable Categorical Data
Numerical Data
Organising Numerical Data
Visualising One Variable Numerical Data
Visualising Two Variable Numerical Data
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
BUSS1020
48
TABLES FOR
ORGANIZING
NUMERICAL DATA
DCOVA
Chap 2 pg 44 - 51
Numerical
Data
Ordered Array
Frequency
Distributions
Cumulative
Distributions
BUSS1020
49
Chap 2 pg 44
19
22
19
25
17
20
27
18
20
32
18
21
38
18
22
42
19
32
19
33
20
41
21
45
Night Students
18
23
18
28
BUSS1020
50
ORGANIZING
NUMERICAL DATA:
FREQUENCY
DISTRIBUTION
A frequency distribution is a summary table:
Chap 2 pg 45
51
ORGANIZING
NUMERICAL DATA:
FREQUENCY
DISTRIBUTION
EXAMPLE
Example:
The Bureau of Meteorology
(BOM)
measures the rainfall (in mm.) in July 2013 for
20 Sydney suburbs:
24, 35, 17, 21, 24, 37, 26, 46, 58, 30, 32, 13,
12, 38, 41, 43, 44, 27, 53, 27
52
FREQUENCY
DISTRIBUTION EXAMPLE
Sort raw data in ascending order:
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53,
58
Find range: 58 - 12 = 46
Select number of classes: 5
Compute class interval (width): 10 (46/5 then round up)
Determine class boundaries (limits):
Class
Class
Class
Class
Class
1:
2:
3:
4:
5:
10
20
30
40
50
20
30
40
50
60
53
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44,
46, 53, 58
Class
Frequency
10 - 20
20 - 30
30 - 40
40 - 50
50 - 60
Midpoints
15
25
35
45
55
Total
3
6
5
4
2
20
BUSS1020
54
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44,
46, 53, 58
Relative
Frequency
Frequenc
y
Class
10 - 20
20 - 30
30 - 40
40 - 50
50 - 60
Total
3
6
5
4
2
20
.15
.30
.25
.20
.10
1.00
Percentag
e
15
30
25
20
10
100
BUSS1020
55
12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44,
46, 53, 58
10 X< 20
15%
20 X< 30
45%
30 X< 40
70%
40 X< 50
90%
Cumulativ Cumulati
e
ve
Frequency Percenta
15%
3 ge
30%
25%
14
20%
18
50 X< 60
10%
Class
Frequenc Percenta
y
ge
20
BUSS1020
56
BUSS1020
57
FREQUENCY
DISTRIBUTIONS: SOME
TIPS
58
AGENDA
Introduction
Categorical Data
Organising One Variable Categorical Data
Visualising One Variable Categorical Data
Organising Two Variable Categorical Data
Visualising Two Variable Categorical Data
Numerical Data
Organising Numerical Data
Visualising One Variable Numerical Data
Visualising Two Variable Numerical Data
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
BUSS1020
59
DCOVA
Chap 2-60
Numerical
Data
Ordered Array
Also
Stem-and-Leaf
Display
Frequency Distributions
and
Cumulative Distributions
Histogram
Polygon
Ogive
Boxplot
BUSS1020
60
HANS ROSLING
Data Visualisation
http://www.youtube.com/watch?v=jbkSRLYSojo
BUSS1020
61
ORGANIZING
NUMERICAL DATA:
HISTOGRAM
Chap 2-62
BUSS1020
62
Class
10 - 20
20 - 30
30 - 40
3
6
5
.15
.30
.25
Percentag
e
15
30
25
40 - 50
50 - 60
4
2
.20
.10
20
10
Total
20
1.00
100
Chap 2-63
(In a percentage
histogram the
vertical axis would
be defined to show
the percentage of
observations per
class)
BUSS1020
63
VISUALIZING
NUMERICAL DATA:
THE HISTOGRAM
Chap 2-64
64
VISUALIZING
NUMERICAL DATA:
THE POLYGON
Chap 2-65
65
VISUALIZING
NUMERICAL DATA:
THE FREQUENCY
POLYGON
Class
3
6
5
4
2
10 - 20
Chap 2 pg 63
Class
Midpoin Frequency
t
15
20 - 30
25
30 - 40
35
40 - 50
45
50 - 60
55
66
NUMERICAL DATA:
THE OGIVE
(CUMULATIVE %
POLYGON)
Class
Chap 2 pg 64
Lower
% less
class than lower
boundar boundary
y
10 - 20
0
20 - 30
15
30 - 40
45
40 - 50
70
50 - 60
90
60 - 70
100
The percentage
of observations
less than each
lower class
boundary are
plot, versus the
lower class
10
20
30
40
50
60
BUSS1020
67
AGENDA
Introduction
Categorical Data
Organising One Variable Categorical Data
Visualising One Variable Categorical Data
Organising Two Variable Categorical Data
Visualising Two Variable Categorical Data
Numerical Data
Organising Numerical Data
Visualising One Variable Numerical Data
Visualising Two Variable Numerical Data
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
BUSS1020
68
VISUALIZING TWO
NUMERICAL VARIABLES:
THE SCATTER PLOT
Chap 2 pg 67
BUSS1020
69
Cost
per
day
23
125
26
140
29
146
33
160
38
167
42
170
50
188
55
195
60
200
BUSS1020
70
BUSS1020
71
BUSS1020
72
BUSS1020
73
BUSS1020
74
VISUALIZING TWO
NUMERICAL VARIABLES:
THE TIME-SERIES PLOT
Chap 2 pg 67
BUSS1020
75
Year
Number
of
Franchise
s
1996
43
1997
54
1998
60
1999
73
2000
82
2001
95
2002
107
2003
99
2004
95
76
6000
5000
4000
3000
2000
1000
500
1000
1500
2000
2500
3000
3500
Number of days
4000
4500
5000
BUSS1020
77
http://flowingdata.com/2015/07/02/changing-pr
ice-of-food-items-and-horizon-graphs/
BUSS1020
78
http://www.nytimes.com/interactive/2010/05/27/technology/20100527-ap
ple.html?_r=0
BUSS1020
79
AGENDA
Introduction
Categorical Data
Organising One Variable Categorical Data
Visualising One Variable Categorical Data
Organising Two Variable Categorical Data
Visualising Two Variable Categorical Data
Numerical Data
Organising Numerical Data
Visualising One Variable Numerical Data
Visualising Two Variable Numerical Data
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
DCOVA
BUSS1020
80
PRINCIPLES OF
GRAPHICAL
EXCELLENCE
Chap 2 pg 76
Further:
3D graphs should have a meaningful 3rd dimension.
Usually 2D is sufficient.
The scale on the vertical axis should (usually) begin at zero.
BUSS1020
81
http://www.perceptualedge.com/example14.php
Which plot is
better?
83
93
BUSS1020
03 04 05
82
GRAPHICAL ERRORS:
CHART JUNK, DISTORTION
Bad Presentation
Minimum Wage
1960: $1.00
1970:
$1.60
1980:
$3.10
1990: $3.80
Good Presentation
Minimum Wage
in the US
4
2
0
1960 1970 1980 1990
BUSS1020
83
http://www.perceptualedge.com/example18.php
BUSS1020
84
GRAPHICAL ERRORS:
NO RELATIVE BASIS
Good Presentation
Bad Presentation
HDs received
by students.
Freq.
HDs received
by students.
30%
300
200
20%
100
10%
0%
Y1
Y2
Y3
HO
Y1
Y2
Y3
HO
BUSS1020
85
GRAPHICAL ERRORS:
COMPRESSING THE VERTICAL AXIS
Bad Presentation
Good Presentation
Quarterly Sales
200
Quarterly Sales
50
100
25
0
Q1
Q2
Q3
Q4
Q1
Q2
Q3
Q4
BUSS1020
86
SUMMARY
Introduction
Categorical Data
Organising One Variable
Categorical Data:
Summary Tables
Numerical Data
87