Академический Документы
Профессиональный Документы
Культура Документы
Tables
?
Frequency
distributions
?
Graphical
representation?
Learning Objectives
Introduction
In statistics we have two distinct types:
1. Descriptive Statistics comprises collecting, presenting
data (tables and graphs) and describing data (central
tendency, dispersion, skewness, kurtosis).
2.
Ordinal data
Interval data
Ratio data
Tables
Tables come in a variety of formats, from simple tables to
frequency distribution, that allow data sets to be summarised in a
form that allows users to be able to access important information.
Proposed voting behaviour by 1110 university
students
(Source: University Student Survey October 2008)
Party
Frequenc or
Party
Frequenc
y
y%
Conservativ
400
Conservativ
36
e
e
Labour
510
Labour
46
Democrat
78
Democrat
7
Green
55
Green
5
Other
67
Other
6
Total
1110
Total
100
Example
1.1
Simple
table
illustrating
the voting
intentions
of 1110
students
10
Simple Tables
Month
Pink
Blue
Total
Januar
y
5200
2100
7300
Single
Less than 15
hrs per week
15 hrs or
more per
week
Total
Under
30
330
30+
June
Total
7000
5200
12200
35250
22600
57850
Married
30+
358
Under
30
1162
1719
241
643
1521
2049
599
1805
2005
484
Example 1.3
Viewing habits
of adult males
11
Frequency Distributions
Consider the set of data that represents the
number of insurance claims processed each
day by an insurance firm over a period of 40
days:
3, 5, 9, 6, 4, 7, 8, 6, 2, 5, 10, 1, 6, 3, 6, 5, 4,
7, 8, 4, 5, 9, 4, 2, 7, 6, 1, 3, 5, 6, 2, 6, 4, 8, 3,
1, 7, 9, 7, 2.
12
Frequency Distributions
Consider the set of data that represents the number of insurance
claims processed each day by an insurance firm over a period of 40
days: 3, 5, 9, 6, 4, 7, 8, 6, 2, 5, 10, 1, 6, 3, 6, 5, 4, 7, 8, 4, 5, 9, 4, 2,
7, 6, 1, 3, 5, 6, 2, 6, 4, 8, 3, 1, 7, 9, 7, 2.
SCORE
1
2
3
4
5
6
7
8
9
10
TALLY
111
1111
1111
1111
1111
1111 11
1111
111
111
1
FREQUENCY, f
3
4
4
5
5
7
5
3
3
1
f = 40
Example
1.4
Frequency
distribution
13
435
444
453
462
474
493
430
431
432
432
433
433
434
436
445
453
462
475
494
439
440
440
441
442
442
443
436
446
453
463
476
495
449
450
450
451
451
451
452
436
447
454
464
477
497
457
457
458
459
459
460
460
438
447
455
465
478
498
468
469
470
471
471
472
473
438
447
455
466
479
498
482
482
483
485
486
488
489
438
448
456
468
481
500
502
502
505
508
509
511
515
14
Counting frequencies
MILEAGE
TALLY
400 - 419
420 - 439
1111 1111 11
1111 1111 1111 1111
1111 11
1111 1111 1111 1111
1111 1111 1111
1111 1111 1111 1111
1111
1111 1111 1111
1111 111
440 - 459
460 - 479
480 - 499
500 - 519
FREQUENCY
f
12
27
Example 1.5
Grouped
frequency
distribution
34
24
15
8
f = 120
15
16
Bar charts
Pie charts
Histograms
Frequency polygons
Scatter plots
Time series plots
Data type
Which graph to use?
Category Bar chart, pie chart, cross tab
or
tables (or contingency tables)
nominal
Ordinal
Bar chart, pie chart, scatter
plots.
Interval or Histogram, frequency polygon,
ratio
histogram.
Cumulative frequency curve (or
ogive), scatter plots, time series
plots.
17
Bar charts
Categorical data is represented
largely by bar and pie charts.
Bar charts are very useful in
providing a simple pictorial
representation of several sets of
data on one graph.
Example 1.7
Bar chart for
proposed
voting
behaviour
See text for
the Excel
solution
18
19
Pie charts
In a pie chart the relative
frequencies are
represented by a slice of a
circle. Each section
represents a category, and
the area of a section
represents the frequency
or number of objects
within a category.
They are particularly
useful in showing relative
proportions, but their
effectiveness tends to
diminish for more than
eight categories.
Example 1.11
Pie chart for
proposed
voting
behaviour
20
Voting
Behaviour
Angle
Calculation
Conservativ
e
Labour
400
(360/1110)*4
00
(360/1110)*5
10
(360/1110)*7
8
(360/1110)*5
5
(360/1110)*6
7
510
Democrat
78
Green
55
Other
67
Total =
1110
Angle
(1 decimal
place)
129.70
165.40
25.30
17.80
21.70
359.9
21
Histograms
A graph of the data in a frequency distribution is called a
histogram. The area of each bar is a measure of the frequency of
occurrence (number of values) within each category. If the bar
widths are the same (constant) then the height of the bar is
directly related to the frequency and this information can then be
used to construct the histogram.
Example 1.12
Histogram for
the number of
insurance claims
processed
22
Histogram Example
Example 1.13
Histogram for
the miles
recorded by 120
salesman
23
Frequency Polygon
A frequency polygon is formed from a histogram by joining the midpoints of the tops of the rectangles by straight lines. The mid-points of
the first and last class are joined to the x-axis to either side at a
distance equal to (1/2)th the class interval of the first and last class.
Example 1.15
Frequency
Polygon for the
miles recorded
by 120 salesman
Glyn Davis & Branko Pecar
24
% Raise in Productivity, Y
4.2
8.1
6.8
4.3
5.0
7.5
4.7
5.9
6.9
5.7
5.7
5.4
7.5
3.8
5.9
6.3
5.7
4.0
5.2
2.2
25
Scatter plots
Example 1.16
Scatter plot for
the % raise in
productivity
against
productivity
26
Time series
Time series analysis is concerned with data collected over a period
of time. It attempts to isolate and evaluate various factors which
contribute to changes over time in such variable series as imports
and exports, sales, unemployment and prices. If we can evaluate the
main components which determine the value of say sales for a
particular month then we can project the series into the future to
obtain a forecast.
Example 1.17
Year
2001
2002
2003
2004
27
28
Conclusion
In this presentation we explored summarising data sets using the
following three concepts:
Tables
Frequency
distributions
Graphs
29