Академический Документы
Профессиональный Документы
Культура Документы
Descriptive Statistics
Chapter Outline
2.1 Frequency Distributions and Their Graphs
2.2 More Graphs and Displays
2.3 Measures of Central Tendency
2.4 Measures of Variation
2.5 Measures of Position
Overview
Descriptive Statistics
Describes the important characteristics of a set of
data.
Organize, present, and summarize data:
1. Graphically
2. Numerically
Important Characteristics of
Quantitative Data
Shape, Center, and Spread
Center: A representative or average value that
indicates where the middle of the data set is located.
Variation: A measure of the amount that the values
vary among themselves.
Distribution: The nature or shape of the distribution
of data (such as bell-shaped, uniform, or skewed).
Overview
2.1 Frequency Distributions and Their Graphs
2.2 More Graphs and Displays
2.3 Measures of Central Tendency
2.4 Measures of Variation
2.5 Measures of Position
Section 2.1
Frequency Distributions
and Their Graphs
Frequency Distributions
Frequency Distribution
Grouped
Age of Frequency, f
Voters
25
18-30
202
38
31-42
508
217
43-54
620
1462
55-66
413
932
67-78
158
15
78-90
32
Peas per
pod
Freq, f
Freq,
Peas per pod
f
1
18
12
frequency
Frequency Histogram
A bar graph that represents the frequency distribution.
The horizontal scale is quantitative and measures the
data values.
The vertical scale measures the frequencies of the
classes.
Consecutive bars must touch.
data values
Larson/Farber 4th ed.
10
Frequency Histogram
Ex. Peas per Pod
Peas per pod
Freq, f
18
12
relative frequency
Sample size
n
Relative Frequency Histogram
Has the same shape and the same horizontal scale as
the corresponding frequency histogram.
The vertical scale measures the relative frequencies,
not frequencies.
12
Freq, f
Exam Scores
Freq, f
30-39
30-39
40-49
40-49
50-59
50-59
60-69
60-69
70-79
70-79
13
80-89
80-89
10
90-99
90-99
Exam Scores
Freq, f
15
range
class width =
number of classes
17
18
Shape of Distributions
Symmetric
Skewed
Data is skewed if it is not symmetric and if it extends
more to one side than the other.
Uniform
Data is uniform if it is equally distributed (on a
histogram, all the bars are the same height or
approximately the same height).
Skewed left
Uniform
Skewed Right
Outliers
Outliers
Unusual data values as compared to the rest of the set.
They may be distinguished by gaps in a histogram.
Section 2.2
More Graphs and Displays
22
Other Graphs
Besides Histograms, there are other methods of
graphing quantitative data:
24
Split each data value at the same place value to form the stem and a leaf. (Want 5-20 stems).
Arrange all possible stems vertically so there are no missing stems.
Write each leaf to the right of its stem, in order.
Create a key to recreate the data.
Variations of stem plots:
1. Split stems
2. Back to back stem plots.
25
26
Dot Plots
Dot plot
Consists of a graph in which each data value is plotted as
a point along a scale of values
Figure 2-5
Time Series
(Paired data)
Quantitative
data
Time Series
Data set is composed of quantitative entries taken at
regular intervals over a period of time.
e.g., The amount of precipitation measured each
day for one month.
Use a time series chart to graph.
time
Larson/Farber 4th ed.
28
Time-Series Graph
Number of Screens at Drive-In Movies Theaters
Figure 2-8
Ex. www.eia.doe.gov/oil_gas/petroleum/
Pareto Chart
A vertical bar graph in which the
height of each bar represents
frequency or relative frequency.
Frequency
Pie Chart
A circle is divided into sectors
that represent categories.
Categories
Larson/Farber 4th ed.
30
Never Married
55.3
Married
127.7
Widowed
13.9
Divorced
22.8
Total: 219.7
55.3
219.7
127.7
219.7
13.9
219.7
22.8
219.7
0.25 or 25%
31
Figure 2-6
Section 2.3
Measures of Central Tendency
33
34
Sample mean:
x
x
n
Round-off rule for measures of center:
Carry
one more decimal place than is in the original values. Do
not round until the last step.
35
36
11
13
%
median is the exact middle value: x 6
11
13
67
%
6.5
median is the mean of the by two numbers: x
2
37
Mode is 1.10
b) 27 27 27 55 55 55 88 88 99
Bimodal -
c) 1 2 3 6 7 8 9 10
No Mode
27 & 55
Section 2.4
Measures of Variation
41
42
Range
Range
The difference between the maximum and minimum
data entries in the set.
The data must be quantitative.
Range = (Max. data entry) (Min. data entry)
43
45
(
x
x
)
s2
n 1
( x x )
s s
n 1
2
46
2.
3.
4.
5.
6.
x
n
xx
( x x )2
( x x ) 2
2
(
x
x
)
s2
n 1
( x x ) 2
s
n 1
47
( x x )
s
n 1
2
Squares: (x x)2
5.2
(2.1)2 = 4.41
6.2
6.2 7.3 =
)2 =
7.5
7.5 7.3 =
)2 =
8.4
8.4 7.3 =
)2 =
9.2
9.2 7.3 =
)2 =
(x x) =
x x
x 36.5
s s2
Round to one more decimal than the data.
Dont round until the end.
Include the appropriate units.
Squares: (x x)2
6.6
6.8
2
(
x
x
)
s2
n 1
7.5
7.7
7.9
x 36.5
(x x) =
x x
2
s s2
Round to one more decimal than the data.
Dont round until the end.
Include the appropriate units.
Sample
Statistics:
Population
Parameters:
Mean
Standard
Deviation
Variance
s2
( x x )
s s
n 1
2
(
x
)
2
N
Larson/Farber 4th ed.
51
s0
( When would s = 0 ?)
53
54
Using Technology
The gas mileage of 2 cars is sampled over various
conditions:
Car A:
Car B:
Range
4
s = 12.8
s = 74.5
Chebyshevs Theorem
Chebyshevs Theorem
For data with any distribution, the proportion (or
fraction) of any set of data lying within K standard
deviations of the mean is always at least 1-1/K2, where
K is any positive number greater than 1.
For K = 2, at least 3/4 (or 75%) of all values lie
within 2 standard deviations of the mean
For K = 3, at least 8/9 (or 89%) of all values lie
within 3 standard deviations of the mean