Академический Документы
Профессиональный Документы
Культура Документы
Descriptive
2.1 Frequency
2 Statistics
Distributions and
Their Graphs
2.2 More Graphs and
Displays
2.3 Measures of Central
Tendency
2.4 Measures of Variation
Case Study
2.5 Measures of Position
Uses and Abuses
Real Statistics
Real Decisions
Technology
32
25, 5, 18, 12, 60, 44, 24, 22, 2, 7, 15, 39, 58, 53, 36, 42, 16, 20, 1, 5, 39, 51, 44, 23, 3, 13, 37,
56, 58, 13, 47, 23, 1, 17, 39, 13, 24, 0, 39, 10, 41, 1, 48, 17, 18, 3, 72, 20, 3, 9, 0, 12, 33, 21, 40,
68, 25, 40, 59, 4, 67, 29, 13, 18, 19, 13, 16, 41, 19, 26, 68, 49, 5, 26, 49, 26, 45, 41, 19, 49
Class Frequency, f 20
0 9 15 18
16
1019 19 14
Frequency
2029 14 12
10
3039 7 8
40 49 14 6
5059 6 4
2
6069 4
7079 1
5
.5
.5
.5
.5
.5
.5
.5
4.
14
24
34
44
54
64
74
Age
0 + 0 + 1 + 1 + 1 + + 67 + 68 + 68 + 72
Mean =
80
2226
=
80
Find an average.
L 27.8 years
Range = 72 - 0
= 72 years Find how the data vary.
33
GUIDELINES
Constructing a Frequency Distribution from a Data Set
Study Tip 1. Decide on the number of classes to include in the frequency distribution.
The number of classes should be between 5 and 20; otherwise, it may
distribution, it
In a frequency be difficult to detect any patterns.
class has the
is best if each
An swers shown 2. Find the class width as follows. Determine the range of the data, divide
same width.
inimum data the range by the number of classes, and round up to the next convenient
will use the m
wer limit of number.
value for the lo
Sometimes it
the first class. 3. Find the class limits. You can use the minimum data entry as the lower
e convenient to
may be mor limit of the first class. To find the remaining lower limits, add the class
that is slightly
choose a value width to the lower limit of the preceding class. Then find the upper
minimum
lower than the limit of the first class. Remember that classes cannot overlap. Find the
ency distri-
value. The frequ
uc ed will vary remaining upper class limits.
bution prod
slightly. 4. Make a tally mark for each data entry in the row of the appropriate
class.
5. Count the tally marks to find the total frequency f for each class.
Note to Instructor
Let students know that there are many
EXAMPLE 1
correct versions for a frequency Constructing a Frequency Distribution from a Data Set
distribution. To make it easy to check
answers, however, they should follow The following sample data set lists the number of minutes 50 Internet
the conventions shown in the text. subscribers spent on the Internet during their most recent session. Construct a
frequency distribution that has seven classes.
50 40 41 17 11 7 22 44 28 21 19 23 37 51 54 42 88
Insight 41 78 56 72 56 17 7 69 30 80 56 29 33 46 31 39 20
18 29 34 59 73 77 36 39 30 62 54 67 39 31 53 44
whole num-
If you obtain a
ulating the SOLUTION
ber when calc
a frequency
class width of 1. The number of classes (7) is stated in the problem.
n, us e the next
distributio
r as the class 2. The minimum data entry is 7 and the maximum data entry is 88, so the range
whole numbe
is ensures is 81. Divide the range by the number of classes and round up to find that the
width. Doing th
ou gh space in class width is 12.
you have en
distribution
your frequency 88 - 7 Maximum entry - Minimum entry
values.
for all the data Class width =
7 Number of classes
81 Range
=
7 Number of classes
Lower Upper
limit limit L 11.57 Round up to 12.
7 18 3. The minimum data entry is a convenient lower limit for the first class. To find
19 30 the lower limits of the remaining six classes, add the class width of 12 to the
lower limit of each previous class. The upper limit of the first class is 18,
31 42
which is one less than the lower limit of the second class. The upper limits of
43 54 the other classes are 18 + 12 = 30, 30 + 12 = 42, and so on. The lower and
55 66 upper limits for all seven classes are shown.
67 78 4. Make a tally mark for each data entry in the appropriate class.
79 90 5. The number of tally marks for a class is the frequency for that class.
The frequency distribution is shown in the following table. The first class, 718,
has six tally marks. So, the frequency for this class is 6. Notice that the sum of
Study Tip
the frequencies is 50, which is the number of entries in the sample data set. The
sum is denoted by g f, where g is the uppercase Greek letter sigma.
k letter
p e rc a se Gree gh-
The up 2 is used throu
sigm a 1 g dica a
te Frequency Distribution for Internet Usage (in minutes)
t sta tis tics to in s.
ou value
tion of Minutes Number of
summa online Class Tally Frequency, f subscribers
718 6
1930 10
Note to Instructor
3142 13
Be sure that students interpret the class 4354 8
width correctly as the distance
5566 5
between lower (or upper) limits of Check that the sum
consecutive classes. A common error is 6778 6
of the frequencies
to use a class width of 11 for the class 7990 2 equals the number
g f = 50
718. Students should be shown that in the sample.
this class actually has a width of 12.
Try It Yourself 1
Construct a frequency distribution using the Akhiok population data set listed
in the Chapter Opener on page 33. Use eight classes.
a. State the number of classes.
b. Find the minimum and maximum values and the class width.
c. Find the class limits.
d. Tally the data entries.
e. Write the frequency f for each class. Answer: Page A29
DEFINITION
The midpoint of a class is the sum of the lower and upper limits of the
class divided by two. The midpoint is sometimes called the class mark.
1Lower class limit2 + 1Upper class limit2
Midpoint =
2
The relative frequency of a class is the portion or percentage of the data
that falls in that class. To find the relative frequency of a class, divide the
frequency f by the sample size n.
Class frequency
Relative frequency =
Sample size
f
=
n
The cumulative frequency of a class is the sum of the frequency for that
class and all previous classes. The cumulative frequency of the last class is
equal to the sample size n.
After finding the first midpoint, you can find the remaining midpoints by
adding the class width to the previous midpoint. For instance, if the first
midpoint is 12.5 and the class width is 12, then the remaining midpoints are
12.5 + 12 = 24.5
24.5 + 12 = 36.5
36.5 + 12 = 48.5
48.5 + 12 = 60.5
and so on.
You can write the relative frequency as a fraction, decimal, or percent. The
sum of the relative frequencies of all the classes must equal 1 or 100%.
EXAMPLE 2
Midpoints, Relative and Cumulative Frequencies
Using the frequency distribution constructed in Example 1, find the midpoint,
relative frequency, and cumulative frequency for each class. Identify any patterns.
SOLUTION The midpoint, relative frequency, and cumulative frequency for the
first three classes are calculated as follows.
Relative Cumulative
Class f
Midpoint frequency frequency
7 + 18 6
718 6 = 12.5 = 0.12 6
2 50
19 + 30 10
1930 10 = 24.5 = 0.2 6 + 10 = 16
2 50
31 + 42 13
3142 13 = 36.5 = 0.26 16 + 13 = 29
2 50
The remaining midpoints, relative frequencies, and cumulative frequencies are
shown in the following expanded frequency distribution.
Interpretation There are several patterns in the data set. For instance, the
most common time span that users spent online was 31 to 42 minutes.
Try It Yourself 2
Using the frequency distribution constructed in Try It Yourself 1, find the
midpoint, relative frequency, and cumulative frequency for each class. Identify
any patterns.
a. Use the formulas to find each midpoint, relative frequency, and cumulative
frequency.
b. Organize your results in a frequency distribution.
c. Identify patterns that emerge from the data. Answer: Page A29
DEFINITION
A frequency histogram is a bar graph that represents the frequency
Study Tip
distribution of a data set. A histogram has the following properties.
1. The horizontal scale is quantitative and measures the data values.
are integers,
If data entries 2. The vertical scale measures the frequencies of the classes.
m each lower
subtract 0.5 fro 3. Consecutive bars must touch.
e lower class
limit to find th
find the upper
boundaries. To
s, add 0.5 to
class boundarie Because consecutive bars of a histogram must touch, bars must begin and
it. The upper
each upper lim l end at class boundaries instead of class limits. Class boundaries are the numbers
class will equa
boundary of a that separate classes without forming gaps between them. You can mark the
dary of the
the lower boun horizontal scale either at the midpoints or at the class boundaries, as shown in
clas s.
next higher
Example 3.
EXAMPLE 3
Constructing a Frequency Histogram
Draw a frequency histogram for the frequency distribution in Example 2.
Describe any patterns.
SOLUTION First, find the class boundaries. The distance from the upper limit of
the first class to the lower limit of the second class is 19 - 18 = 1. Half this
Class Frequency, distance is 0.5. So, the lower and upper boundaries of the first class are as follows:
Class boundaries f
First class lower boundary = 7 - 0.5 = 6.5
718 6.518.5 6
First class upper boundary = 18 + 0.5 = 18.5
1930 18.530.5 10
3142 30.5 42.5 13 The boundaries of the remaining classes are shown in the table. Using the class
midpoints or class boundaries for the horizontal scale and choosing possible
4354 42.554.5 8
frequency values for the vertical scale, you can construct the histogram.
5566 54.566.5 5
6778 66.578.5 6 Internet Usage Internet Usage
7990 78.590.5 2 (labeled with class midpoints) (labeled with class boundaries)
14 13 14 13
(number of subscribers)
(number of subscribers)
12 12
10 10
Frequency
Frequency
10 10
8 8
8 8
6 6 6 6
6 5 6 5
4 4
2 2
2 2
12.5 24.5 36.5 48.5 60.5 72.5 84.5 6.5 18.5 30.5 42.5 54.5 66.5 78.5 90.5
Broken axis Time online (in minutes) Time online (in minutes)
Interpretation From either histogram, you can see that more than half of the
subscribers spent between 19 and 54 minutes on the Internet during their most
recent session.
Try It Yourself 3
Use the frequency distribution from Try It Yourself 1 to construct a frequency
histogram that represents the ages of the residents of Akhiok. Describe any
patterns.
a. Find the class boundaries.
b. Choose appropriate horizontal and vertical scales.
c. Use the frequency distribution to find the height of each bar.
d. Describe any patterns for the data. Answer: Page A30
EXAMPLE 4
Constructing a Frequency Polygon
Draw a frequency polygon for the frequency distribution in Example 2.
Study Tip SOLUTION To construct the frequency polygon, use the same horizontal and
vertical scales that were used in the histogram labeled with class midpoints in
d its Example 3. Then plot points that represent the midpoint and frequency of each
A histogram an
g frequency
correspondin class and connect the points in order from left to right. Because the graph
e of te n drawn
polygon ar should begin and end on the horizontal axis, extend the left side to one class
u have not
together. If yo width before the first class midpoint and extend the right side to one class width
ucted the
already constr after the last class midpoint.
gi n construct-
histogram, be
cy polygon
ing the frequen Internet Usage
propriate
by choosing ap
vertical scales.
horizontal and 14
l scale should
(number of subscribers)
The horizonta , 12
class midpoints
consist of the e shou ld 10
al sc al
Frequency
Try It Yourself 4
Use the frequency distribution from Try It Yourself 1 to construct a frequency
polygon that represents the ages of the residents of Akhiok. Describe any patterns.
a. Choose appropriate horizontal and vertical scales.
b. Plot points that represent the midpoint and frequency for each class.
c. Connect the points and extend the sides as necessary.
d. Describe any patterns for the data. Answer: Page A30
A relative frequency histogram has the same shape and the same horizontal
scale as the corresponding frequency histogram. The difference is that the
vertical scale measures the relative frequencies, not frequencies.
EXAMPLE 5
Picturing the World Constructing a Relative Frequency Histogram
Old Faithful, a geyser at
Yellowstone National Park, Draw a relative frequency histogram for the frequency distribution in
erupts on a regular basis. The Example 2.
time spans of a sample of erup-
tions are given in the relative SOLUTION The relative frequency histogram is shown. Notice that the shape of
frequency histogram. (Source: the histogram is the same as the shape of the frequency histogram constructed
Yellowstone National Park) in Example 3. The only difference is that the vertical scale measures the
relative frequencies.
Old Faithful Eruptions
0.40
Internet Usage
Relative frequency
0.30
0.28
0.20
(portion of subscribers)
0.24
Relative frequency
0.10 0.20
0.16
2.0 2.6 3.2 3.8 4.4
Duration of eruption 0.12
(in minutes) 0.08
0.04
Fifty percent of the
eruptions last less than
6.5 18.5 30.5 42.5 54.5 66.5 78.5 90.5
how many minutes? Time online (in minutes)
Interpretation From this graph, you can quickly see that 0.20 or 20% of the
Internet subscribers spent between 18.5 minutes and 30.5 minutes online, which
is not as immediately obvious from the frequency histogram.
Try It Yourself 5
Use the frequency distribution from Try It Yourself 1 to construct a relative
frequency histogram that represents the ages of the residents of Akhiok.
a. Use the same horizontal scale as used in the frequency histogram.
b. Revise the vertical scale to reflect relative frequencies.
c. Use the relative frequencies to find the height of each bar. Answer: Page A30
If you want to describe the number of data entries that are equal to or
below a certain value, you can easily do so by constructing a cumulative
frequency graph.
DEFINITION
A cumulative frequency graph, or ogive (pronounced o jive ), is a line
graph that displays the cumulative frequency of each class at its upper
class boundary. The upper boundaries are marked on the horizontal axis,
and the cumulative frequencies are marked on the vertical axis.
GUIDELINES
Constructing an Ogive (Cumulative Frequency Graph)
1. Construct a frequency distribution that includes cumulative frequencies
as one of the columns.
2. Specify the horizontal and vertical scales. The horizontal scale consists
of upper class boundaries, and the vertical scale measures cumulative
frequencies.
3. Plot points that represent the upper class boundaries and their
corresponding cumulative frequencies.
4. Connect the points in order from left to right.
5. The graph should start at the lower boundary of the first class (cumu-
lative frequency is zero) and should end at the upper boundary of the
last class (cumulative frequency is equal to the sample size).
EXAMPLE 6
Constructing an Ogive
Draw an ogive for the frequency distribution in Example 2. Estimate how many
subscribers spent 60 minutes or less online during their last session. Also, use
the graph to estimate when the greatest increase in usage occurs.
Upper class Cumulative SOLUTION Using the frequency distribution, you can construct the ogive
boundary f frequency shown. The upper class boundaries, frequencies, and cumulative frequencies are
shown in the table. Notice that the graph starts at 6.5, where the cumulative
18.5 6 6 frequency is 0, and the graph ends at 90.5, where the cumulative frequency is 50.
30.5 10 16
42.5 13 29 Internet Usage
54.5 8 37
66.5 5 42 50
(number of subscribers)
Cumulative frequency
78.5 6 48 40
90.5 2 50
30
20
10
Interpretation From the ogive, you can see that about 40 subscribers spent
60 minutes or less online during their last session. The greatest increase in usage
occurs between 30.5 minutes and 42.5 minutes because the line segment is
steepest between these two class boundaries.
Another type of ogive uses percent as the vertical axis instead of frequency
(see Example 5 in Section 2.5).
Try It Yourself 6
Use the frequency distribution from Try It Yourself 1 to construct an ogive that
represents the ages of the residents of Akhiok. Estimate the number of
residents who are 49 years old or younger.
a. Specify the horizontal and vertical scales.
b. Plot the points given by the upper class boundaries and the cumulative
frequencies.
c. Construct the graph.
d. Estimate the number of residents who are 49 years old or younger.
Answer: Page A30
EXAMPLE 7
Using Technology to Construct Histograms
Use a calculator or a computer to construct a histogram for the frequency
SOLUTION MINITAB, Excel, and the TI-83 each have features for graphing
using
Detailed instructions for histograms. Try using this technology to draw the histograms as shown.
ITAB, Excel, and the TI-83
MIN
hn olo gy
are shown in the Tec
this
Guide that accompanies
text. For instan ce, here are
14
a
instructions for creating 12
histogram on a TI-8 3. 10
10
Frequency
Frequency
8
STAT ENTER 6
5
2
Enter frequencies in L2. 0 0
12.5 24.5 36.5 48.5 60.5 72.5 84.5 12.5 24.5 36.5 48.5 60.5 72.5 84.5
Turn on Plot 1.
Highlight Histogram.
Xlist: L1
Freq: L2
ZOOM 9
WINDOW
Xscl=12
GRAPH
Try It Yourself 7
Use a calculator or a computer to construct a frequency histogram that
represents the ages of the residents of Akhiok listed in the Chapter Opener on
page 33. Use eight classes.
a. Enter the data.
b. Construct the histogram. Answer: Page A30
Exercises
2.1
Building Basic Skills and Vocabulary
1. What are some benefits of representing data sets using frequency
Help distributions?
2. What are some benefits of representing data sets using graphs of frequency
distributions?
True or False? In Exercises 58, determine whether the statement is true or false.
1. Organizing the data into a If it is false, rewrite it as a true statement.
frequency distribution may make
patterns within the data more 5. The midpoint of a class is the sum of its lower and upper limits.
evident.
2. Sometimes it is easier to identify 6. The relative frequency of a class is the sample size divided by the frequency
patterns of a data set by looking of the class.
at a graph of the frequency
distribution. 7. An ogive is a graph that displays cumulative frequency.
3. Class limits determine which 8. Class limits are used to ensure that consecutive bars of a histogram do
numbers can belong to that class. not touch.
Class boundaries are the numbers
that separate classes without
forming gaps between them.
Reading a Frequency Distribution In Exercises 9 and 10, use the given frequency
4. Frequency for a class is the number
distribution to find the
of data entries in each class.
Relative frequency of a class is the (a) class width.
percent of the data that fall in each (b) class midpoints.
class.
(c) class boundaries.
5. False. The midpoint of a class is the
sum of the lower and upper limits 9. Employee Age 10. Tree Height
of the class divided by two.
6. False. The relative frequency of a Class Frequency, f Class Frequency, f
class is the frequency of the class 2029 10 16 20 100
divided by the sample size.
3039 132 2125 122
7. True
4049 284 26 30 900
8. False. Class boundaries are used to
ensure that consecutive bars of a
5059 300 3135 207
histogram do not touch. 6069 175 36 40 795
9. See Odd Answers, page A## 7079 65 4145 568
10. See Selected Answers, page A## 8089 25 46 50 322
11. See Odd Answers, page A##
12. See Selected Answers, page A##
11. Use the frequency distribution in Exercise 9 to construct an expanded
frequency distribution, as shown in Example 2.
13. (a) Number of classes = 7 Graphical Analysis In Exercises 13 and 14, use the frequency histogram to
(b) Least frequency L 10
(a) determine the number of classes.
(c) Greatest frequency L 300
(b) estimate the frequency of the class with the least frequency.
(d) Class width = 10
(c) estimate the frequency of the class with the greatest frequency.
14. (a) Number of classes = 7
(b) Least frequency L 100 (d) determine the class width.
(c) Greatest frequency L 900
13. 14.
(d) Class width = 5
Employee Age Tree Height
15. (a) 50
(b) 12.513.5 pounds 300 900
Frequency
Frequency
(b) 68 70 inches 200 600
(b) 70 inches
24.5
34.5
44.5
54.5
64.5
74.5
84.5
18 23 28 33 38 43 48
Height (in inches)
Age (in years)
15. 16.
Adult Male Rhesus Monkeys Adult Male Ages 2029
55 55
50
Cumulative frequency
Cumulative frequency
50
45 45
40 40
35 35
30 30
25 25
20 20
15
15
10
10
5
5
8.5 10.5 12.5 14.5 16.5 18.5 20.5 22.5
62 64 66 68 70 72 74 76 78
Weight (in pounds)
Height (in inches)
19. (a) Class with greatest relative Graphical Analysis In Exercises 19 and 20, use the relative frequency histogram to
frequency: 8 9 inches
Class with least relative (a) identify the class with the greatest and the least relative frequency.
frequency: 1718 inches (b) approximate the greatest and least relative frequency.
(b) Greatest relative frequency (c) approximate the relative frequency of the second class.
L 0.195
19. Atlantic Croaker Fish 20. Emergency Response Time
Least relative frequency
L 0.005 0.20 40%
(c) Approximately 0.015
Relative frequency
Relative frequency
0.16
30%
20. (a) Class with greatest relative
0.12
frequency: 19 20 minutes 20%
0.08
Class with least relative
frequency: 2122 minutes 0.04 10%
9 15
Frequency
Class with least frequency:
6 10
6.256.75
23. See Odd Answers, page A## 3 5
24. See Selected Answers, page A##
225
275
325
375
425
475
525
575
625
675
725
775
indicates that the data set for this exercise is available electronically.
DATA
25. See Odd Answers, page A## Constructing a Frequency Distribution and a Frequency Histogram In Exercises
26. See Selected Answers, page A## 2528, construct a frequency distribution and a frequency histogram for the data
27. See Odd Answers, page A## set using the indicated number of classes. Describe any patterns.
28. See Selected Answers, page A## 25. Sales
29. See Odd Answers, page A## DATA
Number of classes: 6
30. See Selected Answers, page A##
Data set: July sales (in dollars) for all sales representatives at a company
2114 2468 7119 1876 4105 3183 1932 1355
4278 1030 2000 1077 5835 1512 1697 2478
3981 1643 1858 1500 4608 1000
26. Pepper Pungencies
DATA
Number of classes: 5
Data set: Pungencies (in 1000s of Scoville units) of 24 tabasco peppers
35 51 44 42 37 38 36 39 44 43 40 40
32 39 41 38 42 39 40 46 37 35 41 39
27. Reaction Times
DATA
Number of classes: 8
Data set: Reaction times (in milliseconds) of a sample of 30 adult females
to an auditory stimulus
507 389 305 291 336 310 514 442 307 337
373 428 387 454 323 441 388 426 469 351
411 382 320 450 309 416 359 388 422 413
28. Fracture Times
DATA
Number of classes: 5
Data set: Amount of pressure (in pounds per square inch) at fracture time
for 25 samples of brick mortar
2750 2862 2885 2490 2512 2456 2554 2532 2885
2872 2601 2877 2721 2692 2888 2755 2853 2517
2867 2718 2641 2834 2466 2596 2519
38. See Selected Answers, page A## 38. Children of the President
DATA
39. See Odd Answers, page A## Number of classes: 6
40. See Selected Answers, page A## Data set: Number of children of the U.S. presidents (Source: infoplease.com)
41. Histogram (5 Classes)
0 5 6 0 2 4 0 4 10 15 0 6 2 3
8
7
0 4 5 4 8 7 3 5 3 2 6 3 3 0
6 2 2 6 1 2 3 2 2 4 4 4 6 1 2
Frequency
5
4
3
2
1 Extending Concepts
2 5 8 11 14
Data 39. What Would You Do? You work at a bank and are asked to recommend the
DATA amount of cash to put in an ATM each day. You dont want to put in too
Histogram (10 Classes)
much (security) or too little (customer irritation). Here are the daily
6 withdrawals (in 100s of dollars) for a period of 30 days.
5
Frequency
4 72 84 61 76 104 76 86 92 80 88
3 98 76 97 82 84 67 70 81 82 89
2 74 73 86 81 85 78 82 80 91 83
1
(a) Construct a relative frequency histogram for the data, using eight
1.5 5.5 9.5 13.5 17.5
Data classes.
Histogram (20 Classes) (b) If you put $9000 in the ATM each day, what percent of the days in a
month should you expect to run out of cash? Explain your reasoning.
5
4
(c) If you are willing to run out of cash for 10% of the days, how much cash,
Frequency
3
in hundreds of dollars, should you put in the ATM each day? Explain
your reasoning.
2
1
40. What Would You Do? You work in the admissions department for a college
1 3 5 7 9 11 13 15 17 19
DATA and are asked to recommend the minimum SAT scores that the college will
Data accept for a position as a full-time student. Here are the SAT scores for a
In general, a greater number of sample of 50 applicants.
classes better preserves the actual
1325 1072 982 996 872 849 785 706 669 1049
values of the data set but is not as
helpful for observing general 885 1367 935 980 1188 869 1006 1127 979 1034
trends and making conclusions. 1052 1165 1359 667 1264 727 808 955 544 1202
In choosing the number of classes, 1051 1173 410 1148 1195 1141 1193 768 812 887
an important consideration is the 1211 1266 830 672 917 988 791 1035 688 700
size of the data set. For instance,
(a) Construct a relative frequency histogram for the data using 10 classes.
you would not want to use 20
classes if your data set contained (b) If you set the minimum score at 986, what percent of the applicants will
20 entries. In this particular you be accepting? Explain your reasoning.
example, as the number of classes (c) If you want to accept the top 88% of the applicants, what should the
increases, the histogram shows
minimum score be? Explain your reasoning.
more fluctuation. The histograms
with 10 and 20 classes have classes 41. Writing What happens when the number of classes is increased for a
with zero frequencies. Not much is DATA frequency histogram? Use the data set listed and a technology tool to
gained by using more than five create frequency histograms with 5, 10, and 20 classes. Which graph displays
classes. Therefore, it appears that the data best?
five classes would be best.
2 7 3 2 11 3 15 8 4 9 10 13 9
7 11 10 1 2 12 5 6 4 2 9 15
EXAMPLE 1
Constructing a Stem-and-Leaf Plot
The following are the numbers of league-leading runs batted in (RBIs) for
baseballs American League during a recent 50-year period. Display the data in
a stem-and-leaf plot. What can you conclude? (Source: Major League Baseball)
155 159 144 129 105 145 126 116 130 114 122 112 112 142 126
118 118 108 122 121 109 140 126 119 113 117 118 109 109 119
139 139 122 78 133 126 123 145 121 134 124 119 132 133 124
129 112 126 148 147
SOLUTION Because the data entries go from a low of 78 to a high of 159, you
Study Tip
should use stem values from 7 to 15. To construct the plot, list these stems to the
left of a vertical line. For each data entry, list a leaf to the right of its stem. For
instance, the entry 155 has a stem of 15 and a leaf of 5.The resulting stem-and-leaf
af plot, you
In a stem-and-le plot will be unordered. To obtain an ordered stem-and-leaf plot, rewrite the plot
many leaves
should have as with the leaves in increasing order from left to right. It is important to include a
tr ies in the
as there are en key for the display to identify the values of the data.
se t.
original data
RBIs for American League Leaders RBIs for American League Leaders
7 8 Key: 15 5 = 155 7 8 Key: 15 5 = 155
8 8
Insight 9
10 58999
9
10 5 8 9 9 9
em-and-leaf
You can use st 11 6422889378992 11 2 2 2 3 4 6 7 8 8 8 9 9 9
tif y unusual
plots to iden 12 962621626314496 12 1 1 2 2 2 3 4 4 6 6 6 6 6 9 9
es ca lle d outliers.
data valu 13 0993423 13 0 2 3 3 4 9 9
e data value
In Example 1, th 14 4520587 14 0 2 4 5 5 7 8
. u will
Yo
78 is an outlier 15 59 15 5 9
ou t outliers
learn more ab
in Section 2.3. Unordered Stem-and-Leaf Plot Ordered Stem-and-Leaf Plot
Interpretation From the ordered stem-and-leaf plot, you can conclude that
more than 50% of the RBI leaders had between 110 and 130 RBIs.
Try It Yourself 1
Use a stem-and-leaf plot to organize the Akhiok population data set listed in
the Chapter Opener on page 33. What can you conclude?
a. List all possible stems.
b. List the leaf of each data entry to the right of its stem and include a key.
c. Rewrite the stem-and-leaf plot so that the leaves are ordered.
d. Use the plot to make a conclusion. Answer: Page A30
EXAMPLE 2
Constructing Variations of Stem-and-Leaf Plots
Note to Instructor Organize the data given in Example 1 using a stem-and-leaf plot that has two
If you are using MINITAB or Excel, ask lines for each stem. What can you conclude?
students to use this technology to SOLUTION Construct the stem-and-leaf plot as described in Example 1, except
construct a stem-and-leaf plot. now list each stem twice. Use the leaves 0, 1, 2, 3, and 4 in the first stem row and
the leaves 5, 6, 7, 8, and 9 in the second stem row. The revised stem-and-leaf plot
is shown.
RBIs for American League Leaders RBIs for American League Leaders
7 Key: 15 5 = 155 7 Key: 15 5 = 155
7 8 7 8
8 8
Insight 8
9
8
9
ples 1 and 2. 9 9
Compare Exam
using two
Notice that by 10 10
, you obtain a
lines per stem 10 5 8 9 9 9 10 5 8 9 9 9
picture of
more detailed 11 4 2 2 3 2 11 2 2 2 3 4
the data. 11 6 8 8 9 7 8 9 9 11 6 7 8 8 8 9 9 9
12 2 2 1 2 3 1 4 4 12 1 1 2 2 2 3 4 4
12 9 6 6 6 6 9 6 12 6 6 6 6 6 9 9
13 0 3 4 2 3 13 0 2 3 3 4
13 9 9 13 9 9
14 4 2 0 14 0 2 4
14 5 5 8 7 14 5 5 7 8
15 15
15 5 9 15 5 9
Unordered Stem-and-Leaf Plot Ordered Stem-and-Leaf Plot
Interpretation From the display, you can conclude that most of the RBI
leaders had between 105 and 135 RBIs.
Try It Yourself 2
Using two rows for each stem, revise the stem-and-leaf plot you constructed in
Try It Yourself 1.
a. List each stem twice.
b. List all leaves using the appropriate stem row. Answer: Page A30
You can also use a dot plot to graph quantitative data. In a dot plot, each
data entry is plotted, using a point, above a horizontal axis. Like a stem-and-leaf
plot, a dot plot allows you to see how data are distributed, determine specific
data entries, and identify unusual data values.
EXAMPLE 3
Constructing a Dot Plot
Use a dot plot to organize the RBI data given in Example 1.
155 159 144 129 105 145 126 116 130
114 122 112 112 142 126 118 118 108
122 121 109 140 126 119 113 117 118
109 109 119 139 139 122 78 133 126
123 145 121 134 124 119 132 133 124
129 112 126 148 147
SOLUTION So that each data entry is included in the dot plot, the horizontal
axis should include numbers between 70 and 160. To represent a data entry, plot
a point above the entrys position on the axis. If an entry is repeated, plot
another point above the previous point.
70 75 80 85 90 95 100 105 110 115 120 125 130 135 140 145 150 155 160
Interpretation From the dot plot, you can see that most values cluster
between 105 and 148 and the value that occurs the most is 126. You can also
see that 78 is an unusual data value.
Try It Yourself 3
Use a dot plot to organize the Akhiok population data set listed in the Chapter
Opener on page 33. What can you conclude from the graph?
a. Choose an appropriate scale for the horizontal axis.
b. Represent each data entry by plotting a point.
c. Describe any patterns for the data. Answer: Page A30
EXAMPLE 4
Motor Vehicle Occupants Constructing a Pie Chart
Killed in 2001
The numbers of motor vehicle occupants killed in crashes in 2001 are shown in
Vehicle type Killed the table. Use a pie chart to organize the data. What can you conclude? (Source:
U.S. Department of Transportation, National Highway Traffic Safety Administration)
Cars 20,269
Trucks 12,260 SOLUTION Begin by finding the relative frequency, or percent, of each category.
Motorcycles 3,067 Then construct the pie chart using the central angle that corresponds to each
Other 612 category. To find the central angle, multiply 360 by the categorys relative
frequency. For example, the central angle for cars is 36010.562 L 202. From
the pie chart, you can see that most fatalities in motor vehicle crashes were
those involving the occupants of cars.
Try It Yourself 4
The numbers of motor vehicle occupants killed in crashes in 1991 are shown
in the table. Use a pie chart to organize the data. Compare the 1991 data with
the 2001 data. (Source: U.S. Department of Transportation, National Highway Safety
Administration)
Technology can be used to construct pie charts. For instance, an Excel pie
chart for the data in Example 4 is shown.
EXAMPLE 5
Constructing a Pareto Chart
Picturing the World In a recent year, the retail industry lost $41.0 million in inventory shrinkage.
The five top-selling vehicles Inventory shrinkage is the loss of inventory through breakage, pilferage, shoplift-
in the United States for ing, and so on. The causes of the inventory shrinkage are administrative error
January of 2004 are shown in ($7.8 million), employee theft ($15.6 million), shoplifting ($14.7 million), and
the following Pareto chart. vendor fraud ($2.9 million). If you were a retailer, which causes of inventory
One of the top five vehicles shrinkage would you address first? (Source: National Retail Federation and Center
was a car. The other four for Retailing Education, University of Florida)
vehicles were trucks. (Source:
Associated Press) SOLUTION Using frequencies for the vertical axis, you can construct the Pareto
chart as shown.
Five Top-Selling Vehicles
for January of 2004 Causes of Inventory Shrinkage
Number sold (in thousands)
70
62 16
60
14
Millions of dollars
50
41 12
40
31 28 10
30 26
20 8
10 6
4
2
es
do
ry
m
rer
a
eri
am
era
plo
eR
F-S
ta C
ilv
Ex
dg
Do
yo
rd
ole
theft error
Fo
fraud
Fo
To
evr
Cause
Ch
Vehicle
Interpretation From the graph, it is easy to see that the causes of inventory
How many vehicles
shrinkage that should be addressed first are employee theft and shoplifting.
from the top five did
Ford sell in January
of 2004? Try It Yourself 5
Every year, the Better Business Bureau (BBB) receives complaints from
customers. In a recent year, the BBB received the following complaints.
7792 complaints about home furnishing stores
5733 complaints about computer sales and service stores
14,668 complaints about auto dealers
9728 complaints about auto repair shops
4649 complaints about dry cleaning companies
Use a Pareto chart to organize the data. What source is the greatest cause of
complaints? (Source: Council of Better Business Bureaus)
a. Find the frequency or relative frequency for each data entry.
b. Position the bars in decreasing order according to frequency or relative
frequency.
c. Interpret the results in the context of the data. Answer: Page A31
EXAMPLE 6
Interpreting a Scatter Plot
The British statistician Ronald Fisher (see page 29) introduced a famous data set
called Fishers Iris data set.This data set describes various physical characteristics,
such as petal length and petal width (in millimeters), for three species of iris. In
the scatter plot shown, the petal lengths form the first data set and the petal
widths form the second data set. As the petal length increases, what tends to
happen to the petal width? (Source: Fisher, R. A., 1936)
Note to Instructor
A complete discussion of types of
Fishers Iris Data Set
correlation occurs in Chapter 9. You
25
may want, however, to discuss positive
Petal width (in millimeters)
10 20 30 40 50 60 70
Length of Petal length (in millimeters)
employment Salary
(in years) (in dollars)
SOLUTION The horizontal axis represents the petal length, and the vertical axis
5 32,000 represents the petal width. Each point in the scatter plot represents the petal
4 32,500 length and petal width of one flower.
8 40,000 Interpretation From the scatter plot, you can see that as the petal length
4 27,350 increases, the petal width also tends to increase.
2 25,000
10 43,000 Try It Yourself 6
7 41,650 The lengths of employment and the salaries of 10 employees are listed in the
6 39,225 table at the left. Graph the data using a scatter plot. What can you conclude?
9 45,100 a. Label the horizontal and vertical axes.
3 28,000 b. Plot the paired data.
c. Describe any trends. Answer: Page A31
You will learn more about scatter plots and how to analyze them in
Chapter 9.
Note to Instructor SOLUTION Let the horizontal axis represent the years and the vertical axis
Consider asking students to find a time represent the number of subscribers (in millions). Then plot the paired data
series plot in a magazine or newspaper and connect them with line segments.
and bring it to class for discussion.
Cellular Telephone Subscribers
130
Subscribers (in millions)
120
110
100
90
80
70
60
50
40
30
20
10
1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001
Year
Interpretation The graph shows that the number of subscribers has been
increasing since 1991, with greater increases recently.
Try It Yourself 7
Use the table in Example 7 to construct a time series chart for a subscribers
average local monthly cellular telephone bill for the years 1991 through 2001.
What can you conclude?
a. Label the horizontal and vertical axes.
b. Plot the paired data and connect them with line segments.
c. Describe any patterns you see. Answer: Page A31
Exercises
2.2
Building Basic Skills and Vocabulary
1. Name some ways to display quantitative data graphically. Name some ways
Help to display qualitative data graphically.
2. What is an advantage of using a stem-and-leaf plot instead of a histogram?
What is a disadvantage?
Student Putting Graphs in Context In Exercises 36, match the plot with the description of
Study Pack the sample.
3. 2 8 9 Key: 2 8 = 28 4. 6 78 Key: 6 7 = 67
3 2223457789 7 455888
1. Quantitative: stem-and-leaf plot, 4 0245 8 1355889
dot plot, histogram, scatter plot, 5 1 9 00024
time series chart
6 56
Qualitative: pie chart, Pareto chart
7 2
2. Unlike the histogram, the stem-
and-leaf plot still contains the 5. 6.
original data values. However, some
data are difficult to organize in a
stem-and-leaf plot. 50 52 54 56 58 60 62 64 66
3. a 4. d 5. b 6. c 160 162 164 166 168 170 172 174 176
7. 27, 32, 41, 43, 43, 44, 47, 47, 48, 50,
51, 51, 52, 53, 53, 53, 54, 54, 54, 54, (a) Prices (in dollars) of a sample of 20 brands of jeans
55, 56, 56, 58, 59, 68, 68, 68, 73, 78,
(b) Weights (in pounds) of a sample of 20 first grade students
78, 85
(c) Volumes (in cubic centimeters) of a sample of 20 oranges
Max: 85; Min: 27
8. 129, 133, 136, 137, 137, 141, 141, (d) Ages (in years) of a sample of 20 residents of a retirement home
141, 141, 143, 144, 144, 146, 149,
149, 150, 150, 150, 151, 152, 154, Graphical Analysis In Exercises 710, use the stem-and-leaf plot or dot plot to list
156, 157, 158, 158, 158, 159, 161, the actual data entries. What is the maximum data entry? What is the minimum
166, 167 data entry?
Max: 167; Min: 129
9. 13, 13, 14, 14, 14, 15, 15, 15, 15, 15, 7. 2 7 Key: 2 7 = 27 8. 12 Key: 12 9 = 12.9
16, 17, 17, 18, 19 3 2 12 9
Max: 19; Min: 13 4 1334778 13 3
10. 214, 214, 214, 216, 216, 217, 218, 5 0112333444456689 13 677
218, 220, 221, 223, 224, 225, 225, 6 888 14 1111344
227, 228, 228, 228, 228, 230, 230,
7 388 14 699
231, 235, 237, 239
8 5 15 000124
Max: 239; Min: 214
15 678889
11. Anheuser-Busch spends the most
on advertising and Honda spends 16 1
the least. (Answers will vary.) 1 6 67
12. Value increased the most between
2000 and 2003. (Answers will vary.) 9. 10.
13. Tailgaters irk drivers the most, and
too-cautious drivers irk drivers the
least. (Answers will vary.)
13 14 15 16 17 18 19 215 220 225 230 235
Bus r-
et
er
rs
da
ch
euse
vrol
Coo
Mill
Hon
16. Key: 31 9 = 319
Che
Anh
2000 2001 2002 2003 2004
29 8
Company Year
30 5
31 9 (Source: Nielsen Media Research)
32 7
33 13. How Other Drivers Irk Us 14. Driving and Cell Phone Use
34 5 Too cautious 2% Ignoring signals 50
Number of incidents
35 1 Speeding 3%
36 40
7% Using cell
37 Driving slow phone 21% 30
38 13% 20
39 03 Using two
No signals parking spots 10
40 39
41 059 13% 4%
Swerved Sped Cut off Almost
42 Other 10% Bright lights Tailgating up a car hit a car
43 4% 23% Incident
44 689
45 05 (Adapted from Reuters/Zogby) (Adapted from USA TODAY)
46 05
47 99
48
Graphing Data Sets In Exercises 1528, organize the data using the indicated type
49 1
of graph. What can you conclude about the data?
50 3 15. Elephants: Water Consumed Use a stem-and-leaf plot to display the data. The
It appears that the majority of the DATA data represent the amount of water (in gallons) consumed by 24 elephants
elephants eat between 390 and in one day.
480 pounds of hay each day.
(Answers will vary.)
33 45 34 47 43 48 35 69 45 60 46 51
41 60 66 41 32 40 44 39 46 33 53 53
17. Key: 17 5 = 17.5
16 48 16. Elephants: Hay Eaten Use a stem-and-leaf plot to display the data. The data
17 113455679 DATA represent the amount of hay (in pounds) eaten daily by 24 elephants.
18 13446669 449 450 419 448 479 410 446 465 415 455 345 305
19 0023356 491 479 390 393 403 298 503 327 460 351 409 319
20 18
It appears that most farmers 17. Apple Prices Use a stem-and-leaf plot to display the data. The data
charge 17 to 19 cents per pound of DATA represent the price (in cents per pound) paid to 28 farmers for apples.
apples. (Answers will vary.) 19.2 19.6 16.4 17.1 19.0 17.4 17.3 20.1 19.0 17.5
18. See Selected Answers, page A## 17.6 18.6 18.4 17.7 19.5 18.4 18.9 17.5 19.3 20.8
19.3 18.6 18.6 18.3 17.1 18.1 16.8 17.9
18. Advertisements Use a dot plot to display the data. The data represent
DATA the number of advertisements seen or heard in one week by a sample of
30 people from the United States.
598 494 441 595 728 690 684 486 735 808 734 590 673 545 702
481 298 135 846 764 317 649 732 582 637 588 540 727 486 703
19. Housefly Life Spans 19. Life Spans of House Flies Use a dot plot to display the data. The data
DATA represent the life span (in days) of 40 house flies.
4 5 6 7 8 9 10 11 12 13 14 9 9 4 4 8 11 10 5 8 13 9 6 7 11
Life span (in days) 13 11 6 9 8 14 10 6 10 10 8 7 14 11
It appears that the life span of a 7 8 6 11 13 10 14 14 8 13 14 10
housefly tends to be between 4
and 14 days. (Answers will vary.) 20. Nobel Prize Use a pie chart to display the data. The data represent the
20. Nobel Prize Laureates
number of Nobel Prize laureates by country during the years 19012002.
8
6 9 7 8 7 10
4
2 24. Hourly Wages Use a scatter plot to display the data in the table. The data
represent the number of hours worked and the hourly wage (in dollars) for
Miami, FL
Atlanta, GA
Concord, NH
Boise, ID
Denver, CO
14.00
13.00 35 11.80
12.00 33 11.51
11.00
40 13.65
10.00
33 12.05
9.00
28 10.54
25 30 35 40 45 50
Hours 45 10.33
37 11.57
It appears that hourly wage increases as
the number of hours worked increases. 28 10.17
(Answers will vary.)
Table for Exercise 25 25. Salaries Use a scatter plot to display the data shown in the table. The data
represent the number of students per teacher and the average teacher
Number of Average salary (in thousands of dollars) for a sample of 10 school districts. Describe
students teachers any trends shown.
per teacher salary
26. UV Index Use a time series chart to display the data. The data represent the
17.1 28.7 ultraviolet index for Memphis, TN, on June 14 23 during a recent year.
17.5 47.5 (Source: Weather Services International)
18.9 31.8 June 14 June 15 June 16 June 17 June 18
17.1 28.1 9 4 10 10 10
20.0 40.3 June 19 June 20 June 21 June 22 June 23
18.6 33.8 10 10 10 9 9
14.4 49.8
27. Egg Prices Use a time series chart to display the data. The data represent
16.5 37.5 the prices of Grade A eggs (in dollars per dozen) for the indicated years.
13.3 42.5 (Source: U.S. Bureau of Labor Statistics)
18.4 31.9
1990 1991 1992 1993 1994 1995
25. Teachers Salaries 1.00 1.01 0.93 0.87 0.87 1.16
55 1996 1997 1998 1999 2000 2001
Avg. teachers salary
50
45
1.31 1.17 1.09 0.92 0.96 0.93
40
35
28. T-Bone Steak Prices Use a time series chart to display the data. The data
30 represent the prices of T-bone steak (in dollars per pound) for the indicated
25 years. (Source: U.S. Bureau of Labor Statistics)
13 15 17 19 21
Students per teacher 1990 1991 1992 1993 1994 1995
It appears that a teachers average
5.45 5.21 5.39 5.77 5.86 5.92
salary decreases as the number of 1996 1997 1998 1999 2000 2001
students per teacher increases. 5.87 6.07 6.40 6.71 6.82 7.31
(Answers will vary.)
26. See Selected Answers, page A##
27. Price of Grade A Eggs Extending Concepts
Price of Grade A eggs
(in dollars per dozen)
1.35
1.25 A Misleading Graph? In Exercises 29 and 30,
1.15
1.05 (a) explain why the graph is misleading.
0.95 (b) redraw the graph so that it is not misleading.
0.85
29.
(in thousands of dollars)
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
110
in 1996. (Answers will vary.) 100
28. See Selected Answers, page A## 90
29. See Odd Answers, page A## 3rd 2nd 1st 4th
30. See Selected Answers, page A## Quarter
30.
Sales for Company B
1st 2nd 3rd 4th
1st quarter quarter quarter quarter quarter
20%
20% 15% 45% 20%
3rd quarter
45% 2nd quarter
15%
EXAMPLE 1
Finding a Sample Mean
The prices (in dollars) for a sample of room air conditioners (10,000 Btus per
hour) are listed. What is the mean price of the air conditioners?
Study Tip 500 840 470 480 420 440 440
Try It Yourself 1
The ages of employees in a department are listed. What is the mean age?
34 27 50 45 41 37 24
57 40 38 62 44 39 40
a. Find the sum of the data entries.
b. Divide the sum by the number of data entries.
c. Interpret the results in the context of the data. Answer: Page A31
DEFINITION
The median of a data set is the value that lies in the middle of the data
when the data set is ordered. If the data set has an odd number of entries,
the median is the middle data entry. If the data set has an even number
Study Tip
of entries, the median is the mean of the two middle data entries.
the
t, there are
In a data se lues
er of dat
same numb ian as there
a va EXAMPLE 2
ed
above the m r
elow th e median. Fo Finding the Median
are b 2 , th ree
Example
instance, in e low $ 4 70 Find the median of the air conditioner prices given in Example 1.
s are b
of the price 70.
e above $4
and three ar SOLUTION To find the median price, first order the data.
420 440 440 470 480 500 840
Because there are seven entries (an odd number), the median is the middle, or
fourth, data entry. So, the median air conditioner price is $470.
Try It Yourself 2
One of the families of Akhiok is planning to relocate to another city. The ages
of the family members are 33, 37, 3, 7, and 59. What will be the median age of
the remaining residents of Akhiok after this family relocates?
a. Order the data entries.
b. Find the middle data entry. Answer: Page A31
Akhiok, Alaska is a fishing village on
Kodiak Island.
(Photograph Roy Corral.)
EXAMPLE 3
Finding the Median
The air conditioner priced at $480 is discontinued. What is the median price of
the remaining air conditioners?
Try It Yourself 3
Find the median age of the residents of Akhiok using the population data set
listed in the Chapter Opener on page 33.
a. Order the data entries.
b. Find the mean of the two middle data entries.
c. Interpret the results in the context of the data. Answer: Page A31
DEFINITION
The mode of a data set is the data entry that occurs with the greatest
frequency. If no entry is repeated, the data set has no mode. If two entries
occur with the same greatest frequency, each entry is a mode and the data
set is called bimodal.
EXAMPLE 4
Finding the Mode
Find the mode of the air conditioner prices given in Example 1.
EXAMPLE 5
Finding the Mode
At a political debate a sample of audience members was asked to name the
Political Frequency,
political party to which they belong. Their responses are shown in the table.
party f
What is the mode of the responses?
Democrat 34
Republican 56 SOLUTION The response occurring with the greatest frequency is Republican.
So, the mode is Republican.
Other 21
Did not respond 9 Interpretation In this sample, there were more Republicans than people of
any other single affiliation.
Try It Yourself 5
In a survey, 250 baseball fans were asked if Barry Bondss home run record
would ever be broken. One hundred sixty-nine of the fans responded yes, 54
responded no, and 27 didnt know. What is the mode of the responses?
a. Identify the entry that occurs with the greatest frequency.
b. Interpret the results in the context of the data. Answer: Page A31
Although the mean, the median, and the mode each describe a typical entry
of a data set, there are advantages and disadvantages of using each, especially
when the data set contains outliers.
DEFINITION
An outlier is a data entry that is far removed from the other entries in the
data set.
Ages in a class
EXAMPLE 6
20 20 20 20 20 20 21
21 21 21 22 22 22 23 Comparing the Mean, the Median, and the Mode
23 23 23 24 24 65 Find the mean, the median, and the mode of the sample ages of a class shown
at the left. Which measure of central tendency best describes a typical entry of
Outlier
this data set? Are there any outliers?
SOLUTION
gx 475
Picturing the World Mean: x =
n
=
20
L 23.8 years
160 4
3
140
2
1
Jan. Feb. Mar.
Month 20 25 30 35 40 45 50 55 60 65
Mean Age
Mode Median Outlier
Notice in the graph that
each month the mean price
is about $40,000 more Try It Yourself 6
than the median price. Remove the data entry of 65 from the preceding data set. Then rework the
What factors would cause example. How does the absence of this outlier change each of the measures?
the mean price to be greater
than the median price? a. Find the mean, the median, and the mode.
b. Compare these measures of central tendency with those found in Example 6.
Answer: Page A31
DEFINITION
A weighted mean is the mean of a data set whose entries have varying
weights. A weighted mean is given by
g 1x # w2
x =
gw
where w is the weight of each entry x.
EXAMPLE 7
Finding a Weighted Mean
You are taking a class in which your grade is determined from five sources: 50%
from your test mean, 15% from your midterm, 20% from your final exam, 10%
from your computer lab work, and 5% from your homework. Your scores are
86 (test mean), 96 (midterm), 82 (final exam), 98 (computer lab), and 100
(homework). What is the weighted mean of your scores?
g 1x # w2 88.6
x = = 88.6
gw
=
1
So, your weighted mean for the course is 88.6.
Try It Yourself 7
An error was made in grading your final exam. Instead of getting 82, you
scored 98. What is your new weighted mean?
a. Multiply each score by its weight and find the sum of these products.
b. Find the sum of the weights.
c. Find the weighted mean.
d. Interpret the results in the context of the data. Answer: Page A31
DEFINITION
Study Tip The mean of a frequency distribution for a sample is approximated by
distribution g 1x # f2
If the frequency x = Note that n = gf
pulation, then n
represents a po
e frequency
the mean of th where x and f are the midpoints and frequencies of a class, respectively.
approximated
distribution is
by
g 1x # f 2
m = N GUIDELINES
Finding the Mean of a Frequency Distribution
where N = g
f.
In Words In Symbols
1. Find the midpoint of 1Lower limit2 + 1Upper limit2
x =
each class. 2
EXAMPLE 8
Class midpoint Finding the Mean of a Frequency Distribution
Use the frequency distribution at the left to approximate the mean number of
minutes that a sample of Internet subscribers spent online during their most
1x f 2
Frequency, recent session.
x f #
SOLUTION
12.5 6 75.0
24.5 10 245.0 g 1x # f2 2089
x = = L 41.8
36.5 13 474.5 n 50
48.5 8 388.0 So, the mean time spent online was approximately 41.8 minutes.
60.5 5 302.5
72.5 6 435.0 Try It Yourself 8
84.5 2 169.0 Use a frequency distribution to approximate the mean age of the residents of
n = 50 g = 2089.0 Akhiok. (See Try It Yourself 2 on page 37.)
a. Find the midpoint of each class.
b. Find the sum of the products of each midpoint and corresponding frequency.
c. Find the sum of the frequencies.
d. Find the mean of the frequency distribution. Answer: Page A32
DEFINITION
A frequency distribution is symmetric when a vertical line can be drawn
through the middle of a graph of the distribution and the resulting halves
are approximately mirror images.
A frequency distribution is uniform (or rectangular) when all entries, or
classes, in the distribution have equal frequencies. A uniform distribution
is also symmetric.
A frequency distribution is skewed if the tail of the graph elongates
more to one side than to the other. A distribution is skewed left
(negatively skewed) if its tail extends to the left. A distribution is skewed
right (positively skewed) if its tail extends to the right.
When a distribution is symmetric and unimodal, the mean, median, and mode
are equal. If a distribution is skewed left, the mean is less than the median and
Insight the median is usually less than the mode. If a distribution is skewed right, the
mean is greater than the median and the median is usually greater than the
ll in
ill always fa mode. Examples of these commonly occurring distributions are shown.
The mean w e distribution
n th
the directio
we d. Fo r instance,
is ske
tr ution is
ib
when a dis is to 40 40
ft , the mean
skewed le ed ian. 35 35
em
the left of th 30 30
25 25
20 20
15 15
10 10
5 5
1 3 5 7 9 11 13 15 1 3 5 7 9 11 13 15
Mean Mean
Median Median
Mode
40 40
35 35
30 30
25 25
20 20
15 15
10 10
5 5
1 3 5 7 9 13 15 1 3 5 9 11 13 15
Mean Mode Mode Mean
Median Median
Exercises
2.3
Building Basic Skills and Vocabulary
True or False? In Exercises 14, determine whether the statement is true or false.
Help If it is false, rewrite it so it is a true statement.
1. The median is the measure of central tendency most likely to be affected by
an extreme value (an outlier).
2. Every data set must have a mode.
Student
Study Pack
3. Some quantitative data sets do not have a median.
4. The mean is the only measure of central tendency that can be used for data
at the nominal level of measurement.
5. Give an example in which the mean of a data set is not representative of a
1. False. The mean is the measure of typical number in the data set.
central tendency most likely to be
affected by an extreme value (or 6. Give an example in which the median and the mode of a data set are
outlier). the same.
2. False. Not all data sets must have
a mode.
Graphical Analysis In Exercises 710, determine whether the approximate shape
3. False. All quantitative data sets of the distribution in the histogram is symmetric, uniform, skewed left, skewed
have a median.
right, or none of these. Justify your answer.
4. False. The mode is the only
measure of central tendency 7. 8.
22
15
that can be used for data at the 20
18
nominal level of measurement. 16 12
14
5. A data set with an outlier within 12 9
it would be an example. (Answers 10
8 6
will vary.) 6
4 3
6. Any data set that is symmetric 2
has the same median and mode.
25,000 45,000 65,000 85,000 85 95 105 115 125 135 145 155
7. The shape of the distribution is
skewed right because the bars 9. 10.
18
have a tail to the right. 16
15
8. Symmetric. If a vertical line is drawn 12
12
down the middle, the two halves
9
look approximately the same. 8
6
9. The shape of the distribution is 4
uniform because the bars are 3
24. (a) x = not possible 26. Eating Disorders The number of weeks it took to reach a target weight for
median = not possible a sample of five patients with eating disorders treated by psychodynamic
mode = Domestic psychotherapy (Source: The Journal of Consulting and Clinical Psychology)
(b) Mode, because the data are at 15.0 31.5 10.0 25.5 1.0
the nominal level of
measurement. 27. Eating Disorders The number of weeks it took to reach a target weight for
25. (a) x = 22.6 a sample of 14 patients with eating disorders treated by psychodynamic
psychotherapy and cognitive behavior techniques (Source: The Journal of
median = 19
Consulting and Clinical Psychology)
mode = 14
(b) Median, because the 2.5 20.0 11.0 10.5 17.5 16.5 13.0
distribution is skewed. 15.5 26.5 2.5 27.0 28.5 1.5 5.0
26. (a) x = 16.6 28. Aircraft The number of aircraft 11 airlines have in their fleets (Source:
median = 15 Airline Transport Association)
mode = none 819 366 573 280 375 567
(b) Mean, because there are no 444 145 102 26 37
outliers.
29. Weights (in pounds) of 30. Grade Point Averages of
27. (a) x L 14.11
Dogs at a Kennel Students in a Class
median = 14.25
mode = 2.5 1 02 Key: 1 0 = 10 0 8 Key: 0 8 = 0.8
(b) Mean, because there are no 2 147 1 568
outliers. 3 78 2 1345
28. (a) x L 339.5 4 155 3 09
median = 366 5 07 4 00
mode = none 6 5
(b) Median, because the
7
distribution is skewed. 8
29. (a) x = 41.3 9
median = 39.5
10 6
mode = 45 31. Time (in minutes) it Takes 32. Top Speeds (in miles per hour) of
(b) Median, because the Employees to Drive to Work High-Performance Sports Cars
distribution is skewed.
30. (a) x L 2.5
median = 2.35
mode = 4.0 5 10 15 20 25 30 35 40
(b) Mean, because there are no 200 205 210 215 220
outliers.
31. (a) x L 19.5 Graphical Analysis In Exercises 33 and 34, the letters A, B, and C are marked on the
median = 20 horizontal axis. Determine which is the mean, which is the median, and which is the
mode = 15 mode. Justify your answers.
(b) Median, because the 33. Sick Days Used by Employees 34. Hourly Wages of Employees
distribution is skewed.
16 16
32. See Selected Answers, page A## 14 14
33. A = mode, because its the data 12 12
Frequency
Frequency
10 10
entry that occurred most often.
8 8
B = median, because the 6 6
distribution is skewed right. 4 4
2 2
C = mean, because the
distribution is skewed right. 10 14 16 18 20 22 24 26 28 10 12 14 16 18 20 22 26 28
AB C Days Hourly wageA B C
34. See Selected Answers, page A##
35. Mode, because the data are at the In Exercises 3538, determine which measure of central tendency best represents
nominal level of measurement. the graphed data without performing any calculations. Explain your reasoning.
36. Median, because the distribution is
35. Are You Getting 36. Heights of Players on a
skewed.
Enough Sleep? Hockey Team
37. Mean, because there are no outliers.
120 8
38. Median, because the distribution is 7
100
Frequency
skewed. 6
Frequency
80
5
39. 89.3 60 4
40 3
40. $32,640 20 2
41. 2.8 1
Need more Need less Get the
correct amount 69 70 71 72 73 74 75 76
Response Height (in inches)
Frequency
30 6
25 5
20 4
15 3
10 2
5 1
55 60 65 70 75 80 85 18 20 22 24 26 28 30
Heart rate (beats per minute) BMI
Finding the Weighted Mean In Exercises 39 42, find the weighted mean of
the data.
39. Final Grade The scores and their percent of the final grade for a statistics
student are given. What is the students mean score?
Score Percent of final grade
Homework 85 15%
Quiz 80 10%
Quiz 92 10%
Quiz 76 10%
Project 100 15%
Speech 90 15%
Final Exam 93 25%
40. Salaries The average starting salaries (by degree attained) for 25 employees
at a company are given.What is the mean starting salary for these employees?
8 with MBAs: $42,500
17 with BAs in business: $28,000
41. Grades A student receives the following grades, with an A worth 4 points,
a B worth 3 points, a C worth 2 points, and a D worth 1 point. What is the
students mean grade point score?
B in 2 three-credit classes D in 1 two-credit class
A in 1 four-credit class C in 1 three-credit class
42. 82 42. Scores The mean scores for a statistics course (by major) are given. What
43. 65.5 is the mean score for the class?
44. 70.1 8 engineering majors: 83
45. 35.0
5 math majors: 87
46. 15.3
11 business majors: 79
47. Class Frequency, f Midpoint
34 3 3.5
56 8 5.5 Finding the Mean of Grouped Data In Exercises 4346, approximate the mean of
78 4 7.5 the grouped data.
910 2 9.5
43. Heights of Females The heights 44. Heights of Males The heights (in
1112 2 11.5
(in inches) of 16 female students inches) of 21 male students in a
1314 1 13.5
in a physical education class physical education class
gf = 20
Height Height
Hospitalization (in inches) Frequency (in inches) Frequency
8 6062 3 6365 2
7
6 6365 4 6668 4
Frequency
5
4
6668 7 6971 8
3 6971 2 7274 5
2
1 7577 2
3.5
5.5
7.5
9.5
11.5
13.5
Days hospitalized
45. Ages The ages of residents of a 46. Phone Calls The lengths of long-
Positively skewed town distance calls (in minutes) made
by one person in one year
Age Frequency
09 57 Length Number
1019 68 of call of calls
2029 36 15 12
3039 55 610 26
4049 71 1115 20
5059 44 1620 7
6069 36 2125 11
7079 14 2630 7
8089 8 3135 4
3640 4
4145 1
6 66 72 67 66 69 73 64 62 71 73
5
4
68 72 71 65 69 66 74 72 68 69
3
2 50. Six-Sided Die
1
DATA
144 179 214 249 284 Number of classes: 6
Number of beds
Data set: The results of rolling a six-sided die 30 times
Positively skewed 1 4 6 1 5 3 2 5 4 6
49. Class Frequency, f Midpoint 1 2 4 3 5 6 3 2 1 1
6264 3 63
5 6 2 4 4 3 1 6 2 4
6567 7 66
6870 9 69 51. Coffee Content During a quality assurance check, the actual coffee content
7173 8 72 (in ounces) of six jars of instant coffee was recorded as 6.03, 5.59, 6.40, 6.00,
7476 3 75 5.99, and 6.02.
gf = 30 (a) Find the mean and the median of the coffee content.
Heights of Males (b) The third value was incorrectly measured and is actually 6.04. Find the
9 mean and median of the coffee content again.
8
7 (c) Which measure of central tendency, the mean or the median, was
Frequency
6
5
4
affected more by the data entry error?
3
2 52. U.S. Exports The following data are the U.S. exports (in billions of dollars)
1
to 19 countries for a recent year. (Source: U.S. Department of Commerce)
63 66 69 72 75
Heights
(to the nearest inch) Canada 160.8 Japan 51.4
Mexico 97.5 United Kingdom 33.3
Symmetric
Germany 26.6 South Korea 22.6
50. See Selected Answers, page A##
Taiwan 18.4 Singapore 16.2
51. (a) x = 6.005
Netherlands 18.3 France 19.0
median = 6.01
(b) x = 5.945 China 22.1 Brazil 12.4
median = 6.01 Australia 13.1 Belgium 13.3
(c) Mean Malaysia 10.3 Italy 10.1
52. (a) x L 29.63 Switzerland 7.8 Thailand 4.9
median = 18.3 Saudi Arabia 4.8
(b) x L 22.34 (a) Find the mean and median.
median = 17.25
(b) Find the mean and median without the U.S. exports to Canada.
(c) Mean
(c) Which measure of central tendency, the mean or the median, was
affected more by the elimination of the Canadian export data?
Measures of Variation
2.4 Range Deviation, Variance, and Standard Deviation Interpreting Standard
What You Deviation Standard Deviation for Grouped Data
Should Learn
How to find the range of a
data set Range
How to find the variance
and standard deviation of a In this section, you will learn different ways to measure the variation of a data
population and of a sample set. The simplest measure is the range of the set.
How to use the Empirical Rule
and Chebychevs Theorem to
interpret standard deviation DEFINITION
How to approximate the The range of a data set is the difference between the maximum and
sample standard deviation for minimum data entries in the set.
Range = 1Maximum data entry2 - 1Minimum data entry2
grouped data
EXAMPLE 1
Finding the Range of a Data Set
Two corporations each hired 10 graduates. The starting salaries for each are
shown. Find the range of the starting salaries for Corporation A.
Starting Salaries for Corporation A (1000s of dollars)
Salary 41 38 39 45 47 41 44 41 37 42
Insight SOLUTION Ordering the data helps to find the least and greatest salaries.
37 38 39 41 41 41 42 44 45 47
le 1
ts in Examp
Both data se , a Minimum Maximum
of 41.5
have a mean
Range = 1Maximum salary2 - 1Minimum salary2
e
of 4 1, and a mod
median e tw o se ts
yet th
of 41. And = 47 - 37
icantly.
differ signif
the
nce is that = 10
The differe co n d set
e se
entries in th riat io n . So, the range of the starting salaries for Corporation A is 10, or $10,000.
r va
have greate se ct ion is
th is
Your goal in re Try It Yourself 1
w to measu
to learn ho a d at a set.
n of
the variatio Find the range of the starting salaries for Corporation B.
a. Identify the minimum and maximum salaries.
b. Find the range.
c. Compare your answer with that for Example 1. Answer: Page A32
In Example 2, notice that the sum of the deviations is zero. Because this is
true for any data set, it doesnt make sense to find the average of the deviations.
To overcome this problem, you can square each deviation. In a population data
Study Tip
set, the mean of the squares of the deviations is called the population variance.
uares
add the sq DEFINITION
When you you
ations,
of the devi lled The population variance of a population data set of N entries is
ute a quantity ca
comp noted
g 1x - m22
squares, d e
the sum of
SSx . Population variance = s2 =
N
The symbol s is the lowercase Greek letter sigma.
DEFINITION
The population standard deviation of a population data set of N entries is
the square root of the population variance.
g 1x - m22
Population standard deviation = s = 2s2 =
A N
Note to Instructor
We have used the formulas here that are
GUIDELINES
derived from the definition of the popu- Finding the Population Variance and Standard Deviation
lation variance and standard deviation
because we feel they are easier to In Words In Symbols
remember than the shortcut formula. If gx
you prefer to use the shortcut formula, 1. Find the mean of the population data set. m =
N
we have included it on page 91.
2. Find the deviation of each entry. x - m
3. Square each deviation. 1x - m22
SSx = g 1x - m22
Sum of Squares of Starting Salaries 4. Add to get the sum of squares.
g 1x - m22
for Corporation A
s2 =
1x M22
Salary Deviation Squares 5. Divide by N to get the population variance.
N
x xM 6. Find the square root of the variance to get g 1x - m22
41 0.25 the population standard deviation. s =
-0.5
A N
38 -3.5 12.25
39 -2.5 6.25
45 3.5 12.25
47 5.5 30.25 EXAMPLE 3
41 -0.5 0.25
44 2.5 6.25 Finding the Population Standard Deviation
41 -0.5 0.25 Find the population variance and standard deviation of the starting salaries for
37 -4.5 20.25 Corporation A given in Example 1.
42 0.5 0.25 SOLUTION The table at the left summarizes the steps used to find SSx.
g = 0 SSx = 88.5 88.5
SSx = 88.5, N = 10, s2 = L 8.9, s = 28.85 L 3.0
10
So, the population variance is about 8.9, and the population standard deviation
is about 3.0, or $3000.
DEFINITION
Study Tip The sample variance and sample standard deviation of a sample data set
of n entries are listed below.
d the
hen you fin
Note that w
variance, yo
u g 1x - x 22
population n u m b e r of Sample variance = s2 =
, the n - 1
divide by N fi nd
g 1x - x22
when yo u
entries, but ce , yo u
varian Sample standard deviation = s = 2s2 =
the sample ss A n - 1
e b y n - 1, one le
divid r o f e ntries.
m b e
than the nu
GUIDELINES
Finding the Sample Variance and Standard Deviation
Symbols in Variance and Standard In Words In Symbols
Deviation Formulas
gx
1. Find the mean of the sample data set. x =
Population Sample n
Variance s2 s2 2. Find the deviation of each entry. x - x
3. Square each deviation. 1x - x 22
Standard
deviation
s s 4. Add to get the sum of squares. SSx = g 1x - x22
g 1x - x 22
Mean m x 5. Divide by n - 1 to get the sample variance. s2 =
n - 1
g 1x - x22
Number 6. Find the square root of the variance to get
N n
of entries the sample standard deviation. s =
A n - 1
Deviation x - m x - x
Sum of
squares g1x - m22 g1x - x22
EXAMPLE 4
Finding the Sample Standard Deviation
See MINITAB and TI-83 The starting salaries given in Example 1 are for the Chicago branches of
steps on pages 114 and 115. Corporations A and B. Each corporation has several other branches, and you
plan to use the starting salaries of the Chicago branches to estimate the starting
salaries for the larger populations. Find the sample standard deviation of the
starting salaries for the Chicago branch of Corporation A.
SOLUTION
88.5 88.5
SSx = 88.5, n = 10, s2 = L 9.8, s = L 3.1
9 A 9
So, the sample variance is about 9.8, and the sample standard deviation is about
3.1, or $3100.
Try It Yourself 4
Find the sample standard deviation of the starting salaries for the Chicago
branch of Corporation B.
a. Find the sum of squares, as you did in Try It Yourself 3.
b. Divide by n - 1 to get the sample variance.
c. Find the square root of the sample variance. Answer: Page A32
EXAMPLE 5
Office Rental Rates Using Technology to Find the Standard Deviation
35.00 33.50 37.00 Sample office rental rates (in dollars per square foot per year) for Miamis
23.75 26.50 31.25 central business district are shown in the table. Use a calculator or a computer
to find the mean rental rate and the sample standard deviation. (Adapted from
36.50 40.00 32.00
Cushman & Wakefield Inc.)
39.25 37.50 34.75
37.75 37.25 36.75 SOLUTION MINITAB, Excel, and the TI-83 each have features that
27.00 35.75 26.00 automatically calculate the mean and the standard deviation of data sets. Try
using this technology to find the mean and the standard deviation of the office
37.00 29.00 40.50
rental rates. From the displays, you can see that x L 33.73 and s L 5.09.
24.50 33.00 38.00
Descriptive Statistics
Variable N Mean Median TrMean StDev
Rental Rates 24 33.73 35.38 33.88 5.09
Variable SE Mean Minimum Maximum Q1 Q3
Rental Rates 1.04 23.75 40.50 29.56 37.44
Note to Instructor
The standard deviations reported by
A B 1-Var Stats
MINITAB and Excel represent sample
1 Mean 33.72917 x=33.72916667
standard deviations. The TI-83 also
2 Standard Error 1.038864 x=809.5
reports s, the population standard 3 Median 35.375
deviation. Ask students to compare x2=27899.5
4 Mode 37
the values of s and s shown from the 5 Standard Deviation 5.089373 Sx=5.089373342
same data. 6 Sample Variance 25.90172 x=4.982216639
7 Kurtosis -0.74282
8 Skewness -0.70345 n=24
9 Range 16.75
10 Minimum 23.75
11 Maximum 40.5 Sample Mean
12 Sum 809.5
13 Count 24 Sample Standard Deviation
Try It Yourself 5
Sample office rental rates (in dollars per square foot per year) for Seattles
central business district are listed. Use a calculator or a computer to find the
mean rental rate and the sample standard deviation. (Adapted from Cushman &
Wakefield Inc.)
Frequency
Frequency
Frequency
ep 6 6 6
must b 5
s=0
5 s 1.2 5 s 3.0
4 4 4
3 3 3
2 2 2
1 1 1
1 2 3 4 5 6 7 8 9 1 2 3 4 5 6 7 8 9 1 2 3 4 5 6 7 8 9
Data value Data value Data value
EXAMPLE 6
Estimating Standard Deviation
Without calculating, estimate the population standard deviation of each data set.
1. 8
2. 8
3. 8
7 N=8 7 N=8 7 N=8
= 4 = 4 = 4
Frequency
6 6 6
Frequency
Frequency
5 5 5
4 4 4
3 3 3
2 2 2
1 1 1
0 1 2 3 4 5 6 7 0 1 2 3 4 5 6 7 0 1 2 3 4 5 6 7
Data value Data value Data value
SOLUTION
1. Each of the eight entries is 4. So, each deviation is 0, which implies that
s = 0.
2. Each of the eight entries has a deviation of ;1. So, the population standard
deviation should be 1. By calculating, you can see that
s = 1.
3. Each of the eight entries has a deviation of ;1 or ;3. So, the population
standard deviation should be about 2. By calculating, you can see that
s L 2.24.
Try It Yourself 6
Write a data set that has 10 entries, a mean of 10, and a population standard
deviation that is approximately 3. (There are many correct answers.)
a. Write a data set that has five entries that are three units less than 10 and five
entries that are three units more than 10.
b. Calculate the population standard deviation to check that s is approxi-
mately 3. Answer: Page A32
12
(in percent)
10
8
6
4
Empirical Rule (or 68-95-99.7 Rule)
2 For data with a (symmetric) bell-shaped distribution, the standard
62 64 66 68 70 72 74 76 78 deviation has the following characteristics.
Height (in inches)
1. About 68% of the data lie within one standard deviation of the mean.
About what percent of the 2. About 95% of the data lie within two standard deviations of the mean.
heights lie within two
standard deviations 3. About 99.7% of the data lie within three standard deviations of
of the mean? the mean.
EXAMPLE 7
Insight Using the Empirical Rule
at lie more
Data values th In a survey conducted by the National Center for Health Statistics, the sample
dard devia-
than two stan mean height of women in the United States (ages 2029) was 64 inches, with a
e mean are
tions from th sample standard deviation of 2.75 inches. Estimate the percent of the women
un us ual. Data
considered whose heights are between 64 inches and 69.5 inches.
more than
values that lie
deviations
three standard SOLUTION The distribution of the womens heights is shown. Because the
n are very
from the mea distribution is bell shaped, you can use the Empirical Rule. The mean height is
unusual. 64, so when you add two standard deviations to the mean height, you get
x + 2s = 64 + 212.752 = 69.5.
Heights of Women in the U.S. Because 69.5 is two standard deviations above the mean height, the percent of
Ages 2029 the heights between 64 inches and 69.5 inches is 34% + 13.5% = 47.5%.
Interpretation So, 47.5% of women are between 64 and 69.5 inches tall.
Try It Yourself 7
34%
Estimate the percent of the heights that are between 61.25 and 64 inches.
13.5%
a. How many standard deviations is 61.25 to the left of 64?
b. Use the Empirical Rule to estimate the percent of the data between x - s
and x.
55.75 58.5 61.25 64 66.75 69.5 72.25
x 2s x x + 2s c. Interpret the result in the context of the data. Answer: Page A32
x 3s xs x+s x + 3s
EXAMPLE 8
Insight Using Chebychevs Theorem
ebychevs
In Example 8, Ch The age distributions for Alaska and Florida are shown in the histograms.
m tells yo u that at
Theore Decide which is which. Apply Chebychevs Theorem to the data for Florida
population
least 75% of the using k = 2. What can you conclude?
de r the age of
of Florida is un
a tru e statement,
88.8. This is
ly as strong
but it is not near 120 2500
= 31.6 = 39.2
ad in g the 100
made from re = 19.5 2000 = 24.8
histogram . 80
ychevs 1500
In general, Cheb 60
cautious
Theorem gives 1000
th e percent
estimates of 40
in k st an dard
lying with 20 500
io ns of th e m ean.
deviat
th eo rem
Remember, the
l di st rib ut ions. 5 15 25 35 45 55 65 75 85 5 15 25 35 45 55 65 75 85
applies to al Age (in years) Age (in years)
SOLUTION The histogram on the right shows Floridas age distribution. You can
tell because the population is greater and older. Moving two standard deviations to
the left of the mean puts you below 0, because m - 2s = 39.2 - 2124.82 = -10.4.
Moving two standard deviations to the right of the mean puts you at
m + 2s = 39.2 + 2124.82 = 88.8. By Chebychevs Theorem, you can say that at
least 75% of the population of Florida is between 0 and 88.8 years old.
Try It Yourself 8
Apply Chebychevs Theorem to the data for Alaska using k = 2.
a. Subtract two standard deviations from the mean.
b. Add two standard deviations to the mean.
c. Apply Chebychevs Theorem for k = 2 and interpret the results.
Answer: Page A32
g 1x - x22f
Sample standard deviation = s =
A n - 1
EXAMPLE 9
Number of Children Finding the Standard Deviation for Grouped Data
in 50 Households You collect a random sample of the number of children per household in a
region. The results are shown at the left. Find the sample mean and the sample
1 3 1 1 1
standard deviation of the data set.
1 2 2 1 0
1 1 0 0 0 SOLUTION These data could be treated as 50 individual entries, and you could
1 5 0 3 6 use the formulas for mean and standard deviation. Because there are so many
repeated numbers, however, it is easier to use a frequency distribution.
3 0 3 1 1
1x x22 1x x 22 f
1 1 6 0 1
3 6 6 1 2 x f xf x x
2 3 0 1 1 0 10 0 -1.8 3.24 32.40
4 1 1 2 2 1 19 19 -0.8 0.64 12.16
0 3 0 2 4 2 7 14 0.2 0.04 0.28
3 7 21 1.2 1.44 10.08
4 2 8 2.2 4.84 9.68
5 1 5 3.2 10.24 10.24
6 4 24 4.2 17.64 70.56
g = 50 g = 91 g = 145.40
g xf 91
x = = L 1.8 Sample mean
n 50
Use the sum of squares to find the sample standard deviation.
Study Tip s =
g 1x - x22f
A n - 1
=
145.4
A 49
L 1.7 Sample standard deviation
las for
that formu
Remember ire yo u to So, the sample mean is 1.8 children, and the standard deviation is 1.7 children.
a requ
grouped dat frequencies.
the
multiply by
Try It Yourself 9
Change three of the 6s in the data set to 4s. How does this change affect the
sample mean and sample standard deviation?
a. Write the first three columns of a frequency distribution.
b. Find the sample mean.
c. Complete the last three columns of the frequency distribution.
d. Find the sample standard deviation. Answer: Page A32
When a frequency distribution has classes, you can estimate the sample
mean and standard deviation by using the midpoint of each class.
EXAMPLE 10
Using Midpoints of Classes
The circle graph at the right shows
the results of a survey in which
1000 adults were asked how much
they spend in preparation for
personal travel each year. Make a
frequency distribution for the
data. Then use the table to
estimate the sample mean and the
sample standard deviation of the
data set. (Adapted from Travel
Industry Association of America)
Class x f xf x x 1x x22 1x x 22 f
099 49.5 380 18,810 - 142.5 20,306.25 7,716,375.0
100199 149.5 230 34,385 - 42.5 1,806.25 415,437.5
200299 249.5 210 52,395 57.5 3,306.25 694,312.5
300399 349.5 50 17,475 157.5 24,806.25 1,240,312.5
400499 449.5 60 26,970 257.5 66,306.25 3,978,375.0
500+ 599.5 70 41,965 407.5 166,056.25 11,623,937.5
g = 1,000 g = 192,000 g = 25,668,750.0
Study Tip
g xf 192,000
x = = = 192 Sample mean
n 1,000
s is open, as
When a clas st Use the sum of squares to find the sample standard deviation.
ass, you mu
in the last cl
g 1x - x22f
gle va lue to
assign a sin For 25,668,750
sent th e midpoint. s = = L 160.3 Sample standard deviation
rep re
le, we se lecte d A n - 1 A 999
this examp
599.5. So, the sample mean is $192 per year, and the sample standard deviation is
about $160.3 per year.
Try It Yourself 10
In the frequency distribution, 599.5 was chosen to represent the class of $500 or
more. How would the sample mean and standard deviation change if you used
650 to represent this class?
a. Write the first four columns of a frequency distribution.
b. Find the sample mean.
c. Complete the last three columns of the frequency distribution.
d. Find the sample standard deviation. Answer: Page A32
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short
84 CHAPTER 2 Descriptive Statistics
Exercises
2.4
Building Basic Skills and Vocabulary
In Exercises 1 and 2, find the range, mean, variance, and standard deviation of the
Help population data set.
1. 11 10 8 4 6 7 11 6 11 7
2. 13 23 15 13 18 13 15
14 20 20 18 17 20 13
Student
Study Pack In Exercises 3 and 4, find the range, mean, variance, and standard deviation of the
sample data set.
3. 15 8 12 5 19 14 8 6 13
1. Range = 7, mean = 8.1, 4. 24 26 27 23 9 14 8
variance L 5.7,
8 26 15 15 27 11
standard deviation L 2.4
2. Range = 10
Mean L 16.6 Graphical Reasoning In Exercises 5 and 6, find the range of the data set
represented by the display or graph.
Variance L 10.2
Standard deviation L 3.2 5. 2 39 Key: 2 3 = 23
3. Range = 14, mean L 11.1, 3 002367
variance L 21.6, 4 012338
standard deviation L 4.6 5 0119
4. Range = 19 6 1299
Mean L 17.9 7 59
Variance L 59.6 8 48
Standard deviation L 7.7 9 0256
5. 73 6. 10
7. The range is the difference 6. Brides Age at First Marriage
between the maximum and
minimum values of a data set. The 8
advantage of the range is that it is
Frequency
6
easy to calculate. The disadvantage
is that it uses only two entries from 4
the data set.
8. A deviation 1x - m2 is the
2
11. (a) Range = 25.1 11. Marriage Ages The ages of 10 grooms at their first marriage are given below.
(b) Range = 45.1
24.3 46.6 41.6 32.9 26.8 39.8 21.5 45.7 33.9 35.1
(c) Changing the maximum value
of the data set greatly affects (a) Find the range of the data set.
the range. (b) Change 46.6 to 66.6 and find the range of the new data set.
12. 53 , 3 , 3 , 7 , 7 , 76 (c) Compare your answer to part (a) with your answer to part (b).
13. (a) has a standard deviation of 24
and (b) has a standard deviation of 12. Find a population data set that contains six entries, has a mean of 5, and has
16, because the data in (a) have a standard deviation of 2.
more variability.
14. (a) has a standard deviation of 2.4
and (b) has a standard deviation of
5 because the data in (b) have
Using and Interpreting Concepts
more variability. 13. Graphical Reasoning Both data sets have a mean of 165. One has a standard
15. When calculating the population deviation of 16, and the other has a standard deviation of 24. Which is
standard deviation, you divide the which? Explain your reasoning.
sum of the squared deviations by
n, then take the square root of that
(a) 12 89 Key: 12 8 = 128 (b) 12
value. When calculating the sample 13 558 13 1
standard deviation, you divide the 14 12 14 235
sum of the squared deviations by 15 0067 15 04568
n - 1, then take the square root of
that value. 16 459 16 112333
16. When given a data set, one would 17 1368 17 1588
have to determine if it represented 18 089 18 2345
the population or was a sample 19 6 19 02
taken from the population. If the
20 357 20
data are a population, then s is
calculated. If the data are a sample, 14. Graphical Reasoning Both data sets represented below have a mean of 50.
then s is calculated. One has a standard deviation of 2.4, and the other has a standard deviation
17. Company B of 5. Which is which? Explain your reasoning.
18. Player B (a) (b)
20 20
15 15
Frequency
Frequency
10 10
5 5
42 45 48 51 54 57 60 42 45 48 51 54 57 60
Data value Data value
19. (a) Los Angeles: 17.6, 37.35, 6.11 Comparing Two Data Sets In Exercises 1922, you are asked to compare two data
Long Beach: 8.7, 8.71, 2.95 sets and interpret the results.
(b) It appears from the data that
19. Annual Salaries Sample annual salaries (in thousands of dollars) for municipal
the annual salaries in Los
employees in Los Angeles and Long Beach are listed.
Angeles are more variable than
the salaries in Long Beach. Los Angeles: 20.2 26.1 20.9 32.1 35.9 23.0 28.2 31.6 18.3
20. (a) Dallas: 18.1, 37.33, 6.11 Long Beach: 20.9 18.2 20.8 21.1 26.5 26.9 24.2 25.1 22.2
Houston: 13, 12.26, 3.50 (a) Find the range, variance, and standard deviation of each data set.
(b) It appears from the data that
(b) Interpret the results in the context of the real-life setting.
the annual salaries in Dallas are
more variable than the salaries 20. Annual Salaries Sample annual salaries (in thousands of dollars) for municipal
in Houston. employees in Dallas and Houston are listed.
21. (a) Males: 405; 16,225.3; 127.4
Dallas: 34.9 25.7 17.3 16.8 26.8 24.7 29.4 32.7 25.5
Females: 552; 34,575.1; 185.9 Houston: 25.6 23.2 26.7 27.7 25.4 26.4 18.3 26.1 31.3
(b) It appears from the data that
the SAT scores for females are (a) Find the range, variance, and standard deviation of each data set.
more variable than the SAT (b) Interpret the results in the context of the real-life setting.
scores for males.
21. SAT Scores Sample SAT scores for eight males and eight females are listed.
22. (a) Public teachers: 5.1, 2.95, 1.72
Private teachers: 4.2, 1.99, 1.41 Male SAT scores: 1059 1328 1175 1123 923 1017 1214 1042
(b) It appears from the data that Female SAT scores: 1226 965 841 1053 1056 1393 1312 1222
the annual salaries for public (a) Find the range, variance, and standard deviation of each data set.
teachers are more variable than
the salaries for private teachers. (b) Interpret the results in the context of the real-life setting.
23. (a) Greatest sample standard 22. Annual Salaries Sample annual salaries (in thousands of dollars) for public
deviation: (ii) and private elementary school teachers are listed.
Data set (ii) has more entries
Public teachers: 38.6 38.1 38.7 36.8 34.8 35.9 39.9 36.2
that are farther away from the
Private teachers: 21.8 18.4 20.3 17.6 19.7 18.3 19.4 20.8
mean.
Least sample standard (a) Find the range, variance, and standard deviation of each data set.
deviation: (iii) (b) Interpret the results in the context of the real-life setting.
Data set (iii) has more entries
that are close to the mean.
Reasoning with Graphs In Exercises 2326, you are asked to compare three
(b) The three data sets have the
data sets.
same mean but have different
standard deviations. 23. (a) Without calculating, which data set has the greatest sample standard
deviation? Which has the least sample standard deviation? Explain
your reasoning.
(i) (ii) (iii)
6 6 6
5 5 5
Frequency
Frequency
Frequency
4 4 4
3 3 3
2 2 2
1 1 1
4 5 6 7 8 9 10 4 5 6 7 8 9 10 4 5 6 7 8 9 10
Data value Data value Data value
(b) How are the data sets the same? How do they differ?
24. (a) Greatest sample standard 24. (a) Without calculating, which data set has the greatest sample standard
deviation: (i) deviation? Which has the least sample standard deviation? Explain
Data set (i) has more entries your reasoning.
that are farther away from the
mean. (i) 0 9 (ii) 0 9 (iii) 0
Least sample standard 1 58 1 5 1 5
deviation: (iii) 2 3377 2 333777 2 33337777
Data set (iii) has more entries 3 25 3 5 3 5
that are close to the mean. 4 1 4 1 4
(b) The three data sets have the
Key: 4 1 = 41 Key: 4 1 = 41 Key: 4 1 = 41
same mean, median, and mode
but have different standard (b) How are the data sets the same? How do they differ?
deviations.
25. (a) Greatest sample standard 25. (a) Without calculating, which data set has the greatest sample standard
deviation: (ii) deviation? Which has the least sample standard deviation? Explain
Data set (ii) has more entries your reasoning.
that are farther away from the (i) (ii) (iii)
mean.
Least sample standard
deviation: (iii)
Data set (iii) has more entries
that are close to the mean.
(b) The three data sets have the
same mean, median, and mode 10 11 12 13 14 10 11 12 13 14 10 11 12 13 14
but have different standard
deviations. (b) How are the data sets the same? How do they differ?
26. (a) Greatest sample standard
26. (a) Without calculating, which data set has the greatest sample standard
deviation: (iii)
deviation? Which has the least sample standard deviation? Explain
Data set (iii) has more entries your reasoning.
that are farther away from the
mean. (i) (ii) (iii)
Least sample standard
deviation: (i)
Data set (i) has more entries
that are close to the mean.
1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8
(b) The three data sets have the
same mean and median but
have different modes and (b) How are the data sets the same? How do they differ?
standard deviations.
27. Writing Discuss the similarities and the differences between the Empirical
27. Similarity: Both estimate
proportions of the data contained Rule and Chebychevs Theorem.
within k standard deviations of 28. Writing What must you know about a data set before you can use the
the mean. Empirical Rule?
Difference: The Empirical Rule
assumes the distribution is bell
shaped; Chebychevs Theorem Using the Empirical Rule In Exercises 2934, you are asked to use the Empirical Rule.
makes no such assumption.
29. The mean value of land and buildings per acre from a sample of farms is
28. You must know that the $1000, with a standard deviation of $200. The data set has a bell-shaped
distribution is bell shaped. distribution. Estimate the percent of farms whose land and building values
29. 68% per acre are between $800 and $1200.
30. Between $500 and $1900 30. The mean value of land and buildings per acre from a sample of farms is
31. (a) 51 (b) 17 $1200, with a standard deviation of $350. Between what two values do about
32. (a) 38 (b) 19 95% of the data lie? (Assume the data set has a bell-shaped distribution.)
33. $1250, $1375, $1450, $550 31. Using the sample statistics from Exercise 29, do the following. (Assume the
34. $1950, $475, $2050 number of farms in the sample is 75.)
35. 24 (a) Use the Empirical Rule to estimate the number of farms whose land
36. 148.07, 56.672; so, at least 75% of and building values per acre are between $800 and $1200.
the 400-meter dash times lie
(b) If 25 additional farms were sampled, about how many of these farms
between 48.07 and 56.67 seconds.
would you expect to have land and building values between $800 per
37. Sample mean L 2.1 acre and $1200 per acre?
Sample standard deviation L 1.3
32. Using the sample statistics from Exercise 30, do the following. (Assume the
number of farms in the sample is 40.)
(a) Use the Empirical Rule to estimate the number of farms whose land
and building values per acre are between $500 and $1900.
(b) If 20 additional farms were sampled, about how many of these farms
would you expect to have land and building values between $500 per
acre and $1900 per acre?
33. Using the sample statistics from Exercise 29 and the Empirical Rule,
determine which of the following farms, whose land and building values
per acre are given, are outliers (more than two standard deviations from
the mean).
$1250, $1375, $1125, $1450, $550, $800
34. Using the sample statistics from Exercise 30 and the Empirical Rule,
determine which of the following farms, whose land and building values
per acre are given, are outliers (more than two standard deviations from
the mean).
$1875, $1950, $475, $600, $2050, $1600
35. Chebychevs Theorem Old Faithful is a famous geyser at Yellowstone National
Park. From a sample with n = 32, the mean duration of Old Faithfuls
eruptions is 3.32 minutes and the standard deviation is 1.09 minutes. Using
Chebychevs Theorem, determine at least how many of the eruptions lasted
between 1.14 minutes and 5.5 minutes. (Source: Yellowstone National Park)
36. Chebychevs Theorem The mean time in a womens 400-meter dash is 52.37
seconds, with a standard deviation of 2.15. Apply Chebychevs Theorem to
the data using k = 2. Interpret the results.
Calculating Using Grouped Data In Exercises 37 44, use the grouped data
formulas to find the indicated mean and standard deviation.
37. Pets per Household The results of a
Number of households
12 11
random sample of the number of 10
10
pets per household in a region are
8 7 7
shown in the histogram. Estimate
6 5
the sample mean and the sample
4
standard deviation of the data set.
2
0 1 2 3 4
Number of pets
38. Sample mean L 1.7 38. Cars per Household A random sample of households in a region and the
Sample deviation L 0.8 number of cars per household are shown in the histogram. Estimate the
39. See Odd Answers, page A## sample mean and the sample deviation of the data set.
40. See Selected Answers, page A##
41. See Odd Answers, page A## 24
Number of households
25
42. See Selected Answers, page A## 20
15
15
10 8
5 3
0 1 2 3
Number of cars
39. Football Wins The number of wins for each National Football League team
DATA in 2003 are listed. Make a frequency distribution (using five classes) for the
data set. Then approximate the population mean and the population
standard deviation of the data set. (Source: National Football League)
14 10 6 6 10 8 6 5 12 12 5
5 13 10 4 4 12 10 5 4 10 9
7 5 11 8 7 5 12 10 7 4
40. Water Consumption The number of gallons of water consumed per day by a
DATA small village are listed. Make a frequency distribution (using five classes)
for the data set. Then approximate the population mean and the population
standard deviation of the data set.
167 180 192 173 145 151 174
175 178 160 195 224 244 146
162 146 177 163 149 188
41. Amount of Caffeine The amount of caffeine in a sample of five-ounce servings
of brewed coffee is shown in the histogram. Make a frequency distribution
for the data. Then use the table to estimate the sample mean and the sample
standard deviation of the data set.
14
Number of 5-ounce servings
30 13
25 12
Number responding
25
10 9
20
8
15
12 6 5
10
10
4
5 2
2 2 1
1
70.5 92.5 114.5 136.5 158.5 0 1 2 3 4
Caffeine (in milligrams) Number of supermarket trips
Figure for Exercise 41 Figure for Exercise 42
42. Supermarket Trips Thirty people were randomly selected and asked how
many trips to the supermarket they made in the past week. The responses
are shown in the histogram. Make a frequency distribution for the data.
Then use the table to estimate the sample mean and the sample standard
deviation of the data set.
43. See Odd Answers, page A## 43. U.S. Population The estimated distribution (in millions) of the U.S.
44. See Selected Answers, page A## population by age for the year 2009 is shown in the circle graph. Make a
3.44 # frequency distribution for the data. Then use the table to estimate the
45. CVheights = 100 L 4.73 sample mean and the sample standard deviation of the data set. Use 70 as
72.75
18.47 # the midpoint for 65 years and over. (Source: U.S. Census Bureau)
CVweights = 100 L 9.83
187.83
65 years and over
It appears that weight is more Under 21
18.5
17.8
19.9
16.6
years 513
16.3
15
78.3 years
14.0
35.2 12
12.4
12.1
11.9
9
16.9 1417 6
6.3
years
1.3
40.0 29.8 3
35 44
years 38.3
1824
5 15 25 35 45 55 65 75 85 95
2534 years years Age (in years)
44. Japans Population Japans estimated population for the year 2010 is shown
in the bar graph. Make a frequency distribution for the data. Then use the
table to estimate the sample mean and the sample standard deviation of the
data set. (Source: U.S. Census Bureau, International Data Base)
Extending Concepts
45. Coefficient of Variation The coefficient of variation CV describes the
DATA standard deviation as a percent of the mean. Because it has no units, you
can use the coefficient of variation to compare data with different units.
Standard deviation
CV = * 100%
Mean
The following table shows the heights (in inches) and weights (in pounds)
of the members of a basketball team. Find the coefficient of variation for
each data set. What can you conclude?
Heights Weights
72 180
74 168
68 225
76 201
74 189
69 192
72 197
79 162
70 174
69 171
77 185
73 210
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short
SECTION 2.4 Measures of Variation 91
46. (a) Male: 127.4 46. Shortcut Formula You used SSx = g 1x - x22 when calculating variance
Female: 185.9 and standard deviation. An alternative formula that is sometimes more
47. (a) x = 550, s L 302.8 convenient for hand calculations is
1g x22
(b) x = 5500, s L 3028
(c) x = 55, s L 30.28 SSx = g x2 - .
n
(d) When each entry is multiplied
by a constant k, the new You can find the sample variance by dividing the sum of squares by n - 1
sample mean is k # x , and the
and the sample standard deviation by finding the square root of the
new sample standard deviation
sample variance.
is k # s.
48. (a) x = 550, s L 302.8 (a) Use the shortcut formula to calculate the sample standard deviation for
(b) x = 560, s L 302.8 the data set given in Exercise 21.
(c) x = 540, s L 302.8 (b) Compare your results with those obtained in Exercise 21.
(d) Adding or subtracting a
constant k to each entry makes 47. Team Project: Scaling Data Consider the following sample data set.
the new sample mean x + k
100 200 300 400 500
with the sample standard
deviation being unaffected. 600 700 800 900 1000
49. 10 (a) Find x and s.
1
Set 1 - 2 = 0.99 and solve for k. (b) Multiply each entry by 10. Find x and s for the revised data.
k
50. (a) P L -2.61
(c) Divide the original data by 10. Find x and s for the revised data.
The data are skewed left. (d) What can you conclude from the results of (a), (b), and (c)?
(b) P L 4.12
48. Team Project: Shifting Data Consider the following sample data set.
The data are skewed right.
100 200 300 400 500
600 700 800 900 1000
49. Chebychevs Theorem At least 99% of the data in any data set lie within
how many standard deviations of the mean? Explain how you obtained
your answer.
50. Pearsons Index of Skewness The English statistician Karl Pearson (18571936)
introduced a formula for the skewness of a distribution.
31x - median2
P = Pearsons index of skewness
s
Exercises
Exercises
1. Mean Price Estimate the mean price of a pair of 4. Standard Deviation Estimate the standard deviation
sunglasses sold at (a) an optical store, (b) a sunglass for the number of pairs of sunglasses sold at
specialty store, and (c) a department store. Use $200 (a) optical stores, (b) sunglass specialty stores, and
as the midpoint for $151+. (c) department stores.
2. Revenue Which type of outlet had the greatest total 5. Standard Deviation Of the 13 distributions, which has
revenue? Explain your reasoning. the greatest standard deviation? Explain your
reasoning.
3. Revenue Which type of outlet had the greatest
revenue per location? Explain your reasoning. 6. Bell-Shaped Distribution Of the 13 distributions, which
is more bell shaped? Explain.
Measures of Position
2.5 Quartiles Percentiles and Other Fractiles The Standard Score
What You
Should Learn
How to find the first, second,
and third quartiles of a data
set
Quartiles
How to find the interquartile In this section, you will learn how to use fractiles to specify the position of a
range of a data set data entry within a data set. Fractiles are numbers that partition, or divide, an
How to represent a data set
ordered data set into equal parts. For instance, the median is a fractile because
graphically using a box-and- it divides an ordered data set into two equal parts.
whisker plot
How to interpret other
fractiles such as percentiles DEFINITION
How to find and interpret the The three quartiles, Q1, Q2, and Q3, approximately divide an ordered data
standard score (z-score) set into four equal parts. About one quarter of the data fall on or below
the first quartile Q1. About one half the data fall on or below the second
quartile Q2 (the second quartile is the same as the median of the data set).
About three quarters of the data fall on or below the third quartile Q3 .
EXAMPLE 1
Finding the Quartiles of a Data Set
The test scores of 15 employees enrolled in a CPR training course are listed.
Find the first, second, and third quartiles of the test scores.
13 9 18 15 14 21 7 10 11 20 5 18 37 16 17
SOLUTION First, order the data set and find the median Q2. Once you find Q2,
divide the data set into two halves. The first and third quartiles are the medians
of the lower and upper halves of the data set.
5 7 9 10 11 13 14 15 16 17 18 18 20 21 37
Q1 Q2 Q3
Try It Yourself 1
Find the first, second, and third quartiles for the ages of the Akhiok residents
using the population data set listed in the Chapter Opener on page 33.
a. Order the data set.
b. Find the median Q2.
c. Find the first and third quartiles Q1 and Q3. Answer: Page A33
EXAMPLE 2
Using Technology to Find Quartiles
The tuition costs (in thousands of dollars) for 25 liberal arts colleges are listed.
Use a calculator or a computer to find the first, second, and third quartiles.
23 25 30 23 20 22 21 15 25 24 30 25 30
20 23 29 20 19 22 23 29 23 28 22 28
SOLUTION MINITAB, Excel, and the TI-83 each have features that
automatically calculate quartiles. Try using this technology to find the first,
second, and third quartiles of the tuition data. From the displays, you can see
that Q1 = 21.5, Q2 = 23, and Q3 = 28.
Try It Yourself 2
The tuition costs (in thousands of dollars) for 25 universities are listed. Use a
calculator or a computer to find the first, second, and third quartiles.
20 26 28 25 31 14 23 15 12 26 29 24 31
19 31 17 15 17 20 31 32 16 21 22 28
a. Enter the data.
b. Calculate the first, second, and third quartiles.
c. What can you conclude? Answer: Page A33
After finding the quartiles of a data set, you can find the interquartile range.
Insight
The IQR is a
measure of DEFINITION
gives you
variation that e The interquartile range (IQR) of a data set is the difference between the
ow much th
an idea of h e d at a third and first quartiles.
of th
middle 50%
ries. It ca n also be used Interquartile range (IQR2 = Q3 - Q1
va data
utliers. Any
to identify o re th an
sm o
value that lie
IQ R s to th e left of Q1
1.5
g h t of Q3 is an
or to the ri an
stance, 37 is
outlier. For in te st sc ores EXAMPLE 3
e 15
outlier of th
1.
in Example Finding the Interquartile Range
Find the interquartile range of the 15 test scores given in Example 1. What can
you conclude from the result?
SOLUTION From Example 1, you know that Q1 = 10 and Q3 = 18. So, the
interquartile range is
IQR = Q3 - Q1
= 18 - 10
= 8.
Interpretation The test scores in the middle portion of the data set vary by at
most 8 points.
Try It Yourself 3
Find the interquartile range for the ages of the Akhiok residents listed in the
Chapter Opener on page 33.
a. Find the first and third quartiles, Q1 and Q3 .
b. Subtract Q1 from Q3 .
c. Interpret the result in the context of the data.
Answer: Page A33
40 50 60 70 Minimum Maximum
entry Q1 Median, Q 2 Q3 entry
How many U.S. presidents
ages are represented
by the box?
EXAMPLE 4 See MINITAB and TI-83 steps
Drawing a Box-and-Whisker Plot on pages 114 and 115.
SOLUTION The five-number summary of the test scores is below. Using these
five numbers, you can construct the box-and-whisker plot shown.
Try It Yourself 4
Draw a box-and-whisker plot that represents the ages of the residents of
Akhiok listed in the chapter opener on page 33.
a. Find the five-number summary of the data set.
b. Construct a horizontal scale and plot the five numbers above it.
c. Draw the box, the vertical line, and the whiskers.
d. Make some conclusions. Answer: Page A33
Study Tip
how one individual compares with others in a group. They can also be used to
identify unusually high or unusually low values. For instance, test scores and
childrens growth measurements are often expressed in percentiles. Scores or
you
It is important that measurements in the 95th percentile and above are unusually high, while those
wh at a percentile
understand in the 5th percentile and below are unusually low.
e, if the
means. For instanc
th-old
weight of a six-mon
th pe rcentile,
infant is at the 78
the infant we igh s m or e than EXAMPLE 5
th-old
78% of all six-mon
infants. It does no
t mean that Interpreting Percentiles 100
78% of
SAT Scores
the infant weigh s 90
t. The ogive represents the cumulative 80
some idea l we igh
frequency distribution for SAT test 70
Percentile
scores of college-bound students in a 60
recent year. What test score represents 50
40
the 64th percentile? How should you
30
interpret this? (Source: College Board 20
Online) 10
60
50
Ages of Residents of Akhiok 40
30
95 20
10
85
Interpretation This means that 64%
75
of the students had an SAT score of 200 400 600 800 1000 12001400 1600
65 Score
1100 or less.
Percentile
55
45
Try It Yourself 5
35
25 The ages of the residents of Akhiok are represented in the cumulative
15 frequency graph at the left. At what percentile is a resident whose age is 45?
5
a. Use the graph to find the percentile that corresponds to the given age.
5 10 15 20 25 30 35 40 45 50 55 60 65 70 b. Interpret the results in the context of the data. Answer: Page A33
Ages
DEFINITION
The standard score, or z-score, represents the number of standard
deviations a given value x falls from the mean m. To find the z-score for a
given value, use the following formula.
Value - Mean x - m
z = =
Standard deviation s
EXAMPLE 6
Finding z-Scores
The mean speed of vehicles along a stretch of highway is 56 miles per hour with
a standard deviation of 4 miles per hour. You measure the speed of three cars
traveling along this stretch of highway as 62 miles per hour, 47 miles per hour,
and 56 miles per hour. Find the z-score that corresponds to each speed. What can
you conclude?
Try It Yourself 6
The monthly utility bills in a city have a mean of $70 and a standard deviation
of $8. Find the z-scores that correspond to utility bills of $60, $71, and $92. What
In Example 6, you used z-scores to compare data values within the same
data set. You can also use z-scores to compare data values from different
data sets.
EXAMPLE 7
Jacksonville 5 11 0 .312 276 331
Houston 5 11 0 .312 255 380 Comparing z-Scores from Different Data Sets
West
yz-Kansas City
W
13
L
3
T
0
Pct
.812
PF
484
PA
332
During the 2003 regular season the Kansas City Chiefs, one of 32 teams in the
x-Denver 10 6 0 .625 381 301 National Football League (NFL), scored 63 touchdowns. During the 2003
Oakland 4 12 0 .250 270 379
San Diego 4 12 0 .250 313 441 regular season the Tampa Bay Storm, one of 16 teams in the Arena Football
NATIONAL CONFERENCE League (AFL), scored 119 touchdowns. The mean number of touchdowns in the
East NFL is 37.4, with a standard deviation of 9.3. The mean number of touchdowns
W L T Pct PF PA
yz-Philadelphia 12 4 0 .750 374 287 in the AFL is 111.7, with a standard deviation of 17.3. Find the z-score that
x-Dallas
Washington
10
5
6
11
0
0
.625
.312
289
287
260
372
corresponds to the number of touchdowns for each team. Then compare your
N.Y. Giants 4 12 0 .250 243 387 results. (Source: The National Football League and the Arena Football League)
SOLUTION
NATIONAL CONFERENCE The z-score that corresponds to the number of touchdowns for each team is
EASTERN DIVISION calculated below.
Team Won Lost Tie Pct PF PA
x-New York 8 8 0 .500 857 825
y-Detroit 8 8 0 .500 799 819
y-Las Vegas
Buffalo
8
5
8
11
0
0
.500
.313
756
554
821
751
Kansas City Chiefs Tampa Bay Storm
SOUTHERN DIVISION
x - m x - m
z = z =
Team Won Lost Tie Pct PF PA s s
x-Tampa Bay 12 4 0 .750 849 689
y-Orlando
y-Georgia
12
8
4
8
0
0
.750 805
.500 731
670
701
63 - 37.4 119 - 111.7
= =
Carolina 0 16 0 .000 553 886 9.3 17.3
y--clinched playoff berth, x--clinched division title
L 2.8 L 0.4
Try It Yourself 7
During the 2003 regular season the Kansas City Chiefs scored 16 field goals.
During the 2003 regular season the Tampa Bay Storm scored 12 field goals. The
mean number of field goals in the NFL is 23.6, with a standard deviation of 6.0.
The mean number of field goals in the AFL is 11.7, with a standard deviation of
4.6. Find the z-score that corresponds to the number of field goals for each
team. Then compare your results. (Source: The National Football League and the
Arena Football League)
Exercises
2.5
Building Basic Skills and Vocabulary
In Exercises 1 and 2, (a) find the three quartiles and (b) draw a box-and-whisker plot
Help of the data.
1. 4 7 7 5 2 9 7 6 8 5 8 4 1 5 2 8 7 6 6 9
DATA
2. 2 7 1 3 1 2 8 9 9 2 5 4 7 3 7 5 4 7
DATA 2 3 5 9 5 6 3 9 3 4 9 8 8 2 3 9 5
Student
Study Pack 3. The points scored per game by a basketball team represent the third
quartile for all teams in a league. What can you conclude about the teams
points scored per game?
4. A salesperson at a company sold $6,903,435 of hardware equipment last
1. (a) Q1 = 4.5, Q2 = 6, Q3 = 7.5
year, a figure that represented the eighth decile of sales performance at the
(b)
company. What can you conclude about the salespersons performance?
1 4.5 6 7.5 9 5. A students score on the ACT placement test for college algebra is in the
63rd percentile. What can you conclude about the students test score?
0 1 2 3 4 5 6 7 8 9
6. A doctor tells a childs parents that their childs height is in the 87th
2. (a) Q1 = 3, Q2 = 5, Q3 = 8 percentile for the childs age group. What can you conclude about the
(b) childs height?
1 3 5 8 9
True or False? In Exercises 710, determine whether the statement is true or false.
0 1 2 3 4 5 6 7 8 9
If it is false, rewrite it as a true statement.
3. The basketball team scored more 7. The second quartile is the median of an ordered data set.
points per game than 75% of the
teams in the league. 8. The five numbers you need to graph a box-and-whisker plot are the
4. The salesperson sold more minimum, the maximum, Q1, Q3, and the mean.
hardware equipment than 9. The 50th percentile is equivalent to Q1.
80% of the other salespeople.
5. The student scored above 63% 10. It is impossible to have a negative z-score.
of the students who took the
ACT placement test.
6. The child is taller than 87% of Using and Interpreting Concepts
the other children in the same
age group. Graphical Analysis In Exercises 1116, use the box-and-whisker plot to identify
7. True (a) the minimum entry. (d) the second quartile.
8. False. The five numbers you need (b) the maximum entry. (e) the third quartile.
to graph a box-and-whisker plot
are the minimum, the maximum, (c) the first quartile. (f ) the interquartile range.
Q1, Q3, and the median.
11. 12.
9. False. The 50th percentile is 10 13 15 17 20 100 130 205 270 320
equivalent to Q2.
10. False. The only way to have a 10 11 12 13 14 15 16 17 18 19 20 21 100 150 200 250 300
negative z-score is if the value
is less than the mean.
13. 14.
11. (a) Min = 10 (b) Max = 20
900 1250 1500 1950 2100 25 50 65 70 85
(c) Q1 = 13 (d) Q2 = 15
(e) Q3 = 17 (f ) IQR = 4 900 25 30 35 40 45 50 55 60 65 70 75 80 85
1500 2000
0 2 4.5 6.5 10 3 9 2 1 7 5 3 2 2 6
4 0 10 0 3 5 7 8 6 5
0 2 4 6 8 10
Number of days 21. Butterfly Wingspans The lengths (in inches) of a sample of 22 butterfly
DATA wingspans
21. (a) Q1 = 3.2, Q2 = 3.65, Q3 = 3.9
(b) Butterfly Wingspans 3.2 3.1 2.9 4.6 3.7 3.8 4.0 3.0
2.8 3.3 3.6 3.9 3.7 3.9 4.1 2.9
3.2 3.8 3.9 3.5 3.7 3.3
2.8 3.2 3.65 3.9 4.6
22. Hourly Earnings The hourly earnings (in dollars) of a sample of 25 railroad
2 3 4 5
DATA equipment manufacturers
Wingspan (in inches)
15.60 18.75 14.60 15.80 14.35 13.90 17.50 17.55 13.80
22. See Selected Answers, page A## 14.20 19.05 15.35 15.20 19.45 15.95 16.50 16.30 15.25
15.05 19.10 15.20 16.22 17.75 18.40 15.25
23. (a) 5 23. TV Viewing Refer to the data set given in Exercise 19 and the box-and-
(b) 50% whisker plot you drew that represents the data set.
(c) 25% (a) About 75% of the people watched no more than how many hours of
24. (a) $17.65 television per day?
(b) 50% (b) What percent of the people watched more than 4 hours of television
(c) 50% per day?
25. A : z = -1.43 (c) If you randomly selected one person from the sample, what is the
B:z = 0 likelihood that the person watched less than 2 hours of television per
C : z = 2.14 day? Write your answer as a percent.
A z-score of 2.14 would be unusual. 24. Manufacturer Earnings Refer to the data set given in Exercise 22 and the
26. B : z = 0.77 box-and-whisker plot you drew that represents the data set.
C : z = 1.54 (a) About 75% of the manufacturers made less than what amount per hour?
A : z = -1.54
(b) What percent of the manufacturers made more than $15.80 per hour?
None of the z-scores are unusual.
(c) If you randomly selected one manufacturer from the sample, what is
73 - 63 the likelihood that the manufacturer made less than $15.80 per hour?
27. (a) Statistics: z = L 1.43
7 Write your answer as a percent.
26 - 23
Biology: z = L 0.77
3.9
(b) The student did better on the
Graphical Analysis In Exercises 25 and 26, the midpoints A, B, and C are marked on
statistics test. the histogram. Match them to the indicated z-scores. Which z-scores, if any, would
be considered unusual?
60 - 63
28. (a) Statistics: z =
7 25. z = 0 26. z = 0.77
L -0.43 z = 2.14 z = 1.54
20 - 23
Biology: z = z = -1.43 z = -1.54
3.9
L -0.77 Statistics Test Scores Biology Test Scores
(b) The student did better on the
16 16
statistics test. 14 14
78 - 63 12 12
Number
Number
34 ,000 - 35,000 31. Life Span of Tires A certain brand of automobile tire has a mean life span of
31. (a) z1 =
2250 35,000 miles and a standard deviation of 2250 miles. (Assume the life spans
L -0.44 of the tires have a bell-shaped distribution.)
37,000 - 35,000 (a) The life spans of three randomly selected tires are 34,000 miles,
z2 = L 0.89
2250 37,000 miles, and 31,000 miles. Find the z-score that corresponds to
31,000 - 35,000 each life span. According to the z-scores, would the life spans of any of
z3 =
2250 these tires be considered unusual?
L -1.78 (b) The life spans of three randomly selected tires are 30,500 miles,
None of the selected tires have 37,250 miles, and 35,000 miles. Using the Empirical Rule, find the
unusual life spans. percentile that corresponds to each life span.
(b) For 30,500, 2.5th percentile
32. Life Span of Fruit Flies The life spans of a species of fruit fly have a bell-shaped
For 37,250, 84th percentile distribution, with a mean of 33 days and a standard deviation of 4 days.
For 35,000, 50th percentile
(a) The life spans of three randomly selected fruit flies are 34 days, 30 days,
34 - 33
32. (a) z1 = = 0.25, and 42 days. Find the z-score that corresponds to each life span and
4
determine if any of these life spans are unusual.
30 - 33
z2 = = -0.75, (b) The life spans of three randomly selected fruit flies are 29 days, 41 days,
4
and 25 days. Using the Empirical Rule, find the percentile that
42 - 33
z3 = = 2.25 corresponds to each life span.
4
The life span of 42 days is
unusual. Interpreting Percentiles In Exercises 3338, use the cumulative frequency distrib-
(b) For 29, 16th percentile ution to answer the questions. The cumulative frequency distribution represents
For 41, 97.5th percentile the heights of males in the United States in the 20 29 age group. The heights have
For 25, 2.5th percentile a bell-shaped distribution (see Picturing the World, page 80) with a mean of
33. About 67 inches; 20% of the 69.2 inches and a standard deviation of 2.9 inches. (Source: National Center for
heights are below 67 inches. Health Statistics)
34. 99th percentile
74 - 69.2 Adult Males Ages 2029
35. z1 = L 1.66
2.9
62 - 69.2 100
z2 = L -2.48 90
2.9
80
80 - 69.2 70
Percentile
z3 = L 3.72
2.9 60
50
The heights that are 62 and
40
80 inches are unusual. 30
70 - 69.2 20
36. z1 = L 0.28 10
2.9
66 - 69.2 62 64 66 68 70 72 74 76 78
z2 = L -1.10
2.9 Height (in inches)
68 - 69.2
z3 = L -0.41
2.9
33. What height represents the 20th percentile? How should you interpret this?
None of the heights are unusual.
34. What percentile is a height of 76 inches? How should you interpret this?
35. Three adult males in the 2029 age group are randomly selected. Their
heights are 74 inches, 62 inches, and 80 inches. Use z -scores to determine
which heights, if any, are unusual.
36. Three adult males in the 2029 age group are randomly selected. Their
heights are 70 inches, 66 inches, and 68 inches. Use z -scores to determine
which heights, if any, are unusual.
71.1 - 69.2 37. Find the z-score for a male in the 2029 age group whose height is
37. z = L 0.66
2.9 71.1 inches. What percentile is this?
About the 70th percentile
38. Find the z-score for a male in the 2029 age group whose height is
66.3 - 69.2 66.3 inches. What percentile is this?
38. z = = -1
2.9
About the 11th percentile
39. (a) Q1 = 42, Q2 = 49, Q3 = 56
(b) Ages of Executives
Extending Concepts
39. Ages of Executives The ages of a sample of 100 executives are listed.
DATA
27 42 49 56 82
31 62 51 44 61 47 49 45 40 52 60 51 67 47 63 54 59 43 63 52
50 54 61 41 48 49 51 54 39 54 47 52 36 53 74 33 53 68 44 40
25 35 45 55 65 75 85 60 42 50 48 42 42 36 57 42 48 56 51 54 42 27 43 43 41 54 49
Ages 49 47 51 28 54 36 36 41 60 55 42 59 35 65 48 56 82 39 54 49
(c) Half of the ages are between 61 56 57 32 38 48 64 51 45 46 62 63 59 63 32 47 40 37 49 57
42 and 56 years.
(d) 49, because half of the Over the hill or on top?
executives are older and half Number of 100 top executives
are younger. in the following age groups:
40. 5 TOP EXECUTIVES
36
41. 33.75 31
42. 10.975
43. 19.8 16
13
2 1 1
24.5 34.5 44.5 54.5 64.5 74.5 84.5
Age
(a) Order the data and find the first, second, and third quartiles.
(b) Draw a box-and-whisker plot that represents the data set.
(c) Interpret the results in the context of the data.
(d) On the basis of this sample, at what age would you expect to be an
executive? Explain your reasoning.
(e) Which age groups, if any, can be considered unusual? Explain your
reasoning.
Midquartile Another measure of position is called the midquartile. You can find
the midquartile of a data set by using the following formula.
Q1 + Q3
Midquartile =
2
In Exercises 4043, find the midquartile of the given data set.
40. 5 7 1 2 3 10 8 7 5 3
41. 23 36 47 33 34 40 39 24 32 22 38 41
42. 12.3 9.7 8.0 15.4 16.1 11.8 12.7 13.4
12.2 8.1 7.9 10.3 11.2
43. 21.4 20.8 19.7 15.2 31.9 18.7 15.6 16.7
19.8 13.4 22.9 28.7 19.8 17.2 30.1
Uses
It can be difficult to see trends or patterns from a set of raw data. Descriptive
statistics helps you do so. A good description of a data set consists of three
features: (1) the shape of the data, (2) a measure of the center of the data, and
(3) a measure of how much variability there is in the data. When you read
reports, news items, or advertisements prepared by other people, you are
seldom given raw data sets. Instead, you are given graphs, measures of central
tendency, and measures of variation. To be a discerning reader, you need to
understand the terms and techniques of descriptive statistics.
Abuses
Cropped Vertical Axis Misleading statistical graphs are common in newspapers
and magazines. Compare the two time series charts below. The data are the same
for each. However, the first graph has a cropped vertical axis, which makes it
appear that the stock price has increased greatly over the 10-year period. In the
second graph, the scale on the vertical axis begins at zero. This graph correctly
shows that stock prices increased only modestly during the 10-year period.
62 80
60 70
58 60
56 50
54
40
52
50 30
48 20
46 10
1996 1998 2000 2002 2004 1996 1998 2000 2002 2004
Year Year
Effect of Outliers on the Mean Outliers, or extreme values, can have significant
effects on the mean. Suppose, for example, that in recruiting information, a
company stated that the average commission earned by the five people in its
salesforce was $60,000 last year. This statement would be misleading if four of the
five earned $25,000 and the fifth person earned $200,000.
Exercises
1. Cropped Vertical Axis In a newspaper or magazine, find an example of a
graph that has a cropped vertical axis. Is the graph misleading? Do you
think this graph was intended to be misleading? Redraw the graph so that
it is not misleading.
2. Effect of Outliers on the Mean Describe a situation in which an outlier can
make the mean misleading. Is the median also affected significantly by
outliers? Explain your reasoning.
105
Cyan Magenta Yellow Black Pantone 299
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
106 CHAPTER 2 Descriptive Statistics
Chapter Summary
2
What did you learn? Review Exercises
Section 2.1
How to construct a frequency distribution including limits, boundaries, 1
midpoints, relative frequencies, and cumulative frequencies
How to construct frequency histograms, frequency polygons, relative 26
frequency histograms, and ogives
Section 2.2
How to graph quantitative data sets using the exploratory data analysis tools 7, 8
of stem-and-leaf plots and dot plots
How to graph and interpret paired data sets using scatter plots and time 9, 10
series charts
How to graph qualitative data sets using pie charts and Pareto charts 11, 12
Section 2.3
How to find the mean, median, and mode of a population and a sample 13, 14
gx gx
m = ,x =
N n
How to find a weighted mean of a data set and the mean of a frequency 1518
g1x # w2 g1x # f2
distribution x = ,x =
gw n
How to describe the shape of a distribution as symmetric, uniform, or 1924
skewed and how to compare the mean and median for each
Section 2.4
How to find the range of a data set 25, 26
How to find the variance and standard deviation of a population and a sample 2730
g1x - m2 2
g1x - x2 2
s = ,s =
A N A n - 1
How to use the Empirical Rule and Chebychevs Theorem to interpret 3134
standard deviation
How to approximate the sample standard deviation for grouped data 35, 36
g1x - x2 f 2
s =
A n - 1
Section 2.5
How to find the quartiles and interquartile range of a data set 3739, 41
How to draw a box-and-whisker plot 40, 42
How to interpret other fractiles such as percentiles 43, 44
How to find and interpret the standard score ( z -score) z = 1x - m2>s 4548
Review Exercises
2
1. See Odd Answers, page A## Section 2.1
2. See Selected Answers, page A## In Exercises 1 and 2, use the following data set. The data set represents the income
3. Liquid Volume 12-oz Cans DATA (in thousands of dollars) of 20 employees at a small business.
12
30 28 26 39 34 33 20 39 28 33
10
Frequency
8 26 39 32 28 31 39 33 31 33 32
6
4 1. Make a frequency distribution of the data set using five classes. Include the
2 class midpoints, limits, boundaries, frequencies, relative frequencies, and
cumulative frequencies.
11.875
11.915
11.955
11.995
12.035
12.075
12.115
Actual volume (in ounces) 2. Make a relative frequency histogram using the frequency distribution in
Exercise 1. Then determine which class has the greatest relative frequency
4. See Selected Answers, page A##
and which has the least relative frequency.
5.
Class Midpoint Frequency, f
7993 86 9 In Exercises 3 and 4, use the following data set. The data represent the actual liquid
94108 101 12
DATA volume (in ounces) in 24 twelve-ounce cans.
109123 116 5 11.95 11.91 11.86 11.94 12.00 11.93 12.00 11.94
124138 131 3 12.10 11.95 11.99 11.94 11.89 12.01 11.99 11.94
139153 146 2 11.92 11.98 11.88 11.94 11.98 11.92 11.95 11.93
154 168 161 1
3. Make a frequency histogram using seven classes.
gf = 32
4. Make a relative frequency histogram of the data set using seven classes.
Meals Purchased
14
12 In Exercises 5 and 6, use the following data set. The data represent the number of
Frequency
Number of meals 5. Make a frequency distribution with six classes and draw a frequency polygon.
6. See Selected Answers, page A##
6. Make an ogive of the data set using six classes.
7. 1 3 7 8 9
2 012333445557889
3 11234578 Section 2.2
4 347 In Exercises 7 and 8, use the following data set.The data represent the average daily
5 1 DATA high temperature (in degrees Fahrenheit) during the month of January for Chicago,
8. See Selected Answers, page A## Illinois. (Source: National Oceanic and Atmospheric Administration)
9. Height of Buildings
33 31 25 22 38 51 32 23 23 34 44 43 47 37 29 25
60
55
28 35 21 24 20 19 23 27 24 13 18 28 17 25 31
Number of stories
50
45 7. Make a stem-and-leaf plot of the data set. Use one line per stem.
40
35
30
8. Make a dot plot of the data set.
25
20 9. The following are the heights (in feet) and the number of stories of nine
400 500 600 700 800 notable buildings in Miami. Use the data to construct a scatter plot. What
Height (in feet)
type of pattern is shown in the scatter plot? (Source: Skyscrapers.com)
The number of stories appears to Height (in feet) 764 625 520 510 484 480 450 430 410
increase with height.
Number of stories 55 47 51 28 35 40 33 31 40
10. U.S. Unemployment Rate 10. The U.S. unemployment rate over a 12-year period is given. Use the data to
DATA construct a time series chart. (Source: U.S. Bureau of Labor Statistics)
Unemployment rate
8
7
6
5 Year 1992 1993 1994 1995 1996 1997
4
3
Unemployment rate 7.5 6.9 6.1 5.6 5.4 4.9
2
1 Year 1998 1999 2000 2001 2002 2003
Unemployment rate 4.5 4.2 4.0 4.7 5.8 6.0
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
2002
2003
Year
In Exercises 11 and 12, use the following data set. The data set represents the top
seven American Kennel Club registrations (in thousands) in 2003. (Source: American
Kennel Club)
160
(in thousands)
140
120 12. Make a pie chart of the data set.
100
80
60
40
20
Section 2.3
13. Find the mean, median, and mode of the data set.
Labrador
retriever
Golden
retriever
Beagle
German
shepherd
Dachshund
Yorkshire
terrier
Boxer
9 7 8 6 9 12 11 5 9 10
Breed
14. Find the mean, median, and mode of the data set.
12. American Kennel Club 28 35 29 29 33 32 29 33 31 29
Boxer
Yorkshire 9%
Labrador
15. Estimate the mean of the frequency distribution you made in Exercise 1.
terrier
10% retriever
36% 16. The following frequency distribution shows the number of magazine
Dachshund
10% subscriptions per household for a sample of 60 households. Find the mean
Beagle number of subscriptions per household.
11%
German Golden
shepherd retriever Number of magazines 0 1 2 3 4 5 6
11% 13%
Frequency 13 9 19 8 5 2 4
13. Mean = 8.6
17. Six test scores are given. The first five test scores are 15% of the final grade,
Median = 9
and the last test score is 25% of the final grade. Find the weighted mean of
Mode = 9 the test scores.
14. Mean = 30.8
65 72 84 89 70 90
Median = 30
Mode = 29 18. Four test scores are given. The first three test scores are 20% of the final
15. 31.7
grade, and the last test score is 40% of the final grade. Find the weighted
mean of the test scores.
16. 2.1
17. 79.5 81 95 89 87
18. 87.8 19. Describe the shape of the distribution in the histogram you made in
19. Skewed Exercise 3. Is the distribution symmetric, uniform, or skewed?
20. Skewed 20. Describe the shape of the distribution in the histogram you made in
Exercise 4. Is the distribution symmetric, uniform, or skewed?
21. Skewed left In Exercises 21 and 22, determine whether the approximate shape of the
22. Skewed right distribution in the histogram is skewed right, skewed left, or symmetric.
23. Median 21. 22.
12 12
24. Mean
10 10
25. 2.8 8 8
26. 3.84 6 6
4 4
27. Population mean = 9
2 2
Standard deviation L 3.2
2 6 10 14 18 22 26 30 34 2 6 10 14 18 22 26 30 34
28. Population mean = 69
Standard deviation L 7.8 23. For the histogram in Exercise 21, which is greater, the mean or the median?
29. Sample mean = 2453.4
24. For the histogram in Exercise 22, which is greater, the mean or the median?
Standard deviation L 306.1
30. Sample mean = 38,653.5 Section 2.4
Standard deviation L 6762.6
25. The data set represents the mean price of a movie ticket (in U.S. dollars) for
31. Between $21.50 and $36.50 a sample of 12 U.S. cities. Find the range of the data set.
32. 68%
7.82 7.38 6.42 6.76 6.34 7.44 6.15 5.46 7.92 6.58 8.26 7.17
26. The data set represents the mean price of a movie ticket (in U.S. dollars) for
a sample of 12 Japanese cities. Find the range of the data set.
19.73 16.48 19.10 18.56 17.68 17.19
16.63 15.99 16.66 19.59 15.89 16.49
27. The mileage (in thousands) for a rental car companys fleet is listed. Find
the population mean and standard deviation of the data.
6 14 3 7 11 13 8 5 10 9 12 10
28. The age of each Supreme Court justice as of August 20, 2003 is listed. Find
the population mean and standard deviation of the data. (Source: Supreme
Court of the United States)
78 83 73 67 67 63 55 70 65
29. Dormitory room prices (in dollars for one school year) for a sample of
four-year universities are listed. Find the sample mean and the sample
standard deviation of the data.
2445 2940 2399 1960 2421 2940 2657 2153
2430 2278 1947 2383 2710 2761 2377
30. Sample salaries (in dollars) of public school teachers are listed. Find the
sample mean and standard deviation of the data.
46,098 36,259 35,084 38,617 42,690 26,202 47,169 37,109
31. The mean rate for cable television from a sample of households was $29.00
per month, with a standard deviation of $2.50 per month. Between what
two values do 99.7% of the data lie? (Assume a bell-shaped distribution.)
32. The mean rate for cable television from a sample of households was $29.50
per month, with a standard deviation of $2.75 per month. Estimate the
percent of cable television rates between $26.75 and $32.25. (Assume that
the data set has a bell-shaped distribution.)
33. 30 33. The mean sale per customer for 40 customers at a grocery store is $23.00,
34. 15 with a standard deviation of $6.00. On the basis of Chebychevs Theorem,
35. Sample mean L 2.5 at least how many of the customers spent between $11.00 and $35.00?
Standard deviation L 1.2 34. The mean length of the first 20 space shuttle flights was about 7 days, and
36. Sample mean = 2.4 the standard deviation was about 2 days. On the basis of Chebychevs
Standard deviation L 1.7 Theorem, at least how many of the flights lasted between 3 days and 11
days? (Source: NASA)
37. 56
38. 70 35. From a random sample of households, the number of television sets are
39. 14 listed. Find the sample mean and standard deviation of the data.
40. Height of Students Number of televisions 0 1 2 3 4 5
Number of households 1 8 13 10 5 3
50 56 63 70 75 36. From a random sample of airplanes, the number of defects found in their
fuselages are listed. Find the sample mean and standard deviation of the data.
50 55 60 65 70 75
Heights Number of defects 0 1 2 3 4 5 6
Number of airplanes 4 5 2 9 1 3 1
41. 4
42. Weight of Football Players
Section 2.5
In Exercises 3740, use the following data set. The data represent the heights
145 173 190 208 240 (in inches) of students in a statistics class.
50 51 54 54 56 59 60 61 61 63
140
150
160
170
180
190
200
210
220
230
240
Weights 64 65 68 69 70 70 71 71 75
43. 23% scored higher than 68. 37. Find the height that corresponds 38. Find the height that corresponds
44. 88th percentile to the first quartile. to the third quartile.
45. z = 2.33, unusual 39. Find the interquartile range. 40. Make a box-and-whisker plot of
46. z = -1.5, not unusual the data.
47. z = 1.25, not unusual
48. z = -2.125, unusual 41. Find the interquartile range of the data from Exercise 14.
42. The weights (in pounds) of the defensive players on a high school football
team are given. Make a box-and-whisker plot of the data.
173 145 205 192 197 227 156 240 172 185
208 185 190 167 212 228 190 184 195
43. A students test grade of 68 represents the 77th percentile of the grades.
What percent of students scored higher than 68?
44. In 2004 there were 728 oldies radio stations in the United States. If one
station finds that 84 stations have a larger daily audience than it does, what
percentile does this station come closest to in the daily audience rankings?
(Source: Radioinfo.com)
In Exercises 4548, use the following information. The weights of 19 high school
football players have a bell-shaped distribution, with a mean of 192 pounds and a
standard deviation of 24 pounds. Use z-scores to determine if the weights of the
following randomly selected football players are unusual.
45. 248 pounds 46. 156 pounds 47. 222 pounds 48. 141 pounds
Chapter Quiz
2
Take this quiz as you would take a quiz in class. After you are done, check your work
against the answers given in the back of the book.
1. See Odd Answers, page A##
2. 125.2, 13.0 1. The data set is the number of minutes a sample of 25 people exercise
3. (a) U.S. Sporting Goods
DATA each week.
Recreational
Footwear transport 108 139 120 123 120 132 123 131 131
13% 34%
157 150 124 111 101 135 119 116 117
127 128 139 119 118 114 127
Clothing
22%
(a) Make a frequency distribution of the data set using five classes. Include
class limits, midpoints, frequencies, boundaries, relative frequencies, and
Equipment cumulative frequencies.
31%
(b) Display the data using a frequency histogram and a frequency polygon
(b) U.S. Sporting Goods on the same axes.
(in billions of dollars)
16
14
12 (c) Display the data using a relative frequency histogram.
Sales
10
8
6
(d) Describe the distributions shape as symmetric, uniform, or skewed.
4
2 (e) Display the data using a box-and-whisker plot.
Recreational
transport
Equipment
Clothing
Footwear
7. The number of wins for each Major League Baseball team in 2003 are listed.
DATA (Source: Major League Baseball)
101 95 86 71 63 90 86 83 68 43
96 93 77 71 101 91 86 83 66 88
87 85 75 69 68 100 85 84 74 64
(a) Find the quartiles of the data set.
(b) Find the interquartile range.
(c) Draw a box-and-whisker plot.
FPO www.dfamilk.com
94 95 96 97 98 99 00 01 02 03
The following data set was supplied by a dairy Year
farmer. It lists the monthly milk production (in (Source: National Agricultural Statistics Service)
pounds) for 50 Holstein dairy cows. (Source:
Matlink Dairy, Clymer, NY) Rate per Cow, 19942003
19,000
2825 2072 2733 2069 2484 18,500
Pounds of milk
4285 2862 3353 1449 2029 18,000
1258 2982 2045 1677 1619 17,500
17,000
2597 3512 2444 1773 2284 15% increase over
16,500
1884 2359 2046 2364 2669 16,000 a 10-year period
3109 2804 1658 2207 2159
94 95 96 97 98 99 00 01 02 03
2207 2882 1647 2051 2202 Year
3223 2383 1732 2230 1147 (Source: National Agricultural Statistics Service)
2711 1874 1979 1319 2923 From 1994 to 2003, the number of dairy cows
2281 1230 1665 1294 2936 in the United States decreased and the yearly
milk production increased.
Exercises
In Exercises 14, use a computer or calculator. If In Exercises 68, use the frequency distribution
possible, print your results. found in Exercise 3.
1. Find the sample mean of the data. 6. Use the frequency distribution to estimate the
sample mean of the data. Compare your
2. Find the sample standard deviation of the data.
results with Exercise 1.
3. Make a frequency distribution for the data.
7. Use the frequency distribution to find the
Use a class width of 500.
sample standard deviation for the data.
4. Draw a histogram for the data. Does the Compare your results with Exercise 2.
distribution appear to be bell shaped?
8. Writing Use the results of Exercises 6 and 7 to
5. What percent of the distribution lies within write a general statement about the mean and
one standard deviation of the mean? Within standard deviation for grouped data. Do the
two standard deviations of the mean? How do formulas for grouped data give results that are
these results agree with the Empirical Rule? as accurate as the individual entry formulas?
Graph
Plot... 130
Time Series Plot... 120
Chart...
Contour Plot... 40
30
20
10
0
2 Variances...
Correlation...
Covariance...
Normality Test...
(See Example 4, page 96.)
Graph
Plot...
Time Series Plot... 35
Chart...
Histogram...
Boxplot...
Test Score
25
Matrix Plot...
Draftsman Plot...
Contour Plot... 15
(See Example 7, page 55.) (See Example 4, page 77.) (See Example 4, page 96.)
Type: Type:
Xlist: L1 Xlist: L1
Ylist: L2 Freq: 1
Mark: + .
Section 1.3
1. (1a) Focus: Effect of exercise on senior citizens.
(1b) Population: Collection of all senior citizens.
(1c) Experiment
(2a) Focus: Effect of radiation fallout on senior citizens.
(2b) Population: Collection of all senior citizens.
(2c) Sampling
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
TRY IT YOURSELF ANSWERS A31
Relative frequency
Frequency, Mid- Relative Cumulative 0.25
Class f point frequency frequency 0.20
0.15
0 9 15 4.5 0.1875 15 0.10
10 19 19 14.5 0.2375 34 0.05
20 29 14 24.5 0.1750 48
4.5
14.5
24.5
34.5
44.5
54.5
64.5
74.5
30 39 7 34.5 0.0875 55
40 49 14 44.5 0.1750 69 Age
50 59 6 54.5 0.0750 75
6a. Use upper class boundaries for the horizontal scale and
60 69 4 64.5 0.0500 79 cumulative frequency for the vertical scale.
70 79 1 74.5 0.0125 80
b. See part (c).
f
a f = 80 g = 1 c. Ages of Akhiok Residents
n
Cumulative frequency
80
72
c. 42.5% of the population is under 20 years old. 6.25% of 64
56
48
the population is over 59 years old. 40
32
3a. b. Use class midpoints for the 24
Class boundaries 16
8
horizontal scale and frequency
0.5
9.5
19.5
29.5
39.5
49.5
59.5
69.5
79.5
- 0.59.5 for the vertical scale.
9.519.5 Age
19.529.5
29.539.5 d. Approximately 69 residents are 49 years old or younger.
39.549.5 7a. Enter data.
49.559.5 b. 20
59.569.5
69.579.5
c. Ages of Akhiok Residents d. Same as 2c.
20 0 80
0
16
Frequency
12
8
Section 2.2
4
1a. 0
4.5
14.5
24.5
34.5
44.5
54.5
64.5
74.5
1
Age
2
4a. Same as 3b. 3
b. See part (c). 4
c. Ages of Ahkiok Residents 5
20
6
18
16 7
Frequency
14
12 b. Key: 3 3 = 33
10
8
6
4
0 527153101339045
2
1 8256337307823893699
5.5
4.5
14.5
24.5
34.5
44.5
54.5
64.5
74.5
84.5
2 54203340159666
Age 3 9697993
d. The population increases up to the age of 14.5 and then 4 42471800199519
decreases. Population increases again between the ages of 5 831689
34.5 and 44.5, but then after 44.5, the population decreases. 6 0878
7 2
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
A32 TRY IT YOURSELF ANSWERS
5 136889
6 0788 c. As a percentage of total motor vehicle deaths, car deaths
7 2 decreased by 10%, truck deaths increased by 9%, and
d. It seems that most of the residents are under 40. motorcycle deaths stayed about the same.
2ab. Key: 3 3 = 33 5a. Cause Frequency, f
0 0011123334 Auto Dealers 14,668
0 55579 Auto Repair 9,728
1 02233333 Home Furnishing 7,792
1 56677888999 Computer Sales 5,733
2 00123344 Dry Cleaning 4,649
2 556669
b. Causes of BBB Complaints
3 3 16,000
14,000
3 679999
Frequency
12,000
10,000
4 00111244 8,000
6,000
4,000
4 578999 2,000
5 13
Auto
dealers
Auto
repairs
Home
furnishing
Computer
sales
Dry
cleaning
5 6889
6 0 Cause
6 788 c. It appears that the auto industry (dealers and repair shops)
7 2 account for the largest portion of complaints filed at the
BBB.
7
3a. Use ages for the horizontal axis. 6ab. Salaries c. It appears that the longer
50,000 an employee is with the
b.
Salary (in dollars)
f
gf = 34,145 g L 1 a = 360 Year
n
Section 2.3
1a. 578 b. 41.3
c. The typical age of an employee in a department store is
41.3 years old.
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
TRY IT YOURSELF ANSWERS A33
2a. 0, 0, 1, 1, 1, 2, 3, 3, 4, 5, 5, 5, 9, 10, 12, 12, 13, 13, 13, 13, 13, 15, Section 2.4
16, 16, 17, 17, 18, 18, 18, 19, 19, 19, 20, 20, 21, 22, 23, 23, 24,
24, 25, 25, 26, 26, 26, 29, 36, 39, 39, 39, 39, 40, 40, 41, 41, 41, 1a. Min = 23, or $23,000; Max = 58, or $58,000
42, 44, 44, 45, 47, 48, 49, 49, 49, 51, 53, 56, 58, 58, 60, 67, 68, b. 35, or $35,000
68, 72 c. The range of the starting salaries for Corporation B is 35,
b. 23 or $35,000 (much larger than the range of Corporation A).
3a. 0, 0, 1, 1, 1, 2, 3, 3, 3, 4, 5, 5, 5, 7, 9, 10, 12, 12, 13, 13, 13, 13, 2a. 41.5, or $41,500
13, 15, 16, 16, 17, 17, 18, 18, 18, 19, 19, 19, 20, 20, 21, 22, 23, b. Salary, x Deviation, x M
23, 24, 24, 25, 25, 26, 26, 26, 29, 33, 36, 37, 39, 39, 39, 39, 40,
(1000s of dollars) (1000s of dollars)
40, 41, 41, 41, 42, 44, 44, 45, 47, 48, 49, 49, 49, 51, 53, 56, 58,
58, 59, 60, 67, 68, 68, 72 23 - 18.5
b. 23.5 29 - 12.5
c. Half of the residents of Akhiok are younger than 23.5 32 - 9.5
years old and half are older than 23.5 years old. 40 - 1.5
4a. 0, 0, 1, 1, 1, 2, 3, 3, 3, 4, 5, 5, 5, 7, 9, 10, 12, 12, 13, 13, 13, 13, 41 - 0.5
13, 15, 16, 16, 17, 17, 18, 18, 18, 19, 19, 19, 20, 20, 21, 22, 23, 41 - 0.5
23, 24, 24, 25, 25, 26, 26, 26, 29, 33, 36, 37, 39, 39, 39, 39, 40, 49 7.5
40, 41, 41, 41, 42, 44, 44, 45, 47, 48, 49, 49, 49, 51, 53, 56, 58, 50 8.5
58, 59, 60, 67, 68, 68, 72
52 10.5
b. 13 c. The mode of the ages is 13 years old. 58 16.5
5a. Yes b. The mode of the responses to the survey is
gx = 415 g1x - m2 = 0
Yes.
6a. 21.6; 21; 20
3ab. m = 41.5, or $41,500
b. The mean in Example 6 1 x L 23.82 was heavily
influenced by the age 65. Neither the median nor the Salary, x x M 1x M22
mode was affected as much by the age 65.
23 - 18.5 342.25
7ab.
29 - 12.5 156.25
Score, Weight, 32 - 9.5 90.25
Source x w x w # 40 - 1.5 2.25
Test Mean 86 0.50 43.0 41 - 0.5 0.25
Midterm 96 0.15 14.4 41 - 0.5 0.25
Final 98 0.20 19.6 49 7.5 56.25
Computer Lab 98 0.10 9.8 50 8.5 72.25
Homework 100 0.05 5.0 52 10.5 110.25
gw = 1.00 g1x # w2 = 91.8 58 16.5 272.25
gx = 415 g1x - m2 = 0 g1x - m22 = 1102.5
c. 91.8 d. The weighted mean for the course is 91.8.
8abc. c. 110.3 d. 10.5, or $10,500
Frequency, e. The population variance is 110.3 and the population
Class Midpoint, x f #
x f standard deviation is 10.5, or $10,500.
0 9 4.5 15 67.50 4a. See 3ab. b. 122.5 c. 11.1, or $11,100
10 19 14.5 19 275.50 5a. Enter data. b. 37.89; 3.98
20 29 24.5 14 343.00 6a. 7, 7, 7, 7, 7, 13, 13, 13, 13, 13 b. 3
30 39 34.5 7 241.50 7a. 1 standard deviation b. 34%
40 49 44.5 14 623.00 c. The estimated percent of the heights that are between
50 59 54.5 6 327.00 61.25 and 64 inches is 34%.
60 69 64.5 4 258.00 8a. 0 b. 70.6
70 79 74.5 1 74.50 c. At least 75% of the data lie within 2 standard deviations
N = 80 g(x # f 2 = 2210 of the mean. At least 75% of the population of Alaska is
between 0 and 70.6 years old.
d. 27.6
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
A34 TRY IT YOURSELF ANSWERS
1x x22 1x x22 # f
c. 0 13 23.5 41.5 72
x x
- 1.70 2.8900 28.90 0 10 20 30 40 50 60 70 80
d. 1.5 71 - 70
z2 = = 0.125
8
10a.
Class x f xf 92 - 70
z3 = = 2.75
0 99 49.5 380 18,810 8
100 199 149.5 230 34,385 c. From the z-score, $60 is 1.25 standard deviations below the
200 299 249.5 210 52,395 mean, $71 is 0.125 standard deviation above the mean, and
$92 is 2.75 standard deviations above the mean.
300 399 349.5 50 17,475
7a. NFL: m = 23.6, s = 6.0
400 499 449.5 60 26,970
500+ 650.0 70 45,500 AFL: m = 11.7, s = 4.6
n = 1000 gxf = 195,535 b. Kansas City: z = -1.27
Tampa Bay: z = 0.07
b. 195.5
1x x22 1x x22f
c. The number of field goals scored by Kansas City is
c.
x x 1.27 standard deviations below the mean and the number
of field goals scored by Tampa Bay is 0.07 standard
- 146.04 21,327.68 8,104,518.4 deviations above the mean. Comparing the two measures
- 46.04 2,119.68 487,526.4 of position indicates that Tampa Bay has a higher position
53.96 2,911.68 611,452.8 within the AFL than Kansas City has in the NFL.
153.96 23,703.68 1,185,184.0
253.96 64,495.68 3,869,740.8
454.46 206,533.89 14,457,372.3
g1x - x22f = 28,715,794.7
d. 169.5
Section 2.5
1a. 0, 0, 1, 1, 1, 2, 3, 3, 3, 4, 5, 5, 5, 7, 9, 10, 12, 12, 13, 13, 13, 13,
13, 15, 16, 16, 17, 17, 18, 18, 18, 19, 19, 19, 20, 20, 21, 22, 23,
23, 24, 24, 25, 25, 26, 26, 26, 29, 33, 36, 37, 39, 39, 39, 39, 40,
40, 41, 41, 41, 42, 44, 44, 45, 47, 48, 49, 49, 49, 51, 53, 56, 58,
58, 59, 60, 67, 68, 68, 72
b. 23.5 c. 13, 41.5
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
ODD ANSWERS A3
9. (a) 10
25.
(b) and (c)
Frequency, Mid- Relative Cumulative
Class Midpoint Class boundaries Class f point frequency frequency
20 29 24.5 19.529.5 1000 2019 12 1509.5 0.5455 12
30 39 34.5 29.539.5 2020 3039 3 2529.5 0.1364 15
40 49 44.5 39.549.5 3040 4059 2 3549.5 0.0909 17
50 59 54.5 49.559.5 4060 5079 3 4569.5 0.1364 20
60 69 64.5 59.569.5 5080 6099 1 5589.5 0.0455 21
70 79 74.5 69.579.5 6100 7119 1 6609.5 0.0455 22
80 89 84.5 79.589.5 f
gf = 22 g L 1
11. N
Frequency, Mid- Relative Cumulative
Class f point frequency frequency July Sales for
Representatives
20 29 10 24.5 0.01 10
14
30 39 132 34.5 0.13 142 12
Frequency
10
40 49 284 44.5 0.29 426 8
50 59 300 54.5 0.30 726 6
4
60 69 175 64.5 0.18 901 2
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
A4 ODD ANSWERS
27. 31.
Frequency, Mid- Relative Cumulative Frequency, Mid- Relative Cumulative
Class f point frequency frequency Class f point frequency frequency
291318 5 304.5 0.1667 5 3336 8 34.5 0.3077 8
319 346 4 332.5 0.1333 9 3740 6 38.5 0.2308 14
347374 3 360.5 0.1000 12 4144 5 42.5 0.1923 19
375402 5 388.5 0.1667 17 4548 2 46.5 0.0769 21
403 430 6 416.5 0.2000 23 4952 5 50.5 0.1923 26
431 458 4 444.5 0.1333 27 f
gf = 26 g L 1
459 486 1 472.5 0.0333 28 n
487514 2 500.5 0.0667 30
Heights of Douglas-Fir Trees
f
gf = 30 g = 1 0.35
Relative frequency
n 0.30
0.25
Reaction Times for Females 0.20
0.15
6 0.10
0.05
Frequency
34.5
38.5
42.5
46.5
50.5
2
Heights (in feet)
304.5
332.5
360.5
388.5
416.5
444.5
472.5
500.5
25
n
20
Bowling Scores 15
0.40 10
Relative frequency
0.35
0.30 5
0.25
0.20
49.5 57.5 65.5 73.5
0.15
Ages
0.10
0.05
Location of the greatest increase in frequency: 62 65
157.5
181.5
205.5
229.5
253.5
Scores
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
ODD ANSWERS A5
Relative frequency
Class f frequency frequency 0.35
0.30
24 9 0.3214 9 0.25
0.20
57 6 0.2143 15 0.15
0.10
8 10 7 0.2500 22 0.05
11 13 3 0.1071 25
63.5
69.5
75.5
81.5
87.5
93.5
99.5
105.5
14 16 2 0.0714 27
Dollars (in hundreds)
17 19 1 0.0357 28
f (b) 16.7%, because the sum of the relative frequencies
gf = 28 g L 1
n for the last three classes is 0.167.
(c) $9600, because the sum of the relative frequencies for
Gallons of Gasoline Purchased the last two classes is 0.10.
41. Histogram (5 Classes) Histogram (10 Classes)
Cumulative frequency
30
25 8 6
20 7
5
6
Frequency
Frequency
15
5 4
10 4 3
5 3 2
2
1.5 7.5 13.5 19.5 1 1
Gasoline (in gallons)
2 5 8 11 14 1.5 5.5 9.5 13.5 17.5
Data Data
Location of the greatest increase in frequency: 24
37. Histogram (20 Classes)
Frequency, Mid- Relative Cumulative 5
Class f point frequency frequency 4
Frequency
47 57 1 52 0.05 1 3
58 68 1 63 0.05 2 2
69 79 5 74 0.25 7 1
80 90 8 85 0.40 15
1 3 5 7 9 11 13 15 17 19
91101 5 96 0.25 20 Data
f
gf = 20 g = 1 In general, a greater number of classes better preserves the
N
actual values of the data set but is not as helpful for observing
general trends and making conclusions. In choosing the
Exam Scores number of classes, an important consideration is the size of
10 the data set. For instance, you would not want to use
20 classes if your data set contained 20 entries. In this
Frequency
6
particular example, as the number of classes increases, the
histogram shows more fluctuation. The histograms with 10
4
and 20 classes have classes with zero frequencies. Not much
2
is gained by using more than five classes. Therefore, it
41 52 63 74 85 96 107 appears that five classes would be best.
Scores
Classes with least frequency: 4757 and 58 68 1. Quantitative: stem-and-leaf plot, dot plot, histogram,
scatter plot, time series chart
Qualitative: pie chart, Pareto chart
3. a 4. d 5. b 6. c
7. 27, 32, 41, 43, 43, 44, 47, 47, 48, 50, 51, 51, 52, 53, 53, 53, 54,
54, 54, 54, 55, 56, 56, 58, 59, 68, 68, 68, 73, 78, 78, 85
Max: 85; Min: 27
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
A6 ODD ANSWERS
9. 13, 13, 14, 14, 14, 15, 15, 15, 15, 15, 16, 17, 17, 18, 19 25. Teachers Salaries
Max: 19; Min: 13 55
15. Key: 3 3 = 33 13 15 17 19 21
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
19. Housefly Life Spans
Year
Atlanta, GA
Concord, NH
Boise, ID
Denver, CO
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
ODD ANSWERS A7
Frequency
5
21. (a) x = not possible 4
3
median = not possible 2
1
mode = Worse
3.5
5.5
7.5
9.5
11.5
13.5
(b) Mode, because the data are at the nominal level of
Days hospitalized
measurement.
23. (a) x L 170.63 49. Class Frequency, f Midpoint
median = 169.3
6264 3 63
mode = none 6567 7 66
(b) Mean, because there are no outliers. 6870 9 69
25. (a) x = 22.6 7173 8 72
median = 19 7476 3 75
mode = 14 gf = 30
(b) Median, because the distribution is skewed.
27. (a) x L 14.11 Heights of Males Symmetric
9
median = 14.25 8
7
Frequency
mode = 2.5 6
5
(b) Mean, because there are no outliers. 4
3
2
29. (a) x = 41.3 1
median = 39.5 63 66 69 72 75
Heights
mode = 45 (to the nearest inch)
(b) Median, because the distribution is skewed.
31. (a) x L 19.5 51. (a) x = 6.005 (b) x = 5.945
median = 20 median = 6.01 median = 6.01
mode = 15 (c) Mean
(b) Median, because the distribution is skewed. 53. (a) Mean, because Car A has the highest mean of the
33. A = mode, because its the data entry that occurred most three.
often. (b) Median, because Car B has the highest median of the
B = median, because the distribution is skewed right. three.
C = mean, because the distribution is skewed right. (c) Mode, because Car C has the highest mode of the
three.
35. Mode, because the data are at the nominal level of
measurement.
37. Mean, because there are no outliers.
39. 89.3 41. 2.8 43. 65.5 45. 35.0
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
A8 ODD ANSWERS
55. (a) x L 49.2 (b) median = 46.5 23. (a) Greatest sample standard deviation: (ii)
(c) Key: 3 6 = 36 (d) Positively skewed Data set (ii) has more entries that are farther away
1 13 from the mean.
2 28 Least sample standard deviation: (iii)
3 6667778 Data set (iii) has more entries that are close to the
mean.
4 13467 mean
(b) The three data sets have the same mean but have
5 1113
different standard deviations.
6 1234 median
25. (a) Greatest sample standard deviation: (ii)
7 2246
Data set (ii) has more entries that are farther away
8 5 from the mean.
9 0 Least sample standard deviation: (iii)
57. Two different symbols are needed because they describe a Data set (iii) has more entries that are close to the
measure of central tendency for two different sets of data mean.
(sample is a subset of the population).
(b) The three data sets have the same mean, median, and
mode but have different standard deviations.
Section 2.4 (page 84) 27. Similarity: Both estimate proportions of the data
1. Range = 7, mean = 8.1, variance L 5.7, contained within k standard deviations of the mean.
standard deviation L 2.4 Difference: The Empirical Rule assumes the distribution
3. Range = 14, mean L 11.1, variance L 21.6, is bell shaped; Chebychevs Theorem makes no such
standard deviation L 4.6 assumption.
5. 73 29. 68% 31. (a) 51 (b) 17
7. The range is the difference between the maximum and 33. $1250, $1375, $1450, $550 35. 24
minimum values of a data set. The advantage of the range 37. Sample mean L 2.1
is that it is easy to calculate. The disadvantage is that it Sample standard deviation L 1.3
uses only two entries from the data set.
Max - Min 14 - 4
9. The units of variance are squared. Its units are 39. Class width = = = 2
5 5
meaningless. (Example: dollars 2 )
11. (a) Range = 25.1 Class f Midpoint, x xf
(b) Range = 45.1 4 5 10 4.5 40.5
(c) Changing the maximum value of the data set greatly 67 6 6.5 39.0
affects the range. 89 3 8.5 25.5
13. (a) has a standard deviation of 24 and (b) has a standard 10 11 7 10.5 73.5
deviation of 16, because the data in (a) have more 12 14 6 13.0 78.0
variability.
N = 32 gxf = 261
15. When calculating the population standard deviation, you
1x M22 1x M22f
divide the sum of the squared deviations by n, then take
x M
the square root of that value. When calculating the
sample standard deviation, you divide the sum of the - 3.7 13.69 136.90
squared deviations by n - 1 , then take the square root - 1.7 2.89 17.34
of that value. 0.3 0.09 0.27
17. Company B 2.3 5.29 37.03
19. (a) Los Angeles: 17.6, 37.35, 6.11 4.8 23.04 138.24
Long Beach: 8.7, 8.71, 2.95 g1x - m22f = 329.78
(b) It appears from the data that the annual salaries in
Los Angeles are more variable than the salaries in gxf 261
Long Beach. m = = L 8.2
N 32
21. (a) Males: 405; 16,225.3; 127.4
g1x - m22 f 329.78
Females: 552; 34,575.1; 185.9 s = = L 3.2
C N B 32
(b) It appears from the data that the SAT scores for
females are more variable than the SAT scores for
males.
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
ODD ANSWERS A9
1x x22 1x x22f
point, (c) x = 55, s L 30.28
f x xf x x (d) When each entry is multiplied by a constant k, the
1 70.5 70.5 - 44 1936 1936 new sample mean is k # x , and the new sample stan-
12 92.5 1110.0 - 22 484 5808 dard deviation is k # s.
25 114.5 2862.5 0 0 0 49. 10
10 136.5 1365.0 22 484 4840 1
Set 1 - = 0.99 and solve for k.
2 158.5 317.0 44 1936 3872 k2
n = 50 gxf = 5725 g1x - x22 f = 16,456
Section 2.5 (page 100)
gxf 5725 1. (a) Q1 = 4.5, Q2 = 6, Q3 = 7.5
x = = = 114.5
n 50 (b)
g1x - x2 f 2
16,456
s = = L 18.33 1 4.5 6 7.5 9
C n - 1 A 49
0 1 2 3 4 5 6 7 8 9
43. Class f Midpoint, x xf
3. The basketball team scored more points per game than
0 4 19.9 2.0 39.80
75% of the teams in the league.
5 13 35.2 9.0 316.80
14 17 16.9 15.5 261.95 5. The student scored above 63% of the students who took
the ACT placement test.
18 24 29.8 21.0 625.80
7. True
25 34 38.3 29.5 1129.85
35 44 40.0 39.5 1580.00 9. False. The 50th percentile is equivalent to Q2.
45 64 78.3 54.5 4267.35 11. (a) Min = 10 13. (a) Min = 900
65+ 39.0 70.0 2730.00 (b) Max = 20 (b) Max = 2100
n = 297.4 gxf = 10,951.55 (c) Q1 = 13 (c) Q1 = 1250
1x x22 1x x22f
(d) Q2 = 15 (d) Q2 = 1500
x x (e) Q3 = 17 (e) Q3 = 1950
- 34.82 1212.43 24,127.36 (f) IQR = 4 (f) IQR = 700
- 27.82 773.95 27,243.04 15. (a) Min = -1.9
- 21.32 454.54 7,681.73 (b) Max = 2.1
- 15.82 250.27 7,458.05 (c) Q1 = -0.5
- 7.32 53.58 2,052.11 (d) Q2 = 0.1
2.68 7.18 287.20 (e) Q3 = 0.7
17.68 312.58 24,475.01
(f) IQR = 1.2
33.18 1100.91 42,935.49
17. Q1 = B, Q2 = A, Q3 = C, because about one quarter of
g1x - x22f = 136,259.99 the data fall on or below 17, 18.5 is the median of the
entire data set, and about three quarters of the data fall on
gxf 10,951.55 or below 20.
x = = L 36.82
n 297.4 19. (a) Q1 = 2, Q2 = 4, Q3 = 5
g1x - x22f 136,259.99 (b) Watching Television
s = = L 21.44
C n - 1 A 296.4
3.44 #
45. CVheights = 100 L 4.73 0 2 4 5 9
72.75
18.47 # 0 1 2 3 4 5 6 7 8 9
CVweights = 100 L 9.83 Hours
187.83
It appears that weight is more variable than height.
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
A10 ODD ANSWERS
2 3 4 5 25 35 45 55 65 75 85
Wingspan (in inches) Ages
23. (a) 5 (b) 50% (c) 25% (c) Half of the ages are between 42 and 56 years.
25. A : z = -1.43 (d) 49, because half of the executives are older and half
are younger.
B:z = 0
41. 33.75
C : z = 2.14
43. 19.8
A z -score of 2.14 would be unusual.
73 - 63
27. (a) Statistics: z =
7
L 1.43 Uses and Abuses for Chapter 2 (page 105)
26 - 23 1. Answers will vary.
Biology: z = L 0.77
3.9 2. The salaries of employees at a business could contain an
(b) The student did better on the statistics test. outlier.
78 - 63 The median is not affected by an outlier because the
29. (a) Statistics: z = L 2.14 median does not take into account the outliers numerical
7
value.
29 - 23
Biology: z = L 1.54
3.9
Review Answers for Chapter 2 (page 107)
(b) The student did better on the statistics test.
34 ,000 - 35,000 1.
31. (a) z1 = L -0.44
2250 Mid- Frequency, Rel Cum
37,000 - 35,000 Class point Boundaries f freq freq
z2 = L 0.89
2250 2023 21.5 19.523.5 1 0.05 1
31,000 - 35,000 2427 25.5 23.527.5 2 0.10 3
z3 = L -1.78
2250 2831 29.5 27.531.5 6 0.30 9
None of the selected tires have unusual life spans. 3235 33.5 31.535.5 7 0.35 16
(b) For 30,500, 2.5th percentile 3639 37.5 35.539.5 4 0.20 20
f
For 37,250, 84th percentile gf = 20 g = 1
For 35,000, 50th percentile n
33. About 67 inches; 20% of the heights are below 67 inches. 3. Liquid Volume 12-oz Cans
74 - 69.2 12
35. z1 = L 1.66
2.9 10
Frequency
8
62 - 69.2 6
z2 = L -2.48
2.9 4
2
80 - 69.2
z3 = L 3.72
11.875
11.915
11.955
11.995
12.035
12.075
12.115
2.9
The heights that are 62 and 80 inches are unusual. Actual volume (in ounces)
71.1 - 69.2
37. z = L 0.66
2.9
About the 70th percentile
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
ODD ANSWERS A11
10
8
6
101112 106.5 100.5112.5 3 0.12 3
4 113124 118.5 112.5124.5 11 0.44 14
2
125136 130.5 124.5136.5 7 0.28 21
71
86
101
116
131
146
161
176
Relative frequency
4 347 Frequency 10 0.40
5 1 8 0.32
6 0.24
9. Height of Buildings 4 0.16
60 2 0.08
55
Number of stories
94.5
106.5
118.5
130.5
142.5
154.5
166.5
106.5
118.5
130.5
142.5
154.5
50
45
40 Minutes Minutes
35
30
25 (d) Skewed
20
(e) (f)
400 500 600 700 800
Height (in feet) Weekly Exercise Weekly Exercise
Cumulative frequency 25
The number of stories appears to increase with height. 20
101 117.5 123 131.5 157
11. American Kennel Club 15
Number registered
160 10
(in thousands)
2. 125.2, 13.0
Breed 3. (a) (b)
U.S. Sporting Goods U.S. Sporting Goods
(in billions of dollars)
24
Mode = 9 18
12
Clothing 6
19. Skewed 21. Skewed left 23. Median 25. 2.8 13%
Recreational
transport
Equipment
Clothing
Footwear
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
A12 ODD ANSWERS
4. (a) 751.6, 784.5, none (c) Yes. City A has the highest mean and lowest range
The mean best describes a typical salary because there and standard deviation.
are no outliers. 4. (a) Tell your readers that on average, the price of
(b) 575; 48,135.1; 219.4 automobile insurance premiums is higher in this city
than in other cities.
5. Between $125,000 and $185,000
(b) Location, weather, population
6. (a) z = 3.0, unusual
(b) z L -6.67 , very unusual
(c) z L 1.33
(d) z = -2.2 , unusual
7. (a) 71, 84.5, 90
(b) 19
(c) Wins for Each Team
43 71 84.5 90 101
40 50 60 70 80 90 100
Number of wins
2200
(in dollars)
2000
1800
1600
1400
City A
City B
City D
City C
City
(c) Yes. From the Pareto chart you can see that City A has
the highest average automobile insurance premium
followed by City B, City D, and City C.
3. (a) Find the mean, range, and population standard
deviation for each city.
(b) City A City B
x = $2191.00 x = $2029.20
s L $351.86 s L $437.54
range = $1015.00 range = $1336.00
City C City D
x = $1772.00 x = $1909.30
s L $418.52 s L $361.14
range = $1347.00 range = $1125.00
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
SELECTED ANSWERS A1
Selected Answers
6. Use sampling because it would be impossible to ask every (b) and (c)
consumer whether he or she would still buy a product
Class Midpoint Class boundaries
with a warning label.
8. Take a census because the U.S. Congress keeps records on 16 20 18 15.520.5
the ages of its members. 21 25 23 20.525.5
10. Stratified sampling is used because the persons are divided 26 30 28 25.530.5
into strata and a sample is selected from each stratum. 31 35 33 30.535.5
12. Cluster sampling is used because the disaster area was 36 40 38 35.540.5
divided into grids and 30 grids were then entirely selected. 4145 43 40.545.5
Certain grids may have been much more severely damaged 46 50 48 45.550.5
than others, so this is a possible source of bias.
14. Systematic sampling is used because every twentieth 12.
engine part is sampled. It is possible for bias to enter into
Frequency, Mid- Relative Cumulative
the sample if, for some reason, the assembly line performs
Class f point frequency frequency
differently on a consistent basis.
18. Simple random sampling is used because each telephone 16 20 100 18 0.03 100
has an equal chance of being dialed and all samples of 21 25 122 23 0.04 222
1012 phone numbers have an equal chance of being 26 30 900 28 0.30 1122
selected. The sample may be biased because only homes 31 35 207 33 0.07 1329
with telephones have a chance of being sampled. 36 40 795 38 0.26 2124
20. Sampling. The population of cars is too large to easily 41 45 568 43 0.19 2692
record their color. Cluster sampling is advised because it
46 50 322 48 0.11 3014
would be easy to randomly select car dealerships then
record the color for every car sold at the selected f
gf = 3014 g = 1
dealerships. n
26. Stratified sampling ensures that each segment of the
population is represented.
28. (a) Advantage: Usually results in a savings in the survey
cost.
(b) Disadvantage: There tends to be a lower response
rate and this can introduce a bias into the sample.
Sampling technique: Convenience sampling
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
A2 SELECTED ANSWERS
24. 30.
Frequency, Mid- Relative Cumulative Frequency, Mid- Relative Cumulative
Class f point frequency frequency Class f point frequency frequency
30 113 5 71.5 0.1724 5 10 23 11 16.5 0.3438 11
114197 7 155.5 0.2414 12 24 37 9 30.5 0.2813 20
198281 8 239.5 0.2759 20 38 51 6 44.5 0.1875 26
282365 2 323.5 0.0690 22 52 65 2 58.5 0.0625 28
366449 3 407.5 0.1034 25 66 80 4 72.5 0.1250 32
450533 4 491.5 0.1379 29
f
f gf = 32 g L 1
gf = 29 g = 1 n
n
ATM Withdrawals Class with greatest relative
26.
0.40 frequency: 10 23
Relative frequency
Frequency, Mid- Relative Cumulative 0.35
0.30 Class with least relative
Class f point frequency frequency 0.25
0.20
frequency: 5265
32 35 3 33.5 0.1250 3 0.15
0.10
36 39 9 37.5 0.3750 12 0.05
40 43 8 41.5 0.3333 20
16.5
30.5
44.5
58.5
72.5
44 47 3 45.5 0.1250 23 Dollars
48 51 1 49.5 0.0417 24
32.
f
gf = 24 g = 1
n Frequency, Mid- Relative Cumulative
Class f point frequency frequency
Pungencies of Peppers Cl a ss with greatest frequency: 7 8 7 7.5 0.28 7
9
36 39 9 10 8 9.5 0.32 15
8
7 Class with least frequency: 4851 11 12 6 11.5 0.24 21
Frequency
6
5 13 14 3 13.5 0.12 24
4
3
2 15 16 1 15.5 0.04 25
1
f
33.5 37.5 41.5 45.5 49.5 gf = 25 g = 1
Pungencies n
(in 1000s of Scoville units)
Acres on Small Farms Class with greatest relative
28. 0.35 frequency: 9 10
Relative frequency
0.30
Frequency, Mid- Relative Cumulative Class with least relative
0.25
Class f point frequency frequency 0.20
frequency: 1516
2456 2542 7 2499 0.28 7 0.15
0.10
2543 2629 3 2586 0.12 10 0.05
2630 2716 2 2673 0.08 12
7.5 9.5 11.513.515.5
27172803 4 2760 0.16 16 Acres
2804 2890 9 2847 0.36 25
34. Frequency, Relative Cumulative
f
gf = 25 g = 1 Class f frequency frequency
n
16 22 2 0.10 2
Pressure at Fracture Time Class with greatest frequency: 23 29 3 0.15 5
10 28042890 30 36 8 0.40 13
9
8
Class with least frequency: 37 43 5 0.25 18
Frequency
7
6
5 2630 2716 44 50 0 0.00 18
4
3
2 51 57 2 0.10 20
1
f
2499 2673 2847 gf = 20 g = 1
Pressure n
(in pounds per square inch)
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
SELECTED ANSWERS A3
Daily Saturated Fat Intake Location of the greatest 40. (a) SAT Scores
Cumulative frequency increase in frequency: 30 36 0.20
Relative frequency
20 0.18
0.16
15 0.14
0.12
10 0.10
0.08
5 0.06
0.04
0.02
15.5
22.5
29.5
36.5
43.5
50.5
57.5
457.5
553.5
649.5
745.5
841.5
937.5
1033.5
1129.5
1225.5
1321.5
Daily saturated fat intake
(in grams) SAT scores
36. (b) 48%, because the sum of the relative frequencies for
Frequency, Relative Cumulative
the last four classes is 0.48.
Class f frequency frequency
(c) 698, because the sum of the relative frequencies for
1 5 5 0.2083 5 the last seven classes is 0.88.
6 10 9 0.3750 14
11 15 3 0.1250 17 Section 2.2
16 20 4 0.1667 21
18. Advertisements
21 25 2 0.0833 23
26 30 1 0.0417 24
150 250 350 450 550 650 750 850
f
gf = 24 g = 1 Number of ads
n
It appears that most of the 30 people from the United
States see or hear between 450 and 750 advertisements
Length of Long-Distance Location of the greatest per week. (Answers will vary.)
Phone Calls increase in frequency: 6 10
Cumulative frequency
30
22. 2003 NASA Space
Shuttle Expenditures
25
Dollars (in millions)
20 700
15 600
500
10 400
5 300
200
100
0.5 10.5 20.5 30.5
Length of call (in minutes)
Vehicle and
extravehicular activity
Reusable solid
rocket motor
External tank
Main engine
Flight hardware
upgrades
Solid rocket booster
38.
n 6
2 1 4 7 10 13 16 19
Number of children
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
A4 SELECTED ANSWERS
7.00
6.50
6.00 145164 8 154.5 1236.0
5.50
5.00
165184 7 174.5 1221.5
185204 3 194.5 583.5
2001
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
205224 1 214.5 214.5
Year 225244 1 234.5 234.5
It appears that the price of a T-bone steak steadily N = 20 gxf = 3490.0
1x M22 1x M22f
increased from 1991 to 2001.
30. (a) The pie chart should be displaying all four quarters, x M
not just the first three. - 20 400 3200
(b) Sales for Company B 0 0 0
4th 1st
quarter quarter 20 400 1200
20% 20%
40 1600 1600
60 3600 3600
2nd
quarter
15% g1x - m22f = 9600
2 5 4
3 4 3
2
4 6
1
5 4
1 2 3 4 5 6
6 5 Number rolled
gf = 30
Uniform
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long
SELECTED ANSWERS A5
44.
Class f Midpoint, x xf Review Answers for Chapter 2
0.5 9.5 11.9 5 59.5 2. Income of Employees
Relative frequency
10.5 19.5 12.1 15 181.5 0.35
0.30
20.5 29.5 14.0 25 350.0 0.25
30.5 39.5 18.5 35 647.5 0.20
0.15
40.5 49.5 16.6 45 747.0 0.10
50.5 59.5 16.3 55 896.5 0.05
60.5 69.5 17.8 65 1157.0
21.5
25.5
29.5
33.5
37.5
70.5 79.5 12.4 75 930.0 Income
80.5 89.5 6.3 85 535.5 (in thousands of dollars)
90.5 99.5 1.3 95 123.5
The class with the greatest relative frequency is 3235 and
n = 127.2 gxf = 5628
that with the least is 2023.
x x 1x x22 1x x22 f 4. Liquid Volume 12-oz Cans
Relative frequency
0.45
- 39.25 1540.5625 18,332.69 0.40
0.35
- 29.25 855.5625 10,352.31 0.30
0.25
- 19.25 370.5625 5,187.88 0.20
0.15
- 9.25 85.5625 1,582.91 0.10
0.75 0.5625 9.34 0.05
11.875
11.915
11.955
11.995
12.035
12.075
12.115
20.75 430.5625 7,664.01
Actual volume (in ounces)
30.75 945.5625 11,724.98
40.75 1660.5625 10,461.54 6. Meals Purchased
50.75 2575.5625 3,348.23
Cumulative frequency
35
g1x - x22f = 70,547.56 30
25
gxf
20
5628 15
x = = L 44.25 10
n 127.2 5
g1x - x22 f 70,547.56
78.5
93.5
108.5
123.5
138.5
153.5
168.5
s = = L 23.64
C n - 1 A 126.2 Number of meals
13.8 15.125
15.8
17.65 19.45 CHAPTER 3
13.5 14.5 15.5 16.5 17.5 18.5 19.5
Hourly earnings
(in dollars)
TY1 AC QC TY2 FR Larson Texts, Inc Final Pages for Statistics 3e LARSON Short Long