Вы находитесь на странице: 1из 73

Chapter 1


The processing of statistical information has a history that extends back to the beginning of
mankind. In early biblical times nations compiled statistical data to provide descriptive
informa- tion relative to all sort of things, such as taxes, wars, agricultural crops, and even
athletic events. Today, with the development of probability theory, we are able to use
statistical methods that not only describe important features of the data but methods that
allow us to proceed beyond the data into the area of decision making through generalizations
and predictions.

1.1 Descriptive Statistics

Definition 1.1.1
1. Statistics is a discipline of study dealing with the collection, analysis, interpretation,
and presentation of data.
2. Descriptive Statistics are methods concerned with collecting and describing a set of
data so as to yield meaningful information. It is on the use of graphs, charts, and tables
and the calculation of various statistical measures to organize and summarize

1.2 Inferential Statistics

Definition 1.2.1
1. A population is the complete collection of individuals, items, or data under
consideration in a statistical study.
2. A sample is a subset of a population. It is the portion of the population selected for analysis.
3. Inferential Statistics are methods concerned with the analysis of a subset of data
leading to predictions or inferences about the entire data set.

Example 1.2.2
1. Situation: Quiz in a Math 5 class of 50
students. Population: 50 students
Sample: 11 students
Descriptive Statistics: There are 70% or 35 students who failed in the quiz.
Inferential Statistics: The students in Math 5 class should increase their studying time to
than 1 hour since 11 of those students who failed admitted that they
only allocated 1 hour of studying before the quiz.

2. Situation: Data from WeatherReports.com on the number of days of precipitation

in Cagayan de Oro City for the past 20 years.
Population: Number of days with precipitation in September for the last 20
years Sample: Number of days with precipitation in September for the last 5
years Descriptive Statistics:
Month Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec
of days with 10 7 6 6 8 13 14 14 15 15 11 11
Inferential Statistics: Next year, it is expected to have an average of 15 days with
precipitation for the month of September

1.3 Variable, Observation and Data Set

Definition 1.3.1
1. A variable is a characteristic of interest concerning the individual elements of a
population or a sample.
2. An observation is the value of a variable for one particular element from the sample
or population.
3. A data set consists of the observations of a variable for the elements of a sample.

Example 1.3.2
1. Situation: Customers were asked about their favorite ice cream flavors sold at a grocery store.
Variable: Customers choice of ice cream flavors
Observation: Mango flavor
Data set: {Mango flavor, Ube flavor, Vanilla flavor, . . .}

1.4 Quantitative Variable

Definition 1.4.1
1. A quantitative variable is a variable where the characteristic of interest results in
a numerical value.
2. A discrete variable is a quantitative variable whose values are countable. Discrete
variables usually result from counting.
3. A continuous variable is a quantitative variable that can assume any numerical value
over an interval or over several intervals. A continuous variable usually results from
making a measurement of some type.

Example 1.4.2
1. The following table gives several discrete variables and the set of possible values for each one.
In each case the value of the variable is determined by counting.

Discrete Variable Possible Observations

The number of defective needles in 0,1,2,...,100
100 boxes of syringe
The number of individuals in groups of 30 0,1,2,...,30
with blood type A
The number of students who passed out of 0,1,2,...,150
who enrolled in Math 5 classes
2. The following table gives several continuous variables and the set of possible values for
each one. All three continuous variables involve measurement.

Continuous Variable Possible Observations

The length of time intended All the real numbers between a and b,
by students for studying where a is the smallest amount of time
for studying and b is the largest
The household income for households with
incomes less than or equal to Php20,000 All the real numbers between a
and Php20,000, where a is the
smallest household income in the
The cholesterol reading for those population
individuals having cholesterol readings
equal to or greater than 200 mg/dl All real numbers between 200 and b,
where b is the largest cholesterol
reading of all such individuals

1.5 Qualitative Variable

Definition 1.5.1 A qualitative variable is a variable determined when the description
of the characteristic of interest results in a nonnumeric value.

Example 1.5.2 The following table gives several examples of qualitative variables along with
a set of categories into which they may be classified.

Qualitative Variable Possible Observations

Marital status Single, Married, Divorce,
Gender Male, Female
Crime Classification Theft, Murder
Pain Level None, Low, Moderate, Extreme

1.6 Scales of Measurement

Data can be classified into four levels of measurement:
1. Nominal
2. Ordinal
3. Interval
4. Ratio

Definition 1.6.1 The nominal level of measurement is characterized by data that

consist of names, labels, or categories only. The nominal scale applies to data that are used
for category identification. Nominal scale data cannot be arranged in an ordering scheme.
The arithmetic operations of addition, subtraction, multiplication, and division are not
performed for nominal data.

Example 1.6.2 The following table gives several qualitative variables and a set of possible
nom- inal level data values.

Qualitative Variable Possible Observations

Blood type A, B, AB, O
City in the Philippines Manila, Cebu, Davao
Type of Crime Theft, Defamation
Religion Christian, Islam, other

Definition 1.6.3 The ordinal level of measurement is characterized by data that applies
to categories that can be ranked. The ordinal scale applies to data that can be arranged in
some order, but differences between data values either cannot be determined or are
meaningless. Ordinal scale data can be arranged in an ordering scheme.

Example 1.6.4 Table 1.6.4 gives several qualitative variables and a set of possible ordinal
level data values. Arithmetic operations are not performed on ordinal level data, but an
ordering scheme exists.

Qualitative Variable Possible Observations

Pain level None, low, moderate,
Socioeconomic class severe
Performance rating Upper, middle, lower
Poor, good, excellent

Definition 1.6.5 The interval level of measurement results from counting or

measuring. Interval scale data can be arranged in an ordering scheme and differences can
be calculated and interpreted. The interval scale applies to data that can be arranged in
some order and for which differences in data values are meaningful. The value zero is
arbitrarily chosen for interval data and does not imply an absence of the characteristic being
measured. Ratios are not meaningful for interval data.

Example 1.6.6
1. IQ scores represent interval level data. Joses IQ score equals 104 and Juans IQ score equals
140. Juan has a higher IQ than Jose; that is, IQ scores can be arranged in order. Juans IQ
score is 36 points higher than Joses IQ score; that is, differences can be calculated and
interpreted. However, we cannot conclude that Juan is ≈1.3 times (140/104 1.3) more
intelligent than Jose. An IQ score of zero does not indicate a complete lack of intelligence.
2. Test scores represent interval level data. Mara scored 90 on a test and Clara scored 70 on
a test. Mara scored higher than Clara did on the test; that is, the test scores can be arranged
in order. Mara scored 20 points higher than Clara did on the test; that is, differences can be
calculated and interpreted. We cannot conclude that Mara knows twice as much as Clara
about the subject matter. A test score of 0 does not indicate an absence of knowledge
concerning the subject matter.

Definition 1.6.7 The ratio level of measurement results from counting or measuring.
The ratio scale applies to data that can be ranked and for which all arithmetic operations
including division can be performed. Division by zero is, of course, excluded. Ratio scale
data can be arranged in an ordering scheme and differences and ratios can be calculated and
interpreted. Ratio data has an absolute zero and a value of zero indicates a complete absence
of the characteristic of interest.

Example 1.6.8 The grams of fat consumed per day for adults in the Philippines is ratio scale
data. Mark consumes 70 grams of fat per day and Anthony consumes 35 grams per day. Mark
consumes twice as much fat as Anthony per day, since 70/35 = 2. For an individual who
consumes 0 grams of fat on a given day, there is a complete absence of fat consumed on that
day. Notice that a ratio is interpretable and an absolute zero exists.

1.7 Summation and Notation

If we have the following list of weight loss of individuals who took the fitness program:
15,10,18,6; and took the sum as 15+10+18+6. Using the Greek letter Σ to indicate “summation
of,” we can
write the sum of the four weights as i=1 xi, where we read “summation of xi, i going from 1 to
4.” The numbers 1 and 4 are called the lower and upper limits of the summation. Hence

xi = x1 + x2 + x3 + x4
= 15 + 10 + 18 + 6
= 49 .

xi = x2 + x3

Σn = 10 + 18
In general, the symbol
= 28 .

i=1 means that we replace i whenever it appears after the summation

symbol by 1, then by 2, and so on up to n, and then add up the terms. Therefore we can write


xi2 = x12 + x22 + x32,

xjyj = x2y2 + x3y3 + x4y4 + x5y5

Σ 1.7.1 If x1 = 3, x2 = 5, and x3 = 7, find
1. xi
Solution: Σ
xi = x1 + x2 + x3 = 3 + 5 + 7 = 15

Σ2. 3i=1 2xi2

Solution: Σ
2xi2 = 2x12 + 2x22 + 2x32 = 18 + 50 + 98 = 166
Σ3. i=2 (xi − i)
Solution: Σ
i=2 i
− i) = (x2 − 2) + (x3 − 3) = 3 + 4 = 7
4. Σ (x +

ExΣample 1.7.2 Given x1 = 2, x2 = −3, x3 = 1, y1 = 4, y2 = 2, and y3 = 5, evaluate

1. i=1 xi y
x y = x1y1 + x2y2 + x3y3 = (2)(4) + (−3)(2) + (1)(5) = 7
i=1 i i
Σ . yj 2Σ
. 2. 3i=2 xiΣ Σj=1 2
.Σ 3 iΣ .Σ2
i=2 x
y 2Σ = (x2 + x3)(y12 + y22) = (−2)(20) = −40
j=1 j

I. Classify each of the following as descriptive statistics or inferential statistics. Write DS

for descriptive statistics and IS for inferential statistics.
1. The average scores of the students, percentage of periodical examination scores
are computed.
2. Ten percent of the instant noodles sampled by a quality technician are found to
be underweight. Based on this finding, the packing machine is adjusted to
increase the amount of pack.
3. Manila Bulletin published the numerical quantities concerning stocks of the
different Philippine corporations.
4. Based on a study of 500 single parent households by a social researcher, a
magazine reports that 25% of all single parent households are headed by a high
school dropout.

II.Identify the following either quantitative or qualitative variable. Write QN for

quantitative variable and QL for qualitative variable.
1. The color of several vehicles that were involved in road accidents.
2. The length of time for the medicine to take effect after ingestion.
3. The rating given to a performer as poor, good, or excellent.
4. The country code assigned for each country.
5. The number of years a couple had been married.

III. Indicate the scale of measurements for each of the following variables. Write N for
nominal, O for ordinal, I for interval and R for ratio.
1. racial origin
2. military ranks
3. temperature scale
4. cellular phone numbers
5. medical diagnoses
IV. If x1 = 4, x2 = −3, x3 = 6, and x4 = −1, evaluate the following:
1. xi2(xi − 3)
2. (xi + 1)2
V. Given x1 = −2, x2 = 3, x3 = 1, y1 = 4, y2 = 0, and y3 = 5, find the value of
3 following:
1. x iy i2
. Σ. 3 Σ
Σ 2 Σ
2. xi yi
i=1 i=2
II. Supply the empty spaces for sample or population that would be appropriate for the corre-
sponding data given. .

Population Sample
1. 1. A criminal justice study of 350 prison inmates
2. Legal aliens living in the Philippines
3. Alzheimer patients in the Philippines3.
4. 4.A psychological study of 200 individuals
who suffer anemia
Identify the variable and the number of observations in the data set.
1. In a sociological study involving 38 low-income households, the number of children per
house- hold was recorded for each househould.
2. A national survey was conducted among 3000 household and one question was asked for
the number of television per household. One thousand five hundred was completed and
participated in the survey.
3. The number of hours of research work was determined for 25 college professors. The
minimum number was 0 hours and the maximum is 28 hours.
4. Classify the problems in number 1, 2 and 3 in III as discrete or continuous variable.
Chapter 2


Definition 2.1
1. Raw data is an information obtained by observing values of a variable.
2. Data obtained by observing values of qualitative variable are referred to as qualitative data.
3. Data obtained by observing values of quantitative variable are referred to as
quantitative data.
4. Quantitative data obtained from a discrete variable are also referred as discrete data.
5. Qualitative data obtained from a continuous variable are called continuous data.

2.1 Frequency Distribution for Qualitative Data

Definition 2.1.1 A frequency distribution for qualitative data is the lists of all categories
and the number of elements that belong to each of the categories.

Example 2.1.2
1. The following set of offenses with which individuals were charged in PNP Tangub.
rape robbery burglary arson murder robbery rape defamation
arson theft arson burglary theft robbery theft theft
theft burglary murder murder theft theft theft defamation defamation

Make a frequency distribution of the data.

Answer :
Offense Tally Frequency
Rape II 2
Robbery III 3
Burglary III 3
Arson III 3
Murder III 3
Defamation III 3

2. The following are the flavors of ice cream sold at a grocery store, coded as 0-vanilla, 1-
chocolate, 2-ube, 3-mango, 4-melon, 5-banana, 6-durian and 7-avocado


1 5 1 6 2 7
1 4 1 4 3 5
0 6 0 3 4 6
7 6 6 2 1 7
2 2 2 0 7 2
Make a frequency distribution of the given data.
Answer :

Flavor Tally Frequency

Vanilla III 3
Chocolate IIIII 5
Mango II 2
Melon III 3
Banana II 2
Durian IIIII 5
Avocado IIII 4

Definition 2.1.3 A relative frequency of a category is obtained by dividing the frequency

for a category by the sum of all the frequencies. The sum of all relative frequencies will always
be equal to one.

Definition 2.1.4 The percentage for a category is obtained by multiplying the relative fre-
quency for that category to 100. The sum of the percentages for all the categories will always
equal to 100%.

Example 2.1.5
1. Make a relative frequency and percentage distribution of the offenses in Example 2.1.2(1).
Answer :

Offense Frequency Relative frequency Percentage

Rape 2 2/25=0.08 0.08 × 100 = 8%
Robbery 3 3/25=0.12 0.12 × 100 = 8%
Burglary 3 3/25=0.12 0.12 × 100 = 8%
Arson 3 3/25=0.12 0.12 × 100 = 8%
Murder 3 3/25=0.12 0.12 × 100 = 8%
Theft 8 8/25=0.32 0.32 × 100 =
Defamation 3 3/25=0.12 0.12 × 100 = 12%
2. Make a relative frequency and percentage distribution of the ice cream flavors in Exam-
ple 2.1.2(2).
Answer :
Flavor Frequency Relative frequency Percentage
Vanilla 3 3/30=0.10 0.10 × 100 = 10%
Chocolate 5 5/30=0.17 0.17 × 100 = 17%
Ube 6 6/30=0.20 0.20 × 100 = 20%
Mango 2 2/30=0.07≈ 0.06 0.06 × 100 = 6%
Melon 3 3/30=0.10 0.10 × 100 = 10%
Banana 2 2/30=0.07 0.07 × 100 = 7%
Durian 5 5/30=0.17 0.17 × 100 = 17%
Avocado 4 4/30=0.13 0.13 × 100 = 13%

Definition 2.1.6 A bar graph is a graph composed of bars whose heights are the
frequencies of the different categories. A bar graph displays graphically the same information
concerning qualitative data that a frequency distribution shows in tabular form.

Example 2.1.7
1. Make a bar graph of the offenses in Example 2.1.2(1).
Answer :

2. Make a bar graph of the ice cream flavors in Example 2.1.2(2).

Answer :

Definition 2.1.8 A pie chart is also used to graphically display qualitative data. To construct
a pie chart, a circle is divided into portions that represent the relative frequencies or
percentages belonging to different categories. To construct a pie chart for the frequency
distribution, construct a table that gives angle sizes for each category. The 3600 in a circle are
divided into portions that are proportional to the category sizes.
Example 2.1.9
1. Make a pie graph of the offenses in Example 2.1.2(1).
Answer :

Offenses Relative frequency Angle size

Rape 0.08 360 × 0.08 = 28.8
Robbery 0.12 360 × 0.12 = 43.2
Burglary 0.12 360 × 0.12 = 43.2
Arson 0.12 360 × 0.12 = 43.2
Murder 0.12 360 × 0.12 = 43.20
Theft 0.32 360 × 0.32 = 115.2
Defamation 0.12 360 × 0.12 = 43.2

2. Make a pie graph of the flavors in Example 2.1.2(2).

Answer :

Flavors Relative frequency Angle size

Vanilla 0.10 360 × 0.10 = 36 0
Chocolate 0.17 360 × 0.17 = 61.2
Ube 0.20 360 × 0.20 = 72 0
Mango 0.06 360 × 0.06 = 21.6
Melon 0.10 360 × 0.10 = 36 0
Banana 0.07 360 × 0.07 = 25.20
Durian 0.17 360 × 0.17 = 61.20
Avocado 0.13 360 × 0.13 = 46.8
2.2 Frequency Distribution for Quantitative Data
Consider a frequency distribution of test scores for 75 students in a college entrance

IQ Score Frequency
80-94 8
95-109 14
110-124 24
125-139 16
140-154 13
Test score is a quantitative variable and according to the above table, eight of the individuals
have scores between 80 and 94, fourteen have scores between 95 and 109, twenty-four have
scores between 110 and 124, sixteen have scores between 125 and 139, and thirteen have
scores between 140 and 154.

Class Limits, Class Boundaries, Class Marks, and Class Width

The frequency distribution given in preceding table is composed of five
classes. The classes − are: 80− 94, 95 − 109, 110 − 124, 125 139, and
140 154. −
Each class has a lower class limit and an upper class limit.
The lower class limits for this distribution are 80, 95, 110, 125, and 140.
The upper class are 94, 109, 124, 139, and 154.
If the lower class limit for the second class, 95, is added to the upper class limit for the first
class, 94, and the sum divided by 2, the upper boundary for the first class and the lower
boundary for the second class are determined.
The next table gives all the boundaries for the preceding table.
If the lower class limit is added to the upper class limit for any class and the sum divided by 2,
the class mark for that class is obtained.
The class mark for a class is the midpoint of the class and is sometimes called the class
rather than the class mark.
The difference between the boundaries for any class gives the class width for a
distribution. The class width for the distribution in preceding table is 15.

Class Limits Frequency Class Boundaries Class Width Class Marks

80-94 8 79.5-94.5 15 87.0
95-109 14 94.5-109.5 15 102.0
110-124 24 109.5-124.5 15 117.0
125-139 16 124.5-139.5 15 132.0
140-154 13 139.5-154.5 15 147.0
When forming a frequency distribution, the following general guidelines should be followed:
1. Each data value must belong to one, and only one, class.
2. When possible, all classes should be of equal width.

Definition 2.2.1
A data set consisting of the observations for some variable is referred to as
raw data or ungrouped data.
Data presented in the form of a frequency distribution are called grouped data.

The following were grades obtained by students in their preliminary examination:

80 87 Grades frequency
92 90 80-85 3
95 83 86-91 3
83 88 92-97 2

Ungrouped data Grouped data

Question: How to make ungrouped a grouped data if the number of classes is not given or

1. Solve for C = n, where n = number of data.
If C is not a whole number, then round-off C to the next whole number.
The value C will be the number of classes in the desired frequency distribution.

2. highest value − lowest value

Solve for D = C .
The value D must be of the same unit as the given data. For example, if the given data
are whole numbers and D has a decimal digits then D must be rounded-off to
the next whole number.

3. Therefore, LCL = lowest value

UCL = lowest value + D
Continue the process to arrived at C classes.
Given Grades obtained by students in their preliminary examination
80 87
92 90
95 83
83 88
√ √
Solve for C = n = 8 = 2.83
Since C is not a whole number, therefore C≈ 3 is the number of
classes. highest
− value lowest
− value 95
80 15
Solve for D = = = =5
C 3 3
Therefore, LCL1 = smallest value = 80
UCL1 = smallest value + D = 80 + 5 = 85
LCL2 = 86
UCL2 = 86 + D = 86 + 5 = 91
LCL3 = 92
UCL3 = 92 + D = 92 + 5 = 97
Thus, we have the following frequency distribution
Grades frequency
80-85 3
86-91 3
92-97 2

Example 2.2.2
1. The following are the scores of an 80–item exam. Make a frequency distribution of the
data containing class boundaries, class width and class mark.

50 65 70 35 40 57 66 65 70 35
29 33 44 56 66 60 44 50 58 46
67 78 79 47 35 36 44 57 60 57
Answer :
√ √
Solve for C = n = 30 = 5.48
Since C is not a wholehighest
classes. number, therefore
value C ≈value
− lowest 6 is the
− 29 of50
Solve for D = = = =
C 6 6
Since D is not a whole number and the data are whole numbers, therefore D ≈ 9
Therefore, LCL1 = smallest value = 29
UCL1 = smallest value + D = 29 + 9 = 38
LCL2 = 39
UCL2 = 39 + D = 39 + 9 = 48
LCL3 = 49
UCL3 = 49 + D = 49 + 9 = 58
LCL4 = 59
UCL4 = 59 + D = 59 + 9 = 68
LCL5 = 69
UCL5 = 69 + D = 69 + 9 = 78
LCL6 = 79
UCL6 = 79 + D = 79 + 9 = 88

Exam Scores Frequency Class Boundaries Class Width Class Marks

29-38 6 28.5-38.5 10 33.5
39-48 6 38.5-48.5 10 43.5
49-58 7 48.5-58.5 10 53.5
59-68 7 58.5-68.5 10 63.5
69-78 3 68.5-78.5 10 73.5
79-88 1 78.5-88.5 10 83.5
2. The following are the working hours of the employees in a company. Make a frequency
distribution of the data containing class boundaries, class width and class mark.

9.00 7.60 8.26 8.30 8.21 7.90 8.21 8.31 8.57 8.86
8.86 8.04 7.70 7.82 7.82 8.04 8.28 8.30 9.01 8.87
8.50 7.90 8.30 8.04 8.26 8.27 8.50 8.57 8.51 8.87
8.51 8.26 8.21 8.04 7.82 8.04 8.30 8.50 8.86 8.51
Answer :
√ √
Solve for C = n= 40 = 6.32
Since C is not a wholehighest
classes. number, therefore
value C ≈value
− lowest 7 is the9.number of 1.44
01 − 7.57
Solve for D = = =
= 0.21
C 7 7
Therefore, LCL1 = smallest value = 7.57
UCL1 = smallest value + D = 7.57 + 0.21 = 7.78
LCL2 = 7.79
UCL2 = 7.79 + D = 7.79 + 0.21 = 8.00
LCL3 = 8.01
UCL3 = 8.01 + D = 8.01 + 0.21 = 8.22
LCL4 = 8.23
UCL4 = 8.23 + D = 8.23 + 0.21 = 8.44
LCL5 = 8.45
UCL5 = 8.45 + D = 8.45 + 0.21 = 8.66
LCL6 = 8.67
UCL6 = 8.67 + D = 8.67 + 0.21 = 8.88
LCL7 = 8.89
UCL7 = 8.89 + D = 8.89 + 0.21 = 9.10
Working hours Frequency Class Boundaries Class Width Class Marks
7.57-7.78 2 7.565-7.785 0.22 7.675
7.79-8.00 5 7.785-8.005 0.22 7.895
8.01-8.22 8 8.005-8.225 0.22 8.115
8.23-8.44 10 8.225-8.445 0.22 8.335
8.45-8.66 8 8.445-8.665 0.22 8.555
8.67-8.88 5 8.665-8.885 0.22 8.775
8.89-9.10 2 8.885-9.105 0.22 8.995
1. Group the following weights into the classes 100 to under 125, 125 to under 150, and so forth:
111 120 127 129 130 145 145 150 153 155 160
161 165 167 170 171 174 175 177 179 180 180
185 185 190 195 195 201 210 220 224 225 230
245 248
Make a frequency distribution of the data containing class boundaries, class width and class

2. The price for 500 aspirin tablets is determined for each of twenty randomly selected stores
as part of a larger consumer study. The prices are as follows:
2.50 2.95 2.65 3.10 3.15 3.05 3.05 2.60 2.70 2.75
2.80 2.80 2.85 2.80 3.00 3.00 2.90 2.90 2.85 2.85
Group these data into seven classes and make a frequency distribution of the data containing
class boundaries, class width and class mark.

2.3 Histograms
A histogram is a graph that displays the classes on the horizontal axis and the frequencies of
the classes on the vertical axis. The frequency of each class is represented by a vertical bar
whose height is equal to the frequency of the class. A histogram is similar to a bar graph.
However, a histogram utilizes classes or intervals and frequencies while a bar graph utilizes
categories and frequencies.

Example 2.3.1 A histogram for the aspirin prices in Example ??

A symmetric histogram is one that can be divided into two pieces such that each is the
mirror image of the other.

This type of histogram is often referred to as a mound-shaped histogram or a bell-shaped his-

togram. A symmetric histogram in which each class has the same frequency is called a
uniform or rectangular histogram.

A skewed to the right histogram has a longer tail on the right side. The histogram shown
above is skewed to the right.

A skewed to the left histogram has a longer tail on the left side. The histogram shown
above is skewed to the left.

2.4 Cumulative Frequency Distribution and Cumulative

Relative Frequency

A cumulative frequency distribution gives the total number of values that fall below
various class boundaries of a frequency distribution. A cumulative relative frequency is
obtained by dividing a cumulative frequency by the total number of observations in the data
set. Cumulative percentages are obtained by multiplying cumulative relative frequencies
by 100.

Example 2.4.1
1. Make a cumulative and cumulative relative frequency distribution of the scores of the 80–
item exam.
Answer :

Exam Scores Frequency Relative frequency Percentage

29-38 6 0.20 20%
39-48 6 0.20 20%
49-58 7 0.24 24%
59-68 7 0.23 23%
69-78 3 0.10 10%
79-88 1 0.03 3%
2.5. OGIVES 23

Exam Scores Cumulative Cumulative Relative Cumulative

Frequency Frequency Percentage
29-38 6 0.20 20%
39-48 12 0.40 40%
49-58 19 0.64 64%
59-68 26 0.87 87%
69-78 29 0.97 97%
79-88 30 1.00 100%
2. Make a cumulative and cumulative relative frequency distribution of working hours of the
Answer :

Working hours Frequency Relative frequency Percentage

7.57-7.78 2 0.05 5%
7.79-8.00 5 0.13 13%
8.01-8.22 8 0.20 20%
8.23-8.44 10 0.25 25%
8.45-8.66 8 0.20 20%
8.67-8.88 5 0.13 13%
8.89-9.10 2 0.04 4%

Working Hours Cumulative Cumulative Relative Cumulative

Frequency Frequency Percentage
7.57-7.78 2 0.05 5%
7.79-8.00 7 0.18 18%
8.01-8.22 15 0.38 38%
8.23-8.44 25 0.63 63%
8.45-8.66 33 0.83 83%
8.67-8.88 38 0.96 96%
8.89-9.10 40 1.00 100%

2.5 Ogives

An ogive is a graph in which a point is plotted above each class boundary at a height equal to
the cumulative frequency corresponding to that boundary. Ogives can also be constructed for
a cumulative relative frequency distribution as well as a cumulative percentage distribution.
The following table is the ogive of the preceding data.

2.6 Stem-and-Leaf Display

In a stem-and-leaf display each value is divided into a stem and a leaf. The leaves for each
stem are shown separately. The stem-and-leaf diagram preserves the information on
individual observations.

Example 2.6.1 The following are the scores of an 80–item exam.

50 65 70 35 40 57 66 65 70 35
29 33 44 56 66 60 44 50 58 46
67 78 79 47 35 36 44 57 60 57

Thus, we have the stem-and-leaf display of the data. The first row represents the number 29,
the second row represents the numbers 33, 35, 35, 35, and 36, etc. The first column in plot is
a cumulative frequency that starts at both ends of the data and meets in the middle. The row
that contains the median of the data is marked with parentheses around the count of
observations for that row. For the rows above the median, the number in the first column is
the number of items in that row plus number of items in all the rows above. Rows below the
median are just the opposite.

1 2 9
6 3 3 5 5 5 6
12 4 0 4 4 4 6 7
(7) 5 0 0 6 7 7 7 8
11 6 0 0 5 5 6 6 7
4 7 0 0 8 9
Chapter 3


3.1 Parameters and Statistic

Definition 3.1.1
Any numerical value describing a charateristic of a population is called a
parameter. Any numerical value describing a charateristic of a sample is called a

3.2 Measures of Central Location for Ungrouped Data

Definition 3.2.1 Measure of central location or measure of central tendency – measure
indicating the center of a set of data, arranged in an increasing or decreasing order of

Definition 3.2.2
If the set of data x1, x2, . . . , xN , not necessarily all distinct, represents a finite population of size
N, then the population mean is
ΣN i
i= x
µ i
= N
If the set of data x1, x2, . . . , xn, not necessarily all distinct, represents a finite sample of size N,
then the sample mean is
Σn i
i= x
x i
= n
Example 3.2.3
1. Compute the sample mean of the grades obtained by students in their preliminary
examination 80 87
92 90
95 83
83 88

Since n = 8
and Σ
xi = x1 + x2 + x3 + x4 + x5 + x6 + x7 + x8
= 80 + 92 + 95 + 83 + 87 + 90 + 83 + 88
= 698
Σn Σ8
Thus, x = =
= i n i 8 87.25
x = x i=1
2. The following were the working hours of Mary on seventeen days of February: 8.76, 8.88,
9.2, 9.02, 7.99, 8.67, 9.21, 9.12, 8.89, 8.67, 8.76, 8.66, 8.00, 8.01, 8.10, 8.49, 9.19. Find the
mean for this sample of hours.
Since n = 17 and

xi = x1 + x2 + x3 + · · · + x16 + x17
= 8.76 + 8.88 + 9.2 + · · · + 8.01 + 8.10
= 147.62
Σn Σ17
i=1 xi 1147.62
Thus, x = =
i n 17 8.68
= x = i=1
3. If a class of 40 students has a total preliminary grade of 3612, what is the population mean
of the grades?
Given N = 40 and 20 20 3612
i=1 i =
xi = 3612. Hence, µ = 90.30
i=1 = N 40

Definition 3.2.4 The median of a set of observations arranged in an increasing or
decreasing order of magnitude is the middle value when the number of observations is odd or
the arithmetic mean of the two middle values when the number of observations is even.
Example 3.2.5
1. Compute the sample median of the grades obtained by students in their preliminary
examina- tion
80 87
92 90
95 83
83 88
Arranging the grades in an increasing order of magnitude, we get
80 83 83 87 88 90 92 95
Since the number of observations is even and the middle values are 87 and 88, then the
sample 87 + 88
median is x˜ = = 87.5
2. The following were the working hours of Mary on seventeen days of February: 8.76, 8.88,
9.2, 9.02, 7.99, 8.67, 9.21, 9.12, 8.89, 8.67, 8.76, 8.66, 8.00, 8.01, 8.10, 8.49, 9.19. Find the
median for this population of hours.
1. Arranging the hours in an increasing order of magnitude, we get

7.99 8.00 8.01 8.10 8.49 8.66 8.67 8.67 8.76 8.76 8.88 8.89 9.02 9.12 9.19 9.2 9.21

and hence the population median µ˜ = 8.76.

Definition 3.2.6 The mode of a set of observations is that value which occurs most often or
with the greatest frequency.

Remark 3.2.7 The mode does not always exists. This is certainly true when all observations
occur with the same frequency. If no such value exists, we say that the data set has no mode.
For some sets of data there may be several values occuring with the greatest frequency in
which case we have more than one mode. If two such values exist, we say the data set is
bimodal. If three such values exist, we say the data set is trimodal. There is no symbol that
is used to represent the mode.

Example 3.2.8
1. Find the mode of the grades obtained by students in their preliminary examination
80 87
92 90
95 83
83 88
Since the value that occurs with the greatest frequency is 83. Thus, the mode is 83.

2. Find the mode in the working hours of Mary on seventeen days of February: 8.76, 8.88,
9.2, 9.02, 7.99, 8.67, 9.21, 9.12, 8.89, 8.67, 8.76, 8.66, 8.00, 8.01, 8.10, 8.49, 9.19.
Since the values that occur with the greatest frequencies are 8.67 and 8.76. Therefore, the
mode are 8.67 and 8.76. In this case the set of data has two mode and is called bimodal.

3.3 Measures of Dispersion for Ungrouped Data

Definition 3.3.1 Measure of Dispersion or Variation - measure that describes the variability
of a data set.
Definition 3.3.2 The range for a data set is equal to the difference of the maximum value
and the minimum value in the data set.

Example 3.3.3
1. Find the range of the grades obtained by students in their preliminary examination
80 87
92 90
95 83
83 88
The maximum value is 95 and the minimum value is 80. Thus the range is 95 − 80 = 15.

2. Find the range in the working hours of Mary on seventeen days of February: 8.76, 8.88,
9.2, 9.02, 7.99, 8.67, 9.21, 9.12, 8.89, 8.67, 8.76, 8.66, 8.00, 8.01, 8.10, 8.49, 9.19.
The maximum value is 9.21 and the minimum value is 7.99. Thus the range is 9.21 − 7.99 = 1.22

Variance and Standard Deviation

Definition 3.3.4 The variance and the standard deviation of a data set measures the
spread of the data about the mean of the data set. The variance of a sample of size n is
represented by s2 and is given by
s2 = ni=1 (xi − x)2
or Σ2
s2 = Σn i=1 x − ni=1 xi)2
(n(n − 1)
and the variance of a population of size N is represented by σ2 and is given by
Σn (xi − µ)2
σ2 =
Σ n Σ
σ2 = N i=1 2x − ni=1 xi) 2
(i N

The sample standard deviation is

√ 2
s= s
and the population standard deviation is

σ= σ2

Example 3.3.5
1. Find the variance and sample standard deviation of the grades obtained by students in their
preliminary examination
80 87
92 90
95 83
83 88
xi xi2
80 6400
83 6889
83 6889
87 7569
88 7744
90 8100
92 8464
95 9025
8 8

xi = Σ
xi 2 =
698 61080
Σ8 xi = 698, and
Since n = 8, i=1 xi2 = 61080, thus the variance is

n 2 n
i=1 i=1 xi)2
s =
n(n − 1)
Σ 2x
− 2
Σ8 x − ( x
8 (i
i=1 i i=1 i )
8(8 − 1)
8(61080)− (698)2
488640 − 487204
= 56
= 25.64

and so the standard deviation

√ √
s= s =
25.64 = 5.06.

2. Find the variance and standard deviation of the working hours of Mary on seventeen days
of February: 8.76, 8.88, 9.2, 9.02, 7.99, 8.67, 9.21, 9.12, 8.89, 8.67, 8.76, 8.66, 8.00, 8.01,
8.10, 8.49, 9.19.
xi xi2
8.76 76.7376
8.88 78.8544
9.2 84.64
9.02 81.3604
7.99 63.8401
8.67 75.1689
9.21 84.8241
9.12 83.1744
8.89 79.0321
8.67 75.1689
8.76 76.7376
8.66 74.9956
8.00 64.00
8.01 64.1601
8.10 65.61
8.49 72.0801
9.19 84.4561
17 17

xi = Σ
xi2 = 1284.8404
147.62 i=1

Σ17 Σ17 2
Since n = 17, i=1 xi = 147.62, and i=1 xi = 1284.8404, thus the variance is
n Σ2
Σ x − ni=1 xi)2
s2 = n i=1
n(n − 1)
Σ17 (i Σ17 xi)
17 i=1 2
= xi − ( i=1
17(17 − 1)
17(1284.8404) − (147.62)2
21842.2868 − 21791.6644
= 272
= 0.19
and so the standard deviation is √ √
s= s2 = 0.19 = 0.44.

3.4 Measures of Central Location for Grouped Data

The mean for grouped data is given by
x i=1 ci fi ,
= n
where m –the number of classes, n –size of the sample or population, ci represents class
marks, and fi represents class frequencies.

Example 3.4.1
1. Compute the mean of the grouped data

Grades frequency Class Boundaries Class Width Class Mark

80-85 3 79.5-85.5 6 82.5
86-91 3 85.5-91.5 6 88.5
92-97 2 91.5-97.5 6 94.5
The sample size n = 8 and the number of classes is m = 3. The class marks in the table are
c1 = 82.5, c2 = 88.5, c3 = 94.5 and the frequencies are f1 = 3, f2 = 3, f3 = 2. Therefore the mean
c i fi
Σ 3 ci fi
c1f1 + c2f2 + c3f3
= 8
(82.5)(3) + (88.5)(3) + (94.5)(2)
247.5 + 265.5 + 189
= 87.38

2. Compute the mean of the grouped data

Mary’s Working Hours frequency Class Boundaries Class Width Class Mark
7.99-8.23 4 7.985-8.235 0.25 8.11
8.24-8.48 1 8.235-8.485 0.25 8.36
8.49-8.73 3 8.485-8.735 0.25 8.61
8.74-8.98 4 8.735-8.985 0.25 8.86
8.99-9.23 5 8.985-9.235 0.25 9.11
The sample size n = 17 and the number of classes is m = 5. The class marks in the table are
c1 = 8.11, c2 = 8.36, c3 = 8.61 c4 = 8.86 c5 = 9.11 and the frequencies are f1 = 4, f2 = 1, f3 = 3,
f4 = 4, f5 = 5. Therefore the mean is
Σ 5 ci fi
= 17
c1f1 + c2f2 + c3f3 + c4f4 + c5f5
= 17
(8.11)(4) + (8.36)(1) + (8.61)(3) + (8.86)(4) + (9.11)(5)
32.44 + 8.36 + 25.83 + 35.44 + 45.55
= 8.68


The median for grouped data is found by locating the value that divides the data into two
equal parts. It is given by
b n
x˜ = a + . − cΣ
f 2

where a = lower class boundary of the median class

b = class width of the median
class f = frequency of the median
class n = sample or population
c = sum of the frequencies of classes below the median class
Note: median class is the class containing half of the

Example 3.4.2
1. Find the median of the grouped data

Grades frequency Class Boundaries Class Width Class Mark

80-85 3 79.5-85.5 6 82.5
86-91 3 85.5-91.5 6 88.5
92-97 2 91.5-97.5 6 94.5
The median grade for the data in table is a value such that 4 grades are less than the value and
4 grades are greater than the value. The median grade must occur in the class 86-91, and thus
86-91 is called the median class. Thus,
b n
x˜ = a + . − cΣ
f 2
6 8
= 85.5 + . − 3Σ
3 2
= 85.5 + 2 (4 − 3)
= 85.5 + 2 (1)
= 85.5 + 2
= 87.5

2. Find the median of the grouped data

Mary’s Working Hours frequency Class Boundaries Class Width Class Mark
7.99-8.23 4 7.985-8.235 0.25 8.11
8.24-8.48 1 8.235-8.485 0.25 8.36
8.49-8.73 3 8.485-8.735 0.25 8.61
8.74-8.98 4 8.735-8.985 0.25 8.86
8.99-9.23 5 8.985-9.235 0.25 9.11
The median hour for the data in table is a value such that 8.5 hours are less than the value and
8.5 hours are greater than the value. The median hour must occur in the class 8.74-8.98, and
thus 8.74-8.98 is called the median class. Thus,
b n
x˜ = a + . − cΣ
f 2
0.25 17
= 8.735 + . − 8Σ
4 2
= 8.735 + 0.0625 . Σ
= 8.735 + 0.03125
= 8.77

The modal class is defined to be the class with the maximum frequency. The mode for
grouped data is defined to be the class mark of the modal class.

Example 3.4.3
1. Find the mode of the grouped data.
Grades frequency Class Boundaries Class Width Class Mark
80-85 3 79.5-85.5 6 82.5
86-91 3 85.5-91.5 6 88.5
92-97 2 91.5-97.5 6 94.5
The classes with the highest frequencies are 80-85 and 86-91, their class marks are 82.5. and
88.5. Thus, the mode are 82.5. and 88.5. The grouped data is bimodal.

2. Find the mode of the grouped data

Mary’s Working Hours frequency Class Boundaries Class Width Class Mark
7.99-8.23 4 7.985-8.235 0.25 8.11
8.24-8.48 1 8.235-8.485 0.25 8.36
8.49-8.73 3 8.485-8.735 0.25 8.61
8.74-8.98 4 8.735-8.985 0.25 8.86
8.99-9.23 5 8.985-9.235 0.25 9.11
The class with the highest frequency is 8.99-9.23 and its class mark is 9.11. Therefore, the
mode is 9.11.

3.5 Measure of Dispersion for Grouped data


The range for grouped data is given by the difference between the upper boundary of the
class having the largest values minus the lower boundary of the class having the smallest

Example 3.5.1
1. Find the range of the grouped data.

Grades frequency Class Boundaries Class Width Class Mark

80-85 3 79.5-85.5 6 82.5
86-91 3 85.5-91.5 6 88.5
92-97 2 91.5-97.5 6 94.5
The upper boundary of the class having the maximum value is 97.5 and the lower boundary of
the class having minimum value is 79.5. Therefore the range is 97.5 − 79.5 = 18.

2. Find the range of the grouped data


Mary’s Working Hours frequency Class Boundaries Class Width Class Mark
7.99-8.23 4 7.985-8.235 0.25 8.11
8.24-8.48 1 8.235-8.485 0.25 8.36
8.49-8.73 3 8.485-8.735 0.25 8.61
8.74-8.98 4 8.735-8.985 0.25 8.86
8.99-9.23 5 8.985-9.235 0.25 9.11
The upper boundary of the class having the maximum value is 9.235 and the lower boundary of
the class having minimum value is 7.985. Therefore the range is 9.235 − 7.985 = 1.25.

Variance and Standard Deviation

The variance for grouped data is given by
Σ m 2 c f − m cifi)2
s = n
2 i=1

(i n(n −i 1)
where ci are class marks and fi are class frequencies
√ and the standard deviation is given by
s = s2
Example 3.5.2
1. Find the variance and standard deviation of the grouped data.

Grades frequency Class Boundaries Class Width Class Mark

80-85 3 79.5-85.5 6 82.5
86-91 3 85.5-91.5 6 88.5
92-97 2 91.5-97.5 6 94.5
Note that n = 8 and m = 3
m 3
c i2 fi = ci2 f i
i=1 i=1

= c12f1 +
c22f2 + c32f3
= (82.5)23 + (88.5)23 + (94.5)22
= (6806.25)3 + (7832.25)3 + (8930.25)2
= 20418.75 + 23496.75 + 1786.05
= 61776
m 3
c i fi = cifi
i=1 i=1
= c1f1 + c2f2 + c3f3
= (82.5)(3) + (88.5)(3) + (94.5)(2)
= 247.5 + 265.5 + 189
= 699

Σ m c f − m i=1 cifi)2
n i=1
s =

(i n(n −i 1)
8(61776) − (699)2
8(8 − 1)
494208 − 488601
= 8(7)
= 100.125

√ √
and so s = s =
100.125 = 10.00

2. Find the variance and standard deviation of the grouped data

Mary’s Working Hours frequency Class Boundaries Class Width Class Mark
7.99-8.23 4 7.985-8.235 0.25 8.11
8.24-8.48 1 8.235-8.485 0.25 8.36
8.49-8.73 3 8.485-8.735 0.25 8.61
8.74-8.98 4 8.735-8.985 0.25 8.86
8.99-9.23 5 8.985-9.235 0.25 9.11
Note that n = 17 and m = 5
m 5
ci2 f i = c i2 fi
i=1 i=1

= c12f1 + c22f2 + c32f3 + c42f4 + c52f5

= (8.11)24 + (8.36)21 + (8.61)23 + (8.86)24 + (9.11)25
= (65.7721)4 + (69.8896)1 + (74.1321)3 + (78.4996)4 + (82.9921)5
= 263.0084 + 69.8896 + 222.3963 + 313.9984 + 414.9605
= 1284.2532

m 5
c i fi = cifi
i=1 i=1
= c1f1 + c2f2 + c3f3 + c4f4 + c5f5
= (8.11)(4) + (8.36)(1) + (8.61)(3) + (8.86)(4) + (9.11)(5)
= 32.44 + 8.36 + 25.83 + 35.44 + 45.55
= 147.62


Σ m c f − m i=1 cifi)2
n i=1
s =

(i n(n −i 1)
17(1284.2532)− (147.62)2
17(17 − 1)
21832.3044 − 21791.6644
= 17(16)
= 0.15
√ √
and so s = s =
0.15 = 0.39

3.6 Chebyshev’s Theorem

Chebyshev’s Theorem: At least the fraction 1 − 2 of the measurements of any set of data
must lie with k standard deviation of the mean. k

Equivalence: If x is the mean and s is the standard deviation of a set of data then at least
k2 − 1
of the data set will fall between x − ks and x + ks.
k2 3
If k = 2, then at least 4 or 75% of the data will fall between x − 2s and x + 2s.
If k = 3, then at least or 88.90% of the data will fall between x − 3s and x + 3s.
If k = 4, then at least or 93.80% of the data will fall between x − 4s and x + 4s.
If k = 5, then at least or 96% of the data will fall between x − 5s and x + 5s.
Example 3.6.1
1. If the IQs of a random sample of 1080 students at a large university have a mean score of
120 and a standard deviation of 8, use Chebyshev’s theorem to determine the interval
containing at least 810 of the IQs in the sample.
Given: x = 120, s = 8
Since = 0.75 = 75% then 75% of the IQ Score will fall between

(x − 2s, x + 2s) = (120 − 2(8), 120 + 2(8))

= (120 − 16, 120 + 16)
= (104, 136)
2. In what range can we be sure that 960 of the IQs will fall?
Given: x = 120, s = 8
Since = 0.889 = 88.9% then 88.9% of the IQ Score will fall between
(x − 3s, x + 3s) = (120 − 3(8), 120 + 3(8))
= (120 − 24, 120 + 24)
= (96, 144)

3. If the height of a random sample of 5000 employees at a manufacturing company have a

mean score of 167cm and a standard deviation of 4.5, use Chebyshev’s theorem to determine
the interval containing at least 4800 of the heights in the sample.
Given: x = 167, s = 4.5
Since = 0.96 = 96% then 96% of the heights will fall between
(x − 5s, x + 5s) = (167 − 5(4.5), 167 + 5(4.5))
= (167 − 22.5, 167 + 22.5)
= (144.5, 189.5)

3.7 z Score
z Score: An observation x from − a data with mean µ or x and standard deviation σ or s, has a z
x x−x
score or z value defined by z = or z =
µ s

A z score measures how many standard deviations an observation is above or below the mean.
A positive z score measures the number of standard deviations an observation is above the
mean and a negative z score gives the number of standard deviations an observation is below
the mean.

Example 3.7.1
1. Maria’s grade in Math is 82 and 89 in English. If the class mean grade in Math was 68 and
standard deviation was 8 while the grades in English had a mean score of 80 and a standard
deviation of 6, can we conclude that Maria is a better student in English than Math?
Math: x = 82, x = 68, s = 8
x − x 82 − 68 14
z= s = 8 = 8 = 1.75
English: x = 89, x = 80, s = 6
x − x 89 − 80 9
z= s = 6 = 6 = 1.5
Since the z score of Maria in Math is greater than in English therefore Maria is a better student
in Math than English.

2. Two soap companies argued which brand of their own powdered soaps dissolve quickly
and efficiently. In an actual demo both soap A from company A and soap B from company B
have a dissolving time of 9min. If the mean dissolving time of all soaps from company A was
10.0min and standard deviation was 5.25 while dissolving time of all soaps from company B
had a mean of 11.5min and a standard deviation of 3, can we conclude that soaps from
company A dissolves quicker than soaps from company B?
Soap A: x = 9, x = 10.0, s = 5.25
x − x 9 − 10.0 1
z= = = − = −0.19
s 5.25 5.25
Soap B: x = 9, x = 11.5, s = 3
x − x 9 − 11.5 2. 5
z= s = 3 = − 3 = −0.83
Since the z score of soap B is less than soap A therefore soaps from company B dissolves quicker
than soaps from company A.

3.8 Coefficient of Variation

The coefficient of variation is equal to the standard deviation divided by the mean multiplied
σ s
by 100%. It is given by the formula CV = · 100% or CV = · 100%.
µ x

Example 3.8.1
1. A national sampling of prices for new and used motorcycles found that the mean price for
a new motorcycle is 60,100 and the standard deviation is 6125 and that the mean price for a
used motorcycle is 25485 with a standard deviation equal to 2630. Compute their CVs.
New Motorcycle: x = 60100, s = 6125
s 6125
CV = × 100% = × 100% = 0.1019 × 100% = 10.19%
Used Motorcycle: x = 25485, s = 2630 x 60100
s 2630
CV = × 100% = × 100% = 0.1031 × 100% = 10.31%
x 25485
2. The mean dissolving time of all powdered soaps from company A was 10.0min and stan-
dard deviation was 5.25 while dissolving time of all powdered soaps from company B had a
mean of 11.5min and a standard deviation of 3. Compute their CVs.
Soaps from Company A: x = 10.0, s = 5.25
s 5.25
CV = × 100% = × 100% = 0.5250 × 100% = 52.50%
Soaps from Company B: x = 11.5, s = x3 10.0
s 3
CV = × 100% = × 100% = 0.2609 × 100% = 26.09%
x 11.5
3.9 Pearsonian Coefficient of Skewness
The Pearson Coefficient of Skewness is given by

3(x − 3(µ −
x ˜) = or SK = µ˜ )
s s
1. Compute the Pearsonian Coefficient of Skewness of the grades obtained by students in their
preliminary examination
80 87
92 90
95 83
83 88
Note that x = x˜ = 87.5, s = 5.06.
87.25, Therefore, 3(x − x˜)

3(87.25 − 87.5)
s 5.06
= 5.06
= −0.15
2. Compute the Pearsonian Coefficient of Skewness of the grouped data

Mary’s Working Hours frequency Class Boundaries Class Width Class Mark
7.99-8.23 4 7.985-8.235 0.25 8.11
8.24-8.48 1 8.235-8.485 0.25 8.36
8.49-8.73 3 8.485-8.735 0.25 8.61
8.74-8.98 4 8.735-8.985 0.25 8.86
8.99-9.23 5 8.985-9.235 0.25 9.11
Note that x = x˜ = 8.77, s = 0.39.
8.68, Therefore, 3(x − x˜)
3(8.68 − 8.77)
s 0.39
= 0.39
− 0.27
= −0.69

3.10 Empirical Rule

Empirical Rule: Given a bell-shaped distribution of measurements, then approximately
68% of the observations lie within 1 standard deviation of the mean.

95% of the observations lie within 2 standard deviations of the mean.

99.70% of the observations lie within 3 standard deviations of the

68% of the data will fall between x − s and x + s
95% of the data will fall between x − 2s and x + 2s
99.70% of the data will fall between x − 3s and x +
Example 3.10.1
1. Assuming the incomes for all households last year produced a bell-shaped distribution
with a mean equal to 200,000 and a standard deviation equal to 56,540, deduce an
approximation based on empirical rule.
Given: x = 200, 000, s = 56, 540
68% of the incomes will fall between
(x − s, x + s) = (200, 000 − 56, 540, 200, 000 + 56, 540)
= (143, 460, 256, 540)
95% of the incomes will fall between
(x − 2s, x + 2s) = (200, 000 − 2(56, 540), 200, 000 + 2(56, 540))
= (200, 000 − 113, 080, 200, 000 + 113, 080)
= (86, 920, 286, 920)
99.70% of the incomes will fall between
(x − 3s, x + 3s) = (200, 000 − 3(56, 540), 200, 000 + 3(56, 540))
= (200, 000 − 169, 620, 200, 000 + 169, 620)
= (30, 380, 369, 620)
2. Deduce an approximation based on empirical rule of the grades obtained by students in
their preliminary examination assuming the distribution is bell-shaped
80 87
92 90
95 83
83 88
Given: x = 87.25, s = 5.06.
68% of the grades will fall between
(x − s, x + s) = (87.25 − 5.06, 87.25 + 5.06)
= (82.19, 92.31)
95% of the grades will fall between
(x − 2s, x + 2s) = (87.25 − 2(5.06), 87.25 + 2(5.06))
= (87.25 − 10.12, 87.25 + 10.12)
= (77.13, 97.37)

99.70% of the grades will fall between

(x − 3s, x + 3s) = (87.25 − 3(5.06), 87.25 + 3(5.06))

= (87.25 − 15.18, 87.25 + 15.18)
= (72.07, 102.43)

3. Based on empirical rule deduce an approximation of the working hours of Mary on

seventeen days of February: 8.76, 8.88, 9.2, 9.02, 7.99, 8.67, 9.21, 9.12, 8.89, 8.67, 8.76, 8.66,
8.00, 8.01, 8.10, 8.49, 9.19 assuming that the distribution is bell-shaped.
Given: x = 8.68, s = 0.44.
68% of the hours will fall between

(x − s, x + s) = (8.68 − 0.44, 8.68 + 0.44)

= (8.24, 9.12)

95% of the hours will fall between

(x − 2s, x + 2s) = (8.68 − 2(0.44), 8.68 + 2(0.44))

= (8.68 − 0.88, 8.68 + 0.88)
= (7.8, 9.56)

99.70% of the hours will fall between

(x − 3s, x + 3s) = (8.68 − 3(0.44), 8.68 + 3(0.44))

= (8.68 − 1.32, 8.68 + 1.32)
= (7.36, 10.00)

3.11 Measures of Position

Measures of position are used to describe the location of a particular observation in
relation to the rest of the data set. They are also called fractiles or quantiles.

Definition 3.11.1 Percentiles are values that divide a set of observations into 100 equal
parts. These values, denoted by P1, P2, . . . P99, are such that 1% of the data falls below P1, 2%
falls below P2, . . . and 99% falls below P99.

Definition 3.11.2 Deciles are values that divide a set of observations into 10 equal parts.
These values, denoted by D1, D2, . . . D9, are such that 10% of the data falls below D1, 20%
falls below D2, . . . and 90% falls below D9.
Definition 3.11.3 Quartiles are values that divide a set of observations into 4 equal parts.
These values, denoted by Q1, Q2 and Q3, are such that 25% of the data falls below Q1, 50% falls
below Q2 and 75% falls below Q3.

D1 = P10, D2 = P20, D3 = P30, D4 = P40, D5 = P50, D6 = P60, D7 = P70, D8 = P80, D9 = P90
Q1 = P25, Q2 = P50, Q3 = P75

Procedure in finding percentile P :

1. Arranged the data set in increasing order.
2. Compute the index i .
= 100
3. If i is not an integer, the next integer greater than i locates the position of the pth percentile
in the arranged data set.
4. If i is an integer, the pth percentile is the average of the observation in positions i and i + 1
in the arranged data set.

Example 3.11.4
1. Given the grades obtained by students in their preliminary examination
80 87
92 90
95 83
83 88
Compute the following
a. P15
b. D2
c. Q3
Arranging the grades in an increasing order, we get

80 83 83 87 88 90 92 95

Note that n = 8.
a. Solve for P15.
15(8) 120
Compute the index i (p)(n) = = = 1.2.
= 100 100 100
Since i = 1.2 is not an integer then i ≈ 2. Thus, P15 is the 2nd observation.
Therefore, P15 = 83.
b. Solve for D2.
Since D2 = P20, hence we compute for P20.
Compute the index i (p)(n) = 20(8) = 160 = 1.6.
= 100 100 100
Since i = 1.6 is not an integer then i ≈ 2. Thus, P20 is also the 2nd observation.
Therefore, P20 = 83.
c. Solve for Q3.
Since Q3 = P75, hence we compute for P75.
Compute the index i (p)(n) = 75(8) = 600 = 6.
= 100 100 100
Since i = 6 is an integer then Q3 is the average of the 6th and 7th observations.
Therefore, Q3 = 90 + 92 = 182 = 91.
2 2
2. The following were the working hours of Mary on seventeen days of February: 8.76, 8.88,
9.2, 9.02, 7.99, 8.67, 9.21, 9.12, 8.89, 8.67, 8.76, 8.66, 8.00, 8.01, 8.10, 8.49, 9.19. Compute
the following
a. D7
b. Q1
c. P67
d. P83
e. P98
Arranging the hours in an increasing order, we get

7.99 8.00 8.01 8.10 8.49 8.66 8.67 8.67 8.76 8.76 8.88 8.89 9.02 9.12 9.19 9.2 9.21

Note that n = 17.

a. Solve for D7.
Since D7 = P70, hence we compute for P70.
Compute the index i (p)(n) = 70(17) = 1190 = 11.9.
= 100 100 100
Since i = 11.9 is not an integer then i ≈ 12. Thus, D7 is the 12th observation.
Therefore, D7 = 8.89
b. Solve for Q1.
Since Q1 = P25, hence we compute for P25.
Compute the index i (p)(n) = 25(17) = 425 = 4.25.
= 100 100 100
Since i = 4.25 is not an integer then i ≈ 5. Thus, Q1 is the 5th observation.
Therefore, Q1 = 8.49
c. Solve for P67.
Compute the index i (p)(n) = 67(17) = 1139 = 11.39.
= 100 100 100
Since i = 11.39 is not an integer then i ≈ 12. Thus, P67 is the 12th observation.
Therefore, P67 = 8.89
d. Solve for P83.
Compute the index i (p)(n) = 83(17) = 1411 = 14.11.
= 100 100 100
Since i = 14.11 is not an integer then i ≈ 15. Thus, P83 is the 15th observation.
Therefore, P83 = 9.19
e. Solve for P98.
Compute the index i (p)(n) = 98(17) = 1666 = 16.66.
= 100 100 100
Since i = 16.66 is not an integer then i ≈ 17. Thus, P98 is the 17th observation.
Therefore, P98 = 9.21
Chapter 4


4.1 Sample Space

Definition 4.1.1
Any process that generates a set of data is called an experiment.
The set of all possible outcomes of a statistical experiment is called the sample space and is
represented by the symbol S.
Each outcome in a sample space is called an element or a member of the sample
space or simply a sample point.
An event is a subset of a sample space.
If an event is a set containing only one element of the sample space, then it is called a simple
A compound event is one that can be expressed as the union of simple events.
The null space or empty space is a subset of the sample space that contains no
elements. We denote this event by ∅.
The number of elements in a sample space S is denoted by n(S).

Example 4.1.2
1. Experiment: Tossing a
coin Let H = head, T =
S = H,
{ T }
n(S) = 2

2. Experiment: Tossing a die

S = 1,
{ 2, 3, 4, 5, 6
n(S)}= 6
Event of getting an odd number
E = 1,{ 3, 5 }
n(E) = 3

3. Experiment: Tossing 2
coins Let H = head, T =
S = {HH, HT, TH, TT }
n(S) = 4
Event of having the same results
E = HH,
{ TT }
n(E) = 2

4. Experiment: Tossing 3
coins Let H = head, T =
TTT n(S) = 8
Event that at least 2 heads occur
E = HHH,
n(E) = 4
Event of having the same results
E = HHH,
{ TTT }
n(E) = 2

5. Experiment: Tossing 2 dice

S = {(1, 1), (1, 2), (1, 3), (1, 4), (1, 5), (1, 6), (2, 1), (2, 2), (2, 3), (2, 4), (2, 5), (2, 6), (3, 1),
(3, 2),
(3, 3), (3, 4), (3, 5), (3, 6), (4, 1), (4, 2), (4, 3), (4, 4), (4, 5), (4, 6), (5, 1), (5, 2), (5, 3), (5,
4), (5, 5),
(5, 6), (6, 1), (6, 2), (6, 3), (6, 4), (6, 5),}(6, 6)
n(S) = 36
Event that the sum is 5
E = (1,
{ 4), (2, 3), (3, 2), (4, 1) }
n(E) = 4
Event that the first number is 1
E = (1,
{ 1), (1, 2), (1, 3), (1, 4), (1, 5), (1, 6)
n(E) = 6

4.2 Counting Techniques

1. Multiplication Rule or Fundamental Principle of Counting. If an operation can be
performed in n1 ways, if for each of these a second operation can be performed in n2 ways, if
for each of these a third operation can be performed in n3 ways, and so on, then the sequence
of k operations can be performed in n1 · n2 · n3 · · · nk ways.

Example 4.2.1
1. How many sample points are in the sample space when a pair of dice is thrown?
n1 =number of possible outcomes when the 1st die is thrown,
n2 =number of possible outcomes when the 2nd die is thrown
There are 6 possible outcomes when the 1st die is thrown and there are also 6 possible outcomes
when the 2nd die is thrown. Thus, there are n1 · n2 = 6 · 6 = 36 sample points.

2. How many lunches are possible consisting of soup, a sandwich, dessert, and a drink if one
can select from 4 soups, 3 kinds of sandwiches, 5 desserts and 4 drinks?
There are
n1 = 4 ways to select soups,
n2 = 3 ways to select
sandwiches, n3 = 5 ways to
select deserts and n4 = 4 ways to
select drinks.
Therefore, there are n1 · n2 · n3 · n4 = 4 · 3 · 5 · 4 = 240 ways to select different lunches.

3. How many even three-digit number can be formed from the digits 1,2,5,6 and 9 if each
digit can be used only once?
n1 =ways to select for the the hundreds’ digit,
n2 =ways to select for the the tens’ digit,
n3 =ways to select for the the ones’ digit,
We want the number to be even so n3 = 2 to be selected from 2 or 6. Since there are 5 choices
and we already selected for the ones’ digit and each digit can be used only once so n2 = 4 and
n1 = 3. Therefore, there are n1·n2 ·n3 = 3 4· 2· = 24 even three-digit numbers that can be formed
from the digits 1, 2, 5, 6 and 9 if each digit can be used only once.

2. Arrangement of n distinct objects in a row is n!.

Note: n! = n · (n − 1) · (n − 2) · · · 3 · 2 · 1
0! =

Example 4.2.2
1. How many distinct arrangements on 5 chairs for 5 persons?
There are n! = 5! = 5 · 4 · 3 · 2 · 1 = 120 distinct arrangements on 5 chairs for 5 persons.

2. How many possible arrangements of the letters of the word “MATH”?

There are n! = 4! = 4 · 3 · 2 · 1 = 24 possible arrangements of the letters of the word “MATH”.

3. How many possible arrangements of the letters of the word “LOGARITHM” if its starts
with a vowel and ends with a consonant?
There are n1 · n2 · n3 · n4 · n5 · n6 · n7 · n8 · n9 possible arrangements. There are n1 = 3 ways
to select for the vowels o, a and i and n9 = 6 ways to select consonants for the last letter. More-
over, n2 · n3 · n4 · n5 · n6 · n7 · n8 = 7! arrangements of 7 remaining letters in a row.
· · ·n1 ·n2 ·n3 n
Therefore, · 4 n· 5 n·6 n7 n8· n9 = 3 7! 6 = 90720 possible arrangements of the
letters of the word “LOGARITHM” if its starts with a vowel and ends with a consonant.

3. Arrangement of n nondistinct objects n!

is where n = n1 + n2 + · · · + nk .
n1! · n2 ! · n3 ! · k !
Example 4.2.3
1. How many possible arrangements of letters of the word “STATISTICS”?
There are
n1 = 3 ways to select for S,
n2 = 3 ways to select for T,
n3 = 1 ways to select for A,
n4 = 2 ways to select for I,
n5 = 1 ways to select for C
n = 10
Therefore, there are n! 10! 3628800
= = = 50400 possible
ments of letters. n1! · n2! · n3! · n4! · n5! 3! · 3! · 1! · 2! · 72 arrange-
2. How many possible color combinations of flags from 3 red, 4 yellow and 2 blue?
There are
n1 = 3 ways to select from red flags,
n2 = 4 ways to select from yellow flags,
n3 = 2 ways to select from blue flags
Therefore, there are n! 9! 362880
= = = 1260 possible color combinations of
flags. n 1! · n 2! · n 3! 3! · 4! · 288
4. Arrangement of n distinct objects in a circle is (n − 1)!.
Example 4.2.4
1. How many possible arrangements of 6 petals of Gumamela?
The possible arrangement of 6 petals of Gumamela is (6 − 1)! = 5! = 120.

2. How many possible arrangements of 10 persons in a round table?

The possible arrangements of 10 persons in a round table is (10 − 1)! = 9! = 362880.
5. The number of permutations of n distinct objects taken r at a time is nPr = .
(n − r)!
Example 4.2.5
1. How many permutations of the letters A, B and C taken 1,2 or 3 at a time?
Taken 1 at a time
3! 3!
3 P1 = = =3
(3 − 1)! 2!
Therefore, there are 3 permutations of the letters A, B and C taken 1 at a time.
Taken 2 at a time
3! 3!
3P2 = = =6
(3 − 2)! 1!
Therefore, there are 6 permutations of the letters A, B and C taken 2 at a time.
Taken 3 at a time
3! 3!
3P 3 = = =6
(3 − 3)! 0!
Therefore, there are 6 permutations of the letters A, B and C taken 3 at a time.

2. Two lottery tickets are drawn from 20 for first and second prizes. Find the number of
sample points in the sample space.
There are
20! 20!
20P2 = = = 20 · 19 = 380 sample points in the sample space.
(20 − 2)! 18!
6. The number of combinations of n distinct objects taken r at a time is nCr =
. r!(n −

Example 4.2.6
1. How many combinations of the letters A, B and C taken 1,2 or 3 at a time?
Taken 1 at a time
3! 3!
3C1 = = =3
1!(3 − 1)! 2!
Therefore, there are 3 combinations of the letters A, B and C taken 1 at a time.
Taken 2 at a time
3! 3!
3C2 = = =3
2!(3 − 2)! 2!
Therefore, there are 3 combinations of the letters A, B and C taken 2 at a time.
Taken 3 at a time
3! 3!
3C 3 = = =1
3!(3 − 3)! 3!
Therefore, there is 1 combination of the letters A, B and C taken 3 at a time.

2. From 5 SOE students, 4 SBA students and 3 SAS students find the number of committees
of 6 persons that can be formed with 3 SOE, 2 SBA and 1 SAS students.
There are
n1 =5 C3 = = 10 ways to select from SOE students
− 3)!
3!(5 4!
n2 =4 C2 = = 6 ways to select from SBA students
− 2)!
2!(4 3!
n3 =3 C1 = = 3 ways to select from SAS students
1!(3 − 1)!
Therefore, there are
n1 · n2 · n3 = 10 · 6 · 3 = 180 possible number of committees.

3. How many possible combinations of numbers in a 6-55 lotto game?

There are
55! 55!
55C6 = = = 28, 989, 675 possible combinations of numbers.
6!(55 − 6)! 6!49!

4.3 Probability of an Event

Definition 4.3.1 The probability of an event E is the sum of the probabilities of all the
sample points in E. Therefore,

0 ≤ P (E) ≤ 1, P (E) = n(E)

, P (∅) = 0, P (S) = 1
Example 4.3.2
1. A coin is tossed twice. What is the probability that at least 1 head occurs?
Let H = head, T = tail
S = {HH, HT, TH, TT}
Event that at least 1 head occurs
E = {HH, HT, TH, }
Thus, n(S) = 4 and n(E) = 3. Therefore, P (E) = n(E) = 3 .
n(S) 4
2. If three coins are tossed, find the probability of having the same results.
Let H = head, T = tail
S = HHH,
Event of having the same results
E = {HHH, TTT }
Thus, n(S) = 8 and n(E) = 2. Therefore, P (E) = n(E) = 2 = 1.
n(S) 8 4
3. If two dice are thrown, find the probability that the sum is 5.
S = {(1, 1), (1, 2), (1, 3), (1, 4), (1, 5), (1, 6), (2, 1), (2, 2), (2, 3), (2, 4), (2, 5), (2, 6), (3, 1),
(3, 2),
(3, 3), (3, 4), (3, 5), (3, 6), (4, 1), (4, 2), (4, 3), (4, 4), (4, 5), (4, 6), (5, 1), (5, 2), (5, 3), (5,
4), (5, 5),
(5, 6), (6, 1), (6, 2), (6, 3), (6, 4), (6, 5),}(6, 6)
n(S) = 36
Event that the sum is 5
E = (1,
{ 4), (2, 3), (3, 2), (4, 1) }
n(E) = 4
Thus, n(S) = 36 and n(E) = 4. Therefore, P (E) = n(E) = 4 = 1.
n(S) 36 9
4. If a card is drawn from an ordinary deck, find the probability that it is a heart.
We note that in an ordinary deck there are 52 cards of which 13 are heart suit. Thus, n(S) = 52
and n(E) = 13. Therefore, P (E) = n(E) = 13 = 1.

5. Find the probability of winning the 6-55 lotto game given one ticket.
We note that n(S) = 28, 989, 675 are the possible combinations of 6-55 lotto game. Since
n(E) = 1 for one ticket, thus we have
P (E) n(E) = 1
= 0.000000034
= n(S) 28, 989,
6. Find the probability of winning the 6-45 lotto game given six tickets.
We note that n(S) = 45C6 = = 8, 145, 060 are the possible combinations of 6-45 lotto
− 6)!
game. Since n(E) = 6 for six6!(45
tickets, thus we have
P (E) n(E) = 6
= 0.000000736
= n(S) 8, 145,
Chapter 5


5.1 Normal Curve

A continuous random variable X having a bell-shaped distibution is called a normal
random variable. The mathematical equation for the probability distribution of the normal
variable depends on the parameters µ and σ, its mean and standard deviation.

Definition 5.1.1
1. If X is a random variable with mean µ and variance σ2, then the equation of the normal
curve is . Σ2
1 x−µ

n(x; µ, σ) = σ
e 2

2. The distribution of a normal random variable with mean zero and standard deviation equal to
1 is called a standard normal distribution.

Properties of the Normal Curve

1. The mode, which is the point on the horizontal axis where the curve is a maximum, occurs at
x = µ.
2. The curve is symmetric about a vertical axis through the mean µ.
3. The normal curve approaches the horizontal axis asymptotically as we proceed in either
direction away from the mean.
4. The total area under the curve and above the horizontal axis is equal to 1.

5.2 Transformation of Normal Random Variable

The transformation of all the observations of any normal random variable X to a new set of
observations of a normal random variable Z with mean zero and variance 1 is given by the

z= σ .

If X is between the values x = x1 and x = x2, (x1 < X < x2) the random variable Z will
fall between the corresponding values x1 −
= and z2 = x2 − (z1 < Z < z2). Thus,
z1 µ µ
σ σ
P (x1 < X < x2) = P (z1 < Z < z2) .
Example 5.2.1
I. Find the following z values.
1. P (z < 2.64)
P (z < 2.64) = 0.9959
2. P (z < −1.61)
P (z < −1.61) = 0.0537
3. P (z < 0.84)
P (z < 0.84) = 0.7995
4. P (z > 1.38)
P (z > 1.38) = 1 − P (z < 1.38) = 1 0.9162 = 0.0838 or
P (z > 1.38) = P (z < −1.38) = 0.0838
5. P (z > −2.75)
P (z > −2.75) = 1 − P (z < −2.75) = 1 0.0030 = 0.9970 or
P (z > −2.75) = P−(z <−( 2.75)) = P (z < 2.75) = 0.9970
6. P (z > −0.68)
P (z > −0.68) = 1− P (z < −0.68) = 1 0.7517 = 0.2483 or
P (z > −0.68) = P−(z <−( 0.68)) = P (z < 0.68) = 0.2483
7. P (−2.67−< z < 1.32)
P (− 2.67 < z < 1.32) = P (z < 1.32)− P (z < −2.67) = 0.9066 0.0038 = 0.9028
8. P (1 < z < 2) −
P (1 < z < 2) = P (z < 2) − P (z < 1) = 0.9772 0.8413 = 0.1359
9. P (− 2.74 < z < 0.11)
P (−2.74 < z < −0.11) = P (z < −0.11) − P (z < −2.74) = 0.4562 − 0.0031 = 0.4531

II. Given µ = 50 and σ = 3, find the following x values.

1. P (x < 55)
x µ 55 − 50 5
x = 55, z = − = = = 1.67
σ 3 3
P (x < 55) = P (z < 1.67) = 0.9525

2. P (x < 44)
x−µ 44 − 50 6
x = 44, z = = = − = −2
σ 3 3
P (x < 44) = P (z <−2) = 0.0228
3. P (x > 46)
x − µ 46 − 50 4
x = 46, z = σ = 3 = − 3 = −1.33
P (x > 46) = P (z > −1.33) = 1 − P (z < −1.33) = 1 0.0918 = 0.9082
4. P (45 < x < 60) −
x − µ 45 − 50 5
x1 = 45, z1 = = = − 3 = −1.67
σ 3
x = 60, z
2 =
x − µ 60 − 50 10
σ = = 3 = 3.33
P (45 < x < 60) = P (− 1.67 < z < 3.33) = P (z < 3.33)−P (z < −1.67) = 0.9996 0.0475 = 0.9521
5. P (51 < x < 57) −
x = 51, z x − µ 51 − 50 1
1 1
= σ = 3 = 3 = 0.33
x = 57, z x − µ 57 − 50 7
2 =
σ = 3 = 3 = 2.33

P (51 < x < 57) = P (0.33 < z < 2.33) = P (z < 2.33) − P (z < 0.33) = 0.9901 − 0.6293 =

Finding the x value and k if P is given

From k =
µ we have x = kσ + µ.

Example 5.2.2 Given µ = 72 and σ = 6, find the value of x and k following.

1. P (x < k) = 0.5596
⇒ k = 0.15 and so x = kσ + µ = 0.15(6) + 72 = 0.9 + 72 = 72.9
2. P (x < k) = 0.9406
⇒ k = 1.56 and so x = kσ + µ = 1.56(6) + 72 = 9.36 + 72 = 81.36
3. P (x > k) = 0.2516
P (x > k) = 1 − P (x < k) = 0.2516, thus P (x < k) = 1 0.2516 = 0.7484
⇒ k = 0.67 and so x = kσ + µ = 0.67(6) + 72 = 4.05 + 72 = 76.02
4. P (x > k) = 0.6615
P (x > k) = 1 − P (x < k) = 0.6615, thus P (x < k) = 1 − 0.6615 = 0.3385
⇒ k = −0.42 and so x = kσ + µ = −0.42(6) + 72 = −2.52 + 72 = 69.48

5.3 Applications of the Normal Distribution

Example 5.3.1
1. If n = 300, µ = 50 and σ = 10, find the lowest passing grade if the lowest 10 percent were
given a failing grade.
P (x < k) = 0.10
⇒ k = 1.38 and so x = kσ + µ = 1.38(10) + 50 = 13.8 + 50 = 63.8 64
Therefore, 64 is the lowest passing grade of lowest 10%.

2. If n = 80, µ = 77 and σ = 5.2, find the maximum grade of the lowest 15.34% of the class.
P (x < k) = 0.1534
⇒ k = −1.02 and so x = kσ + µ = −1.02(5.2) + 77 = −5.034 + 77 = 71.696 72
≈ 72 is the maximum grade of lowest 15.34% of the class.

3. If n = 80, µ = 77 and σ = 5.2, find the minimum grade of the highest 23.18% of the
P (x > k) = 0.2318
P (x > k) = 1 − P (x < k) = 0.2318, thus P (x < k) = 1 0.2318 = 0.7682
⇒ k = 0.73 and so x = kσ + µ = 0.73(5.2) + 77 = 3.796 + 77 = 80.796 81
Therefore, 81 is the minimum grade of the highest 23.18% of the class.

4. A certain type of storage battery lasts on the average 3.0 years, with a standard deviation
of 0.5 year. Assuming that the battery lives are normally distributed, find the probability that
a given battery will last less than 2.3 years.
x µ 2. 3 3 0 .7
x = 2.3 transforming to z = − = =− = −1.4
σ 0.5 0.5
P (x < 2.3) = P (z < −1.4) =

5. An electrical firm manufactures light bulbs that have a length of life that is normally dis-
tributed with mean equal to 800 hours and standard deviation of 40 hours. Find the
probability that a bulb burns between 778 and 834 hours.
x = 778 transforming to z 1 x−µ 778 − 800 22
= = =−
= −0.55
x = 834 transforming to σ 40 40
z x−µ
834 − 800 34
2 2 = = = 40 = 0.85
σ 40
P (778 < x < 834) = P (−0.55 < z < 0.85) = P (0.85) − P (−0.55) = 0.8023 − 0.2912 =

6. If the average height of miniature poodles is 30 centimeters, with a standard deviation of

4.1 centimeters, what percentage of miniature poodles exceeds 35 centimeters in height, assuming
that the heights follow a normal distribution and can be measured to any desired degree of
accu- racy?
x − µ 35 − 30 5
x = 35 transforming to z = = =
= 1.22
σ 4.1 4.1
P (x > 35) = P (z > 1.22) = 1− P (z < 1.22) = 1 0.8888 = 0.1112.
Therefore, 11.12% of miniature poodles exceed 35 centimeters in height.

7. The grade-point averages of 300 college freshmen follow approximately a normal

distribu- tion with a mean of 2.1 and a standard deviation of 0.8. How many of these
freshmen would you expect to have a grade-point average between 2.45 and 2.76?
x = 2.45 transforming to z 1 x−µ 2.45 − 2.1 0.35
= = =−
= 0.4375
x = 2.76 transforming to z σ 0.8 0.8
x−µ2.76 − 2.1 0.66
2 2 = = = 0.8 = 0.825
σ 0.8
P (2.45 < x < 2.76) = P (0.4375 < z < 0.825) = P (z < 0.825) − P (z < 0.4375) =
0.7967− 0.6700 = 0.1267
Therefore, 12.67% or approximately 38 of the 300 freshmen, should have a grade-point average
between 2.45 and 2.76.
Chapter 6


6.1 Statistical Hypothesis

Definition 6.1.1
1. A statistical hypothesis is an assertion or conjecture concerning one or more populations.
2. The procedure for establishing a set of rules that lead to the acceptance or rejection of a
statistical hypothesis is called hypothesis testing.
3. Hypotheses that were formulated with the hope that they be rejected led to the use of null
hypothesis and is denoted by H0. The rejection of H0 leads to the acceptance of an alternative
hypothesis, denoted by H1.

6.2 Level of Significance

Definition 6.2.1
1. Rejection of the null hypothesis when it is true is called a type I error.
2. Acceptance of the null hypothesis when it is false is called a type II error.
The probability of committing a type I error is called the level of significance of the test
and is denoted by α. Thus, α = P (type I error).

6.3 One-Tailed and Two-Tailed Tests

Definition 6.3.1
1. A test of any statistical hypothesis where the alternative is one-sided, such as

H0 : θ = θ0,
H1 : θ > θ0,
H0 : θ = θ0,
H0 : θ < θ0,


is called a one-tailed test.

2. A test of any statistical hypothesis where the alternative is two-sided, such as

H0 : θ = θ0

H1 : θ ƒ= θ0,
is called a two-tailed test. The alternative hypothesis θƒ = θ0 states that either θ < θ0 or θ > θ0.

H0 : θ = θ0,
H1 : θ < θ0 or θ > θ0.

3. A test is significant if the null hypothesis is rejected at the 0.05 level of significance and
is considered to be highly significant if the null hypothesis is rejected at the 0.01 level of

Steps in Hypothesis Testing

1. State the null hypothesis H0 that θ = θ0.
2. Choose an appropriate alternative hypothesis H1 from one of the alternatives θ < θ0, θ > θ0 or
θ ƒ= θ0.
3. Choose a level of significance α.
4. Select the appropriate test statistic and establish the critical region.
5. Compute the value of the test statistic from the sample data.
6. Decision: Reject H0 if the test statistic has a value in the critical region; otherwise, accept H0.

6.3.1 Test Concerning Means: Single Means

H0 Value of Test Statistic H1 Critical Region

x − µ0
z= ; σ known

µ = µ0 or n ≥ 30 µ < µ0 z < −zα
µ > µ0 z > −zα
µ ƒ= µ0 z < −z α
z > z 2α

Example 6.3.2
1. A manufacturer of sports equipment has developed a new synthetic fishing line that he
claims has a mean breaking strength of 8 kilograms with a standard deviation of 0.5
kilogram. Test the hypothesis that µ = 8 kilograms against ƒ the alternative that µ = 8
kilograms if a random sample of 50 lines is tested and found to have a mean breaking
strength of 7.8 kilograms. Use a 0.01 level of significance.
1. H0 : µ = 8 kilograms.
2. H1 : µ ƒ= 8 kilograms.

3. α = 0.01
4. Critical region: z < −2.58 and z > 2.58, where

x − µ0
5. Computations: x = 7.8 kilograms, σ = 0.5, n = 50, and hence
7.8 − 8
z= = 2.83
√0.5 −
6. Decision: Reject H0 and conclude that the average breaking strength is not equal to 8.

2. A random sample of 100 recorded deaths in the United States during the past year showed
an average life span of 71.8 years, with a standard deviation of 8.9 years. Does this seem to
indicate that the average life span today is greater than 70 years? Use a 0.05 level of
significance. Solution:
1. H0 : µ = 70 years.
2. H1 : µ > 70 years.
3. α = 0.05
4. Critical region: z > 1.65, where
x − µ0
5. Computations: x = 71.8 years, σ = 8.9 years, n = 100, and hence

71.8 − 70
z= √8.9 = 2.02
6. Decision: Reject H0 and conclude that the average life span today is greater than 70 years.

3. The average length of time for students to register for summer classes at a certain college
has been 50 minutes with a standard deviation of 10 minutes. If a random sample of 60
students had an average registration time of 52 minutes, test the hypothesis that the
population mean is now less than 50, using a level of significance of 0.025.
1. H0 : µ = 50 years.
2. H1 : µ < 50 years.
3. α = 0.025
4. Critical region: z < −1.96 where
z = x − µ0
5. Computations: x = 52 years, σ = 10 years, n = 60, and hence

52 − 50
z= = 1.55
6. Decision: Accept H0 and conclude that the average life span today is equal to 50 minutes.
6.4. EXERCISE 65

6.4 Exercise
1. An electrical firm manufactures light bulbs that have a length of life that is approximately
normally distributed with a mean of 800 hours and a standard deviation of 40 hours. Test the
hypothesis that µ = 800 hours against the alternative µ =ƒ 800 hours if a random sample of 30
light bulbs has an average life span of 788 hours. Use a 0.04 level of significance.

2. The average height of females in the freshman class of a certain college has been 160.5
centime- ters with a standard deviation of 6.9 centimeters. Is there a reason to believe that
there has been a change in the average height if a random sample of 50 females in the present
freshman class has an average height of 162.5 centimeters? Use a 0.02 level of significance.

3. Test the hypothesis that the average content of containers of a particular lubricant is 10
liters if the contents of a random sample of 30 containers are 10.2, 9.7, 10.1, 10.3, 10.1, 9.8,
9.9, 10.4, 10.3, 9.8, 10.21, 9.71, 10.11, 10.31, 10.11, 9.81, 9.91, 10.41, 10.31, 9.81, 9.83, 10.33,
10.43, 9.93, 9.83, 10.13, 10.33, 10.13, 9.73, and 10.23. Use a 0.01 level of significance and
assume that the distribution of contents is normal.

Вам также может понравиться