Академический Документы
Профессиональный Документы
Культура Документы
probabilities
Differentiate between the binomial and normal
distributions
Describe the concepts and uses of the standard
normal distribution
2/13/2020 2
Probability is:
3. The exam has five questions and each question has four multiple
choice in which one of the choice is the correct answer. If a
student answers all the question by guess, what is the chance that
he will answer 3 out of 5 correctly?
4. A mother in the delivery room to give birth and the health worker
informed her as she will deliver at 9:30 pm. She is eager to give birth
of a daughter. What is the probability that she will get what she
2/13/2020 4
wants?
Why Probability in Statistics?
2/13/2020 5
Why Probability in Healthcare?
2/13/2020 9
Types of Probability (Continued)
B. Subjective Probability
• Personalistic (represents one’s degree of belief in the
occurrence of an event).
• different for all observers (subjective)
• Examples:
Personal assessment of which is more effective to provide cure
– traditional/modern medicine
Personal assessment of which sports team will win a match.
• It also uses classical and relative frequency methods
to assess the likelihood of an event.
2/13/2020 10
• E.g., If someone says that he is 95% certain that a cure
for AIDS will be discovered within 5 years, then he
means that:
2/13/2020 11
Classical Probability
Example:
• Rolling a die -
– There are 6 possible outcomes:
– Total ways = {1, 2, 3, 4, 5, 6}.
• Each is equally likely
– P(i) = 1/6, i=1,2,...,6.
P(1) = 1/6
P(2) = 1/6
…….
P(6) = 1/6
2/13/2020 SUM = 1 12
• Definition: If an event can occur in N mutually exclusive
and equally likely ways, and if m of these posses a
characteristic, E, the probability of the occurrence of E
= m/N.
2/13/2020 13
• Another “equally likely” setting is the tossing of a
coin –
– There are 2 possible outcomes in the set of all
possible outcomes {H, T}.
P(H) = 0.5
P(T) = 0.5
SUM = 1
2/13/2020 14
Relative Frequency Probability
2/13/2020 15
• If you toss a coin 100 times and head comes up 40
times,
P(H) = 40/100 = 0.4.
• If we toss a coin 10,000 times and the head comes up
5562,
P(H) = 0.5562.
• Therefore, the longer the series and the longer sample
size, the closer the estimate to the true value.
2/13/2020 16
• Since trials cannot be repeated an infinite number
of times, theoretical probabilities are often
estimated by empirical probabilities based on a
finite amount of data
• Example: Of 158 people who attended a dinner
party, 99 were ill.
P (Illness) = 99/158 = 0.63 = 63%.
• In 1998, there were 2,500,000 registered live births; of
these, 200,000 were LBW infants.
• Therefore, the probability that a newborn is LBW is
estimated by
P (LBW) = 200,000/2,500,000
2/13/2020 17
= 0.08
Basic Definitions
• Experiment: Process that leads to one of several
possible outcomes *, e.g.:
Coin toss (Heads, Tails)
Throw die (1, 2, 3, 4, 5, 6)
Pick a card (AH, KH, QH, ...)
Introduce a new product
• Each trial of an experiment has a single observed
outcome.
• The precise outcome of a random experiment is
unknown before a trial.
2/13/2020 18
Basic Definitions…
Sample Space or Event Set: Set of all possible
outcomes (universal set) for a given experiment
E.g.: Roll a regular six-sided die
– S = {1,2,3,4,5,6}
2/13/2020 19
Basic Definitions…
Set - a collection of elements or objects of interest
Empty set (denoted by )
a set containing no elements
2/13/2020 20
Complement of a Set
2/13/2020 22
Sets: A Intersecting with B
A
B
A B
2/13/2020 23
• The union of A and B, A U B, is the event that either
A happens or B happens or they both happen
simultaneously
P ( A or B ) = P ( A U B )
• In the example above, the union of A and B is the
event that the newborn is either LBW or from a
multiple birth, or both
2/13/2020 24
Sets: A Union B
A
B
2/13/2020
A B 25
Mutually Exclusive Events
sets having no elements in common, having no
intersection, whose intersection is the empty set
Two events A and B are mutually exclusive if they
cannot both happen at the same time
P (A ∩ B) = 0
• Example:
– A coin toss cannot produce heads and tails simultaneously.
– Weight of an individual can’t be classified simultaneously as
“underweight”, “normal”, “overweight”
2/13/2020 26
Mutually Exclusive or Disjoint Sets
B
A
2/13/2020 27
Independent Events
• If the probability of the first one happening is the
same no matter how the second one turns out. OR
• The outcome of one event has no effect on the
occurrence or non-occurrence of the other.
P(A∩B) = P(A) x P(B) (Independent events)
P(A∩B) ≠ P(A) x P(B) (Dependent events)
Example:
– The outcomes on the first and second coin tosses
are independent
2/13/2020 28
The birth of a son or a daughter
are mutually exclusive events
because the two events will not
happen at the same time.
2/13/2020 29
Properties of Probability
1. The numerical value of a probability always lies
between 0 and 1, inclusive.
0 P(E) 1
A value 0 means the event can not occur
A value 1 means the event definitely will occur
A value of 0.5 means that the probability that the
event will occur is the same as the probability that it
will not occur.
2/13/2020 30
Properties of Probability…
2/13/2020 31
Properties of Probability…
2/13/2020 32
• In the example, the complement of A is the event that
a newborn is not LBW
• In other words, A is the event that the child weighs
2500 grams at birth
P(Ā) = 1 − P(A)
P(not low bwt) = 1 − P(low bwt)
= 1− 0.076
= 0.924
2/13/2020 33
Basic Probability Rules
1. Addition rule
If events A and B are mutually exclusive:
P(A or B) = P(A) + P(B)
P(A and B) = 0
More generally:
P(A or B) = P(A) + P(B) - P(A and B)
P(event A or event B occurs or they both occur)
2/13/2020 34
Example: The probabilities below represent years of
schooling completed by mothers
2/13/2020 35
• What is the probability that a mother has
completed < 12 years of schooling?
P( 8 years) = 0.056 and
P(9-11 years) = 0.159
• Since these two events are mutually exclusive,
P( 8 or 9-11) = P( 8 U 9-11)
= P( 8) + P(9-11)
= 0.056+0.159
= 0.215
2/13/2020 36
• What is the probability that a mother has completed 12
or more years of schooling?
P(12) = P(12 or 13-15 or 16)
= P(12 U 13-15 U 16)
= P(12)+P(13-15)+P(16)
= 0.321+0.218+0.230
= 0.769
2/13/2020 37
If A and B are not mutually exclusive events, then
subtract the overlapping:
P(AU B) = P(A)+P(B) − P(A ∩ B)
2/13/2020 38
Basic Probability Rules …
2. Multiplication rule
– If A and B are independent events, then
P(A ∩ B) = P(A) × P(B)
– More generally,
P(A ∩ B) = P(A) P(B|A) = P(B) P(A|B)
P(A and B) denotes the probability that A and B
both occur at the same time.
2/13/2020 39
Conditional Probability
• Refers to the probability of an event, given that
another event is known to have occurred.
• “What happened first is assumed”
• Think in stages. Think of the two events A and B
occurring chronologically, one after the other, either
in time or space.
2/13/2020 40
Conditional Probability
P( A B)
P( A B) , where P( B) 0
P( B)
Independent events:
P( A B) P( A)
P( B A) P( B)
2/13/2020 41
Conditional Probability (continued)
P( A B) P( A B) so P( A B) P( A B) P( B)
P( B)
P( B A) P( A)
P ( A B ) P ( A)
so P( A B) P( A)P(B)
P ( B / A) P ( B )
2/13/2020 42
Example for Conditional Probability
A study investigating the effect of prolonged exposure to
bright light on retina damage in premature infants.
Retinopathy Retinopathy TOTAL
YES NO
Bright light 18 3 21
Reduced light 21 18 39
TOTAL 39 21 60
2/13/2020 44
Conditional Probability (continued)
= 18/21 = 0.86
2/13/2020 45
Conditional Probability (continued)
= 21/39 = 0.54
2/13/2020 47
Example
• In a study of optic-nerve degeneration in Alzheimer’s
disease, postmortem examinations were conducted on
10 Alzheimer’s patients. The following table shows the
distribution of these patients according to sex and
evidence of optic-nerve degeneration.
2/13/2020 48
Optic-nerve Degeneration
Sex
Present Not Present
Female 4 1
Male 4 1
2/13/2020 49
Solution
• P(Optic-nerve degeneration/Female) =
.
Order the letters: A, B, and C
C
. ABC
. .. B
C B .. ACB
. . .
A
B A C
.. BAC
. .
C
C
A
A
B
BCA
2/13/2020
. B
A
. CAB
CBA
52
Factorial
P n!
n r (n r )!
Forex ample :
6! 6! 6 * 5 * 4 * 3 * 2 *1
P 6 * 5 * 4 120
(6 3)!
6 3
2/13/2020 3! 3 * 2 *1 54
Combinations (Order is not Important)
Suppose that when we pick 3 letters out of the 6 letters A, B, C, D, E, and F
we chose BCD, or BDC, or CBD, or CDB, or DBC, or DCB. (These are the
6 (3!) permutations or orderings of the 3 letters B, C, and D.) But these are
orderings of the same combination of 3 letters. How many combinations of 6
different letters, taking 3 at a time, are there?
2/13/2020 56
• Random variables can be either discrete or
continuous
2/13/2020 57
• With categorical variables, we obtain the frequency
distribution of each variable
• With numeric variables, the aim is to determine
whether or not normality may be assumed
– If not we may consider transforming the variable or
categorize it for analysis (eg age group)
0 ≤ P(X = x) ≤ 1
∑ P(X = x) = 1
2/13/2020 60
The following data shows the number of diagnostic
services a patient receives
2/13/2020 61
• What is the probability that a patient receives exactly 3
diagnostic services?
P(X=3) = 0.031
• What is the probability that a patient receives at most
one diagnostic service?
P (X≤1) = P(X = 0) + P(X = 1)
= 0.671 + 0.229
= 0.900
• What is the probability that a patient receives at least
four diagnostic services?
P (X≥4) = P(X = 4) + P(X = 5)
= 0.010 + 0.006
2/13/2020 = 0.016 62
Probability distributions can also
be displayed using a graph
0.8
Probability, X=x
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
0 1 2 3 4 5
No. of diagnostic services, x
2/13/2020 63
The Expected Value of a Discrete
Random variable
• If a random variable is able to take on a large number
of values, then a probability mass function might not
be the most useful way to summarize its behavior
2/13/2020 64
The Expected Value …
• It is represented by E(X) or µ
2/13/2020 65
• Example: For the diagnostic service data:
2/13/2020 66
The Variance of a Discrete Random
Variable
2/13/2020 67
Example: For the diagnostic service data
σ2 = ∑(xi-µ)2P(X=xi)
= (0− 0.5)2(0.671) +(1 − 0.5)2(0.229)
+(2 − 0.5)2(0.053) +(3 − 0.5)2(0.031)
+(4 − 0.5)2(0.010) +(5 − 0.5)2(0.006)
= 0.782
2/13/2020 68
• Examples of discrete probability distributions
are the binomial distribution and the Poisson
distribution.
2/13/2020 69
Binomial Distribution
2/13/2020 70
Binomial Distribution…
2/13/2020 71
• The outcomes are mutually exclusive and
exhaustive
• Suppose that 72% of infants born survive to age
70 years
P(Y = 1) = p = 0.72
P(Y = 0) = 1 − p = 0.28
2/13/2020 72
A binomial probability distribution occurs
when the following requirements are met.
• P (X=x) = , x = 0, 1, 2, ..., n.
2/13/2020 74
• n denotes the number of fixed trials
• x denotes the number of successes in
the n trials
• p denotes the probability of success
• q denotes the probability of failure (1- p)
2/13/2020 76
• If the probability that any individual in the
population is a smoker to be P=0.40, then the
probability that x=4 smokers out of n=10 subjects
selected is:
P(X=4) =10C4(0.4)4(1-0.4)10-4
= 10C4(0.4)4(0.6)6 = 210(.0256)(.04666)
= 0.25
2/13/2020 78
2/13/2020 79
The probability in the above table can
be converted into the following graph
0.3
0.25
Probability
0.2
0.15
0.1
0.05
0
0 1 2 3 4 5 6 7 8 9 10
No. of Smokers
2/13/2020 80
Exercise
Each child born to a particular set of parents
has a probability of 0.25 of having blood type
O. If these parents have 5 children.
What is the probability that
a. Exactly two of them have blood type O
b. At most 2 have blood type O
c. At least 4 have blood type O
d. 2 do not have blood type O.
2/13/2020 81
Solution for ‘a’
5
P(x 2) = (0.25) (0.75)
2 5-2
2
0.2637
2/13/2020 82
The Mean and Variance of a
Binomial Distribution
• Once n and P are specified, we can compute the
proportion of success,
P = x/n
2/13/2020 83
Example:
• 70% of a certain population has been immunized for
polio. If a sample of size 50 is taken, what is the
“expected total number”, in the sample who have been
immunized?
µ = np = 50(.70) = 35
2/13/2020 85
Poisson Distribution
the distribution of the number of occurrences x of some
random event in an interval of time or space, or some volume
of matter.
For example, a hospital administrator has been studying
daily emergency admissions over a period of several months
and has found that admissions have averaged three per day.
He or she is then interested in finding the probability that no
emergency admissions will occur on a particular day.
2/13/2020 86
2/13/2020 87
2/13/2020 88
2/13/2020 89
B. Continuous Probability
Distributions
• A continuous random variable X can take on any
value in a specified interval or range
2/13/2020 90
Distribution of serum
triglyceride
2/13/2020 91
• Instead of assigning probabilities to specific outcomes
of the random variable X, probabilities are assigned to
ranges of values
• The probability associated with any one particular
value is equal to 0
• Therefore, P(X=x) = 0
• Also, P(X ≥ x) = P(X > x)
• We calculate:
Pr [ a < X < b], the probability of an
interval of values of X.
2/13/2020 92
The Normal distribution (ND)
• The ND is the most important probability distribution in
statistics
• Frequently called the “Gaussian distribution” or bell-
shape curve.
• Variables such as blood pressure, weight, height, serum
cholesterol level, and IQ score — are approximately
normally distributed
2/13/2020 93
A random variable is said to have a normal distribution
if it has a probability distribution that is symmetric and
bell-shaped
2/13/2020 94
• The ND is vital to statistical work, most estimation
procedures and hypothesis tests underlie ND
• The concept of “probability of X=x” in the discrete
probability distribution is replaced by the “probability
density function f(x)
• The ND is also an approximating distribution to other
distributions (e.g., binomial)
2/13/2020 95
• A random variable X is said to follow ND, if
and only if, its probability density function
is:
1 x-
2
1
2
f(x) = e , - < x < .
2
2/13/2020 96
π (pi) = 3.14159
e = 2.71828, x = Value of X
σ2 = Variance of X.
2/13/2020 97
2/13/2020 98
1. The mean µ tells you about location -
– Increase µ - Location shifts right
– Decrease µ – Location shifts left
– Shape is unchanged
2. The variance σ2 tells you about narrowness or flatness
of the bell -
– Increase σ2 - Bell flattens. Extreme values are
more likely
– Decrease σ2 - Bell narrows. Extreme values are
less likely
– Location is unchanged
2/13/2020 99
2/13/2020 100
Properties of the Normal Distribution
1. It is symmetrical about its mean, .
2/13/2020 101
5. As the value of increases, the curve becomes more
and more flat and vice versa.
6. Perpendiculars of:
± 1SD contain about 68%;
±2 SD contain about 95%;
±3 SD contain about 99.7%
of the area under the curve.
Next slide
7. The distribution is completely determined by the
parameters and .
2/13/2020 102
2/13/2020 103
• We have different normal distributions depending on
the values of μ and σ2.
• We cannot tabulate every possible distribution
• Tabulated normal probability calculations are
available only for the ND with µ = 0 and σ2=1.
2/13/2020 104
Normal Probability Distributions
All of these are normal probability density functions, though each has a different mean and
variance.
Normal Distribution: =40, =1
0.4
Normal Distribution: =30, =5 Normal Distribution: =50, =3
0.2 0.2
0.3
f(w)
0.2
f(x)
f(y)
0.1 0.1
0.1
0.0
35 40 45
0.0 0.0
w 0 10 20 30 40 50 60 35 45 50 55 65
x y
0.2
2/13/2020
Z~N(0,1) 105
Standard Normal Distribution
2/13/2020 106
The Standard Normal Distribution
The standard normal random variable, Z, is the normal random
variable with mean = 0 and standard deviation = 1: Z~N(0,12).
0 .4
0 .3
=1
f(z)
{
0 .2
0 .1
0 .0
-5 -4 -3 -2 -1 0 1 2 3 4 5
=0
Z
Approximately 68% of the area under the standard normal curve lies
between ±1, about 95% between ±2, and about 99% between ±2.5 SD
2/13/2020 107
Z - Transformation
Value (x)
50 65 80 95 110 125 140 155 170
-4 -3 -2 -1 0 1 2 3 4
SDs from mean using
(x-110)/15 = (x-μ)/σ
2/13/2020 109
Finding normal curve areas
1. The table gives areas between +∞ and the value of Zo.
3. Read the value of the area (P) from the body of the
table where the row and column intersect. Values of P
are in the form of a decimal point and four places.
2/13/2020 110
Some Useful Tips
2/13/2020 111
Standard Normal
Table
P(Z>2.575) Z
0.0 2.575
2/13/2020 112
2/13/2020 113
Finding Probabilities of the Standard
Normal Distribution: P(0 < Z < 1.56)
Standard Normal Probabilities
Standard Normal Distribution z .00 .01 .02 .03 .04 .05 .06 .07 .08 .09
0.0 0.0000 0.0040 0.0080 0.0120 0.0160 0.0199 0.0239 0.0279 0.0319 0.0359
0.4 0.1 0.0398 0.0438 0.0478 0.0517 0.0557 0.0596 0.0636 0.0675 0.0714 0.0753
0.2 0.0793 0.0832 0.0871 0.0910 0.0948 0.0987 0.1026 0.1064 0.1103 0.1141
0.3 0.1179 0.1217 0.1255 0.1293 0.1331 0.1368 0.1406 0.1443 0.1480 0.1517
0.3 0.4 0.1554 0.1591 0.1628 0.1664 0.1700 0.1736 0.1772 0.1808 0.1844 0.1879
0.5 0.1915 0.1950 0.1985 0.2019 0.2054 0.2088 0.2123 0.2157 0.2190 0.2224
0.6 0.2257 0.2291 0.2324 0.2357 0.2389 0.2422 0.2454 0.2486 0.2517 0.2549
f(z)
0.2 0.7 0.2580 0.2611 0.2642 0.2673 0.2704 0.2734 0.2764 0.2794 0.2823 0.2852
0.8 0.2881 0.2910 0.2939 0.2967 0.2995 0.3023 0.3051 0.3078 0.3106 0.3133
0.9 0.3159 0.3186 0.3212 0.3238 0.3264 0.3289 0.3315 0.3340 0.3365 0.3389
0.1 1.0 0.3413 0.3438 0.3461 0.3485 0.3508 0.3531 0.3554 0.3577 0.3599 0.3621
1.56 1.1 0.3643 0.3665 0.3686 0.3708 0.3729 0.3749 0.3770 0.3790 0.3810 0.3830
{
1.2 0.3849 0.3869 0.3888 0.3907 0.3925 0.3944 0.3962 0.3980 0.3997 0.4015
0.0 1.3 0.4032 0.4049 0.4066 0.4082 0.4099 0.4115 0.4131 0.4147 0.4162 0.4177
-5 -4 -3 -2 -1 0 1 2 3 4 5 1.4 0.4192 0.4207 0.4222 0.4236 0.4251 0.4265 0.4279 0.4292 0.4306 0.4319
Z 1.5 0.4332 0.4345 0.4357 0.4370 0.4382 0.4394 0.4406 0.4418 0.4429 0.4441
1.6 0.4452 0.4463 0.4474 0.4484 0.4495 0.4505 0.4515 0.4525 0.4535 0.4545
1.7 0.4554 0.4564 0.4573 0.4582 0.4591 0.4599 0.4608 0.4616 0.4625 0.4633
1.8 0.4641 0.4649 0.4656 0.4664 0.4671 0.4678 0.4686 0.4693 0.4699 0.4706
1.9 0.4713 0.4719 0.4726 0.4732 0.4738 0.4744 0.4750 0.4756 0.4761 0.4767
2.0 0.4772 0.4778 0.4783 0.4788 0.4793 0.4798 0.4803 0.4808 0.4812 0.4817
2/13/2020 114
Finding Probabilities of the Standard
Normal Distribution: P(1< Z < 2)
To find P(1 Z 2): z
.
.00
.
...
. .
1. Find table area for 2.00 .
0.9
.
0.3159 ...
0.3
Area between 1 and 2
P(1 Z 2) = .9772 - .8413 = 0.1359
f(z)
0.2
0.1
0.0
-5 -4 -3 -2 -1 0 1 2 3 4 5
Z
2/13/2020 115
Finding Probabilities of the Standard
Normal Distribution: P(Z < -2.47)
z ... .06 .07 .08
To find P(Z<-2.47): . . . .
. . . .
Find table area for 2.47 . . . .
P(0 < Z < 2.47) = .4932 2.3 ... 0.4909 0.4911 0.4913
2.4 ... 0.4931 0.4932 0.4934
P(Z < -2.47) = .5 - P(0 < Z < 2.47)2.5 ... 0.4948 0.4949 0.4951
= .5 - .4932 = 0.0068 .
.
.
0.2
0.1
0.0
-5 -4 -3 -2 -1 0 1 2 3 4 5
Z
2/13/2020 116
Finding Values of the Standard Normal
Random Variable: P(0 < Z < z) = 0.40
z .00 .01 .02 .03 .04 .05 .06 .07 .08 .09
To find z such that 0.0
0.1
0.0000
0.0398
0.0040
0.0438
0.0080
0.0478
0.0120
0.0517
0.0160
0.0557
0.0199
0.0596
0.0239
0.0636
0.0279
0.0675
0.0319
0.0714
0.0359
0.0753
0.2 0.0793 0.0832 0.0871 0.0910 0.0948 0.0987 0.1026 0.1064 0.1103 0.1141
0.3 0.1179 0.1217 0.1255 0.1293 0.1331 0.1368 0.1406 0.1443 0.1480 0.1517
0.2
0.0
2/13/2020 120
Applications of the Normal
Distribution
• The ND is used as a model to study many different
variables.
2/13/2020 121
Example:
• The diastolic blood pressures of males 35–44 years of
age are normally distributed with µ = 80 mm Hg and σ2
= 144 mm Hg2
σ = 12 mm Hg
2/13/2020 122
a. What is the probability that a randomly selected male
has a BP above 95 mm Hg?
2/13/2020 123
b. What is the probability that a randomly selected male
has a DBP above 110 mm Hg?
Z = 110 – 80 = 2.50
12
2/13/2020 124
c. What is the probability that a randomly selected male
has a DBP below 60 mm Hg?
Z = 60 – 80 = -1.67
12
2/13/2020 125
d. What value of DBP cuts off the upper 5% of this
population?
• Looking at the table, the value Z = 1.645 cuts off an
area of 0.05 in the upper tail
• We want the value of X that corresponds to Z = 1.645
Z=X–μ
σ
1.645 = X – μ, X = 99.7
σ
• Approximately 5% of the men in this population have
a DBP greater than 99.7 mm Hg
2/13/2020 126
Other Distributions
1. Student t-distribution
2. F- Distribution
3. 2 -Distribution
2/13/2020 127
The student t- distribution
• We use the normal distribution
if our sample is taken from a normally distributed
population whose population standard deviation (δ) is
known, or
if the sample size is large (i.e. greater than 30) so
that we can use the Central Limit Theorem (CLT).
• The student t-distribution (t- statistic) is used only when we
get the following three conditions:
a sample from normally distributed population is used,
Population variance is unknown, and
the sample size is small (less than 30)
2/13/2020 128
Otherwise we use non-parametric test
Student’s t distributions cont…
Unimodal;
Asymptotic to the horizontal axis;
Symmetrical about zero,
Dependent on n or the degrees of freedom (v = n-1);
More variable than the standard normal distribution;
If degree of freedom or v is large, approximately the same as
the standard normal distribution
2/13/2020 129
Student’s t distributions cont…
The test - statistic for students t-distribution is :
i.e.
2/13/2020 130
Table : Student’s t-distribution
t table with right tail probabilities
2/13/2020 131
Table : Student’s t-distribution cont…
2/13/2020 132
Table 3: Chi-square table
Right tail areas for the Chi-square Distribution
d/\area .995 .990 .975 .950 .900 .750 .500 .250 .100 .050 .025 .010 .005
1 0.0000 0.0001 0.0009 0.0039 0.0157 0.1015 0.4549 1.3233 2.7055 3.8414 5.0238 6.6349 7.8794
2 0.0100 0.0201 0.0506 0.1025 0.2107 0.5753 1.3862 2.7725 4.6051 5.9914 7.3777 9.2103 10.596
3 0.0717 0.1148 0.2158 0.3518 0.5843 1.2125 2.3659 4.1083 6.2513 7.8147 9.3484 11.344 12.838
4 0.2069 0.2971 0.4844 0.7107 1.0636 1.9225 3.3566 5.3852 7.7794 9.4877 11.143 13.276 14.860
5 0.4117 0.5543 0.8312 1.1454 1.6103 2.6746 4.3514 6.6256 9.2363 11.070 12.832 15.086 16.749
6 0.6757 0.8720 1.2373 1.6353 2.2041 3.4546 5.3481 7.8408 10.644 12.591 14.449 16.811 18.547
7 0.9892 1.2390 1.6898 2.1673 2.8331 4.2548 6.3458 9.0375 12.017 14.067 16.012 18.475 20.277
8 1.3444 1.6465 2.1797 2.7326 3.4895 5.0706 7.3441 10.218 13.361 15.507 17.534 20.090 21.954
2/13/2020 133
Test of associations for 2x2 tables
2/13/2020 134
The Chi-squared test (2-test with d.f. = (r-1)x(c-1))
nad bc
2
2
for 2 x 2 table
(a c)(b d )( a b)(c d )
2
Oij Eij
2
2/13/2020 135
The Chi-squared test…..
2/13/2020 136
Example
• Consider the following 2X2 table
Head injury Wearing helmet
Yes No Total
2/13/2020 137
THANKS
2/13/2020 138