Академический Документы
Профессиональный Документы
Культура Документы
A good condence interval is as narrow as possible and has a condence coecient near to 1. The narrower the
interval, the more exactly we have located the parameter. The larger the condence coecient, the more the
condence we have that a particular interval enclose the parameter.
Using the equation above, the (1)100% condence interval (CI) for is obtained as
For example, if =0 .05, then z/ 2 = z0.05/ 2 = z0.025 =1 .96 (from Normal Table).
EXAMPLE 1: Unoccupied seats on flights cause airlines to lose revenue. Suppose a large airline wants to estimate its
average number of unoccupied seats per flight over the past year. 225 flight records are randomly selected and
the number of unoccupied seats is noted, with a sample mean of 11.6 seats and a standard deviation of 4.1 seats.
Calculate a 90 percent confidence interval for , the mean number of unoccupied seats per flight during the past year.
Answer: n = 225
x-bar = 11.6
standard deviation, = 4.1
Margin of error, E = s.e.* Z
s.e. = / n = 4.1/225 = 0.2733
Z = 1.645 at 90% confidence
So, E = 0.2733x1.645 = 0.4496 = 0.45 (approx.)
CONCLUSION: We are 90% condent that the true mean number of unoccupied seats () will lie between 11.15 to 12.05.
EXAMPLE 2:
A large company wants to estimate the average amount of money that is owed to it by delinquent customers, .
The company takes a random sample of n=100 of its delinquent accounts. The mean of this sample is $230 and the
standard deviation is $90. Construct a 95% and 99% confidence interval for the population parameter .
(1.96)/
230 17.64
212.36 247.64
206.78 253.22
EXAMPLE 3:
Unoccupied seats on flights cause airlines to lose revenue. A large airline wants to estimate its average number of
unoccupied seats per flight over the past year. To accomplish this, the records of 225 flights are randomly selected, and the
number of unoccupied seats is noted for each of the sampled flights. If you do the calculation by hand: The summary statisti
number of unoccupied seats for the 225 flights was: sample mean 11.6, and standard deviation 4.1 (t* = 1.97).
a. Find the 95% confidence interval for the population mean (make sure you check the conditions before you make your calcu
b. Interpret it IN CONTEXT.
c. Find the margin of error.
d. List two ways the margin of error could be reduced.
a xbar std.error
11.6 0.535733
Condence Interval
11.0642667 12.13573
b We are 95% condent that our estimate of 11,6 is within 0.54 of the true number of unoccupied seats
We are 95% condent that the true mean unoccupied seats ( ) will lie between 11.06 to 12.14.
c Margin of error, E = s.e.* Z
s.e. = / n = 4.1/225 = 0.2733
EXAMPLE 4:
What is the condence level for each of the following condence intervals for m ?
x = xbar
1.96(/)
1.645(/)
2.575(/)
1.282(/)
.99(/)
EXAMPLE 5:
Find Z/2 for each of the following:
a. = 0.10 b. = 0.01
c. = 0.05 d. = 0.2 FOLLOW THE SOLUTION WITH THE SAME COLORED CELLS
EXPLANATIONS FOR THE SOLUTIONS ON THE RIGHT
/2 Blues
0.05 0.005
0.025 0.1
greens
0,5 - /2
See Z-Table 0.4500 0.4950 pinks
to nd it 0.4750 0.4000
oranges
Write the row 1.645 2.578
and colums 1.96 1.28
EXAMPLE 6:
A random sample of 70 observations from a normally distributed population possesses a sample mean equal to 26,2
and a standard deviation equal to 4,1.
Answers
a 1- =.95 Z /2 = 0,5/2 25.24 We are 95% condent that the true mean of the population will be between 25,24 and
= 0.05 0.025 27.16
b The Condence coecient of .95 means that in repeated sampling , 95% of all condence
interval constructed will include
c 1- =.99 Z /2 = 0,5/2 24.94
= 0.01 0.005 27.46
d As the condence coecient increases, the width of the condence
interval also increases
e Yes. Since the sample size is 70, the Central Limit Theorem applies.
This ensures the distribution of is normal,
regardless of the original distribution.
TH CONFIDENCE INTERVALS
arrower the
s to estimate its
n of 4.1 seats.
ring the past year.
ustomers, .
is $230 and the
ge number of
omly selected, and the
nd: The summary statistics for the
(t* = 1.97).
fore you make your calculations)
noccupied seats
ean equal to 26,2
n was not normal? Explain
CS
TI
CHAPTER: 5 INFERENCE BASED ON A SINGLE SAMPLE
IS
AT
ST
E
PROBLEM 1: Estimate average amount of money owed by a banks delinquent deptors.
V
TI
Use the data to nd a 95% ondence interval for and interpret the results.
IP
R
SC
x mean x-xmean Squares 2
DE
AMOUNT
E
195 US 233.28 -38.28 1465.3584 8079.6816
II!
Mean 233.28
Standard Error 9.033988347
Median 222
Mode 221
Standard Deviation 90.3398834705
Sample Variance 8161.2945454546
Kurtosis 0.2548102344
Skewness 0.4767998287
Range 495
Minimum 17
Maximum 512
Sum 23328
Count 100
Condence Level(95.0%) 17.9253928186
PROBLEM 2 SAME "UNOCCUPIED" PROBLEM TO SEE DESCRIPTIVE STATISTICS!
Unoccupied seats on flights cause airlines to lose revenue.
A large airline wants to estimate its average number of
NOSHOWS
12
2
unoccupied seats per flight over the past year. To do this 11 11.15 12.05
225 flights are randomly selected, and the number of 19 11.6 0.45
unoccupied seats is noted for each of the sampled flights. 6
(data is in airnoshow worksheet) 10 Interpretation:
Decriptive statistics for the data are displayed below. 8 At the 90% condence level, we estim
11 unoccupied seats per flight to be bet
Descriptive Statistics 13 (if you use minitab, these values will
10
Mean 11.5955555556 2
Standard Error 0.2735058818 20
Median 12 13
Mode 10 9
Standard Deviation 4.1025882268 10
Sample Variance 16.8312301587 13
Kurtosis 0.2430943946 7
Skewness -0.2936457933 18
Range 23 16
Minimum 0 12
Maximum 23 9
Sum 2609 12
Count 225 1
Condence Level(90.0%) 0.451745442 10
0
9
14
9
7
15
19
15
13
15
15
12
9
14
11
6
14
10
15
12
10
8
13
14
11
11
17
13
6
13
12
0
13
15
12
18
16
9
19
13
16
10
10
13
14
17
12
11
23
10
2
18
13
9
14
4
17
1
13
5
7
10
11
19
12
7
14
10
11
13
18
4
17
14
4
13
1
12
11
12
9
10
11
9
17
12
16
11
12
7
14
16
10
5
11
17
17
12
9
6
16
10
10
19
12
11
6
9
16
9
3
10
13
9
10
10
16
16
16
7
13
15
17
13
13
8
15
11
16
12
6
14
14
9
9
8
10
12
12
12
13
5
10
9
11
8
15
9
9
18
11
6
15
11
10
8
13
9
19
17
8
10
10
9
10
14
6
9
4
12
19
16
17
15
17
14
14
11
6
13
16
12
12
11
8
13
7
18
9
12
9
14
18
11
10
14
14
13
12
14
17
6
The population standart deviation is almost unknown. Although it is still true that xbar = /
n
the sample standard deviation s may provide a poor approximation for when the sample size is small.
Solutions:
Instead of using the standard normal statistic which requires knowledge of a good approximation to
we dene and use t statistics in which
sample standard deviation, s, replaces the po
standard deviation.
Degrees of Freedom (df)
Some estimates are based on more information than others.
For example, an estimate of the variance based on a sample size of 100 is based on more
information than an estimate of the variance based on a sample size of 5. The degrees of freedom
(df) of an estimate is the number of independent pieces of information on which the estimate is based.
The actual amount of variability in the sampling distribution of t depends on the sample size n.
A convenient way of expressing the dependence is to say that the t-statistic has (n-1) degrees of freedom (df).
EXAMPLE 1
Consider the pharmaceutical company that desires an estimate of the mean increase in blood pressure of patients
who take a new drug. The blood pressure increases (points) for the n=6 patients who take a new drug.
The blood pressure increases (points) or the n=6 patients in the human testing phase are shown in the table below.
Use this information and construct a 95% condence interval for the mean increase in blood pressure
associated with the new drug for all patients in the population.
1.28587396 3.280126 We can be condent that the mean increase in blood pressure
associated with taking this new drug is between 1,286 and 3,28 points.
EXAMPLE 2
Suppose a manufacturer of printers for personal computers wishes to estimate the mean number of
characters printed before the printhead fails. The printer manifacturer test n = 15 randomly selected
printheads and records the number of characters printed until failure for each. These 15 measurements
(in millions characters) are listed in below table.
a. Form a 99% condence interval for the mean number of characters printed before printheads fails
Interpret the results.
n
mple size is small.
proximation to
use t statistics in which
ard deviation, s, replaces the population
s of freedom
estimate is based.
Mean 2.283333
Standard Error 0.387657
Median 2.4
Mode #N/A
Standard Devia 0.949561
Sample Varianc 0.901667
Kurtosis -0.38918
Skewness -0.57308
Range 2.6
286 and 3,28 points. Minimum 0.8
Maximum 3.4
Sum 13.7
Count 6
ean number of
domly selected
15 measurements
printheads fails
Standard Error
Standard Deviation
Sample Variance
Condence Level(99.0%)
Large Sample Confidence Interval for a Population Proportion
Estimating a Population Proportion
EXAMPLE 1
A food-products company conducted a market study by randomly sampling and interviewing 1,000 consumers
to determine which brand of breakfast cereal they prefer. Suppose 313 consumers were found to prefer the
company's brand. How would you estimate the true fraction of all customers who prefer the company's
cereal brand?
p-hat = # of consumers sampled who prefer company's brand / # of consumers sampled
p-hat =313/1000 = ,313
FORMULA p = x / n Where:
POPULATION PROPORTION x is the number of items youre interested in, and
n is the total number of items in the population.
CALCULATING P-HAT The actual calculation of p-hat is not challenging. To do it, you need two numbers. One
is the sample size (n) and the other is the number of occurrences of the event or parameter in question (X).
The equation for p-hat is p-hat = X/n. In words: You nd p-hat by dividing the number of occurrences
of the desired event by the sample size.
FORMULA
CONFIDENCE INTERVAL
Thus, if 313 or 1000 consumers prefer the company's cereal brand, a 95%
condence interval for the population of all customers who prefer the company brand is
((.313.687)/1000)
.313 1.96* (.284, .342)
The company can be 95% condent that the interval from 28.4% to 34.2% contains the true
percentage of all customers who prefer its brand.
EXAMPLE 2
Suppose that Bureau of Economic and Business Research at the University randomly samples
484 consumers and nds that 257 are optimistic about the state of the economy. Use 90%
condence interval to estimate the proportion of all consumers in Florida who are optimistic
about the state of the economy.
Point estimate of the proportion of Florida consumers, who are optimistic about the economy:
p = x/n 257/484 0.531 we rst check to be sure that the sample size is
suciently large that normal distribution provides
np-hat 257.004 a reasonable approximation for the sampling distribution
nq-hat 227 of p-hat.
((.531.469)/484)
.313 1,645* (.494, .568)
ADJUSTED CONFIDENCE INTERVAL If number of success is less than 15
EXAMPLE 3
According to True Odds: How Risk Affects your Everyday Life (Walsh, 1997), the probability of
being victim of a violent crime is less then .01. Suppose that in a random sample of 200 Americans,
3 victim of a violent crime. Estimate the true proportion of Americans, who were
victims of a violent crime using a 95% condence interval.
p almost 0 and 3 number of success is less than 15. We doupt whether the sample size 200 is large enough to
apply large sample method. Alternatively we wil apply the adjustment explained on the left box.
(3+2)/(200+4) =5/204
(((.025.975)/204@))
0.025 1.96*
mers sampled
ng distribution
Americans,
In "Confidence Interval" Example 2 we estimated the mean overdue amount for all delinquent accounts in a large credit c
A sample of 100 delinquent accounts produced the 95% condence interval:
mean was 230 So, 95% condence means 2 standard deviations above and below the mean.
= (2*90)/5 =36
Thus, 1296 account will have to be sampled randomly
to estimate the mean overdue amount to within n = 36 ^2 = 1296
$5 with 95% condence.
Determining the sample size
In order to estimate with a sampling error and with
100(1-)% condence, the requird sample size is
found as follows:
Z/2(/())=
n=(("Z/2" )
2
2/2)
PROBLEM:The manufacturer of ocial NFL footballs uses a machine to inflate its new balls
to a pressure of 13,5 pounds. When the machine is properly calibrated, the mean inflation pressure
is 13,5 pounds, but uncontrollable factors cause the pressures of individual footballs to vary
randomly from about 13,3 to 13,7 pounds. For quality control purposes, the manufacturer wishes to
estimate the mean inflation pressure to within 0,025 pound of its true value with a 99% condence
interval. What sample size should be used?
SOLUTION:We desire a 99% condence interval that estimates with a sampling error of SE=0,025 pound.
For a 99% condence interval, we have Z/2 = Z 0,05 = 2.575
Range of observations 13,7 - 13,3 =,4 A conservative estimate based on chebychev's rule is
=R/4 =.1.
n=(("Z/2" ) n=((("2" n=106,09
)
2 2/ 2
575)2(.1)2)/
(0,025)2))
(
counts in a large credit cooperation.
ence- that is
rval from $35,28
e increased to
This sheet
is for
Extra Info
he problem :
Finite Population Correction for Simple Random Sampling
We learned "In Large Sample Confidence Interval" . In some sampling situations we use finite population correction.
In some sampling situations, the sample size n may represent 5% or perhaps 10% of the total number of sampling units
in the population. When the sample size large relative to the number of measurementsin the population, the standad
errors of the estimators of or p (p cap, estimator of p) should be multiplied by a finite population correction factor.
a.Find an approximate 95% condence interval for the mean amout of foi
N b. Estimate the total square feet of foil on all rolls by multiplying the con
Interpret the result.
Rule of Thumb for Finite
Population Correction Function: SOLUTION: Each roll of foil is a sampling unit, and there are N=1462 units
Use this correction when n/N >0,05 The sample size is n = 100. Because n/N = 100/1462 = 0,68 and exc
we need to apply nite population correction factor. We have n=100,x bar=
a.
47,4 2
N 12,4/
(100)
b. For nite population of size N the sum of all measurements in the pop
total
24_(=1)^
lower limitN(45,01) =1462(45,01)= 6580
=
Upper limiN(49.79)=1462(49.79) = 7279