Ch02 - Some Basic Statistika Concept

Design of Engineering Experiments
Chapter 2 – Some Basic Statistical Concepts

• Describing sample data
– Random samples
– Sample mean, variance, standard deviation
– Populations versus samples
– Population mean, variance, standard deviation
– Estimating parameters
• Simple comparative experiments
– The hypothesis testing framework
– The two-sample t-test
– Checking assumptions, validity
Chapter 2 Design & Analysis of Experiments 1

7E 2009 Montgomery
Portland Cement Formulation (page 24)

7E 2009 Montgomery
Graphical View of the Data
Dot Diagram, Fig. 2.1, pp. 24

7E 2009 Montgomery
If you have a large sample, a
histogram may be useful

7E 2009 Montgomery
Box Plots, Fig. 2.3, pp. 26

7E 2009 Montgomery
The Hypothesis Testing Framework
• Statistical hypothesis testing is a useful

framework for many experimental
situations
• Origins of the methodology date from the
early 1900s
• We will use a procedure known as the two-
sample t-test

7E 2009 Montgomery
The Hypothesis Testing Framework
• Sampling from a normal distribution

• Statistical hypotheses: H : µ = µ
0 1 2
H1 : µ1 ≠ µ 2
7E 2009 Montgomery
Estimation of Parameters
1 n
y = ∑ yi estimates the population mean µ
n i =1
n
1
S =
2
∑
n − 1 i =1
( yi − y ) estimates the variance σ
2 2

7E 2009 Montgomery
Summary Statistics (pg. 36)
Formulation 1 Formulation 2
“New recipe” “Original recipe”
y1 = 16.76 y2 = 17.04
S = 0.100
1
2
S22 = 0.061
S1 = 0.316 S2 = 0.248
n1 = 10 n2 = 10

7E 2009 Montgomery
How the Two-Sample t-Test Works:
Use the sample means to draw inferences about the population means
y1 − y2 = 16.76 − 17.04 = −0.28
Difference in sample means
Standard deviation of the difference in sample means
σ2
σ =
2
y
n
This suggests a statistic:
y1 − y2
Z0 =
σ 12 σ 22
+
n1 n2
7E 2009 Montgomery
Use S and S to estimate σ and σ
1
2 2
2
2
1
2
2
y1 − y2
The previous ratio becomes
2 2
S S
+
1 2
n1 n2
However, we have the case where σ = σ = σ 2
1
2
2
2
Pool the individual sample variances:

( n − 1) S 2
+ ( n − 1) S 2
Sp = 1
2 1 2 2
n1 + n2 − 2
7E 2009 Montgomery
The test statistic is
y1 − y 2
t0 =
1 1
Sp +
n1 n2
• Values of t0 that are near zero are consistent with the null
hypothesis
• Values of t0 that are very different from zero are consistent
with the alternative hypothesis
• t0 is a “distance” measure-how far apart the averages are
expressed in standard deviation units
• Notice the interpretation of t0 as a signal-to-noise ratio
7E 2009 Montgomery
The Two-Sample (Pooled) t-Test
( n1 − 1) S12 + (n2 − 1) S 22 9(0.100) + 9(0.061)
S =
2
= = 0.081
n1 + n2 − 2 10 + 10 − 2
p
S p = 0.284
y1 − y2 16.76 − 17.04
t0 = = = −2.20
1 1 1 1
Sp + 0.284 +
n1 n2 10 10
The two sample means are a little over two standard deviations apart
Is this a "large" difference?

7E 2009 Montgomery
William Sealy Gosset (1876, 1937)
Gosset's interest in barley cultivation led

him to speculate that design of
experiments should aim, not only at
improving the average yield, but also at
breeding varieties whose yield was
insensitive (robust) to variation in soil and
climate.
Developed the t-test (1908)
Gosset was a friend of both Karl Pearson

and R.A. Fisher, an achievement, for each
had a monumental ego and a loathing for
the other.
Gosset was a modest man who cut short

an admirer with the comment that “Fisher
would have discovered it all anyway.”

7E 2009 Montgomery
• So far, we haven’t really
done any “statistics” t0 = -2.20
• We need an objective
basis for deciding how
large the test statistic t0
really is
• In 1908, W. S. Gosset
derived the reference
distribution for t0 …
called the t distribution
• Tables of the t
distribution – see
textbook appendix
7E 2009 Montgomery
• A value of t0 between
–2.101 and 2.101 is t0 = -2.20
consistent with
equality of means
• It is possible for the
means to be equal and
t0 to exceed either
2.101 or –2.101, but it
would be a “rare
event” … leads to the
conclusion that the
means are different
• Could also use the
P-value approach
7E 2009 Montgomery
t0 = -2.20
• The P-value is the area (probability) in the tails of the t-distribution beyond -2.20 + the
probability beyond +2.20 (it’s a two-sided test)
• The P-value is a measure of how unusual the value of the test statistic is given that the null
hypothesis is true
• The P-value the risk of wrongly rejecting the null hypothesis of equal means (it measures
rareness of the event)
• The P-value in our problem is P = 0.042

7E 2009 Montgomery
Computer Two-Sample t-Test Results

7E 2009 Montgomery
Checking Assumptions –
The Normal Probability Plot

7E 2009 Montgomery
Importance of the t-Test
• Provides an objective framework for simple
comparative experiments
• Could be used to test all relevant hypotheses
in a two-level factorial design, because all
of these hypotheses involve the mean
response at one “side” of the cube versus
the mean response at the opposite “side” of
the cube
7E 2009 Montgomery
Confidence Intervals (See pg. 44)
• Hypothesis testing gives an objective statement
concerning the difference in means, but it doesn’t
specify “how different” they are
• General form of a confidence interval
L ≤ θ ≤ U where P ( L ≤ θ ≤ U ) = 1 − α
• The 100(1- α)% confidence interval on the
difference in two means:
y1 − y2 − tα / 2,n1 + n2 − 2 S p (1/ n1 ) + (1/ n2 ) ≤ µ1 − µ 2 ≤
y1 − y2 + tα / 2,n1 + n2 − 2 S p (1/ n1 ) + (1/ n2 )

7E 2009 Montgomery
7E 2009 Montgomery
Other Chapter Topics
• Hypothesis testing when the variances are
known
• One sample inference
• Hypothesis tests on variances
• Paired experiments

7E 2009 Montgomery

Ch02 - Some Basic Statistika Concept

Загружено:

Сведения о документе

Исходное описание:

Оригинальное название

Авторское право

Доступные форматы

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Авторское право:

Доступные форматы

Ch02 - Some Basic Statistika Concept

Загружено:

Авторское право:

Доступные форматы

Design of Engineering Experiments

Chapter 2 – Some Basic Statistical Concepts

Chapter 2 Design & Analysis of Experiments 1

Chapter 2 Design & Analysis of Experiments 2

Chapter 2 Design & Analysis of Experiments 3

Chapter 2 Design & Analysis of Experiments 4

Chapter 2 Design & Analysis of Experiments 5

• Statistical hypothesis testing is a useful

Chapter 2 Design & Analysis of Experiments 6

• Sampling from a normal distribution

Chapter 2 Design & Analysis of Experiments 8

“New recipe” “Original recipe”

Chapter 2 Design & Analysis of Experiments 9

Pool the individual sample variances:

Chapter 2 Design & Analysis of Experiments 13

Gosset's interest in barley cultivation led

Developed the t-test (1908)

Gosset was a friend of both Karl Pearson

Gosset was a modest man who cut short

Chapter 2 Design & Analysis of Experiments 14

Chapter 2 Design & Analysis of Experiments 17

Chapter 2 Design & Analysis of Experiments 18

Chapter 2 Design & Analysis of Experiments 19

Chapter 2 Design & Analysis of Experiments 21

Chapter 2 Design & Analysis of Experiments 23

Вам также может понравиться