Академический Документы
Профессиональный Документы
Культура Документы
207
APPROACH TO ANALYZE
Develop a
focused problem
statement
Process Analysis
Organize
potential
causes
Hypothesis
Testing and
Regression
Analysis
Design of
Experiments
208
Brainstorm on Xs
209
Understanding
Process
210
UNDERSTANDING A PROCESS
211
FLOWCHARTS
Step 1
Step 2
Yes
Step 3
Step 6
Decision
End
No
Step 4
Step 5
212
Activity flowcharts
Deployment flowcharts
Sales
Technical
Shipping
Coordinator
213
ACTIVITY FLOWCHARTS
Hotel Check-out Process
Activity flowcharts
are specific about
what happens in a
process. They often
capture decision
points, rework
loops, complexity,
etc.
1
Approach front desk
2
Is there
a line?
YES
Process Name
3
Wait
NO
Numbered
steps
4
Step up to desk
5
Clerk
available?
NO
Clear
direction of
flow (top to
bottom or
left to right)
6
Wait
Key of symbols
YES
Start/End
Consistent
level of
detail
7
Give room number
Action/Task
8
Check bill
Decision
Sequence
9
Charges
correct?
NO
10
Correct charges
YES
11
Pay bill
Date of creation
or update &
name of creator
Clear starting
and ending
points
214
DEPLOYMENT FLOWCHARTS
People or groups
listed across the top Invoicing Process
Sales
Billing
Steps listed in
column of person or
group doing step or
in charge
Shipping
Customer
Delivers goods
Notifies sales of
completed delivery
Receives
delivery
Elapsed
Time
Time flows
down the
page
5 days
Sends invoice to
customer
10 days
4
10
Notifies billing
of invoice
Receives invoice
11
5
Files invoice
Checks invoice
against receipt
12
Pays bill
Receives and
records payment
7
Reviews weekly
report of overdue
accounts
Horizontal lines
clearly identify
handoffs
215
Activity
Flowchart
Deployment
Flowchart
To display the
complexity and
decision points of a
process
To help highlight
handoff areas in
processes between
people or functions
To illustrate where in
the process you will
collect data
To identify rework
loops and bottlenecks
216
>
217
FOUR PERSPECTIVES
At this stage of a DMAIC project, you are trying to define the current
situation, as it is. Therefore, your flowchart(s) should map what is really
happening in the process.
218
COPY PROCESS
Put original
on glass
Close Lid
Adjust
Settings
Press
START
Remove
originals
and copies
219
COPY PROCESS
Yes
Take Original
Copier Yes
in Use?
Wait?
No
Leave
No
Place
Original
No
Select
Size
Glass Yes
Dirty?
Select
Orientation
Clean
Select
Number
Paper? No
Find
Paper
Yes
Yes
No
Box
Open?
Knife? No
Find
Knife
Open
Box
Yes
Yes Paper No
Loaded?
Find
Help
No
Yes
Start
Copier
Copy Yes
Made?
Quality No
Ok?
Stop
Copier
Yes
No
Adjust
Yes Another
Page?
Find
Help
Fix
No
Problem?
No
Remove
Original
Adjust?
Yes
Collect
Copies
Staple
copies
Clear
modes
Leave
220
Value-Added Step:
Nonvalue-Added Step:
221
EXAMPLES
Value-Added Activities
Entering order
Ordering materials
Preparing drawing
Assembling
Legally mandated testing
Packaging
Shipping to customer
Nonvalue-Added Activities
Waiting
Storing
Staging
Counting
Inspecting
Recording
Obtaining Approvals
Testing
Reviewing
Copying
Filing
Revising/Reworking
Tracking
222
Process Step
Cumulative
Time
VA Time
NVA Time
Notes
223
You can track specific types of nonvalue-added time with a valueanalysis matrix. This helps clarify not only the types of waste present
in the process, but also the percentage of overall process time each
nonvalue-added step adds.
Process Step
10
Time (Hours)
12
10
10
20
10
10
20
Total %Total
100
100%
2%
10
10%
6%
Delay
52
52%
Transporting/Motion
30
30%
100
100%
Value-Added
Nonvalue-added
Fixing errors
Prep/Set-up
Control/Inspection
Total
Six Sigma GB Material Oct 2013
224
Graphical Data
Analysis
225
226
Theory: Variation in
training, technique, and
procedures at different
locations accounts for
much of the variation in
how long it takes to
complete oil change/lubes
4
3
2
1
0
7
10
11
12 13
14 15
16 17
19 20
21
Location B
18
5
4
Count
2
1
3
0
7
10 11
12 13
14 15
16
17 18
19 20
21
2
6
Location C
1
5
4
8 9 10 11 12 13 14 15 16 17 18 19 20 21
Minutes
3
2
1
0
7
10
11
12 13
14 15
16 17
18
19 20
21
227
5 10 15 20 25 30 35 40 45 50 55 60 >60
Time With Customer
(in minutes)
5 10 15 20 25 30 35 40 45 50 55 60 >60
Time With Customer
(in minutes)
228
Scatter Plot of
Time Needed to Assemble the Product vs.
Workers Time on the Job
Time
(mins)
10
9
8
7
6
5
2
2
2
Product A
Product B
3
2
2
1
1
10 11 12 >12
Months on job
229
230
Scatter Plot of
Time Needed to Assemble the Product vs.
Workers Time on the Job
(mins)
10
Stratification
using different
symbols allows
you to look at
multiple patterns
at once.
Time to Assemble
(Hours)
8
7
6
5
2
2
2
Product A
Product B
3
2
2
1
1
10 11 12 >12
Months on job
The pattern
formed by the
scatter is an
important clue to
how the two
variables are
related.
231
Organization of Potential
Cause
232
233
Means
Means/Objective
Means
Means/Objective
Means
Means/Objective
Means
Problem
Statement
Objective
Means
Means/Objective
Means
Means/Objective
Means
Means/Objective
Means
234
235
BRAIN STORMING
What is Brainstorming ?
Brainstorming is a simple but effective technique for
generating many ideas of a group of people within a
short span of time to solve a given problem
236
Defer Evaluation
Put critical faculties in cold storage- even constructive criticism. This
is to ensure a proper climate of acceptance of all sorts of ideas. No
idea should be treated as stupid.
Fantasize Freely
Dont operate with your brakes on. The participants are encouraged,
urged to let themselves go and generate ideas, no matter how fanciful
these ideas are.
Generate Quantity
Generate as many ideas as possible. A pearl diver will be more
successful in finding pearls, perhaps the pearl, when he brings up 200
oysters than when he surfaces only 15-20 oysters.
Build on Ideas
Idea of one participant is more effectively built up by another
participant.
Six Sigma GB Material Oct 2013
237
PRINCIPLES OF BRAINSTORMING
Deferment of evaluation develops the appropriate
psychologically safe climate for ideation
The uniqueness of each participants knowledge is
tapped to develop new insights
Ideas of one participant tend to trigger off ideas in the
brains of other group members
Free association encourages fruitful ideation
The pressure of time bound sessions in a nonthreatening atmosphere is conducive to a high
productivity of ideas
Six Sigma GB Material Oct 2013
238
239
Machine
Problem
/ Effect
Material
Method
240
Driver
Impatience
Craze
Vehicle
Spark plugs
Contacts
Life
Heavy
Poor
anticipation
Always
late
Lack of
awareness
Riding on
clutch
Bad
attitude
Poor
skill
Wrong
gears
Body
Shape
Inexperience
Wrong
culture
High H.P
Spurious
Crossings
One way
No turn
Circuitous
Road
Road
Fuel mix
Carburetor
Engine
Cylinders
Restrictions
Technical
details
Spares
High Petrol
Consumption
Impurities
Incorrect
Octane no.
Traffic
Tyres Inferior
Frequent
Petrol
Faulty
stops Negligence
pressure
Speed Breakers
Additives
Ignorance
Potholes
Irregular
Incorrect viscosity
Low pressure
servicing
Poor
Clogged
Oil
condition
filters
False
Steep
Not changed
economy
Low level
Maintenance
Materials
241
242
243
Validation of Potential
Cause
244
Process
variation
Measurement
variation
Gage R&R
Customer
to
Customer
ANOVA: p=.93
TEV: p=1.00
Plant
to
Plant
Season
to
Season
Method of
Shipment
Shipment
to
Shipment
ANOVA: p=.43
ANOVA: p=.54
ANOVA: p=.08 TEV: p=1.00
TEV: p=.79
TEV: p=1.00
ANOVA: p=.38
TEV: p=1.00
Day of
Week to
Day of
Week
M
Six Sigma GB Material Oct 2013
Other
ROOT CAUSE
ANOVA: p=.000
TEV: p=.000
Sa
Su
245
Steps
246
Validation of Causes
247
H
H
H
H
o
a
A
A
<
=
>
Hypotheses of
Means
B
B
Hypotheses of
Standard
Deviations
249
Hypothesis Testing
We want to take a practical problem and change it to a statistical problem
We use relatively small samples to estimate population parameters
There is always a chance that we can select a weird sample
Sample may not represent a typical set of observations
Inferential statistics allows us to estimate the probability of getting a weird
sample
Example
If we wanted to know a coin was fair, we could flip it a number of times
and track how many heads we saw
By chance we would expect about 50% of the flips to be heads
If we flipped the coin 10 times and got 10 heads, we would be fairly
confident the coin is not fair
There is one chance out of 1000 that we could have gotten 10 heads with
a fair coin
Therefore, we would say we are willing to take a 0.1% chance of being
wrong about our unfair coin
250
Yield
Study One
Study Two
251
Overall Approach
Practical Problem
Statistical Problem
y = f ( x1 , x2 ,..., x k )
Practical Solution
Statistical Solution
252
Key Terms
Ho = Null Hypothesis ,Ha = Alternative
Hypothesis ,P-Value = Probability Value
253
Hypothesis Testing
H o :
H a:
a
a
b
b
254
Hypothesis Testing
What is it for Statisticians ?
Ho:
Ha:
Ho:
Ha:
Ho:
Ha:
Ho:
Ha:
255
Hypothesis Testing
What is it for the Average Person ?
Ho: Age doesnt matter in a companys hiring practices
Ha: Age does matter in a companys hiring practices
Ho: Data is Normal
Ha: Data is not Normal
Ho: Batch X Avg. Cycle Time = Batch Y Avg. Cycle Time
Ha: Batch X Avg. Cycle Time = Batch Y Avg. Cycle Time
Ho = _______________________________________
Ha = _______________________________________
Six Sigma GB Material Oct 2013
256
257
258
Hypotheses of
Means
Ho: o = 13.6
Ha: o < 13.6
Hypotheses of
Standard
Deviations
Ho: A = B
Ha: A > B
DECIDE:
What does the evidence suggest?
Reject Ho?
or
259
NOTE:
Not Guilty
Six Sigma GB Material Oct 2013
Innocent
260
Decision Errors
In deciding to Reject or Not, we could make one of
two decision errors
Your Decision
Ho True
Accept Ho
Reject Ho
Correct
Type I
Error
(-Risk)
(
The
Truth
Type II Error
Ho False
( -Risk)
Correct
261
Example: A Trial
Jurys Decision
Hes Not Guilty
Actually
Innocent
The
Truth
Actually
Guilty
Correct
Hes Guilty
Type I
Error
Consequence:
Innocent Man
Goes to Jail
(-Risk)
(
Type II Error
( -Risk)
Correct
262
Desired
Precise but not Accurate
LSL
USL
Current
Situation
LSL
USL
Hypotheses of
Means
H o:
H 1:
H 2:
Ho:
H 1:
H 2:
o
o
o
o
o
o
=
>
<
=
>
<
1
1
1
1
1
1
Hypotheses of
Standard
Deviations
263
The
Small P-Value
Large SNR (e.g., t or Z
statistic)
Ho is Rejected
Large P-Value
Small SNR
Ho is Not Rejected
264
.999
.99
Bartlett's Test
P-Value
: 0.315
Levene's Test
Test Statistic: 0.925
P-Value
: 0.341
.95
Probability
.80
.50
.20
.05
.01
.001
0.7
1.2
10
11
12
Mach 1
1.7
Average: 10.0799
StDev: 0.943184
N: 25
Descriptive Statistics
Variable: Mach 1
Analysis of Variance
Source
DF
SS
Factor
1
0.12
Error
48
53.71
Total
49
53.83
Level
Mach 1
Mach 2
N
25
25
Pooled StDev =
Mean
10.080
9.980
1.058
A-Squared:
P-Value:
MS
0.12
1.12
F
0.11
P
0.740
9.0
StDev
0.943
1.161
9.5
10.0
10.5
11.0
11.5
12.0
0.889
0.020
Mean
StDev
Variance
Skewness
Kurtosis
N
10.0799
0.9432
0.889597
0.511741
-1.14078
25
Minimum
1st Quartile
Median
3rd Quartile
Maximum
8.9821
9.3571
9.7873
10.8440
12.0670
10.5
0.7365
10.4693
10.6151
265
266
267
Steps Continued
9. Develop the Sampling Plan
10. Select Samples
11. Conduct test and collect data
12. Calculate the test statistic (z, t, or F) from the data.
13. Determine the probability of that calculated test statistic occurring by
chance.
14. If that probability is less than alpha, reject Ho and accept Ha. If that
probability is greater than alpha, do not reject Ho. ( Practically Accept
H0)
15. Replicate results and translate statistical conclusion to practical
solution.
268
Significance Level
Common statement: ... so unlikely ...
You should ask: How unlikely were they?
The answer: The significance level (
)
269
Y = f( x 1 , x 2 , x 3 ,..., x k )
Remember this simple equation ?
DATA TYPE :Discrete
Counts of Discrete Events ( 1, 2, 3, 4 Defects)
Qualitative Descriptions
Democrat / Republican
Good / Bad
Machine 1 / Machine 2
Continuous
Decimal sub-divisions are meaningful
Time, Weight, Thickness, etc...
Six Sigma GB Material Oct 2013
270
Purpose of roadmap
271
Analyze Roadmap
Single X
Continuous
(Normal)
Continuous (Non
Normal)
Single Y
Discrete
Discrete
Chi-Square
Test of Proportion
Continuous
Logistic
Regression
272
Test of Comparisons
Y = Continuous
Y = Discrete
Comparison
Type
Against
Standard
Between Two
Mean
Variance
Defective
Defects
1 Sample t
Chi-Square
Test
F-test
1 sample p
1 sample
defect rate
2 sample
defect rate
Among Many
ANOVA
2 Sample t
OR
Paired t
2 Sample p
Chi-square
Note: The test mentioned for Y (Continuous) is applicable only when Y follows
Normal Distribution.
In case Y does not satisfy the Normality, then we need to use Non Parametric
tests
For carrying out ANOVA, the condition of Equality of variance has to be
satisfied.
273
Scenario #1
A Supervisor wants to know if two operators add
significantly different amounts of Material A during the blending process
274
Scenario #2
The Personnel Department wants to see if there is a link between age
(old and young) and whether that person gets hired
275
Scenario #3
A team wants to see if there is relationship between
ambient temperature and the viscosity of a material
276
Scenario #4
For outstanding payment analysis, Sales dept. wants to see if there is a
link between average amount outstanding and dealers.
277
Scenario #5
For accident analysis, safety dept. wants to see if there is a link
between unit weight per container and injuries to consumers
278
Correlation and
Regression
279
Correlation
280
40
35
30
25
20
15
25
35
45
55
65
75
Speed (km/h)
Six Sigma GB Material Oct 2013
281
Scatter diagram
A scatter diagram depicts the relationship as a
pattern that can be directly read.
If Y increases with X, then X and Y are positively
correlated.
If Y decreases as X increases, then the two types of
data are negatively correlated.
If no significant relationship is apparent between X
and Y, then the two data types are not correlated.
282
No Correlation
r = .006
2
R = .0036%
Strong
Negative Correlation
r = -.90
R2 = 81%
Moderate
Negative Correlation
r = -.73
R2 = 53%
283
Severed
Length
(mm)
1046
Sl. No.
Conveyor
Speed
(cm/sec)
8.1
16
Conveyor
Speed
(cm/sec)
6.7
Severed
Length
(mm)
1024
7.7
1030
17
8.2
1034
7.4
1039
18
8.1
1036
5.8
1027
19
6.6
1023
7.6
1028
20
6.5
1011
6.8
1025
21
8.5
1030
7.9
1035
22
7.4
1014
6.3
1015
23
7.2
1030
7.0
1038
24
5.6
1016
10
8.0
1036
25
6.3
1020
11
8.0
1026
26
8.0
1040
12
8.0
1041
27
5.5
1013
13
7.2
1029
28
6.9
1025
14
6.0
1010
29
7.0
1020
15
6.3
1020
30
7.5
1022
284
S evered L en g th (m m )
1045
1040
1035
1030
1025
1020
1015
1010
1005
1000
5
5.5
6.5
7.5
8.5
285
286
REGRESSION
287
Y = a+b X
288
Hypothesis Testing of
Variance
289
Objectives
290
Characteristics of F-Distribution
292
>
Reduce variation
>
Increase tolerances
293
For the two tail test, the test statistic is given by:
S
F =
S
2
1
2
2
S12 and S 22 are the sample variances for the two samples.
The null hypothesis is rejected if the computed test
statistic is greater than the critical (table) value with
confidence level / 2 and numerator and denominator
degrees of freedom.
F-Test
Performing the Test
The F-Test tests two distributions
The calculation can be performed manually using a calculator. The
formula is:
Fcalc = s12/s22
where : s12 = variance of one distribution , s22 = variance of the other
distribution. The larger variance always serves as the numerator.
The critical f-value is read from an f-table, where n-1 equals the
number of degrees of freedom for the numerator and denominator.
If the numerator and denominator have different sampling sizes, the
correct value must be used for each factor.
295
Example
Factor Levels
1
2
2
F-Test
Test Statistic: 0,632
P-Value
P-Value
: 0,505
: 0,390
80
85
Response
90
Response
89.7
81.4
84.5
84.8
87.3
79.7
85.1
81.7
83.7
84.5
84.7
86.1
83.2
91.9
86.3
79.3
82.6
89.1
83.7
88.5
Factor
1
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
2
296
F-Test
Interpreting the Results
297
F-Test
F-Test
Test Statistic: 0,632
P-Value
: 0,505
Levene's Test
Test Statistic: 0,775
P-Value
: 0,390
298
Factor Levels
1
Bartlett's Test
Test Statistic: 2,159
P-Value
: 0,340
If the data is
normally
distributed:
Bartletts Test
Levene's Test
Test Statistic: 1,327
P-Value
: 0,291
3
0
10
299
Hypothesis Testing of
Means
300
301
9-3
9-3
The degrees of
freedom for
the t-distribution
is df = n - 1.
z-distribution
t-distribution
The test statistic for the one sample case is given by:
X
t =
s/ n
The current rate for producing 5 amp fuses at General Electric Co. is
250 per hour. A new machine has been purchased and installed that,
according to the supplier, will increase the production rate. A sample of
10 randomly selected hours from last month revealed the mean hourly
production on the new machine was 256, with a sample standard
deviation of 6 per hour. At the .05 significance level can General
conclude that the new machine is faster?
EXAMPLE 1 continued
Step 1:
H 0 : 250
X
t =
s/ n
NOTE
t n1, / 2
t n1, / 2
t n1, / 2
Six Sigma GB Material Oct 2013
( n1 1) s + ( n 2 1) s
s =
n1 + n 2 2
2
p
t =
Thus.
Test Statistic:
Six Sigma GB Material Oct 2013
1
1
s
+
n2
n1
2
p
2
2
2
(
)
+
(
)
n
1
s
n
1
s
1
1
2
2
s 2p =
n1 + n 2 2
Step 3: Determine t
X1 X2
1
1
s
+
n
n
1
2
2
p
309
t-Test
Comparing Two Independent Samples
Enter the before yield in C1 and the after yield in C2. This
method is called unstacked.
Enter all the values for the yield in C1 and the status in C2.
Minitab identifies C2 as the index variable (subscript).
310
EXAMPLE 2
EXAMPLE 2
Step 1:
continued
H 0 : 2 1
H1: 2 > 1
t=
X1 X 2
1 1
s +
n1 n2
2
p
313
Formulation 2
(contains no lead)
89.5
91.5
91.0
89.0
91.5
92.0
92.0
90.5
90.0
91.0
314
315
Paired t-Test...
In a paired t-test
- The data must be continuous
- The data must be random
- The population of the differences should be normally distributed.
- The following test statistic should be used...
d
sd /
n
Where:
sd
d=
n
( d ) 2
d
n
n 1
2
sd
318
Paired Comparison
Another good example of paired comparison is the comparison of
measurements performed using an online system, to
measurements performed in a lab using the same samples.
This method is also suitable for examining measurement systems
to determine whether testers obtain the same mean value using the
same samples.
Lets look at the file shoe.mtw.
Were testing shoe material. We have a sample of 10 boys, and
each boy wears two shoes, each of a different material.
In this case, the boys represent blocks.
319
Paired Comparison
Boy
Material A
Material B
1
2
3
4
5
6
7
8
9
10
13.2(L)
8.2(L)
10.9(R)
14.3(L)
10.7(R)
6.6(L)
9.5(L)
10.8(L)
8.8(R)
13.3(L)
14.0(R)
8.8(R)
11.2(L)
14.2(R)
11.8(L)
6.4(R)
9.8(R)
11.3(R)
9.3(L)
13.6(R)
320
Paired Comparison
T-Test of the Mean
Variable
Delta
Mean
StDev
SE Mean
10
-0.410
0.387
0.122
-3.35
0.0085
0.4
2.5%
5%
Oserved Value
Prob
0.3
0.2
0.1
0.0
-4
-3
-2
-1
T-V alue
321
We are using the same data and will analyze it again, this time by
comparing two independent samples.
Mean
StDev
SE Mean
Mtrl A
10
10.63
2.45
0.78
Mtrl B
10
11.04
2.52
0.80
?
322
Blocking
323
Example 1
A pharmaceutical dispenser that is supposed to dispense 25 ml of agent
was calibrated to dispense 25 ml quantities into 10 previously-weighed
containers. The actual quantities dispensed were:
25.01 ml, 24.89 ml, 25.10 ml, 24.95 ml, 24.97 ml,
25.04 ml, 25.08 ml, 24.91 ml, 25.07 ml, 24.85 ml
Test the null hypothesis that says the dispenser provides 25 ml agent
against the test hypothesis that this is not the case.
324
Example 2
Water hardness is measured in order to determine calcium ion concentrations (in
ppm). The hardness of water in hot and cold water lines for a manufacturing
process were measured. A technician objected, stating that warm water was
harder than cold water. The hardness of the various samples is as follows:
Warm water:
133.4,
133.3,
135.4,
136.5,
137.1,
137.6,
138.4,
139.5
136.3,
137.1,
Cold water:
134.1,
135.9,
134.7,
135.6,
136.0,
135.8,
131.7,
132.2
134.7,
135.2,
Test the null hypothesis that the warm and cold water have the same calcium
concentration, against the test hypothesis that warm water has a higher
concentration.
325
Example 3
A chemical company manufactures paint thinner. The content of ethyl
alcohol in the pint thinner is set at 3%. To determine whether the
manufacturing process has exceeded the 3% threshold, 20 samples of
thinner are taken. The ethyl alcohol concentrations were determined as
follows:
4.2,
5.3,
3.5,
4.3,
3.7,
3.2,
3.5,
2.8,
3.5,
3.7,
2.8,
3.3,
2.7,
3.0,
3.1,
3.0,
3.7,
3.3,
3.4,
2.3
Test the null hypothesis that the process is unchanged (3% ethyl
alcohol), against the test hypothesis that the mean values of the
process are more than 3%.
326
Example 4
A manufacturer of foils has implemented a new process to reduce the weight of the
product. The foil strength is an important variable affecting the weight. Foils in eight
different strengths were manufactured using both the old and new methods. The weight
(in grams) for each combination is shown below:
Strength
Old Process
New Process
154
152
159
152
169
171
176
167
183
182
199
194
200
204
213
208
Test the null hypothesis that there is no difference in weight between the old and new
process, against the test hypothesis that the new process has reduced the weight of the
foil.
327
Example 5
Two brands (A and B) of air conditioner dust filters were tested to determine
whether one was better than the other. All filters were tested on the same system,
and the dust quantity (in grams) filtered over a 6 hour period was measured. The
data obtained for the two filters is as follows:
Filter A:
9.1, 11.8,
Filter B: 15.6,
1.5, 7.2,
4.2,
9.6,
8.7, 10.2,
4.4,
7.8, 4.3
9.3, 16.9, 5.1, 14.5, 19.0, 10.3, 12.5, 13.3, 16.1, 2.6
Test the null hypothesis there there is no difference in average dust quantity
filtered against the test hypothesis that one filter is better than the other.
328
Analysis of Variance
(ANOVA)
Objectives
330
Ho: 1 = 2 = 3 = 4
HA: At least one k is different
Advantage:
To test the NULL HYPOTHESIS (all 4 mean values are equal), we would have
to test hypotheses for 6 combinations using the technique previously described
(t-test). Using the ANOVA technique, we can decide whether to reject the null
hypothesis or keep the null hypothesis with a single test.
331
H o : 1 = 2 = 3 = 4
Are some of the 4 population means different?
333
DIET A
DIET B
DIET C
DIET D
62
63
68
56
60
67
66
62
63
71
71
60
59
64
67
61
65
68
63
66
68
64
63
59
335
DF
SS
MS
Diet
228.00
76.00
13.57
0.000
Error
20
112.00
5.60
Total
23
340.00
Individual 95% CIs For Mean
Based on Pooled StDev
Level
Mean
StDev
---+---------+---------+---------+---
61.000
1.826
(------*------)
66.000
2.828
68.000
1.673
61.000
2.619
(-----*----)
(----*-----)
(----*----)
---+---------+---------+---------+---
Pooled StDev =
2.366
59.5
63.0
66.5
70.0
336
ANOVA Table
The ANOVA table is an important result of ANOVA
DF
3
SS
228.00
Error
20
112.00
Total
23
340.00
MS
76.00
13.57
0.000
5.60
337
Caution:
line is without
warranty
68
67
Coagtime
66
65
64
63
62
61
1
Diet
338
Dimple 2
Dimple 3
Dimple 4
277
281
304
250
268
299
295
277
281
317
317
268
263
286
299
272
290
304
281
295
304
286
281
263
339
Hypothesis Testing of
Proportion
Black Belt Training
1 Proportion Test
vs
target
value
P
Practical Question
(example)
Statistical Question
Ho : P = target value
Ha : P target value
341
obs
p
p (1 p
p
)(1 n
1
2
1
+ 1
where
p =
X
n
1
1
+ X
+ n
2
2
342
Product A
Product B
East
West
32
135
32
80
42
98
Product C
Discrete
Type of Data ? ______________
Age
Whats the X ? _____________
Discrete
Type of Data ? ______________
Chi-Square
What type of tool would you use ? ________________________
344
The Data
Total
Hired
Not Hired
Old
30
150
180
Young
45
230
275
Total
75
380
455
345
The Hypothesis
With the Chi-Square Test for Independence,
statisticians assume most variables in life are
independent, therefore:
Ho:
Ha:
346
Step #1
We must develop an Observed Frequency Table by
breaking our 2 variables into different levels:
Age: Old & Young
Hiring Practices: Hired & Not Hired
Not Hired
Old
30
150
Young
45
230
347
Step #2
Calculate Column & Row Totals
Hired
Not Hired
Total
Old
30
150
180
Young
45
230
275
Total
75
380
455
348
Step #3
Develop an expected frequency table. That is, what should
this table look like if these if these 2 factors are really
independent?
Hired
Not Hired
Old
Young
How do we do that?
Six Sigma GB Material Oct 2013
349
Step #3 Continued
Develop an expected frequency table. That is, what should this table
look like if these if these 2 factors are really independent?
Hired
Old
75 x 180
455
= 29.6
Not Hired
Total
___
180
Young
___
___
275
Total
75
380
455
350
Step #3 Continued
29.6 is what we would expect if the 2 factors are really independent
Hired
Not Hired
Total
Old
29.6
150.3
___
180
Young
45.3
___
Total
75
___
229.7
275
380
455
351
Step #4
Subtract the expected value from the observed (O-E)
Hired
Not Hired
Total
Old
30-29.6=.4
-0.3
___
180
Young
___
-0.3
___
0.3
275
Total
75
380
455
352
Step #5
Square the Differences (O-E)^2
Hired
Old
(.4)*(.4)=.16
Not Hired
Total
.09
___
180
Young
___
.09
___
.09
275
Total
75
380
455
353
Step #6
Compute the Relative Squared Differences (O-E)^2 / E
Hired
Old
Young
Total
75
Not Hired
Total
.0006
___
180
___
.0004
275
380
455
354
So What?
The sum of the relative squared differences is
distributed as a Chi Square distribution!
355
Evaluate The
P Value
Examine
Contingency
Table
Make
Decision
356
357
Total
180
45
45.33
230
229.67
275
Total
75
380
455
Chi-Sq =
0.004 + 0.001 +
0.002 + 0.000 = 0.007
DF = 1, P-Value = 0.932
Note:
The observed and expected
counts are the same values you
calculated a moment ago
A P-Value !
358
Another Example . . .
Hired
Not Hired
Old
45
135
Young
45
230
359
Chi-Square Comments
Chi-Square is the least insightful and usually one of
the more difficult to analyze tools we learned this
week. But that is what happens when we deal with
attribute data.
You must have at least FIVE expected frequencies for the
Chi-Square Test to work or Minitab will crash.
Your data should have been gathered to assure
randomness. Beware of other hidden factors (Xs).
360
Excercise 1
A) You determine the faulty orders from 2 regions.
Faulty Orders
Correct Orders
Region 1
110
420
Region 2
110
400
Error 2
Correct Orders
Region 1
90
20
420
Region 2
60
50
400
361
Design of Experiment
362
363
Objectives Of Experimentation
The following are some of the objectives of experimentation in an
industry :
364
Design of experiments
365
Traditional approach
One-factor-At-A-Time
This is a traditional method of experimentation which tests, then changes, one factor
at a time to allow for observation and comparison. Note on the example below, all 8
factors are varied one-at-a-time . It is efficient because it takes only 16 runs.
A1 and A2 are evaluated by comparing Result - 1 and Result - 2
B1, B2 and B3 are evaluated by comparing Result-2, Result-3 and Result-4.
C1, C2, and C3 are evaluated by comparing Result-4, Result-5 and Result-6
Etc.
Run No.
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
A
1
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
B
1
1
2
3
3
3
3
3
3
3
3
3
3
3
3
3
C
1
1
1
1
2
3
3
3
3
3
3
3
3
3
3
3
D
1
1
1
1
1
1
2
3
3
3
3
3
3
3
3
3
E
1
1
1
1
1
1
1
1
2
3
3
3
3
3
3
3
F
1
1
1
1
1
1
1
1
1
1
2
3
3
3
3
3
G
1
1
1
1
1
1
1
1
1
1
1
1
2
3
3
3
H
1
1
1
1
1
1
1
1
1
1
1
1
1
1
2
3
Re sult
Result 1
Result 2
Result 3
Result 4
Result 5
Result 6
Result 7
Result 8
Result 9
Result 10
Result 11
Result 12
Result 13
Result 14
Result 15
Result 16
366
Traditional approach
367
One-Factor-At-A-Time
Problem: Fuel economy we want is 30 MPG
Try changing each input variable at two settings believed to be
associated with dramatically changing fuel economy. See what
happens.
Speed
55
60
60
60
Octane
85
85
90
85
Tire Pressure
30
30
30
35
MPG
23
29
23
24
How many more Combinations would you need to figure out the best
combination of variables? (3 Variables at two settings; 2x2x2 = 8 total)
How can you explain the above results? (Combination 2 is the answer)
If there were more variables, how long would it take to get a good solution?
(Multiply by another 2 for each one)
What if theres a specific combination of two or more variables that leads to
the best mileage? (Too hard for me to figure out; What do you think?)
Six Sigma GB Material Oct 2013
368
Speed
55
60
55
60
55
60
55
60
Octane
85
85
90
90
85
85
90
90
Tire Pressure
30
30
30
30
35
35
35
35
MPG
23
29
37
23
37
24
30
36
OFAT Runs
369
370
371
372
373
Speed Y
374
375
Brain storming and Cause & Effect diagram : To list all the possible factors
Classification of factors
Interactions to be studied
376
Classification of factors
Tools like brainstorming and cause & effect diagrams helps in identification of
factors and preparing a complete list of the factors involved in any experiment.
Factors listed can be classified into three categories :
1. Experimental Factors
Experimental factors are those which we really experiment with by varying them at
various levels.
2. Control Factors
Control Factors are those which are kept at a constant (controlled) level throughout
experimentation.
Note : At the planning stage itself all the factors viz. Experimental, Control and error should
be recognized. This will help to tackle them appropriately during experimentation.
Six Sigma GB Material Oct 2013
377
A WORD OF ADVICE
REMEMBER:
379
The experiment is Lf
380
Wear
The experiment is Lf
4 [Taking 2
2
21
381
382
Number of Factors :
Number of Levels :
Possible Runs :
Nos. we plan to carry out:
2
2
22
4
383
Factor -1
+1
Temp.
40C
80C
Conc.
Low
High
-1 signifies one level (normally lower) and +1 signifies the other level
(normally higher)
It is now believed that residue depends on concentration of Acid and Temperature of bath.
RUN
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
CONC.
Low
Low
Low
Low
Low
Low
Low
Low
High
High
High
High
High
High
High
High
TEMP.
40
40
40
40
80
80
80
80
40
40
40
40
80
80
80
80
RESIDUE
20.4
19.3
17.6
16.3
9.7
16.4
14.8
12.3
17.4
17.7
23.2
20.4
15
24
15.6
15.2
385
386
Factor -1
+1
Temp.
R.T.
90C
Time
3 mins 10 mins
Conc.
Low
High
-1 signifies one level (normally lower) and +1 signifies the other level
(normally higher)
WHICH EXPERIMENT?
23 FULL FACTORIAL EXPERIMENT
388
NO. TEMP.
1
RT
2
90
3
RT
4
90
5
RT
6
90
7
RT
8
90
TIME
3 mins
3 mins
10 mins
10 mins
3 mins
3 mins
10 mins
10 mins
CONC.
Low
Low
Low
Low
High
High
High
High
12
RT
3mins
Low
65
16
13
90
10 mins
High
45
14
90
3mins
High
45
15
90
10 mins
Low
41
16
90
3mins
Low
44
Note here the second column gives the run order on which the experiment
has to be conducted.