Академический Документы
Профессиональный Документы
Культура Документы
y = b0 + x ′b + x ′Bx
RESPONSE
75
65
55
5
4
45
55 3 X1
45
X2 35
25 2
INSTRUCTORS:
SECTION I: Introduction
Every data collection either explicitly or implicitly reflects the choice of an experimental design
strategy. This choice of design strategy, more than any other factor, determines how successful
an experimental program will be. The impact of using poor design strategies include technical
stagnation, inefficient use of resources, incorrect conclusions, and incomplete or missed
solutions.
This short seminar examines the technical considerations involved in the selection of an
experimental design strategy. Well selected design strategies are powerful tools for the
systematic study of processes (laboratory or plant). This material represents a core set of
knowledge which has proven to be applicable to many problems at Air Products and Chemicals,
Inc.
This seminar was originally developed by T. Bzik, S. Mehta, D. Neogi, J. Sheesley, and C. A.
Valenzuela from the Research and Engineering Systems Department. The notes are reviewed
and modified every year to incorporate changes in the discipline.
2. Box, George E.P. and Behnken, D.W. (1960), "Some New Three Level Designs for the Study of
Quantitative Variables," Technometrics, 2, pp 455-475.
3. Box, George E.P. and Draper, Norman R. (1987), Empirical Model-Building and Response
Surfaces, John Wiley and Sons, Inc., NY.
4. Box, George E.P.; Hunter, William G.; and Hunter, J. Stuart (1978), Statistics for Experimenters,
John Wiley and Sons, Inc., NY.
5. Cornell, John A. (1990), Experiments with Mixtures, 2nd Edition, John Wiley and Sons, New
York.
6. Draper, Norman R. and Smith, H. (1998). Applied Regression Analysis, 3rd Edition, John
Wiley and Sons, Inc., NY.
7. Montgomery, Douglas C. (1997), Design and Analysis of Experiments, John Wiley and Sons,
4th Edition.
9. Neter, J., Wasserman W., and Kutner M. H. (1985). Applied Linear Statistical Models, Richard
D. Irwin, Inc. Illinois.
10. Taguchi,G.and Konishi,S.(1987), Taguchi Methods: Orthogonal Arrays and Linear Graphs,
American Supplier Institute, Inc.
11. Schmidt, S. R. and Launsby, R. G. (1992), Understanding Industrial Designed Experiments, Air
Academy Press.
12. Wonnacott, T. H., Wonnacott, R. J. (1981), Regression: A Second Course in Statistics, Robert E.
Krieger Publishing Co.
Frequently conclusions are easily drawn from a well designed
experiment, even when rather elementary methods of analysis
are employed. Conversely, even the most sophisticated analyses
cannot salvage a badly designed experiment.
Box, Hunter, and Hunter
Table of Contents
ONE-AT-A-TIME..................................................................................................................................................... 6
THE DATA............................................................................................................................................................. 7
REGRESSION ANALYSIS ................................................................................................................................ 8
A BOX-BEHNKEN DESIGN... ................................................................................................................................ 11
The best (parsimonious) model... ......................................................................................................... 12
________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Introduction Intro-3
y = φ( x 1 , x 2 , . . . , x n )
________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Introduction Intro-4
A Typical Strategy
While there are various ways to analyse the results of
experiments, this course will emphasize REGRESSION
ANALYSIS as the principal tool of choice.
data
structure
Input
Data
Models
y = f ( x1 , x2 , . . . , x n )
regression
Analysis
selection
Best
Model
________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Introduction Intro-5
• Regression analysis
________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Introduction Intro-6
ONE-AT-A-TIME
A computer generated example: 3 variables and one
response:
x1 , x2 , x3 y
y = f ( x1 , x2 , x3 )
________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Introduction Intro-7
THE DATA
Obs # X1 X2 X3 Y
1 1 3 3 38.99
2 2 3 3 66.53
3 3 3 3 92.94
4 4 3 3 122.34
5 5 3 3 147.62
6 3 1 3 40.25
7 3 2 3 68.02
8 3 3 3 94.18
9 3 4 3 119.48
10 3 5 3 146.48
11 3 3 1 95.26
12 3 3 2 94.90
13 3 3 3 94.04
14 3 3 4 93.58
15 3 3 5 91.36
________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Introduction Intro-8
REGRESSION ANALYSIS
R2 = 0.9997
RMSE = 0.7334
y = −68 . 61 + 55 . 67 x 1 − 9 . 13 x 1 x 3 + 9 . 17 x 2 x 3 − 0 . 16 x 12 − 0 . 19 x 22 − 0 . 17 x 32
________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Introduction Intro-9
X1 X2 X3 Pred Y True Y
5 5 5 197.74 230.00
1 2 3 12.24 30.00
1 1 1 -13.42 22.00
3 3 3 94.08 94.00
________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Introduction Intro-10
Conclusions:
• Unreliable models
________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Introduction Intro-11
A Box-Behnken design...
(same number of experiments)
Obs # X1 X2 X3 Y
1 5 5 3 231.45
2 5 1 3 64.13
3 1 5 3 59.77
4 1 1 3 21.80
5 5 3 5 144.49
6 5 3 1 150.48
7 1 3 5 38.47
8 1 3 1 40.76
9 3 5 5 142.04
10 3 5 1 149.69
11 3 1 5 39.27
12 3 1 1 42.38
13 3 3 3 94.60
14 3 3 3 94.70
15 3 3 3 95.58
________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Introduction Intro-12
Total 48899.1126 14
________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Introduction Intro-13
Standardized
Ob Observed Predicted Residuals
Residuals Leverage Cook's D Atkinson's T
s Response Response
Internal External
1 231.45 231.9515 -0.5015 0.5667 -0.5559 -0.5357 0.0808 -0.8664
Shapiro-Wilks W : 0.9381
Significance level: 0.5923
________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Introduction Intro-14
Project : Introduction
Analysis : RS Model 2
Prediction : Model Prediction 1
Response : Y1 (Y1)
y = 10 + 3x1 + 2 x2 − x3 + 8 x1 x2 + ε
________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Introduction Intro-15
ε = N ( 0,1)
________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
SECTION II
REGRESSION ANALYSIS
Regression Analysis REG - 1
Table of Contents
BASIC ASSUMPTIONS........................................................................................................................................3
NOMENCLATURE.................................................................................................................................................4
MULTICOLLINEARITY ....................................................................................................................................46
VARIANCE INFLATION FACTORS ..............................................................................................................48
EIGENVALUES AND CONDITION NUMBERS........................................................................................49
POSSIBLE REMEDIAL ACTIONS .................................................................................................................51
THE CORRELATION COEFFICIENT ...........................................................................................................52
M O D E L B U I L D I N G ........................................................................................................................................55
POLYNOMIAL EXPANSIONS.........................................................................................................................71
Basic concepts
POPULATION
SAMPLE
BASIC ASSUMPTIONS
y = β0 + β1 x1 + β2 x 2 +...+ βn x n + ε
Estimated Model:
y$ = b0 + b1 x + b2 x 2 +...+ bn x n
NOMENCLATURE
INDEPENDENT VARIABLES
(deterministic or random variables
with constant variance)
y = β 0 + β1 x1 +... + β n x n + ε
COEFFICIENTS
(Random variables)
σ 2b
β ERROR TERM
DEPENDENT (random variable)
VARIABLE OR
σ 2ε
RESPONSE
(Random variable)
σ 2y
0
E(y)
⎛ E ⎞
−⎜ ⎟ m
W = K0e ⎝ RT ⎠
[
∏ iA ] p i
which becomes
m
+ ∑ pi ln[ Ai ]
E
ln W = ln K 0 −
RT 1
WHAT ABOUT:
− bx
• y = ae
a + bx
• y =
c + dx
θ − θ 2t −θ1t
• Y = 1
e − e +ε
θ1 − θ 2
The objective is to find the estimates of β such that the sum of the
square errors is minimized. This is known as the Least Squares
Method (LSE).
Minimize Q = ∑e 2
i = ∑ ( y i − y$ i ) 2
Min Q = ∑ ( y − b0 − b1 x1 )
2
∂Q
=0 ⇒ nb 0 + b1 ∑ x i = ∑y
∂b0
i
i i
∂Q
=0 ⇒ b0 ∑ x i + b1 ∑ x i2 = ∑x y
∂b1
i i
i i i
Solving for b0 , b1
∑ ( x − x )( y − y )
i i
b1 = i
∑( x − x )i
i
2
b0 = y − b1 x
the OLS equation becomes
y$ i = b0 + b1 x i
Matrix Notation
In matrix notation the equations become
y = Xβ + ε ====> y$ = Xb
y: nx1
X: nx(p+1)
b: (p+1)x1
ε : nx1
⎢. . . . ⎥
⎢ ⎥
⎣1 x n1 xn2 ... x np ⎦
Q = Y ′Y − b′ X ′Y − Y ′Xb + b′ X ′Xb
Q = Y ′Y − 2b′ X ′Y + b′ X ′Xb
∂Q
= −2 X ′Y + 2 X ′Xb = 0
∂b
( X ′X )b = X ′y
whose solution is
b = ( X ′X ) − 1 X ′y
σ b2
Mean = E ( b1 ) = β1
s 2 ( b ) = MSE( X ′X )−1
THE IMPORTANCE OF ( X ′X )
−1
⎡n 0 0 0 ⎤
⎢0 0 ⎥
X ′X = ⎢
∑x 2
i1 0
⎥
⎢0 0 ∑ xi22 0 ⎥
⎢
⎣0 0 0 ∑ xi23 ⎥⎦
−1
3. The matrix ( X ′X ) is part of the equation for the variances
of b, Y, etc. Hence, it is important to structure the data
collection procedure to compute it accurately.
Multicollinearity is a typical problem, leading to a
determinant which is close to zero.
A FEW DETAILS...
σ 2
σ 2
σ 2
Notes:
Assumptions regarding ε:
normal distribution N( 0, σ )
2
2. E( ε i ) = 0
Variance = σ2
MODEL EVALUATION
Typical questions:
ANOVA TABLE
ei = yi − yˆ i
y
yˆ i − y i
x x
y i − y = ( y i − y$ i ) + ( y$ i − y )
ANOVA TABLE
∑ (y
i
i − y ) 2 = ∑ ( y i − y$ i ) 2 + ∑ ( y$ i − y ) 2
i i
Here SST represents the total sum of squares; SSE is the sum of
squares due to error and SSR is the sum of squares due to regression.
Typically, this is shown as a table.
Mean
SOURCE SS df Squares F
TOTAL ∑ ( yi − yi )2 n-1
i
Best estimate
of σ
2
SSE
Estimate of σ = MSE = 2
n − p −1
X1 X2 Y
(Temp, 0 C ) (Time, min.) (Gloss)
PLOTS
Project Name : MIT One at a TIME Project Name : MIT One at a TIME
Plot Name : User Plot1 [RS Model 2] Plot Name : User Plot1 [RS Model 2]
3/10/2004 10:54:13 AM 3/10/2004 10:58:19 AM
45 45
40 40
Observed Gloss(Y1)
Observed Gloss(Y1)
35 35
30 30
25
25
0 1 2 3 4 5 6 7 8 9 10 11
70 80 90 100
Temp(X1)
Time(X2)
3D Graph 1
45
40
35
Gloss
30
12
25 10
8
6
p
m
20 4
100
Te
95 2
90
85
80 0
75
70
Tim e 65
Stat-Studio: Output
Data Structure Evaluation
Project : Gloss Properties of Coating
Analysis : RS Model 1
Continuous variables
Mi
Variable Label Max
n
X1 Temp (C) 0 10
For achieving reliable modeling results, it is essential that the experimental data has a proper robust
structure.This structure should be such that it supports an unbiased estimation of the quadratic model
used in response surface methodology.
The data structure was investigated through a Variance Inflation Factor (VIF) analysis.
Results:
VIF for each term in the model
Intercept = 0
X1 X2
Temp (C) Time (Min)
1.1268 1.1268
Conclusion:
Data has the (near) orthogonal structure required for the robust modeling.
Second order polynomial modeling should be robust.
Design is balanced.
Main-effects:
X1 X2
X1 1 0.3355
X2 0.3355 1
3/18/2004 2:11:11 PM
Results
Project : Gloss Properties of Coating
Analysis : RS Model 1
Pure Error 0 0 0
Total 599.7855 19
Shapiro-Wilks W : 0.9733
Significance level: 0.8998
Parity Plot
44
42
40
38
36
Observed Response
34
32
30
28
26
24
22
20
20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45
Predicted Response
H 0 : β 1 = ... = β i = 0
H α : At least one β i ≠ 0
Pure Error 0 0 0
Total 599.7855 19
CONCLUSION:
H0: βi = 0
Hα : βi ≠ 0
Decision rule:
CONCLUSION:
Typically, we rely on these criteria to assess how good a model is: the
best model will have
SSR
where, R 2
= can be obtained from the ANOVA table,
SST
500.329
R2 = = 0.834
599.786
RMSE = 2.4188
a) The mean value of gloss (y) for all coatings rated at 5 0C and a curing
time of 90 minutes is:
CONFIDENCE INTERVALS
1. The 95% confidence interval of the gloss for one particular coating
is
2. The 95% confidence interval of the mean gloss for all coatings
rated at 5 0C and a curing time of 90 minutes is
1
Standardized Residuals
-1
-2
-3
21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39
Predicted Response
Prediction Plots
Project Name : Gloss Properties of Coating
Plot Name : User Plot1
10/3/2002 2:59:56 PM
Predicted Gloss(Y1)
Observed Gloss(Y1)
44
42
40
38
36
34
32
30
28
26
24
22
20
-1 0 1 2 3 4 5 6 7 8 9 10 11
Tem p (C)(X1)
Predicted Gloss(Y1)
Observed Gloss(Y1)
44
42
40
38
36
34
32
30
28
26
24
22
20
70 80 90 100
Tim e (Min)(X2)
Response Plots
42.3
39.9
37.5
35.1
32.7
30.3
27.9
25.5
23.1
20.7
42.3
39.9
37.5
35.1
32.7
30.3
27.9
25.5
23.1
20.7
RESIDUAL ANALYSIS
1. RANDOM PATTERN
Residual
Predicted y
In this case, residuals are randomly scattered about the center line or
zero residual. The model is likely to be good (provided the other
measures of performance are good as well).
2. FUNNEL
Residual
Predicted y
This pattern occurs when the model has nonconstant variance. The
model, therefore, is inadequate. Transformations of the independent or
dependent variables are typically needed.
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Regression Analysis REG - 38
Residual
Predicted y
This pattern typically occurs when the errors are not independent.
Again, transformations of the independent or dependent variables may
be needed.
EXAMPLE (PLASMA)
X Y
(Age) (Level)
.000 17.000
.000 11.200
1.000 9.200
1.000 12.600
2.000 7.400
2.000 10.500
3.000 8.300
3.000 5.800
4.000 4.600
4.000 6.500
5.000 5.300
5.000 3.800
6.000 3.200
6.000 4.500
Total 199.095 13
Shapiro-Wilks W : 0.9125
Significance level: 0.411
The residuals...
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Regression Analysis REG - 42
1
Standardized Residuals
-1
-2
-3
2 3 4 5 6 7 8 9 10 11 12 13
Predicted Response
WARNINGS....
Y ′ = ln(Y )
1
Standardized Residuals
-1
-2
-3
1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 2.0 2.1 2.2 2.3 2.4 2.5 2.6 2.7
Predicted Response
Is it better?
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Regression Analysis REG - 45
MULTICOLLINEARITY
−1
We have seen the importance of the matrix ( X ′X ) . Some problems
that may arise when computing the matrix are:
−1
2) If any two variables are highly correlated, the matrix ( X ′X )
will have a determinant which is close to zero.
coefficients.
• Unstable estimates
coefficients
DETECTING MULTICOLLINEARITY:
X ′X
y = f ( x1 , x 2, ..., x p )
1
VIFi =
1 − Ri2
A rule-of-thumb (there’s no agreement among
statisticians!) is that if VIF exceeds a value of 10, there’s
strong reason to suspect multicollinearity.
Variance
Variable Inflation
INTERCEP 0.00000000
X1 1.12684384
X2 1.12684384
2. Condition number:
Largest Eigenvalue
CN i =
Eigenvalue i
Variance Variance
Condition Proportion Proportion
Number Eigenvalue Number X1 X2
If the objective is
to use the model
for:
Transform variables
sxy =
∑(x i − x )( y i − y )
Let
n −1
then
s xy
r =
sx s y
• The sign of r is the same as the sign of the slope of the linear
trend component.
Y | Y |
45 + 100 + A
| A | AA
| AA | AA
40 + A | A
| AA | AA
| AA 80 + A
35 + A | AA
| AA | AA
| AA | A
30 + A | AA
| AA 60 + A
| AA | AA
25 + A | AA
| AA |
| |
20 + 40 +
--+---+---+---+---+---+-- --+---+---+---+---+---+-
10 12 14 16 18 20 10 12 14 16 18 20
X X
Y | 2000 + A
| |
| | A
| | A
| Y | A
| | A
| | A
| | A
15 + AAAAAAAAAAAAAAAAAAAAA 1000 + A
| | AA
| | A
| | AA
| | AA
| | AA
| |AAAA
| |
| 0 +
--+---+---+---+---+---+-- -+---+---+---+---+---+-
10 12 14 16 18 20 10 12 14 16 18 20
X X
MODEL BUILDING
MOTIVATION
Y = φ( x1 , x2 ,..., x n ) + ε
The function φ is continuous in the region of interest. In building an
empirical model, the typical steps are:
START
Experimental RAW
Design DATA DATA
Strategy
data
structure
MODELS
y = b0 + x ′b + x ′Bx
subset
model
regression
ANALYSIS
selection
Best
Model
k k k −1 k
y = b0 + ∑ bi xi + ∑ b x + ∑ ∑ bij xi x j 2
ii i
1 1 i =1 j >i
⎛ E ⎞
−⎜ ⎟ m
W = K 0e ⎝ RT ⎠
[
∏ iA ] p i
which becomes
m
E
ln W = ln K 0 − + ∑ pi ln A i
RT 1
EXAMPLE (POLYMER)
POLYMER EXAMPLE
Suppose we are trying to maximize the density (g/ml) of a polymer
which is a function of time and temperature.
Time: 2 - 60 minutes
THE DATA
Obs X1 X2 Y
• Notice that the three replicates of the center point are placed last in
the data set to allow SAS to compute lack-of-fit statistics.
Comments
*********************************************;
* Original File :CCD1.SAS ;
* Study Type : Response Surface ;
* Factor Coding : Coded ;
*********************************************;
DATA Exp2;
INPUT
DSN_ID /* Design ID Points (optional)*/
X1 /* Reaction Time (min) */
X2 /* Reaction Temp (Celsius) */
Y /* Yield (g/ml) */
;
CARDS;
1 60 190 101
2 50 155 72 Variable names
3 50 225 101
4 30 140 70
5 30 170 91
6 10 155 70
7 10 225 83 Data
8 2 190 70
9 30 240 .
10 30 190 98
11 30 190 103
12 30 190 97
;
RUN;
title 'CCD Example with a Missing Point ';
title2 'Input Data';
proc print data=exp2;
run;
Obtain echo
report
USEFUL TRANSFORMATIONS
Y′ = Y
Y ′ = log10 (Y )
Y ′ = ln( Y )
1
Y′ =
Y
X4 = X3 (power)
X5 = x (a common power)
X6 = log(X) (logarithmic)
X7 = exp(X) (exponential)
2 Level Variable:
Levels X[1]
A -1
B 1
3 Level Variable:
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Regression Analysis REG - 65
B 0 1
C -1 -1
4 Level Variable:
B 0 1 0
C 0 0 1
D -1 -1 -1
5 Level Variable:
B 0 1 0 0
C 0 0 1 0
D 0 0 0 1
E -1 -1 -1 -1
SSE( x1 , x2 ) − SSE( x1 , x2 , x3 )
(# of Extra Terms)
F=
SSE( x1 , x2 , x3 )
n − p −1
Hypotheses:
Ha: At least one new coefficient not equal to zero (i.e., β3≠0)
Decision rule:
Reject H0 if
X1 = % Copper
X2 = Temperature
X1 X2 X3 Y
(% Copper) (Temp) (Manufacturer) (Thermal Conductivity)
7 100 B 43
1 86 A 23.7
5 82 B 34.3
8 86 B 35.8
10 84 B 38
0 75 A 22.2
1 80 A 23.1
6 83 A 30
6 91 B 33
9 88 B 38
2 73 A 26.6
10 75 B 36.2
5 81 A 31.6
6 74 A 29
8 87 B 34
4 79 A 30.1
6 94 B 33.9
3 70 A 28.2
3 89 A 30
Levels X3[1]
A -1
B 1
Pure Error 0 0 0
Total 599.7855 19
E ( y ) = β 0 + β 1 x1 + β 2 x 2 + β 3 ( −1)
E( y ) = β 0 + β1 x1 + β 2 x 2 + β 3 (1)
POLYNOMIAL EXPANSIONS
y = β 0 + β1 x + β 2 x 2 + β 3 x 3 + ε
Polynomial expansions:
1. Graph, both the data and the model as part of a routine analysis.
3. Check key assumptions! Make sure they are at least crudely met.
12. Watch out for of data points which are relatively far away from
the rest of the data.
16. Never force fit a regression analysis into software. Find software
which fits your regression analysis/problem well.
ANALYZING A TYPICAL
COMPUTER REPORT
I. SAS OUTPUT
ANALYSIS OF VARIANCE
(15)
(14) Sum of Mean (3) (4,6)
Source DF Squares Square F Value Prob>F
p-value (6) - Used to judge whether the effect associated with a given F or t
statistic was statistically significant or not. Smaller p-values indicate stronger
statistical evidence of a given result (leading to the conclusion that the associated
coefficient is not zero). The decision criterion is that a result is statistically
significant when p-value < α (usually α= 0.05).
residual (7) - (observed value - predicted value) = error in prediction. Very useful
in detecting problems with regression models.
mean (9) - the arithmetic mean of the dependent variable, i.e., "y"
R-square (11) - The square of the multiple correlation coefficient. Also called the
coefficient of determination. The percentage of the variation (1=100%) in the
dependent variable which can be explained through use of the regression model.
Ranges between 0 and 1.
standard error of estimate (16) - The standard deviation with which the
parameter in question was estimated. A measure of uncertainty of the estimated
value.
All examples and figures provided in this section of the seminar were constructed
with the use of SAS/PC the PC based version of SAS (Statistical Analysis System,
SAS Inc., Cary, NC). SAS software runs in many computing environments (PC,
IBM Mainframe, VAX, HP, ...).
The following program illustrates how to read in a simple set of data into SAS, print
the data, request a regression analysis with confidence limits, perform a graphical
residual analysis, and examine a plot of the model.
If confidence limits for the mean were desired one would simply change the cli in
the above to clm instead. If a no-intercept regression was desired as well the option
list would have been: / p clm noint
If a multiple regression model was desired one would simply add the additional
independent variable names in the model statement.
PROC STEPWISE;
MODEL YIELD = TEMP RPM FLOW FEED / MAXR;
Table of Contents
1. INTRODUCTION..............................................................................................................................................2
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-2
1. Introduction
Experiment:
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-3
Mostly
• improving existing products/processes
• developing new products/processes
• developing new applications (“fishing”)
Frequently
• fundamental models: “scale-up” or “optimizing”
control
control Industrial R&D resulting
resulting
variables
variables response
response
[x1,..,x
XX==[x ,..,xn]] Semi-Black Box yy
1 n
Approach
y = Xβ + ε
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-4
2. A Photographic Process
The concept phase
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-5
Problem Discussion
• How many factors can be studied ?
• resources, constraints in Lab and Plants
• Which factors are to be studied ?
• What factors are most important ?
• What model to be used ?
• linear, polynomial, theoretical (first
principles)
• Which experimental design strategy?
• One-at-a-time, Taguchi, Factorial, Box-
Behnken, ...
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-6
Goal:
To obtain a maximum IQ near 100 with the
least variation
Objective:
To find an empirical model of IQ as a
function of
» Temperature
» Ph
» Humidity
» Speed
» Metol conc
» HQ conc
» KI conc
Optimize the response surface
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-7
Regression Analysis
Variance of coefficients:
^
s ( β ) = MSE ( X ′X ) −1
2
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-8
The
The Design
Design Pyramid
Pyramid
Design Model
theoretical
empirical
(response surface)
screening
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-9
The
The Design
Design Pyramid
Pyramid
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-10
3. FACTORIAL DESIGNS
Factorial experiments are a series of experiments in
which several factors are controlled in order to
investigate their effects on some dependent variable.
Definitions
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-11
y
X2 = +1
X2 = -1
X1
Figure 1. No Interaction
y
X2 = +1
X2=-1
X1
Figure 2. Interaction
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-12
Example of a 22 design
Two independent variables are given below.
X1
-1 +1
-1 103 125
X2
+1 198 142
Effect Calculation
x1 1/2[(125-103) + (142 - 198)] = -17
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-13
Full-Factorial Design
A full factorial design is one in which
all possible combinations of the factors at
all levels involved in the experiment are
used. The number N of all possible
combinations to be performed during the
experiment is given by
N = nk
Where
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-14
A three-factor design...
Run A B C
1 +1 +1 +1
2 +1 -1 -1
3 -1 +1 -1
4 -1 -1 +1
N
∑ x = 0
ji
i=1
• A 2-level design matrix is said to be orthogonal if it is
balanced and if the sum of every dot product for all
possible variable pairs is zero,
N
∑ x ji xui = 0 for all j ≠ u
i =1
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-15
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-16
2-Way 3-Way
Main Effects Interactions Interactions
A AB ABC
B AC
C BC
Run A B C AB AC BC ABC
1 -1 -1 -1
2 -1 -1 +1
3 -1 +1 -1
4 -1 +1 +1
5 +1 -1 -1
6 +1 -1 +1
7 +1 +1 -1
8 +1 +1 +1
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-18
EXAMPLE (YIELD)
This is a 23 design...
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-19
Level
Independent
Variable Low High
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-20
Coding
⎜⎛ zi −
⎛ High + Low ⎞ ⎟⎞
⎝ ⎝ 2 ⎠⎠
xi =
⎛ High − Low ⎞
⎝ 2 ⎠
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-21
⎛ ⎞
⎜ ⎟
⎜ z1 − 4 ⎟ = ⎛⎜ z1 − 4 ⎞⎟
Pressure: ⎜ ⎛ 6 − 4⎞ ⎟ ⎝ 2 ⎠
⎜ ⎜⎝ ⎟⎟
⎝ 2 ⎠⎠
⎛ ⎞
⎜ ⎟
⎜ z 2 − 150 ⎟ = ⎛⎜ z 2 − 150 ⎞⎟
Temperature: ⎜ ⎛ 200 − 100 ⎞ ⎟ ⎝ 50 ⎠
⎜ ⎜⎝ ⎟⎟
⎠⎠
⎝ 2
⎛ ⎞
⎜ ⎟
⎜ 3 z − 40 ⎟ = ⎛⎜ z3 − 15 ⎞⎟
Reaction Time: ⎜ ⎛ 20 − 10 ⎞ ⎟ ⎝ 5 ⎠
⎜ ⎜⎝ ⎟⎟
⎝ 2 ⎠⎠
0 0 0
The point with coordinates ( z1 , z2 ,. . . , zk )
zmax + zj
min
j
z =
0
j ; j = 1, 2, .. . , k
2
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-22
Factors on
Natural Coded
Scale Factors Yield
Obs. z1 z2 z3 x1 x2 x3 y
1 100 2 10 -1 -1 -1 2
2 200 2 10 1 -1 -1 6
3 100 6 10 -1 1 -1 4
4 200 6 10 1 1 -1 8
5 100 2 20 -1 -1 1 10
6 200 2 20 1 -1 1 18
7 100 6 20 -1 1 1 8
8 200 6 20 1 1 1 12
Center Points
9 150 4 15 0 0 0 8
10 150 4 15 0 0 0 9
11 150 4 15 0 0 0 8.8
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-23
3/17/2005 10:17:27 AM
Results
Project : New Design Project
Analysis : RS Model 2
Replication Error 0 0 0
Total 174.5818 10
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-24
Shapiro-Wilks W : 0.9166
Significance level: 0.5271
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-25
DESIGN-EASE Analysis
Yield
18.00
A
c 15.33
t
u
a 12.67
l B-
Y 10.00 B+
i
e
l
d 7.33
B+
B-
4.67
2.00
A- A+
DESIGN-EASE Analysis
Yield
18.00
A
c 15.33 C+
t
u
a 12.67
l
Y 10.00
i
e C+
l
d 7.33 C-
4.67
C-
2.00
A- A+
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-26
DESIGN-EASE Analysis
Yield
18.00
A
c 15.33
t C+
u
a 12.67
l
Y 10.00 C+
i
e
l
d 7.33
C-
4.67
C-
2.00
B- B+
DESIGN-EASE Analysis
Actual Yield
8.00 12.00
B+ 4.00 8.00
T
e
m 10.00 18.00 C+
p
e
m
i
T
B- 2.00 6.00 C-
A- A+
Pressure
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-27
RANDOMIZATION
Coded Actual
Experimen Random
t x1 x2 x3 z1 z2 z3 Order
#
1 - - - 2 100 10 2
2 + - - 6 100 10 4
3 - + - 2 200 10 6
4 + + - 6 200 10 1
5 - - + 2 100 20 7
6 + - + 6 100 20 5
7 - + + 2 200 20 8
8 + + + 6 200 20 3
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-28
Number of levels
Number of Factors 2 3 4
2 4 9 16
3 8 27 64
4 16 81 256
5 32 243 1024
6 64 729 4096
7 128 2187 16384
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-29
ADVANTAGES OF FULL
FACTORIAL DESIGNS
1. Orthogonality
2. No aliasing
DISADVANTAGES?
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-30
FACTORIAL DESIGN
PROCEDURE
6. Compute effects.
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-31
4. SCREENING DESIGNS
• Used at the beginning of the program when
process knowledge is relatively limited
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-32
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-33
Plackett-Burman Designs
• Developed by R. L. Plackett and J. P. Burman
during World War II.
• Fractional factorial:
2k-p = 4, 8, 16, 32, 64, 128, ... .
• Plackett-Burman Designs:
4, 8, 12, 16, 20, 24, 28, ...
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-34
n=8 (+ + + - + - -)
n = 12 (+ + - + + + - - - + -)
n = 16 (+ + + + - + - + + - - + - - -)
n = 20 (+ + - - + + + + - + - + - - - - + + -)
n = 24 (+ + + + + - + - + + - - + + - - + - + - - - -)
RUN A B C D E F G
1 + + + - + - -
2 + + - + - - +
3 + - + - - + +
4 - + - - + + +
5 + - - + + + -
6 - - + + + - +
7 - + + + - + -
8 - - - - - - -
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-35
1 + + - + + + - - - + -
2 + - + + + - - - + - +
3 - + + + - - - + - + +
4 + + + - - - + - + + -
5 + + - - - + - + + - +
6 + - - - + - + + - + +
7 - - - + - + + - + + +
8 - - + - + + - + + + -
9 - + - + + - + + + - -
10 + - + + - + + + - - -
11 - + + - + + + - - - +
12 - - - - - - - - - - -
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-36
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-37
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-38
EXP X1 X2 X3 X4 X5 RESP
1 + + + - + 90
2 + + - + - 85
3 + - + - - 82
4 - + - - + 50
5 + - - + + 91
6 - - + + + 48
7 - + + + - 60
8 - - - - - 64
X1 Effect
(90 + 85 + 82 + 91)/4 - (50 + 48 + 60 + 64)/4 =
87.00 - 55.50 = 21.50 (Rank = 1)
X2 Effect
(90 + 85 + 50 + 60)/4 - (82 + 91 + 48 + 64)/4 =
71.25 - 71.25 = 0.00 (Rank = 5)
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-39
X1: 60 to 100
X2: 2 to 8
X3: 0 to 50
X4: 25 to 75
X5: 20 to 50
X6: 10 to 30
DATA
OBS SEL X1 X2 X3 X4 X5 X6
1 86.3093 100 2 50 25 50 10
2 86.3841 60 2 50 75 50 30
3 83.0953 60 8 0 75 50 10
4 85.8008 60 2 50 75 20 30
5 86.0550 100 8 50 25 20 30
6 77.7010 60 8 50 25 20 10
7 88.5746 100 2 0 25 50 30
8 87.9833 100 8 0 75 20 30
9 81.8652 60 8 0 25 50 30
10 86.8871 100 8 50 75 50 10
11 87.4913 100 2 0 75 20 10
12 78.7226 60 2 0 25 20 10
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-40
CORRELATION ANALYSIS
SEL X1 X2 X3 X4 X5 X6
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-41
MODEL1
Dependent Variable: SEL
Analysis of Variance
Sum of Mean
Source DF Squares Square F Value Prob>F
Parameter Estimates
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-42
5. COMMON DESIGN/ANALYSIS
PITFALLS
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Experimental Design Des-43
_____________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-1
SECTION IV
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-2
Table of Contents
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-3
1. RESPONSE SURFACE
CONCEPTS
Y = φ( x1 , x2 ,..., x n ) + ε
START
Experimental RAW
Design DATA DATA
Strategy
data
structure
MODELS
y = b0 + x ′b + x ′Bx
subset
model
regression
ANALYSIS
selection
Best
Model
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-4
k k k −1 k
y = b0 + ∑ bi xi + ∑ b x + ∑ ∑ bij xi x j
2
ii i
1 1 i =1 j >i
⎛ E ⎞ m
∏[ A ]
−⎜ ⎟ pi
⎝ RT ⎠
W = K 0e i
1
which becomes
m
E
ln W = ln K 0 − + ∑ pi ln A i
RT 1
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-5
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-6
Empirical Designs
(Response Surface Designs)
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-7
TYPICAL STRATEGY:
Response (y)
X2
x1
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-8
y$ = b0 + x ′b + x ′Bx
where, b0, the intercept, is a scalar.
Then
∂y$
= b + 2Bx = 0
∂x
1
x 0 = − B −1 b
2
The predicted response is
1
y$ 0 = b0 + x 0 b
2
The canonical form is
k
y$ = y$ 0 + ∑ λ i wi2
1
where
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-9
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-10
Response
Response Surface
SurfaceShapes:
Shapes: Plane
Plane
3-D
CONTOUR
2.0
61
67
70
74 1.5
RESPONSE
X2
66 1.0
58
64
55
58 0.5
9
52
8
50 0.0
2.00 7 X1 6.00 7.50
1.33
6.75 8.25 9.00
0.67
X2 0.00 6 X1
Response
Response Surface
SurfaceShapes:
Shapes: Twisted
TwistedPlane
Plane
3-D
CONTOUR
2.0 33
38
65 1.5
43
X2
RESPONSE
54 1.0
48
43 0.5 53
9
58
8 63
0.0
32 7 X1 6.00 6.75 7.50 8.25 9.00
2.00
1.33 X1
X2 0.67 6
0.00
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-11
Response
ResponseSurface
Surface Shapes:
Shapes: Maxima
Maxima
3-D CONTOUR
55.0
66
75 47.5
X2
70
RESPONSE
65 40.0
66
55
5 32.5
4 62
45 54
58
55 3 X1 25.0 50 58
45
35 2.00 2.75 3.50 4.25 5.00
X2 25 2
X1
Response
ResponseSurface
SurfaceShapes:
Shapes: Stationary
StationaryRidge
Ridge
3-D CONTOUR
70
95.00 55
X2
RESPONSE
85.67 40
70
74
78
82
86
90
90
86
82
78
74
70
94
94
76.33
11 25
8
67.00
70 5 X1
50 10
30 10 2 2.00 4.25
X2 6.50 8.75 11.00
X1
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-12
Response
ResponseSurface
SurfaceShapes:
Shapes: Rising
RisingRidge
Ridge
3-D
CONTOUR
65.0
74
76.00 57.5
X2
RESPONSE
66.67 50.0
70
54
58
57.33 66
18.5 42.5
15.5
58
50
62
48.00 54
12.5 X1 35.0
50
Response
ResponseSurface
Surface Shapes:
Shapes: Saddle
Saddle
3-D CONTOUR
65.0
60
170 150
75
57.5
RESPONSE
90
130
105
X2
50.0 135
120
90
12
12
0
11 42.5
10
5
50 10 X1
90
65
55
75
45 9 35.0
X2 35 9.00 9.75 10.50 11.25 12.00
X1
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-13
• Box-Behnken
• Central Composite
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-14
Box-Behnken
• Near orthogonal...
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-15
Three-variable BB design
Run A B C
#
1 + + 0
2 + - 0
3 - + 0
4 - - 0
5 + 0 +
6 + 0 -
7 - 0 +
8 - 0 -
9 0 + +
10 0 + -
11 0 - +
12 0 - -
13 0 0 0
14 0 0 0
15 0 0 0
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-16
Four-variable BB design
Run A B C D
#
1 - - 0 0
2 - + 0 0
3 + - 0 0
4 + + 0 0
5 0 0 - -
6 0 0 - +
7 0 0 + -
8 0 0 + +
9 0 0 0 0
10 - 0 0 -
11 - 0 0 +
12 + 0 0 -
13 + 0 0 +
14 0 - - 0
15 0 - + 0
16 0 + - 0
17 0 + + 0
18 0 0 0 0
19 - 0 - 0
20 - 0 + 0
21 + 0 - 0
22 + 0 + 0
23 0 - 0 -
24 0 - 0 +
25 0 + 0 -
26 0 + 0 +
27 0 0 0 0
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-17
X2
X3
X1
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-18
Run A B
#
1 + +
2 + -
3 - +
4 - -
5 0 0
6 0 0
7 0 0
8 α 0
9 -α 0
10 0 α
11 0 -α
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-19
Three-variable CCD
Run A B C
#
1 - - -
2 - - +
3 - + -
4 - + +
5 + - -
6 + - +
7 + + -
8 + + +
9 0 0 0
10 0 0 0
11 0 0 0
12 0 0 0
13 0 0 0
14 0 0 0
15 α 0 0
16 -α 0 0
17 0 α 0
18 0 -α 0
19 0 0 α
20 0 0 -α
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-20
( )
1/ 4
α = nf
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-21
α=1
[ ]
1
⎛ F + T − F F ⎞⎟
2 4
α =⎜
⎜ 4 ⎟
⎝ ⎠
where
F=number of factorial points
T=number of axial + number of center points.
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-22
Rule of Thumb:
Uniform Precision:
N
λ4 =
( F + 4F )
1
(4)
2
+4
# 2 3 4 5 6
Controls
λ4 0.7844 0.8385 0.8704 0.8918 0.9070
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-23
CODING A CCD:
design values: -α -1 0 1 α
-------------------------------------------------------------------
real values: 100 (200-∆) 200 (200+∆) 300
Max − Center ∆
=
α 1
300 − 200
= 66 . 66 = ∆
1. 5
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-24
Suppose we want to find out the 'best' model after testing many subset
combinations.
From the statistical point of view, the ‘best’ model is one that satisfies
the following conditions:
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-25
EXAMPLE (POLYMER)
Time: 2 - 60 minutes
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-26
THE DATA
Obs X1 X2 Y
1 60 190 101 Star point
2 50 155 72 Corner point
3 50 225 101 Corner point
4 30 140 70 Star point
5 30 240 . Star point: missing!
6 10 155 70 Corner point
7 10 225 83 Corner point
8 2 190 70 Star point
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-27
3/17/2005 10:25:53 AM
Results
Project : POLYMER EXAMPLE
Analysis : RS Model 1
Replication Error 0 0 0
Total 1932.1818 10
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-28
Shapiro-Wilks W : 0.9376
Significance level: 0.6869
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-29
SSE ( p )
C( p ) = − (n − 2 p) + 1
MSE
where
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-30
CCD Example
Best Subset Regression models using REG
All Possible Models Regression (using original data)
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-31
CCD Example
55
50
45
C(p), Mallows Statistics,
40
# of Parameters
# of Parameters
35
30 C(p)
25
20 Linear (# of
Parameters)
15
10
5
0
0 1 2 3 4 5
# of Regressors in the Model
Temperature 5.3612
-6 -5 -4 -3 -2 -1 0 1 2 3 4 5 6
Standardized Estimates
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Response Surface RS-32
105.9
100.7
95.6
90.4
85.3
80.1
74.9
69.8
64.6
59.4
Time(X1): 21.8474
Temperature(X2): 179.7679
Predicted Density(Y1): 90.9542
__________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
NonLinear Models Nlin-1
SECTION V
Mechanistic Model Building
Table of Contents
1: Introduction
Mechanistic Modeling
Mathematical Models- Classification based on Structure
2: Non-linear Regression
Comparison with Linear Regression
Regression Methodology and Output
Multi-Response Regression
Examples
4: APPENDIX
Data Analysis using Excel
• Introduction
Objective: Develop fundamental understanding on the influence
of various control variables on desired responses.
Design Based
Model Building
Control variables
Mechanistic Experimental
Description Design
(Model Form)
Regression
Design Based Analysis
Mechanistic Modeling
Model
Validation
Computational Modeling Center, 2004
© Air Products and Chemicals, Inc.
NonLinear Models Nlin-4
Application Areas
• Kinetics, reaction rate expressions,
• Thermodynamic correlations,
• Transport correlations,
• Financial,
• Econometric,
• Business, or
• Engineering
Examples:
• Kinetic data analysis for Langmuir-Hinshelwood-Hougen-
Watson type of rate expressions or multiple uses of the
Arrhenius rate expression
abx
r =
1 + bx
• Adsorption isotherm characterization, e.g. Dual-Site-
Langmuir isotherm. Equation of state or thermodynamic
characterization of multi-component mixtures.Growth rate
models etc.
Approach:
• Step 1 – Identify appropriate responses that can be measured in the lab
and have enough sensitivity w.r.t. parameters.
Mathematical Models
y = b0 + b1 z1 + b2 z 2 + L + b p z p
y = ∑ bi z i
i
e.g.,
y = b0 + b1 x1 + b2 x12
b) Non-Linear Models:
Non-linear in parameters
e.g.,
− (θ1 +θ 2 t ) 2
y=e
θ1
y=
θ1 − θ 2
[e −θ t − e −θ t ]
2 1
ax1
y=
(1 + bx1 + cx 2 ) (Langmuir - Hinshelwood)
⎛ E ⎞
− ⎜⎜ ⎟⎟
⋅
y = k0 ⋅ e ⎝ 1 . 987 Temp ⎠
(Arrhenius)
y = W a X bZ c
y = β 1e − β 2 x1
y = b1e − b2 x1
b1 x1
y=
(1 + b2 x1 + b3 x2 )
Y = f ( X ,θ )
Regression Example
Three component kinetic system
A chemist is studying a system in which a reactant A
decomposes to form the desired product B, which in turn
decomposes into and undesired by-product C according to the
scheme:
First-order kinetics,
dC A
= −k C A
dt 1
dC B
= −k CB + k C A
dt 2 1
dCC
= k CB
dt 2
Where:
Y1 = U1 = CA is the concentration of A
Y2 = U2 = CB is the concentration of B
Y3 = U3 = CC is the concentration of C
POPULATION
SAMPLE
NOMENCLATURE
INDEPENDENT VARIABLES
(deterministic or random variables
with constant variance)
y = β 0 + β1 x1 +... +β n x n + ε
Parameters
(Random variables)
ERROR TERM
DEPENDENT (random variable)
VARIABLE OR
RESPONSE
(Random variable)
y = β 0 + β 1 x1 + β 2 x 2 + ... + β n x n + ε
Estimated Model:
yˆ = b0 + b1 x + b2 x 2 + ... + bn x n
Linear Non-Linear
Y = f ( X ,θ ) + ε
Where
Y = {Y1, ….. Yn}, a vector of responses
X = {X1, ….. Xk}, a vector of independent variables
θ = {θ1, ….., θp}, a vector of parameters
ε = {ε1, ….. εn}, a vector of experimental errors
Possible Constraints
L .B . ≤ C ( X , Y , Θ ) ≤ U .B .
or
Expectation Function:
E ( Y ) = f ( X ,θ )
Assumptions
• The errors are additive to the true model. No errors are
made in determining the control variable vector ξ .
or
ε i ~ N (0, σ 2 )
The errors follow a normal distribution with mean 0 and
variance σ2. The scatter about the model does not change
as a function of where you are in the design space.
Residual
Predicted y
• The errors are uncorrelated and independent.
The size and direction of any error is not related to the size
or direction of any other error
Regression Methodology
Least Square Criterion for Regression
Error Sum of Squares for the Non-Linear Model:
N exp
S (θ ) = ∑ {Yu − f ( X u ,θ )}2
obs
u =1
---(Objective Function)
Where
Nexp is the number of observations.
Regression Algorithm
Let
θ 0 = (θ10 ,θ 20 ,K ,θ p 0 )
If we set
f u0 = f (ξ u ,θ 0 )
β i0 = (θ i − θ i 0 )
⎡ ∂f (ξ u ,θ ) ⎤
Z =⎢
0
⎥ ( Jacobian)
iu
⎣ ∂θ i ⎦θ =θ0
We have,
p
Yu − f u0 = ∑ β i0 Z iu0 + ε u
i =1
LINEAR FORM!!
θˆ = ( Z ′Z ) −1 Z ′y
Numerical Algorithms:
General Concerns:
• Converge slowly
• Oscillate
• Diverge
NOTE:
Regression Output
Diagnostic Plots
Lack-of-Fit ANOVA
Variance-covariance matrix
Indicates the hidden dependencies between various Y responses.
N exp
min ∑ j j
w ( y obs
− y pred 2
j )
Θ j =1
L .B . ≤ C ( X , Y , Θ ) ≤ U .B .
Weights:
• Equal – (Ordinary) Least Squares
• 1/sqr(Y) – Approximate Relative Error Criteria for Y>0
• 1/Y – Requires Y>0 and Faster Than Relative Fanning
Predicted y
Transformations
Possible reasons for the use of transformations:
β
Ex: Use of log transformation to linearize: y = x
θx
y= + εi
µ+x
Inverse transformation leads to
1 1 µ 1
= + • + εi
*
y θ θ x
Computer Packages:
1. Statistical Studio
2. Athena
3. SimuSolv
4. SAS
5. STATISTICA
6. SIGMAPLOT
7. Delta Graph
8. NAG ROUTINES
9. EXCEL (solver) – see appendix A
Practical Considerations
• Missing data
Single Response: No problem
Multi-response: More difficult, may be handled in next
version.
Example 1
Computer Code in SAS
data one;
do x = 2 to 10 by 1;
y = 2*(x**.7);
y = y + 0.2*rannor(5934491);
output;
end;
* Specify/perform the nonlinear regression and write out a data set with predicted ;
* and confidence limits. ;
* Print the the data set written out by the nlin procedure.
run;
Example 2
A chemist is studying a system in which a reactant A
decomposes to form the desired product B, which in
turn decomposes into and undesired by-product C
according to the scheme:
β1 β 2
A⇒ B ⇒ C
First-order kinetics,
dη
− 1 =β η
dx 1 1
dη
− 2 = β η −β η
dx 2 2 1 1
Where:
η1 is the concentration of A
η2 is the concentration of B
β1 −β 2 x −β1x
η2 = (e −e )
β1 − β2
Experimental Result:
Time at which reaction is Relative concentration of
terminated product B
x y
1 0.22
2 0.51
4 0.48
5 0.29
6 0.20
7 0.12
Iterative Phase
Sum of
Iter beta1 beta2 Squares
Estimation Summary
Method Gauss-Newton
Iterations 8
Subiterations 1
Objective 0.036312
Observations Read 6
Observations Used 6
Observations Missing 0
Approx
Parameter Estimate Std Error Approx 95% Confidence Limits
beta1 0.5153 0.1374 0.1339 0.8968
beta2 0.3475 0.0549 0.1951 0.4999
0.5
y, conc. of product B
0.4
0.3
0.2
0.1
0
0 1 2 3 4 5 6 7 8
x, time at which reaction is terminated
y = Zβ + ε
is
b = ( Z ' Z ) −1 Z ' y
Mean = E (b) = β
σ b2
and
But the true coefficients in the model are the very thing we’re
trying to estimate!
T T −1
Minimize z ( Z Z ) z over the candidate space.
D-Optimality
Design Algorithms
Popular Methods
• Sequential or Dykstra method
• Simple exchange (Wynn-Mitchell) method
• DETMAX algorithm (exchange with excursions)
• Modified Fedorov (simultaneous switching).
The entrance of the parameter estimates into the solution for the
optimal design points indicates that the use of
Stagewise/sequential design
Sequential Design
Objectives:
Approach:
Advantages:
Y = exp(-p1*x1*exp(-p2*(1/x2-1/620)))
10 Experiments
Expt. No. Time Temperature Conversion
1 10 600 0.991906293
2 40 600 0.967431911
3 80 600 0.934848257
4 100 600 0.91931483
5 150 600 0.881679952
6 10 640 0.860220306
7 40 640 0.548607927
8 80 640 0.301761912
9 100 640 0.222038881
10 150 640 0.106088191
Iterative Phase
Sum of
Iter p1 p2 Squares
Estimation Summary
Method Gauss-Newton
Iterations 6
R 1.088E-6
PPC 1.57E-9
RPC(p2) 0.000013
Object 0.000011
Objective 2.569E-6
Observations Read 10
Observations Used 10
Observations Missing 0
Approx
Parameter Estimate Std Error Approximate 95% Confidence Limits
p1 0.00372 6.877E-6 0.00370 0.00373
p2 27676.3 36.5567 27592.0 27760.6
Response
1.0
0.9
0.8
0.7
Predicted Response
Observed Response
0.6
0.5
0.4
0.3
0.2
0.1
P r o j e c t N a m e : I so m e r i z a ti o n o f B i c y c l o H e x a n e
T a sk N a m e : A u to m a te d A p p r o a c h
P l o t N a m e : U se r P l o t1
0 3 / 1 3 / 2 0 0 1 9 :4 7 :4 0 A M
1.000
0.919
0.837
0.756
0.675
0.593
0.512
0.431
0.349
0.268
0.187
0.105
T im e ( X 1 ) : 5 . 0 3 7 9 6 4 e + 0 0 1
T e m p e r a tu r e ( X 2 ) : 6 .1 6 2 4 5 6 e + 0 0 2
Pr e d ic t e d R e s p o n s e ( Y 1 ) : 8 .6 6 5 5 3 3 e - 0 0 1
1.000
0.919
0.837
0.756
0.675
0.593
0.512
0.431
0.349
0.268
0.187
0.105
Time(X1):
Temperature(X2):
Predicted Response (Y1):
Model Validation
Model validation checks for the performance of the model
in the regions of the design space,
O Which were left out by D-optimal design
discontinuities etc.
D-optimal points
Validation points
Design Space
Validates the model form throughout the design
space.
Nature of D-optimality
Robustness of D-optimality
The “Good Side” of the Force
Robustness is as narrow and focused as the target (tightest fit for
specific model in minimum # experiments).
REFERENCES
Box, G.E.P., and Lucas, H.L., 1959. Design of experiments in non-linear
situations. Biometrika 46, 77-90.
Bates, D.M., and Watts, D.G., 1988, Nonlinear Regression Analysis and its
Applications, John Wiley and Sons, Inc., New York.
Englezos, P., and Kalogerakis, N., 2001, Applied Parameter Estimation for
Chemical Engineers, Marcel Dekker, New York.
Atkinson, A.C., and Hunter, W.G., 1968. The design of experiments for parameter
estimation. Technometrics, 10, 271-289.
Draper, N.R., and Smith, H., 1981. Applied Regression Analysis, Second Edition.
John Wiley and Sons, Inc., New York.
Johnson, M.E., and Nachtsheim, C.J., 1983. Some guidelines for constructing
exact D-optimal design on convex design spaces. Technometrics, 25, 271-277.
Box, M.J., and Draper, N.R., 1971. Factorial Designs, the |X'X|
criterion, and some related matters. Technometrics, 13, 731-742.
1. Integrated Rate models for homogeneous systems, e.g. initial rate measurements for various
feed concentrations and or temperature,
r = ki CαA CBβ CCγ ... − k j CDδ CEλ ...
2. Differential Models for the concentration : Time dependent studies where species
concentrations are measured as a function of time. Changing concentrations are described by a
system of differential equations
dC
= f (C, t, Θ)
dt
3. Langmuir-Hinshelwood-Hougen-Watson models in heterogeneous catalysis. These models
have highly nonlinear algebraic rate expressions with equality and/or inequality constraints. The
simplest example of this type is:
abx
y=
1 + bx
Model 3 can be put into general linear form through use of the reciprocal transformation (some, but not
all nonlinear models can be put into general linear model form via transformations).
1 1 1
= +
y abx a
Now the Linest function in Excel can be used to provide the interval estimates for the coefficients
(1/ab) and 1/a, which can then be used to obtain the point estimates of the original parameters a and b.
Statistical issues arising from estimation in transformed metrics is discussed on pages 13-15. For
example, the standard errors of a and b may need to be calculated by accounting for the propagation of
error relationships during the transformation, a nontrivial exercise. When such situations arise it is
advisable to treat the model in its original form and perform nonlinear regression analysis.
The solver is an add-in and can be selected from the Tools menu. To use the solver to perform
nonlinear parameter estimation follow the procedure given below. The procedure is demonstrated in
next section :
1. Data Tabulation : Start with a worksheet containing the data, the independent variable(s) , x
and observed dependent variable(s), yobs to be fitted.
2. Model Prediction : Assign unknown parameters to certain manipulating cells. To start, guess
some values for these parameters. Provide the formula for the model to predict the response(s),
ypred for each experimental data point. This step is straightforward for algebraic models but for
differential models, predictions will require additional calculations for the solution of differential
equations, see Example 4.
3. Calculate Residuals : For each data point tabulate the residuals (ypred -yobs).
4. Formulate Objective function : This step is essentially related to the choice of the objective
function (O.F.) for regression, e.g. ordinary least squares (OLS) or weighted least squares(WLS) or
any other liklihood based function to be minimized. OLS is the simplest and most commonly used
and O.F. in this case is sum of squares of the residuals, After making the choice, simply provide the
expression for O.F. in a target cell.
5. Use Solver : Load the solver from the Tools menu and minimize the O.F. in the target cell by
changing the parameters in the manipulating cells, see Fig. below. Minimization is a repetitive
procedure and the values of the cells will change during the course of minimization (McFedries,
1995). After finding the minimum, Excel generates Answer and Sensitivity reports.
6. Obtain Regression Statistics : Solver provides the parameter estimates but it doesn’t have any
means for obtaining standard errors and other useful model statistics. This issue can be partially
Solver provides user with a number of options for controlling the way the minimization is
performed, see Fig. below. The user can specify the maximum time or maximum number of iterations
during the solution procedure. Furthermore, it allows the user to select between Newton and conjugate
gradient methods for the search of minimum and also permits the automatic scaling of the equations.
The derivatives required during the minimization are calculated numerically either by forward or central
difference scheme. The most recent update of solver, known as Premium Solver Plus, which is
compatible with the Excel 95 provides greater speed and robustness. It is capable of solving Linear
problems up to 16000 variables and nonlinear problems up to 4000 variables and without any restriction
on the number of constraints. More information about the solver and its algorithm can be obtained from
site www.frontsys.com.
Test Problems
1. Negative Exponential Growth Model : This two parameter model along with the
associated data set is taken from SAS User’s guide. This model is used to describe ionic transport
through membranes :
y = a (1 − exp( −bx ))
1.2 0.015
1
0.01
0.8
RESIDUALS
0.005
0.6
y
0
Measured
0.4 0 5 10 15 20 25
Predicted -0.005
0.2
-0.01
0
0 50 100 150 200 250 -0.015
x x
Excel worksheet with the plots showing fitted response and normality of residuals
ac ( x 2 − x 3 / 1.632 )
f =
1 + bx1 + cx 2 + dx 3
P-Hydrogen P-Pentane P-Isopentane Msd. Rate Pred. Rate Residual Square Res. Initial
205.8 90.9 37.1 3.541 3.664623518 0.123624 0.01528277 Parameter
404.8 92.9 36.3 2.397 2.439770065 0.04277 0.00182928 Estimates
209.7 174.9 49.4 6.694 6.382446885 -0.31155 0.09706534 a = 30.0
401.6 187.2 44.9 4.722 4.91661242 0.194612 0.03787399 b = 0.1
224.9 92.7 116.3 0.593 0.728752532 0.135753 0.01842875 c = 0.05
402.6 102.2 128.9 0.268 0.572920716 0.304921 0.09297664 d = 0.5
212.7 186.9 134.4 2.797 3.108380767 0.311381 0.09695798
406.2 192.6 134.9 2.451 2.500413157 0.049413 0.00244166
133.3 140.8 87.6 3.196 3.884481591 0.688482 0.4740069
470.9 144.2 86.9 2.021 2.269038164 0.248038 0.06152293
300 68.3 81.7 0.896 0.642288292 -0.25371 0.06436963
301.6 214.6 101.7 5.084 4.348546959 -0.73545 0.54089118
297.3 142.2 10.5 5.686 6.305280984 0.619281 0.38350894
314 146.7 157.1 1.193 1.242021333 0.049021 0.00240309
305.7 142 86 2.648 2.855305461 0.207305 0.04297555
300.1 143.7 90.2 3.303 2.802950297 -0.50005 0.25004971
305.4 141.1 87.4 3.054 2.787346017 -0.26665 0.07110435
305.2 141.5 87 3.302 2.812240848 -0.48976 0.23986403
300.1 83 66.4 1.271 1.571592021 0.300592 0.09035556
106.6 209.6 33 11.648 11.67934651 0.031347 0.0009826
417.2 83.9 32.9 2.002 2.202588555 0.200589 0.04023577
251 294.4 41.5 9.604 9.898804916 0.294805 0.08690994
250.3 148 14.7 7.754 7.035909783 -0.71809 0.51565356
145.1 291 50.2 11.59 11.50758569 -0.08241 0.00679212
SS= 3.23448228
14
0.8
12 Observed 0.6
10 Pedicted 0.4
Residuals
8 0.2
6 0
-0.2 0 100 200 300 400
4
-0.4
2 -0.6
0 -0.8
0 100 200 300 400 -1
n-Pentane partial pressure (psia) n-Pentane partial pressure (psia)
Fitted responses vs. various predictor variables and the plot showing normality of residuals
Conclusions
We have demonstrated the use of Excel for nonlinear parameter estimation on a number of kinetic
problems. The results from the first two examples presented above show that the parameter estimates
and standard errors obtained using Excel matched very well with those from SAS. Our interest in using
Excel for nonlinear regression analysis is justified by its comparable performance, easy use, familiarity
and popularity of Excel throughout the company. For those situations which require model confidence
limit estimation, more advanced statistical tools than those currently available in Excel would be
required.
References
1. Bates, D.M., D.G. Watts, Nonlinear Regression Analysis and its Applications, John Wiley and
Sons, N.Y., 1988.
2. Berk, K.N., P.M. Carey, Data Analysis with Microsoft Excel, Doxbury Press, N.Y., 1998.
3. Billo, E.J., Excel for Chemists : A Comprehensive Guide, Wiley-VCH, New York, 1997.
4. McFedries, P., Excel for Windows 95 - Unleashed, Sams Publishing , Indianapolis, 1995.
5. SAS User’s Guide, Vol. 2, Version 6, SAS Institute, Carey, NC, 1990.
MIXTURE EXPERIMENTS
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-2
• Examples
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-3
• Factorial Experiments:
• Mixture Experiments:
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-4
0 ≤ xi ≤ 1, i = 1, 2, ..., q
q
∑ xi = x1 + x 2 + . . . + x q = 1. 0
i=1
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-5
Other Constraints
Single Factor:
a < xi < b
Multifactor:
a < ( xi + x j ) < b
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-6
Triangular Region: x1 + x2 + x3 = 1
(0,0,1)
SIMPLEX
Coordinate System
x3
(0,1,0)
(0,0,0) x2
(1,0,0)
x1
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-7
Design Spaces
100% x4
100% x1 100% x2
50% x4
50% x3
100% x3
100% x1
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-8
x3 = 1
70% x3
x2 = 1 x1=0.5 x1 = 1
x2=0.5
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-9
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-10
Objectives
• To study the effects of the ingredients on
some measurable response in an attempt
to find the blend or blends that produce
the optimal response or a set of optimal
responses
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-11
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-12
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-13
Simplex-Lattice Design
1 2
xi = 0, , ,...,1
m m
q m # of points model
3 1 3 linear
3 2 6 quadratic
3 3 7 spec. cubic
3 3 10 cubic
4 1 4 linear
4 2 10 quadratic
4 3 14 spec. cubic
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-14
x1=1
(1,0,0)
(1/2,1/2,0) (1/2,0,1/2)
(0,1,0) (0,0,1)
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-15
Simplex Lattices
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-16
q
∑ x i = x1 + x 2 + . . . + x q = 1. 0
i=1
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-17
y = β0 + β1 x1 + β2 x 2
Since (x1 + x2)=1 we can replace
bo with bo(x1 + x2)
y = ( β0 + β1 ) x1 + ( β0 + β2 ) x2
= β1′x1 + β2′x2
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-18
y = ( β0 + β1 + β11 ) x1 + ( β0 + β2 + β22 ) x 2
+ ( β12 − β11 − β22 ) x1 x 2
= β1′x1 + β2′x 2 + β12′ x1 x 2
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-19
Mixture Models
q q q q
y = ∑βi xi + ∑∑βij xi x j + ∑∑δij xi x j (xi − x j ) + ∑∑∑βijk xi x j xk
i=1 i <j i <j i < j <k
q q q
y= ∑βx +∑∑β x x +∑∑∑β
i =1
i i
i <j
ij i j
i <j <k
ijk xi x j xk
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-20
INTERPRETATION OF
COEFFICIENTS
0.25β12
y = β1x1 + β2x2
y=β1
x1=1, x2=0 x1=0.5, x2=0.5 x1=0, x2=1
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-21
# of Degree of Polynomial
factors, q Linear Quad S.Cubic F.cubic
2 2 3 - -
3 3 6 7 10
4 4 10 14 20
5 5 15 25 35
6 6 21 41 56
. . . . .
. . . . .
q q q(q+1)/2 (q2+5)/6 (q+1)(q+2)/6
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-22
• Response of interest is
Taste of the punch (y)
Quantified on a scale of 1-25:
1 = not tasty
25 = very tasty
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-23
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-24
3-COMPONENT JUICE
BLEND
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-25
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-26
ANALYSIS
Subset Models
Project : juice blend
Analysis : Mixture Model 1
3/18/2004 3:35:50 PM
Results
Project : juice blend
Analysis : Mixture Model 1
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-27
Total 260.9167 11
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-28
Shapiro-Wilks W : 0.9514
Significance level: 0.7807
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-29
Model Diagnostic
Project Name : juice blend
Plot Name : Residual Plot for Taste(Y1) [Mixture Model 1]
3/18/2004 3:42:04 PM
2
Standardized Residuals
-1
-2
-3
10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26
Observed Response
26
24
Observed Response
22
20
18
16
14
12
10
10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26
Predicted Response
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-30
24
Grapefruit
22
Pineapple
20
Taste
18
16
14
12
Orange
10
-0.4 -0.3 -0.2 -0.1 -0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7
Deviation from Reference Blend
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-31
Design Variable
Exp. X1 X2 X3
1 1 0 0
2 0 1 0
3 0 0 1
4 1/2 1/2 0
5 1/2 0 1/2
6 0 1/2 1/2
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-32
xi ≥ 0, ∑xi =1
i =1
0 ≤ Li ≤ xi , i = 1, 2 , ..., q
USE PSEUDOCOMPONENTS
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-33
Pseudocomponents
L = ∑ Li < 1
Let, i =1
x i − Li
zi =
Then, 1− L
or conversely, x i = (1 − L ) z i + Li
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-34
Example of Pseudocomponents
L 1 = 0 . 30 ≤ x 1
L 2 = 0 . 25 ≤ x 2
L 3 = 0 . 02 ≤ x 3
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-35
x1 = 1
x3 > 0.02
x1=0.73
x2=0.25
x3=0.02 z1 = 1
x1=0.30
x2=0.25
x3=0.45
x1 > 0.30
x1=0.30 z2 = 1 z3 = 1
x2=0.68
x3=0.02
x2 = 1 x3 = 1
x2 > 0.25
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-36
A {3,2} Simplex-Lattice in
Pseudocomponents
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-37
0 ≤ Li ≤ x i ≤ U i ≤ 1
Where, the Ui’s are the Upper bounds
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-38
U2 = 1.0-(L1 + L2)
= 1.0-(0.4+0.1)
= 0.5
0 . 40 ≤ x 1 ≤ 0 . 80
0 . 10 ≤ x 2 ≤ 0 . 50
0 . 10 ≤ x 3 ≤ 0 . 30
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-39
Design Space
Special
Quadratic Cubic
Design Design
watermelon pineapple orange watermelon pineapple orange
0.55 0.25 0.20 0.55 0.25 0.20
0.80 0.10 0.10 0.80 0.10 0.10
0.80 0.10 0.10 0.40 0.50 0.10
0.40 0.50 0.10 0.60 0.10 0.30
0.60 0.10 0.30 0.40 0.30 0.30
0.40 0.30 0.30 0.40 0.40 0.20
0.40 0.40 0.20 0.70 0.10 0.20
0.70 0.10 0.20 0.60 0.30 0.10
0.60 0.30 0.10 0.50 0.20 0.30
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-40
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-41
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-42
MIXTURE
FACTORIAL
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-43
y ( z ) = β0 + β1 z1 + β2 z 2 + β12 z1 z 2 + ε
Second degree model in the mixture variables x1,
x2, and x3
y ( x ) = α1 x1 + α2 x2 + α3 x3
+ α12 x1 x2 + α13 x1 x3 + α23 x2 x3 + ε
Combined model is the cross product (24 terms)
y( x, z ) = y( x ) • y( z )
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-44
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-45
Independent Variables:
Mixture Variables
Mixture Total : 1
Lower Upper
Variable Label
Bound Bound
Plasticizer
X1 0 1
1
Plasticizer
X2 0 1
2
Plasticizer
X3 0 1
3
X5 Drying Temperature -1 1
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-46
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-47
0 1 0 -1 1 0.2814
0 0 1 1 1 -2.3993
Total 201.7165 37
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-48
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-49
3
Observed Response
-1
-2
-3
-4
-4 -3 -2 -1 0 1 2 3 4 5
Predicted Response
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-50
Plasticizer 2 0.4055
Plasticizer 1 0.0908
Plasticizer 3 -0.0083
-0.5 -0.4 -0.3 -0.2 -0.1 -0.0 0.1 0.2 0.3 0.4
Standardized Estimates
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-51
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-52
Screening Design
y$ ( X ) = b1 x1 + b2 x 2 +...+ bq x q
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-53
RAYS
x1 = 2x3
x2 = 1 x3 = 1
x2 = x3
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-54
EXAMPLE:
Project : Octane-Blending Experiment
Independent Variables:
Mixture Variables
Mixture Total : 1
Lower Upper
Variable Label
Bound Bound
X1 Straightrun 0 0.21
X2 Reformate 0 0.62
X3 Naptha 0 0.12
X5 Polymer 0 0.12
X6 Alkylate 0 0.74
Design Properties:
Mixture Design Strategy : Mixture-Screening
Number of points needed to estimate the model : 7
Overall Centroid Points : 1
Number of repeats : 1
Total number of points : 9
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-55
Number of
# Type Used
points
1 Vertices + 112
DATA SHEET
Reformat Cat. Nat Motor
Straightrun e Naptha Naptha Polymer Alkylate Gasoline Octane
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-56
0 0.62 0 0.38 0 0 0
0.0500000
0.20999999 0 0.12 0.62 2 0 0
0.6700000
0.20999999 0 0 0 0.12 2 0
0.0900000
0.20999999 0.62 0 2 0 0 0.08
0.0599999
0 0.62 0.12 0 0.12 9 0.08
0.1799999
0 0 0 0.62 0.12 9 0.08
0.7400000
0 0 0.12 0.06 0 1 0.08
0.2508000 0.2508000 0.3355999
0 1 0.0612 1 0.0612 9 0.0404
0.0624074 0.3101851 0.0624074 0.4046296 0.0416666
0.1187037 0 1 9 1 5 7
0.2453703 0.2453703 0.0620370 0.2950000
0.11092592 7 0 7 4 2 0.0412963
0.3101851 0.0624074 0.0624074 0.4046296 0.0416666
0.1187037 9 1 0 1 5 7
0.2453703 0.0620370 0.2453703 0.2950000
0.11092592 7 4 7 0 2 0.0412963
0.3453846 0.0653846 0.3453846 0.0653846 0.0438461
0.13461538 3 1 3 1 0 5
0.2409259 0.0605555 0.2409259 0.0605555 0.2890740
0.10796296 2 5 2 5 6 0
0.2352083 0.2352083 0.1806250 0.0352083
0.20999999 3 0.051875 3 0.051875 1 3
0.0570588 0.0623529 0.0570588 0.0623529
0.10294117 0.62 2 4 2 4 0.0382353
0.2273076 0.2273076 0.0561538 0.2400000 0.0376923
0.09153846 9 0.12 9 5 1 1
0.0623529 0.0570588 0.0570588 0.0623529
0.10294117 4 2 0.62 2 4 0.0382353
0.2273076 0.0561538 0.2273076 0.2400000 0.0376923
0.09153846 9 5 9 0.12 1 1
0.0510714 0.0510714 0.7400000 0.0339285
0.05892857 0.0325 3 0.0325 3 1 7
0.2319230 0.0576923 0.2319230 0.0576923 0.2461538
0.09461538 7 1 7 1 5 0.08
0.2349107 0.0591964 0.2349107 0.0591964 0.0395535
0.10535714 1 3 1 3 0.266875 7
0.7400000
0 0 0.12 0.14 0 1 0
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-57
Models:
q q
y = ∑ β i x i + ∑ β − i x i−1
i =1 i =1
q q q
y = ∑ β i x i + ∑ ∑ β ij x i x j + ∑ β − i x i− 1
i =1 i <j i =1
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-58
Independent Variables:
Mixture Variables
Mixture Total : 1
Lower Upper
Variable Label
Bound Bound
X1 X1 0.01 0.648
X2 X2 0 0.954
X3 X3 0.036 0.924
Responses Models:
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-59
X1 X2 X3 Octane Number
0.278 0.267 0.454 83.45593658
0.648 0 0.352 89.35029034
0.01 0.954 0.036 126.8244268
0.648 0.316 0.036 99.16556145
0.076 0 0.924 60.35789469
0.01 0.066 0.924 86.21433551
0.01 0.51 0.48 110.2195226
0.648 0.032 0.32 90.99558018
0.074 0.801 0.125 102.1432641
0.52 0.355 0.125 99.97015188
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-60
X2 1.9799
X1 1.794
Model Terms
X3 0.9299
1/X1 0.809
1/X3 -0.1682
-0.2 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0
Standardized Estimates
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-61
USE OF RATIOS:
x1 x2
w1 = , w2 = , x1 + x 2 + x3 = 1
x2 x3
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-62
Model Reduction
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-63
80LIFE
70
60
^ _ _ ^
50
(y - y) = (y - y) + (y - y)
tot err pe lof
40 TEMP
0 10 20 30 40 50 60 70 80 90 100
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-64
R-SQUARE
INTERPRETATION
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-65
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.
Mixture Design Mix-66
REFERENCES
_________________________________
Computational Modeling Center, 2005
©Air Products and Chemicals, Inc.