Temperature ( |
Pressure Ratio ( |
Purity |
---|---|---|
-225 | 1.1 | 82.8 |
-225 | 1.3 | 83.5 |
-215 | 1.1 | 84.7 |
-215 | 1.3 | 85.0 |
-220 | 1.2 | 84.1 |
-220 | 1.2 | 84.5 |
-220 | 1.2 | 83.9 |
-220 | 1.2 | 84.3 |
Chapter 11
(AST301) Design and Analysis of Experiments II
11 Response Surface Methods and Designs
11.1 Introduction
Factorial designs are very useful for factor screening - that is, identifying the most important factors that affect the performance of a process. Sometimes this is called process characterization.
Once the appropriate subset of process variables is identified, the next step is usually process optimization, or finding the set of operating conditions for the process variables that result in the best process performance.
Response surface methodology, or RSM, is a collection of mathematical and statistical techniques useful for the modeling and analysis of problems in which a response of interest is influenced by several variables and the objective is to optimize this response.
For example, suppose that a chemical engineer wishes to find the levels of temperature
The process yield is a function of the levels of temperature and pressure, say
We usually represent the response surface graphically, where
To help visualize the shape of a response surface, we often plot the contours of the response surface.
In the contour plot, lines of constant response are drawn in the
Figure: A three-dimensional response surface showing the expected yield of temperature
As an example, consider an investigation into the effect of the concentration of the reactant and the amount of the catalyst on the conversion (yield) in a chemical process.
The objective of the experiment is to determine if adjustments to either of these two factors would increase the yield.
Let the reactant concentration be factor
The experiment is replicated three times, so there are 12 runs. The order in which the runs are made is random, so this is a completely randomized experiment.
On the basis of the
The Response Surface
The regression model
Because the model is first-order (that is, it contains only the main effects), the fitted response surface is a plane.
From examining the contour plot, we see that yield increases as reactant concentration increases and catalyst amount decreases.
Often, we use a fitted surface such as this to find a direction of potential improvement for a process.
RSM
In most RSM problems, the form of the relationship between the response and the independent variables is unknown.
Thus, the first step in RSM is to find a suitable approximation for the true functional relationship between
Usually, a low-order polynomial in some region of the independent variables is employed.
If the response is well modeled by a linear function of the independent variables, then the approximating function is the first-order model
If there is curvature in the system, then a polynomial of higher degree must be used, such as the second-order model
Almost all RSM problems use one or both of these models.
The method of least squares is used to estimate the parameters in the approximating polynomials.
RSM is a sequential procedure.
Often, when we are at a point on the response surface that is remote from the optimum, the objective is to lead the experimenter rapidly and efficiently along a path of improvement toward the general vicinity of the optimum.
Once the region of the optimum has been found, a more elaborate model, such as the second-order model, may be employed, and an analysis may be performed to locate the optimum.
Summary: (Steps of RSM)
- Find a suitable approximation for
using LS {maybe a low-order polynomial} - Move towards the region of the optimum
- When curvature is found find a new approximation for
{generally a higher order polynomial} and perform the “Response Surface Analysis”
The eventual objective of RSM is to determine the optimum operating conditions for the system or to determine a region of the factor space in which operating requirements are satisfied.
11.2 Method of Steepest Ascent
(A procedure for moving sequentially from an initial “guess” towards to region of the optimum)
Frequently, the initial estimate of the optimum operating conditions for the system will be far from the actual optimum.
In such circumstances, the objective of the experimenter is to move rapidly to the general vicinity of the optimum.
When we are remote from the optimum, we usually assume that a first-order model is an adequate approximation to the true surface in a small region of the
The method of steepest ascent is a procedure for moving sequentially in the direction of the maximum increase in the response.
If minimization is desired, then we call this technique the method of steepest descent.
The fitted first-order model is
The direction of steepest ascent is the direction in which
We usually take as the path of steepest ascent the line through the center of the region of interest and normal to the fitted surface.
Thus, the steps along the path are proportional to the regression coefficients.
The actual step size is determined by the experimenter based on process knowledge or other practical considerations.
Experiments are conducted along the path of steepest ascent until no further increase in response is observed.
Then a new first-order model may be fit, a new path of steepest ascent determined, and the procedure continued.
Eventually, the experimenter will arrive in the vicinity of the optimum. This is usually indicated by lack of fit of a first-order model. At that time, additional experiments are conducted to obtain a more precise estimate of the optimum.
Example 11.1
A chemical engineer is interested in determining the operating conditions that maximize the yield of a process. Two controllable variables influence process yield: reaction time and reaction temperature. The engineer is currently operating the process with a reaction time of 35 minutes and a temperature of
The engineer decides that the region of exploration for fitting the first-order model should be
Note that the design used to collect these data is a
Replicates at the center are used to estimate the experimental error and to allow for checking the adequacy of the first-order model. Also, the design is centered about the current operating conditions for the process.
A first-order model may be fit to these data by least squares. Employing the methods for two-level designs, we obtain the following model in the coded variables
Before exploring along the path of steepest ascent, the adequacy of the first-order model should be investigated.
The
- Obtain an estimate of error.
- Check for interactions (cross-product terms) in the model.
- Check for quadratic effects (curvature) (See Section 11.2.3 for details).
The replicates at the center can be used to calculate an estimate of error as follows:
- we have no reason to question the adequacy of the first-order model
To move away from the design center-the point
along the path of steepest ascent, we would move 0.775 units in the direction for every 0.325 units in the direction.Thus, the path of steepest ascent passes through the point
and has a slope .The engineer decides to use 5 minutes of reaction time as the basic step size. Using the relationship between
and , we see that 5 minutes of reaction time is equivalent to a step in the coded variable of . Therefore, the steps along the path of steepest ascent are and .
- The engineer computes points along this path and observes the yields at these points until a decrease in response is noted
A new first-order model is fit around the point
Once again, a
The first-order model fit to the coded variables in Table 11.4 is
The analysis of variance for this model, including the interaction and pure quadratic term checks, is shown in Table 11.5.
The interaction and pure quadratic checks imply that the first-order model is not an adequate approximation.
This curvature in the true surface may indicate that we are near the optimum.
At this point, additional analysis must be done to locate the optimum more precisely.
Step-by-step procedure
Choose a step size in one of the process variables, say
. Usually, we would select the variable we know the most about, or we would select the variable that has the largest absolute regression coefficient .The step size in the other variables is
Convert the
from coded variables to the natural variables.
Test for Curvature
By using two-level factorial and fractional factorial designs (augmented with center points), the experimenter is able to
- Obtain an estimate of pure error.
- The replicated center points can be used to calculate an estimate of the pure error
where is the sample variance of the center point responses.
- Overall check for interaction effects in the model.
- The sum of squares for testing for interaction is found by accumulating sum of squares associated with all two-factor interaction terms which can be estimated.
- Overall check for quadratic effects (curvature).
If there is no curvature present, then the average response corresponding to the factorial points (say
) should be similar to the average response corresponding to the center points (say ).Thus,
is an measure of the overall curvature in the surface over that design space.
If the difference
is small, then there is no quadratic curvature.On the other hand, if
is large, then quadratic curvature is present. A single-degree-of-freedom sum of squares for pure quadratic curvature is given by where, in general, is the number of factorial design points and is the number of centre point.
- When points are added to the center of the
design, the test for curvature actually tests the hypotheses
Further reading: https://online.stat.psu.edu/stat503/lesson/11
Exercises
11.1. A chemical plant produces oxygen by liquefying air and separating it into its component gases by fractional distillation. The purity of the oxygen is a function of the main condenser temperature and the pressure ratio between the upper and lower columns. Current operating conditions are temperature
11.4. For the first-order model
11.5. The region of experimentation for three factors are time (
Is the point