Академический Документы
Профессиональный Документы
Культура Документы
What is regression?
refers to the method of predicting the value of one variable on the basis of the known value/s of other variable/s done by establishing a simplified statement, or a model, of the relationship between a variable and another variable or set of variables
Example of a Statement
Y = + 1X1 + 2X2 + 3X3 + This statement is called a regression equation This implies that the value of Y, except for a random error, is determined by the values of the variables X1, X2 and X3. When the observed variations in the values of Y can largely be accounted for by the values of X1, X2 and X3 then the statement is acceptable.
A Graphical Presentation
Y
Income
X1
X2
X3
X4
X5
X6
X7
Education Fig. 1. General form of regression of Y on X or the path of the means of Y values for fixed values of X
Least squares procedure - method for finding the straight line that minimizes the squared deviations of points around it
A Graphical Presentation
Y
Fig. 1. Least-squares equation minimizing sum of squares of vertical distances and estimating the regression of Y on X
Components of deviations
SST = sum of squared deviations between the mean and the observed values (d.f. = n -1) SSR = sum of squared deviations between the observed values and the corresponding fitted values (d.f. = p - 1, or the number of explanatory variables) SSE = unexplained deviations; = SST - SSR with d.f. = n - p
F =
MSR MSE
where, MSR = SSR/(p - 1) MSE = SSE/(n - p) Thus, F is a measure of the relative size of the variations accounted for by the fitted regression line to the unexplained remaining variations.
Partial Fi - measure of the significance of the contribution of the regression coefficient bi , given that the other variables are included in the model.
Stepwise Regression
Used when there is no definite model to be tested, or when from a set of variables a subset is to be identified as the explanatory variables
Variables are entered into (or step in) or removed from (or step out) the model one at a time
The Model: Y = a + b1X1 + b2X2 + b3X3 + b4X4 If X4 is a categorical variable, the adjusted model is: Y = a + b1X1 + b2X2 + b3X3 + b4DX41 + b42DX42 + b43DX43