Академический Документы
Профессиональный Документы
Культура Документы
By
Dr. D. ISRAEL
1
Objective
2
Requirements
Dependent variables should be intervally scaled.
Independent variables should also be intervally
scaled.
Sometimes, Nominal variables can also be used
as predictors.
3
Meaning of related Concepts
Multiple Regression Equation
A mathematical model that depicts the relationship between
independent variables and a dependent variable. It is of the form
y=a+b1x1+b2x2+….+bnxn+e where y = the value of dependent variable.
a=the intercept which is a constant and meaningless.
b1=the partial regression coefficient for the independent variable 1
b2=the partial regression coefficient for the independent variable 2
x1=the actual value of the independent variable
bn=the partial regression coefficient for the independent variable n.
e=the error term which is nothing but the residuals (the difference between
actual and predicted y values)
4
LEAST SQUARE ANALYSIS
•This is the process with which the
multiple regression analysis is based.
•This is aimed at producing a minimum
sum of squares of residuals.
5
STANDARD ERROR OF
ESTIMATE (SEE)
•This is the Standard Deviation of the
residuals of the regression model.
•It is a rule that at least 68% of the residuals
should be within 1 SEE
6
BETA COEFFICIENT ()
•This is an indicator of the relative importance of a
particular independent variable on the dependent variable.
•This is computed on the standardised scores of
independent variables.
•A variable with high Beta coefficient indicates that that
variable is the most important one in explaining the
dependent variable
•The significance of this data coefficient can be tested
through a ‘ t ’ test.
•It is also known as a Beta weight or standardised
regression coefficient.
7
Unstandardised Regression Coefficient
8
Multiple R
Nothing but simple correlation between y and Y.
It is also called a multiple correlation index.
Multiple R2
It is also known as Coefficient of multiple determination.
It shows the strength of association between independent
variables as a group are significantly related to the dependent
variable.
If the multiple R2 is insignificant, the entire regression model
will be useless.
9
The significance of multiple R2 is tested through F value.
R2 value can be increased by increasing the number of variables.
R2 in a multiple Regression will never go below than the highest
bivariate R2 of any individual independent variable with a
dependent variable.
R² will be larger when the multi collinearity is low.
If the independent variables are statistically independent then R²
will be the sum of bivariate R² of each independent variable
with a dependent variable.
R² is calculated as the proportion of explained variation to total
variation.
10
Adjusted R2
11
R2 Increment
Useful for estimating the predictive power of an
independent variable when it is added to the regression
equation, as compared to the one without such variable.
It is the only method for comparing the importance of
independent variables in two or more models.
12
Relationship between R2 and Adjusted R2
13
Multicollinearity
14
How to control Multicollinearity?
15
Tolerance :
16
Variance Influence Factor (VIF)
17
Taking significant difference between two R2s
[R22 – R12]
(K2 – K1)
F=
( 1 - R22 )
[n - K2 – 1]
Where
R22 = R2 of the second model
R12 = R2 of the first model
18
n = total sample size Contd…
K1 : Number of independent variables in the first model
19
DUMMY VARIABLE
20
STEPWISE REGRESSION
21
Questions ?
22
23