Академический Документы
Профессиональный Документы
Культура Документы
Artin Armagan
Armagan
Simple Linear Regression Analysis
Multiple Linear Regression
Outline
1 Simple Linear Regression Analysis
Using simple regression to describe a linear relationship
Inferences From a Simple Regression Analysis
Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
2 Multiple Linear Regression
Using Multiple Linear Regression to Explain a
Relationship
Inferences From a Multiple Regression Analysis
Assessing the Fit of the Regression Line
Comparing Two Regression Models
Multicollinearity
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Example
● ●
12
12
●
●
10
10
● ●
8
8
y
y
●
6
6
4
4
● ●
2
1 2 3 4 5 6 1 2 3 4 5 6
x x
y = 1 + 2x ŷ = −0.2 + 2.2x
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
●
55000
●
●
●
●
●
45000
COST
35000
● ●
●
●
●
25000
●
●
10 20 30 40 50 60 70
NUMPORTS
ˆ Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
300000
●
●
200000
●
VALUE
100000
● ●
●
●
●
50000
●● ●
●
●
●● ● ●● ●●●● ●
●● ●
●
● ●●● ●
● ●
●● ●● ●●
● ●
● ● ●
●
●● ●●●
●● ●●
● ●
● ● ●●● ●
●● ●
● ●●
● ●
●●●
●●●●
● ● ● ● ●
● ●●
SIZE
ˆ Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Assumptions
It is assumed that there exists a linear deterministic
relationship between x and the mean of y , µy |x :
µy |x = β0 + β1 x
Since the actual observations deviate from this line, we
need to add a noise term giving
yi = β0 + β1 xi + ei .
The expected value of this error term is zero: E(ei ) = 0.
The variance of each ei is equal to σe2 . This assumptions
suggests a constant variance along the regression line.
The ei are normally distributed.
The ei are independent.
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Sampling Distribution of b0
E(b0 ) = β0
1 x̄ 2
Var (b0 ) = σe2 n + Pn 2 2
i=1 i −x̄ )
(x
The sampling distribution of b0 is normal.
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Sampling Distribution of b1
E(b1 ) = β1
σe2
Var (b0 ) = Pn 2 2
i=1 i −x̄ )
(x
The sampling distribution of b1 is normal.
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Properties of b0 and b1
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Estimating σe2
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
H0 : β1 = (≥, ≤)β1∗
Ha : β1 6= (<, >)β1∗
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
b1 −β1∗
To test this hypothesis, a t statistic is used, t = sb1 .
A significance level, α, is specified to decide whether or not
reject the null hypothesis.
Possible alternative hypotheses and corresponding
decision rules are
Alternative Decision Rule
Ha : β1 6= β1∗ Reject H0 if |t| > tα/2,n−2
Ha : β1 < β1∗ Reject H0 if t < −tα,n−2
Ha : β1 > β1∗ Reject H0 if t > tα,n−2
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
ANOVA
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
SSR
R2 =
SST
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
For simple
√ linear regression the correlation coefficient is
r = ± R2.
This does not apply to multiple linear regression.
If the sign of r is positive, then the relationship between the
variables is direct, otherwise is inverse.
r ranges between −1 and 1.
A correlation of 0 merely implies no linear relationship.
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
The F Statistic
MSR
F =
MSE
where MSR = SSR/1 and MSE = SSE/(n − 2).
The degrees of freedom corresponding to SSR and SSE
add up to the total degrees of freedom, n − 1.
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
The F Statistic
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Armagan
Using simple regression to describe a linear relationship
Simple Linear Regression Analysis Inferences From a Simple Regression Analysis
Multiple Linear Regression Assessing the Fit of the Regression Line
Prediction with a Sample Linear Regression Equation
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
Formulation
y = b0 + b1 x1 + b2 x2 + b3 x3 + ... + bp xp
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
Meddicorp Sales
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
Meddicorp Sales
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
Assumptions
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
R 2 and Radj
2
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
R 2 and Radj
2
2 SSE/(n − p − 1)
Radj =1−
SST /(n − 1)
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
F Statistic
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
F Statistic
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
F Statistic
H0 : βl+1 = ... = βp = 0
Ha : At least one of βl+1 , ..., βp is not equal to zero.
This implies, under the reduced model we have
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
Meddicorp
H0 : β3 = ... = β4 = 0
Ha : At least one of β3 , β4 is not equal to zero.
(181176−175855)/2
F = 175855/20 = 0.303
3.49 is the 5% F critical value with 2 numerator and 20
denominator degrees of freedom. Thus we accept H0 .
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
Multicollinearity
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
Detecting multicollinearity
Pairwise correlations.
Large F , small t.
Variance inflation factor (VIF).
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
Armagan
Using Multiple Linear Regression to Explain a Relationship
Inferences From a Multiple Regression Analysis
Simple Linear Regression Analysis
Assessing the Fit of the Regression Line
Multiple Linear Regression
Comparing Two Regression Models
Multicollinearity
Armagan