Академический Документы
Профессиональный Документы
Культура Документы
Analysis
Fall 2011
• Why
Wh use OLS?
• Relatively easy to use
• The goal of minimizing RSS is intuitively /
theoretically appealing
• This basically says we want the estimated regression
equation to be as close as possible to the observed data
• OLS estimates have a number of useful
characteristics
(2.5)
• Dependent variable:
• FINAIDi: financial aid (measured in dollars of
grant) awarded to the ith applicant
• Theoretical Model:
(2.9)
(2.10)
where:
h
– PARENTi: The amount (in dollars) that the parents of the ith
student are judged able to contribute to college expenses
– HSRANKi: The ith student’s GPA rank in high school, measured
as a percentage (i.e. between 0 and 100)
(2.11)
• (2.12)
• (2 13)
(2.13)
(2 14)
(2.14)
(2.15)
(3.2)
(3 3)
(3.3)
• Background:
• You have been hired to determine the best location
for the next Woody’s restaurant (a moderately priced,
24 ho r family
24-hour, famil resta
restaurant
rant chain)
• Objective:
j
• How to decide location using the six basic steps of
applied
app ed regression
eg ess o a analysis,
a ys s, discussed
d scussed ea
earlier?
e
(3.4)
• You take the data set and enter it into the computer
• You then run an OLS regression (after thinking the model over one
last time!)
• The resulting model is:
(3.5)
• The classical assumptions must be met in order for OLS estimators to be the
best available
• The seven classical assumptions are:
I. The regression model is linear, is correctly specified, and has an
additive error term
II The error term has a zero population mean
II.
III. All explanatory variables are uncorrelated with the error term
IV. Observations of the error term are uncorrelated with each other
(no serial correlation)
V. The error term has a constant variance (no heteroskedasticity)
VI. No explanatory variable is a perfect linear function of any other
explanatory variable(s) (no perfect multicollinearity)
VII. The error term is normally distributed (this assumption is optional
but usually is invoked)
• Example:
– Including both annual sales (in dollars) and the annual sales tax
paid in a regression at the level of an individual store, all in the
same city
– Since the stores are all in the same city, there is no variation in the
percentage sales tax
Hypothesis Testing
(5.2)
H0: βk ≥ 0 H0: βk ≥ S
HA: βk < 0 HA: βk < S
• As stated in Section 5.2, the decision rule for the t-test is to:
Reject H0 if |tk| > tc and if tk also has the sign implied by HA
• In this example, this amounts to the following three
conditions:
For β1: Reject
j H0 if ||2.1|| > 1.943 and if 2.1 is p
positive.
For β2: Reject H0 if |5.6| > 1.943 and if 5.6 is positive.
For β3: Reject H0 iff |–0.1|
| | > 1.943 and iff –0.1 is positive.
• Figure 5.6 illustrates all three of these outcomes
We can test for the predictive power of the entire model using
the F statistic
• Generally these compare two sources of variation
• F = V1/V2 and
dhhas ttwo df parameters
t
• Here V1 = ESS/K has K df
• And V2 = RSS/(n-K-1) has n-K-1 df
Numerator d.f.
denom.
d.f. Value of F at a
specific significance level
F Test Hypotheses
H 0 : R2 = 0 is an equivalent hypothesis
Reject H0 if F≥Fc
Do Not Reject
j H0 if F<Fc
• From E-Views
E Views output
output, F=15
F=15.65,
65 Fc(0.05;3,29)=2.93
Fc(0 05;3 29)=2 93
• Fc is well below the calculated F-value of 15.65, so we can reject
the null hypothesis
yp and conclude that the Woody's
y equation
q does
indeed have a significance of overall fit.
Key Terms from Chapter 5
• What happens if we estimate Equation 6.2 when Equation 6.1 is the truth?
• We get bias!
• What this means is that:
(6.4)
• The amount of bias is a function of the impact of the omitted variable on the
dependent variable times a function of the correlation between the included
and the omitted variable
• Or,
Or more formally:
(6.7)
• So, the bias exists unless:
1. the true coefficient equals zero, or
2. the included and omitted variables are uncorrelated
• What if:
– You have an unexpected result
result, which leads you to believe that you have
an omitted variable
– You have two or more theoretically sound explanatory variables as
potential
t ti l “candidates”
“ did t ” ffor iinclusion
l i as th
the omitted
itt d variable
i bl tto th
the equation
ti iis
to use
• How do you choose between these variables?
• One possibility is expected bias analysis
– Expected bias: the likely bias that omitting a particular variable would have
caused in the estimated coefficient of one of the included variables
• Omitted variable
• Irrelevant variable
• Specification bias
• Sequential specification search
• Specification
p error
• The four specification criteria
• Expected bias
• Sensitivity analysis
• Here, the natural log of Y is the dependent variable and the natural log
of X is the independent variable:
(7.5)
• IIn a double-log
d bl l equation,ti an iindividual
di id l regression
i coefficient
ffi i t can b
be
interpreted as an elasticity because:
(7 6)
(7.6)
• Note that the elasticities of the model are constant and the slopes are
nott
• This is in contrast to the linear model, in which the slopes are
constant but the elasticities are not
(7 11)
(7.11)
Yt = β0 + β1X1t-1
1t 1 + β2X2t + εt (7 16)
(7.16)
(7.18)
• If functional forms are similar, and if theory does not specify exactly which form
to use, there are at least two reasons why we should avoid using goodness of fit
over the
th sample l tto determine
d t i which
hi h equation
ti tto use:
1. Fits are difficult to compare if the dependent variable is transformed
2
2. An incorrect function form may provide a reasonable fit within the sample but
have the potential to make large forecast errors when used outside the range of
the sample
• The first of these is essentially due to the fact that when the dependent variable
is transformed, the total sum of squares (TSS) changes as well
• The second is essentially die to the fact that using an incorrect functional
amounts to a specification error similar to the omitted variables bias discussed in
Section 6.1
• Thiss seco
second
d case is
s illustrated
us a ed in Figure
gu e 7.8
8
Multicollinearity
(8 16)
(8.16)
• Perfect multicollinearity
• Severe imperfect multicollinearity
• Dominant variable
• Auxiliary (or secondary) equation
• Variance inflation factor
• Redundant variable
Serial Correlation
(9 10)
(9.10)
if d < dL Reject H0
if d > dU Do not reject H0
if dL ≤ d ≤ dU Inconclusive
• In rare circumstances, perhaps first differenced
equations, a two-sided d test might be appropriate
• In such a case
case, steps 1 and 2 are still used
used, but step 3 is now:
if d < dL Reject H0
if d > 4 – dL Reject H0
if 4 – dU > d > dU Do Not Reject H0
Otherwise Inconclusive
Figure 9.6 gives an example of a one-sided Durbin Watson d test
(9.20)
Heteroskedasticity
(10.9)
• Impure heteroskedasticity
• Pure heteroskedasticity
• Proportionality factor Z
• The Park test
• The White test
• Heteroskedasticity-corrected standard errors
• Places to look:
– your textbooks and notes from previous economics classes
– economics journals
• For example, Table 11.1 contains a list of the journals cited so far in this
textbook (in order of the frequency of citation)
• Before any quantitative analysis can be done, the data must be:
– collected
– organized
– entered into a computer
• Usually, this is a time-consuming and frustrating task because of:
– the difficulty of finding data
– the existence of definitional differences between theoretical variables
and their empirical counterparts
– and the high probability of data entry errors or data transmission errors
• But time spent thinking about and collecting the data is well spent, since a
researcher who knows the data sources and definitions is much less likely
to make mistakes using or interpreting regressions run on that data
• We will now discuss three data collection issues in a bit more detail
© 2011 Pearson Addison-Wesley. All rights reserved. 1-٢٨٨
What Data to Look For
• Checking for data availability means deciding what specific variables you
want to study:
y
– dependent variable
– all relevant independent variables
• At least 5 issues to consider here:
1. Time periods:
– If the dependent variable is measured annually, the explanatory variables
should also be measured annually and not, say, monthly
2. Measuring
gqquantity:
y
– If the market and/or quality of a given variable has changed over time, it makes
little sense to use quantity in units
– Example: TVs have changed so much over time that it makes more sense to use
quantity in terms of monetary equivalent: more comparable across time
© 2011 Pearson Addison-Wesley. All rights reserved. 1-٢٨٩
What Data to Look For (cont.)
(cont )
Time-Series
Time Series Models
2 Dynamic
2. D i models:
d l
– Now serial correlation causes bias in the coefficients produced by OLS
• Compounding all this this is the fact that the consequences, detection, and
remedies for serial correlation that we discussed in Chapter 9 are all either
incorrect or need to be modified in the presence of a lagged dependent
variable
• We will now discuss the issues of testing and correcting for serial correlation
in dynamic models in a bit more detail
© 2011 Pearson Addison-Wesley. All rights reserved. 1-٣٢٩
Testing for Serial Correlation
in Dynamic Models
3. Estimate Equation 12.18 using OLS and then test the null hypothesis
that a3 = 0 with the following
g test statistic:
LM = N*R2 (12.19)
where: N = the sample
p size
R2 is the unadjusted coefficient of determination
both of the auxiliary equation, Equation 12.18
For large samples, LM has a chi-square distribution with degrees
of freedom equal to the number of restrictions in the null hypothesis (in
this case, one).
If LM is greater than the critical chi-square value from Statistical
Table B-8, then we reject the null hypothesis that a3 = 0 and conclude
that
h there
h is
i indeed
i d d serial
i l correlation
l i i the
in h original
i i l equation
i
Yt = γYt–1 + vt (12.22)
where vt is a classical error term
• Can you see that if | γ | < 1, then the expected value of Yt will eventually
approachh 0 (and
( d th
therefore
f be
b stationary)
t ti ) as th
the sample
l size
i gets t bi
bigger and
d
bigger? (Remember, since vt is a classical error term, its expected value = 0)
• Similarly,
y, can you
y see that if | γ | > 1,, then the expected
p value of Yt will
continuously increase, making Yt nonstationary?
• This is nonstationarity due to a trend, but it still can cause spurious
regression results
• Dynamic model
• Ad hoc distributed lag model
• Lagrange Multiplier Serial Correlation test
• Granger causality
• Nonstationary series
• Dickey
Dickey–Fuller
Fuller test
• Unit root
• Random walk
• Cointegration
(13.7)
Simultaneous Equations
• STAGE ONE:
– R
Run OLS on theth reduced-form
d df equations
ti f each
for h off the
th
endogenous variables that appear as explanatory variables in the
structural equations in the system
– That is, estimate (using OLS):
(14.18)
(14.19)
• STAGE TWO:
– S
Substitute
b tit t the
th Ys
Y from
f the
th reduced
d d form
f for
f the
th Ys
Y that
th t appear on
the right side (only) of the structural equations, and then estimate
these revised structural equations with OLS
– That is, estimate (using OLS):
(14.20)
(14.21)
• Endogenous variable
• Predetermined variable
• Structural equation
• Reduced-form equation
• Simultaneity
y bias
• Two-Stage Least Squares
• Identification
• Order condition for identification
Forecasting
(15.2)
• Figure 15.1
15 1 illustrates two examples
where the θs and the φφs are the coefficients of the autoregressive
g and
moving-average processes, respectively, and p and q are the number
of past values used of Y and ε, respectively
• Unconditional forecast
• Conditional forecast
• Leading indicator
• Confidence interval (of forecast)
• Autoregressive process
• Moving-average process
• ARIMA(p,d,q)
2. Unobservable Heterogeneity:
• In Equation 16.2, we added observable factors to the equation to avoid omitted
variable bias, but not all omitted factors in economics are observable
• This “unobservable omitted variable” problem is called unobserved
heterogeneity
2. Variables that change over time but are the same for all
individuals in a given time period:
• e.g., the retail price index and the national unemployment rate
• Note that ai, β2, D2i, and β0 are subtracted out because they’re in both
equations
q
• We’ve therefore shown that estimating panel data with the fixed effects
model does indeed drop the ai out of the equation
• Hence, the fixed effects model will not experience bias due to time-
invariant omitted variables!
• Example: The death penalty and the murder rate:
– Figures 16.2 and 16.3 illustrates the importance of the fixed-effects model:
the unlikely (positive) result from the cross-section model is reversed by
the fixed effects model!
• Treatment group
• Control group
g p
• Differences estimator
• Difference in differences
• Unobserved heterogeneity
• The Hawthorne effect
• Panel data
• The fixed effects model
• The random effects model
• Hausman test
Statistical Principles