Академический Документы
Профессиональный Документы
Культура Документы
1 What is econometrics?
8 Summary
Econometrics is
[T]he study of the application of statistical methods to the analysis of
economic phenomena (Tintner, 1953).
[T]he art and science of using statistical methods for the measurement of
economic relations (Chow, 1985).
Theory may be ambiguous as to the eect of some policy change, and in any
case theory rarely tells us how large the eect might be.
Delivery via ten Lectures and ten Practical lectures (PLs): 37.5 hours.
We expect you to (1) complete the assigned reading, (2) attend the lectures
and participate actively in class discussions, (2) develop and practice
programming skills, and (3) participate constructively in group work.
Also attempt weekly problem set for independent study.
Your feedback on the course: via the Rep and questionnaire survey.
Assessment:
Cross-sectional data
Data are collected on individuals, families, rms, governments, or some other units
at a given point in time.
Examples
Estimate the CAPM for a stock using daily stock return and market index.
Panel data
The same cross-sectional units are followed over time.
Examples
Eects of some rm characteristics on debt maturity may exhibit time lag.
There are two variables, x and y, and we would like to study how y varies
with changes in x.
Ex: in the wage model, x is years of schooling and y is hourly wage.
y = β0 + β1 x + u.
n n
ûi2 = (yi − β̂0 − β̂1 xi )2 .
X X
Min S(β̂0 , β̂1 ) = (1)
i=1 i=1
First-order conditions: ∂S(β̂0 , β̂1 )/∂ β̂0 = 0 and ∂S(β̂0 , β̂1 )/∂ β̂1 = 0, or
n
X
2 (yi − β̂0 − β̂1 xi ) = 0. (2)
i=1
n
X
2 xi (yi − β̂0 − β̂1 xi ) = 0. (3)
i=1
n n n n
−1 −1 −1 −1
X X X X
n (yi − β̂0 − β̂1 xi ) = n yi − n β̂0 − n β̂1 xi = 0. (4)
i=1 i=1 i=1 i=1
Recall:
n n n
n −1 yi − n−1 β̂0 − n−1
X X X
β̂1 xi = ȳ − β̂0 − β̂1 x̄ = 0, (5)
i=1 i=1 i=1
which implies
β̂0 = ȳ − β̂1 x.
¯ (6)
n
X n
X
xi (yi − β̂0 − β̂1 xi ) = xi [yi − (ȳ − β̂1 x̄) − β̂1 xi ] = 0. (7)
i=1 i=1
n
" n
#
X X
xi (yi − ȳ ) = β̂1 xi (xi − x̄) .
i=1 i=1
Given two useful facts about the summation operator (see Appendix A.6):
n
X n
X n
X
xi (yi − ȳ ) = (xi − x̄)(yi − ȳ ), where (yi − ȳ ) = 0, (9)
i=1 i=1 i=1
n n n
(xi − x̄)2 ,
X X X
xi (xi − x̄) = where (xi − x̄) = 0. (10)
i=1 i=1 i=1
We obtain
Pn
i=1 (xi − x̄)(yi − ȳ ) Cov (xi , yi )
β̂1 = P n 2
= . (11)
i=1 (xi − x̄) Var (xi )
salary = β0 + β1 roe + u.
\ = 936.191 + 18.501roe.
salary
n
X
ûi = 0. (12)
i=1
û is the residual computed from the sample data, and is dierent from the
unobserved error u in y = β0 + β1 x + u .
Likewise, β̂0 and β̂1 are dierent from the parameter values, β0 and β1 .
2 The sample covariance (and therefore the sample correlation) between the
explanatory variable(s) and the residuals is always zero (see (3)):
n
X
xi ûi = 0. (13)
i=1
n
(yi − ȳ )2 ,
X
SST = (15)
i=1
n
(ŷi − ȳ )2 ,
X
SSE = (16)
i=1
n
ûi2 .
X
SSR = (17)
i=1
To measure how well the OLS regression line ts the data, we use:
SSE SSR
R2 = =1− . (19)
SST SST
The R -squared shows the fraction of the sample variation in y that is
explained by x.
By construction, 0 ≤ R 2 ≤ 1:
2
R =0 means no linear relationship, while R2 = 1 means a perfect linear
relationship (OLS provides a perfect t to the data).
As R2 increases, the yi are closer and closer to falling on the OLS regression
line.
\ = 936.191 + 18.501roe.
salary
n = 209, R 2 = 0.0132.
How do we interpret the R -squared in this case?
What is BLUE?
An estimator is BLUE if it is the best linear unbiased estimator.
Proof of eciency of OLS is more complicated but a full proof using matrix
notation is available in Appendix E.
Looked at three main data structures: cross-sectional, time series, and panel
data.
Introduced the simple linear regression model and derived the OLS estimator.
What about models with multiple regressors, i.e., multiple regression analysis?