Академический Документы
Профессиональный Документы
Культура Документы
1:
Data on U.S. gasoline consumption for the years 1953 to 2004 are given in Table F2.2. Note,
the consumption data appear as total expenditure. To obtain the per capita quantity
variable, divide GASEXP by GASP times Pop. The other variables do not need transformation.
a. Compute the multiple regression of per capita consumption of gasoline on per capita
income, the price of gasoline, all the other prices and a time trend. Report all results. Do the
signs of the estimates agree with your expectations?
b. Test the hypothesis that at least in regard to demand for gasoline; consumers do not
differentiate between changes in the prices of new and used cars.
c. Estimate the own price elasticity of demand, the income elasticity, and the cross price
elasticity with respect to changes in the price of public transportation. Do the computations
at the 2004 point in the data.
d. Re-estimate the regression in logarithms so that the coefficients are direct estimates of
the elasticities. (Do not use the log of the time trend.) How do your estimates compare with
the results in the previous question? Which specification do you prefer?
e. Compute the simple correlations of the price variables. Would you conclude that
multicollinearity is a “problem” for the regression in part a or part d?
f. Notice that the price index for gasoline is normalized to100 in 2000, whereas the other
price indices are anchored at 1983 (roughly). If you were to renormalize the indices so that
they were all 100.00 in2004, then how would the results of the regression in part a change?
How would the results of the regression in part d change?
g. This exercise is based on the model that you estimated in part d. We are interested in
investigating the change in the gasoline market that occurred in 1973. First, compute the
average values of log of per capita gasoline consumption in the years 1953–1973 and 1974–
2004 and report the values and the difference. If we divide the sample into these two
groups of observations, then we can decompose the change in the expected value of the log
of consumption into a change attributable to change in the regressors and a change
attributable to a change in the model coefficients, as shown in Section 4.7.3. Using the
Oaxaca–Blinder approach described there, compute the decomposition by partitioning the
sample and computing separate regressions. Using your results, compute a confidence
interval for the part of the change that can be attributed to structural change in the market,
that is, change in the regression coefficients.
Solution:
(a)
Stata Operation:
. gen gpc=1000000*gasexp/(gasp*pop)
. gen t =year-1952
. sum
The regression result reveals that the per capita gasoline consumption is positively related
with income, pnc, ppt, pd, pn and negatively related with gasp, puc and ps. The demand for
gasoline per capita also increases over time. Therefore, most of the signs of the coefficients
of explanatory variables agree with our expectations.
The sign of coefficient of pnc (price index for new cars) is supposed to be negative. It is so
happened that as the price index of used cars, puc, increases people may prefer to buy more
new cars and thereby increasing the demand for gasoline.
(b) To test the hypothesis that at least in regard to demand for gasoline, consumers do not
differentiate between the changes in the price of new and used cars, we set
. test pnc=puc
( 1) pnc - puc = 0
F( 1, 42) = 0.24
Prob > F = 0.6233
The estimated value of F and P-value indicates that there is no significant evidence to reject
the null hypothesis.
(c)
. mfx, at ( 27113 123.901 133.9 133.3 209.1 114.8 172.2 222.8 52) eyex
. scalar ep=bgasp*(gasp2004/gpc2004)
. scalar ei=bincome*(income2004/gpc2004)
. scalar ce=bppt*(ppt2004/gpc2004)
. scalar list ep ei ce
ep = -.22279148
ei = .9489883
ce = .23431171
(d)
. gen Lgpc=log( gpc)
. reg Lgpc Lincome Lgasp Lpnc Lpuc Lppt Lpd Lpn Lps t
The above estimates differ from the regression results previously estimates without
logarithm. The TSS as well as ESS and RSS greatly differs [ much smaller] from the regression
of without regression. The R squared and Adjusted R squared are almost same for both the
models. However, the coefficients of the logarithm model capture most of the expected
signs than the regression model of without logarithm. Therefore, logarithm model would be
more preferable.
(e)
gasp 1.0000
pnc 0.9361 1.0000
puc 0.9228 0.9939 1.0000
ppt 0.9270 0.9807 0.9824 1.0000
pd 0.9389 0.9933 0.9878 0.9585 1.0000
pn 0.9627 0.9885 0.9822 0.9899 0.9773 1.0000
ps 0.9394 0.9785 0.9769 0.9975 0.9563 0.9936 1.0000
Lgasp 1.0000
Lpnc 0.9667 1.0000
Lpuc 0.9674 0.9940 1.0000
Lppt 0.9665 0.9891 0.9910 1.0000
Lpd 0.9776 0.9932 0.9945 0.9864 1.0000
Lpn 0.9839 0.9900 0.9902 0.9942 0.9923 1.0000
Lps 0.9742 0.9902 0.9912 0.9985 0.9886 0.9979 1.0000
Therefore, the simple correlation of the price variables show that there is indeed a
multicollinearity problem in both the regressions specified in part (a) and (d).
(f) In the linear case, if we normalize to 100, the results will be the same because the
coefficient will be divided by the same scale factor, so that x*b would be unchanged, where x
is a variable and b is the coefficient. In the log linear case, since log(k*x)=log(k)+log(x), the
renormalization would simply affect the constant term. The price coefficients would be
unchanged.
(g)
. scalar ybar0=r(b)
. scalar ybar1=r(b)
. gen cons=1
. mean Lincome Lgasp Lpnc Lpuc Lppt Lpd Lpn Lps t cons if year<1974
. matrix x0=e(b)
. mean Lincome Lgasp Lpnc Lpuc Lppt Lpd Lpn Lps t cons if year>1973
. matrix x1=e(b)
. reg Lgpc Lincome Lgasp Lpnc Lpuc Lppt Lpd Lpn Lps t if year<1974
. matrix b0=e(b)
. matrix var0=e(v)
. r eg Lgpc Lincome Lgasp L pnc Lpuc Lppt Lpd Lpn Lp s t if year>1973
. m atrix b1=e(b)
. m atrix var1=e(v)
. m atrix vtotal=var0+var1
. m atrix vdb=x0*vtotal*x0'
. s calar dy_dxs=dy_dx[1,1]
. s calar dy_dbs=dy_db[1,1]
. s calar vdbs=vdb[1,1]
. d i s p l a y " d y b a r = " y b a r 1 - y b a r0
dy b a r = .
DY_DX=0.12274279
DY_DB= 0.27263373
LOWER= 0.18484676
UPPER= 0.3604207
Application 5.3:
ln(G/Pop) = α+βP ln Pg +βI ln(Income/Pop)+ γnc ln Pnc + γuc ln Puc+ γpt ln Ppt + τyear+ δd ln Pd
a. Carry out a test of the hypothesis that the three aggregate price indices are not significant
determinants of the demand for gasoline.
b. Consider the hypothesis that the microelasticities are a constant proportion of the
elasticity with respect to their corresponding aggregate. Thus, for some positive θ
(presumably between 0 and 1), γnc = θδd,γuc = θδd,γpt = θδs. The first two imply the simple
linear restriction γnc = γuc. By taking ratios, the first (or second) and third imply the nonlinear
restriction γnc /γpt = δd /δs or γncδs −γptδd =0.
Describe in detail how you would test the validity of the restriction.
c. Using the gasoline market data in TableF2.2, test the two restrictions suggested here,
separately and jointly.
Solution:
(a)
S ource SS df MS N umber of ob s = 52
F ( 9, 42 ) = 35 1.33
Model 2. 87044 868 9 .31893 8742 P rob > F = 0. 0000
R es i d u a l .0 38128 217 42 .00090 7815 R -squar ed = 0. 9869
A dj R-s quare d = 0. 9841
Total 2 .9085 769 51 .0570 3092 R oot MS E = .0 3013
( 1) Lpd = 0
( 2) Lpn = 0
( 3) Lps = 0
F( 3, 4 2) = 23.2 5
Prob > F = 0.0 000
*F=34.868735
The critical value from the F table for (3,42) df is 2.827. So, there is sufficient evidence to
reject the hypothesis. It can therefore be said that the three aggregate price indices for
durables, non-durables and consumer services are also significant determinants for the
demand for gasoline.
(b) The restricted model is quite non-liners. It would, therefore, be tedious to estimate and
examine the loss of fit. So, we can test the restriction using the unrestricted model. Let,
δf1 δ∞
G=[ ]
δf2 δ∞
0 0 0 1 -1 0 0 0 0 0
=[ ]
0 0 0 δs 0 δd 0 -γpt 0 γnc
The parameter estimates are thus, f= [-0.17399, 0.10091]. The covariance matrix to use for
the tests is Gs2(X’X)-1G’. The statistics for the joint test is as follows:
X2 = f[Gs2(X’X)-1G’]-1f= 0.4772
This is less than the critical value for a chi-squared with 2 df. So, we would not reject the
joint hypothesis. For the individual hypothesis, we need only compute the equivalent of a t
ratio for each element of f. Thus,
Neither the hypothesis would be rejected. Given the earlier result, it is to be expected.
(c)
. reg Lgpc Lgasp Lincome Lpnc Lpuc Lppt t Lpd Lpn Lps
------------------------------------------------------------------------------
Lgpc | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
(1) | .3343761 .2874485 1.16 0.251 -.2457184 .9144706
------------------------------------------------------------------------------
------------------------------------------------------------------------------
Lgpc | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
_nl_1| .1009092 .3482736 0.29 0.773 -.6019353 .8037538
------------------------------------------------------------------------------