Академический Документы
Профессиональный Документы
Культура Документы
unexplained part
logY=a+blogX
1/Y*dY/dX=b/X
b=(dY/Y)/(dX/X)=%Y/%X
%Y=b*%X
In case we get large residuals, we should try using log(LN) for getting a better
correlation and model more appropriate
MSE=Mean Square Error (Mean of squared residuals)
RMSE(Root MSE) gives a comparable idea like SD
Principle of Parsimony (Dont overcomplicate for the sake of complicating)
Within sample forecast is used for checking whether model is accurate.
Out of sample forecast are for predictive purpose
Class 2:
NON-LINEAR RELATIONSHIPS
Real world data is rarely linear
If a researcher was to ignore this consistently under/over-estimated results
ONE solution is to TRANSFORM the data by taking LOGS
This shrinks the scale and MAY transform the data to be more linear
NOTE: In financial literature taking logs and then the difference Rate of
return
In the regression results coefficients of X refers to the %age change in Y for
every 1 %age change of X
One way to compare to different research methods RMSE(Root Mean Square
Error)
RMSEResiduals options in RegressionSquare averageSq Root
MULTICOLLINEARITY
Multicollinearity may give you meaningless results, but gives you very high p
value for t-test and very low p-value for F-test
Two or more x variables are related to each other.
We only want Y to be related to x
Putting x variables that are related to each other gives us meaningless results
o
o
Cluster Analysis
Binary Factor Analysis
Class 4:
GRETL:
File->Open Data->Import->Select No as input is not time based.
CTRL-AModel->Ordinary least squares
In the new window for model->Grpahs->Residual Plot->Select anyone1 graph
To check heteroskedasticity:
Click on Test->Heteroskedasticity-oenf ->Koenker
If the value is <.05, it is not random
Class 5:
Multivariable regression analysis is used for causality i.e. y=f(x1,x2,x3,....)
Anova is used for relationship between multivariables without causality
Class 6
In case of 3 variable and time series, check the dependent variables
relationship with the independent variable. Then check the residuals and
look for time series dependency in both of them. Then check for the 3 in a
combined regression test. ve sign need not mean that it is uncorrelated.
TIME SERIES RM STATIONARY
When conducting research using times series data, we will
usually find a relationship between two or more factors.
To avoid the problem of SPURIOUS (happens to show sig results
even though not related) regressions we make sure our data is
STATIONARY (always reverts back to long run mean)
If using non-stationary data we convert it to stationary by
looking at CHANGES
This way we are looking at how CHANGE in one variable
influences CHANGE in another
Go to variable->Unit root tests->Augmented Dickey-Fuller Test
Go to Add->First differences of selected variables
Go to variable->Go to Correlogram.
Auto-regression:
Useful if you have no causal factors in your research (either due to not
finding any or due to a time constraint)
Useful if this is simply your research aim
e.g. Salest=......Salest-1
But then what lag?
Use CORRELOGRAM as a tool look for spikes the cross the blue line in
the PACF
Class 7
Double Click on advertisementClcik on edit valuesChange the last
value and press apply.
Go to modelOrdinary Least SquaresShow the model
In the opened model Go to Analysis->Go to Forecasts
How to know whether t or t is to be taken.
Click Straits VariableUnit root testADF TestChange the criterion to
t-statIf asymptotic t-values are not <0.05, it is not stationary Add first
differences of selected variables and repeat the test again.
How to know the lag
Go to variableCorrelogram->Check the most significant lag
Go to ModelTime SeriesARIMAOne of the following two ways
1. Choose Sales and make differences as 1
2. Choose d_Sales and make the differences as 0
ARIMA:
Auto Regressive (PACF)
Integrated (Another word for difference)
Moving Average (ACF)
Schwartz criteria helps us determine the best ARIMA combination.
Steps:
1. Decide whether Time series or cross section
2. Make it Stationary
3. Forecast
4. Add the change to actual values in case change is taken to make it
stationary.