Академический Документы
Профессиональный Документы
Культура Документы
statisticsbyjim.com
1 of 11 10/05/2020, 06:17
Making Predictions with Regression Analysis - Statistics By Jim about:reader?url=https://statisticsbyjim.com/regression/predictions-regre...
Psychic predictions are things that just pop into mind and are not
often verified against reality. Unsurprisingly, predictions in the
regression context are more rigorous. We need to collect data for
relevant variables, formulate a model, and evaluate how well the
model fits the data.
4. If you have a model that adequately fits the data, use it to make
predictions.
While this process involves more work than the psychic approach, it
provides valuable benefits. With regression, we can evaluate the
bias and precision of our predictions:
2 of 11 10/05/2020, 06:17
Making Predictions with Regression Analysis - Statistics By Jim about:reader?url=https://statisticsbyjim.com/regression/predictions-regre...
Why might we want to use BMI to predict body fat percentage? It’s
more expensive to obtain your body fat percentage through a direct
measure like DXA. If you can use your BMI to predict your body fat
percentage, that provides valuable information more easily and
cheaply. Let’s see if BMI can produce good predictions!
3 of 11 10/05/2020, 06:17
Making Predictions with Regression Analysis - Statistics By Jim about:reader?url=https://statisticsbyjim.com/regression/predictions-regre...
4 of 11 10/05/2020, 06:17
Making Predictions with Regression Analysis - Statistics By Jim about:reader?url=https://statisticsbyjim.com/regression/predictions-regre...
You should also assess the residual plots. If you see patterns in the
residual plots, you know that your model is incorrect and that you
need to reevaluate it. Non-random residuals indicate that the
predicted values are biased. You need to fix the model to produce
unbiased predictions.
The residual plots below also confirm the unbiased fit because the
data points fall randomly around zero and follow a normal
distribution.
In the statistical output below, the p-values indicate that both the
linear and squared terms are statistically significant. Based on all of
this information, we have a model that provides a statistically
significant and unbiased fit to these data. We have a valid
regression model. However, there are additional issues we must
5 of 11 10/05/2020, 06:17
Making Predictions with Regression Analysis - Statistics By Jim about:reader?url=https://statisticsbyjim.com/regression/predictions-regre...
Later, I’ll generate predictions and show you how to assess the
precision.
6 of 11 10/05/2020, 06:17
Making Predictions with Regression Analysis - Statistics By Jim about:reader?url=https://statisticsbyjim.com/regression/predictions-regre...
Goodness-of-Fit Measures
R-squared and S indicate how well the model fits the observed
data. We need predictions for new observations that the analysis
did not use during the model estimation process. Assessing that
type of fit requires a different goodness-of-fit measure, the
predicted R-squared.
7 of 11 10/05/2020, 06:17
Making Predictions with Regression Analysis - Statistics By Jim about:reader?url=https://statisticsbyjim.com/regression/predictions-regre...
Regression predictions are valid only for the range of data used to
estimate the model. The relationship between the independent
variables and the dependent variable can change outside of that
range. In other words, we don’t know whether the shape of the
curve changes. If it does, our predictions will be invalid.
The graph shows that the observed BMI values range from 15-35.
We should not make predictions outside of this range.
8 of 11 10/05/2020, 06:17
Making Predictions with Regression Analysis - Statistics By Jim about:reader?url=https://statisticsbyjim.com/regression/predictions-regre...
I’ll use the software to predict the body fat percentage for a BMI of
18. The prediction output is below.
The output indicates that the mean value associated with a BMI of
18 is estimated to be ~23% body fat. Again, this mean applies to
the population of middle school girls. Let’s assess the precision
using the confidence interval (CI) and the prediction interval (PI).
The confidence interval is the range where the mean value for girls
with a BMI of 18 is likely to fall. We can be 95% confident that this
mean is between 22.1% and 23.9%. However, this confidence
interval does not help us evaluate the precision of individual
9 of 11 10/05/2020, 06:17
Making Predictions with Regression Analysis - Statistics By Jim about:reader?url=https://statisticsbyjim.com/regression/predictions-regre...
predictions.
10 of 11 10/05/2020, 06:17
Making Predictions with Regression Analysis - Statistics By Jim about:reader?url=https://statisticsbyjim.com/regression/predictions-regre...
11 of 11 10/05/2020, 06:17