Академический Документы
Профессиональный Документы
Культура Документы
21
4.2.6 Assessment
The last step is we add Assessment node and connect it with the three nodes
which are Decision tree node, Regression node, and Neural Network node.
Figure 18
Then, click Run at Assessment node to get the result.
Figure 19
From the result, we can see that Decision Tree is the best model for now, because
the regression is not included in this table .This is because the value of
misclassification rate from testing data set is 0.
22
5. SUMMARY AND DISCUSSION
Before this, from the Assessment result, we state that the Decision Tree is the best model
because the test misclassification rate is equal to zero. But now, we can see that, regression
model also have a zero misclassification rate for the test data.
Figure 21
Model Training: Misclassification
Rate
Test: Misclassification Rate
Decision Tree 0.036036036 0
Neural Network 0.027027027 0.0208333333
Regression 0.018 0
Figure 20
23
Figure 21 shows the Lift Chart for the regression model. From the lift chart, the cumulative %
response is 100% through the 30th percentile. At the 40th percentile, the next observation with
the highest predicted probability is a non-response, so the cumulative response drops to 91.25%.
Thus, Regression is the best model.
24
REFERENCES
1) Retrieved from : http://www.amstat.org/publications/jse/jse_data_archive.html
2) Journal of Statistics Education Data Archive (2006), "Fish Catch data set (1917)
3) http://support.sas.com/documentation/cdl/en/stsug/62259/HTML/default/viewer.htm#uga
ppdatasets_sect8.htm