Вы находитесь на странице: 1из 1

15.062 Data Mining – Spring 2003 Nitin R.

Patel

Comparison of Data Mining techniques – large data sets – Guidelines (… and only guidelines)

H: high, M:medium, L:low.


Multiple Logistic Discriminant Naïve Neural Trees k-Nearest
Linear Regression Analysis Bayes Nets Neighbors
Regression
Accuracy M M M HM H M HM

Intepretability H H M H L H L

Speed- H H H HM L HM H
Training

Speed- H H H H H HM L
Deployment

Effort in HM HM HM HM L L ML
choice and
transformation
of indep.Vars.
Effort to tune L L L ML H ML ML
performance
parameters
Robustness to ML ML ML ML HM H HM
Outliers in
indep vars
Robustness to H H HM H L ML L
irrelevant
variables
Ease of M M M H M H ML
handling of
missing
values
Natural H H ML M H H L
handling both
categorical
and
continuous
variables

Вам также может понравиться