Академический Документы
Профессиональный Документы
Культура Документы
CS 540 section 2
Louis Oliphant
oliphantcs.wisc.edu
Inductive learning
Trivially, there is a consistent decision tree Ior any training set with one path to leaI
Ior each example (unless f nondeterministic in x) but it probably won't generalize to
new examples
Idea: a good attribute splits the examples into subsets that are
(ideally) "all positive" or "all negative"
classiIication is wrong
attributes values are incorrect because
oI errors getting or preprocessing the data
Missing data:
OverIitting
meaningless regularity is Iound in the data
Handling
continuous Ieatures
noisy data
missing values
Pruning