Академический Документы
Профессиональный Документы
Культура Документы
Generic view of a
pattern classifier
Discriminant Functions
Let gi(x) = -R(i | x)
(max. discriminant corresponds to min. risk!)
For the minimum error rate, we take
gi(x) = P(i | x)
(max. discrimination corresponds to max.
posterior!)
gi(x) p(x | i) P(i)
gi(x) = ln p(x | i) + ln P(i)
(ln: natural
logarithm!)
Discriminant functions
Discriminant functions do not change the
decision, when scaled by some positive
constant k.
The decision is not affected when a
constant is added to all
discriminant
functions.
Discriminant Functions
Feature space divided into c decision regions
if gi(x) > gj(x) j i then x is in Ri
(Ri means assign x to i)
The two-category case
A classifier is a dichotomizer that has two
discriminant functions g1 and g2
Let g(x) g1(x) g2(x)
Decide 1 if g(x) > 0 ; Otherwise decide 2
Dichotomizer
or
10
Where
and
d x d Covariance matrix
11
12
13
14
Normal distribution has the maximum entropy over all distributions with a
given mean and variance.
2D Gaussian
15
16
17
Mahalanobis Distance