Академический Документы
Профессиональный Документы
Культура Документы
Raymond J. Mooney
University of Texas at Austin
1
Bayesian Networks
Roots (sources) of the DAG that have no parents are given prior
probabilities.
Earthquake
Burglary
P(B)
.001
P(E)
Earthquake
Burglary
Alarm
B
T
Alarm
JohnCalls
MaryCalls
A
P(J)
.90
.05
E
T
.002
P(A)
.95
.94
.29
.001
A
MaryCalls
JohnCalls
P(M)
.70
.01
CPT Comments
P ( x1 , x2 ,...xn ) = P( xi | Parents( X i ))
i =1
Example
P( J M A B E )
= P( J | A) P( M | A) P( A | B E ) P(B) P(E )
= 0.9 0.7 0.001 0.999 0.998 = 0.00062
X1
X2
Xn
???
Earthquake
Burglary
For example, if we know the alarm went off that someone called
about the alarm, then it makes earthquake and burglary dependent
since evidence for earthquake decreases belief in burglary. and
vice versa.
Alarm
MaryCalls
JohnCalls
10
P(B)
???
P(B)
.001
Earthquake
Burglary
Alarm
JohnCalls
MaryCalls
???
.001
Earthquake
Burglary
Alarm
JohnCalls
P(J)
P(J)
.90
.90
.05
.05
11
MaryCalls
12
Types of Inference
Sample Inferences
Diagnostic (evidential, abductive): From effect to cause.
13
14
15
16
Burglary
JohnCalls
17
Alarm
Earthquake
MaryCalls
18
Assumes the CPT for each node is a matrix (M) with a column
for each value of the nodes variable and a row for each
conditioning case (all rows must sum to 1).
Matrix M for
the Alarm node
Value of Alarm
T
F
TT
0.95 0.05
Values
FF 0.001 0.999
21
Node Clustering
Medical diagnosis
Pathfinder system outperforms leading experts
in diagnosis of lymph-node disease.
Microsoft applications
Problem diagnosis: printer problems
Recognizing user intents for HCI
Number of values for merged node is exponential
in the number of nodes merged.
Still reasonably tractable for many network
topologies requiring relatively little merging to
eliminate loops.
24
Statistical Revolution
Across AI there has been a movement from logicbased approaches to approaches based on
probability and statistics.
Social networks
Biochemical compounds
Web sites
Biochemical Data
Predicting mutagenicity
[Srinivasan et. al, 1995]
Grad
Student
Research
Project
27
Collective Classification
Traditional learning methods assume that
objects to be classified are independent (the
first i in the i.i.d. assumption)
In structured data, the class of an entity can
be influenced by the classes of related
entities.
Need to assign classes to all objects
simultaneously to produce the most
probable globally-consistent interpretation.
Logical AI Paradigm
Represents knowledge and data in a binary
symbolic logic such as FOPC.
Probabilistic AI Paradigm
probabilistic reasoning.
Conclusions
Bayesian learning methods are firmly based on
probability theory and exploit advanced methods
developed in statistics.
Nave Bayes is a simple generative model that works
fairly well in practice.
A Bayesian network allows specifying a limited set
of dependencies using a directed graph.
Inference algorithms allow determining the
probability of values for query variables given
values for evidence variables.
33