Академический Документы
Профессиональный Документы
Культура Документы
Advanced Pattern
Mining
Vu Manh Cam
Nguyen Quy Ky Nguyen
Luong Anh Tuan
Nguyen Kim Chinh
OutLine
Pattern Pruning
Data Pruning
Pattern Fusion
Patter clustering
Anti-monotonic Constraints
Monotonic Constraints
Succinct Constraints
Convertible Constraints
Anti-monotonic Constraints
Monotonic Constraints
Succinct Constraints
Convertible Constraints
Example:
Data-space
pruning
2 properties:
Data Succinctness
Data Anti-monotonicity
Data-Succinctness
Data Anti-monotonicity
Data Anti-monotonicity
Example
Ti + S not satisfy C1
Ti can be pruned
Data Anti-monotonicity
Example
Data Anti-monotonicity
Example
3/7/15
17
Introductions
D[m, n] when n very large but m is 100 -> 1000 => New mining strategy
Pattern-Fusion
18
19
Core pattern
Core Patterns
Intuitively, for a pattern , a subpattern is a -core pattern of if
shares a similar support set with , i.e.,
| D |
| D |
0 1
Robustness
Pattern apha is (d,t) robust if d is
20
Transaction (# of
Ts)
(abe) (100)
(bcf) (100)
(acf) (100)
(abcef) (100)
(ab), (ac), (af), (ae), (bc), (bf), (be) (ce), (fe), (e),
(abc), (abf), (abe), (ace), (acf), (afe), (bcf), (bce),
(bfe), (cfe), (abcf), (abce), (bcfe), (acfe), (abfe),
(abcef)
21
22
For each seed pattern thus picked, we find all the patterns
within a bounding ball centered at the seed pattern
A bounded-breadth pattern
tree traversal
It avoids explosion in
mining mid-sized ones
Efficiency
24
Introduction
Minimum Support: 2
(b) : 3
(a, b, c, d)
(a) : 2
(a, b) : 2
(a, b, d, e)
(a, d) : 2
(b, e, f)
(d) : 2
(b, d) : 2
(e) : 2
(b, e) : 2
(a, b, d) : 2
26
Compressing Frequent
Patterns
Key Problems
27
Distance Measure
28
Clustering Criterion
-clustering
29