Академический Документы
Профессиональный Документы
Культура Документы
improvement
Seddik Ilias
Meftah Boudjelal
Benyettou Abdelkader
I.
entrop;
dictionary;
coding;
INTRODUCTION
A. Data Pre-Treatment
In this step we create the vector that will be coded by the
transform. For this, a subdivision of the input data in blocs is
done, followed by a reordering if necessary. The interest of
this reordering is to optimize the regularity of the bloc by
increasing the correlation between each element of input data
and its following. The processing to be applied will vary
according to the nature of the input data, so for text data
Burrows & Wheeler transform [4] which sorts blocs of text
lexicographically increasing rehearsals will be applied. For
image inputs, the matrix will be divided and each block
obtained will be sorted according to the flow geometry inside
the block [5],Or simply by choosing among the horizontal,
vertical and diagonal direction to minimize the entropy
resulting. The same operation is applied to video on the
images sequences. The goal of those pre-treatment is to
minimize high frequencies (defined here by a large differential
between a coefficient and its following) for images, sounds
and videos, and to reduce the disorder of characters in textual
data.
B. Processing method
V2=
V7=
; V3=
; V4=
; V5=
; V6=
; V2=
; V5=
; V3=
; V6=
; V7=
; V2=
; V3=
; V4=
; V5=
; V7=
C(1)=M(1)
For (i=2:length(M))
C(i)= Position of M(i) in VM(i-1)
end
Example: a= [2 5 3 2 5 6 3 2 4 5 2 5 1 7 5 1 1 2 3 1 7 2 3 1]
V1=
representing the vector of the following of the
character 1, and the second line represents number of the
occurrence of each of them after this character.
(a)
(b)
Figure 3. Coeficient distribution before (a) and after (b) application of the
transform
2
3
1
7
2
3
5
2
7
5
5
2
III.
3
1
1
2
3
1
Last coefficient
decoded
2
5
3
2
5
6
3
2
4
5
2
5
1
7
5
1
1
Input data
text
CODING
4
2
4
-
Vf(1)
Vf(2)
Vf(3)
Vf(4)
5
1
2
5
1
3
2
5
5
1
5
1
7
5
1
7
7
3
3
1
3
3
1
3
3
3
3
1
2
3
1
1
4
6
4
6
4
6
4
6
2
6
2
2
2
2
2
2
2
-
Image
Video
Paper 1
Paper 2
Pic
Barbara
Boat
Flinstones
Claire
Salesman
Foreman
Original
entropy
4.983
4.601
1.210
15.735
11.385
14.724
6.418
6.801
7.173
Transformed
data entropy
3.830
3.592
1.004
5.367
5.128
4.946
3.209
4.605
4.589
TABLE III.
COMPRESSION PERFORMANCE IN TERM OF RATIO (AVERAGE BIT PER ELEMENT) AND RUNNING TIMES (SEC) FOR
DIFFERENT TYPES OF DATA
Input data
Original
size
Compressed
Vectors size
Dictionary
size
Compression
ratio
Running
times
(Sec)
Paper 1
53161
24520
1556
3.92
0.078
Paper 2
82199
35518
1340
3.58
0.171
Pic
513216
56302
3009
1.06
2.44
Barbara
262144
163500
14888
5.44
1.48
Boat
262144
168140
9338
5.41
1.24
Flinstones
262144
162330
12617
5.33
2.15
Claire
18779904
753212
41795
3.22
61.3
Salesman
17071922
9827967
42384
4.62
69.2
Foreman
11406644
6418926
30393
4.52
41.1
Text
Image
Video