Академический Документы
Профессиональный Документы
Культура Документы
Abstract lieve that digital video will soon become a part of every-
day life. Unlike still images, video sequences provide more
The advance of technology makes video acquisition de- information about how objects and scenarios change over
vices better and less costly, thereby increasing the num- time, but at the cost of increased space for storage and wider
ber of applications that can effectively utilize digital video. bandwidth for transmission. Therefore, the topic of video
Compared to still images, video sequences provide more in- compression have drawn more and more attentions during
formation about how objects and scenarios change over recent years. The MPEG 4 standard suggests the usage of
time. However, video needs more space for storage and object planes in video compression. By segmenting the ob-
wider bandwidth for transmission. Hence is raised the topic ject planes and determine the motion parameter for each
of video compression. The MPEG 4 compression standard one correctly, good compression results can be achieved.
suggests the usage of object planes. If the object planes Hence, algorithms of tracking objects are needed. Also, in
are segmented correctly and the motion parameters are de- the context of reconnaissance, robot technology, etc., there
rived for each object plane accordingly, a better compres- are great interests in moving object tracking algorithms.
sion ratio can be expected. Therefore, to take full advantage Therefore in this paper, we present an algorithm of moving
of the MPEG 4 standard, algorithms for tracking objects object tracking.
are needed. It is also obvious that there is great interest in Many existing algorithms [1]-[4] segment each video
moving object tracking algorithms in the fields of reconnais- frame to determine the objects; this action can be compu-
sance, robot technology, etc. So, we propose an algorithm tationally expensive, and it is not necessary if the goal is to
to track moving objects in video sequences. determine the moving objects. Alternatively, we proposed
The algorithm first separates the moving objects from the an algorithm [5] that derives the objects based on the motion
background in each frame. Then, four sets of variables are between frames. While initial results were promising, the
computed based on the positions, the sizes, the grayscale tracking algorithm in [5] is limited and not able to handle
distributions and the presence of textures of the objects. A some complex situations such as new tracks (object starts
rule-based method is developed to track the objects between moving), ceased tracks (object stops moving) and possible
frames, based on the values of the variables. Preliminary collisions (objects move together). Therefore, we present
experimental results show that the algorithm performs well. a rule-based method to deal with these situations. The rest
The tests also show that the algorithm obtains success in of the paper is organized as follows: Section 2 will review
indicating new tracks (object starts moving), ceased tracks some necessary background on the wavelet transform and
(object stops moving) and possible collisions (objects move camera motion models. We will review the algorithm that
together). we developed in [5] in Section 3 and introduce the tracking
algorithm in Section 4. The experimental results are shown
in Section 5, and the conclusions are given in Section 6.
1. Introduction
2. Background
Because of the advance in technology, there are more
affordable digital video acquisition devices in the market. 2.1. Wavelet Transform and Filter Banks
This means more applications for digital video. Having
witnessed the success of web camera applications and the Due to the extensive study done on it, the wavelet trans-
appearance of high definition digital video cameras, we be- form is now a very powerful tool in signal analysis and rele-
vant fields. Unlike some traditional transforms, the Fourier
transform for example, the wavelet transform can achieve
both spatial and frequency localization. In the area of dis-
crete signal analysis, the wavelet transform is closely re-
lated to filter banks. A two-channel filter bank is shown in
Fig. 1 [6].
For digital images, the signal is two dimensional. There- through equation
fore, in image analysis and compression, two 1-D wavelet 2 3 2 32 3
u m1 m2 m3 x
analyses are usually applied to the horizontal and vertical 4 v 5=4 m4 m5 m6 54 y 5
directions of the images separately; the structures in Fig. 1
w m7 m8 1 1
are cascaded, as shown in Fig. 2. Using the decomposition
structure in Fig. 2 repeatedly, a wavelet pyramid can be cre- where (x; y ) is the original coordinates. The transformed
ated for multi-resolution analysis. A wavelet pyramid of the coordinates (x0 ; y 0 ) can be obtained using equations
image “Lena” is given in Fig. 3 as an example [7].
If we examine Fig. 3 carefully, we can see that in the m1 x + m2 y + m3 u
x0 = = ;
high frequency bands, the coefficients have large ampli- m7 x + m8 y + 1 w
tudes at the location of the edges. This property is some-
m4 x + m5 y + m6 v
times used in edge detection. y0 = = :
m7 x + m8 y + 1 w
2.2. Camera Motion Estimation The bilinear model, described by the equations below, is an
approximation of the projective model,
References
[1] D. Wang, Unsupervised Video segmentation Based On Watersheds
And Temporal Tracking, IEEE Trans. Circuits Syst. Video Technol.,
8(5):539-546, September 1998.
[2] G. L. Foresti, Object Recognition And Tracking For Remote Video
Surveillance, IEEE Trans. Circuits Syst. Video Technol., 9(7):1045-1062,
October 1999.
[3] P. Salembier, F. Marques, M. Parda s, J. R. Morros, I. Corset, S.
Jeannin, L. Bouchard, F. Meyer, B. Marcotegui, Segmentation-Based
Video Coding System Allowing The Manipulation Of Objects, IEEE
Trans. Circuits Syst. Video Technol., 7(1):60-74, February 1997.
[4] A. J. Lipton, H. Fujiyoshi, R. S. Patil, Moving Target Classification
(b) frame 5 And Tracking From Real-time Video, Applications of Computer Vision,
1998. WACV ’98. Proceedings., Fourth IEEE Workshop on, pp. 8-14,
1998.
Figure 6. detected object areas in frame 4 and [5] Y. Wang, R.E. Van Dyck, J. F. Doherty, Tracking Moving Objects in
frame 5 Video Sequences, Proc. Conference on Information Sciences and Systems,
Princeton, NJ, March 2000.
[6] M. Vetterli, J. Kovacevic, Wavelets And Subband Coding, Prentice-
ject (135.5, 332.1), we evaluate the dispersions of the object Hall, INC., Upper Saddle River, NJ, 1995.
and the tracks. The dispersion for the new object is 66.0 [7] M. Antonini, M. Barlaud, P. Mathieu, and I. Daubechies, Image
while those for the two tracks are 35.2 and 42.5. Because Coding Using Wavelet Transform IEEE Trans. on Image Processing,
42:5 < 66:0 < 35:2 + 42:5 = 77:7, we can claim that there 1(2):205-220, April 1992.
is possible track collision in frame 6. [8] S. Mann, R. W. Picard, Video Orbits of The Projective Group: A
Simple approach To Featureless Estimation Of Parameters, IEEE Trans.
on Image Processing, 6(9):1281-95, September 1997.
6. Conclusion and comments [9] S. Lertrattanapanich, N. K. Bose, Latest Results On High-resolution
Reconstruction From Video Sequence, Technical Report Of IEICE.
DSP99-140, pp. 59-65, December 1999.
In this paper, we first reviewed an algorithm to isolate
the moving objects in video sequences and then presented a
rule-based tracking algorithm. The preliminary experimen-
tal results demonstrate the effectiveness of the algorithm
even in some complicated situations, such as new track,
ceased track, track collision, etc.
The goal of the algorithm is to identify and track the
moving object quickly. However, the projective model for