Академический Документы
Профессиональный Документы
Культура Документы
Abstract
Image compression is necessary now days to transmit the image data with reduced
bandwidth and storage capacity. Region based compression is a special type of compression
technique in which the required portion is not altered and the remaining portion of the image is
compressed using discrete wavelet transform. The image is segmented by using Kmeans
clustering algorithm (unsupervised classification) and then Area of Interest (AOI) is subtracted
from the original image. The background image (remaining portion) is compressed using various
wavelet functions and its performance is also analysed. Then the compressed image is added
with the AOI and its performance is analysed. This image is encoded using any one of the
lossless coding technique.
1.Introduction
The seven spectral bands and its frequency spectrum are given below.
by converting the seven spectral bands into three primary color bands. The primary color bands
are Red, Blue and Green. The advantage of these primary colors is, objects can be easily
partitioned in these regions. Red signifies the NIR band, Green signifies the visible red band, and
Blue signifies the visible green band.In the CIR image, water features are very dark (the Seine
River)and green vegetation appears red (urban areas).
2. K-means Algorithm
The K-means algorithm is an iterative technique that is used to partition an image into K clusters.
The basic algorithm is:
2. Assign each pixel in the image to the cluster that minimizes the distance between the
pixel and the cluster center
3. Re-compute the cluster centers by averaging all of the pixels in the cluster
4. Repeat steps 2 and 3 until convergence is attained (e.g. no pixels change clusters)
In this case, distance is the squared or absolute difference between a pixel and a cluster center.
The difference is typically based on pixel color, intensity, texture, and location, or a weighted
combination of these factors. K can be selected manually, randomly, or by a heuristic.
This algorithm is guaranteed to converge, but it may not return the optimal solution. The quality
of the solution depends on the initial set of clusters and the value of K.
In statistics and machine learning, the k-means algorithm is a clustering algorithm to partition n
objects into k clusters, where k < n. It is similar to the expectation-maximization algorithm for
mixtures of Gaussians in that they both attempt to find the centers of natural clusters in the data.
The model requires that the object attributes correspond to elements of a vector space. The
objective it tries to achieve is to minimize total intra-cluster variance, or, the squared error
function.
The most common form of the algorithm uses an iterative refinement heuristic known
as Lloyd's algorithm. Lloyd's algorithm starts by partitioning the input points into k initial sets,
either at random or using some heuristic data. It then calculates the mean point, or centroid, of
each set. It constructs a new partition by associating each point with the closest centroid. Then
the centroids are recalculated for the new clusters, and algorithm repeated by alternate
application of these two steps until convergence, which is obtained when the points no longer
switch clusters (or alternatively centroids are no longer changed).
Lloyd's algorithm and k-means are often used synonymously, but in reality Lloyd's
algorithm is a heuristic for solving the k-means problem, as with certain combinations of starting
points and centroids, Lloyd's algorithm can in fact converge to the wrong answer. Other
variations exist, but Lloyd's algorithm has remained popular, because it converges extremely
quickly in practice. In terms of performance the algorithm is not guaranteed to return a global
optimum. The quality of the final solution depends largely on the initial set of clusters, and may,
in practice, be much poorer than the global optimum. Since the algorithm is extremely fast, a
common method is to run the algorithm several times and return the best clustering found. A
drawback of the k-means algorithm is that the number of clusters k is an input parameter. An
inappropriate choice of k may yield poor results. The algorithm also assumes that the variance is
an appropriate measure of cluster scatter.
• Simple to use: Extensive hand-labeling of images by experts to generate training data is not
necessary
• Probabilistic framework allows to approximate the actual edge probabilities for optimal
segmentation
Our goal is to segment colors in an automated fashion using the L*a*b* color space and K-
means clustering.
Thematic Mapper 4, 3, and 2 bands cover the near infrared (NIR), the visible red, and the visible
green parts of the electromagnetic spectrum. When they are mapped to the red, green, and blue
planes, respectively, of an RGB image the result is a standard color-infrared (CIR) composite.
The final input argument to multiband read specifies which bands to read, and in which order, so
that you can construct a composite in a single step.
Variable CIR is a 512-by-512-by-3 array of class uint8. It is an RGB image, but with false
colors. When the image is displayed, red signifies the NIR band, green signifies the visible red
band, and blue signifies the visible green band. In the CIR image, water features are very dark.
The LAN file, paris.lan, contains a 7-band 512-by-512 Landsat image. A 128-byte header is
followed by the pixel values, which are band interleaved by line (BIL) in order of increasing
band number. They are stored as unsigned 8-bit integers, in little-endian byte order. The first step
is to read bands 4, 3, and 2 from the LAN file using the MATLAB function multiband read.
Seine River) and green vegetation appears red (parks and shade trees).
Step 2: Convert Image from RGB Color Space to L*a*b* Color Space.
There are three colors: green, blue, and red. Notice how easily you can visually distinguish
these colors from one another. The L*a*b* color space (also known as CIELAB or CIE L*a*b*)
enables you to quantify these visual differences.
The L*a*b* color space is derived from the CIE XYZ tri stimulus values. The L*a*b* space
consists of a luminosity layer 'L*', chromaticity-layer 'a*' indicating where color falls along the
red-green axis, and chromaticity-layer 'b*' indicating where the color falls along the blue-green
axis. All of the color information is in the 'a*' and 'b*' layers. You can measure the difference
between two colors using the Euclidean distance metric.
Convert the image to L*a*b* color space using makecform and applycform.
MAKECFORM Create a color transformation structure.
makecform(TYPE) creates the color transformation structure, C
that defines the color space conversion specified by TYPE. To
perform the transformation, pass the color transformation structure