Вы находитесь на странице: 1из 13

Leaf Image Analysis towards Plant Identification

Debnath Bhattacharyya1, Tai-hoon Kim2,*, and Gang-soo Lee3


1
Information Technology Department,
MCKV Institute of Engineering, Liluah,
Howrah – 711204, India
debnathb@gmail.com
2
GVSA and School of Information Science
Unviersity of Tasmania, Australia
taihoonk@utas.edu.au
3
Department of Computer Engineering
Hannam University, Daejeon, Korea
gslee@hnu.kr

Abstract. Plants can be classified and identified by naturally or artificially as per


the botanists. Plants can be identified by their leaves also. There are different
varieties of plants grown throughout the world. Their identifications are studied
using various laboratory methods. The morphological and genetically
characteristics are employed to classify different leafs as well as plants. However,
the presence of wide morphological varieties through evolution among the various
leaf cultivars made it more complex and difficult to classify them. Leaf structures
play a very crucial role in determining the characteristics of a plant. The broad and
narrow shaped leaves, leaf arrangement, leaf margin characteristics features which
differentiate various leaf of a plant. In this paper, we propose the methods to
identify the leaf using an image analysis based approach.

Keywords: image processing, pattern recognition, image segmentation,


classification.

1 Introduction

The Plants are basically identified by their leaves. There are different varieties of trees
grown throughout the world. Some are important cash crop. Some are used in medicine.
The tree identification is very important in day to day life. Their identifications had been
studied using various laboratory methods [2, 3, 4]. The morphological and genetically
characteristics were employed to classify different leafs. However, the presence of wide
morphological varieties through evolution among the various leaf cultivars made it more
complex and difficult to classify them. Therefore manual identification as well as
classification of these leaves is a tedious task. During the last few decades computational
biologists have studied various diversities among leaf due to huge number of
evolutionary changes [1, 5, 6]. Leaf structures play a very crucial role in determining the

*
Corresponding author.

T.-h. Kim et al. (Eds.): SIP 2011, CCIS 260, pp. 113–125, 2011.
© Springer-Verlag Berlin Heidelberg 2011
114 D. Bhattacharyya, T.-h. Kim, and G.-s. Lee

characteristics of a plant. The broad and narrow shaped leaves, leaf arrangement, leaf
margin etc. characteristics features which differentiate various leaf of a tree. This project
proposed the methods to identify the leaf using an image analysis based approach.
The Leaf structure: To identify a leaf, first we have to know the structure of the
leaf. The following is the description of the structure of the leaf. An example of a
sample jpeg image of a leaf structure is shown in Fig. 1.
An entire leaf has the following parts:-
a. The Leaf Apex: The Apex is the tip of the leaf.
b. The Leaf Blade: The blade is the entire leaf unit. Sometimes this is made
up of several smaller leaflets.
c. The Leaf Margin: The margin is the term used to describe the edge of the
leaf.
d. The Leaf Base: Base is the name given to the part of the blade that is
closest to the stem.
e. The Petiole: The leaf parts that is closest to the stem. The Petiole is the
thin stalk that connects the leaf blade to the stem.
f. The Stipule: The leaf parts that is closest to the stem. Stipules are tiny
leaf-like structures that may or may not be present at the base of the
petiole.
g. Petiolate and Sessile: Leaves which have petioles are called Petiolate. If a
leaf has no petiole it is called Sessile.
Importance of Identifying Leaf:
Identify Plants: If we can able to identify leaf, we can easily able to identify plants.
Disease identification: The structure of the leaf will be changed, if the leaf is
affected by some disease. So, we can quickly identify the disease of the plant [7, 8, 9].
If the disease is early detected then we can take the step to prevent the diseases that
occur in the plant. In this way we prevent the production fall in agriculture system. In
this way we can achieve green revolution which helps our country to grow rapidly.
Plant leaves classification: If we can able to identify leaf, we can easily able to
classify the plants.
Pollen leaves classification: If we can able to identify leaf, we can easily able to
classify the pollen [10]. It is found that pollen grain of some plant families is
responsible for producing allergy or asthma of human being. So this type of pollen
should be identified.
Seed classification: If we can able to identify leaf, we can easily able to classify the
seed [11]. This classification helps the farmer very much.
Crop Productivity: If we can able to identify the leaves, it helps to identify the
different varieties of seeds, diseases, changing the color of leaves in heat, etc. This
helps in crop productivity.
Identification of individual crop plants in the field: If we can able to identify the
leaves, it helps to identify of individual crop plants in the field and locating their exact
position of the crop [12]. Thus it helps in modern automated farming.
Leaf Image Analysis towards Plant Identification 115

Recreation: Many people enjoy gardening or home landscape work. They need to
identify which plants are useful and which ones are weeds. Other people simply like
being able to name the plants they see when they walk through a city park or down a
country road.
Food and Fiber: It is useful to be able to recognize plants which are sources of food
(fruits, vegetables, spices, etc.) and fiber (lumber, firewood, paper pulp, etc.). This
ability can also be profitable.
Career Skill: The ability to identify plant materials is an important skill for people who
are interested in horticulture, forestry, landscape architecture, parks and recreation,
botany, and a number of other career areas.

2 Previous Works

In this paper several methods [9] for recognizing of leaf are proposed. They are
described below:-

• Leaf Blade Shape


• Simple and Compound Leaf Grouping
• Leaf Arrangement
• Leaf Venation
• Leaf Margin
• In this paper we will discuss all the above points.

2.1 Leaf Blade Shape

The first step in identifying a plant is to look at its leaves and determine their shape,
and grouping.
One way to identify a plant is by the shape of its leaf blade (Fig. 2). The main part
of a leaf is the blade. There are nine major shape categories:

• Linear: Narrow, with parallel or nearly parallel sides, (Fig. 3).


• Lanceolate: Lance shaped, longer than broad and tapering to a point at the tip
(Fig. 4).
• Oblong: Much longer than wide, with nearly parallel sides (Fig. 5).
• Elliptic: Shaped like an ellipse, wider at the center and tapering to equal tips
(Fig. 6).
• Ovate: Egg-shaped, broader at the base than the tip (Fig. 7).
• Cordate: Heart-shaped, either the leaf shape or base (Fig. 8).
• Reniform: Kidney-shaped (Fig. 9).
• Spatulate: In the shape of a spatula or spoon (Fig. 10).
• Orbicular: Round (Fig. 11).
116 D. Bhattacharyya, T.-h. Kim, and G.-s. Lee

Fig. 1. Entire leaf structure

Fig. 2. Leaf blade shape Fig. 3. Leaf with linear blade shape

Fig. 4. Leaf lanceolate blade shape Fig. 5. Leaf with oblong blade shape

Fig. 6. Leaf elliptic blade shape Fig. 7. Leaf with ovate blade shape
Leaf Image Analysis towards Plant Identification 117

Fig. 8. Leaf cordate blade shape Fig. 9. Leaf with reniform


blade shape

Fig. 10. Leaf spatulate blade shape Fig. 11. Leaf with orbicular blade shape

2.2 Leaf Grouping

After determination of the leaf shape, we need to examine how the leaves are grouped
on the stem. This is called Leaf Grouping. There are two types of leaf grouping.
Simple Grouping: Some leaves are simple, meaning they appear alone (Fig. 12).
There are three main types of Simple leaf groupings as follows:
• Entire
• Palmately lobed
• Pinnately lobed
Compound Grouping: Other leaves are compound, meaning that they appear in
groups and are made up of leaflets. Three kinds of compound leaves are as follows:
• Palmate: Here leaflets form a fan shape (Fig. 13).
• Pinnate: Leaflets are opposite each other on the stem (Fig. 14).
• Bipinnate: Leaflets are in pairs of pinnate groups (Fig. 15).

Fig. 12. Leaf with simple grouping Fig. 13. Leaf with palmate
compound grouping
118 D. Bhattacharyya, T.-h. Kim, and G.-s. Lee

Fig. 14. Leaf with pinnate compound grouping Fig. 15. Leaf with orbicular
blade shape

2.3 Leaf Arrangement

After we have determined the shape and grouping of the plant's leaves, we need to
examine the arrangement of the leaves on the stem. Here are the three principle ways
leaves are arranged on the stem.
• Opposite: Leaves are directly opposite each other on the stem (Fig. 16).
• Alternate: Leaves are on both sides, but not directly across from one another
(Fig. 17).
• Whorled: Leaves are arranged alternately around the stem.

Fig. 16. Opposite leaves arrangement Fig. 17. Alternate Leaves


arrangement

2.4 Leaf Venation


The lines that appear on the surface of a leaf and look like blood vessels are called
Veins. The two main types are Palmate and Pinnate (Fig. 18 and Fig. 19, respectively).

2.5 Leaf Margin

After we have determined the shape and grouping of the plant's leaves, the arrangement
of the leaves on the stem, and the structure of the leaf veins, we need to examine the leaf
margin characteristics.
The shape of the edge or margin of a leaf is another way it can be identified. Three
kinds of margins are Entire, Toothed, and Lobed.
Leaf Image Analysis towards Plant Identification 119

Fig. 18. Leaf with palmate leaf venation Fig. 19. Leaf with pinnate leaf
Venation

Entire Margins: Margin smooth no bumps (Fig. 20).


Toothed Margins: A number of margins are named by the kinds of "teeth" they
have (Fig. 21 and Fig. 22). Leaf Margins are as follows:
• Toothed: shallow bumps;
• Dentate: pointing outward;
• Serrate: pointing to the leaf's tip;
• Crenate: broad and round; and
• Incised: deeply cut with sharp, irregular teeth.

Fig. 20. Leaf with smooth margin Fig. 21. Toothed leaf margin

Fig. 22. Leaves with various types of toothed margins


120 D. Bhattacharyya, T.-h. Kim, and G.-s. Lee

Lobed Margins: Leaves with lobed margins are as follows (Fig. 23):
• Lobed: deep indents or sinuses;
• Sinuate: very wavy margin; and
• Undulate: wavy margin, but not as wavy as a sinuate margin.

Fig. 23. Leaves with lobed margins

3 Our Work
In this paper we applied some methods and algorithms to identify the leaf. These are
explained as follows:

3.1 Algorithms for Database Creation


The leaf samples are stored in the database. The images are in .bmp format. Here we
consider only the leaf margins. There are nine types of leaf margins. These nine types
of leaves are stored in the database. Algorithm is stated hereunder:
Step 1: Start
Step 2: List all .bmp files from the folder
Step 3: Initialize the variable Numfids with total no. of listed .bmp files
Step 4: Declare an array to store the loaded .bmp files
Step 5: Read the .bmp files from the folder
Step 6: Store the loaded .bmp file in an array for future use
Step 7: Stop

3.2 Algorithms for Preprocessing of Images in the Database

The leaf samples which are stored in the database are scaled in standard size and then
converted into gray scale image. From these gray scale images edge are detected using
‘sobel’ technique. Algorithm is given hereunder:
Step 1: Start
Step 2: Initialize the variable Numfids with total no. of listed .bmp files
Step 3: Read the images from the stored array
Step 4: Scale the images with size 186,200
Step 5: Convert the color images to gray scale images
Step 6: Identify the edge of the gray images using sobel technique
Step 7: Store the properties of the detected edge in an array
Step 8: Stop
Leaf Image Analysis towards Plant Identification 121

3.3 Finding Out the Mathematical Characteristics of the Stored Image in the
Database

The leaf samples which are stored in the database edges are identified. These edges
are logical in nature. The edges area will store ‘1’ in the array. From that array we
calculate the correlation coefficient for each image and store in another array. The
correlation coefficient is calculated with the following formula mentioned below.
The correlation coefficient of x and y is:

Correlation coefficient=cov(x,y)/σx* σy
Where cov(x,y) is covariance of x and y,
σx and σy are standard deviation of the values of x and y respectively.

σ2x = 1/n ∑ x2i – (1/n ∑ xi)2 and σ2y= 1/n ∑ y2i – (1/n ∑ yi)2
The covariance of x and y is
n
Cov(x,y)= 1/n ∑ ( xi- X) ( yi-Y)
i=1

where X and Y are the arithmetic means of the value assumed by x and y respectively,
X=1/n(x1+x2+……………+xn)
and Y= 1/n(x1+x2+……………+xn)
The algorithm is given hereunder:
Step 1: Start
Step 2: Initialize the variable Numfids with total no. of listed .bmp files
Step 3: Read the array which store the detected edge properties of the stored
images
Step 4: Find out the correlation coefficient from the array
Step 5: Store the correlation coefficient value in an array
Step 6: Stop

3.4 Algorithms for Preprocessing of Input Image and Mathematical


Characteristic

Same algorithms are used to process the input image for identification, from the
algorithm stated in 3.2 to 3.3.

3.5 Algorithms for Comparison of Characteristics of the Input Image with the
Stored Images

After the mathematical calculation we get the value of correlation coefficient of the
stored images and the input images. Then we calculate the matching percentage with
the following formula:
122 D. Bhattacharyya, T.-h. Kim, and G.-s. Lee

Percentage = (Correlation coefficient of each stored image/ Correlation coefficient


of the input image)*100;
The matching percentage if greater than 80% then we say 'image is fully matched'.
If it is greater than 60 but less than 80 then we say 'image is partially matched' else we
say that 'image is not matched'.
Step 1: Start
Step 2: Initialize the variable Numfids with total no. of listed .bmp files
Step 3: Read the array which store the correlation coefficient value of the stored
images
Step 4: If Correlation coefficient of the stored image=correlation coefficient of
the input image?
Then Print “Image is found” Else Print “Image is not found”
Step 5: Stop

3.6 Another Algorithms for Comparison of Characteristics of the Input Image


with the Stored Images

Step 1: Start
Step 2: Initialize the variable Numfids with total no. of listed .bmp files
Step 3: Read the array which store the correlation coefficient value of the stored
images
Step 4: Calculate Matching percentage= (Correlation coefficient of the individual
stored image/correlation coefficient of the input image)*100
Step 5: Store the matching percentage in an array
Step 6: Initialize the variable Per with the maximum matching value from the
array which stores the matching percentage value
Step 7: if per>=80 then Print “image is fully matched”
Step 8: if (per>=60) && (per<80) Print “image is partially matched, Else Print
“image is not matched”
Step 9: Stop

4 Result and Analysis

Table 1, shows the data of input file and Table 2 shows the detail experimental result.
Fig. 24 to Fig. 27 shows the Algorithm output. Fig. 28 shows the Matlab implementation
shot.

Table 1. Detail information of input leaf

Serial no Name of input file Value of the correlation


coefficient of the input file
1. toothed_large.bmp -0.0911
Leaf Image Analysis towards Plant Identification 123

Table 2. Detail of output information after matching

Value of the Matching


Serial Name of correlation percentage with Output
no stored files coefficient of the the input image
stored file file
Crenate.bmp 0.0891 -97.7624 image is not
1.
found
Dentate.bmp 0.0619 -67.9869 image is not
2.
found
Entire.bmp 0.0344 -37.7459 image is not
3.
found
0.1592 -174.8101 image is not
4. Irregular.bmp
found
Lobed.bmp 0.0159 -17.4670 image is not
5.
found
Serrate.bmp 0.1006 -110.4112 image is not
6.
found
Sinuate.bmp -0.1114 81.8079 image is not
7.
found
Undulate.bmp 0.2544 -279.2548 image is not
8.
found
Image is
matches up to
Toothed.bmp -0.0911 100 100 %
9.
image is fully
matched

Fig. 24. Original Toothed Leaf Fig. 25. Toothed Leaf after scaling
124 D. Bhattacharyya, T.-h. Kim, and G.-s. Lee

Fig. 26. Leaf after gray conversion Fig. 27. After Edge Detection

Fig. 28. Matlab Implementation Screen shot

5 Conclusion
This approach is useful in MLP classifier that helps in identification of various plant
based on its morphological or genetically characteristics. Since the leaf structure
varies in different cultivars, the outcome of the present work clearly shows the exact
characteristics present in each variety of leaves. The figure of percentage required for
the presence of similar characteristics of different leaves. For a more detailed and
accurate classification other morphological features like fruits, stem, flower, and root
can be considered for further processing.

Acknowledgement. This work was supported by the Security Engineering Research


Center, granted by the Korea Ministry of Knowledge Economy.
Leaf Image Analysis towards Plant Identification 125

References
1. Sanyal, P., Bhattacharya, U., Bandyopadhyay, S.: Analysis of SEM Images of Stomata of
Different Tomato Cultivars Based on Morphological Features. In: IEEE International
Conference, AMS 2008 (2008)
2. Sabino, D.M.U., da, L., Costa, F., Rizzatti, E.G., Zago, M.A.: A texture approach to
leukocyte recognition. Real-Time Imaging 10, 205–216 (2004)
3. Soille, P.: Morphological image analysis applied to crop field mapping. Image and Vision
Computing 18(13), 1025–1032 (2000)
4. Stojanovic, R., Mitropoulos, P., Koulamas, C., Karayiannis, Y., Koubias, S.,
Papadopoulos, G.: Real-time vision-based system for textile fabric inspection. Real-Time
Imaging 7(6), 507–518 (2001)
5. Polder, G., van der Heijden, G.W.A.M., Young, I.T.: Hyperspectral Image Analysis For
Measuring Ripeness Of Tomatoes. In: ASAE International Meeting (2000)
6. Tzionas, P., Papadakis, E.S., Manolakis, D.: Plant leaves classification based on
morphological features and a fuzzy surface selection technique. In: Fifth International
Conference on Technology and Automation, Thessaloniki, Greece, pp. 365–370 (2005)
7. Zhao-yan, L., Fang, C., Yi-bin, Y., Xiu-qin, R.: Identification of rice seed varieties using
neural network. Journal of Zhejiang University SCIENCE , 1095–1100 (2005)
8. Neuman, M., Sapirstein, H.D., Shwedyk, E., Bushuk, W.: Wheat grain color analysis by
digital image processing: I. Methodology. J. Cereal Sci. 10, 175–182 (1989)
9. Damian, M., Cernadas, E., Formella, A., Sa-Otero, P.M.: Pollen classification of three
types of plants of the family Urticaceae,
http://trevinca.ei.uvigo.es/~formella/inv/aaa/
formella-2002-pollen.pdf,
http://cas.psu.edu/docs/CASDEPT/Hort/LeafID/Arrangement.html
10. Slaughter, D.C., Harrell, R.C.: Discriminating fruit in a natural outdoor scene for robotic
harvest. Trans. Of the ASAE 32(2), 757–763 (1989)
11. Tian, L., Slaughter, D.C., Norris, R.F.: Machine Vision Identification Of Tomato
Seedlings For Automated Weed Control,
http://www.age.uiuc.edu/faculty/lft/papers/tomato.pdf
12. Sanyal, P., Bhattacharya, U., Parui, S.K., Bandyopadhyay, S.K.: Color Texture Analysis of
Rice Leaves for Detection of Blast Disease. In: 20th CSI Conference Proceedings, pp. 45–48
(2006)

Вам также может понравиться