Академический Документы
Профессиональный Документы
Культура Документы
UMI
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
Using Image Analysis and GIS for coffee Mapping
Sandra Bolanos
Department of Geography
McGill University, Montreal
April, 2007
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
1*1 Library and
Archives Canada
Bibliotheque et
Archives Canada
NOTICE: AVIS:
The author has granted a non L'auteur a accorde une licence non exclusive
exclusive license allowing Library permettant a la Bibliotheque et Archives
and Archives Canada to reproduce, Canada de reproduire, publier, archiver,
publish, archive, preserve, conserve, sauvegarder, conserver, transmettre au public
communicate to the public by par telecommunication ou par Nntemet, preter,
telecommunication or on the Internet, distribuer et vendre des theses partout dans
loan, distribute and sell theses le monde, a des fins commerciales ou autres,
worldwide, for commercial or non sur support microforme, papier, electronique
commercial purposes, in microform, et/ou autres formats.
paper, electronic and/or any other
formats.
Canada
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
To my husband, fo r his unconditional support,
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Abstract
Currently, the system that developing countries use to map coffee crops is
map coffee crops in a cost-effective way, they would have an advantage in terms of
setting the prices and getting a higher return for their product.
mapped using the spectral and spatial information derived from satellite images and
topographic data. I also believe that during the spatial analysis of the data, the
information about the shape and context of a particular area) provides more
Colombia of 70km2 each. I integrated satellite images and topographic grids using
iii
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Abstract
coffee systems. Then in a second step, the classification was refined using decision
rules from the statistical analysis of objects (composed of at least 3 pixels). Finally,
using the same objects, the classification was refined further, through linear spectral
unmixing.
The results show that the rule-based classification combined with an object-based
approach can improve the overall accuracy of the classification by 3% and by 17.47%
for the coffee class when compared to the pixel-based analysis. The linear spectral
unmixing, however, does not contribute to improve the classification accuracy when
Landsat images are used to map the different types of coffee systems
iv
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Resume
pour cartographier les recoltes de cafe est cher, penible et imprecis. Ainsi, les cartes
de cafe ne peuvent etre mises a jour frequemment et la production ne peut pas etre
estimee avec exactitude. Si ces pays pouvaient cartographier les recoltes de cafe d'une
maniere plus economique, ils auraient un avantage pour fixer les prix et obtenir des
Je pose ici lhypothese que les zones de cafe peuvent etre correctement
l'analyse spatiale des donnees, l'information additionnelle foumie par une approche
systemes de cafe dans trois secteurs d'etude en Colombie de 70km2 chacun. J'ai
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Resume
integre des images satellites et des grilles topographiques en utilisant les 2 approches
"pixel-basee et object-basee dans une approche hierarchique et les resultats ont ete
compares. Des objets ont ete formes sur la base de la similitude spectrale des bandes
de cafe. Alors, dans une deuxieme etape, la classification est affinee en utilisant des
regies de decision sur l'analyse statistique de plus petits objets (composes de 5 pixel
Les resultats prouvent que la classification basee sur les regies a combine avec
3% et par 17.47% pour la classe de cafe une fois comparee a l'analyse Pixel-basee. Le
classification quand des images de Landsat sont employees pour tracer les differents
vi
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Acknowledgements
This research could not be achieved without the support of the J.W.
and without the support of both; my advisor, Dr. Raja Sengupta, at McGill University
and Dr. Thomas Oberthur, my mentor at the International Center for Tropical
for providing financial support for this research through the macro project New
McGill University, for the Warren Fellowship in GIS I got. I sincerely thank to the
professors and classmates at the Geography department for their valuable support
I also want to acknowledge German Escobar, Herman Usma and Natalia Uribe
for their technical support during the fieldwork. I want to give special
vii
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Acknowledgements
and financial support for my fieldwork. I hope this methodology can be integrated
into their information system and improve the strategic planning by providing a better
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Contents
A B S T R A C T ............................................................................................................................................................................I l l
R E S U M E ...................................................................................................................................................................................V
C O N T E N T S ...........................................................................................................................................................................IX
L IS T O F T A B L E S ................................................................................................... XI
L IS T O F F I G U R E S ..........................................................................................................................................................X I I
I N T R O D U C T I O N ................................................................................................................................................................ 1
1 .1 . R e s e a r c h O b je c t iv e s ............................................................................................. :.................................... 4
1.1.1 What are the physical features that characterize different coffee systems?...................... 4
1.1.2 What is the accuracy that can be obtained as a result o f combining spectral and
elevation data to map coffee systems?.................................................................................................. 4
S T U D Y A R E A A N D D A T A ....................................... 6
2.1 S t u d y A r e a .........................................................................................................................................................6
2 .2 G IS a n d I m a g e D a t a : ....................................................................................................................................8
2 .3 F ie l d W o r k M e t h o d o l o g y ..................................................................................................................... 10
C O N T E X T ......................................... 13
3.1 T h e U s e o f S at el l it e Im a g e r y f o r C r o p M a p p i n g ...................................................................13
3 .2 T h e U s e o f I m a g e A n a l y s is f o r C o ffe e M a p p i n g ..................................................................... 16
3 .3 T h e U s e o f O bject B a s e d Im a g e A n a l y s i s ............................................. !.....................................19
3 .4 In t e g r a t e d G e o g r a p h ic I n f o r m a t i o n S y s t e m s .............................. 20
3 .5 T h e U s e o f D ig it a l E l e v a t io n M o d e l s (D E M s ) f o r L a n d C o v e r A n a l y s i s 24
M E T H O D S .................................................................................. 25
4 .1 D E M E v a l u a t i o n .......................................................................... 25
4 .2 I m a g e P r e p r o c e s s in g ................................................................................................................................. 26
4.2.1 Radiometric Correction.........................................................................................................27
4.2.2 Terrain Illumination Correction........................................................................................... 30
4.3.4 Soil Background Influence:...................................................................................................33
4 .4 H ie r a r c h ic a l C l a s s if ic a t io n ..................................................... :......................................................33
4 .5 S e g m e n t a t io n : b u il d in g t h e u n it s f o r o bjec t - b a s e d a n a l y s i s .......................................34
4 .6 S c a l e effec ts o n im a g e s e g m e n t a t i o n .......................................................................................... 35
4 .7 S h a p e a s a d d i t io n a l fe a t u r e f o r l a n d c o v e r c h a r a c t e r i z a t i o n : .............................37
4 .8 S i g n a t u r e E v a l u a t io n : S t a t is t ic a l a n a l y s is o f s a m p l e s ..................................................3 8
4 .9 R u l e - b a s e d im a g e c l a s s if ic a t io n .......................................................................................................42
4 .1 0 th e L in e a r M ix in g M o d e l f o r S p e c t r a l U n m ix in g i n c o f f e e f ie l d s !............. 44
4 .1 1 A c c u r a c y A s s e s s m e n t ...............................................................................................................................47
R E S U L T S ................................................................................................................................................................................ 52
5 .1 D E M E v a l u a t io n .....................................................................................................................................5 2
ix
with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Contents
5 .2 I m a g e P r e p r o c e s s in g : ................................................................................................................. 54
5 .3 P r e l im in a r y C l a s s if ic a t io n :.................................................................................................................54
5 .4 E x p l o r a t o r y A n a l y s is : P ixel B a s e d A n a l y s is ............................................................. 57
5.4.1 ND VI vs. percentage o f shade............................................................................................. 57
5.4.2 Spectral separation between coffee and non-coffee fields...................................... 58
5 .5 P ixel - b a s e d c l a s s if ic a t i o n ................................................................................................................... 60
5 .6 Im a g e S e g m e n t a t i o n : C h o o s i n g t h e A p p r o p r ia t e S p a t ia l U n i t s ................................ 63
5 .7 S ig n a t u r e A n a l y s is : .................................................................................................................................. 65
5 .8 R u l e d - b a s e d C l a s s if ic a t io n u s in g o b je c t s : ................................................................................68
5 .9 E v a l u a t in g L in e a r S pe c t r a l U n m i x in g T o M a p d if f e r e n t ty pe s o f c o ffe e
s y s t e m s ................................................................................................................................................................................ 73
5 .1 0 A c c u r a c y A s s e s s m e n t .............................................................................................................................. 75
D IS C U S S IO N ...................... 79
SUMMARY A N D CONCLUSIO NS.....................................................................................................86
BIBLIOGRAPHY.................................................................................. 89
ANNEXES
ETM+ CALIBRATION PARAMETERS............................................................................................ 101
D ATA DISTRIBUTION: SPECTRAL A N D TOPOGRAPHIC CHARACTERISTICS 102
BOX PLOTS FOR COFFEE SYSTEMS VS. NON-COFFEE SYSTEMS.................................... 107
NORMALITY TEST................................................................................
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
List of Tables
Table 2-1: Spectral Bands in Landsat ETM+ image................................... 8
Table 5-2. Differences between coffee and non-coffee systems..................................60
Table 5-3: Features ranges used in pixel-based classification..................................... 61
Table 5-4: Accuracy statistics for per-pixel classification........................................... 61
Table 5-5 Comparison between mean objects and pixels values fo coffee..............64
Table 5-6 Comparison between mean objects and pixels values for Non-coffee ... 65
Table 5-7: Non-significant differences between coffee and non-coffee systems.
Radiometric features.............................................................................................. 67
Table 5-8: Significant differences between coffee- and non-coffee systems Shape
features.............................................................................................................................. 67
Table 5-9: Non-significant differences between coffee and non-coffee systems:
Shape Features................................................................................................................. 67
Table 5-10: Range values for object-based classification.................... 68
Table 5-11: Accuracy statistics for object based classification................................ 69
Table 5-12: Accuracy statistics after the spectral unmixing........................................ 76
Table 6-13: Comparison between different methodologies......................................... 79
xi
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
List of Figures
Figure 2-1: Study Area and distribution of Sampling Points......................................... 7
Figure 2-2: Structure for Field Sampling......................................................................12
Figure 3-3: Spectral Behavior of Vegetation....................... 14
Figure 3-4: Representation of polygon classification functionality.............................23
Figure 4-5: Box Plot Diagram............................................. 39
Figure 4-6: The Spectral Angle Mapping in a two-dimensional space....................... 46
Figure 4-7: Methodological Sequence...........................................................................51
Figure 5-8: DEM evaluation...........................................................................................53
Figure 5-9: Overlap between image and rivers generated from D E M ........................53
Figure 5-10: Terrain illumination correction................................................................ 54
Figure 5-11: Preliminary classification..................................................... 56
Figure 5-12: Correlation between percentage of cover and N D V I.............................58
Figure 5-13: Scatter plot in all bands for coffee and non-coffee areas....................... 59
Figure 5-14: Box Plot for both land cover classes in band 1 ........ 60
Figure 5-15: Ruled-Based classification map obtained from pixel-based analysis... 62
Figure 5-16: Omnidirectional variogram.......................................................................63
Figure 5-17: Ruled-Based classification using object-based analysis........................ 70
Figure 5-18: Spatial distribution of coffee systems in the area....................................72
Figure 5-19: Spectral separation between mean values of coffee and non-coffee areas
in bands blue and green.............................................. 73
Figure 5-20: Spectral separation between mean values of coffee systems and non
coffee areas, in bands green and red ............................................................................... 74
Figure 5-21: Classification map resulting from the linear spectral unmixing 75
Figure 5-22: Spatial distribution of the error: the RMS image.................................... 78
xii
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Chapter 1
Introduction
Coffee ranks as one of the world's most important commodities, being second in
petroleum, and represents the main source of income for 25 million families on the
hillsides of Latin America, Southeast Asia and East Africa (Gresser and Tickell, 2002).
In Colombia, coffee has traditionally been an essential part of the economy, the social
networks, and the culture in the tropical mountain forest (Garcia, 2001).
Coffees role in the economy of developing countries has always been of primary
importance: in 2000, coffee represented 79% of the exports in Burundi, 54% in Ethiopia,
43% in Uganda, 31% in Rwanda and between 7 and 24% for many other countries in
Latin America (Gresser and Tickell, 2002; Varangais et al., 2003). However, in 2002,
coffee reached its lowest historical price. Continuous global overproduction caused
coffee prices to fall below production cost for many producers (Gresser and Tickell,
2002), which led to social and economic problems such as unemployment, migration and
diminishing well being (Del Corral et al., 2002; Hoyos, 2002; Ponte, 2002; Varangais et
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Introduction
productivity. This has generated environmental problems due to deforestation and the
use of agrochemicals (Evenson and Gollin, 2003, Lee, 2002; Roach, 2003; Smithsonian,
1997a; Rainforest Alliance, 2000). Considering the strong impact on the environment,
many international certification programs are using differential coffee brands to promote
the use of shade coffee (Rainforest Alliance, 2000, Smithsonian 1997b, Dietsh et al.
2004, Perfecto et al. 2005) which requires less fertilizers due to improved soil
management (i.e., leaf litter from the trees serves to fertilize the soil), thus influencing
grow specialty coffee, which benefits from shade (Muschler, 2001). Shaded systems also
contribute to food security and economic income from accompanying species (e.g. fruits).
However, the inventory and monitoring of areas of shade coffee plantations is a tedious
task with the current methodologies. Therefore, is not clear to what extent areas suitable
Also, not much has been done to evaluate the impacts of changes in coffee
systems on the environment in large geographical regions. Any evaluation of the impacts
methodologies for coffee mapping have yet to be developed (Croome, 1989; Moreira,
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Introduction
2004) because the existing methods are expensive (e.g. use of infrared photography by
NASA, 2005 or IKONOS by Widayatti et al, 2005), inefficient (use of manual editing
Landsat data provides an affordable means of mapping vegetation, and has been
widely used for crop mapping and monitoring. Many studies have focused on few
agricultural crops, specifically on sugar cane, wheat, maize, and rice, which have a
seasonal pattern of growth (Maas 1988, Van Niel and Me Vicar, 2004, Wall et al, 1984,
Zwiggelaar, 1998). But permanent crops, such as coffee have not yet been analyzed to
the same extent. Due to the spectral similarity with forest, shade-coffee systems have
been difficult to reliably map under different shade densities (Moreira, 2004).
research evaluated the potential of merging GIS data and Landsat imagery to characterize
diverse coffee systems using an integrated approach. The proposed approach consisted
of image and spatial data segmentation for landscape analysis. The main hypothesis is
that different coffee growing systems can be mapped using spectral information derived
from Landsat imagery and topographic features derived from DEMs. It is also
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Introduction
1.1.1 What are the physical features that characterize different coffee systems?
It is well known that coffee grows on hillsides. Therefore, it is expected that the
degree of slope can be used as complementary information for determining the existence
associated with the amount of solar radiation (Matoso et al, 2004). The correlations
leaf area index (Asrar et. al., 1984), plant health (Apan, 2004) and yield (Doberman and
Ping, 2004; Hatfield, 1983; Vina et al., 2004) have been widely analyzed in the literature
(Moran et al., 1997; Moulin et al., 1998; Wiegand et al., 1991; Zwiggelaar, 1988).
Therefore, it is hypothesized that the type of coffee system (shade, non-shade) can be
identified using specific topographic (altitude, slope and aspect) and spectral signatures.
1.1.2 What is the accuracy that can be obtained as a result o f combining spectral and
analysis should be done to evaluate their differences. If differences exist for some
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Introduction
that the accuracy will be lower for shaded systems, due to the confusion with forest and
To answer the above questions, I combine topographic (altitude slope and aspect)
and spectral data to differentiate between the two coffee systems, from question 1 above.
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Chapter 2
The study area encompasses the region of coffee cultivation in the Department of
7650 W longitude, elevation values between 1270 to 2070 meters above sea level, and
slopes between 15 and 30 degrees. With precipitation ranges from 1200-1800 mm per
year and temperatures from 8 C to 30 C, the natural vegetation in the area consists
DAP A - funded by the German Ministry for International Cooperation and Development
Positioning System (GPS) device that provided the latitude, longitude, and elevation
value. DAPAs main objective is to help farmers to make decisions that lead to the
production and marketing of high value products (such as tropical fruits, nuts, specialty
coffees and medicinal plants) for diversification on particular farm sites not suitable to
sustain commodity coffee production. Fig.2-1 shows the location of the study area and
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Figure 2-1: Study Area and distribution of Sampling Points
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Study Area and Data
A Landsat image ETM+path 09 row 059 from August 24th 2001. The image was
obtained from GLCF1 web site. The spectral information carried by this image is
Nominal Spatial
Spectral Bands Colour
Resolution (m)
28.5 0.450-0.515 Blue
28.5 0.525 -0.605 Green
28.5 0.630 - 0.690 Red
28.5 0.760 - 0.900 Near IR
28.5 1.550 - 1.750 MidIR
60 10.40 -12.5 Thermal IR
28.5 2.080 - 2.35 MidIR
12.5 0.52 - 0.92 Panchromatic
T able 2-1: Spectral Bands in Landsat E' TV1+ image
A total of 26,355 points that contains the geographic position of the all coffee
farms surveyed by a Garmin eTrex GPS system in the three municipalities. This
dataset belongs to the coffee information system -SICA-, built and managed by
FNC. It represents an exhaustive sampling of all coffee farms in the study area.
In addition the X,Y and Z coordinates, this data set contains the farm size, age of
the plantation, density (i.e. number of plants), variety, and illumination conditions
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Study Area and Data
longitude, and elevation in the center of each farm. Those farms were located in
the illumination conditions (percentage of cover over the coffee) and geographic
selected on the criterion that the minimum percentage of shade was 30%. The
in the study area, the sampling contains mainly agroforestry systems (coffee
grown with citric fruits and/or introduced species of secondary forest such as Inga
300 points randomly taken in areas that do not contain any type of coffee system
at all (called non-coffee areas, from this point forward). Those points were taken
by visual interpretation of the image, and were selected due to a) their spectral
similarity with coffee systems and b) their location: away from areas with high
density of coffee farms, determined by the SICA inventory (see above). This set
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Study Area and Data
of points was divided in two groups: 140 points for training and 140 points for
accuracy assessment.
The coffee systems can be extensively classified as: a) monoculture, also known
plantain over the coffee bushes and c) agroforestry systems (partially shaded coffee).
Sites were picked, in an attempt to cover as much variation as possible. In the first step, a
reconnaissance trip was done in order to test techniques for data collection in the field
a representative4 square of 30 meters long was selected (Figure 2-2a) and georeferenced
with a Trimble Pro-XRS GPS system provided with OmniSTAR real-time differential
correction. A real-time differential correction occurs when the base station (a GPS
receiver) compares its precisely known location to the position co-ordinates as obtained
from satellite signals to estimate the errors in the satellite signal as it receives the data
(Leick, 1995). The correction is calculated by the difference between the calculated and
known coordinates and broadcasted to the roving GPS receiver via radio signal if the
source is land based, or via a satellite signal if it is satellite based. The correction is then
applied to the position the differential GPS is calculating to correct its positional estimate.
The premise underlying this concept is that the receivers will experience similar
10
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Study Area and Data
atmospheric errors. Differential correction reduces the error in location from some
Crown closure5, also called canopy closure (i.e. % of shade over coffee) was
measured by the amount of ground area the tree crowns occupies (see Fig. 2-2b).
Transects located within the polygon were used to measure canopy closure. The percent
intervals along the transects. Each diagonal (in north East and South West directions)
was surveyed with a tubular densitometer by walking on it and stopping every two
meters, looking up and counting the number of times a tree was at the crosshairs (taken as
a positive record). The number of positives divided by the total number of readings (i.e.
the length of the diagonal divided by two) gives us the percent of tree canopy cover over
the coffee field within the sample area, which is taken as a representative of the
percentage of cover over the coffee in the whole farm. Given that our pixel is 30m
length, the minimum farm size was one hectare. Figure 2-2 shows the structure for
sampling. .
5The progressive reduction o f space between crowns as they spread laterally, increasing canopy cover.
11
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Study Area and Data
j
\i
30 m
X
i1
N
/
Figure 2-2: Structure for Field Sampling: a) sampling square b) canopy closure taken
from Becker et al., 2004
12
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Chapter 3
Context
Remote sensing technology offers a broad array of opportunities for crop mapping
and monitoring (Moran, 1997). However, previous studies have focused on crops such as
wheat, com, maize, soybeans and rice, which have a seasonal pattern of growth, allowing
for the adoption of multi-seasonal approaches (Daughtry et al, 1992; Badhwar, 2001;
Lobell, and Asner, 2003; Vina et al., 2004). Some tropical crops are included in these
seasonal varieties; but permanent crops such as coffee, usually require mapping
methodologies that call for an intense process of editing after the initial image processing.
The use of image analysis for crop mapping is grounded in the physical and chemical
characteristics of the plants, which determine their spectral response to solar irradiation
(Figure 3-1). Some of these characteristics include chlorophyll absorption, leaf pigment
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Context
Leaf
Cell
Pigments Structure Water Content Leaf
Reflectance
Water
Vegetation
Water Content
Sand
Chlorophyll
Absorption
0 ,4 0 ,6
Figure 3-3: Spectral Behavior of Vegetation, Sand and Water for Different Wavelengths.
Adapted from Chuvieco, (2002).
As mentioned earlier, remote sensing has primarily been used to map seasonal
crops with several combinations of the near infrared and red bands of the spectrum
(Haboudane et al, 2002; Sims, 2002; Daughtry, 1992; Zwiggelaar, 1998; Apan et al.,
2004, Vina et al., 2004). Some approaches use raw bands (especially the information in
the green, red and near infrared part of the spectrum), while others use spectral
temporal variation in spectral response of seasonal crops for mapping and monitoring
(Wall et al., 1984; Odenweller, 1984; Badhwar, 1984; Fisher, 1994; Grignetti etal., 1997;
Ippoliti-Ramilo, 2003; Markley, 2003; Van Niel and McVicar, 2004). Combinations of
bands into vegetation indices are useful to differentiate between land cover classes as
well. The basis behind vegetation indices is the difference between the absorption by
chlorophyll in the red part of the spectrum (0.650pm) and the reflectance in Near Infrared
14
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Context
-N IR - (0.79 - 0.90(Am) in healthy vegetation. The most widely used index has been the
Normalized Difference Vegetation Index (NDVI), which is the difference between NIR
and red values divided by their sum. Healthier, denser and more productive crops have
higher values of NDVI. The use of NDVI and other vegetation Indices - Plant Pigment
Ratio, Photosynthetic Vigor Ratio, and Greenness - have also been incorporated into
cost-effective systems for crop management and monitoring (Mettemicht, 2003; Sims,
2002 ).
Satellite imagery has also been widely used to tune crop growth and yield models and
improve their predictive capacity (Maas, 1988; Moulin et al, 1998). The two ways in
been through the use of empirical and semiempirical models. In the former, a regression
equation between vegetation indices and measured yield in seasonal crops is found, and
this function is used to predict future yield of the same crop (Vina et al., 2004). In the
latter, semi-empirical models use other plant parameters, such as harvest index (dry
biomass/total biomass) to predict future yields but utilize remote sensing information for
calibration and parameterization. Calibration usually involves the use of Leaf Area Index
with satellite imagery -specifically red, near infrared and NDVI - and parameterization
involves the calculation of light use efficiency of the plant, inferred from measurements
in the visible bands. Usually semi-empirical models can be more universally applied,
requiring less calibration for application to other regions and temporal domains than the
15
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Context
Mapping coffee plantations have been of special interest to the remote sensing
community in Brazil (Moreira, 2004), and interest in studies of plantation dynamics has
grown in Colombia over the last decade; (Guhl, 2004). The total area used for coffee
plantations can provide an indicator of the production limits in terms of the area available
quantification of the total existing and potentially developable coffee area might serve to
(Rubiano, 2004) and for the creation of special databases for precision crop management
(Moran, 1997).
plantations, coffee yields are being predicted using infrared photography (NASA, 2004).
In a special partnership program with the private industry, NASA6 supports Community
Coffee Inc., a company which buys coffee primarily from Guatemala and Brazil, in
monitoring coffee growth and yield trends. The project involves the use of remote
tracking coffee growth and assessing crop health. NASA provided satellite imagery of an
area in Guatemala and supplemented it with infrared aerial photography of the same area.
The resulting product was a composite colour image of the area showing the size,
16
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Context
location, health status and yields of coffee-growing areas. However, for larger areas, the
and Moreira (2004) have used Landsat images for coffee mapping. They integrated
Moreira (2004) and Croome (1989) found useful spectral band combinations to
distinguish between shade and non-shade coffee using spectral enhancements of Landsat
Thematic Mapper (TM) images. Nevertheless, they highlight the need for more research
in coffee mapping. The methodologies used by Moreira (2004) and Croome (1989)
involved intensive editing after image processing, as well as the use of masks over steep
areas which isolates the coffee without the influence of topographic shade.
lightly shaded and non-shaded. In his study, band 2 of TM (0.52 - 0.60 pm) was
determined to be the most useful channel to distinguish coffee from other vegetation
types, and also to identify spectral variations within different coffee systems (shade and
non-shade). Croome suggested that a color composition of the far infrared (TM7), green
(TM2) and infrared (TM5) will accurately identify the coffee plantations. By using an
affine transformation7 algorithm, bands TM2 (green) and TM7 (medium infrared) were
combined to identify the three coffee systems (shade, partial shade and non-shade). The
results from the TM image were evaluated with aerial photography. However, in slopes
greater than 20 degrees, the influence of shadows resulted in poor coffee discrimination.
A n affine transformation is a linear combination followed by an offset factor, which performs a multidimensional
manipulation o f the image data
17
R eproduced w ith perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Context
Croome also used the first two components of a Principal Component Analysis (PCA) to
distinguish between shade and non-shade coffee but is not clear why this approach led to
accuracte results. Although Croome (1989) evaluated his results using aerial
In his study, Moreira (2004) analyzed the spectral and temporal behavior of coffee
crops in Landsat Images. He found that the images acquired during the dry season were
more efficient to map coffee plantations at an early development stage and under
production. Band 4 (near infrared) of Landsat Thematic Mapper and Enhanced Thematic
Mapper (ETM+) gave the best performance for spectral differentiation between coffee
and other land cover classes during the dry season. During the rainy season, mature
coffee was mixed with pastures, forest and coffee at an early development stage.
According to the author, higher accuracies were obtained during the dry season using
band 4 of TM because the canopy structure is denser in coffee areas as compared to the
sparser surroundings. In Moreiras study, the variability within the reflectance values for
active producing coffee plantations :caused by different plant ages, development stages
and space between plants indicating a need for further field work for the identification
of coffee crop in Landsat images. For instance, the loss of leaves during the harvest
resulted in lower band 4 (NIR) values, and higher values in band 3 (Red). As with
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Context
The traditional approach in remote sensing imagery has been per-pixel classification
using multidimensional spectral band analysis (i.e. several spectral bands). In this
approach the objects in the image are grouped into categories using statistical analysis on
a per-pixel basis (De Jong, S. and Van der Meer, F., 2004), a process known as image
classification. Some of these algorithms for image classification, such as the well-known
maximum likehood classifier, assume normal distributions for data analysis, which is not
always the case. Frequently, the classification analysis ends with filtering in order to
eliminate noisy pixels in the final land cover map. By introducing the concepts of
neighborhood, distance and location, homogeneous pixels can be grouped into objects
through a segmentation of the image (Haralick and Shapiro, 1985; Blaschke and Strobl
2001), the main conceptual framework behind object-based remote sensing. In contrast
analysis are image objects, not single pixels. Image objects are created through a process
called image segmentation, which is the grouping of pixels into homogeneous objects.
correspond to homogeneous areas in the image, reducing the amount of data for analysis
and reducing uncertainty (Lucieer, 2005). It is assumed that those homogeneous areas
(called objects) have low internal variance. Image information can be represented in
different scales based on the average size of image objects. The concept of hierarchy in
the image (Woodcock and Harward, 1992). Using different levels, context-based
19
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Context
features, such as texture features (e.g. homogeneity, compactness) can be also be used to
In addition to texture, shape features and spatial distribution have been used to
patch refers to a spatial and functional unit of the landscape and has a spatial geometric
and thematic pattern. Coffee plantations can be seen as homogeneous objects with
different levels of spectral patchiness depending on the canopy cover of the trees over the
coffee (in shaded coffee systems) and the density of the coffee bushes (in Sun or non
analysis and display of spatially referenced data. Through spatial analysis of topographic
topographic indices, can be derived using GIS routines (Wilson and Gallant, 2000). It
has been demonstrated that when these biophysical attributes are combined with spectral
information from satellite imagery, the accuracy in the characterization of land cover is
improved (Wheatley et al., 2000, Walsh et al., 1998, Michalak, 1993). For instance,
Walsh et al. (1998) used data from Landsat TM, and landscape metrics to assess the
20
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Context
drainages. In another example, Janssen et al. (1990) combined objects from digitized
polygons with a per-pixel classification: Objects were assigned to the land cover class
with the highest frequency. The 12% improvement in classification accuracy obtained by
The term Integrated Geographic Information Systems (IGIS) has been used to
describe the link between an image analysis system and GIS (Hinton, 1996). GIS and
remote sensing have been widely linked for spatial data analysis, such as: image
illumination correction and orthorectification using digital elevation models (Justice et al.
1980), generation of digital elevation models from satellite images (Bolanos, 2000), land
cover change analysis (Michalak, 1993; Muller and Zeller, 2002; Xiuwan 2002), natural
hazards research (Nagarajan et al. 1998, Walsh et al., 1998) and sustainable land
Traditionally, data stored in a GIS needs to be converted into raster format in order to
be integrated into the image processing chain. Raster layers generated from remote
sensing data are incorporated into a GIS database through a transformation (e.g.
causes certain problems due to generalization such as loss of accuracy (when data are
21
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Context
converted between formats) and constrains the full range of applications, caused by
the image analysis system, GIS functions can no longer be applied to the data (e.g.
attribute and topological queries, etc). Similarly, products derived for image analysis
(e.g. land cover maps) loose their raw radiometric data, and are generalized into a limited
number of classes.
Obviously, a full integration can optimize the extraction of information from imagery
and vector data by utilizing all of the relations and functions that GIS and image analysis
might provide in a two-way flow (Hinton, 1996, Janssen and Molenaar, 1995). For
instance, in the selection o f training areas for image classification, image and topographic
statistics might be combined to make optimum use of the full range of terrain data and
information and vector data can also be used as additional bands in image segmentation,
in which the creation of objects at different scales minimizes much of the heterogeneity
in the image (heuristic optimization). Figure 3-2 shows the conceptual framework in
image segmentation integrating topographic and imagery data using an object based
approach.
22
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Context
Figure 3-4: Representation of polygon classification functionality: Attributes of objects stored in the
database table are used for image segmentation and classification. Those attributes are generated form
spectral information (e.g. mean of red) or from GIS data (e.g. soils, topography). The output classification
still contains a database table associated to the object using the polygon identifier. Adapted from Hinton
(1996).
The object-based approach is used to generate image objects or GIS statistics within
and between polygons, and then to incorporate those statistics directly into a database.
Those statistics, stored as polygon attributes, can be used for image classification or
generation of thematic maps. The result is a description of the image content by objects
with a number of attributes (e.g. mean reflectance, size, shape and neighborhood
synthesis. The user can generate thematic or statistic raster products by evaluating the
occurrence of spatial and non-spatial attributes through operations such as attribute query,
and neighborhood analysis. One of the most common spatial attributes used to
characterize the environment of a specific land cover class is the elevation value, derived
23
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Context
3.5 The Use o f D igital Elevation M odels (DEMs) for Land Cover A nalysis
The use of DEMs to represent terrain conditions as a surrogate for soil information -
often unavailable- for landscape characterization is an approach that has been widely
used in the literature (Walsh et al., 1998, Franklin et al. 2000). Topographic indices such
as relative elevation, slope, wetness index, upslope length and curvature have also been
used to explain spatial variability in crop yields and biomass (Zeleke, et al 2004).
Therefore, there is a need to carefully evaluate the use of digital elevation data in
landscape characterization based upon a detailed knowledge of the methods used in their
24
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Chapter 4
Methods
The DEM used in this research was generated by the 11-day Shuttle Radar
Topographic Mission (SRTM) of NASA and obtained from the GLCF8. The data
database of the Earth. The reported horizontal accuracy of this DEM is 3m, and 6 to
10m for the absolute and relative vertical accuracy, respectively (Rabus, 2003).
quality. For instance, sinks (i.e., local depressions) and horizontal shifts between the
images and the DEM affect subsequent procedures such as image orthorectification and
DEM were evaluated and the descriptive statistics are presented in the results section.
For the vertical accuracy, the elevation available in 240 data points obtained in the field
8GLCF is the acronym for the Global Land Cover Facility project, supported by NASA.
25
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Methods
using a high precision Trimble PRO-XRS differential GPS system were used. For the
horizontal accuracy, a stream network produced from the DEM was overlaid on the
Landsat image. The stream network was produced by performing a full drainage analysis
on the DEM, as described and proposed by Jenson and Dominique (1988). In this
analysis, primary and secondary topographic attributes such as slope9, aspect, flow
direction and flow accumulation are derived from a DEM and the stream network layer is
produced using the higher flow accumulation values. The methods to derive primary
topographic attributes such as slope and aspect (i.e., orientation of the line of steepest
Based on slope and aspect values, the primary flow direction determines the direction
of water moving over the land surface. Once the flow direction is established for each
cell, a value, equal to the number of cell that flow to it is assigned, resulting in a flow
accumulation data set. The flow accumulation is used to produce a raster drainage
network dataset, in which cells with a greater accumulation value of threshold are
selected. As the threshold decreases, the density of the network increases. Finally, a
9 Slope measures the rate of change of elevation in the direction of steepest descent Slope influences the
flow of water, and hence, the soil water content
26
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Methods
The geometric correction involves modeling relationships between the image and
a set of accurately measured ground coordinates (in X, Y and Z). The product is an
orthorectified image, an image that appears as if the satellite or the viewer is looking
Earth coordinates, and is corrected to remove errors due to elevation changes (Cheng et
al., 2000). The image used in this research was orthorectified by Earth Satellite
Corporation using methods described by Tucker et al. (2004). It is expected that the
positional error of the image is less than 50m, as described by Tucker (2004). A few
GPS points were also located in roads intersections to verify this. The radiometric
correction for this image, however, deserved a more detailed analysis, as described
below.
incoming solar radiation and diffuses radiation from the sky, or skylight. This skylight
prevents absolute darkness in shadows where direct sunlight is absent and acts as a
masking agent, reducing the clarity of images obtained from airborne or spacebome
sensors.
by adjusting for sensor measurements, on-ground gains and offsets, Sun angle and Earth-
Sun distance. Conversion from digital numbers to reflectance values and constants
follows the equations provided by Markham and Barker (1986) and Chavez (1996). To
27
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Methods
implement this conversion to reflectance values, Irish (2002) proposed equation 4-3 to
L=Gain*DN+Offset
Equation 4-1
Where:
2 1 1
L ; Spectral calibrated radiance, in units w m ' sr fim
DN ; Digital Number (0, 255),
(Lnax/^.n) upper and lower spectral radiance limit, units wm ~2s r ]jum~\
Gain= jy"'"'
(C^max C/min/ | Rescaled gain, in units wm ^ s r 1fa n 1.
The lower and upper radiance limits are related to their correspondent minimum
(Q m m ) and maximum (Q*) digital number, equal to 0 and 255 respectively, in Landsat
images.
The reflectance values for bands 1 to 5 and 7 are calculated using the Dark Object
Vca=(x*L M *d$)
(,*cos(&))
Equation 4-2
Where:
28
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Methods
d : The radius vector (ratio of mean to actual Sun-Earth distance) that can be
computed as:
d= 1
(1 0.016729*cos(0.9856(Z)(9]/ 4))) w here^O E is the julian day number10.
Band 6 is also converted from spectral radiance (as described above) to at-satellite
emmissivity11 and using pre-launch calibration constants (see annex 1), using the formula
K2
T=
In
J
Equation 4-3
Satellite imagery data are observed from viewing angles that vary from near to far
range12 of the image, which causes differences in reflectance of the same type of
coverage depending on their position within the image. To correct for this variation, a
10 The Julian day number is a continuous count o f days that have elapsed since January 1st.
11 It means that the material absorbs all electromagnetic radiation that falls on to it
12 The width o f the track covered by a sensing system. Also called the range
29
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Methods
on viewing and illumination geometry (Susaki et al., 2004). A BDRF model consists of a
set of equations that relate surface physical properties (e.g. soil reflectance, canopy
inclination, distribution and anisotropic properties of canopy and soil are required (Qi et
al., 2000). However, the BDRF effects mainly affect satellite images that have a wide
swath7 (e.g. NOAA-AVHRR or MODIS that have a swath of approximately 2300 km)
but have a negligible effect on images covering smaller areas. Therefore, the BDRF
effect is considered to be negligible in the image subset used for this research.
The area selected for this study is mountainous, as are many agricultural areas in
the tropics. In such areas, the steepness of the terrain causes the slopes of hills facing
away from the Sun to appear darker and the slopes facing towards the Sun to appear
brighter than corresponding flat areas. This differential illumination effect causes
classification from satellite images. In order to reduce this variation, the illumination
effect is corrected for each pixel in the image by a proper simulation of illumination
determined by elevation, slope, aspect, Sun elevation and Sun azimuth (Eastman, 2001).
13 The ratio of the radiation reflected from an object to the total amount incident upon it, for a particular
portion of the spectrum
17 The width of the track covered by a sensing system. Also called the range
30
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Methods
There are two different approaches to correct for the varying illumination and
effects (Pons and Sugranes, 1994; Sandmeier and Klaus, 1997; Dymond, 1999). The
difficulty with this approach is that the estimation of radiances, transmittances, and
diffuse solar fluxes for each land cover in the image is necessary before this technique
can be applied (Richter, 1997; Shepherd and Dymond, 2003). The second approach is
empirical, and uses band ratios (Justice et al., 1980; Holben and Justice, 1981; Colby,
Although, is the simplest and most used approach, the main drawback is the
differentiated diffuse radiation16 in each band. The most common of these methods is
called cosine correction which has proved to over-estimate the value of slopes close to
because it leads to the appropriate correction for topographic effects of coffee fields in
Landsat ETM+ (Uribe, 2005). Teillet et al (1982) and Meyer et al. (1993) found that the
14 A radiative transfer model is the fate of electromagnetic radiation as it is scattered, absorbed or reflected
while passing through a medium
15 Eastman (2001) proposes a topographic correction based on a linear regression between a hillshade and
the image, and uses the slope of this regression to calibrate the hillshade and subtract it from the original
image to apply the correction.
16 Diffuse radiation is the radiation scattered by particles in the atmosphere, and is wavelength dependent
31
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Methods
being proportional to the cosine of the emergent angle of reflection defined with respect
to the surface normal vector (i.e. the incidence angle). The c-correction algorithm is a
semi-empirical function, that uses a c-factor, defined as c=h/m, where m represents the
slope and b the intercept of a linear regression between L, the radiance and cos the
incidence angle (see Fig. 4-1). The c-correction is defined by Teillet et al. (1982) as:
r cos# + c
L h LT
COS/+C
Equation 4-4
0S= Solar zenith angle (i.e. 90 - solar elevation angle. See Fig. 4-1)
According to Smith et. al. (1980), the incidence angle can be calculated as:
Equation 4-5
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Methods
Therefore, the c-correction simulates the effect of indirect illumination from sky:
the c-factor simulates the effect of path radiance17 on the terrain at given slope and aspect.
Figure 4-1: Representation of the solar zenith angle Z and the incidence angle i
Soil background also has an effect in the radiometric spectral signature of land cover
(Huete, 1988; Rondeaux, 1996). However, due to the density of the coffee bushes and
the pixel size, the effect of soil background on the signatures of coffee systems was not
4.4 Hierarchical Classification: separating the more suitable areas for coffee
classification results (Benz et al., 2004). A preliminary classification that separates the
areas that have higher probability to have coffee fields will help to focus the image
analysis for such areas (e.g. areas of agroforestry systems). In this stage, the minimum
17 Path radiance is the effect of the atmosphere acting as a reflector itself, adding scattered radiance to the
signal detected by the sensor (Lillesand and Kiefer, 1994)
33
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Methods
pastures, water, forest, urban and agricultural areas (that include coffee systems as well as
any other type of agriculture or agroforestry systems). The algorithm uses the mean
reflectance on each band for a signature and each pixel is assigned to the class with the
mean closest to the value of that pixel (Mather, 2001). Subsequent steps with object-
based and pixel-based approach were run using only the mask of both forest and
agriculture.
The reflectance from bands 1 to 7, the at-satellite temperature (obtained from the
thermal infrared band) and the DEM can be integrated in a database to run segmentation
and classification algorithms in e-cognition software (Baatz et al, 2004). Meinel and
and they showed accurate shape estimates (e.g, area, diameter, shape index) using e-
cognition. The software combines neighboring pixels that show a degree of spectral
similarity, which is lower than a threshold defined by the scale parameter18. The smaller
the scale parameter, the larger the number of objects and the smaller their size. The
intrinsic features (e.g. color, shape), topological features (e.g. position regarding to
another object) and context features (e.g. texture, a measure of the amount of tonal
change between neighbor objects). However, context features characterize an area only if
the pixel size is much smaller than the objects of interest. Therefore, only intrinsic
18 The segmentation results (polygons, delineating objects) can be converted into a GIS-readable vector
format.
34
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Methods
(Blaschke and Strobl 2001). Therefore, the objects created through image segmentation
process must be meaningful; in order to analyze their properties and make the
correspondence between the objects (as defined) and their attributes. Statistical and
geostatistical19 models can quantitatively characterize spatial structures. As the main tool
for geostatistical analysis, the variogram is a quantitative descriptive statistic that has
and Tate, 2000). A variogram analysis consists of the experimental variogram calculated
using the data and the variogram model fitted to the data. The experimental variogram is
calculated by averaging one-half the squared differences of the values over all pairs of
observations with the specified separation distance (called lag distance) and direction
(Goovaerts, 1997):
I n (h )
Equation 4-6
Where n(h) is the number of pairs in lag h, dn(xi) represents the image digital
19 Geostatistics is a branch of statistics that deals specifically with the spatial relations indexed over
continuous space and focus on spatial variability.
35
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Methods
and semi-variance in the j-axis. Variogram analysis has also been used to characterize
the local variance in satellite images to determine optimal pixel size (Atkinson and
Curran 1997). It has been shown (Woodcock and Strahler, 1987, Woodcock et al.,
1988a, 1988b) that the variograms range (i.e. the x coordinate of the sill point of the
variogram, see fig. 4-2) is directly related to the texture and/or object size (i.e. the scale
of spatial variation), while the sill (i.e. the y coordinate) estimates the a priori variance at
certain support (i.e. pixel size). The nugget effect represents variability at distances
smaller than the pixel, and is attributed to the measurement error, caused by noise. The
the ground and the sampling framework, constituted by the pixel size, the sampling
1.0
0.8
0u)
I 0.6
i
W 0.4
0.2 Nugget
0.0
0 2000 4000 6000 8000 10000
Lag (meters)
36
with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Methods
Assuming that the variance is the same in all directions (i.e. isotropy) the
variogram is used in this research to estimate the size of the homogeneous areas in the
image, for the creation of objects, by analyzing coffee fields in aerial photographs.
Despite their multiples advantages, the segmentation of an image into objects will
involve the modifiable areal unit problem -M A U P- (Openshaw and Taylor, 1981). It
refers to the fact that the same pixels can be aggregated differently and that the results of
the spatial analysis depend to some extent of the aggregation strategy used. Spatial
aggregation reduces the size of data by converting raw data into spatial units that are
modifiable, and affect the statistical analysis when the information is grouped at different
levels of spatial resolution (called the scale component of the MAUP). The variability in
the statistical results obtained as a result of the different ways in which the information is
grouped at a given scale is the aggregation component of the MAUP, which also affects
the spatial analysis (Wrigley et al., 1997). In order to explore how much variation is
attributed to the aggregation system, statistics from pixels and objects are compared and
the effects of aggregation are analyzed using the differences. As described by Wong
(1996) reporting the scale-sensitivity of the results (e.g. the error due to aggregation
Shape features are based on the statistics of the spatial distribution of the pixels that
form each image object and have been widely used to characterize different types of land
37
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Methods
cover (Southworth, 2002). Some of the general shape features used for land cover
characterization are: area, ratio length/width (calculated by dividing the length by the
width of the bounding box surrounding the object), border length, shape index20
(calculated by dividing an image objects border length by four times the square root of
its area), density (calculated by dividing the area of the image object by its radius: a
square image object has the highest density), compactness (i.e., the product of its length
and width, divided by the number of pixels forming the object) and asymmetry:
expressed as the ratio of the lengths of minor and major axes of an ellipse surrounding the
object. As an alternative way to characterize the objects shape, the elliptic and
rectangular fits compare the object to an ellipse and rectangle, respectively. While 0
The descriptive statistics (e.g. means, standard deviations, minimum and maximum
values) provide measures of central tendency21 and variability in spectral bands and
topographic data that characterize each land cover, and assist in the analysis of their
differences. A first step in using these measures is plotting the samples on a two-
dimensional feature space, in which each axis represents a spectral band, gives an idea of
20The shape index is used to describe the smoothness of the image object boundaries: the more fractal an
image object appears, the higher its shape index.
21 The tendency of the set of measurements to cluster
38
R eproduced w ith perm ission o f the copyright owner. F urther reproduction prohibited w ith o u t perm ission.
Methods
Graphs such as the histogram, which represents the distribution (values and
frequency) of the data values, and the box plot were analyzed for each land cover class in
each band in order to evaluate the data distribution. The box plot is a graph based on
quartiles of the data set (see Figure 4-3). Quartiles are values that partition the data set
into four groups, each containing 25% of the measurements. The lower quartile contains
25% of the measurements, the middle quartile is the median and the upper contains 75%
of the data set. A box plot is drawn on the interquartile range, the distance between the
lower and upper quartile. The two lines emanating from the borders of the box (the
whiskers) extend 1.5 times the box length (i.e. 1.5 times the interquartile range) and it
contains the smallest and largest measurement. Values that are beyond the whiskers are
extreme values that represent rare occurrences. In fact, in normal distributions less than
1% of the data are expected to fall outside the whiskers. Measurements that fall beyond
three times the interquartile range are called outliers. Outliers are very extreme
measurements that stand out from the rest of the sample and may be faulty (e.g. incorrect
recorded observations).
> Chillier
> yyjusker
3rd quartile
* Median
G ro u p A G ro u p B
39
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Methods
based on a normal distribution of the sample data for each land cover class. Then, in
order to determine the type of test that best fits the type of data and to make inferences
about the parameters of the population, a test of normality should be carried out. The
Shapiro-Wilk test (Shapiro and Wilk, 1965) tests the hypothesis that a random sample
comes from a normally distributed population. It is recognized as the best test for
given by:
( )2 i
^ = , where x =~ x>
I>,-*)2
<=1
Equation 4-7
Where x,- represents the value for each observation, and a are a series of
weights generated from the means, variances and covariances of the statistics of a
represents the probability of obtaining a result by chance alone. A p-value of .05, for
example, indicates that we would have only a 5% chance of drawing the sample being
tested and reject the hypothesis, when the null hypothesis was actually true. The p-value
measures consistency: a small p-value is strong evidence against the null hypothesis
while a large p-value means little or no evidence against the null hypothesis, i.e., that
40
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Methods
P-values are compared with a , the significance level. The significance level is the
pre-chosen probability that the null hypothesis will be rejected when is true (also known
as type I error, i.e., observing a difference when in fact, there is none). Common levels
of significance are 5% and 1%, and these are chosen arbitrarily. If a test of significance
gives a p-value lower than the a level, such result are normally referred as statistically
significant and the null hypothesis (of equality) is rejected. If the chosen alpha level is
0.05 and the p-value is less than 0.05, then the null hypothesis of normality is rejected. If
the p-value is greater than 0.05, then the null hypothesis cannot be rejected. Note that in
this situation, the hypothesis being tested is accepted (or rejected) if the p-value is greater
(or lower) than a certain significance level. This is called an one-sided hypothesis.
Wilk test, a non-parametric test can be used in order to evaluate their differences of
means. The Mann-Whitney test can be used to test the differences between the medians
in two groups when the data are collected using different subjects in each group (as is the
case in the sampling of remote sensing data for land cover analysis). As a non-parametric
test, the Mann-Whitney test works by looking at the differences in the ranked positions of
scores in different groups. Although it is a non-parametric test, it does assume that the
two distributions are similar in shape. In the case where the only distributional difference
Theoretically, in large samples the Mann-Whitney test can detect differences in spread
even when the medians are very similar. It test the null hypothesis that the two samples
are drawn from the same population, and therefore that the medians are equal. It requires
41
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Methods
differences between groups. The test involves the calculation of a statistic called U,
whose distribution under the null hypothesis is known, which is defined as the minimum
value between:
Equation 4-8
Where rti and n2 are the two samples sizes and Rj and R2 is the sum of the ranks in
sample 1 and 2, respectively. The maximun value of U is the product of the two sample
sizes. With samples above about 20, the U statistic aproximates to a normal distribution,
with mean mu: and standard deviation a u . Using the normal approximation, the z
Equation 4-9
If the chosen alpha level is 0.05 and the p-value associated with this z value is less
Traditional methods for land cover classification with satellite imagery do not
handle the use of ancillary data appropriately (Oruc et al., 2004). In the integration of
GIS and remote sensing data, the use of ruled-based (also know as knowledge-based
(Goodenough et al., 1987; Fierens et al., 1994, Benediktsson et al., 1990) than statistical
42
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Methods
classifiers. For instance, Murai and Omatu (1997) demostrated that ruled-based methods,
when integrated to neural networks, can improve the classification by about 9%.
The ruled based classification is implemented in this research through the use of a very
recursively partitions a data set into smaller subdivisions on the basis of tests defined at
each branch in the tree (Friedl and Brodley, 1997). In this framework, a data set is
the tree, and the pixel (or object) is assigned to a class according to the leaf node into
which the observation falls. As opposite to other classifiers, this method do not rely on
normality assumptions and handles numeric and categorical data in the same routine
(Quinlan, 1986) The decision rules in each node can be based solely on analyst
expertise, which is difficult to implement across different times and geographic places, or
they can be defined based on the statistical analysis of training data. The classification
system used in this research uses an univariate decision tree in which the decision rule in
improve accuracy, the tree is pruned by removing leaf nodes until training samples are
classified correctly (Friedl and Brodley, 1997). The elimination of leaf nodes is stopped
when further removal reduces the classification accuracy with respect to the independent
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Methods
4.10 R efining the classification: Using the Linear M ixing M odel for Spectral
In order to refine the classification results after the ruled-based classification, and
identify the specific type of coffee system, a mask can was created to delineate areas
most likely to have coffee fields, given their spectral and environmental characteristics
(topography and temperature, specifically). However, due to the small size of the
individual coffee fields in this area, this mask contains some pixels with features that are
smaller than the resolution of the sensor. For instance, the reflectance of a pixel that has
coffee in an agroforestry system (i.e. shade coffee), will contain the combined reflectance
The mixed pixel problem has called the attention of many scholars in remote sensing
(Foody, 2004, Foody et al. 1996, Huguenin et al. 1997, Quarmby et al.,1992, Settle and
Drake, 1993, Sohn and McCoy, 1997, Van der Meer, 1997). In order to segregate a pixel
into its components, a technique called spectral unmixing is performed. Different cover
types are discriminated with a pixel by comparing the observed spectra of the pixel with
the end-members. An end-member is the signature of a pure pixel, which only contains
There are several models for spectral unmixing (Ali, 2002): the linear mixture, the
probabilistic, the geometric, geometric-optical and stochastic. The linear mixture model,
also called linear unmixing model, assumes that the reflectance in each pixel is a linear
44
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Methods
Because its simplicity and independence of the normality assumption, the linear mixture
model is often preferred. In equation form, the linear mixture model can be represented
as:
i'i = a n x, + a 12 + ......... + a, x n + c,
Equation 4-10
components within the pixel, r represents the mean spectral reflectance of the pixel in
each band, x represents the proportion of the each component (or land cover class) in the
pixel, amn represents the spectral reflectance of the each component in the pixel for the
specific spectral band and e represents the error component, caused by noise. Assuming
that the sum of proportions within a pixel is one, the proportions are positive and the
number of components does not exceed the number of spectral bands, this equation can
Smith (1991). The method estimates the proportion of each component inside the pixel
The key factor to resolve this model relies in the selection of the end-members.
The selection o f the end-members can be achieved using a spectral library (from field or
laboratory measurements) or from the purest pixels in the image (Van der Meer, 1999).
45
R eproduced w ith perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Methods
If a spectral library is not available for the type of cover of interest, the purest pixels in
the image can be used for end-member selection. This method has the advantage that the
spectral signatures for the end-members are collected under the same atmospheric
conditions.
The identification of the purest pixels in the image can be done using Principal
Component Analysis (PCA), Pixel Purity Index (PPI) and Spectral Angle Mapping (Ali,
2002). For its simplicity, the principle of the Spectral Angle Mapping (SAM) is preferred
to assess the selection of endmembers. The spectral angle mapping calculates the
spectral similarity between two spectra by calculating the spectral angle between them,
treating them as vectors in a space with dimensionality equal to the number of bands
(Van der Meer, 1995, 1997). Figure 4-4 shows a two dimensional example of spectral
angle mapping. This method was implemented for the selection of end-members.
Sand j
Figure 4-6: The Spectral Angle Mapping in a two-dimensional space. Taken from Ali
(2002)
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Methods
with ground truth data. For this research, GPS polygons representing coffee farms in
Huila, surveyed using DGPS units, were compared with the classified image. The
number of samples for accuracy assessment (as well as for training) depends on the
budget and practical issues, but it should be adequate to maintain the validity of the
proposed statistical analysis. Most of the statistical tests are based on binomial or normal
distribution of the error, for each land cover class. Therefore, the sample size should be
computed using the multinomial distribution. Usually, 30 samples are the minimum
required for the t-test to approach a normal distribution, but this number can be adjusted
according to the importance of certain category (or the complexity of the landscape for
certain category) or by the variability within each land cover signature. In any case, the
minimum number of samples must be preserved for all land cover classes. If we
(k=9), and a confidence level of 95% (a =5%) that the classes are in average 20% of the
map area ([X=0.20), we get approximately 47 samples per class, according to the
_ ^ n .o no
b1
Equation 4-11
The value for /? is determined from a chi-square table with 1 degree of freedom
and (1- a/k). If the confidence interval is lower, the required sample size decreases.
47
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Methods
Note, however, that the sampling scheme (simple random, systematic, stratified
across the landscape. Good statistical properties result from random selection of samples,
However, a fully random sampling is not always possible due to accessibility in many
areas. Therefore, for this research, both training and accuracy assessment, the area is
divided up according to land cover classes, and a random sampling is carried within each
land cover (i.e. stratified random sampling). This method has been demonstrated be
The overall error can be estimated using the error matrix and the kappa index of
agreement (Congalton, 1999). The error matrix is a square array of numbers set in rows
(the map labels or classified data) and columns (the reference data) that shows the labels
relative to the labels assigned to the same point in the reference data.
The output of the error matrix is a matrix of percentages that allows the evaluation
of the number of pixels in each sample that are assigned to each class. The error matrix
will give us the errors of omission, commission, the producers and the users accuracies
defined as including an area into a category when it does not belong to that category. An
omission error (or exclusion error) is defined as the exclusion of an area from the
48
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Methods
category in which it truly belongs. Every error is an omission of the correct and
commission to the wrong category. The overall accuracy is the sum of the major
diagonal (which is the correctly classified sample data) divided by the total number of
sample units. The users accuracy (rows) is computed by dividing the total number of
correctly classified pixels in a category by the total number of pixels classified as that
pixels correctly identified in the map (the proportion of pixels or objects that are correctly
recognized by the classifier), while users accuracy (commission error) is the percentage
of the areas identified in the map that actually belongs to that category on the ground (the
proportion of pixels identified by the classifier that agree with the ground truth).
The kappa index is another indicator of accuracy derived from the values in the
error matrix. It indicates the probability that a pixel classified on the image corresponds
to the same category as observed on the ground, taking into account all marginal
appropriate only to determine the accuracy of one category as right or wrong based on the
proportion of correctly classified pixels, the allowable error, and the desired level of
confidence. However it will not be related with the error matrix with n land cover
classes, in which there is one correct and (n-1) incorrect possibilities. This case is better
between the actual agreement (the major diagonal) and the chance agreement indicated
by the row and column totals. This value is computed for each error matrix and is a
measure o f how well the produced classification agrees with the reference data and it
49
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Methods
goes from -1 to +1. The higher the value of kappa, the better the classification
performance: as the values in the off-diagonal increase, the value of kappa decreases.
N T ux - Z < x + #x)
= ^ ----- _i=!-------------
<=i
Equation 4-12
Where N is the total number of samples, r is the number of classes, xu is the entry ii
(the diagonal) and xi+, x +i are the marginal totals of row i and column /, respectively.
observer variability, or the inappropriateness of the data for mapping a specific land
cover class can cause differences in the error matrix. The misapplication of the
classification scheme to the map or reference data (i.e. a reference point which land cover
class is in between one or more classes in the classification scheme), the subjectiveness in
interpretation and reference sampling and land cover change, all contribute to differences
50
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Methods
Accuracy Assessment
51
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Chapter 5
Results
The quality of the DEM was evaluated by comparing the 280 DGPS22 points
taken in the field and analyzing the differences between the GPS height and DEM values
for the study area. The mean error was 24.039m. The standard deviation, minimum
and maximum values of the error were 10.33, -59.6 and 2.30 meters, respectively. The
distribution of the error is shown in figure 5-1. Given the scale of the analysis and the
pixel size (28.5m) of Landsat images, this error was considered to be acceptable for the
purposes of this study. The evaluated DEM will be used for topographic illumination
correction (see section 4.3.3. and 5.2), and to generate secondary topographic attributes
(slope and aspect) that characterize the topography of coffee and non-coffee fields.
22Differential Global Positioning System DGPS -See section 4.7 for theoretical details
52
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
0.000
10.000
m
Urn
1 - 2 0.00 0
g - 3 0.00 0
C
1 - 4 0.0 0 0
5 0.00 0
- 60 .0 0 0
- 70 .0 0 0
Figure 5-8: DEM evaluation. Error Distribution calculated as DEM values-GPS height.
In order to evaluate the overlap in X, Y between the image and DEM, a drainage
section 4.2. The fitting between image and DEM was then analyzed by visual inspection
of the overlap between image and vectors on rivers. Figure 5-2 shows the overlap for
Figure 5-9: Overlap between image and rivers generated from DEM in two subareas.
53
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
The geometric and radiometric corrections were done following the procedure
described in section 4.3. The values for gain, offset, solar spectral irradiance, solar zenith
angle and calibration constants K1 and K2, are given in Annex 1. The terrain
illumination correction was done using the c-correction method described in section 4.3.3
by Uribe (2005). Figure 5-3 shows the image before and after the terrain illumination
Figure 5-10: Subset over the study area showing the image before (left) and after (right) the
terrain illumination correction was applied. Band Combination: Blue:ETM3, Green:ETM5, Red:
ETM4
performed in order to extract the most likely areas to have coffee fields. The
classification scheme contains seven classes: forest, pastures, clouds, shadows, urban,
water and miscellaneous. The miscellaneous category includes areas that contain all
coffee systems and non-coffee areas as well. Non-coffee areas include those land cover
54
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
classes whose spectral signature is either, similar to coffee systems or very different to
the other six classes (pastures, clouds, etc.). The overall classification accuracy obtained
was 97%, as evaluated by the kappa statistic (see section 4.11). Points for classification
training and accuracy assessment at this stage were taken by visual interpretation of the
image. Two sets of fifty points per class were randomly localized for training and
accuracy assessment, respectively (as described in section 4.11). A mask was created
using the miscellaneous class and all the subsequent analyzes on the image and GIS data
were done using this mask. Figure 5-4 shows the result of this preliminary classification.
55
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
Legend
H i Forest
I | C lo uds
Shadows
1
P astures
Urban
M is c e lla n e o u s
3 0 3 6 9 12 Kilometers
56
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
Because the Normalized Difference Vegetation Index (NDVI) has been long
related to biomass content (Baret and Guyot, 1991; Hatchfield, 1993; Lobel, 2003;
Moran, 1997, Moulin et al., 1998; Vina et. al., 2004; Zwiggelaar, 1998), the NDVI was
used to explore the capability of Landsat images to discriminate between non-shade and
shade coffee systems, which have a higher biomass content. The NDVI enhances the
differences between band 4 (reflection by the plant structure) and band 3 (absorption by
the chlorophyll):
N IR -R E D B 4 - B 3
NDVI = ---------------= -----------
NIR + RED B4+B3
Using the reflectance in these two bands, the NDVI was calculated in order to
evaluate the correlation between NDVI values and percentage of shade (overstory) over
coffee bushes, as measured by 44 points taken in the field (see section 2.3).
57
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Results
0,8-i
0.80 -
S Ip 'J H #**
ft ?ft 1 ---
* * * * *
Figure 5-12: Correlation between percentage of cover (measured in the Field) and NDVI values
calculated using reflectances in Landsat
Although the slope of the regression line between the two variables is 0.745, i.e, it
is almost a linear relationship, the NDVI values only explains 12.33% of the outcome
variable (i.e. percentage of shade) with a significance level of 0.05. The linear
relationship, if exists, seems to be stronger at NDVI values of 0.5 to 0.8. This can be
associated to the saturation of the model at NDVI values higher than 0.8. Note also, that
at NDVI values lower than 0.1, the model does not detect any change in percentage of
shade. One possible reason for this is that changes in percentage of shade can be
occurring at short distances (i.e. less than 30m) that will not be detected by a Landsat
In order to choose the bands that will separate coffee from non-coffee fields and
to analyze their representative signature, the spectral reflectance in each one of the seven
58
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
bands for all sample points (280 for coffee systems and 300 for non-coffee areas) was
mapped in a scatter plot, as shown in figure 5-5. From the figure, it is noticeable that
some combinations of bands 3, 4, 5 and 7 will provide better separation between the
coffee systems and non-coffee areas (miscellaneous), and that band 1 shows the highest
overlap between both. For this reason, band 1 was discarded for subsequent analysis.
*SjdbP
00 A3 3
0 A 3 3
/ A 3 0
p PP
0 0 P 0
0 0 P
bl to hi M
/b5 bT
Figure 5-13: Scatter plot in all bands for coffee (dark blue) and non-coffee classes (green)
Samples were also evaluated through the inspection of histograms (see Annex 2)
and box plots for coffee systems and non-coffee areas for bands 1 to 7, temperature,
elevation, slope and aspect, shown in Annex 3. The box plot for band 1 in both land
cover classes confirms the strong overlap in this band (see figure 5-7). Given that the
samples have strong deviations from normality24, the difference between both land cover
59
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Results
I !'!
0 MOM
0.12300
o mm
29... 2?
B.mmo
OJB0OH
S.QsifiOO
class
Figure 5-14: Box Plot for both land cover classes in band 1
image using pixel-based statistics, i.e., range values of sample pixels in bands 2 to 7,
60
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
temperature, elevation, slope and aspect. The criterion used to get coffee systems is
Factor Range
Band 2 0.08 <b2< 0.135
Band 3 0.030 <b3 <0.10
Band 4 0.15 <b4< 0.50
Band 5 0.05 < b5 < 0.30
Band 7 0.025 <b7< 0.17
Temperature 17 < temp < 25
Elevation 1200 < elevation < 1850
Table 5-2: Features ranges used in pixel-based classification
A total of 1429 points were used to evaluate accuracy. For forest, clouds,
shadows, pastures, urban, miscellaneous and water, the points were located by visual
inspection of the image. For the coffee class, the points were selected form the SICA
database (see section 2.2 for details), using two criteria: a) the farms size was bigger than
1 hectare and b) The density was higher than 4500 plants. The overall accuracy was
71.89% and the kappa statistic25 was 0.659. The 95% Confidence Interval ranges from
69.553% to 74.219%. Table 5-3 shows the user and producers accuracy and the kappa
Note that despite the overall accuracy being acceptable, the accuracy achieved for
coffee systems and non-coffee (i.e. miscellaneous) areas, is about 40%, and, according to
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
the confidence interval, it goes up to 60% for the users accuracy. Likewise, note that the
omission error (as measured by the producers accuracy) is less than the commission
error. Therefore, there are more pixels classified as coffee than actually are in the
Legend
Forest
I I C loud s
Shadows
Pastures
Urban
W ater
12 Kilometers a
I
C of fe e S y s t e m s
I M isc e lla n e o u s
62
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Results
The corrected image (following the removal of geometric and radiometric errors)
was segmented using a region growing segmentation technique, where neighboring pixels
whose spectral properties (in bands 2,3,4,5 and 7) are similar enough, were aggregated,
considered to be similar enough if the absolute value of the difference between their tone
intensities was smaller than a threshold (Haralick and Shapiro, 1985). The threshold is
defined by the scale parameter in e-cognition. In order to evaluate the scale component
of the MAUP (see section 4.6) a variogram analysis was performed. It determined an
average objects diameter of 100 meters for coffee fields, which results in a scale
parameter of 10 for the segmentation. To define this scale parameter, the variogram
analysis was done on a representative coffee fields identified on aerial photographs using
the spatial dependence modeler in Idrisi software. Aerial photographs were used for
this part of the analysis, because coffee fields are easier to distinguish due to their higher
Figure 5-16: Left: omni-directional variogram for a coffee field in an aerial photo. Right:
segmented image
63
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
Note that, according to the test, the differences between pixels and objects mean
value for the same point are minimal. From this comparison, I conclude that the
aggregation of pixels into objects will not modify the mean spectral signatures for the
coffee and non-coffee fields in the area. One possible reason for this, is that the objects
size were kept as small and as homogeneous as possible, in order to avoid introducing
variations in the signatures when pixels from different land cover classes are merged into
the same object (that normally occurs when the objects size is increased).
normality test was applied to all signatures for coffee systems and non-coffee areas for
mean values and standard deviations in bands one to seven, elevation, slope and
curvature. Since the algorithm uses a rule-based classifier, minimum and maximum
values in each object can also be used for classification. Those were evaluated for
normality as well (see Annex 4). The general shape features described in section 4.8
were obtained from the segmented image (e.g. area, diameter) and evaluated for
65
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
normality, because they can also be used in the decision rule. Results of the normality
Given that the data had significant deviations from normality, a Mann-Whitney U
test for difference of means (see section 4.9) and a careful analysis of the box plot for
object-based features was used to choose the variables in which the coffee systems were
significantly different from non-coffee areas. Tables 5-5 through 5-8 summarize the
66
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
<57
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
Features that show significant differences between all types of coffee systems and
non-coffee areas were used to create a rule-based classification. For instance, elevation
(as shown in table 5-5), presents significant differences for coffee systems and non-coffee
areas, using the sample data. Although there were significant differences between coffee
systems and non-coffee areas in many features, not all of them lead to accurate
classification results. Therefore, using the group of variables with significant differences
between coffee systems and non-coffee areas, several classifications were done, adding
one feature each time. This can be described as a trial and error process, where features
for classification were entered (or removed), as the classification accuracies were higher
(or lower). The structure of the decision rule is shown in table 5-9.
Factor Range
Mean Band 2 0.08 <b2 < 0.135
Mean Band 3 0.030 < b3 < 0.10
Mean Band 4 0.15 <b 4 < 0.50
Mean Band 5 0.05 <b 5 < 0.30
Mean Band 7 0.025 <b 7 < 0.17
Mean Temperature 17 < temp < 25
Mean Elevation 1200<elevation< 1850
Std. Dev B3 0.005<St.Dev.B3<0.01
Std. Dev. B7 0.005<St.Dev.B4<0.03
Std. Dev. B7 0.005<St.Dev.B7<0.03
Max. B2 0.06<MaxB2 <0.13
Min. Aspect Min Asp>3
Table 5-9: Range values 'or object-based classification
Note that not all features show significant differences between coffee systems and
non-coffee areas. For instance, although the slope is a feature that characterize the
topography of coffee fields, the use of a narrow range, as the one shown in Annexes 2
and 3 (see box plot for slope, object-based statistics) constrains the algorithm to select
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
only those fields that have slopes values higher than 20, which excludes many objects of
The corresponding classification result is shown in figure 5-9. Note, that though
the coffee systems area (shown in red) is overestimated, the classification clearly
eliminates some areas very unlikely to have any type of coffee system. As an example,
note that the area surrounding the urban center, the areas surrounding the clouds and the
Using the same data set described in section 5.5, the overall accuracy was
74.881% and the kappa statistic26 was 0.696. The 95% Confidence Interval ranges from
72.629% to 77.133%. Table 5-7 shows the user and producers accuracy and the kappa
classification, the accuracy for the coffee class is improved by 17%. As indicated by the
69
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
user and producers accuracy, the omission errors are still less than the commission
errors. It means that there is more likely to find that a coffee field in the map is not a
coffee field on the ground, than to find a coffee field in the ground that is not mapped as
such. I other words, the area of coffee class has been overestimated.
Legend
m | Forest
I | C lo uds
Shadows
H
P astures
Urban
W ater
C o ffe e S y s t e m s
3 0 3 6 9 12 Kilometers
Projection: UTM 18 North, Datum: WGS84
M is c e lla n e o u s
Figure 5-17: Ruled-Based classification using object-based analysis: Coffee areas are in red
70
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
As another measure of accuracy, the SICA points, which contain an exhaustive sample of
all coffee farms in the municipalities of Pitalito, Oporapa, Palestina and Timana were
overlaid on this map, in order to evaluate the spatial distribution of the coffee systems
land cover class in the map versus the true spatial distribution of the coffee fields from
71
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
w e
Legend
I i M unicipal B oundary
SICA points
m F o re s t
[ I Clouds
m S h ad o w s
P a s tu re s
12 Kilometers
I W a ter
C offee S y ste m s
M is cellan e o u s
F igure 5-18: Spatial distribution of coffee systems in the area: the magenta points represent the
real location of all coffee farms in the area, as indicated by the SICA database.
72
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
5.9 R efining the Classification Result: Evaluating Linear Spectral Unm ixing
A mask was created using the ruled-based classification showed in figure 5-8.
Using this mask, the image was cut and the principles of spectral angle mapping
described in section 4.11 were used to select the most useful bands that better separate
coffee systems from other coverages. Three components were identified that influenced
the spectral signature of coffee within a pixel: coffee, forest and topographic shadows
that still remain in areas of highest slopes. Pure pixels (i.e. pixels with 100% of one and
only one cover) of forest (740 points), coffee (448 points) and shadows (137 points) were
used to analyze the separation between bands, as shown in Figure 5-10 and 5-11.
0. 1 - ^ _
[
0.099
0.098
0.097
0.096
2 0.095
0.094
0.093
0.092
0.091
0.09
0.064 0.065 0.066 0.067 0.068 0.069 0.07 0.071 0.072 0.073 0.074
b1
F igure 5-19: Spectral separation between mean values of coffee systems and non-coffee areas,
measured at the pixel level. Note that the combination of bands 1 and 2 is not optimum because
all mean values for forest, shadows and Sun coffee fall in the same line
73
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
0.06
Figure 5-20: Spectral separation between mean values of coffee systems and non-coffee areas,
measured at the pixel level. The combination of bands 2 and 3 works better, because of the
existence of wider angle between coffee systems and forest
Following this approach, different combinations of all bands were evaluated and
bands 2,3,4,5 and 7 were chosen as input for the linear spectral unmixing. The mean
values calculated for each land cover in each band were input into Idrisi software and the
Linear Spectral Unmixing28 module was run. Figure 5-12 shows the final map.
74
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
Legend
Fo r es t
I I Clouds
Shadows
Pastures
Urban
B 3 M is c e ll a n eo u s
I Su n Coffee
9 12 Kilometers
S h a d e d C offee
Figure 5-21: Classification map resulting from the linear spectral unmixing. Coffee systems
were divided in sun and shade coffee classes.
The product o f the linear spectral unmixing model was evaluated using the same
set of points described in section 5.5. The overall accuracy was 70.184% and the kappa
75
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
statistic29 was 0.646. The 95% Confidence Interval ranges from 67.810% to 72.557%.
Table 5-7 shows the user and producers accuracy and the kappa statistic for each class.
Note, that the overall accuracy is much higher than the accuracy for shaded coffee
and miscellaneous (i.e. non-coffee areas) classes, because of the influence of other land
cover classes with higher accuracies (e.g. pastures, clouds, shadows, forest, urban and
water). Also, note the low accuracy in shaded coffee class, where omission errors are
The accuracy for the Sun-coffee class, as measured by the kappa statistic is very
low (20% approximately) which does not agrees with the accuracy obtained by using the
training points (i.e. the DAPA set of Sun coffee) for accuracy (that was about 75%). This
spectra or b) the variability of the Sun coffee areas that is not measured by the sample.
Since there were as many sun coffee areas as shaded coffee areas in the study region, this
76
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
As another indicator of accuracy, the RMS error image (see figure 5-11) shows
the spatial distribution of the error component in equation 4-29. Also, the SICA points
provided by FNC (see section 2.2) were overlaid to the image and their spatial
distribution was visually compared against the spatial distribution of the coffee systems
77
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Results
Legend
EZ 3 - 1 - * *?
r I 1.897 - 3:794
f I -i 7 9 4
H g 5.691 . 7.58$
H l i '7 . 5 8 8 - 9.435
H P 9.485 11.382
1 11.382 - 13.279
H | 13 2 7 9 15.176
15.176 - 17.873
H i 17.073 - 18.97
0 3 9 12 K ilom eters
Figure 5-22: Spatial distribution o f the error: the RMS image shows that most o f the study area
presents less than 10% o f error within the unmixing model.
78
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Chapter 6
Discussion
analysis support the hypothesis that coffee systems can be characterized using
radiometric and topographic information. Those characteristics can be used to map areas
with higher probability of having coffee fields given certain radiometric and topographic
characteristics. The table 6-1 summarizes the results for the different methodologies
compared.
Despite the kappa value per class being lower than the desired value for a land
cover classification; it is better than the existing methodologies to map coffee Crops. For
instance, Wood and You (2006) used census data to map the spatial distribution of coffee
crops in Latin America, but the overall accuracy for this map was less than 6%, as
checked through the use of the same 1469 GPS points used here for the accuracy
Moreira (2004) did not described the achieved accuracies, is expected that is their
79
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Discussion
accuracies are lower, given that both authors emphasized the need for manual edition
points and the coffee systems in the land cover map, and by the overall accuracy, the fact
that this methodology narrows the geographic area that is more likely to have certain type
of crop (i.e. coffee, in this case), helps to focus the efforts in field work and subsequent
mapping methodologies to smaller regions, minimizing the costs associated to the image
interpretation.
analysis about the local variation (e.g. minimum and maximum values of reflectance and
topography for each homogeneous area represented by an object). This can be used (e.g.
minimum aspect) during the classification and represents an enormous advantage when
characterizing land cover for thematic mapping. The variation in these characteristics
might vary due to different plant development stages. Variations in band 4 agree with the
different foliage structure that the coffee plants and accompanying introduced species of
forest might have. Variations in bands 5 and 7 can be attributed to the different moisture
conditions, closely related to the stage of growth of the plant and the microclimatic
conditions in which the plantation is located. Nevertheless, note that the similarity in
altitude, slope and aspect values between objects and pixels based box-plots (see Annex
80
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Discussion
2) shows that coffee plantations are fairly homogeneous: the altitude, slope and aspect
values of all pixels within the same object is roughly the same.
consideration that the coffee systems are diverse, and the sampling values might not
moisture and physiology of the cover over coffee, results in variations in the signature
However, the separability of the signatures also relies in the statistical method used to
analyze them. The loss of information involved in substituting ranks for original values
makes the non-parametric tests less powerful than their parametric counterpart. The
difference between samples is then accounted for ranks, instead of raw values, which
does not take into consideration the magnitude of the distances between the values (Field,
2000). Further, the Mann-Whitney test is unable to detect the difference in data
distribution thought it preserves the initial ordering of the original values. The median
value used by the Mann-Whitney test is not as representative as the mean value for a
normally distributed data. To partially overcome this problem, the analysis of the box
plot and the histograms is presented as an alternative to evaluate the separation between
features values for two land cover classes30. A drawback of the Mann-Whitney test is
that regardless of the set of data values, the mean and the variance of a set of distinct
30Although the box plot contains 75% o f the values, those are not necessarily the most common values (i.e.
the values with the higher frequency). An examination to the histogram is needed to assess this.
81
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Discussion
1997): all possible samples of size n, regardless whether a population is finite, yield a
parametric statistics computed with ranked data; however, substituting ranked data into
(Griffith, 1997), but its associated measures of the sampling variation do not come from
While the accuracy of the results can be affected by the overlap between land cover
classes, it is also affected by the image pre-processing. It has been shown (Smith et al.,
1980) that the Lambertian assumption may be more valid when analysis is restricted to
slopes of less than 25 degrees and effective illumination angles of less than 45 degrees.
This effect can be observed in the RMS error image (see fig.5-11), in which higher values
of error are located in areas with higher slopes. Additionally, significant radiometric
errors can be introduced as a result of the calculation of diffuse radiation during the
application of the terrain illumination correction. Ideally the model should be able to
consider the effects of slopes on diffuse irradiance and path radiance and diffuse sky
illumination as well. But it has been demonstrated that this correction is difficult to
The accuracy of the results can also be affected by the definition of objects in the
82
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Discussion
small fields. According to the literature, the different levels of segmentation must be
reduced to a few which are geometrical and semantically robust (Blaschke and Strobl
2001; Schiewe et ah, 2001). This problem focuses on meaningful objects; as often
transitions in land cover depend on scale of the defined objects of interest (Woodcock
Hierarchical Classification helps to focus the efforts in a fewer number of objects, but
parameters should be set for different objects. At greater scales, segmentation might
create an overrepresentation of larger areas at the cost of smaller features. Also, the
value of the scale parameter, product of the scale analysis is difficult to implement in the
software for the definition of objects. Although the target objects size is defined from
the analysis of the variogram, the resulting objects size in the segmentation is achieved
by trial and error, which undermines the efforts of the scale analysis.
With the creation of objects, the particularities of each unit as well as the dissimilarity
among units are smoothed (see Annex 3, band 2, for an example). As Wong (1996)
pointed out, the magnitude of the smoothing relies not only on the spatial autocorrelation,
but also upon the cross-correlation of variables in adjacent units. This effect was not
analyzed due to the lack of appropriate high spatial resolution data to analyze the objects
size in different bands and their cross-correlation. It constitutes one important issue for
83
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Discussion
After the creation and evaluation of objects, the accuracy obtained with the ruled-
based classification using objects could not be improved through spectral unmixing,
which, according to the literature review, was found to be a valuable resource to map
crops and features whose spatial resolution is lower than the pixel size. The spectral
unmixing algorithm used in this case (i.e. the linear mixture model) assumes a linear
response of all coverages within the pixel. It assumes that there is no multiple scattering
and photons interact with a single material only. This is not always true: first, some
coverages might have higher reflectance values that affect the response of a neighboring
sensor implies that the cover in the center of the pixels contributes more to the reflectance
of that pixel, than the perimeter. Finally, as mentioned by Fisher (1997) the adequacy of
any algorithm for land cover mapping is also affected by how the pixel, as a geographic
Nevertheless, despite the existence of many techniques for sub-pixel mapping, the
linear mixture model is the simplest and most used for practical reasons. Fuzzy
classifiers are also another alternative for sub-pixel mapping (Foody, 2004). They allow
for explicit recognition of the possibility that ambiguity might exist regarding the
appropriate map label for some locations; but the difficulty lies on defining the
membership functions for each land cover class (Gianneti et al., 2001).
31 The point spread function (PSF) describes the response of an imaging system to a point source or point
object. It has been analyzed in remote sensing to determine the optimum pixel size (Atkinson, 2004)
84
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Discussion
The accuracy of the results however is not limited to the analysis of the error matrix.
The spatial distribution of the GPS points collected in SICA32, overlaps fairly well with
the coffee systems in the land cover map. Those points provide a holistic view of the
Finally, as noted by Lambin (2001), the integration of GIS data into the image
information measured by the image (i.e. reflectance) with landscape attributes (such as
elevation) and field measurements. This fact also highlights the role of environmental
85
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Chapter 7
This research has evaluated the potential of remote sensing and GIS data to map
different types of coffee systems. Topographic attributes from a DEM were evaluated
and integrated with a Landsat image corrected by terrain and atmospheric effects. First, a
areas suitable to have coffee systems due to their spectral properties. The result of this
mask was divided in two land cover classes, coffee systems and miscellaneous (i.e. areas
with similar chlorophyll and moisture content to the coffee systems). The spectral
separation between both land cover classes was evaluated by analyzing their differences
in all bands. Second, objects were created using the minimum size determined by a
variogram analysis, and a ruled-based classification using objects and pixels was
compared.
and, for the coffee class specifically, the accuracy increased by 17%. Also, a visual
inspection to the maps confirms that the spatial distribution of the coffee class in the
classification map resembles the spatial distribution of the coffee fields in the area.
86
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Summary and Conclusions
Finally, in an attempt to refine the classification result to include the type of coffee
system, pure samples of coffee and forest were extracted from the image and the linear
spectral unmixing was applied. Due to the variability in moisture, chlorophyll and size of
the coffee fields and forest, the spectral unmixing did not improve the classification
result.
One of the main contributions of this methodology is the fact that it uses a
appropriate to map coffee fields and other type of crops, but given the cloudy conditions,
the long periods of revisit/programming of the satellite and the lack of funds to acquire
those images, high resolution data do not represent a feasible resource for crop mapping
in the tropics.
This research also contributes to understand the potentialities and limitation that
remote sensing data have for crop discrimination. With the use of ancillary GIS data
within the image interpretation it was possible to generate statistics within polygons and
associate those statistics to the land cover attributes and then, use them in the
classification rule.
cost-efficient methodologies to map coffee. The present study was designed to explore a
new approach, using a low-cost image and object-based analysis to produce maps that
show areas where coffee field are expected to be, due to certain topographic and
87
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Summary and Conclusions
radiometric conditions. The findings suggest that, in fact, topographic conditions and
spectral information can be used to get areas of coffee fields with and accuracy of 70%,
using only automatic procedures. Therefore, it is expected that this research contributes
with a methodology that can be used to find areas most likely to contain coffee fields,
resulting mask produced after the topographic and radiometric data has been analyzed
will serve for a better planning by providing a narrower geographic area were the
However, the evidence from this study suggests that, to map the different types of
coffee systems more research is needed. Coffee systems are extremely variable (with
respect to reflectance and topography) and their specific environmental conditions make
difficult to determine a specific signature for each one. Further research with a better
spectral sampling of the different coffee systems (i.e. using multispectral images with
higher spatial resolution or radiometric measures in the field) will clarify the specific
signature for each type, and will contribute to improve the results. Due to the cost and
the permanent cloudy conditions in areas with agroforestry and polyculture systems,
higher resolution images are difficult to obtain, and, although this became one of the
main limitations of this research, the findings provide a valuable tool to target those
areas.
88
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Bibliography
Ali, S. (2002). Use o f spectral and temporal unmixing fo r crop identification using
multi-spectral data. International Institute of Geo-information Science and Earth
Observation.
Apan, A., Held., A., Phin, S. and Markley, J. (2004). Detecting sugarcane orange rust
disease using EO-1 Hyperion Hyperspectral imagery. International Journal o f
Remote Sensing, 25 (2), 489-498.
Asrar, G.; Fuchs, M.; Kanemasu, E., Hatfield, J. (1984). Estimating absorbed
photosynthetic radiation and leaf area index from spectral reflectance in wheat
Agronomy Journal, 76(2), 300-306.
Atkinson, P. (2004). Resolution Manipulation and sub-pixel mapping in Van der Meer,
de Jong, S. Remote Sensing and Image Analysis: including the spatial domain
Springier
Atkinson, P., Curran, P. (1997). Choosing an appropriate spatial resolution for remote
sensing investigations. Photogrammetric Engineering and Remote Sensing, 63(12),
1345-1351.
Atkinson, P., Tate, N., (2000). Spatial Scale Problems and Geostatistical Solutions: A
Review. Professional Geographer, 52 (4), 607-623.
Bacon, C. (2005). Confronting the Coffee Crisis: Can Fair Trade, Organic and Specialty
Coffees Reduce Small-Scale Farmer Vulnerability in Northern Nicaragua?. World
Development, 33(3), 497-511.
Baatz, M., Benz, U., Dehghani, S., Heynen, M., Holtje, A., Hofman, P., Lingenfelder, I.,
Matthias, M., Sohlbacl, M., Weber, M., Willhauck G. (2004). Ecognition User Guide
Available at: http://www.definiens-imaging.com/down/trial.htm
Baret, F., Guyot, G. (1991). Potential and Limits of Vegetation Indices for LAI and
APAR assessment. Remote Sensingfo r Environment, 35, 161-173
89
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Bibliography
Becker, M., Congalton, R., Budd, R., Fried, A. (2004) A GLOBE Collaboration to
Develop Land Cover Data Collection and Analysis Protocols. Journal o f Science
Education and Technology, 7(1), pp. 85-96
Benediktsson, J.A., Swain, P. H., Ersoy, O.K. (1990) Neural Network approaches versus
statistical methods in classification of multisource data IEEE Transactions on
Geoscience and Remote Sensing, 28, 540-552.
Benz, U., Hofmann, P., Willhauck, G., Lingenfelder, I., Haynen, M. (2004) Multi
resolution, object oriented fuzzy analysis of remote sensing data for GIS-ready
information. ISPRS Journal ofPhotogrammetric and Remote Sensing, 58, 239-258
Blaschke, T., Strobl, J. (2001). Whats wrong with pixels?. Some recent developments
interfacing remote sensing and GIS. GIS-Zeitschrift fur Geoinformationssysteme, 6,
34-39.
Chavez, Jr., P.S. 1996. Image-based atmospheric corrections - revisited and improved.
Photogrametric Engineering & Remote Sensing, 62: 1025-1036.
Cheng, P., Toutin, Th., Tom, V. (2000) Orthorectification and data fusion of Landsat 7
data. ASPRS 2000, Washington DC, May 22-26.
Congalton, Russell G. (1999) Assessing the accuracy o f remotely sensed data: principles
and practices. London: Lewis Publications.
Croome, R. (1989). The Potential for Satellite Remote Sensing to monitor Coffee, Tea,
Cocoa and Coconut plantings in Papua New Guinea. Working Paper 2. Series:
90
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Bibliography
Daughtry, C.S., Walthall, C.L., Kim, M.S. (1992). Estimating corn leaf chlorophyll
concentration from leaf canopy reflectance. Remote Sensing o f Environment, 74,
229-239.
De Jong, S. and Van der Meer, F. (2004). Remote Sensing Image Analysis: Including the
Spatial Domain. Kluwer Academic Publishers.
Del Corral, M., Velasquez L., Ortiz, O., Hernandez, J., Arango P. (2002) Cuantification
de los Impactos Micro-Macroeconomicos y Sociales de la Crisis Cafetera en
Colombia. Publicaciones del Centro de Estudios Regionales, Cafeteros y
Empresariales CRECE.
Dietsh, T. Phillpot, D., Rice, R. Greenberg, R., Bichier, P. (2004). Conservation Policy in
Coffee Landscapes. Science vol. 303, January 2004. pp.625.
Doberman, A., Ping, J.L. (2004). Geostatistical Integration of Yield Monitor Data and
Remote Sensing Improves Yield Maps. Agronomy Journal, 96, 285-297
Eastman, R. (2001). Idrisi Guide to GIS and Image Processing. Clark Labs, Clark
University, 2001.
Evenson, R., Gollin, D. (2003). Assessing the Impact of the Green Revolution, 1960 to
2000. Science 300 pp.758
Fischer, A. (1994). A model for the seasonal variations of vegetation indices in coarse
resolution data and its inversion to extract crop parameters. Remote Sensing o f
Environment, 48, 220-230.
91
R eproduced w ith perm ission o f the copyright owner. F urther reproduction prohibited w ith o u t perm ission.
Bibliography
Fisher, P. (1997). The pixel: a snare and delusion. International Journal o f Remote
Sensing, 18 (3), 679-685.
Foody, G.M., Lucas, R., Curran, P., Flonzak, M. (1996) Estimation of the areal Extent of
land cover classes that only occur at a sub-pixel level. Canadian Journal o f Remote
Sensing, 22(4), 428-432.
Foody, G.M. (2004) Subpixel Methods in DeJong, S., and Van der Meer, F. Remote
Sensing Image Analysis: including the spatial domain. Kluwer academic Publishers.
Franklin, J., McCullough., Gray, C. (2000) Terrain Variables for predictive mapping of
vegetation Communities in Southern California in: Terrain Analysis, Principles and
Applications edited by Jhon Wilson and Jhon Gallant ed. Wiley 2000
Friedl, M.A., Brodley, C.E. (1997). Decision Tree Classification of Land Cover from
Remotely Sensed Data. Remote Sensing o f Environment, 61, 399-409.
Gianneti, F., Montanarella, L., Salandin, R. (2001). Integrated Use of satellite images,
DEMs, soil and substrate data in studying mountainous lands. JAG v.3(l) pp.25-29
Goodenough, D., Goldberg, M., Plunkett, Selek, J. (1987) An expert system for remote
sensing. IEEE Transactions on Geoscience and Remote Sensing 25, 349-359
Gresser, C., Tickell, S., (2002). Poverty in your cup [Online], Retrieved February 11,
2005. Oxfam International. Available:
Mtp vyww w .marketradefair.coni/assets/englisti/nHigged.pdf
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Bibliography
Grignetti, A., Salvatori, R., Casacchia, R. And Manes, F. (1997). Mediterranean vegetation
analysis by multi-temporal satellite data. International Journal o f Remote Sensing, 18,
1307-1318.
Guhl, A. (2004). Cafe y Cambio de Paisaje en la zona cafetera Colombiana entre 1970 y
1997. Revista CENICAFE 55(1) pp. 29-44.
Haboudane, D. M., Tramblay N., Z. (2002). Integrated narrow band vegetation indices
for prediction of crop chlorophyll content for applications in precision agriculture.
Remote Sensing o f Environment, 81,416-426.
Hatfield, J.L. (1983). Remote Sensing Estimation of Potential and Actual Crop Yield.
Remote Sensing o f Environment, 13,301-311.
Hinton, J. (1996). GIS and remote sensing integration for environmental applications.
International Journal o f Geographical Information Systems, 10 (7), 877-890.
Holben, B., Justice, C. (1981) An examination of spectral band rationing to reduce the
topographic effect on remotely sensed data. International Journal o f Remote Sensing.
Vol. 2, pp. 115-133.
Hoy os, S. (2002). Apertura Economica y Equidad: Los retos de Colombia en la decada
de los noventa. Biblioteca Virtual Banco de la Republica. Retrieved Jan. 04 2005.
Available: h ttp ://w w w .b a n re i3 .g o v .c o /b la a v irtu a l/le tra -c /c o lh o v /c o lo l6 .h tm .
Huguenin, R., Karaska, M., Blaricom, D., Jensen, J. (1997). Subpixel classification of
Bald Cypress and Tupelo Gum Trees in Thematic Mapper Imagery.
Photogrammetric Engineering and Remote Sensing, 63(6), 717-725.
Hutchinson, M., Gallant, J. (2000). Digital Elevation Models and representation of terrain
shape in Terrain Analysis, Principles and Applications edited by Jhon Wilson and
Jhon Gallant ed. Wiley 2000
Irish., 2002. Landsat 7 Science Data Users Handbook. NASA Goddard Spaceflight
Centre, MD. Retreived 23 may 2006. Available:
http://laiidsathandbook.gsfc.nasa.gov/handbook.html.
93
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Bibliography
Janssen, L., Jaarsma, M., Vander Linden, E. T. (1990). Integrating Topographic Data
with Remote Sensing for Land Cover Classification. Photogrammetric Engineering
and Remote Sensing, 56(11), 1503-1506.
Janssen, L., Molenaar, M. (1995) Terrain Objects, their dynamics and their monitoring
by the integration of GIS and remote sensing. IEEE Transactions on Geosience and
Remote Sensing, 33 (3) pp. 749-758
Justice, CO., Wharton, S.W., Holben, B.N. (1980). Application o f Digital Terrain Data
to quantify and reduce the topographic effect on Landsat Data. NASA Technical
memorandum 81988.
Law, K.H., Nichol, J.(2004) Topographic correction for differential illumination effects
on UCONOS satellite imagery. ISPRS. Available at
http: //www.i spr s. org/i stanbul2004/comm3/papers/347. pdf
Lee, James R. (2002). The Trade and Environmental Database, American University
School of International Service. Retrieved Feb. 11, 2005. Available:
htto ://vvw\v.american.ed/TED/SHADECOF.HTM
Lobell, D.B, Asner, G.P. (2003). Comparison of Earth Observing -1 ALI and Landsat
ETM+ for crop identification and Yield prediction in Mexico. IEEE Transactions on
Geoscience and Remote Sensing, 41 (6), 1277-1282.
Markley, J., Crossley, R.; Raines, A. (2003). The development and integration of remote
sensing, GIS and data processing tools for effective harvest management. Spatial
Sciences 2003. 12th Australasian Remote Sensing and Photogrammetry Conference,
Fremantale Australia
94
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Bibliography
Matoso, M., Silva, R., De Freitas, G .B ., Prieto H .E ., Ribeiro, S.L., L u iz F. (2004) Growth
a n d y ie ld o f coffee p la n ts in agroforestry a n d m o n o c u ltu re sy stem s in M in a s G erais,
B ra z il. Agrqforestry Systems, 63 pp. 7 5 -8 2
Meyer, P., Itten, K., Kellenberg, T., Sandmeier, S., Sandmeier, R. (1993). Radiometric
Corrections of topographically indiced effects on Landsat TM data in an alpine
environment. ISPRS Journal o f Photogrammetry and Remote Sensing, 48(4), 17-28.
Meyers, L., Gamst, G., Guarino, A.J. (2006) Applied Multivariate Research. Sage
Publications. London, UK.
Michalak, W. Z. (1993). GIS in land use change analysis: integration of remotely sensed
data into GIS. Applied Geography, 13, 28-44.
Moran, M.S., Inoue, Y., Barnes, E.M. (1997) Opportunities and Limitations for Image-
Based Remote Sensing in Precision Crop Management. Remote Sensing o f
Environment, 61, 319-346.
Moreira, M., Adami, M., Rudorff, B. (2004). Analise espectral e temporal da cultura do
cafe em imagens Landsat. Pesquisa Agropecuaria Brasileira, 39 (3), 223-231.
Moulin, S., Bondeau A., Delecolle, R. (1998). Combining Agricultural crop models and
satellite observations: from field to regional scales. International Journal o f Remote
Sensing, 19(6), 1021-1036.
Muller, D., Zeller, M. (2002). Land Use dynamics in central highlands of Vietnam: a
spatial model combining village survey data with satellite imagery interpretation.
Agricultural Economics, 27, 333-354.
Murai, H., Omatu, S. (1997) Remote Sensing Image Analysis using a neural network and
knowledge-based processing. International Journal o f Remote Sensing, 18(4), 811-
828.
95
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Bibliography
Nagarajan, R.; Mukheijee, A., Roy, A., Khire., M. (1998). Temporal Remote Sensing
data and GIS application in landslide hazard zonation of part of Western ghat, India.
International Journal o f Remote Sensing, 19 (4), 573-585.
Openshaw, S., Taylor, P.J. (1981). The modifiable aereal unit problem. In: Quantitative
Geography: A British view, pp.60-69. Ed. Wringley and Bennet. Routledge & Kegan,
LD.
Perfecto, I., Vandermeer, J., Mas, A., Soto-Pinto, L. (2005). Biodversity, Yield and
shade coffee certification. Ecological Economics, 54, 435-446.
Ponte, S. (2002). The Latte Revolution? Regulation, markets and consumption in the
global coffee chain. World Development, 30(7), 1099-1122.
Qi, J., Kerr, Y.H., Moran, M. S., Weltz, M., Huete, A. R , Sorooshian, S., Bryant, R.
(2000) Leaf Area Index Estimates Using Remotely Sensed Data and BDRF Models in
a Semiarid Region. Remote Sensing o f Environment 73, 18-30.
Quarmby, N.A., Townshed, J., Settle, J., White, K. (1992). Linear mixture modeling
applied to AVHRR data for crop area estimation. International Journal o f Remote
Sensing, 13(3), 415-425.
Rabus B., Eineder, M., Roth, A., Bamler, R. (2003). The shuttle radar topography
mission - a new class ,of digital elevation models acquired by spaceborne radar.
ISPRS Journal o f Photogrammetric Engineering and Remote Sensing, 57, pp.241-
262.
96
R eproduced w ith perm ission o f the copyright owner. F urther reproduction prohibited w ith o u t perm ission.
Bibliography
Rainforest Alliance (2000). The Conservation coffee campaign. Retrieved Feb. 11 2004.
Available: http://www.rainforest-alliance.org/news/2001/coffee-principles.html
Richter, R. (1997). Correction of atmospheric and topographic effects for high spatial
resolution satellite imagery. International Journal o f Remote Sensing, 18 (5), 1099-
1111 .
Roach J. (2003). Coffee Glut Brews Crisis fo r Farmers, Wildlife. National Geographic
News. Retrieved March 18 2005. Available:
http ://www.guiadelcafe!com/articl e.php? sid=254
Rondeaux., G., Steven, M., Baret, F., (1996). Optimization of Soil-Adjusted Vegetation
Indices. Remote Sensing o f Environment, 55, 95-107.
Schiewe, J., Tufte, L., Ehlers, M. (2001). Potential and problems of multi-scale
segmentation methods in remote sensing. GIS-Zeitschrifi fu r
Geoinformationssysteme, 6, 12-17.
Settle, J.J., Drake, N.A. (1993). Linear mixing and the estimation of ground cover
proportions. International Journal o f Remote Sensing, 14(6), 1159-1177.
Shapiro, S., Wilk, M B. (1965), An analysis of variance test for normality (complete
samples). Biometnka, Vol. 52, No. 3/4 (Dec., 1965), pp. 591-611
Shepherd, J., Dymond, J.R. (2003). Correcting Satellite imagery for the variance of
reflectance and illumination with topography. International Journal o f Remote
Sensing, 24 (17), 3503-3514.
Skidmore, A., Bijker, W., Schmidt, K., Kumar, L. (1997). Use of Remote Sensing and
GIS for sustainable land management. ITC Journal, 3 (4), pp.302-315
Sims, D.A., Garmon, J.A. (2002). Relationships between leaf pigment content and
spectral reflectance across a wide range of species, leaf structures and developmental
stages. Remote Sensing o f Environment, 81, 337-354.
97
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Bibliography
Smith, J.A., Lin, T.L., Ranson, K. J. (1980). The Lambertian Assumption and Landsat
Data. Photogrammetric Engineering and Remote Sensing, 46(9), 1183-1189
Smithsonian Migratory Bird Center (1997b). Shade Grown Coffee. Retrieved February
11,2005. Available: littp://Hasionalzoo.si.edii/('onservatioiiAndScience/MigralorvBirds/CoiTee
Sohn, Y., McCoy., R.M. (1997). Mapping Desert Shrub Rangeland Using Spectral
Unmixing and Modeling Spectral Signatures with TM Data. Photogrammetric
Engineering and Remote Sensing, 63(6), 707-716.
Stehman, S. 1992 Comparison of systematic and random sampling for estimating the
accuracy of maps generated from remotely sensed data. Photogrammetric
Engineering and Remote Sensing 58,9,1343
Susaki, J., Hara, K., Kajiwarea, K., Honda, Y. (2004) Robust estimation of BDRF model
parameters. Remote Sensing o f Environment, 89,63-71.
Teillet, P.M. (1986). Image Correction for radiometric effects is remote sensing.
International Journal o f Remote Sensing, 7 (12), 1637-1651
Tucker, C., Grant, D., Dykstra, J. (2004) NASAs Global Orthorectified Landsat Data
Set. Photogrammetric Engineering and Remote Sensing, vol. 20, No. 3, pp.313-322
Van der Meer (1995) Spectral Unmixing of Landsat Thematic Mapper data.
International Journal o f Remote Sensing, 16(16) 3189-3194
Van der Meer, F. (1997). Mineral Mapping and Landsat Thematic Mapper Image
Classification Using Spectral Unmixing. Geocarto International 12, 27-40.
98
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Bibliography
Van der Meer (1999). Image Classification through spectral unmixing in Spatial
Statistics fo r Remote Sensing, Stein et al. (eds). Kluwer Academic Publishers,
Netherlands
Van Niel, T., McVicar, T. (2004). Determining Temporal Windows for crop
discrimination with remote sensing: a case study in southeastern Australia.
Computers and Electronics in Agriculture, 45, 91-108.
Varangais, P., Siegel, P., Lewin, B., Giovannucci, D. Lewin, B. (2003). La Crisis
Cafetalera: efectos y Estrategias para hacerle frente [Online], World Bank Policy
Research Working Paper 2993. World Bank. Retrieved march 18 2005. Available:
http://www.cafedecolombia.coni/docs/ensavosl92003/crisiscafetaleraefectos.pdf
Vina, A., Gitelson, A., Rundisquist, D , Keydan, G., Leavitt, B., Schepers, J., (2004).
Monitoring Maize (Zea mays L.) Phenology with Remote Sensing. Agronomy
Journal, 96, 1139-1141.
Wall, S., Thomas, R., Brown, C. (1984). Landsat Crop Inventory System for Agriculture
in California. Remote Sensing o f Environment, 14, 267-278.
Walsh, S., Butler, D., Malanson, G. (1998). An overview of scale, pattern and process
relationship in geomorphology: a remote sensing and GIS perspective.
Geomorphology, 21, 183-205.
Wheatley, J., Wilson, J., Redmond, R., Ma, Z., Dibenedetto, J. (2000). Automated Land
Cover Mapping Using Landsat Thematic Mapper Images and Topographic Attributes
in: Terrain Analysis, Principles and Applications edited by Jhon Wilson and Jhon
Gallant ed. Wiley 2000
Widayati, A., Verbist B., Meijerink, A. (2002). Application of combined pixel-based and
spatial-based approaches for improved mixed vegetation classification using
IKONOS. Paper presented at the XII Asian Conference on Remote Sensing,
Bangkok, Thailand, August 17-21
Wilson, J. P., Gallant, J. (2000) Terrain Analysis: Principles and Applications. John
Wiley & Sons, Inc.
99
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Bibliography
Woodcock, C., Strahler, A. (1987). The factor of scale in remote sensing. Remote
Sensing o f Environment, 21,311-332.
Woodcock, C., Strahler, A., Jupp, D. (1988a). The Use of Variograms in Remote
Sensing: Scene Models and Simulated Images. Remote Sensing o f Environment, 25,
323-348.
Woodcock, C., Strahler, A., Jupp, D. (1988a). The Use of Variograms in Remote
Sensing: Real Digital Images. Remote Sensing o f Environment, 25, 349-379
Wrigley, N., Holt, T., Steel, D., Tranmer, M. (1996) Analyzing, modeling and resolving
the ecological fallacy in Longley, P., Batty, M.: Spatial Analysis: modeling in GIS
environment John Wiley and Sons.
Xiuwan, C. (2002). Using Remote Sensing and GIS to analyze land cover change and its
impacts on regional sustainable development. International Journal o f Remote
Sensing, 23 (1), 107-124.
Zeleke, T. B., Si, C.B. (2004). Scaling Properties of Topographic Indices and Crop
Yield. Multifractal and Joint Multifractal Approaches. Agronomy Journal, 96, 1082-
1090.
Zwiggelaar, R. (1998). A review of spectral properties of plants and their potential use
for crop/weed discrimination in row-crops. Crop Protection, 17(3), 189-206
100
R eproduced with perm ission o f the copyright owner. Further reproduction prohibited w ith o u t perm ission.
Annex 1
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Annex 2
Coffee Non-Coffee
102
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Annex 2: Data Distribution
86 -
u.I
16-
I
ojraeao 9mm o.wwo o.orooo o.iwoa ft.iaooo a,p
m
Mean = 0.060 Mean = 0.048
Std. Dev. = 0.016 Std. Dev. = 0.005
Coffee Non-Coffee
a.
M m
Mean = 0.316 Mean = 0.363
Std. Dev. = 0.054 Std. Dev. = 0.048
103
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Annex 2: Data Distribution
Coffee Non-Coffee
*
48
U
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Annex 2: Data Distribution
w
u.
Coffee Non-Coffee
U.
105
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Annex 2: Data Distribution
20-
C
#
3cr
i
r rr i innr
I
0-f-
I.- . . *
i }
JU JU
...
.J J
J
it
D
if 3lP*
Mean = 17 Mean = 17
Std. Dev. = 8.6 Std. Dev. = 10.7
Coffee Non-Coffee
It
HI
m
wm
c.at icyiL s u jtij jl' j o
as pac t
asp ect
106
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Annex 3
Pixels Objects
2m
m
HB'JS
107
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Annex 3: Box Plots
class
108
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Annex 3: Box Plots
eoffii
109
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Annex 3: Box Plots
coffee Non-coffee
cjites
c la s s
lar.i ex.
RHtyt S#5
110
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Annex 3: Box Plots
class
f J T- 'iff*#
Ill
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Annex 3: Box Plots
112
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Annex 3: Box Plots
O414
sw
l3.'5S
220D.CC
l inck.m
s
sUss ela&a
113
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Annex 3: Box Plots
iCjO"'
0
It' in
DO.03
els s s
114
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Annex 3: Box Plots
C'^nsriy
115
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Annex 4
Normality Test
a) Pixel values
Shapiro - W ilk
Feature Class Statistic Df. Sig.
Band 2 CofFee- .76 280 .00
Non-coffee .98 300 .00
B and 3 CoFfee- .87 280 .00
Non-coffee .97 300 .00
B and 4 CofFee- .99 280 .26
Non-coffee .99 300 .10
B and 5 CoFFee- .98 280 .00
Non-cofFee .99 300 .02
B and 7 CofFee- .94 280 .00
Non-cofFee .99 300 .08
T em perature CofFee- .99 280 .03
Non-cofFee .99 300 .09
Elevation CoFfee- .98 280 .00
Non-coFFee .95 300 .00
Slope CofFee- .97 280 .00
Non-coFfee .96 300 .00
A spect CoFfee- .98 280 .00
Non-coFfee .95 300 .00
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Annex 4: Normality Test
b) Objects Values
Shapiro - W ilk
Feature Class Statistic Df. Sig.
M ean B2 Coffee .80 280 .000
Non-coffee .75 300 .000
M ean B3 Coffee .84 280 .000
Non-coffee .74 300 .050
M ean B4 Coffee .98 280 .002
Non-coffee .98 300 .000
M ean B5 Coffee .96 280 .000
Non-coffee .96 300 .000
M ean B7 Coffee .89 280 .000
Non-coffee .89 300 .000
M ean Tem perature Coffee .99 280 .071
Non-coffee .99 300 .16
M ean Elevation Coffee .98 280 .00
Non-coffee .95 300 .00
M ean Slope Coffee .96 280 .00
Non-coffee .97 300 .00
M ean Aspect Coffee .98 280 .00
Non-coffee .97 300 .00
StD ev. B2 Coffee .19 280 .00
Non-coffee .86 300 .00
StD ev. B3 Coffee .40 280 .00
Non-coffee .08 300 .00
StD ev. B4 Coffee .76 280 .00
Non-coffee .63 300 .00
StD ev. B5 Coffee .65 280 .00
Non-coffee .57 300 .00
StD ev. B7 Coffee .63 280 .00
Non-coffee .53 300 .00
StD ev. Tem perature Coffee .96 280 .00
Non-coffee .94 300 .00
StD ev. Elevation Coffee .80 280 .00
Non-coffee .89 300 .00
StD ev. Slope Coffee .94 280 .00
Non-coffee .95 300 .00
StD ev. A spect Coffee .72 280 .00
Non-coffee .87 300 .00
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Annex 4: Normality Test
Shapiro - W ilk
Feature Class Statistic Df. Sig.
M in b2 Coffee .81 280 .00
Non-coffee .77 300 .00
M ax b2 Coffee .82 280 .00
Non-coffee .75 300 .00
M in b3 Coffee .80 280 .00
Non-coffee .73 300 .00
M ax b3 Coffee .88 280 .00
Non-coffee .82 300 .00
M in b4 Coffee .98 280 .00
Non-coffee .98 300 .00
M in b5 Coffee .95 280 .00
Non-coffee .97 300 .00
M ax b5 Coffee .97 280 .00
Non-coffee .97 300 .00
M in b7 Coffee .87 280 .00
Non-coffee .90 300 .00 .
M ax b7 Coffee .99 280 .02
Non-coffee .98 300 .00
M in Temp Coffee .99 280 .02
Non-coffee .98 300 .00
M ax Temp Coffee .99 280 .02
Non-coffee .99 300 .01
M in Elevation Coffee .98 280 .00
Non-coffee .95 300 .00
M ax Elevation Coffee .98 280 .00
Non-coffee .95 300 .00
M in Slope Coffee .92 280 .00
Non-coffee .90 300 .00
M ax Slope Coffee .98 280 .00
Non-coffee .98 300 .00
M in Aspect Coffee .97 280 .00
Non-coffee .83 300 .00
M ax Aspect Coffee .95 280 .00
Non-coffee .88 300 .00
118
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.
Annex 4: Normality Test
Feature values:
Shapiro - Wilk
Feature Class Statistic Df. Sig.
Area Coffee .784 280 .00
Non-coffee .861 300 .00
Length Coffee .866 280 .00
Non-coffee .936 300 .00
Width Coffee .904 280 .00
Non-coffee .909 300 .00
Length/Width Coffee .855 280 .00
Non-coffee .954 300 .00
Compactness Coffee .944 280 .00
Non-coffee .967 300 .00
Border Length Coffee .797 280 .00
Non-coffee .871 300 .00
Shape Index Coffee .882 280 .00
Non-coffee .922 300 .00
Density Coffee .982 280 .00
Non-coffee .994 300 .31
Asymmetry Coffee .968 280 .00
Non-coffee .970 300 .00
Perimeter Coffee .857 280 .00
Non-coffee .873 300 .00
119
R eproduced with perm ission o f the copyright owner. F urther reproduction prohibited w itho ut perm ission.