Академический Документы
Профессиональный Документы
Культура Документы
4 Conclusion
I have argued that object identication is not an adequate characterisation of the problem of vision,
and that too much emphasis has been placed on object identication within computer vision.
The problem of vision consists of a multiplicity of tasks, of which object identication is only
one. Others include visually guided behaviours such as walking, grasping, climbing. By continuing to
concentrate eorts on object identication there is a danger that the formulation of the problem (of
object identication) will provide methods which are not of general utility for vision, and which may
not even be of use for object identication.
Object identication is an integral part of the general problem of vision. Consequently it is likely
that solutions to the problem of vision and object identication share a common set of computational
mechanisms. A broader denition of computer vision ensures that computer vision will be useful for
general visually guided behaviours, as well as for identication of objects.
I have suggested that one way to usefully broaden the denition of the problem of vision is to
consider the use of spatio-temporal cues, not only as a means of estimating the atemporal structure
of a scene, but directly as cues for accomplishing particular visual tasks which include (but are not
necessarily uniquely associated with) object identication.
Acknowledgements: Thanks to Stephen Isard for comments on longer versions of this paper, and
for suggesting Tinbergen's experiment as an example of the use of a spatio-temporal stimulus. Thanks
to Raymond Lister and Helen Peddington for useful discussions on drafts of this paper. Thanks also
to Mark Lee, Marcus Rodrigues, David Cli and Inman Harvey for comments on an earlier draft of
this paper.
This work was undertaken as part of an MRC/JCI grant awarded to Mark Lee at the Department
of Computer Science, University of Wales, Aberystwyth.
7
References
[1] DH Ballard and CM Brown. Computer Vision. Prentice-Hall Inc., New Jersey 07632, 1982.
[2] S Zeki. The visual image in mind and brain. Scientic American, 267(3), Sept. 1992.
[3] P Cavanagh. Multiple analyses of orientation in the visual system. In Neural Mechanisms of Visual
Perception, Lam, D and Gilbert, C (Eds)), pages 261{279, 1989.
[4] DH Ballard. Animate vision. Articial Intelligence, 48:57{86, 1991.
[5] RA Brooks. A robust layered control system for a mobile robot. IEEE J. Robot Autom., 2:14{23, 1986.
[6] S Watanabe. Knowing and Guessing. J Wiley and Sons, 1969.
[7] DH Hubel and TN Wiesel. Receptive elds, binocular interaction, and functional architecture in the cat's
visual cortex. Journal of Physiology, 160:160{154, 1962.
[8] HB Barlow. Single units and sensation: A neuron doctrine for perceptual psychology? Perception,
1:371{394, 1972.
[9] SR Lehky and TJ Sejnowski. Neural network model of visual cortex for determining surface curvature
from images of shaded images. Proc R. Soc. London (B), 240:251{278, 1990.
[10] DI Perrett, E Rolls, and W Caan. Visual neurones responsive to faces in the monkey temporal cortex.
Exp. Brain Res., 47:329{342, 1982.
[11] I Fujita, K Tanaka, M Ito, and K Cheng. Columns for visual features in monkey inferotemporal cortex.
Nature, 360(26):343{346, 1992.
[12] MA Georgeson. Human vision combines oriented lters to compute edges. Proc. Roy. Soc. London, (B),
249:235{245, 1992.
[13] GE Hinton. Shape representation in parallel systems. In Proc 7th IJCAI, Vancouver BC, pages 1088{1096,
1981.
[14] DH Ballard. Parameter nets. Articial Intelligence, 22:235{267, 1984.
[15] BL Whorf. Language, Thought, and Reality. MIT Press, New York, 1956.
[16] JHR Maunsell and WT Newsome. Visual processing in monkey extrastriate cortex. Ann. Rev. Neuro-
science, 10:363{401, 1987.
[17] M Mishkin, LG Ungerleider, and KA Macko. Object vision and spatial vision: Two cortical pathways.
Trends in Neurosciences, 6:414{417, 1983.
[18] D Marr. Vision. Freeman: New York, 1982.
[19] D Lowe. Perceptual Organisation and Visual Recognition. Kluwer Academic Publishers, Boston MA, 1985.
[20] Bray, A J. Recognising and Tracking Polyhedral Objects. PhD thesis, School of Cognitive and Computing
Sciences, University of Sussex, UK, 1990.
[21] S Edelman and H Bultfo. Viewpoint-specic representations in three-dimensional object recognition.
MIT AI Memo No. 1239, 1990.
[22] T Poggio and S Edelman. A network that learns to recognize three-dimensional objects. Nature, 343:263{
266, 1990.
[23] N Tinbergen. The Study of Instinct. Clarendon), Oxford University Press, 1951.
[24] G Johansson. Visual perception of biological motion and a model for its analysis. Perception and Psy-
chophysics, 14:201{21, 1973.
[25] G Mather, K Radford, and S West. Low-level visual processing of biological motion. Proc. Roy. Soc.,
249:149{155, 1992.
[26] D Perrett, M Harries, AJ Mistlin, and AJ Chitty. Three stages in the classiication of body movements
by visual neurons. In Images and Understanding, Barlow H, and Blakemore C, Weston-Smith, M (Eds),
pages 95{107, 1990.