The digital computer is an inappropriate model of the visual perceiver." Discuss

Authors Avatar

The digital computer is an inappropriate model of the visual perceiver." Discuss

Some theories of vision are overtly computational, while others deal in concepts which appear to defy implementation on a computer. By considering, among other things what you consider to be the successes and failures of these apparently different approaches in describing the observed properties of visual perception, assess how far you consider the above assertion to be justified.

____________________________________________________________________

 

"The digital computer is an inappropriate model of the visual perceiver."

"Perception refers to the means by which information acquired from the environment .....is transformed into experiences of objects, events etc." (Roth, 1986). In the context of this definition it is possible to argue that a digital computer, devoid of consciousness could never be an appropriate model of the visual perceiver as such. It seems unlikely that anyone should wish to term a series of binary inputs and outputs to a piece of some semi-conductor, however intricate, as experiences in the same sense as we might to a human. However, to put aside the philosophical issues, human visual perception is undoubtedly an astonishing feat. The very fact that it has proven so hard to implement a mechanical visual system comparable to that which we possess bears witness to the scope of the problem. To construct a robot that can simply navigate its way around a novel environment without crashing into things is quite an achievement and yet we fail to see anything remarkable in our in-depth understanding of the visual information presented to us. Millions of times a day this information is modified and updated and yet it goes almost completely unnoticed that what we are taking for granted is in fact an extremely complex operation. But consider for a moment what processes are involved. Consider what processing must be going on which can tell us not only what it is that we are seeing but can also give us clues as to spatial relationships, size and distance and allow us to make inferences concerning structure and property, regardless of semi-occlusion, varying angles of observation and unusual illumination sources. But is the ability to make sense of the real world in a similar way to ourselves really something that machines are incapable of? With the technology that we possess is it not possible to model the human visual system?

Clearly the problem is not one of a lack of information. Television cameras can be considered to be an accurate enough representation of the input which our eyes afford us. Effects of binocular disparity cues can be overcome with the use of more than one camera and we can easily enable these cameras to be mobile to simulate any movements that we may make when assessing a scene. What then is the cause of our inability to model the way we see? The problem we are faced with seems to be how this information is analysed by our brains. How do we transform a pattern of electrical signals generated by the excitation of photoreceptors on a 2-D surface into a 3-D image of the world around us? The interpretation of this information is the area in which the controversy reigns and is where the answer to the problem will be found, if found it can be.

There are, as mentioned earlier, several schools of thought as to how the visual information is processed, some of these seeming diametrically opposed to others. With the notable exception of Gibson's (1950) theory of the importance of optic flow in visual perception, most theorists have worked from the principle of an image on the retina which is then broken down for analysis by feature extraction and pattern detection etc., although there is some disagreement within this camp as to whether this analysis is based on hard-wired knowledge or from memory and experience. Whilst all agree that some level of inferencing is necessary for the perceiver to make sense of the world, the constructive theorists argue for the 'top - down' approach and emphasise the importance of world knowledge in our understanding of what we see. They claim that we acquire knowledge of the world through our experiences and that these can then be drawn upon to assist the inferencing process when analysing the visual input, c.f. Gregory's (1972) claim that perceptions are constructions from "fragmentary scraps of data ....drawn from the brain memory banks, themselves constructions from the snippets of the past." If the ideas of the constructivists such as Gregory and Bruner are to be believed then it seems not intuitively unreasonable that our perception may be prone to errors caused by our expectations. Indeed, Gregory has claimed support for his theories from several forms of visual illusion, for example the Ames room and the Muller - Lyer illusion. Gregory posits a misapplied size - constancy theory to explain the Muller - Lyer illusion in which two equal length vertical lines are perceived as being of different lengths because of other contextual clues. He argues that the two lines are seen as 3 -D objects and can be thought of as the inside and outside corners of a room. The inside corner is known by experience to be further away from the observer than an outside corner, hence is considered longer and hence the illusion. Whilst this explanation seems ingenious, the constructivists arguments seem questionable when we consider how rarely our perception is in error, when the level of inferencing proposed here would suggest that far more perceptive errors would be made. Further, the Muller - Lyer illusion is not universally seen as a 3 - D construct as Gregory suggests and no explanation is offered for this anomaly.

Join now!

Whilst the constructive theories may then have some limited role in our perception it appears that their view is too simplistic and unable to cope with all the observed phenomena. We turn then, to an idea which contrasts with the ideas of the constructivists, the direct perception theory. Direct theorists argue in favour of 'bottom - up' processing, where little world knowledge is required in order to make sense of what is being seen. They suggest that general constraints are used when analysing the visual input, inferencing only playing a minor role when perceptual ambiguities occur. One of the most ...

This is a preview of the whole essay