Lecture 4: Travelling through Space and Time
Course co-ordinator: Johannes M. Zanker, j.zanker@rhul.ac.uk, (Room W 214)
Mobius Belt II M.C.Escher, 1963
the world has three spatial and one temporal dimension
|
|
|
|
| Bacchus
and Ariadne Titian, 1522 National Gallery, London |
![]() |
the third dimension
needs to be reconstructed from the flat images captured by the eyes
|
a multitude of cues can be used in real world (and some of them, 'pictorial cues', are used by painters):
a wide range of depth information can be directly extracted from a static monocular (& natural) image (Gibson 1979)
why do we have two eyes?
apart from extending the visual field, the
combination of information from the two eyes allows precise depth measurements
through stereopsis
the left eye view and right eye view different images; they can be combined
to retrieve stereo and oculomotor signals as depth cues !
and this is how stereopsis works
(for the full story, see Julesz 1971, and Marr & Poggio 1979):
consider an observer fixating the horizon (close to infinity) : parallel optical
axes of left and right eye
the retinal projection of an object on opposite sides of the fovea (indicated disparity angle delta, delta') indicates its depth relative to the plane of fixation![]()
stereopsis is exploited to
produce depth impressions in projected/printed pictures
(Wheatstone stereoscope, red-green
anaglyphs, lenticular stereocards)
single image sterograms (Julesz & Miller1962, Tyler & Clarke 1990, Thimbleby
et al. 1992) were commercialised by Tom Baccei and Cheri Smith in 1991: magic
eye, more magic eye …
(try the flying sausage experiment)
|
|
|
|
|
|
conversely, constant angular size (the two kangaroos have the same size in the image) may be interpreted as difference in object size (the closer kangoroo looks smaller than the distant kangoroo): constancy scaling |
|
the size constancy effect is sometimes believed to be the basis of the Ponzo illusion : the closer red bar between the converging rails looks smaller than the more distant red bar... |
|
In the Ames Room, even the size of a familiar object (such as a person) is perceived largely distorted, because the misleading geometry generates a incorrect frame of reference | ![]() >> top view of room geometry << |
the size illusion in the Ames Room is a case in point for constructivist theories of perception: the knowledge of the rules of perspective and the assumption of rectangular architecture force the visual system to construct the apparent size difference (this way of drawing conclusions about what is seen in the retinal images, is called 'unconscious inference', see Gregory 1998)
(however, think about the following: with the same theory one could also argue the other way round: why are we not using our knowledge about body size to reconstruct veridical, non-rectangular geometry of the room?)
for instructions to build your own Ames room, and more information about this illusion, click here
![]() |
|
from the Max Planck Institut für biologische Kybernetik in Tübingen, with kind permission by H Bülthoff, (see also Gregory 1998)
for instructions to build your own hollow face illusion, click here
how can a ‘realistic’ depth sensation be produced in the cinema, on a flat screen ??
Visit the Science Museum, or IMAX® Cinema for this extraordinary experience
To understand motion we need a bit of minimal physical background, explained
by a simple example:
Consider an astronomer who is watching the night sky to find stars...
|
she might see two stars next to each other : they appear at a spatial distance |
|
she might see a star appearing, dis-appearing, reappearing: it is visible in temporal intervals |
|
she might see a star moving : it is changing position in space and time; this is motion (in space-time: x-t-dagram, motion is characterised by orientation, oblique trajectories) |
motion perception is a prime example for the study of brain function with a mixture of neuroscientific methods: physiology, psychophysics, and computational modelling all contribute to a comprehensive understanding of the fundamental processing mechanisms
If we look out of window and watch cars passing by – how could we illustrate this observation as still image ?
a motion detector has the task to assess displacement as function of time (the phiscal definition of motion): technically, this is called spatio-temporal correlation (and can be illustrated as orientation filter in space-time) (see Reichardt 1961, Borst & Egelhaaf 1989)
![]() |
minimum requirements for a computational model (elementary motion detector = EMD):
|
|
|
|
|
it was know since a long time and used for props at country fairs, etc: the
'zoetrope' (for some activity, see also
here) - a pioneer in
this area was Eadweard
Muybridge who in some way invented cnema (visit the Museum
in Kingston!)
it was 'discovered' experimentally by Exner (1877),
who demonstrated that motion is an independent
sensation in space and time
|
a typical phenomenon:
rapid alternations between two dots in opposite corners of a virtual square
can be perceived as jumping up & down, or as jumping left & right |
|
vertical proximity or horizontal proximity can resolve this ambiguity
>>> Gestalt laws of perceptual organisation:
'proximity' 'common fate' (Wertheimer 1912)
discontinuity in apparent motion stimuli >> ambiguity >> need to
identify objects in successive frames :
solving the 'correspondence problem'
|
|
the appearance of a motion-defined object is sometimes called 'structure from motion' (see Ullman 1979)
according to Gestalt Theory, perception is not just passive image acquisition,
but is an active process to create meaningful percepts:
'laws’of perceptual
organization generate ‘good shapes’ (Wertheimer 1912) -
Gestalt Psychology (Koffka
1935)
|
![]() |
|
|
|
|
|
|
|
|
|
|
after you fixate the centre of a rotating spiral (generating a sensation of expansion) for 2 minutes ... what happens when you look at HM's on the coin ? >>>> (static objects seem to contract) |
|
this effect can be understood as the result of adaptation and opponency mechanisms,
similar to colour
aftereffecrts
classical case for such dynamic afterimages : Waterfall Illusion
(for a brief history, see Wade 1994)
why does the rotating spiral (in the panel shown above) appear to expand ?? it is rotating, not expanding, after all !
consider the motion of an individual contour - it is ambiguous in an aperture (receptive field, indicated by red circle on the right) : from the restricted view you dont't know whether it moves horizontally or vertically - the motion direction of a contour is under-determined within small regions !!![]()
![]()
the most likely solution (direction perpendicular to the line) is perceived
| superimposing 2 component gratings moving in perpendicular directions |
||
![]() |
![]() |
![]() |
| what do you see in such a motion plaid ? (Adelson & Movshon 1982) |
||
(in coloured plaids, for instance you can perceive both directions: gratings slide across each other - this is called 'transparency')
![]() Old Woking High
Street |
|
the shape of a particular aperture forces the visual system to adopt a particular solution when integrating ambiguous motion signals from the central regions of the aperture and disambiguated motion signals from the boundary regions (Castet & Zanker1999)
|
|
| the classical barberspole configuration | a recent version (JMZ, unpublished) |
to download a pdf copy of lecture summary, click here
to download a pdf copy of lecture slides, click here
back to course
outline
last update
21-10-2011
Johannes
M. Zanker