Understanding Motion Perception in Computational Vision

Slide Note
Embed
Share

In computational vision, the concept of motion opponency plays a crucial role in how the brain processes left and right motion inputs. By examining psychophysical results and the construction of motion opponent energy filters, we explore how the brain handles motion information. Additionally, the Velocity Extraction Problem and the Reverse Phi Effect highlight the complexities of extracting velocity information from visual inputs. The phenomenon of Fluted Square Waves further illustrates how motion perception can be influenced by signal processing in the brain.


Uploaded on Nov 25, 2024 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. Computational Vision CSCI 363, Spring 2023 Lecture 17 Computing Motion 1

  2. Motion-Opponency Psychophysical results suggest that neurons in the brain use a motion-opponent processing (e.g. left - right). Evidence: 1) We cannot see both left and right motion at the same time. If you superimpose a leftward moving sinewave pattern on a rightward moving sinewave pattern, you don't see motion, you see flicker. 2) Motion after-effect: If you adapt to rightward motion, and then look away at static image, you see leftward motion (The waterfall illusion). Demo: http://www.michaelbach.de/ot/mot_adapt/index.html 2

  3. Construction of Motion Opponent energy filters To construct a motion-opponent energy filter, simply subtract the response of a filter tuned to left motion from that of a filter tuned to right motion (or vice versa). 3

  4. Velocity Extraction Problem: Velocity information confounded with contrast. E.g. A weak signal could mean low contrast or low velocity. Solution: Compare the output of different motion channels. (E.g. Left, right and static channels. Change in contrast => Ratio between channels stays the same Change in velocity => Ratio between channels changes. 4

  5. Reverse Phi If a pattern of white and black lines is moved rightward in small steps, people see rightward motion. If the contrast is reversed with each step (white becomes black and vice versa), people see leftward motion. (The Reverse Phi Effect) Demo: https://i.gifer.com/GvPs.gif Energy White energy => rightward motion. Move pattern in steps. Reverse Phi: Move pattern in steps while reversing contrast Dark energy => leftward motion. 5

  6. Fluted Square Wave A square wave that is shifted to the right in 90 deg steps, appears to move right. 90 deg step to right A square wave with the fundamental frequency component removed is a fluted square wave. The highest amplitude component is 3f. When the fluted square wave is shifted to the right in 90 deg steps, it appears to move left! 6

  7. Fluted Square Wave Why? When a square wave that is shifted to the right in 90 deg steps, its fundamental frequency moves right in 90 deg steps. For a fluted square wave, the highest amplitude component is 3f. When the square wave (frequency f) moves 90 deg to the right, the 3f component is being shifted 270 deg to the right, which appears as 90 deg to the left. 7

  8. Energy Response to a fluted square wave x Energy White = Right t Square wave Fluted Square wave Black = Left 8

  9. Moving Plaid Demo Demo of a moving plaid grating: = + Demo: https://www.youtube.com/watch?v=g_sn0WtHK1g 9

  10. Motion Energy for 2D images For a 2D image, we use a 3-D gabor filter: Selects frequency range within an ovoid in spatio-temporal frequency space: tf sfx sfy 10

  11. Velocity lies on a Plane For a 1D image, all measurements of the same velocity lie along a line in SF-TF space (because v = TF/SF) tf sf For a 2D image, all measurements of the same velocity lie on a plane in SF-TF space. tf Find the plane by making multiple measurements and finding best fit. sfx sfy 11

  12. Extra-striate visual areas Folded Cortex Flattened Cortex 12

  13. Dorsal and Ventral Streams 13

  14. Evidence for two processing streams Evidence for separate streams of processing comes from three areas: 1) Lesion studies. Lesions in the ventral areas cause selective deficits in color and orientation discrimination abilities. They can also cause deficits in object or face recognition. Lesions in the dorsal areas cause selective deficits in judgments of motion (e.g. speed or direction). Can also cause deficits in localization of objects. 2) Psychophysics: Hard to see motion at "isoluminance". 3)Connection patterns: Parvocellular->4C ->Superficial cortical layers (color and form) Magnocellular->4C ->4B-> MT 14

  15. Motion Processing in V1 In V1, some simple cells and complex cells are tuned to direction of motion. I.e. they respond most strongly to motion in a given direction and their response falls off as the motion deviates from that direction. Tuning for 180 deg Firing Rate 120o 240o 180o Direction of Motion Direction Tuning Polar Plot (tuning for zero deg) 15

  16. V1 neurons tuned to temporal frequency V1 neurons appear to be tuned to temporal frequency. Their preferred speed depends on the spatial frequency of the pattern. v = t/ x Firing Rate Temporal Frequency Neurons in V1 behave like motion energy filters. 16

  17. Motion Processing in MT MT (The Middle Temporal Area) is thought to be important for processing motion information. Characteristics of MT neurons: 1) Cells tuned for direction of motion (more broadly tuned than V1 cells. 2) Cells tuned for speed. (Some cells specifically tuned for speed. Not dependent on spatial frequency). 3) Large receptive field sizes. (Some are 100x bigger than V1 receptive fields). They range from 1-2deg in diameter in the foveal region and increase in the periphery. 17

  18. Speed Selectivity McKee and Nakayama have shown that people are very good at discriminating two different speeds independent of spatial frequency. The Weber fraction gives a measure of how big a change in speed is necessary to distinguish two different speeds. It is fairly constant over a broad range of speeds: V/V = .05 MT may be the area that first computes speed independent of spatial frequency. 18

  19. Direction Selectivity V1 cells are sensitive to the direction of motion of the spatial frequency components of a stimulus. For example, in plaid stimuli a V1 cell will respond when either sine wave is moving in its preferred direction, but will not respond for the pattern motion in its preferred direction. Some MT cells respond to pattern motion. They respond best when the pattern motion of a plaid is in their preferred direction. 20% of MT cells respond to the pattern motion. 40% respond to component motion. 40% are in between. 19

  20. Responses to Plaids Moving plaid: = + Response to pattern motion MT response V1 response 20 (20% of cells)

  21. Motion opponency MT cells appear to exhibit motion-opponency in their receptive fields: + This has implications for motion transparency. - - Many MT cells have an inhibitory surround. + Motion in the surround inhibits the response to motion in the center. The inhibitory surround may be involved in: Figure-ground segmentation based on motion. Motion parallax Heading judgments. 21

  22. Evidence that MT processes motion 1. Cells in MT prefer moving stimuli to static stimuli. 2. Lesions of MT cause loss of ability to judge motion direction: Newsome et al. performed an experiment to test this in monkeys. Stimulus: Moving dots--Some percentage move in a coherent direction (correlated dots), the rest move in random directions Task: Judge direction of motion (e.g. up vs. down). Measure: Percent correlation to judge the directions of motion. Result: After lesion of MT, monkeys require a greater percentage of correlated dots to make the discrimination (i.e. they were worse at the motion task). Another piece of evidence for MT being involved in motion comes from experiments in which MT cells are electrically stimulated with a micro-electrode (microstimulation). Salzman & Newsome (1994) showed that they could influence a monkey's perception of motion by stimulation of cells in MT.

  23. 2D Motion is just the Beginning 2D image motion contains information about: Relative depth of surfaces 3D motion of objects 3D structure of objects Direction of observer motion Among other things. 23

More Related Content