![]() | ![]() |
Formats:
|
||||||||||||||||||
Figure-ground mechanisms provide structure for selective attention Krieger Mind/Brain Institute and Department of Neuroscience, Johns Hopkins University, 3400 N Charles Street, Baltimore, MD 21218 Correspondence: Rüdiger von der Heydt, Krieger Mind/Brain Institute, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD 21218, Phone: 410 516-6416, Fax: 410 516-8648, E-mail: von.der.heydt/at/jhu.edu The publisher's final edited version of this article is available at Nat Neurosci. See commentary "Out of the spotlight: face to face with attention." in Nat Neurosci, volume 10 on page 1344. See other articles in PMC that cite the published article.Abstract Attention depends on figure-ground organization: figures draw attention, while shapes of the ground tend to be ignored. Recent research has demonstrated mechanisms of figure-ground organization in the visual cortex, but how they relate to the attention process remains unclear. Here we show that the influences of figure-ground organization and volitional (top-down) attention converge in single neurons of area V2. While assignment of border ownership was found for attended as well as for ignored figures, attentional modulation was stronger when the attended figure was located on the neuron’s preferred side of border ownership. When the border between two overlapping figures was placed in the receptive field, responses depended on the side of attention, and enhancement was generally found on the neuron’s preferred side of border ownership. This correlation suggests that the neural network that creates figure-ground organization also provides the interface for the top-down selection process. Perception tends to segregate visual images into figures and ground, and process the figure regions, but not the ground regions (Fig. 1
Results We studied the responses of neurons in area V2 under conditions when monkeys performed a shape discrimination task that required selective attention (Fig. 2d–f In one set of experiments, the three figures were separated (Fig. 2d 1. Border ownership assignment in the absence of attention The purpose of the first experiment (Fig. 2d
How does attention affect border ownership modulation in the cells in which both influences converge (cross hatched sector in Fig. 3a We conclude from this experiment that border ownership signals can emerge without the influence of attention and that the overall strength of border ownership modulation is nearly the same for figures that the monkey tries to ignore as for figures at the focus of its attention. 2. Attention and extrinsic border suppression Next we examined configurations in which two figures overlapped (Fig. 2e We found that varying attention had a strong effect: Of 216 cells tested, 103 (48%) showed an influence of the side of attention, and 66 of these (31% of the total) showed both influences combined, in form of significant main effects or interaction (Fig. 3c A comparison of Figure 3b 3. Time course of border ownership and attention effects Border ownership and attention effects both emerged with only a short delay after the beginning of stimulus evoked activity in V2 (Fig. 4
4. Common circuits for figure-ground and attention The test with overlapping figures revealed an asymmetry of the attention effect which can be seen in typical example neurons in Figure 5
The population results show that there was a correlation (Fig. 6 10−5, N = 66, t-test). There was a significant shift to positive values also in the entire population (mean = 0.047, p = 1.6 10−4, N = 215). Thus, the test with overlapping figures reveals that attentional modulation is spatially asymmetric about the RF, and this asymmetry is correlated with border ownership preference. [In some neurons, responses to the occluding edge were enhanced by attention on the foreground compared to attention on the background irrespective of border ownership. However, front-back attention modulation was overall weaker than side-of-attention modulation (Supplementary Fig. S4).]
After observing spatial asymmetry of attentional modulation in this experiment we re-examined the data of experiment 1 (separated figures) and found a similar asymmetry: the attentional enhancement was stronger for figures on the preferred border ownership side than figures on the nonpreferred side (modulation index 9.8% vs. 3.0%, N = 100, t = 4.11, P = 0.0001, paired t-test). Thus, even in the case of separated figures, when spatial mechanisms would be adequate for attentional selection, the neurons are more susceptible to attentional modulation on the side of border ownership preference than on the other side. These observations have implications for the mechanisms underlying selective attention that will be discussed below. 5. Controls and comments The results just described were obtained in two animals performing somewhat different tasks (see Methods): TE signaled the shape of the target figure by making different saccades, whereas LA responded manually. We used a different task in the second animal (LA) to make sure that the attention effects we had observed in TE were not a result of training the animal to make specific eye movements. The results from the two monkeys were virtually the same in every respect: strength of attention effect and similarity of border ownership modulation with and without attention in the separated figure condition; spatial asymmetry of attention modulation (Fig. 4 Regarding the experiment with overlapping figures, it might be argued that the monkey focused attention on the location of the hidden edge when the background figure was the target, in which case its focus of attention would not have been exactly on the RF because this was centered on the visible, occluding edge. Two observations show that this cannot be the explanation for the asymmetry of the attention effect. First, the distribution of post-fixation saccades indicates that attention was directed to the center of the target figure and not to the hidden edge (Supplementary Fig. S3). Second, attention modulation showed the same spatial asymmetry irrespective of the direction of occlusion (Fig. 5 To address the question of whether variations in fixation behavior could have contributed to the correlations shown in Figure 6 We also ruled out the distribution of the RF orientations and positions of the neurons in our sample as a source of correlation. Attention modulation did not depend on the orientation of the stimulus axis (which varied with RF orientation), or the orientation of the stimulus axis relative to the vector connecting RF and fovea (Supplementary Fig. S1). This means that attention modulation was not related to the location of the figures relative to the center of fixation. We further considered the possibility that aspects of the task other than attention could have influenced the results. Because we used squares and trapezoids, the orientation of the edge in the RF varied slightly (typically ±7deg). However, the different shapes of figures contributed equally to the responses in each experimental condition, and there was no interaction between shape and site of attention (14 of 253 cells showed interaction at p < 0.05, not different from proportion expected by chance, p = 0.71). 6. Understanding the mechanisms The asymmetry of receptive fields regarding attentional modulation was an unexpected finding. A plausible explanation for this asymmetry is that the same circuits that produce border ownership modulation also provide a structure for attentional selection. We have previously proposed a model for border ownership assignment based on simple circuits that integrate image context.15,16 If we assume that top-down attention works by activating the same circuits, then all the above findings fall into place. The principle of this idea is illustrated in Figure 7a–b
The grouping cell network is the key to understanding the interplay between attention and figure-ground organization. We assume that selective attention excites G-cells at the focus of attention, or inhibits G-cells surrounding it (Fig. 7c–d Also the details of the results shown in Figure 3 The model accounts for three aspects of the results described above. It explains how the system uses image context to generate border ownership signals; it explains the spatial asymmetry of the attention influence; and it explains why the side of attention enhancement is generally the same as the preferred side of border ownership. The existence of G cells is hypothetical as yet. Our results suggest that border ownership preference is a fixed property of the neurons, implying that G-cells are pre-established (by genetic or experiential factors). Our model postulates that G cell templates come in a range of sizes and cover the visual field densely, but with relatively coarse spatial resolution. Their resolution should be comparable to that of attention,17 which is much lower than the visual acuity and the resolution of the receptive fields of V1. This means that the model can function with a relatively small number of G-cells, about 1% of the cells representing image information.16 G-cells might reside within or outside V2. As pointed out,5,16 the short latency of the border ownership signal (Fig. 4 Discussion Our results demonstrate that selective attention and figure-ground organization involve overlapping populations of neurons in V2. A fraction of the cells showed border ownership selectivity without any attention modulation, others exhibited an influence of attention without border ownership selectivity, and a large fraction (about 40% of the cells) showed both influences combined. Such a result would be expected if border ownership assignment and attentional modulation were produced by two independent mechanisms that interact at this stage. We show that border ownership assignment occurs simultaneously for multiple figures in the display, including figures outside the focus of attention (Fig. 3a The convergence and largely additive effects of figure-ground and attention effects in single neurons of the V2 edge representation also explain the observation that perceived figure ground organization can be influenced by attention1 (inspection of Fig. 1 The most telling result of our experiments is the asymmetry of V2 receptive fields with respect to attentional modulation and its correlation with the border ownership preference of the cells. This correlation indicates that top-down attention processes share neural circuitry with the mechanism underlying context integration in figure-ground organization. Asymmetry with respect to attentional modulation has been demonstrated in receptive fields of V4 neurons,26 a finding that might be related to the mechanism we describe here. The attention effects in our experiments might be interpreted as examples of ‘biased competition’ in which visual objects compete for neural representation and top-down attention can bias the competition in favor of one or the other object.27,28 The attentional modulation of the neural responses to the border between figures (Fig. 4 Methods We studied two adult rhesus monkeys (Macaca mulatta), one male and one female. The details of our general methods have been described.5,6 The animals were prepared by implanting, under general anesthesia, first three small posts for head fixation, and later two recording chambers (one over each hemisphere). Behavioral training was achieved by controlling fluid intake and using small amounts of juice or water to reward correct responses. All animal procedures conformed to US National Institutes of Health and USDA guidelines as verified by the Animal Care and Use Committee of the Johns Hopkins University. Recording Single-neuron activity was recorded extracellularly with epoxy-insulated tungsten microelectrodes inserted through the dura mater within small (3–5 mm) trephinations. Area V2 was identified by its retinotopic organization and by histological reconstruction of the recording sites, as described.5 Action potentials were discriminated using a spike sorting device (Alpha Omega). Only isolated single unit activity was analyzed. Receptive fields were in the lower hemifield at eccentricities ranging between 0.75 and 12 deg (median 2.2 deg). Eye movements were recorded for one eye using an infra-red video based system (Iscan ETL-200) with a resolution of 5120 (H) and 2560 (V). The eye was imaged through a hot mirror (a mirror that selectively reflects infrared), with the camera placed on the axis of fixation. The optical magnification in our system resulted in a resolution of the pupil position signal of 0.03 deg visual angle in the horizontal and 0.06 deg in the vertical. However, noise and drifts of the signal reduced its accuracy. Behavioral tasks Animals performed two tasks, a shape discrimination with initial fixation, and a simple fixation task. Shape discrimination was taught first. Upon appearance of a fixation spot, the animal could initiate a trial by fixating the spot, which was detected by monitoring the eye movements. After fixation was maintained for 0.3 s, a figure was displayed that could be a square or a trapezoid, and the animal was rewarded if it signaled the shape correctly. Monkey TE responded by making a saccade to the figure if it was a trapezoid, and looking off the screen if it was a square. A trial was rewarded only if the first saccade after fixation landed in the correct target zone.29 Monkey LA responded by pulling or pushing a lever. A correct response terminated the trial. After an incorrect response, the trial was terminated and a 3 s delay ensued. Upon termination of a trial, the screen was blanked for 1–1.5 s (plus the additional delay after an error) until the fixation spot came on again and a new trial was enabled. Once the animals performed the shape discrimination reliably, two additional figures were added and the animals were trained to perform the task with one of the figures, the target, as specified by instruction trials at the beginning of each block of trials. In these trials the target figure was shown as solid and the other figures as outlines. Which of the figures was the target varied between blocks. The shape of each figure varied randomly from trial to trial. Once the animals mastered the task with three spatially separated figures, a variant of the display was introduced in which two of the figures partially overlapped. The blocking of trials and the sequence of events in each trial were the same, except that a certain time after stimulus onset the top (occluding) figure was moved so as to expose the bottom figure completely. This occurred after 0.5 s for monkey TE and after 0.2 s for monkey LA. Thus, in trials in which the bottom figure was target, correct performance required that the animal waited until that figure was exposed before responding. Both monkeys performed the tasks well above chance level (TE, 80%, LA, 91% correct). To check if the responses of monkey LA were based on processing the stimulus during the fixation period (in TE this was obviously the case, because he responded with a saccade at the end of the fixation period), we modified the display sequence in some of the training sessions so that the display was blanked when a saccade was detected. In these sessions, in which post-saccadic information could not be used, LA’s performance was also well above chance (72%). The animals learned also to perform a fixation task in which trials were rewarded only if the eye position signal stayed within a window of 0.75 deg radius for 2s. (Note that this scheme actually produces more accurate fixation than suggested by the size of the reward window because noise and drifts of the eye movement signal effectively produce a negative gradient of reward probability away from the fixation point.) This task was used for mapping of receptive fields, for the general characterization of selectivity, and for the standard test of border ownership. The shape of the fixation spot told the animal which task to perform. Design and presentation of stimuli Stimuli were generated on a Pentium 4 Linux workstation with NVIDIA GeForce 6800 graphics card using the anti-aliasing feature of the Open Inventor software, and were presented on a 21-inch EIZO FlexScan T965 color monitor with 1600×1200 resolution, a 100 Hz refresh rate, and a maximum luminance 93 cd/m2. Background luminance was 28 cd/m2 except for conditions in border ownership tests in which figure and background color were flipped. The display was viewed binocularly at a distance of 100 cm and subtended 22.7 by 17.1 deg of visual angle. Stationary bars were used to determine the color preference, and bars and drifting gratings to map the minimum response field of each cell. Orientation tuning curves were recorded using moving bars. Three shapes of figures were used in the main experiments, a square, and two trapezoids that were derived from the square by tilting one side (A) either clockwise or counterclockwise, typically an angle of 7 deg. The figures typically measured 3 deg on a side, but smaller figures were often used in foveal cells. All figures had rounded corners (radius 9% of figure size) to avoid the use of angles as a cue in the task. For the overlapping figures, the amount of overlap was about 13%, and the figures were displaced parallel to the occluding edge by about 9% of the figure size. In each trial, three figures were simultaneously presented with the shape of each figure chosen randomly to be a square with probability 0.5, or either kind of trapezoid with probabilities 0.25. The figures were presented with orientation of side A for the square shape equal to the preferred orientation of the cell under study. The centers of the sides A were arranged on a circle around the fixation point. The spacing depended on the size of the figures and was typically 60 deg polar angle. In experiments with separated figures, border ownership was varied by flipping each figure about side A (see Fig. 2d Procedure Upon isolating a cell we first characterized its selectivity for color, bar size, and orientation, and mapped its RF.5 A standard test of border ownership with a single square, using square sizes of 3 and 8 deg,6 was also performed in most cells. The fixation paradigm was used for these basic tests. Subsequently, one of the selective attention experiments (or both, if time permitted) was performed using the shape discrimination paradigm. Each of the two attention and two border ownership conditions was typically presented 40 times, one per trial. Our sample is not biased with respect to the effect of attention. However, because neurons were usually selected for the main tests after the standard border ownership test was performed, the proportion of border-ownership selective cells in our sample (74%) was higher than average. Among the total of 666 cells in which the standard test was performed, 303 (45%) were found to be border ownership selective. This is virtually the same as the proportion of 184/423 (43%) found with the same test in experiments in which the animal was never trained to pay attention to the stimuli, but, on the contrary, its attention was engaged at the fovea by a demanding fixation task (stereoscopic adjustment within a small fixation target).5–7 This is important, because it shows that the overall frequency of border ownership selective cells was not altered by training the attention task in the present study. Data analysis The spike activity during periods of 200 ms after stimulus onset was analyzed. We chose this interval because eye movement recordings indicated that no systematic shifts of gaze occurred before 160 ms (Supplementary Fig. S5). Because V2 neurons respond with a delay of about 40ms,5 we can assume that eye movements that occurred after 160 ms did not influence the activity during the analysis period. Neurons that responded with less than 4 spikes/s mean firing rate in each of the four border ownership/attention conditions were excluded because we felt that our stimuli were not appropriate for these cells (10%). Analysis of variance (ANOVA) was performed on the square-root transformed spike counts. This transformation serves to homogenize the variances and produces approximately normal distributions. The ANOVA included five factors: site of attention, border ownership, local contrast, shape, and direction of tilt (nested within shape), and was performed on each neuron. The main effects of attention and border ownership and their interaction are discussed in this paper. In Figure 6 1 Click here to view.(263K, pdf) Acknowledgements We wish to thank T.J. Macuda for help with the behavioral training of TE, S. Mihalas, E. Niebur, P.J. O’Herron, and N.R. Zhang for suggestions and critical comments on the manuscript, and O. Garalde for technical assistance. This research was supported by NIH grants EY02966 and EY16281. References 1. Rubin E. Visuell wahrgenommene Figuren. Copenhagen: Gyldendals; 1921. 2. Koffka K. Principles of Gestalt Psychology. New York: Harcourt, Brace and World; 1935. 3. Nakayama K, Shimojo S, Silverman GH. Stereoscopic depth: its relation to image segmentation, grouping, and the recognition of occluded objects. Perception. 1989;18:55–68. [PubMed] 4. Driver J, Baylis GC. Edge-assignment and figure-ground segmentation in short-term visual matching. Cogn. Psychol. 1996;31:248–306. [PubMed] 5. Zhou H, Friedman HS, von der Heydt R. Coding of border ownership in monkey visual cortex. J. Neurosci. 2000;20:6594–6611. [PubMed] 6. Qiu FT, von der Heydt R. Figure and ground in the visual cortex: V2 combines stereoscopic cues with Gestalt rules. Neuron. 2005;47:155–166. [PubMed] 7. Qiu FT, von der Heydt R. Neural representation of transparent overlay. Nat. Neurosci. 2007;10:283–284. [PubMed] 8. Motter BC. Focal attention produces spatially selective processing in visual cortical areas V1, V2, and V4 in the presence of competing stimuli. J. Neurophysiol. 1993;70:909–919. [PubMed] 9. Reynolds JH, Chelazzi L, Desimone R. Competitive mechanisms subserve attention in macaque areas V2 and V4. J. Neurosci. 1999;19:1736–1753. [PubMed] 10. Luck SJ, Chelazzi L, Hillyard SA, Desimone R. Neural mechanisms of spatial selective attention in areas V1, V2, and V4 of macaque visual cortex. J. Neurophysiol. 1997;77:24–42. [PubMed] 11. Bender DB, Youakim M. Effect of attentive fixation in macaque thalamus and cortex. J. Neurophysiol. 2001;85:219–234. [PubMed] 12. Marcus DS, Van Essen DC. Scene segmentation and attention in primate cortical areas V1 and V2. J. Neurophysiol. 2002;88:2648–2658. [PubMed] 13. Friedman HS, Zhou H, von der Heydt R. The coding of uniform color figures in monkey visual cortex. J. Physiol. (Lond). 2003;548:593–613. [PubMed] 14. Motter BC. Neural correlates of feature selective memory and pop-out in extrastriate area V4. J. Neurosci. 1994;14:2190–2199. [PubMed] 15. Schuetze H, Niebur E, von der Heydt R. Modeling cortical mechanisms of border ownership coding. J. Vision. 2003;3/9:114. 16. Craft E, Schuetze H, Niebur E, von der Heydt R. A neural model of figure-ground organization. J. Neurophysiol. 2007;97:4310–4326. [PubMed] 17. Intriligator J, Cavanagh P. The spatial resolution of visual attention. Cognit. Psychol. 2001;43:171–216. [PubMed] 18. Girard P, Hupe JM, Bullier J. Feedforward and feedback connections between areas V1 and V2 of the monkey have similar rapid conduction velocities. J. Neurophysiol. 2001;85:1328–1331. [PubMed] 19. Goldberg ME, Bisley JW, Powell KD, Gottlieb J. Chapter 10 Saccades, salience and attention: the role of the lateral intraparietal area in visual behavior. Prog. Brain Res. 2006;155PB:157–175. [PubMed] 20. Lammea VAF. The neurophysiology of figure-ground segregation in primary visual cortex. J. Neurosci. 1995;15:1605–1615. [PubMed] 21. Zipser K, Lamme VAF, Schiller PH. Contextual modulation in primary visual cortex. J. Neurosci. 1996;16:7376–7389. [PubMed] 22. Lee TS, Mumford D, Romero R, Lamme VAF. The role of the primary visual cortex in higher level vision. Vision Res. 1998;38:2429–2454. [PubMed] 23. Mack A, Tang B, Tuma R, Kahn S, Rock I. Perceptual organization and attention. Cognit. Psychol. 1992;24:475–501. [PubMed] 24. Moore CM, Egeth H. Perception without attention: evidence of grouping under conditions of inattention. J. Exp. Psychol. Hum. Percept. Perform. 1997;23:339–352. [PubMed] 25. Driver J, Davis G, Russell C, Turatto M, Freeman E. Segmentation, attention and phenomenal visual objects. Cognition. 2001;80:61–95. [PubMed] 26. Connor CE, Preddie DC, Gallant JL, Vanessen DC. Spatial attention effects in macaque area V4. J. Neurosci. 1997;17:3201–3214. [PubMed] 27. Grossberg S. How Does A Brain Build A Cognitive Code. Psychol. Rev. 1980;87:1–51. [PubMed] 28. Desimone R, Duncan J. Neural mechanisms of selective visual attention. Annu. Rev. Neurosci. 1995;18:193–222. [PubMed] 29. Schiller PH. The effects of V4 and middle temporal (MT) area lesions on visual performance in the rhesus monkey. Vis. Neurosci. 1993;10:717–746. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||
Perception. 1989; 18(1):55-68.
[Perception. 1989]Cogn Psychol. 1996 Dec; 31(3):248-306.
[Cogn Psychol. 1996]J Neurosci. 2000 Sep 1; 20(17):6594-611.
[J Neurosci. 2000]Nat Neurosci. 2007 Mar; 10(3):283-4.
[Nat Neurosci. 2007]J Neurophysiol. 1993 Sep; 70(3):909-19.
[J Neurophysiol. 1993]J Neurosci. 2000 Sep 1; 20(17):6594-611.
[J Neurosci. 2000]J Physiol. 2003 Apr 15; 548(Pt 2):593-613.
[J Physiol. 2003]J Neurosci. 2000 Sep 1; 20(17):6594-611.
[J Neurosci. 2000]J Neurosci. 1999 Mar 1; 19(5):1736-53.
[J Neurosci. 1999]J Neurosci. 1994 Apr; 14(4):2190-9.
[J Neurosci. 1994]J Neurophysiol. 2007 Jun; 97(6):4310-26.
[J Neurophysiol. 2007]Cogn Psychol. 2001 Nov; 43(3):171-216.
[Cogn Psychol. 2001]J Neurophysiol. 2007 Jun; 97(6):4310-26.
[J Neurophysiol. 2007]J Neurosci. 2000 Sep 1; 20(17):6594-611.
[J Neurosci. 2000]J Neurophysiol. 2001 Mar; 85(3):1328-31.
[J Neurophysiol. 2001]Prog Brain Res. 2006; 155():157-75.
[Prog Brain Res. 2006]J Neurosci. 1995 Feb; 15(2):1605-15.
[J Neurosci. 1995]Vision Res. 1998 Aug; 38(15-16):2429-54.
[Vision Res. 1998]J Neurophysiol. 2002 Nov; 88(5):2648-58.
[J Neurophysiol. 2002]Cogn Psychol. 1992 Oct; 24(4):475-501.
[Cogn Psychol. 1992]Cognition. 2001 Jun; 80(1-2):61-95.
[Cognition. 2001]J Neurosci. 1997 May 1; 17(9):3201-14.
[J Neurosci. 1997]Psychol Rev. 1980 Jan; 87(1):1-51.
[Psychol Rev. 1980]Annu Rev Neurosci. 1995; 18():193-222.
[Annu Rev Neurosci. 1995]J Neurosci. 1999 Mar 1; 19(5):1736-53.
[J Neurosci. 1999]J Neurosci. 2000 Sep 1; 20(17):6594-611.
[J Neurosci. 2000]Neuron. 2005 Jul 7; 47(1):155-66.
[Neuron. 2005]J Neurosci. 2000 Sep 1; 20(17):6594-611.
[J Neurosci. 2000]Vis Neurosci. 1993 Jul-Aug; 10(4):717-46.
[Vis Neurosci. 1993]J Neurosci. 2000 Sep 1; 20(17):6594-611.
[J Neurosci. 2000]Neuron. 2005 Jul 7; 47(1):155-66.
[Neuron. 2005]Nat Neurosci. 2007 Mar; 10(3):283-4.
[Nat Neurosci. 2007]J Neurosci. 2000 Sep 1; 20(17):6594-611.
[J Neurosci. 2000]