| Volume 5, Number 3, Article 9, Pages 257-274 |
doi:10.1167/5.3.9 |
http://journalofvision.org/5/3/9/ |
ISSN 1534-7362 |
Visual search for transparency and opacity: Attentional guidance by cue combination?
Jeremy M. Wolfe |
Visual Attention Laboratory, Brigham & Women's Hospital, & Department of Ophthalmology, Harvard Medical School, Boston, MA, USA |
|
Randall S. Birnkrant |
Visual Attention Laboratory, Brigham & Women's Hospital, Boston, MA, USA |
|
Melina A. Kunar |
Visual Attention Laboratory, Brigham & Women's Hospital, & Department of Ophthalmology, Harvard Medical School, Boston, MA, USA |
|
Todd S. Horowitz |
Visual Attention Laboratory, Brigham & Women's Hospital, & Department of Ophthalmology, Harvard Medical School, Boston, MA, USA |
|
Abstract
A series of seven experiments explored search for opaque targets among transparent distractors or vice versa. Static stimuli produced very inefficient search. With moving items, search for an opaque target among transparent distractors was quite efficient while search for transparent targets was less efficient ( Experiment 1). Transparent and opaque items differed from each other on the basis of motion cues, luminance cues, and figural cues (e.g., junction type). Motion cues were not sufficient to support efficient search ( Experiments 2- 5). Violations of the luminance rules of transparency disrupt search ( Experiments 3 and 4). Experiment 5 shows that search becomes inefficient if X-junctions are removed. Experiments 6 and 7 show that efficient search survives if X-junctions are occluded. It appears that guidance of attention to an opaque target is guidance based on "cue combination" (M. S. Landy, L. T. Maloney, E. B. Johnston, & M. Young, 1995). Several cues must be present to produce a difference between opaque and transparent surfaces that is adequate to guide attention.
 |
|
History
Received August 4, 2004; published March 30, 2005
Citation
Wolfe, J. M., Birnkrant, R. S., Kunar, M. A., & Horowitz, T. S. (2005). Visual search for transparency and opacity: Attentional guidance by cue combination?
Journal of Vision, 5(3):9, 257-274,
http://journalofvision.org/5/3/9/,
doi:10.1167/5.3.9.
Keywords
transparency, opacity, visual search, cue combination, visual attention, surface perception
for related articles by these authors
for papers that cite this paper |
How can we select a desired object out of the many
items in our visual field? The human visual system is capable of extracting a
great deal of information about the distal properties of visible objects based
on the distribution of light falling on the retina. However, the last 25
years’ worth of research in attention suggests that, when we search, the
allocation of attention is guided by only a limited set of attributes. These
include basic features such as color, orientation, size, and motion (for a
recent review, see Wolfe & Horowitz, 2004). While it was once assumed that only
attributes that were analyzed early in visual processing would be available to
guide attention, the contemporary consensus is these attributes include
properties better described as “mid-level vision.” For example,
attention can be guided by pictorial depth cues that are unlikely to be the
product of the earliest stages of visual cortical processing (Enns &
Rensink, 1990, 1993; Enns, Rensink, & Douglas, 1990; Epstein, Babler, &
Bownds, 1992; Sun & Perona, 1996; Von Grünau & Dubé, 1994). Moreover, when attributes such as
size, for example, guide search, it is not the retinal size of an object that is
used, but the “post-constancy” size – size modulated by
available depth cues (Aks & Enns, 1993).
Other attributes known to be extracted in early vision
are not available to guide attention. Although line intersections, for example,
are available early on in the visual system (e.g., He & Nakayama, 1992), they do not produce efficient search
(Wolfe & DiMase, 2003). Rensink and
Enns ( 1995) demonstrated that not
only is low-level information not guiding the deployment of attention, early
information about properties like the length of line segments is actually
preempted by higher order representations. Processes such as those generating
the Müller-Lyer illusion can hide information that would otherwise be
available to guide visual search. This class of findings led Nakayama and He ( 1995) to propose that attention is directed
to surfaces, rather than objects, and to suggest that attention would be guided
only by surface properties.
Real world surfaces can be categorized as transparent
or opaque. If surface features are the guiding attributes, can attention be
guided toward transparent and/or opaque objects? Note that transparency and
opacity are unusual features because they are defined only in the presence of
another surface. An item on a computer screen can be red or big or vertical
without reference to another visible item (e.g., “big” could be
relative to a remembered standard). That same item cannot be transparent or
opaque in splendid isolation.
The goal of this work is to determine whether
transparency and/or opacity serve as guiding attributes in visual search.
Mitsudo ( 2003) performed a series of
experiments that show transparency information can be used to create objects
that are sought efficiently (his Experiments 1
and 2) and to disable image attributes that
support efficient search (his Experiment 3).
These results show that transparency information, like intersection information,
is available early in visual processing. Mitsudo shows that we can search for
rectangles that would not exist if early vision were insensitive to the rules of
transparency. We wish to know if observers can guide attention by transparency
itself.
This distinction can be made clearer by considering
guidance by color and orientation. It is possible to guide attention to a
vertical bar that would not exist if the early visual system were insensitive to
color information. Imagine red vertical and horizontal bars on an equiluminant
green background (Cavanagh, Arguin, & Treisman, 1990). At the same time, it is possible to
guide attention on the basis of the redness itself. Imagine a search for the red
bar among green bars on a gray background. It is the latter sort of evidence we
seek in this study. Can attention be guided to transparent among opaque objects
or vice versa?
To answer this question, we performed a series of
visual search experiments. Guidance can
be inferred from the pattern of results from experiments where observers search
for a target item among varying numbers of distractor items. Reaction time (RT)
is plotted against set size (the total number of display items, target +
distractors). This RT × set size function indexes the cost of
adding another distractor to the display, and is taken as a measure of search
efficiency. Shallower slopes represent more efficient searches. If the RT ×
set size function is flat (slope of 0), the target can be detected independently
of the number of distractor items, and is often said to “pop out” of
the display. A flat (or nearly flat) slope is a classic diagnostic for a guiding
attribute in search (Treisman & Gelade, 1980). Steep slopes, on the other hand,
suggest that attention is necessary to discriminate targets from distractors.
Finding a rotated T among rotated L distractors is one example of such an
inefficient search, where the addition of each new distractor item adds around
20–30 ms to the search time (Kwak, Dagenbach, & Egeth, 1991).
A stimulus feature is not a guiding attribute
independent of context. For example, a red item will be found efficiently among
green distractors but not among reddish orange distractors. The difference
between target and distractor is critical (Duncan & Humphreys, 1989). A more curious aspect of the
relationship of targets and distractors is that this relationship can be
asymmetric. In some cases, the roles of target and distractor can be reversed
without much consequence. It is easy to find red among green or green among red.
In other cases, this is not true. For example, it is easier to find a moving
target among stationary items than a stationary target among moving items (Dick,
1989; Royden, Wolfe, & Klempen, 2001). Treisman and Gormican ( 1988) first suggested that search
asymmetries could be a useful tool by noting that the presence of a feature can
guide attention better than its absence. Thus, we could argue that motion is a
feature and stationarity only its absence (but see Rosenholtz, 2001). Applying this search asymmetry
logic, one would test search for A among B, and B among A. If one search
produces shallow slopes and the other steeper slopes, then one could argue that
the target producing shallow slopes bears the feature, and the other target is
defined by the absence of the feature. Note that there are asymmetries where
both searches produce steep slopes but
where one is markedly steeper than the other (e.g., search for upright among
inverted faces, Tong & Nakayama, 1999).
In such cases, neither property can be said to guide search. The difference in
slopes is most likely a reflection of different rates of serial processing of
items. Additionally, in some more complex cases, like detection of shadows, it
is possible for the absence of a specific feature to alter the structure of a
scene. In that case, it might be absence that is detected more readily than
presence (Rensink & Cavanagh, in press).
In this work, we demonstrate that attention can be
guided to the presence of an opaque object with considerable efficiency.
Transparency, the absence of opacity, is harder to find. In initial work, we
found that searches for static opaque surfaces among transparent surfaces, and
vice versa, were extremely inefficient. In Experiment 1, search for a moving opaque item among
moving transparent items was quite efficient while the reverse was less
efficient. Experiment 2 shows that the motion
cues, while apparently necessary, are not sufficient to support efficient
search. Experiments 3 and 4 show that violations of the physics of
transparency disrupt search, while showing again that the motion differences
between stimuli are not sufficient for efficient search. Experiment 5 shows that search becomes inefficient
if X-junctions are removed. Experiments 6 and 7 show that efficient search survives if
X-junctions are occluded. It appears that guidance of attention to an opaque
target is guidance on the basis of "cue combination" (Landy et al., 1995). Several cues must be present to produce
a difference between opaque and transparent surfaces that is adequate to guide
attention. Initial studies: Static stimuli
As can be seen in many works of art and in many
articles in the scientific literature, transparency and opacity can be
effectively portrayed with static stimuli. We examined the ability to use this
information to guide attention in a series of initial investigations with a
variety of stimuli. Figure 1 shows an
example using circular stimuli on a contoured background. In this case, an
appearance of transparency was created by placing a virtual blue disk on an
achromatic background. The disk behaved like a filter that would pass a
percentage of blue light while reflecting blue from its surface. Opaque
distractors were created by shifting the disk so that the contours within the
disk did not align with the contours of the background. The resulting
T-junctions indicated occlusion, and therefore opacity, just as the X-junctions
indicated transparency. Thus, the disk at the center appears more transparent
than the other four disks in Figure
1.
Figure 1. Static transparent and opaque stimuli
used in pilot investigations.
We report these experiments only briefly because pilot
work with a variety of static stimuli consistently produced extremely
inefficient search. Using stimuli like those in Figure 1, search for an opaque disk among
transparent distractors produced average RT x set size slopes of
142 ms/item on target present trials and 356 ms/item for target absent.
Search for a transparent disk among opaque distractors was even worse, with
target present slopes of 210 ms/item and target absent slopes of
339 ms/item. In this example, set sizes were 1, 2, and 3 items and there
were seven observers. Control experiments demonstrated that this was not a
failure to detect transparency away from fixation. Observers could reliably (83%
correct) discriminate a single transparent from opaque stimulus in the periphery
in a 150-ms flash–too brief to perform a voluntary saccadic eye movement.
Nevertheless, these very steep search slopes indicate that observers may have
felt a need to fixate each item in turn.
In spite of the range of beautiful transparency
illustrations in the literature, we never found a static display that produced
search slopes that fell even in the range of typical inefficient (T vs. L)
search. That in itself is of some interest as a negative finding. The
transparency and opacity of these items were introspectively clear, yet search
proved to be extremely inefficient. Accordingly, in the experiments described
more fully below, we used stimuli in which simulated opaque and transparent
surfaces moved over a textured background.
It is interesting that Mitsudo's ( 2003) experiments worked with static
stimuli. In his tasks, the rules of transparency were creating distinctive
objects. In our research, we wished to distinguish between objects that were
identical except for their transparency or opacity. In distinguishing between
transparent and opaque objects like those in Figure 1,
the visual system must decide how to assign contours that lie within the
object's boundaries. This is the problem of "scission" (e.g., Singh &
Anderson, 2002). Apparent transparency is
made more compelling if scission is made more compelling, and scission of
surfaces is made easier if they are moving independently (D’Zmura, Rinner,
& Gegenfurtner, 2000). As we will see,
while relative motion alone is not adequate to guide attention efficiently, when
it is combined with other cues to transparency/opacity, search for an opaque
item proceeds with relative ease.
Experiment 1: Search for moving transparent and opaque targets
In Experiment 1, we had
observers search for transparent targets among opaque distractors or vice versa.
Experiment 1a used small set sizes (1-4), while
Experiment 1b tested the same effect at larger
set
sizes.
Twelve observers from the paid observer panel of the
Visual Attention Laboratory at Brigham and Women’s Hospital in Boston
participated in this experiment. All passed the Ishihara test for color
blindness and had 20/25 corrected vision or better. Observers gave informed
consent and were compensated $10/hour for their
time.
In this and all subsequent experiments, stimuli were
presented on a 21” Mitsubishi monitor running at a refresh rate of 75 Hz
and with a resolution of 1024 x 768 pixels controlled by a Power Macintosh G4
running Mac OS 9.2.2. Stimulus presentation and data collection were controlled
by a Matlab 5.2 (MathWorks) script using the Psychophysics Toolbox (Brainard, 1997; Pelli, 1997). Stimuli were viewed in a dark room at a
viewing distance of 57.4 cm.
The stimuli consisted of simulated opaque and
transparent bars (15° x 1.5°) moving over a grayscale texture of dots
( Figure 2). Backgrounds subtended 23° x
23° and consisted of 500 dots (diameter = 2.0°) positioned randomly
atop a light gray (29.0 cd/m 2) backdrop. Each dot assumed one of four
luminance values (Dot A = 25.1 cd/m 2, Dot B = 14.2 cd/m 2,
Dot C = 6.6 cd/m 2, and Dot D = 1.3 cd/m 2), and dots could
occlude one another. The region surrounding each background was solid black (0.7
cd/m 2).
Figure 2. Movie of typical stimuli used in Experiment 1a: a search for an opaque target among
transparent distractors.
A filtered version of the background was created by
transforming RGB values according to Equation
1: . | (1) |
The multiplicative component acts as a greenish
filter. The additive component simulated a diffuse, achromatic reflectance from
the filter’s surface (and made the stimuli look more compelling). This
resulted in transparent colors with CIE coordinates of
x = 0.272 and
y = 0.385 and a range of luminance
values (backdrop = 16.2 cd/m 2, Dot A = 14.4 cd/m 2,
Dot B = 8.5 cd/m 2, Dot C = 4.1 cd/m 2, and Dot D = 0.1
cd/m 2). Transparent stimuli were created by pasting the filtered
version onto the corresponding region of the unfiltered background. Opaque
stimuli were obtained by randomly cutting a portion of the filtered version and
moving it across the background (with the constraint that the sampled stimuli
never overlapped the original background). As shown in Figure 2, texture within the opaque bar
remained unchanged as the bar moved over the background, whereas the texture
within the transparent bar changed, consistent with a transparent bar moving
over a visible texture. Opaque and transparent
bars were horizontally centered on the background (4.0° from either edge)
and motion was confined to the vertical dimension. (In exploratory work, we
added a modest horizontal component to the motion without changing the results.)
The stimuli were randomly assigned to 15° x 3° cells within a moving 1
x 4 grid (see Figure 2). The overall
vertical motion of all bars was driven by a sinusoidal oscillation with an
amplitude of 4.9° and a period of 2720 ms. Additionally, each opaque and
transparent bar was given an independent vertical motion with an amplitude
0.5° and a period one fifth of the period of the overall motion (544 ms).
The phase of this component was different for each bar. Thus, the added
component served to disrupt the sense that the moving items formed a single
object. The grid of four possible stimulus locations began each trial at the
center of the display. Starting direction of motion was chosen at random. In the
course of their motion, bars’ edges could come as close to each other as
0.5° or go as far away as 2.5°. They could come as close as 0.8°
to the edge of the background.
Search displays included a central fixation point with
a red center (0.8° x 0.8°, 9.1 cd/m2, CIE
coordinates: x = 0.629,
y = 0.342) and yellow surround
(0.2° thick, 43.8 cd/m2, CIE coordinates:
x = 0.400,
y = 0.515). The fixation point occluded
the opaque and transparent stimuli as they passed through the center of the
display.
Observers searched for an opaque bar among transparent
bars and vice versa. Blocks of opaque and transparent search trials consisted of
40 practice trials and 300 test trials. The order of these conditions was
counterbalanced across observers. A new background was generated for each
condition. Before the start of a condition, observers were told what the target
and distractors would be. There were four set sizes: 1, 2, 3, and 4 items. A
target was present on 50% of trials. Set size, presence or absence of the target
item, and position of search stimuli were randomly chosen across trials. A tone
accompanied the appearance of the background at the beginning of each trial,
after which search stimuli appeared 500 ms later. Stimuli remained visible until
the observer pressed one of two keys, a "yes" key if the target was detected or
a "no" key if not. Feedback was provided after each response in the form of text
that remained on screen for 400 ms and a beep if an error was made. Observers
were instructed to respond as quickly and accurately as
possible.
RTs faster than 200 ms and greater than 4000 ms were
discarded. All data from an observer were discarded if error rates in at least
one cell exceeded 25%. Such error thresholds are somewhat arbitrary. We wished
to remove observers who were very incautious outliers in our population. Removal
of these observers "cleans up" the data but does not alter the pattern of
results. As noted below, error rates in these experiments are somewhat higher
than typical in visual search. In general, the higher error rates occurred in
condtions that produced less efficient search–a speed-accuracy
covariance.
Data from one observer were excluded due to the error
criterion. Discarding fast and slow RTs did not result in excluding more than 1%
of any observer’s data. Figure 3 shows
the mean correct RTs as a function of display size for both target present and
target absent trials.
Figure 3. RT x
set size functions for search for opaque among transparent stimuli and vice
versa (opaque target = green symbols, transparent target = red symbols). Solid
lines and symbols are target present data; dashed lines and hollow symbols are
target absent. Slopes are given with the data.
A one-sample t
test on the target-present slopes showed search rates to be significantly
different from 0 ms/item for both opaque targets among transparent distractors,
t(10) = 3.4,
p < .01, and transparent targets
among opaque distractors, t(10) = 6.9,
p < .01. Slopes for the opaque
target were significantly shallower than those for the transparent target,
t(10) = 2.4,
p < .05.
The target-absent data were somewhat unusual.
Typically, target absent RTs are slower than target present, and RT x set size
slopes for target absent trials are steeper than corresponding target present
slopes (Wolfe, 1998). Figure 3 shows that neither of these results
holds in the present case. Target absent slopes are shallower than the
corresponding target present slopes,
t(10) = 2.5,
p < .05 for opaque targets and
t(10) = 4.5,
p < .01 for transparent targets,
and, at least for the larger set sizes, mean RTs are comparable for present and
absent trials. Recently, we have found unusual patterns of results with small
set sizes (Michod, Wolfe, & Horowitz, 2004). Accordingly, we will present a
replication of the basic experiment with larger set sizes (see Experiment 1b) before considering the
implications of this pattern of results.
As shown in Figure
4, error rates are somewhat higher than is typical for visual search
experiments. As a somewhat arbitrary comparison, our recent study of the
featural status of intersections (Wolfe & DiMase, 2003) produced miss error rates between 1.3%
and 6.5%, depending on condition and averaged across set size. False alarm rates
in those experiments averaged less than 1%. False alarms are typically very rare
in search tasks with RT as the dependent measure. The higher error rates here
suggest that the opaque and transparent bars are more confusable than many other
search stimuli. The miss errors increase with set size. This means that the
"true" hit trial RTs, in particular, are probably slower than measured RTs.
Townsend and Ashby ( 1983) suggest
dividing RT by accuracy as a way to estimate the true RT. If we do that for the
data in Experiment 1a, the resulting slopes are
20 ms/item for opaque targets and 33 ms/item for transparent targets. Thus,
while there is evidence for an asymmetry favoring opaque targets, the evidence
that opacity is a guiding attribute is not particularly strong in this
experiment.
Figure 4. Miss
and false alarm (FA) error rates as a function of set size for the opaque and
transparent target conditions of Experiment 1a.
Solid lines and symbols are misses and dashed lines with hollow symbols are
false alarms. Opaque stimuli are shown with green symbols, and transparent
stimuli are shown with red symbols.
Experiment 1b: Larger set sizes
Because some of our recent work (Michod et al., 2004) suggests that search with small set
sizes may produce different patterns of results than search through larger sets
of items, Experiment 1b repeats the basic
experiment for set sizes between 4 and
16.
Twelve observers from the Visual Attention
Laboratory’s paid observer panel participated in this
experiment.
Compared to Experiment
1a, the opaque and transparent bar stimuli were reduced in width (from the
original 15° to 2.9°) allowing for a 4 x 4 grid of possible stimulus
locations. There was 1.0° between adjacent columns of stimuli. Motion of
the stimuli followed the same rules as before but was slightly slowed. A
complete cycle took 3200 ms.
There were four set sizes: 4, 8, 12, and 16 items. In
all other respects, the procedure was similar to that for Experiment
1a.
Data from two observers were excluded due to the error
criterion. Discarding fast and slow RTs did not result in excluding more than 1%
of any observer’s data. Figure 5 shows
the mean correct RTs as a function of display size for both target present and
target absent trials.
Figure 5. RT x set size functions for search for
opaque among transparent stimuli (green symbols) and vice versa (red symbols)
using larger set sizes. Solid lines and symbols are target present data; dashed
lines and hollow symbols are target absent. Slopes are given with the
data.
The target present results for larger set sizes
duplicate the basic pattern seen with the smaller set sizes. Search for an
opaque target is faster and more efficient than search for a transparent target,
t(9) = 3.1,
p < .05. Target absent trials show a
fairly typical pattern for the transparent case. Slopes are marginally steeper
than those for target present, t(9) =
2.2, p = .057. The pattern for target
absent trials in the opaque target condition remains atypical. The slope is
close to zero, t(9) = 0.18,
p =
ns (though, note that the mean RTs are
substantially slower than those shown in Figure
3 for a comparable condition of Experiment
1a).
Error rates are again somewhat higher than typical in
search experiments. Misses averaged 6.8% for opaque targets and 6.9% for
transparent targets, with false alarm rates of 2.6% and 2.4%, respectively.
Correcting hit RTs for accuracy, as described above, changes the slopes to
7.2 ms/item for the opaque targets and 21.8 ms/item for transparent
targets.
Several aspects of the results of Experiment 1 are noteworthy. First, unlike with
static stimuli, all of the search efficiencies are in range of typical
laboratory search tasks that do not require fixation of each potential target in
turn. Moving the stimuli over a textured background made the distinction between
opacity and transparency sufficiently salient to study with these methods.
Second, the results are asymmetric. Search for an opaque target is more
efficient than search for a transparent target. In the typical understanding of
search asymmetries, this would suggest that opacity is the "feature" and
"transparency" is the absence of opacity. The claim of featural status for
opacity is complicated by the relatively high error rates in these experiments.
Correcting for errors preserves the asymmetry, so it is not the case that the
difference between opacity and transparency searches is produced by a
speed-accuracy tradeoff. While the error correction made the slopes in Experiment 1a steep enough to weaken claims for an
efficient opaque feature search, the corrected slopes for opaque targets in Experiment 1b were still quite efficient.
Moreover, the error correction makes a much less dramatic change in the slopes
in Experiment 1a if two observers with rather
high error rates are removed from the analysis.
In the course of several experiments designed to
uncover the guiding cues to opacity and transparency, we have run a number of
replications of the basic experiment described here. These are summarized in Table 1 (RT x set size slopes) and Table 2 (error rates). Here, and elsewhere,
correction for errors changes the specific values for slopes but it does not
change the overall pattern of results. Accordingly, we will report the
uncorrected slopes and give an account of errors.
|
|
Background
dot size
|
Opaque
present
|
Transparent
present
|
Opaque
absent
|
Transparent
absent
|
|
1 to 4
|
Large
|
6.8
|
16.9
|
1.2
|
15.2
|
|
1 to 4
|
Large
|
13.5
|
16.5
|
-2.9
|
7.9
|
|
1 to 4
|
Large
|
6.0
|
14.3
|
-0.3
|
10.0
|
|
1 to 4
|
Large
|
9.4
|
16.0
|
-3.9
|
5.8
|
|
1 to 4
|
Large
|
7.1
|
21.0
|
-1.1
|
7.8
|
|
1 to 4
|
Large
|
5.1
|
20.3
|
3.9
|
15
|
|
1 to 4
|
Small
|
7.4
|
15.7
|
0.3
|
2.9
|
|
4 to 16
|
Large
|
5.3
|
13.7
|
0.2
|
20.5
|
|
4 to 16
|
Small
|
1.6
|
9.4
|
0.1
|
12.4
|
|
1 to 4
|
Scene
|
10.1
|
41.2
|
5.8
|
29.3
|
|
Average:
|
7.2
|
18.5
|
0.3
|
12.7
|
Table 1. Multiple replications of the basic search
for opaque among transparent items and vice versa. Slopes are standard RT x set
size functions and are not error corrected. Note that search for opaque among
transparent is typically quite efficient and is always more efficient than
search for transparent among opaque.
|
|
Background
dot size
|
Opaque
present
|
Transparent
present
|
Opaque
absent
|
Transparent
absent
|
|
1 to 4
|
Large
|
4.2%
|
5.6%
|
3.0%
|
3.8%
|
|
1 to 4
|
Large
|
6.3%
|
6.3%
|
2.8%
|
3.1%
|
|
1 to 4
|
Large
|
5.9%
|
6.3%
|
2.2%
|
3.1%
|
|
1 to 4
|
Large
|
7.7%
|
8.1%
|
4.3%
|
5.9%
|
|
1 to 4
|
Large
|
6.1%
|
8.4%
|
3.6%
|
4.7%
|
|
1 to 4
|
Large
|
4.8%
|
5.5%
|
3.2%
|
3.0%
|
|
1 to 4
|
Small
|
7.8%
|
6.7%
|
4.1%
|
3.5%
|
|
4 to 16
|
Large
|
6.8%
|
6.9%
|
2.6%
|
2.4%
|
|
4 to 16
|
Small
|
3.1%
|
7.1%
|
2.5%
|
1.5%
|
|
1 to 4
|
Scene
|
4.7%
|
3.7%
|
1.3%
|
3.7%
|
|
Average:
|
5.7%
|
6.5%
|
3.0%
|
3.5%
|
Table 2. Corresponding error rates for the multiple
replications of the basic search for opaque among transparent items and vice
versa. In general, these error rates are somewhat higher than those measured in
typical feature search tasks. However, error-corrected slopes remain shallow for
opaque targets. Error rates are higher for transparent targets. Thus, correcting
for errors enhances the asymmetry between opaque and transparent.
In all 10 versions of the experiment, search for opaque
targets among transparent distractors was more efficient than the reverse.
Moreover, target absent slopes in the opaque target condition were always highly
efficient. On these trials, when all items are transparent, it appears to be
very easy to determine that nothing is present. Taken as
a group, the results suggest
that observers can guide their attention to opaque items in the display with
reasonable efficiency. Even corrected for errors, the average target present
slope for finding opaque targets is 12.6 ms/item ( Experiment 1a is actually the worst case). The
average corrected slope is 30.6 ms/item for search for a transparent target. Something is guiding attention toward the
opaque targets and is very efficiently allowing observers to determine when no
opaque target is present.
The results of the basic experiment do not establish
exactly what aspect of the stimulus is critical. As seen in Figure 2, moving transparent and opaque stimuli
differ in a number of ways that involve other guiding attributes. Consider a
single opaque or transparent stimulus as it moves across the background.
Contours internal to the stimulus move if the stimulus is opaque. They are
“stuck” to the opaque surface. Is search for an opaque stimulus
merely an example of search for this motion cue? This hypothesis is tested in Experiment 2. Transparent filters change stimuli in
ways limited by the physics of the situation. For example, regions do not
increase in luminance as they move behind filters. Experiments 3 and 4
assess sensitivity to violations of these physical rules. Moving opaque bars
create accretion and deletion cues at their borders as T-junctions are formed
with abutting background contours. Transparent stimuli create specific types of
X-junctions as background contours traverse their borders. The role of these
spatial constraints is evaluated in Experiments
5- 7.
To anticipate our conclusions, the data suggest that
opaque surfaces are more object-like
than transparent surfaces. Search for the presence of an opaque item is search
for the presence of an object. Search for the presence of a transparent item is
search for the relative absence of an object, and is thus more difficult.
Violating any of the cues to transparency makes the transparent item more
object-like, and thus harder to distinguish from an opaque
item.
Experiment 2: Is efficient search due to motion cues within the bar?
In principle, the search for the opaque item among
transparent (or vice versa) in Experiment 1
could have been a motion search. A dot on the opaque item moves relative to the
rest of the background while a dot under the transparent filter does not. In the
frame of reference of individual items, the dots internal to the transparent
items move as the item moves, while the dots internal to the opaque items are
fixed. If observers performed the task in Experiment
1 solely on the basis of the motion of dots within the search items, then
the same pattern of results should be obtained if we move the same bars (with
the same contents) over a blank, dark background. The search items would not be
transparent and opaque, though the transparent stimuli could be seen as windows
in a black surface, looking through to a textured background. Importantly, the
contours inside the once-transparent bars would still move while the contours in
the formerly opaque bars would remain fixed to the moving
bar.
Fourteen observers from the Visual Attention
Laboratory’s paid observer panel participated in this
experiment.
The stimuli were identical to Experiment 1a, with the addition of two new
conditions in which the background was not shown ( Figure 6). In these conditions, bars appeared
against a uniform black (0.7 cd/m 2) background. The opaque bar
contained an unchanging texture that moved with the bar and the transparent bar
had a scrolling motion of dots that appeared and disappeared. Transparent bars
were consistent with a moving window in a black surface, opening onto a textured
background. Concealing the background eliminates the cue of junction type
because the black background forms only T-junctions with the textures in both
transparent and opaque
bars.
Figure 6. Movie
of typical stimuli used in the no background conditions of Experiment 2.
Observers were tested in four blocked conditions:
search for an opaque bar among transparent bars and vice versa with visible
backgrounds, search for a transparent bar among opaque bars and vice versa with
hidden backgrounds. The first two conditions were a replication of Experiment 1a and are reported in Tables 1 and 2. Blocks consisted of 40 practice trials and
300 test trials. The order of conditions was counterbalanced across observers.
There were four set sizes: 1, 2, 3, and 4
items.
Discarded fast and slow RTs constituted no more than 1%
of any individual's data. Data for one observer were excluded for violating the
error criterion. Figure 7 shows the mean
correct RTs as a function of set size for both target present and target absent
trials. Error data are shown in Table
3.
Figure 7. RT x set size functions for search
for opaque among transparent stimuli (green symbols) and vice versa (red
symbols). Solid lines and symbols are target present data; dashed lines and
hollow symbols are target absent. Slopes are given to the right of the data. The
with background cases (Panel A) are a replication of Experiment 1a, showing an advantage for the opaque
targets. From top to bottom, slopes are given in this order: transparent absent,
transparent present, opaque absent, and opaque present. In the no background
conditions (Panel B), only the moving bars are presented. The asymmetry reverses
and search becomes markedly slower and less efficient.
|
Back-
ground
|
Yes
|
No
|
Yes
|
No
|
|
Target
type
|
Opaque
|
Transparent
|
Opaque
|
Transparent
|
Opaque
|
Transparent
|
Opaque
|
Transparent
|
|
Set size
|
MISS % (target present)
|
FA % (target absent)
|
|
1
|
5.1%
|
2.7%
|
5.3%
|
10.7%
|
2.2%
|
3.9%
|
7.5%
|
2.6%
|
|
2
|
6.3%
|
5.3%
|
9.0%
|
7.8%
|
3.5%
|
2.1%
|
7.6%
|
4.6%
|
|
3
|
4.2%
|
7.8%
|
14.9%
|
6.4%
|
2.2%
|
2.0%
|
7.6%
|
4.6%
|
|
4
|
8.1%
|
9.5%
|
20.1%
|
12.9%
|
1.3%
|
4.5%
|
9.7%
|
10.5%
|
Table 3. Error rates for Experiment 2. Error rates are high for simple
search tasks but note that they are
higher when the background is removed,
showing that better RT performance with a background is not the result of a
speed-accuracy tradeoff. FA = false alarm.
It is clear that the motion cue within the bars in Experiment 2 is not sufficient to produce
performance like that in Experiment 1. Removing
the background made search notably worse. This was especially true for the
opaque target condition, where slopes went from about 6 ms/item to about 80
ms/item. 1 Efficiency of the search for a
transparent target was less affected, though mean RT was somewhat slowed. The
removal of the background introduces an occlusion cue into the nominally
transparent stimuli. Dots in those bars appear and disappear–something
that does not happen in the opaque bars. This may have been the signal that
observers used to perform the task and it might have actually interfered in some
fashion with the motion cue.
Error rates increased when the background was removed,
showing that the difference between conditions was not due to a speed-accuracy
tradeoff.
Another indication that the motion cue is not
sufficient to explain the results of Experiment
1 is that the search asymmetry is reversed in Experiment 2. The scrolling dots of the transparent
bars are easier to search for than the static dots of the opaque bar. Our
initial work indicated that motion of the items was necessary to permit
efficient search for an opaque target among transparent distractors. Experiment 2 showed that the motion of the items in
the absence of the background is not
sufficient. Experiment 3: Preserving motion while violating transparency–impossible filters
Perhaps the sufficient cues in the basic opaque versus
transparent search are relative motion cues between items and the background. A
dot on an opaque surface moves relative to a dot on the background. A dot,
visible through a transparent surface, does not move relative to a dot on the
background. Relative motion cues are more salient than absolute motion: Think of
the motion of a cloud in the open sky compared to the motion of the same cloud
passing in front of the moon (Aubert, 1886, reported in Graham, 1965). Relative motion would have been
eliminated by the removal of the background in Experiment 2. If relative motion cues between
background and search items are sufficient, then performance should not be
impaired if we violate the rules of transparency while preserving the motion. In
previous experiments, the bars consisted of an image of the background as seen
through a greenish filter. All of the contours of the background were preserved
in the filtered image, and color and luminance were consistent with the physical
effects of a green filter. In the present experiment, we created an image that
did not correspond to the effects of any physical filter. All contours were
preserved, but a “false color” filter was introduced, which had the
effect of replacing each underlying gray dot with a dot of randomly generated
RGB values ( Figure 8). These are very
colorful, but not physically plausible. The false color opaque items were simply
patches of the false color image that were randomly selected from the background
in the same manner as the opaque bars of Experiment
1.
Figure 8. Movie
of typical stimuli used in the false color conditions of Experiment 3.
Eleven observers from the Visual Attention
Laboratory’s paid observer panel participated in this
experiment.
Transparent and opaque stimuli were as described in Experiment 1a. False color stimuli were
constructed by replacing the greenish filter transformation described in Experiment 1a with an algorithm that randomly
assigned RGB values to the existing background
dots.
There were four blocked conditions: search for an
opaque bar among transparent bars and vice versa, as well as search for a false
colored opaque bar among false colored transparent bars and vice versa. The
first two conditions were a replication of Experiment 1a and are reported in Tables 1 and 2. The order of conditions was counterbalanced
across observers. There were four set sizes: 1, 2, 3, and 4 items. Methods were
otherwise similar to previous
experiments.
Discarded fast and slow RTs
constituted no more than 1% of any individual's data. Data for one observer were
excluded for violating the error criterion. Figure 9 shows the mean correct RTs as a
function of display size for both target present and target absent trials for
the remaining 10 observers. Error rates are shown in Table 4 and reflect the pattern of errors
obtained in Experiment 2.
Figure 9. RT x set size functions for search
for opaque among transparent stimuli (green symbols) and vice versa (red
symbols). Solid lines and symbols are target present data; dashed lines and
hollow symbols are target absent. Slopes are given to the right of the data. The
plausible filter cases (Panel A) are a replication of Experiment 1a, showing an advantage for the opaque
targets. From top to bottom, slopes are given in this order: transparent absent,
transparent present, opaque present, and opaque absent. In the false color
conditions (Panel B), the items have colors inconsistent with any physically
transparent filter. The asymmetry reverses and search becomes markedly slower
and less efficient.
|
|
Plausible
|
FALSE
|
Plausible
|
FALSE
|
|
Target type
|
Opaque
|
Transparent
|
Opaque
|
Transparent
|
Opaque
|
Transparent
|
Opaque
|
Transparent
|
|
Set size
|
MISS % (target present)
|
FA % (target absent)
|
|
1
|
4.5%
|
2.5%
|
5.2%
|
8.1%
|
4.5%
|
5.1%
|
5.0%
|
2.7%
|
|
2
|
3.9%
|
5.4%
|
8.3%
|
8.4%
|
2.3%
|
2.2%
|
4.6%
|
4.1%
|
|
3
|
7.2%
|
7.7%
|
15.9%
|
9.6%
|
2.3%
|
2.2%
|
5.1%
|
3.4%
|
|
4
|
9.7%
|
9.8%
|
14.0%
|
9.6%
|
1.7%
|
2.7%
|
5.1%
|
3.8%
|
It is clear from Figure
9 that the efficiency of search is adversely affected by changing the items
from simulations of a physically plausible filter to simulations of an
impossible filter. Note that all of the motion cues (relative and absolute) and
all of the form cues (T-junctions vs. X-junctions) are identical in the
plausible filter and false color conditions. These motion and form cues are not
sufficient to produce the relatively
efficient search for opaque targets among transparent distractors, though our
pilot experiments suggested that motion cues might be
necessary. Experiments 2 and 3
suggest that neither absolute nor relative motion cues that differentiate
transparency from opacity are sufficient to guide
attention. Experiment 4: Impossible filters with possible luminance statistics
Perhaps the false color filter of Experiment 3 is too dramatic. A transparent surface
produces both local (dot by dot) and global (whole surface) changes in luminance
and chromatic statistics. The false color transformation made radical changes in
the global statistics. Suppose we preserved the global distribution of color and
luminance created by the greenish filter of Experiment 1 but mapped the specific local colors
onto the wrong dots ( Figure 10). This
amounts to preserving overall Michelson contrast, which has been suggested as an
important cue in transparency perception (Singh & Anderson, 2002). Consider two background dots, A and B.
Under the plausible transparent transformation of Experiment 1, A would be transformed to f(A) as it
passed under the filter and B to f(B). Would the ability to search for the
opaque target among transparent distractors be preserved if we mapped, for
example, A to f(B) and B to
f(A)?
Figure 10. Movie of typical stimuli used in Experiment 4: a search for an opaque target among
false transparent distractors. Note
that X-junctions persist with these falsely transparent stimuli.
Seventeen observers from the Visual Attention
Laboratory’s paid observer panel participated in this
experiment.
The stimuli were similar to those used in Experiment 1a. Opaque and transparent bars were
created as before. False transparent bars were created by generating a
background having the same spatial layout as the actual background but with
randomly scrambled grayscale values. This alternate background was filtered to
produce the substrate for the falsely transparent items. These items had the
same average luminance, contrast, and chrominance values as the transparent
item. When they moved over the background, the motion internal to the item was
the same in false and true transparent items. The cues to the falseness of the
transparency would be impossible luminance transitions across the borders (e.g.,
one spot getting brighter and a neighboring spot of the same color getting
dimmer). The impossible transitions also generate X-junctions that violate the
physical constraints on junctions formed by contours crossing filter boundaries
(e.g., Adelson & Anandan, 1990; Beck
& Ivry, 1988; Metelli, 1974).
Each observer was tested in two pairs of conditions. In
one pair, false transparent stimuli were pitted against opaque stimuli. In the
other, false transparent were pitted against transparent. Set sizes were 1, 2,
3, and 4. Observers were tested for 40 practice and 300 experimental trials in
each condition. Order of conditions was counterbalanced across
observers.
Discarded fast and slow RTs constituted no more than 1%
of any individual's data. RT x set size functions are shown in Figure 11. Beginning with Panel A, it is clear
that search for the false transparent items among opaque items and vice versa
was not efficient, certainly not as efficient as the comparable conditions of Experiment 1. In contrast, Panel B shows that the
false transparent item behaved much like an opaque item. It was found with
relative ease among truly transparent distractors (slope of 12 ms/item on target
present trials). Moreover, we see the now-familiar asymmetry. Search for the
transparent target among falsely transparent distractors was less efficient (31
ms/item), though it was still a faster search than the searches for opaque among
false transparent (37 ms/item). Error rates were unremarkable. Misses averaged
8-9% for the false transparent/opaque searches and 5-6% for the false
transparent/transparent searches.
Figure 11. A. RT
x set size functions for search for false
transparent items among opaque stimuli (blue symbols) and opaque targets
among false transparent distractors
(green symbols). Solid lines and symbols are target present data; dashed lines
and hollow symbols are target absent. Slopes are given to the right of the data.
B. RT x set size functions for search for
false transparent items among
transparent stimuli (blue symbols) and transparent targets among
false transparent distractors (red
symbols). Solid lines and symbols are target present data; dashed lines and
hollow symbols are target absent. Slopes are given to the right of the
data
In this experiment, the false transparent items had the
figural properties (X-junctions vs. T-junctions), the movement properties, and
the average luminance properties of truly transparent items. Only the local
luminance values provided information that contradicted the hypothesis of a
transparent filter. However, this violation was enough to cause the false
transparent items to be treated as if they were opaque. Apparently, they are
quite confusable with opaque items, and easier to discriminate from transparent
items in search tasks. Thus we see that, at least for our stimuli, motion cues
are necessary (pilot data) but not sufficient ( Experiment 2, 3,
and this experiment). The same is true for the luminance cues: The correct local
luminance cues are necessary ( Experiment 3 and
this experiment) but not sufficient (luminance cues were present in the pilot
experiments with stationary stimuli). In the remaining experiments, we turn to
the form cues–the junctions formed as contours cross the filter boundary
and the accretion and deletion of contour information at opaque
boundaries.
Experiment 5: Are the figural cues necessary?
A transparent filter produces X-junctions as contours
in the background pass under the filter. An opaque item produces T-junctions
when the contours in the background are occluded by the overlying surface. Experiments 3 and 4
showed that the mere presence of this junction distinction is not sufficient to
produce the relatively efficient searches of Experiment 1. Are these junctions necessary at all?
Twelve observers from the Visual Attention
Laboratory’s paid observer panel participated in this
experiment.
Opaque bars were constructed as before. Here we
introduce a new type of false transparency. In this experiment, a false
transparent item was created by taking a transparent item from one location and
placing it in the incorrect position in the field. Thus, like a transparent
item, contours were accreting and deleting within the falsely transparent bar.
This preserves the motion and the global luminance statistics within the falsely
transparent item. However, this manipulation eliminates item continuity across
the filter border. Non-accidental X-junctions have been eliminated and
T-junctions predominate. Sample stimuli are shown in Figure 12.
Figure 12. Movie of typical stimuli used in Experiment 5. A search for an opaque target among
false transparent distractors. Note
that these falsely transparent items form T-junctions with the background
contours.
Observers were tested on searches for an opaque bar
among false transparent bars and vice versa. These were blocked conditions
consisting of 40 practice trials and 300 test trials. The conditions were
counterbalanced across observers. There were four set sizes: 1, 2, 3, and 4
items. All other aspects of the experiment were similar to previous
experiments.
Discarded RTs constituted no
more than 1% of any individual's data. Figure
13 shows the mean correct RTs as a function of display size for both target
present and target absent trials. Table 5
displays the error
rates.
Figure 13. RT x set size functions for search for
opaque among false transparent stimuli
(green symbols) and vice versa (red symbols). Solid lines and symbols are target
present data; dashed lines and hollow symbols are target absent. Slopes are
given to the right of the data. In this experiment, the
false transparent items were taken from
a piece of background that did not lie under the putative filter so that while
the contours moved like those beneath a transparent filter, contours in the
background were not completed across the filter boundary.
|
|
Opaque
|
False
transparent
|
Opaque
|
False
transparent
|
|
Target
presence
|
Present
|
Absent
|
|
Set size
|
MISS %
|
FA %
|
|
1
|
4.8%
|
6.3%
|
4.0%
|
4.0%
|
|
2
|
6.2%
|
3.7%
|
4.1%
|
5.0%
|
|
3
|
7.2%
|
8.6%
|
4.0%
|
2.9%
|
|
4
|
13.3%
|
12.9%
|
5.3%
|
4.6%
|
Again the results of the manipulation are obvious. The
search is markedly inefficient. Note that these stimuli had the same motion and
global luminance cues as the stimuli of Experiment
1. This suggests that the X-junctions are necessary to support the good
performance seen in Experiment 1, and their
absence can account for poor performance not only here but also in Experiment 2. Like motion and luminance cues, form
cues to transparency/opacity appear to be necessary but not sufficient.
How necessary are those X-junctions? Is it important
that explicit X-junctions be present in the display or is it enough to have the
same contours appearing in the background filtered or unfiltered? Can we hide
the actual point of intersection? That is the next question.
Experiment 6: Good continuation stimuli
Experiment 5 showed
that performance is impaired when X-junctions are removed from the transparent
items. What happens if they are merely hidden? In Experiment 6, a frame was placed around each
item.
Fourteen observers from the Visual Attention
Laboratory’s paid observer panel participated in this
experiment.
In the framed conditions, a 0.2° thick frame was
placed around each search item as shown in Figure 14. It was colored a mid-level green
(6.2 cd/m 2, CIE coordinates:
x = 0.272,
y = 0.385). Stimuli were otherwise
similar to the previous
experiments.
Figure 14. Movie of typical stimuli used in the
frame conditions of Experiment 6. Placing frames
around each item makes T-junctions out of all explicit points of contact between
the search items and the background. In the transparent items, however, contours
can be seen to complete behind the frame.
Observers were tested in four blocked conditions:
search for an opaque bar among transparent bars and vice versa without frames,
search for an opaque bar among transparent bars and vice versa with frames. In
the no frame conditions, the stimuli were the same as those used in Experiment 1a and are reported in Tables 1 and 2. Each condition consisted of 40 practice
trials and 300 test trials. The order of conditions was counterbalanced across
observers. There were four set sizes: 1, 2, 3, and 4
items.
Discarded fast and slow RTs constituted no more than 1%
of any individual's data. RT x set size functions for the no frame and frame
conditions are shown in Figure
15.
Figure 15. RT x set size functions for search for
opaque among transparent stimuli (green symbols) and vice versa (red symbols).
Solid lines/symbols are target present data; dashed lines and hollow symbols are
target absent. Slopes are given to the right of the data. The no frame cases
(Panel A) are a replication of Experiment 1a,
showing an advantage for the opaque targets. In the frame conditions (Panel B),
each item has a frame around it to eliminate explicit X-junctions. However,
contours can be traced in their path behind the frame (good continuation).
Unlike eliminating the X-junctions, hiding the
X-junctions produces only modest effects in this experiment, consistent with
similarly weak effects in transparency perception with comparable manipulations
(Kasrai & Kingdom, 2002). Looking at
the target present trials, there is a main effect of adding a frame,
F(1,13) = 5.9,
p < .05, reflecting a slight slowing
of RT in the frame conditions. The frames do not significantly increase slopes
for either opaque targets, t(13) = 1.5,
p =
ns, or transparent targets,
t(13) = 1.6,
p =
ns, relative to the corresponding no
frame conditions. The advantage for opaque targets is seen again. The main
effect of target type is significant,
F(1,13) = 10.4,
p < .05, and the slopes for
transparent targets are steeper both with a frame,
t(13) = 2.5,
p < .05, and without a frame,
t(13)= 3.7,
p < .01, compared to the slopes for
opaque targets.
Miss errors average 8% in this experiment. They
increase with set size but are not influenced by the frame/no frame
manipulation.
These results suggest that the figural rules that allow
contours to complete under a narrow occluder are at work here as they are in
other visual search tasks (Rensink & Enns, 1998). In combination with the cues
from the motion of the stimulus, these contours provide enough information for
the system to infer the opacity or transparency of items and to permit
relatively efficient search for opaque targets among transparent
distractors. Experiment 7: Is good continuation needed?
In Experiment 6,
background contours could be traced under the occluding frame of a transparent
item. In Experiment 7, we used a background
texture made of much smaller dots. Individual dots were completely occluded as
they passed under the frame in the transparent case. Thus, dot contours could
not be traced. In the absence of such contours, transparency would have to be
inferred from the disappearance and reappearance of the same dots on either side
of the frame and/or from the bridging of the frame by larger structures in the
image–clusters of dots and/or low spatial frequency
information.
Twelve observers performed the frame and no background
conditions. A different set of 15 observers performed the no frame control
conditions. All observers were from the Visual Attention Laboratory’s paid
observer
panel. |