Psychological Review
2008, Vol. 115, No. 4, 836 – 863
Copyright 2008 by the American Psychological Association
0033-295X/08/$12.00 DOI: 10.1037/a0013395
A Boost and Bounce Theory of Temporal Attention
Christian N. L. Olivers and Martijn Meeter
Vrije Universiteit Amsterdam
What is the time course of visual attention? Attentional blink studies have found that the 2nd of 2 targets
is often missed when presented within about 500 ms from the 1st target, resulting in theories about
relatively long-lasting capacity limitations or bottlenecks. Earlier studies, however, reported quite the
opposite finding: Attention is transiently enhanced, rather than reduced, for several hundreds of milliseconds after a relevant event. The authors present a general theory, as well as a working computational
model, that integrate these findings. There is no central role for capacity limitations or bottlenecks.
Central is a rapidly responding gating system (or attentional filter) that seeks to enhance relevant and
suppress irrelevant information. When items sufficiently match the target description, they elicit transient
excitatory feedback activity (a “boost” function), meant to provide access to working memory. However,
in the attentional blink task, the distractor after the target is accidentally boosted, resulting in subsequent
strong inhibitory feedback response (a “bounce”), which, in effect, closes the gate to working memory.
The theory explains many findings that are problematic for limited-capacity accounts, including a new
experiment showing that the attentional blink can be postponed.
Keywords: attention, time course, attentional blink, awareness
One of the brain’s crucial functions is to prioritize relevant over
irrelevant information. It does this by a set of mechanisms we
collectively call selective attention. By its very definition, selective
attention is selective: It is thought that only one or, at most, a few
objects can be processed at a time. The important question then is
how much time is spent on selecting and processing one object or
set of objects before attention is available again for the next
selection. In other words, what are the dynamics of attention?
What is its time course? Or, as others have phrased it, what is the
“attentional dwell time” (Duncan, Ward, & Shapiro, 1994)?
information”) one needs a paradigm that controls for other factors
that may take time, such as having to switch from one location to
the other, or from one complete task to the other. Probably the
most popular paradigm in this respect is the rapid serial visual
presentation task (RSVP; Lawrence, 1971). For the past 15 years,
the two-targets version of the RSVP task has dominated the
literature. Figure 1a depicts a typical example, in which participants are asked to report two letters from a stream of digits all
presented at a single location (Chun & Potter, 1995). Participants
have little difficulty reporting the first target (T1). However, as
shown in Figure 1b, report of the second target (T2) suffers
considerably when it is presented within about 500 ms from T1. It
is as if attention blinks for half a second while it is busy processing
T1; hence, the phenomenon has been termed the attentional blink
(Raymond, Shapiro, & Arnell, 1992).
To our knowledge, all currently active theories of the attentional
blink attribute it to a limited-capacity processing stage, or bottleneck, with a relatively late locus in the information-processing
stream. The idea is that T1 temporarily uses up vital mental
resources that are then not available to T2. Figure 1c illustrates this
resource depletion. The prototypical account is Chun and Potter’s
(1995) two-stage theory, but similar proposals were made earlier
by Broadbent and Broadbent (1987) and a little later by Jolicoeur
and colleagues (Jolicoeur, 1998, 1999; Jolicoeur & Dell’Acqua,
1998; Jolicoeur, Tombu, Oriet, & Stevanovski, 2002). According
to two-stage theories, all items in the stream can be processed up
to semantic levels (often referred to as conceptual short-term
memory), but representations are vulnerable. For conscious report
of a target, a second processing stage is required which consolidates the item in short-term memory proper. This consolidation
process is limited to about one item at a time, and, it is important
to note, takes time. Thus, while T1 is being consolidated, T2 must
wait. Waiting takes longer if T1 processing takes longer, for
instance, when T1 is masked by a subsequent distractor. However,
The Answer From the Attentional Blink: Attention Is
Slow and Deals With One Object at a Time
If one is interested in the pure time course of attention (“how
long does it take before one can attend to the next piece of relevant
Christian N. L. Olivers and Martijn Meeter, Department of Cognitive
Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands.
Both authors put an equal amount of work into this project and therefore
regard this as a shared first authorship. The work benefited from NWO
Grants 452-06-007 and 451-05-006 from the Netherlands Organization for
Scientific Research to Christian N. L. Olivers and Martijn Meeter and from
discussions with Sander Nieuwenhuis, Brad Wyble, Howard Bowman,
Mark Nieuwenstein, Jane Raymond, and Vince Di Lollo. We are also very
grateful to David Huber for making us simplify and improve the model so
much more. We thank Andrew Leber for planting the term “bouncer” in
our minds. The computational model is available in Excel format from
http://olivers.cogpsy.nl.
Correspondence concerning this article should be addressed to Christian
N. L. Olivers, Department of Cognitive Psychology, Vrije Universiteit,
Van der Boechorststr, 1, 1081 BT Amsterdam, the Netherlands. E-mail:
[email protected]
836
BOOST AND BOUNCE
A. Typical attentional blink task
837
B. Typical data (Chun & Potter, 1995)
1
What was
the 2nd
letter?
What was
the 1st
letter?
T1
0.5
time
T1
T2
0.25
0
0
200
400
600
800
1000
T1-T2 lag in ms.
C. Typical explanation: Resource depletion
lag
0.95
Attentional R
Resources
T2
9
2
7
K
3
F
8
9
0 75
0.75
0.7
0.45
0.2
5
2
+
-0.05
±100 ms
0
200
400
600
800
1000
Time since target detection (ms)
Figure 1. (A) The prototypical attentional blink task, in which the observer is asked to report the two letter
targets (T1 and T2) embedded in a stream of digit distractors. The lag between the two targets is varied. (B)
Typical performance for T1 and T2 as a function of lag (adapted from “A Two-Stage Model for Multiple
Detection in Rapid Serial Visual Presentation,” M. M. Chun & M. C. Potter, 1995, Journal of Experimental
Psychology: Human Perception and Performance, 21, Figure 2, p. 112. Copyright 1995 by the American
Psychological Association. (C) The classic explanation of the attentional blink: T1 processing leads to a
depletion of resources that then cannot be used for T2.
there is little opportunity for waiting, because a distractor quickly
follows T2, overwrites its vulnerable first-stage representation, and
causes an attentional blink as a result. Thus, distractors play two
important roles in two-stage theories: (a) They render T1 processing more difficult, resulting in longer waiting times and, thus, a
more profound blink; and (b) they interfere with the T2 representation, such that T2 becomes sensitive to the temporary resource
depletion (Brehaut, Enns, & Di Lollo, 1999; Chun & Potter, 1995;
Giesbrecht & Di Lollo, 1998; Grandison, Ghirardelli, & Egeth,
1997; Seiffert & Di Lollo, 1997). Indeed, it has been shown that a
skeletal version of the paradigm, with just the two targets and their
immediately following distractors, is sufficient to generate an
attentional blink (Ward, Duncan, & Shapiro, 1997).
A similar account has been proposed by Shapiro and Raymond, with their colleagues (Isaak, Shapiro, & Martin, 1999;
Raymond, Shapiro, & Arnell, 1995; Shapiro & Raymond, 1994;
Shapiro, Raymond, & Arnell, 1994). According to their interference theory, T1, T2, and the respective subsequent distractors
are uploaded into visual short-term memory (VSTM). Within
VSTM, these items then compete for conscious report. However,
because VSTM resources are limited, the competition is biased in
favor of those items that entered first, with further biases toward
those items that closely resemble the targets. Thus, T1 and the
distractor immediately following it often win the competition for
retrieval, rather than T1 and T2. Within interference theory, the
attentional blink period corresponds to the time needed for T1 to
win the competition and be transferred to a report stage, so that
VSTM can be cleared for T2. As in two-stage theory then, items
are initially represented in a feeble memory system (conceptual
short-term memory or VSTM) and, at that moment, compete for
limited resources. These resources are primarily assigned to T1
because it appears first. While T1 undergoes further processing, T2
suffers from either masking or competition for selection from
surrounding distractors and is lost (see Shapiro, Arnell, & Raymond, 1997, for a unified model).
The temporary loss of control theory (TLC; Di Lollo, Kawahara, Ghorashi, & Enns, 2005; Kawahara, Kumada, & Di Lollo,
2006) has been presented as an alternative to limited-capacity
accounts. It attributes the attentional blink to a temporary loss of
control over the stimulus input. It assumes that, leading up to T1,
the system is configured as an input filter: It adopts an attentional
set for targets and against distractors. Targets pass the filter into
higher processing stages necessary for consolidation and response
planning, whereas distractors are rejected. An important assumption is that, because targets and distractors are usually arbitrarily
defined, the input filter must be actively maintained by a central
executive process. However, when T1 passes the filter, its consolidation also requires the dedication of the central executive. This
means that the central executive can no longer maintain the attentional set, and the input filter becomes vulnerable to stimulusdriven disruption from distractors. As a consequence, targets are
no longer allowed to enter, and an attentional blink is observed. As
the central executive gradually becomes available again, the cor-
838
OLIVERS AND MEETER
rect input filter is reinstated, and performance returns to normal. It
is questionable, however, whether TLC indeed manages to avoid
the limited-capacity resource-depletion argument. Notably, it assumes that T1 occupies a central executive for some time, during
the course of which the system is not ready for T2. It appears then
that limited-capacity resources have entered through the back door.
Several computational models have recently been developed
that are based on the above ideas (Bowman & Wyble, 2007;
Battye, 2006; Chartier, Cousineau, & Charbonneau, 2004; Dehaene, Sergent, & Changeux, 2003; Fragopanagos, Kockelkoren,
& Taylor, 2005; Shih, 2008). As we discuss further in the General
Discussion, they all assume either direct competition between T1
and T2 and/or indirect competition via the drainage of some
limited-capacity resource. In summary then, the consensus is that
the attentional blink reflects a relatively long-lasting limitation on
attention after encountering a relevant visual event.
The Answer From Cueing Paradigms: Attention Is Fast
and Can Deal With Multiple Relevant Items Presented in
Rapid Sequence
Other studies have suggested an attentional time course that is
rather opposite to that suggested by the attentional blink. Some of
these are illustrated in Figure 2a. For example, Nakayama and Mackeben (1989) asked observers to detect and identify a target in a
cluttered display, followed by a mask. Preceding the target display, a
cue indicated the target position with 100% validity. Note that because cue, target, and mask always appeared in the same position, this
procedure is not at all unlike RSVP. The results, however, were rather
different: Target-identification performance increased with increasing
stimulus onset asynchrony (SOA) between cue and target display, up
to about 100 –200 ms. Beyond this peak in performance, increasing
SOAs resulted in gradually decreasing cueing benefits over the time
course of several hundreds of milliseconds. Figure 2b (top left panel)
shows Nakayama and Mackeben’s findings, whereas Figure 2c illustrates the hypothesized steeply rising and slowly decaying underlying
attentional time course function. Nakayama and Mackeben referred to
this temporary enhancement in performance as transient attention and
argued that it is largely automatic, operating at a relatively early level
in visual processing. They distinguished it from a slower, more
sustained attentional component. Similar transient attentional enhancement functions have been found or hypothesized by others using
various paradigms, including cued visual search (Kristjánsson, Mackeben, & Nakayama, 2001), classic spatial cueing (Müller & Rabbitt,
1989), spatial distortion (Suzuki & Cavanagh, 1997), imageclassification techniques (Shimozaki, Chen, Abbey, & Eckstein,
2007), saliency-guided search (Nothdurft, 2002), illusory line motion
(Hikosaka, Miyauchi, & Shimojo, 1993), temporal order judgment
(Scharlau, Ansorge, & Horstmann, 2006), flash-lag effects (Bachmann & Oja, 2003), eye-movement measurements (Mackeben &
Nakayama, 1993), and RSVP (Botella, Barriopedro, & Suero, 2001;
Chua, Goh, & Hon, 2001).
A rapid transient attention function was actually proposed earlier by Weichselgartner and Sperling (1987), using a combination
of cueing and RSVP. In some of their experiments, observers
monitored a single stream of digits, waiting for a particular cue
(which was typically an outline square or highlighted digit). The
task was to report the cued digit as well as the digits following it.
They found that the cued digit and the one immediately following
it had a high likelihood of report, as had digits later in the stream
(from 300 ms onward). In between, at about 200 ms, was a brief
dip. Figure 2b (top right panel) shows this pattern of results. Like
Nakayama and Mackeben (1989) did later, Weichselgartner and
Sperling argued for an initial, rapid, and automatic attentional
component that was then followed by a slower, more controlled
component. Weichselgartner and Sperling referred to these
two components as two attentional “glimpses.” The brief dip at
200 ms represented the transition between these two glimpses.
It deserves mentioning that Raymond et al. (1992) argued that
this dip might actually reflect an attentional blink. We agree, but,
as we explain later, if it is an attentional blink, it is not caused by
capacity limitations (nor was this claimed as such by Raymond et
al., 1992). The typical limited-capacity explanation would be that
the cued digit drains attentional resources, at the expense of the
digit presented at 200 ms. However, there are several aspects of
Weichselgartner and Sperling’s (1987) data that go against this
possibility. For one, the dip disappeared when a faint outline
square was used to indicate the first to-be-reported digit, or when
an auditory instead of visual cue was used. This suggests there may
have been some low-level visual interference involved. More
important is the fact that, within the second glimpse, observers
were able to report more than the two targets so typical for the
attentional blink task. For example, observers might report one
digit in their “first glimpse” and then three subsequent digits in
their “second glimpse” without much trouble. Limited-capacity
theories would predict a second attentional blink in the second
glimpse (see, e.g., Chun & Potter, 1995).
Further evidence that attention can really deal with multiple
targets presented in rapid succession comes from Reeves and
Sperling (1986; see also Sperling & Reeves, 1980). In these
experiments, participants were presented with two concurrent
RSVP streams of characters, one on the left of fixation and
containing letters, the other on the right of fixation and containing
digits. The task was to monitor the left stream for a cue to switch
to the other stream (e.g., the letter C) and, as soon as it occurred,
to start collecting digits from the right stream. Streams could run
at various speeds (including those used in typical attentional blink
studies), and by assessing both accuracy and order of report from
the second stream, Reeves and Sperling determined which items in
the second stream had received most attention since the onset
of the cue. Across RSVP speeds, the data showed a remarkably
consistent distribution as a function of time. After an initial delay
(regarded as the time needed to detect the cue and switch to the
second stream), chances of a digit being reported rose quickly and
then deteriorated more slowly, such that most reports came from a
period of 250 – 600 ms after cue onset. Figure 2b (bottom left
panel) shows performance for one observer for streams running at
9.2 ms/item. The important finding here is that observers had
relatively little trouble reporting multiple targets from this 250 –
600-ms period. Any existing attentional blink theory would predict
that the very first digit identified in the second stream would cause
an attentional blink for the following digits. There was no sign of
an attentional blink. On the contrary, according to Sperling and
Reeves (1980, p. 359), “the distribution directly estimates the
moment of maximal attention. . . i.e., the moment of fastest registration—the moment at which the shutter is fully open.” In Reeves
and Sperling (1986), this performance function was described as
the opening and closing of the attentional gate to short-term
BOOST AND BOUNCE
A. Typical RSVP/cueing tasks
B. Typical data
> Nakayama & Mackeben (1989)
Cueing
Nakayama & Mackeben (1989)
Report target
presence &
identity
P(Corrrect report)
1
Mask
839
1
0.75
0.75
0.5
0.5
0.25
0.25
0
0
1
200
400
600
800
Reeves & Sperling (1986)
Weichselgartner & Sperling (1987)
0
1000 -100
100
300
500
Cue - target onset asynchrony in ms.
0.75
SOA
0.5
Cue
0 25
0.25
(estimated cue detection time)
0
0
> Reeves & Sperling (1986)
> Weichselgartner & Sperling (1987)
Cueing & switching
0
1
9
Report next
H
7
four digits
3
Switch
C
5
4
A
8
In one condition of
2
Weichselgartner and
P
Sperling (1987) there was
5
200
400
600
800
Cue-Target SOA
Cue - target onset asynchrony in ms.
C. Typical explanation:
C. Transient attentional enhancement
T
Stream 1
Stream 2
only one stream and the
cue was an outline square
Attentional Reso
A
ources
0.95
0.7
0.45
0.2
-0.05
0
200
400
700
Cue - target onset asynchrony in ms.
600
800
Time since target detection (ms)
Figure 2. (A) Illustration of the tasks used by Nakayama and Mackeben (1989; top panel) and Sperling and
colleagues (Reeves and Sperling, 1986; Weichselgartner & Sperling, 1987; bottom panel). All paradigms
resemble a combination of rapid serial visual presentation task (RSVP; Lawrence, 1971) and cueing. In
Nakayama and Mackeben, observers saw a cue, a target, and a display-wide mask. Cue and target always
appeared in the same position. The task was to determine the presence and identity of one of two possible targets.
In Reeves and Sperling, observers monitored one RSVP stream for a cue (“C”) to switch to a second stream, from
which they were required to report the next four digits they saw. In Weichselgartner and Sperling, observers
monitored a single stream for an outline square (or another type of cue), after which they had to switch to
reporting digits from the remainder of the stream. (B) Top left panel: Averaged accuracy data for subject NW
of Nakayama and Mackeben (as estimated from their Figure 8). Top right panel: Average target report accuracy
for participant EW (as estimated from Weichselgartner & Sperling’s Figure 2b). Bottom left panel: Average
target-report accuracy for participant AR in a dual-stream RSVP task running at 9.2 ms/item, after the observer
has switched from the first to the second stream (as estimated from Reeves & Sperling’s Figure 3). (C) Both
transient attentional enhancement and transient attentional gating accounts hypothesize the temporary recruitment of attentional resources after a target has been detected (a gamma distribution that returns in both Reeves
& Sperling and Nakayama & Mackeben, with different assumed delays). Data adapted from “Sustained and
Transient Components of Focal Visual Attention,” K. Nakayama & M. Mackeben, 1989, Vision Research, 29,
Figure 8, p. 1683. Copyright 1989 by Elsevier; from “Attention Gating in Short-Term Visual Memory,” A.
Reeves & G. Sperling, 1986, Psychological Review, 93, Figure 3, p. 184. Copyright 1986 by the American
Psychological Association; from “Dynamics of Automatic and Controlled Visual Attention,” E. Weichselgartner
& G. Sperling, 1987, Science, 238, Figure 2, p. 779. Copyright 1987 by the American Association for the
Advancement of Science.
1000
OLIVERS AND MEETER
840
memory and was modeled with a gamma distribution with a
relatively steep rise (within 150 ms after cue detection) and a
relatively slow decay (lasting several hundreds of milliseconds),
just like the function drawn in Figure 2c. Later, Sperling and
Weichselgartner (1995; see also Shih & Sperling, 2002) described
it as a transition function from one attentional episode to the next.
Boost and Bounce Theory
Thus, any theory of temporal attention must deal with a major
paradox: Whereas the attentional blink appears to indicate that the
attentional resources available for perceptual input are reduced
when a relevant event is encountered, the transient attentionalenhancement results appear to indicate that additional attentional
resources are being recruited. What is more, the time courses of
these two functions are remarkably similar. In fact, we have
plotted the two time course functions in Figures 1c and 2c as
vertical mirror images of each other. None of the current theories
can simultaneously explain both these phenomena, simply because
these theories start from opposite premises: the reduction versus
the recruitment of attentional resources.
Our boost and bounce theory seeks to integrate the two phenomena.
In the present article, we present a new theory within which this
similarity in time course is no coincidence. We call this theory the
boost and bounce theory of temporal attention, after its two crucial but
straightforward functions: Attention boosts the visual input by responding in an excitatory manner whenever relevant information (a
target) is encountered. It blocks, or bounces, the visual input in an
inhibitory manner whenever irrelevant information (a distractor) is
encountered. The theory claims that performance is eventually determined by the interaction between these two functions and the stimulus
input. It is important to note that there is no role for capacity limitations or resource depletion in explaining the attentional blink, and its
apparently long time course (in the order of 500 ms) is the result of
underlying microdynamics operating at a much smaller time scale (in
the order of 100 ms).
The theory shares a number of ideas with the TLC account of Di
Lollo et al. (2005)—namely the importance of an input filter and
a crucial role for distractors— but eliminates the necessity to
invoke limited-capacity resources. The theory shares even more
with the temporary suppression account proposed by Raymond et
al. (1992), which, incidentally, was the very first theory of the
attentional blink. According to the temporary suppression account,
T1’s defining property opens an attentional gate to higher systems
so that it can be recognized. However, the post-T1 distractor is also
allowed to enter, potentially leading to a false conjunction with the
target-defining feature and, hence, false identification. The attentional blink occurs because this potential for conjunction errors
calls for a period of suppression (or closing of the gate) to protect
target processing. This then goes at the expense of later targets. For
several reasons, Raymond and colleagues abandoned the suppression account in favor of a limited-capacity account (Raymond et
al., 1995; Shapiro & Raymond, 1994; Shapiro, Arnell, & Raymond, 1997). First, the inhibition was thought to operate “at a
relatively early stage of processing” (Raymond et al., 1992, p.
854), whereas later evidence clearly suggested that blinked items
reach high levels of processing, including the activation of semantic representations (Anderson, 2005; Chua et al., 2001; Luck,
Vogel, & Shapiro, 1996; Maki, Frigen, & Paulson, 1997; Marois,
Yi, & Chun, 2004; Martens, Wolters, & Van Raamsdonk, 2002;
Potter, Dell’Acqua, Pesciarelli, Job, & Peressotti, 2005; Rolke,
Heil, Streb, & Hennighausen, 2001; Sergent, Baillet, & Dehaene,
2005; Shapiro, Driver, Ward, & Sorensen, 1997; Shapiro, Caldwell, & Sorensen, 1997; Visser, Merikle, & Di Lollo, 2005; Vogel,
Luck, & Shapiro, 1998). Second, the hypothesis that T1 needs
protection from conjunction errors predicts that if no conjunction
errors are possible (e.g., when targets and distractors are from
different categories, or T1 involves a detection task, rather than an
identification task), there should be no blink. This prediction did
not hold either (Chun & Potter, 1995; Shapiro et al., 1994). Finally,
there appeared to be a logical inconsistency in the idea that the
post-T1 distractor triggered the inhibition: “[I]t is difficult to
envision a mechanism capable of inhibiting a stimulus that itself
caused the suppression” (Raymond et al., 1995, p. 661).
However, in our view, the baby has been thrown out with the
bathwater. The temporary suppression model can be saved if we
drop the assumptions that (a) suppression occurs only early in the
system, (b) suppression is necessary to protect T1, and (c) an item
cannot trigger its own suppression. In a way, then, our work serves
to rehabilitate the temporal suppression account, be it in a different
guise. In what follows, we explain the theory and show how it is
implemented in a computational model. We then demonstrate that
this model simulates a substantial number of studies on transient
attention, as well as the attentional blink. We also show that the
theory generates predictions that distinguish it from limitedcapacity theories. Finally, we discuss the crucial differences with
existing computational versions of those theories.
Stage 1: Sensory Processing
In line with Chun and Potter (1995), and many other attention
theories, our theory assumes two major information-processing
stages, both of which are illustrated in Figure 3. Although we refer
to these as stages, they are not strictly sequential but, rather,
interact. The Appendix describes a computational model in which
these interactions are implemented. The first stage, which we refer
to as sensory processing, consists of the activation of representations of perceptual features, such as color, shape, and orientation,
but also of high-level representations involving semantic and categorical information. We assume that these different properties are
represented separately, for example, in different feature maps that
feed into one another and together form a hierarchy from simple to
complex processing (e.g., Treisman & Gelade, 1980; Ungerleider
& Mishkin, 1982; Zeki, 1978). We also assume that these initial
sensory signals spread through the system in a mainly feedforward
manner (e.g., Lamme & Roelfsema, 2000).
In line with physiological evidence, representations are activated
rapidly and relatively strongly at the onset of the stimulus (the initial
visual transient, not to be confused with transient attention here) and
then decay to a more sustained level (when the stimulus remains on)
or resting level (when the stimulus is switched off) of activation (e.g.,
Breitmeyer & Ganz, 1976). Multiple representations can be activated
in parallel, but because of items being presented in the same location
in RSVP, an individual item’s activation is affected by preceding and
succeeding items (following, e.g., Keysers, Xiao, Földiák, & Perrett,
2001; Macknik & Livingstone, 1998; Rolls & Tovee, 1994). The
strength of forward masking effects on an item is assumed to depend
on its similarity to the preceding item, whereas the strength of back-
BOOST AND BOUNCE
841
Figure 3. Diagram of the boost and bounce theory. Stimuli in the rapid serial visual presentation task
(Lawrence, 1971) stream are subject to sensory processing, including activation of color, shape, and semantic
properties. The attentional set required for the task is implemented in the working memory gating system, which
is a combination of excitatory and inhibitory gate neurons maintaining feedback loops that respectively modulate
the target- and distractor-related sensory activity. In this way, they open or close the gate to working memory,
within which incoming information is then linked to reportable (e.g., verbal) representations. When a target
arrives, strong attentional enhancement (i.e., excitatory feedback) is triggered, allowing the target to enter
working memory. The gate remains open as long as relevant information enters. However, in the attentional blink
paradigm, the bulk of the excitatory feedback hits the post-T1 distractor, which then triggers a strong inhibitory
feedback response from the gate neurons in turn. An attentional blink is the consequence.
ward masking is assumed to depend on the similarity to the following
item. This can be interpreted as masking being, in essence, affected by
stimulus saliency: The more an item differs from its predecessor, the
more salient it is, the less it is masked. Finally, in accordance with
neurophysiology, the signal adapts, so that it is reduced when an item
was already presented recently (e.g., Legge, 1978).
Stage 2: Working Memory
The second component is working memory. Working memory
serves as the global workspace, central executive, or task monitor
in which the rules applying to the task at hand are implemented
and maintained (cf. Baars, 1989; Baddeley & Hitch, 1974;
Bundesen, Habekost, & Kyllingsbæk, 2005; Dehaene, Kerszberg,
& Changeux, 1998; Desimone & Duncan, 1995; Lavie, Hirst,
Fockert, & Viding, 2004; E. K. Miller & Cohen, 2001). Within our
theory, this means that systems underlying working memory can
flexibly monitor and maintain information and couple the relevant
input to the relevant response. An item can only be reported when
it enters working memory, because only then can it be linked to a
response. Exactly how this stimulus–response mapping occurs is
an important question and not one we are able to answer here. We
simply assume that after receiving verbal instructions and some
practice trials, the crucial links have been established. (In other
words, we cannot yet fully ban the homunculus, but once he has
wired the controls at the start of the experiment, he dozes off and
plays no further role in our model.) In line with Dehaene et al.
(1998), we envisage that these links are established through feedback connections from central working memory neurons to rele-
842
OLIVERS AND MEETER
vant sensory representations on the one hand and to relevant
response representations (whether overt or covert) on the other. An
item may enter working memory—that is, it is selected—when
sufficient (i.e., above threshold) evidence for its presence has
accumulated during the time it was active in the system. The
strength of this activation depends on (a) the bottom-up activity of
the item, as described in the previous section, and (b) the top-down
attentional modulation of this activity received through the feedback connections, as is described next.
Gating
Not only does working memory link relevant stimuli to a response but it also prevents irrelevant information from interfering
with behavior by shielding response systems from unfiltered sensory representations (e.g., Engle, Conway, Tuholski, & Schisler,
1995; Fuster, 1997; Hasher & Zacks, 1988; Jonides, Smith,
Marshuetz, Koeppe, & Reuter-Lorenz, 1998; Lavie, 2000). In
other words, working memory employs an input filter, or attentional set, for relevant information and against irrelevant information. Others have also used the term template matching, but we use
the term gating (following, e.g., Reeves & Sperling, 1986). Gating
occurs through feedback that modulates sensory processing (e.g.,
Lamme & Roelfsema, 2000; Di Lollo, Enns, & Rensink, 2000).
Sets of sensory representations that are important for the task are
enhanced, or boosted, through excitatory feedback, whereas sets of
sensory representations that are irrelevant are tempered, or
bounced, through inhibitory feedback. Of course, the ideas of
enhancing target information and suppressing distractor information are not new, and they constitute the mechanisms behind many
a selective attention theory (Bundesen et al., 2005; Houghton,
1994; Lavie, 2000; Wolfe, 1994, to name but a few). We assume
gating to be fuzzy. For reasons of cognitive economy, as well as
flexibility, there is no one-to-one mapping between every single
sensory neuron and a gate neuron (Postle, 2006). Instead, gate
neurons operate on populations of sensory neurons, on relatively
broad classes or categories of representations. For example, gating
may be driven by “letter-like,” “colored,” or perhaps as specific as
“reddish” stimuli but cannot apply only to something like “the
little maroon espresso cup” at the exclusion of all other stimuli. In
this sense, gating is also contingently automatic: Once an attentional set has been implemented (presumably at the start of the
experiment), items that sufficiently match this set automatically
elicit feedback activity, even if they are not targets (Bargh, 1992;
Folk, Remington, & Johnston, 1992).
The modulation is multiplicative: It needs to interact with stimulusbased activity (Grossberg, 1995). Furthermore, feedback is issued by
gate neurons that are themselves driven by stimuli that are predefined
as targets or distractors. Enhancement and inhibition through feedback thus only really start to take off when a stimulus is present.
Furthermore, we assume that gate neurons respond with an initial
strong burst of activity, after which activity tapers off (e.g., Connors
& Gutnick, 1990; Fuster, Bauer, & Jervey, 1982; Goldman-Rakic,
1995; Tomita, Ohbayashi, Nakahara, Hasegawa, & Miyashita, 1999;
Zipser, Lamme, & Schiller, 1996). Functionally, this corresponds to
the idea that updating (as opposed to maintenance) of working memory only needs to be transient (Hazy, Frank, & O’Reilly, 2006).
Together with others, then, we assume that attention has a strong
transient component (Nakayama & Mackeben, 1989; Weichselgartner
& Sperling, 1987; see also Bowman & Wyble, 2007, and Fragopanagos et al., 2005).
Figure 3 illustrates the boosting and bouncing feedback loops
underlying the gating mechanism. When it is one’s task to report
letters but ignore digits, letter representations receive excitatory
modulation, whereas digit representations are subject to inhibitory
modulation. In both cases, although the relevant links are established, a concrete input from either category is necessary to elicit
feedback from a gate neuron. As mentioned, the feedforward
sweep of activity caused by stimulus presentation is assumed to
travel upward through a hierarchy of representations. Similarly,
top-down feedback activity travels in the opposite direction.
Although initially only the alphanumeric identity representations
that trigger the feedback are modulated by either boosting or
bouncing feedback, it is assumed that the feedback activity rapidly
spreads down the hierarchy to underlying layers of representation
that originally fed into the identity representation. At the bottom of
the hierarchy, this includes strong attentional modulation of the
location of the feedback-eliciting item. This means that modulation
is not limited to only the triggering stimulus but also affects
successive items presented at the same location.
The strength of the modulation is affected by the strength of the
sensory evidence. The stronger the evidence for a target or a
distractor, the stronger the respective excitatory or inhibitory feedback. For this purpose, within the model, the transient attentional
response is weighted by the sensory evidence accumulated during the
first 15 ms of presentation. The sensory evidence itself is affected by
two factors. First, the current state of attention modulates the strength
of the sensory evidence, as bottom-up and top-down mechanisms
continuously interact: Current excitatory feedback enhances the sensory evidence, whereas inhibitory feedback reduces it. Second, as
explained earlier, the less similar targets and distractors are, the
stronger the relative sensory evidence is (because of reduced masking
and increased salience). This also means that when overall target
salience is very high, the need for active gating is strongly reduced
(although not absent), because the bottom-up activity itself provides
strong evidence for what is a target and what is not. Hence, the
strength of the gating is further weighted by the overall similarity
between targets and distractors within the stream.
Last but not least, the spreading of feedback activity through the
visual system takes time. Within our model, after sufficient activity has reached the layers of representation defining the target (i.e.,
a target has been detected), it takes 25 ms for it to start reaching the
lowest layers. The peak of the feedback activity is not until another
70 ms have passed, and hence, the bulk of attention does not arrive
until roughly 100 ms after target detection. This delay means that,
in RSVP paradigms, the triggering stimulus may actually already
be gone and replaced by the next stimulus by the time the full
chain of recurrent processing is operational. Thus, although the
goal of modulation by feedback may be to retrieve the precise
identity of a target stimulus, the ensuing feedback may accidentally also, or even predominantly, modulate the trailing stimuli.
Capacity Limitations
There are two capacity limitations in our model:
1.
No more than one attentional set (or task set) can be
active at the same time. This entails that a reconfiguration
BOOST AND BOUNCE
of selection settings, or switch, is necessary when there
are multiple tasks, multiple locations, and possibly multiple modalities involved. New gate neurons need to be
recruited or configured to filter for a different location or
a different kind of stimulus. The exact mechanisms of
such switches are not implemented in the model yet. For
the time being, we assume that gate neurons activate one
another and that this causes a transition from one attentional set to the next. Following Sperling and Weichselgartner (1995), we modeled this as follows: After a
stochastically determined interval, a new set of attentional gates opens, and the old gates close. We chose a
logistically distributed stochastic interval with a mean
that depends on the type of switch. Endogenously cued
switches (including task switches) have a mean switch
time of 200 ms, whereas exogenously cued switches are
assumed to take only 75 ms (Carlson, Hogendoorn, &
Verstraten, 2006; Cheal & Lyon, 1991; Eriksen, 1990;
Jonides, 1981; Luck & Vecera, 2002; Posner, 1980;
Posner & Cohen, 1984; Shulman, Remington, &
McLean, 1979).
2.
Working memory storage capacity is limited. The likelihood of stimuli entering working memory is not constant
throughout an RSVP stream: When capacity has been
exhausted by previous items that have entered working
memory, new items cannot enter. Capacity was set at five
items (following the evidence that working memory capacity is about four to seven items, depending on additional memory strategies, such as chunking, rehearsal, or
efficient use of additional memory systems, Cowan,
2001; G. A. Miller, 1956).
It is important to note that, within our theory, neither of these
capacity limitations provides an explanation for the attentional blink,
as the attentional blink easily occurs with only two targets, for which
one and the same attentional set applies (e.g., Chun & Potter, 1995).
Explaining Empirical Findings: Transient
Attentional Enhancement
In this and the following section, we describe how boost and
bounce theory explains performance enhancement under some
circumstances but performance deterioration under others. It also
explains many related findings. For each finding, we demonstrate
the viability of our theory through simulations with the computational model specified in the Appendix. It is important to note that
these simulations were generated with just one set of parameter
values (except for parameters governing similarity between items,
where mentioned). Although this precludes optimizing the fit of
particular individual experiments, we wished to capture the overall
pattern of findings with as few assumptions as possible. The
simulations should therefore be evaluated on the basis of their
qualitative, not quantitative, fit of the data pattern.
Transient Spatial Cueing Effects
A relevant event (usually a target or a valid cue) elicits excitatory
feedback from gate neurons to the sensory neurons activated by the
843
event. This results in a transient boost of the neuronal signals coding
for this event, including representations of its features and source
location. When the gate neurons are triggered, the enhancement is
initially strong but then decays back to normal levels. This mechanism
explains the results of the cueing study of Nakayama and Mackeben
(1989), in which observers had to detect a target in a cluttered and
masked display. A 100% valid abrupt onset cue indicated the target
position. Performance first rose rapidly with increasing time between
cue and target, but for cue–target intervals greater than about 100 ms,
it slowly decreased again, while remaining above chance levels.
Because in Nakayama and Mackeben’s study, the target and mask
were always in the same position as the cue, the largest part of their
task can be simulated as a straightforward RSVP. However, because
participants did not know where the abrupt onset cue would appear,
we assume that the initial part of processing was dominated by an
exogenously triggered spatial switch necessary to orient to the new
location. Within our model, such a switch takes on average 75 ms.
Figure 4a shows the performance predicted by the model for a target
presentation duration of 33 ms (which is the duration Nakayama and
Mackeben used for most of their participants).1 Because the cue
remained on during target presentation, and the target was presented
within a cluttered array and followed by a strong mask, we assumed
masking to be stronger here (by increasing the similarity by a factor
5) than in the subsequent RSVP paradigms.
Two Glimpses
The idea of a rapid transient attentional enhancement had surfaced earlier, in a study by Weichselgartner and Sperling (1987),
who referred to it as the first of two attentional “glimpses.” The
second of the two glimpses was thought to reflect a slower, more
sustained attentional process independent of the first glimpse. In
the crucial condition, observers monitored a single stream of digits
for a digit that was uniquely accompanied by a salient cue (e.g., a
surrounding square or a highlighted numeral). The task was to
report this cued digit, as well as the digits that followed it. The
pattern of performance showed two peaks: There was relatively
good performance for the initial target and often the item presented
immediately after, as well as for items presented after about
300 – 400 ms (after which performance gradually decreased). Between these two peaks, performance showed a brief but marked
drop. As shown in Figure 4b, our model reproduces these findings.
However, within our model, the “two glimpses” are not the result
of two independent attentional processes being separated in time
but of the continuous interaction of the boosting and bouncing
gating functions. Note that in Weichselgartner and Sperling’s task,
observers needed to ignore digits until the cue (the surrounding
square) was presented, after which digits were to be reported. This
has two implications in our model: (a) Gate neurons are initially
1
Although the Nakayama and Mackeben task involved a spatial cue, it
is mainly informative about the time course of attention, rather than spatial
selection. This is because the target (and, in a sense, the display-wide
mask) always appeared in the same location as the cue. It is interesting to
note that the time course of transient attention does show remarkable
similarities to that of inhibition of return in more classic spatial cueing
paradigms (Maylor & Hockey, 1985; Posner & Cohen, 1984). Whether or
not transient attention is an important factor in inhibition of return remains
an important question for the future.
OLIVERS AND MEETER
844
A. N
Nakayama & Mackebe
en (1989)
1
0.75
0.5
0.25
0
0
200
400
600
800
B. W
Weichselgartner & Spe
erling (1987
7)
p(Correct)
1
0.75
0.5
0.25
0
-100
100
300
500
700
Cue - target ons
set asynchrony in
n ms.
C. R
Reeves & S
Sperling (1986)
1
0.75
cue, digits are to be treated as targets, and an endogenous shift in
attentional set toward accepting digits is now required. Such an
endogenous task switch takes time. The dynamics of the model are
then as follows: The presentation of the cue results in a boost of the
identity of the simultaneously presented digit, which enters working memory. Because of inherent delays, the boost reaches its
maximum level when the item after the cued item is presented.
Hence, the identity of the post-cue item often also enters working
memory, despite the fact that, within the initial attentional set, it is
still a distractor. Together, these first items in the target series
constitute the first glimpse. At the same time, the cue also acts as
a signal to initiate the task switch, which takes 200 ms on average
(because it is endogenous). During this period, gate neurons engaged with the first task set have not yet switched off, and the
boosted post-cue item sets off inhibition exactly because it is still
being treated as a distractor. In other words, the attentional gate
closes for the next item, resulting in a dip in performance. Later on,
we see that this inhibition, as set off by the post-target distractor,
plays an important role in explaining the attentional blink. In fact,
the brief performance dip in the Weichselgartner and Sperling
results can be regarded as an attentional blink beginning to develop
(cf. Raymond et al., 1992) but then being rapidly reversed as the
second task is activated. This is because, as time passes, the
attentional set is more and more likely to have switched to one in
which digits without accompanying cues are treated as targets. The
attentional gate reopens, and performance rapidly recovers. Excitatory feedback, although already subsiding, is still active and
catches some of these targets in its tail: A second glimpse is
observed (see Figure 4b). Finally, as working memory fills up with
targets, new items are increasingly unlikely to enter working
memory. The model’s prediction then is that, when task switches
are removed, only a single glimpse should emerge. Indeed, recent
data (Olivers, Van der Stigchel, & Hulleman, 2007; Di Lollo et al.,
2005) appear to support this prediction.
0.5
Switching Streams
0.25
0
0
200
400
600
800
Cue-Targ
get SOA (ms)
Figure 4. Model simulations regarding the attentional enhancement findings
of (A) Nakayama and Mackeben (1989), (B) Weichselgartner and Sperling
(1987), and (C) Reeves and Sperling (1986). In Simulation A, backward
masking of the target was assumed to be stronger (by a factor of 5) than in rapid
serial visual presentation paradigms, as Nakayama and Mackeben used cluttered
target displays with relatively strong, long duration masks. In Simulation B, the cue
(outline square, highlighted numeral) was assumed to be more salient than typical
rapid serial visual presentation items, and its color similarity was therefore reduced
to 0.5. SOA ⫽ stimulus onset asynchrony.
set to filter for the additional feature provided by the cue (the
outline square). This means that, initially, items with cue are
treated as targets and cause enhancement, whereas items without
cue are treated as distractors and cause inhibition. (b) After the
Figure 4c shows how the model accounts for the item-report
data from observer AR in Reeves and Sperling (1986) for an RSVP
task consisting of two streams running at 9.2 items/s. In this task,
observers were required to switch streams as well as tasks after a
cue in the first stream (the cue itself was not to be reported). The
data show the proportion of correctly reported items for each
temporal position in the second stream, after the cue. Within the
model, the first 200 ms of performance are therefore dominated by
the fact that working memory needs to switch from filtering for
cue-defining properties to filtering for digits in the second stream.
Unlike the Weichselgartner and Sperling (1987) task, in this task,
no first glimpse is observed, because as long as the attentional
switch has not occurred, participants are still attending the first
stream, which does not contain any reportable items. Then, as the
switch to the second stream is completed, the targets start to enter
and fill up working memory. Because there are targets in only the
second stream, only excitatory feedback is operational, and there is
no inhibition. Nevertheless, performance eventually deteriorates
simply because working memory is full.
BOOST AND BOUNCE
Explaining Empirical Findings: The Attentional Blink
We consider the typical attentional blink task of having to report
two letters from a stream of digits, as it involves no spatial or task
switches. Excitatory feedback links are set up for letters, and
inhibitory links for digits, although note again that these categories
are considered to be fuzzy. The stream is running, but T1 has not
been presented yet: Only distractors are encountered, and during
the first few, the system settles in a stable inhibitory state sufficient
to keep them out of working memory. Direct evidence for such a
pre-T1 inhibitory state comes from Dux, Coltheart, and Harris
(2006); Maki and Padmanabhan (1994); Olivers and Watson
(2006); and Sahraie, Milders, and Niedeggen (2001).
Then, after the initial streak of distractors, T1 appears. Sensory
evidence for a target arrives in the neurons coding for targets,
which, in turn, elicits feedback from the gate neurons in working
memory linked to these neurons. The result is a surge of excitatory
recurrent processing, boosting sensory signals for T1 and allowing
T1’s identity to enter working memory. Figure 5 illustrates the
dynamics of attention following T1. The most important aspect of
the model is that, even though T1 benefits, the peak of the boost in
excitatory feedback actually arrives after T1, especially at the
845
lower layers of the representational hierarchy, which need some
more time before they are reached by the recurrent wave of
activity. At the usual presentation rates of 100 ms/item, this means
that the bulk of attention lands on the post-T1 item.
The attentional blink occurs when this post-T1 item is a distractor. Because of the delay in excitatory feedback, this distractor
accidentally receives the maximal enhancement intended for T1.
As a consequence, the gate neurons now receive a very strong
signal of the wrong category entering working memory. Because
the gating system has been set up to reject distractors, this strong
signal automatically results in a bounce—a strong inhibitory response, which, in effect, closes off working memory from the
RSVP stream. As the inhibition builds up over the course of the
post-T1 distractor’s presentation, the distractor itself still has a
reasonable chance of entering working memory, resulting in substitution errors when the task allows for such errors to be made (cf.
Isaak et al., 1999). However, just as the boost reaches its maximum
after T1, the bounce reaches its maximum after the post-T1 distractor. If this next item turns out to be a target, an attentional blink
arises: Because of the strong inhibitory feedback, this target is less
likely to muster sufficient activity in time to make it into working
Figure 5. Simulation of the basic attentional blink paradigm. (A) The model’s performance is shown for T1
(the open marker on the y axis) and for T2 as a function of T1-T2 lag. (B) The model’s dynamics for a T2 at
lag 2 is shown in detail, with (1) the bottom-up sensory signal, (2) the top-down attentional response to the input
(as a combination of both excitatory and inhibitory feedback), and (3) the combined bottom-up and top-down
signal.
OLIVERS AND MEETER
846
memory. This is the case even though this target itself generates
some net attentional boost (see Figure 5). Over time, as the strong
transient inhibition summoned by the post-T1 distractor gradually
resides, performance gradually improves.
Thus, the attentional blink is completely determined by the
temporal dynamics of excitatory and inhibitory selection mechanisms following T1 and its trailing distractor(s). It is not determined by the need to process and consolidate T1 within working
memory, nor is it determined by limited working memory capacity. Because the most important factor is the delay in feedback
processing, one prediction is that the attentional blink is mainly
time-based, not item-based (even though it is not entirely timebased, because distractor items are still necessary to trigger the
inhibition). If, for example, the stream runs at twice the speed,
twice as many items should fall within the blink. This prediction
has recently been confirmed (Bowman & Wyble, 2007; Martens,
Munneke, Smid, & Johnson, 2006; Popple & Levi, 2007; see
Nieuwenhuis, Gilzenrat, Holmes, & Cohen, 2005, for a partial
manipulation), and, as shown in Figure 6, our model reproduces
these findings.
The past decades have generated a wealth of other findings
related to the attentional blink, many of which we now discuss.
Lag-1 Sparing
Probably the most important phenomenon accompanying the attentional blink is lag-1 sparing. Performance for T2 is typically high
when T2 immediately follows T1 (e.g., Chun & Potter, 1995; Raymond et al., 1992; see Figure 1). In fact, at lag 1, T2 performance
often exceeds T1 performance (Chun & Potter, 1995; Di Lollo et al.,
2005; Olivers et al., 2007; Potter, Staub, & O’Conner, 2002). This has
been a troublesome finding for most limited-capacity theories. How
can T2 be spared when only one item can pass through the bottleneck,
or when resource depletion should be maximal? So far, solutions have
been sought in additional hypotheses and psychological constructs
that allow for the two targets to be temporarily processed together
(Chun & Potter, 1995), bound to the same token (Bowman & Wyble,
2007; Chun, 1997b) before a sluggish attentional gate closes (Visser,
Zuvic, Bischof, & Di Lollo, 1999), in a single batch (Jolicoeur et al.,
2002), temporal window (Visser, Bischof, & Di Lollo, 1999), glimpse
A. Data
p(Corrrect)
1
(Chua et al., 2001), event representation (Kessler et al., 2005), or
episode (Hommel & Akyürek, 2005). We do not deny the existence
of such temporary attentional processes—in fact, many are very
similar to the temporary enhancement function within our model. The
point here is that within most of these theories, the hypotheses of what
causes lag-1 sparing do not explain the attentional blink itself, and
vice versa. As Visser, Bischof, and Di Lollo (1999) wrote, the attentional blink and lag-1 sparing are treated as independent phenomena:
T1 triggers a relatively long episode of up to 500 ms during which
resources are taken away from the stream, resulting in the blink. At
the same time, T1 also triggers a shorter period of about 200 ms,
during which the lag-1 item is assigned additional resources.
In contrast, our theory provides an integrative explanation of both
the attentional blink and lag-1 sparing, by proposing the T1-induced
attentional boost as the common cause. The post-T1 item is spared
and may even yield better performance than T1 itself, simply because
it arrives in the peak of the attentional boost. Because of the boost’s
set delay, the theory predicts that lag-1 sparing is also largely time
based, rather than item based. In other words, relative sparing should
not be confined to lag 1, as long as the later lags fall near the peak of
the attentional boost. The few studies that have looked at the issue all
suggest that the sparing is indeed essentially time based and spreads
to later lags when the RSVP is speeded up (Bowman & Wyble, 2007;
Martens, Munneke, et al., 2006; Nieuwenhuis et al., 2005). Again,
Figure 6 shows that this pattern of results is successfully reproduced
by our model.
Apparent Tradeoffs Between T1 and T2
Potter et al. (2002) systematically varied the SOA between T1 and
T2 and found evidence consistent with the idea that T1 and T2 are in
direct competition for resources. At short SOAs (⬍100 ms), T2
performance was better than T1 performance (a result indicative of
sparing), whereas at long SOAs (⬎100 ms), T2 performance was
inferior to T1 performance (as is indicative of an attentional blink; see
also Bachmann & Hommuk, 2005). However, in this study, SOA was
confounded with differential masking effects. The targets were words,
whereas the distractors surrounding the targets were always ampersands. Thus, at very short SOAs, T1 was probably much more
effectively masked by the closely following and very similar T2 than
0.75
0.75
0.5
05
0.5
0.25
B. Model
1
100 ms/item
0.25
100 ms/item
50 ms/item
50 ms/item
0
0
0
200
400
600
800
1000
0
200
400
600
800
1000
T1-T2 lag (ms)
Figure 6. T2 performance as a function of T1-T2 lag in streams with interitem stimulus onset asynchronies
(SOAs) of 50 ms, versus of 100 ms. T1 performance is shown by open markers on the y axis (lag 0). (A) Data
adapted from “The Simultaneous Type, Serial Token Model of Temporal Attention and Working Memory,” H.
Bowman & B. P. Wyble, 2007, Psychological Review, 114, Figure 19, p. 55. Copyright 2007 by the American
Psychological Association. (B) Model results.
BOOST AND BOUNCE
at long SOAs, when it was followed by a train of dissimilar distractors, whose representation was probably further weakened because of
their repetition (Kanwisher, 1987). T2, on the other hand, suffers more
from the attentional blink at later lags, when more distractors intervene. Such differential effects emerge from our model, and, as can be
seen in Figure 7, it captures the large part of the apparent tradeoff
between T1 and T2, without the necessity to assume a real tradeoff in
terms of attentional resources (see also Kawahara & Enns, in press).
has been induced. In other words, the attentional blink can be
rapidly reversed. Olivers et al. (2007; see also Kawahara, Kumada,
& Di Lollo, 2006) presented observers with the sequence . . .
TDTT. . . . As expected, an attentional blink was found for the
second of the three targets. However, the third target was almost
completely spared, even though it was presented in a temporal
position relative to the first target that would normally be severely
“blinked” (as was confirmed using . . . TDDT. . . sequences). Similar findings have been reported by Nieuwenstein, Chun, Van der
Lubbe, and Hooge (2005) and Nieuwenstein (2006). Just prior to
T2, they inserted distractors carrying the target-defining property
(referred to as cues, these distractors could, e.g., be red when
targets were also red). Again, substantial sparing occurred.
The reversal of the blink also appears problematic for the TLC
account (Di Lollo et al., 2005). TLC hypothesizes that, during the
attentional blink, the central executive control is occupied by T1,
which leaves the input filter vulnerable to exogenous disruption.
Within its current form, TLC does not allow central control to be
reinstated before T1 processing has finished. Instead, T2 might
cause an exogenous reset of the filter to start accepting targets
again (as is proposed by Kawahara, Kumada, & Di Lollo, 2006),
although it is difficult to see how these targets are recognized as
such when they are not allowed to pass the filter in the first place.
Moreover, Nieuwenstein (2006) reported strong evidence that the
attentional filter is actually fully intact during the attentional blink.
He found that when observers searched for red targets in an RSVP
stream, T2 was relatively spared when it was preceded by a red cue
but was hardly spared when it was preceded by a green cue.
However, when he asked observers to detect both red and green
targets, both red and green cues became effective, regardless of the
actual color of T2 itself. TLC cannot account for this, because
green cues should have exogenously reconfigured the input filter
to accept only green targets, not red targets.
The above results imply that, rather than reflecting the irreversible T1-induced draining of resources for 500 ms, the attentional
blink reflects the dynamic, relatively rapid, online response to
important changes in the stimulus stream. Given the speed at
which these streams run (typically at 10 items/s), the data suggest
that the system responds on a time scale of something more like
100 ms, rather than 500 ms. Boost and bounce theory readily
allows for such rapid reversals of the attentional blink following
either targets or cues. A particular target (or target-like cue) is
Spreading the Sparing
We have already mentioned that under sufficiently rapid presentation rates, sparing can spread to lag 2. But even at standard
presentation rates of around 100 ms/item, sparing is not necessarily
limited to lag 1. Di Lollo et al. (2005), as well as Olivers et al.
(2007), asked participants to identify the targets in sequential
triplets of items, like . . . TDT. . ., and . . . TTT. . . (T denoting a
target, D denoting a distractor; all embedded in a stream of
distractors). Note that the final targets in these triplets are in
exactly the same temporal position relative to the first target, and
a limited-capacity account would thus predict an attentional blink
for both these final targets. Yet performance differed remarkably:
There was a clear blink for the final target in the TDT triplet,
whereas there was no blink for any of the targets in the TTT triplet
(this relative sparing of the third target even occurred when performance was analyzed contingent upon T1 and T2 correct). In
other words, lag-2 sparing occurred. Olivers et al. (2007) showed
that with a total of four targets, sparing spreads even further, to lag
3. Work by Nieuwenstein and Potter (2006) demonstrated that
sparing may spread to five subsequent targets. These results are
highly problematic for resource-depletion theories. They predict
that performance should deteriorate, not improve, with increasing
numbers of targets. Boost and bounce theory, on the other hand,
readily explains these findings: As long as no distractor signal
arises, no inhibition is issued (the gate remains open), allowing for
multiple target sparing.
Figure 8 shows the lag-2 and lag-3 sparing found by Olivers et
al. (2007) and the model’s replication.
Rapid Reversal of the Attentional Blink
Even more problematic for limited-capacity accounts is the
finding that sparing can still occur once a proper attentional blink
A. Data
p(Corre
ect)
1
847
B. Model
1
0.75
0.75
05
0.5
05
0.5
0.25
T1
T1
0.25
T2
T2
0
0
0
50
100
150
200
250
0
50
100
150
200
250
T1-T2 lag (ms)
Figure 7. (A) T1 and T2 identification performance, as a function of T1-T2 lag, adapted from “The Time
Course of Competition for Attention: Attention Is Initially Labile,” M. C. Potter, A. Staub, & D. H. O’Conner,
2002, Journal of Experimental Psychology: Human Perception and Performance, 28, Figure 1, p. 1152.
Copyright 2002 by the American Psychological Association. (B) Model results.
OLIVERS AND MEETER
848
A. Data
1
B. Model
1
Spreaded sparing
Spreaded sparing
0.75
p(Corre
ect)
0.75
0.5
0.5
Blink reversal
1 Tgt:
2 Tgts:
3 Tgts:
3 Tgts:
4 Tgts:
0.25
0
0
100
Blink reversal
.. T1 ..
.. T1 (D..) T2 ..
blink
.. T1 T2 T3 D .. Standard0.25
.. T1 D T2 T3 ..
.. T1 T2 T3 T4 ..
0
200
300
400 0
1 Tgt:
2 Tgts:
3 Tgts:
3 Tgts:
4T
Tgts:
t
.. T1 ..
.. T1 (D..) T2 ..
.. T1 T2 T3 D ..
.. T1 D T2 T3 ..
.. T1 T2 T3 T4 ..
100
200
Standard blink
300
400
T1-T2 lag (ms)
Figure 8. (A) Target identification performance in an attentional blink paradigm in which either one, two,
three, or four targets were presented, as adapted from “Spreading the Sparing: Against a Limited-Capacity
Account of the Attentional Blink,” C. N. L. Olivers, S. Van der Stigchel, & J. Hulleman, 2007, Psychological
Research, 71, Figure 3, p. 132. Copyright 2007 by Springer-Verlag. Accuracy is shown for each possible
temporal position, with T1 appearing at temporal position 0; T2 appearing at temporal positions 1, 2, or 3 (at 100,
200, or 300 ms, respectively); T3 at temporal positions 2 or 3 (200 or 300 ms, respective); and T4 at temporal
position 3 (300 ms). The data show that sparing spreads to lags 2 and 3 and that the blink can be rapidly (i.e.,
within a single lag) reversed. (B) Model results.
blinked because the system is still in a strong inhibitory state
following a distractor. Nevertheless, blinked targets or cues do not
go completely unnoticed by the system and can already start
eliciting excitatory feedback. The next item, when it is another
target, benefits from this, resulting in an increased sparing. If the
next item turns out to be a distractor instead, substitution errors
may occur (Chun, 1997a; Isaak et al., 1999). As shown in Figure 8,
the blink’s reversal is indeed found in our simulation of the Olivers
et al. (2007) paradigm. In this simulation, we set dcatsame equal to
dcatdif (0.7) to model the fact that Olivers et al. had equalized
masking across targets and distractors. As shown in Figure 9, the
cueing data of Nieuwenstein et al. (2005) are, to a large extent, also
captured. Here, it is assumed that a distractor carrying the target-
p(Corre
ect)
1
defining property is treated as neither a distractor nor a target but
as an item with neutral relevance.
The Importance of the Distractors
As mentioned earlier, to explain lag-1 sparing, some have suggested that whatever happens to the lag-1 item is independent of
what causes the attentional blink. However, this is directly at odds
with the evidence that the lag-1 item, when it is a distractor, is of
crucial importance in inducing an attentional blink in the first
place. In the previous section, we discussed the finding that . . .
TDT . . . sequences generate a blink for the final target, whereas
. . . TTT . . . sequences do not (Di Lollo et al., 2005; Olivers et al.,
A. Data
1
0.75
B. Model
0.75
Uncued
T2-1 Cue
T2-2 Cue
Cue
T2 3 C
T2-3
0.5
Uncued
T2-1 Cue
T2-2 Cue
T2 3 C
T2-3
Cue
0.5
lag 4
lag 10
lag 4
lag 10
T1-T2 lag
Figure 9. (A) T2 identification performance for red targets among black distractors as a function of T1-T2 lag
and as a function of the number of preceding distractor items with the same color as the targets (i.e., the number
of red cues), as adapted from “Delayed attentional engagement in the attentional blink,” M. R. Nieuwenstein,
M. M. Chun, R. H. J. v. d. Lubbe, & I. T. C. Hooge, 2005, Journal of Experimental Psychology: Human
Perception and Performance, 31, Figure 2, p. 1466. Copyright 2005 by the American Psychological Association.
(B) Model results.
BOOST AND BOUNCE
2007). The only difference between these conditions is the presence of a distractor between the targets. Furthermore, the attentional blink is strongly reduced when the distractor following T1 is
replaced with a blank (Breitmeyer, Ehrenstein, Pritchard, Hiscock,
& Crisan, 1999; Chun & Potter, 1995; Grandison et al., 1997;
Raymond et al., 1992; Seiffert & Di Lollo, 1997), or when the
distractor following T1 is weakened because of inhibition or
repetition blindness (Drew & Shapiro, 2006; Dux et al., 2006).
There are also indications that the post-T1 distractor receives
relatively high levels of processing compared with other items in
the stream. For example, Chua et al. (2001) found that only
distractors presented at lag 1 could semantically prime a subsequent T2. As mentioned earlier, instead of T1, observers often
report the immediately following distractor (provided the type of
task allows for such errors to be made; Chun, 1997a; Isaak et al.,
1999; Raymond et al., 1992). At the same time, responses to a
target at the very end of the stream have been found to be inhibited
when it matches the post-T1 distractor (Loach & Marı́-Beffa,
2003). This suggests that the post-T1 distractor may first be
enhanced and then be suppressed.
Boost and bounce theory readily predicts all these findings. Within
the theory, the post-T1 distractor is essential to observing a blink
exactly because it induces the inhibition that causes the blink. Removing or weakening the post-T1 distractor removes or reduces the
inhibition and, thus, the blink. At the same time, the post-T1 distractor
itself is so heavily boosted that it may act as a strong subconscious
prime or even break through the gate to consciousness.
Figure 10 shows the predictions from our model for the case in
which the lag-1 distractor is removed, compared with data from Chun
and Potter (1995). The model makes the additional assumption that a
gap in a continuous stream is slightly disruptive (i.e., it is a salient but
irrelevant event) that triggers a weak inhibitory response (at 1/10 of
the normal inhibitory response). Without this assumption, the pattern
looks the same, but the distractor after the gap induces a deeper
attentional blink. Figure 11 shows how the attentional blink is reduced
when the post-T1 distractor is weakened (Drew & Shapiro, 2006; Dux
et al., 2006). Figure 5 shows how the post-T1 distractor is first
enhanced and then suppressed but, nevertheless, generates substantial
activity, compared with other distractors.2
Finally, the post-T2 distractor also plays an important role in the
model. When the post-T2 distractor is removed, sensory activation
associated with T2’s identity lingers (as it is no longer masked).
Excitatory feedback elicited by T2 reaches its maximum while this
sensory activity is still around, whereas normally, it enhances the
post-T2 distractor more than T2. T2 is therefore perceived at a
delay, consistent with behavioral and neurophysiological data
(Dell’Acqua, Pascali, Jolicoeur, & Sessa, 2003; Giesbrecht & Di
Lollo, 1998; Jolicoeur & Dell’Acqua, 1998; Vogel & Luck, 2002;
Zuvic, Visser, & Di Lollo, 2000).
Lien, 1997; Maki, Bussard, Lopez, & Digby, 2003; Maki & Padmanabhan, 1994; McAuliffe & Knowlton, 2000; Olivers & Watson,
2006; Raymond et al., 1995; Visser, Bischof, & Di Lollo, 2004).
Although it has been difficult to separate out the exact contributions,
similarity effects appear to occur on the level of visual features
(probably involving masking), as well as on conceptual/semantic
levels. Furthermore, manipulations of target– distractor similarity often also involved variations in similarity within the distractor set itself.
Finally, it is often difficult to assess whether similarity affects overall
performance or specifically affects the attentional blink (i.e., interacts with
lag). This is because potential ceiling effects allow less room for improvement at later lags than at the usually more affected shorter lags.
Within our model, increased similarity between items (within or
between categories) reduces the sensory input signal to gate neurons and, thus, both the strength of excitatory and inhibitory
feedback elicited by targets and distractors, respectively. Furthermore, as targets and distractors become less distinguishable, there
is an increased necessity for top-down gating to extract the relevant items from the stream. The net result is a deeper attentional
blink. Figure 12 shows simulated attentional blinks for a range of
within- and between-category similarities (parameters dcatsame and
dcatdif, respectively).
A New Prediction: The Attentional Blink Can Be
Postponed
Boost and bounce theory also makes new predictions. One
prediction is that the attentional blink is not time-locked to T1 but
to the first post-T1 item that does not resemble a target (and that
therefore induces strong inhibition). The prediction then is that
inserting post-T1 distractors that do not induce strong inhibition
will lead to a postponement of the attentional blink until a distractor is encountered that does. We did this by making the post-T1
distractors carry the target-defining feature, as is illustrated in
Figure 13a. We started from an attentional blink task in which
observers were asked to report the two red letters in a stream of
black digits. We assume that observers set up their gating system
(or attentional set) such that red items led to excitatory feedback,
whereas black items led to inhibitory feedback. In the standard
condition, the red T1 was usually followed by a black distractor
(except at lag 1), and a standard attentional blink pattern was
expected. The interesting case is the T1 ⫹ 1 red condition, in
which the distractor immediately following T1 was also red. Our
model predicts that, simply because it carries a target feature, this
distractor will not induce the inhibition invoked by the standard
black distractor. In fact, the target resembling distractor is treated
as neutral (neither target nor distractor) by the model, and the
excitatory feedback invoked by T1 continues unhindered. The
2
Similarity Effects
Not only the presence but also the nature of the distractors is
important. A number of studies have shown an aggravated attentional
blink when either the pre-target distractors, the post-target distractors,
or all the distractors in the stream, are similar to T1, T2, or both (Chun
& Potter, 1995; Dux & Coltheart, 2005; Ghorashi, Zuvic, Visser, &
Di Lollo, 2003; Giesbrecht, Bischof, & Kingstone, 2003; Isaak et al.,
1999; Kawahara, Enns, & Di Lollo, 2006; Maki, Couture, Frigen, &
849
It is interesting to note that Seiffert and Di Lollo (1997) found a clear
attentional blink, even when the lag-1 distractor was removed, as long as
T1 itself was degraded by a simultaneous mask. This appears at odds with
our model, which assigns a central role to the post-T1 distractor. However,
if we assume that degrading T1 results in slower processing of T1, then the
triggering of excitatory feedback is delayed too. In other words, instead of
the lag-1 item, the lag-2 item now receives the bulk of the processing. If
this item is a distractor, it will result in a blink for subsequent items. Thus,
a full but delayed attentional blink is predicted, as was exactly found by
Seiffert and Di Lollo (1997).
OLIVERS AND MEETER
850
A. Data
p(Corre
ect)
1
0.75
0.75
0.5
0.5
0.25
B. Model
1
0.25
Standard
Standard
Gap at lag 1
0
0
200
400
Gap at lag 1
0
800 0
600
200
400
600
800
T1-T2 lag (ms)
Figure 10. (A) T2 identification performance as a function of T1-T2 lag either in a standard attentional blink
task or when a gap is inserted at lag 1 (in other words, the lag 1 distractor is removed), as adapted from “A
Two-Stage Model for Multiple Detection in Rapid Serial Visual Presentation,” M. M. Chun & M. C. Potter,
1995, Journal of Experimental Psychology: Human Perception and Performance, 21, Figure 4, p. 114.
Copyright 1995 by the American Psychological Association. (B) Model results.
model thus predicts that the attentional blink will not be fully
induced until the next black item appears, which, in this case, is the
T1 ⫹ 2 distractor. In other words, the entire attentional blink
function should shift later in time by one lag. Similarly, in the
T1 ⫹ 2 red condition, we inserted two red distractors after T1. The
model now predicts that the same attentional blink function should
shift even further back. In sum, the attentional blink is postponed.
Figure 13b shows this prediction.
Limited-capacity theories predict something rather different.
First of all, no postponement is predicted, simply because the
attentional blink is time-locked to T1. After all, it is the secondstage processing of T1 that is assumed to be the major culprit
behind the attentional blink. Second, such theories have often
assumed that T1 induces a blink because it is masked by a
distractor. If anything, then, making the post-T1 distractor similar
to T1 (by giving it the same color) should make it more difficult to
maintain the same level of T1 performance, resulting in a deeper
blink for T2. In a similar vein, Shapiro and Raymond’s (1994)
interference theory, with its emphasis on similarity, would predict
that T2 suffers more from the red distractors than from the black
distractors, simply because the red distractors match the target
Method
Participants. Eighteen participants (9 male) were included in
the study, ranging in age from 18 to 33 years (M ⫽ 22.3 years). They
were paid €7 (about $11) per hour. One more participant was run, but
she showed no standard attentional blink (performance consistently at
100%, except for one lag at 95%) and was therefore excluded. All
other participants showed an attentional blink effect (maximum T2
performance ⫺ minimum T2 performance) of at least 10%.
A. Data
1
B. Model
1
0.75
p(Corre
ect)
template better and are therefore stronger competitors for selection. So again, a deeper blink, but no postponement, is predicted.
Finally, similar to the boost and bounce model, TLC theory might
predict that the input filter remains intact as long as items carrying
the target feature enter the system, and an attentional blink is only
induced when the filter is disrupted. However, unlike our model,
TLC predicts the attentional blink to remain time-locked to T1,
because it assumes that it is T1 processing that prevents the central
executive from maintaining the input filter. As soon as the central
executive is available again, the input filter should be reinstantiated, regardless of the type of intervening distractors.
0.75
0.5
0.5
0.25
0.25
Standard
Standard
Repeated distractor
Repeated distractor
0
0
0
200
400
600
800
0
200
400
600
800
T1-T2 lag (ms)
Figure 11. (A) T2 identification performance as a function of T1-T2 lag for two conditions: One in which the
first post-T1 distractor has the same identity as the distractor immediately preceding T1 (“repeated”) and one in
which it has a different identity (“nonrepeated”; equivalent to a standard blink paradigm). Data adapted from “On
the Fate of Distractor Stimuli in Rapid Serial Visual Presentation,” P. E. Dux, V. Coltheart, & I. Harris, 2006,
Cognition, 99, 355–382. Copyright 2006 by Elsevier. (B) Model results.
BOOST AND BOUNCE
A. Data
1
B. Model
1
0.75
0.75
p(Corre
ect)
851
05
0.5
05
0.5
0.25
0
0
200
Similarity:
Low (false font)
0.25
Medium (string)
High (nonword)
0
400
600
800 0
Similarity:
Low (0.2; 5)
Medium (0.5; 2)
High (0.8; 1.25)
200
400
600
800
T1-T2 lag (ms)
Figure 12. (A) T2 identification performance as a function of T1-T2 lag and as a function of target-distractor
similarity, as adapted from “Sources of the Attentional Blink During Rapid Serial Visual Presentation:
Perceptual Interference and Retrieval Competition,” W. S. Maki, T. Couture, K. Frigen, & D. Lien, 1997,
Journal of Experimental Psychology: Human Perception and Performance, 23, 1393–1411. Copyright 1997 by
the American Psychological Association. (B) Model results, using different values of dcatdif and dcatsame,
respectively (which, in our model, stands for the distinctiveness of items between and within categories). The
open symbols on the y axis signify T1 accuracy.
Stimulus, design, and procedure. Stimulus generation and response recording were done using E-Prime (Psychology Software
Tools, Pittsburgh, PA). Backgrounds were gray (40 cd/m2). After a
1000-ms blank period, a 0.5 ⫻ 0.5° black fixation cross was
presented for 1000 ms in the center of the display and was
subsequently replaced by a rapid serial presentation of 24
characters, most of which were black digits, presented in Cou-
rier New (approximately 0.8 ⫻ 0.8° in size). Digits were
randomly drawn from the set 2–9, with the restriction that no
two consecutive digits could be the same. Each item was
presented for 67 ms, followed by a 25-ms blank (SOA ⫽ 92
ms). At position 8 –10 in the stream, a first red letter (T1)
replaced one of the black digits. At various lags (1–12), a
second red letter (T2) replaced another black digit. The partic-
A. Experimental conditions
B. Model predictions
1
T1
5
2
+
9
K
7
2
3
F
8
9
5
2
+
9
K
7
2
3
F
8
9
T1+2 red
9
K
7
2
3
F
8
9
p((Correct)
T2
T1+1 red
5
2
+
0.75
0.5
standard
T1+1 red
T1+2 red
0.25
0
1
2
3
4
5
6
7
8
9 10 11 12
C. Data
1.00
p(Co
orrect)
Standard
0.90
0.80
standard
1 red
2 red
ti
time
0 70
0.70
0
1
2
3
4
5
6
7
8
9 10 11 12
T1-T2 lag
Figure 13. (A) Examples of the three conditions in the new experiment. In the Standard condition, targets were
red (here drawn in gray) letters among black digit distractors. In the T1 ⫹ 1 red condition, the distractor
immediately following T1 was also red. In the T1 ⫹ 2 red condition, the two distractors following T1 were red.
(B) The boost and bounce model’s predictions: The attentional blink is postponed. (C) Average accuracy data
from 18 participants for T2 as a function of T1-T2 lag.
852
OLIVERS AND MEETER
ipant’s task was to report the red letters at the end of the trial,
unspeeded. The letters I, O, Q, and S were excluded. The
important manipulation involved the nature of the digit distractors between T1 and T2. In the standard condition, these were
all black. In the T1 ⫹ 1 red condition, the digit immediately
following T1 was red. In the T1 ⫹ 2 red condition, the two
distractors following T1 were both red. In the standard condition, T2 would follow at lag 1, 2, 3, 4, 5, 6, 8, or 10. In the T1 ⫹
1 red condition, T2 could never appear at lag 1 (as this lag was
occupied by the red distractor), and thus, lags 2, 3, 4, 5, 6, 7, 9,
and 11 were used instead. Similarly, in the T1 ⫹ 2 red condition, lags were 3, 4, 5, 6, 7, 8, 10, and 12. All conditions were
randomly mixed in six blocks of 96 trials each. The first block
was regarded as practice. In all, this resulted in 20 trials per cell.
The experiment lasted a little more than 1 hr.
Results and Discussion
Mean percentages correct for T1 and T2 (contingent upon T1
correct) were calculated, with order reversals counted as correct.
Overall, T1 performance was high at 94%, with little difference
across conditions or lag, with the exception that T1 performance
dropped to 80% when it was immediately followed by T2 (as was
the case only for lag 1 in the standard condition). For other lags
and conditions, there were no significant effects, although there
was a trend toward deteriorated performance when T1 was followed by red distractors, F(1, 17) ⫽ 3.45, MSE ⫽ 0.002, p ⫽ .081
(all other ps ⬎ 0.10). This is to be expected under the assumption
that similar distractors act as somewhat stronger masks for T1.
Figure 13c shows T2 performance contingent upon T1 correct.
In line with the model’s predictions, the pattern of results suggests
a clear postponement of the attentional blink with increasing
number of post-T1 distractors. In the standard condition, sparing
occurred for lag 1, whereas the trough of the blink fell at lag 2. In
the T1 ⫹ 1 red condition, relative sparing occurred at lag 2,
whereas the trough occurred at lag 3. In the T1 ⫹ 2 red condition,
sparing occurred at lag 3, whereas the trough occurred at lag 5.
Because not all lags were the same for all distractor conditions, a
single analysis of variance involving the full experimental design
was not possible. However, the pattern of results in Figure 13c was
statistically confirmed when taking only the common lags into
account. An analysis of variance with distractor condition (standard, T1 ⫹ 1 red, T1 ⫹ 2 red) and the four lags they had in
common (lags 3, 4, 5, 6) as factors revealed a significant Distractor
Condition ⫻ Lag interaction, F(6, 102) ⫽ 5.16, p ⬍ .001. The
same was true when comparing only the standard to the T1 ⫹ 1 red
condition on all common lags (2, 3, 4, 5, 6), F(4, 68) ⫽ 8.52, p ⬍
.001, and when comparing the T1 ⫹ 2 red with the T1 ⫹ 1 red
condition on all common lags (3, 4, 5, 6, 7), F(4, 68) ⫽ 6.44, p ⬍
.001. Finally, the interaction also held when comparing the T1 ⫹
2 red condition with the standard condition on common lags (3, 4,
5, 6, 8, 10), F(5, 85) ⫽ 3.82, p ⬍ .01. In this latter comparison,
there was also a main effect of distractor condition, F(1, 17) ⫽
21.0, p ⬍ .001, indicating that performance was overall worse
when T1 was followed by two red distractors.
The data largely confirm the model’s predictions. When the
post-T1 distractors are at least partly relevant to the gating system,
the inhibition is delayed, leading to a postponed attentional blink.
As we stated earlier, to qualitatively capture as many data patterns
as possible with a limited set of parameters, we kept parameter
values constant for all simulations. With these parameter values,
there are some differences between the model’s performance and
human performance. First, the model shows an overall deeper
blink than did human observers.3 Second, the model shows relatively stronger sparing at the first lag, following the last red
distractor. Third, the data show overall subdued (i.e., flattened)
attentional blink patterns for the T1 ⫹ 1 and T1 ⫹ 2 red conditions,
whereas this is not the case for the model. All these discrepancies
are more due to quantitative than qualitative differences, to the
extent that the model assigns relatively more weight to color than
humans might, resulting in stronger color-based modulation.
In any case, limited-capacity theories have difficulty explaining
the postponement of the attentional blink. Interference theory
(Shapiro & Raymond, 1994) predicts stronger interference and,
hence, a deeper blink from more similar distractors—in this case,
the red ones. Two-stage theory sees T1 as the cause of the blink
and therefore fails to explain its postponement. If anything, it
predicts that red distractors are stronger T1 masks (there was
indeed a trend in our data) and, hence, should result in more
resources being allocated to T1 at the expense of T2. As mentioned, TLC also predicts a data pattern time-locked to T1,
whereas the data suggest that the entire blink curve is shifted
backward in time. Finally, it is worth looking at order reversals. To
explain the relative sparing at lags 2 and 3 in the red distractor
conditions, Bowman and Wyble’s (2007) simultaneous type serial
token (ST2) model (as discussed later) might predict that the
spared item and T1 are bound to the same token, leading to
potential order reversals. However, although our data show substantial order reversals when T2 is spared at lag 1 (23%, in the
standard condition), there was no sign of increased order reversals
in the lag 2 and lag 3 sparing conditions (5% in the T1 ⫹ 1 red and
3% in the T1 ⫹ 2 red conditions, relative to 4% for black
distractors at the same lags). Thus, sparing does not necessarily go
together with order reversals.
General Discussion
We have presented a relatively straightforward theory that explains a considerable number of findings, some of which appeared
contradictory at the start. It proposes that target stimuli lead to
transient attentional enhancement, whereas distractor stimuli lead
to transient attentional suppression. This suppression is particularly strong when distractors are first enhanced, as occurs during
the attentional blink.
All of the individual components of our theory have been proposed
before in the literature on visual attention. What is new about the
theory, however, is the combination of these components and how
they interact, allowing for an integrated account of how selection
takes place in time. The important novel theoretical step is that
selection from rapid visual streams can be accounted for by the
interplay of two identical feedback functions, both rapidly rising and
more slowly decaying but opposite in sign. As a consequence, and as
3
We fully replicated the experiment with a more difficult set of targets
and distractors. Overall, the blink was much deeper, and sparing was
reduced; but again, the attentional blink was clearly postponed by red
distractors.
BOOST AND BOUNCE
a crucial departure from existing theories, there is no central role for
limited-capacity resources in our theory. According to Boost and
Bounce theory, the biggest piece of evidence for such limitations—the
attentional blink— does actually not reflect an extended failure of
attention but, rather, its intact, responsive, and rapid operation, modulating the input within about 100 ms. In fact, the attentional blink is
caused by too strong an attentional response, rather than by too weak
an attentional response. The same strong response accounts for transient attention effects in the cueing studies. Thus, the theory integrates
classic transient attention findings with the classic phenomenon of the
attentional blink with a relatively limited set of assumptions. It explains numerous associated findings, such as lag-1 sparing, sparing at
later lags; the time-based, rather than item-based, nature of the phenomena; apparent trade-offs between T1 and T2; the importance of
the distractors; variations in the processing of distractors; similarity
effects; and rapid reversals of the blink. By regarding the attentional
blink as the direct counterpart of the transient attentional episode, the
theory does away with what seemed to be ad hoc explanations of
some of these findings. Finally, the theory’s computational implementation successfully captures all these effects with a minimal set of
parameters with fixed values and generates new predictions. In short,
the theory can explain more data with fewer assumptions than most
other theories.
There are now a number of theories and computational models of
temporal attention, some of which, at the surface, bear similarity to
ours. Most of these are theories of the attentional blink. Here we
discuss a few of them in detail, whereas others have already been
extensively discussed and dismissed by Bowman and Wyble (2007).
Relation to Other Models and Theories: Temporary
Suppression Models
As mentioned, boost and bounce theory comes closest to the
temporary suppression account originally proposed by Raymond et al
(1992). They proposed that T1 initiates an attentional episode that
includes the post-T1 distractor. This post-T1 distractor then initiates a
period of suppression, during which subsequent items are missed.
However, the accounts also differ in important respects:
1.
As mentioned earlier, Raymond et al. (1992) hypothesized that the inhibition operates at an early visual stage
of processing. To account for the high level representations of blinked items, boost and bounce theory proposes
that the inhibition is initiated late, at the level of working
memory entrance. However, inhibition does trickle down
the representational hierarchy to lower-level representations, affecting the processing of later items.
2.
Whereas Raymond et al.’s (1992) account assumed the
inhibition to be ballistic and nonadaptive to changes in
the stream, in boost and bounce theory, the gating is
dynamic and flexible. This allows for the attentional
blink to be postponed (see the new experiment here) or
rapidly reversed (Di Lollo et al., 2005; Olivers et al.,
2007; Kawahara, Kumada, & Di Lollo, 2006; Nieuwenstein et al., 2005).
3.
Raymond et al. (1992) proposed that the inhibition is necessary to protect T1 from conjunction errors. As such, it
853
predicts that the blink is time-locked to the T1-versus-firstdistractor conflict. In contrast, within boost and bounce
theory, T1 does not need to be protected. The theory does
not assume that the post-T1 item must be inhibited for T1 to
be reported correctly. The inhibition is not triggered by the
potential for conjunction errors, conflict, or competition
between T1 and the post-T1 item (or between T1 and T2 for
that matter) and, thus, is not time-locked to T1. Instead, the
inhibition is triggered by a mismatch between a strong
incoming signal and the attentional set for the target (see
also Olivers & Watson, 2006).
The idea that T1 requires some form of protection has recently
returned in a number of verbal and computational models. For
example, Hommel et al. (2006) proposed that the post-T1 inhibition is a direct consequence of the attentional network being able
to process only one object at a time: The network “silences itself”
in an attempt to preserve T1. The idea of a (potential for) conflict
between T1 and the post-T1 distractor also returns in Battye’s
(2006) account of the blink. As we propose here, his model
assumes that T1 triggers a temporary enhancement of the post-T1
distractors, causing maximum conflict in the output layers of the
network. This conflict is registered and results in stronger lateral
inhibition between the output units. T2, when it arrives, suffers
from this lateral inhibition, resulting in an attentional blink. As the
conflict is being resolved, the blink subsides again. As with Raymond et al.’s (1992) original proposal, one problem with these
models is that the inhibition is ballistic and irreversible, contrary to
what the data suggest (Olivers et al., 2007; the current experiment).
What is more, the idea that higher structures cannot deal with more
than one item at a time is highly reminiscent of resource-depletion
theories and, thus, suffers from the arguments against these theories we made earlier.
ST2
A number of other computational models of the attentional blink
feature an enhanced response following T1, and many also feature
inhibition (Bowman & Wyble, 2007; Chartier et al., 2004;
Dehaene et al., 2003; Fragopanagos et al., 2005; Nieuwenhuis et
al., 2005). However, in all these models, the enhanced T1 representation either directly inhibits the T2 representation (as a type of
lateral inhibition), or it inhibits a type of resource (e.g., working
memory capacity or attentional enhancement) that is then not
available to T2. These models are therefore actually straightforward limited-capacity models, suffering from the same arguments
we made earlier.
Probably the most sophisticated of these models is the ST2
model of Bowman and Wyble (2007). The ST2 model is a
neural network combining Chun and Potter’s (1995) two-stage
theory with Kanwisher’s (1987) type/token distinction. Types
are general, context-free representations of objects (such as letters
or digits), which may be activated in parallel, without capacity
limitations. In contrast, tokens are specific, episodic instances of
an object (e.g., the digit “6” presented about midway in the
stream). For conscious report, the type representations need to be
bound to tokens. This binding occurs in a binding pool. It is
important to note that, according to Bowman and Wyble (2007),
this tokenization takes time and occurs serially: In principle, only
OLIVERS AND MEETER
854
one token at a time can be bound to a type. Furthermore, the
tokenization requires a blaster. The blaster is triggered by, and
causes attentional enhancement of, T1, so that it can be bound to
a token. The blaster also spills over to the next item, so that
occasionally, both items may be bound to the same token. This
explains lag-1 sparing, and, because temporal information is lost
within a token, it also explains the order reversals often found to
accompany lag-1 sparing. The attentional blink is explained by
assuming that the system seeks to prevent false bindings of
post-T1 items to the same token. Therefore, the tokenization
process must be temporarily halted. This is done by inhibiting the
blaster for a period of several hundreds of milliseconds, until the
tokenization of T1 is completed. With a switched-off blaster,
subsequent items no longer receive the attentional enhancement
necessary for tokenization (and, thus, for conscious report), resulting in an attentional blink. With this architecture, ST2 can explain
an impressive range of data (Bowman & Wyble, 2007).
There are obvious similarities between ST2 and boost and bounce
theory, most notably the presence of both excitatory and inhibitory
components. However, there are also fundamental differences:
1.
The ST 2 model is, in essence, a hybrid limitedcapacity/T1 protection model similar to the ones we
discussed above: Tokenization capacity is limited, and to
protect the system from false T1 token bindings, subsequent processing is inhibited. Boost and bounce theory
argues that there is no need to assume a role for such
limitations or protection mechanisms in explaining the
attentional blink. A flexible but slightly delayed gating
mechanism is sufficient.
2.
Within the ST2 model, the inhibition is triggered by T1
and is ballistic. It therefore cannot easily explain the
spread sparing. The ST2 model may be stretched so that
the tokenization process can include more than two successive targets. To account for the Olivers et al. data
(2007), Bowman and Wyble (2007) suggested that the
blaster may rapidly refire when incoming information
remains relevant. As a result, a continuous sequence of
targets can be encoded together into working memory
(Bowman, Wyble, Chennu, & Craston, 2008; Wyble,
Bowman, & Nieuwenstein, in press). This way, the theory might also account for the postponement of the blink,
as found in the present experiment. However, it would
have to assume that the post-T1 distractors are also
tokenized (as they did not inhibit the blaster).
3.
It appears to us that even an ongoing blaster within ST2
could not easily account for rapid reversals of the attentional blink. Here a full blink for a second target was first
induced (suggesting that the inhibition must therefore
have been triggered), but then a third target escaped the
attentional blink when immediately following the second.
This goes against the irreversible inhibition of a blaster.
Now not only continuous firing but also rapid resumption
of the blaster (once it is inhibited) would have to be
assumed. The big question regarding Points 2 and 3, then,
is why ST2 would allow for such rapid lifting of the
inhibition when it deems the inhibition necessary to pro-
tect T1 in the first place. It appears then that the inhibition protects against something that is not really a threat.
In contrast, our boost and bounce model has little trouble
with reversing the blink. It assumes that the gate to
working memory responds dynamically and adaptively to
the incoming sensory information, closing when distractors enter, opening when targets enter.
The Locus Coeruleus Norepinephrine Model
Another model we discuss here is the locus coeruleus norepinephrine (LC-NE) model proposed by Nieuwenhuis et al. (2005).
According to this neurocomputational model, the attentional blink
is the consequence of the dynamics of the LC, which is the brain
stem nucleus responsible for the release of NE in the neocortex.
The NE release is believed to have an attention-enhancing effect
and, according to Nieuwenhuis et al., is a necessary condition for
target awareness. It is interesting to note that LC activity peaks
around 100 ms after target onset and, because of auto-inhibition, is
then followed by a refractory period of between 200 and 400 ms,
during which it cannot fire again (at least in monkeys—little
is known about the exact LC dynamics in humans; Aston-Jones,
Rajkowski, & Cohen, 2000). According to Nieuwenhuis et al.,
lag-1 sparing corresponds directly to the peak, whereas the attentional blink corresponds directly to the refractory period. The
LC-NE model differs from the boost and bounce model in that the
former is, in essence, a limited-capacity resource model. As in
the classic attentional blink models, T1 uses up a vital resource that it
requires for detection (namely, LC firing and NE release). This
resource is then temporarily unavailable to subsequent targets (because of the refractory period). The problem with this is that the
LC-NE model cannot explain why the attentional blink is so dependent on the distractors intervening T1 and T2. This is because the LC
is triggered by T1, and thus, a refractory period follows, regardless of
the presence or nature of the subsequent items. In a similar vein,
because it operates in a ballistic fashion, the LC-NE model cannot
deal with the sparing of multiple targets and the attentional blink
reversal effects mentioned earlier (Di Lollo et al., 2005; Olivers et al.,
2007). Nor can it explain the postponement of the blink in the
currently presented experiment. Nevertheless, the LC-NE system remains a strong contender for contributing to the transient attentional
enhancement effects that are at the core of our model. It is possible
that the NE release further modulates the gating of relevant and
irrelevant sensory input, thus aggravating the attentional blink (see
also Olivers, 2007, for the idea that the LC might be an engine behind
transient attentional enhancement).
Gating Theory
The idea that input is filtered, or gated, by working memory has
been proposed by many but computationally implemented by few.
One of the most prominent mathematical descriptions of how this
takes place as a function of time is the gating theory developed by
Sperling and colleagues (Reeves & Sperling, 1986; Shih & Sperling, 2002; Sperling & Reeves, 1980; Sperling & Weichselgartner,
1995). According to this theory, the occurrence of a relevant event
(e.g., a cue to switch streams or tasks) induces a transition from
one discreet attentional state to another. The time course of this
transition is described by a temporal transition function, which, on
BOOST AND BOUNCE
the basis of Sperling and colleagues’ RSVP work, was hypothesized to be gamma-shaped (see Figure 2c). The idea of gating, as
well as the rapidly rising and slowly decaying signal (with the
shape of a gamma distribution), are at the heart of our model.
However, this is largely where the similarities end:
1.
Boost and bounce model employs an excitatory as well as
an inhibitory gating function. Gating theory mentions
only attentional enhancement.
2.
Gating theory was designed to describe the time course
that accompanies shifts or transitions in attention, when
either a spatial or a task switch is required. As such, it
cannot account for the attentional blink, which also occurs when no such switches are necessary. In contrast, the
core of boost and bounce theory is designed to account
for pure temporal attention phenomena, focusing on situations in which all items appear in the same location and
belong to the same task, with the possibility to plug in
spatial and task-switch modules (as we have provisionally done in the current implementation).
Recently, to account for the attentional blink, Shih (2008) extended gating theory with a working memory consolidation process that refuses to accept further input while it is busy dealing
with a target. Thus, this model, too, is a straightforward limitedcapacity resource model, facing the same problems with explaining spread sparing and blink reversals that we saw before. For
example, to explain Nieuwenstein and Potter’s (2006) whole report
benefits, Shih (p. 5) merely assumed a “specific processing strategy,” without further explanation.
No Capacity Limitations
The most important difference with existing theories is that
boost and bounce theory does away with limited-capacity resources and bottlenecks as core explanations of some classic
temporal attention phenomena. Instead, our theory sees the time
course of attention as the direct result of the close to (but not quite)
optimal operation of run-of-the-mill selection mechanisms that
would serve the system just fine under natural circumstances—
selection mechanisms that also operate in numerous other attention
tasks, such as visual search, partial report, or Stroop-like tasks.
Relevant information leads to enhancement, whereas irrelevant
information leads to inhibition, all within roughly 100 ms. It is
simply that in the artificial case of the RSVP paradigm, these
enhancement and inhibition mechanisms are a fraction too late,
thus exerting their effects on the next object. Again, we do not
deny the existence of capacity limitations. In fact, they play some
role in our own theory, when we assume that working memory
capacity is limited to about four items (or a little more, when
allowing for mechanisms like chunking). This limitation predominantly plays up when more than the maximum number of items
enter working memory (e.g., Nieuwenstein & Potter, 2006; Reeves
& Sperling, 1986), working memory is already filled by some
other task (e.g., Akyürek & Hommel, 2005; Nieuwenstein, Johnson, Kanai, & Martens, 2007), or items in working memory need
to be reported in a certain order (Reeves & Sperling, 1986).
855
Psychological Refractory Period
Capacity limitations or bottlenecks also play an important role
in theories of the psychological refractory period (PRP; Pashler,
1984; Welford, 1952). The PRP is the temporary slowing of
responses to a second target when that target is presented shortly
after a first target. Although the PRP has been linked to the
attentional blink (e.g., Jolicoeur, 1998), we believe there are important methodological differences that would allow for two independent effects. For one, within the PRP paradigm, observers
usually need to respond immediately and as quickly as possible to
both targets. Nevertheless, observers are typically fully aware of
both targets. This leaves open the possibility that the PRP is the
consequence of competition on the response level (reflecting a true
bottleneck), whereas in our view, the attentional blink reflects the
active gating of working memory access, even before this bottleneck can come into play (as the prerequisites for generating a
response are not even there). Second, PRP experiments typically
involve a clear task switch and, often, a sensory modality switch.
Such switches may be regarded as a contamination of the attentional blink (as the blink also occurs perfectly without taskswitching components; Chun & Potter, 2001; Enns, Visser, Kawahara, & Di Lollo, 2001; Potter, Chun, Banks, & Muckenhoupt,
1998). Third, the PRP paradigm is usually devoid of distractors,
whereas it has been shown that distractors play an important role
in the attentional blink paradigm. In all, then, the PRP appears to
reflect processes that occur after selection, whereas in the present
article, we are interested in the time course of selection itself (and
we argue that the attentional blink reflects just that). Marois and
Ivanoff (2005) have mentioned a few more grounds on which the
PRP and the attentional blink might be dissociated. Naturally,
however, our model could be extended with components important
for explaining PRP phenomena.
But What About Previously Reported Resource-Depletion
Effects in the Attentional Blink Paradigm?
Limited-capacity theories predict that T1 and T2 act like two
communicating vessels: The more resources T1 demands, the
fewer are left for T2. Surely there is evidence for that? Well, in our
view, the evidence is actually rather thin. A number of studies have
indeed found that more difficult T1s lead to deeper blinks for T2
(Grandison et al., 1997; Jolicoeur, 1998; Ouimet & Jolicoeur,
2007; Seiffert & Di Lollo, 1997; Shore, McLaughlin, & Klein,
2001; Visser, 2007). However, as also pointed out by McLaughlin,
Shore, and Klein (2001), many of the experimental procedures in
these studies involved a task switch from T1 to T2 (and some even
involved a location switch). Switch costs may be aggravated by a
difficult T1 task, whereas the attentional blink itself is not. Some
experiments required participants to respond immediately to T1,
which, together with the task switch, turns the procedure into a
PRP paradigm, with possible bottlenecks at the response-selection
level (see the previous section). Moreover, others have changed
the post-T1 mask as a way to make T1 more difficult to identify.
However, also in boost and bounce theory, a stronger mask is
predicted to result in a stronger blink, simply because it induces
stronger inhibition. As argued earlier, differential masking can also
partly explain the T1/T2 tradeoff found by Potter et al. (2002).
Other studies failed to find a tradeoff altogether (McLaughlin et
al., 2001; Shapiro et al., 1994; Ward et al., 1997) or reported only
856
OLIVERS AND MEETER
additive effects (i.e., T1 load did not interact with lag or presentation time, suggesting that it affects stages independent of what
causes the attentional blink; Akyürek & Hommel, 2005; Jolicoeur
& Dell’Acqua, 2000; Nieuwenstein et al., 2007; Olivers et al.,
2007). Again, others have reported an opposite tradeoff (deeper
blinks after easier T1s; Chua, 2005).
Conversely, T2 performance can be improved without impeding
T1 performance. Participants in our lab occasionally reported that
they felt they did better when they did not focus so strongly on the
stimulus stream, an observation also made earlier by Kahneman
(1973). Olivers and Nieuwenhuis (2005, 2006; see also Arend,
Johnston, & Shapiro, 2006, and Ho, Mason, & Spence, 2006) put
this observation to the test by comparing performance in a standard
attentional blink condition, in which participants were instructed to
concentrate on the task, with performance in conditions in which
observers were somewhat distracted from the central task. For
example, we asked them to actively think about their holiday
plans, to listen to a repetitive tune and detect an occasional yell in
it, to perform an additional memory task, or to simply concentrate
a little less. All these manipulations had the same effect: Detection
of the second target improved under conditions intended to invoke
a more distributed attentional state, suggesting that taking away
attentional resources may actually be beneficial, rather than detrimental (but see T. Spalek & V. Di Lollo, personal communication,
summer 2005, for a failure to replicate). Similarly, using multiple
RSVP streams, Kristjánsson and Nakayama (2002) found the
attentional blink to be more reduced the further away T2 was from
the stream containing T1 and, thus, the fewer resources were
presumably allocated to the T2 stream. Furthermore, Slagter et al.
(2007) found that extensive meditation practice leads to a reduced
attentional blink. It is important to note that in none of these
studies did the improvement in T2 performance go at the expense
of T1 performance.
In a similar vein, a study by Ferlazzo, Lucido, Di Nocera,
Fagioli, and Sdoia (2007) also suggests that processing resources
are not necessarily limited to one target, at the expense of the other
target. They found a standard attentional blink for the second of
two targets when participants were instructed to report “each of the
two targets.” However, using identical RSVPs, they found a reduced blink when the instruction was to report “the pair of targets.”
The blink was even virtually absent when the instructions were to
report “the sum of the targets” (when they were digits) or “the
syllable formed by the targets” (when they were letters). Ferlazzo
et al. argued that normally, participants treat T1 and T2 report as
separate goals, between which they then need to switch. It is this
goal switch that causes the attentional blink. When participants are
induced to adopt a more holistic, distributed task set, the attentional blink is reduced or absent. Finally, the fact that the attentional blink disappears when the RSVP stream can be treated as a
single, gradually changing object (Raymond, 2003; Kellie & Shapiro, 2004) also points toward the conclusion that what matters is
the selection mechanism itself (i.e., whether or not individual items
need to be selected from the stream) and not so much what
happens to T1 after selection.
Whereas limited-capacity theories have difficulties explaining these results, our theory readily explains why there is no
unequivocal evidence for resource depletion or structural bottlenecks in the attentional blink. This is because resource depletion plays no central role in our theory. Moreover, a more
holistic or distributed approach toward the stream would mean
a weaker inhibitory response to the distractors and, thus, a
weaker blink, simply because distractors are not treated as such
(or not as strongly).
Neurophysiology
The attentional mechanisms we have proposed here are consistent with what is known about the nervous system. The idea that
selection takes place through a rapid spread of recurrent processing
down the representational hierarchy has been proposed before
(Di Lollo et al., 2000) and has been directly linked to neurophysiological processes (Dehaene et al., 1998; Grossberg, 1995;
Lamme & Roelfsema, 2000). There is also ample evidence that
such recurrent processes might be biased in advance of stimulus
presentation, effectively allowing for an attentional filter or template to be set up (e.g., Chelazzi, Duncan, Miller, & Desimone,
1998; Desimone & Duncan, 1995; Motter, 1994). A likely source
for such biasing or gating signals is prefrontal cortex, an area
linked to working memory function (E. K. Miller, Erickson, &
Desimone, 1996; E. K. Miller & Cohen, 2001). The prefrontal
cortex has long-range feedback connections to extrastriate visual
areas as well as the inferotemporal cortex, making it the prime
candidate for hosting the gate neurons that we assume modulate
the rise and fall of attention in response to incoming stimuli. A
recent study by McNab and Klinberg (2008) is especially interesting here. In their functional magnetic resonance imaging study,
they found the posterior part of the middle frontal gyrus to become
active when observers were instructed to expect distractors during
a subsequent working memory task (without the stimuli being
present yet), indicating that this area represents the task settings
necessary to reject distractors. Activity in another structure, the
basal ganglia, was also increased prior to distractor rejection and
correlated negatively with the unnecessary storage of items in
working memory, as indicated by parietal activity (Vogel, McCollough, & Machizawa, 2005). This led Awh and Vogel (2008) to
call the prefrontal cortex/basal ganglia network “the bouncer in the
brain.” It is interesting to note that the basal ganglia, in turn, have
been proposed to provide the transient boost of activity necessary
to update working memory (Hazy et al., 2006), making it another
serious candidate for being an engine behind transient attention in
our model.
Further evidence comes from electroencephalography (EEG)
studies. The frontal cortex is also the source of the event-related
potential known as the frontal selection positivity (FSP; Potts,
2004). This component is known to respond selectively to taskrelevant stimuli and may correlate with the gating mechanisms
governing access to working memory. Two EEG studies using
attentional blink tasks show an encouraging pattern of findings in
this respect. At frontal electrode sites, Martens, Munneke, et al.
(2006) found a marked positivity roughly 250 ms after T1 onset,
relative to a condition in which there was no T1. It is interesting to
note that, after approximately 100 –150 ms, the FSP was followed
by a marked and prolonged negativity in the EEG signal, relative
to the no-target baseline. This negativity reached its peak shortly
after the FSP, gradually returning to baseline over a period of
300 –500 ms. Niedeggen, Hesselman, Sharaie, Milders, and Blakemore (2004) reported a highly similar pattern and indeed linked it
to a “postperceptual frontal gating mechanism that controls the
BOOST AND BOUNCE
access of visual stimuli to higher order evaluation” (p. 584). We
believe that this post-FSP negativity may reflect a direct neurophysiological correlate of the attentional blink, in terms of locus as
well as time course.4
Another EEG component that appears diagnostic of the attentional blink is the P3/P300, with the M300 as its magnetoencephalography counterpart. The P3 is a broad, positive event-related
brain potential probably encompassing multiple components but
with a modal peak latency of about 300 –350 ms after stimulus
onset. It has often been associated with the updating of working
memory (Donchin & Coles, 1988; Vogel & Luck, 2002). With
regard to the attentional blink, there is a positive correlation
between T2 accuracy and the amplitude and/or latency of the P3
elicited by T2 (Kessler et al., 2005; Rolke et al., 2001; Sergent et
al., 2005; Vogel et al., 1998). At the same time, T2 accuracy has
also been found to correlate negatively with the amplitude of the
P3 elicited by T1 (Martens, Johnson, Elmallah, & London, 2006;
McArthur, Budd, & Michie, 1999; Shapiro, Schmitz, Martens,
Hommel, & Schnitzler, 2006). These findings appear to lend
support to the idea of a trade-off in resources between T1 and T2
on the level of working memory. However, one needs to keep in
mind that these relationships are purely correlational, not causal.
This means that a third factor may explain the negative correlation between the P3 to T1 and the P3 to T2. For example,
imagine the following scenario: On a particular trial, T1 is
detected and elicits a particularly strong attentional enhancement, which is then measured as a strong P3. The strong
enhancement also carries over to the post-T1 distractor, with an
equally strong inhibitory response as a result. Because of this
strong suppression, T2 is less likely to enter working memory
and is, hence, less likely to trigger a P3. This explains why
strong blinks correlate with strong P3s to T1 and weak P3s to
T2. Thus, in this scenario, the negative correlation between T1
and T2 does not reflect a direct tradeoff in resources between
the two targets but, rather, a common underlying third factor,
namely a strong attentional boost. At present, excitatory feedback in our model is of fixed strength and length, which is
sufficient to account for averaged data. Naturally, it could be
made variable to account for differences between individual
trials.
Conclusion
Boost and bounce theory comprehensively and coherently explains the time course of attention and associated phenomena in
number of different paradigms involving rapid serial processing.
Moreover, the theory has been made explicit in a working computational model based on a minimum of assumptions and makes
testable new predictions. Finally, it appears to connect well with
current knowledge of the neurophysiology of attention. There is no
doubt that, as a theory of attention, the theory is far from complete.
Notably, the interactions with spatial and feature processing need
to be made explicit, as do the higher-end mechanisms involved in
the setting up and switching of tasks and the selection and consolidation of response representations. What the present work aims
to demonstrate is that the dynamics of attention can largely be
captured by a straightforward gating system, thus fulfilling attention’s classic role as the gatekeeper between sensory and response
processes.
857
4
The linking of inhibitory processing to a negative component is accidental, as the polarity of event-related potential components is a function
of brain anatomy, not physiology.
References
Akyürek, E. G., & Hommel, B. (2005). Short-term memory and the
attentional blink: Capacity versus content. Memory & Cognition, 33,
654 – 663.
Anderson, A. K. (2005). Affective influences on the attentional dynamics
supporting awareness. Journal of Experimental Psychology: General,
134, 258 –281.
Arend, I., Johnston, S., & Shapiro, K. (2006). Task-irrelevant visual motion
and flicker attenuate the attentional blink. Psychonomic Bulletin &
Review, 13, 600 – 607.
Aston-Jones, G., Rajkowski, J., & Cohen, J. D. (2000). Locus coeruleus
and regulation of behavioral flexibility and attention. Progress in Brain
Research, 126, 549 –554.
Awh, E., & Vogel, E. K. (2008). The bouncer in the brain. Nature
Neuroscience, 11, 5– 6.
Baars, B. J. (1989). A cognitive theory of consciousness. Cambridge,
United Kingdom: Cambridge University Press.
Bachmann, T., & Hommuk, K. (2005). How backward masking becomes
attentional blink. Psychological Science, 16, 740 –742.
Bachmann, T., & Oja, A. (2003). Flash-lag without change in feature space
is alive and well at late intervals after stream onset. Perception, S32,
126 –127.
Baddeley, A. D., & Hitch, G. (1974). Working memory. In G. Bower (Ed.),
Recent advances in learning and motivation (Vol. 8, pp. 47–90). New
York: Academic Press.
Bargh, J. (1992). The ecology of automaticity: Toward establishing the
conditions needed to produce automatic processing effects. American
Journal of Psychology, 105(2), 181–199.
Battye, T. G. G. (2006). Connectionist modelling of attention and anxiety.
Unpublished PhD thesis, The Medical Research Council’s Cognition and
Brain Sciences Unit, Cambridge University, United Kingdom.
Botella, J., Barriopedro, M., & Suero, M. (2001). A model of the formation
of illusory conjunctions in the time domain. Journal of Experimental
Psychology: Human Perception and Performance, 27, 1452–1467.
Bowman, H., & Wyble, B. P. (2007). The simultaneous type, serial token
model of temporal attention and working memory. Psychological Review, 114, 38 –70.
Bowman, H., Wyble, B., Chennu, S., & Craston, P. (2008). A reciprocal
relationship between bottom-up trace strength and the attentional blink
bottleneck: Relating the LC–NE and ST2 Models. Brain Research, 1202,
25– 42.
Brehaut, J. C., Enns, J. T., & Di Lollo, V. (1999). Visual masking plays two
roles in the attentional blink. Perception & Psychophysics, 61(7), 1436 –
1448.
Breitmeyer, B., Ehrenstein, A., Pritchard, K., Hiscock, M., & Crisan, J.
(1999). The roles of location specificity and masking mechanisms in the
attentional blink. Perception & Psychophysics, 61(5), 798 – 809.
Breitmeyer, B. G., & Ganz, L. (1976). Implications of sustained and
transient channels for theories of visual pattern masking, saccadic suppression, and information processing. Psychological Review, 83, 1–36.
Broadbent, D. E., & Broadbent, M. H. P. (1987). From detection to
identification: Response to multiple targets in rapid serial visual presentation. Perception & Psychophysics, 42, 105–113.
Bundesen, C., Habekost, T., & Kyllingsbæk, S. (2005). A neural theory of
visual attention: Bridging cognition and neurophysiology. Psychological
Review, 112, 291–328.
Busey, T. A., & Loftus, G. R. (1994). Sensory and cognitive components
of visual informaiton acquisition. Psychological Review, 101, 446 – 469.
858
OLIVERS AND MEETER
Carlson, T. A., Hogendoorn, H., & Verstraten, F. A. J. (2006). The speed
of visual attention: What time is it? Journal of Vision, 6, 1406 –1411.
Chartier, S., Cousineau, D., & Charbonneau, D. (2004). A connectionist model
of the attentional blink effect during a rapid serial visual task. Proceedings
of the 6th International Conference on Cognitive Modeling, 64 – 69.
Cheal, M. L., & Lyon, D. R. (1991). Central and peripheral precuing of
forced-choice discrimination. Quarterly Journal of Experimental Psychology, 43A, 859 – 880.
Chelazzi, L., Duncan, J., Miller, E. K., & Desimone, R. (1998). Responses
of neurons in inferior temporal cortex during memory-guided visual
search. Journal of Neurophysiology, 80, 2918 –2940.
Chua, F. K. (2005). The effect of target contrast on the attentional blink.
Perception & Psychophysics, 67, 770 –788.
Chua, F. K., Goh, J., & Hon, N. (2001). Nature of codes extracted during
the attentional blink. Journal of Experimental Psychology: Human Perception and Performance, 27, 1229 –1242.
Chun, M. M. (1997a). Temporal binding errors are redistributed by the
attentional blink. Perception & Psychophysics, 59(8), 1191–1199.
Chun, M. M. (1997b). Types and tokens in visual processing: A double
dissociation between the attentional blink and repetition blindness. Journal of Experimental Psychology: Human Perception and Performance,
23, 738 –755.
Chun, M. M., & Potter, M. C. (1995). A two-stage model for multiple
detection in RSVP. Journal of Experimental Psychology: Human Perception and Performance, 21, 109 –127.
Chun, M. M., & Potter, M. C. (2001). The attentional blink and task
switching within and across modalities. In K. Shapiro (Ed.), The limits
of attention: Temporal constraints in human information processing (pp.
20 –35). London: Oxford University Press.
Connors, B. W., & Gutnick, M. J. (1990). Intrinsic firing patterns of
diverse neocortical neurons. Trends in Neuroscience, 13, 99 –104.
Cowan, N. (2001). The magical number 4 in short-term memory: A
reconsideration of mental storage capacity. Behavioral and Brain Sciences, 24, 87–114.
Dehaene, S., Kerszberg, M., & Changeux, J.-P. (1998). A neuronal model
of a global workspace in effortful cognitive tasks. Proceedings of the
National Academy of Sciences, USA, 95, 14529 –14534.
Dehaene, S., Sergent, C., & Changeux, J.-P. (2003). A neuronal network
model linking subjective reports and objective physiological data during
conscious perception. Proceedings of the National Academy of Sciences,
USA, 100, 8520 – 8525.
Dell’Acqua, R., Pascali, A., Jolicoeur, P., & Sessa, P. (2003). Four-dot
masking produces the attentional blink. Vision Research, 43, 1907–
1913.
Desimone, R., & Duncan, J. (1995). Neural mechanisms of selective visual
attention. Annual Review of Neuroscience, 18, 193–222.
Di Lollo, V., Enns, J. T., & Rensink, R. A. (2000). Competition for
consciousness among visual events: The psychophysics of reentrant
visual processes. Journal of Experimental Psychology: General, 129,
481–507.
Di Lollo, V., Kawahara, J., Ghorashi, S. M. S., & Enns, J. T. (2005). The
attentional blink: Resource depletion or temporary loss of control?
Psychological Research, 69, 191–200.
Donchin, E., & Coles, M. G. H. (1988). Is the P300 component a manifestation of context updating? Behavioral and Brain Sciences, 11, 357–
374.
Drew, T., & Shapiro, K. (2006). Representational masking and the attentional blink. Visual Cognition, 13, 513–528.
Duncan, J., Ward, R., & Shapiro, K. (1994). Direct measurement of
attentional dwell time in human vision. Nature, 369, 313–315.
Dux, P. E., & Coltheart, V. (2005). The meaning of the mask matters:
Evidence of conceptual interferences in the attentional blink. Psychological Science, 16, 775–779.
Dux, P. E., Coltheart, V., & Harris, I. (2006). On the fate of distractor
stimuli in rapid serial visual presentation. Cognition, 99, 355–382.
Engle, R. W., Conway, A. R. A., Tuholski, S. W., & Schisler, R. J. (1995).
A resource account of inhibition. Psychological Science, 6, 122–125.
Enns, J. T., Visser, T. A. W., Kawahara, J.-I., & Di Lollo, V. (2001). Visual
masking and task switching in the attentional blink. In K. Shapiro (Ed.),
The limits of attention: Temporal constraints in human information
processing (pp. 65– 81). London: Oxford University Press.
Eriksen, C. W. (1990). Attentional search of the visual field. In D. Brogan
(Ed.), Visual search (pp. 3–19). London: Taylor & Francis.
Ferlazzo, F., Lucido, S., Di Nocera, F., Fagioli, S., & Sdoia, S. (2007).
Switching between goals mediates the attentional blink effect. Experimental Psychology, 54, 89 –98.
Folk, C., Remington, R. W., & Johnston, J. C. (1992). Involuntary covert
orienting is contingent on attentional control settings. Journal of Experimental Psychology: Human Perception and Performance, 18, 1030 –1044.
Fragopanagos, N., Kockelkoren, S., & Taylor, J. G. (2005). A neurodynamic
model of the attentional blink. Cognitive Brain Research, 24, 568 –586.
Fuster, J. M. (1997). The prefrontal cortex: Anatomy, physiology, and
neuropsychology of the frontal lobe (3rd ed.). New York: Lippincott
Williams & Wilkins.
Fuster, J. M., Bauer, R. H., & Jervey, J. P. (1982). Cellular discharge in the
dorsolateral prefrontal cortex of the monkey in cognitive tasks. Experimental Neurology, 77, 679 – 694.
Gawne, T. J., & Martin, J. M. (2002). Responses of primate visual cortical
V4 neurons to simultaneously presented stimuli. Journal of Neurophysiology, 88, 1128 –1135.
Ghorashi, S. M. S., Zuvic, S. M., Visser, T. A. W., & Di Lollo, V. (2003).
Focal distraction: Spatial shifts of attention are not required for contingent capture. Journal of Experimental Psychology: Human Perception
and Performance, 24, 699 –706.
Giesbrecht, B., Bischof, W. F., & Kingstone, A. (2003). Visual masking
during the attentional blink: Tests of the object substitution hypothesis.
Journal of Experimental Psychology: Human Perception and Performance, 29, 238 –258.
Giesbrecht, B., & Di Lollo, V. (1998). Beyond the attentional blink: Visual
masking by object substitution. Journal of Experimental Psychology:
Human Perception and Performance, 24, 1454 –1466.
Goldman-Rakic, P. S. (1995). Cellular basis of working memory. Neuron,
14, 477– 485.
Grandison, T. D., Ghirardelli, T. G., & Egeth, H. E. (1997). Beyond
similarity: Masking of the target is sufficient to cause the attentional
blink. Perception & Psychophysics, 59(2), 266 –274.
Grossberg, S. (1995). The attentive brain. American Scientist, 83, 438 – 449.
Hasher, L., & Zacks, R. T. (1988). Working memory, comprehension, and
aging. In G. H. Bower (Ed.), The psychology of learning and motivation:
Advances in research and theory (Vol. 22, pp. 193–225). San Diego,
CA: Academic Press.
Hazy, T. E., Frank, M. J., & O’Reilly, R. (2006). Banishing the homunculus: Making working memory work. Neuroscience, 139, 105–118.
Hikosaka, O., Miyauchi, S., & Shimojo, S. (1993). Focal visual attention
produces illusory temporal order and motion sensation. Vision Research,
33, 1219 –1240.
Ho, C., Mason, O., & Spence, C. (2006). An investigation into the temporal
dimension of the Mozart effect: Evidence from the attentional blink task.
Acta Psychologica, 125, 117–128.
Hommel, B., & Akyürek, E. G. (2005). Lag-1 sparing in the attentional
blink: Benefits and costs of integrating two events into a single episode.
Quarterly Journal of Experimental Psychology, 58, 1415–1433.
Hommel, B., Kessler, K., Schmitz, F., Gross, J., Akyürek, E. G., Shapiro,
K., et al. (2006). How the brain blinks: Towards a neurocognitive model
of the attentional blink. Psychological Research, 70, 425– 435.
Houghton, D. (1994). A model of inhibitory mechanisms in selective
BOOST AND BOUNCE
attention. In D. Dagenbach (Ed.), Inhibitory processes in attention,
memory, and language (pp. 53–112). San Diego, CA: Academic Press.
Huber, D. E., & O’Reilly, R. C. (2003). Persistence and accommodation in
short-term priming and other perceptual paradigms: Temporal segregation through synaptic depression. Cognitive Science, 27, 403– 430.
Isaak, M. I., Shapiro, K. L., & Martin, J. (1999). The attentional blink
reflects retrieval competition among multiple rapid serial visual presentation items: Tests of an interference model. Journal of Experimental
Psychology: Human Perception and Performance, 25, 1774 –1792.
Jolicoeur, P. (1998). Modulation of the attentional blink by on-line response selection: Evidence from speeded and unspeeded task-1 decisions. Memory & Cognition, 26(5), 1014 –1032.
Jolicoeur, P. (1999). Concurrent response-selection demands modulate the
attentional blink. Journal of Experimental Psychology: Human Perception and Performance, 25, 1097–1113.
Jolicoeur, P., & Dell’Acqua, R. (1998). The demonstration of short-term
consolidation. Cognitive Psychology, 32, 138 –202.
Jolicoeur, P., & Dell’Acqua, R. (2000). Selective influence of second target
exposure duration and Task1 load effects in the attentional blink phenomenon. Psychonomic Bulletin & Review, 7, 472– 479.
Jolicoeur, P., Tombu, M., Oriet, C., & Stevanovski, B. (2002). From perception to action: Making the connection. In W. Prinz & B. Hommel (Eds.),
Attention and performance: Vol. XIX. Common mechanisms in perception
and action (pp. 558 –586). Oxford, United Kingdom: Oxford University
Press.
Jonides, J. (1981). Voluntary versus automatic control over the mind’s
eye’s movement. In J. Long & A. Baddeley (Eds.), Attention and
performance IX (pp. 187–203). Hillsdale, NJ: Erlbaum.
Jonides, J., Smith, E. E., Marshuetz, C., Koeppe, R. A., & Reuter-Lorenz,
P. A. (1998). Inhibition in verbal working memory revealed by brain
activation. Proceedings of the National Academy of Sciences, USA, 95,
8410 – 8413.
Kahneman, D. (1973). Attention and effort. New York: Prentice Hall.
Kanwisher, N. (1987). Repetition blindness: Type recognition without
token individuation. Cognition, 27, 117–143.
Kawahara, J.-I., & Enns, J. T. (in press). Selection difficulty and inter-item
competition are independent factors in rapid visual stream perception.
Journal of Experimental Psychology: Human Perception and Performance.
Kawahara, J., Enns, J. T., & Di Lollo, V. (2006). The attentional blink is
not a unitary phenomenon. Psychological Research, 70, 405– 413.
Kawahara, J., Kumada, T., & Di Lollo, V. (2006). The attentional blink is
governed by a temporary loss of control. Psychonomic Bulletin &
Review, 13, 886 – 890.
Kellie, F. J., & Shapiro, K. (2004). Object file continuity predicts attentional blink magnitude. Perception & Psychophysics, 66, 692–712.
Kessler, K., Schmitz, F., Gross, J., Hommel, B., Shapiro, K., & Schnitzler,
A. (2005). Cortical mechanisms of attention in time: Neural correlates of
the lag-1 sparing phenomenon. European Journal of Neuroscience, 21,
2563–2574.
Keysers, C., & Perrett, D. I. (2002). Visual masking and RSVP reveal
neural competition. Trends in Cognitive Sciences, 6, 120 –125.
Keysers, C., Xiao, D.-K., Földiák, P., & Perrett, D. I. (2001). The speed of
sight. Journal of Cognitive Neuroscience, 13(1), 90 –101.
Kristjánsson, Á., Mackeben, M., & Nakayama, K. (2001). Rapid, objectbased learning in the deployment of transient attention. Perception, 30,
1375–1387.
Kristjánsson, Á., & Nakayama, K. (2002). The attentional blink in space
and time. Vision Research, 42, 2039 –2050.
Lamme, V. A. F., & Roelfsema, P. R. (2000). The distinct modes of vision
offered by feedforward and recurrent processing. Trends in Neuroscience, 23, 571–579.
Lavie, N. (2000). Selective attention and cognitive control: Dissociating
attentional functions through different types of load. In S. Monsell & J.
859
Driver (Eds.), Control of cognitive processes: Attention and performance XVIII (pp. 175–194). Cambridge, MA: Bradford, MIT Press.
Lavie, N., Hirst, A., Fockert, J. W. d., & Viding, E. (2004). Load theory of
selective attention and cognitive control. Journal of Experimental Psychology: General, 133, 339 –354.
Lawrence, D. H. (1971). Two studies of visual search for word targets with
controlled rates of presentation. Perception and Psychophysics, 10,
85– 89.
Legge, G. (1978). Sustained and transient mechanisms in human vision:
Temporal and spatial properties. Vision Research, 18, 341–376.
Loach, D., & Marı́-Beffa, P. (2003). Post-target inhibition: A temporal
binding mechanism? Visual Cognition, 10, 513–526.
Luck, S. J., & Vecera, S. P. (2002). Attention: From tasks to mechanisms.
In S. Yantis (Ed.), Stevens’ handbook of experimental psychology: Vol.
1. Sensation and perception (pp. 235–286). New York: Wiley.
Luck, S. J., Vogel, E. K., & Shapiro, K. L. (1996, October 17). Word
meanings can be accessed but not reported during the attentional blink.
Nature, 383, 616 – 618.
Mackeben, M., & Nakayama, K. (1993). Express attentional shifts. Vision
Research, 33, 85–90.
Macknik, S. L., & Livingstone, M. S. (1998). Neuronal correlates of
visibility and invisibility in the primate visual system. Nature Neuroscience, 1, 144 –149.
Maki, W. S., Bussard, G., Lopez, K., & Digby, B. (2003). Sources of
interference in the attentional blink: Target-distractor similarity revisited. Perception & Psychophysics, 65, 188 –201.
Maki, W. S., Couture, T., Frigen, K., & Lien, D. (1997). Sources of the
attentional blink during rapid serial visual presentation: Perceptual interference and retrieval competition. Journal of Experimental Psychology: Human Perception and Performance, 23, 1393–1411.
Maki, W. S., Frigen, K., & Paulson, K. (1997). Associative priming by
targets and distractors during rapid serial visual presentation: Does word
meaning survive the attentional blink? Journal of Experimental Psychology: Human Perception and Performance, 23, 1014 –1034.
Maki, W. S., & Padmanabhan, G. (1994). Transient suppression of processing during rapid serial visual presentation: Acquired distinctiveness
of probes modulates the attentional blink. Psychonomic Bulletin &
Review, 1, 499 –504.
Marois, R., & Ivanoff, J. (2005). Capacity limits of information processing
in the brain. Trends in Cognitive Sciences, 9, 296 –305.
Marois, R., Yi, D. J., & Chun, M. M. (2004). The neural fate of consciously
perceived and missed events in the attentional blink. Neuron, 41, 465–
472.
Martens, S., Johnson, A., Elmallah, K., & London, R. (2006). Cuing and
stimulus effects on the P3 and the AB. Acta Psychologica, 123, 204 –
218.
Martens, S., Munneke, J., Smid, H., & Johnson, A. (2006). Quick minds
don’t blink: Electrophysiological correlates of individual differences in
attentional selection. Journal of Cognitive Neuroscience, 18, 1423–
1438.
Martens, S., Wolters, G., & Van Raamsdonk, M. (2002). Blinks of the
mind: Memory effects of attentional processes. Journal of Experimental
Psychology: Human Perception and Performance, 28, 1275–1287.
Maylor, E. A., & Hockey, R. (1985). Inhibitory component of externally
controlled covert orienting in visual space. Journal of Experimental
Psychology: Human Perception and Performance, 11, 777–787.
McArthur, G., Budd, T., & Michie, P. (1999). The attentional blink and
P300. NeuroReport, 10(17), 3691–3695.
McAuliffe, S. P., & Knowlton, B. J. (2000). Dissociating the effects of
featural and conceptual interference on multiple target processing in
rapid serial visual presentation. Perception & Psychophysics, 62, 187–
195.
McLaughlin, E. N., Shore, D. I., & Klein, R. M. (2001). The attentional
860
OLIVERS AND MEETER
blink is immune to masking-induced data limits. Quarterly Journal of
Experimental Psychology, 54A, 169 –196.
McNab, F., & Klingberg, T. (2008). Prefrontal cortex and basal ganglia
control access to working memory. Nature Neuroscience, 11, 103–107.
Miller, E. K., & Cohen, J. D. (2001). An integrative theory of prefrontal
cortex function. Annual Review of Neuroscience, 24, 167–202.
Miller, E. K., Erickson, C. A., & Desimone, R. (1996). Neural mechanisms
of visual working memory in prefrontal cortex of the macaque. Journal
of Neuroscience, 16, 5154 –5167.
Miller, G. A. (1956). The magical number seven, plus or minus two: Some
limits on our capacity for processing information. Psychological Review,
63, 81–97.
Miller, J. (1989). The control of attention by abrupt visual onsets and
offsets. Perception & Psychophysics, 45, 567–571.
Motter, B. C. (1994). Neural correlates of attentive selection for color or
luminance in extrastriate area V4. Journal of Neuroscience, 14, 2178 –
2189.
Müller, H. J., & Rabbitt, P. M. A. (1989). Reflexive and voluntary
orienting of visual attention: Time course of activation and resistance to
interruption. Journal of Experimental Psychology: Human Perception
and Performance, 15, 315–330.
Nakayama, K., & Mackeben, M. (1989). Sustained and transient components of focal visual attention. Vision Research, 29, 1631–1647.
Niedeggen, M., Hesselman, G., Sharaie, A., Milders, M., & Blakemore, C.
(2004). Probing the prerequisites for motion blindness. Journal of Cognitive Neuroscience, 16, 584 –597.
Nieuwenhuis, S., Gilzenrat, M. S., Holmes, B. D., & Cohen, J. D. (2005).
The role of the locus coeruleus in mediating the attentional blink: A
neurocomputational theory. Journal of Experimental Psychology: General, 134, 291–307.
Nieuwenstein, M. R. (2006). Top-down controlled, delayed selection in the
attentional blink. Journal of Experimental Psychology: Human Perception and Performance, 32, 973–985.
Nieuwenstein, M. R., Chun, M. M., Lubbe, R. H. J. v. d., & Hooge, I. T. C.
(2005). Delayed attentional engagement in the attentional blink. Journal
of Experimental Psychology: Human Perception and Performance, 31,
1463–1475.
Nieuwenstein, M. R., Johnson, A., Kanai, R., & Martens, S. (2007).
Cross-task repetition amnesia: Impaired recall of RSVP targets held in
memory for a secondary task. Acta Psychologica, 125, 319 –331.
Nieuwenstein, M. R., & Potter, M. C. (2006). Temporal limits of selection
and memory encoding: A comparison of whole versus partial report in
rapid serial visual presentation. Psychological Science, 17, 471– 475.
Nothdurft, H.-C. (2002). Attention shifts to salient targets. Vision Research, 42, 1287–1306.
Olivers, C. N. L. (2007). The time course of attention: It’s better than we
thought. Current Directions in Psychological Science, 16, 11–15.
Olivers, C. N. L., & Nieuwenhuis, S. (2005). The beneficial effect of
concurrent task-irrelevant mental activity on temporal attention. Psychological Science, 16(4), 265–269.
Olivers, C. N. L., & Nieuwenhuis, S. (2006). The beneficial effects of
additional task load, positive affect, and instruction on the attentional
blink. Journal of Experimental Psychology: Human Perception and
Performance, 32, 364 –379.
Olivers, C. N. L., Van der Stigchel, S., & Hulleman, J. (2007). Spreading
the sparing: Against a limited-capacity account of the attentional blink.
Psychological Research, 71, 126 –139.
Olivers, C. N. L., & Watson, D. G. (2006). Input control processes in rapid
serial visual presentation: Target selection and distractor inhibition.
Journal of Experimental Psychology: Human Perception and Performance, 32, 1083–1092.
Ouimet, C., & Jolicoeur, P. (2007). Beyond Task 1 difficulty: The duration
of T1 encoding modulates the attentional blink. Visual Cognition, 15,
290 –304.
Pashler, H. (1984). Processing stages in overlapping tasks: Evidence for a
central bottleneck. Journal of Experimental Psychology: Human Perception and Performance, 10, 358 –377.
Popple, A. V., & Levi, D. M. (2007). Attentional blinks as errors in
temporal binding. Vision Research, 47, 2973–2981.
Posner, M. I. (1980). Orienting of attention: The 7th Sir Frederic Bartlett
lecture. Quarterly Journal of Experimental Psychology, 32, 3–25.
Posner, M. I., & Cohen, Y. (1984). Components of visual orienting. In H.
Bouma & D. G. Bouwhuis (Eds.), Attention and performance X: Control
of language processes (pp. 531–556). Hillsdale, NJ: Erlbaum.
Postle, B. R. (2006). Working memory as an emergent property of the mind
and brain. Neuroscience, 139, 23–38.
Potter, M. C., Chun, M. M., Banks, B. S., & Muckenhoupt, M. (1998). Two
attentional deficits in serial target search: The visual attentional blink
and an amodal task-switch deficit. Journal of Experimental Psychology:
Learning, Memory, and Cognition, 24, 979 –992.
Potter, M. C., Dell’Acqua, R., Pesciarelli, F., Job, R., & Peressotti, F.
(2005). Bidirectional semantic priming in the attentional blink. Psychonomic Bulletin & Review, 12(3), 460 – 465.
Potter, M. C., Staub, A., & O’Conner, D. H. (2002). The time course of
competition for attention: Attention is initially labile. Journal of Experimental Psychology: Human Perception and Performance, 28, 1149 –
1162.
Potts, G. F. (2004). An ERP index of task relevance evaluation of visual
stimuli. Brain and Cognition, 56, 5–31.
Raymond, J. E. (2003). New objects, not new features, trigger the attentional blink. Psychological Science, 14, 54 –59.
Raymond, J. E., Shapiro, K. L., & Arnell, K. M. (1992). Temporary
suppression of visual processing in an RSVP task: An attentional blink?
Journal of Experimental Psychology: Human Perception and Performance, 18, 849 – 860.
Raymond, J. E., Shapiro, K. L., & Arnell, K. M. (1995). Similarity
determines the attentional blink. Journal of Experimental Psychology:
Human Perception and Performance, 21, 653– 662.
Reeves, A., & Sperling, G. (1986). Attention gating in short-term visual
memory. Psychological Review, 93, 180 –206.
Rolke, B., Heil, M., Streb, J., & Hennighausen, E. (2001). Missed prime
words within the attentional blink evoke an N400 semantic priming
effect. Psychophysiology, 38, 165–174.
Rolls, E. T., & Tovee, M. J. (1994). Processing speed in the cerebral cortex
and the neurophysiology of visual masking. Proceedings of the Royal
Society London B, 257, 9 –15.
Sahraie, A., Milders, M., & Niedeggen, M. (2001). Attention-induced
motion blindness. Vision Research, 41, 1613–1617.
Scharlau, I., Ansorge, U., & Horstmann, G. (2006). Latency facilitation in
temporal-order judgments: Time course of facilitation as a function of
judgment. Acta Psychologica, 122, 129 –159.
Seiffert, A. E., & Di Lollo, V. (1997). Low-level masking in the attentional
blink. Journal of Experimental Psychology: Human Perception and
Performance, 23, 1061–1073.
Sergent, C., Baillet, S., & Dehaene, S. (2005). Timing of the brain events
underlying access to consciousness during the attentional blink. Nature
Neuroscience, 8, 1391–1400.
Shapiro, K. L., Arnell, K. M., & Raymond, J. E. (1997). The attentional
blink. Trends in Cognitive Sciences, 1, 291–296.
Shapiro, K. L., Caldwell, J., & Sorensen, R. E. (1997). Personal names and
the attentional blink: A visual “cocktail party” effect. Journal of Experimental Psychology: Human Perception and Performance, 23, 504 –514.
Shapiro, K. L., Driver, J., Ward, R., & Sorensen, R. E. (1997). Priming
from the attentional blink: A failure to extract visual tokens but not
visual types. Psychological Science, 8, 95–100.
Shapiro, K. L., & Raymond, J. E. (1994). Temporal allocation of visual
attention. Inhibition or interference? In D. Dagenbach & T. H. Carr
BOOST AND BOUNCE
(Eds.), Inhibitory processes in attention, memory, and language (pp.
151–188). San Diego, CA: Academic Press.
Shapiro, K. L., Raymond, J. E., & Arnell, K. M. (1994). Attention to visual
pattern information produces the attentional blink in RSVP. Journal of
Experimental Psychology: Human Perception and Performance, 20,
357–371.
Shapiro, K., Schmitz, F., Martens, S., Hommel, B., & Schnitzler, A.
(2006). Resource sharing in the attentional blink. NeuroReport, 17,
163–166.
Shih, S.-I. (2008). The attention cascade model and the attentional blink.
Cognitive Psychology, 56, 210 –236.
Shih, S.-I., & Sperling, G. (2002). Measuring and modeling the trajectory
of visual spatial attention. Psychological Review, 109, 260 –305.
Shimozaki, S. S., Chen, K. Y., Abbey, C. K., & Eckstein, M. P. (2007). The
temporal dynamics of selective attention of the periphery as measured by
classification images. Journal of Vision, 7, 1–20.
Shore, D. I., McLaughlin, E. N., & Klein, R. (2001). Modulation of the
attentional blink by differential resource allocation. Canadian Journal of
Experimental Psychology, 55(4), 318 –324.
Shulman, G. L., Remington, R. W., & McLean, J. P. (1979). Moving
attention through visual space. Journal of Experimental Psychology:
Human Perception and Performance, 5, 522–526.
Slagter, H. A., Lutz, A., Greischar, L. L., Francis, A. D., Nieuwenhuis, S.,
Davis, J., et al. (2007). Mental training affects distribution of limited
brain resources. Plos Biology, 5, 1228 –1235.
Sperling, G., & Reeves, A. (1980). Measuring the reaction time of an
unobservable response: A shift of visual attention. In R. S. Nickerson
(Ed.), Attention and performance VIII (pp. 347–360). New York: Academic Press.
Sperling, G., & Weichselgartner, E. (1995). Episodic theory of the dynamics of spatial attention. Psychological Review, 102, 503–532.
Suzuki, S., & Cavanagh, P. (1997). Focused attention distorts visual space:
An attentional repulsion effect. Journal of Experimental Psychology:
Human Perception and Performance, 23, 443– 463.
Tomita, H., Ohbayashi, M., Nakahara, K., Hasegawa, I., & Miyashita, Y.
(1999, October 14). Top-down signal from prefrontal cortex in executive
control of memory retrieval. Nature, 401, 699 –701.
Treisman, A., & Gelade, G. (1980). A feature-integration theory of attention. Cognitive Psychology, 12, 97–136.
Ungerleider, L. G., & Mishkin, M. (1982). Two cortical visual systems. In
D. J. Ingle, M. A. Goodale, & R. J. W. Mansfield (Eds.), Analysis of
visual behavior (pp. S49 –S86). Cambridge, MA: MIT Press.
Visser, T. A. W. (2007). Masking T1 difficulty: Processing time and the
attentional blink. Journal of Experimental Psychology: Human Perception and Performance, 33, 285–297.
861
Visser, T. A. W., Bischof, W. F., & Di Lollo, V. (1999). Attentional
switching in spatial and nonspatial domains: Evidence from the attentional blink. Psychological Bulletin, 125, 458 – 469.
Visser, T. A. W., Bischof, W. F., & Di Lollo, V. (2004). Rapid serial visual
distraction: Task-irrelevant items can produce an attentional blink. Perception & Psychophysics, 66(8), 1418 –1432.
Visser, T. A. W., Merikle, P. M., & Di Lollo, V. (2005). Priming in the
attentional blink: Perception without awareness. Visual Cognition, 12,
1362–1372.
Visser, T. A. W., Zuvic, S. M., Bischof, W. F., & Di Lollo, V. (1999). The
attentional blink with targets in different spatial locations. Psychonomic
Bulletin & Review, 6(3), 432– 436.
Vogel, E. K., & Luck, S. J. (2002). Delayed working memory consolidation during the attentional blink. Psychonomic Bulletin & Review, 9,
739 –743.
Vogel, E. K., Luck, S. J., & Shapiro, K. L. (1998). Electrophysiological
evidence for a postperceptual locus of suppression during the attentional
blink. Journal of Experimental Psychology: Human Perception and
Performance, 24, 1656 –1674.
Vogel, E. K., McCollough, A. W., & Machizawa, M. G. (2005, November
24). Neural measures reveal individual differences in controlling access
to working memory. Nature, 438, 500 –503.
Ward, R., Duncan, J., & Shapiro, K. (1997). Effects of similarity, difficulty, and nontarget presentation on the time course of visual attention.
Perception & Psychophysics, 59, 593– 600.
Watson, D. G., & Humphreys, G. W. (1995). Attention capture by contour
onsets and offsets: No special role for onsets. Perception & Psychophysics, 57, 583–597.
Weichselgartner, E., & Sperling, G. (1987, November 6). Dynamics of
automatic and controlled visual attention. Science, 238, 778 –780.
Welford, A. T. (1952). The “psychological refractory period” and the
timing of high-speed performance: A review and theory. British Journal
of Psychology, 43, 2–19.
Wolfe, J. M. (1994). Guided Search 2.0: A revised model of visual search.
Psychonomic Bulletin & Review, 1, 202–238.
Wyble, B., Bowman, H., & Nieuwenstein, M. R. (in press). The attentional
blink provides episodic distinctiveness: Sparing at a cost. Journal of
Experimental Psychology: Human Perception and Performance.
Zeki, S. M. (1978, August 3). Functional specialization in the visual cortex
of the rhesus monkey. Nature, 274, 423– 428.
Zipser, K., Lamme, V. A., & Schiller, P. H. (1996). Contextual modulation
in primary visual cortex. Journal of Neuroscience, 15, 7376 –7389.
Zuvic, S. M., Visser, T. A. W., & Di Lollo, V. (2000). Direct estimates of
processing delays in the attentional blink. Psychological Research, 63, 192–198.
(Appendix follows)
OLIVERS AND MEETER
862
Appendix
Technical Details of the Model
The model consists of two parts, namely (a) sensory activity
and (b) a gating system that governs entrance to working
memory (WM). Both are defined by mathematical formulas that
describe the activity of sets of interacting model neurons.
Low-level sensory processes that lead up to the relevant perceptual representations (i.e., identity, category, and color) are
not explicitly modeled, and neither are the high-level consolidation and response-generation processes in WM. Such processes, of course, take place, but within our model, they play no
crucial role in explaining the data. The crucial role is played by
the gating between the sensory memory and WM stages. All
simulations were run in discrete time steps of 1 ms. The code
used in the simulations can be found at http://olivers.cogpsy.nl.
Sensory Activity
The stream of inputs to the model are taken from the paradigms
that are simulated. Of each item in the stream, the presentation
duration, identity, category, and color are coded.
A first set of equations models the activity of sensory neurons
that code for the identity, category (letters, digits, or symbols),
and color of an item. We collectively refer to these neurons as
first-stage neurons. The total activity of these sensory neurons
is the product of bottom-up input and of attention already
directed at the stream location, as generated by preceding items
(later, we explain how activity is also convolved with attention
generated by item j itself). Consider an item j that is presented
from time steps ton to toff at location l. Sensory activity for item
j at time step t, aj,t, is the product of the bottom-up input
generated by j and bj, and of the attention directed at the
location generated by preceding items, gl,t, at a delay of 25 ms:
(A1)
aj,t ⫽ gl,t⫺25 bj,t.
Here, gl,t-25 can be regarded as the “prior attentional state,” as
generated by all the previous items (i.e., prior to the attention yet
to be triggered by the current item). The 25 ms simulates signaling
delays in the effects that these previous items exert on the current
item. The bottom-up input bj,t is governed by the following equations:
bj,t ⫽ 0 for t ⬍ ton,
冋
parameter (not made explicit in Equation A2b but set to 2) and a
scale parameter trans, with value 0.04. The scaling constant of the
transient signal, ctrans, is equal to 3. We added the sustained signal
to account for activity generated by sustained stimulus presentations: Although activity seen in brain areas involved in vision
shows a marked transient, it does not decay back to zero with
sustained presentation (e.g., Gawne & Martin, 2002). The sustained signal rises more slowly as the item is presented, with time
constant sus ⫽ 1/10, to a maximum of csus (equal to 0.5). The
resulting function resembles recorded responses of neurons in
the visual system (e.g., Keysers & Perrett, 2002). More sophisticated accounts of how this activation pattern emerges from
mechanisms at the individual neuronal level exist but are beyond the scope of the present model (see, e.g., Huber &
O’Reilly, 2003). After presentation, bj,t falls exponentially (see
Equation A2c), with dec determining the speed of decay (the
higher dec, the faster decay; decay depends on masking, see
below). Salient visual offsets, such as a gap within a rapid serial
visual presentation task (RSVP; Lawrence, 1971) stream, are
also assumed to produce transient signals (see, e.g., J. Miller,
1989; Watson & Humphreys, 1995, for behavioral evidence). In
the model, gaps within a stream therefore produce the same
bottom-up signal as do items.
The strength of the bottom-up signals for item j, sj in Equation
A2b, incorporates forward masking and the adaptation caused by
item repetitions in a stream. Forward masking is stronger the more
similar two items are, and this is implemented by making sj a
function of the contrast between item j and the item that precedes
it (see Equation A3). Adaptation is implemented by making sj a
function of the distance in time between the onset of the current
and the onset of the last presentation of the item:
冉
bj,t ⫽ sj ctranstrans(t ⫺ ton)e⫺trans(t ⫺ ton) ⫹ csus 1 ⫺
(A2a)
冊册
1
1 ⫹ sus(t ⫺ ton)
for ton ⬍ t ⬍ toff,
(A2b)
and
bj,t ⫽ bj,toff e⫺dec(t ⫺ toff) for t ⬎ toff.
(A2c)
The bottom-up input bj,t is equal to 0 before ton (see Equation
A2a). On the time steps that j is presented, it generates a transient
(trans) signal and an additional sustained (sus) signal (see Equation
A2b), both multiplied by strength sj. Following others (e.g., Busey
& Loftus, 1994), we modeled the transient signal as having the
shape of a gamma distribution. Its form is determined by a shape
sj ⫽
冉
冊
␦decay
1
1⫺
.
dcoldcat
ton ⫺ ton,prev
(A3)
In Equation 3, dcol is the color contrast. It is equal to dcoldif (0.9)
if j has a different color than its predecessor and to 1 if color
remains the same. The contrast in category, dcat, is equal to dcatdif
(0.7, unless specified otherwise) if j is from a different category
than its predecessor. It is equal to dcatsame (1, unless specified
otherwise) if the category remains the same. In accordance with
“repetition blindness” effects (Kanwisher, 1987), strength sj is
assumed to be lower if the item was recently presented in the
stream (i.e., the difference between the onset of the current presentation, ton, and the onset of the previous presentation, ton,prev,
is small) than when it was not. Adaptation resulting from the
presentation of items in previous trials is assumed to be negligible,
with ton ⫺ ton,prev effectively equal to infinity. Constant ␦decay was
set to 50. This value results in a negative sj if the previous presentation was less than 50 ms ago, but at that speed, the presentations are
assumed to fuse and effectively become a single presentation. The
strength of the decay in Equation 2c, dec, incorporates backward
masking by making it dependent on the item following j: dec ⫽
.05dcat dcol. Here, dcol and dcat have the same values as above but are
set by the contrast between item j and the following item instead
BOOST AND BOUNCE
of the preceding item. This implies that dec is smaller (and, thus,
decay is slower and masking is weaker) the more dissimilar item
j and the following item are.
Attentional Gating
Items generate a top-down gating response. Targets generate
excitatory attention to their location, distractors generate inhibition. Both excitation and inhibition are transient signals, governed
by a function with the same shape as the one for visual transients.
The assumed mechanism is one where sensory activity drives gate
neurons, which, in turn, modulate the sensory activity. The activation of gate neurons has effects that take time to develop and
time to subside. Behaviorally, this transient response corresponds
to transient attention (Nakayama & Mackeben, 1989). The
strength of the excitatory or inhibitory gating response is assumed
to depend on the strength of the bottom-up evidence for a target or
distractor. For practical purposes, the model takes the sensory
activity during the first 15 ms of presentation as a measure of
perceptual strength. The attentional response to item j on time step
t, fj,t, is excitatory for targets (see Equation A4a) and inhibitory for
distractors (see Equation A4b). Both are set relative to a baseline
of 1:
fj,t ⫽ 1 ⫹ wj
冘
ton ⫹ 15
u ⫽ ton
aj,u[cattatt (t ⫺ ton)e⫺att(t ⫺ ton)]
for t ⬎ ton ⫹ 15 and j: target;
fj,t ⫽
1 ⫹ wj
冘
(A4a)
1
a [cattatt (t ⫺ ton)e⫺att(t ⫺ ton)]
for t ⬎ ton ⫹ 15 and j: distractor. (A4b)
On the first 15 time steps of presentation, the summation is from
ton to t, instead of to ton⫹15, (i.e., from onset to the current time
step, so that in these 15 ms, gating does not depend on future
values). Here, constant catt ⫽ 3, and time constant att ⫽ 0.015.
Strength wj is dependent on the target in the simulated paradigm
and equals 1/(dcatdif ⫻ dcoldif). That is, targets can be defined by
category, color, or both, and the need for gating strength depends
on the relative salience of the bottom-up evidence. Gaps in the
stream are assumed to be slightly disruptive and are therefore
assumed to generate a slight inhibitory signal, with wj ⫽ 0.1. If an
item has the target color but is of the alphanumeric distractor
category (so it is both target-like and distractor-like), it is treated
as neutral, with wj ⫽ 0. Thus, fj,t is the attention generated by the
current item, and it modulates the activity of that item at no delay.
The net attentional gating devoted to the location of the stream,
l, is the product of the attention, both excitatory and inhibitory,
generated by all items preceding the current item:
写
f ,
i i,t
sensory activity generated by the item over the course of
the stream (aj,t, which is, in turn, a function of all prior attention;
see Equation A1), of the new attention generated by the current
item (fj,t; see Equation A4), and of the likelihood that there is space
in WM (i.e., that all available slots have not yet been occupied by
preceding items):
冢
1
p(j in WM) ⫽ 1 ⫺
1 ⫹ cWM
冋冘
册冣
⫹
f a ⫺r
t j,t j,t
p(q ⬍ C).
(A6)
The first part of Equation A6 is a squashing function that keeps the
likelihood of entry into WM between 0 and 1. Scaling constant
cWM is set to 15, threshold constant r to 1. Negative values within
the [ ]⫹ brackets are set to 0. This means that if the sum of sensory
activity multiplied by attention generated by item j is lower than r,
then j has 0 likelihood of entering WM. P(q ⬍ C) is the likelihood
that the number of items already in WM, q, is lower than the
capacity of WM, C. This is calculated by computing the likelihood
that of the previous items in the stream, C have already entered
WM. Capacity C was set to 5, which is within the range of 4 to 7
found by others (Cowan, 2001; G. A. Miller, 1956).
Simulated performance on targets is equal to the likelihood that
these have entered WM at the end of the stream—p(j in WM) (see
Equation A6)—multiplied by .95 to allow for 5% nonspecific
errors (e.g., eye blinks, general lapses of attention, wrong key
presses), plus a guessing correction depending on the number of
response alternatives (n) in the stream:
p(correct) ⫽ 0.95 p(j in WM) ⫹ 1/n[1 ⫺ 0.95p(j in WM)]. (A7)
ton ⫹ 15
u ⫽ ton j,u
gl,t ⫽
863
(A5)
with i ranging over all items preceding item j.
Entry Into WM
Within the model, entry into WM is a stochastic process. The
likelihood that item j enters WM is a function of the product of the
Switching Tasks and Locations
Many RSVP tasks involve switches, in which the targets are
defined differently halfway through the stream (task switch) or are
presented at a different location (location switch). We model such
switches as stochastic. If item i is a cue to switch, the switch can
occur during presentation of any subsequent item. The likelihood
that this has occurred before an item j following i, pswitch,j, follows
a logistic distribution:
p switch,t ⫽
1
.
1 ⫹ e⫺(ton, j ⫺ ton,i ⫺ ␣)/
(A8)
Here, ton,i is the onset time of the cue item, and ton,j is the onset
time of item j. In the case of a task switch, item j is defined as
target or distractor following the new task with likelihood pswitch,j,
and following the old task with likelihood 1⫺ pswitch,j. In the case
of a location switch, item j is processed with likelihood pswitch,j,
whereas only a blank is processed with likelihood 1 ⫺ pswitch,j.
Parameter ␣, the mean of the distribution, is equal to 200 ms for
endogenous task and locations switches and to 75 ms for exogenous cues (salient cues that automatically attract attention to their
location). Parameter , which determines the variance, was always
equal to 0.2␣.
Received August 24, 2006
Revision received June 2, 2008
Accepted June 3, 2008 䡲