Published version: Nearly forthcoming in Psychonomic Bulletin & Review
The Reverse Stroop Effect
Send correspondence and requests to: firstname.lastname@example.org
Frank H. Durgin
Department of Psychology
500 College Avenue
Swarthmore, PA 19081
phone: (610) 328-8678
fax: (610) 328-7814
In classic Stroop interference, manual or oral identification of sensory colors presented as incongruent color words is delayed relative to simple color naming. In the experiment reported here, this effect was shown to all but disappear when the response was simply to point to a matching patch of color. Conversely, strong Reverse Stroop interference occurred with the pointing task. That is, when the sensory color of a color word was incongruent with that word, responses to color words were delayed by an average of 69 msec relative to a word presented in gray. Thus, incongruently-colored words interfere strongly with pointing to a color patch named by the words, but little interference from incongruent color words is found when the goal is to match the color of the word. These results suggest that Stroop effects arise from response compatibility of irrelevant information rather than automatic processing or habit strength.
The Reverse Stroop Effect
The Stroop Effect is one of the easiest and most powerful effects to demonstrate in a classroom, but not the easiest to explain. Nearly every Introductory Psychology book provides a demonstration of the phenomenon: that it is difficult to name the ink color in which different color words are printed. But what is the proper explanation? Perhaps the weakest hypothesis concerning Stroop interference is that "words are processed faster than are colors." It is true that reading words is faster than naming colors, but this seems to be a matter of response compatibility, rather then perceptual speed. After all, the words require no translation (Virzi & Egeth, 1985). In trying to emphasize that the interference effect depends on greater response compatibility between printed and spoken words, however, one risks suggesting the automaticity account (see Besner, Stolz, & Boutilier, 1997, for a recent critique) which suggests that color words interfere with color naming because they are automatically processed.
The present experiment was designed to put both the speed theory and the automaticity theory to rest, if only for a little while, by using the simple non-verbal response of pointing to the appropriate color in a visual array. This will be shown to completely reverse the direction of interference, and thus to produce a Reverse Stroop effect where sensory colors interfere with identifying color words. No one thinks pointing to colors is automatic, although, like naming, pointing can be construed as a simple deictic act (literally, indexical). Nonetheless, the perceptual grouping of matched colors in an array (target and response) seems so likely to be a sufficient guide for pointing (just as the printed word maps easily to the internal array of possible verbal responses), that the symmetry of this task with the traditional task seems quite good. Here, it can be argued, the response is suited to the sensory information, rather than to the verbal.
In the traditional Stroop effect (Stroop, 1935; see MacLeod, 1991 for a review), naming the print color of a word is delayed if the word itself is a color word which names a different color (e.g., responding "red" to the word "blue" displayed in red letters is slower than responding "red" to a red patch of color). Conversely, very little reverse Stroop interference is found when reading a color-word printed in a conflicting color (i.e., responding "blue" in the above example). One promising account of Stroop interference supposes that it is due to response competition, which, when the response is verbal, gives verbal inputs a privileged position (e.g., Fitts & Posner, 1967; MacLeod, 1991; Treisman & Fearnley, 1969). Although it is well established that Stroop interference is still obtained (though reduced) when manual, key-press responses are given (e.g., Keele, 1972; Pritchatt, 1968), these kinds of task may involve implicit verbal coding at the response-selection stage.
At least two prior experiments have attempted to eliminate Stroop interference by the use of color matching. Pritchatt (1968) showed a reduction in interference by a kind of matching task, but little reversal. McClain (1983) reported elimination of Stroop interference using colored buttons, but did not investigate Reverse Stroop interference. In both cases, colored patches were placed on or near buttons that were to be used as responses. Conversely, Besner et al. (e.g., 1997) have routinely studied normal Stroop interference using color-marked buttons.
Treisman and Fearnley (1969) showed that making same/different judgments about pairs of Stroop-like stimuli showed interference only then the comparison was between different modes (verbal and visual) rather than within the same modes (but, cf. Morton and Chambers, 1973). They did not demonstrate a response type for which visual information had a positive advantage, however. Indeed, Egeth, Blecker, and Kamlet (1969) showed that such comparisons were disrupted when the verbal information embedded in the stimuli (namely "DIFF" or "SAME") conflicted with the response itself. These kinds of findings have been used by some to support a "translational" account of Stroop interference (Virzi & Egeth, 1985). In a translational account, the argument is made that the target information must be translated into the appropriate classification modality (e.g., verbal), whereas the distracting information is already presented in that form. If two items share a modality, no translation is require to match them.
Flowers (1975) demonstrated a strong Reverse Stroop effect in a left-right two-alternative sequential word-to-color matching task. A word ("red" or "green") was presented on a colored background, and then, after a blank delay, one side of the screen would turn green, the other red. The subjects had to indicate which side matched the word. Here, again, although only two responses were possible, the response locations varied from trial to trial so that immediate perceptual information always formed the basis for response (in conjunction with verbal information presented moments before). Flowers demonstrated that this effect was clearly modulated by sensory similarity of interfering colors to the target colors. He did not investigate the influence of this paradigm on normal Stroop interference, however, and his task differs from traditional tasks in having a delayed, binary response. Flowers, Warner, and Polansky (1979), however, did perform a direct test of response compatibility with a numerosity analog of the Stroop task, and found reversal of the direction of interference when the response was to tap out the number of items rather than to respond verbally. (The stimuli in this case were collections of single digits, such as three twos, to which it was easier to say "two", but to tap thrice, as it turned out.)
The present paper reports a new kind of manual task which almost completely eliminates the traditional Stroop interference and produces strong color-based interference when the task is to identify the words (Reverse Stroop). The task is to point to a color by moving a mouse cursor to a patch of color corresponding to the desired response. This task is formally similar to a manual key-press inasmuch as pointing acts like naming. But pointing does so by referring to a perceptual entity (the color patch) rather than a mental category (via a categorical response).
The principal findings of the present experiment are that Stroop interference in responding to the sensory color of a conflicting color word can be eliminated and that Reverse Stroop interference (interference with responding to the color named by the word) demonstrated with a pointing procedure in which the responses are color patches. In the Stroop (Color) condition of this experiment, participants were required to respond to the color that the target word was printed in, ignoring the word itself (which named a different color). In the Reverse Stroop (Word) condition, participants were to respond to the color named by the word, ignoring its incongruent physical color. In both cases the response was to move a mouse cursor to a patch of color on the computer screen. Neutral (no-conflict) versions of each condition were also performed. Insofar as the demands of the pointing task lend themselves to the direct use of color rather than verbal information, response-competition theory predicts a strong Reverse Stroop effect when participants are required to respond to the verbal information and to disregard the perceptually salient color match. Simply put, pointing to a matching color can be accomplished without ever internally labeling the color. Pointing to a named color, on the other hand, would seem to require either categorical identification of the surrounding color patches or translation of the word into a visual code. If distracting information represents a possible response, then response competition may ensue when the distracter-based response would be in conflict with the correct response.
As a further consideration, the locations of the response colors were randomly altered from trial to trial for half of the participants to ensure dependence on visual guidance rather than on memory. In terms of information acquisition, fixed target locations ought logically to be faster, but if memory for fixed locations tends to foster a counterproductive dependence on response categorization (e.g., as may occur with manual key-presses) the opposite result could hold for the Color condition of the present task.
Participants. Forty Swarthmore undergraduate students participated in exchange for payment or partial fulfillment of a course requirement. Ten were assigned to each of four experimental conditions. Two additional students were excluded for failure to follow instructions.
Figure 1. Display configuration to scale with a key to actual display colors (left). Target word was presented in gray (neutral condition) or in a conflicting color (incongruent condition), which was matched to one of four square patches at corners of display. Color locations shown are those used in the fixed-location conditions of the Experiment. The central fixation square, which was only visible prior to the appearance of the target word, was about half the linear size of the "u" in "blue" shown here. Note that actual color patches rather than textures were used in experiment (right) -- and no color key was necessary.
The Task. On each trial of the experiment, students had to move a mouse cursor to a location (see Figure 1) which corresponded in color to the color of a word on the screen or to the color which the word named.
Design. Four between-participant conditions were included representing the combination of Task, which was either to identify the Color of the word or the color the Word named, and Color Location of the response patches, which were either Fixed or were Variable from trial to trial. Participants were assigned at random to one of the four conditions and each received two sessions of neutral trials as well as two sessions of incongruent trials in ABBA (or BAAB) order, so that interference effects could be measured within individuals by comparing sessions. For the Word tasks, the neutral stimuli were color words presented in medium gray. For the Color tasks, the neutral stimuli were furniture words ("desk", "lamp", "table", "chair") presented in color. Whether the neutral or incongruent condition came first was varied systematically between subjects.
Twelve distinct incongruent target stimuli were created by the factorial combination of four color words ("red", "yellow", "blue", and "green"), with each of the three colors incongruent with that word. A session consisted of 12 blocks of 12 trials each with stimulus order randomized within each block. The first two blocks of each session were considered warm-up and were not analyzed. In the Variable Color-Location conditions, the locations of the four color patches were chosen pseudo-randomly on each trial by the computer. In the Fixed Color-Location conditions the color positions were consistently as shown in Figure 1. In the neutral Color conditions, the furniture words replaced the color words, and thus were each paired with only three of the remaining colors.
Stimuli and response. The words were presented in 72-point Geneva lowercase letters against a black background at the center of a high-resolution Macintosh Display (28 pixels/cm) viewed, without restraint, at a distance of about 50 cm. The colors used, specified as 8-bit RGB values, were red (255,0,0), green (0,170,51), blue (68,68,255), yellow (255,255,0), and, for the neutral color, gray (170,170,170). A white square outline, 2 pixels wide, with an internal width of 400 pixels (16 deg of visual angle) defined the active portion of the display. The colored response patches were 100 pixels square and were placed at the inside corners of the white square, as depicted in Figure 1.
Each trial began with the square white outline around the display region, and a white square fixation mark, 20 pixels across, at its center. In the Fixed Color-Location condition, the four response color-squares were also present. The student initiated the trial by clicking the mouse on the fixation square. This action caused the fixation square and the mouse cursor to disappear, so as not to mask the word. (The mouse cursor later reappeared as soon as the student moved the mouse outside the region defined by the fixation square.) After a delay of 500 ms, the stimulus word appeared and remained on the screen until response. In the Variable Color-Location condition, the colored response patches appeared simultaneously with the word. Response latency was defined as the time at which the mouse-cursors tip entered one of the four colored response regions. A ballistic motion which passed the cursor through the color patch was therefore sufficient. A physical movement of the mouse by about 3 cm along the table surface sufficed to reach any patch. A beep for incorrect responses provided feedback.
Median response latencies were computed for correct-response trials in each session for each participant. Average median response times (RTs), as well as mean error rates (number of errors per 120-trial session), are shown in Figure 2 for incongruent and neutral trials for each experimental task.
Figure 2. Averages of median reaction times as a function of Task (identify Word or Color), Color Locations (Fixed or Arbitrary), and presence of incongruent information (Neutral or Incongruent). Error bars represent standard errors of the mean. Parenthetical values report error frequencies for the various conditions.
A repeated measures ANOVA was conducted on the RT data with Task (Word or Color) and Color Location (Fixed or Random) as between-subject variables and Irrelevant Dimension (Neutral or Incongruent) and Block (First or Second) as within-subject variables. As is evident from Figure 2, overall latencies in the Word task conditions (654 ms) are substantially longer than those in the Color matching conditions (526 ms) [F(1,36) = 61.1, p < .001], consistent with the less natural mapping from words to directional selection of colored patched. Incongruent trials were also slower, overall, than neutral trials [F(1,36) = 41.7, p< .001]. Crucially, however, because there was an interaction between Irrelevant Dimension and Task [F(1,36) = 24.7, p < .001], separate RT analyses were carried out for each Task. Note that this interaction indicates greater interference for the Word task than for the Color task, as is evident in Figure 2.
RTs were shorter in the second block (585) than in the first (597) [F(1,36) = 7.5, p < .01], and a reliable interaction of Block and Irrelevant Dimension indicated that this improvement was greater for incongruent than for neutral trials [F(1,36) = 9.3, p < .01]. However these order effects, which are consistent with practice, are not relevant to our main questions and will not be considered further.
When the same ANOVA was applied to error scores, the same patterns of findings emerged: Overall errors were higher in the Word condition (2.1 per session) than in the Color condition (1.0) [F(1,36) = 7.5, p < .01], and there were more errors for incongruent trials (2.0) than for neutral trials (1.1) [F(1,36) = 23.9, p < .001]. There was also an interaction between Irrelevant Dimension and Task [F(1,36 = 9.5, p < .01], indicating greater interference effects for the word task, and so error analyses were also conducted separately for each task. There was no reliable effect of Block on error rates [F(1,36) = 2.2, p > 0.1].
Word Matching Task When the color-matching data were excluded from the analysis, the mean RT for incongruent trials (689 msec) was much longer than that for neutral trials (620 msec) [F(1, 18) = 37.2, p <.0001]. Similarly, there were more errors for incongruent trials (2.9) than for neutral trials (1.3) [F(1,18) = 24.5, p < .0001]. There was no evidence of effects of Color Location in either analysis, nor did Color Location interact reliably with any other factors. In summary, strong Reverse Stroop interference effects have been demonstrated in both response times and in error rates.
Color-Matching Task. In contrast to the results of the Word task, very little Stroop interference is evident in the pointing task when the response is to the color in which the word is displayed. For the RT analysis, incongruent trials (531 msec) were, indeed, slower than congruent trials (522 msec) [F(1,18) = 4.6, p < .05]. However, this difference (9 msec) is much smaller than that for the Word task (79 msec), as was indicated by the interaction in the main analysis above. Moreover, the error rates for the incongruent trials did not differ reliably from neutral trials for this task [F(1,18) = 2.3, p > .10]. Again, the interaction in the main error analysis indicates that interference was greater in the Word task than in the Color task. Although some Stroop interference remains, it is clear that it is inconsequential relative to the Reverse Stroop effects.
Rather surprisingly, responses in the Random Color Location condition (504 msec) were faster than those in the Fixed Color Location condition (550 msec) [F(1,18) = 11.6, p < .01]. It might be that fixed response locations facilitated categorical encoding of responses as a strategy, and it might be that this was actually counterproductive in the matching task, which can be handled by perceptual color grouping more rapidly. However, in further experiments, not reported here, this particular difference has not arisen reliably (whereas the others do). Moreover, the effects of Irrelevant Dimension did not vary as a function of Color Location [F(1,18) = 1.8, n.s], so I will not dwell on this further.
The results reported here indicate that using a pointing task can produce strong Reverse Stroop interference while nearly eliminating traditional Stroop interference. It is worth noting that, insofar as the Fixed Color Location conditions are analogous to traditional Stroop paradigms with fixed responses for each color item, it is impressive that these conditions are so effective at reversing the traditional direction of Stroop interference.
The present findings support the response-compatibility/response-competition model of Stroop interference, and are thus consistent with similar findings for numerosity (Flowers et al., 1979). They are also compatible with translational accounts (e.g., Virzi & Egeth, 1985). When responses are matched to visually guided action, visual, rather than verbal responses are faster, and conflicting visual information is more strongly disruptive of responding to verbal information than vice versa.
Accounts of Stroop interference which depend on the purported automaticity of verbal processing of text, on the other hand, are difficult to adapt to the present results. Although a common response to a word is to read it, pointing to matching colors is not plausibly an automatic response, or even a normal response. There is nonetheless clear Reverse Stroop interference in the pointing task. Insofar as the Reverse Stroop parallels the traditional Stroop, the automaticity account fails for both. This is bad news for models of Stroop performance that presume a basis for the effect in automaticity or strength of association (e.g., Cohen, Dunbar, & McClelland, 1990).
Researchers using sorting-task variants of the Stroop effect, in which the stimulus itself was removed to one of several bins, have previously suggested that active manipulation reduces Stroop interference (Chmiel, 1984; Martin, 1981; Taylor & Clive, 1983; Tecce & Happ, 1964) particularly when colored labels are used for the sorting bins (Chmiel, 1984). However, reversals (i.e., strong Reverse Stroop effects) may not have been as evident because sorting tasks maintain an emphasis on conceptual categorization. The present task might be regarded as a single-trial-analyzable version of a sorting task, but the analogy to sorting is actually no stronger than that of naming to sorting. Moreover, the pointing task employed here may succeed because it avoids any dependence on explicit categorization. Scanning tasks (Uleman & Reeves, 1978) have shown suggestive reversal results, but these have departed substantially from any structural similarity to the traditional Stroop.
The nearest predecessor of the present effect is probably the work of Flowers (1975), discussed earlier, in which a Reverse Stroop effect was demonstrated. He used keypress responses on the left or right side to a word presented on a colored background. Following the word, the two colors used, red and green were presented randomly to the left and right, and the keypress was to correspond to the side of the color that matched the word. The variable location of response locations was used to delay response by a variable interval in that experiment, but may also have served to disadvantage categorical responding in favor of visually-guided action (perhaps guided by implicit apparent motion). Flowers did not investigate the traditional direction of the Stroop effect, because he did not have subjects respond to the perceived colors. Moreover, his task departs from most traditional Stroop tasks in having only a binary response set (cf., also, Treisman & Fearnley, 1969). Apart from Flowers et al.s (1979) investigation of response compatibility in the numerosity analog of Stroop interference, the present results stand as a unique demonstration of the symmetry of interference that the stimulus matching results of Treisman and Fearnley (1969), for example, suggested ought to be possible.
In conclusion, the data presented here show that Reverse Stroop interference, the interference of colors with the response to color words, is quite strong in a color-matching pointing task for which normal Stroop interference is minimized. Unlike manual key-pressing, which has traditionally failed to eliminate strong Stroop interference, pointing, even to a fixed set of locations, appears to be resistant to covert verbal labeling. These data are clearly consistent with accounts which stress the importance of response compatibility and consequent response competition in Stroop interference effects.
Besner, D., Stolz, J. A., & Boutilier, C. (1997). The Stroop effect and the myth of automaticity. Psychonomic Bulletin and Review, 4, 221-225.
Chmiel, N. (1984). Phonological encoding for reading: The effect of concurrent articulation in a Stroop task. British Journal of Psychology, 75, 213-220.
Cohen, J. D., Dunbar, K., & McClelland, J. L. (1990). On the control of automatic processes: A parallel distributed processing account of the Stroop effect. Psychological Review, 97, 332-361.
Fitts, P. M., & Posner, M. I. (1967). Human Performance. Monterey, CA: Brooks Cole.
Flowers, J. H. (1975). "Sensory" interference in a word-color matching task. Perception & Psychophysics, 18, 37-43.
Flowers, J. H. (1979). Response and encoding factors in "ignoring" irrelevant information. Memory & Cognition , 7, 86-94.
Egeth, H. E., Blecker, D., & Kamlet, A. S. (1969). Verbal interference in a perceptual comparison task. Perception & Psychophysics, 6, 355-356.
Keele, S. (1972). Attention demands of memory retrieval. Journal of Experimental Psychology, 93, 245-2458.
MacLeod, C. M. (1991). Half a century of research on the Stroop effect: An integrative review. Psychological Review, 109, 163-203.
Martin, M. (1981). Reverse Stroop effect with concurrent tasks. Bulletin of the Psychonomic Society, 17, 8-9.
McClain, L. (1983). Effects of response type and set size on Stroop color-word performance. Perceptual and Motor Skills, 56, 735-743.
Morton, J., & Chambers, S. M. (1973). Selective attention to words and colors. Quarterly Journal of Experimental Psychology, 25, 387-397.
Pritchatt, D. (1968). An investigation into some of the underlying associative verbal processes of the Stroop colour effect. Quarterly Journal of Experimental Psychology, 20, 351-359.
Stroop, J. R. (1935). Studies of interference in serial verbal reactions. Journal of Experimental Psychology, 18, 643-662.
Taylor, A., & Clive, P. B. (1983). Two forms of the Stroop test. Perceptual and Motor Skills, 57, 879-882.
Tecce. J. J., & Happ, S. J. (1964). Effects of shock-arousal on a card-sorting test of color-word interference. Perceptual and Motor Skills, 19, 905-906.
Treisman, A. M., & Fearnley, S. (1969). The Stroop test: Selective attention to colours and words. Nature, 222, 437-439.
Uleman, J. S., & Reeves, J. (1971). Reversal of the Stroop interference effect through scanning. Perception & Psychophysics, 9, 293-295.
Virzi, R. A., & Egeth, H. E. (1985). Toward a translational model of Stroop interference. Memory & Cognition, 13, 304-319.
This work was supported by a Swarthmore Faculty Research Grant and by the Howard Hughes Medical Institute. Thanks to Evoni Story and to Richa Jain for running the experimental participants. Thanks also to Derek Besner, John Flowers, Joel Lachter, Neill Trammell, John Wixted, and two anonymous reviewers for helpful comments on earlier drafts.