Visual search refers to the act of looking for targets in a visual world containing distractors. This could be a search for the cat in the living room, a typo in this paragraph, or a tumor in a chest x-ray. In all these cases and others, fundamental capacity limitations assure that we cannot fully process all visual input at one time. To deal with those limitations, humans (and any animals with a substantial nervous system) have mechanisms of selective attention that allow them to more extensively process some stimuli before moving on to others. Fortunately, attention is not deployed randomly but is under the systematic influence of bottom-up processes (driven by the stimulus) and top-down processes (driven by the conscious or unconscious desires of the searcher). Other factors, like observer experience and scene structure, also influence search. Together, these factors make most routine searches seem effortless, although the world abounds with search tasks that are more difficult, time-consuming, and error prone.

History

Visual search as a term and a topic of research is about a century old (e.g., Kingsley, 1932) with extensive work beginning after World War II (e.g., Mackworth, 1948). Of course, people have known forever that they must look for things: “Go bid my woman search for a jewel that too casually hath left mine arm,” says Shakespeare’s Imogen (Cymbeline, Act 2, Scene 3), and aspects of the search process have been part of scientific and philosophical inquiry since antiquity (Hatfield, 1998). For instance, Aristotle discusses the narrowing of attention in De Anima, and the steering of attention is found in Lucretius (De Rerum Natura) [see Attention].

By the 1950s and 60s, visual search was becoming an active topic of research, with basic laboratory work involving search through arrays of letters (Green & Anderson, 1956; Neisser et al., 1963) and more applied work like searching for tumors in x-rays (Tuddenham & Calvert, 1961) or mariners in the ocean (Koopman, 1956). The topic became central to the study of visual cognition in the 1970s and 80s with the work of researchers like Sperling, Shiffrin, Egeth, Townsend, Sternberg, and others (see Shiffrin, 1988 for a review) and, probably most importantly, with the publication of Anne Treisman’s feature integration theory (Treisman & Gelade, 1980). Her work crystalized many of the core concepts that have defined the field in the subsequent decades.

Core concepts

**Figure 1**
A search stimulus with multiple targets (see text).

Figure 1 illustrates many core concepts in visual search. Find arrows pointing to the right. First, you must search. You cannot simply see the targets without search. If you swiftly detected a target, find the second one. The tendency to miss a second target is known as satisfaction of search, an important topic in medical image searches. Search for the target could proceed in two ways. There are overt movements of the eyes to look at one spot or another. There are also covert deployments of attention. Notice that, if you fixate your eyes at the center of the figure, you can find the blue target arrow without moving your eyes.

Some aspects of the image may be processed in parallel, across the entire image at once. This includes a limited set of basic features like color and orientation, known as preattentive features because they appear to be available before attention is directed to an object (Wolfe & Horowitz, 2017). These features can be used to guide attention. Thus, if asked to find tilted green arrows, you can guide your attention to arrows with tilted orientations (noticing that they are clustered on the right) or to green arrows (noticing that they are distributed across the image). Notice that it is easier to find a tilted arrow in the horizontal region than a horizontal arrow in the tilted region, a search asymmetry (Treisman & Gormican, 1988).

Guidance can take two main forms. Top-down guidance involves volitional control of the human “search engine”; for example, to look for “green” and “tilted” in the figure. The figure’s red item grabs attention in a bottom-up, stimulus-driven manner, largely independent of observer desires. An item having unique, salient features (here, color, shape, and size) seems to immediately grab attention and is said to pop out of the display. Calculating stimulus salience is important in many computational models.

**Figure 2**
Many search experiments measure response time as a function of number of items in a display (set size). Different tasks will produce different slopes of these response time x set size functions, shown here in idealized form without the noise of real data.

Many methods are used to study visual search. The most typical behavioral technique is to present a search display and to ask the observer to respond as quickly and accurately as possible. Search efficiency is defined by the response time x set size function (often with correction for errors). Figure 2 shows idealized response time data. Pop-out tasks produce efficient search (e.g., finding that salient, red symbol). In contrast, a search for the letter T among Ls would be inefficient. Guided searches lie between efficient and inefficient. The eyes are deployed less often (3–4/sec) than for covert attention, so if each item must be fixated, the slope will be much steeper (as shown in Figure 2).

**Figure 3**
Scene context guides attention if you search for people.

Additional factors become important in real-world search. To get a sense of this, search for people in the scene in Figure 3. This is quick, although they are not unique in color or size in this image. Here, scene guidance becomes important (Vo et al., 2019). You rapidly understand the gist of the scene; its meaning and layout (Oliva, 2005). Your history of searching similar scenes recently (e.g., on the last trial) and across your life also influences your search (Anderson et al., 2021). Thus, you guide search for humans to horizontal surfaces, not to the sky and probably not up a tree. The specific search history of a radiologist or a birder is an important part of their expertise in their search domains.

Questions, controversies, and new developments

Among current questions in the study of visual search, investigators want to know when and why attention can be captured by a stimulus despite a top-down desire to attend elsewhere (Luck et al., 2021). They try to understand how observers’ know when to end a search, especially an unsuccessful one (e.g., how many dogs are in the scene above?; Mazor & Fleming, 2022). Modelers debate if search is primarily serial, parallel, or both (Townsend & Wenger, 2004). They wonder if artificial aids (artificial intelligence, global positioning systems, etc.) will, with time, impair our normal search processes by delivering the answer without the search (Ying et al., 2024).

Broader connections

Behavioral studies of visual search contact the study of attention more generally. There is very considerable work on the neuroscience of visual search (e.g., on scene search, see Segraves, 2023) [see Visual Cognitive Neuroscience]. Various other tasks like foraging have an obvious relationship to search (Hills et al., 2008). The topic has significant real-world applications to tasks like driving, medical image perception, computer interface design, and a wide range of military and security concerns. Each of these applied domains raises questions about the role of expertise and the possibility that some people might have special aptitudes for specific tasks or for search in general. While there do appear to be reliable individual differences, it has proven difficult to determine in advance who might make a great airport screener or radiologist.

Acknowledgments

J. M. W. is supported by National Institutes of Health EY017001 and CA207490 and National Science Foundation 2146617.

References

Anderson, B. A., Kim, H., Kim, A. J., Liao, M.-R., Mrkonja, L., Clement, A., & Grégoire, L. (2021). The past, present, and future of selection history. Neuroscience & Biobehavioral Reviews, 130, 326-350. https://doi.org/10.1016/j.neubiorev.2021.09.004
↩
Green, B. F., & Anderson, L. K. (1956). Color coding in a visual search task. Journal of Experimental Psychology, 51(1), 19-24. https://doi.org/10.1037/h0047484
↩
Hatfield, G. (1998). Attention in early scientific psychology. In R. D. Wright (Ed.), Visual attention (pp. 3-25). Oxford University Press.
↩
Hills, T. T., Todd, P. M., & Goldstone, R. L. (2008). Search in external and internal spaces: Evidence for generalized cognitive search processes. Psychological Science, 19(8), 802-808. https://doi.org/10.1111/j.1467-9280.2008.02160.x
↩
Kingsley, H. L. (1932). An experimental study of ‘search.’ American Journal of Psychology, 44(2), 314-318. https://doi.org/10.2307/1414831
↩
Koopman, B. O. (1956). The theory of search. I. Kinematic bases. Operations Research, 4(3), 324-346. https://doi.org/10.1287/opre.4.3.324
↩
Luck, S. J., Gaspelin, N., Folk, C. L., Remington, R. W., & Theeuwes, J. (2021). Progress toward resolving the attentional capture debate. Visual Cognition, 29(1), 1-21. https://doi.org/10.1080/13506285.2020.1848949
↩
Mackworth, N. H. (1948). The breakdown of vigilance during prolonged visual search. Quarterly Journal of Experimental Psychology, 1(1), 6-21. https://doi.org/10.1080/17470214808416738
↩
Mazor, M., & Fleming, S. M. (2022). Efficient search termination without task experience. Journal of Experimental Psychology: General, 151(10), 2494-2510. https://doi.org/10.1037/xge0001188
↩
Neisser, U., Novick, R., & Lazar, R. (1963). Searching for ten targets simultaneously. Perceptual and Motor Skills, 17(3), 955-961. https://doi.org/10.2466/pms.1963.17.3.955
↩
Oliva , A. (2005). Gist of the scene. In L. Itti, G. Rees, & J. K. Tsotsos (Eds.), Neurobiology of attention (pp. 251-257). Academic Press.
↩
Segraves, M. A. (2023). Using natural scenes to enhance our understanding of the cerebral cortex’s role in visual search. Annual Review of Vision Science, 9, 435-454. https://doi.org/10.1146/annurev-vision-100720-124033
↩
Shiffrin, R. M. (1988). Attention. In R. Atkinson, R. J. Herrnstein, G. Lindzey, & R. D. Luce (Eds.), Steven’s handbook of experimental psychology. (2nd ed., Vol. 2, pp. 739-812). Wiley.
↩
Townsend, J. T., & Wenger, M. J. (2004). The serial-parallel dilemma: A case study in a linkage of theory and method. Psychonomic Bulletin & Review, 11(3), 391-418. https://doi.org/10.3758/bf03196588
↩
Treisman, A., & Gelade, G. (1980). A feature-integration theory of attention. Cognitive Psychology, 12(1), 97-136. https://doi.org/10.1016/0010-0285(80)90005-5
↩
Treisman, A., & Gormican, S. (1988). Feature analysis in early vision: Evidence from search asymmetries. Psychological Review, 95(1), 15-48. https://doi.org/10.1037/0033-295X.95.1.15
↩
Tuddenham, W. J., & Calvert, W. P. (1961). Visual search patterns in roentgen diagnosis. Radiology, 76, 255-256. https://doi.org/10.1148/76.2.255
↩
Vo, M. L., Boettcher, S., & Draschkow, D. (2019). Reading scenes: How scene grammar guides attention and aids perception in real-world environments. Current Opinion in Psychology, 29, 205-210. https:/doi.org/10.1016/j.copsyc.2019.03.009
↩
Wolfe, J. M., & Horowitz, T. S. (2017). Five factors that guide attention in visual search. Nature Human Behaviour, 1, 0058. https://doi.org/10.1038/s41562-017-0058
↩
Ying, Q., Dong, W., & Fabrikant, S. I. (2024). How do in-car navigation aids impair expert navigators’ spatial learning ability? Annals of the American Association of Geographers, 114(7), 1485-1504. https://doi.org/10.1080/24694452.2024.2356858
↩

Visual Search