A word’s meaning is the set of knowledge associated with a particular spoken, written, or signed word form that captures what the word refers to and how it is used and understood in context. This information is referred to as lexical–semantic knowledge and is typically distinguished from grammatical, or syntactic, information. Word meanings are a basic unit of linguistic knowledge and are combined together to express complex meanings in phrases and sentences. Cognitive theories of word meaning explore how word meanings are learned, represented, and processed.
History
Sumerian scholars compiled lists of words and their meanings on clay tablets as early as about 2000 BCE. These early dictionaries mainly clarified rare or specialized terms, but over centuries, dictionaries became broader as dictionary writers (known as lexicographers) endeavored to exhaustively document how all words were used across entire languages (Mugglestone, 2011).
Since the early 20th century, linguists have reflected on the nature of word meanings more broadly as part of a system of linguistic signs. Saussure famously argued that linguistic signs (i.e., words) consist of a signifier that represents its physical form and a signified that represents its meaning, with the relationship between these two components being conventional and largely arbitrary (Reda, 2016; see the section “Core concepts” for examples).
From the mid-20th century, researchers increasingly turned to experimentation to reveal how people learn and process word meanings [see Psycholinguistics]. Since the 1980s, computational models, especially neural network models, and cognitive neuroscience, have led to increasingly well-specified and neurobiologically plausible theories (McClelland & Rogers, 2003) [see Computational Models of Language Learning; Neuroscience of Language; Recurrent Neural Networks]. Large language models have recently transformed the field, demonstrating the richness of lexical–semantic knowledge that can be extracted from massive bodies of text [see Large Language Models].
Core concepts
Arbitrariness
There is often no necessary or intrinsic link between a word’s form and its meaning. Words like “cat,” “can,” and “cap” sound similar but have unrelated meanings; “big” and “large” have similar meanings despite dissimilar forms. This arbitrariness in the form-to-meaning mapping is reduced in some domains (Monaghan et al., 2011); onomatopoeic words (“buzz,” “hiccup”) resemble the things to which they refer, and morphological families, which are built from the same base word (e.g., “hop,” “hops,” “hopping”), systematically share form and meaning. Signed languages are somewhat less arbitrary, as some signs (referred to as iconic) visually resemble what they denote [see Sign Language].
Ambiguity
Nearly all common words are ambiguous, with a single form mapping onto multiple meanings. This phenomenon, known as lexical or semantic ambiguity, takes different forms. Homonyms have multiple unrelated meanings that share the same form by historical accident (e.g., the bark of a tree/dog). Polysemous words have clusters of related senses that have evolved as their usage has extended over time (e.g., the athlete/paint/river/program runs). Such ambiguities are typically resolved by using the surrounding context to settle on the most likely interpretation (Rodd, 2022).
Conventionality
Successful communication between individuals within a language community relies on shared, conventionalized norms as to the agreed meaning of words. These conventions change over time as words gain new senses (e.g., “cloud” for remote data storage; Rodd, 2022). Words can also be used in unconventional ways, being flexibly repurposed by speakers in novel and imaginative ways, for example, in novel metaphors (e.g., science is a glacier; Bowdle & Gentner, 2005).
Combinatoriality and productivity
A finite vocabulary can generate an endless set of phrases and sentences (Nefdt & Potts, 2024) [see Language]. Words occur within grammatical structures that indicate how word meanings combine into larger structures (e.g., “The student emailed the professor” vs. “The professor emailed the student”). This combinatorial aspect of language underlies its creative nature, allowing novel, unusual, or even nonsensical ideas to be conveyed using familiar words (e.g., “Colorless green ideas sleep furiously”; Chomsky, 1957).
Theoretical explanations
Classical feature-based theories view word meanings as bundles of semantic features (Collins & Quillian, 1969); the concept “bird” is linked to features like “is an animal,” “has feathers,” and “can fly.” Although neural network models often retain the notion that word meanings are distributed across many discrete featural units, these units typically do not correspond to interpretable, labeled features.
Distributional theories that assume word meanings are grounded in patterns of linguistic use have strong historical roots in philosophy (“The meaning of a word is its use in the language”; Wittgenstein, 1953/2009, section 43) and linguistics (“You shall know a word by the company it keeps”; Firth, 1957, p. 11). These insights are implemented in computational models that represent words as high-dimensional vectors that summarize their co-occurrence with other words (Landauer & Dumais, 1997). Contemporary large language models significantly extend this approach; words are represented by embeddings that encode complex, nonlinear statistical information about their co-occurrence patterns (i.e., the meaning of “tea” derives from the language with which it occurs: a cup of tea, tea and coffee, tea leaves).
Embodied, or grounded, accounts emphasize the role of perceptual, motor, and emotional systems; understanding “run” involves simulating relevant motor patterns, whereas understanding “cake” may activate visual, tactile, and taste representations. Embodied theories claim that aspects of words’ meanings are represented in the neural circuits used for perception and action (Barsalou, 2008).
Questions, controversies, and new developments
How predictive is word meaning processing?
Widespread evidence suggests that listeners and readers actively anticipate upcoming words, such that aspects of a word’s meaning can become active before its form is encountered on the basis of its preceding context (Ryskin & Nieuwland, 2023; Wong et al., 2025). Open questions include the specificity of these predictions—do they constitute specific word meanings or relatively broad semantic category information?
How does the flexibility of word meanings arise?
Successful communication requires word meanings to be relatively stable across individuals within a language community. Yet, meanings are highly flexible; they shift across contexts and are creatively extended in poetry, metaphor, and everyday conversation. Does this flexibility reflect dynamic updating of stored lexical–semantic knowledge or the additional contributions of more flexible cognitive systems such as episodic memory (Gaskell, 2024)?
How does the structure of word meanings vary across languages?
Current theories of word meaning are overly reliant on evidence from Indo-European languages, especially English. Other languages differ markedly. Agglutinating languages (e.g., Turkish) combine multiple meaning-bearing units into long words. Nonconcatenative languages (e.g., Arabic) form words from a root, which provides the core meaning and a pattern that helps determine the final interpretation.
Broader connections
Communication is inherently social in nature, yet most evidence about word meanings comes from experiments that deliberately minimize and standardize social context. Word meaning should be understood within broader models of social learning, recognizing the motivational forces that lead (most) humans to readily acquire words and use them to form and sustain complex social relationships [see Language Socialization; Social Learning]
Adequate theories of how word meanings are learned and processed are essential to support educational initiatives to improve critical language skills, which vary widely across the population [see Developmental Language Disorder]
Recent developments in artificial intelligence are influencing cognitive theories of word meaning. Examining the representations of words and their meanings that emerge during training in large language models, and that support near-human performance across diverse language tasks, has yielded surprising insights.
Further reading
Meteyard, L., & Vigliocco, G. (2018). Lexico-semantics. In S.-A. Rueschemeyer & M. G. Gaskell (Eds), The Oxford handbook of psycholinguistics (2nd ed., pp. 71–95). Oxford University Press.
Rodd, J. M. (2020). Settling into semantic space: An ambiguity-focused account of word-meaning access. Perspectives on Psychological Science, 15(2), 411–427. https://doi.org/10.1177/1745691619885860
Ryskin, R., & Nieuwland, M. S. (2023). Prediction during language comprehension: What is next? Trends in Cognitive Sciences, 27(11), 1032–1052. https://doi.org/10.1016/j.tics.2023.08.003
References
Barsalou, L. W. (2008). Grounded cognition. Annual Review of Psychology, 59, 617–645. https://doi.org/10.1146/annurev.psych.59.103006.093639
↩Bowdle, B. F., & Gentner, D. (2005). The career of metaphor. Psychological Review, 112(1), 193–216. https://doi.org/10.1037/0033-295X.112.1.193
↩Chomsky, N. (1957). Syntactic structures. Mouton de Gruyter.
↩Collins, A. M., & Quillian, M. R. (1969). Retrieval time from semantic memory. Journal of Verbal Learning & Verbal Behavior, 8(2), 240–247. https://doi.org/10.1016/S0022-5371(69)80069-1
↩Firth, J. R. (1957). A synopsis of linguistic theory. Blackwell.
↩Gaskell, M. G. (2024). EPS mid-career prize: An integrated framework for the learning, recognition and interpretation of words. Quarterly Journal of Experimental Psychology, 77(12), 2365–2384. https://doi.org/10.1177/17470218241284289
↩Landauer, T. K., & Dumais, S. T. (1997). A solution to Plato’s problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychological Review, 104(2), 211–240. https://doi.org/10.1037/0033-295X.104.2.211
↩McClelland, J. L., & Rogers, T. T. (2003). The parallel distributed processing approach to semantic cognition. Nature Reviews Neuroscience, 4(4), 310–322. https://doi.org/10.1038/nrn1076
↩Monaghan, P., Christiansen, M. H., & Fitneva, S. A. (2011). The arbitrariness of the sign: Learning advantages from the structure of the vocabulary. Journal of Experimental Psychology: General, 140(3), 325–347. https://doi.org/10.1037/a0022924
↩Mugglestone, L. (2011). Dictionaries: A very short introduction. Oxford University Press.
↩Nefdt, R. M., & Potts, C. (2024). Compositionality. In M. C. Frank & A. Majid (Eds.), Open encyclopedia of cognitive science. MIT Press. https://doi.org/10.21428/e2759450.494deacd
↩Reda, G. (2016). Ferdinand de Saussure in the Era of Cognitive Linguistics. Language and Semiotic Studies, 2(2), 89–100. https://doi.org/10.1515/lass-2016-020203
↩Rodd, J. M. (2022). Word-meaning access: The one-to-many mapping from form to meaning. In A. Papafragou, J. C. Trueswell, & L. R. Gleitman (Eds), The Oxford handbook of the mental lexicon (pp. 491–505). Oxford University Press.
↩Ryskin, R., & Nieuwland, M. S. (2023). Prediction during language comprehension: What is next? Trends in Cognitive Sciences, 27(11), 1032–1052. https://doi.org/10.1016/j.tics.2023.08.003
↩Wittgenstein, L. (2009). Philosophical investigations (P. M. S. Hacker & J. Schulte, Trans.). Wiley-Blackwell. (Original work published 1953)
↩Wong, R., Reichle, E. D., & Veldre, A. (2025). Prediction in reading: A review of predictability effects, their theoretical implications, and beyond. Psychonomic Bulletin & Review, 32, 973–1006. https://doi.org/10.3758/s13423-024-02588-z
↩