When communicating with infants, we use infant-directed speech, a specialized way of speaking characterized by a higher pitch, exaggerated intonation, short utterances, and frequent repetition. The language tends to be grammatically simpler, and the vocabulary is often tailored to topics relevant to the child’s world. Sometimes referred to as “baby talk,” this kind of speech is not just playful but conveys emotional warmth and supports early language acquisition. Across many cultures, infants consistently show a preference for infant-directed speech over the speech adults use with one another, and this preference appears to promote greater attention and learning. As children develop, caregivers subtly tune their speech, reducing prosodic exaggeration and increasing linguistic complexity in response to children’s growing perceptual and cognitive abilities. Thus, infant-directed speech serves both as a cue for social connection and as a learning environment that adapts to developmental change.

History

Researchers have long been interested in how caregivers talk to infants, and one influential early contribution was Charles Ferguson's (1964) description of “baby talk” as a structured speech register observed across many languages. In the decades that followed, especially during the 1970s and 1980s, interest in this topic grew alongside claims that children’s language input was too limited to explain how quickly and accurately they acquire grammar (Chomsky, 1965) [see Language Acquisition]. These concerns led to new questions about whether caregivers adjust their speech in ways that support learning (Cooper & Aslin, 1990). Around the same time, researchers also suggested that the melodic quality of infant-directed speech might communicate emotion (Fernald, 1989) and that clear pronunciation could help infants hear and process sounds more effectively (Kuhl et al., 1997). Although early studies yielded mixed results, recent work using large datasets and new analytic tools has shown that caregivers do adjust their speech in systematic ways (Cox et al., 2023). Ongoing research from neuroscience, eye tracking, and speech analysis continues to show how these adjustments support learning in real time and help shape how infants respond to language in their environment (Nencheva & Lew-Williams, 2022).

Core concepts

Infant-directed speech supports early language learning through its distinctive acoustic and linguistic characteristics.

An example audio stimulus from the infant-directed speech condition of the ManyBabies Consortium (2020) providing a collage of several different North American English-speaking mothers speaking to infants.

Its heightened pitch, slower tempo, and rhythmic prosody not only draw infants’ attention but also signal emotional engagement (see Soderstrom, 2007 for a review). According to the hyperarticulation hypothesis (e.g., Kuhl et al., 1997), caregivers may exaggerate sound contrasts to support early speech perception. Such features enhance the perceptual clarity of speech and make it easier to process. Infant-directed speech also features simplified grammar and more clearly articulated sounds (Ko & Jun, 2024). These adjustments can help infants identify word boundaries and build phonological categories.

From early in life, infants show a preference for infant-directed speech over adult-directed speech. This tendency has been observed within the first months of life (Cooper & Aslin, 1990) and replicated across a variety of experimental methods (ManyBabies Consortium, 2020). The prosodic salience of infant-directed speech appears crucial for maintaining infants’ attention (Fernald, 1989); however, the acoustic characteristics of infant-directed speech not only lead to a preference but also have implications for learning. Infants are more likely to detect phrase boundaries and segment words when listening to infant-directed speech (Thiessen et al., 2005). They also learn new word–object pairings more easily when words are presented in infant-directed speech (Shi et al., 2023) and show stronger brain responses to familiar words in infant-directed speech than in adult-directed speech (Zangl & Mills, 2007).

The amount and nature of linguistic input that infant-directed speech provides also matter for development. Children who are engaged in more verbal interactions with caregivers tend to develop larger vocabularies and stronger language skills over time (Hart & Risley, 1995) [see Interaction Engine; Word Learning]. Although early work emphasized differences in the quantity of input by socioeconomic status and how they predict children’s language outcome, recent findings suggest a more nuanced picture: input–outcome associations are robust yet modest (Coffey & Snedeker, 2025), and the structure of input itself varies substantially across cultures (e.g., Casillas et al., 2020). In addition to quantity, features such as lexical diversity, syntactic complexity (Huttenlocher et al., 2010), and how well caregiver speech matches the child’s attention (Schroer & Yu, 2022) have all been linked to vocabulary and grammatical growth.

Questions, controversies, and new developments

Understanding the functions of infant-directed speech and the extent to which they are universal continues to be a major focus of research in developmental science. One key question is whether the features of infant-directed speech, such as expanded vowel space, are designed to help infants learn language or whether they simply reflect caregivers’ emotions, indirectly shaping features of speech. Some researchers argue that these modifications support infants in perceiving and learning speech sounds. Others suggest they may arise naturally from positive emotional states, like smiling, which can alter vocal tract shape and affect the sound of speech. Evidence points to both roles: infant-directed speech engages infants emotionally and supports learning, with its form shaped by cultural practices and children’s developmental needs.

Cross-cultural studies suggest that the features of infant-directed speech vary across languages and communities. Much of what we know about infant-directed speech comes from studies of English-speaking families and research conducted in other Western, educated, industrialized, rich, and democratic settings [see WEIRD]. Yet, findings from more diverse contexts reveal a more complex picture. Some communities provide relatively little direct speech to young children, with much of their early language input coming from overheard speech (Shneidman & Goldin-Meadow, 2012). Cross-linguistic research further shows that grammatical structures thought to be difficult, such as passives, are acquired earlier in some languages than others (Armon-Lotem et al., 2016). Recent work has begun to extend the use of daylong recording methods, such as those based on the Language Environment Analysis system (see Greenwood et al., 2018 for a review) and similar wearable technologies (Casillas et al., 2020), to capture more ecologically sound and culturally varied patterns of language experience. This presents new chances to improve theories of language learning in a global setting.

Broader connections

Infant-directed speech has become a focus of interest not only in developmental science but also in artificial intelligence. One striking contrast lies in how efficiently children acquire language compared to large language models [see Large Language Models]. Although these models require exposure to vast amounts of data, young children learn from a relatively small amount of variable and noisy input (Frank, 2023). This contrast has sparked renewed interest in whether the structure of infant-directed speech plays a role in making that learning possible. Researchers investigate how its features might guide infants’ attention and highlight patterns in language. When speech is presented with cues like gaze, gesture, touch, and prosodic modulation, learning seems to be particularly successful. Moments with coordination among cues like these support both attention and comprehension and may even align infants’ brain activity with speech (Nencheva & Lew-Williams, 2022). Such findings underscore the importance of social interaction in learning and offer insights for designing more human-like artificial intelligence systems.

Acknowledgments

This work was supported by the National Research Foundation of Korea (NRF-2025S1A6B5A02004207). OpenAI’s ChatGPT (GPT-5) was used to refine the grammar and phrasing. The author reviewed and edited all suggestions to ensure that the text represents the author’s original ideas.

References

Armon-Lotem, S., Haman, E., Jensen de López, K., Smoczynska, M., Yatsushiro, K., Szczerbinski, M., van Hout, A., Dabašinskienė, A., Gavarró, A., Hobbs, E., Kamandulytė-Merfeldienė, L., Katsos, N., Kunnari, S., Nitsiou, C., Sundahl Olsen, L., Parramon, X., Sauerland, U., Torn-Leesik, R., & van der Lely, H. (2016). A large-scale cross-linguistic investigation of the acquisition of passive. Language Acquisition, 23(1), 27–56. https://doi.org/10.1080/10489223.2015.1047095
↩
Casillas, M., Brown, P., & Levinson, S. C. (2020). Early language experience in a Tseltal Mayan village. Child Development, 91(5), 1819–1835. https://doi.org/10.1111/cdev.13349
↩
Chomsky, N. (1965). Aspects of the theory of syntax. MIT Press.
↩
Coffey, J. R., & Snedeker, J. (2025). How strong is the relationship between caregiver speech and language development? A meta-analysis. Journal of Child Language. Published online January 10, 2025. https://doi.org/10.1017/S0305000924000692
↩
Cooper, R. P., & Aslin, R. N. (1990). Preference for infant-directed speech in the first month after birth. Child Development, 61(5), 1584–1595. https://doi.org/10.2307/1130766
↩
Cox, C., Bergmann, C., Fowler, E., Keren-Portnoy, T., Roepstorff, A., Bryant, G., & Fusaroli, R. (2023). A systematic review and Bayesian meta-analysis of the acoustic features of infant-directed speech. Nature Human Behaviour, 7(1), 114–133. https://doi.org/10.1038/s41562-022-01452-1
↩
Ferguson, C. A. (1964). Baby talk in six languages. American Anthropologist, 66(6), 103-114. https://doi.org/10.1525/aa.1964.66.suppl_3.02a00060
↩
Fernald, A. (1989). Intonation and communicative intent in mothers’ speech to infants: Is the melody the message? Child Development, 60(6), 1497–1510. https://doi.org/10.2307/1130938
↩
Frank, M. C. (2023). Bridging the data gap between children and large language models. Trends in Cognitive Sciences, 27(11), 990–992. https://doi.org/10.1016/j.tics.2023.08.007
↩
Greenwood, C. R., Schnitz, A. G., Irvin, D., Tsai, S. F., & Carta, J. J. (2018). Automated language environment analysis: A research synthesis. American Journal of Speech-Language Pathology, 27(2), 853–867. https://doi.org/10.1044/2017_AJSLP-17-0033
↩
Hart, B., & Risley, T. R. (1995). Meaningful differences in the everyday experience of young American children. Paul H. Brookes Publishing.
↩
Huttenlocher, J., Waterfall, H., Vasilyeva, M., Vevea, J., & Hedges, L. V. (2010). Sources of variability in children’s language growth. Cognitive Psychology, 61(4), 343–365. https://doi.org/10.1016/j.cogpsych.2010.08.002
↩
Ko, E.-S., & Jun, J. (2024). Phonological variation in child-directed speech is modulated by lexical frequency. Journal of Child Language, 51(2), 288–313. https://doi.org/10.1017/S0305000923000466
↩
Kuhl, P. K., Andruski, J. E., Chistovich, I. A., Chistovich, L. A., Kozhevnikova, E. V., Ryskina, V. L., Stolyarova, E. I., Sundberg, U., & Lacerda, F. (1997). Cross-language analysis of phonetic units in language addressed to infants. Science, 277(5326), 684–686. https://doi.org/10.1126/science.277.5326.684
↩
ManyBabies Consortium. (2020). Quantifying sources of variability in infancy research using the infant-directed speech preference. Advances in Methods and Practices in Psychological Science, 3(1), 24–52. https://doi.org/10.1177/2515245919900809
↩
Nencheva, M. L., & Lew-Williams, C. (2022). Understanding why infant-directed speech supports learning: A dynamic attention perspective. Developmental Review, 66, 101047. https://doi.org/10.1016/j.dr.2022.101047
↩
Schroer, S. E., & Yu, C. (2022). The real-time effects of parent speech on infants' multimodal attention and dyadic coordination. Infancy, 27(6), 1154–1178. https://doi.org/10.1111/infa.12500
↩
Shi, J., Gu, Y., & Vigliocco, G. (2023). Prosodic modulations in child-directed language and their impact on word learning. Developmental Science, 26(4), e13357. https://doi.org/10.1111/desc.13357
↩
Shneidman, L. A., & Goldin-Meadow, S. (2012). Language input and acquisition in a Mayan village: How important is directed speech? Developmental Science, 15(5), 659–673. https://doi.org/10.1111/j.1467-7687.2012.01168.x
↩
Soderstrom, M. (2007). Beyond babytalk: Re-evaluating the nature and content of speech input to preverbal infants. Developmental Review, 27(4), 501–532. https://doi.org/10.1016/j.dr.2007.06.002
↩
Thiessen, E. D., Hill, E. A., & Saffran, J. R. (2005). Infant-directed speech facilitates word segmentation. Infancy, 7(1), 53–71. https://doi.org/10.1207/s15327078in0701_5
↩
Zangl, R., & Mills, D. L. (2007). Increased brain activity to infant-directed speech in 6-and 13-month-old infants. Infancy, 11(1), 31–62. https://doi.org/10.1207/s15327078in1101_2
↩

Infant-Directed Speech