z-logo
open-access-imgOpen Access
Modeling word and morpheme order in natural language as an efficient trade-off of memory and surprisal.
Author(s) -
Michael Hahn,
Judith Degen,
Richard Futrell
Publication year - 2021
Publication title -
psychological review
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 4.688
H-Index - 211
eISSN - 1939-1471
pISSN - 0033-295X
DOI - 10.1037/rev0000269
Subject(s) - morpheme , word order , natural language processing , computer science , natural language , linguistics , word (group theory) , artificial intelligence , natural (archaeology) , psychology , history , philosophy , archaeology
Memory limitations are known to constrain language comprehension and production, and have been argued to account for crosslinguistic word order regularities. However, a systematic assessment of the role of memory limitations in language structure has proven elusive, in part because it is hard to extract precise large-scale quantitative generalizations about language from existing mechanistic models of memory use in sentence processing. We provide an architecture-independent information-theoretic formalization of memory limitations which enables a simple calculation of the memory efficiency of languages. Our notion of memory efficiency is based on the idea of a memory-surprisal trade-off: A certain level of average surprisal per word can only be achieved at the cost of storing some amount of information about the past context. Based on this notion of memory usage, we advance the Efficient Trade-off Hypothesis : The order of elements in natural language is under pressure to enable favorable memory-surprisal trade-offs. We derive that languages enable more efficient trade-offs when they exhibi information locality : When predictive information about an element is concentrated in its recent past. We provide empirical evidence from three test domains in support of the Efficient Trade-off Hypothesis: A reanalysis of a miniature artificial language learning experiment, a large-scale study of word order in corpora of 54 languages, and an analysis of morpheme order in two agglutinative languages. These results suggest that principles of order in natural language can be explained via highly generic cognitively motivated principles and lend support to efficiency-based models of the structure of human language. (PsycInfo Database Record (c) 2021 APA, all rights reserved).

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom