Premium
Large‐Scale Modeling of Wordform Learning and Representation
Author(s) -
Sibley Daragh E.,
Kello Christopher T.,
Plaut David C.,
Elman Jeffrey L.
Publication year - 2008
Publication title -
cognitive science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.498
H-Index - 114
eISSN - 1551-6709
pISSN - 0364-0213
DOI - 10.1080/03640210802066964
Subject(s) - connectionism , computer science , encoder , sequence (biology) , natural language processing , representation (politics) , artificial intelligence , scale (ratio) , speech recognition , artificial neural network , operating system , physics , quantum mechanics , biology , politics , political science , law , genetics
The forms of words as they appear in text and speech are central to theories and models of lexical processing. Nonetheless, current methods for simulating their learning and representation fail to approach the scale and heterogeneity of real wordform lexicons. A connectionist architecture termed the sequence encoder is used to learn nearly 75,000 wordform representations through exposure to strings of stress‐marked phonemes or letters. First, the mechanisms and efficacy of the sequence encoder are demonstrated and shown to overcome problems with traditional slot‐based codes. Then, two large‐scale simulations are reported that learned to represent lexicons of either phonological or orthographic wordforms. In doing so, the models learned the statistics of their lexicons as shown by better processing of well‐formed pseudowords as opposed to ill‐formed (scrambled) pseudowords, and by accounting for variance in well‐formedness ratings. It is discussed how the sequence encoder may be integrated into broader models of lexical processing.