Learning Orthographic Structure With Sequential Generative Neural Networks | Zendy

Testolin Alberto | Zendy; Stoianov Ivilin | Zendy; Sperduti Alessandro | Zendy; Zorzi Marco | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Learning Orthographic Structure With Sequential Generative Neural Networks

Author(s) -

Testolin Alberto,

Stoianov Ivilin,

Sperduti Alessandro,

Zorzi Marco

Publication year - 2016

Publication title -

cognitive science

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.498

H-Index - 114

eISSN - 1551-6709

pISSN - 0364-0213

DOI - 10.1111/cogs.12258

Subject(s) - connectionism , computer science , artificial intelligence , generative model , probabilistic logic , artificial neural network , generative grammar , boltzmann machine , context (archaeology) , recurrent neural network , machine learning , restricted boltzmann machine , paleontology , biology

Learning the structure of event sequences is a ubiquitous problem in cognition and particularly in language. One possible solution is to learn a probabilistic generative model of sequences that allows making predictions about upcoming events. Though appealing from a neurobiological standpoint, this approach is typically not pursued in connectionist modeling. Here, we investigated a sequential version of the restricted Boltzmann machine ( RBM ), a stochastic recurrent neural network that extracts high‐order structure from sensory data through unsupervised generative learning and can encode contextual information in the form of internal, distributed representations. We assessed whether this type of network can extract the orthographic structure of English monosyllables by learning a generative model of the letter sequences forming a word training corpus. We show that the network learned an accurate probabilistic model of English graphotactics, which can be used to make predictions about the letter following a given context as well as to autonomously generate high‐quality pseudowords. The model was compared to an extended version of simple recurrent networks, augmented with a stochastic process that allows autonomous generation of sequences, and to non‐connectionist probabilistic models ( n ‐grams and hidden Markov models). We conclude that sequential RBM s and stochastic simple recurrent networks are promising candidates for modeling cognition in the temporal domain.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research