Separation of Memory and Processing in Dual Recurrent Neural Networks

Christian Oliva,Luis F. Lago-Fernández

Separation of Memory and Processing in Dual Recurrent Neural Networks

2021

Christian Oliva
Luis F. Lago-Fernández

We explore a neural network architecture that stacks a recurrent layer and a feedforward layer, both connected to the input. We compare it to a standard recurrent neural network. When noise is introduced into the recurrent units activation function, the two networks display binary activation patterns that can be mapped into the discrete states of a finite state machine. But, while the former is equivalent to a Moore machine, the latter can be interpreted as a Mealy machine. The additional feedforward layer reduces the computational load on the recurrent layer, which is used to model the temporal dependencies only. The resulting models are simpler and easier to interpret when the networks are trained on different sample problems, including the recognition of regular languages and the computation of additions in different bases.

Keywords:

Artificial intelligence
Algorithm
Recurrent neural network
Layer (object-oriented design)
Feed forward
Deep learning
Activation function
Mealy machine
Moore machine
Finite-state machine
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations