An Empirical Exploration of Recurrent Network Architectures: Paper Notes

This post contains my notes about the paper titled "An Empirical Exploration of Recurrent Network Architectures". You can read the paper here - http://proceedings.mlr.press/v37/jozefowicz15.pdf Abstract of the paper -  The Recurrent Neural Network (RNN) is an extremely powerful sequence model that is often difficult to train. The Long Short-Term Memory (LSTM) is a specific RNN … Continue reading An Empirical Exploration of Recurrent Network Architectures: Paper Notes