Clear and concise intro to RNN

We motivate why recurrent neural networks are important for dealing with sequence data and review LSTMs and GRU (gated recurrent unit) architectures.   GRU is simplified LSTM.  Notes: BPTT( back propagation through time)