Gated Recurrent Units (GRUs)

GRUs help solve the vanishing gradient problem in RNNs. They allow for long-range dependencies.

The GRU unit is shown below:

The following are the equations associated with a single GRU unit.

r stands for the reset gate and u stands for update gate.

Last updated