Using LSTM for POMDPs (Bakker, 2001)
reward
To the the robot, all T-junctions look the same. Needs short-term memory to disambiguate them!
Previous slide
Next slide
Back to first slide
View graphic version
Back to
J. Schmidhuber
's
Recurrent neural network page