PPT Slide
Reward per time slightly improves when adding Q-learning to the instruction set. Q-learning by itself fails though.
Previous slide
Next slide
Back to first slide
View graphic version
Back to
J. Schmidhuber
's
Metalearning page