Next: About this document ...
Up: subgoalsicann
Previous: Conclusion
- 1
-
C. W. Anderson.
Learning and Problem Solving with Multilayer Connectionist
Systems.
PhD thesis, University of Massachusetts, Dept. of Comp. and Inf.
Sci., 1986.
- 2
-
A. G. Barto, R. S. Sutton, and C. W. Anderson.
Neuronlike adaptive elements that can solve difficult learning
control problems.
IEEE Transactions on Systems, Man, and Cybernetics,
SMC-13:834-846, 1983.
- 3
-
M. I. Jordan.
Supervised learning and systems with excess degrees of freedom.
Technical Report COINS TR 88-27, Massachusetts Institute of
Technology, 1988.
- 4
-
Nguyen and B. Widrow.
The truck backer-upper: An example of self learning in neural
networks.
In Proceedings of the International Joint Conference on Neural
Networks, pages 357-363. IEEE Press, 1989.
- 5
-
T. Robinson and F. Fallside.
Dynamic reinforcement driven error propagation networks with
application to game playing.
In Proceedings of the 11th Conference of the Cognitive Science
Society, Ann Arbor, pages 836-843, 1989.
- 6
-
A. L. Samuel.
Some studies in machine learning using the game of checkers.
IBM Journal on Research and Development, 3:210-229, 1959.
- 7
-
J. Schmidhuber.
Learning algorithms for networks with internal and external feedback.
In D. S. Touretzky, J. L. Elman, T. J. Sejnowski, and G. E. Hinton,
editors, Proc. of the 1990 Connectionist Models Summer School, pages
52-61. Morgan Kaufmann, 1990.
- 8
-
J. Schmidhuber.
Recurrent networks adjusted by adaptive critics.
In Proc. IEEE/INNS International Joint Conference on Neural
Networks, Washington, D. C., volume 1, pages 719-722, 1990.
- 9
-
J. Schmidhuber.
Towards compositional learning with dynamic neural networks.
Technical Report FKI-129-90, Institut für Informatik, Technische
Universität München, 1990.
- 10
-
J. Schmidhuber.
Adaptive decomposition of time.
In T. Kohonen, K. Mäkisara, O. Simula, and J. Kangas, editors,
Artificial Neural Networks, pages 909-914. Elsevier Science Publishers
B.V., North-Holland, 1991.
- 11
-
J. Schmidhuber.
Neural sequence chunkers.
Technical Report FKI-148-91, Institut für Informatik, Technische
Universität München, April 1991.
- 12
-
J. Schmidhuber.
Reinforcement learning in Markovian and non-Markovian
environments.
In D. S. Lippman, J. E. Moody, and D. S. Touretzky, editors, Advances in Neural Information Processing Systems 3, pages 500-506. Morgan
Kaufmann, 1991.
- 13
-
P. J. Werbos.
Consistency of HDP applied to a simple reinforcement learning
problem.
Neural Networks, 2:179-189, 1990.
Juergen Schmidhuber
2003-03-14
Back to Subgoal learning - Hierarchical Learning
German pages with Subgoal learning pictures